Complex numbers are often useful to prove theorems only involving real numbers (e.g., every polynomial over R can be factorised into a product of degree 1 and degree 2 polynomials over R).
In the same way, it's not surprising that more general objects than ordinary functions, which can ALWAYS be differentiated, can be useful in PDEs, etc.
That's the whole purpose of distributions: to have a class of objects extending ordinary functions, which can always be differentiated, and agree with the ordinary derivative whenever they are smooth functions.
For example, look up elliptic regularity. Sometimes you can prove that a PDE has a distribution solution, more easily than an ordinary function. Then prove a SEPARATE theorem that the distribution solution in fact IS just an ordinary function.
Look up things like the fundamental solution and Green's functions for PDEs.
E.g., to solve $Lu=f$ for some linear differential operator $L$, first solve $Lk = \delta$ for the Dirac delta function $\delta$. Then, formally, using convolution:
$L(k*f) = Lk * f = \delta * f = f$.
Thus we've found a general formula $u=k*f$ to generate solutions. Sometimes, as in elliptic regularity, we can use properties of $k$ to prove theorems about the smoothness of $u$, given smoothness conditions on $f$, etc.
Distribution theory can sometimes make such arguments rigorous.
Of course, in any given special case, it might be that results can be proved directly and easily without the general theory; but this is true of any general theory.
Also very important is the Fourier transform, which works on a subspace of distributions called tempered distributions. Again, this generalises the ordinary Fourier transform for nice functions.
It should also be remembered that physicists and engineers were already doing calculations like this before distribution theory. So it's nice to have a fully rigorous general theory to lay more solid foundations (although, some details are rather advanced mathematically. E.g., you need the Lebesgue differentiation theorem to prove that ordinary locally integrable functions define a unique distribution, but this fact is sometimes unmentioned. Also, Banach spaces and convergence in ordinary metric spaces are not enough to define distributions rigorously).