The name Brownian motion comes from the guy named Brown, and his microscope observation of spores moving in the liquid. Einstein got Nobel prize for the theory. The physics are simple, as Einstein put it:
Soon after the appearance of my paper (*) on the movements of
particles suspended in liquids demanded by the molecular theory of
heat, Siedentopf (of Jena) informed me that he and other physicists-in
the first instance, Prof. Gouy (of Lyons)-had been convinced by direct
observation that the so-called Brownian motion is caused by the
irregular thermal movements of the molecules of the liquid.
This is how Zeldovich and Myshkis set up the simple theoretical framework as follows:
Along the avenue on which the x-axis, there is a lot of stores
equally spaced at $h$ distance from one another. From one of them serving
as the reference point, the lady runs out: she only just heard that
somewhere there's a sale on French umbrellas. She randomly chooses the direction along
the alley, in a time $\tau$ she runs to a nearby store. Hearing that there
are no French umbrellas, she forgets where she came from, and
runs again at random to one of two sides. After the time $\tau$ she
again gets a negative response and, once again losing the sense of direction,
randomly selects the direction and so on. Where will she end up at time $n\tau$?
(For simplicity, we assume that the rumor was
false, so the search for French umbrellas goes on infinitely.
Then next passage sets the expectations of what we can get from the model:
Of course, it is impossible to predict exactly, one can
only determine the probability of the fact that the lady at the time $n\tau$
will have a certain coordinate.
Note, how they emphasized the fact the Lady was a bit out of her mind and sense of direction: she doesn't remember where she came from. That's the crucial detail: each move is independent of each other. Then the central limit theorem (CLT) kicks in, so you have the $x(n\tau)=\sum_{i=1}^nx_i=\sum_{i=1}^n(-1)^\xi_ih$, where $\xi_i$ - bernulli process with $p=1/2$. According to CLT this must converge to some kind of normal distrobution.
What's the problem with "modeling volatility"? It is a strong assumption to state that the volatility is constant. In economics and finance it fails again and again. Think of volatility as a noise "level", it doesn't have to remain constant and it doesn't. What guys like Heston noticed, was that volatility seems to be stationary and persistent, hence the family of stochastic volatility models.