Next: The Black-Scholes Equation
Up: thesis
Previous: Derivatives and Options
  Contents
Subsections
In this chapter, I will present the theory of stochastic processes in
an elementary manner sufficient for understanding the theory presented
in the following chapters. A reader interested in a more rigorous
approach could consult Ross[4]. This chapter mostly follows
Roepstorff[5] which has a more physical description of
stochastic processes.
This is probably the simplest example of a discrete time stochastic
process. One way of defining it would be :
which simply means that the variable
(the subscript stands for the
time) has an equal probability of increasing or decreasing by 1 at
each time step. For historical reasons, the term Brownian motion is
also used for random walks. This is usually described as a ``drunken
ant'' with a clock and a coin. At each tick of the clock, the ant
tosses the coin. If it turns out heads, the ant moves one step towards
the right and if it turns out tails it moves one step to the left.
This process is both homogeneous (since the transition probability is
only dependent on the distance between the initial and final points)
and isotropic (the transition probability is independent of the
direction of movement).
The random walk can be taken to be a Markov chain with a transition
matrix
(for one time step) where the element
stands for
the probability that an ant with initial position
will end up at
. For
time steps, the accumulated transition probability is
given by
.
It can be easily seen that the following difference equation holds
 |
(5) |
This is equivalent to
 |
(6) |
Now, if we re-scale so that the step size is
and the time step is
, the above becomes
 |
(7) |
where I have used the more intuitive but less rigorous notation where
is the probability of the ant being at
at time
(it
is important to note that these are still discrete variables).
In the limit
,
becomes
(now, of course,
and
are continuous) and
becomes
. Hence, in this limit, we get the following
equation for the one-dimensional random walk
 |
(8) |
where
is
. Since
tends to a finite limit
when
and
, we see that the
``velocity'' of the ant
. This
shows that the velocity of a particle undergoing a random walk is
infinity
.
Equation 2.4 is, of course, the diffusion equation which
should not come as too much of a surprise as diffusion is the result
of the random movement of molecules. The reader should also be
familiar with the fact that the solution of the diffusion equation
with the initial condition
is given by the
Gaussian
 |
(9) |
Since the discrete random walk actually follows a binomial
distribution, we can also think of this as following from the
classical theorem of Laplace and De Moivre about the convergence of
the binomial distribution towards the normal distribution.
We have not handled the transition from discrete variables to
continuous variables rigorously but the steps above should be
intuitively reasonable. To do the above rigorously, we only have to
identify the discrete probability
with
.
Throughout the above discussion, we have assumed that the probability
of the ant moving left or right is the same (i.e. we have assumed that
the walk is isotropic). This type of random walk is called the simple
random walk. To generalize the above discussion, we have to change
eqn. (2.1) to
 |
(10) |
where
is the probability of moving to the left and
is the
probability of moving to the right (obviously,
). The above
analysis can be applied to this system provided the limit
exists. The diffusion equation is then replaced by
 |
(11) |
where
can be interpreted as a mean drift velocity. The reader
should be easily able to verify that the solution for non-zero
is
related to the solution of the simple random walk
by
 |
(12) |
Hence, any one-dimensional random walk is equivalent to a simple
one-dimensional random walk under a Galilean transformation with
velocity given by
.
Let us suppose that the ant does not live in a one-dimensional world
but in a
-dimensional world. At each step, the ant can move in
any of
directions. If the ant now possesses a
sided fair
dice and a clock, it can walk one step in any direction with equal
probability. We again look at the limit where the step size
of the
ant tends to zero.
The resulting paths have certain interesting geometrical
properties. While these are not important for the purposes of this
thesis, they are interesting in their own right. A few of them are
described below :
- They are continuous everywhere but differentiable nowhere.
- They are recurrent (i.e. the paths will return to the origin
with probability 1) for
and transient (i.e there is a
finite probability different from 1 that the paths will return to the
origin) for
.
- The path is fractal in nature.
For more interesting properties of random walks, please see section
1.2 of Roepstorff[5].
Equation (2.3) is now replaced by
 |
(13) |
(the subscripts refer to the components of
) whose continuum
limit is
 |
(14) |
where
is now
and
is the
-dimensional
Laplacian. The solution to this equation with the initial condition
is given by
 |
(15) |
It is instructive to note that the mean square displacement
 |
(16) |
is proportional to
. In other words, the distribution varies with
time according to a
law.
We are now in a position to discuss the Wiener process which is of
fundamental importance in the theory of option pricing. This process
was first discussed by Einstein in his description of Brownian
motion and was first put into a rigorous form by Wiener.
For simplicity, we will assume
in the rest of this
discussion. This can always be achieved by rescaling the time and
space coordinates.
Consider a particle which was initially at the origin performing a
-dimensional simple random walk. In other words,
at time
. The position of the particle at
time
can be considered as a random variable
(the term random vector might be more appropriate as it emphasis the fact that
has several components). Now, if we can find the probabilities
for all
with non-zero measure,
we would have described the path followed by the particle completely.
We have actually found the answer to this in eqn. (2.11). To
make this clearer, we can express the answer in the following manner
 |
(17) |
with density
 |
(18) |
Hence,
follows a normal distribution.
A stochastic process is a map
, where
ranges over
some interval (
in this case). To properly define a
stochastic process, we need to be able to determine the probabilities
of general events. Before we can do this, it is necessary to consider
compound events of the form ``
'', where
and to devise rules that determine their
probability.
We denote the probability of a compound event by
Varying
we get the joint distribution of
the random variables
. This is
also called the distribution of the process with base
and may be
abbreviated as
where
. The distribution is said to be
finite-dimensional (of order
) since the base is finite (has
elements).
The stochastic process
is said to be the Wiener
process
if the finite-dimensional
distributions are of the form
 |
(19) |
with
given by (2.14) and if the initial
distribution is
 |
(20) |
Hence, the particle starts at the origin (in other words,
with certainty).
We can rewrite eqn. (2.15) as
 |
(21) |
(
) with density
given by
(2.14).
The fact that the right hand side of (2.17) is a product
tells us that the Wiener process is a Markov (memory-independent)
process. This is reassuring as the present state then contains all the
information that is relevant for the future which we have seen is true
for an efficient market.
By noting that
is only dependent on the combination
, we can see that
defines another Wiener process which is
indistinguishable from
except for the scale. In other words, the
Wiener process is scale invariant (provided we re-scale time
accordingly) or self-similar. This follows from the fractal nature of
Brownian motion.
Since any probability distribution is normalized, the zeroth moment of
is 1. Further, the first moment is zero as
. We have already found the second moment in
(2.12). All the above statements can be rephrased in terms of
expectation values. We see that
 |
(22) |
This might seem strange in terms of the Markovian property of the
Wiener process as the origin seems to be special. However,
ought to be interpreted as the conditional expectation of
given
the information that
(the initial condition). Hence, the
origin plays a distinguished role only by convention.
For the rest of this discussion, we will restrict ourselves to the
case
. The generalization to higher dimensions is
self-evident.
We will be particularly interested in the following expected value
 |
(23) |
where
is called the covariance matrix of the process. We
can calculate
using (2.12). Since we are assuming
that
, we obtain the simple result
. Further
is symmetrical by definition. To calculate
, we consider the two increments
and
which are independent with zero mean. Hence,
. We also note that
. Finally, we write
 |
(24) |
Hence, we obtain the final result
 |
(25) |
(the min arises due to the assumption
which can be made
without loss of generality as
is symmetric).
Now, we are in a position to define white noise as a time derivative
of the Wiener process. We note that
 |
(26) |
Hence, 
has covariance
 |
(27) |
is termed white noise which emphasizes that the Fourier
transformed covariance is constant.
A precise meaning can be given to
by considering the concept of
generalized stochastic processes. A good discussion of this concept
can be found in Roepstorff[5].
A martingale is an extension of the concept of a fair game. Let us
assume that we have a gambler who tosses a coin at each time step. If
he calls correctly, he gets $1 and loses $1 otherwise. If we
represent the winnings at time step
by
, then
. If
represents a random variable
representing the amount of money that a gambler has at time step
,
then the the expected value
is
. So, the gambler has zero expected
gain in each time step. This is exactly what is meant by a martingale.
A discrete stochastic process
is said to be a martingale
with respect to a process
if, for all
,
is called a sub-martingale with respect to
if, for
all
,
is a function of
and
where
.
is called a super-martingale with respect to
if, for
all
,
is a function of
and
where
.
While a martingale describes a fair game, the sub-martingales and
super-martingales describe favourable and unfavourable games
respectively. If
is a martingale,
, while
if
is a sub-martingale and
if
is a super-martingale.
Some simple examples of martingales are
- If
and
is a sequence of independent
centered random variables (i.e.
and
), then
is a martingale with respect to
where
and
.
- If
is a sequence of independent random
variables with
and
for all
, then
is a martingale with respect to
where
.
The first example is an important, if simple one. It shows that any
discrete, and hence the continuous, random walk is a martingale.
Martingales are extremely important in finance due to the concept of
risk-neutral valuation. This is due to the fact that the expected
growth rate of all securities in a risk-neutral world is the risk-free
interest rate. Hence,
is a martingale for all securities
in a risk-neutral world. This is why risk-neutral valuation
approach is also referred to as using an equivalent martingale
measure.
Martingales have several interesting properties and several important
limit theorems about them can be proven. These are beyond the scope of
this thesis and the interested reader should consult a good textbook on
stochastic processes such as Ross[4].
A physical way of discussing stochastic processes is through the
Langevin equation. The historical impetus for this equation arose from
Einstein's description of Brownian motion.
Langevin considered the equation of motion of a particle in a fluid
which is classically given by
 |
(28) |
where
is the coefficient of friction. He considered this
equation as correct only for the average motion of the particle. In
that case, the equation would correctly describe the motion for
relatively massive particles (since the random disturbances would be
too small to disturb them) but would only describe the long-term
motion of lighter particles. Since Brownian motion only occurs for
light particles such as pollen grains, this is quite
reasonable. Hence, he generalized this equation to
 |
(29) |
where
is a stochastic process with zero mean and covariance
![\begin{displaymath}
E[F(t)F(s)] = 2D\delta(t-s)
\end{displaymath}](img250.png) |
(30) |
The reader should be able to see that equation (2.28) can
also be written as
 |
(31) |
where
is white noise. We can solve this equation by transforming
to the variable
. We obtain
 |
(32) |
which can be easily solved to obtain
which gives
![\begin{displaymath}
v \sim N\left(v_0 e^{-\gamma T/M}, \frac{D}{M\gamma} \left[1-e^{-2\gamma
t/M} \right]\right)
\end{displaymath}](img256.png) |
(33) |
We now solve the Langevin equation formally and check that the
solution gives us the same expected value and variance as the solution
above. The formal solution for the Langevin equation gives
 |
(34) |
so that
![\begin{displaymath}
E[v(t)] = v_0e^{-\gamma t/M} + \frac{1}{M}\int_0^t
e^{\frac{\gamma}{M}(t-\tau)} E[F(\tau)]d\tau = v_0e^{-\gamma t/M}
\end{displaymath}](img258.png) |
(35) |
and
 |
(36) |
Hence, we see that the expectation and the variance calculated using
both the methods are the same as should be the case.
As
, the particle attains equilibrium with its
surroundings. Hence, the velocity distribution should be Maxwellian
 |
(37) |
which when compared to the solution above yields
which is
the Einstein relation.
The stochastic differential equation for the logarithm of the stock
price
 |
(38) |
where
is a Wiener process can also be considered as a Langevin
equation
 |
(39) |
(which, after changing variables to
and setting
(implying zero viscosity) and
, is the same as the classical Langevin
equation) Equation (2.37) can be readily solved to yield
. We
can formally solve equation (2.38) as above to obtain
 |
(40) |
from which we get
![\begin{displaymath}
E[x] = \left(r-\frac{\sigma^2}{2}\right) t + \frac{\sigma}{\...
...int_0^t E[F(\tau)] d\tau = \left(r-\frac{\sigma^2}{2}\right) t
\end{displaymath}](img271.png) |
(41) |
and
![\begin{displaymath}
E\left[\left(x-\left(r-\frac{\sigma^2}{2}\right)t\right)^2\r...
...int_0^t \int_0^t E[F(\tau)F(\eta)] d\tau \,d\eta =
\sigma^2 t
\end{displaymath}](img272.png) |
(42) |
The main importance of the Langevin equation is that it gives us a
different way of considering stochastic processes. They can also
sometimes suggest better methods of solving stochastic differential
equations. Mathematically, the two approaches are, of course,
equivalent.
Next: The Black-Scholes Equation
Up: thesis
Previous: Derivatives and Options
  Contents
Marakani Srikant
2000-08-15