Stochastic processes

A stochastic process is a sequence of events in which the outcome of any stage depends on some probability. A stochastic process is also called a random process.The stochastic process is said to be continuous, if i.e. with probability one, all of its trajectories are continuous functions in t.

There are many ways in which the stochastic processes can be classified. For example, they can be classified in terms of boundedness, continuity etc., i.e. properties of their sample trajectories. They could also be described with respect to the properties of their distribution functions. However, we find it important to classify the stochastic processes according to their memory. We classify these processes as discrete time stochastic processes and Markovian data processes. The Markov processes are processes with short memory where the process has memory only at its immediate past. With this in mind, in our stochastic approximation process, we approximate a discrete time stochastic process with a scenario tree and a Markovian data process with a scenario lattice.

Stochastic function

This is a user-defined function representing the stochastic process in which she/he wants to approximate. This is the function that will generate trajectories that helps to improve the states of the nodes in the scenario tree or scenario lattice. To be able to do a good approximation, the user of this package should be aware of the number of stages in the scenario tree or scenario lattice that she/he is working on and more importantly, the dimension of the states of nodes in the tree or the scenario lattice. Why is this important? For each iteration, the stochastic approximation algorithm generates one trajectory from the stochastic function that will improve one path in the scenario tree. Therefore, knowing the number of stages and the dimension of the states will help the user to design a stochastic function that generates a trajectory with the same length as the number of stages in the scenario tree and with the same dimension as the dimension of the states in the scenario tree.

We have the following examples in our package:

Gaussian random walk,
Running Maximum process, and,
Sample Gaussian stock paths

We have created the first two example functions in 1 and 2 dimensions. What we mean by dimension is that, a function in 1 dimension generates just one array. Generally, a function in d-dimension generates d arrays. As stated before, the length of the array and the dimension of the array naturally depends on the characteristics of the scenario tree or scenario lattice that you are using.

The position $X_t$ of a Gaussian random walk after $t$ steps is

\[X_t = \sum_{k=1}^{t} Y_k\]

where the random variables $(Y_k)_{k\geq 1}$ are independent and identically distributed and each $Y_k$ follows a normal distribution. For the running maximum process, we take the maximum of each two consecutive numbers in a sample drawn from a normal distribution. Mathematically,

\[M_t = \max \{\sum_{t=1}^{t^\prime} \xi_t : t^\prime \leq t\} \ \text{with} \ \xi_t \sim N(0,1).\]

We insist that the user of this package must write a function that takes no inputs. This function should be, in a way, a hard-coded function that when you call, it produces an array with a certain user pre-determined characteristics.

The above two processes in our package are in 4 stages and therefore can be used with a scenario tree or a scenario lattice with 4 stages. As already stated, the stages in a scenario tree or a scenario lattice starts from 1 to T. Each of these processes generates samples with a common starting point since a stochastic process has a deterministic starting value and the rest of the values are random.

It is important to create a function that generates samples that are dependent on each other in d dimension. Most stochastic processes have states which depends on each other and so it is always important to consider dependency of the samples generated by the stochastic function.

The following shows sample trajectories of our example functions above:

julia> gaussian_path1D() # 4 stages, 1 dimension
[0.0, -0.1566, -1.5676, -1.2084]

julia> gaussian_path1D()
# 2nd run - same starting value, different other values (randomness)
[0.0, -2.1779, -2.4299, -1.3796]

julia> gaussian_path2D() # 4 stages, 2 dimension
[1.0, 0.6968, 2.2257, 3.3988 ; 0.0, -0.3676, -0.1558, -0.2629]

julia> running_maximum1D()
[0.0, 0.0, 0.5574, 0.9278]

julia> running_maximum2D()
[0.0, 2.7430, 2.3931, -0.7218; 0.0, 0.4331, 0.4331, 0.3281]

julia> path() # example of stock prices
[100.0, 78.7084, 109.4443, 74.3815]

Notes

The user of this package must provide this function.
The function must take no inputs. This is so important for the stochastic approximation algorithm.
The length of an array produced by the function must be equal to the number of stages in the scenario tree or the scenario lattice.
The dimension of the array produced must be the same as the dimension of the states of the scenario tree or scenario lattice.
The array should have a common deterministic starting point or be in a certain range of values.
The array produced should be a N x d Array{Float64,2} where N is the number of stages and d is the dimension. This is important for the stochastic approximation process.
In higher dimension, depending on the user, the arrays could be dependent on each other.

Up to this point, the user can now create a random scenario tree and look at its characteristics as well as write a process function that generates trajectories for the stochastic approximation process. It is important to write a function that follows the above criterion as this becomes very important in the stochastic approximation process.

Having looked on the scenario trees, scenario lattices and stochastic processes, we are now ready to have a look on the stochastic approximation process. This is in our next tutorial.