Probability measures on graph trajectories
Abstract.
The aim of this note is to construct a probability measure on the space of trajectories in a continuous time Markov chain having a finite state diagram, or more generally which admits a global bound on its degree and rates. Our approach is elementary. Our main intention is to fill a gap in the literature and to give some additional details in the proof of [CCK-fluctuation, prop. 4.8].
1. Introduction
As is well known, Markov chains model random walks on graphs. Let be a directed graph. Its set of vertices represent the states of the system and its edges indicate transitions between states. There are two flavors of random walk: those in discrete time and those in continuous time. This note will consider the continuous time variant.
The dynamics of continuous time random walk are encoded by a master equation
where is a time dependent matrix of transition rates and is a 1-parameter family of probability distributions on . The solutions to the equation describe the time evolution of probability. For vertices and , the matrix entry is the instantaneous rate of change at time in jumping from state to state along the set of edges of having initial vertex and terminal vertex . The operator is called the master operator; its off diagonal entries are non-negative and the sum of the entries in any column add to zero.
Given a continuous time Markov chain with state diagram , our goal here will be to construct a probability distribution on the space of trajectories in . By a trajectory in , we mean a path of contiguous edges equipped with jump times at each vertex of the path. Note that such a probability distribution amounts to a description of the stochastic process associated with the Markov chain.
Remark 1.1.
We apologize to the reader in advance for our somewhat unconventional treatment: two of us are algebraic topologists and one is a chemical physicist.
2. Preliminaries
For a set , let denote the set of its non-empty subsets of cardinality . An undirected graph consists of data
in which is the set of vertices, is the set of edges and
is a function. We will always assume that is locally finite in the sense that the function is a finite-to-one. With this definition multiple edges connecting a pair of distinct vertices are permitted, but we do not permit loop edges, i.e., edges which connect a vertex to itself.
A directed graph is defined in a similar way, but where now is replaced by a function , where , i.e., the cartesian product with its diagonal deleted. We write , where is is the function which assigns to a directed edge its source, respectively target. Note that the canonical map is a double cover, and the composition defines the underlying undirected graph.
Example 2.1.
Given an undirected graph , we may construct its double. This is the directed graph
in which and is the set of ordered pairs in which . The function is given by , where .
Remark 2.2.
Let be the category of undirected graphs. An object is an undirected graph and a morphism consists of functions , such that . Similarly, one has the category of directed graphs. Then we have an adjoint functor pair
where is the forgetful functor and is given by the double.
2.1. Markov chains
Let be a directed graph. A continuous time Markov chain with state diagram is an assignment of a continuous function
to each edge . The function is called the transition rate of . If , then is to be interpreted as the instantaneous rate of change of probability in jumping from to along .
Remark 2.3.
The foundational material on Markov chains can be found in the texts of Norris [Norris] and Stroock [STR14]. When the transition rates are constant, the Markov chain is said to be time homogeneous. When the rates are not constant, the chain is said to be time inhomogeneous.111In contrast with the homogeneous case, the literature on the inhomogeneous case is scant, with the known results making strong additional assumptions. The only foundational work we are of aware of that treats the time inhomogeneous case is Stroock’s text (cf. [STR14, §5.5.2]).
Remark 2.4.
The canonical map is an embedding. Given a Markov chain on , one has a canonical extension to by defining the rates to be zero on those edges which aren’t in . The Markovian dynamics of the two chains coincide. From this standpoint, there is nothing to lose by assuming that for some undirected graph .
If is infinite, we also require the following growth constraints.
Definition 2.5 (Rate Bound).
For each , there exists a constant , possibly depending on , such that
for and every .
Definition 2.6 (Degree Bound).
Let be the function which assigns to a vertex its degree, i.e., the number of edges meeting it. There is a positive integer such that
Observe that when is a finite, both conditions hold automatically.
2.2. The master equation
Then the rates define a time dependent square matrix , as follows. For , set
where the sum is interpreted as zero when is the empty set. Then the matrix entries of are given by
where the indices range over . The time dependent matrix is called the master operator. Associated with is a linear, first order ordinary differential equation
(1) |
in which is a one parameter family of (probability) distributions on the set of vertices . Equation (1) is called the (forward) Kolmogorov equation or the master equation [STR14, eqn. 5.5.2]. Its solutions describe the evolution of an initial distribution .
Remark 2.7.
If and the transition rates are constant with value 1, then is the graph Laplacian of and (1) is a combinatorial version of the heat (diffusion) equation.
Remark 2.8.
The forward Kolmogorov equation is often written in the literature in adjoint form, i.e., as
where and are the transposed matrices. The backward equation (which we will not consider here) is
2.3. Trajectories
A path of length in consists of a sequence of edges
such that for . We let
denote the -th vertex of the path, i.e., if and .
A trajectory of length and duration is a pair
such that is a path of length and is a sequence of real numbers satisfying
In what follows, it will be convenient to set
Remark 2.9.
For a vertex of the path , the number is called the jump time and the number is called wait time.
3. The probability of a trajectory
Let be as in the previous section. Given a vertex and an interval , the escape rate at is
Fix an initial probability distribution . For , set .
Let
denote the set of trajectories of having length . Define a function
by the formula
(compare [Bellac, eqn. 1.112] in the constant rate case).222The function is a discrete analogue of the Onsager-Machlup Lagrangian [Onsager-Machlup].
Consider the master equation
Let
denote the set of paths of length and let denote the subset of those paths which have terminus .
Theorem 3.1.
The formal solution to the master equation is the vector valued function whose component at is given by the expression
Proof.
Write , where is the diagonal matrix with entries . For , set . Consider the equation
(2) |
We seek a formal solution with and for . Once such a solution is found, we set to obtain the formal solution to the master equation.
For , the -th equation of the system is the first order linear differential equation
(4) |
If , the system is uncoupled and separation of variables gives
For , the solution to (4) can be iteratively solved using the integrating factor. The first iteration gives
where the second sum is over all edges with terminus and whose source is denoted in the integrand by . We then repeat the procedure using in place of , to obtain
where the sum is indexed over all paths of length two satisfying , , and . Applying this procedure a total of times results in the desired expression for . ∎
Let denote the space of trajectories of duration of arbitrary length.
Corollary 3.2.
Let denote the formal solution to the master equation. Then is a probability distribution on for every . In particular, the function is a probability density on .
Proof.
As the rate bound holds, there is a constant , independent of , such for all trajectories of length . As the degree bound holds, there is global bound on the degree function, so the number of paths of length terminating at a vertex is at most . Consequently,
where the sum ranges over paths of length with terminus . Here, we have used the fact that is the volume of the -simplex . By the comparison test, converges. Therefore also converges.
Let be the row vector which is identically one at every vertex. It will suffice to show that . Observe that , since the entries of in any column add to zero. Then for all we have
Consequently, is a constant. But , hence for all . ∎
4. Fundamental solutions
Consider the master equation with initial distribution for a fixed vertex , where is the Kronecker delta function. The solution to this equation is called a fundamental solution and will be denoted by .
In this case the density is supported on the set of trajectories with initial vertex . Let denote the subspace of trajectories which start at the vertex .
Corollary 4.1.
With respect to this assumption, the function is a probability density on .
Remark 4.2.
The general solution to the master equation with initial condition is obtained from the fundamental solutions using the identity
(5) |
Definition 4.3.
Define the propagator by
i.e., the probability of the set of trajectories of duration which start at vertex and terminate at vertex . Note the initial condition .
Setting , equation (5) becomes
which is familiar to the physics literature. By Theorem 3.1, we obtain the path integral representation
where is the set of paths of length which start at and terminate at . Furthermore, the series converges if satisfies the rate and degree bounds.
Example 4.4.
Let be an -regular graph, i.e., the number of edges meeting each vertex is . Assume that the rates are constant with value one. Then the master operator is the negative of the graph Laplacian. In this instance elementary to check that . Let be the number of paths of length in from to . Then a straightforward calculation shows
In this case, is the combinatorial heat kernel.
As is -regular, there are precisely paths of length which start at . Set
Then is the probability of the set of paths (not trajectories), which end at after -jumps, given that such paths start at (where the probability of jumping across an edge meeting any vertex is ).
Let
Then is the Poisson probability mass function with parameter . Consequently,
is the (Poisson) expected value of the random variable . Summarizing, the continuous time random walk on an -regular graph with uniform rate may be thought of as a discrete time random walk subordinated to a Poisson process (cf. [Feller, chap. X§7]).