This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Dynamical large deviations of reflected diffusions

Johan du Buisson [email protected] Institute of Theoretical Physics, Department of Physics, Stellenbosch University, Stellenbosch 7600, South Africa    Hugo Touchette [email protected], [email protected] Department of Mathematical Sciences, Stellenbosch University, Stellenbosch 7600, South Africa
Abstract

We study the large deviations of time-integrated observables of Markov diffusions that have perfectly reflecting boundaries. We discuss how the standard spectral approach to dynamical large deviations must be modified to account for such boundaries by imposing zero-current conditions, leading to Neumann or Robin boundary conditions, and how these conditions affect the driven process, which describes how large deviations arise in the long-time limit. The results are illustrated with the drifted Brownian motion and the Ornstein–Uhlenbeck process reflected at the origin. Other types of boundaries and applications are discussed.

Brownian motion, reflected diffusions, large deviations

I Introduction

The use of stochastic differential equations (SDEs) for modelling noisy diffusive systems often requires that we specify the location of boundaries or “walls” when the system of interest evolves in a confined space or has a state that is inherently bounded Gardiner (1985); Schuss (2013); Bressloff (2014). Examples include molecules diffusing in cells Bressloff (2014); Grebenkov (2007a); Holcman and Schuss (2017), particle transport in porous media Grebenkov (2007a), as well as diffusive limits of population Bressloff (2014) and queueing dynamics Giorno et al. (1986); Ward and Glynn (2003); Linetsky (2005) which have a positive state. In each case, one must also define what happens when a boundary is reached by specifying a boundary type or condition on the density ρ\rho and current JJ entering in the Fokker–Planck equation Gardiner (1985). Reflecting boundaries, for instance, are defined by requiring J=0J=0 at the boundary, whereas absorbing boundaries, related to extinctions in population models, are such that ρ=0\rho=0 at the boundary. Other types of boundaries are possible, including partially reflective Singer et al. (2008), reactive Erban and Chapman (2007); Chapman et al. (2016); Pal et al. (2019), and sticky Harrison and Lemoine (1981); Bou-Rabee and Holmes-Cerfon (2020); Engelbert and Peskir (2014), and arise in biological and chemical applications.

Many studies, starting with Feller Feller (1952, 1954); Peskir (2015), have looked at the effect of boundaries on SDEs at the level of probability distributions (time-dependent or stationary) and mean first-passage times Schuss (2013); Bressloff (2014); Grebenkov (2007a). In this paper, we investigate this effect on the long-time large deviations of time averages of the form

ST=1T0Tf(Xt)𝑑t,S_{T}=\frac{1}{T}\int_{0}^{T}f(X_{t})dt, (1)

where ff is some function of the state XtX_{t} of a bounded SDE. The random variable STS_{T} can be related, depending on the system considered, to various physical quantities that are integrated over time, and is called for this reason a dynamical observable Sekimoto (2010); Seifert (2012); Touchette (2018). For simplicity, we study one-dimensional systems, so that XtX_{t} evolves in a closed interval [a,b][a,b] of \mathbb{R} and consider perfect reflections at the endpoints aa and bb 111We do not consider observables involving integrals of the increments of XtX_{t}, as they are trivial for one-dimensional diffusions.. Other types of boundaries are discussed in the conclusion.

Large deviations have been studied before for reflected SDEs, in particular, by Grebenkov Grebenkov (2007b), Forde et al. Forde et al. (2015), and Fatalov Fatalov (2017), who obtain the rate function of various functionals of reflected Brownian motion, including its area and the residence time at a reflecting point. Pinsky Pinsky (1985a, b, c, d) and Budhiraja and Dupuis Budhiraja and Dupuis (2003) also study the large deviations of bounded diffusions, but do so at the level of empirical densities, the so-called “level 2” of large deviations, rather than time averages, which corresponds to “level 1” Touchette (2009). Finally, many studies Dupuis (1987); Sheu (1998); Majewski (1998); Ignatyuk et al. (1994); Ignatiouk-Robert (2005); Bo and Zhang (2009) consider escape-type events occurring in the low-noise limit, which fall within the Freidlin–Wentzell theory of large deviations Freidlin and Wentzell (1984). In this case, the rare events of interest typically involve the state XtX_{t} at a fixed or random time rather than time averages of XtX_{t}, as in (1).

In this work, we focus on the long-time limit and extend the studies above by deriving the reflective boundary conditions of the spectral problem that underlies the calculation of dynamical large deviations Touchette (2018). Our results clarify the source of the boundary conditions used in Grebenkov (2007b); Fatalov (2017) and extend them to more general SDEs and observables. We also investigate how the presence of reflecting boundaries affects the driven process, introduced in Chetrite and Touchette (2013, 2015a, 2015b) to explain, via a modified SDE, how fluctuations of STS_{T} away from its typical value are created in time. The main result that we obtain for this process, which is also called the auxiliary or effective process Evans (2004); Jack and Sollich (2010, 2015), is that its drift generally differs from the drift of the original SDE everywhere except at the boundaries, due to the J=0J=0 condition which is also satisfied by the driven process.

These results can be applied to study the large deviations of many equilibrium and nonequilibrium diffusions, including manipulated Brownian particles, which necessarily evolve in a confined environment and so can interact with walls. As illustrations, we consider two simple reflected diffusions, namely, the reflected Ornstein-Uhlenbeck process, which models the dynamics of an underdamped Brownian particle pulled linearly towards a reflecting wall as well as the dynamics of queueing systems in the heavy-traffic regime Ward and Glynn (2003), and the reflected Brownian motion with negative drift, which models a Brownian particle pulled to a wall by a constant force such as gravity. Other applications, boundary types, and open problems are discussed in the conclusion.

II Reflected diffusions

We consider a one-dimensional Markov diffusion (Xt)t0(X_{t})_{t\geq 0} defined by the SDE

dXt=F(Xt)dt+σdWt,dX_{t}=F(X_{t})dt+\sigma dW_{t}, (2)

which we restrict to the interval [a,b][a,b] with a<ba<b and either aa or bb (or both) finite. The function F(x)F(x) is called the force or drift, and is assumed to be such that the boundaries of [a,b][a,b] are reachable in finite time from the interior of this interval (regular boundaries) Karlin and Taylor (1981). The constant σ>0\sigma>0 is the noise amplitude multiplying the increments of the Brownian motion WtW_{t}\in\mathbb{R}, representing in SDE form a Gaussian white noise. The more general case where σ\sigma depends on XtX_{t} can be covered using the methods explained in Chetrite and Touchette (2015a).

To complete the model, we must specify the behavior of the process at the boundaries of [a,b][a,b]. Mathematically, this can be done at the level of the SDE or at the level of the Fokker–Planck equation

tρ(x,t)=xF(x)ρ(x,t)+σ22xxρ(x,t),\partial_{t}\rho(x,t)=-\partial_{x}F(x)\rho(x,t)+\frac{\sigma^{2}}{2}\partial_{xx}\rho(x,t), (3)

which governs the evolution of the time-dependent probability density ρ(x,t)\rho(x,t) of XtX_{t}, starting from some initial density ρ(x,0)\rho(x,0) for X0X_{0}. Rewriting this equation as a conservation equation

tρ(x,t)=xJF,ρ(x,t),\partial_{t}\rho(x,t)=-\partial_{x}J_{F,\rho}(x,t), (4)

we identify the probability current

JF,ρ(x,t)=F(x)ρ(x,t)σ22xρ(x,t).J_{F,\rho}(x,t)=F(x)\rho(x,t)-\frac{\sigma^{2}}{2}\partial_{x}\rho(x,t). (5)

Perfect reflections are then imposed by requiring that this current vanish at aa and bb (approaching these points from the interior). Thus,

JF,ρ(a+,t)=0=JF,ρ(b,t)J_{F,\rho}(a^{+},t)=0=J_{F,\rho}(b^{-},t) (6)

at all times tt, where a+=a+0a^{+}=a+0 and b=b0b^{-}=b-0. This follows, as is well known Gardiner (1985), because a reflecting boundary cannot be crossed, so the normal component of the probability current at the boundary, which in one dimension is simply the current itself, must be equal to zero.

If XtX_{t} is ergodic, we also have in the long-time limit

JF,ρ(a+)=0=JF,ρ(b),J_{F,\rho^{*}}(a^{+})=0=J_{F,\rho^{*}}(b^{-}), (7)

where ρ(x)\rho^{*}(x) is the unique stationary distribution of the Fokker–Planck equation 222There is no time variable at this point because we are dealing with the stationary regime.. For one-dimensional diffusions, we have in fact JF,ρ(x)=0J_{F,\rho^{*}}(x)=0 not just at the boundary points but for all x[a,b]x\in[a,b], since the stationary current is constant throughout space in this case. As a result, ρ\rho^{*} must be an equilibrium density having the Gibbs form

ρ(x)=ce2U(x)/σ2,\rho^{*}(x)=c\,e^{-2U(x)/\sigma^{2}}, (8)

where U(x)U(x) is the potential associated with the force by F(x)=U(x)F(x)=-U^{\prime}(x) and cc is a normalization constant.

Perfect reflections at the boundaries can also be imposed directly at the level of XtX_{t} by adding to the SDE a new “noise” term given by the increment of the “local time” at the boundaries, which essentially represents the amount of time that XtX_{t} spends near aa or bb Schuss (2013). This mathematical construction, due to Skorokhod Skorokhod (1961), provides a rigorous way to study reflections in diffusions, but will not be used here as it is too abstract for our purposes. Instead, we think of XtX_{t} as evolving on \mathbb{R} in discrete time according to the Euler–Maruyama scheme

Xt+Δt=Xt+F(Xt)Δt+σΔtZ,X_{t+\Delta t}=X_{t}+F(X_{t})\Delta t+\sigma\sqrt{\Delta t}Z, (9)

and we simply reflect the update Xt+ΔtX_{t+\Delta t} back inside [a,b][a,b] whenever it falls outside this interval Schuss (2013), as illustrated in Fig. 1. Here, Δt\Delta t is the discretized time step while ZN(0,1)Z\sim N(0,1) is a standard normal random variable.

Refer to caption
Figure 1: Mechanical reflection of the Langevin dynamics at a boundary. If the state XtX_{t} crosses the boundary aa during one time step, then it is reflected back to [a,b][a,b] in a mirror-like way with respect to aa. A similar reflection is applied to the boundary at bb. This reflection rule assumes that the reflected point still falls within [a,b][a,b], which is the case if Δt\Delta t is sufficiently small. The boundary layer is shown in grey.

The typical distance Δx\Delta x from a boundary within which XtX_{t} can cross it in a single time step defines an exclusion zone, called the boundary layer, which represents an artefact or error of the Euler–Maruyama scheme. A similar boundary layer is found if one simulates the reflections in a “soft” way by adding a fictitious potential to U(x)U(x) to create strong repulsive walls at aa and bb Menaldi (1983); Pettersson (1997); Kanagawa and Saisho (2000). In both cases, the thickness of the layer generally decreases to 0 as Δt0\Delta t\rightarrow 0, so they give a good approximation of the dynamics of XtX_{t} when Δt\Delta t is sufficiently small. In this limit, it should also be clear that a “particle” with state XtX_{t} entering the boundary layer will leave it instantaneously, so that the net number of crossings at the layer is 0. Viewing ρ(x,t)\rho(x,t) as representing the density of an ensemble of such particles and JF,ρ(x,t)J_{F,\rho}(x,t) as their flux, we then recover the zero-current condition (6) when the layer disappears.

III Markov operators

For the results to come, it is important to rewrite the Fokker–Planck equation in operator form as

tρ(x,t)=ρ(x,t)\partial_{t}\rho(x,t)=\mathcal{L}^{\dagger}\rho(x,t) (10)

in order to identify the Fokker–Planck operator

=xF+σ22xx\mathcal{L}^{\dagger}=-\partial_{x}F+\frac{\sigma^{2}}{2}\partial_{xx} (11)

as the generator of the time evolution of ρ(x,t)\rho(x,t), starting from some initial density ρ(x,0)\rho(x,0). The domain 𝒟()\mathcal{D}(\mathcal{L}^{\dagger}) of this linear operator is naturally the set of all probability densities that (i) can be normalized on [a,b][a,b]; (ii) satisfy the zero-current condition (6), which has the form of a Robin (mixed) boundary condition on ρ\rho; and (iii) are twice-differentiable, since \mathcal{L}^{\dagger} is a second-order differential operator.

Dual to \mathcal{L}^{\dagger} is the Markov generator

=Fx+σ22xx\mathcal{L}=F\partial_{x}+\frac{\sigma^{2}}{2}\partial_{xx} (12)

which governs the evolution of expectations according to

tE[g(Xt)]=E[g(Xt)],\partial_{t}E[g(X_{t})]=E[\mathcal{L}g(X_{t})], (13)

where gg is a test function and E[]E[\cdot] denotes the expectation with respect to ρ(x,t)\rho(x,t). The two generators are dual or adjoint to each other in the sense that

ρ,g=ρ,g\langle\mathcal{L}^{\dagger}\rho,g\rangle=\langle\rho,\mathcal{L}g\rangle (14)

with respect to the standard inner product used in the theory of Markov processes, namely,

ρ,g=abρ(x)g(x)𝑑x=E[g(X)],\langle\rho,g\rangle=\int_{a}^{b}\rho(x)g(x)\,dx=E[g(X)], (15)

where ρ\rho is an arbitrary density in 𝒟()\mathcal{D}(\mathcal{L}^{\dagger}) and gg is a test function. From this definition, as well as the expressions (11) and (12) for \mathcal{L}^{\dagger} and \mathcal{L}, we find

ρ,g=ρ,gg(x)JF,ρ(x)|abσ22ρ(x)g(x)|ab\langle\mathcal{L}^{\dagger}\rho,g\rangle=\langle\rho,\mathcal{L}g\rangle-g(x)J_{F,\rho}(x)\big{|}_{a}^{b}-\frac{\sigma^{2}}{2}\rho(x)g^{\prime}(x)\big{|}_{a}^{b} (16)

via integration by parts. Given that the current JF,ρJ_{F,\rho} vanishes at the boundaries, we must then require that the test functions gg acted on by \mathcal{L} satisfy

g(a+)=0=g(b)g^{\prime}(a^{+})=0=g^{\prime}(b^{-}) (17)

in order for the boundary term in (16) to vanish and, thus, for the operators \mathcal{L} and \mathcal{L^{\dagger}} to be proper duals defined independently of any specific ρ\rho or gg Nagasawa (1961). From this result, the domain 𝒟()\mathcal{D}(\mathcal{L}) of \mathcal{L} is then defined to be the set of test functions that (i) have finite expectation (finite inner product); (ii) satisfy the zero-derivative condition (17), which is a Neumann boundary condition; and (iii) are twice differentiable.

IV Large deviations

We now come to the main point of our work, namely, to understand how reflecting boundaries determine the large deviations of dynamical observables. To this end, we briefly recall the definition of the rate function, which characterises the probability distribution of STS_{T} in the long-time limit, and present the spectral problem underlying the calculation of this function. We then explain how the boundary conditions normally applied to this spectral problem in the case of unbounded diffusions must be modified to account for perfect reflections, and how these new boundary conditions affect the properties of the driven process, which explains how fluctuations of STS_{T} arise in time.

IV.1 Large deviation functions

We consider an ergodic reflected diffusion XtX_{t}, as defined in the previous section, and a dynamical observable STS_{T} of this process having the form (1). We are interested in finding the rate function of STS_{T}, defined by the limit

I(s)=limT1TlnPT(s),I(s)=\lim_{T\rightarrow\infty}-\frac{1}{T}\ln P_{T}(s), (18)

where PT(s)P_{T}(s) is the probability distribution of STS_{T}. In practice, it is very difficult to obtain this distribution exactly, which is why the rate function is sought instead. Indeed, for a large class of Markov processes and observables, it can be shown that PT(s)eTI(s)P_{T}(s)\approx e^{-TI(s)} with sub-exponential corrections in TT, so that I(s)I(s) effectively describes the shape of PT(s)P_{T}(s) at leading order in TT Touchette (2009). This holds in the limit of large integration times TT, typically much longer than the relaxation time-scale of XtX_{t}.

To calculate the rate function, we use the fact that it is dual to another large deviation function, called the scaled cumulant generating function (SCGF) Dembo and Zeitouni (1998) and defined as

λ(k)=limT1TlnE[eTkST],k.\lambda(k)=\lim_{T\rightarrow\infty}\frac{1}{T}\ln E[e^{TkS_{T}}],\quad k\in\mathbb{R}. (19)

Using the Feynman–Kac formula, it can be shown Touchette (2018) that this function coincides with the dominant (real) eigenvalue of a linear operator, called the tilted generator, having the form

k=+kf,\mathcal{L}_{k}=\mathcal{L}+kf, (20)

where \mathcal{L} is the generator of XtX_{t} and ff is the function appearing in the definition (1) of the observable STS_{T}. Having the SCGF, we then obtain I(s)I(s) by taking a Legendre transform, so that

I(s)=k(s)sλ(k(s)),I(s)=k(s)s-\lambda(k(s)), (21)

where k(s)k(s) is the unique root of λ(k)=s\lambda^{\prime}(k)=s. This essentially holds if λ(k)\lambda(k) is differentiable and strictly convex.

The calculation of the rate function thus reduces to solving the spectral problem

krk(x)=λ(k)rk(x),\mathcal{L}_{k}r_{k}(x)=\lambda(k)r_{k}(x), (22)

where rkr_{k} is the eigenfunction of k\mathcal{L}_{k} corresponding to the dominant eigenvalue λ(k)\lambda(k). Given that k\mathcal{L}_{k} is not in general Hermitian, this spectral problem must be solved in conjunction with its dual

klk(x)=λ(k)lk(x),\mathcal{L}_{k}^{{\dagger}}l_{k}(x)=\lambda(k)l_{k}(x), (23)

where lkl_{k} is the eigenfunction of k=+kf\mathcal{L}_{k}^{\dagger}=\mathcal{L}^{\dagger}+kf with the same dominant eigenvalue λ(k)\lambda(k). The boundary conditions that we must impose on these two spectral equations to obtain λ(k)\lambda(k) are discussed next. Independently of these conditions, rkr_{k} and lkl_{k} are dual functions with respect to the standard inner product (15), so they must satisfy lk,rk<\langle l_{k},r_{k}\rangle<\infty. They are also positive functions, since they are the dominant modes of k\mathcal{L}_{k} and k\mathcal{L}_{k}^{\dagger}, respectively. In the literature Chetrite and Touchette (2015a), it is common to normalize them in such a way that

ablk(x)rk(x)𝑑x=lk,rk=1\int_{a}^{b}l_{k}(x)r_{k}(x)\,dx=\langle l_{k},r_{k}\rangle=1 (24)

and

ablk(x)𝑑x=lk,1=1.\int_{a}^{b}l_{k}(x)\,dx=\langle l_{k},1\rangle=1. (25)

The latter integral is consistent, as we will see, with the fact that k\mathcal{L}_{k}^{\dagger} is related to the Fokker–Planck operator acting on normalized densities.

IV.2 Boundary conditions

The boundary conditions that must be imposed on rkr_{k} and lkl_{k} to solve the spectral equations (22) and (23) are determined by considering, as done before, the boundary term that arises when transforming k\mathcal{L}_{k} to its dual k\mathcal{L}_{k}^{\dagger}. Since these operators differ from \mathcal{L} and \mathcal{L}^{\dagger}, respectively, only by the constant term kfkf, which produces no boundary terms of its own, the result of (16) can readily be used to obtain

klk,rk=lk,krkrk(x)JF,lk(x)|abσ22lk(x)rk(x)|ab,\langle\mathcal{L}_{k}^{\dagger}l_{k},r_{k}\rangle=\langle l_{k},\mathcal{L}_{k}r_{k}\rangle-r_{k}(x)J_{F,l_{k}}(x)\big{|}_{a}^{b}-\frac{\sigma^{2}}{2}l_{k}(x)r^{\prime}_{k}(x)\big{|}_{a}^{b}, (26)

where

JF,lk(x)=F(x)lk(x)σ22lk(x)J_{F,l_{k}}(x)=F(x)l_{k}(x)-\frac{\sigma^{2}}{2}l^{\prime}_{k}(x) (27)

is the probability current associated with lkl_{k}.

For diffusions evolving in \mathbb{R} without boundaries, we see that the boundary term in (26) vanishes for any rk(x)r_{k}(x) and lk(x)l_{k}(x) if the latter eigenfunction decays to 0 sufficiently fast as |x||x|\rightarrow\infty. This is consistent with the normalization integral in (25) and implies with lk>0l_{k}>0 that both lk(x)l^{\prime}_{k}(x) and JF,lk(x)J_{F,l_{k}}(x) vanish as |x||x|\rightarrow\infty. There is no condition on rkr_{k} alone, as such, since the Markov generator \mathcal{L}, and by extension k\mathcal{L}_{k}, have no natural boundary conditions, except for the fact that both act on functions that have finite inner product, which for k\mathcal{L}_{k} translates into the integral condition in (24), normalized to 1.

For reflected diffusions, the normalization integrals (24) and (25) are no longer sufficient on their own to define boundary conditions on rkr_{k} and lkl_{k}. Instead, we must impose

JF,lk(a+)=0=JF,lk(b)J_{F,l_{k}}(a^{+})=0=J_{F,l_{k}}(b^{-}) (28)

and

rk(a+)=0=rk(b)r_{k}^{\prime}(a^{+})=0=r_{k}^{\prime}(b^{-}) (29)

in order for the boundary term in (26) to vanish and, importantly, for k\mathcal{L}_{k}^{\dagger} and k\mathcal{L}_{k} to have boundary conditions that are consistent with those imposed on =k=0\mathcal{L}^{\dagger}=\mathcal{L}^{\dagger}_{k=0} and =k=0\mathcal{L}=\mathcal{L}_{k=0}, respectively, when k=0k=0. The same conditions also follow by noticing again that the boundary term in the duality of k\mathcal{L}_{k}^{\dagger} and k\mathcal{L}_{k} does not depend on kfkf, so that (28) simply extends the zero-current condition of \mathcal{L}^{\dagger} to k\mathcal{L}_{k}^{\dagger}, while (29) extends the Neumann boundary condition of \mathcal{L} to k\mathcal{L}_{k}. As a result, 𝒟(k)=𝒟()\mathcal{D}(\mathcal{L}_{k}^{\dagger})=\mathcal{D}(\mathcal{L}^{\dagger}) and 𝒟(k)=𝒟()\mathcal{D}(\mathcal{L}_{k})=\mathcal{D}(\mathcal{L}). These boundary conditions must be imposed, incidentally, not just on the eigenfunctions related to the dominant eigenvalue, but to all eigenfunctions, thereby determining the full spectrum of k\mathcal{L}_{k} which is conjugate to the spectrum of k\mathcal{L}_{k}^{\dagger}. For large deviation calculations, however, we only need the dominant eigenvalue, which is real, and the corresponding eigenfunctions, which are positive.

IV.3 Driven process

While the SCGF and the rate function describe the fluctuations of STS_{T}, they do not provide any information about how these fluctuations are created in time. Recently, it has been shown Chetrite and Touchette (2013, 2015a, 2015b) that this information is provided by a modified Markov process X^t\hat{X}_{t}, called the effective or driven process, which represents in some approximate way the original diffusion XtX_{t} conditioned on the fluctuation ST=sS_{T}=s and so describes the paths of XtX_{t} that lead to or create that fluctuation.

The details of this process can be found in many studies Chetrite and Touchette (2013, 2015a, 2015b), which provide a full derivation of its properties and interpretation. In the context of ergodic diffusions, the driven process satisfies the new SDE

dX^t=Fk(X^t)dt+σdWt,d\hat{X}_{t}=F_{k}(\hat{X}_{t})dt+\sigma dW_{t}, (30)

where the driven force FkF_{k} is a modification of the force FF given in terms of the dominant eigenfunction rkr_{k} by

Fk(x)=F(x)+σ2rk(x)rk(x).F_{k}(x)=F(x)+\sigma^{2}\frac{r_{k}^{\prime}(x)}{r_{k}(x)}. (31)

The parameter kk is the same as that entering in the SCGF: its value is set for a given fluctuation ST=sS_{T}=s according to the Legendre transform (21) as the root of λ(k)=s\lambda^{\prime}(k)=s or, equivalently, as k=I(s)k=I^{\prime}(s) 333These two relations, which follow from the Legendre transform (21), hold when I(a)I(a) is strictly convex Chetrite and Touchette (2015a).. With this choice, the stationary density of the driven process, which is known to be given (Chetrite and Touchette, 2015a, Sec. 5.4) by

ρk(x)=rk(x)lk(x),\rho_{k}^{*}(x)=r_{k}(x)l_{k}(x), (32)

is such that

abρk(x)f(x)𝑑x=s.\int_{a}^{b}\rho_{k}^{*}(x)f(x)\,dx=s. (33)

This shows that we can also interpret the driven process as a change of process that transforms the fluctuation ST=sS_{T}=s into a typical value realized by X^t\hat{X}_{t} in the ergodic limit. The change of drift and density also modifies the stationary current to

JFk,ρk(x)=Fk(x)ρk(x)σ22ρk(x).J_{F_{k},\rho_{k}^{*}}(x)=F_{k}(x)\rho_{k}^{*}(x)-\frac{\sigma^{2}}{2}\rho_{k}^{*}(x)^{\prime}. (34)

Note that for k=0k=0, rk=0=1r_{k=0}=1 while lk=0=ρl_{k=0}=\rho^{*}, leading to Fk=0=FF_{k=0}=F, ρk=0=ρ\rho_{k=0}^{*}=\rho^{*} and JFk=0,ρk=0=JF,ρJ_{F_{k=0},\rho^{*}_{k=0}}=J_{F,\rho^{*}}.

For an ergodic diffusion XtX_{t} evolving on [a,b][a,b] with reflecting boundaries, the driven process X^t\hat{X}_{t} also evolves on [a,b][a,b], since it represents a conditioning of XtX_{t}, and inherits for the same reason a zero-current boundary conditions at x=a+x=a^{+} and x=bx=b^{-}, given in terms of the driven current by

JFk,ρk(a+)=0=JFk,ρk(b).J_{F_{k},\rho_{k}^{*}}(a^{+})=0=J_{F_{k},\rho_{k}^{*}}(b^{-}). (35)

This can be verified, in fact, independently of the interpretation of X^t\hat{X}_{t} by applying the boundary conditions on rkr_{k} and lkl_{k} discussed previously to the driven process. First, note that the Neumann boundary conditions (29) on rkr_{k} implies with the definition of the driven force (31) that the latter is constrained to satisfy

Fk(a+)=F(a+)andFk(b)=F(b).F_{k}(a^{+})=F(a^{+})\quad\textrm{and}\quad F_{k}(b^{-})=F(b^{-}). (36)

Hence, the drift at the boundaries is not modified at the level of the driven process, although it is modified in the interior of [a,b][a,b], as we will see in the next section with specific examples. This is a significant result of our work, which also implies that any density ρ\rho satisfying a zero-current condition at the boundaries with respect to the original drift must also satisfy a zero-current condition at the boundaries with respect to the driven force. In other words, JF,ρ(a+)=0J_{F,\rho}(a^{+})=0 is equivalent to JFk,ρ(a+)=0J_{F_{k},\rho}(a^{+})=0 and similarly for x=bx=b^{-}.

Next, we note that the boundary conditions (28) and (29) can be combined as

rk(x)JF,lk(x)|x=a+,b=0r_{k}(x)J_{F,l_{k}}(x)\big{|}_{x=a^{+},b^{-}}=0 (37)

and

σ22lk(x)rk(x)|x=a+,b=0.\frac{\sigma^{2}}{2}l_{k}(x)r_{k}^{\prime}(x)\big{|}_{x=a^{+},b^{-}}=0. (38)

We recognize these as the boundary terms in (26), which can be combined to obtain

[rk(x)JF,lk(x)σ22lk(x)rk(x)]x=a+,b=0.\left[r_{k}(x)J_{F,l_{k}}(x)-\frac{\sigma^{2}}{2}l_{k}(x)r_{k}^{\prime}(x)\right]_{x=a^{+},b^{-}}=0. (39)

Upon expanding the expression of the current associated with lkl_{k} and combining terms, we then obtain

[F(x)rk(x)lk(x)σ22(rk(x)lk(x))]x=a+,b=0,\left[F(x)r_{k}(x)l_{k}(x)-\frac{\sigma^{2}}{2}\left(r_{k}(x)l_{k}(x)\right)^{\prime}\right]_{x=a^{+},b^{-}}=0, (40)

which is a zero-current condition for the product rklkr_{k}l_{k}. From this result, we finally recover (35) using (36) and the fact that rklkr_{k}l_{k} is the stationary density of the driven process, as shown in (32).

This confirms in a more direct way that the driven process also evolves in [a,b][a,b] with perfect reflections at the boundaries. Of course, since we are dealing with one-dimensional diffusions, the stationary current of the driven process must vanish not just at the boundaries but over the whole of [a,b][a,b], similarly to the original diffusion. This is a known result in the theory of the driven process: if the original Markov process is reversible, then so is the driven process in the case where the dynamical observable has the form of (1) Chetrite and Touchette (2015a). This does not mean that ρ\rho^{*} and ρk\rho^{*}_{k} are the same in [a,b][a,b] – we will see illustrations of this point in the next section. Moreover, note that requiring JFk,ρk=0J_{F_{k},\rho_{k}}=0 at the boundaries is not sufficient to define boundary conditions for rkr_{k} and lkl_{k} since the driven current mixes both FkF_{k} and ρk\rho_{k}. These conditions are determined again by studying the boundary term arising in the duality between k\mathcal{L}_{k} and k\mathcal{L}_{k}^{\dagger}.

IV.4 Symmetrization

The spectral calculation of the SCGF is complicated by the fact that the tilted generator k\mathcal{L}_{k} is not Hermitian in general. A significant simplification is possible when the spectrum of k\mathcal{L}_{k} is real, as is the case, for example, when XtX_{t} is an ergodic, gradient diffusion characterized by the Gibbs stationary distribution (8) and STS_{T} is defined as in (1). Then it is known Touchette (2018) that k\mathcal{L}_{k} can be transformed in a unitary way, therefore preserving its spectrum, to the following Hermitian operator:

k=ρk1ρ=σ22xxVk,\mathcal{H}_{k}=\sqrt{\rho^{*}}\mathcal{L}_{k}\frac{1}{\sqrt{\rho^{*}}}=\frac{\sigma^{2}}{2}\partial_{xx}-V_{k}, (41)

which involves a quantum-like potential given by

Vk(x)=|U(x)|22σ2U′′(x)2kf,V_{k}(x)=\frac{|U^{\prime}(x)|^{2}}{2\sigma^{2}}-\frac{U^{\prime\prime}(x)}{2}-kf, (42)

where UU is the potential of the gradient force and ff the function defining the observable STS_{T}.

Since the new operator k\mathcal{H}_{k} has the same spectrum as k\mathcal{L}_{k}, the spectral problem associated with the SCGF becomes

kψk=λ(k)ψk,\mathcal{H}_{k}\psi_{k}=\lambda(k)\psi_{k}, (43)

where ψk\psi_{k} is the corresponding eigenfunction related to rkr_{k} and lkl_{k} by

ψk(x)=ρ(x)rk(x)=lk(x)/ρ(x).\psi_{k}(x)=\sqrt{\rho^{*}(x)}r_{k}(x)=l_{k}(x)/\sqrt{\rho^{*}(x)}. (44)

From (24), ψk\psi_{k} thus satisfies the normalization condition

abψk(x)2𝑑x=1,\int_{a}^{b}\psi_{k}(x)^{2}\,dx=1, (45)

similarly to that found in quantum mechanics, but for a real eigenfunction. Moreover, using ψk2=rklk=ρk\psi_{k}^{2}=r_{k}l_{k}=\rho^{*}_{k} we find with (40) that ψk\psi_{k} must satisfy for reflected diffusions the zero-current boundary conditions

JF,ψk2(a+)=0=JF,ψk2(b).J_{F,\psi_{k}^{2}}(a^{+})=0=J_{F,\psi_{k}^{2}}(b^{-}). (46)

This can be expressed more explicitly as

ψk(a+)=F(a+)σ2ψk(a+),\psi_{k}^{\prime}(a^{+})=\frac{F(a^{+})}{\sigma^{2}}\psi_{k}(a^{+}), (47)

with a similar expression holding for x=bx=b^{-}.

V Examples

We illustrate in this section the results derived before by applying them to two simple reflected diffusions obtained by constraining the Brownian motion with negative drift and the Ornstein–Uhlenbeck process on the half line. Each of these diffusions can be solved exactly and give rise to interesting properties for the driven process in the presence of a reflecting boundary. Further applications for diffusions evolving in higher dimensions are mentioned in the conclusion.

V.1 Reflected Ornstein–Uhlenbeck process

The first example that we consider is the reflected Ornstein–Uhlenbeck process (ROUP) satisfying the SDE

dXt=γXtdt+σdWtdX_{t}=-\gamma X_{t}dt+\sigma dW_{t} (48)

with γ>0\gamma>0, σ>0\sigma>0, Xt[0,)X_{t}\in[0,\infty), and perfect reflection at x=0x=0. This diffusion is used in engineering as a continuous-space model of queues in the heavy-traffic regime Giorno et al. (1986); Ward and Glynn (2003); Linetsky (2005) and represents, more physically, the dynamics of a Brownian particle attracted to a reflecting wall by a linear force induced, for example, by laser tweezers Ashkin (1997) or an ac trap Cohen and Moerner (2005). For this process, we consider the observable

ST=1T0TXt𝑑t,S_{T}=\frac{1}{T}\int_{0}^{T}X_{t}\,dt, (49)

which for laser tweezers is related to the mechanical power expended by the lasers on the particle van Zon and Cohen (2003).

The tilted generator k\mathcal{L}_{k} associated with this process and observable is given from (12) and (20) by

k=γxx+σ22xx+kx\mathcal{L}_{k}=-\gamma x\partial_{x}+\frac{\sigma^{2}}{2}\partial_{xx}+kx (50)

and is clearly non-Hermitian. However, because the ROUP is gradient and ergodic on [0,)[0,\infty), we can apply the symmetrization transform described before by substituting the potential U(x)=γx2/2U(x)=\gamma x^{2}/2 in (42) to obtain from (41)

k=σ22xxγ2x22σ2+γ2+kx.\mathcal{H}_{k}=\frac{\sigma^{2}}{2}\partial_{xx}-\frac{\gamma^{2}x^{2}}{2\sigma^{2}}+\frac{\gamma}{2}+kx. (51)

This defines with (43) the spectral problem that we need to solve in order to obtain the SCGF. The boundary condition (47) reduces in this case to the simple Neumann condition

ψk(0)=0.\psi_{k}^{\prime}(0)=0. (52)

For x=x=\infty, the normalization (45) requires that ψk(x)20\psi_{k}(x)^{2}\rightarrow 0 as xx\rightarrow\infty and, therefore, ψk(x)0\psi_{k}(x)\rightarrow 0 as xx\rightarrow\infty.

Refer to caption
Figure 2: SCGF λ(k)\lambda(k) for the ROUP with linear observable. Parameters: σ=1\sigma=1 and γ=1\gamma=1.

The spectral problem (43) defines for k\mathcal{H}_{k} above a differential equation whose solutions are taken from the class of parabolic cylinder functions Dν(z)D_{\nu}(z), the dominant solution ψk\psi_{k} having the form

ψk(x)=Dξ(k)(2γxσ2kσγ3/2)\psi_{k}(x)=D_{\xi(k)}\left(\frac{\sqrt{2\gamma}x}{\sigma}-\frac{\sqrt{2}k\sigma}{\gamma^{3/2}}\right) (53)

up to a normalization constant, where we have defined

ξ(k)=k2σ22γ2λ(k)2γ3.\xi(k)=\frac{k^{2}\sigma^{2}-2\gamma^{2}\lambda(k)}{2\gamma^{3}}. (54)

This solution decays to 0 at infinity. By imposing the boundary condition (52) and using well-known properties of the parabolic cylinder functions, we find that λ(k)\lambda(k) is determined implicitly by the transcendental equation

kσ2γ3/2Dξ(k)(2kσγ3/2)+Dξ(k)+1(2kσγ3/2)=0.\frac{k\sigma}{\sqrt{2}\gamma^{3/2}}D_{\xi(k)}\left(-\frac{\sqrt{2}k\sigma}{\gamma^{3/2}}\right)+D_{\xi(k)+1}\left(\frac{\sqrt{2}k\sigma}{\gamma^{3/2}}\right)=0. (55)

To be more precise, λ(k)\lambda(k) is the largest root of this equation; the other roots, which are all real, give the rest of the spectrum of k\mathcal{H}_{k} and therefore of k\mathcal{L}_{k}.

We show in Fig. 2 the plot of λ(k)\lambda(k) obtained by solving the transcendental equation numerically for a given set of parameters γ\gamma and σ\sigma and different values of kk. We can see that λ(0)=0\lambda(0)=0, as follows from the definition of the SCGF, and that λ(k)\lambda(k) appears to be differentiable and strictly convex. As a result, we can take the Legendre transform (21) to obtain the rate function I(s)I(s), shown in Fig. 3. There we see that I(s)I(s) has two very different branches on either side of the minimum and zero ss^{*}, corresponding to the most probable value of STS_{T} at which PT(s)P_{T}(s) concentrates exponentially as TT\rightarrow\infty. The left branch, related to the k<0k<0 branch of the SCGF, is steep and therefore indicates that small fluctuations of STS_{T} close to 0, produced by paths that stay close to the reflecting boundary, are very unlikely. The right branch, on the other hand, is less steep and has overall the shape of a parabola, signalling that the large fluctuations of STS_{T} are Gaussian-distributed.

Refer to caption
Figure 3: Rate function I(s)I(s) for the ROUP with linear observable, compared with the rate function of the normal OUP. Parameters: σ=1\sigma=1 and γ=1\gamma=1.

This is confirmed by comparing I(s)I(s) with the rate function of STS_{T} for the normal Ornstein–Uhlenbeck process (OUP) evolving on the whole of \mathbb{R}, which is known to be

IOUP(s)=γ2s22σ2.I_{\textrm{OUP}}(s)=\frac{\gamma^{2}s^{2}}{2\sigma^{2}}. (56)

This rate function is shown with the dashed line in Fig. 3 and closely approximates, as can be seen, the upper tail of the rate function obtained with reflection. This is explained by noting that large fluctuations of STS_{T} arise from paths that venture far from the reflecting boundary, and so do not “feel” its influence. The zero at ss^{*} is itself a product of the boundary, since s=0s^{*}=0 in the normal OUP. We can determine its value by noting from the ergodic theorem that the most probably value of STS_{T} is the stationary expectation

s=0xρ(x)𝑑x.s^{*}=\int_{0}^{\infty}x\rho^{*}(x)\,dx. (57)

Here ρ(x)\rho^{*}(x) is a truncated Gaussian density with potential U(x)=γx2/2U(x)=\gamma x^{2}/2 restricted to x[0,)x\in[0,\infty), leading in the integral above to s=σ/πγs^{*}=\sigma/\sqrt{\pi\gamma}, which gives s=1/π0.564s^{*}=1/\sqrt{\pi}\approx 0.564 for the parameters used in Fig. 3.

The large Gaussian fluctuations of STS_{T} are also confirmed by analyzing the driven force FkF_{k}, which we find from (31) using the expression (53) for ψk\psi_{k} and the relation (44), yielding

Fk(x)=2γσ12ηDξ(k)(η)Dξ(k)+1(η)Dξ(k)(η),F_{k}(x)=\sqrt{2}\gamma\sigma\frac{\frac{1}{2}\eta D_{\xi(k)}(\eta)-D_{\xi(k)+1}(\eta)}{D_{\xi(k)}(\eta)}, (58)

where

η=2γxσ2kσγ3/2.\eta=\frac{\sqrt{2\gamma}x}{\sigma}-\frac{\sqrt{2}k\sigma}{\gamma^{3/2}}. (59)

To obtain this expression, we have also used some identities of the parabolic cylinder functions du Buisson (2020). The result is plotted in Fig. 4(a) for different values of kk, together with the properly normalized ρk(x)=ψk(x)2\rho_{k}^{*}(x)=\psi_{k}(x)^{2} in Fig. 4(b).

Refer to caption
Refer to caption
Figure 4: (Color online) (a) Drift Fk(x)F_{k}(x) of the driven process for the ROUP with linear observable plotted for different values of kk (see legend). (b) Corresponding stationary density ρk(x)\rho_{k}^{*}(x). Parameters: σ=1\sigma=1 and γ=1\gamma=1.

Comparing the two plots, we see that Fk(x)F_{k}(x) has two zeros for k>0k>0 and, therefore, two critical points: one at x=0x=0, which gives rise to a local minimum in ρk(x)\rho_{k}^{*}(x), and another at some value x>0x>0, which is responsible for the maximum of ρk(x)\rho_{k}^{*}(x). Around the latter critical point, Fk(x)F_{k}(x) is approximately linear with slope γ-\gamma, implying that ρk(x)\rho_{k}^{*}(x) is approximately Gaussian around its maximum. Moreover, as kk is increased, we see that the maximum of ρk(x)\rho_{k}^{*}(x) moves away from x=0x=0, showing that the driven process is repelled from the boundary, thus creating larger typical values of STS_{T}, similarly to the driven process of the normal OUP Chetrite and Touchette (2015a). The difference between the two processes is that the driven force of the ROUP is “bent” near the boundary so as to have Fk(0)=F(0)F_{k}(0)=F(0), consistently with (36), whereas that of the OUP is always linear Chetrite and Touchette (2015a).

For k<0k<0, the picture is different. The driven force Fk(x)F_{k}(x) only has a single critical point at x=0x=0, creating the maximum of ρk(x)\rho_{k}^{*}(x) seen in Fig. 4(b). As kk\rightarrow-\infty, ρk(x)\rho_{k}^{*}(x) gets more concentrated at x=0x=0, as the driven process is attracted to the reflecting boundaries, creating smaller fluctuations of STS_{T}. Such a behavior is very unlikely in the ROUP, which is why the rate function is steep close to s=0s=0. In fact, it is steeper than a parabola because Fk(x)F_{k}(x) is not linear away from x=0x=0: its curvature becomes more pronounced for large negative values of kk, leading ρk(x)\rho_{k}^{*}(x) to be non-Gaussian. Note in all cases that Fk(0)=F(0)=0F_{k}(0)=F(0)=0, consistently again with (36).

This analysis of Fk(x)F_{k}(x) is useful not only in understanding the different stochastic mechanisms underlying or “producing” different fluctuations of STS_{T}, but also in deriving accurate approximations of the rate function by following three steps Chetrite and Touchette (2015b). First, approximate FkF_{k} by some function, say F~θ\tilde{F}_{\theta}, where θ\theta denotes a set of parameters. Second, calculate the stationary distribution ρ~θ\tilde{\rho}^{*}_{\theta} that results from this approximation. Third and finally, calculate the ergodic expectation of STS_{T} with respect to ρ~θ\tilde{\rho}^{*}_{\theta}:

sθ=abf(x)ρ~θ(x)𝑑x,s_{\theta}=\int_{a}^{b}f(x)\tilde{\rho}_{\theta}^{*}(x)\,dx, (60)

as well as the integral

Kθ=12σ2ab[F(x)F~θ(x)]2ρ~θ(x)𝑑x.K_{\theta}=\frac{1}{2\sigma^{2}}\int_{a}^{b}[F(x)-\tilde{F}_{\theta}(x)]^{2}\tilde{\rho}_{\theta}^{*}(x)\,dx. (61)

Then

I(sθ)KθI(s_{\theta})\leq K_{\theta} (62)

with equality if and only if F~θ=Fk\tilde{F}_{\theta}=F_{k} at sθs_{\theta} Chetrite and Touchette (2015b).

It is beyond this paper to prove this result (see Chetrite and Touchette (2015b)). At this point we only want to use it to find useful upper bounds on I(s)I(s), beginning with the left branch of this function, related as mentioned before to the k<0k<0 branch of λ(k)\lambda(k). In this case, we know that Fk(x)F_{k}(x) has a single critical point at x=0x=0, so we approximate it in a simple way as

F~θ(x)=θx.\tilde{F}_{\theta}(x)=-\theta x. (63)

Only θγ\theta\geq\gamma need be considered, since it is clear from Fig. 4(b) that Fk(0)γF^{\prime}_{k}(0)\leq-\gamma for k0k\leq 0. This approximation retains the linear form of F(x)F(x), which means that ρ~θ(x)\tilde{\rho}^{*}_{\theta}(x) is the same truncated Gaussian density as the ROUP but with γ\gamma replaced by θ\theta. As a result, we have sθ=σ/πθs_{\theta}=\sigma/\sqrt{\pi\theta} and obtain

Kθ=(θγ)24θK_{\theta}=\frac{(\theta-\gamma)^{2}}{4\theta} (64)

by direct integration of (61). Changing the θ\theta variable to ss with sθ=ss_{\theta}=s, we then find

I~(s)=Kθ(s)=π4(σπsγsσ)2\tilde{I}(s)=K_{\theta(s)}=\frac{\pi}{4}\left(\frac{\sigma}{\pi s}-\frac{\gamma s}{\sigma}\right)^{2} (65)

as our approximation of I(s)I(s) for s(0,s]s\in(0,s^{*}].

We do not compare this result with the exact rate function obtained from the spectral calculation, as the two are nearly indistinguishable du Buisson (2020). They agree exactly at ss^{*}, since θ=γ\theta=\gamma recovers the drift of the ROUP, and start to differ only close to s=0s=0 because FkF_{k} is curved there, as we noticed, whereas F~θ\tilde{F}_{\theta} is not. Note that the divergence of I(s)I(s) near s=0s=0 predicted by the approximation above is I~(s)σ2/(4πs2)\tilde{I}(s)\sim\sigma^{2}/(4\pi s^{2}) as s0s\rightarrow 0, which is independent of γ\gamma.

Similar calculations can be carried out for k>0k>0 to approximate I(s)I(s) for s>ss>s^{*}. In that case, the form of FkF_{k} suggests that we use

F~θ(x)=γx+θ\tilde{F}_{\theta}(x)=-\gamma x+\theta (66)

as an approximate drift parameterized by θ>0\theta>0. The associated stationary density ρ~θ\tilde{\rho}_{\theta}^{*} can also be obtained in closed form and yields, after solving a number of Gaussian integrals du Buisson (2020),

sθ=σ2πγeθ2/(γσ2)1+erf[θ/(γσ)]+θγs_{\theta}=\sqrt{\frac{\sigma^{2}}{\pi\gamma}}\frac{e^{-\theta^{2}/(\gamma\sigma^{2})}}{1+\operatorname{erf}[\theta/(\sqrt{\gamma}\sigma)]}+\frac{\theta}{\gamma} (67)

and

Kθ=θ22σ2.K_{\theta}=\frac{\theta^{2}}{2\sigma^{2}}. (68)

The presence of the error function in sθs_{\theta} prevents us from expressing KθK_{\theta} in closed form as a function of ss, as in (65). However, the result clearly shows that the rate function becomes a parabola as θ\theta\rightarrow\infty, for in that limit, sθθ/γs_{\theta}\sim\theta/\gamma, leading to I~(s)=Kθ(s)=IOUP(s)\tilde{I}(s)=K_{\theta(s)}=I_{\textrm{OUP}}(s) as ss\rightarrow\infty.

V.2 Reflected Brownian motion with drift

The second example we consider is the reflected Brownian motion with drift (RBMD) governed by the SDE

dXt=μdt+σdWt,dX_{t}=-\mu dt+\sigma dW_{t}, (69)

where μ>0\mu>0, σ>0\sigma>0, and Xt[0,)X_{t}\in[0,\infty), with reflection imposed at x=0x=0 Linetsky (2005). This process was studied by Fatalov Fatalov (2017) and models, similarly to the ROUP, the dynamics of a particle attracted by a force to a reflecting wall. The force is now constant and can be viewed, for instance, as the gravity pulling vertically on a Brownian particle in a container. As for the ROUP, we consider the linear observable defined in (49).

The tilted generator associated with the large deviations of STS_{T} for this process is given by (12) with F(x)=μF(x)=-\mu and leads, after symmetrization with the corresponding potential U(x)=μxU(x)=\mu x, to the Hermitian operator

k=σ22xxμ22σ2+kx.\mathcal{H}_{k}=\frac{\sigma^{2}}{2}\partial_{xx}-\frac{\mu^{2}}{2\sigma^{2}}+kx. (70)

This defines with (43) the spectral problem that gives the SCGF, which needs to be solved with the Robin boundary condition (47)

ψk(0)=μσ2ψk(0)\psi_{k}^{\prime}(0)=-\frac{\mu}{\sigma^{2}}\psi_{k}(0) (71)

at x=0x=0 and ψk(x)0\psi_{k}(x)\rightarrow 0 as xx\rightarrow\infty.

Refer to caption
Figure 5: SCGF λ(k)\lambda(k) for the RBMD with linear observable (inset) and corresponding rate function I(s)I(s). Parameters: σ=1\sigma=1 and μ=1\mu=1. The rate function is defined by spectral means for 0<ss=σ2/(2μ)0<s\leq s^{*}=\sigma^{2}/(2\mu); it is unknown above ss^{*}, as indicated by the question mark.

The eigenfunctions of k\mathcal{H}_{k} satisfying these conditions are now expressed in terms of Airy functions of the first kind. The dominant eigenfunction is

ψk(x)=Ai[(σ22k)2/3(2kxσ2+2σ2λ(k)+μ2σ4)],\psi_{k}(x)=\operatorname{Ai}\left[\left(-\frac{\sigma^{2}}{2k}\right)^{2/3}\left(-\frac{2kx}{\sigma^{2}}+\frac{2\sigma^{2}\lambda(k)+\mu^{2}}{\sigma^{4}}\right)\right], (72)

while λ(k)\lambda(k) is given by the largest root λ\lambda of the following transcendental equation:

(2kσ2)1/3Ai[(σ22k)2/32σ2λ+μ2σ4]+μσ2Ai[(σ22k)2/32σ2λ+μ2σ4]=0.\left(-\frac{2k}{\sigma^{2}}\right)^{1/3}\operatorname{Ai}^{\prime}\left[\left(-\frac{\sigma^{2}}{2k}\right)^{2/3}\frac{2\sigma^{2}\lambda+\mu^{2}}{\sigma^{4}}\right]\\ \quad+\frac{\mu}{\sigma^{2}}\operatorname{Ai}\left[\left(-\frac{\sigma^{2}}{2k}\right)^{2/3}\frac{2\sigma^{2}\lambda+\mu^{2}}{\sigma^{4}}\right]=0. (73)

As before, we can solve this equation numerically for given values of μ\mu and σ\sigma as well as various values of kk so as to build an interpolation of λ(k)\lambda(k), from which we obtain the rate function by computing the Legendre transform (21). The resulting functions are shown in Figs. 5. Unlike the ROUP, λ(k)\lambda(k) is now defined only for k0k\leq 0 because the potential

Vk(x)=μ22σ2kx,V_{k}(x)=\frac{\mu^{2}}{2\sigma^{2}}-kx, (74)

which is related to the classic quantum triangular well, is confining only for k<0k<0, and so has bound states only for this range of parameters. This implies that the Legendre transform of λ(k)\lambda(k) gives I(s)I(s) only for s(0,s]s\in(0,s^{*}], as shown in Fig. 5, where ss^{*} is again the typical value of STS_{T}, found here from (8) and (57) to be s=σ2/(2μ)s^{*}=\sigma^{2}/(2\mu).

Refer to caption
Figure 6: (Color online) Driven force Fk(x)F_{k}(x) for the RBMD with linear observable for various values of kk. Parameters: σ=1\sigma=1 and μ=1\mu=1.

Above this value, STS_{T} does have fluctuations, but its large deviations are not covered by the spectral calculation, which is a sign generally that PT(s)P_{T}(s) scales weaker than exponentially in TT. An example of such a scaling was discussed recently for the OUP Nickelsen and Touchette (2018) using path integral techniques that predict a stretched exponential scaling in TT, although the exact rate function cannot be found. The application of these techniques is beyond the scope of this paper, so we leave the study of the fluctuation region ST>sS_{T}>s^{*} as an open problem 444This region is not considered by Fatalov Fatalov (2017)..

Note, incidentally, that for μ=0\mu=0, STS_{T} has no large deviations at the scale TT because Brownian motion is non-ergodic. The confining force produced by the negative drift along with the reflecting boundary makes that motion ergodic and creates fluctuations ST<sS_{T}<s^{*} that are exponentially unlikely with TT, though it is not strong enough, somehow, to constrain the fluctuations ST>sS_{T}>s^{*} in the same exponential way. A similar effect is seen for the Brownian motion with reset den Hollander et al. (2019).

To understand how the fluctuations of STS_{T} arise below its mean ss^{*}, where PT(s)P_{T}(s) scales exponentially, let us analyze the driven process 555Defining the driven process for ST>sS_{T}>s^{*} is also an open problem, related again to the fact that the large deviations in that region are not given by the spectral elements of k\mathcal{L}_{k} Nickelsen and Touchette (2018).. The explicit expression of Fk(x)F_{k}(x) for the RBMD is too long to show here, so we provide only the formula

Fk(x)=σ2ψk(x)ψk(x),F_{k}(x)=\sigma^{2}\frac{\psi_{k}^{\prime}(x)}{\psi_{k}(x)}, (75)

which follows from (31) and (44), and which leads with (72) to a ratio of the derivative of the Airy function and the Airy function. The result, plotted in Fig. 6 for various values of kk, is similar to the driven force found for the ROUP when k<0k<0. Its shape shows that small fluctuations of STS_{T} are created, as expected, by squeezing the process close to the reflecting boundary by two forces: the negative constant drift μ-\mu of the original RBMD and an added nonlinear force, which can be approximated near x=0x=0 as a linear force with varying “friction” coefficient γk=Fk(0+)\gamma_{k}=-F^{\prime}_{k}(0^{+}). The action of these two forces creates a stationary density ρk\rho_{k}^{*} (not shown) which is approximately a shifted Gaussian density truncated at the boundary and increasingly concentrated there as kk\rightarrow-\infty or, equivalently, ST0S_{T}\rightarrow 0 du Buisson (2020).

These results suggest that we approximate Fk(x)F_{k}(x) for k0k\leq 0 to first order in xx as

F~θ(x)=θxμ,\tilde{F}_{\theta}(x)=-\theta x-\mu, (76)

where θ0\theta\geq 0. This has the same form as the ansatz (66) used for the ROUP, with obvious replacements for the parameters, so we can find ρ~θ\tilde{\rho}_{\theta}^{*} and sθs_{\theta} from the results obtained before. The integral (61) of KθK_{\theta}, however, is different because of the different FF for the RBMD and leads now to

I~(sθ)=(θ4+μ22σ2)μ2σθπeμ2/(θσ2)1erf[μ/(θσ)].\tilde{I}(s_{\theta})=\left(\frac{\theta}{4}+\frac{\mu^{2}}{2\sigma^{2}}\right)-\frac{\mu}{2\sigma}\sqrt{\frac{\theta}{\pi}}\frac{e^{-\mu^{2}/(\theta\sigma^{2})}}{1-\operatorname{erf}[\mu/(\sqrt{\theta}\sigma)]}. (77)

It can be checked that the limit θ0\theta\rightarrow 0 of this approximation gives I~(s0)=0\tilde{I}(s_{0})=0 at s0=ss_{0}=s^{*}, as expected, whereas θ\theta\rightarrow\infty gives a scaling near s=0s=0 similar to the ROUP, namely, I~(s)σ2/(8πs2)\tilde{I}(s)\sim\sigma^{2}/(8\pi s^{2}).

VI Conclusion

We have shown how reflecting boundaries enter in the calculation of large deviation functions describing the likelihood of fluctuations of integrated quantities defined for Langevin-type processes. We have illustrated with basic examples the influence of such boundaries, particularly at the level of the driven process, which provides a mechanism for understanding how large deviations arise in the long-time limit. Our results pave the way for studying more general diffusions that evolve in confined domains of 2\mathbb{R}^{2} or 3\mathbb{R}^{3} with reflecting boundaries, as well as other observables, including particle currents, work-like quantities, and the entropy production, which are defined in terms of the increments of XtX_{t} in addition to XtX_{t} Seifert (2012). The study of such diffusions should bring many new interesting results, as they may have non-zero stationary currents Hoppenau et al. (2016) circulating parallel to a reflecting surface. Observables involving increments of XtX_{t} are also expected to change the boundary term in the duality between k\mathcal{L}_{k} and k\mathcal{L}_{k}^{\dagger} in a non-trivial way, giving rise to more complicated boundary conditions for rkr_{k} and lkl_{k}.

In principle, our results can also be applied, as mentioned, to SDEs with multiplicative noise, that is, SDEs in which the noise amplitude σ\sigma depends on the state XtX_{t}. These often arise in diffusion limits of population models 666Note that jump processes, often used for modelling population dynamics, have no boundaries – they have states connected by transition rates that can be set in an arbitrary way., as well as in finance, and should be treated in the same way as described here 777SDEs with multiplicative noise can also be treated in one dimension by transforming them to SDEs with additive noise using the Lamperti transformation Pavliotis (2014)., assuming that their boundaries are reachable and that they are ergodic. The geometric Brownian motion, for example, is such that Xt0X_{t}\geq 0 but the boundary x=0x=0 cannot be reached in finite time Borodin and Salminen (2015), so it does not make sense to define reflections there. This process is also not ergodic, so we do not expect a priori large deviations to exist.

Similar considerations apply to other boundary types: they should be treated in the same way as reflective boundaries by considering the boundary term arising in the duality of k\mathcal{L}_{k} and k\mathcal{L}_{k}^{\dagger}. However, as for multiplicative SDEs, we must ensure that the process considered with its boundary behavior is ergodic, for otherwise the distribution of observables is not expected to have a large deviation form. This prevents us, in general, from considering absorbing boundary conditions, which lead (without re-entry) to singular distributions concentrated on boundaries and for which, therefore, observables do not fluctuate in the infinite-time limit.

To conclude, we remark that our results could be obtained in a different way by using the contraction principle, which establishes a link between the large deviations of empirical densities and sample means, and which effectively replaces the spectral problem studied here by a minimization problem Chetrite and Touchette (2015b). The boundary conditions that must be imposed on the latter problem are discussed for reflected diffusions by Pinsky Pinsky (1985a, b, c, d) and can be shown to be equivalent to the zero-current conditions imposed on ρk\rho^{*}_{k}, which represents in the contraction principle the optimal stationary density leading to a given fluctuation of STS_{T}. This follows by generalizing a previous equivalence established for unbounded diffusions Chetrite and Touchette (2015b).

In principle, one could also approach the large deviation problem by expressing the expectation E[ekTST]E[e^{kTS_{T}}] of the SCGF as a path integral restricted on an interval. Such integrals have been studied in quantum mechanics in the context of the free quantum particle evolving on the half-line Clark et al. (1980); Carreau et al. (1990); Farhi and Gutmann (1990), but they are not expected to be solvable, except for simple systems. In any case, the main property of E[ekTST]E[e^{kTS_{T}}] that underlies dynamical large deviations is its exponential behavior in TT, which, as we know from the Feynman–Kac equation, is determined by the dominant eigenvalue of the tilted generator k\mathcal{L}_{k}. Path integral techniques only confirm this result and have proved to be useful in large deviation theory mainly when considering the low-noise limit.

Acknowledgements.
We thank Emil Mallmin for useful comments. J.d.B. is also grateful to the Joubert family for an MSc Scholarship administered by Stellenbosch University.

References