This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Linearization and a superposition principle for deterministic and stochastic nonlinear Fokker-Planck-Kolmogorov equations

Marco Rehmeier111Faculty of Mathematics, Bielefeld University, 33615 Bielefeld, Germany. E-Mail: [email protected]
Abstract

We prove a superposition principle for nonlinear Fokker-Planck-Kolmogorov equations on Euclidean spaces and their corresponding linearized first-order continuity equation over the space of Borel (sub-)probability measures. As a consequence, we obtain equivalence of existence and uniqueness results for these equations. Moreover, we prove an analogous result for stochastically perturbed Fokker-Planck-Kolmogorov equations. To do so, we particularly show that such stochastic equations for measures are, similarly to the deterministic case, intrinsically related to linearized second-order equations on the space of Borel (sub-)probability measures.

Keywords: Nonlinear Fokker-Planck equation, McKean-Vlasov stochastic differential equation, diffusion process, superposition principle

2010 MSC: 60J60, 58J65

1 Introduction

In this work we are concerned with nonlinear Fokker-Planck-Kolmogorov equations (FPK-equations) on d\mathbb{R}^{d}, both deterministic

tμt=t,μtμt,t[0,T],\partial_{t}\mu_{t}=\mathcal{L}^{*}_{t,\mu_{t}}\mu_{t},\,\,t\in[0,T], (NLFPK)

and perturbed by a first-order stochastic term driven by a finite-dimensional Wiener process WW

tμt=t,μtμtdiv(σ(t,μt))dWt,t[0,T],\partial_{t}\mu_{t}=\mathcal{L}^{*}_{t,\mu_{t}}\mu_{t}-\text{div}(\sigma(t,\mu_{t}))dW_{t},\,\,t\in[0,T], (SNLFPK)

with solutions being continuous curves of subprobability measures μt𝒮𝒫\mu_{t}\in\mathcal{SP}. Here, \mathcal{L}^{*} denotes the formal dual of a second-order differential operator acting on sufficiently smooth functions φ:d\varphi:\mathbb{R}^{d}\to\mathbb{R} via

t,μφ(x)=i,j=1daij(t,μ,x)ij2φ(x)+i=1dbi(t,μ,x)iφ(x)\mathcal{L}_{t,\mu}\varphi(x)=\sum_{i,j=1}^{d}a_{ij}(t,\mu,x)\partial^{2}_{ij}\varphi(x)+\sum_{i=1}^{d}b_{i}(t,\mu,x)\partial_{i}\varphi(x) (1)

with coefficients aa and bb depending on (t,x)[0,T]×d(t,x)\in[0,T]\times\mathbb{R}^{d} and (in general non-locally) on the solution μt\mu_{t}. These equations are to be understood in distributional sense, see Definition 3.1 and 4.1. The nonlinearity arises from the dependence of \mathcal{L} and σ\sigma on the solution itself, which renders the theory of existence and uniqueness of such equations significantly more difficult compared to the linear case. For a thorough introduction to the field, we refer to [5] and the references therein. As shown in [17], the deterministic nonlinear equation (NLFPK) is naturally associated to a first-order linear continuity equation on 𝒫(𝒮𝒫)\mathcal{P}(\mathcal{SP}), the space of Borel probability measures on 𝒮𝒫\mathcal{SP}, of type

tΓt=𝐋tΓt,t[0,T],\partial_{t}\Gamma_{t}=\mathbf{L}^{*}_{t}\Gamma_{t},\,\,t\in[0,T], (𝒮𝒫\mathcal{SP}-CE)

in the sense of distributions, with the linear operator 𝐋\mathbf{L} acting on sufficiently smooth real functions on 𝒮𝒫\mathcal{SP} via the gradient operator 𝒮𝒫\nabla^{\mathcal{SP}} on 𝒮𝒫\mathcal{SP} as

𝐋tF=𝒮𝒫F,bt+atL2.\mathbf{L}_{t}F=\big{\langle}\nabla^{\mathcal{SP}}F,b_{t}+a_{t}\nabla\big{\rangle}_{L^{2}}.

Precise information on this operator and equation (𝒮𝒫\mathcal{SP}-CE) are given in Section 3, in particular in Definition 3.5 and the paragraph preceding it.
Our first main result, Theorem 3.7 states that each weakly continuous solution (Γt)tT(\Gamma_{t})_{t\leq T} to (𝒮𝒫\mathcal{SP}-CE) is a superposition of solutions to (NLFPK), i.e. (denoting by ete_{t} the canonical projection et:(μt)tTμte_{t}:(\mu_{t})_{t\leq T}\mapsto\mu_{t})

Γt=ηet1\Gamma_{t}=\eta\circ e_{t}^{-1} (2)

for some probability measure η\eta concentrated on solution curves to (NLFPK) in a suitable sense.
We also treat the stochastic case in a similar fashion. More precisely, in Section 4 we establish a new correspondence between the stochastic equation for measures (SNLFPK) and a corresponding second-order equation for curves (Γt)tT(\Gamma_{t})_{t\leq T} in 𝒫(𝒮𝒫)\mathcal{P}(\mathcal{SP}) of type

tΓt=(𝐋t(2))Γt,t[0,T],\partial_{t}\Gamma_{t}=(\mathbf{L}_{t}^{(2)})^{*}\Gamma_{t},\,\,t\in[0,T], (𝒮𝒫\mathcal{SP}-FPK)

where, roughly,

𝐋t(2)=𝐋t+ second-order perturbation.\mathbf{L}^{(2)}_{t}=\mathbf{L}_{t}+\textit{ second-order perturbation}.

The second-order term stems from the stochastic perturbation of (SNLFPK) and will be geometrically interpreted in terms of a (formal) notion of the Levi-Civita connection on 𝒮𝒫\mathcal{SP}. The second main result of this work, Theorem 4.8, is then the stochastic generalization of the deterministic case: For any solution (Γt)tT(\Gamma_{t})_{t\leq T} to (𝒮𝒫\mathcal{SP}-FPK), there exists a solution process (μt)tT(\mu_{t})_{t\leq T} to (SNLFPK) on some probability space such that μt\mu_{t} has distribution Γt\Gamma_{t}. We stress that in both cases, we do not require any regularity of the coefficients.

Let us embed these results into the general research in this direction. Let bt():ddb_{t}(\cdot):\mathbb{R}^{d}\to\mathbb{R}^{d} be an inhomogeneous vector field and consider the (nonlinear) ODE

ddtγt=bt(γt),tT\frac{d}{dt}\gamma_{t}=b_{t}(\gamma_{t}),\,\,t\leq T (ODE)

and the linear continuity equation for curves of Borel (probability) measures on d\mathbb{R}^{d}

tμt=div(btμt),tT,\partial_{t}\mu_{t}=-\operatorname*{div}(b_{t}\mu_{t}),\,\,t\leq T, (CE)

understood in distributional sense. In the seminal paper [1], L. Ambrosio showed the following: Any (probability) solution (μt)tT(\mu_{t})_{t\leq T} to (CE) with an appropriate global integrability condition is a superposition of solution curves to (ODE), i.e. there exists a (probability) measure η\eta on the space of continuous paths with values in the state space of (ODE), C([0,T],d)C([0,T],\mathbb{R}^{d}), which is concentrated on solutions to (ODE) such that

ηet1=μt,tT.\eta\circ e_{t}^{-1}=\mu_{t},\,\,t\leq T.

This allows to transfer existence and uniqueness results between the linear equation (CE) and the nonlinear (ODE). However, the linear equation must be studied on an infinite-dimensional space of (probability) measures. The analogy to our deterministic result from Section 3 is as follows: (ODE) is replaced by (NLFPK), which, in spirit of this analogy, we interpret as a differential equation on the manifold-like state space 𝒮𝒫\mathcal{SP}. Likewise, (CE) is replaced by (𝒮𝒫\mathcal{SP}-CE) and our first main result Theorem 3.7 may be understood as the analogue of Ambrosio’s result to the present setting. By passing from (NLFPK) to (𝒮𝒫\mathcal{SP}-CE), we linearize the equation.
Concerning the stochastic case, consider a stochastic differential equation on d\mathbb{R}^{d}

dXt=b(t,Xt)dt+a~(t,Xt)dBt,t[0,T].dX_{t}=b(t,X_{t})dt+\tilde{a}(t,X_{t})dB_{t},\,\,t\in[0,T]. (SDE)

By Itô’s formula, the one-dimensional marginals μt\mu_{t} of any (weak) martingale solution XX solve the corresponding linear FPK-equation

tμt=lin,tμt,t[0,T],\partial_{t}\mu_{t}=\mathcal{L}_{lin,t}^{*}\mu_{t},\,\,t\in[0,T], (FPK)

where lin\mathcal{L}_{lin} is a linear second-order diffusion operator with coefficients bb and 12a~a~T\frac{1}{2}\tilde{a}\tilde{a}^{T}. Conversely, a superposition principle has successively been developed in increasingly general frameworks (cf. [9, 14, 21, 6]): Under mild global integrability assumptions, for every weakly continuous solution curve of probability measures (μt)tT(\mu_{t})_{t\leq T} to (FPK), there exists a (weak) martingale solution XX to (SDE) with one-dimensional marginals (μt)tT(\mu_{t})_{t\leq T}, thereby providing an equivalence between solutions to (SDE) and (FPK), which offers a bridge between probabilistic and analytic approaches to diffusion processes. As in the deterministic case, the transition from (SDE) to (FPK) provides a linearization, while at the same time it transfers the equation to a much higher dimensional state space. Concerning our stochastic result Theorem 4.8, we replace the stochastic equation on d\mathbb{R}^{d} by the stochastic equation for measures (SNLFPK) and the corresponding second-order equation for measures (FPK) by (𝒮𝒫\mathcal{SP}-FPK) and prove an analogous superposition result for solutions to the latter equation.
The proofs of both the deterministic and stochastic result rely on superposition principles for differential equations on \mathbb{R}^{\infty} and the corresponding continuity equation (for the deterministic case) and for martingale solutions and FPK-equations on \mathbb{R}^{\infty} (for the stochastic case) by Ambrosio and Trevisan ([2], [20]). The key technique is to transfer (𝒮𝒫\mathcal{SP}-CE) and (NLFPK) (and, similarly, (𝒮𝒫\mathcal{SP}-FPK) and (SNLFPK) for the stochastic case) to suitable equations on \mathbb{R}^{\infty} via a homeomorphism between 𝒮𝒫\mathcal{SP} and \mathbb{R}^{\infty} (replaced by 2\ell^{2} for the stochastic case, in order to handle the stochastic integral).
Moreover, our results also blend into the theory of distribution dependent stochastic differential equations, also called McKean-Vlasov equations, i.e. stochastic equations on Euclidean space of type

dXt=b(t,Xt,Xt)dt+a~(t,Xt,Xt)dBt,t[0,T],dX_{t}=b(t,\mathcal{L}_{X_{t}},X_{t})dt+\tilde{a}(t,\mathcal{L}_{X_{t}},X_{t})dB_{t},\,\,t\in[0,T], (DDSDE)

see the classical papers [15, 10, 18] as well as the more recent works [12, 11, 8]. Here, Xt\mathcal{L}_{X_{t}} denotes the distribution of XtX_{t} and is not to be confused with the operators t,μ\mathcal{L}_{t,\mu} and t\mathcal{L}_{t} from above. As in the non-distribution dependent case, where the curve of marginals of any solution to (SDE) solves an equation of type (FPK), a similar observation holds here: Each solution XX to (DDSDE) provides a solution to a nonlinear FPK-equation of type (NLFPK) via μt=Xt\mu_{t}=\mathcal{L}_{X_{t}} and a corresponding superposition principle holds analogously to the linear case as well ([4, 3]).
However, while for (SDE) the passage to (FPK) provides a complete linearization, the situation is different for equations of type (NLFPK). This stems from the observation that (DDSDE) is an equation with two sources of nonlinearity. Hence, it seems natural to linearize (NLFPK) once more in order to obtain a linear equation, which is related to (DDSDE) and (NLFPK) in a natural way. By the results of [17], this linear equation is of type (𝒮𝒫\mathcal{SP}-CE). Similar considerations prevail in the stochastic case, where one considers equations of type (DDSDE) with an additional source of randomness (we shall not pursue this direction in this work).

On the one hand, the superposition principles of Theorem 3.7 and Theorem 4.8 provide new structural results for nonlinear FPK-equations and its corresponding linearized equations on the space of probability measures over 𝒮𝒫\mathcal{SP}, involving a geometric interpretation of the latter. On the other hand, it is our future plan to further study the geometry of 𝒮𝒫\mathcal{SP} as initiated in [17] and this work to develop an analysis on such infinite-dimensional manifold-like spaces, which allows to solve linear equations of type (𝒮𝒫\mathcal{SP}-CE) and (𝒮𝒫\mathcal{SP}-FPK) on such spaces. By means of the results of this work, one can then lift such solutions to solutions to the nonlinear equations for measures (NLFPK) and (SNLFPK), thereby obtaining new existence results for these nonlinear equations for measures.

We point out that although our main aim is to lift weakly continuous solutions to (𝒮𝒫\mathcal{SP}-CE) and (𝒮𝒫\mathcal{SP}-FPK) concentrated on probability measures to a measure on the space of continuous probability measure-valued paths (μt)tT(\mu_{t})_{t\leq T}, for technical reasons we more generally develop our results for vaguely continuous subprobability solutions (i.e. μt𝒮𝒫\mu_{t}\in\mathcal{SP}). We comment on the advantages of this approach in Remark 3.9 for the deterministic case and note that similar arguments prevail in the stochastic case as well. However, due to the global integrability assumptions we consider, we are able to obtain results for probability solutions as desired.

The organization of this paper is as follows. After introducing general notation and recalling basic properties of the spaces 𝒫\mathcal{P} and 𝒮𝒫\mathcal{SP} in Section 2, Section 3 contains the deterministic case, i.e. the superposition principle between solutions to (𝒮𝒫\mathcal{SP}-CE) and (NLFPK). Here, the main result is Theorem 3.7. We use this result to prove an open conjecture of [17] (cf. Proposition 3.12) and present several consequences. In Section 4, we treat the stochastic case for equations of type (SNLFPK), the main result being Theorem 4.8.

Acknowledgements

Financial support by the German Science Foundation DFG (IRTG 2235) is gratefully acknowledged.

2 Notation and Preliminaries

We introduce notation and repeat basic facts on spaces and topologies of measures.

Notation

For a measure space (𝒳,𝒜,μ)(\mathcal{X},\mathcal{A},\mu) and a measurable function φ:𝒳\varphi:\mathcal{X}\to\mathbb{R}, we set μ(φ):=𝒳φ(x)𝑑μ(x)\mu(\varphi):=\int_{\mathcal{X}}\varphi(x)d\mu(x) whenever the integral is well-defined. For x𝒳x\in\mathcal{X}, we denote by δx\delta_{x} the Dirac measure in xx, i.e. δx(A)=1\delta_{x}(A)=1 if and only if xAx\in A and δx(A)=0\delta_{x}(A)=0 else. For a topological space XX with Borel σ\sigma-algebra (X)\mathcal{B}(X) we denote the set of continuous bounded functions by Cb(X)C_{b}(X), the set of Borel probability measures on XX by 𝒫(X)\mathcal{P}(X) and write 𝒫=𝒫(d)\mathcal{P}=\mathcal{P}(\mathbb{R}^{d}). If Y(X)Y\in\mathcal{B}(X), we let (X)Y\mathcal{B}(X)_{\upharpoonright Y} denote the trace of YYon (X)\mathcal{B}(X). For T>0T>0, a family (μt)tT=(μt)t[0,T](\mu_{t})_{t\leq T}=(\mu_{t})_{t\in[0,T]} of finite Borel measures on d\mathbb{R}^{d} is a Borel curve, if tμt(A)t\mapsto\mu_{t}(A) is Borel measurable for each A(d)A\in\mathcal{B}(\mathbb{R}^{d}). A set of functions 𝒢Cb(d)\mathcal{G}\subseteq C_{b}(\mathbb{R}^{d}) is called measure-determining, if μ(g)=ν(g)\mu(g)=\nu(g) for each g𝒢g\in\mathcal{G} implies μ=ν\mu=\nu for any two finite Borel measures μ,ν\mu,\nu on d\mathbb{R}^{d}.
For x,ydx,y\in\mathbb{R}^{d}, the usual inner product is denoted by xyx\cdot y and, with slight abuse of notation, we also denote by xy=k1xkykx\cdot y=\sum_{k\geq 1}x_{k}y_{k} the inner product in 2\ell^{2} (the Hilbert space of square-summable real-valued sequences x=(xk)k1x=(x_{k})_{k\geq 1}). For φCb(d)\varphi\in C_{b}(\mathbb{R}^{d}), we set φ:=supxd|φ(x)|||\varphi||_{\infty}:=\underset{x\in\mathbb{R}^{d}}{\text{sup}}|\varphi(x)|. If φ\varphi has first- and second-order partial derivatives, we denote them by iφ\partial_{i}\varphi and ij2φ\partial^{2}_{ij}\varphi for i,jdi,j\leq d.

We use notation for function spaces as follows. For k0k\in\mathbb{N}_{0}, Cbk(d)C^{k}_{b}(\mathbb{R}^{d}) denotes the subset of functions φ\varphi in Cb(d)C_{b}(\mathbb{R}^{d}) with continuous, bounded partial derivatives up to order kk, with the usual norm φCb2=max(φ,iφ,ij2φ)||\varphi||_{C^{2}_{b}}=\text{max}(||\varphi||_{\infty},||\partial_{i}\varphi||_{\infty},||\partial^{2}_{ij}\varphi||_{\infty}) for k=2k=2. Likewise, Cck(d)C^{k}_{c}(\mathbb{R}^{d}) denotes the subset of all such φ\varphi with compact support; for k=0k=0, we write Cc(d)C_{c}(\mathbb{R}^{d}) instead. For n1n\geq 1, p1p\geq 1 and a measure μ\mu on (d)\mathcal{B}(\mathbb{R}^{d}), we denote by Lp(d,n;μ)L^{p}(\mathbb{R}^{d},\mathbb{R}^{n};\mu) the space of Borel functions φ:dn\varphi:\mathbb{R}^{d}\to\mathbb{R}^{n} such that

dφ(x)p𝑑μ(x)<+,\int_{\mathbb{R}^{d}}||\varphi(x)||^{p}d\mu(x)<+\infty,

where ||||||\cdot|| denotes the standard Euclidean norm on n\mathbb{R}^{n}. For p=2p=2, ,L2(d,n;μ)\langle\cdot,\cdot\rangle_{L^{2}(\mathbb{R}^{d},\mathbb{R}^{n};\mu)} denotes the usual inner product on the Hilbert space L2(d,n;μ)L^{2}(\mathbb{R}^{d},\mathbb{R}^{n};\mu). For T>0T>0 and a topological space YY, we write CTYC_{T}Y for the set of continuous functions φ:[0,T]Y\varphi:[0,T]\to Y. By 𝕊d+\mathbb{S}^{+}_{d} we denote the space of symmetric, positive-semidefinite d×dd\times d-matrices with real entries.

Basic properties of spaces of measures

Probability measures

For a topological space XX, we endow 𝒫(X)\mathcal{P}(X) with the topology of weak convergence of measures, i.e. the initial topology of the maps μμ(φ)\mu\mapsto\mu(\varphi), φCb(X)\varphi\in C_{b}(X). If XX is Polish, then so is 𝒫(X)\mathcal{P}(X).

Subprobability measures

By 𝒮𝒫\mathcal{SP} we denote the set of all Borel subprobability measures on d\mathbb{R}^{d}, i.e. μ𝒮𝒫\mu\in\mathcal{SP} if and only if μ\mu is a non-negative measure on (d)\mathcal{B}(\mathbb{R}^{d}) with μ(d)1\mu(\mathbb{R}^{d})\leq 1. Throughout, we endow 𝒮𝒫\mathcal{SP} with the vague topology, i.e. the initial topology of the maps μμ(g)\mu\mapsto\mu(g), gCc(d)g\in C_{c}(\mathbb{R}^{d}). Hence, a sequence (μn)n1(\mu_{n})_{n\geq 1} converges to μ\mu in 𝒮𝒫\mathcal{SP} if and only if μn(g)nμ(g)\mu_{n}(g)\underset{n\to\infty}{\longrightarrow}\mu(g) for each gCc(d)g\in C_{c}(\mathbb{R}^{d}). Its Borel σ\sigma-algebra is denoted by (𝒮𝒫)\mathcal{B}(\mathcal{SP}). In particular, 𝒫(𝒮𝒫)\mathcal{P}(\mathcal{SP}), the set of Borel probability measures on 𝒮𝒫\mathcal{SP}, is a topological space with the weak topology of probability measures on (𝒮𝒫,(𝒮𝒫))(\mathcal{SP},\mathcal{B}(\mathcal{SP})). The Riesz-Markov representation theorem yields that 𝒮𝒫\mathcal{SP} with the vague topology coincides with the positive half of the closed unit ball of the dual space of Cc(d)C_{c}(\mathbb{R}^{d}) with the weak*-topology. Hence 𝒮𝒫\mathcal{SP} with the vague topology is compact. It is also Polish and μμ(d)\mu\mapsto\mu(\mathbb{R}^{d}) is vaguely lower semicontinuous, see [13, Ch.4.1]. In particular, 𝒫(𝒮𝒫).\mathcal{P}\in\mathcal{B}(\mathcal{SP}). Recall that (𝒫)=(𝒮𝒫)𝒫\mathcal{B}(\mathcal{P})=\mathcal{B}(\mathcal{SP})_{\upharpoonright_{\mathcal{P}}}. Hence, in the sequel we may consider measures Γ𝒫(𝒫)\Gamma\in\mathcal{P}(\mathcal{P}) as elements in 𝒫(𝒮𝒫)\mathcal{P}(\mathcal{SP}) with mass on 𝒫\mathcal{P}.
In contrast to weak convergence in 𝒫\mathcal{P}, vague convergence in 𝒮𝒫\mathcal{SP} can be characterized by countably many functions in a sense made precise by Lemma 3.3. The fact that this is not true for weak convergence in 𝒫\mathcal{P} is the main reason why we formulate all equations for subprobability measures, although we are mainly interested in the case of probability solutions. More details in this direction are stated in Remark 3.9.

3 Superposition Principle for deterministic nonlinear Fokker-Planck-Kolmogorov Equations

Fix T>0T>0 throughout, let each component of the coefficients

a=(aij)i,jd:[0,T]×𝒮𝒫×d𝕊d+,b=(bi)id:[0,T]×𝒮𝒫×dda=(a_{ij})_{i,j\leq d}:[0,T]\times\mathcal{SP}\times\mathbb{R}^{d}\to\mathbb{S}^{+}_{d},\,b=(b_{i})_{i\leq d}:[0,T]\times\mathcal{SP}\times\mathbb{R}^{d}\to\mathbb{R}^{d}

be ([0,T])(𝒮𝒫)(d)/()\mathcal{B}([0,T])\otimes\mathcal{B}(\mathcal{SP})\otimes\mathcal{B}(\mathbb{R}^{d})/\mathcal{B}(\mathbb{R})-measurable and consider the operator t,μ\mathcal{L}_{t,\mu} as in (1).

Definition 3.1.
  1. (i)

    A vaguely continuous curve (μt)tT𝒮𝒫(\mu_{t})_{t\leq T}\subseteq\mathcal{SP} is a subprobability solution to (NLFPK), if for each i,jdi,j\leq d the global integrability condition

    0Td|aij(t,μt,x)|+|bi(t,μt,x)|dμt(x)dt<+\int_{0}^{T}\int_{\mathbb{R}^{d}}|a_{ij}(t,\mu_{t},x)|+|b_{i}(t,\mu_{t},x)|d\mu_{t}(x)dt<+\infty (3)

    holds and for each φCc2(d)\varphi\in C^{2}_{c}(\mathbb{R}^{d}) and t[0,T]t\in[0,T]

    dφ(x)𝑑μt(x)dφ(x)𝑑μ0(x)=0tds,μsφ(x)𝑑μs(x)𝑑s.\int_{\mathbb{R}^{d}}\varphi(x)d\mu_{t}(x)-\int_{\mathbb{R}^{d}}\varphi(x)d\mu_{0}(x)=\int_{0}^{t}\int_{\mathbb{R}^{d}}\mathcal{L}_{s,\mu_{s}}\varphi(x)d\mu_{s}(x)ds. (4)
  2. (ii)

    A probability solution to (NLFPK) is a curve (μt)tT𝒫(\mu_{t})_{t\leq T}\subseteq\mathcal{P} fulfilling (3) and (4) such that tμtt\mapsto\mu_{t} is weakly continuous.

Since vaguely continuous curves of measures are in particular Borel curves, all integrals in the above definition are defined. Below we shortly refer to subprobability and probability solutions and keep in mind the respective continuity conditions. In the literature, more general notions of solutions to (NLFPK) are considered, such as (possibly discontinuous) curves of signed, bounded measures [5]. However, in this work, we restrict attention to continuous (sub-)probability solutions. In presence of the global integrability condition (3), we make the following observation.

Remark 3.2.
  1. (i)

    Any subprobability solution (μt)tT(\mu_{t})_{t\leq T} with μ0𝒫\mu_{0}\in\mathcal{P} is a probability solution. Indeed, to prove this it suffices to show μt(d)=1\mu_{t}(\mathbb{R}^{d})=1 for each tTt\leq T. Since (μt)tT(\mu_{t})_{t\leq T} fulfills (4), it suffices to choose a sequence φl\varphi_{l}, l1l\geq 1, from Cc2(d)C^{2}_{c}(\mathbb{R}^{d}) with the following properties: 0φl10\leq\varphi_{l}\nearrow 1 pointwise such that iφll0\partial_{i}\varphi_{l}\underset{l\to\infty}{\longrightarrow}0, ij2φll0\partial^{2}_{ij}\varphi_{l}\underset{l\to\infty}{\longrightarrow}0 pointwise with all first and second order derivatives bounded by some M<+M<+\infty uniformly in l1l\geq 1 and xdx\in\mathbb{R}^{d}. Considering (4) for the limit ll\to\infty, we obtain, by (3) and dominated convergence, for each t[0,T]t\in[0,T]

    d1𝑑μtd1𝑑μ0=0\int_{\mathbb{R}^{d}}1d\mu_{t}-\int_{\mathbb{R}^{d}}1d\mu_{0}=0

    and hence the claim.

  2. (ii)

    By the above argument, one shows that for any subprobability solution, (4) holds for each φCb2(d)\varphi\in C^{2}_{b}(\mathbb{R}^{d}).

Geometric approach to 𝒮𝒫\mathbf{\mathcal{SP}}

For our goals, it is preferable to consider 𝒮𝒫\mathcal{SP} as a manifold-like space. We refer the reader to the appendix in [17], where for the space of probability measures 𝒫\mathcal{P} the tangent spaces Tμ𝒫=L2(d,d;μ)T_{\mu}\mathcal{P}=L^{2}(\mathbb{R}^{d},\mathbb{R}^{d};\mu) and a suitable test function class Cb2(𝒫)\mathcal{F}C^{2}_{b}(\mathcal{P}),

FCb2(𝒫)F:μf(μ(φ1),,μ(φn)) for n1,fCb1(n),φiCc(d),F\in\mathcal{F}C^{2}_{b}(\mathcal{P})\iff F:\mu\mapsto f\big{(}\mu(\varphi_{1}),\dots,\mu(\varphi_{n})\big{)}\text{ for }n\geq 1,f\in C^{1}_{b}(\mathbb{R}^{n}),\,\varphi_{i}\in C^{\infty}_{c}(\mathbb{R}^{d}), (5)

have been introduced. Further, based on these choices, a natural pointwise definition of the gradient 𝒫F\nabla^{\mathcal{P}}F as a section in the tangent bundle

T𝒫=μ𝒫Tμ𝒫T\mathcal{P}=\bigsqcup_{\mu\in\mathcal{P}}T_{\mu}\mathcal{P}

for FF as above is given by

𝒫F(μ):=k=1nkf(μ(φ1),,μ(φn))φkTμ𝒫,\nabla^{\mathcal{P}}F(\mu):=\sum_{k=1}^{n}\partial_{k}f\big{(}\mu(\varphi_{1}),\dots,\mu(\varphi_{n})\big{)}\nabla\varphi_{k}\in T_{\mu}\mathcal{P},

which is shown to be independent of the representation of FF in terms of ff and φi\varphi_{i}. The setting in the present paper is nearly identical, but we consider the manifold-like space 𝒮𝒫\mathcal{SP} with the vague topology instead of 𝒫\mathcal{P} with the weak topology as in [17], because 𝒮𝒫\mathcal{SP} is embedded in \mathbb{R}^{\infty} in the following sense. Let

𝒢={gi,i1}\mathcal{G}=\{g_{i},i\geq 1\} (6)

be dense in Cc2(d)C^{2}_{c}(\mathbb{R}^{d}) with respect to ||||Cb2||\cdot||_{C^{2}_{b}} such that no gig_{i} is constantly 0. Clearly, any such set of functions is dense in Cc(d)C_{c}(\mathbb{R}^{d}) with respect to uniform convergence and measure-determining. Such sets of functions are sufficiently extensive to characterize the topology of 𝒮𝒫\mathcal{SP} as well as solutions to (NLFPK):

Lemma 3.3.

Let 𝒢\mathcal{G} be any set of functions with the properties mentioned above and let (μn)n1𝒮𝒫(\mu_{n})_{n\geq 1}\subseteq\mathcal{SP}. Then,

  1. (i)

    (μn)n1(\mu_{n})_{n\geq 1} converges vaguely to μ𝒮𝒫\mu\in\mathcal{SP} if and only if

    μn(gi)nμ(gi)\mu_{n}(g_{i})\underset{n\to\infty}{\longrightarrow}\mu(g_{i})

    for each gi𝒢g_{i}\in\mathcal{G}.

  2. (ii)

    A vaguely continuous curve (μt)tT𝒮𝒫(\mu_{t})_{t\leq T}\subseteq\mathcal{SP}, which fulfills (3), is a subprobability solution to (NLFPK) if and only if (4) holds for each gi𝒢g_{i}\in\mathcal{G} in place of φ\varphi.

Proof.
  1. (i)

    From μn(gi)nμ(gi)\mu_{n}(g_{i})\underset{n\to\infty}{\longrightarrow}\mu(g_{i}) for each gi𝒢g_{i}\in\mathcal{G}, one obtains for each fCc(d)f\in C_{c}(\mathbb{R}^{d}) and ϵ>0\epsilon>0 by choosing gi𝒢g_{i}\in\mathcal{G} with fgi<ϵ3||f-g_{i}||_{\infty}<\frac{\epsilon}{3}

    |μn(f)μ(f)||μn(f)μn(gi)|+|μn(gi)μ(gi)|+|μ(gi)μ(f)|ϵ|\mu_{n}(f)-\mu(f)|\leq|\mu_{n}(f)-\mu_{n}(g_{i})|+|\mu_{n}(g_{i})-\mu(g_{i})|+|\mu(g_{i})-\mu(f)|\leq\epsilon (7)

    for all sufficiently large n1n\geq 1.

  2. (ii)

    Let φCc2(d)\varphi\in C^{2}_{c}(\mathbb{R}^{d}) be approximated uniformly up to second-order derivatives by a sequence {gik}k1\{g_{i_{k}}\}_{k\geq 1} from 𝒢\mathcal{G}. Considering (4) for such gikg_{i_{k}} and letting kk\to\infty, the result follows by dominated convergence, which applies due to (3).

Considering 𝒮𝒫\mathcal{SP} as a (infinite-dimensional) manifold-like topological space, any set of functions 𝒢\mathcal{G} as above provides a global chart (i.e., an atlas consisting of a single chart) for 𝒮𝒫\mathcal{SP}, as it yields an embedding 𝒮𝒫\mathcal{SP}\subseteq\mathbb{R}^{\infty} (cf. Lemma 3.4).
Consider \mathbb{R}^{\infty} as a Polish space with the topology of pointwise convergence and the range G(𝒮𝒫)G(\mathcal{SP})\subseteq\mathbb{R}^{\infty} of GG as introduced below with its subspace topology. We write CTG(𝒮𝒫)C_{T}G(\mathcal{SP}) for the set of all elements in CTC_{T}\mathbb{R}^{\infty} with values in G(𝒮𝒫)G(\mathcal{SP}). For u[0,T]u\in[0,T], we denote by eue_{u} the canonical projection on CT𝒮𝒫C_{T}\mathcal{SP}

eu:(μt)tTμue_{u}:(\mu_{t})_{t\leq T}\mapsto\mu_{u}

and, likewise, by eue^{\infty}_{u} the projection on CTC_{T}\mathbb{R}^{\infty}. Subsequently, without further mentioning, we consider the spaces CT𝒮𝒫C_{T}\mathcal{SP} and CTC_{T}\mathbb{R}^{\infty} with σ\sigma-algebras

(CT𝒮𝒫)=σ(et,t[0,T]) and (CT)=σ(et,t[0,T]),\mathcal{B}(C_{T}\mathcal{SP})=\sigma(e_{t},t\in[0,T])\text{ and }\mathcal{B}(C_{T}\mathbb{R}^{\infty})=\sigma(e_{t}^{\infty},t\in[0,T]),

respectively. These algebras coincide with the Borel σ\sigma-algebras with respect to the topology of uniform convergence (because both 𝒮𝒫\mathcal{SP} and \mathbb{R}^{\infty} are Polish). Also, consider CTG(𝒮𝒫)C_{T}G(\mathcal{SP}) with the natural subspace σ\sigma-algebra of (CT)\mathcal{B}(C_{T}\mathbb{R}^{\infty}). We refer to these σ\sigma-algebras as the canonical σ\sigma-algebras on the respective spaces and denote the set of probability measures on the respective σ\sigma-algebras by 𝒫(CT𝒮𝒫)\mathcal{P}(C_{T}\mathcal{SP}) and 𝒫(CT)\mathcal{P}(C_{T}\mathbb{R}^{\infty}).

Lemma 3.4.

Let 𝒢={gi}i1\mathcal{G}=\{g_{i}\}_{i\geq 1} be a set of functions as in (6).

  1. (i)

    The map GG, depending on 𝒢\mathcal{G},

    G:𝒮𝒫,G(μ):=(μ(gi))i1G:\mathcal{SP}\to\mathbb{R}^{\infty},G(\mu):=(\mu(g_{i}))_{i\geq 1} (8)

    is a homeomorphism between 𝒮𝒫\mathcal{SP} and its range G(𝒮𝒫)G(\mathcal{SP}) (hence, formally, a global chart for 𝒮𝒫\mathcal{SP}). In particular, G(𝒮𝒫)G(\mathcal{SP})\subseteq\mathbb{R}^{\infty} is compact. Moreover, if 𝒢={gi,i1}\mathcal{G}^{\prime}=\{g_{i}^{\prime},i\geq 1\} is another set as in (6) with corresponding chart GG^{\prime}, then G=G𝒱G^{\prime}=G\circ\mathcal{V} for a unique homeomorphism 𝒱\mathcal{V} on 𝒮𝒫\mathcal{SP}.

  2. (ii)

    The map

    J:CT𝒮𝒫CT,J((μt)tT):=G(μt)tTJ:C_{T}\mathcal{SP}\to C_{T}\mathbb{R}^{\infty},\,J((\mu_{t})_{t\leq T}):=G(\mu_{t})_{t\leq T}

    is measurable and one-to-one with measurable inverse J1:CTG(𝒮𝒫)CT𝒮𝒫J^{-1}:C_{T}G(\mathcal{SP})\to C_{T}\mathcal{SP}. Further, CTG(𝒮𝒫)CTC_{T}G(\mathcal{SP})\subseteq C_{T}\mathbb{R}^{\infty} is a measurable set, i.e. CTG(𝒮𝒫)(CT)C_{T}G(\mathcal{SP})\subseteq\mathcal{B}(C_{T}\mathbb{R}^{\infty}).

Proof.
  1. (i)

    The continuity of GG is obvious by definition of the vague topology on 𝒮𝒫\mathcal{SP} and since 𝒢Cc(d)\mathcal{G}\subseteq C_{c}(\mathbb{R}^{d}). Since 𝒮𝒫\mathcal{SP} is compact with respect to the vague topology, compactness of G(𝒮𝒫)G(\mathcal{SP})\subseteq\mathbb{R}^{\infty} follows. 𝒢\mathcal{G} is measure-determining on d\mathbb{R}^{d}, which implies that GG is one-to-one. Since by definition

    G(μn)nG(μ)μn(gi)nμ(gi) for each gi𝒢,G(\mu_{n})\underset{n\to\infty}{\longrightarrow}G(\mu)\iff\mu_{n}(g_{i})\underset{n\to\infty}{\longrightarrow}\mu(g_{i})\text{ for each }g_{i}\in\mathcal{G},

    continuity of G1G^{-1} follows from Lemma 3.3 (i). The final assertion follows, since for GG^{\prime} as in the assertion, 𝒱:=G1𝒲G\mathcal{V}:=G^{-1}\circ\mathcal{W}\circ G^{\prime} with 𝒲:G(𝒮𝒫)G(𝒮𝒫)\mathcal{W}:G^{\prime}(\mathcal{SP})\to G(\mathcal{SP}), 𝒲:(μ(gi))i1(μ(gi))i1\mathcal{W}:(\mu(g^{\prime}_{i}))_{i\geq 1}\mapsto(\mu(g_{i}))_{i\geq 1} is a homeomorphism.

  2. (ii)

    Since GG is one-to-one and measurable, so is JJ. Clearly, CTG(𝒮𝒫)C_{T}G(\mathcal{SP}) is the range of JJ and hence J:CT𝒮𝒫CTG(𝒮𝒫)J:C_{T}\mathcal{SP}\to C_{T}G(\mathcal{SP}) is a bijection between standard Borel spaces (the latter, because 𝒮𝒫\mathcal{SP} and G(𝒮𝒫)G(\mathcal{SP}) with the respective topologies are Polish). This yields the measurability of J1J^{-1}. Finally, closedness of G(𝒮𝒫)G(\mathcal{SP})\subseteq\mathbb{R}^{\infty} implies that CTG(𝒮𝒫)CTC_{T}G(\mathcal{SP})\subseteq C_{T}\mathbb{R}^{\infty} is a measurable set, because G(𝒮𝒫)G(\mathcal{SP}) carries the subspace topology inherited from \mathbb{R}^{\infty}.

By part (i) of the previous lemma it is justified to fix a set 𝒢={gi,i1}\mathcal{G}=\{g_{i},i\geq 1\} for the remainder of the section. In order to switch between test functions on 𝒮𝒫\mathcal{SP} and \mathbb{R}^{\infty} in an equivalent way, we slightly deviate from the test function class presented in [17] (see (5)) and, instead, consider

Cb2(𝒢):={F:𝒮𝒫|F(μ)=f(μ(g1),,μ(gn)),fCb2(n),n1},\mathcal{F}C^{2}_{b}(\mathcal{G}):=\{F:\mathcal{SP}\to\mathbb{R}|F(\mu)=f\big{(}\mu(g_{1}),\dots,\mu(g_{n})\big{)},f\in C^{2}_{b}(\mathbb{R}^{n}),n\geq 1\},

where the restriction fCb2(n)f\in C^{2}_{b}(\mathbb{R}^{n}) is made for consistency with the stochastic case later on only. We summarize our geometric interpretation of 𝒮𝒫\mathcal{SP}, which is of course still a close adaption of the ideas presented in [17]:
For the manifold-like space 𝒮𝒫\mathcal{SP}, we consider smooth test functions FCb2(𝒢)F\in\mathcal{F}C^{2}_{b}(\mathcal{G}), with 𝒢\mathcal{G} being fixed as in (6). For each μ𝒮𝒫\mu\in\mathcal{SP}, we have the tangent space Tμ𝒮𝒫=L2(d,d;μ)T_{\mu}\mathcal{SP}=L^{2}(\mathbb{R}^{d},\mathbb{R}^{d};\mu) and the gradient

𝒮𝒫F(μ)=k=1nkf(μ(g1),,μ(gn))gkTμ𝒮𝒫\nabla^{\mathcal{SP}}F(\mu)=\sum_{k=1}^{n}\partial_{k}f\big{(}\mu(g_{1}),\dots,\mu(g_{n})\big{)}\nabla g_{k}\in T_{\mu}\mathcal{SP}

for Cb2(𝒢)F:μf(μ(g1),,μ(gn))\mathcal{F}C^{2}_{b}(\mathcal{G})\ni F:\mu\mapsto f\big{(}\mu(g_{1}),\dots,\mu(g_{n})\big{)} as a section in the tangent bundle T𝒮𝒫T\mathcal{SP}, which is independent of the representation of FF. Adding to the approach of 𝒮𝒫\mathcal{SP} as a manifold-like space, the global chart GG as in (8) embeds 𝒮𝒫\mathcal{SP} into \mathbb{R}^{\infty}. However, we do not rigorously treat 𝒮𝒫\mathcal{SP} as a (Fréchet-)manifold and consider the embedding 𝒮𝒫\mathcal{SP}\subseteq\mathbb{R}^{\infty} merely as a tool to transfer (NLFPK) and its corresponding continuity equation to equivalent equations over \mathbb{R}^{\infty}, as outlined below.

The continuity equation (𝒮𝒫\mathcal{SP}-CE)

As mentioned in the introduction, we study the linear continuity equation associated to (NLFPK) as derived in [17], which is a first-order equation for curves of measures on 𝒮𝒫\mathcal{SP}. More precisely, in analogy to the derivation in [17], it is readily seen that any subprobability solution (μt)tT(\mu_{t})_{t\leq T} to (NLFPK) induces a curve of elements in 𝒫(𝒮𝒫)\mathcal{P}(\mathcal{SP}), Γt:=δμt\Gamma_{t}:=\delta_{\mu_{t}}, tTt\leq T, with

𝒮𝒫F(μ)𝑑Γt(μ)𝒮𝒫F(μ)𝑑Γ0(μ)=0t𝒮𝒫𝒮𝒫F(μ),b(s,μ)+a(s,μ)L2(μ)𝑑Γs(μ)𝑑s\int_{\mathcal{SP}}F(\mu)d\Gamma_{t}(\mu)-\int_{\mathcal{SP}}F(\mu)d\Gamma_{0}(\mu)=\int_{0}^{t}\int_{\mathcal{SP}}\big{\langle}\nabla^{\mathcal{SP}}F(\mu),b(s,\mu)+a(s,\mu)\nabla\big{\rangle}_{L^{2}(\mu)}d\Gamma_{s}(\mu)ds (9)

for each tTt\leq T and FCb2(𝒢)F\in\mathcal{F}C^{2}_{b}(\mathcal{G}). Here, we set b(s,μ)=b(s,μ,):ddb(s,\mu)=b(s,\mu,\cdot):\mathbb{R}^{d}\to\mathbb{R}^{d} (similarly for a(s,μ)a(s,\mu)), L2(μ)=L2(d,d;μ)L^{2}(\mu)=L^{2}(\mathbb{R}^{d},\mathbb{R}^{d};\mu) and abbreviated

𝒮𝒫F(μ),b(s,μ)+a(s,μ)L2(μ)=dk=1n(kf)(μ(g1),,μ(gn))aij(s,μ,x)ij2gk(x)+bi(s,μ,x)igk(x)dμ(x).\big{\langle}\nabla^{\mathcal{SP}}F(\mu),b(s,\mu)+a(s,\mu)\nabla\big{\rangle}_{L^{2}(\mu)}=\int_{\mathbb{R}^{d}}\sum_{k=1}^{n}(\partial_{k}f)\big{(}\mu(g_{1}),\dots,\mu(g_{n})\big{)}a_{ij}(s,\mu,x)\partial^{2}_{ij}g_{k}(x)+b_{i}(s,\mu,x)\partial_{i}g_{k}(x)d\mu(x).

We rewrite (9) in distributional form in duality with Cb2(𝒢)\mathcal{F}C^{2}_{b}(\mathcal{G}) as

tΓt=𝒮𝒫([bt+at]Γt),tT.\partial_{t}\Gamma_{t}=-\nabla^{\mathcal{SP}}\cdot([b_{t}+a_{t}\nabla]\Gamma_{t}),\,\,t\leq T.

Setting

𝐋tF(μ):=a(t,μ)+b(t,μ),𝒮𝒫F(μ)L2(μ),\mathbf{L}_{t}F(\mu):=\big{\langle}a(t,\mu)\nabla+b(t,\mu),\nabla^{\mathcal{SP}}F(\mu)\big{\rangle}_{L^{2}(\mu)}, (10)

this is just the linear continuity equation (𝒮𝒫\mathcal{SP}-CE). The term aa\nabla has rigorous meaning only, if aa has sufficiently regular components in order to put the derivative \nabla on aa via integration by parts, which we do not assume at any point. Considering 𝒮𝒫\mathcal{SP} as a manifold-like space, one may formally regard to a+ba\nabla+b as a time-dependent section in the tangent bundle T𝒮𝒫T\mathcal{SP}.
More generally, we introduce the following notion of solution to (𝒮𝒫\mathcal{SP}-CE) (see [17]):

Definition 3.5.

A weakly continuous curve (Γt)tT𝒫(𝒮𝒫)(\Gamma_{t})_{t\leq T}\subseteq\mathcal{P}(\mathcal{SP}) is a solution to (𝒮𝒫\mathcal{SP}-CE), if the integrability condition

0T𝒮𝒫b(t,μ,)L1(d,d;μ)+a(t,μ,)L1(d,d2;μ)dΓt(μ)dt<+\int_{0}^{T}\int_{\mathcal{SP}}||b(t,\mu,\cdot)||_{L^{1}(\mathbb{R}^{d},\mathbb{R}^{d};\mu)}+||a(t,\mu,\cdot)||_{L^{1}(\mathbb{R}^{d},\mathbb{R}^{d^{2}};\mu)}d\Gamma_{t}(\mu)dt<+\infty (11)

is fulfilled and for each FCb2(𝒢)F\in\mathcal{F}C^{2}_{b}(\mathcal{G}) and t[0,T]t\in[0,T]

𝒮𝒫F(μ)𝑑Γt(μ)𝒮𝒫F(μ)𝑑Γ0(μ)=0t𝒮𝒫𝐋sF(μ)𝑑Γs(μ)𝑑s\int_{\mathcal{SP}}F(\mu)d\Gamma_{t}(\mu)-\int_{\mathcal{SP}}F(\mu)d\Gamma_{0}(\mu)=\int_{0}^{t}\int_{\mathcal{SP}}\mathbf{L}_{s}F(\mu)d\Gamma_{s}(\mu)ds (12)

holds (which is just (9)).

The choice of 𝒢\mathcal{G} as in (6) implies that any solution in the above sense fulfills (12) even for each FCb2(𝒮𝒫)F\in\mathcal{F}C^{2}_{b}(\mathcal{SP}), i.e. for the larger class of test functions considered in [17] (upon extending their domain from 𝒫\mathcal{P} to 𝒮𝒫\mathcal{SP}). In particular, this notion of solution is independent of 𝒢\mathcal{G}. The main result of this chapter, Theorem 3.7, states that any solution to (𝒮𝒫\mathcal{SP}-CE) as in Definition 3.5 arises as a superposition of solutions to (NLFPK). Note that for ν𝒮𝒫\nu\in\mathcal{SP}, uniqueness of solutions (Γt)tT(\Gamma_{t})_{t\leq T} to (𝒮𝒫\mathcal{SP}-CE) with Γ0=δν\Gamma_{0}=\delta_{\nu} implies uniqueness of subprobability solutions (μt)tT(\mu_{t})_{t\leq T} to (NLFPK) with μ0=ν\mu_{0}=\nu.

Transferring (NLFPK) and (𝒮𝒫\mathcal{SP}-CE) to \mathbb{R}^{\infty}

We use the global chart G:𝒮𝒫G:\mathcal{SP}\to\mathbb{R}^{\infty} and the map JJ of Lemma 3.4 to reformulate both (NLFPK) and (𝒮𝒫\mathcal{SP}-CE) on \mathbb{R}^{\infty}. Define a Borel vector field B¯=(B¯k)k\bar{B}=(\bar{B}_{k})_{k\in\mathbb{N}} component-wise as follows. For t[0,T]t\in[0,T], consider the Borel set At(𝒮𝒫)A_{t}\in\mathcal{B}(\mathcal{SP}),

At:={μ𝒮𝒫:d|aij(t,μ,x)+|bi(t,μ,x)|dμ(x)<i,jd}A_{t}:=\bigg{\{}\mu\in\mathcal{SP}:\int_{\mathbb{R}^{d}}|a_{ij}(t,\mu,x)+|b_{i}(t,\mu,x)|d\mu(x)<\infty\,\,\forall i,j\leq d\bigg{\}}

and define B:=(Bk)kB:=(B_{k})_{k\in\mathbb{N}} via

Bk(t,μ):=dt,μgk(x)𝑑μ(x),(t,μ)[0,T]×At.B_{k}(t,\mu):=\int_{\mathbb{R}^{d}}\mathcal{L}_{t,\mu}g_{k}(x)d\mu(x),\quad(t,\mu)\in[0,T]\times A_{t}.

Now define B¯:[0,T]×\bar{B}:[0,T]\times\mathbb{R}^{\infty}\to\mathbb{R}^{\infty} via

B¯(t,z):={B(t,G1(z)), if zG(At)0, else,\bar{B}(t,z):=\begin{cases}B(t,G^{-1}(z)),&\quad\text{ if }z\in G(A_{t})\\ 0,&\quad\text{ else,}\end{cases}

which is Borel measurable by Lemma 3.4. Next, consider the differential equation on \mathbb{R}^{\infty}

ddtzt=B¯(t,zt),t[0,T],\frac{d}{dt}z_{t}=\bar{B}(t,z_{t}),\,\,t\in[0,T], (\mathbb{R}^{\infty}-ODE)

which turns out to be the suitable analogue to (NLFPK) on \mathbb{R}^{\infty}. Analogously, the corresponding continuity equation for curves of Borel probability measures Γ¯t\bar{\Gamma}_{t} on \mathbb{R}^{\infty}, i.e.

tΓ¯t=¯(B¯Γ¯t),t[0,T],\partial_{t}\bar{\Gamma}_{t}=-\bar{\nabla}\cdot(\bar{B}\bar{\Gamma}_{t}),\,\,t\in[0,T], (\mathbb{R}^{\infty}-CE)

with ¯\bar{\nabla} as introduced below, is the natural analogue of the linear continuity equation (𝒮𝒫\mathcal{SP}-CE). Roughly, these analogies are to be understood in the sense that solutions to (NLFPK) and (𝒮𝒫\mathcal{SP}-CE) can be transferred to solutions to (\mathbb{R}^{\infty}-ODE) and (\mathbb{R}^{\infty}-CE), respectively, via the chart GG. We refer to the proof of the main result below for more details. Let

pi:zzi,zp_{i}:z\mapsto z_{i},\,\,z\in\mathbb{R}^{\infty}

denote the canonical projection to the ii-th component, set πn=(p1,,pn)\pi_{n}=(p_{1},\dots,p_{n}) and

Cb2():={F¯:|F¯=fπn,fCb2(n),n1}.\mathcal{F}C^{2}_{b}(\mathbb{R}^{\infty}):=\{\bar{F}:\mathbb{R}^{\infty}\to\mathbb{R}|\bar{F}=f\circ\pi_{n},f\in C^{2}_{b}(\mathbb{R}^{n}),n\geq 1\}.

By ¯\bar{\nabla} we denote the gradient-type operator on \mathbb{R}^{\infty}, acting on F¯=fπnCb2()\bar{F}=f\circ\pi_{n}\in\mathcal{F}C^{2}_{b}(\mathbb{R}^{\infty}) via

¯F¯(z):=((1f)(πnz),,(nf)(πnz),0,0,).\bar{\nabla}\bar{F}(z):=\big{(}(\partial_{1}f)(\pi_{n}z),\dots,(\partial_{n}f)(\pi_{n}z),0,0,\dots\big{)}. (13)

Again, the restriction to test functions possessing second-order derivatives is made in order to be consistent with the stochastic (second-order) case later on.

Definition 3.6.
  1. (i)

    A curve (zt)tT=((pizt)i1)tTCT(z_{t})_{t\leq T}=((p_{i}\circ z_{t})_{i\geq 1})_{t\leq T}\in C_{T}\mathbb{R}^{\infty} is a solution to (\mathbb{R}^{\infty}-ODE), if for each i1i\geq 1 the \mathbb{R}-valued curve tpiztt\mapsto p_{i}\circ z_{t} is absolutely continuous with weak derivative tpiB¯(t,zt)t\mapsto p_{i}\circ\bar{B}(t,z_{t}) dtdt-a.s.

  2. (ii)

    A curve (Γ¯t)tT𝒫()(\bar{\Gamma}_{t})_{t\leq T}\subseteq\mathcal{P}(\mathbb{R}^{\infty}) is a solution to (\mathbb{R}^{\infty}-CE), if it is weakly continuous, fulfills the integrability condition

    0T|B¯k(t,z)|𝑑Γ¯t(z)𝑑t<+ for each k1\int_{0}^{T}\int_{\mathbb{R}^{\infty}}|\bar{B}_{k}(t,z)|d\bar{\Gamma}_{t}(z)dt<+\infty\text{ for each }k\geq 1 (14)

    and for each F¯Cb2()\bar{F}\in\mathcal{F}C^{2}_{b}(\mathbb{R}^{\infty}) the identity

    F¯(z)𝑑Γ¯t(z)F¯(z)𝑑Γ¯0(z)=0t¯F¯(z)B¯(s,z)𝑑Γ¯s(z)𝑑s\int_{\mathbb{R}^{\infty}}\bar{F}(z)d\bar{\Gamma}_{t}(z)-\int_{\mathbb{R}^{\infty}}\bar{F}(z)d\bar{\Gamma}_{0}(z)=\int_{0}^{t}\int_{\mathbb{R}^{\infty}}\bar{\nabla}\bar{F}(z)\cdot\bar{B}(s,z)d\bar{\Gamma}_{s}(z)ds

    holds for all t[0,T]t\in[0,T].

3.1 Main Result: Deterministic case

The following theorem is the main result for the deterministic case.

Theorem 3.7.

Let a,ba,b be Borel coefficients on [0,T]×𝒮𝒫×d[0,T]\times\mathcal{SP}\times\mathbb{R}^{d}. For any weakly continuous solution (Γt)tT(\Gamma_{t})_{t\leq T} to (𝒮𝒫\mathcal{SP}-CE) in the sense of Definition 3.5, there exists a probability measure η𝒫(CT𝒮𝒫)\eta\in\mathcal{P}(C_{T}\mathcal{SP}), which is concentrated on vaguely continuous subprobability solutions to (NLFPK) such that

ηet1=Γt,t[0,T].\eta\circ e_{t}^{-1}=\Gamma_{t},\,\,t\in[0,T].

Moreover, if Γ0𝒫(𝒫)\Gamma_{0}\in\mathcal{P}(\mathcal{P}), then η\eta is concentrated on weakly continuous probability solutions to (NLFPK).

The proof relies on a superposition principle for measure-valued solution curves of continuity equations on \mathbb{R}^{\infty} and its corresponding differential equation, which we recall in Proposition 3.8 below. More precisely, we proceed in three steps. First, we transfer (Γt)tT(\Gamma_{t})_{t\leq T} to a solution (Γ¯t)tT(\bar{\Gamma}_{t})_{t\leq T} to (\mathbb{R}^{\infty}-CE). Then, by Proposition 3.8 below we obtain a measure η¯𝒫(CT)\bar{\eta}\in\mathcal{P}(C_{T}\mathbb{R}^{\infty}) with η¯(et)1=Γt¯\bar{\eta}\circ(e^{\infty}_{t})^{-1}=\bar{\Gamma_{t}}, which is concentrated on solution curves to (\mathbb{R}^{\infty}-ODE). Finally, we transfer η¯\bar{\eta} back to a measures η𝒫(CT𝒮𝒫)\eta\in\mathcal{P}(C_{T}\mathcal{SP}) with the desired properties. Below, we denote by Cb1()\mathcal{F}C^{1}_{b}(\mathbb{R}^{\infty}) the set of test functions of same type as in Cb2()\mathcal{F}C^{2}_{b}(\mathbb{R}^{\infty}), but with fCb1(n)f\in C^{1}_{b}(\mathbb{R}^{n}) in place of fCb2(n)f\in\mathcal{F}C^{2}_{b}(\mathbb{R}^{n}).

Proposition 3.8.

[Superposition principle on \mathbb{R}^{\infty}, Thm. 7.1. [2]] Let (Γ¯t)tT(\bar{\Gamma}_{t})_{t\leq T} be a solution to (\mathbb{R}^{\infty}-CE) in the sense of Definition 3.6 (ii) with test functions Cb1()\mathcal{F}C^{1}_{b}(\mathbb{R}^{\infty}) instead of Cb2()\mathcal{F}C^{2}_{b}(\mathbb{R}^{\infty}). Then, there exists a Borel measures η¯𝒫(CT)\bar{\eta}\in\mathcal{P}(C_{T}\mathbb{R}^{\infty}) concentrated on solutions to (\mathbb{R}^{\infty}-ODE) in the sense of Definition 3.6 (i) such that

η¯(et)1=Γ¯t,tT.\bar{\eta}\circ(e^{\infty}_{t})^{-1}=\bar{\Gamma}_{t},\,\,t\leq T.

We proceed to the proof of the main result.

Proof of Theorem 3.7: Let Γ=(Γt)tT\Gamma=(\Gamma_{t})_{t\leq T} be a weakly continuous solution to (𝒮𝒫\mathcal{SP}-CE) as in Definition 3.5.
Step 1: From (𝒮𝒫\mathcal{SP}-CE) to (\mathbb{R}^{\infty}-CE): Set

Γ¯t:=ΓtG1,\bar{\Gamma}_{t}:=\Gamma_{t}\circ G^{-1},

with GG as in Lemma 3.4, which corresponds to the fixed set of functions 𝒢\mathcal{G}. Since GG is continuous, (Γ¯t)tT(\bar{\Gamma}_{t})_{t\leq T} is a weakly continuous curve of Borel subprobability measures on \mathbb{R}^{\infty}. We show that (Γ¯t)tT(\bar{\Gamma}_{t})_{t\leq T} solves (\mathbb{R}^{\infty}-CE). Indeed, the integrability condition (14) is fulfilled, since (Γt)tT(\Gamma_{t})_{t\leq T} fulfills Definition 3.5. Further, since Γ\Gamma solves (𝒮𝒫\mathcal{SP}-CE), we have for any Cb2(𝒢)F:μf(μ(g1),,μ(gn))\mathcal{F}C^{2}_{b}(\mathcal{G})\ni F:\mu\mapsto f\big{(}\mu(g_{1}),\dots,\mu(g_{n})\big{)} and t[0,T]t\in[0,T]

0t𝒮𝒫𝐋sF(μ)𝑑Γs(μ)𝑑s=𝒮𝒫F(μ)𝑑Γt(μ)𝒮𝒫F(μ)𝑑Γ0(μ)\int_{0}^{t}\int_{\mathcal{SP}}\mathbf{L}_{s}F(\mu)d\Gamma_{s}(\mu)ds=\int_{\mathcal{SP}}F(\mu)d\Gamma_{t}(\mu)-\int_{\mathcal{SP}}F(\mu)d\Gamma_{0}(\mu) (15)

and hence, abbreviating pkB(t,)p_{k}\circ B(t,\cdot) by BtkB^{k}_{t} and setting F¯=fπn\bar{F}=f\circ\pi_{n} for ff as above, we have

0t𝒮𝒫𝐋sF(μ)𝑑Γs(μ)𝑑s\displaystyle\int_{0}^{t}\int_{\mathcal{SP}}\mathbf{L}_{s}F(\mu)d\Gamma_{s}(\mu)ds =0t𝒮𝒫k=1n(kf)(μ(g1),,μ(gn))(ds,μgk(x)𝑑μ(x))Γs(μ)ds\displaystyle=\int_{0}^{t}\int_{\mathcal{SP}}\sum_{k=1}^{n}(\partial_{k}f)\big{(}\mu(g_{1}),\dots,\mu(g_{n})\big{)}\bigg{(}\int_{\mathbb{R}^{d}}\mathcal{L}_{s,\mu}g_{k}(x)d\mu(x)\bigg{)}\Gamma_{s}(\mu)ds
=0t𝒮𝒫k=1n(kf)(μ(g1),,μ(gn))Bsk(μ)dΓs(μ)ds\displaystyle=\int_{0}^{t}\int_{\mathcal{SP}}\sum_{k=1}^{n}(\partial_{k}f)\big{(}\mu(g_{1}),\dots,\mu(g_{n})\big{)}B^{k}_{s}(\mu)d\Gamma_{s}(\mu)ds
=0t𝒮𝒫k=1n(kf)(p1G(μ),,pnG(μ))B¯skG(μ)dΓs(μ)ds\displaystyle=\int_{0}^{t}\int_{\mathcal{SP}}\sum_{k=1}^{n}(\partial_{k}f)\big{(}p_{1}\circ G(\mu),\dots,p_{n}\circ G(\mu)\big{)}\bar{B}^{k}_{s}\circ G(\mu)d\Gamma_{s}(\mu)ds
=0tF¯(z)B¯s(z)Γ¯s(z)𝑑s\displaystyle=\int_{0}^{t}\int_{\mathbb{R}^{\infty}}\nabla\bar{F}(z)\cdot\bar{B}_{s}(z)\bar{\Gamma}_{s}(z)ds

and, furthermore, for each s[0,T]s\in[0,T]

𝒮𝒫F(μ)𝑑Γs(μ)=𝒮𝒫f(p1G(μ),,pnG(μ))𝑑Γs(μ)=F¯(z)𝑑Γ¯s(z).\int_{\mathcal{SP}}F(\mu)d\Gamma_{s}(\mu)=\int_{\mathcal{SP}}f\big{(}p_{1}\circ G(\mu),\dots,p_{n}\circ G(\mu)\big{)}d\Gamma_{s}(\mu)=\int_{\mathbb{R}^{\infty}}\bar{F}(z)d\bar{\Gamma}_{s}(z).

Comparing with (15), it follows that (Γ¯t)tT(\bar{\Gamma}_{t})_{t\leq T} is a solution to (\mathbb{R}^{\infty}-CE) as claimed, because FCb2(𝒢)F\in\mathcal{F}C^{2}_{b}(\mathcal{G}) was arbitrary and hence F¯\bar{F} as above is arbitrary in Cb2()\mathcal{F}C^{2}_{b}(\mathbb{R}^{\infty}). By standard approximation, one extends the above equation to test functions F¯\bar{F} from Cb1()\mathcal{F}C^{1}_{b}(\mathbb{R}^{\infty}).

Step 2: From (\mathbb{R}^{\infty}-CE) to (\mathbb{R}^{\infty}-ODE): Proposition 3.8 implies the existence of a measure η¯𝒫(CT)\bar{\eta}\in\mathcal{P}(C_{T}\mathbb{R}^{\infty}) such that

  1. (i)

    η¯(et)1=Γ¯t\bar{\eta}\circ(e^{\infty}_{t})^{-1}=\bar{\Gamma}_{t} for each t[0,T]t\in[0,T]

  2. (ii)

    η¯\bar{\eta} is concentrated on solution paths of (\mathbb{R}^{\infty}-ODE).

Step 3: From (\mathbb{R}^{\infty}-ODE) to (NLFPK): We show that the measure η:=η¯(J1)1\eta:=\bar{\eta}\circ(J^{-1})^{-1}, with JJ as in Lemma 3.4 fulfills all desired properties. Indeed, since

η¯(et)1=Γ¯t=ΓtG1,\bar{\eta}\circ(e^{\infty}_{t})^{-1}=\bar{\Gamma}_{t}=\Gamma_{t}\circ G^{-1},

for each t[0,T]t\in[0,T] we deduce that η¯(et)1\bar{\eta}\circ(e^{\infty}_{t})^{-1} is concentrated on G(𝒮𝒫)G(\mathcal{SP}). By Lemma 3.4, G(𝒮𝒫)G(\mathcal{SP})\subseteq\mathbb{R}^{\infty} is closed. Since by construction η¯\bar{\eta} is concentrated on continuous curves in \mathbb{R}^{\infty}, η¯\bar{\eta} is concentrated on CTG(𝒮𝒫)C_{T}G(\mathcal{SP}). Further, CTG(𝒮𝒫)CTC_{T}G(\mathcal{SP})\subseteq C_{T}\mathbb{R}^{\infty} is a measurable set and J1:CTG(𝒮𝒫)CT𝒮𝒫J^{-1}:C_{T}G(\mathcal{SP})\to C_{T}\mathcal{SP} is measurable by Lemma 3.4. Therefore, we may define η𝒫(CT𝒮𝒫)\eta\in\mathcal{P}(C_{T}\mathcal{SP}) via

η:=η¯(J1)1.\eta:=\bar{\eta}\circ(J^{-1})^{-1}.

It remains to verify ηet1=Γt\eta\circ e_{t}^{-1}=\Gamma_{t} for all t[0,T]t\in[0,T] and that η\eta is concentrated on subprobability solutions to (NLFPK). Concerning the first matter, we have

ηet1=η¯(J1)1et1=η¯(etJ1)1\eta\circ e_{t}^{-1}=\bar{\eta}\circ(J^{-1})^{-1}\circ e_{t}^{-1}=\bar{\eta}\circ(e_{t}\circ J^{-1})^{-1}

and

Γt=Γt(G1G)1=Γ¯t(G1)1=η¯(G1et)1.\Gamma_{t}=\Gamma_{t}\circ(G^{-1}\circ G)^{-1}=\bar{\Gamma}_{t}\circ(G^{-1})^{-1}=\bar{\eta}\circ(G^{-1}\circ e_{t}^{\infty})^{-1}.

Since etJ1e_{t}\circ J^{-1} and G1etG^{-1}\circ e_{t}^{\infty} coincide as measurable maps on CTG(𝒮𝒫)C_{T}G(\mathcal{SP}) and it was shown above that η¯\bar{\eta} is concentrated on CTG(𝒮𝒫)C_{T}G(\mathcal{SP}), we obtain

ηet1=Γt,tT.\eta\circ e_{t}^{-1}=\Gamma_{t},\,\,t\leq T.

Concerning the second aspect, note that by definition of η\eta and Γ¯t\bar{\Gamma}_{t} and by the equality etJ1=G1ete_{t}\circ J^{-1}=G^{-1}\circ e_{t}^{\infty}, (11) for Γ\Gamma implies that η\eta is concentrated on vaguely continuous curves tμtt\mapsto\mu_{t} in 𝒮𝒫\mathcal{SP} with the global integrability property (3) such that tG(μt)t\mapsto G(\mu_{t}) is a solution to (\mathbb{R}^{\infty}-ODE). Each such curve tμtt\mapsto\mu_{t} is a subprobability solution to (NLFPK). Indeed, due to μtAt\mu_{t}\in A_{t} dtdt-a.s., we have

ddtpkG(μt)=\displaystyle\frac{d}{dt}p_{k}\circ G(\mu_{t})= pkB¯(t,G(μt))dta.s.ddtpkG(μt)=pkB(t,μt)dta.s.\displaystyle\,p_{k}\circ\bar{B}(t,G(\mu_{t}))\quad dt-a.s.\iff\frac{d}{dt}p_{k}\circ G(\mu_{t})=p_{k}\circ B(t,\mu_{t})\quad dt-a.s.
ddtdgk(x)dμt(x)=dt,μtgk(x)dμt(x)dta.s.\displaystyle\iff\frac{d}{dt}\int_{\mathbb{R}^{d}}g_{k}(x)d\mu_{t}(x)=\int_{\mathbb{R}^{d}}\mathcal{L}_{t,\mu_{t}}g_{k}(x)d\mu_{t}(x)\quad dt-a.s.
dgk𝑑μtdgk𝑑μ0=0tds,μsgk(x)𝑑μs(x)𝑑s,t[0,T],\displaystyle\iff\int_{\mathbb{R}^{d}}g_{k}d\mu_{t}-\int_{\mathbb{R}^{d}}g_{k}d\mu_{0}=\int_{0}^{t}\int_{\mathbb{R}^{d}}\mathcal{L}_{s,\mu_{s}}g_{k}(x)d\mu_{s}(x)ds,\,\,t\in[0,T],

and Lemma 3.3 (ii) applies. It remains to prove the additional assertion about probability solutions. To this end, assume Γ0\Gamma_{0} is concentrated on 𝒫\mathcal{P}. Then, η(e0𝒫)=1\eta(e_{0}\in\mathcal{P})=1 and hence the claim follows by Remark 3.2. ∎

The final assertion of the theorem in particular implies: If Γ0𝒫(𝒫)\Gamma_{0}\in\mathcal{P}(\mathcal{P}) for a weakly continuous solution (Γt)tT𝒫(𝒮𝒫)(\Gamma_{t})_{t\leq T}\subseteq\mathcal{P}(\mathcal{SP}) to (𝒮𝒫\mathcal{SP}-CE), then Γt𝒫(𝒫)\Gamma_{t}\in\mathcal{P}(\mathcal{P}) for each tTt\leq T. Of course, this is to be expected due to the global integrability condition in Definition 3.5.

Remark 3.9.

Finally, let us explain why we developed the above result for subprobability solutions to (NLFPK) although our principal interest is restricted to probability solutions. If we directly consider solution curves (Γt)tT(\Gamma_{t})_{t\leq T} to (𝒮𝒫\mathcal{SP}-CE) with Γt𝒫(𝒫)\Gamma_{t}\in\mathcal{P}(\mathcal{P}), we cannot prove that η\eta in Theorem 3.7 is concentrated on CT𝒫C_{T}\mathcal{P} (in fact, not even η(CT𝒫)>0\eta(C_{T}\mathcal{P})>0 could be shown). Indeed, inspecting the proof above, one may only prove that ηet1\eta\circ e_{t}^{-1} is concentrated on 𝒫\mathcal{P} for each tTt\leq T. But since 𝒫𝒮𝒫\mathcal{P}\subseteq\mathcal{SP} is not closed, curves in the support of η\eta may be proper subprobability-valued at single times. The deeper reason for this is that the range G(𝒫)G(\mathcal{P}) of GG as in 3.3 as a map on 𝒫\mathcal{P} with the weak topology is not closed in \mathbb{R}^{\infty}. It seems that one cannot resolve this issue by simply changing the function set 𝒢\mathcal{G}, since there exists no countable set of functions, which allows for a characterization of weak instead of vague convergence as in Lemma 3.3. Since 𝒮𝒫\mathcal{SP} with the vague topology is compact and the vague test function class Cc(d)C_{c}(\mathbb{R}^{d}) is separable, it is feasible to carry out the entire development for subprobability measures as above.

We also mention that to our understanding there is no inherent reason why the superposition principle could not be extended to larger spaces of measures (e.g. spaces of signed measures), as long as its topology allows for a suitable identification with \mathbb{R}^{\infty} as in our present case. Our principal motivation from a probabilistic viewpoint was to study curves of probability measures, and we were only forced to extend to 𝒮𝒫\mathcal{SP}, the vague closure of 𝒫\mathcal{P}, by the reasons outlined above. In order to replace 𝒮𝒫\mathcal{SP} by some larger space of measures \mathcal{M}, it seems indispensable that Lemma 3.4 remains true, i.e. that the range of \mathcal{M} under a suitable homeomorphism is closed in \mathbb{R}^{\infty}.

3.2 Consequences and applications

The following existence- and uniqueness results immediately follow from the superposition principle Theorem 3.7 and provide an equivalence between the nonlinear FPK-equation (NLFPK) and its linearized continuity equation (𝒮𝒫\mathcal{SP}-CE).

Corollary 3.10.

Let μ0𝒮𝒫\mu_{0}\in\mathcal{SP} and assume there exists a solution to (𝒮𝒫\mathcal{SP}-CE) with initial condition δμ0\delta_{\mu_{0}}. Then, there exists a subprobability solution to (NLFPK) with initial condition μ0\mu_{0}. Moreover, if μ0𝒫\mu_{0}\in\mathcal{P}, then there exists a probability solution to (NLFPK) with initial condition μ0\mu_{0}.

Proof.

By Theorem 3.7 there exists a probability measure η\eta concentrated on subprobability solutions to (NLFPK) with ηe01=δμ0\eta\circ e_{0}^{-1}=\delta_{\mu_{0}}. Hence, at least one such solution to (NLFPK) with initial condition μ0\mu_{0} exists. The second assertion is treated similarly. ∎

Corollary 3.11.

Let μ0𝒮𝒫\mu_{0}\in\mathcal{SP} and assume there exists at most one vaguely continuous subprobability solution to (NLFPK) with initial condition μ0\mu_{0}. Then, there exists also at most one weakly continuous solution (Γt)tT(\Gamma_{t})_{t\leq T} to (𝒮𝒫\mathcal{SP}-CE) with initial condition δμ0\delta_{\mu_{0}}. If μ0𝒫\mu_{0}\in\mathcal{P}, then, in the case of existence, Γt(𝒫)=1\Gamma_{t}(\mathcal{P})=1 for each t[0,T]t\in[0,T].

Proof.

Let Γ(1)\Gamma^{(1)} and Γ(2)\Gamma^{(2)} be weakly continuous solutions to (𝒮𝒫\mathcal{SP}-CE) with Γ0(i)=δμ0\Gamma^{(i)}_{0}=\delta_{\mu_{0}} for i{1,2}i\in\{1,2\}. By Theorem 3.7, there exist probability measures η(i)\eta^{(i)}, i{1,2}i\in\{1,2\}, concentrated on subprobability solutions to (NLFPK) with initial condition μ0\mu_{0} such that η(i)et1=Γt(i)\eta^{(i)}\circ e_{t}^{-1}=\Gamma_{t}^{(i)} for each t[0,T]t\in[0,T] and i{1,2}i\in\{1,2\}. By assumption, we obtain η(1)=δμ=η(2)\eta^{(1)}=\delta_{\mu}=\eta^{(2)} for a unique element μCT𝒮𝒫\mu\in C_{T}\mathcal{SP} and thus also Γ(1)=Γ(2)\Gamma^{(1)}=\Gamma^{(2)}. If μ0𝒫\mu_{0}\in\mathcal{P}, then μCT𝒫\mu\in C_{T}\mathcal{P} by Remark 3.2, which gives the second assertion. ∎

3.2.1 Application to coupled nonlinear-linear Fokker-Planck-Kolmogorov equations

Using the superposition principle, we prove an open conjecture posed in [17]. Let us shortly recapitulate the necessary framework. In [17], the authors consider a coupled nonlinear-linear FPK-equation of type

{tμt=t,μtμttνt=t,μtνt,\begin{cases}\partial_{t}\mu_{t}=\mathcal{L}^{*}_{t,\mu_{t}}\mu_{t}\\ \partial_{t}\nu_{t}=\mathcal{L}^{*}_{t,\mu_{t}}\nu_{t},\end{cases} (16)

i.e. comparing to our situation the first nonlinear equation is of type (NLFPK) and the second (linear) equation is obtained by ”freezing” a solution (μt)tT(\mu_{t})_{t\leq T} to the first equation in the nonlinearity spot of \mathcal{L}. For an initial condition (μ¯,ν¯)𝒫×𝒫(\bar{\mu},\bar{\nu})\in\mathcal{P}\times\mathcal{P}, (16) is said to have a unique solution, if there exists a unique probability solution (μt)tT(\mu_{t})_{t\leq T} to the first equation in the sense of Definition 3.1 with μ0=μ¯\mu_{0}=\bar{\mu} and a unique weakly continuous curve (νt)tT𝒫(\nu_{t})_{t\leq T}\subseteq\mathcal{P}, which solves the second equation with fixed coefficient μt\mu_{t} with ν0=ν¯\nu_{0}=\bar{\nu} (we refer to [17] for more details). The authors associate a linear continuity equation on d×𝒫\mathbb{R}^{d}\times\mathcal{P} to (16) in the following sense: Let 𝕃\mathbb{L} be the operator acting on functions

𝒞:={Φ:(x,μ)φ(x)F(μ)|φCc2(d),FCb2(𝒫)},\mathcal{C}:=\big{\{}\Phi:(x,\mu)\mapsto\varphi(x)F(\mu)|\varphi\in C^{2}_{c}(\mathbb{R}^{d}),F\in\mathcal{F}C^{2}_{b}(\mathcal{P})\big{\}},

via

𝕃tΦ(x,μ):=t,μΦ(,μ)(x)+𝐋tΦ(x,)(μ),\mathbb{L}_{t}\Phi(x,\mu):=\mathcal{L}_{t,\mu}\Phi(\cdot,\mu)(x)+\mathbf{L}_{t}\Phi(x,\cdot)(\mu),

with \mathcal{L} as in (1) and 𝐋\mathbf{L} as in (10). Consider the continuity equation

tΛt=𝕃tΛt,t[0,T]\partial_{t}\Lambda_{t}=\mathbb{L}_{t}^{*}\Lambda_{t},\,\,t\in[0,T] (17)

for weakly continuous curves of Borel probability measures on d×𝒫\mathbb{R}^{d}\times\mathcal{P}. The exact notion of solution can be found in [17], where also the following observation is made: A pair (μt,νt)tT(\mu_{t},\nu_{t})_{t\leq T} solves (16) if and only if Λt:=νt×δμt\Lambda_{t}:=\nu_{t}\times\delta_{\mu_{t}} solves (17). Using our main result, we prove the following conjecture posed in Remark 4.4. of [17].

Proposition 3.12.

If (μt,νt)tT(\mu_{t},\nu_{t})_{t\leq T} is the unique solution to (16) with initial condition (μ¯,ν¯)𝒫×𝒫(\bar{\mu},\bar{\nu})\in\mathcal{P}\times\mathcal{P}, then (νt×δμt)tT(\nu_{t}\times\delta_{\mu_{t}})_{t\leq T} is the unique solution to (17) with initial condition ν¯×δμ¯\bar{\nu}\times\delta_{\bar{\mu}}.

Proof.

By Corollary (3.11), the unique solution to (𝒮𝒫\mathcal{SP}-CE) with initial condition δμ¯\delta_{\bar{\mu}} is (δμt)tT(\delta_{\mu_{t}})_{t\leq T}. Let (Λt(1))tT(\Lambda^{(1)}_{t})_{t\leq T} and (Λt(2))tT(\Lambda^{(2)}_{t})_{t\leq T} be two solutions to (17) with initial condition ν¯×δμ¯\bar{\nu}\times\delta_{\bar{\mu}}. It is straightforward to check that the curves of second marginals (Λt(1)Π21)tT(\Lambda_{t}^{(1)}\circ\varPi_{2}^{-1})_{t\leq T} and (Λt(2)Π21)tT(\Lambda_{t}^{(2)}\circ\varPi_{2}^{-1})_{t\leq T} are probability solutions to (𝒮𝒫\mathcal{SP}-CE) with initial condition δμ¯\delta_{\bar{\mu}} (where we denote by Π2\varPi_{2} the projection from d×𝒫\mathbb{R}^{d}\times\mathcal{P} onto the second coordinate). Hence, for each t[0,T]t\in[0,T]

Λt(1)Π21=δμt=Λt(2)Π21.\Lambda_{t}^{(1)}\circ\varPi_{2}^{-1}=\delta_{\mu_{t}}=\Lambda^{(2)}_{t}\circ\varPi_{2}^{-1}.

Consequently, Λt(i)\Lambda_{t}^{(i)} is of product type, i.e. Λt(i)=γt(i)×δμt\Lambda^{(i)}_{t}=\gamma_{t}^{(i)}\times\delta_{\mu_{t}} for weakly continuous curves (γt(i))tT𝒫(\gamma^{(i)}_{t})_{t\leq T}\subseteq\mathcal{P}, i{1,2}i\in\{1,2\}. It is immediate to show that each curve γ(i)\gamma^{(i)} solves the second equation of (16) with fixed μt\mu_{t} and initial condition ν¯\bar{\nu}. Hence, γt(i)=νt\gamma^{(i)}_{t}=\nu_{t} for each t[0,T]t\in[0,T] and i{1,2}i\in\{1,2\}, which implies Λt(1)=Λt(2)\Lambda^{(1)}_{t}=\Lambda^{(2)}_{t}. Hence, the unique solution to (17) with initial condition ν¯×δμ¯\bar{\nu}\times\delta_{\bar{\mu}} is given by (νt×δμt)tT(\nu_{t}\times\delta_{\mu_{t}})_{t\leq T}. ∎

4 Superposition Principle for stochastic nonlinear Fokker-Planck-Kolmogorov Equations

We make use of the following notation specific to the stochastic case.

For two real-valued n×nn\times n matrices A,BA,B we write AA:B=k,l=1nAklBklB=\sum_{k,l=1}^{n}A_{kl}B_{kl}. We use the same notation for A=(Akl)k,l1A=(A_{kl})_{k,l\geq 1} and B=(Bkl)k,l1B=(B_{kl})_{k,l\geq 1}, if either AA or BB contain only finitely many non-trivial entries.
For the Hilbert space 2\ell^{2} with topology induced by the usual inner product ,2\langle\cdot,\cdot\rangle_{\ell^{2}} and norm ||||2||\cdot||_{\ell^{2}}, we denote the space of continuous 2\ell^{2}-valued functions on [0,T][0,T] by CT2C_{T}\ell^{2}. On 2\ell^{2} and CT2C_{T}\ell^{2}, we unambiguously use the same notation et,pie_{t},p_{i} and πn\pi_{n} as on \mathbb{R}^{\infty} and CTC_{T}\mathbb{R}^{\infty} in the previous section. Reminiscent to the previous section, we set (CT2)=σ(et,t[0,T])\mathcal{B}(C_{T}\ell^{2})=\sigma(e_{t},t\in[0,T]) and denote the set of probability measures on this space by 𝒫(CT2)\mathcal{P}(C_{T}\ell^{2}). For σ\sigma-algebras 𝒜1\mathcal{A}_{1}, 𝒜2\mathcal{A}_{2}, we denote by 𝒜1𝒜2\mathcal{A}_{1}\bigvee\mathcal{A}_{2} the σ\sigma-algebra generated by 𝒜1\mathcal{A}_{1} and 𝒜2\mathcal{A}_{2}.

We call a filtered probability space (Ω,,(t)tT,)(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\leq T},\mathbb{P}) complete, provided both \mathcal{F} and 0\mathcal{F}_{0} contain all subsets of \mathbb{P}-negligible sets NN\in\mathcal{F} (i.e. (N)=0\mathbb{P}(N)=0). This notion does not require (t)tT(\mathcal{F}_{t})_{t\leq T} to be right-continuous. A real-valued Wiener process W=(Wt)tTW=(W_{t})_{t\leq T} on such a probability space is called an t\mathcal{F}_{t}-Wiener process, if WtW_{t} is t\mathcal{F}_{t}-adapted and WuWtW_{u}-W_{t} is independent of t\mathcal{F}_{t} for each 0tuT0\leq t\leq u\leq T. Pathwise properties of stochastic processes such as continuity are to be understood up to a negligible set with respect to the underlying measure.

As in the previous section, we consider 𝒮𝒫\mathcal{SP} as a compact Polish space with the vague topology. Let d11d_{1}\geq 1 and consider product-measurable coefficients on [0,T]×𝒮𝒫×d[0,T]\times\mathcal{SP}\times\mathbb{R}^{d}

a(t,μ,x)=(aij(t,μ,x))𝕊d+,b(t,μ,x)=(bi(t,μ,x))idd,σ(t,μ,x)=(σij(t,μ,x))i,jdd×d1a(t,\mu,x)=(a_{ij}(t,\mu,x))\in\mathbb{S}^{+}_{d},\,\,b(t,\mu,x)=(b_{i}(t,\mu,x))_{i\leq d}\in\mathbb{R}^{d},\,\sigma(t,\mu,x)=(\sigma_{ij}(t,\mu,x))_{i,j\leq d}\in\mathbb{R}^{d\times d_{1}}

such that σ\sigma is bounded, and let \mathcal{L} be as before, i.e.

t,μφ(x)=bi(t,μ,x)iφ(x)+aij(t,μ,x)ij2φ(x)\mathcal{L}_{t,\mu}\varphi(x)=b_{i}(t,\mu,x)\partial_{i}\varphi(x)+a_{ij}(t,\mu,x)\partial^{2}_{ij}\varphi(x)

for φC2(d)\varphi\in C^{2}(\mathbb{R}^{d}) and (t,μ,x)[0,T]×𝒮𝒫×d(t,\mu,x)\in[0,T]\times\mathcal{SP}\times\mathbb{R}^{d}.

In contrast to the deterministic framework of the previous section, here we consider nonlinear stochastic FPK-equations of type (SNLFPK) on [0,T][0,T], to be understood in distributional sense as follows. With slight abuse of notation, for σd×d1\sigma\in\mathbb{R}^{d\times d_{1}} and xdx\in\mathbb{R}^{d}, we write σx=(i=1dσikxi)kd1\sigma\cdot x=(\sum_{i=1}^{d}\sigma^{ik}x_{i})_{k\leq d_{1}}, which is consistent with the standard inner product notation σx\sigma\cdot x in the case d1=1d_{1}=1.

Definition 4.1.
  1. (i)

    A pair (μ,W)(\mu,W) consisting of an t\mathcal{F}_{t}-adapted vaguely continuous 𝒮𝒫\mathcal{SP}-valued stochastic process μ=(μt)tT\mu=(\mu_{t})_{t\leq T} and an t\mathcal{F}_{t}-adapted, d1d_{1}-dimensional Wiener process W=(Wt)tTW=(W_{t})_{t\leq T} on a complete probability space (Ω,,(t)tT,)(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\leq T},\mathbb{P}) is a subprobability solution to (SNLFPK), provided

    0Td|bi(t,μt,x)|+|aij(t,μt,x)|+|σik(t,μt,x)|2dμt(x)dt<-a.s.\int_{0}^{T}\int_{\mathbb{R}^{d}}|b_{i}(t,\mu_{t},x)|+|a_{ij}(t,\mu_{t},x)|+|\sigma_{ik}(t,\mu_{t},x)|^{2}d\mu_{t}(x)dt<\infty\quad\mathbb{P}\text{-a.s.} (18)

    for each i,jd,kd1i,j\leq d,k\leq d_{1}, and

    dφ(x)𝑑μt(x)dφ(x)𝑑μ0(x)=0tds,μsφ(x)𝑑μs(x)𝑑s+0tdσ(s,μs,x)φ(x)𝑑μs(x)𝑑Ws\int_{\mathbb{R}^{d}}\varphi(x)d\mu_{t}(x)-\int_{\mathbb{R}^{d}}\varphi(x)d\mu_{0}(x)=\int_{0}^{t}\int_{\mathbb{R}^{d}}\mathcal{L}_{s,\mu_{s}}\varphi(x)d\mu_{s}(x)ds+\int_{0}^{t}\int_{\mathbb{R}^{d}}\sigma(s,\mu_{s},x)\cdot\nabla\varphi(x)d\mu_{s}(x)dW_{s} (19)

    holds \mathbb{P}-a.s. for each t[0,T]t\in[0,T] and φCc2(d)\varphi\in C^{2}_{c}(\mathbb{R}^{d}).

  2. (ii)

    A probability solution to (SNLFPK) is a pair as above such that μ\mu is a 𝒫\mathcal{P}-valued process (μt)tT(\mu_{t})_{t\leq T} with weakly continuous paths.

Remark 4.2.
  1. (i)

    Since Cc2(d)C^{2}_{c}(\mathbb{R}^{d}) is separable with respect to uniform convergence and since the paths tμt(ω)t\mapsto\mu_{t}(\omega) are vaguely continuous, the exceptional sets in the above definition can be chosen independently of φ\varphi and tt.

  2. (ii)

    The first integral on the right-hand side of (19) is a pathwise (that is, for individual fixed ωΩ\omega\in\Omega) integral with respect to the finite measure μs(ω)ds\mu_{s}(\omega)ds on [0,T]×d[0,T]\times\mathbb{R}^{d}. The second integral is a stochastic integral, which is defined, since the integrand

    (t,ω)dσ(t,μt(ω),x)φ(x)𝑑μt(ω)(x)(t,\omega)\mapsto\int_{\mathbb{R}^{d}}\sigma(t,\mu_{t}(\omega),x)\cdot\nabla\varphi(x)d\mu_{t}(\omega)(x)

    is d1\mathbb{R}^{d_{1}}-valued, bounded, product-measurable and t\mathcal{F}_{t}-adapted (Thm. 3.8 [7]). More precisely,

    0tdσ(s,μs,x)φ(x)𝑑μs(x)𝑑Ws=α=1d10tdσαφdμsdWsα,\int_{0}^{t}\int_{\mathbb{R}^{d}}\sigma(s,\mu_{s},x)\cdot\nabla\varphi(x)d\mu_{s}(x)dW_{s}=\sum_{\alpha=1}^{d_{1}}\int_{0}^{t}\int_{\mathbb{R}^{d}}\sigma^{\alpha}\cdot\nabla\varphi d\mu_{s}dW^{\alpha}_{s},

    where σα=(σiα)id\sigma^{\alpha}=(\sigma^{i\alpha})_{i\leq d} denotes the α\alpha-th column of σ\sigma and the components WαW^{\alpha}, αd1\alpha\leq d_{1}, of WW are real, independent Wiener processes.

By the global integrability assumption (18) and since σ\sigma is bounded, we obtain (in analogy to Remark 3.2) the following conservation of mass, which we use to prove the final assertion of the main result Theorem 4.8.

Lemma 4.3.

Let (μt)tT(\mu_{t})_{t\leq T} be a subprobability solution to (SNLFPK). If μ0𝒫\mu_{0}\in\mathcal{P} \mathbb{P}-a.s., then the paths of tμtt\mapsto\mu_{t} are 𝒫\mathcal{P}-valued \mathbb{P}-a.s. and, hence, in particular weakly continuous.

Proof.

Let (φk)k1Cc2(d)(\varphi_{k})_{k\geq 1}\subseteq C^{2}_{c}(\mathbb{R}^{d}) approximate the constant function 11 as in Remark 3.2. Then, by Itô-isometry, for each t[0,T]t\in[0,T], there exists a subsequence (klt)l1=(kl)l1(k^{t}_{l})_{l\geq 1}=(k_{l})_{l\geq 1} such that

0tdσ(s,μs,x)φkl(x)𝑑μs(x)𝑑Wsl0-a.s.\int_{0}^{t}\int_{\mathbb{R}^{d}}\sigma(s,\mu_{s},x)\cdot\nabla\varphi_{k_{l}}(x)d\mu_{s}(x)dW_{s}\underset{l\to\infty}{\longrightarrow}0\,\,\mathbb{P}\text{-a.s.} (20)

Since the stochastic integral is continuous in tt, a classical diagonal argument yields that there exists a subsequence (kl)l1(k_{l})_{l\geq 1} along which (20) holds for all t[0,T]t\in[0,T] on a set of full \mathbb{P}-measure, independent of tt. Let ωΩ\omega^{\prime}\in\Omega be from this set such that also μ0(ω)𝒫\mu_{0}(\omega^{\prime})\in\mathcal{P} and (19) holds for each tt and φ\varphi. Note that the set of all such ω\omega^{\prime} has full \mathbb{P}-measure. Then, similar to the reasoning in Remark 3.2 and by using (20), considering (19) for such ω\omega^{\prime} with φkl\varphi_{k_{l}} in place of φ\varphi for the limit l+l\longrightarrow+\infty, we obtain

μt(ω)(d)=μ0(ω)(d),t[0,T]\mu_{t}(\omega^{\prime})(\mathbb{R}^{d})=\mu_{0}(\omega^{\prime})(\mathbb{R}^{d}),\,\,t\in[0,T]

and hence the result. ∎

Note that the above proof can be adjusted to extend (19) to each φCb2(d)\varphi\in C^{2}_{b}(\mathbb{R}^{d}).

Embedding 𝒮𝒫\mathcal{SP} into 2\ell^{2}

In comparison with the deterministic case, we still consider 𝒮𝒫\mathcal{SP} as a manifold-like space with tangent spaces Tμ𝒮𝒫=L2(d,d;μ)T_{\mu}\mathcal{SP}=L^{2}(\mathbb{R}^{d},\mathbb{R}^{d};\mu) as before. However, instead of embedding into \mathbb{R}^{\infty} by GG as in the previous section, now we need a global chart

H:𝒮𝒫2H:\mathcal{SP}\to\ell^{2}

in order to handle the stochastic integral term later on. To this end, we replace the set of functions 𝒢={gi,i1}\mathcal{G}=\{g_{i},i\geq 1\} of the deterministic case by

:={hi}i1,hi:=2igigiCb2\mathcal{H}:=\{h_{i}\}_{i\geq 1},\,h_{i}:=2^{-i}\frac{g_{i}}{||g_{i}||_{C^{2}_{b}}} (21)

and consider the map

H:𝒮𝒫2,H:μ(μ(hi))i1.H:\mathcal{SP}\to\ell^{2},\,\,H:\mu\mapsto(\mu(h_{i}))_{i\geq 1}.

The following lemma collects useful properties of \mathcal{H} and HH, which are in the spirit of Lemma 3.3 and 3.4. We point out that we could have used the function class \mathcal{H} instead of 𝒢\mathcal{G} already in Section 3, but we decided to pass from 𝒢\mathcal{G} to \mathcal{H} at this point in order to stress the technical adjustments necessary due to the stochastic case.

Lemma 4.4.
  1. (i)

    The set \mathcal{H} is measure-determining. Further, a process (μt)tT(\mu_{t})_{t\leq T} as in Definition 4.1 is a solution to (SNLFPK) if and only if (19) holds for each hih_{i}\in\mathcal{H} in place of φ\varphi.

  2. (ii)

    HH is a homeomorphism between 𝒮𝒫\mathcal{SP} and its range H(𝒮𝒫)2H(\mathcal{SP})\subseteq\ell^{2}, endowed with the 2\ell^{2}-subspace topology. In particular, H(𝒮𝒫)2H(\mathcal{SP})\subseteq\ell^{2} is compact.

Proof.
  1. (i)

    The first claim is obvious, since 𝒢\mathcal{G} is measure-determining. Concerning the second claim, note that it is clearly sufficient to have (19) for each φCc2(d)\varphi\in C_{c}^{2}(\mathbb{R}^{d}) with φCb21||\varphi||_{C^{2}_{b}}\leq 1. Since the functions giCb21gi||g_{i}||^{-1}_{C^{2}_{b}}g_{i} are dense in the unit ball of Cc2(d)C^{2}_{c}(\mathbb{R}^{d}) with respect to ||||Cb2||\cdot||_{C^{2}_{b}}, it is sufficient to have (19) for each such normalized function. Indeed, if φkkφ\varphi_{k}\underset{k\to\infty}{\longrightarrow}\varphi uniformly up to second-order partial derivatives, then by Itô-isometry

    𝔼[(0tdσ(s,μs,)(φkφ)dμsdWs)2]=𝔼[0t(dσ(s,μs,)(φkφ)dμs)2𝑑s],\mathbb{E}\bigg{[}\bigg{(}\int_{0}^{t}\int_{\mathbb{R}^{d}}\sigma(s,\mu_{s},\cdot)\cdot\nabla(\varphi_{k}-\varphi)d\mu_{s}dW_{s}\bigg{)}^{2}\bigg{]}=\mathbb{E}\bigg{[}\int_{0}^{t}\bigg{(}\int_{\mathbb{R}^{d}}\sigma(s,\mu_{s},\cdot)\cdot\nabla(\varphi_{k}-\varphi)d\mu_{s}\bigg{)}^{2}ds\bigg{]},

    which converges to 0 as kk\longrightarrow\infty due to the boundedness of σ\sigma. Hence, along a subsequence (kl)l1(k_{l})_{l\geq 1}, we have a.s.

    0tdσ(s,μs,x)φkl(x)𝑑μs(x)𝑑Wsl0tdσ(s,μs,x)φ(x)𝑑μs(x)𝑑Ws.\int_{0}^{t}\int_{\mathbb{R}^{d}}\sigma(s,\mu_{s},x)\cdot\nabla\varphi_{k_{l}}(x)d\mu_{s}(x)dW_{s}\underset{l\to\infty}{\longrightarrow}\int_{0}^{t}\int_{\mathbb{R}^{d}}\sigma(s,\mu_{s},x)\cdot\nabla\varphi(x)d\mu_{s}(x)dW_{s}.

    The a.s.-convergence of all other terms in (19) is clear. Therefore, it is sufficient to require (19) for a dense subset of the unit ball of Cc2(d)C^{2}_{c}(\mathbb{R}^{d}). Clearly, this yields at once that it is sufficient to have (19) for each hih_{i}\in\mathcal{H}.

  2. (ii)

    By definition, HH maps into 2\ell^{2}. Since \mathcal{H} is measure-determining, HH is one-to-one, hence bijective onto its range. If μnnμ\mu_{n}\underset{n\to\infty}{\longrightarrow}\mu vaguely in 𝒮𝒫\mathcal{SP}, clearly H(μn)H(\mu_{n}) converges to H(μ)H(\mu) in the product topology. Since for any i1i\geq 1

    supn1|H(μn)i|2i,\underset{n\geq 1}{\text{sup}}|H(\mu_{n})_{i}|\leq 2^{-i},

    the convergence holds in 2\ell^{2} as well, which implies continuity of HH. In particular, H(𝒮𝒫)2H(\mathcal{SP})\subseteq\ell^{2} is compact. Conversely, if H(μn)H(\mu_{n}) converges in 2\ell^{2} to some z=(zi)i1z=(z_{i})_{i\geq 1}, then, by closedness of H(𝒮𝒫)2H(\mathcal{SP})\subseteq\ell^{2}, we have z=H(μ)z=H(\mu) for a unique element μ𝒮𝒫\mu\in\mathcal{SP} and μnnμ\mu_{n}\underset{n\to\infty}{\longrightarrow}\mu vaguely. Indeed, the latter follows as in Lemma 3.4 (i).

For consistency of notation, below we denote the test function class of the manifold-like space 𝒮𝒫\mathcal{SP} by Cb2()\mathcal{F}C^{2}_{b}(\mathcal{H}) to stress that the base functions gig_{i} are now replaced by hih_{i}\in\mathcal{H}. However, the class of test functions remains unchanged, because the transition from gig_{i} to hih_{i} can be incorporated in the choice of ff.

Linearization of (SNLFPK)

As in the deterministic case, also for the stochastic nonlinear equation (SNLFPK) one can consider an associated linear equation for curves in 𝒫(𝒮𝒫)\mathcal{P}(\mathcal{SP}). To the best of our knowledge, such a linearization for stochastic FPK-equations has not yet been considered in the literature. Of course, the basic idea stems from the deterministic case [17] discussed in the previous section. From Itô’s formula one expects this linearized equation to be of second-order.

Let ((μt)tT,W)\big{(}(\mu_{t})_{t\leq T},W\big{)} be a subprobability solution to (SNLFPK) (with underlying measure \mathbb{P}) and choose any F:μf(μ(h1),,μ(hn))F:\mu\mapsto f\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)} from Cb2()\mathcal{F}C^{2}_{b}(\mathcal{H}). Again, we abbreviate b(t,μ):=b(t,μ,)b(t,\mu):=b(t,\mu,\cdot) and similarly for aa and σ\sigma. By Itô’s formula, we have \mathbb{P}-a.s.

F(μt)F(μ0)\displaystyle F(\mu_{t})-F(\mu_{0}) =0t𝒮𝒫F(μ),b(s,μ)+a(s,μ)L2(μs)𝑑s\displaystyle=\int_{0}^{t}\big{\langle}\nabla^{\mathcal{SP}}F(\mu),b(s,\mu)+a(s,\mu)\nabla\big{\rangle}_{L^{2}(\mu_{s})}ds
+12α=1d10tk,l=1n(klf)(μ(h1),,μ(hn))(dσα(s,μ)hkdμ)(dσα(s,μ)hldμ)ds\displaystyle+\frac{1}{2}\sum_{\alpha=1}^{d_{1}}\int_{0}^{t}\sum_{k,l=1}^{n}(\partial_{kl}f)(\mu(h_{1}),\dots,\mu(h_{n}))\bigg{(}\int_{\mathbb{R}^{d}}\sigma^{\alpha}(s,\mu)\cdot\nabla h_{k}d\mu\bigg{)}\bigg{(}\int_{\mathbb{R}^{d}}\sigma^{\alpha}(s,\mu)\cdot\nabla h_{l}d\mu\bigg{)}ds
+MtF,\displaystyle+M_{t}^{F},

with the martingale MFM^{F} given as

MtF:=α=1d10t[l=1n(lf)(μ(h1),,μ(hn))dσαhldμs]𝑑Wsα.M_{t}^{F}:=\sum_{\alpha=1}^{d_{1}}\int_{0}^{t}\bigg{[}\sum_{l=1}^{n}(\partial_{l}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}\int_{\mathbb{R}^{d}}\sigma^{\alpha}\cdot\nabla h_{l}d\mu_{s}\bigg{]}dW_{s}^{\alpha}.

Since M0F=0M^{F}_{0}=0 \mathbb{P}-a.s., integrating with respect to \mathbb{P} and defining the curve of measures in (𝒮𝒫)\mathbb{P}(\mathcal{SP})

Γt:=μt1,tT\Gamma_{t}:=\mathbb{P}\circ\mu_{t}^{-1},\,\,t\leq T

yields

𝒮𝒫F(μ)𝑑Γt(μ)𝒮𝒫F(μ)𝑑Γ0(μ)=0t𝒮𝒫𝒮𝒫F(μ),b(s,μ)+a(s,μ)L2(μ)𝑑Γs(μ)𝑑s\displaystyle\int_{\mathcal{SP}}F(\mu)d\Gamma_{t}(\mu)-\int_{\mathcal{SP}}F(\mu)d\Gamma_{0}(\mu)=\int_{0}^{t}\int_{\mathcal{SP}}\big{\langle}\nabla^{\mathcal{SP}}F(\mu),b(s,\mu)+a(s,\mu)\nabla\big{\rangle}_{L^{2}(\mu)}d\Gamma_{s}(\mu)ds
+12α=1d10t𝒮𝒫k,l=1n(klf)(μ(h1),,μ(hn))(dσα(s,μ)hkdμ)(dσα(s,μ)hldμ)dΓs(μ)ds.\displaystyle+\frac{1}{2}\sum_{\alpha=1}^{d_{1}}\int_{0}^{t}\int_{\mathcal{SP}}\sum_{k,l=1}^{n}(\partial_{kl}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}\bigg{(}\int_{\mathbb{R}^{d}}\sigma^{\alpha}(s,\mu)\cdot\nabla h_{k}d\mu\bigg{)}\bigg{(}\int_{\mathbb{R}^{d}}\sigma^{\alpha}(s,\mu)\cdot\nabla h_{l}d\mu\bigg{)}d\Gamma_{s}(\mu)ds. (22)

As for the first-order term, which is interpreted as the pairing of the gradient 𝒮𝒫F\nabla^{\mathcal{SP}}F with the inhomogeneous vector field b+ab+a\nabla in the tangent bundle T𝒮𝒫T\mathcal{SP}, also the second-order term allows for a geometric interpretation: Recall that for a smooth, real function FF on a Riemannian manifold MM with tangent bundle TMTM, the Hessian Hess(F)pHess(F)_{p} at pMp\in M is a bilinear form on TpMT_{p}M with

Hess(F)p(ηp,ξp)=ηpLF(p),ξpTpM,ηp,ξpTpM,Hess(F)_{p}(\eta_{p},\xi_{p})=\big{\langle}\nabla^{L}_{\eta_{p}}\nabla F(p),\xi_{p}\big{\rangle}_{T_{p}M},\,\,\eta_{p},\xi_{p}\in T_{p}M, (23)

where L:TM×TMTM\nabla^{L}:TM\times TM\to TM denotes the Levi-Civita-connection on MM, the unique affine connection compatible with the metric tensor on MM and \nabla denotes the usual gradient on MM. Intuitively, ηpLF(p)\nabla^{L}_{\eta_{p}}\nabla F(p) denotes the change of the vector field F\nabla F in direction ηp\eta_{p} at pp. Recall that we consider 𝒮𝒫\mathcal{SP} as a manifold-like space with gradient 𝒮𝒫\nabla^{\mathcal{SP}} and that hence the reasonable notion of the Levi-Civita connectionL,𝒮𝒫\nabla^{L,\mathcal{SP}} on 𝒮𝒫\mathcal{SP} for σTμ𝒮𝒫=L2(d,d;μ),YT𝒮𝒫\sigma\in T_{\mu}\mathcal{SP}=L^{2}(\mathbb{R}^{d},\mathbb{R}^{d};\mu),Y\in T\mathcal{SP} at μ\mu is given by

σL,𝒮𝒫Y(μ)=𝒮𝒫Y(μ),σTμ𝒮𝒫,\nabla^{L,\mathcal{SP}}_{\sigma}Y(\mu)=\big{\langle}\nabla^{\mathcal{SP}}Y(\mu),\sigma\big{\rangle}_{T_{\mu}\mathcal{SP}},

whenver 𝒮𝒫Y\nabla^{\mathcal{SP}}Y is defined in T𝒮𝒫T\mathcal{SP}. For the representation of Hess(F)Hess(F) for a test function FCb2()F\in\mathcal{F}C^{2}_{b}(\mathcal{H}), we need to set Y=𝒮𝒫FY=\nabla^{\mathcal{SP}}F. In this case, we can indeed make sense of

(𝒮𝒫)2F:=𝒮𝒫𝒮𝒫F,(\nabla^{\mathcal{SP}})^{2}F:=\nabla^{\mathcal{SP}}\nabla^{\mathcal{SP}}F,

because the gradient

μ𝒮𝒫F(μ)=k=1n(kf)(μ(h1),,μ(hn))hk\mu\mapsto\nabla^{\mathcal{SP}}F(\mu)=\sum_{k=1}^{n}(\partial_{k}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}\nabla h_{k}

is a linear combinations of the ”Cb2()\mathcal{F}C^{2}_{b}(\mathcal{H})-like” functions μkf(μ(h1),,μ(hn))\mu\mapsto\partial_{k}f\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}. The linear combination has to be understood in an xx-wise sense with coefficient functions hk\nabla h_{k}, which are independent of the variable of interest μ\mu. Denoting Fk(μ):=(kf)(μ(h1),,μ(hn))F_{k}(\mu):=(\partial_{k}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}, we then define

(𝒮𝒫)2F(μ)(x,y):=k=1n(𝒮𝒫Fk(μ))(y)hk(x),(x,y)d×d.(\nabla^{\mathcal{SP}})^{2}F(\mu)(x,y):=\sum_{k=1}^{n}\big{(}\nabla^{\mathcal{SP}}F_{k}(\mu)\big{)}(y)\nabla h_{k}(x),\,\,(x,y)\in\mathbb{R}^{d}\times\mathbb{R}^{d}. (24)

Consequently, we have a reasonable notion of the Levi-Civita connection on 𝒮𝒫\mathcal{SP} at μ\mu for σTμ𝒮𝒫\sigma\in T_{\mu}\mathcal{SP} and 𝒮𝒫F\nabla^{\mathcal{SP}}F for FCb2()F\in\mathcal{F}C^{2}_{b}(\mathcal{H}) as

σL,𝒮𝒫𝒮𝒫F(μ):=(𝒮𝒫)2F(μ),σTμ𝒮𝒫=k,l=1n(klf)(μ(h1),,μ(hn))hk(dσhldμ).\nabla^{L,\mathcal{SP}}_{\sigma}\nabla^{\mathcal{SP}}F(\mu):=\big{\langle}(\nabla^{\mathcal{SP}})^{2}F(\mu),\sigma\big{\rangle}_{T_{\mu}\mathcal{SP}}=\sum_{k,l=1}^{n}(\partial_{kl}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}\nabla h_{k}\bigg{(}\int_{\mathbb{R}^{d}}\sigma\cdot\nabla h_{l}d\mu\bigg{)}. (25)

The section (𝒮𝒫)2F(\nabla^{\mathcal{SP}})^{2}F in T𝒮𝒫T𝒮𝒫T\mathcal{SP}^{*}\otimes T\mathcal{SP}^{*} (and hence σL,𝒮𝒫𝒮𝒫F\nabla^{L,\mathcal{SP}}_{\sigma}\nabla^{\mathcal{SP}}F and Hess(F)Hess(F) below) is independent of the particular representation of FF in (24). Indeed, we have (c.f. Appendix A [17]) for

γμσ(t):=μ(Id+tσ)1\gamma^{\sigma}_{\mu}(t):=\mu\circ(\text{Id}+t\sigma)^{-1}

the following pointwise (in xdx\in\mathbb{R}^{d}) equality for each μ𝒮𝒫,σL2(d,d;μ)\mu\in\mathcal{SP},\sigma\in L^{2}(\mathbb{R}^{d},\mathbb{R}^{d};\mu)

ddt𝒮𝒫F(γμσ(t))\displaystyle\frac{d}{dt}\nabla^{\mathcal{SP}}F\big{(}\gamma^{\sigma}_{\mu}(t)\big{)} =k=1n[ddt(kf)(γμσ(t)(h1),,γμσ(t)(hn))]hk\displaystyle=\sum_{k=1}^{n}\bigg{[}\frac{d}{dt}(\partial_{k}f)\big{(}\gamma^{\sigma}_{\mu}(t)(h_{1}),\dots,\gamma^{\sigma}_{\mu}(t)(h_{n})\big{)}\bigg{]}\nabla h_{k}
=k,l=1n(klf)(μ(h1),,μ(hn))hl,σL2(μ)hk\displaystyle=\sum_{k,l=1}^{n}(\partial_{kl}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}\big{\langle}\nabla h_{l},\sigma\big{\rangle}_{L^{2}(\mu)}\nabla h_{k}
=(𝒮𝒫)2F(μ),σL2(μ).\displaystyle=\big{\langle}(\nabla^{\mathcal{SP}})^{2}F(\mu),\sigma\big{\rangle}_{L^{2}(\mu)}.

Since the gradient 𝒮𝒫F\nabla^{\mathcal{SP}}F is independent of the particular representation of FF and σL2(d,d;μ)\sigma\in L^{2}(\mathbb{R}^{d},\mathbb{R}^{d};\mu) is arbitrary, also (𝒮𝒫)2F(\nabla^{\mathcal{SP}})^{2}F is independent of the representation of FF.

Considering (23), we then set for FCb2()F\in\mathcal{F}C^{2}_{b}(\mathcal{H}) and σ,σ~L2(d,d;μ)\sigma,\tilde{\sigma}\in L^{2}(\mathbb{R}^{d},\mathbb{R}^{d};\mu)

Hess(F)(μ):(σ,σ~)k,l=1n(klf)(μ(h1),,μ(hn))(dσhldμ)(dσ~hkdμ),Hess(F)(\mu):(\sigma,\tilde{\sigma})\mapsto\sum_{k,l=1}^{n}(\partial_{kl}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}\bigg{(}\int_{\mathbb{R}^{d}}\sigma\cdot\nabla h_{l}d\mu\bigg{)}\bigg{(}\int_{\mathbb{R}^{d}}\tilde{\sigma}\cdot\nabla h_{k}d\mu\bigg{)}, (26)

which is a (symmetric) bilinear form on Tμ𝒮𝒫T_{\mu}\mathcal{SP} and rewrite (4) as

𝒮𝒫F𝑑Γt𝒮𝒫F𝑑Γ0=0t𝒮𝒫𝒮𝒫F,bs+asL2+12α=1d1Hess(F)(σsα,σsα)dΓsds\displaystyle\int_{\mathcal{SP}}Fd\Gamma_{t}-\int_{\mathcal{SP}}Fd\Gamma_{0}=\int_{0}^{t}\int_{\mathcal{SP}}\big{\langle}\nabla^{\mathcal{SP}}F,b_{s}+a_{s}\nabla\big{\rangle}_{L^{2}}+\frac{1}{2}\sum_{\alpha=1}^{d_{1}}Hess(F)(\sigma_{s}^{\alpha},\sigma_{s}^{\alpha})d\Gamma_{s}ds (27)

(with bs:(μ,x)b(s,μ,x)b_{s}:(\mu,x)\mapsto b(s,\mu,x) and similarly for asa_{s} and σs\sigma_{s}). Introducing the second-order operator 𝐋(2)\mathbf{L}^{(2)}, acting on FCb2()F\in\mathcal{F}C^{2}_{b}(\mathcal{H}) via

𝐋t(2)F(μ)=𝒮𝒫F,b(t,μ)+a(t,μ)L2(μ)+12α=1d1Hess(F)(σα(t,μ),σα(t,μ)),\mathbf{L}^{(2)}_{t}F(\mu)=\big{\langle}\nabla^{\mathcal{SP}}F,b(t,\mu)+a(t,\mu)\nabla\big{\rangle}_{L^{2}(\mu)}+\frac{1}{2}\sum_{\alpha=1}^{d_{1}}Hess(F)\big{(}\sigma^{\alpha}(t,\mu),\sigma^{\alpha}(t,\mu)\big{)},

we arrive at the distributional formulation of (𝒮𝒫\mathcal{SP}-FPK)

tΓt=(𝐋t(2))Γt,tT,\partial_{t}\Gamma_{t}=(\mathbf{L}_{t}^{(2)})^{*}\Gamma_{t},\,\,t\leq T,

as in the introduction.

Remark 4.5.

Equation (𝒮𝒫\mathcal{SP}-CE) is the natural analogue to second-order FPK-equations over Euclidean spaces. Indeed, for a stochastic equation on d\mathbb{R}^{d}

dXt=b(t,Xt)dt+σ(t,Xt)dWt,dX_{t}=b(t,X_{t})dt+\sigma(t,X_{t})dW_{t}, (28)

by Itô’s formula, the corresponding linear second-order equation for measures in distributional form is

tμt=(t(2))μt\partial_{t}\mu_{t}=\big{(}\mathcal{L}^{(2)}_{t}\big{)}^{*}\mu_{t}

with

t(2)f=fbt+12σt,Hess(f)σtd,\mathcal{L}^{(2)}_{t}f=\nabla f\cdot b_{t}+\frac{1}{2}\big{\langle}\sigma_{t},Hess(f)\sigma_{t}\big{\rangle}_{\mathbb{R}^{d}},

where Hess(f)Hess(f) denotes the usual Euclidean Hessian matrix of fC2(d)f\in C^{2}(\mathbb{R}^{d}). In this spirit, it seems natural to consider (SNLFPK) as a stochastic equation with state space 𝒮𝒫\mathcal{SP} instead of d\mathbb{R}^{d} as for (28) and (𝒮𝒫\mathcal{SP}-CE) as the corresponding linear Fokker-Planck-type equation on 𝒮𝒫\mathcal{SP}.

By the above derivation, for any subprobability solution process (μt)tT(\mu_{t})_{t\leq T} to (SNLFPK) the curve (Γt)tT(\Gamma_{t})_{t\leq T}, Γt:=μt1\Gamma_{t}:=\mathbb{P}\circ\mu_{t}^{-1} in 𝒫(𝒮𝒫)\mathcal{P}(\mathcal{SP}) solves (𝒮𝒫\mathcal{SP}-CE) in the sense of the following definition.

Definition 4.6.

A weakly continuous curve (Γt)tT𝒫(𝒮𝒫)(\Gamma_{t})_{t\leq T}\subseteq\mathcal{P}(\mathcal{SP}) is a solution to (SNLFPK), if the integrability condition

0T𝒮𝒫b(t,μ)L1(d,d;μ)+a(t,μ)L1(d,d2;μ)+σ(t,μ)L2(d,d×d1;μ)2dΓt(μ)dt<\displaystyle\int_{0}^{T}\int_{\mathcal{SP}}||b(t,\mu)||_{L^{1}(\mathbb{R}^{d},\mathbb{R}^{d};\mu)}+||a(t,\mu)||_{L^{1}(\mathbb{R}^{d},\mathbb{R}^{d^{2}};\mu)}+||\sigma(t,\mu)||^{2}_{L^{2}(\mathbb{R}^{d},\mathbb{R}^{d\times d_{1}};\mu)}d\Gamma_{t}(\mu)dt<\infty (29)

is fulfilled and for each FCb2()F\in\mathcal{F}C^{2}_{b}(\mathcal{H}), 27 holds for each t[0,T]t\in[0,T].

Transferring (SNLFPK) and (𝒮𝒫\mathcal{SP}-FPK) to 2\ell^{2}

Reminiscent to the deterministic case, we use the global chart H:𝒮𝒫2H:\mathcal{SP}\to\ell^{2} to introduce auxiliary equations on 2\ell^{2} and the space of measures on 2\ell^{2}, respectively, as follows. Again, we use the notation

At:={μ𝒮𝒫:d|aij(t,μ,x)|+|bi(t,μ,x)|dμ(x)< 1i,jd},t[0,T].A_{t}:=\bigg{\{}\mu\in\mathcal{SP}:\int_{\mathbb{R}^{d}}|a_{ij}(t,\mu,x)|+|b_{i}(t,\mu,x)|d\mu(x)<\infty\,\,\forall\,1\leq i,j\leq d\bigg{\}},\quad t\in[0,T].

For i,j1i,j\geq 1, αd1\alpha\leq d_{1}, define the measurable coefficients BiB_{i} for (t,μ)(t,\mu) such that μAt\mu\in A_{t}, and Σiα\Sigma^{\alpha}_{i} and AijA_{ij} on [0,T]×𝒮𝒫[0,T]\times\mathcal{SP} by

Bi(t,μ)\displaystyle B_{i}(t,\mu) :=dt,μhi(x)𝑑μ(x),(t,μ)[0,T]×At,\displaystyle:=\int_{\mathbb{R}^{d}}\mathcal{L}_{t,\mu}h_{i}(x)d\mu(x),\quad(t,\mu)\in[0,T]\times A_{t},
Σiα(t,μ)\displaystyle\Sigma^{\alpha}_{i}(t,\mu) :=dσα(t,μ,x)hi(x)𝑑μ(x),\displaystyle:=\int_{\mathbb{R}^{d}}\sigma^{\alpha}(t,\mu,x)\cdot\nabla h_{i}(x)d\mu(x),
Σi(t,μ)\displaystyle\Sigma_{i}(t,\mu) :=(Σiα(t,μ))αd1,\displaystyle:=\big{(}\Sigma^{\alpha}_{i}(t,\mu)\big{)}_{\alpha\leq d_{1}},
Aij(t,μ)\displaystyle A_{ij}(t,\mu) :=Σi,Σjd1(t,μ),\displaystyle:=\big{\langle}\Sigma_{i},\Sigma_{j}\big{\rangle}_{d_{1}}(t,\mu),

and set

B:=(Bi)i1,Σ:=(Σiα)αd1,i1,A:=(Aij)i,j1.B:=(B_{i})_{i\geq 1},\,\Sigma:=(\Sigma^{\alpha}_{i})_{\alpha\leq d_{1},i\geq 1},\,A:=(A_{ij})_{i,j\geq 1}.

Now, transferring to 2\ell^{2}, define B¯,Σ¯\bar{B},\bar{\Sigma} and A¯ij\bar{A}_{ij} on [0,T]×2[0,T]\times\ell^{2} component-wise via

B¯i(t,z):={Bi(t,H1(z)),zH(At)0,else,\displaystyle\bar{B}_{i}(t,z):=\begin{cases}B_{i}(t,H^{-1}(z))&,z\in H(A_{t})\\ 0&,\text{else}\end{cases},

,

Σ¯iα(t,z):={Σiα(t,H1(z)),zH(𝒮𝒫)0,z2\H(𝒮𝒫),\displaystyle\bar{\Sigma}^{\alpha}_{i}(t,z):=\begin{cases}\Sigma_{i}^{\alpha}(t,H^{-1}(z))&,z\in H(\mathcal{SP})\\ 0&,z\in\ell^{2}\backslash H(\mathcal{SP})\end{cases},
Σ¯i(t,z):=(Σ¯iα(t,z))αd1,\bar{\Sigma}_{i}(t,z):=\big{(}\bar{\Sigma}^{\alpha}_{i}(t,z)\big{)}_{\alpha\leq d_{1}},
A¯ij(t,z):=Σ¯i,Σ¯jd1(t,z).\bar{A}_{ij}(t,z):=\big{\langle}\bar{\Sigma}_{i},\bar{\Sigma}_{j}\big{\rangle}_{d_{1}}(t,z).

B¯\bar{B} and Σ¯α\bar{\Sigma}^{\alpha} are 2\ell^{2}-valued, since for z=H(μ)z=H(\mu)

|B¯i(t,z)|d|t,μhi(x)|𝑑μ(x)C2i,|\bar{B}_{i}(t,z)|\leq\int_{\mathbb{R}^{d}}|\mathcal{L}_{t,\mu}h_{i}(x)|d\mu(x)\leq C2^{-i},

where C=C(a,b,d)C=C(a,b,d) is a finite constant independent of t,zt,z and i1i\geq 1. A similar argument is valid for each Σ¯α\bar{\Sigma}^{\alpha}. Each B¯i\bar{B}_{i} and Σ¯iα\bar{\Sigma}^{\alpha}_{i} is product-measurable with respect to the 2\ell^{2}-topology due to the measurability of BB and Σα\Sigma^{\alpha}. Reminiscent to (\mathbb{R}^{\infty}-CE) in the previous section, we associate to (𝒮𝒫\mathcal{SP}-FPK) the FPK-equation on 2\ell^{2}

tΓ¯t=¯(B¯(t,z)Γ¯t)+ij2(A¯ij(t,z)Γ¯t),\partial_{t}\bar{\Gamma}_{t}=-\bar{\nabla}\cdot(\bar{B}(t,z)\bar{\Gamma}_{t})+\partial^{2}_{ij}(\bar{A}_{ij}(t,z)\bar{\Gamma}_{t}), (2\ell^{2}-FPK)

which we understand in the sense of the following definition, with ¯\bar{\nabla} as in (13). Subsequently, we denote by Cb2(2)\mathcal{F}C^{2}_{b}(\ell^{2}) the set of all maps F¯:2\bar{F}:\ell^{2}\to\mathbb{R} of type F¯=fπn\bar{F}=f\circ\pi_{n} for n1n\geq 1 and fCb2(n)f\in C^{2}_{b}(\mathbb{R}^{n}). Also, set

D2F¯ij:={(ij2f)πn,i,jn0, else.D^{2}\bar{F}_{ij}:=\begin{cases}(\partial^{2}_{ij}f)\circ\pi_{n}&,i,j\leq n\\ 0&,\text{ else}.\end{cases}

Consequently, both summands in (31) contain only finitely many non-trivial summands.

Definition 4.7.

A weakly continuous curve (Γ¯t)tT𝒫(2)(\bar{\Gamma}_{t})_{t\leq T}\subseteq\mathcal{P}(\ell^{2}) is a solution to (2\ell^{2}-FPK), if it fulfills the integrability condition

0T2|B¯i(t,z)|+|A¯ij(t,z)|dΓ¯tdt<,i,j1,\int_{0}^{T}\int_{\ell^{2}}|\bar{B}_{i}(t,z)|+|\bar{A}_{ij}(t,z)|d\bar{\Gamma}_{t}dt<\infty,\quad\forall\,i,j\geq 1, (30)

and for any F¯Cb2(2)\bar{F}\in\mathcal{F}C^{2}_{b}(\ell^{2}), F¯:=fπn\bar{F}:=f\circ\pi_{n},

2F¯(z)𝑑Γ¯t(z)=2F¯(z)𝑑Γ¯0(z)+0t2¯F¯(z)B¯(s,z)+12D2F¯(z):A¯(s,z)dΓ¯s(z)ds.\int_{\ell^{2}}\bar{F}(z)d\bar{\Gamma}_{t}(z)=\int_{\ell^{2}}\bar{F}(z)d\bar{\Gamma}_{0}(z)+\int_{0}^{t}\int_{\ell^{2}}\bar{\nabla}\bar{F}(z)\cdot\bar{B}(s,z)+\frac{1}{2}D^{2}\bar{F}(z):\bar{A}(s,z)d\bar{\Gamma}_{s}(z)ds. (31)

holds for each tTt\leq T.

4.1 Main Result: Stochastic case

The main result of this section is the following superposition principle for solutions to (SNLFPK) and (𝒮𝒫\mathcal{SP}-FPK), which generalizes Theorem 3.7 to stochastically perturbed equations.

Theorem 4.8.

Let σ\sigma be bounded on [0,T]×𝒮𝒫×d[0,T]\times\mathcal{SP}\times\mathbb{R}^{d}. Let (Γt)tT(\Gamma_{t})_{t\leq T} be a weakly continuous solution to (𝒮𝒫\mathcal{SP}-FPK). Then, there exists a complete filtered probability space (Ω,,(t)tT,)(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\leq T},\mathbb{P}), an adapted d1d_{1}-dimensional Wiener process W=(Wt)tTW=(W_{t})_{t\leq T} and a 𝒮𝒫\mathcal{SP}-valued adapted vaguely continuous process μ=(μt)tT\mu=(\mu_{t})_{t\leq T} such that (μ,W)(\mu,W) solves (SNLFPK) and

μt1=Γt\mathbb{P}\circ\mu_{t}^{-1}=\Gamma_{t}

holds for each t[0,T]t\in[0,T].
Moreover, if Γ0\Gamma_{0} is concentrated on 𝒫\mathcal{P}, i.e. Γ0(𝒫)=1\Gamma_{0}(\mathcal{P})=1, then the paths tμt(ω)t\mapsto\mu_{t}(\omega) are 𝒫\mathcal{P}-valued for \mathbb{P}-a.e. ωΩ\omega\in\Omega and hence even weakly continuous.

As in the proof of 3.7, we proceed in three steps. Since parts of the proof are technically more involved than in the deterministic case, we first present the ingredients of each step and afterwards state the proof of Theorem 4.8 as a corollary.

Step 1: From (𝒮𝒫\mathcal{SP}-FPK) to (𝟐\ell^{2}-FPK):

Lemma 4.9.

For any solution (Γt)tT(\Gamma_{t})_{t\leq T} to (𝒮𝒫\mathcal{SP}-FPK), the curve Γ¯t=ΓtH1\bar{\Gamma}_{t}=\Gamma_{t}\circ H^{-1} is a solution to (2\ell^{2}-FPK).

Proof.

Clearly, tΓ¯tt\mapsto\bar{\Gamma}_{t} is a weakly continuous curve in 𝒫(2)\mathcal{P}(\ell^{2}) due to the continuity of H:𝒮𝒫2H:\mathcal{SP}\to\ell^{2}. (30) holds, since tΓtt\mapsto\Gamma_{t} fulfills (29) and since σ\sigma is bounded. Moreover, we have for s,tTs,t\leq T, F¯=fπnCb2(2)\bar{F}=f\circ\pi_{n}\in\mathcal{F}C^{2}_{b}(\ell^{2}) and F:μf(μ(h1),,μ(hn))F:\mu\mapsto f\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}

2¯F¯(z)B¯(s,z)+12D2F¯(z):A¯(s,z)dΓ¯s(z)\displaystyle\int_{\ell^{2}}\bar{\nabla}\bar{F}(z)\cdot\bar{B}(s,z)+\frac{1}{2}D^{2}\bar{F}(z):\bar{A}(s,z)d\bar{\Gamma}_{s}(z)
=𝒮𝒫k=1n(kf)(μ(h1),,μ(hn))Bk(s,μ)+α=1d112k,l=1n(klf)(μ(h1),,μ(hn))Σkα(s,μ)Σlα(s,μ)dΓs(μ)\displaystyle=\int_{\mathcal{SP}}\sum_{k=1}^{n}(\partial_{k}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}B_{k}(s,\mu)+\sum_{\alpha=1}^{d_{1}}\frac{1}{2}\sum_{k,l=1}^{n}(\partial_{kl}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}\Sigma_{k}^{\alpha}(s,\mu)\Sigma_{l}^{\alpha}(s,\mu)d\Gamma_{s}(\mu)
=𝒮𝒫𝒮𝒫F(μ),b(s,μ)+a(s,μ)L2(μ)+12α=1d1Hess(F)(σα(s,μ),σα(s,μ))dΓs(μ)\displaystyle=\int_{\mathcal{SP}}\big{\langle}\nabla^{\mathcal{SP}}F(\mu),b(s,\mu)+a(s,\mu)\nabla\big{\rangle}_{L^{2}(\mu)}+\frac{1}{2}\sum_{\alpha=1}^{d_{1}}Hess(F)\big{(}\sigma^{\alpha}(s,\mu),\sigma^{\alpha}(s,\mu)\big{)}d\Gamma_{s}(\mu)

and likewise

2F¯(z)𝑑Γ¯t=𝒮𝒫F(μ)𝑑Γt.\int_{\ell^{2}}\bar{F}(z)d\bar{\Gamma}_{t}=\int_{\mathcal{SP}}F(\mu)d\Gamma_{t}.

Comparing with (27), the statement follows. ∎

Step 2: From (𝟐\ell^{2}-FPK) to the martingale problem (2\ell^{2}-MGP): We introduce a martingale problem on 2\ell^{2}, which is related to (2\ell^{2}-FPK) in the sense of Remark 4.11 below and is, roughly speaking, the stochastic analogue to (\mathbb{R}^{\infty}-ODE) from the previous section. Recall the notation ete_{t} for the projection et:CT22e_{t}:C_{T}\ell^{2}\to\ell^{2}, et:γγte_{t}:\gamma\mapsto\gamma_{t} for tTt\leq T.

Definition 4.10.

A measure Q¯𝒫(CT2)\bar{Q}\in\mathcal{P}(C_{T}\ell^{2}) is a solution to the 2\ell^{2}-martingale problem (2\ell^{2}-MGP), provided

CT20T|B¯i(t,et)|+|A¯ij(t,et)|dtdQ¯<,i,j1,\int_{C_{T}\ell^{2}}\int_{0}^{T}|\bar{B}_{i}(t,e_{t})|+|\bar{A}_{ij}(t,e_{t})|dtd\bar{Q}<\infty,\quad i,j\geq 1, (32)

and

F¯et0t¯F¯esB¯(s,es)+12D2F¯es:A¯(s,es)ds\bar{F}\circ e_{t}-\int_{0}^{t}\bar{\nabla}\bar{F}\circ e_{s}\cdot\bar{B}(s,e_{s})+\frac{1}{2}D^{2}\bar{F}\circ e_{s}:\bar{A}(s,e_{s})ds (33)

is a Q¯\bar{Q}-martingale on CT2C_{T}\ell^{2} with respect to the natural filtration on CT2C_{T}\ell^{2} for any F¯Cb2(2)\bar{F}\in\mathcal{F}C^{2}_{b}(\ell^{2}).

Remark 4.11.

By construction, any such solution Q¯\bar{Q} induces a weakly continuous solution (Γ¯t)tT(\bar{\Gamma}_{t})_{t\leq T} to (2\ell^{2}-FPK) via Γ¯t:=Q¯et1\bar{\Gamma}_{t}:=\bar{Q}\circ e_{t}^{-1}. Indeed, this is readily seen by integrating (33) with respect to Q¯\bar{Q} and Fubini’s theorem.

In view of Proposition 4.13 below, we extend the coefficients B¯i,Σ¯iα\bar{B}_{i},\bar{\Sigma}^{\alpha}_{i} (and hence also A¯ij\bar{A}_{ij}) to \mathbb{R}^{\infty} via

B¯i:=0=:Σ¯iα on [0,T]×\2.\bar{B}_{i}:=0=:\bar{\Sigma}^{\alpha}_{i}\text{ on }[0,T]\times\mathbb{R}^{\infty}\backslash\ell^{2}.

We still use the notation B¯\bar{B}, Σ¯α\bar{\Sigma}^{\alpha} and A¯\bar{A} and note that they are ([0,T])()/()\mathcal{B}([0,T])\otimes\mathcal{B}(\mathbb{R}^{\infty})/\mathcal{B}(\mathbb{R}^{\infty})-measurable due to Remark 4.12 below. Due to the same remark, we may regard any solution (Γ¯t)tT(\bar{\Gamma}_{t})_{t\leq T} to (2\ell^{2}-FPK) as a solution to a FPK-equation on \mathbb{R}^{\infty} by considering (2\ell^{2}-FPK) with the extended coefficients and test functions F¯Cb2(2)\bar{F}\in\mathcal{F}C^{2}_{b}(\ell^{2}) extended to \mathbb{R}^{\infty} by considering πn\pi_{n} on \mathbb{R}^{\infty} instead of 2\ell^{2}. Similarly, the formulation of the martingale problem (2\ell^{2}-MGP) as in Definition 4.10 extends to \mathbb{R}^{\infty} in the sense that a measure Q¯𝒫(CT)\bar{Q}\in\mathcal{P}(C_{T}\mathbb{R}^{\infty}) is understood as a solution, provided the process (33) is a Q¯\bar{Q}-martingale on CTC_{T}\mathbb{R}^{\infty} with respect to the natural filtration for each F¯=fπn:\bar{F}=f\circ\pi_{n}:\mathbb{R}^{\infty}\to\mathbb{R} as above.

Remark 4.12.

We recall that 2()\ell^{2}\in\mathcal{B}(\mathbb{R}^{\infty}) and (2)=()2\mathcal{B}(\ell^{2})=\mathcal{B}(\mathbb{R}^{\infty})_{\upharpoonright\ell^{2}}. In particular, any probability measure Γ¯𝒫(2)\bar{\Gamma}\in\mathcal{P}(\ell^{2}) uniquely extends to a Borel probability measure on \mathbb{R}^{\infty} via Γ¯(A):=Γ¯(A2)\bar{\Gamma}(A):=\bar{\Gamma}(A\cap\ell^{2}), A()A\in\mathcal{B}(\mathbb{R}^{\infty}).

We shall need the following superposition principle from [20], which lifts a solution to a FPK-equation on \mathbb{R}^{\infty} to a solution of the associated martingale problem. Note that in [20], the author assumes an integrability condition of order p>1p>1 instead of p=1p=1 as in (30) in order to essentially reduce the proof to the corresponding finite-dimensional result, see [20, Thm.2.14], which requires such a higher order integrability. However, since the latter result was extended to the case of an L1L^{1}-integrability condition by the same author [21, Thm.2.5], it is easy to see that also the infinite-dimensional result [20, Thm.7.1] holds for solutions with L1L^{1}-integrability as in (30).

Proposition 4.13.

[Superposition principle on \mathbb{R}^{\infty}, Thm.7.1. [20] For any weakly continuous solution (Γ¯t)tT𝒫()(\bar{\Gamma}_{t})_{t\leq T}\subseteq\mathcal{P}(\mathbb{R}^{\infty}) to the \mathbb{R}^{\infty}-extended version of (2\ell^{2}-FPK), there exists Q¯𝒫(CT)\bar{Q}\in\mathcal{P}(C_{T}\mathbb{R}^{\infty}), which solves the \mathbb{R}^{\infty}-extended version of (2\ell^{2}-MGP) such that Q¯et1=Γ¯t\bar{Q}\circ e_{t}^{-1}=\bar{\Gamma}_{t} for each t[0,T]t\in[0,T].

Moreover, we have the following consequence for the solutions we are interested in. Note that paths tztH(𝒮𝒫)t\mapsto z_{t}\in H(\mathcal{SP}) are continuous with respect to the product topology if and only if they are 2\ell^{2}-continuous. Hence, we may use the notation CTH(𝒮𝒫)C_{T}H(\mathcal{SP}) unambiguously and consider it as a subset of either CTC_{T}\mathbb{R}^{\infty} or Ct2C_{t}\ell^{2}. Since H(𝒮𝒫)2H(\mathcal{SP})\subseteq\ell^{2} is closed even with respect to the product topology, CTH(𝒮𝒫)C_{T}H(\mathcal{SP}) belongs to (CT2)\mathcal{B}(C_{T}\ell^{2}) and (CT)\mathcal{B}(C_{T}\mathbb{R}^{\infty}).

Lemma 4.14.

If in the situation of the previous proposition each Γ¯t\bar{\Gamma}_{t} is concentrated on the Borel set H(𝒮𝒫)H(\mathcal{SP})\subseteq\mathbb{R}^{\infty}, then Q¯\bar{Q} is concentrated on continuous curves in H(𝒮𝒫)H(\mathcal{SP}). In particular, in this case Q¯\bar{Q} may be regarded as an element of 𝒫(CT2)\mathcal{P}(C_{T}\ell^{2}) and a solution to the martingale problem (2\ell^{2}-MGP) as in Definition 4.10.

Proof.

The closedness of H(𝒮𝒫)H(\mathcal{SP})\subseteq\mathbb{R}^{\infty} yields

Q¯(CTH(𝒮𝒫))=Q¯(q[0,T]{eqH(𝒮𝒫)})=1,\bar{Q}(C_{T}H(\mathcal{SP}))=\bar{Q}\bigg{(}\underset{q\in[0,T]\cap\mathbb{Q}}{\bigcap}\{e_{q}\in H(\mathcal{SP})\}\bigg{)}=1,

due to Q¯et1=Γ¯t\bar{Q}\circ e_{t}^{-1}=\bar{\Gamma}_{t} for each tTt\leq T. By the observation above this lemma, it follows

(CT2)CTH(𝒮𝒫)(CT)CTH(𝒮𝒫)(CT)\mathcal{B}(C_{T}\ell^{2})_{\upharpoonright C_{T}H(\mathcal{SP})}\subseteq\mathcal{B}(C_{T}\mathbb{R}^{\infty})_{\upharpoonright C_{T}H(\mathcal{SP})}\subseteq\mathcal{B}(C_{T}\mathbb{R}^{\infty})

and we can therefore consider Q¯\bar{Q} as a probability measure on (CT2)\mathcal{B}(C_{T}\ell^{2}) via

Q¯(A):=Q¯(ACTH(𝒮𝒫)),A(CT2)\bar{Q}(A):=\bar{Q}\big{(}A\cap C_{T}H(\mathcal{SP})\big{)},\,\,A\in\mathcal{B}(C_{T}\ell^{2})

with mass on CTH(𝒮𝒫)C_{T}H(\mathcal{SP}). It is clear that this measure fulfills Definition 4.10. ∎

Hence, subsequently we may regard to Q¯\bar{Q} as in Proposition 4.13 as a solution to (2\ell^{2}-MGP) on either \mathbb{R}^{\infty} or 2\ell^{2} without differing the notation. Recall the notation pi:2p_{i}:\ell^{2}\to\mathbb{R}, pi(z)=zi.p_{i}(z)=z_{i}.

Lemma 4.15.

Let Q¯\bar{Q} be a solution to the martingale problem (2\ell^{2}-MGP) on 2\ell^{2}. Then, for any i1i\geq 1, the process

Mi(t):=pietpie00tB¯i(s,es)𝑑sM_{i}(t):=p_{i}\circ e_{t}-p_{i}\circ e_{0}-\int_{0}^{t}\bar{B}_{i}(s,e_{s})ds (34)

is a real-valued, continuous Q¯\bar{Q}-martingale on CT2C_{T}\ell^{2} with respect to the canonical filtration. The covariation Mi,Mj\langle\langle M_{i},M_{j}\rangle\rangle of MiM_{i} and MjM_{j} is Q¯\bar{Q}-a.s. given by

Mi,Mjt=0tA¯ij(s,es)𝑑s,t[0,T].\langle\langle M_{i},M_{j}\rangle\rangle_{t}=\int_{0}^{t}\bar{A}_{ij}(s,e_{s})ds,\,\,t\in[0,T]. (35)
Proof.

For i,j1i,j\geq 1, let nmax(i,j)n\geq\text{max}(i,j), consider pin:np^{n}_{i}:\mathbb{R}^{n}\to\mathbb{R}, pin(x)=xip^{n}_{i}(x)=x_{i} and let

F¯in:2,F¯in(z)=pinπn(z).\bar{F}^{n}_{i}:\ell^{2}\to\mathbb{R},\,\,\bar{F}^{n}_{i}(z)=p^{n}_{i}\circ\pi_{n}(z).

Note that F¯in=pi\bar{F}^{n}_{i}=p_{i} on 2\ell^{2}, independent of nmax(i,j)n\geq\text{max}(i,j). For k1k\geq 1, introduce the stopping time σk:=inf{t[0,T]:et2k}\sigma_{k}:=\text{inf}\{t\in[0,T]:||e_{t}||_{\ell^{2}}\geq k\} with respect to the canonical filtration on CT2C_{T}\ell^{2}. Clearly, σk+\sigma_{k}\nearrow+\infty pointwise. Consider ηkCc2(n)\eta_{k}\in C^{2}_{c}(\mathbb{R}^{n}) such that ηk(x)=1\eta_{k}(x)=1 for |x|k+1|x|\leq k+1.
Since kpin=δki\partial_{k}p^{n}_{i}=\delta_{ki} and klpin=0\partial_{kl}p^{n}_{i}=0 for k,lnk,l\leq n, we have

Mi(t)=F¯inet0t¯F¯inesB¯(s,es)+12D2F¯ines:A¯(s,es)dsM_{i}(t)=\bar{F}^{n}_{i}\circ e_{t}-\int_{0}^{t}\bar{\nabla}\bar{F}^{n}_{i}\circ e_{s}\cdot\bar{B}(s,e_{s})+\frac{1}{2}D^{2}\bar{F}^{n}_{i}\circ e_{s}:\bar{A}(s,e_{s})ds

and, setting F¯in,k:=(ηkpin)πnCb2(2)\bar{F}^{n,k}_{i}:=(\eta_{k}p^{n}_{i})\circ\pi_{n}\in\mathcal{F}C^{2}_{b}(\ell^{2}),

Mi(σkt)=F¯in,ketσk0tσk¯F¯in,kesB¯(s,es)+12D2F¯in,kes:A¯(s,es)ds.M_{i}(\sigma_{k}\wedge t)=\bar{F}^{n,k}_{i}\circ e_{t\wedge\sigma_{k}}-\int_{0}^{t\wedge\sigma_{k}}\bar{\nabla}\bar{F}^{n,k}_{i}\circ e_{s}\cdot\bar{B}(s,e_{s})+\frac{1}{2}D^{2}\bar{F}^{n,k}_{i}\circ e_{s}:\bar{A}(s,e_{s})ds.

Since the latter is a continuous Q¯\bar{Q}-martingale for each k1k\geq 1, it follows that MiM_{i} is a continuous local Q¯\bar{Q}-martingale. Concerning (35), it suffices to prove that for any F¯Cb2(2)\bar{F}\in\mathcal{F}C^{2}_{b}(\mathcal{\ell}^{2}), F¯=fπn\bar{F}=f\circ\pi_{n}, we have

MF¯t=0t¯F¯(es),A¯(s,es)¯F¯(es)ds2𝑑s,\langle\langle M^{\bar{F}}\rangle\rangle_{t}=\int_{0}^{t}\big{\langle}\bar{\nabla}\bar{F}(e_{s}),\bar{A}(s,e_{s})\bar{\nabla}\bar{F}(e_{s})ds\big{\rangle}_{\ell^{2}}ds, (36)

with

MtF¯:=F¯et0t¯F¯(es)B¯(s,es)+12D2F¯(es):A¯(s,es)ds.M^{\bar{F}}_{t}:=\bar{F}\circ e_{t}-\int_{0}^{t}\bar{\nabla}\bar{F}(e_{s})\cdot\bar{B}(s,e_{s})+\frac{1}{2}D^{2}\bar{F}(e_{s}):\bar{A}(s,e_{s})ds.

Indeed, from here (35) follows by considering (36) for F¯in,k\bar{F}^{n,k}_{i}, localization of the local martingale MiM_{i} and polarization for the quadratic (co-)variation. Concerning (36), it is standard (cf. [19, p.73,74]) to use Itô’s product rule to obtain that

t(MtF¯)20t𝐋¯s(2)F¯2(es)2F¯(es)𝐋¯s(2)F¯(es)dst\mapsto(M^{\bar{F}}_{t})^{2}-\int_{0}^{t}\bar{\mathbf{L}}_{s}^{(2)}\bar{F}^{2}(e_{s})-2\bar{F}(e_{s})\bar{\mathbf{L}}_{s}^{(2)}\bar{F}(e_{s})ds

is a continuous Q¯\bar{Q}-martingale on CT2C_{T}\ell^{2}, where we denote by 𝐋¯t(2)F¯(es)\bar{\mathbf{L}}_{t}^{(2)}\bar{F}(e_{s}) the integrand of the integral term in the definition of MF¯M^{\bar{F}}. A straightforward calculation yields

0t𝐋¯s(2)F¯2(es)2F¯(es)𝐋¯s(2)F¯(es)ds=0t¯F¯(es),A¯(s,es)¯F¯(es)2𝑑s,\int_{0}^{t}\bar{\mathbf{L}}_{s}^{(2)}\bar{F}^{2}(e_{s})-2\bar{F}(e_{s})\bar{\mathbf{L}}_{s}^{(2)}\bar{F}(e_{s})ds=\int_{0}^{t}\big{\langle}\bar{\nabla}\bar{F}(e_{s}),\bar{A}(s,e_{s})\bar{\nabla}\bar{F}(e_{s})\big{\rangle}_{\ell^{2}}ds,

which completes the proof. ∎

We summarize the results of this step in the following proposition.

Proposition 4.16.

Let (Γ¯t)tT(\bar{\Gamma}_{t})_{t\leq T} be a weakly continuous solution to (2\ell^{2}-FPK) such that Γ¯t(H(𝒮𝒫))=1\bar{\Gamma}_{t}(H(\mathcal{SP}))=1 for each t[0,T]t\in[0,T]. Then, there exists a solution Q¯𝒫(CT2)\bar{Q}\in\mathcal{P}(C_{T}\ell^{2}) to the martingale problem (2\ell^{2}-MGP) such that Q¯\bar{Q} is concentrated on CTH(𝒮𝒫)C_{T}H(\mathcal{SP}) with Q¯et1=Γ¯t\bar{Q}\circ e_{t}^{-1}=\bar{\Gamma}_{t} for each t[0,T]t\in[0,T]. Further, the results of Lemma 4.15 apply to Q¯\bar{Q}.

Step 3: From (2\ell^{2}-MGP) to (SNLFPK): For a given solution Q¯𝒫(CT2)\bar{Q}\in\mathcal{P}(C_{T}\ell^{2}) to (2\ell^{2}-MGP), set

𝒞:=(CT2)𝒩Q¯\mathcal{C}:=\mathcal{B}(C_{T}\ell^{2})\bigvee\mathcal{N}_{\bar{Q}}

and

𝒞t:=σ(es,st)𝒩Q¯\mathcal{C}_{t}:=\sigma(e_{s},s\leq t)\bigvee\mathcal{N}_{\bar{Q}}

for tTt\leq T, where 𝒩Q¯\mathcal{N}_{\bar{Q}} denotes the collection of all subsets of sets N(CT2)N\in\mathcal{B}(C_{T}\ell^{2}) with Q¯(N)=0\bar{Q}(N)=0. Of course, 𝒞\mathcal{C} and 𝒞t\mathcal{C}_{t} depend on Q¯\bar{Q}, but we suppress this dependence in the notation. Without further mentioning, we understand such Q¯\bar{Q} as extended to 𝒞\mathcal{C} in the canonical way. Then, (CT2,𝒞,(𝒞t)tT,Q¯)(C_{T}\ell^{2},\mathcal{C},(\mathcal{C}_{t})_{t\leq T},\bar{Q}) is a complete filtered probability space. Clearly, (t,γ)Σ¯(t,et(γ))(t,\gamma)\mapsto\bar{\Sigma}(t,e_{t}(\gamma)) is 𝒞t\mathcal{C}_{t}-progressively measurable from [0,T]×CT2[0,T]\times C_{T}\ell^{2} to L(d1,2)L(\mathbb{R}^{d_{1}},\ell^{2}), the space of bounded linear operators from d1\mathbb{R}^{d_{1}} to 2\ell^{2}.

Remark 4.17.

We extend (CT2,𝒞,(𝒞t)tT,Q¯)(C_{T}\ell^{2},\mathcal{C},(\mathcal{C}_{t})_{t\leq T},\bar{Q}) as follows. Let (Ω,′′,(t′′)tT,P)(\Omega^{\prime},\mathcal{F}^{\prime\prime},(\mathcal{F}^{\prime\prime}_{t})_{t\leq T},P) be a complete filtered probability space with a real-valued t′′\mathcal{F}^{\prime\prime}_{t}-Wiener process β\beta on it, define

Ω:=CT2l1Ω,:=𝒞l1′′,t:=𝒞tl1t′′,:=Q¯l1P,\Omega:=C_{T}\ell^{2}\otimes\underset{l\geq 1}{\bigotimes}\Omega^{\prime},\,\,\mathcal{F}^{\prime}:=\mathcal{C}\otimes\underset{l\geq 1}{\bigotimes}\mathcal{F}^{\prime\prime},\,\,\mathcal{F}_{t}^{\prime}:=\mathcal{C}_{t}\otimes\underset{l\geq 1}{\bigotimes}\mathcal{F}^{\prime\prime}_{t},\,\,\mathbb{P}^{\prime}:=\bar{Q}\otimes\underset{l\geq 1}{\bigotimes}P,

let \mathcal{F} and t\mathcal{F}_{t} be the \mathbb{P}^{\prime}-completion of \mathcal{F}^{\prime} and t\mathcal{F}_{t}^{\prime}, respectively, and denote the canonical extension of \mathbb{P}^{\prime} to \mathcal{F} by \mathbb{P}. Further, we denote the Wiener process β\beta on the ii-th copy of Ω\Omega^{\prime} by βi\beta_{i} and extend each βi\beta_{i} to Ω\Omega by βi(ω):=βi(ωi)\beta_{i}(\omega):=\beta_{i}(\omega_{i}) for ω=γ×(ωi)i1Ω\omega=\gamma\times(\omega_{i})_{i\geq 1}\in\Omega. Similarly, we extend each projection ete_{t} from CT2C_{T}\ell^{2} to Ω\Omega via et(ω):=et(γ)e_{t}(\omega):=e_{t}(\gamma) for ω\omega as above, but keep the same notation for this extended process. Obviously, (et)tt(e_{t})_{t\leq t} is a continuous, t\mathcal{F}_{t}-adapted process on Ω\Omega and each βi\beta_{i} is an t\mathcal{F}_{t}-Wiener process on Ω\Omega under \mathbb{P}. Moreover, (et)tT(e_{t})_{t\leq T} and (βi)i1(\beta_{i})_{i\geq 1} are independent on Ω\Omega with respect to \mathbb{P} by construction. Further, it is clear that the process MiM_{i} as in (34) is a \mathbb{P}-martingale with respect to t\mathcal{F}_{t} for each i1i\geq 1 with covariation as in (35) and that (t,γ)Σ¯(t,et(γ))L(d1,2)(t,\gamma)\mapsto\bar{\Sigma}(t,e_{t}(\gamma))\in L(\mathbb{R}^{d_{1}},\ell^{2}) is t\mathcal{F}_{t}-progressively measurable on [0,T]×Ω[0,T]\times\Omega.

Finally, we need the following result, which is a special case of Theorem 2, [16].

Proposition 4.18.

Let Q¯𝒫(CT2)\bar{Q}\in\mathcal{P}(C_{T}\ell^{2}) be a solution to the martingale problem (2\ell^{2}-MGP). Then, there exists a complete filtered probability space with an adapted d1d_{1}-dimensional Wiener process W=(Wα)αd1W=(W^{\alpha})_{\alpha\leq d_{1}} and an 2\ell^{2}-valued adapted continuous process X=(Xt)tTX=(X_{t})_{t\leq T} such that the law of XX on CT2C_{T}\ell^{2} is Q¯\bar{Q} and for i1i\geq 1 and t[0,T]t\in[0,T], we have a.s.

piXtpiX00tB¯i(s,Xs)𝑑s=α=1d10tΣ¯iα(s,Xs)𝑑Wsαp_{i}\circ X_{t}-p_{i}\circ X_{0}-\int_{0}^{t}\bar{B}_{i}(s,X_{s})ds=\sum_{\alpha=1}^{d_{1}}\int_{0}^{t}\bar{\Sigma}^{\alpha}_{i}(s,X_{s})dW_{s}^{\alpha} (37)

and the exceptional set can be chosen independent of tt and ii.

To see this, consider Theorem 2 of [16] with X=2X=\ell^{2}, U0=d1U_{0}=\mathbb{R}^{d_{1}}, D={pi,i1}D=\{p_{i},i\geq 1\}, the processes M(pi)M(p_{i}) given by MiM_{i} as in (34) on the probability space Ω\Omega of Remark 4.17 and

gs=Σ¯(s,es)L(d1,2).g_{s}=\bar{\Sigma}(s,e_{s})\in L(\mathbb{R}^{d_{1}},\ell^{2}).

These choices fulfill all requirements of [16]. In this case, the 2\ell^{2}-valued process XX is given by Xt=etX_{t}=e_{t} on Ω\Omega. Since all terms in (37) are continuous in tt, the exceptional set may indeed by chosen independently of t[0,T]t\in[0,T] and i1i\geq 1.

The proof of Theorem 4.8 now follows from the above three step-scheme as follows.

Proof of Theorem 4.8: Let (Γt)tT𝒫(𝒮𝒫)(\Gamma_{t})_{t\leq T}\subseteq\mathcal{P}(\mathcal{SP}) be a weakly continuous solution to (𝒮𝒫\mathcal{SP}-FPK). By Lemma 4.9 of Step 1, the weakly continuous curve of Borel probability measures on 2\ell^{2}

Γ¯t:=ΓtH1,t[0,T]\bar{\Gamma}_{t}:=\Gamma_{t}\circ H^{-1},\,\,t\in[0,T]

solves (2\ell^{2}-FPK) and each Γ¯t\bar{\Gamma}_{t} is concentrated on H(𝒮𝒫)H(\mathcal{SP}). By Proposition 4.16 of Step 2, there exists a solution Q¯𝒫(CT2)\bar{Q}\in\mathcal{P}(C_{T}\ell^{2}) to the martingale problem (2\ell^{2}-MGP), which is concentrated on CTH(𝒮𝒫)C_{T}H(\mathcal{SP}) such that

Q¯et1=Γ¯t,t[0,T].\bar{Q}\circ e_{t}^{-1}=\bar{\Gamma}_{t},\,\,t\in[0,T].

Further, Lemma 4.15 applies to Q¯\bar{Q}. By Lemma 4.15 and Proposition 4.18 of Step 3, there is a d1d_{1}-dimensional t\mathcal{F}_{t}-adapted Wiener process W=(Wα)αd1W=(W^{\alpha})_{\alpha\leq d_{1}} and an t\mathcal{F}_{t}-adapted process XX on some complete filtered probability space (Ω,,(t)tT,)(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\leq T},\mathbb{P}), which fulfill (37) and XCTH(𝒮𝒫)X\in C_{T}H(\mathcal{SP}) \mathbb{P}-a.s. such that Q¯\bar{Q} is the law of XX.
Possibly redefining XX on a \mathbb{P}-negligible set (which preserves (37) and its adaptedness, the latter due to the completeness of the underlying filtered probability space), we may assume Xt(ω)=H(μt(ω))X_{t}(\omega)=H(\mu_{t}(\omega)) for some μt(ω)𝒮𝒫\mu_{t}(\omega)\in\mathcal{SP} for each (t,ω)[0,T]×Ω(t,\omega)\in[0,T]\times\Omega. The continuity of H1:H(𝒮𝒫)𝒮𝒫H^{-1}:H(\mathcal{SP})\to\mathcal{SP} and tXt(ω)t\mapsto X_{t}(\omega) implies vague continuity of

tμt(ω)=H1Xt(ω)t\mapsto\mu_{t}(\omega)=H^{-1}\circ X_{t}(\omega) (38)

for each ωΩ\omega\in\Omega and t\mathcal{F}_{t}-adaptedness of the 𝒮𝒫\mathcal{SP}-valued process (μt)tT(\mu_{t})_{t\leq T}. Considering (37), Xt=H(μt)X_{t}=H(\mu_{t}) and the definition of B¯\bar{B} and Σ¯iα\bar{\Sigma}^{\alpha}_{i}, we obtain, recalling pi(H(ν))=ν(hi)p_{i}(H(\nu))=\nu(h_{i}) for each ν𝒮𝒫\nu\in\mathcal{SP},

μt(hi)μ0(hi)0tBi(s,μs)𝑑s=α=1d10tΣiα(s,μs)𝑑Wsα,tT\mu_{t}(h_{i})-\mu_{0}(h_{i})-\int_{0}^{t}B_{i}(s,\mu_{s})ds=\sum_{\alpha=1}^{d_{1}}\int_{0}^{t}\Sigma^{\alpha}_{i}(s,\mu_{s})dW^{\alpha}_{s},\,\,t\leq T

\mathbb{P}-a.s. for each i1i\geq 1. From here, it follows by Lemma 4.4 (i) that (μt)tT(\mu_{t})_{t\leq T} is a solution to (SNLFPK) as in Definition 4.1. Further,

μt1=(Xt1)(H1)1=Γ¯t(H1)1=ΓtH1(H1)1=Γt.\mathbb{P}\circ\mu_{t}^{-1}=(\mathbb{P}\circ X_{t}^{-1})\circ(H^{-1})^{-1}=\bar{\Gamma}_{t}\circ(H^{-1})^{-1}=\Gamma_{t}\circ H^{-1}\circ(H^{-1})^{-1}=\Gamma_{t}.

It remains to prove the final assertion of the theorem. To this end, note that Γ0(𝒫)=1\Gamma_{0}(\mathcal{P})=1 implies Γ¯0(H(𝒫))=1\bar{\Gamma}_{0}(H(\mathcal{P}))=1 and hence μ0𝒫\mu_{0}\in\mathcal{P} \mathbb{P}-a.s. with μ0\mu_{0} as in (38). From here, the assertion follows by Lemma 4.3. ∎

Remark 4.19.

The particular type of noise we consider for (SNLFPK) was partially motivated by [8], where the natural connection of equations of type (SNLFPK) to interacting particle systems with common noise was investigated. Other types of noise terms may be treated in the future, including a possible extension to infinite-dimensional ones. In particular, Proposition 4.18 via [16, Thm.2] in the final step of the proof seems capable of such extensions, since the latter is an infinite-dimensional result.

References

  • [1] L. Ambrosio. Transport Equation and Cauchy Problem for Non-Smooth Vector Fields, pages 1–41. Springer Berlin Heidelberg, Berlin, Heidelberg, 2008.
  • [2] L. Ambrosio and D. Trevisan. Well-posedness of Lagrangian flows and continuity equations in metric measure spaces. Anal. PDE, 7(5):1179–1234, 2014.
  • [3] V. Barbu and M. Röckner. Probabilistic Representation for Solutions to Nonlinear Fokker–Planck Equations. SIAM Journal on Mathematical Analysis, 50(4):4246–4260, 2018.
  • [4] V. Barbu and M. Röckner. From nonlinear Fokker–Planck equations to solutions of distribution dependent SDE. Ann. Probab., 48(4):1902–1920, 07 2020.
  • [5] V.I. Bogachev, N.V. Krylov, M. Röckner, and S.V. Shaposhnikov. Fokker-Planck-Kolmogorov Equations. Mathematical Surveys and Monographs 207. American Mathematical Society, 2015.
  • [6] V.I. Bogachev, M. Röckner, and S.V. Shaposhnikov. On the Ambrosio–Figalli–Trevisan Superposition Principle for Probability Solutions to Fokker–Planck–Kolmogorov Equations. Journal of Dynamics and Differential Equations, 2020.
  • [7] K. L. Chung and R. J. Williams. Introduction to stochastic integration. Modern Birkhäuser classics. Birkhäuser/Springer, New York, second edition, 2014.
  • [8] M. Coghi and B. Gess. Stochastic nonlinear Fokker-Planck equations. arXiv:1904.07894, 2019.
  • [9] A. Figalli. Existence and uniqueness of martingale solutions for SDEs with rough or degenerate coefficients. Journal of Functional Analysis, 254(1):109 – 153, 2008.
  • [10] T. Funaki. A certain class of diffusion processes associated with nonlinear parabolic equations. Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete, 67(3):331–348, 1984.
  • [11] X. Huang, M. Röckner, and F.-Y. Wang. Nonlinear Fokker–Planck equations for probability measures on path space and path-distribution dependent SDEs. Discrete & Continuous Dynamical Systems - A, 39(6):3017–3035, 2019.
  • [12] X. Huang and F.-Y. Wang. Distribution dependent SDEs with singular coefficients. Stochastic Processes and their Applications, 129(11):4747 – 4770, 2019.
  • [13] O. Kallenberg. Random Measures, Theory and Applications. Springer International Publishing, 2017.
  • [14] T. Kurtz. Equivalence of Stochastic Equations and Martingale Problems, pages 113–130. Springer Berlin Heidelberg, Berlin, Heidelberg, 2011.
  • [15] H. P. McKean. A class of Markov processes associated with nonlinear parabolic equations. Proceedings of the National Academy of Sciences, 56(6):1907–1911, 1966.
  • [16] M. Ondrejat. Brownian Representations of Cylindrical Local Martingales, Martingale Problem and Strong Markov Property of Weak Solutions of SPDEs in Banach Spaces. Czechoslovak Mathematical Journal, 55(4):1003–1039, 2005.
  • [17] P. Ren, M. Röckner, and F.-Y. Wang. Linearization of Nonlinear Fokker-Planck Equations and Applications. arXiv:1904.06795, 2020.
  • [18] M. Scheutzow. Uniqueness and non-uniqueness of solutions of Vlasov-McKean equations. Journal of the Australian Mathematical Society. Series A. Pure Mathematics and Statistics, 43(2):246–256, 1987.
  • [19] D. W. Stroock. Lectures on Stochastic Analysis: Diffusion Theory. London Mathematical Society Student Texts. Cambridge University Press, 1987.
  • [20] D. Trevisan. Well-posedness of Diffusion Processes in Metric Measure Spaces. PhD-thesis, Scuola Normale Superiori Pisa, 2014.
  • [21] D. Trevisan. Well-posedness of multidimensional diffusion processes with weakly differentiable coefficients. Electron. J. Probab., 21:41 pp., 2016.

Marco Rehmeier Faculty of Mathematics, Bielefeld University, Universitätsstraße 25, 33615 Bielefeld, Germany
E-mail address: [email protected]