Linearization and a superposition principle for deterministic and stochastic nonlinear Fokker-Planck-Kolmogorov equations

Marco Rehmeier¹¹1Faculty of Mathematics, Bielefeld University, 33615 Bielefeld, Germany. E-Mail: [email protected]

Abstract

We prove a superposition principle for nonlinear Fokker-Planck-Kolmogorov equations on Euclidean spaces and their corresponding linearized first-order continuity equation over the space of Borel (sub-)probability measures. As a consequence, we obtain equivalence of existence and uniqueness results for these equations. Moreover, we prove an analogous result for stochastically perturbed Fokker-Planck-Kolmogorov equations. To do so, we particularly show that such stochastic equations for measures are, similarly to the deterministic case, intrinsically related to linearized second-order equations on the space of Borel (sub-)probability measures.

Keywords: Nonlinear Fokker-Planck equation, McKean-Vlasov stochastic differential equation, diffusion process, superposition principle

2010 MSC: 60J60, 58J65

1 Introduction

In this work we are concerned with nonlinear Fokker-Planck-Kolmogorov equations (FPK-equations) on $\mathbb{R}^{d}$ , both deterministic

\partial_{t}\mu_{t}=\mathcal{L}^{*}_{t,\mu_{t}}\mu_{t},\,\,t\in[0,T],

(NLFPK)

and perturbed by a first-order stochastic term driven by a finite-dimensional Wiener process $W$

\partial_{t}\mu_{t}=\mathcal{L}^{*}_{t,\mu_{t}}\mu_{t}-\text{div}(\sigma(t,\mu_{t}))dW_{t},\,\,t\in[0,T],

(SNLFPK)

with solutions being continuous curves of subprobability measures $\mu_{t}\in\mathcal{SP}$ . Here, $\mathcal{L}^{*}$ denotes the formal dual of a second-order differential operator acting on sufficiently smooth functions $\varphi:\mathbb{R}^{d}\to\mathbb{R}$ via

\mathcal{L}_{t,\mu}\varphi(x)=\sum_{i,j=1}^{d}a_{ij}(t,\mu,x)\partial^{2}_{ij}\varphi(x)+\sum_{i=1}^{d}b_{i}(t,\mu,x)\partial_{i}\varphi(x)

(1)

with coefficients $a$ and $b$ depending on $(t,x)\in[0,T]\times\mathbb{R}^{d}$ and (in general non-locally) on the solution $\mu_{t}$ . These equations are to be understood in distributional sense, see Definition 3.1 and 4.1. The nonlinearity arises from the dependence of $\mathcal{L}$ and $\sigma$ on the solution itself, which renders the theory of existence and uniqueness of such equations significantly more difficult compared to the linear case. For a thorough introduction to the field, we refer to [5] and the references therein. As shown in [17], the deterministic nonlinear equation (NLFPK) is naturally associated to a first-order linear continuity equation on $\mathcal{P}(\mathcal{SP})$ , the space of Borel probability measures on $\mathcal{SP}$ , of type

\partial_{t}\Gamma_{t}=\mathbf{L}^{*}_{t}\Gamma_{t},\,\,t\in[0,T],

(

\mathcal{SP}

-CE)

in the sense of distributions, with the linear operator $\mathbf{L}$ acting on sufficiently smooth real functions on $\mathcal{SP}$ via the gradient operator $\nabla^{\mathcal{SP}}$ on $\mathcal{SP}$ as

\mathbf{L}_{t}F=\big{\langle}\nabla^{\mathcal{SP}}F,b_{t}+a_{t}\nabla\big{\rangle}_{L^{2}}.

Precise information on this operator and equation ( $\mathcal{SP}$ -CE) are given in Section 3, in particular in Definition 3.5 and the paragraph preceding it.
Our first main result, Theorem 3.7 states that each weakly continuous solution $(\Gamma_{t})_{t\leq T}$ to ( $\mathcal{SP}$ -CE) is a superposition of solutions to (NLFPK), i.e. (denoting by $e_{t}$ the canonical projection $e_{t}:(\mu_{t})_{t\leq T}\mapsto\mu_{t}$ )

\Gamma_{t}=\eta\circ e_{t}^{-1}

(2)

for some probability measure $\eta$ concentrated on solution curves to (NLFPK) in a suitable sense.
We also treat the stochastic case in a similar fashion. More precisely, in Section 4 we establish a new correspondence between the stochastic equation for measures (SNLFPK) and a corresponding second-order equation for curves $(\Gamma_{t})_{t\leq T}$ in $\mathcal{P}(\mathcal{SP})$ of type

\partial_{t}\Gamma_{t}=(\mathbf{L}_{t}^{(2)})^{*}\Gamma_{t},\,\,t\in[0,T],

(

\mathcal{SP}

-FPK)

where, roughly,

\mathbf{L}^{(2)}_{t}=\mathbf{L}_{t}+\textit{ second-order perturbation}.

The second-order term stems from the stochastic perturbation of (SNLFPK) and will be geometrically interpreted in terms of a (formal) notion of the Levi-Civita connection on $\mathcal{SP}$ . The second main result of this work, Theorem 4.8, is then the stochastic generalization of the deterministic case: For any solution $(\Gamma_{t})_{t\leq T}$ to ( $\mathcal{SP}$ -FPK), there exists a solution process $(\mu_{t})_{t\leq T}$ to (SNLFPK) on some probability space such that $\mu_{t}$ has distribution $\Gamma_{t}$ . We stress that in both cases, we do not require any regularity of the coefficients.

Let us embed these results into the general research in this direction. Let $b_{t}(\cdot):\mathbb{R}^{d}\to\mathbb{R}^{d}$ be an inhomogeneous vector field and consider the (nonlinear) ODE

\frac{d}{dt}\gamma_{t}=b_{t}(\gamma_{t}),\,\,t\leq T

(ODE)

and the linear continuity equation for curves of Borel (probability) measures on $\mathbb{R}^{d}$

\partial_{t}\mu_{t}=-\operatorname*{div}(b_{t}\mu_{t}),\,\,t\leq T,

(CE)

understood in distributional sense. In the seminal paper [1], L. Ambrosio showed the following: Any (probability) solution $(\mu_{t})_{t\leq T}$ to (CE) with an appropriate global integrability condition is a superposition of solution curves to (ODE), i.e. there exists a (probability) measure $\eta$ on the space of continuous paths with values in the state space of (ODE), $C([0,T],\mathbb{R}^{d})$ , which is concentrated on solutions to (ODE) such that

\eta\circ e_{t}^{-1}=\mu_{t},\,\,t\leq T.

This allows to transfer existence and uniqueness results between the linear equation (CE) and the nonlinear (ODE). However, the linear equation must be studied on an infinite-dimensional space of (probability) measures. The analogy to our deterministic result from Section 3 is as follows: (ODE) is replaced by (NLFPK), which, in spirit of this analogy, we interpret as a differential equation on the manifold-like state space $\mathcal{SP}$ . Likewise, (CE) is replaced by ( $\mathcal{SP}$ -CE) and our first main result Theorem 3.7 may be understood as the analogue of Ambrosio’s result to the present setting. By passing from (NLFPK) to ( $\mathcal{SP}$ -CE), we linearize the equation.
Concerning the stochastic case, consider a stochastic differential equation on $\mathbb{R}^{d}$

dX_{t}=b(t,X_{t})dt+\tilde{a}(t,X_{t})dB_{t},\,\,t\in[0,T].

(SDE)

By Itô’s formula, the one-dimensional marginals $\mu_{t}$ of any (weak) martingale solution $X$ solve the corresponding linear FPK-equation

\partial_{t}\mu_{t}=\mathcal{L}_{lin,t}^{*}\mu_{t},\,\,t\in[0,T],

(FPK)

where $\mathcal{L}_{lin}$ is a linear second-order diffusion operator with coefficients $b$ and $\frac{1}{2}\tilde{a}\tilde{a}^{T}$ . Conversely, a superposition principle has successively been developed in increasingly general frameworks (cf. [9, 14, 21, 6]): Under mild global integrability assumptions, for every weakly continuous solution curve of probability measures $(\mu_{t})_{t\leq T}$ to (FPK), there exists a (weak) martingale solution $X$ to (SDE) with one-dimensional marginals $(\mu_{t})_{t\leq T}$ , thereby providing an equivalence between solutions to (SDE) and (FPK), which offers a bridge between probabilistic and analytic approaches to diffusion processes. As in the deterministic case, the transition from (SDE) to (FPK) provides a linearization, while at the same time it transfers the equation to a much higher dimensional state space. Concerning our stochastic result Theorem 4.8, we replace the stochastic equation on $\mathbb{R}^{d}$ by the stochastic equation for measures (SNLFPK) and the corresponding second-order equation for measures (FPK) by ( $\mathcal{SP}$ -FPK) and prove an analogous superposition result for solutions to the latter equation.
The proofs of both the deterministic and stochastic result rely on superposition principles for differential equations on $\mathbb{R}^{\infty}$ and the corresponding continuity equation (for the deterministic case) and for martingale solutions and FPK-equations on $\mathbb{R}^{\infty}$ (for the stochastic case) by Ambrosio and Trevisan ([2], [20]). The key technique is to transfer ( $\mathcal{SP}$ -CE) and (NLFPK) (and, similarly, ( $\mathcal{SP}$ -FPK) and (SNLFPK) for the stochastic case) to suitable equations on $\mathbb{R}^{\infty}$ via a homeomorphism between $\mathcal{SP}$ and $\mathbb{R}^{\infty}$ (replaced by $\ell^{2}$ for the stochastic case, in order to handle the stochastic integral).
Moreover, our results also blend into the theory of distribution dependent stochastic differential equations, also called McKean-Vlasov equations, i.e. stochastic equations on Euclidean space of type

dX_{t}=b(t,\mathcal{L}_{X_{t}},X_{t})dt+\tilde{a}(t,\mathcal{L}_{X_{t}},X_{t})dB_{t},\,\,t\in[0,T],

(DDSDE)

see the classical papers [15, 10, 18] as well as the more recent works [12, 11, 8]. Here, $\mathcal{L}_{X_{t}}$ denotes the distribution of $X_{t}$ and is not to be confused with the operators $\mathcal{L}_{t,\mu}$ and $\mathcal{L}_{t}$ from above. As in the non-distribution dependent case, where the curve of marginals of any solution to (SDE) solves an equation of type (FPK), a similar observation holds here: Each solution $X$ to (DDSDE) provides a solution to a nonlinear FPK-equation of type (NLFPK) via $\mu_{t}=\mathcal{L}_{X_{t}}$ and a corresponding superposition principle holds analogously to the linear case as well ([4, 3]).
However, while for (SDE) the passage to (FPK) provides a complete linearization, the situation is different for equations of type (NLFPK). This stems from the observation that (DDSDE) is an equation with two sources of nonlinearity. Hence, it seems natural to linearize (NLFPK) once more in order to obtain a linear equation, which is related to (DDSDE) and (NLFPK) in a natural way. By the results of [17], this linear equation is of type ( $\mathcal{SP}$ -CE). Similar considerations prevail in the stochastic case, where one considers equations of type (DDSDE) with an additional source of randomness (we shall not pursue this direction in this work).

On the one hand, the superposition principles of Theorem 3.7 and Theorem 4.8 provide new structural results for nonlinear FPK-equations and its corresponding linearized equations on the space of probability measures over $\mathcal{SP}$ , involving a geometric interpretation of the latter. On the other hand, it is our future plan to further study the geometry of $\mathcal{SP}$ as initiated in [17] and this work to develop an analysis on such infinite-dimensional manifold-like spaces, which allows to solve linear equations of type ( $\mathcal{SP}$ -CE) and ( $\mathcal{SP}$ -FPK) on such spaces. By means of the results of this work, one can then lift such solutions to solutions to the nonlinear equations for measures (NLFPK) and (SNLFPK), thereby obtaining new existence results for these nonlinear equations for measures.

We point out that although our main aim is to lift weakly continuous solutions to ( $\mathcal{SP}$ -CE) and ( $\mathcal{SP}$ -FPK) concentrated on probability measures to a measure on the space of continuous probability measure-valued paths $(\mu_{t})_{t\leq T}$ , for technical reasons we more generally develop our results for vaguely continuous subprobability solutions (i.e. $\mu_{t}\in\mathcal{SP}$ ). We comment on the advantages of this approach in Remark 3.9 for the deterministic case and note that similar arguments prevail in the stochastic case as well. However, due to the global integrability assumptions we consider, we are able to obtain results for probability solutions as desired.

The organization of this paper is as follows. After introducing general notation and recalling basic properties of the spaces $\mathcal{P}$ and $\mathcal{SP}$ in Section 2, Section 3 contains the deterministic case, i.e. the superposition principle between solutions to ( $\mathcal{SP}$ -CE) and (NLFPK). Here, the main result is Theorem 3.7. We use this result to prove an open conjecture of [17] (cf. Proposition 3.12) and present several consequences. In Section 4, we treat the stochastic case for equations of type (SNLFPK), the main result being Theorem 4.8.

Acknowledgements

Financial support by the German Science Foundation DFG (IRTG 2235) is gratefully acknowledged.

2 Notation and Preliminaries

We introduce notation and repeat basic facts on spaces and topologies of measures.

Notation

For a measure space $(\mathcal{X},\mathcal{A},\mu)$ and a measurable function $\varphi:\mathcal{X}\to\mathbb{R}$ , we set $\mu(\varphi):=\int_{\mathcal{X}}\varphi(x)d\mu(x)$ whenever the integral is well-defined. For $x\in\mathcal{X}$ , we denote by $\delta_{x}$ the Dirac measure in $x$ , i.e. $\delta_{x}(A)=1$ if and only if $x\in A$ and $\delta_{x}(A)=0$ else. For a topological space $X$ with Borel $\sigma$ -algebra $\mathcal{B}(X)$ we denote the set of continuous bounded functions by $C_{b}(X)$ , the set of Borel probability measures on $X$ by $\mathcal{P}(X)$ and write $\mathcal{P}=\mathcal{P}(\mathbb{R}^{d})$ . If $Y\in\mathcal{B}(X)$ , we let $\mathcal{B}(X)_{\upharpoonright Y}$ denote the trace of $Y$ on $\mathcal{B}(X)$ . For $T>0$ , a family $(\mu_{t})_{t\leq T}=(\mu_{t})_{t\in[0,T]}$ of finite Borel measures on $\mathbb{R}^{d}$ is a Borel curve, if $t\mapsto\mu_{t}(A)$ is Borel measurable for each $A\in\mathcal{B}(\mathbb{R}^{d})$ . A set of functions $\mathcal{G}\subseteq C_{b}(\mathbb{R}^{d})$ is called measure-determining, if $\mu(g)=\nu(g)$ for each $g\in\mathcal{G}$ implies $\mu=\nu$ for any two finite Borel measures $\mu,\nu$ on $\mathbb{R}^{d}$ .
For $x,y\in\mathbb{R}^{d}$ , the usual inner product is denoted by $x\cdot y$ and, with slight abuse of notation, we also denote by $x\cdot y=\sum_{k\geq 1}x_{k}y_{k}$ the inner product in $\ell^{2}$ (the Hilbert space of square-summable real-valued sequences $x=(x_{k})_{k\geq 1}$ ). For $\varphi\in C_{b}(\mathbb{R}^{d})$ , we set $||\varphi||_{\infty}:=\underset{x\in\mathbb{R}^{d}}{\text{sup}}|\varphi(x)|$ . If $\varphi$ has first- and second-order partial derivatives, we denote them by $\partial_{i}\varphi$ and $\partial^{2}_{ij}\varphi$ for $i,j\leq d$ .

We use notation for function spaces as follows. For $k\in\mathbb{N}_{0}$ , $C^{k}_{b}(\mathbb{R}^{d})$ denotes the subset of functions $\varphi$ in $C_{b}(\mathbb{R}^{d})$ with continuous, bounded partial derivatives up to order $k$ , with the usual norm $||\varphi||_{C^{2}_{b}}=\text{max}(||\varphi||_{\infty},||\partial_{i}\varphi||_{\infty},||\partial^{2}_{ij}\varphi||_{\infty})$ for $k=2$ . Likewise, $C^{k}_{c}(\mathbb{R}^{d})$ denotes the subset of all such $\varphi$ with compact support; for $k=0$ , we write $C_{c}(\mathbb{R}^{d})$ instead. For $n\geq 1$ , $p\geq 1$ and a measure $\mu$ on $\mathcal{B}(\mathbb{R}^{d})$ , we denote by $L^{p}(\mathbb{R}^{d},\mathbb{R}^{n};\mu)$ the space of Borel functions $\varphi:\mathbb{R}^{d}\to\mathbb{R}^{n}$ such that

\int_{\mathbb{R}^{d}}||\varphi(x)||^{p}d\mu(x)<+\infty,

where $||\cdot||$ denotes the standard Euclidean norm on $\mathbb{R}^{n}$ . For $p=2$ , $\langle\cdot,\cdot\rangle_{L^{2}(\mathbb{R}^{d},\mathbb{R}^{n};\mu)}$ denotes the usual inner product on the Hilbert space $L^{2}(\mathbb{R}^{d},\mathbb{R}^{n};\mu)$ . For $T>0$ and a topological space $Y$ , we write $C_{T}Y$ for the set of continuous functions $\varphi:[0,T]\to Y$ . By $\mathbb{S}^{+}_{d}$ we denote the space of symmetric, positive-semidefinite $d\times d$ -matrices with real entries.

Basic properties of spaces of measures

Probability measures

For a topological space $X$ , we endow $\mathcal{P}(X)$ with the topology of weak convergence of measures, i.e. the initial topology of the maps $\mu\mapsto\mu(\varphi)$ , $\varphi\in C_{b}(X)$ . If $X$ is Polish, then so is $\mathcal{P}(X)$ .

Subprobability measures

By $\mathcal{SP}$ we denote the set of all Borel subprobability measures on $\mathbb{R}^{d}$ , i.e. $\mu\in\mathcal{SP}$ if and only if $\mu$ is a non-negative measure on $\mathcal{B}(\mathbb{R}^{d})$ with $\mu(\mathbb{R}^{d})\leq 1$ . Throughout, we endow $\mathcal{SP}$ with the vague topology, i.e. the initial topology of the maps $\mu\mapsto\mu(g)$ , $g\in C_{c}(\mathbb{R}^{d})$ . Hence, a sequence $(\mu_{n})_{n\geq 1}$ converges to $\mu$ in $\mathcal{SP}$ if and only if $\mu_{n}(g)\underset{n\to\infty}{\longrightarrow}\mu(g)$ for each $g\in C_{c}(\mathbb{R}^{d})$ . Its Borel $\sigma$ -algebra is denoted by $\mathcal{B}(\mathcal{SP})$ . In particular, $\mathcal{P}(\mathcal{SP})$ , the set of Borel probability measures on $\mathcal{SP}$ , is a topological space with the weak topology of probability measures on $(\mathcal{SP},\mathcal{B}(\mathcal{SP}))$ . The Riesz-Markov representation theorem yields that $\mathcal{SP}$ with the vague topology coincides with the positive half of the closed unit ball of the dual space of $C_{c}(\mathbb{R}^{d})$ with the weak*-topology. Hence $\mathcal{SP}$ with the vague topology is compact. It is also Polish and $\mu\mapsto\mu(\mathbb{R}^{d})$ is vaguely lower semicontinuous, see [13, Ch.4.1]. In particular, $\mathcal{P}\in\mathcal{B}(\mathcal{SP}).$ Recall that $\mathcal{B}(\mathcal{P})=\mathcal{B}(\mathcal{SP})_{\upharpoonright_{\mathcal{P}}}$ . Hence, in the sequel we may consider measures $\Gamma\in\mathcal{P}(\mathcal{P})$ as elements in $\mathcal{P}(\mathcal{SP})$ with mass on $\mathcal{P}$ .
In contrast to weak convergence in $\mathcal{P}$ , vague convergence in $\mathcal{SP}$ can be characterized by countably many functions in a sense made precise by Lemma 3.3. The fact that this is not true for weak convergence in $\mathcal{P}$ is the main reason why we formulate all equations for subprobability measures, although we are mainly interested in the case of probability solutions. More details in this direction are stated in Remark 3.9.

3 Superposition Principle for deterministic nonlinear Fokker-Planck-Kolmogorov Equations

Fix $T>0$ throughout, let each component of the coefficients

a=(a_{ij})_{i,j\leq d}:[0,T]\times\mathcal{SP}\times\mathbb{R}^{d}\to\mathbb{S}^{+}_{d},\,b=(b_{i})_{i\leq d}:[0,T]\times\mathcal{SP}\times\mathbb{R}^{d}\to\mathbb{R}^{d}

be $\mathcal{B}([0,T])\otimes\mathcal{B}(\mathcal{SP})\otimes\mathcal{B}(\mathbb{R}^{d})/\mathcal{B}(\mathbb{R})$ -measurable and consider the operator $\mathcal{L}_{t,\mu}$ as in (1).

Definition 3.1.

(i)

A vaguely continuous curve $(\mu_{t})_{t\leq T}\subseteq\mathcal{SP}$ is a subprobability solution to (NLFPK), if for each $i,j\leq d$ the global integrability condition

\int_{0}^{T}\int_{\mathbb{R}^{d}}|a_{ij}(t,\mu_{t},x)|+|b_{i}(t,\mu_{t},x)|d\mu_{t}(x)dt<+\infty

(3)

holds and for each $\varphi\in C^{2}_{c}(\mathbb{R}^{d})$ and $t\in[0,T]$

\int_{\mathbb{R}^{d}}\varphi(x)d\mu_{t}(x)-\int_{\mathbb{R}^{d}}\varphi(x)d\mu_{0}(x)=\int_{0}^{t}\int_{\mathbb{R}^{d}}\mathcal{L}_{s,\mu_{s}}\varphi(x)d\mu_{s}(x)ds.

(4)

(ii)

A probability solution to (NLFPK) is a curve $(\mu_{t})_{t\leq T}\subseteq\mathcal{P}$ fulfilling (3) and (4) such that $t\mapsto\mu_{t}$ is weakly continuous.

Since vaguely continuous curves of measures are in particular Borel curves, all integrals in the above definition are defined. Below we shortly refer to subprobability and probability solutions and keep in mind the respective continuity conditions. In the literature, more general notions of solutions to (NLFPK) are considered, such as (possibly discontinuous) curves of signed, bounded measures [5]. However, in this work, we restrict attention to continuous (sub-)probability solutions. In presence of the global integrability condition (3), we make the following observation.

Remark 3.2.

(i)

Any subprobability solution $(\mu_{t})_{t\leq T}$ with $\mu_{0}\in\mathcal{P}$ is a probability solution. Indeed, to prove this it suffices to show $\mu_{t}(\mathbb{R}^{d})=1$ for each $t\leq T$ . Since $(\mu_{t})_{t\leq T}$ fulfills (4), it suffices to choose a sequence $\varphi_{l}$ , $l\geq 1$ , from $C^{2}_{c}(\mathbb{R}^{d})$ with the following properties: $0\leq\varphi_{l}\nearrow 1$ pointwise such that $\partial_{i}\varphi_{l}\underset{l\to\infty}{\longrightarrow}0$ , $\partial^{2}_{ij}\varphi_{l}\underset{l\to\infty}{\longrightarrow}0$ pointwise with all first and second order derivatives bounded by some $M<+\infty$ uniformly in $l\geq 1$ and $x\in\mathbb{R}^{d}$ . Considering (4) for the limit $l\to\infty$ , we obtain, by (3) and dominated convergence, for each $t\in[0,T]$

$\int_{\mathbb{R}^{d}}1d\mu_{t}-\int_{\mathbb{R}^{d}}1d\mu_{0}=0$

and hence the claim.
(ii)

By the above argument, one shows that for any subprobability solution, (4) holds for each $\varphi\in C^{2}_{b}(\mathbb{R}^{d})$ .

Geometric approach to $\mathbf{\mathcal{SP}}$

For our goals, it is preferable to consider $\mathcal{SP}$ as a manifold-like space. We refer the reader to the appendix in [17], where for the space of probability measures $\mathcal{P}$ the tangent spaces $T_{\mu}\mathcal{P}=L^{2}(\mathbb{R}^{d},\mathbb{R}^{d};\mu)$ and a suitable test function class $\mathcal{F}C^{2}_{b}(\mathcal{P})$ ,

F\in\mathcal{F}C^{2}_{b}(\mathcal{P})\iff F:\mu\mapsto f\big{(}\mu(\varphi_{1}),\dots,\mu(\varphi_{n})\big{)}\text{ for }n\geq 1,f\in C^{1}_{b}(\mathbb{R}^{n}),\,\varphi_{i}\in C^{\infty}_{c}(\mathbb{R}^{d}),

(5)

have been introduced. Further, based on these choices, a natural pointwise definition of the gradient $\nabla^{\mathcal{P}}F$ as a section in the tangent bundle

T\mathcal{P}=\bigsqcup_{\mu\in\mathcal{P}}T_{\mu}\mathcal{P}

for $F$ as above is given by

\nabla^{\mathcal{P}}F(\mu):=\sum_{k=1}^{n}\partial_{k}f\big{(}\mu(\varphi_{1}),\dots,\mu(\varphi_{n})\big{)}\nabla\varphi_{k}\in T_{\mu}\mathcal{P},

which is shown to be independent of the representation of $F$ in terms of $f$ and $\varphi_{i}$ . The setting in the present paper is nearly identical, but we consider the manifold-like space $\mathcal{SP}$ with the vague topology instead of $\mathcal{P}$ with the weak topology as in [17], because $\mathcal{SP}$ is embedded in $\mathbb{R}^{\infty}$ in the following sense. Let

\mathcal{G}=\{g_{i},i\geq 1\}

(6)

be dense in $C^{2}_{c}(\mathbb{R}^{d})$ with respect to $||\cdot||_{C^{2}_{b}}$ such that no $g_{i}$ is constantly $0$ . Clearly, any such set of functions is dense in $C_{c}(\mathbb{R}^{d})$ with respect to uniform convergence and measure-determining. Such sets of functions are sufficiently extensive to characterize the topology of $\mathcal{SP}$ as well as solutions to (NLFPK):

Lemma 3.3.

Let $\mathcal{G}$ be any set of functions with the properties mentioned above and let $(\mu_{n})_{n\geq 1}\subseteq\mathcal{SP}$ . Then,

(i)

$(\mu_{n})_{n\geq 1}$ converges vaguely to $\mu\in\mathcal{SP}$ if and only if

$\mu_{n}(g_{i})\underset{n\to\infty}{\longrightarrow}\mu(g_{i})$

for each $g_{i}\in\mathcal{G}$ .
(ii)

A vaguely continuous curve $(\mu_{t})_{t\leq T}\subseteq\mathcal{SP}$ , which fulfills (3), is a subprobability solution to (NLFPK) if and only if (4) holds for each $g_{i}\in\mathcal{G}$ in place of $\varphi$ .

Proof.

(i)

From $\mu_{n}(g_{i})\underset{n\to\infty}{\longrightarrow}\mu(g_{i})$ for each $g_{i}\in\mathcal{G}$ , one obtains for each $f\in C_{c}(\mathbb{R}^{d})$ and $\epsilon>0$ by choosing $g_{i}\in\mathcal{G}$ with $||f-g_{i}||_{\infty}<\frac{\epsilon}{3}$

|\mu_{n}(f)-\mu(f)|\leq|\mu_{n}(f)-\mu_{n}(g_{i})|+|\mu_{n}(g_{i})-\mu(g_{i})|+|\mu(g_{i})-\mu(f)|\leq\epsilon

(7)

for all sufficiently large $n\geq 1$ .

(ii)

Let $\varphi\in C^{2}_{c}(\mathbb{R}^{d})$ be approximated uniformly up to second-order derivatives by a sequence $\{g_{i_{k}}\}_{k\geq 1}$ from $\mathcal{G}$ . Considering (4) for such $g_{i_{k}}$ and letting $k\to\infty$ , the result follows by dominated convergence, which applies due to (3).

∎

Considering $\mathcal{SP}$ as a (infinite-dimensional) manifold-like topological space, any set of functions $\mathcal{G}$ as above provides a global chart (i.e., an atlas consisting of a single chart) for $\mathcal{SP}$ , as it yields an embedding $\mathcal{SP}\subseteq\mathbb{R}^{\infty}$ (cf. Lemma 3.4).
Consider $\mathbb{R}^{\infty}$ as a Polish space with the topology of pointwise convergence and the range $G(\mathcal{SP})\subseteq\mathbb{R}^{\infty}$ of $G$ as introduced below with its subspace topology. We write $C_{T}G(\mathcal{SP})$ for the set of all elements in $C_{T}\mathbb{R}^{\infty}$ with values in $G(\mathcal{SP})$ . For $u\in[0,T]$ , we denote by $e_{u}$ the canonical projection on $C_{T}\mathcal{SP}$

e_{u}:(\mu_{t})_{t\leq T}\mapsto\mu_{u}

and, likewise, by $e^{\infty}_{u}$ the projection on $C_{T}\mathbb{R}^{\infty}$ . Subsequently, without further mentioning, we consider the spaces $C_{T}\mathcal{SP}$ and $C_{T}\mathbb{R}^{\infty}$ with $\sigma$ -algebras

\mathcal{B}(C_{T}\mathcal{SP})=\sigma(e_{t},t\in[0,T])\text{ and }\mathcal{B}(C_{T}\mathbb{R}^{\infty})=\sigma(e_{t}^{\infty},t\in[0,T]),

respectively. These algebras coincide with the Borel $\sigma$ -algebras with respect to the topology of uniform convergence (because both $\mathcal{SP}$ and $\mathbb{R}^{\infty}$ are Polish). Also, consider $C_{T}G(\mathcal{SP})$ with the natural subspace $\sigma$ -algebra of $\mathcal{B}(C_{T}\mathbb{R}^{\infty})$ . We refer to these $\sigma$ -algebras as the canonical $\sigma$ -algebras on the respective spaces and denote the set of probability measures on the respective $\sigma$ -algebras by $\mathcal{P}(C_{T}\mathcal{SP})$ and $\mathcal{P}(C_{T}\mathbb{R}^{\infty})$ .

Lemma 3.4.

Let $\mathcal{G}=\{g_{i}\}_{i\geq 1}$ be a set of functions as in (6).

(i)

The map $G$ , depending on $\mathcal{G}$ ,

$G:\mathcal{SP}\to\mathbb{R}^{\infty},G(\mu):=(\mu(g_{i}))_{i\geq 1}$ (8)

is a homeomorphism between $\mathcal{SP}$ and its range $G(\mathcal{SP})$ (hence, formally, a global chart for $\mathcal{SP}$ ). In particular, $G(\mathcal{SP})\subseteq\mathbb{R}^{\infty}$ is compact. Moreover, if $\mathcal{G}^{\prime}=\{g_{i}^{\prime},i\geq 1\}$ is another set as in (6) with corresponding chart $G^{\prime}$ , then $G^{\prime}=G\circ\mathcal{V}$ for a unique homeomorphism $\mathcal{V}$ on $\mathcal{SP}$ .
(ii)

The map

$J:C_{T}\mathcal{SP}\to C_{T}\mathbb{R}^{\infty},\,J((\mu_{t})_{t\leq T}):=G(\mu_{t})_{t\leq T}$

is measurable and one-to-one with measurable inverse $J^{-1}:C_{T}G(\mathcal{SP})\to C_{T}\mathcal{SP}$ . Further, $C_{T}G(\mathcal{SP})\subseteq C_{T}\mathbb{R}^{\infty}$ is a measurable set, i.e. $C_{T}G(\mathcal{SP})\subseteq\mathcal{B}(C_{T}\mathbb{R}^{\infty})$ .

Proof.

(i)

The continuity of $G$ is obvious by definition of the vague topology on $\mathcal{SP}$ and since $\mathcal{G}\subseteq C_{c}(\mathbb{R}^{d})$ . Since $\mathcal{SP}$ is compact with respect to the vague topology, compactness of $G(\mathcal{SP})\subseteq\mathbb{R}^{\infty}$ follows. $\mathcal{G}$ is measure-determining on $\mathbb{R}^{d}$ , which implies that $G$ is one-to-one. Since by definition

G(\mu_{n})\underset{n\to\infty}{\longrightarrow}G(\mu)\iff\mu_{n}(g_{i})\underset{n\to\infty}{\longrightarrow}\mu(g_{i})\text{ for each }g_{i}\in\mathcal{G},

continuity of $G^{-1}$ follows from Lemma 3.3 (i). The final assertion follows, since for $G^{\prime}$ as in the assertion, $\mathcal{V}:=G^{-1}\circ\mathcal{W}\circ G^{\prime}$ with $\mathcal{W}:G^{\prime}(\mathcal{SP})\to G(\mathcal{SP})$ , $\mathcal{W}:(\mu(g^{\prime}_{i}))_{i\geq 1}\mapsto(\mu(g_{i}))_{i\geq 1}$ is a homeomorphism.

(ii)

Since $G$ is one-to-one and measurable, so is $J$ . Clearly, $C_{T}G(\mathcal{SP})$ is the range of $J$ and hence $J:C_{T}\mathcal{SP}\to C_{T}G(\mathcal{SP})$ is a bijection between standard Borel spaces (the latter, because $\mathcal{SP}$ and $G(\mathcal{SP})$ with the respective topologies are Polish). This yields the measurability of $J^{-1}$ . Finally, closedness of $G(\mathcal{SP})\subseteq\mathbb{R}^{\infty}$ implies that $C_{T}G(\mathcal{SP})\subseteq C_{T}\mathbb{R}^{\infty}$ is a measurable set, because $G(\mathcal{SP})$ carries the subspace topology inherited from $\mathbb{R}^{\infty}$ .

∎

By part (i) of the previous lemma it is justified to fix a set $\mathcal{G}=\{g_{i},i\geq 1\}$ for the remainder of the section. In order to switch between test functions on $\mathcal{SP}$ and $\mathbb{R}^{\infty}$ in an equivalent way, we slightly deviate from the test function class presented in [17] (see (5)) and, instead, consider

\mathcal{F}C^{2}_{b}(\mathcal{G}):=\{F:\mathcal{SP}\to\mathbb{R}|F(\mu)=f\big{(}\mu(g_{1}),\dots,\mu(g_{n})\big{)},f\in C^{2}_{b}(\mathbb{R}^{n}),n\geq 1\},

where the restriction $f\in C^{2}_{b}(\mathbb{R}^{n})$ is made for consistency with the stochastic case later on only. We summarize our geometric interpretation of $\mathcal{SP}$ , which is of course still a close adaption of the ideas presented in [17]:
For the manifold-like space $\mathcal{SP}$ , we consider smooth test functions $F\in\mathcal{F}C^{2}_{b}(\mathcal{G})$ , with $\mathcal{G}$ being fixed as in (6). For each $\mu\in\mathcal{SP}$ , we have the tangent space $T_{\mu}\mathcal{SP}=L^{2}(\mathbb{R}^{d},\mathbb{R}^{d};\mu)$ and the gradient

\nabla^{\mathcal{SP}}F(\mu)=\sum_{k=1}^{n}\partial_{k}f\big{(}\mu(g_{1}),\dots,\mu(g_{n})\big{)}\nabla g_{k}\in T_{\mu}\mathcal{SP}

for $\mathcal{F}C^{2}_{b}(\mathcal{G})\ni F:\mu\mapsto f\big{(}\mu(g_{1}),\dots,\mu(g_{n})\big{)}$ as a section in the tangent bundle $T\mathcal{SP}$ , which is independent of the representation of $F$ . Adding to the approach of $\mathcal{SP}$ as a manifold-like space, the global chart $G$ as in (8) embeds $\mathcal{SP}$ into $\mathbb{R}^{\infty}$ . However, we do not rigorously treat $\mathcal{SP}$ as a (Fréchet-)manifold and consider the embedding $\mathcal{SP}\subseteq\mathbb{R}^{\infty}$ merely as a tool to transfer (NLFPK) and its corresponding continuity equation to equivalent equations over $\mathbb{R}^{\infty}$ , as outlined below.

The continuity equation ( $\mathcal{SP}$ -CE)

As mentioned in the introduction, we study the linear continuity equation associated to (NLFPK) as derived in [17], which is a first-order equation for curves of measures on $\mathcal{SP}$ . More precisely, in analogy to the derivation in [17], it is readily seen that any subprobability solution $(\mu_{t})_{t\leq T}$ to (NLFPK) induces a curve of elements in $\mathcal{P}(\mathcal{SP})$ , $\Gamma_{t}:=\delta_{\mu_{t}}$ , $t\leq T$ , with

\int_{\mathcal{SP}}F(\mu)d\Gamma_{t}(\mu)-\int_{\mathcal{SP}}F(\mu)d\Gamma_{0}(\mu)=\int_{0}^{t}\int_{\mathcal{SP}}\big{\langle}\nabla^{\mathcal{SP}}F(\mu),b(s,\mu)+a(s,\mu)\nabla\big{\rangle}_{L^{2}(\mu)}d\Gamma_{s}(\mu)ds

(9)

for each $t\leq T$ and $F\in\mathcal{F}C^{2}_{b}(\mathcal{G})$ . Here, we set $b(s,\mu)=b(s,\mu,\cdot):\mathbb{R}^{d}\to\mathbb{R}^{d}$ (similarly for $a(s,\mu)$ ), $L^{2}(\mu)=L^{2}(\mathbb{R}^{d},\mathbb{R}^{d};\mu)$ and abbreviated

\big{\langle}\nabla^{\mathcal{SP}}F(\mu),b(s,\mu)+a(s,\mu)\nabla\big{\rangle}_{L^{2}(\mu)}=\int_{\mathbb{R}^{d}}\sum_{k=1}^{n}(\partial_{k}f)\big{(}\mu(g_{1}),\dots,\mu(g_{n})\big{)}a_{ij}(s,\mu,x)\partial^{2}_{ij}g_{k}(x)+b_{i}(s,\mu,x)\partial_{i}g_{k}(x)d\mu(x).

We rewrite (9) in distributional form in duality with $\mathcal{F}C^{2}_{b}(\mathcal{G})$ as

\partial_{t}\Gamma_{t}=-\nabla^{\mathcal{SP}}\cdot([b_{t}+a_{t}\nabla]\Gamma_{t}),\,\,t\leq T.

Setting

\mathbf{L}_{t}F(\mu):=\big{\langle}a(t,\mu)\nabla+b(t,\mu),\nabla^{\mathcal{SP}}F(\mu)\big{\rangle}_{L^{2}(\mu)},

(10)

this is just the linear continuity equation ( $\mathcal{SP}$ -CE). The term $a\nabla$ has rigorous meaning only, if $a$ has sufficiently regular components in order to put the derivative $\nabla$ on $a$ via integration by parts, which we do not assume at any point. Considering $\mathcal{SP}$ as a manifold-like space, one may formally regard to $a\nabla+b$ as a time-dependent section in the tangent bundle $T\mathcal{SP}$ .
More generally, we introduce the following notion of solution to ( $\mathcal{SP}$ -CE) (see [17]):

Definition 3.5.

A weakly continuous curve $(\Gamma_{t})_{t\leq T}\subseteq\mathcal{P}(\mathcal{SP})$ is a solution to ( $\mathcal{SP}$ -CE), if the integrability condition

\int_{0}^{T}\int_{\mathcal{SP}}||b(t,\mu,\cdot)||_{L^{1}(\mathbb{R}^{d},\mathbb{R}^{d};\mu)}+||a(t,\mu,\cdot)||_{L^{1}(\mathbb{R}^{d},\mathbb{R}^{d^{2}};\mu)}d\Gamma_{t}(\mu)dt<+\infty

(11)

is fulfilled and for each $F\in\mathcal{F}C^{2}_{b}(\mathcal{G})$ and $t\in[0,T]$

\int_{\mathcal{SP}}F(\mu)d\Gamma_{t}(\mu)-\int_{\mathcal{SP}}F(\mu)d\Gamma_{0}(\mu)=\int_{0}^{t}\int_{\mathcal{SP}}\mathbf{L}_{s}F(\mu)d\Gamma_{s}(\mu)ds

(12)

holds (which is just (9)).

The choice of $\mathcal{G}$ as in (6) implies that any solution in the above sense fulfills (12) even for each $F\in\mathcal{F}C^{2}_{b}(\mathcal{SP})$ , i.e. for the larger class of test functions considered in [17] (upon extending their domain from $\mathcal{P}$ to $\mathcal{SP}$ ). In particular, this notion of solution is independent of $\mathcal{G}$ . The main result of this chapter, Theorem 3.7, states that any solution to ( $\mathcal{SP}$ -CE) as in Definition 3.5 arises as a superposition of solutions to (NLFPK). Note that for $\nu\in\mathcal{SP}$ , uniqueness of solutions $(\Gamma_{t})_{t\leq T}$ to ( $\mathcal{SP}$ -CE) with $\Gamma_{0}=\delta_{\nu}$ implies uniqueness of subprobability solutions $(\mu_{t})_{t\leq T}$ to (NLFPK) with $\mu_{0}=\nu$ .

Transferring (NLFPK) and ( $\mathcal{SP}$ -CE) to $\mathbb{R}^{\infty}$

We use the global chart $G:\mathcal{SP}\to\mathbb{R}^{\infty}$ and the map $J$ of Lemma 3.4 to reformulate both (NLFPK) and ( $\mathcal{SP}$ -CE) on $\mathbb{R}^{\infty}$ . Define a Borel vector field $\bar{B}=(\bar{B}_{k})_{k\in\mathbb{N}}$ component-wise as follows. For $t\in[0,T]$ , consider the Borel set $A_{t}\in\mathcal{B}(\mathcal{SP})$ ,

A_{t}:=\bigg{\{}\mu\in\mathcal{SP}:\int_{\mathbb{R}^{d}}|a_{ij}(t,\mu,x)+|b_{i}(t,\mu,x)|d\mu(x)<\infty\,\,\forall i,j\leq d\bigg{\}}

and define $B:=(B_{k})_{k\in\mathbb{N}}$ via

B_{k}(t,\mu):=\int_{\mathbb{R}^{d}}\mathcal{L}_{t,\mu}g_{k}(x)d\mu(x),\quad(t,\mu)\in[0,T]\times A_{t}.

Now define $\bar{B}:[0,T]\times\mathbb{R}^{\infty}\to\mathbb{R}^{\infty}$ via

\bar{B}(t,z):=\begin{cases}B(t,G^{-1}(z)),&\quad\text{ if }z\in G(A_{t})\\ 0,&\quad\text{ else,}\end{cases}

which is Borel measurable by Lemma 3.4. Next, consider the differential equation on $\mathbb{R}^{\infty}$

\frac{d}{dt}z_{t}=\bar{B}(t,z_{t}),\,\,t\in[0,T],

(

\mathbb{R}^{\infty}

-ODE)

which turns out to be the suitable analogue to (NLFPK) on $\mathbb{R}^{\infty}$ . Analogously, the corresponding continuity equation for curves of Borel probability measures $\bar{\Gamma}_{t}$ on $\mathbb{R}^{\infty}$ , i.e.

\partial_{t}\bar{\Gamma}_{t}=-\bar{\nabla}\cdot(\bar{B}\bar{\Gamma}_{t}),\,\,t\in[0,T],

(

\mathbb{R}^{\infty}

-CE)

with $\bar{\nabla}$ as introduced below, is the natural analogue of the linear continuity equation ( $\mathcal{SP}$ -CE). Roughly, these analogies are to be understood in the sense that solutions to (NLFPK) and ( $\mathcal{SP}$ -CE) can be transferred to solutions to ( $\mathbb{R}^{\infty}$ -ODE) and ( $\mathbb{R}^{\infty}$ -CE), respectively, via the chart $G$ . We refer to the proof of the main result below for more details. Let

p_{i}:z\mapsto z_{i},\,\,z\in\mathbb{R}^{\infty}

denote the canonical projection to the $i$ -th component, set $\pi_{n}=(p_{1},\dots,p_{n})$ and

\mathcal{F}C^{2}_{b}(\mathbb{R}^{\infty}):=\{\bar{F}:\mathbb{R}^{\infty}\to\mathbb{R}|\bar{F}=f\circ\pi_{n},f\in C^{2}_{b}(\mathbb{R}^{n}),n\geq 1\}.

By $\bar{\nabla}$ we denote the gradient-type operator on $\mathbb{R}^{\infty}$ , acting on $\bar{F}=f\circ\pi_{n}\in\mathcal{F}C^{2}_{b}(\mathbb{R}^{\infty})$ via

\bar{\nabla}\bar{F}(z):=\big{(}(\partial_{1}f)(\pi_{n}z),\dots,(\partial_{n}f)(\pi_{n}z),0,0,\dots\big{)}.

(13)

Again, the restriction to test functions possessing second-order derivatives is made in order to be consistent with the stochastic (second-order) case later on.

Definition 3.6.

(i)

A curve $(z_{t})_{t\leq T}=((p_{i}\circ z_{t})_{i\geq 1})_{t\leq T}\in C_{T}\mathbb{R}^{\infty}$ is a solution to ( $\mathbb{R}^{\infty}$ -ODE), if for each $i\geq 1$ the $\mathbb{R}$ -valued curve $t\mapsto p_{i}\circ z_{t}$ is absolutely continuous with weak derivative $t\mapsto p_{i}\circ\bar{B}(t,z_{t})$ $dt$ -a.s.

(ii)

A curve $(\bar{\Gamma}_{t})_{t\leq T}\subseteq\mathcal{P}(\mathbb{R}^{\infty})$ is a solution to ( $\mathbb{R}^{\infty}$ -CE), if it is weakly continuous, fulfills the integrability condition

\int_{0}^{T}\int_{\mathbb{R}^{\infty}}|\bar{B}_{k}(t,z)|d\bar{\Gamma}_{t}(z)dt<+\infty\text{ for each }k\geq 1

(14)

and for each $\bar{F}\in\mathcal{F}C^{2}_{b}(\mathbb{R}^{\infty})$ the identity

\int_{\mathbb{R}^{\infty}}\bar{F}(z)d\bar{\Gamma}_{t}(z)-\int_{\mathbb{R}^{\infty}}\bar{F}(z)d\bar{\Gamma}_{0}(z)=\int_{0}^{t}\int_{\mathbb{R}^{\infty}}\bar{\nabla}\bar{F}(z)\cdot\bar{B}(s,z)d\bar{\Gamma}_{s}(z)ds

holds for all $t\in[0,T]$ .

3.1 Main Result: Deterministic case

The following theorem is the main result for the deterministic case.

Theorem 3.7.

Let $a,b$ be Borel coefficients on $[0,T]\times\mathcal{SP}\times\mathbb{R}^{d}$ . For any weakly continuous solution $(\Gamma_{t})_{t\leq T}$ to ( $\mathcal{SP}$ -CE) in the sense of Definition 3.5, there exists a probability measure $\eta\in\mathcal{P}(C_{T}\mathcal{SP})$ , which is concentrated on vaguely continuous subprobability solutions to (NLFPK) such that

\eta\circ e_{t}^{-1}=\Gamma_{t},\,\,t\in[0,T].

Moreover, if $\Gamma_{0}\in\mathcal{P}(\mathcal{P})$ , then $\eta$ is concentrated on weakly continuous probability solutions to (NLFPK).

The proof relies on a superposition principle for measure-valued solution curves of continuity equations on $\mathbb{R}^{\infty}$ and its corresponding differential equation, which we recall in Proposition 3.8 below. More precisely, we proceed in three steps. First, we transfer $(\Gamma_{t})_{t\leq T}$ to a solution $(\bar{\Gamma}_{t})_{t\leq T}$ to ( $\mathbb{R}^{\infty}$ -CE). Then, by Proposition 3.8 below we obtain a measure $\bar{\eta}\in\mathcal{P}(C_{T}\mathbb{R}^{\infty})$ with $\bar{\eta}\circ(e^{\infty}_{t})^{-1}=\bar{\Gamma_{t}}$ , which is concentrated on solution curves to ( $\mathbb{R}^{\infty}$ -ODE). Finally, we transfer $\bar{\eta}$ back to a measures $\eta\in\mathcal{P}(C_{T}\mathcal{SP})$ with the desired properties. Below, we denote by $\mathcal{F}C^{1}_{b}(\mathbb{R}^{\infty})$ the set of test functions of same type as in $\mathcal{F}C^{2}_{b}(\mathbb{R}^{\infty})$ , but with $f\in C^{1}_{b}(\mathbb{R}^{n})$ in place of $f\in\mathcal{F}C^{2}_{b}(\mathbb{R}^{n})$ .

Proposition 3.8.

[Superposition principle on $\mathbb{R}^{\infty}$ , Thm. 7.1. [2]] Let $(\bar{\Gamma}_{t})_{t\leq T}$ be a solution to ( $\mathbb{R}^{\infty}$ -CE) in the sense of Definition 3.6 (ii) with test functions $\mathcal{F}C^{1}_{b}(\mathbb{R}^{\infty})$ instead of $\mathcal{F}C^{2}_{b}(\mathbb{R}^{\infty})$ . Then, there exists a Borel measures $\bar{\eta}\in\mathcal{P}(C_{T}\mathbb{R}^{\infty})$ concentrated on solutions to ( $\mathbb{R}^{\infty}$ -ODE) in the sense of Definition 3.6 (i) such that

\bar{\eta}\circ(e^{\infty}_{t})^{-1}=\bar{\Gamma}_{t},\,\,t\leq T.

We proceed to the proof of the main result.

Proof of Theorem 3.7: Let $\Gamma=(\Gamma_{t})_{t\leq T}$ be a weakly continuous solution to ( $\mathcal{SP}$ -CE) as in Definition 3.5.
Step 1: From ( $\mathcal{SP}$ -CE) to ( $\mathbb{R}^{\infty}$ -CE): Set

\bar{\Gamma}_{t}:=\Gamma_{t}\circ G^{-1},

with $G$ as in Lemma 3.4, which corresponds to the fixed set of functions $\mathcal{G}$ . Since $G$ is continuous, $(\bar{\Gamma}_{t})_{t\leq T}$ is a weakly continuous curve of Borel subprobability measures on $\mathbb{R}^{\infty}$ . We show that $(\bar{\Gamma}_{t})_{t\leq T}$ solves ( $\mathbb{R}^{\infty}$ -CE). Indeed, the integrability condition (14) is fulfilled, since $(\Gamma_{t})_{t\leq T}$ fulfills Definition 3.5. Further, since $\Gamma$ solves ( $\mathcal{SP}$ -CE), we have for any $\mathcal{F}C^{2}_{b}(\mathcal{G})\ni F:\mu\mapsto f\big{(}\mu(g_{1}),\dots,\mu(g_{n})\big{)}$ and $t\in[0,T]$

\int_{0}^{t}\int_{\mathcal{SP}}\mathbf{L}_{s}F(\mu)d\Gamma_{s}(\mu)ds=\int_{\mathcal{SP}}F(\mu)d\Gamma_{t}(\mu)-\int_{\mathcal{SP}}F(\mu)d\Gamma_{0}(\mu)

(15)

and hence, abbreviating $p_{k}\circ B(t,\cdot)$ by $B^{k}_{t}$ and setting $\bar{F}=f\circ\pi_{n}$ for $f$ as above, we have

	$\displaystyle\int_{0}^{t}\int_{\mathcal{SP}}\mathbf{L}_{s}F(\mu)d\Gamma_{s}(\mu)ds$	$\displaystyle=\int_{0}^{t}\int_{\mathcal{SP}}\sum_{k=1}^{n}(\partial_{k}f)\big{(}\mu(g_{1}),\dots,\mu(g_{n})\big{)}\bigg{(}\int_{\mathbb{R}^{d}}\mathcal{L}_{s,\mu}g_{k}(x)d\mu(x)\bigg{)}\Gamma_{s}(\mu)ds$
		$\displaystyle=\int_{0}^{t}\int_{\mathcal{SP}}\sum_{k=1}^{n}(\partial_{k}f)\big{(}\mu(g_{1}),\dots,\mu(g_{n})\big{)}B^{k}_{s}(\mu)d\Gamma_{s}(\mu)ds$
		$\displaystyle=\int_{0}^{t}\int_{\mathcal{SP}}\sum_{k=1}^{n}(\partial_{k}f)\big{(}p_{1}\circ G(\mu),\dots,p_{n}\circ G(\mu)\big{)}\bar{B}^{k}_{s}\circ G(\mu)d\Gamma_{s}(\mu)ds$
		$\displaystyle=\int_{0}^{t}\int_{\mathbb{R}^{\infty}}\nabla\bar{F}(z)\cdot\bar{B}_{s}(z)\bar{\Gamma}_{s}(z)ds$

and, furthermore, for each $s\in[0,T]$

\int_{\mathcal{SP}}F(\mu)d\Gamma_{s}(\mu)=\int_{\mathcal{SP}}f\big{(}p_{1}\circ G(\mu),\dots,p_{n}\circ G(\mu)\big{)}d\Gamma_{s}(\mu)=\int_{\mathbb{R}^{\infty}}\bar{F}(z)d\bar{\Gamma}_{s}(z).

Comparing with (15), it follows that $(\bar{\Gamma}_{t})_{t\leq T}$ is a solution to ( $\mathbb{R}^{\infty}$ -CE) as claimed, because $F\in\mathcal{F}C^{2}_{b}(\mathcal{G})$ was arbitrary and hence $\bar{F}$ as above is arbitrary in $\mathcal{F}C^{2}_{b}(\mathbb{R}^{\infty})$ . By standard approximation, one extends the above equation to test functions $\bar{F}$ from $\mathcal{F}C^{1}_{b}(\mathbb{R}^{\infty})$ .

Step 2: From ( $\mathbb{R}^{\infty}$ -CE) to ( $\mathbb{R}^{\infty}$ -ODE): Proposition 3.8 implies the existence of a measure $\bar{\eta}\in\mathcal{P}(C_{T}\mathbb{R}^{\infty})$ such that

(i)

$\bar{\eta}\circ(e^{\infty}_{t})^{-1}=\bar{\Gamma}_{t}$ for each $t\in[0,T]$
(ii)

$\bar{\eta}$ is concentrated on solution paths of ( $\mathbb{R}^{\infty}$ -ODE).

Step 3: From ( $\mathbb{R}^{\infty}$ -ODE) to (NLFPK): We show that the measure $\eta:=\bar{\eta}\circ(J^{-1})^{-1}$ , with $J$ as in Lemma 3.4 fulfills all desired properties. Indeed, since

\bar{\eta}\circ(e^{\infty}_{t})^{-1}=\bar{\Gamma}_{t}=\Gamma_{t}\circ G^{-1},

for each $t\in[0,T]$ we deduce that $\bar{\eta}\circ(e^{\infty}_{t})^{-1}$ is concentrated on $G(\mathcal{SP})$ . By Lemma 3.4, $G(\mathcal{SP})\subseteq\mathbb{R}^{\infty}$ is closed. Since by construction $\bar{\eta}$ is concentrated on continuous curves in $\mathbb{R}^{\infty}$ , $\bar{\eta}$ is concentrated on $C_{T}G(\mathcal{SP})$ . Further, $C_{T}G(\mathcal{SP})\subseteq C_{T}\mathbb{R}^{\infty}$ is a measurable set and $J^{-1}:C_{T}G(\mathcal{SP})\to C_{T}\mathcal{SP}$ is measurable by Lemma 3.4. Therefore, we may define $\eta\in\mathcal{P}(C_{T}\mathcal{SP})$ via

\eta:=\bar{\eta}\circ(J^{-1})^{-1}.

It remains to verify $\eta\circ e_{t}^{-1}=\Gamma_{t}$ for all $t\in[0,T]$ and that $\eta$ is concentrated on subprobability solutions to (NLFPK). Concerning the first matter, we have

\eta\circ e_{t}^{-1}=\bar{\eta}\circ(J^{-1})^{-1}\circ e_{t}^{-1}=\bar{\eta}\circ(e_{t}\circ J^{-1})^{-1}

and

\Gamma_{t}=\Gamma_{t}\circ(G^{-1}\circ G)^{-1}=\bar{\Gamma}_{t}\circ(G^{-1})^{-1}=\bar{\eta}\circ(G^{-1}\circ e_{t}^{\infty})^{-1}.

Since $e_{t}\circ J^{-1}$ and $G^{-1}\circ e_{t}^{\infty}$ coincide as measurable maps on $C_{T}G(\mathcal{SP})$ and it was shown above that $\bar{\eta}$ is concentrated on $C_{T}G(\mathcal{SP})$ , we obtain

\eta\circ e_{t}^{-1}=\Gamma_{t},\,\,t\leq T.

Concerning the second aspect, note that by definition of $\eta$ and $\bar{\Gamma}_{t}$ and by the equality $e_{t}\circ J^{-1}=G^{-1}\circ e_{t}^{\infty}$ , (11) for $\Gamma$ implies that $\eta$ is concentrated on vaguely continuous curves $t\mapsto\mu_{t}$ in $\mathcal{SP}$ with the global integrability property (3) such that $t\mapsto G(\mu_{t})$ is a solution to ( $\mathbb{R}^{\infty}$ -ODE). Each such curve $t\mapsto\mu_{t}$ is a subprobability solution to (NLFPK). Indeed, due to $\mu_{t}\in A_{t}$ $dt$ -a.s., we have

	$\displaystyle\frac{d}{dt}p_{k}\circ G(\mu_{t})=$	$\displaystyle\,p_{k}\circ\bar{B}(t,G(\mu_{t}))\quad dt-a.s.\iff\frac{d}{dt}p_{k}\circ G(\mu_{t})=p_{k}\circ B(t,\mu_{t})\quad dt-a.s.$
		$\displaystyle\iff\frac{d}{dt}\int_{\mathbb{R}^{d}}g_{k}(x)d\mu_{t}(x)=\int_{\mathbb{R}^{d}}\mathcal{L}_{t,\mu_{t}}g_{k}(x)d\mu_{t}(x)\quad dt-a.s.$
		$\displaystyle\iff\int_{\mathbb{R}^{d}}g_{k}d\mu_{t}-\int_{\mathbb{R}^{d}}g_{k}d\mu_{0}=\int_{0}^{t}\int_{\mathbb{R}^{d}}\mathcal{L}_{s,\mu_{s}}g_{k}(x)d\mu_{s}(x)ds,\,\,t\in[0,T],$

and Lemma 3.3 (ii) applies. It remains to prove the additional assertion about probability solutions. To this end, assume $\Gamma_{0}$ is concentrated on $\mathcal{P}$ . Then, $\eta(e_{0}\in\mathcal{P})=1$ and hence the claim follows by Remark 3.2. ∎

The final assertion of the theorem in particular implies: If $\Gamma_{0}\in\mathcal{P}(\mathcal{P})$ for a weakly continuous solution $(\Gamma_{t})_{t\leq T}\subseteq\mathcal{P}(\mathcal{SP})$ to ( $\mathcal{SP}$ -CE), then $\Gamma_{t}\in\mathcal{P}(\mathcal{P})$ for each $t\leq T$ . Of course, this is to be expected due to the global integrability condition in Definition 3.5.

Remark 3.9.

Finally, let us explain why we developed the above result for subprobability solutions to (NLFPK) although our principal interest is restricted to probability solutions. If we directly consider solution curves $(\Gamma_{t})_{t\leq T}$ to ( $\mathcal{SP}$ -CE) with $\Gamma_{t}\in\mathcal{P}(\mathcal{P})$ , we cannot prove that $\eta$ in Theorem 3.7 is concentrated on $C_{T}\mathcal{P}$ (in fact, not even $\eta(C_{T}\mathcal{P})>0$ could be shown). Indeed, inspecting the proof above, one may only prove that $\eta\circ e_{t}^{-1}$ is concentrated on $\mathcal{P}$ for each $t\leq T$ . But since $\mathcal{P}\subseteq\mathcal{SP}$ is not closed, curves in the support of $\eta$ may be proper subprobability-valued at single times. The deeper reason for this is that the range $G(\mathcal{P})$ of $G$ as in 3.3 as a map on $\mathcal{P}$ with the weak topology is not closed in $\mathbb{R}^{\infty}$ . It seems that one cannot resolve this issue by simply changing the function set $\mathcal{G}$ , since there exists no countable set of functions, which allows for a characterization of weak instead of vague convergence as in Lemma 3.3. Since $\mathcal{SP}$ with the vague topology is compact and the vague test function class $C_{c}(\mathbb{R}^{d})$ is separable, it is feasible to carry out the entire development for subprobability measures as above.

We also mention that to our understanding there is no inherent reason why the superposition principle could not be extended to larger spaces of measures (e.g. spaces of signed measures), as long as its topology allows for a suitable identification with $\mathbb{R}^{\infty}$ as in our present case. Our principal motivation from a probabilistic viewpoint was to study curves of probability measures, and we were only forced to extend to $\mathcal{SP}$ , the vague closure of $\mathcal{P}$ , by the reasons outlined above. In order to replace $\mathcal{SP}$ by some larger space of measures $\mathcal{M}$ , it seems indispensable that Lemma 3.4 remains true, i.e. that the range of $\mathcal{M}$ under a suitable homeomorphism is closed in $\mathbb{R}^{\infty}$ .

3.2 Consequences and applications

The following existence- and uniqueness results immediately follow from the superposition principle Theorem 3.7 and provide an equivalence between the nonlinear FPK-equation (NLFPK) and its linearized continuity equation ( $\mathcal{SP}$ -CE).

Corollary 3.10.

Let $\mu_{0}\in\mathcal{SP}$ and assume there exists a solution to ( $\mathcal{SP}$ -CE) with initial condition $\delta_{\mu_{0}}$ . Then, there exists a subprobability solution to (NLFPK) with initial condition $\mu_{0}$ . Moreover, if $\mu_{0}\in\mathcal{P}$ , then there exists a probability solution to (NLFPK) with initial condition $\mu_{0}$ .

Proof.

By Theorem 3.7 there exists a probability measure $\eta$ concentrated on subprobability solutions to (NLFPK) with $\eta\circ e_{0}^{-1}=\delta_{\mu_{0}}$ . Hence, at least one such solution to (NLFPK) with initial condition $\mu_{0}$ exists. The second assertion is treated similarly. ∎

Corollary 3.11.

Let $\mu_{0}\in\mathcal{SP}$ and assume there exists at most one vaguely continuous subprobability solution to (NLFPK) with initial condition $\mu_{0}$ . Then, there exists also at most one weakly continuous solution $(\Gamma_{t})_{t\leq T}$ to ( $\mathcal{SP}$ -CE) with initial condition $\delta_{\mu_{0}}$ . If $\mu_{0}\in\mathcal{P}$ , then, in the case of existence, $\Gamma_{t}(\mathcal{P})=1$ for each $t\in[0,T]$ .

Proof.

Let $\Gamma^{(1)}$ and $\Gamma^{(2)}$ be weakly continuous solutions to ( $\mathcal{SP}$ -CE) with $\Gamma^{(i)}_{0}=\delta_{\mu_{0}}$ for $i\in\{1,2\}$ . By Theorem 3.7, there exist probability measures $\eta^{(i)}$ , $i\in\{1,2\}$ , concentrated on subprobability solutions to (NLFPK) with initial condition $\mu_{0}$ such that $\eta^{(i)}\circ e_{t}^{-1}=\Gamma_{t}^{(i)}$ for each $t\in[0,T]$ and $i\in\{1,2\}$ . By assumption, we obtain $\eta^{(1)}=\delta_{\mu}=\eta^{(2)}$ for a unique element $\mu\in C_{T}\mathcal{SP}$ and thus also $\Gamma^{(1)}=\Gamma^{(2)}$ . If $\mu_{0}\in\mathcal{P}$ , then $\mu\in C_{T}\mathcal{P}$ by Remark 3.2, which gives the second assertion. ∎

3.2.1 Application to coupled nonlinear-linear Fokker-Planck-Kolmogorov equations

Using the superposition principle, we prove an open conjecture posed in [17]. Let us shortly recapitulate the necessary framework. In [17], the authors consider a coupled nonlinear-linear FPK-equation of type

\begin{cases}\partial_{t}\mu_{t}=\mathcal{L}^{*}_{t,\mu_{t}}\mu_{t}\\ \partial_{t}\nu_{t}=\mathcal{L}^{*}_{t,\mu_{t}}\nu_{t},\end{cases}

(16)

i.e. comparing to our situation the first nonlinear equation is of type (NLFPK) and the second (linear) equation is obtained by ”freezing” a solution $(\mu_{t})_{t\leq T}$ to the first equation in the nonlinearity spot of $\mathcal{L}$ . For an initial condition $(\bar{\mu},\bar{\nu})\in\mathcal{P}\times\mathcal{P}$ , (16) is said to have a unique solution, if there exists a unique probability solution $(\mu_{t})_{t\leq T}$ to the first equation in the sense of Definition 3.1 with $\mu_{0}=\bar{\mu}$ and a unique weakly continuous curve $(\nu_{t})_{t\leq T}\subseteq\mathcal{P}$ , which solves the second equation with fixed coefficient $\mu_{t}$ with $\nu_{0}=\bar{\nu}$ (we refer to [17] for more details). The authors associate a linear continuity equation on $\mathbb{R}^{d}\times\mathcal{P}$ to (16) in the following sense: Let $\mathbb{L}$ be the operator acting on functions

\mathcal{C}:=\big{\{}\Phi:(x,\mu)\mapsto\varphi(x)F(\mu)|\varphi\in C^{2}_{c}(\mathbb{R}^{d}),F\in\mathcal{F}C^{2}_{b}(\mathcal{P})\big{\}},

via

\mathbb{L}_{t}\Phi(x,\mu):=\mathcal{L}_{t,\mu}\Phi(\cdot,\mu)(x)+\mathbf{L}_{t}\Phi(x,\cdot)(\mu),

with $\mathcal{L}$ as in (1) and $\mathbf{L}$ as in (10). Consider the continuity equation

\partial_{t}\Lambda_{t}=\mathbb{L}_{t}^{*}\Lambda_{t},\,\,t\in[0,T]

(17)

for weakly continuous curves of Borel probability measures on $\mathbb{R}^{d}\times\mathcal{P}$ . The exact notion of solution can be found in [17], where also the following observation is made: A pair $(\mu_{t},\nu_{t})_{t\leq T}$ solves (16) if and only if $\Lambda_{t}:=\nu_{t}\times\delta_{\mu_{t}}$ solves (17). Using our main result, we prove the following conjecture posed in Remark 4.4. of [17].

Proposition 3.12.

If $(\mu_{t},\nu_{t})_{t\leq T}$ is the unique solution to (16) with initial condition $(\bar{\mu},\bar{\nu})\in\mathcal{P}\times\mathcal{P}$ , then $(\nu_{t}\times\delta_{\mu_{t}})_{t\leq T}$ is the unique solution to (17) with initial condition $\bar{\nu}\times\delta_{\bar{\mu}}$ .

Proof.

By Corollary (3.11), the unique solution to ( $\mathcal{SP}$ -CE) with initial condition $\delta_{\bar{\mu}}$ is $(\delta_{\mu_{t}})_{t\leq T}$ . Let $(\Lambda^{(1)}_{t})_{t\leq T}$ and $(\Lambda^{(2)}_{t})_{t\leq T}$ be two solutions to (17) with initial condition $\bar{\nu}\times\delta_{\bar{\mu}}$ . It is straightforward to check that the curves of second marginals $(\Lambda_{t}^{(1)}\circ\varPi_{2}^{-1})_{t\leq T}$ and $(\Lambda_{t}^{(2)}\circ\varPi_{2}^{-1})_{t\leq T}$ are probability solutions to ( $\mathcal{SP}$ -CE) with initial condition $\delta_{\bar{\mu}}$ (where we denote by $\varPi_{2}$ the projection from $\mathbb{R}^{d}\times\mathcal{P}$ onto the second coordinate). Hence, for each $t\in[0,T]$

\Lambda_{t}^{(1)}\circ\varPi_{2}^{-1}=\delta_{\mu_{t}}=\Lambda^{(2)}_{t}\circ\varPi_{2}^{-1}.

Consequently, $\Lambda_{t}^{(i)}$ is of product type, i.e. $\Lambda^{(i)}_{t}=\gamma_{t}^{(i)}\times\delta_{\mu_{t}}$ for weakly continuous curves $(\gamma^{(i)}_{t})_{t\leq T}\subseteq\mathcal{P}$ , $i\in\{1,2\}$ . It is immediate to show that each curve $\gamma^{(i)}$ solves the second equation of (16) with fixed $\mu_{t}$ and initial condition $\bar{\nu}$ . Hence, $\gamma^{(i)}_{t}=\nu_{t}$ for each $t\in[0,T]$ and $i\in\{1,2\}$ , which implies $\Lambda^{(1)}_{t}=\Lambda^{(2)}_{t}$ . Hence, the unique solution to (17) with initial condition $\bar{\nu}\times\delta_{\bar{\mu}}$ is given by $(\nu_{t}\times\delta_{\mu_{t}})_{t\leq T}$ . ∎

4 Superposition Principle for stochastic nonlinear Fokker-Planck-Kolmogorov Equations

We make use of the following notation specific to the stochastic case.

For two real-valued $n\times n$ matrices $A,B$ we write $A$ : $B=\sum_{k,l=1}^{n}A_{kl}B_{kl}$ . We use the same notation for $A=(A_{kl})_{k,l\geq 1}$ and $B=(B_{kl})_{k,l\geq 1}$ , if either $A$ or $B$ contain only finitely many non-trivial entries.
For the Hilbert space $\ell^{2}$ with topology induced by the usual inner product $\langle\cdot,\cdot\rangle_{\ell^{2}}$ and norm $||\cdot||_{\ell^{2}}$ , we denote the space of continuous $\ell^{2}$ -valued functions on $[0,T]$ by $C_{T}\ell^{2}$ . On $\ell^{2}$ and $C_{T}\ell^{2}$ , we unambiguously use the same notation $e_{t},p_{i}$ and $\pi_{n}$ as on $\mathbb{R}^{\infty}$ and $C_{T}\mathbb{R}^{\infty}$ in the previous section. Reminiscent to the previous section, we set $\mathcal{B}(C_{T}\ell^{2})=\sigma(e_{t},t\in[0,T])$ and denote the set of probability measures on this space by $\mathcal{P}(C_{T}\ell^{2})$ . For $\sigma$ -algebras $\mathcal{A}_{1}$ , $\mathcal{A}_{2}$ , we denote by $\mathcal{A}_{1}\bigvee\mathcal{A}_{2}$ the $\sigma$ -algebra generated by $\mathcal{A}_{1}$ and $\mathcal{A}_{2}$ .

We call a filtered probability space $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\leq T},\mathbb{P})$ complete, provided both $\mathcal{F}$ and $\mathcal{F}_{0}$ contain all subsets of $\mathbb{P}$ -negligible sets $N\in\mathcal{F}$ (i.e. $\mathbb{P}(N)=0$ ). This notion does not require $(\mathcal{F}_{t})_{t\leq T}$ to be right-continuous. A real-valued Wiener process $W=(W_{t})_{t\leq T}$ on such a probability space is called an $\mathcal{F}_{t}$ -Wiener process, if $W_{t}$ is $\mathcal{F}_{t}$ -adapted and $W_{u}-W_{t}$ is independent of $\mathcal{F}_{t}$ for each $0\leq t\leq u\leq T$ . Pathwise properties of stochastic processes such as continuity are to be understood up to a negligible set with respect to the underlying measure.

As in the previous section, we consider $\mathcal{SP}$ as a compact Polish space with the vague topology. Let $d_{1}\geq 1$ and consider product-measurable coefficients on $[0,T]\times\mathcal{SP}\times\mathbb{R}^{d}$

a(t,\mu,x)=(a_{ij}(t,\mu,x))\in\mathbb{S}^{+}_{d},\,\,b(t,\mu,x)=(b_{i}(t,\mu,x))_{i\leq d}\in\mathbb{R}^{d},\,\sigma(t,\mu,x)=(\sigma_{ij}(t,\mu,x))_{i,j\leq d}\in\mathbb{R}^{d\times d_{1}}

such that $\sigma$ is bounded, and let $\mathcal{L}$ be as before, i.e.

\mathcal{L}_{t,\mu}\varphi(x)=b_{i}(t,\mu,x)\partial_{i}\varphi(x)+a_{ij}(t,\mu,x)\partial^{2}_{ij}\varphi(x)

for $\varphi\in C^{2}(\mathbb{R}^{d})$ and $(t,\mu,x)\in[0,T]\times\mathcal{SP}\times\mathbb{R}^{d}$ .

In contrast to the deterministic framework of the previous section, here we consider nonlinear stochastic FPK-equations of type (SNLFPK) on $[0,T]$ , to be understood in distributional sense as follows. With slight abuse of notation, for $\sigma\in\mathbb{R}^{d\times d_{1}}$ and $x\in\mathbb{R}^{d}$ , we write $\sigma\cdot x=(\sum_{i=1}^{d}\sigma^{ik}x_{i})_{k\leq d_{1}}$ , which is consistent with the standard inner product notation $\sigma\cdot x$ in the case $d_{1}=1$ .

Definition 4.1.

(i)

A pair $(\mu,W)$ consisting of an $\mathcal{F}_{t}$ -adapted vaguely continuous $\mathcal{SP}$ -valued stochastic process $\mu=(\mu_{t})_{t\leq T}$ and an $\mathcal{F}_{t}$ -adapted, $d_{1}$ -dimensional Wiener process $W=(W_{t})_{t\leq T}$ on a complete probability space $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\leq T},\mathbb{P})$ is a subprobability solution to (SNLFPK), provided

\int_{0}^{T}\int_{\mathbb{R}^{d}}|b_{i}(t,\mu_{t},x)|+|a_{ij}(t,\mu_{t},x)|+|\sigma_{ik}(t,\mu_{t},x)|^{2}d\mu_{t}(x)dt<\infty\quad\mathbb{P}\text{-a.s.}

(18)

for each $i,j\leq d,k\leq d_{1}$ , and

\int_{\mathbb{R}^{d}}\varphi(x)d\mu_{t}(x)-\int_{\mathbb{R}^{d}}\varphi(x)d\mu_{0}(x)=\int_{0}^{t}\int_{\mathbb{R}^{d}}\mathcal{L}_{s,\mu_{s}}\varphi(x)d\mu_{s}(x)ds+\int_{0}^{t}\int_{\mathbb{R}^{d}}\sigma(s,\mu_{s},x)\cdot\nabla\varphi(x)d\mu_{s}(x)dW_{s}

(19)

holds $\mathbb{P}$ -a.s. for each $t\in[0,T]$ and $\varphi\in C^{2}_{c}(\mathbb{R}^{d})$ .

(ii)

A probability solution to (SNLFPK) is a pair as above such that $\mu$ is a $\mathcal{P}$ -valued process $(\mu_{t})_{t\leq T}$ with weakly continuous paths.

Remark 4.2.

(i)

Since $C^{2}_{c}(\mathbb{R}^{d})$ is separable with respect to uniform convergence and since the paths $t\mapsto\mu_{t}(\omega)$ are vaguely continuous, the exceptional sets in the above definition can be chosen independently of $\varphi$ and $t$ .

(ii)

The first integral on the right-hand side of (19) is a pathwise (that is, for individual fixed $\omega\in\Omega$ ) integral with respect to the finite measure $\mu_{s}(\omega)ds$ on $[0,T]\times\mathbb{R}^{d}$ . The second integral is a stochastic integral, which is defined, since the integrand

(t,\omega)\mapsto\int_{\mathbb{R}^{d}}\sigma(t,\mu_{t}(\omega),x)\cdot\nabla\varphi(x)d\mu_{t}(\omega)(x)

is $\mathbb{R}^{d_{1}}$ -valued, bounded, product-measurable and $\mathcal{F}_{t}$ -adapted (Thm. 3.8 [7]). More precisely,

\int_{0}^{t}\int_{\mathbb{R}^{d}}\sigma(s,\mu_{s},x)\cdot\nabla\varphi(x)d\mu_{s}(x)dW_{s}=\sum_{\alpha=1}^{d_{1}}\int_{0}^{t}\int_{\mathbb{R}^{d}}\sigma^{\alpha}\cdot\nabla\varphi d\mu_{s}dW^{\alpha}_{s},

where $\sigma^{\alpha}=(\sigma^{i\alpha})_{i\leq d}$ denotes the $\alpha$ -th column of $\sigma$ and the components $W^{\alpha}$ , $\alpha\leq d_{1}$ , of $W$ are real, independent Wiener processes.

By the global integrability assumption (18) and since $\sigma$ is bounded, we obtain (in analogy to Remark 3.2) the following conservation of mass, which we use to prove the final assertion of the main result Theorem 4.8.

Lemma 4.3.

Let $(\mu_{t})_{t\leq T}$ be a subprobability solution to (SNLFPK). If $\mu_{0}\in\mathcal{P}$ $\mathbb{P}$ -a.s., then the paths of $t\mapsto\mu_{t}$ are $\mathcal{P}$ -valued $\mathbb{P}$ -a.s. and, hence, in particular weakly continuous.

Proof.

Let $(\varphi_{k})_{k\geq 1}\subseteq C^{2}_{c}(\mathbb{R}^{d})$ approximate the constant function $1$ as in Remark 3.2. Then, by Itô-isometry, for each $t\in[0,T]$ , there exists a subsequence $(k^{t}_{l})_{l\geq 1}=(k_{l})_{l\geq 1}$ such that

\int_{0}^{t}\int_{\mathbb{R}^{d}}\sigma(s,\mu_{s},x)\cdot\nabla\varphi_{k_{l}}(x)d\mu_{s}(x)dW_{s}\underset{l\to\infty}{\longrightarrow}0\,\,\mathbb{P}\text{-a.s.}

(20)

Since the stochastic integral is continuous in $t$ , a classical diagonal argument yields that there exists a subsequence $(k_{l})_{l\geq 1}$ along which (20) holds for all $t\in[0,T]$ on a set of full $\mathbb{P}$ -measure, independent of $t$ . Let $\omega^{\prime}\in\Omega$ be from this set such that also $\mu_{0}(\omega^{\prime})\in\mathcal{P}$ and (19) holds for each $t$ and $\varphi$ . Note that the set of all such $\omega^{\prime}$ has full $\mathbb{P}$ -measure. Then, similar to the reasoning in Remark 3.2 and by using (20), considering (19) for such $\omega^{\prime}$ with $\varphi_{k_{l}}$ in place of $\varphi$ for the limit $l\longrightarrow+\infty$ , we obtain

\mu_{t}(\omega^{\prime})(\mathbb{R}^{d})=\mu_{0}(\omega^{\prime})(\mathbb{R}^{d}),\,\,t\in[0,T]

and hence the result. ∎

Note that the above proof can be adjusted to extend (19) to each $\varphi\in C^{2}_{b}(\mathbb{R}^{d})$ .

Embedding $\mathcal{SP}$ into $\ell^{2}$

In comparison with the deterministic case, we still consider $\mathcal{SP}$ as a manifold-like space with tangent spaces $T_{\mu}\mathcal{SP}=L^{2}(\mathbb{R}^{d},\mathbb{R}^{d};\mu)$ as before. However, instead of embedding into $\mathbb{R}^{\infty}$ by $G$ as in the previous section, now we need a global chart

H:\mathcal{SP}\to\ell^{2}

in order to handle the stochastic integral term later on. To this end, we replace the set of functions $\mathcal{G}=\{g_{i},i\geq 1\}$ of the deterministic case by

\mathcal{H}:=\{h_{i}\}_{i\geq 1},\,h_{i}:=2^{-i}\frac{g_{i}}{||g_{i}||_{C^{2}_{b}}}

(21)

and consider the map

H:\mathcal{SP}\to\ell^{2},\,\,H:\mu\mapsto(\mu(h_{i}))_{i\geq 1}.

The following lemma collects useful properties of $\mathcal{H}$ and $H$ , which are in the spirit of Lemma 3.3 and 3.4. We point out that we could have used the function class $\mathcal{H}$ instead of $\mathcal{G}$ already in Section 3, but we decided to pass from $\mathcal{G}$ to $\mathcal{H}$ at this point in order to stress the technical adjustments necessary due to the stochastic case.

Lemma 4.4.

(i)

The set $\mathcal{H}$ is measure-determining. Further, a process $(\mu_{t})_{t\leq T}$ as in Definition 4.1 is a solution to (SNLFPK) if and only if (19) holds for each $h_{i}\in\mathcal{H}$ in place of $\varphi$ .
(ii)

$H$ is a homeomorphism between $\mathcal{SP}$ and its range $H(\mathcal{SP})\subseteq\ell^{2}$ , endowed with the $\ell^{2}$ -subspace topology. In particular, $H(\mathcal{SP})\subseteq\ell^{2}$ is compact.

Proof.

(i)

The first claim is obvious, since $\mathcal{G}$ is measure-determining. Concerning the second claim, note that it is clearly sufficient to have (19) for each $\varphi\in C_{c}^{2}(\mathbb{R}^{d})$ with $||\varphi||_{C^{2}_{b}}\leq 1$ . Since the functions $||g_{i}||^{-1}_{C^{2}_{b}}g_{i}$ are dense in the unit ball of $C^{2}_{c}(\mathbb{R}^{d})$ with respect to $||\cdot||_{C^{2}_{b}}$ , it is sufficient to have (19) for each such normalized function. Indeed, if $\varphi_{k}\underset{k\to\infty}{\longrightarrow}\varphi$ uniformly up to second-order partial derivatives, then by Itô-isometry

\mathbb{E}\bigg{[}\bigg{(}\int_{0}^{t}\int_{\mathbb{R}^{d}}\sigma(s,\mu_{s},\cdot)\cdot\nabla(\varphi_{k}-\varphi)d\mu_{s}dW_{s}\bigg{)}^{2}\bigg{]}=\mathbb{E}\bigg{[}\int_{0}^{t}\bigg{(}\int_{\mathbb{R}^{d}}\sigma(s,\mu_{s},\cdot)\cdot\nabla(\varphi_{k}-\varphi)d\mu_{s}\bigg{)}^{2}ds\bigg{]},

which converges to $0$ as $k\longrightarrow\infty$ due to the boundedness of $\sigma$ . Hence, along a subsequence $(k_{l})_{l\geq 1}$ , we have a.s.

\int_{0}^{t}\int_{\mathbb{R}^{d}}\sigma(s,\mu_{s},x)\cdot\nabla\varphi_{k_{l}}(x)d\mu_{s}(x)dW_{s}\underset{l\to\infty}{\longrightarrow}\int_{0}^{t}\int_{\mathbb{R}^{d}}\sigma(s,\mu_{s},x)\cdot\nabla\varphi(x)d\mu_{s}(x)dW_{s}.

The a.s.-convergence of all other terms in (19) is clear. Therefore, it is sufficient to require (19) for a dense subset of the unit ball of $C^{2}_{c}(\mathbb{R}^{d})$ . Clearly, this yields at once that it is sufficient to have (19) for each $h_{i}\in\mathcal{H}$ .

(ii)

By definition, $H$ maps into $\ell^{2}$ . Since $\mathcal{H}$ is measure-determining, $H$ is one-to-one, hence bijective onto its range. If $\mu_{n}\underset{n\to\infty}{\longrightarrow}\mu$ vaguely in $\mathcal{SP}$ , clearly $H(\mu_{n})$ converges to $H(\mu)$ in the product topology. Since for any $i\geq 1$

$\underset{n\geq 1}{\text{sup}}|H(\mu_{n})_{i}|\leq 2^{-i},$

the convergence holds in $\ell^{2}$ as well, which implies continuity of $H$ . In particular, $H(\mathcal{SP})\subseteq\ell^{2}$ is compact. Conversely, if $H(\mu_{n})$ converges in $\ell^{2}$ to some $z=(z_{i})_{i\geq 1}$ , then, by closedness of $H(\mathcal{SP})\subseteq\ell^{2}$ , we have $z=H(\mu)$ for a unique element $\mu\in\mathcal{SP}$ and $\mu_{n}\underset{n\to\infty}{\longrightarrow}\mu$ vaguely. Indeed, the latter follows as in Lemma 3.4 (i).

∎

For consistency of notation, below we denote the test function class of the manifold-like space $\mathcal{SP}$ by $\mathcal{F}C^{2}_{b}(\mathcal{H})$ to stress that the base functions $g_{i}$ are now replaced by $h_{i}\in\mathcal{H}$ . However, the class of test functions remains unchanged, because the transition from $g_{i}$ to $h_{i}$ can be incorporated in the choice of $f$ .

Linearization of (SNLFPK)

As in the deterministic case, also for the stochastic nonlinear equation (SNLFPK) one can consider an associated linear equation for curves in $\mathcal{P}(\mathcal{SP})$ . To the best of our knowledge, such a linearization for stochastic FPK-equations has not yet been considered in the literature. Of course, the basic idea stems from the deterministic case [17] discussed in the previous section. From Itô’s formula one expects this linearized equation to be of second-order.

Let $\big{(}(\mu_{t})_{t\leq T},W\big{)}$ be a subprobability solution to (SNLFPK) (with underlying measure $\mathbb{P}$ ) and choose any $F:\mu\mapsto f\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}$ from $\mathcal{F}C^{2}_{b}(\mathcal{H})$ . Again, we abbreviate $b(t,\mu):=b(t,\mu,\cdot)$ and similarly for $a$ and $\sigma$ . By Itô’s formula, we have $\mathbb{P}$ -a.s.

	$\displaystyle F(\mu_{t})-F(\mu_{0})$	$\displaystyle=\int_{0}^{t}\big{\langle}\nabla^{\mathcal{SP}}F(\mu),b(s,\mu)+a(s,\mu)\nabla\big{\rangle}_{L^{2}(\mu_{s})}ds$
		$\displaystyle+\frac{1}{2}\sum_{\alpha=1}^{d_{1}}\int_{0}^{t}\sum_{k,l=1}^{n}(\partial_{kl}f)(\mu(h_{1}),\dots,\mu(h_{n}))\bigg{(}\int_{\mathbb{R}^{d}}\sigma^{\alpha}(s,\mu)\cdot\nabla h_{k}d\mu\bigg{)}\bigg{(}\int_{\mathbb{R}^{d}}\sigma^{\alpha}(s,\mu)\cdot\nabla h_{l}d\mu\bigg{)}ds$
		$\displaystyle+M_{t}^{F},$

with the martingale $M^{F}$ given as

M_{t}^{F}:=\sum_{\alpha=1}^{d_{1}}\int_{0}^{t}\bigg{[}\sum_{l=1}^{n}(\partial_{l}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}\int_{\mathbb{R}^{d}}\sigma^{\alpha}\cdot\nabla h_{l}d\mu_{s}\bigg{]}dW_{s}^{\alpha}.

Since $M^{F}_{0}=0$ $\mathbb{P}$ -a.s., integrating with respect to $\mathbb{P}$ and defining the curve of measures in $\mathbb{P}(\mathcal{SP})$

\Gamma_{t}:=\mathbb{P}\circ\mu_{t}^{-1},\,\,t\leq T

yields

		$\displaystyle\int_{\mathcal{SP}}F(\mu)d\Gamma_{t}(\mu)-\int_{\mathcal{SP}}F(\mu)d\Gamma_{0}(\mu)=\int_{0}^{t}\int_{\mathcal{SP}}\big{\langle}\nabla^{\mathcal{SP}}F(\mu),b(s,\mu)+a(s,\mu)\nabla\big{\rangle}_{L^{2}(\mu)}d\Gamma_{s}(\mu)ds$
		$\displaystyle+\frac{1}{2}\sum_{\alpha=1}^{d_{1}}\int_{0}^{t}\int_{\mathcal{SP}}\sum_{k,l=1}^{n}(\partial_{kl}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}\bigg{(}\int_{\mathbb{R}^{d}}\sigma^{\alpha}(s,\mu)\cdot\nabla h_{k}d\mu\bigg{)}\bigg{(}\int_{\mathbb{R}^{d}}\sigma^{\alpha}(s,\mu)\cdot\nabla h_{l}d\mu\bigg{)}d\Gamma_{s}(\mu)ds.$		(22)

As for the first-order term, which is interpreted as the pairing of the gradient $\nabla^{\mathcal{SP}}F$ with the inhomogeneous vector field $b+a\nabla$ in the tangent bundle $T\mathcal{SP}$ , also the second-order term allows for a geometric interpretation: Recall that for a smooth, real function $F$ on a Riemannian manifold $M$ with tangent bundle $TM$ , the Hessian $Hess(F)_{p}$ at $p\in M$ is a bilinear form on $T_{p}M$ with

Hess(F)_{p}(\eta_{p},\xi_{p})=\big{\langle}\nabla^{L}_{\eta_{p}}\nabla F(p),\xi_{p}\big{\rangle}_{T_{p}M},\,\,\eta_{p},\xi_{p}\in T_{p}M,

(23)

where $\nabla^{L}:TM\times TM\to TM$ denotes the Levi-Civita-connection on $M$ , the unique affine connection compatible with the metric tensor on $M$ and $\nabla$ denotes the usual gradient on $M$ . Intuitively, $\nabla^{L}_{\eta_{p}}\nabla F(p)$ denotes the change of the vector field $\nabla F$ in direction $\eta_{p}$ at $p$ . Recall that we consider $\mathcal{SP}$ as a manifold-like space with gradient $\nabla^{\mathcal{SP}}$ and that hence the reasonable notion of the Levi-Civita connection $\nabla^{L,\mathcal{SP}}$ on $\mathcal{SP}$ for $\sigma\in T_{\mu}\mathcal{SP}=L^{2}(\mathbb{R}^{d},\mathbb{R}^{d};\mu),Y\in T\mathcal{SP}$ at $\mu$ is given by

\nabla^{L,\mathcal{SP}}_{\sigma}Y(\mu)=\big{\langle}\nabla^{\mathcal{SP}}Y(\mu),\sigma\big{\rangle}_{T_{\mu}\mathcal{SP}},

whenver $\nabla^{\mathcal{SP}}Y$ is defined in $T\mathcal{SP}$ . For the representation of $Hess(F)$ for a test function $F\in\mathcal{F}C^{2}_{b}(\mathcal{H})$ , we need to set $Y=\nabla^{\mathcal{SP}}F$ . In this case, we can indeed make sense of

(\nabla^{\mathcal{SP}})^{2}F:=\nabla^{\mathcal{SP}}\nabla^{\mathcal{SP}}F,

because the gradient

\mu\mapsto\nabla^{\mathcal{SP}}F(\mu)=\sum_{k=1}^{n}(\partial_{k}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}\nabla h_{k}

is a linear combinations of the ” $\mathcal{F}C^{2}_{b}(\mathcal{H})$ -like” functions $\mu\mapsto\partial_{k}f\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}$ . The linear combination has to be understood in an $x$ -wise sense with coefficient functions $\nabla h_{k}$ , which are independent of the variable of interest $\mu$ . Denoting $F_{k}(\mu):=(\partial_{k}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}$ , we then define

(\nabla^{\mathcal{SP}})^{2}F(\mu)(x,y):=\sum_{k=1}^{n}\big{(}\nabla^{\mathcal{SP}}F_{k}(\mu)\big{)}(y)\nabla h_{k}(x),\,\,(x,y)\in\mathbb{R}^{d}\times\mathbb{R}^{d}.

(24)

Consequently, we have a reasonable notion of the Levi-Civita connection on $\mathcal{SP}$ at $\mu$ for $\sigma\in T_{\mu}\mathcal{SP}$ and $\nabla^{\mathcal{SP}}F$ for $F\in\mathcal{F}C^{2}_{b}(\mathcal{H})$ as

\nabla^{L,\mathcal{SP}}_{\sigma}\nabla^{\mathcal{SP}}F(\mu):=\big{\langle}(\nabla^{\mathcal{SP}})^{2}F(\mu),\sigma\big{\rangle}_{T_{\mu}\mathcal{SP}}=\sum_{k,l=1}^{n}(\partial_{kl}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}\nabla h_{k}\bigg{(}\int_{\mathbb{R}^{d}}\sigma\cdot\nabla h_{l}d\mu\bigg{)}.

(25)

The section $(\nabla^{\mathcal{SP}})^{2}F$ in $T\mathcal{SP}^{*}\otimes T\mathcal{SP}^{*}$ (and hence $\nabla^{L,\mathcal{SP}}_{\sigma}\nabla^{\mathcal{SP}}F$ and $Hess(F)$ below) is independent of the particular representation of $F$ in (24). Indeed, we have (c.f. Appendix A [17]) for

\gamma^{\sigma}_{\mu}(t):=\mu\circ(\text{Id}+t\sigma)^{-1}

the following pointwise (in $x\in\mathbb{R}^{d}$ ) equality for each $\mu\in\mathcal{SP},\sigma\in L^{2}(\mathbb{R}^{d},\mathbb{R}^{d};\mu)$

	$\displaystyle\frac{d}{dt}\nabla^{\mathcal{SP}}F\big{(}\gamma^{\sigma}_{\mu}(t)\big{)}$	$\displaystyle=\sum_{k=1}^{n}\bigg{[}\frac{d}{dt}(\partial_{k}f)\big{(}\gamma^{\sigma}_{\mu}(t)(h_{1}),\dots,\gamma^{\sigma}_{\mu}(t)(h_{n})\big{)}\bigg{]}\nabla h_{k}$
		$\displaystyle=\sum_{k,l=1}^{n}(\partial_{kl}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}\big{\langle}\nabla h_{l},\sigma\big{\rangle}_{L^{2}(\mu)}\nabla h_{k}$
		$\displaystyle=\big{\langle}(\nabla^{\mathcal{SP}})^{2}F(\mu),\sigma\big{\rangle}_{L^{2}(\mu)}.$

Since the gradient $\nabla^{\mathcal{SP}}F$ is independent of the particular representation of $F$ and $\sigma\in L^{2}(\mathbb{R}^{d},\mathbb{R}^{d};\mu)$ is arbitrary, also $(\nabla^{\mathcal{SP}})^{2}F$ is independent of the representation of $F$ .

Considering (23), we then set for $F\in\mathcal{F}C^{2}_{b}(\mathcal{H})$ and $\sigma,\tilde{\sigma}\in L^{2}(\mathbb{R}^{d},\mathbb{R}^{d};\mu)$

Hess(F)(\mu):(\sigma,\tilde{\sigma})\mapsto\sum_{k,l=1}^{n}(\partial_{kl}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}\bigg{(}\int_{\mathbb{R}^{d}}\sigma\cdot\nabla h_{l}d\mu\bigg{)}\bigg{(}\int_{\mathbb{R}^{d}}\tilde{\sigma}\cdot\nabla h_{k}d\mu\bigg{)},

(26)

which is a (symmetric) bilinear form on $T_{\mu}\mathcal{SP}$ and rewrite (4) as

\displaystyle\int_{\mathcal{SP}}Fd\Gamma_{t}-\int_{\mathcal{SP}}Fd\Gamma_{0}=\int_{0}^{t}\int_{\mathcal{SP}}\big{\langle}\nabla^{\mathcal{SP}}F,b_{s}+a_{s}\nabla\big{\rangle}_{L^{2}}+\frac{1}{2}\sum_{\alpha=1}^{d_{1}}Hess(F)(\sigma_{s}^{\alpha},\sigma_{s}^{\alpha})d\Gamma_{s}ds

(27)

(with $b_{s}:(\mu,x)\mapsto b(s,\mu,x)$ and similarly for $a_{s}$ and $\sigma_{s}$ ). Introducing the second-order operator $\mathbf{L}^{(2)}$ , acting on $F\in\mathcal{F}C^{2}_{b}(\mathcal{H})$ via

\mathbf{L}^{(2)}_{t}F(\mu)=\big{\langle}\nabla^{\mathcal{SP}}F,b(t,\mu)+a(t,\mu)\nabla\big{\rangle}_{L^{2}(\mu)}+\frac{1}{2}\sum_{\alpha=1}^{d_{1}}Hess(F)\big{(}\sigma^{\alpha}(t,\mu),\sigma^{\alpha}(t,\mu)\big{)},

we arrive at the distributional formulation of ( $\mathcal{SP}$ -FPK)

\partial_{t}\Gamma_{t}=(\mathbf{L}_{t}^{(2)})^{*}\Gamma_{t},\,\,t\leq T,

as in the introduction.

Remark 4.5.

Equation ( $\mathcal{SP}$ -CE) is the natural analogue to second-order FPK-equations over Euclidean spaces. Indeed, for a stochastic equation on $\mathbb{R}^{d}$

dX_{t}=b(t,X_{t})dt+\sigma(t,X_{t})dW_{t},

(28)

by Itô’s formula, the corresponding linear second-order equation for measures in distributional form is

\partial_{t}\mu_{t}=\big{(}\mathcal{L}^{(2)}_{t}\big{)}^{*}\mu_{t}

with

\mathcal{L}^{(2)}_{t}f=\nabla f\cdot b_{t}+\frac{1}{2}\big{\langle}\sigma_{t},Hess(f)\sigma_{t}\big{\rangle}_{\mathbb{R}^{d}},

where $Hess(f)$ denotes the usual Euclidean Hessian matrix of $f\in C^{2}(\mathbb{R}^{d})$ . In this spirit, it seems natural to consider (SNLFPK) as a stochastic equation with state space $\mathcal{SP}$ instead of $\mathbb{R}^{d}$ as for (28) and ( $\mathcal{SP}$ -CE) as the corresponding linear Fokker-Planck-type equation on $\mathcal{SP}$ .

By the above derivation, for any subprobability solution process $(\mu_{t})_{t\leq T}$ to (SNLFPK) the curve $(\Gamma_{t})_{t\leq T}$ , $\Gamma_{t}:=\mathbb{P}\circ\mu_{t}^{-1}$ in $\mathcal{P}(\mathcal{SP})$ solves ( $\mathcal{SP}$ -CE) in the sense of the following definition.

Definition 4.6.

A weakly continuous curve $(\Gamma_{t})_{t\leq T}\subseteq\mathcal{P}(\mathcal{SP})$ is a solution to (SNLFPK), if the integrability condition

\displaystyle\int_{0}^{T}\int_{\mathcal{SP}}||b(t,\mu)||_{L^{1}(\mathbb{R}^{d},\mathbb{R}^{d};\mu)}+||a(t,\mu)||_{L^{1}(\mathbb{R}^{d},\mathbb{R}^{d^{2}};\mu)}+||\sigma(t,\mu)||^{2}_{L^{2}(\mathbb{R}^{d},\mathbb{R}^{d\times d_{1}};\mu)}d\Gamma_{t}(\mu)dt<\infty

(29)

is fulfilled and for each $F\in\mathcal{F}C^{2}_{b}(\mathcal{H})$ , 27 holds for each $t\in[0,T]$ .

Transferring (SNLFPK) and ( $\mathcal{SP}$ -FPK) to $\ell^{2}$

Reminiscent to the deterministic case, we use the global chart $H:\mathcal{SP}\to\ell^{2}$ to introduce auxiliary equations on $\ell^{2}$ and the space of measures on $\ell^{2}$ , respectively, as follows. Again, we use the notation

A_{t}:=\bigg{\{}\mu\in\mathcal{SP}:\int_{\mathbb{R}^{d}}|a_{ij}(t,\mu,x)|+|b_{i}(t,\mu,x)|d\mu(x)<\infty\,\,\forall\,1\leq i,j\leq d\bigg{\}},\quad t\in[0,T].

For $i,j\geq 1$ , $\alpha\leq d_{1}$ , define the measurable coefficients $B_{i}$ for $(t,\mu)$ such that $\mu\in A_{t}$ , and $\Sigma^{\alpha}_{i}$ and $A_{ij}$ on $[0,T]\times\mathcal{SP}$ by

	$\displaystyle B_{i}(t,\mu)$	$\displaystyle:=\int_{\mathbb{R}^{d}}\mathcal{L}_{t,\mu}h_{i}(x)d\mu(x),\quad(t,\mu)\in[0,T]\times A_{t},$
	$\displaystyle\Sigma^{\alpha}_{i}(t,\mu)$	$\displaystyle:=\int_{\mathbb{R}^{d}}\sigma^{\alpha}(t,\mu,x)\cdot\nabla h_{i}(x)d\mu(x),$
	$\displaystyle\Sigma_{i}(t,\mu)$	$\displaystyle:=\big{(}\Sigma^{\alpha}_{i}(t,\mu)\big{)}_{\alpha\leq d_{1}},$
	$\displaystyle A_{ij}(t,\mu)$	$\displaystyle:=\big{\langle}\Sigma_{i},\Sigma_{j}\big{\rangle}_{d_{1}}(t,\mu),$

and set

B:=(B_{i})_{i\geq 1},\,\Sigma:=(\Sigma^{\alpha}_{i})_{\alpha\leq d_{1},i\geq 1},\,A:=(A_{ij})_{i,j\geq 1}.

Now, transferring to $\ell^{2}$ , define $\bar{B},\bar{\Sigma}$ and $\bar{A}_{ij}$ on $[0,T]\times\ell^{2}$ component-wise via

\displaystyle\bar{B}_{i}(t,z):=\begin{cases}B_{i}(t,H^{-1}(z))&,z\in H(A_{t})\\ 0&,\text{else}\end{cases},

\displaystyle\bar{\Sigma}^{\alpha}_{i}(t,z):=\begin{cases}\Sigma_{i}^{\alpha}(t,H^{-1}(z))&,z\in H(\mathcal{SP})\\ 0&,z\in\ell^{2}\backslash H(\mathcal{SP})\end{cases},

\bar{\Sigma}_{i}(t,z):=\big{(}\bar{\Sigma}^{\alpha}_{i}(t,z)\big{)}_{\alpha\leq d_{1}},

\bar{A}_{ij}(t,z):=\big{\langle}\bar{\Sigma}_{i},\bar{\Sigma}_{j}\big{\rangle}_{d_{1}}(t,z).

$\bar{B}$ and $\bar{\Sigma}^{\alpha}$ are $\ell^{2}$ -valued, since for $z=H(\mu)$

|\bar{B}_{i}(t,z)|\leq\int_{\mathbb{R}^{d}}|\mathcal{L}_{t,\mu}h_{i}(x)|d\mu(x)\leq C2^{-i},

where $C=C(a,b,d)$ is a finite constant independent of $t,z$ and $i\geq 1$ . A similar argument is valid for each $\bar{\Sigma}^{\alpha}$ . Each $\bar{B}_{i}$ and $\bar{\Sigma}^{\alpha}_{i}$ is product-measurable with respect to the $\ell^{2}$ -topology due to the measurability of $B$ and $\Sigma^{\alpha}$ . Reminiscent to ( $\mathbb{R}^{\infty}$ -CE) in the previous section, we associate to ( $\mathcal{SP}$ -FPK) the FPK-equation on $\ell^{2}$

\partial_{t}\bar{\Gamma}_{t}=-\bar{\nabla}\cdot(\bar{B}(t,z)\bar{\Gamma}_{t})+\partial^{2}_{ij}(\bar{A}_{ij}(t,z)\bar{\Gamma}_{t}),

(

\ell^{2}

-FPK)

which we understand in the sense of the following definition, with $\bar{\nabla}$ as in (13). Subsequently, we denote by $\mathcal{F}C^{2}_{b}(\ell^{2})$ the set of all maps $\bar{F}:\ell^{2}\to\mathbb{R}$ of type $\bar{F}=f\circ\pi_{n}$ for $n\geq 1$ and $f\in C^{2}_{b}(\mathbb{R}^{n})$ . Also, set

D^{2}\bar{F}_{ij}:=\begin{cases}(\partial^{2}_{ij}f)\circ\pi_{n}&,i,j\leq n\\ 0&,\text{ else}.\end{cases}

Consequently, both summands in (31) contain only finitely many non-trivial summands.

Definition 4.7.

A weakly continuous curve $(\bar{\Gamma}_{t})_{t\leq T}\subseteq\mathcal{P}(\ell^{2})$ is a solution to ( $\ell^{2}$ -FPK), if it fulfills the integrability condition

\int_{0}^{T}\int_{\ell^{2}}|\bar{B}_{i}(t,z)|+|\bar{A}_{ij}(t,z)|d\bar{\Gamma}_{t}dt<\infty,\quad\forall\,i,j\geq 1,

(30)

and for any $\bar{F}\in\mathcal{F}C^{2}_{b}(\ell^{2})$ , $\bar{F}:=f\circ\pi_{n}$ ,

\int_{\ell^{2}}\bar{F}(z)d\bar{\Gamma}_{t}(z)=\int_{\ell^{2}}\bar{F}(z)d\bar{\Gamma}_{0}(z)+\int_{0}^{t}\int_{\ell^{2}}\bar{\nabla}\bar{F}(z)\cdot\bar{B}(s,z)+\frac{1}{2}D^{2}\bar{F}(z):\bar{A}(s,z)d\bar{\Gamma}_{s}(z)ds.

(31)

holds for each $t\leq T$ .

4.1 Main Result: Stochastic case

The main result of this section is the following superposition principle for solutions to (SNLFPK) and ( $\mathcal{SP}$ -FPK), which generalizes Theorem 3.7 to stochastically perturbed equations.

Theorem 4.8.

Let $\sigma$ be bounded on $[0,T]\times\mathcal{SP}\times\mathbb{R}^{d}$ . Let $(\Gamma_{t})_{t\leq T}$ be a weakly continuous solution to ( $\mathcal{SP}$ -FPK). Then, there exists a complete filtered probability space $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\leq T},\mathbb{P})$ , an adapted $d_{1}$ -dimensional Wiener process $W=(W_{t})_{t\leq T}$ and a $\mathcal{SP}$ -valued adapted vaguely continuous process $\mu=(\mu_{t})_{t\leq T}$ such that $(\mu,W)$ solves (SNLFPK) and

\mathbb{P}\circ\mu_{t}^{-1}=\Gamma_{t}

holds for each $t\in[0,T]$ .
Moreover, if $\Gamma_{0}$ is concentrated on $\mathcal{P}$ , i.e. $\Gamma_{0}(\mathcal{P})=1$ , then the paths $t\mapsto\mu_{t}(\omega)$ are $\mathcal{P}$ -valued for $\mathbb{P}$ -a.e. $\omega\in\Omega$ and hence even weakly continuous.

As in the proof of 3.7, we proceed in three steps. Since parts of the proof are technically more involved than in the deterministic case, we first present the ingredients of each step and afterwards state the proof of Theorem 4.8 as a corollary.

Step 1: From ( $\mathcal{SP}$ -FPK) to ( $\ell^{2}$ -FPK):

Lemma 4.9.

For any solution $(\Gamma_{t})_{t\leq T}$ to ( $\mathcal{SP}$ -FPK), the curve $\bar{\Gamma}_{t}=\Gamma_{t}\circ H^{-1}$ is a solution to ( $\ell^{2}$ -FPK).

Proof.

Clearly, $t\mapsto\bar{\Gamma}_{t}$ is a weakly continuous curve in $\mathcal{P}(\ell^{2})$ due to the continuity of $H:\mathcal{SP}\to\ell^{2}$ . (30) holds, since $t\mapsto\Gamma_{t}$ fulfills (29) and since $\sigma$ is bounded. Moreover, we have for $s,t\leq T$ , $\bar{F}=f\circ\pi_{n}\in\mathcal{F}C^{2}_{b}(\ell^{2})$ and $F:\mu\mapsto f\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}$

	$\displaystyle\int_{\ell^{2}}\bar{\nabla}\bar{F}(z)\cdot\bar{B}(s,z)+\frac{1}{2}D^{2}\bar{F}(z):\bar{A}(s,z)d\bar{\Gamma}_{s}(z)$
	$\displaystyle=\int_{\mathcal{SP}}\sum_{k=1}^{n}(\partial_{k}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}B_{k}(s,\mu)+\sum_{\alpha=1}^{d_{1}}\frac{1}{2}\sum_{k,l=1}^{n}(\partial_{kl}f)\big{(}\mu(h_{1}),\dots,\mu(h_{n})\big{)}\Sigma_{k}^{\alpha}(s,\mu)\Sigma_{l}^{\alpha}(s,\mu)d\Gamma_{s}(\mu)$
	$\displaystyle=\int_{\mathcal{SP}}\big{\langle}\nabla^{\mathcal{SP}}F(\mu),b(s,\mu)+a(s,\mu)\nabla\big{\rangle}_{L^{2}(\mu)}+\frac{1}{2}\sum_{\alpha=1}^{d_{1}}Hess(F)\big{(}\sigma^{\alpha}(s,\mu),\sigma^{\alpha}(s,\mu)\big{)}d\Gamma_{s}(\mu)$

and likewise

\int_{\ell^{2}}\bar{F}(z)d\bar{\Gamma}_{t}=\int_{\mathcal{SP}}F(\mu)d\Gamma_{t}.

Comparing with (27), the statement follows. ∎

Step 2: From ( $\ell^{2}$ -FPK) to the martingale problem ( $\ell^{2}$ -MGP): We introduce a martingale problem on $\ell^{2}$ , which is related to ( $\ell^{2}$ -FPK) in the sense of Remark 4.11 below and is, roughly speaking, the stochastic analogue to ( $\mathbb{R}^{\infty}$ -ODE) from the previous section. Recall the notation $e_{t}$ for the projection $e_{t}:C_{T}\ell^{2}\to\ell^{2}$ , $e_{t}:\gamma\mapsto\gamma_{t}$ for $t\leq T$ .

Definition 4.10.

A measure $\bar{Q}\in\mathcal{P}(C_{T}\ell^{2})$ is a solution to the $\ell^{2}$ -martingale problem ( $\ell^{2}$ -MGP), provided

\int_{C_{T}\ell^{2}}\int_{0}^{T}|\bar{B}_{i}(t,e_{t})|+|\bar{A}_{ij}(t,e_{t})|dtd\bar{Q}<\infty,\quad i,j\geq 1,

(32)

and

\bar{F}\circ e_{t}-\int_{0}^{t}\bar{\nabla}\bar{F}\circ e_{s}\cdot\bar{B}(s,e_{s})+\frac{1}{2}D^{2}\bar{F}\circ e_{s}:\bar{A}(s,e_{s})ds

(33)

is a $\bar{Q}$ -martingale on $C_{T}\ell^{2}$ with respect to the natural filtration on $C_{T}\ell^{2}$ for any $\bar{F}\in\mathcal{F}C^{2}_{b}(\ell^{2})$ .

Remark 4.11.

By construction, any such solution $\bar{Q}$ induces a weakly continuous solution $(\bar{\Gamma}_{t})_{t\leq T}$ to ( $\ell^{2}$ -FPK) via $\bar{\Gamma}_{t}:=\bar{Q}\circ e_{t}^{-1}$ . Indeed, this is readily seen by integrating (33) with respect to $\bar{Q}$ and Fubini’s theorem.

In view of Proposition 4.13 below, we extend the coefficients $\bar{B}_{i},\bar{\Sigma}^{\alpha}_{i}$ (and hence also $\bar{A}_{ij}$ ) to $\mathbb{R}^{\infty}$ via

\bar{B}_{i}:=0=:\bar{\Sigma}^{\alpha}_{i}\text{ on }[0,T]\times\mathbb{R}^{\infty}\backslash\ell^{2}.

We still use the notation $\bar{B}$ , $\bar{\Sigma}^{\alpha}$ and $\bar{A}$ and note that they are $\mathcal{B}([0,T])\otimes\mathcal{B}(\mathbb{R}^{\infty})/\mathcal{B}(\mathbb{R}^{\infty})$ -measurable due to Remark 4.12 below. Due to the same remark, we may regard any solution $(\bar{\Gamma}_{t})_{t\leq T}$ to ( $\ell^{2}$ -FPK) as a solution to a FPK-equation on $\mathbb{R}^{\infty}$ by considering ( $\ell^{2}$ -FPK) with the extended coefficients and test functions $\bar{F}\in\mathcal{F}C^{2}_{b}(\ell^{2})$ extended to $\mathbb{R}^{\infty}$ by considering $\pi_{n}$ on $\mathbb{R}^{\infty}$ instead of $\ell^{2}$ . Similarly, the formulation of the martingale problem ( $\ell^{2}$ -MGP) as in Definition 4.10 extends to $\mathbb{R}^{\infty}$ in the sense that a measure $\bar{Q}\in\mathcal{P}(C_{T}\mathbb{R}^{\infty})$ is understood as a solution, provided the process (33) is a $\bar{Q}$ -martingale on $C_{T}\mathbb{R}^{\infty}$ with respect to the natural filtration for each $\bar{F}=f\circ\pi_{n}:\mathbb{R}^{\infty}\to\mathbb{R}$ as above.

Remark 4.12.

We recall that $\ell^{2}\in\mathcal{B}(\mathbb{R}^{\infty})$ and $\mathcal{B}(\ell^{2})=\mathcal{B}(\mathbb{R}^{\infty})_{\upharpoonright\ell^{2}}$ . In particular, any probability measure $\bar{\Gamma}\in\mathcal{P}(\ell^{2})$ uniquely extends to a Borel probability measure on $\mathbb{R}^{\infty}$ via $\bar{\Gamma}(A):=\bar{\Gamma}(A\cap\ell^{2})$ , $A\in\mathcal{B}(\mathbb{R}^{\infty})$ .

We shall need the following superposition principle from [20], which lifts a solution to a FPK-equation on $\mathbb{R}^{\infty}$ to a solution of the associated martingale problem. Note that in [20], the author assumes an integrability condition of order $p>1$ instead of $p=1$ as in (30) in order to essentially reduce the proof to the corresponding finite-dimensional result, see [20, Thm.2.14], which requires such a higher order integrability. However, since the latter result was extended to the case of an $L^{1}$ -integrability condition by the same author [21, Thm.2.5], it is easy to see that also the infinite-dimensional result [20, Thm.7.1] holds for solutions with $L^{1}$ -integrability as in (30).

Proposition 4.13.

[Superposition principle on $\mathbb{R}^{\infty}$ , Thm.7.1. [20] For any weakly continuous solution $(\bar{\Gamma}_{t})_{t\leq T}\subseteq\mathcal{P}(\mathbb{R}^{\infty})$ to the $\mathbb{R}^{\infty}$ -extended version of ( $\ell^{2}$ -FPK), there exists $\bar{Q}\in\mathcal{P}(C_{T}\mathbb{R}^{\infty})$ , which solves the $\mathbb{R}^{\infty}$ -extended version of ( $\ell^{2}$ -MGP) such that $\bar{Q}\circ e_{t}^{-1}=\bar{\Gamma}_{t}$ for each $t\in[0,T]$ .

Moreover, we have the following consequence for the solutions we are interested in. Note that paths $t\mapsto z_{t}\in H(\mathcal{SP})$ are continuous with respect to the product topology if and only if they are $\ell^{2}$ -continuous. Hence, we may use the notation $C_{T}H(\mathcal{SP})$ unambiguously and consider it as a subset of either $C_{T}\mathbb{R}^{\infty}$ or $C_{t}\ell^{2}$ . Since $H(\mathcal{SP})\subseteq\ell^{2}$ is closed even with respect to the product topology, $C_{T}H(\mathcal{SP})$ belongs to $\mathcal{B}(C_{T}\ell^{2})$ and $\mathcal{B}(C_{T}\mathbb{R}^{\infty})$ .

Lemma 4.14.

If in the situation of the previous proposition each $\bar{\Gamma}_{t}$ is concentrated on the Borel set $H(\mathcal{SP})\subseteq\mathbb{R}^{\infty}$ , then $\bar{Q}$ is concentrated on continuous curves in $H(\mathcal{SP})$ . In particular, in this case $\bar{Q}$ may be regarded as an element of $\mathcal{P}(C_{T}\ell^{2})$ and a solution to the martingale problem ( $\ell^{2}$ -MGP) as in Definition 4.10.

Proof.

The closedness of $H(\mathcal{SP})\subseteq\mathbb{R}^{\infty}$ yields

\bar{Q}(C_{T}H(\mathcal{SP}))=\bar{Q}\bigg{(}\underset{q\in[0,T]\cap\mathbb{Q}}{\bigcap}\{e_{q}\in H(\mathcal{SP})\}\bigg{)}=1,

due to $\bar{Q}\circ e_{t}^{-1}=\bar{\Gamma}_{t}$ for each $t\leq T$ . By the observation above this lemma, it follows

\mathcal{B}(C_{T}\ell^{2})_{\upharpoonright C_{T}H(\mathcal{SP})}\subseteq\mathcal{B}(C_{T}\mathbb{R}^{\infty})_{\upharpoonright C_{T}H(\mathcal{SP})}\subseteq\mathcal{B}(C_{T}\mathbb{R}^{\infty})

and we can therefore consider $\bar{Q}$ as a probability measure on $\mathcal{B}(C_{T}\ell^{2})$ via

\bar{Q}(A):=\bar{Q}\big{(}A\cap C_{T}H(\mathcal{SP})\big{)},\,\,A\in\mathcal{B}(C_{T}\ell^{2})

with mass on $C_{T}H(\mathcal{SP})$ . It is clear that this measure fulfills Definition 4.10. ∎

Hence, subsequently we may regard to $\bar{Q}$ as in Proposition 4.13 as a solution to ( $\ell^{2}$ -MGP) on either $\mathbb{R}^{\infty}$ or $\ell^{2}$ without differing the notation. Recall the notation $p_{i}:\ell^{2}\to\mathbb{R}$ , $p_{i}(z)=z_{i}.$

Lemma 4.15.

Let $\bar{Q}$ be a solution to the martingale problem ( $\ell^{2}$ -MGP) on $\ell^{2}$ . Then, for any $i\geq 1$ , the process

M_{i}(t):=p_{i}\circ e_{t}-p_{i}\circ e_{0}-\int_{0}^{t}\bar{B}_{i}(s,e_{s})ds

(34)

is a real-valued, continuous $\bar{Q}$ -martingale on $C_{T}\ell^{2}$ with respect to the canonical filtration. The covariation $\langle\langle M_{i},M_{j}\rangle\rangle$ of $M_{i}$ and $M_{j}$ is $\bar{Q}$ -a.s. given by

\langle\langle M_{i},M_{j}\rangle\rangle_{t}=\int_{0}^{t}\bar{A}_{ij}(s,e_{s})ds,\,\,t\in[0,T].

(35)

Proof.

For $i,j\geq 1$ , let $n\geq\text{max}(i,j)$ , consider $p^{n}_{i}:\mathbb{R}^{n}\to\mathbb{R}$ , $p^{n}_{i}(x)=x_{i}$ and let

\bar{F}^{n}_{i}:\ell^{2}\to\mathbb{R},\,\,\bar{F}^{n}_{i}(z)=p^{n}_{i}\circ\pi_{n}(z).

Note that $\bar{F}^{n}_{i}=p_{i}$ on $\ell^{2}$ , independent of $n\geq\text{max}(i,j)$ . For $k\geq 1$ , introduce the stopping time $\sigma_{k}:=\text{inf}\{t\in[0,T]:||e_{t}||_{\ell^{2}}\geq k\}$ with respect to the canonical filtration on $C_{T}\ell^{2}$ . Clearly, $\sigma_{k}\nearrow+\infty$ pointwise. Consider $\eta_{k}\in C^{2}_{c}(\mathbb{R}^{n})$ such that $\eta_{k}(x)=1$ for $|x|\leq k+1$ .
Since $\partial_{k}p^{n}_{i}=\delta_{ki}$ and $\partial_{kl}p^{n}_{i}=0$ for $k,l\leq n$ , we have

M_{i}(t)=\bar{F}^{n}_{i}\circ e_{t}-\int_{0}^{t}\bar{\nabla}\bar{F}^{n}_{i}\circ e_{s}\cdot\bar{B}(s,e_{s})+\frac{1}{2}D^{2}\bar{F}^{n}_{i}\circ e_{s}:\bar{A}(s,e_{s})ds

and, setting $\bar{F}^{n,k}_{i}:=(\eta_{k}p^{n}_{i})\circ\pi_{n}\in\mathcal{F}C^{2}_{b}(\ell^{2})$ ,

M_{i}(\sigma_{k}\wedge t)=\bar{F}^{n,k}_{i}\circ e_{t\wedge\sigma_{k}}-\int_{0}^{t\wedge\sigma_{k}}\bar{\nabla}\bar{F}^{n,k}_{i}\circ e_{s}\cdot\bar{B}(s,e_{s})+\frac{1}{2}D^{2}\bar{F}^{n,k}_{i}\circ e_{s}:\bar{A}(s,e_{s})ds.

Since the latter is a continuous $\bar{Q}$ -martingale for each $k\geq 1$ , it follows that $M_{i}$ is a continuous local $\bar{Q}$ -martingale. Concerning (35), it suffices to prove that for any $\bar{F}\in\mathcal{F}C^{2}_{b}(\mathcal{\ell}^{2})$ , $\bar{F}=f\circ\pi_{n}$ , we have

\langle\langle M^{\bar{F}}\rangle\rangle_{t}=\int_{0}^{t}\big{\langle}\bar{\nabla}\bar{F}(e_{s}),\bar{A}(s,e_{s})\bar{\nabla}\bar{F}(e_{s})ds\big{\rangle}_{\ell^{2}}ds,

(36)

with

M^{\bar{F}}_{t}:=\bar{F}\circ e_{t}-\int_{0}^{t}\bar{\nabla}\bar{F}(e_{s})\cdot\bar{B}(s,e_{s})+\frac{1}{2}D^{2}\bar{F}(e_{s}):\bar{A}(s,e_{s})ds.

Indeed, from here (35) follows by considering (36) for $\bar{F}^{n,k}_{i}$ , localization of the local martingale $M_{i}$ and polarization for the quadratic (co-)variation. Concerning (36), it is standard (cf. [19, p.73,74]) to use Itô’s product rule to obtain that

t\mapsto(M^{\bar{F}}_{t})^{2}-\int_{0}^{t}\bar{\mathbf{L}}_{s}^{(2)}\bar{F}^{2}(e_{s})-2\bar{F}(e_{s})\bar{\mathbf{L}}_{s}^{(2)}\bar{F}(e_{s})ds

is a continuous $\bar{Q}$ -martingale on $C_{T}\ell^{2}$ , where we denote by $\bar{\mathbf{L}}_{t}^{(2)}\bar{F}(e_{s})$ the integrand of the integral term in the definition of $M^{\bar{F}}$ . A straightforward calculation yields

\int_{0}^{t}\bar{\mathbf{L}}_{s}^{(2)}\bar{F}^{2}(e_{s})-2\bar{F}(e_{s})\bar{\mathbf{L}}_{s}^{(2)}\bar{F}(e_{s})ds=\int_{0}^{t}\big{\langle}\bar{\nabla}\bar{F}(e_{s}),\bar{A}(s,e_{s})\bar{\nabla}\bar{F}(e_{s})\big{\rangle}_{\ell^{2}}ds,

which completes the proof. ∎

We summarize the results of this step in the following proposition.

Proposition 4.16.

Let $(\bar{\Gamma}_{t})_{t\leq T}$ be a weakly continuous solution to ( $\ell^{2}$ -FPK) such that $\bar{\Gamma}_{t}(H(\mathcal{SP}))=1$ for each $t\in[0,T]$ . Then, there exists a solution $\bar{Q}\in\mathcal{P}(C_{T}\ell^{2})$ to the martingale problem ( $\ell^{2}$ -MGP) such that $\bar{Q}$ is concentrated on $C_{T}H(\mathcal{SP})$ with $\bar{Q}\circ e_{t}^{-1}=\bar{\Gamma}_{t}$ for each $t\in[0,T]$ . Further, the results of Lemma 4.15 apply to $\bar{Q}$ .

Step 3: From ( $\ell^{2}$ -MGP) to (SNLFPK): For a given solution $\bar{Q}\in\mathcal{P}(C_{T}\ell^{2})$ to ( $\ell^{2}$ -MGP), set

\mathcal{C}:=\mathcal{B}(C_{T}\ell^{2})\bigvee\mathcal{N}_{\bar{Q}}

and

\mathcal{C}_{t}:=\sigma(e_{s},s\leq t)\bigvee\mathcal{N}_{\bar{Q}}

for $t\leq T$ , where $\mathcal{N}_{\bar{Q}}$ denotes the collection of all subsets of sets $N\in\mathcal{B}(C_{T}\ell^{2})$ with $\bar{Q}(N)=0$ . Of course, $\mathcal{C}$ and $\mathcal{C}_{t}$ depend on $\bar{Q}$ , but we suppress this dependence in the notation. Without further mentioning, we understand such $\bar{Q}$ as extended to $\mathcal{C}$ in the canonical way. Then, $(C_{T}\ell^{2},\mathcal{C},(\mathcal{C}_{t})_{t\leq T},\bar{Q})$ is a complete filtered probability space. Clearly, $(t,\gamma)\mapsto\bar{\Sigma}(t,e_{t}(\gamma))$ is $\mathcal{C}_{t}$ -progressively measurable from $[0,T]\times C_{T}\ell^{2}$ to $L(\mathbb{R}^{d_{1}},\ell^{2})$ , the space of bounded linear operators from $\mathbb{R}^{d_{1}}$ to $\ell^{2}$ .

Remark 4.17.

We extend $(C_{T}\ell^{2},\mathcal{C},(\mathcal{C}_{t})_{t\leq T},\bar{Q})$ as follows. Let $(\Omega^{\prime},\mathcal{F}^{\prime\prime},(\mathcal{F}^{\prime\prime}_{t})_{t\leq T},P)$ be a complete filtered probability space with a real-valued $\mathcal{F}^{\prime\prime}_{t}$ -Wiener process $\beta$ on it, define

\Omega:=C_{T}\ell^{2}\otimes\underset{l\geq 1}{\bigotimes}\Omega^{\prime},\,\,\mathcal{F}^{\prime}:=\mathcal{C}\otimes\underset{l\geq 1}{\bigotimes}\mathcal{F}^{\prime\prime},\,\,\mathcal{F}_{t}^{\prime}:=\mathcal{C}_{t}\otimes\underset{l\geq 1}{\bigotimes}\mathcal{F}^{\prime\prime}_{t},\,\,\mathbb{P}^{\prime}:=\bar{Q}\otimes\underset{l\geq 1}{\bigotimes}P,

let $\mathcal{F}$ and $\mathcal{F}_{t}$ be the $\mathbb{P}^{\prime}$ -completion of $\mathcal{F}^{\prime}$ and $\mathcal{F}_{t}^{\prime}$ , respectively, and denote the canonical extension of $\mathbb{P}^{\prime}$ to $\mathcal{F}$ by $\mathbb{P}$ . Further, we denote the Wiener process $\beta$ on the $i$ -th copy of $\Omega^{\prime}$ by $\beta_{i}$ and extend each $\beta_{i}$ to $\Omega$ by $\beta_{i}(\omega):=\beta_{i}(\omega_{i})$ for $\omega=\gamma\times(\omega_{i})_{i\geq 1}\in\Omega$ . Similarly, we extend each projection $e_{t}$ from $C_{T}\ell^{2}$ to $\Omega$ via $e_{t}(\omega):=e_{t}(\gamma)$ for $\omega$ as above, but keep the same notation for this extended process. Obviously, $(e_{t})_{t\leq t}$ is a continuous, $\mathcal{F}_{t}$ -adapted process on $\Omega$ and each $\beta_{i}$ is an $\mathcal{F}_{t}$ -Wiener process on $\Omega$ under $\mathbb{P}$ . Moreover, $(e_{t})_{t\leq T}$ and $(\beta_{i})_{i\geq 1}$ are independent on $\Omega$ with respect to $\mathbb{P}$ by construction. Further, it is clear that the process $M_{i}$ as in (34) is a $\mathbb{P}$ -martingale with respect to $\mathcal{F}_{t}$ for each $i\geq 1$ with covariation as in (35) and that $(t,\gamma)\mapsto\bar{\Sigma}(t,e_{t}(\gamma))\in L(\mathbb{R}^{d_{1}},\ell^{2})$ is $\mathcal{F}_{t}$ -progressively measurable on $[0,T]\times\Omega$ .

Finally, we need the following result, which is a special case of Theorem 2, [16].

Proposition 4.18.

Let $\bar{Q}\in\mathcal{P}(C_{T}\ell^{2})$ be a solution to the martingale problem ( $\ell^{2}$ -MGP). Then, there exists a complete filtered probability space with an adapted $d_{1}$ -dimensional Wiener process $W=(W^{\alpha})_{\alpha\leq d_{1}}$ and an $\ell^{2}$ -valued adapted continuous process $X=(X_{t})_{t\leq T}$ such that the law of $X$ on $C_{T}\ell^{2}$ is $\bar{Q}$ and for $i\geq 1$ and $t\in[0,T]$ , we have a.s.

p_{i}\circ X_{t}-p_{i}\circ X_{0}-\int_{0}^{t}\bar{B}_{i}(s,X_{s})ds=\sum_{\alpha=1}^{d_{1}}\int_{0}^{t}\bar{\Sigma}^{\alpha}_{i}(s,X_{s})dW_{s}^{\alpha}

(37)

and the exceptional set can be chosen independent of $t$ and $i$ .

To see this, consider Theorem 2 of [16] with $X=\ell^{2}$ , $U_{0}=\mathbb{R}^{d_{1}}$ , $D=\{p_{i},i\geq 1\}$ , the processes $M(p_{i})$ given by $M_{i}$ as in (34) on the probability space $\Omega$ of Remark 4.17 and

g_{s}=\bar{\Sigma}(s,e_{s})\in L(\mathbb{R}^{d_{1}},\ell^{2}).

These choices fulfill all requirements of [16]. In this case, the $\ell^{2}$ -valued process $X$ is given by $X_{t}=e_{t}$ on $\Omega$ . Since all terms in (37) are continuous in $t$ , the exceptional set may indeed by chosen independently of $t\in[0,T]$ and $i\geq 1$ .

The proof of Theorem 4.8 now follows from the above three step-scheme as follows.

Proof of Theorem 4.8: Let $(\Gamma_{t})_{t\leq T}\subseteq\mathcal{P}(\mathcal{SP})$ be a weakly continuous solution to ( $\mathcal{SP}$ -FPK). By Lemma 4.9 of Step 1, the weakly continuous curve of Borel probability measures on $\ell^{2}$

\bar{\Gamma}_{t}:=\Gamma_{t}\circ H^{-1},\,\,t\in[0,T]

solves ( $\ell^{2}$ -FPK) and each $\bar{\Gamma}_{t}$ is concentrated on $H(\mathcal{SP})$ . By Proposition 4.16 of Step 2, there exists a solution $\bar{Q}\in\mathcal{P}(C_{T}\ell^{2})$ to the martingale problem ( $\ell^{2}$ -MGP), which is concentrated on $C_{T}H(\mathcal{SP})$ such that

\bar{Q}\circ e_{t}^{-1}=\bar{\Gamma}_{t},\,\,t\in[0,T].

Further, Lemma 4.15 applies to $\bar{Q}$ . By Lemma 4.15 and Proposition 4.18 of Step 3, there is a $d_{1}$ -dimensional $\mathcal{F}_{t}$ -adapted Wiener process $W=(W^{\alpha})_{\alpha\leq d_{1}}$ and an $\mathcal{F}_{t}$ -adapted process $X$ on some complete filtered probability space $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\leq T},\mathbb{P})$ , which fulfill (37) and $X\in C_{T}H(\mathcal{SP})$ $\mathbb{P}$ -a.s. such that $\bar{Q}$ is the law of $X$ .
Possibly redefining $X$ on a $\mathbb{P}$ -negligible set (which preserves (37) and its adaptedness, the latter due to the completeness of the underlying filtered probability space), we may assume $X_{t}(\omega)=H(\mu_{t}(\omega))$ for some $\mu_{t}(\omega)\in\mathcal{SP}$ for each $(t,\omega)\in[0,T]\times\Omega$ . The continuity of $H^{-1}:H(\mathcal{SP})\to\mathcal{SP}$ and $t\mapsto X_{t}(\omega)$ implies vague continuity of

t\mapsto\mu_{t}(\omega)=H^{-1}\circ X_{t}(\omega)

(38)

for each $\omega\in\Omega$ and $\mathcal{F}_{t}$ -adaptedness of the $\mathcal{SP}$ -valued process $(\mu_{t})_{t\leq T}$ . Considering (37), $X_{t}=H(\mu_{t})$ and the definition of $\bar{B}$ and $\bar{\Sigma}^{\alpha}_{i}$ , we obtain, recalling $p_{i}(H(\nu))=\nu(h_{i})$ for each $\nu\in\mathcal{SP}$ ,

\mu_{t}(h_{i})-\mu_{0}(h_{i})-\int_{0}^{t}B_{i}(s,\mu_{s})ds=\sum_{\alpha=1}^{d_{1}}\int_{0}^{t}\Sigma^{\alpha}_{i}(s,\mu_{s})dW^{\alpha}_{s},\,\,t\leq T

$\mathbb{P}$ -a.s. for each $i\geq 1$ . From here, it follows by Lemma 4.4 (i) that $(\mu_{t})_{t\leq T}$ is a solution to (SNLFPK) as in Definition 4.1. Further,

\mathbb{P}\circ\mu_{t}^{-1}=(\mathbb{P}\circ X_{t}^{-1})\circ(H^{-1})^{-1}=\bar{\Gamma}_{t}\circ(H^{-1})^{-1}=\Gamma_{t}\circ H^{-1}\circ(H^{-1})^{-1}=\Gamma_{t}.

It remains to prove the final assertion of the theorem. To this end, note that $\Gamma_{0}(\mathcal{P})=1$ implies $\bar{\Gamma}_{0}(H(\mathcal{P}))=1$ and hence $\mu_{0}\in\mathcal{P}$ $\mathbb{P}$ -a.s. with $\mu_{0}$ as in (38). From here, the assertion follows by Lemma 4.3. ∎

Remark 4.19.

The particular type of noise we consider for (SNLFPK) was partially motivated by [8], where the natural connection of equations of type (SNLFPK) to interacting particle systems with common noise was investigated. Other types of noise terms may be treated in the future, including a possible extension to infinite-dimensional ones. In particular, Proposition 4.18 via [16, Thm.2] in the final step of the proof seems capable of such extensions, since the latter is an infinite-dimensional result.

References

[1] L. Ambrosio. Transport Equation and Cauchy Problem for Non-Smooth Vector Fields, pages 1–41. Springer Berlin Heidelberg, Berlin, Heidelberg, 2008.
[2] L. Ambrosio and D. Trevisan. Well-posedness of Lagrangian flows and continuity equations in metric measure spaces. Anal. PDE, 7(5):1179–1234, 2014.
[3] V. Barbu and M. Röckner. Probabilistic Representation for Solutions to Nonlinear Fokker–Planck Equations. SIAM Journal on Mathematical Analysis, 50(4):4246–4260, 2018.
[4] V. Barbu and M. Röckner. From nonlinear Fokker–Planck equations to solutions of distribution dependent SDE. Ann. Probab., 48(4):1902–1920, 07 2020.
[5] V.I. Bogachev, N.V. Krylov, M. Röckner, and S.V. Shaposhnikov. Fokker-Planck-Kolmogorov Equations. Mathematical Surveys and Monographs 207. American Mathematical Society, 2015.
[6] V.I. Bogachev, M. Röckner, and S.V. Shaposhnikov. On the Ambrosio–Figalli–Trevisan Superposition Principle for Probability Solutions to Fokker–Planck–Kolmogorov Equations. Journal of Dynamics and Differential Equations, 2020.
[7] K. L. Chung and R. J. Williams. Introduction to stochastic integration. Modern Birkhäuser classics. Birkhäuser/Springer, New York, second edition, 2014.
[8] M. Coghi and B. Gess. Stochastic nonlinear Fokker-Planck equations. arXiv:1904.07894, 2019.
[9] A. Figalli. Existence and uniqueness of martingale solutions for SDEs with rough or degenerate coefficients. Journal of Functional Analysis, 254(1):109 – 153, 2008.
[10] T. Funaki. A certain class of diffusion processes associated with nonlinear parabolic equations. Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete, 67(3):331–348, 1984.
[11] X. Huang, M. Röckner, and F.-Y. Wang. Nonlinear Fokker–Planck equations for probability measures on path space and path-distribution dependent SDEs. Discrete & Continuous Dynamical Systems - A, 39(6):3017–3035, 2019.
[12] X. Huang and F.-Y. Wang. Distribution dependent SDEs with singular coefficients. Stochastic Processes and their Applications, 129(11):4747 – 4770, 2019.
[13] O. Kallenberg. Random Measures, Theory and Applications. Springer International Publishing, 2017.
[14] T. Kurtz. Equivalence of Stochastic Equations and Martingale Problems, pages 113–130. Springer Berlin Heidelberg, Berlin, Heidelberg, 2011.
[15] H. P. McKean. A class of Markov processes associated with nonlinear parabolic equations. Proceedings of the National Academy of Sciences, 56(6):1907–1911, 1966.
[16] M. Ondrejat. Brownian Representations of Cylindrical Local Martingales, Martingale Problem and Strong Markov Property of Weak Solutions of SPDEs in Banach Spaces. Czechoslovak Mathematical Journal, 55(4):1003–1039, 2005.
[17] P. Ren, M. Röckner, and F.-Y. Wang. Linearization of Nonlinear Fokker-Planck Equations and Applications. arXiv:1904.06795, 2020.
[18] M. Scheutzow. Uniqueness and non-uniqueness of solutions of Vlasov-McKean equations. Journal of the Australian Mathematical Society. Series A. Pure Mathematics and Statistics, 43(2):246–256, 1987.
[19] D. W. Stroock. Lectures on Stochastic Analysis: Diffusion Theory. London Mathematical Society Student Texts. Cambridge University Press, 1987.
[20] D. Trevisan. Well-posedness of Diffusion Processes in Metric Measure Spaces. PhD-thesis, Scuola Normale Superiori Pisa, 2014.
[21] D. Trevisan. Well-posedness of multidimensional diffusion processes with weakly differentiable coefficients. Electron. J. Probab., 21:41 pp., 2016.

Marco Rehmeier Faculty of Mathematics, Bielefeld University, Universitätsstraße 25, 33615 Bielefeld, Germany
E-mail address: [email protected]

Linearization and a superposition principle for deterministic and stochastic nonlinear Fokker-Planck-Kolmogorov equations

Abstract

1 Introduction

Acknowledgements

2 Notation and Preliminaries

Notation

Basic properties of spaces of measures

Probability measures

Subprobability measures

3 Superposition Principle for deterministic nonlinear Fokker-Planck-Kolmogorov Equations

Definition 3.1.

Remark 3.2.

Geometric approach to 𝒮​𝒫\mathbf{\mathcal{SP}}

Lemma 3.3.

Proof.

Lemma 3.4.

Proof.

The continuity equation (𝒮​𝒫\mathcal{SP}-CE)

Definition 3.5.

Transferring (NLFPK) and (𝒮​𝒫\mathcal{SP}-CE) to ℝ∞\mathbb{R}^{\infty}

Definition 3.6.

3.1 Main Result: Deterministic case

Theorem 3.7.

Proposition 3.8.

Remark 3.9.

3.2 Consequences and applications

Corollary 3.10.

Proof.

Corollary 3.11.

Proof.

3.2.1 Application to coupled nonlinear-linear Fokker-Planck-Kolmogorov equations

Proposition 3.12.

Proof.

4 Superposition Principle for stochastic nonlinear Fokker-Planck-Kolmogorov Equations

Definition 4.1.

Remark 4.2.

Lemma 4.3.

Proof.

Embedding 𝒮​𝒫\mathcal{SP} into ℓ2\ell^{2}

Lemma 4.4.

Proof.

Linearization of (SNLFPK)

Remark 4.5.

Definition 4.6.

Transferring (SNLFPK) and (𝒮​𝒫\mathcal{SP}-FPK) to ℓ2\ell^{2}

Definition 4.7.

4.1 Main Result: Stochastic case

Theorem 4.8.

Lemma 4.9.

Proof.

Definition 4.10.

Remark 4.11.

Remark 4.12.

Proposition 4.13.

Lemma 4.14.

Proof.

Lemma 4.15.

Proof.

Proposition 4.16.

Remark 4.17.

Proposition 4.18.

Remark 4.19.

References

Geometric approach to $\mathbf{\mathcal{SP}}$

The continuity equation ( $\mathcal{SP}$ -CE)

Transferring (NLFPK) and ( $\mathcal{SP}$ -CE) to $\mathbb{R}^{\infty}$

Embedding $\mathcal{SP}$ into $\ell^{2}$

Transferring (SNLFPK) and ( $\mathcal{SP}$ -FPK) to $\ell^{2}$