[1,3]\fnmStephen S.-T. \surYau

1]\orgdivDepartment of Mathematical Sciences, \orgnameTsinghua University, \orgaddress\cityBeijing, \postcode100084, \countryChina

2]\orgdivSchool of Mathematics, \orgnameRenmin University of China, \orgaddress\cityBeijing, \postcode100872, \countryChina

3]\orgnameBeijing Institute of Mathematical Sciences and Applications, \orgaddress\cityBeijing, \postcode101408, \countryChina

On the Convergence Analysis of Yau-Yau Nonlinear Filtering Algorithm: from a Probabilistic Perspective

\fnmZeju \surSun [email protected] \fnmXiuqiong \surChen [email protected] [email protected] [ [ [

Abstract

At the beginning of this century, a real time solution of the nonlinear filtering problem without memory was proposed in [1, 2] by the third author and his collaborator, and it is later on referred to as Yau-Yau algorithm. During the last two decades, a great many nonlinear filtering algorithms have been put forward and studied based on this framework. In this paper, we will generalize the results in the original works and conduct a novel convergence analysis of Yau-Yau algorithm from a probabilistic perspective. Instead of considering a particular trajectory, we estimate the expectation of the approximation error, and show that commonly-used statistics of the conditional distribution (such as conditional mean and covariance matrix) can be accurately approximated with arbitrary precision by Yau-Yau algorithm, for general nonlinear filtering systems with very liberal assumptions. This novel probabilistic version of convergence analysis is more compatible with the development of modern stochastic control theory, and will provide a more valuable theoretical guidance for practical implementations of Yau-Yau algorithm.

keywords:

nonlinear filtering, DMZ equation, Yau-Yau algorithm, convergence analysis, stochastic partial differential equation

pacs:

[

MSC Classification]60G35, 93E11, 60H15, 65M12

1 Introduction

Filtering is an important subject in the field of modern control theory, and has wide applications in various scenarios such as signal processing [3][4], weather forecast [5][6], aerospace industrial [7][8] and so on. The core objective of filtering problem is pursuing accurate estimation or prediction to the state of a given stochastic dynamical system based on a series of noisy observations [9][10]. For practical implementations, it is also necessary that the estimation or prediction to the state can be computed in a recursive and real-time manner.

In the filtering problems we consider in this paper, the evolution of state processes, as well as the noisy observations, is governed by the following system of stochastic differential equations,

\left\{\begin{aligned} &dX_{t}=f(X_{t})dt+g(X_{t})dV_{t},\quad X_{0}=\xi,\\ &dY_{t}=h(X_{t})dt+dW_{t},\quad Y_{0}=0,\end{aligned}\right.\quad t\in[0,T],

(1)

in the filtered probability space $(\Omega,\mathcal{F},\{\mathcal{F}_{t}\}_{t=0}^{T},P)$ , where $T>0$ is a fixed termination; $X=\{X_{t}:0\leq t\leq T\}\subset\mathbb{R}^{d}$ is the state process we would like to track; $Y=\{Y_{t}:0\leq t\leq T\}\subset\mathbb{R}^{d}$ is the noisy observation to the state process $X$ ; $\{V_{t}:0\leq t\leq T\}$ and $\{W_{t}:0\leq t\leq T\}$ are mutually independent, $\mathcal{F}_{t}$ -adapted, $d$ -dimensional standard Brownian motions; $\xi$ is a $\mathbb{R}^{d}$ -valued random variable with probability density function $\sigma_{0}(x)$ , which is independent of $V_{t}$ and $W_{t}$ ; $f:\mathbb{R}^{d}\rightarrow\mathbb{R}^{d}$ , $g:\mathbb{R}^{d}\rightarrow\mathbb{R}^{d\times d}$ and $h:\mathbb{R}^{d}\rightarrow\mathbb{R}^{d}$ are sufficiently smooth vector- or matrix- valued functions.

Mathematically, for a given test function $\varphi:\mathbb{R}^{d}\rightarrow\mathbb{R}$ , the optimal estimation of $\varphi(X_{t})$ based on the historical observations up to time $t$ is the conditional expectation $E[\varphi(X_{t})|\mathcal{Y}_{t}]$ , where $\mathcal{Y}_{t}:=\sigma\{Y_{s}:0\leq s\leq t\}$ the $\sigma$ -algebra generated by historical observations. Such estimation is ‘optimal’ in the sense that,

E[\varphi(X_{t})|\mathcal{Y}_{t}]=\mathop{\text{arg min}}\limits_{U\text{ is }\mathcal{Y}_{t}\text{-measurable}}E[(\varphi(X_{t})-U)^{2}].

(2)

Therefore, the main task of filtering problem can be specified into finding efficient algorithms to numerically compute the conditional expectation $E[\varphi(X_{t})|\mathcal{Y}_{t}]$ , or equivalently, the conditional probability distribution $P[X_{t}\in\cdot|\mathcal{Y}_{t}]$ or the conditional probability density function (if exists).

With some regularity assumptions on the coefficients $f,g,h$ in the system (1), the conditional probability measure $P[X_{t}\in\cdot|\mathcal{Y}_{t}]$ is absolutely continuous with respect to the Lebesgue measure in $\mathbb{R}^{d}$ , and the conditional probability density function $\sigma(t,x)$ , (without considering a normalization constant), can be described by the well-known DMZ equation, which is named after three researchers: T. E. Duncan [11], R. E. Mortensen [12] and M. Zakai [13], who derived the equation independently in the late 1960s.

The DMZ equation satisfied by the unnormalized conditional probability density function $\sigma(t,x)$ is a second-order stochastic partial differential equation, and does not possess an explicit-form solution in general. In their works [1] and [2], the third author and his collaborator propose a two-stage algorithm framework to compute the solution of the DMZ equation numerically in a memoryless and real-time manner. Later on, this algorithm is referred to as Yau-Yau algorithm.

The basic idea of Yau-Yau algorithm is that the heavy computation burden of numerically solving a Kolmogorov-type partial differential equation (PDE) can be done off-line, and in the meanwhile, the on-line procedure only consists of the basic computations such as multiplication by the exponential function of the observations. In the framework of Yau-Yau algorithm, various kinds of methods to solving the Kolmogorov-type PDEs, such as spectral methods [14][15], proper orthogonal decomposition [16], tensor training [17], etc, are proposed and applied in specific examples of nonlinear filtering problems. Numerical results in the previous works mentioned above show that Yau-Yau algorithm can provide accurate and real-time estimations to the state process of very general nonlinear filtering problems in low and medium-high dimensional space.

In the original works [1] and [2], the convergence analysis of Yau-Yau algorithm is conducted pathwisely. For regular paths of observations with some boundedness conditions, it is proved that the numerical solution provided by Yau-Yau algorithm will converge to the exact solution of DMZ equation both pointwisely and in $L^{2}$ -sense, as the size of time-discretization step tends to zero, while in the works on the practical implementations of Yau-Yau algorithm, such as [14] and [15], the convergence analysis mainly focuses on the capability of numerical methods to approximate the solution of Kolmogorov-type PDEs arising in Yau-Yau algorithms.

In this paper, we will revisit the convergence analysis of Yau-Yau algorithm from a probabilistic perspective. Instead of considering the convergence results pathwisely, we prove that the solution of Yau-Yau algorithm will converge to the exact solution of DMZ equation in expectation, and also, after the normalization procedure, the approximated solution to the filtering problem provided by Yau-Yau algorithm will converge to the conditional expectation $E[\varphi(X_{t})|\mathcal{Y}_{t}]$ .

The advantage of this probabilistic perspective is that for a theoretically rigorous convergence analysis, instead of regularity assumptions on observation paths, we only need to make assumptions on the coefficients $f,g,h$ of the filtering system (1) and the test function $\varphi$ , which are verifiable off-line in advance for practitioners. In the meanwhile, as shown in the main results of this paper in Section 3, those assumptions we need are in fact quite general, and it is straightforward to check that the most commonly used test functions $\varphi(x)=x_{i}$ and $\varphi(x)=x_{i}x_{j}$ , $x=(x_{1},\cdots,x_{d})^{\top}\in\mathbb{R}^{d}$ , corresponding to the conditional mean and conditional covariance matrix, as well as the linear Gaussian systems (with $f(x)=Fx$ , $g(x)\equiv\Gamma$ , and $h(x)=Hx$ , $F,H,\Gamma\in\mathbb{R}^{d\times d}$ ), satisfy all the assumptions.

Moreover, to the best of the authors’ knowledge, most of the theoretical analysis of PDE-based filtering algorithm mainly deals with convergence results with respect to the DMZ equation. In such probabilistic perspective we consider here in this paper, however, it is natural and convenient to make a step forward and discuss the approximation capability of Yau-Yau algorithm to the normalized conditional expectation and conditional probability distribution. In this way, we will provide a thorough convergence analysis of the Yau-Yau algorithm for filtering problems.

The organization of this paper is as follows. Section 2 serves as preliminaries, in which we will summarize some basic concepts of filtering problems and the main procedure of Yau-Yau algorithm. The main theorems in this paper will be stated in Section 3, together with a sketch of the proofs. In the next four sections, we will provide the detailed proofs of the lemmas and theorems. We first focus on the properties of the exact solution of DMZ equation in Section 4 and Section 5, and then deal with the approximated solutions given by Yau-Yau algorithm in Section 6 and Section 7. Finally, Section 8 is a conclusion.

2 Preliminaries

In this section, we would like to briefly summarize the theory of nonlinear filtering, including the change-of-measure approach to deriving the DMZ equation, as well as the main idea and procedures of Yau-Yau algorithm.

In the change-of-measure approach to deriving the DMZ equation corresponding to the filtering system (1), we first introduce a series of reference probability measures $\{\tilde{P}_{t}:0\leq t\leq T\}$ , absolutely continuous to the original probability measure $P$ with Radon derivatives given by

Z_{t}\triangleq\frac{d\tilde{P}_{t}}{dP}\biggr{|}_{\mathcal{F}_{t}}=\exp\biggl{(}-\int_{0}^{t}h(X_{s})^{\top}dW_{s}-\frac{1}{2}\int_{0}^{t}|h(X_{s})|^{2}ds\biggr{)},\ t\in[0,T].

(3)

According to Girsanov’s theorem, as long as the process $\{Z_{t}:0\leq t\leq T\}$ defined in (3) is a martingale, then under the reference probability measure $\tilde{P}_{T}$ , the observation process $\{Y_{t}:0\leq t\leq T\}$ is a standard Brownian motion which is independent of the state process $X$ .

We also introduce the process $\{\tilde{Z}_{t}:0\leq t\leq T\}$ , $\tilde{Z}_{t}=Z_{t}^{-1}$ , to be the inverse of $Z_{t}$ , which is also a Radon derivative and can be expressed by the stochastic integral with respect to $Y$ as follows:

\tilde{Z}_{t}=Z_{t}^{-1}=\frac{dP}{d\tilde{P}_{t}}\biggr{|}_{\mathcal{F}_{t}}=\exp\biggl{(}\int_{0}^{t}h(X_{s})^{\top}dY_{s}-\frac{1}{2}\int_{0}^{t}|h(X_{s})|^{2}ds\biggr{)},\ t\in[0,T].

(4)

Therefore, for any $\mathcal{F}_{t}$ -measurable, integrable random variable $U\in\mathcal{F}_{t}$ , its expectation with respect to measure $P$ can be computed by

E[U]=\tilde{E}\left[\tilde{Z}_{t}U\right],

(5)

where $\tilde{E}$ means the expectation is taken under the probability measure $\tilde{P}_{T}$ .

As an extension of Bayesian formula in the context of continuous-time stochastic processes, the following Kallianpur-Striebel formula allows us to express and calculate the solution of filtering problem, $E[\varphi(X_{t})|\mathcal{Y}_{t}]$ , by a ratio of conditional expectations under $\tilde{P}_{T}$ :

E[\varphi(X_{t})|\mathcal{Y}_{t}]=\frac{\tilde{E}\left[\tilde{Z}_{t}\varphi(X_{t})|\mathcal{Y}_{t}\right]}{\tilde{E}\left[\tilde{Z}_{t}|\mathcal{Y}_{t}\right]},\ t\in[0,T].

(6)

Since the denominator $\tilde{E}\left[\tilde{Z}_{t}|\mathcal{Y}_{t}\right]$ in (6) is independent of the test function $\varphi$ , people often refer to the nominator, $\tilde{E}\left[\tilde{Z}_{t}\varphi(X_{t})|\mathcal{Y}_{t}\right]$ , as the unnormalized conditional expectation of $\varphi(X_{t})$ . The corresponding measure-valued stochastic process $\{\rho_{t}:0\leq t\leq T\}$ defined by

\rho_{t}(A):=\tilde{E}\left[\tilde{Z}_{t}1_{A}|\mathcal{Y}_{t}\right],\ \forall\ A\in\mathcal{F}_{t},\ t\in[0,T],

(7)

is also referred to as unnormalized conditional probability measure, and we also denote the unnormalized conditional expectation by

\rho_{t}(\varphi):=\tilde{E}\left[\tilde{Z}_{t}\varphi(X_{t})|\mathcal{Y}_{t}\right],\ \varphi\text{ is a test function},\ t\in[0,T],

(8)

With sufficient regularity assumptions on the coefficients $f,g,h$ and test function $\varphi$ , the evolution of $\rho_{t}(\varphi)$ is governed by the following well-known DMZ equation:

\rho_{t}(\varphi)=\rho_{0}(\varphi)+\int_{0}^{t}\rho_{s}(\mathcal{L}\varphi)ds+\sum_{j=1}^{d}\int_{0}^{t}\rho_{s}(h_{j}\varphi)dY_{s}^{j},\ t\in[0,T].

(9)

where

\mathcal{L}=\frac{1}{2}\sum_{i,j=1}^{d}a^{ij}(x)\frac{\partial^{2}}{\partial x_{i}\partial x_{j}}+\sum_{i=1}^{d}f_{i}(x)\frac{\partial}{\partial x_{i}}

(10)

is a second-order elliptic operator with $a(x):=(a^{ij}(x))_{1\leq i,j\leq d}=g(x)g(x)^{\top}$ .

If the stochastic measures $\rho_{t}$ , $t\in[0,T]$ , are almost surely absolutely continuous to the Lebesgue measure in $\mathbb{R}^{d}$ , and the density functions (or the Radon derivatives) $\sigma(t,x)$ , as well as the derivatives of $\sigma(t,x)$ , is square-integrable, then $\sigma(t,x)$ is the solution to the following equation, (which is also referred to as the DMZ equation):

\left\{\begin{aligned} &d\sigma(t,x)=\mathcal{L}^{*}\sigma(t,x)dt+\sum_{j=1}^{d}h_{j}(x)\sigma(t,x)dY_{t}^{j},\ t\in[0,T],\\ &\sigma(0,x)=\sigma_{0}(x),\end{aligned}\right.

(11)

which is a second-order stochastic partial differential equation with

\mathcal{L}^{*}(\star)=\frac{1}{2}\sum_{i,j=1}^{d}\frac{\partial^{2}}{\partial x_{i}\partial x_{j}}(a^{ij}\star)-\sum_{i=1}^{d}\frac{\partial}{\partial x_{i}}(f_{i}\star).

(12)

the adjoint operator of $\mathcal{L}$ .

In this case, the unnormalized conditional expectation $\rho_{t}(\varphi)$ can be expressed as

\rho_{t}(\varphi)=\tilde{E}\left[\tilde{Z}_{t}\varphi(X_{t})|\mathcal{Y}_{t}\right]=\int_{\mathbb{R}^{d}}\varphi(x)\sigma(t,x)dx,\ \varphi\text{ is a test function},\ t\in[0,T],

(13)

and the (normalized) conditional expectation $E\left[\varphi(X_{t})|\mathcal{Y}_{t}\right]$ can be calculated by

E\left[\varphi(X_{t})|\mathcal{Y}_{t}\right]=\frac{\tilde{E}\left[\tilde{Z}_{t}\varphi(X_{t})|\mathcal{Y}_{t}\right]}{\tilde{E}\left[\tilde{Z}_{t}|\mathcal{Y}_{t}\right]}=\frac{\int_{\mathbb{R}^{d}}\varphi(x)\sigma(t,x)dx}{\int_{\mathbb{R}^{d}}\sigma(t,x)dx}.

(14)

Because the solution of (11) does not have a closed form for general nonlinear filtering systems, efficient numerical methods must be proposed, so that we can get a good approximation to the conditional expectation $E[\varphi(X_{t})|\mathcal{Y}_{t}]$ through the equation (14).

At the beginning of this century, the third author and his collaborator proposed a two-stage algorithm to numerically solve the DMZ equation (11) in a memoryless and real-time manner, which is often referred to as Yau-Yau algorithm. Here, we would like to briefly introduce the basic idea and main procedure of this algorithm.

Firstly, if we consider the exponential transformation

w(t,x):=\exp\left(-h^{\top}(x)Y_{t}\right)\sigma(t,x),\ t\in[0,T],

(15)

then the function $w(t,x)$ satisfies the robust DMZ equation

\frac{\partial w}{\partial t}=\frac{1}{2}\sum_{i,j=1}^{d}a^{ij}(x)\frac{\partial^{2}w}{\partial x_{i}\partial x_{j}}+\sum_{i=1}^{d}F_{i}(t,x)\frac{\partial w}{\partial x_{i}}+J(t,x)w(t,x),

(16)

where the stochastic differential terms in the original DMZ equation (11) are eliminated and

F_{i}(t,x)=\sum_{j=1}^{d}\biggl{(}\frac{\partial a^{ij}}{\partial x_{j}}+a^{ij}\sum_{k=1}^{d}Y_{t}^{k}\frac{\partial h_{k}}{\partial x_{j}}\biggr{)}-f_{i}(x),\quad i=1,\cdots,d,

$\displaystyle J(t,x)=$	$\displaystyle\frac{1}{2}\sum_{i,j=1}^{d}\frac{\partial^{2}a^{ij}}{\partial x_{i}\partial x_{j}}+\sum_{i,j,k=1}^{d}Y_{t}^{k}\frac{\partial h_{k}}{\partial x_{j}}\frac{\partial a^{ij}}{\partial x_{i}}$	(17)
	$\displaystyle+\frac{1}{2}\sum_{i,j=1}^{d}a^{ij}\biggl{(}\sum_{k=1}^{d}Y_{t}^{k}\frac{\partial^{2}h_{k}}{\partial x_{i}\partial x_{j}}+\sum_{k=1}^{d}\sum_{l=1}^{d}Y_{t}^{k}Y_{t}^{l}\frac{\partial h_{k}}{\partial x_{i}}\frac{\partial h_{l}}{\partial x_{j}}\biggr{)}$
	$\displaystyle-\sum_{i=1}^{d}\frac{\partial f_{i}}{\partial x_{i}}-\sum_{i,j=1}^{d}Y_{t}^{j}\frac{\partial h_{j}}{\partial x_{i}}f_{i}(x)-\frac{1}{2}\|h\|^{2}$

are stochastic functions that depend on the specific value of observation $Y_{t}$ at time $t$ .

Instead of solving equation (11) directly, we will mainly focus on the robust DMZ equation (16), especially the corresponding initial-boundary value (IBV) problems in a closed ball $B_{R}:=\{x\in\mathbb{R}^{d}:|x|\leq R\}$ , with a given radius $R>0$ .

\left\{\begin{aligned} &\frac{\partial u_{R}}{\partial t}=\frac{1}{2}\sum_{i,j=1}^{d}a^{ij}(x)\frac{\partial^{2}u_{R}}{\partial x_{i}\partial x_{j}}+\sum_{i=1}^{d}F_{i}(t,x)\frac{\partial u_{R}}{\partial x_{i}}+J(t,x)u_{R}(t,x),\ t\in[0,T],\\ &u_{R}(0,x)=\sigma_{0}(x)\cdot\mathscr{S}_{R}(x),\ x\in B_{R},\\ &u_{R}(t,x)=0,\ (t,x)\in[\tau_{k-1},\tau_{k}]\times\partial B_{R}.\end{aligned}\right.

(18)

where $\mathscr{S}_{R}(x)$ is a $C^{\infty}$ function supported in $B_{R}$ and satisfies

\mathscr{S}_{R}(x)=\left\{\begin{aligned} &1,\quad|x|\leq R-\frac{1}{R}\\ &0,\quad|x|\geq R\end{aligned}\right.

(19)

and $0\leq\mathscr{S}_{R}(x)\leq 1$ for $R-\frac{1}{R}\leq|x|\leq R$ , so that the initial value is compatible with the boundary condition in (18). And from now on, we would like to drop the subscript, $R$ , in the notation $u_{R}(t,x)$ for the simplicity of notations, and use $u(t,x)$ to denote the solution to the IBV problem (19).

Let $0=\tau_{0}<\tau_{1}<\cdots<\tau_{K}=T$ be a uniform partition of the time interval $[0,T]$ , with $\tau_{k}-\tau_{k-1}=\delta=\frac{T}{K}$ , $k=1,\cdots,K$ . On each time interval $[\tau_{k-1},\tau_{k}]$ , consider the IBV problem of the following parabolic equation

\left\{\begin{aligned} &\frac{\partial u_{k}}{\partial t}=\frac{1}{2}\sum_{i,j=1}^{d}a^{ij}(x)\frac{\partial^{2}u_{k}}{\partial x_{i}\partial x_{j}}+\sum_{i=1}^{d}F_{i}(\tau_{k-1},x)\frac{\partial u_{k}}{\partial x_{i}}+J(\tau_{k-1},x)u_{k}(t,x),\\ &\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad(t,x)\in(\tau_{k-1},\tau_{k}]\times B_{R},\\ &u_{k}(\tau_{k-1},x)=u_{k-1}(\tau_{k-1},x),\ x\in B_{R},\\ &u_{k}(t,x)=0,\ (t,x)\in[\tau_{k-1},\tau_{k}]\times\partial B_{R},\end{aligned}\right.

(20)

with the value of coefficients $F(t,x)$ and $J(t,x)$ frozen at the left point $t=\tau_{k-1}$ and initial value $u_{0}(\tau_{0},x):=\sigma_{0}(x)$ .

With another exponential transformation given by

\tilde{u}_{k}(t,x)=\exp\left(h^{\top}(x)Y_{\tau_{k-1}}\right)u_{k}(t,x),\ t\in[\tau_{k-1},\tau_{k}],

(21)

the newly-constructed function $\tilde{u}_{k}$ satisfies

\left\{\begin{aligned} &\frac{\partial\tilde{u}_{k}}{\partial t}=\frac{1}{2}\sum_{i,j=1}^{d}\frac{\partial^{2}}{\partial x_{i}\partial x_{j}}(a^{ij}\tilde{u}_{k}(t,x))\\ &\quad-\sum_{i=1}^{d}\frac{\partial}{\partial x_{i}}(f_{i}\tilde{u}_{k}(t,x))-\frac{1}{2}|h|^{2}\tilde{u}_{k}(t,x),\ (t,x)\in(\tau_{k-1},\tau_{k}]\times B_{R},\\ &\tilde{u}_{k}(\tau_{k-1},x)=\exp\biggl{(}h^{\top}(x)Y_{\tau_{k-1}}\biggr{)}u_{k}(\tau_{k-1},x),\ x\in B_{R}\\ &\tilde{u}_{k}(t,x)=0,\ (t,x)\in[\tau_{k-1},\tau_{k}]\times\partial B_{R}\end{aligned}\right.

(22)

After the two exponential transformations (15) and (21), the function we would like to use to approximate the unnormalized conditional probability density function $\sigma(\tau_{k},x)$ at time $t=\tau_{k}$ is given by

\sigma(\tau_{k},x)\approx\exp\biggl{(}h^{\top}(x)(Y_{\tau_{k}}-Y_{\tau_{k-1}})\biggr{)}\tilde{u}_{k}(\tau_{k},x)=\tilde{u}_{k+1}(\tau_{k},x),\ k=1,\cdots,K.

(23)

and the value for approximating the conditional expectation $E[\varphi(X_{t})|\mathcal{Y}_{t}]$ is given by

E[\varphi(X_{t})|\mathcal{Y}_{t}]\approx\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx},\ k=1,\cdots,K.

(24)

The main idea of the Yau-Yau algorithm is that the problem of solving the DMZ equation satisfied by the unnormalized probability density function $\sigma(t,x)$ can be separated into two parts. The computationally expensive part of solving the (IBV) problems of parabolic equation (22) can be done off-line, because it is a deterministic Kolmogorov-type PDE which is independent of observations, and at least, the corresponding semi-group, $\{S_{t}:t\in[0,T]\}$ , can be analyzed and approximated off-line. When new observation comes, the remaining task is only about calculating exponential transformations and numerical integrals. The framework of Yau-Yau algorithm is shown in Algorithm 1.

Algorithm 1 The Two-Stage Framework of Yau-Yau Algorithm

1: Initialization: Input the terminal time

T

, the radius

R

of closed ball

B_{R}

, the number of time-discretization steps

K

, the initial distribution of state process

\sigma_{0}(x)

, the test function

\varphi(x)

, and the initial observation

Y_{0}=0

. Let

\delta=\frac{T}{K}

be the time-discretization step size and

\{0=\tau_{0}<\tau_{1}<\cdots<\tau_{K}=T\}

be a uniform partition of

[0,T]

with

\tau_{k}-\tau_{k-1}=\delta

. Initialize

\tilde{u}_{1}(0,x)=\sigma_{0}(x)

2: Off-Line Algorithm: Solve the IBV problem of Kolmogorov-type partial differential equation (22) in closed ball

B_{R}

, and determine or approximate the corresponding semi-group

\{S_{t}:t\in[0,T]\}

3: On-Line Algorithm:

4: for

k=1

K

5: Obtain

\tilde{u}_{k}(\tau_{k},x)

from the Off-Line Algorithm

\tilde{u}_{k}(\tau_{k},x)=S_{\tau_{k}-\tau_{k-1}}\tilde{u}_{k}(\tau_{k-1},x).

6: Renew the initial value of the partial differential equation satisfied by

\tilde{u}_{k+1}(x,t)

\tilde{u}_{k+1}(\tau_{k},x)=\exp\left[h^{\top}(x)(Y_{\tau_{k}}-Y_{\tau_{k-1}})\right]\tilde{u}_{k}(\tau_{k},x).

7: Compute the approximated conditional expectation:

\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}.

8: end for

In the next section, we will give a mathematically rigorous interpretation of the approximation results (23) and (24) from a probabilistic perspective. In particular, we only need assumptions on the test function and the coefficients of the filtering systems to derive the convergence result. These assumptions are also easy to verify off-line before the observations come, and therefore, this convergence analysis will provide a guidance for practitioners to determine the parameters in the implementations of Yau-Yau algorithm for practical use.

3 Main Results

In this section, we would like to state the main result in this paper and also provide a sketch of the proof.

Firstly, besides the smoothness and regularity requirements which guarantee the existence of conditional expectation and the existence of the solution to the DMZ equation, let us further introduce four particular assumptions on the coefficients of the system, the initial distribution and the test function.

For the state equation in the filtering system (1), the drift term $f:\mathbb{R}^{d}\rightarrow\mathbb{R}^{d}$ is assumed to be Lipschitz, and the the diffusion term $g:\mathbb{R}^{d}\rightarrow\mathbb{R}^{d\times d}$ , together with $a(x)=g(x)g(x)^{\top}$ , is assumed to have bounded partial derivatives up to second order, i.e.,

\displaystyle\text{{(A1)}: }\begin{aligned} &\exists\ L>0,\ s.t.\ |f(x)-f(y)|\leq L|x-y|,\\ &|a(x)|\leq L,\ \biggl{|}\frac{\partial a^{ij}(x)}{\partial x_{k}}\biggr{|}\leq L,\ \biggl{|}\frac{\partial^{2}a^{ij}(x)}{\partial x_{k}\partial x_{l}}\biggr{|}\leq L,\ \forall\ x,y\in\mathbb{R}^{d},\ i,j,k,l=1,\cdots,d.\end{aligned}

Assumption (A1) guarantees that the state equation of (1) has a strong solution in $[0,T]$ , and especially, the state equation for linear filter satisfies this assumption.

Also, in order to conduct energy estimations for the (stochastic) partial differential equations, we would like to assume that the diffusion term in the state equation is nondegenerate, in the sense that

\displaystyle\text{{(A2):}}\quad\begin{aligned} &\text{For each $x\in\mathbb{R}^{d}$, there exists a continuous function $\lambda(x)>0$},\\ &s.t.\ \sum_{i,j=1}^{d}a^{ij}(x)\zeta_{i}\zeta_{j}\geq\lambda(x)|\zeta|^{2},\ \forall\ \zeta=(\zeta_{1},\cdots,\zeta_{d})^{\top}\in\mathbb{R}^{d}.\end{aligned}

For the initial distribution $\sigma_{0}$ , we would like to assume that it is smooth enough and possesses finite high-order moments:

\displaystyle\text{{(A3):}}\ \int_{\mathbb{R}^{d}}|x|^{2n}\sigma_{0}(x)dx<\infty,\ \forall\ n\in\mathbb{N}.

Assumption (A3) is satisfied by commonly-used light-tailed distributions such as Gaussian distributions. In fact, in the following convergence analysis, we only require Assumption (A3) to hold for sufficiently large $n\geq 1$ , rather than for all $n\in\mathbb{N}$ .

Finally, it is assumed that the test function $\varphi:\mathbb{R}^{d}\rightarrow\mathbb{R}$ is at most polynomial growth:

\displaystyle\text{{(A4): }}\exists\ L>0,\ m\in\mathbb{N},\ s.t.\ |\varphi(x)|\leq L(1+|x|^{2m}),\quad\forall\ x\in\mathbb{R}^{d},

which is satisfied by most of the commonly-used test functions, such as those correspond to the conditional mean and covariance matrix.

Based on the above assumptions (A1) to (A4), the main result in this paper is stated as follows:

Theorem 1.

Fix a terminal time $T>0$ and the filtering system (1) with smooth coefficients. If Assumptions (A1) to (A4) hold, then for every $\epsilon>0$ , there exists $R>0$ , and $\delta>0$ , such that we can conduct the Yau-Yau algorithm in the closed ball $B_{R}=\{x\in\mathbb{R}^{d}:|x|\leq R\}$ and the uniform partition of $[0,T]$ : $0=\tau_{0}<\tau_{1}<\cdots<\tau_{K}=T$ with $\delta=\tau_{k}-\tau_{k-1}$ for $k=1,\cdots,K$ , and the numerical solution $\{\tilde{u}_{k+1}(\tau_{k},x):k=1,\cdots,K\}$ approximates the exact solution of the filtering problems at each time step $t=\tau_{k}$ well in the sense of mathematical expectation, i.e.,

E\biggl{|}E[\varphi(X_{\tau_{k}})|\mathcal{Y}_{\tau_{k}}]-\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggr{|}<\epsilon,\ \forall\ 1\leq k\leq K.

(25)

Here in this section, we provide a sketch of the proof of Theorem 25, in which the main idea of the proof is illustrated. The detailed proofs of those key estimations here will be given in order in the next four sections.

A Sketch of the Proof of Theorem 25.

Let $\{\tilde{Z}_{t}:0\leq t\leq T\}$ be the Radon derivative $Z_{t}=\left.\frac{dP}{d\tilde{P}_{t}}\right|_{\mathcal{F}_{t}}$ defined in (4). And therefore, for every integrable, $\mathcal{F}_{t}$ -measurable random variable $U$ , we have

E[U]=\tilde{E}[\tilde{Z}_{t}U].

(26)

According to the properties of conditional expectations, the expectation of the approximation error of Yau-Yau algorithm can be estimated as follows:

	$\displaystyle E$	$\displaystyle\biggl{\|}E[\varphi(X_{\tau_{k}})\|\mathcal{Y}_{\tau_{k}}]-\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggr{\|}$
		$\displaystyle=\tilde{E}\biggl{[}\tilde{Z}_{\tau_{k}}\biggl{\|}E[\varphi(X_{\tau_{k}})\|\mathcal{Y}_{\tau_{k}}]-\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggr{\|}\biggr{]}$
		$\displaystyle=\tilde{E}\biggl{[}\tilde{Z}_{\tau_{k}}\biggl{\|}\frac{\tilde{E}[\varphi(X_{\tau_{k}})\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]}{\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]}-\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggr{\|}\biggr{]}$
		$\displaystyle=\tilde{E}\biggl{[}\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]\biggl{\|}\frac{\tilde{E}[\varphi(X_{\tau_{k}})\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]}{\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]}-\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggr{\|}\biggr{]}$
		$\displaystyle\leq\tilde{E}\biggl{[}\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]\biggl{\|}\frac{\tilde{E}[\varphi(X_{\tau_{k}})\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]}{\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]}-\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]}\biggr{\|}\biggr{]}$
		$\displaystyle\quad+\tilde{E}\biggl{[}\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]\biggl{\|}\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]}-\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggr{\|}\biggr{]}$
		$\displaystyle\leq\tilde{E}\biggl{[}\biggl{\|}\tilde{E}[\varphi(X_{\tau_{k}})\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]-\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx\biggr{\|}\biggr{]}$
		$\displaystyle\quad+\tilde{E}\biggl{[}\frac{\int_{B_{R}}\|\varphi(x)\|\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggl{\|}\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]-\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx\biggr{\|}\biggr{]}$
		$\displaystyle=\tilde{E}\biggl{[}\biggl{\|}\int_{\mathbb{R}^{d}}\varphi(x)\sigma(\tau_{k},x)dx-\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx\biggr{\|}\biggr{]}$
		$\displaystyle\quad+\tilde{E}\biggl{[}\frac{\int_{B_{R}}\|\varphi(x)\|\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggl{\|}\int_{\mathbb{R}^{d}}\sigma(\tau_{k},x)dx-\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx\biggr{\|}\biggr{]}$
		$\displaystyle\triangleq I_{1}+I_{2}.$

where we use the fact that $\tilde{u}_{k+1}(\tau_{k},x)$ is $\mathcal{Y}_{\tau_{k}}$ -measurable and for integrable, $\mathcal{Y}_{\tau_{k}}$ -measurable random variable $V$ ,

\tilde{E}[\tilde{Z}_{\tau_{k}}V]=\tilde{E}[\tilde{E}[\tilde{Z}_{\tau_{k}}|\mathcal{Y}_{\tau_{k}}]V].

(27)

Therefore, the remaining task for us is to estimate the two error terms $I_{1}$ and $I_{2}$ , and to show that $I_{1}$ and $I_{2}$ can be arbitrarily small with sufficiently large $R>0$ and sufficiently small $\delta>0$ .

Firstly, for the estimation of

I_{1}=\tilde{E}\biggl{[}\biggl{|}\int_{\mathbb{R}^{d}}\varphi(x)\sigma(\tau_{k},x)dx-\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx\biggr{|}\biggr{]},

(28)

we would like to utilize an intermediate function $\sigma_{R}(t,x)$ , $(t,x)\in[0,T]\times B_{R}$ , which is the solution of IBV problem of the DMZ equation (11) and will be introduced in (43) in Section 4. And we have

$\displaystyle I_{1}\leq$	$\displaystyle\tilde{E}\int_{\|x\|\geq R}\|\varphi(x)\|\sigma(\tau_{k},x)dx+\tilde{E}\biggl{[}\biggl{\|}\int_{B_{R}}\varphi(x)\sigma(\tau_{k},x)dx-\int_{B_{R}}\varphi(x)\sigma_{R}(\tau_{k},x)dx\biggr{\|}\biggr{]}$
	$\displaystyle+\tilde{E}\biggl{[}\biggl{\|}\int_{B_{R}}\varphi(x)\sigma_{R}(\tau_{k},x)dx-\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx\biggr{\|}\biggr{]}$
$\displaystyle\leq$	$\displaystyle\tilde{E}\int_{\|x\|\geq R}\|\varphi(x)\|\sigma(\tau_{k},x)dx+\tilde{E}\int_{B_{R}}\|\varphi(x)\|\cdot\|\sigma(\tau_{k},x)-\sigma_{R}(\tau_{k},x)\|dx$
	$\displaystyle+\tilde{E}\int_{B_{R}}\|\varphi(x)\|\cdot\|\sigma_{R}(\tau_{k},x)-\tilde{u}_{k+1}(\tau_{k},x)\|dx$
$\displaystyle\leq$	$\displaystyle\tilde{E}\int_{\|x\|\geq R}\|\varphi(x)\|\sigma(\tau_{k},x)dx+L(1+R^{2m})\tilde{E}\int_{B_{R}}\|\sigma(\tau_{k},x)-\sigma_{R}(\tau_{k},x)\|dx$
	$\displaystyle+L(1+R^{2m})\tilde{E}\int_{B_{R}}\|\sigma_{R}(\tau_{k},x)-\tilde{u}_{k+1}(\tau_{k},x)\|dx.$	(29)

For the estimation of

I_{2}=\tilde{E}\biggl{[}\frac{\int_{B_{R}}|\varphi(x)|\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggl{|}\int_{\mathbb{R}^{d}}\sigma(\tau_{k},x)dx-\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx\biggr{|}\biggr{]},

(30)

since in the closed ball $B_{R}$ , $|\varphi(x)|\leq L(1+R^{2m})$ ,

\frac{\int_{B_{R}}|\varphi(x)|\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\leq L(1+R^{2m})\frac{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}=L(1+R^{m}).

(31)

and thus,

$\displaystyle I_{2}$	$\displaystyle\leq L(1+R^{2m})\tilde{E}\biggl{[}\biggl{\|}\int_{\mathbb{R}^{d}}\sigma(\tau_{k},x)dx-\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx\biggr{\|}\biggr{]}$	(32)
	$\displaystyle\leq L(1+R^{2m})\biggl{(}\tilde{E}\int_{\|x\|\geq R}\sigma(\tau_{k},x)dx+\tilde{E}\biggl{[}\biggl{\|}\int_{B_{R}}\sigma(\tau_{k},x)dx-\int_{B_{R}}\sigma_{R}(\tau_{k},x)dx\biggr{\|}\biggr{]}\biggr{)}$
	$\displaystyle\quad+L(1+R^{2m})\tilde{E}\biggl{[}\biggl{\|}\int_{B_{R}}\sigma_{R}(\tau_{k},x)dx-\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx\biggr{\|}\biggr{]}$
	$\displaystyle\leq L(1+R^{2m})\biggl{(}\tilde{E}\int_{\|x\|\geq R}\sigma(\tau_{k},x)dx+\tilde{E}\int_{B_{R}}\|\sigma(\tau_{k},x)-\sigma_{R}(\tau_{k},x)\|dx\biggr{)}$
	$\displaystyle\quad+L(1+R^{2m})\tilde{E}\int_{B_{R}}\|\sigma_{R}(\tau_{k},x)-\tilde{u}_{k+1}(\tau_{k},x)\|dx.$

Combining (29) and (32), we have

$\displaystyle E$	$\displaystyle\biggl{\|}E[\varphi(X_{\tau_{k}})\|\mathcal{Y}_{\tau_{k}}]-\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggr{\|}\leq I_{1}+I_{2}$	(33)
	$\displaystyle\leq\tilde{E}\int_{\|x\|\geq R}\|\varphi(x)\|\sigma(\tau_{k},x)dx+L(1+R^{m})\tilde{E}\int_{\|x\|\geq R}\sigma(\tau_{k},x)dx$
	$\displaystyle\quad+2L(1+R^{2m})\tilde{E}\int_{B_{R}}\|\sigma(\tau_{k},x)-\sigma_{R}(\tau_{k},x)\|dx$
	$\displaystyle\quad+2L(1+R^{2m})\tilde{E}\int_{B_{R}}\|\sigma_{R}(\tau_{k},x)-\tilde{u}_{k+1}(\tau_{k},x)\|dx.$

According to Theorem 2 in Section 4, for every $n\in\mathbb{N}$ , there exists $C_{1}>0$ , which depends on $d$ , $m$ , $n$ , $L$ , $T$ , such that

		$\displaystyle\tilde{E}\int_{\|x\|\geq R}\sigma(\tau_{k},x)dx\leq\frac{C_{1}}{1+R^{2n}}\int_{\mathbb{R}^{d}}\|x\|^{2n}\sigma_{0}(x)dx$		(34)
		$\displaystyle\tilde{E}\int_{\|x\|\geq R}\|\varphi(x)\|\sigma(\tau_{k},x)dx\leq\frac{C_{1}}{1+R^{2n}}\int_{\mathbb{R}^{d}}\|x\|^{2(m+n)}\sigma_{0}(x)dx$		(34)

Therefore, for every $\epsilon>0$ , with Assumption (A3) for the initial distribution $\sigma_{0}$ , as long as we take $n>m$ , there exists $R_{1}>0$ , such that

	$\displaystyle\tilde{E}$	$\displaystyle\int_{\|x\|\geq R_{1}}\|\varphi(x)\|\sigma(\tau_{k},x)dx+L(1+R_{1}^{m})\tilde{E}\int_{\|x\|\geq R_{1}}\sigma(\tau_{k},x)dx$		(35)
		$\displaystyle\leq\frac{C_{1}(1+R_{1}^{2m})}{1+R_{1}^{2n}}\int_{\mathbb{R}^{d}}\|x\|^{2n}\sigma_{0}(x)dx+\frac{C_{1}}{1+R_{1}^{2n}}\int_{\mathbb{R}^{d}}\|x\|^{2(m+n)}\sigma_{0}(x)dx<\frac{\epsilon}{3}$		(35)

According to Theorem 3 in Section 5, there exists $C_{2}>0$ , which depends on $d,n,L,T$ , such that

\tilde{E}\int_{B_{R}}|\sigma(\tau_{k},x)-\sigma_{R}(\tau_{k},x)|dx\leq\frac{C_{2}}{1+R^{2n}}.

(36)

Therefore, as long as $n>m$ , there exists $R_{2}>0$ , such that

\displaystyle 2L(1+R_{2}^{2m})\tilde{E}\int_{B_{R_{2}}}|\sigma(\tau_{k},x)-\sigma_{R_{2}}(\tau_{k},x)|dx\leq\frac{2C_{2}L(1+R_{2}^{2m})}{1+R_{2}^{2n}}<\frac{\epsilon}{3}

(37)

Let us choose $R=\max\{R_{1},R_{2}\}$ , and for this particular $R$ , according to Theorem 5 in Section 7, there exists a time step $\delta>0$ , such that

\tilde{E}\int_{B_{R}}|\sigma_{R}(\tau_{k},x)-\tilde{u}_{k+1}(\tau_{k},x)|dx<\frac{\epsilon}{6L(1+R^{2m})}.

(38)

and thus,

2L(1+R^{2m})\tilde{E}\int_{B_{R}}|\sigma_{R}(\tau_{k},x)-\tilde{u}_{k+1}(\tau_{k},x)|dx\leq\frac{2L(1+R^{2m})\epsilon}{6L(1+R^{2m})}<\frac{\epsilon}{3}.

(39)

Take $\eqref{eq:35}$ , $\eqref{eq:37}$ and $\eqref{eq:39}$ back to (33), and we obtain the desired result, that is, we have found $R>0$ and $\delta>0$ , such that

E\biggl{|}E[\varphi(X_{\tau_{k}})|\mathcal{Y}_{\tau_{k}}]-\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggr{|}<\epsilon.

(40)

∎

4 Estimation of the density outside the ball $B_{R}$

In this section, we will provide an estimation of the value of the unnormalized conditional probability density $\sigma(t,x)$ outside a ball $B_{R}\subset\mathbb{R}^{d}$ , with $R\gg 1$ large enough.

Especially, we will show that almost all the density of $\sigma(t,x)$ is contained in the closed ball $B_{R}$ , and the estimations (34) in the proof of Theorem 25 in Section 3 holds with Assumptions (A1) to (A4).

Theorem 2.

With Assumptions (A1) to (A4), there exists a constant $C>0$ which only depends on $T$ , $L$ , $d$ , $m$ and $n$ , such that

\sup\limits_{0\leq t\leq T}\tilde{E}\int_{|x|\geq R}\sigma(t,x)dx\leq\frac{C}{1+R^{2n}}\int_{\mathbb{R}^{d}}|x|^{2n}\sigma_{0}(x)dx,

(41)

and

\sup\limits_{0\leq t\leq T}\tilde{E}\int_{|x|\geq R}|\varphi(x)|\sigma(t,x)dx\leq\frac{C}{1+R^{2n}}\int_{\mathbb{R}^{d}}|x|^{2(m+n)}\sigma_{0}(x)dx

(42)

holds for all $R>0$ .

Proof of Theorem 2.

We first consider the following IBV problem on the ball $B_{R}$ :

\left\{\begin{aligned} &d\sigma_{R}(t,x)=\mathcal{L}\sigma_{R}(t,x)dt+\sum_{j=1}^{d}h_{j}(x)\sigma_{R}(t,x)dY_{t}^{j},\ (t,x)\in(0,T]\times B_{R};\\ &\sigma_{R}(0,x)=\sigma_{0,R}(x)\triangleq\sigma_{0}(x)\cdot\mathscr{S}_{R}(x),\ x\in B_{R};\\ &\sigma_{R}(t,x)=0,\ (t,x)\in[0,T]\times\partial B_{R}.\end{aligned}\right.

(43)

where $\mathscr{S}_{R}(x)$ is the $C^{\infty}$ function defined in (19), such that the initial value is compatible with the boundary conditions.

Let $\psi(x)=\log\left(1+|x|^{2n}\right)$ and define

\Phi(t)=\int_{B_{R}}e^{\psi(x)}\sigma_{R}(t,x)dx.

(44)

Then, according to the IBV problem (43) satisfied by the function $\sigma_{R}(t,x)$ , we have

$\displaystyle d\Phi(t)=$	$\displaystyle\biggl{[}\frac{1}{2}\sum_{i,j=1}^{d}\int_{B_{R}}\frac{\partial^{2}}{\partial x_{i}\partial x_{j}}\left[\left(a^{ij}(x)\right)\sigma_{R}(t,x)\right]e^{\psi(x)}dx$	(45)
	$\displaystyle-\sum_{i=1}^{d}\int_{B_{R}}\frac{\partial}{\partial x_{i}}\left(f_{i}(x)\sigma_{R}(t,x)\right)e^{\psi(x)}dx\biggr{]}dt$
	$\displaystyle+\sum_{j=1}^{d}\biggl{[}\int_{B_{R}}e^{\psi(x)}h_{j}(x)\sigma_{R}(t,x)dx\biggr{]}dY_{t}^{j}$
$\displaystyle\triangleq$	$\displaystyle\left[I_{1}(t)-I_{2}(t)\right]dt+\sum_{j=1}^{d}I_{3,j}(t)dY_{t}^{j}.$

By the Gauss-Green formula, we have

$\displaystyle I_{1}(t)=\frac{1}{2}\sum_{i,j=1}^{d}\biggl{[}$	$\displaystyle\int_{B_{R}}e^{\psi(x)}\left(\frac{\partial\psi(x)}{\partial x_{i}}\frac{\partial\psi(x)}{\partial x_{j}}+\frac{\partial^{2}\psi(x)}{\partial x_{i}\partial x_{j}}\right)a^{ij}(x)\sigma_{R}(t,x)dx$	(46)
	$\displaystyle-\int_{B_{R}}\frac{\partial}{\partial x_{j}}\left(e^{\psi(x)}\frac{\partial\psi(x)}{\partial x_{i}}a^{ij}(x)\sigma_{R}(t,x)\right)dx$
	$\displaystyle+\int_{B_{R}}\frac{\partial}{\partial x_{i}}\left(e^{\psi(x)}\frac{\partial}{\partial x_{j}}\left[a^{ij}(x)\sigma_{R}(t,x)\right]\right)dx\biggr{]}$
$\displaystyle=\frac{1}{2}\sum_{i,j=1}^{d}$	$\displaystyle\int_{B_{R}}e^{\psi(x)}\left(\frac{\partial\psi(x)}{\partial x_{i}}\frac{\partial\psi(x)}{\partial x_{j}}+\frac{\partial^{2}\psi(x)}{\partial x_{i}\partial x_{j}}\right)a^{ij}(x)\sigma_{R}(t,x)dx$
	$\displaystyle-\int_{\partial B_{R}}\vec{\mathfrak{M}}_{1}(t,x)\cdot\vec{\mathfrak{n}}dS+\int_{\partial B_{R}}\vec{\mathfrak{M}}_{2}(t,x)\cdot\vec{\mathfrak{n}}dS,$

where $\vec{\mathfrak{n}}$ is the unit outward normal vector of $\partial B_{R}$ , $dS$ denotes the measure on $\partial B_{R}$ ,

\vec{\mathfrak{M}}_{i}(t,x)=\left(\mathfrak{M}_{i,1}(t,x),\cdots,\mathfrak{M}_{i,d}(t,x)\right),\quad i=1,2,

(47)

and

		$\displaystyle\mathfrak{M}_{1,j}(t,x)=\frac{1}{2}\sum_{i=1}^{d}e^{\psi(x)}\frac{\partial\psi(x)}{\partial x_{i}}a^{ij}(x)\sigma_{R}(t,x),\quad j=1,\cdots,d,$		(48)
		$\displaystyle\mathfrak{M}_{2,i}(t,x)=\frac{1}{2}\sum_{j=1}^{d}e^{\psi(x)}\frac{\partial}{\partial x_{j}}\left[a^{ij}(x)\sigma_{R}(t,x)\right],\quad i=1,\cdots,d.$		(48)

Since $\sigma_{R}(t,x)\equiv 0$ , $\forall(t,x)\in[0,T]\times\partial B_{R}$ , $\mathfrak{M}_{1,j}(t,x)\equiv 0$ on $[0,T]\times\partial B_{R}$ and

\int_{\partial B_{R}}\vec{\mathfrak{M}}_{1}(t,x)\cdot\vec{\mathfrak{n}}dS=0.

(49)

Moreover, we have

\nabla\sigma_{R}=\left(\frac{\partial\sigma_{R}}{\partial x_{1}},\cdots,\frac{\partial\sigma_{R}}{\partial x_{d}}\right)=-c\ \vec{\mathfrak{n}},\quad\text{on}\ \partial B_{R},

(50)

where $c(x)>0$ is a continuous function on $\partial B_{R}$ , because $\sigma_{R}\geq 0$ and $\sigma_{R}|_{\partial B_{R}}\equiv 0$ .

Therefore,

$\displaystyle\vec{\mathfrak{M}}_{2}(t,x)\cdot\vec{\mathfrak{n}}=$	$\displaystyle-\vec{\mathfrak{M}}_{2}(t,x)\cdot\nabla\sigma_{R}$	(51)
$\displaystyle=$	$\displaystyle-e^{\psi(x)}\frac{1}{2}\sum_{i,j=1}^{d}a^{ij}(x)\frac{\partial\sigma_{R}}{\partial x_{j}}\frac{\partial\sigma_{R}}{\partial x_{i}}$
	$\displaystyle-e^{\psi(x)}\sigma_{R}\frac{1}{2}\sum_{i,j=1}^{d}\frac{\partial}{\partial x_{j}}a^{ij}(x)\frac{\partial\sigma_{R}}{\partial x_{i}}$
$\displaystyle=$	$\displaystyle-e^{\psi(x)}\frac{1}{2}\sum_{i,j=1}^{d}a^{ij}(x)\frac{\partial\sigma_{R}}{\partial x_{j}}\frac{\partial\sigma_{R}}{\partial x_{i}}\leq 0,\quad\text{on}\ \partial B_{R},$

where the last inequality holds because $a(x)=g(x)g(x)^{\top}$ is positive semi-definite. Thus,

I_{1}(t)\leq\frac{1}{2}\sum_{i,j=1}^{d}\int_{B_{R}}e^{\psi(x)}\left(\frac{\partial\psi(x)}{\partial x_{i}}\frac{\partial\psi(x)}{\partial x_{j}}+\frac{\partial^{2}\psi(x)}{\partial x_{i}\partial x_{j}}\right)a^{ij}(x)\sigma_{R}(t,x)dx.

(52)

Similarly,

\displaystyle I_{2}(t)=-\sum_{i=1}^{d}\int_{B_{R}}f_{i}(x)\sigma_{R}(t,x)e^{\psi(x)}\frac{\partial\psi(x)}{\partial x_{i}}dx.

(53)

Therefore,

d\Phi(t)\leq\left(\int_{B_{R}}\mathfrak{F}(x)e^{\psi(x)}\sigma_{R}(t,x)dx\right)dt+\sum_{j=1}^{d}\left(\int_{B_{R}}h_{j}(x)e^{\psi(x)}u(t,x)dx\right)dY_{t}^{j},

(54)

where

\displaystyle\mathfrak{F}(x)=\frac{1}{2}\sum_{i,j=1}^{d}\left(\frac{\partial\psi(x)}{\partial x_{i}}\frac{\partial\psi(x)}{\partial x_{j}}+\frac{\partial^{2}\psi(x)}{\partial x_{i}\partial x_{j}}\right)a^{ij}(x)+\sum_{i=1}^{d}f_{i}(x)\frac{\partial\psi(x)}{\partial x_{i}}.

(55)

Since $\psi(x)=\log\left(1+|x|^{2n}\right)$ , then

\frac{\partial\psi}{\partial x_{i}}=\frac{2n|x|^{2n-2}x_{i}}{1+|x|^{2n}},\ \frac{\partial^{2}\psi}{\partial x_{i}\partial x_{j}}=\frac{4nx_{i}x_{j}|x|^{2n-4}\left((n-1)-|x|^{2n}\right)}{\left(1+|x|^{2n}\right)^{2}}+\frac{2n|x|^{2n-2}\mathscr{\delta}_{ij}}{1+|x|^{2n}},

(56)

where $\mathcal{\delta}_{ij}$ is the Kronecker’s symbol, with $\mathcal{\delta}_{ij}=1$ , if $i=j$ , and $\mathcal{\delta}_{ij}=0$ otherwise.

Notice that

\left|\frac{\partial\psi}{\partial x_{i}}\right|\leq 2n,\ \left|\frac{\partial^{2}\psi}{\partial x_{i}\partial x_{j}}\right|\leq 4n^{2}+2n.

(57)

With the assumption that $|a^{ij}(x)|\leq L$ , we have

|\mathfrak{F}(x)|\leq d^{2}(4n^{2}+n)L+2n\sum_{i=1}^{d}\frac{|f_{i}(x)x_{i}|\cdot|x|^{2n-2}}{1+|x|^{2n}},\ \forall x\in\mathbb{R}^{d}.

(58)

Because $f(x)$ is Lipschitz continuous according to (A1).

|f(x)|\leq L|x|+|f(0)|,\quad\forall x\in\mathbb{R}^{d}.

(59)

Therefore,

$\displaystyle\|\mathfrak{F}(x)\|$	$\displaystyle\leq d^{2}(4n^{2}+n)L+\frac{2n\|x\|^{2n-2}}{1+\|x\|^{2n}}\sum_{i=1}^{d}\frac{\|f_{i}(x)\|^{2}+\|x_{i}\|^{2}}{2}$	(60)
	$\displaystyle\leq d^{2}(4n^{2}+n)L+\frac{n\|x\|^{2n}+n\|x\|^{2n-2}\|f(x)\|^{2}}{1+\|x\|^{2n}}$
	$\displaystyle\leq d^{2}(4n^{2}+n)L+n(L^{2}+1)+n\|f(0)\|^{2}+2nL\|f(0)\|,\ \forall x\in\mathbb{R}^{d}.$

Let us denote by $M(n,d,L)$ the above upper bound of $|\mathfrak{F}(x)|$ :

M(n,d,L):=d^{2}(4n^{2}+n)L+n(L^{2}+1)+n|f(0)|^{2}+2nL|f(0)|,

(61)

which is a constant that depends on $n$ , $d$ and $L$ , but does not depend on $R$ .

Take expectation with respect to the reference probability measure $\tilde{P}$ , we obtain

\frac{d}{dt}\tilde{E}\Phi(t)\leq M(n,d,L)\tilde{E}\Phi(t).

(62)

Here we use the fact that $Y_{t}$ is a Brownian motion with respect to $\tilde{P}$ .

According to the Gronwall’s inequality, we have

	$\displaystyle\sup\limits_{0\leq t\leq T}\tilde{E}\int_{B_{R}}\left(1+\|x\|^{2n}\right)\sigma_{R}(t,x)dx$	$\displaystyle\leq e^{M(n,d,L)T}\int_{B_{R}}\left(1+\|x\|^{2n}\right)\sigma_{0,R}(x)dx$		(63)
		$\displaystyle\leq e^{M(n,d,L)T}\int_{B_{R}}\left(1+\|x\|^{2n}\right)\sigma_{0}(x)dx.$		(63)

Let $R$ tends to infinity, and we have

\sup\limits_{0\leq t\leq T}\tilde{E}\int_{\mathbb{R}^{d}}\left(1+|x|^{2n}\right)\sigma(t,x)dx\leq e^{M(n,d,L)T}\int_{\mathbb{R}^{d}}\left(1+|x|^{2n}\right)\sigma_{0}(x)dx.

(64)

Therefore,

$\displaystyle\sup\limits_{0\leq t\leq T}\tilde{E}\int_{\|x\|\geq R}\sigma(t,x)dx$	$\displaystyle\leq\frac{1}{1+R^{2n}}\sup\limits_{0\leq t\leq T}\tilde{E}\int_{\|x\|\geq R}\left(1+\|x\|^{2n}\right)\sigma(t,x)dx$	(65)
	$\displaystyle\leq\frac{1}{1+R^{2n}}\sup\limits_{0\leq t\leq T}\tilde{E}\int_{\mathbb{R}^{d}}\left(1+\|x\|^{2n}\right)\sigma(t,x)dx$
	$\displaystyle\leq\frac{e^{M(n,d,L)T}}{1+R^{2n}}\int_{\mathbb{R}^{d}}\left(1+\|x\|^{2n}\right)\sigma_{0}(x)dx.$

Moreover, with condition (A4),

$\displaystyle\sup\limits_{0\leq t\leq T}\tilde{E}$	$\displaystyle\int_{\|x\|\geq R}\|\varphi(x)\|\sigma(t,x)dx\leq\sup\limits_{0\leq t\leq T}\tilde{E}\int_{\|x\|\geq R}\left(1+\|x\|^{2m}\right)\sigma(t,x)dx$	(66)
	$\displaystyle\leq\frac{1}{1+R^{2n}}\sup\limits_{0\leq t\leq T}\tilde{E}\int_{\|x\|\geq R}\left(1+\|x\|^{2n}\right)\left(1+\|x\|^{2m}\right)\sigma(t,x)dx$
	$\displaystyle\leq\frac{1}{1+R^{2n}}\sup\limits_{0\leq t\leq T}\tilde{E}\int_{\|x\|\geq R}2\left(1+\|x\|^{2(m+n)}\right)\sigma(t,x)dx$
	$\displaystyle\leq\frac{2}{1+R^{2n}}\sup\limits_{0\leq t\leq T}\tilde{E}\int_{\mathbb{R}^{d}}\left(1+\|x\|^{2(m+n)}\right)\sigma(t,x)dx$
	$\displaystyle\leq\frac{2e^{M(n+m,d,L)T}}{1+R^{2n}}\int_{\mathbb{R}^{d}}\left(1+\|x\|^{2(m+n)}\right)\sigma_{0}(x)dx.$

∎

5 Approximation of $\sigma(t,x)$ by the IBV problem in $B_{R}$

With the estimation in Theorem 2, because almost all the density of $\sigma(t,x)$ is contained in the closed ball $B_{R}$ for $R$ large enough, it is natural to think about approximating $\sigma(t,x)$ by the solution, $\sigma_{R}(t,x)$ , to the corresponding initial-boundary value (IBV) problem (43) of DMZ equation in the ball $B_{R}$ .

It will be rigorously proved in this section that, for $R$ large enough, $\sigma(t,x)$ can be approximated well by $\sigma_{R}(t,x)$ defined in (43), and in particular, the estimation (36) holds in the proof of Theorem 25 in Section 3.

The main result in this section is stated as follows:

Theorem 3.

With Assumptions (A1) to (A4), there exists a constant $C>0$ which only depends on $T$ , $n$ , $d$ and $L$ , such that

\sup\limits_{0\leq t\leq T}\tilde{E}\int_{B_{\sqrt{R}}}|\sigma(t,x)-\sigma_{R}(t,x)|dx\leq\frac{C}{1+R^{n}}

(67)

holds for large enough $R$ (for example, $R>5$ ), where $\sigma_{R}(t,x)$ is the solution of the IBV problem (43).

Proof of Theorem 3.

For each $R>0$ , consider the auxiliary function

\phi(x)=\log\left(1+R^{n}\left(1-\left(1-\frac{|x|^{2n}}{R^{2n}}\right)^{2}\right)\right),\quad x\in B_{R},

(68)

and

\psi(x)=e^{-\phi(x)}-e^{-\phi(R)},\quad x\in B_{R}.

(69)

Define $v(t,x)=\sigma(t,x)-\sigma_{R}(t,x)$ , $(t,x)\in[0,T]\times B_{R}$ . Then, according to the maximum principle for SPDEs (cf. [18], for example), we have $v(t,x)\geq 0$ , for all $(t,x)\in[0,T]\times B_{R}$ and a.s. $\tilde{P}$ . Let $\Phi(t)$ be the stochastic process defined by

\Phi(t)=\int_{B_{R}}\psi(x)v(t,x)dx.

(70)

Since $v(t,x)$ is the solution to the SPDE

	$\displaystyle dv(t,x)=\biggl{[}\frac{1}{2}\sum_{i,j=1}^{d}\frac{\partial^{2}}{\partial x_{i}\partial x_{j}}(a^{ij}(x)v(t,x))$	$\displaystyle-\sum_{i=1}^{d}\frac{\partial}{\partial x_{i}}(f_{i}(x)v(t,x))\biggr{]}dt$		(71)
		$\displaystyle+\sum_{j=1}^{d}h_{j}(x)v(t,x)dY_{t}^{j},$		(71)

the $\mathbb{R}$ -valued stochastic process $\Phi(t)$ satisfies

$\displaystyle d\Phi(t)=$	$\displaystyle\frac{1}{2}\biggl{(}\sum_{i,j=1}^{d}\int_{B_{R}}\psi(x)\frac{\partial^{2}}{\partial x_{i}\partial x_{j}}(a^{ij}(x)v(t,x))dx\biggr{)}dt$	(72)
	$\displaystyle-\biggl{(}\sum_{i=1}^{d}\int_{B_{R}}\psi(x)\frac{\partial}{\partial x_{i}}(f_{i}(x)v(t,x))dx\biggr{)}dt$
	$\displaystyle+\sum_{j=1}^{d}\biggl{(}\int_{B_{R}}h_{j}(x)\psi(x)v(t,x)dx\biggr{)}dY_{t}^{j}.$

According to the Gauss-Green formula, we have

	$\displaystyle d\Phi(t)=$	$\displaystyle\frac{1}{2}\left(\sum_{i,j=1}^{d}\int_{B_{R}}a^{ij}(x)\frac{\partial^{2}\psi}{\partial x_{i}\partial x_{j}}v(t,x)dx\right)dt$
		$\displaystyle+\left(\sum_{i=1}^{d}\int_{B_{R}}\frac{\partial\psi}{\partial x_{i}}f_{i}(x)v(t,x)dx\right)dt$
		$\displaystyle+\sum_{j=1}^{d}\left(\int_{B_{R}}h_{j}(x)\psi(x)v(t,x)dx\right)dY_{t}^{j}$
		$\displaystyle+\biggl{[}-\int_{\partial B_{R}}\vec{\mathfrak{M}}_{1}(t,x)\cdot\vec{\mathfrak{n}}dS+\int_{\partial B_{R}}\vec{\mathfrak{M}}_{2}(t,x)\cdot\vec{\mathfrak{n}}dS\biggr{]}dt,$

where, as in the proof of Theorem 2,

\vec{\mathfrak{M}}_{i}(t,x)=\left(\mathfrak{M}_{i,1}(t,x),\cdots,\mathfrak{M}_{i,d}(t,x)\right),\quad i=1,2,

(73)

	$\displaystyle\mathfrak{M}_{1,j}(t,x)$	$\displaystyle=\frac{1}{2}\sum_{i=1}^{d}\frac{\partial\psi}{\partial x_{i}}a^{ij}(x)v(t,x),\ j=1,\cdots,d,$		(74)
	$\displaystyle\mathfrak{M}_{2,i}(t,x)$	$\displaystyle=\psi\biggl{(}\frac{1}{2}\sum_{j=1}^{d}\frac{\partial}{\partial x_{j}}(a^{ij}(x)v(t,x))-f_{i}(x)v(t,x)\biggr{)},\ i=1,\cdots,d,$		(74)

$\vec{\mathfrak{n}}$ denotes the outward normal vector of the boundary $\partial B_{R}$ and $dS$ denotes the measure on $\partial B_{R}$ .

Notice that $\psi|_{\partial B_{R}}\equiv 0$ and

\frac{\partial\psi}{\partial x_{j}}=-e^{-\phi(x)}\frac{\partial\phi}{\partial x_{j}}.

(75)

Moreover,

\frac{\partial\phi}{\partial x_{i}}=\frac{2R^{n}\biggl{(}1-\frac{|x|^{2n}}{R^{2n}}\biggr{)}\frac{2n|x|^{2n-2}x_{i}}{R^{2n}}}{1+R^{n}\biggl{(}1-\biggl{(}1-\frac{|x|^{2n}}{R^{2n}}\biggr{)}^{2}\biggr{)}},

(76)

and therefore,

\frac{\partial\phi}{\partial x_{i}}\biggr{|}_{\partial B_{R}}=0=\frac{\partial\psi}{\partial x_{i}}\biggr{|}_{\partial B_{R}},\ i=1,\cdots,d.

(77)

Hence,

$\displaystyle d\Phi(t)=$	$\displaystyle\frac{1}{2}\left(\int_{B_{R}}e^{-\phi(x)}v(t,x)\sum_{i,j=1}^{d}a^{ij}(x)\left(-\frac{\partial^{2}\phi(x)}{\partial x_{i}\partial x_{j}}+\frac{\partial\phi}{\partial x_{i}}\frac{\partial\phi}{\partial x_{j}}\right)dx\right)dt$	(78)
	$\displaystyle-\biggl{(}\int_{B_{R}}\sum_{i=1}^{d}e^{-\phi(x)}v(t,x)f_{i}(x)\frac{\partial\phi}{\partial x_{i}}dx\biggr{)}dt$
	$\displaystyle+\sum_{j=1}^{d}\left(\int_{B_{R}}h_{j}(x)\psi(x)v(t,x)dx\right)dY_{t}^{j}$

Take expectation with respect to the probability measure $\tilde{P}$ , and we have

$\displaystyle\frac{d\tilde{E}\Phi(t)}{dt}=$	$\displaystyle\frac{1}{2}\tilde{E}\left(\int_{B_{R}}\psi(x)v(t,x)\sum_{i,j=1}^{d}a^{ij}(x)\left(-\frac{\partial^{2}\phi(x)}{\partial x_{i}\partial x_{j}}+\frac{\partial\phi}{\partial x_{i}}\frac{\partial\phi}{\partial x_{j}}\right)dx\right)$	(79)
	$\displaystyle-\tilde{E}\biggl{(}\int_{B_{R}}\sum_{i=1}^{d}\psi(x)v(t,x)f_{i}(x)\frac{\partial\phi}{\partial x_{i}}dx\biggr{)}$
	$\displaystyle+e^{-\phi(R)}\frac{1}{2}\tilde{E}\left(\int_{B_{R}}v(t,x)\sum_{i,j=1}^{d}a^{ij}(x)\left(-\frac{\partial^{2}\phi(x)}{\partial x_{i}\partial x_{j}}+\frac{\partial\phi}{\partial x_{i}}\frac{\partial\phi}{\partial x_{j}}\right)dx\right)$
	$\displaystyle-e^{-\phi(R)}\tilde{E}\biggl{(}\int_{B_{R}}\sum_{i=1}^{d}v(t,x)f_{i}(x)\frac{\partial\phi}{\partial x_{i}}dx\biggr{)}$

For $x\in B_{R}$ , $|x_{i}|\leq|x|\leq R$ , $i=1,\cdots,d$ , and together with the Lipschitz conditions for $f(x)$ ,

\left|\frac{\partial\phi}{\partial x_{i}}(x)\right|\leq\frac{4nR^{n}|x|^{2n-2}|x_{i}|}{R^{2n}\biggl{(}1+\frac{|x|^{2n}}{R^{n}}\biggr{)}}\leq 4n,\ \forall\ x\in B_{R};

(80)

$\displaystyle\biggl{\|}f_{i}(x)\frac{\partial\phi}{\partial x_{i}}\biggr{\|}$	$\displaystyle\leq(\|f(0)\|+L\|x\|)\biggl{\|}\frac{\partial\phi}{\partial x_{i}}\biggr{\|}$	(81)
	$\displaystyle\leq 4n\|f(0)\|+4nL\frac{R^{n}\|x\|^{2n-1}\|x_{i}\|}{R^{2n}\biggl{(}1+\frac{\|x\|^{2n}}{R^{n}}\biggr{)}}$
	$\displaystyle\leq 4n(\|f(0)\|+L),\ \forall\ x\in B_{R}.$

Also, according to direct computations,

$\displaystyle\frac{\partial^{2}\phi}{\partial x_{i}\partial x_{j}}=$	$\displaystyle\frac{8n\|x\|^{2n-4}x_{i}x_{j}(R^{2n}(n-1)-(2n-1)\|x\|^{2n})(R^{3n}+2R^{2n}\|x\|^{2n}-\|x\|^{4n})}{(R^{3n}+2R^{2n}\|x\|^{2n}-\|x\|^{4n})^{2}}$	(82)
	$\displaystyle-\frac{16n^{2}\|x\|^{2n-2}x_{i}x_{j}(R^{2n}\|x\|^{2n-2}-\|x\|^{4n-2})(R^{2n}-\|x\|^{2n})}{(R^{3n}+2R^{2n}\|x\|^{2n}-\|x\|^{4n})^{2}}$
	$\displaystyle+\frac{4n\delta_{ij}\|x\|^{2n-2}(R^{2n}-\|x\|^{2n})}{R^{3n}+2R^{2n}\|x\|^{2n}-\|x\|^{4n}}.$

where $\delta_{ij}$ is the Kronecker’s symbol. Thus,

\biggl{|}\frac{\partial^{2}\phi}{\partial x_{i}\partial x_{j}}\biggr{|}\leq 8n(3n-2)+16n^{2}+4n,\ \forall x\in B_{R}.

(83)

We would like to remark that the estimation in (83) is quite rough. Each term on the right-hand side of (83) corresponds to one term on the right-hand side of (82), and the purpose is just to show the second-order derivatives are also bounded by a constant independent of $R$ .

Notice that $e^{-\phi(R)}=\frac{1}{1+R^{n}}$ . Together with the bounded condition for $a^{ij}(x)$ , we have

\frac{d\tilde{E}\Phi(t)}{dt}\leq C_{1}\tilde{E}\Phi(t)+\frac{C_{1}}{1+R^{n}}\tilde{E}\int_{B_{R}}v(t,x)dx

(84)

where $C_{1}>0$ is a constant which depends on $n,d,L$ , but does not depend on $R$ .

According to Theorem 2, the integral

\tilde{E}\int_{B_{R}}v(t,x)dx\leq\tilde{E}\int_{B_{R}}\sigma(t,x)dx\leq\tilde{E}\int_{\mathbb{R}^{d}}\sigma(t,x)dx,

(85)

which is also bounded by a constant independent of $R$ , thus,

\frac{d\tilde{E}\Phi(t)}{dt}\leq C_{1}\tilde{E}\Phi(t)+\frac{C_{2}}{1+R^{n}}.

(86)

where $C_{2}>0$ is a constant which depends on $T,n,d,L$ .

By Gronwall’s inequality,

\tilde{E}\Phi(t)\leq\frac{C_{3}}{1+R^{n}}

(87)

where $C_{3}>0$ is a constant which depends on $T$ , $n$ , $d$ and $L$ .

On the other hand, for $R>5$ and for all $x\in B_{\sqrt{R}}$ , i.e., $|x|\leq\sqrt{R}$ ,

\phi(x)\in\biggl{[}0,\log\left(3-\frac{1}{R^{n}}\right)\biggr{]},\quad\psi(x)\geq\psi(\sqrt{R})=\frac{R^{n}}{3R^{n}-1}-\frac{1}{1+R^{n}}\geq\frac{1}{3}-\frac{1}{6}=\frac{1}{6}.

(88)

Then,

\tilde{E}\Phi(t)=\tilde{E}\int_{B_{R}}\psi(x)v(t,x)dx\geq\tilde{E}\int_{B_{\sqrt{R}}}\psi(x)v(t,x)dx\geq\frac{1}{6}\tilde{E}\int_{B_{\sqrt{R}}}v(t,x)dx.

(89)

Combining (87) and (89), we obtain that, for all $R\gg 1$ ,

\tilde{E}\int_{B_{\sqrt{R}}}|\sigma(t,x)-\sigma_{R}(t,x)|dx=\tilde{E}\int_{B_{\sqrt{R}}}v(t,x)dx\leq\frac{6C_{3}}{1+R^{n}}.

(90)

∎

6 Regularity of the Approximated Function $u_{k}(t,x)$

In this section, we will discuss the regularity of $u_{k}(t,x)$ , $t\in[0,T]$ , which is the solution of a series of coefficient-frozen equations (20).

The main purpose of this section is to show that under mild conditions, the recursively defined functions $u_{k}(t,x)$ will not explode in the finite time interval $[0,T]$ , even if the time-discretization step $\delta\rightarrow 0$ , in the sense that the $L^{2}$ -norm of $u_{k}(\tau_{k},x)$ ( $k=1,\cdots,K$ ) is square integrable with respect to the probability measure $\tilde{P}$ , and the expectations, $\tilde{E}\int_{B_{R}}|u_{k}(\tau_{k},x)|^{2}dx$ , are uniformly bounded for $k=1,\cdots,K$ .

As shown in the next section, this following theorem is an essential intermediate result for the convergence analysis of this time-discretization scheme.

Theorem 4.

Let $\{u_{k}(t,x):\tau_{k-1}\leq t\leq\tau_{k}\}_{k=1}^{K}$ be the solution to the IBV problem of the coefficients-frozen equation (20). Then, with Assumptions (A1) to (A4), the $L^{2}$ -norm of $u_{k}(\tau_{k},x)$ is square-integrable with respect to the probability measure $\tilde{P}$ , and we have

\tilde{E}\int_{B_{R}}|u_{k}(\tau_{k},x)|^{2}dx\leq C<\infty,\ \forall\ k=1,\cdots,K,

(91)

where $C>0$ is a constant that depends on $d$ , $T$ , $R$ , $L$ , but is uniform in $k=1,\cdots,K$ .

In the proof of Theorem 4, we will consider another exponential transformation given by

\sigma_{k}(t,x)=\exp(h^{\top}(x)Y_{\tau_{k-1}})u_{k}(t,x),\ t\in[\tau_{k-1},\tau_{k}],\ k=1,\cdots,K.

(92)

Direct computation implies that $\sigma_{k}(t,x)$ is the solution of

\left\{\begin{aligned} &\frac{\partial\sigma_{k}(t,x)}{\partial t}=\frac{1}{2}\sum_{i,j=1}^{d}\frac{\partial^{2}}{\partial x_{i}\partial x_{j}}(a^{ij}(x)\sigma_{k}(t,x))-\sum_{i=1}^{d}\frac{\partial}{\partial x_{i}}(f_{i}(x)\sigma_{k}(t,x))\\ &\qquad-\frac{1}{2}|h(x)|^{2}\sigma_{k}(t,x),\ (t,x)\in[\tau_{k-1},\tau_{k}]\times B_{R},\\ &\sigma_{k}(\tau_{k-1},x)=\exp\left(h^{\top}(x)Y_{\tau_{k-1}}\right)u_{k-1}(\tau_{k-1},x),\ x\in B_{R}\\ &\sigma_{k}(t,x)=0,\ (t,x)\in[\tau_{k-1},\tau_{k}]\times\partial B_{R},\end{aligned}\right.

(93)

and recursively, we can rewrite the initial value in (93) by

\sigma_{k}(\tau_{k-1},x)=\exp\left(h^{\top}(x)(Y_{\tau_{k-1}}-Y_{\tau_{k-2}})\right)\sigma_{k-1}(\tau_{k-1},x),\ k=2,\cdots,K.

(94)

Under the reference probability measure $\tilde{P}$ , $\{Y_{t}:0\leq t\leq T\}$ is a Brownian motion and

Y_{\tau_{k}}-Y_{\tau_{k-1}}\sim\mathcal{N}(0,\delta I_{d}),\ k=1,\cdots,K,

(95)

with $I_{d}\in\mathbb{R}^{d\times d}$ the $d$ -dimensional identity matrix. We would like to study the regularity of $\sigma_{k}(t,x)$ first, utilizing the Markov property of $Y$ , and then derive the regularity results for $u_{k}(t,x)$ .

For the sake of discussing the regularity of $\sigma_{k}(t,x)$ in a recursive manner, we need the following lemma which describes the relationship between $\sigma_{k}(\tau_{k-1},x)$ and $\sigma_{k-1}(\tau_{k-1},x)$ from (94).

Lemma 1.

For $k=2,\cdots,K$ , let $\sigma_{k}(t,x)$ , $t\in[\tau_{k-1},\tau_{k}]$ be the solution of (93). The end-point values $\sigma_{k}(\tau_{k-1},x)$ and $\sigma_{k-1}(\tau_{k-1},x)$ satisfy (94). Let us denote by $L^{4}(B_{R})$ the space of quartic-integrable functions in $B_{R}$ . Assume that $\sigma_{k-1}(\tau_{k-1},\cdot)\in L^{4}(B_{R})$ , and the $L^{4}$ -norm, $\|\sigma_{k-1}(\tau_{k-1},\cdot)\|_{L^{4}}$ , is quartic integrable with respect to $\tilde{P}$ , i.e.,

\tilde{E}\int_{B_{R}}\sigma_{k-1}^{4}(\tau_{k-1},x)dx<\infty

(96)

then $\sigma_{k}(\tau_{k-1},\cdot)\in L^{4}(B_{R})$ , its $L^{4}$ -norm, $\|\sigma_{k}(\tau_{k-1},\cdot)\|_{L^{4}}$ is quartic integrable with respect to $\tilde{P}$ , and for sufficiently small time-discretization step size $\delta=\tau_{k}-\tau_{k-1}$ , we have

\tilde{E}\int_{B_{R}}\sigma_{k}^{4}(\tau_{k-1},x)dx\leq(1+C\delta)\tilde{E}\int_{B_{R}}\sigma_{k-1}^{4}(\tau_{k-1},x)dx

(97)

where $C$ is a constant that depends on $d$ and $R$ .

Proof of Lemma 1.

According to the expression (94) and the definition of $\sigma_{k-1}$ on $[\tau_{k-2},\tau_{k-1}]$ , because of the Markov property of $Y$ , $\exp\left(h^{\top}(x)(Y_{\tau_{k-1}}-Y_{\tau_{k-2}})\right)$ is independent of $\sigma_{k-1}(\tau_{k-1},x)$ .

Because the observation function $h$ is assumed to be smooth enough, and $B_{R}$ is a bounded domain in $\mathbb{R}^{d}$ , there exists a constant $M$ , which may depend on $R$ , such that the maximum of the absolute value of $h$ , together with its partial derivatives up to order $m$ , is bounded above by $M$ .

Therefore, by Fubini’s theorem,

	$\displaystyle\tilde{E}\int_{B_{R}}\sigma_{k}^{4}(\tau_{k-1},x)dx$	$\displaystyle=\tilde{E}\int_{B_{R}}\exp\left(4h^{\top}(x)(Y_{\tau_{k-1}}-Y_{\tau_{k-2}})\right)\sigma_{k-1}^{4}(\tau_{k-1},x)dx$		(98)
		$\displaystyle=\int_{B_{R}}\tilde{E}\exp\left(4h^{\top}(x)(Y_{\tau_{k-1}}-Y_{\tau_{k-2}})\right)\tilde{E}\sigma_{k-1}^{4}(\tau_{k-1},x)dx.$		(98)

Next, let us estimate the expectations of functions of normal random variable $\xi:=Y_{\tau_{k-1}}-Y_{\tau_{k-2}}$ arising in the above expressions, for small time-discretization step $\delta$ .

In fact, because $\xi\sim\mathcal{N}(0,\delta I_{d})$ , we have

\tilde{E}\exp\left(4h(x)^{\top}\xi\right)=\prod_{j=1}^{d}\tilde{E}e^{4h_{j}(x)\xi_{j}}=\prod_{j=1}^{d}\left(\int_{\mathbb{R}}\frac{1}{\sqrt{2\pi\delta}}e^{4h_{j}(x)z}e^{-\frac{z^{2}}{2\delta}}dz\right)

(99)

In the bounded domain $B_{R}$ ,

$\displaystyle\int_{\mathbb{R}}\frac{1}{\sqrt{2\pi\delta}}e^{4h_{j}(x)z}e^{-\frac{z^{2}}{2\delta}}dz$	$\displaystyle=\int_{\mathbb{R}}\frac{1}{\sqrt{2\pi}}e^{4h_{j}(x)\sqrt{\delta}z}e^{-\frac{z^{2}}{2}}dz$	(100)
	$\displaystyle=e^{8h_{j}^{2}(x)\delta}\int_{\mathbb{R}}\frac{1}{\sqrt{2\pi}}e^{-\frac{1}{2}(z-4h_{j}(x)\sqrt{\delta})^{2}}dz$
	$\displaystyle=e^{8h_{j}^{2}(x)\delta}\leq e^{8M^{2}\delta}.$

Therefore, for $\delta\ll 1$ (for example $\delta\leq\frac{1}{16M^{2}d}$ ),

\tilde{E}\exp\left(4h(x)^{\top}\xi\right)\leq e^{8dM^{2}\delta}\leq 1+16dM^{2}\delta.

(101)

Thus,

\tilde{E}\int_{B_{R}}\sigma_{k}^{4}(\tau_{k-1},x)dx\leq(1+16dM^{2}\delta)\tilde{E}\int_{B_{R}}\sigma_{k-1}^{4}(\tau_{k-1},x)dx.

(102)

∎

Now, we are ready to give the proof of Theorem 4.

Proof of Theorem 4.

The idea of this proof is to study the regularity of $\sigma_{k}(t,x)$ , recursively, and then obtain the regularity of $u_{k}(t,x)$ based on the relationship (94).

In fact, according to the Cauchy-Schwartz inequality,

	$\displaystyle\tilde{E}\int_{B_{R}}\|u_{k}(\tau_{k},x)\|^{2}dx$	$\displaystyle=\tilde{E}\int_{B_{R}}\exp\left(-2h^{\top}(x)Y_{\tau_{k-1}}\right)\sigma_{k}^{2}(\tau_{k},x)dx$
		$\displaystyle\leq\tilde{E}\biggl{[}\exp\left(2M\sum_{j=1}^{d}\|Y_{\tau_{k-1},j}\|\right)\int_{B_{R}}\sigma_{k}^{2}(\tau_{k},x)dx\biggr{]}$
		$\displaystyle\leq\biggl{(}\tilde{E}\exp\left(4M\sum_{j=1}^{d}\|Y_{\tau_{k-1},j}\|\right)\biggr{)}^{\frac{1}{2}}\biggl{(}\tilde{E}\biggl{(}\int_{B_{R}}\sigma_{k}^{2}(\tau_{k},x)dx\biggr{)}^{2}\biggr{)}^{\frac{1}{2}}$
		$\displaystyle\leq C_{1}\biggl{(}\tilde{E}\exp\left(4M\sum_{j=1}^{d}\|Y_{\tau_{k-1},j}\|\right)\biggr{)}^{\frac{1}{2}}\biggl{(}\tilde{E}\int_{B_{R}}\sigma_{k}^{4}(\tau_{k},x)dx\biggr{)}^{\frac{1}{2}}$

with $C_{1}>0$ , a constant depending only on $R$ .

Under the reference probability measure $\tilde{P}$ , $\{Y_{t}:0\leq t\leq T\}$ is a standard $d$ -dimensional Brownian motion, and therefore, the expectation

\tilde{E}\exp\left(4M\sum_{j=1}^{d}|Y_{\tau_{k-1},j}|\right)

is bounded.

Hence, it remains to show that there exists a constant $C_{2}>0$ , such that,

\tilde{E}\int_{B_{R}}\sigma_{k}^{4}(\tau_{k},x)dx\leq C_{2}<\infty,

(103)

holds uniformly for $k=1,\cdots,K$ .

In the time interval $[\tau_{k-1},\tau_{k}]$ , $\sigma_{k}(t,x)$ is the solution to (93). According to the regularity results of parabolic partial differential equations, we have

\int_{B_{R}}\sigma_{k}^{4}(\tau_{k},x)dx\leq e^{C_{4}\delta}\int_{B_{R}}\sigma_{k}^{4}(\tau_{k-1},x)dx,\ \forall\ k=1,\cdots,K.

(104)

where $C_{4}$ is a constant which depends on the coefficients of the filtering system. The techniques in the proof of (104) is standard, and the proof of a counterpart, in which $L^{2}$ -norm (instead of $L^{4}$ -norm) is considered, can be found in the textbook [19]. We also provide a detailed proof in the Appendix, for the readers’ convenience and in order to keep this paper self-contained.

Thus, with the result in Lemma 1, there exists $C_{5},C_{6}>0$ , such that for small enough $\delta$ ,

	$\displaystyle\tilde{E}\int_{B_{R}}\sigma_{k}^{4}(\tau_{k},x)dx$	$\displaystyle\leq e^{C_{4}\delta}\tilde{E}\int_{B_{R}}\sigma_{k}^{4}(\tau_{k-1},x)dx\leq e^{C_{4}\delta}(1+C_{5}\delta)\tilde{E}\int_{B_{R}}\sigma_{k-1}^{4}(\tau_{k-1},x)dx$		(105)
		$\displaystyle\leq(1+C_{6}\delta)\tilde{E}\int_{B_{R}}\sigma_{k-1}^{4}(\tau_{k-1},x)dx.$		(105)

Inductively, we have

	$\displaystyle\tilde{E}\int_{B_{R}}\sigma_{k}^{4}(\tau_{k},x)dx$	$\displaystyle\leq(1+C_{6}\delta)^{k}\int_{B_{R}}\sigma_{0}^{4}(x)dx$		(106)
		$\displaystyle\leq(1+C_{6}\delta)^{\frac{T}{\delta}}\int_{B_{R}}\sigma_{0}^{4}(x)dx\leq e^{C_{6}T}\int_{B_{R}}\sigma_{0}^{4}(x)dx.$		(106)

Thus, we have proved the boundedness of $\tilde{E}\int_{B_{R}}\sigma_{k}^{4}(\tau_{k},x)dx$ , and also, the result of Theorem 4 holds. ∎

7 Convergence Analysis of the Time Discretization Scheme

This section serves to show that the solution $u_{k}(t,x)$ of the coefficient-frozen equations (20) can approximate the solution $u(t,x)$ of the original robust DMZ equation (18) well, if the time-discretization step size $\delta$ is small enough.

Also, we will show in this section that, after the exponential transformation $\exp(h^{\top}(x)Y_{\tau_{k}})$ , the $L^{1}$ -norm of the difference between the unnormalized densities $\sigma_{R}(\tau_{k},x)$ (defined by (43)) and $\tilde{u}_{k+1}(\tau_{k},x)$ (defined by (22)) still converges to zero, as $\delta\rightarrow 0$ . In particular, the estimation (38) holds in the proof of Theorem 25 in Section 3.

Theorem 5.

Fix $R>0$ . With Assumptions (A1) to (A4), we can use the solution $u_{k}(t,x)$ of equation (20) to approximate the solution $u(t,x)$ of equation (18). In particular, for every $\epsilon>0$ , there exists a constant $\delta>0$ , such that

\tilde{E}\int_{B_{R}}|\sigma_{R}(\tau_{k},x)-\tilde{u}_{k+1}(\tau_{k},x)|dx=\tilde{E}\int_{B_{R}}e^{h^{\top}(x)Y_{\tau_{k}}}\left|u(\tau_{k},x)-u_{k}(\tau_{k},x)\right|dx<\epsilon,

(107)

holds for every $k=1,\cdots,K$ .

Proof of Theorem 5.

Since $f$ is globally Lipschitz, $h\in C^{2}(B_{R})$ , and $B_{R}$ is a bounded domain, there exists a constant $M_{0}>0$ , such that the absolute value of each component in $f(x)$ and $h(x)$ , as well as there first and second order derivatives, are dominated by $M$ in the ball $B_{R}$ , i.e.,

	$\displaystyle\max\limits_{x\in B_{R}}\biggl{\{}\max\limits_{1\leq i\leq d}\|f_{i}(x)\|,$	$\displaystyle\max\limits_{1\leq i\leq d}\|h_{i}(x)\|,\max\limits_{1\leq i,j\leq d}\biggl{\|}\frac{\partial f_{i}}{\partial x_{j}}\biggr{\|},$		(108)
		$\displaystyle\max\limits_{1\leq i,j\leq d}\biggl{\|}\frac{\partial h_{i}}{\partial x_{j}}\biggr{\|},\max\limits_{1\leq i,j,k\leq d}\biggl{\|}\frac{\partial^{2}f_{i}}{\partial x_{j}\partial x_{k}}\biggr{\|},\max\limits_{1\leq i\leq d}\biggl{\|}\frac{\partial^{2}h_{i}}{\partial x_{j}\partial x_{k}}\biggr{\|}\biggr{\}}\leq M_{0}.$		(108)

Let $B_{R,t}^{+}=\{x\in B_{R}:u(t,x)-u_{k}(t,x)\geq 0\}$ . According to the technical Lemma 4.1 in [2], we have

\frac{d}{dt}\int_{B_{R,t}^{+}}(u(t,x)-u_{k}(t,x))dx=\int_{B_{R,t}^{+}}\frac{\partial}{\partial t}(u(t,x)-u_{k}(t,x))dx,

(109)

for almost all $t\in[0,T]$ .

Then, according to equations (18) and (20) satisfied by $u(t,x)$ and $u_{k}(t,x)$ in $[\tau_{k-1},\tau_{k}]$ ,

		$\displaystyle\frac{d}{dt}\int_{B_{R,t}^{+}}(u(t,x)-u_{k}(t,x))dx$		(110)
		$\displaystyle=\int_{B_{R,t}^{+}}\frac{\partial}{\partial t}(u(t,x)-u_{k}(t,x))dx$
		$\displaystyle=\frac{1}{2}\int_{B_{R,t}^{+}}\sum_{i,j=1}^{d}a^{ij}(x)\frac{\partial^{2}}{\partial x_{i}\partial x_{j}}(u-u_{k})dx+\int_{B_{R,t}^{+}}\sum_{i=1}^{d}F_{i}(\tau_{k-1},x)\frac{\partial}{\partial x_{i}}(u-u_{k})dx$
		$\displaystyle\quad+\int_{B_{R,t}^{+}}J(\tau_{k-1},x)(u(t,x)-u_{k}(t,x))dx$
		$\displaystyle\quad+\int_{B_{R,t}^{+}}\sum_{i=1}^{d}(F_{i}(t,x)-F_{i}(\tau_{k-1},x))\frac{\partial u}{\partial x_{i}}dx+\int_{B_{R,t}^{+}}(J(t,x)-J(\tau_{k-1},x))u(t,x)dx.$

Because $u(t,x)=u_{k}(t,x)\equiv 0$ on the boundary $\partial B_{R}$ , and $\partial B_{R,t}^{+}\subset\partial B_{R}\cup\{x\in B_{R}:u(t,x)-u_{k}(t,x)=0\}$ , we have $(u-u_{k})|_{\partial B_{R,t}^{+}}=0$ and $\nabla(u-u_{k})|_{\partial B_{R,t}^{+}}=-c(x)\vec{\mathfrak{n}}$ with $\vec{\mathfrak{n}}$ the outward normal vector of $B_{R,t}^{+}$ and $c(x)\geq 0$ on $\partial B_{R,t}^{+}$ . Thus, the first three terms on the right-hand side of (110) can be estimated by

$\displaystyle\frac{1}{2}$	$\displaystyle\int_{B_{R,t}^{+}}\sum_{i,j=1}^{d}a^{ij}\frac{\partial^{2}}{\partial x_{i}\partial x_{j}}(u-u_{k})dx+\int_{B_{R,t}^{+}}\sum_{i=1}^{d}F_{i}(\tau_{k-1},x)\frac{\partial}{\partial x_{i}}(u-u_{k})dx$	(111)
	$\displaystyle+\int_{B_{R,t}^{+}}J(\tau_{k-1},x)(u(t,x)-u_{k}(t,x))dx$
	$\displaystyle=\frac{1}{2}\int_{B_{R,t}^{+}}\sum_{i,j=1}^{d}\frac{\partial^{2}a^{ij}}{\partial x_{i}\partial x_{j}}(u-u_{k})dx-\int_{B_{R,t}^{+}}\sum_{i=1}^{d}\frac{\partial F_{i}(\tau_{k-1},x)}{\partial x_{i}}(u-u_{k})dx$
	$\displaystyle\quad+\int_{B_{R,t}^{+}}J(\tau_{k-1},x)(u(t,x)-u_{k}(t,x))dx$
	$\displaystyle-\frac{1}{2}\int_{\partial B_{R,t}^{+}}\sum_{i,j=1}^{d}a^{ij}\frac{\partial}{\partial x_{i}}(u-u_{k})\frac{\partial}{\partial x_{j}}(u-u_{k})dS$
	$\displaystyle\quad-\frac{1}{2}\int_{\partial B_{R,t}^{+}}(u-u_{k})\sum_{i,j=1}^{d}\frac{\partial a^{ij}}{\partial x_{i}}\vec{\mathfrak{n}}_{j}dS+\int_{\partial B_{R,t}^{+}}(u-u_{k})\sum_{i=1}^{d}F_{i}(\tau_{k-1},x)\vec{\mathfrak{n}}_{i}dS$
	$\displaystyle\leq\biggl{(}\frac{d^{2}L}{2}+C(d,L,M_{0})\biggl{(}1+\sum_{j=1}^{d}\|Y_{\tau_{k-1},j}\|\biggr{)}^{2}\biggr{)}\int_{B_{R,t}^{+}}(u-u_{k})dx,$
	$\displaystyle\leq C_{1}\biggl{(}1+\sum_{j=1}^{d}\|Y_{\tau_{k-1},j}\|\biggr{)}^{2}\int_{B_{R,t}^{+}}(u-u_{k})dx$

where we use the fact that $a(x)=g(x)g(x)^{\top}$ is positive semi-definite and the definition of $F_{i}(t,x)$ and $J(t,x)$ in (17); $C(d,L,M_{0})$ and $C_{1}$ are constants which depend only on $d,L,M_{0}$ ; and $dS$ denotes the measure on $\partial B_{R,t}^{+}$ .

Also, by the definition of $F_{i}(t,x)$ and $J(t,x)$ in (17), we have the following estimation of the differences

		$\displaystyle\|F(t,x)-F(\tau_{k-1},x)\|\leq C_{2}\|Y_{t}-Y_{\tau_{k-1}}\|,$		(112)
		$\displaystyle\|J(t,x)-J(\tau_{k-1},x)\|\leq C_{3}\biggl{(}1+\sum_{j=1}^{d}(\|Y_{t,j}\|+\|Y_{\tau_{k-1},j}\|)\biggr{)}\|Y_{t}-Y_{\tau_{k}}\|,\ \forall\ x\in B_{R},$		(112)

where $C_{2}$ and $C_{3}$ are constants which only depends on $d,L.M_{0}$ .

Hence,

$\displaystyle\frac{d}{dt}\int_{B_{R,t}^{+}}$	$\displaystyle(u(t,x)-u_{k}(t,x))dx\leq C_{1}\biggl{(}1+\sum_{j=1}^{d}\|Y_{\tau_{k-1},j}\|\biggr{)}^{2}\int_{B_{R,t}^{+}}(u-u_{k})dx$	(113)
	$\displaystyle+C_{2}\|Y_{t}-Y_{\tau_{k-1}}\|\int_{B_{R}}\|\nabla u(t,x)\|dx$
	$\displaystyle+C_{3}\biggl{(}1+\sum_{j=1}^{d}(\|Y_{t,j}\|+\|Y_{\tau_{k-1},j}\|)\biggr{)}\|Y_{t}-Y_{\tau_{k}}\|\int_{B_{R}}\|u(t,x)\|dx$

holds for almost all $t\in[\tau_{k-1},\tau_{k}]$ and almost surely, where $C_{1}$ , $C_{2}$ and $C_{3}$ are constants which depend on the coefficients of the system.

Under the reference probability distribution $\tilde{P}$ , the observation process $\{Y_{t}:0\leq t\leq T\}$ is a standard $d$ -dimensional Brownian motion, and therefore,

\displaystyle\sum_{j=1}^{d}(Y_{t,j}-Y_{\tau_{k-1},j})\sim N(0,d(t-\tau_{k-1})).

(114)

Let $\Omega_{M_{1}}=\left\{\omega:\sup\limits_{0\leq t\leq T}\sum_{j=1}^{d}|Y_{t,j}(\omega)|\leq M_{1}\right\}$ be the event which represents the observation process $Y_{t}$ is not severely abnormal, and $1_{A}(\cdot)$ is the indicator function of the set $A$ .

For a fixed $M_{1}>0$ , let us first take the expectation with respect to $\tilde{P}$ on the event $\Omega_{1,M_{1}}$ for both sides of (113), and we have

	$\displaystyle\frac{d}{dt}\tilde{E}$	$\displaystyle\biggl{[}1_{\Omega_{M_{1}}}\int_{B_{R,t}^{+}}(u(t,x)-u_{k}(t,x))dx\biggr{]}$
	$\displaystyle\leq$	$\displaystyle\ C_{1}\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\biggl{(}1+\sum_{j=1}^{d}\|Y_{\tau_{k-1},j}\|\biggr{)}^{2}\int_{B_{R,t}^{+}}(u-u_{k})dx\biggr{]}$
		$\displaystyle+C_{2}\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\|Y_{t}-Y_{\tau_{k-1}}\|\int_{B_{R}}\|\nabla u\|dx\biggr{]}$
		$\displaystyle+C_{3}\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\biggl{(}1+\sum_{j=1}^{d}(\|Y_{t,j}\|+\|Y_{\tau_{k-1},j}\|)\biggr{)}\|Y_{t}-Y_{\tau_{k}}\|\int_{B_{R}}\|u\|dx\biggr{]}$
	$\displaystyle\leq$	$\displaystyle\ C_{1}(1+M_{1})^{2}\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\int_{B_{R,t}^{+}}(u(t,x)-u_{k}(t,x))dx\biggr{]}$
		$\displaystyle+C_{2}\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\|Y_{t}-Y_{\tau_{k-1}}\|\int_{B_{R}}\|\nabla u\|dx\biggr{]}$
		$\displaystyle+C_{3}(1+2M_{1})\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\|Y_{t}-Y_{\tau_{k-1}}\|\int_{B_{R}}\|u\|dx\biggr{]}$
	$\displaystyle\leq$	$\displaystyle\ C_{1}(1+M_{1})^{2}\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\int_{B_{R,t}^{+}}(u(t,x)-u_{k}(t,x))dx\biggr{]}$
		$\displaystyle+C_{2}\biggl{(}\tilde{E}\|Y_{t}-Y_{\tau_{k-1}}\|^{2}\biggr{)}^{\frac{1}{2}}\left(\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\left(\int_{B_{R}}\|\nabla u\|dx\right)^{2}\biggr{]}\right)^{\frac{1}{2}}$
		$\displaystyle+C_{3}(1+2M_{1})\left(\tilde{E}\|Y_{t}-Y_{\tau_{k-1}}\|^{2}\right)^{\frac{1}{2}}\left(\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\left(\int_{B_{R}}\|u\|dx\right)^{2}\biggr{]}\right)^{\frac{1}{2}}$
	$\displaystyle=$	$\displaystyle\ C_{1}(1+M_{1})^{2}\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\int_{B_{R,t}^{+}}(u(t,x)-u_{k}(t,x))dx\biggr{]}$
		$\displaystyle+C_{2}d^{\frac{1}{2}}(t-\tau_{k-1})^{\frac{1}{2}}\left(\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\left(\int_{B_{R}}\|\nabla u\|dx\right)^{2}\biggr{]}\right)^{\frac{1}{2}}$
		$\displaystyle+C_{3}(1+2M_{1})d^{\frac{1}{2}}(t-\tau_{k-1})^{\frac{1}{2}}\left(\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\left(\int_{B_{R}}\|u\|dx\right)^{2}\biggr{]}\right)^{\frac{1}{2}}.$

Here, the second inequality holds because of the property of the event $\Omega_{M_{1}}$ , the third inequality holds according to the Cauchy-Schwartz inequality and the last equality holds because $Y_{t}$ is a normal distributed random vector.

On the event $\Omega_{M_{1}}$ , the observation process $\{Y_{t}:0\leq t\leq T\}$ is bounded. Therefore, according to the regularity results of parabolic partical differential equations (cf. [19], Section 7.1, Theorem 6), the integrals $\int_{B_{R}}|\nabla u|dx$ and $\int_{B_{R}}|u|dx$ are also bounded for almost every $t\in[0,T]$ , as long as $f\in C^{1}(B_{R})$ and $h\in C^{2}(B_{R})$ . Thus,

	$\displaystyle\frac{d}{dt}\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}$	$\displaystyle\int_{B_{R,t}^{+}}(u(t,x)-u_{k}(t,x))dx\biggr{]}$		(115)
		$\displaystyle\leq C_{4}\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\int_{B_{R,t}^{+}}(u(t,x)-u_{k}(t,x))dx\biggr{]}+C_{5}(t-\tau_{k-1})^{\frac{1}{2}},$		(115)

where $C_{4},C_{5}>0$ are constants which depend on $d,L,M_{0},M_{1},T$ .

Similarly, we also have the estimation for the integral on the set $B_{R,t}^{-}=\{x\in B_{R}:u(t,x)-u_{k}(t,x)\leq 0\}$ :

	$\displaystyle\frac{d}{dt}\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}$	$\displaystyle\int_{B_{R,t}^{-}}(u(t,x)-u_{k}(t,x))dx\biggr{]}$		(116)
		$\displaystyle\leq C_{4}\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\int_{B_{R,t}^{-}}(u(t,x)-u_{k}(t,x))dx\biggr{]}+C_{5}(t-\tau_{k-1})^{\frac{1}{2}},$		(116)

and thus

	$\displaystyle\frac{d}{dt}\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}$	$\displaystyle\int_{B_{R}}\|u(t,x)-u_{k}(t,x)\|dx\biggr{]}$		(117)
		$\displaystyle\leq C_{4}\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\int_{B_{R}}\|u(t,x)-u_{k}(t,x)\|dx\biggr{]}+2C_{5}(t-\tau_{k-1})^{\frac{1}{2}}.$		(117)

Therefore,

	$\displaystyle\frac{d}{dt}\biggl{(}e^{-C_{4}(t-\tau_{k-1})}$	$\displaystyle\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\int_{B_{R}}\|u(t,x)-u_{k}(t,x)\|dx\biggr{]}\biggr{)}$		(118)
		$\displaystyle\leq 2C_{5}e^{-C_{4}(t-\tau_{k-1})}(t-\tau_{k-1})^{\frac{1}{2}},$		(118)

and

	$\displaystyle\tilde{E}\biggl{[}$	$\displaystyle 1_{\Omega_{M_{1}}}\int_{B_{R}}\|u(t,x)-u_{k}(t,x)\|dx\biggr{]}$		(119)
		$\displaystyle\leq e^{C_{4}(t-\tau_{k-1})}\left(\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\int_{B_{R}}\|u(\tau_{k-1},x)-u_{k}(\tau_{k-1},x)\|dx\biggr{]}+\frac{4}{3}C_{5}(t-\tau_{k-1})^{\frac{3}{2}}\right).$		(119)

Notice that $u_{k}(\tau_{k-1},x)\equiv u_{k-1}(\tau_{k-1},x)$ by definition. Inductively, we have

$\displaystyle\tilde{E}\biggl{[}$	$\displaystyle 1_{\Omega_{M_{1}}}\int_{B_{R}}\|u(\tau_{k},x)-u_{k}(\tau_{k},x)\|dx\biggr{]}$	(120)
	$\displaystyle\leq e^{C_{4}\delta}\left(\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\int_{B_{R}}\|u(\tau_{k-1},x)-u_{k-1}(\tau_{k-1},x)\|dx\biggr{]}+\frac{4}{3}C_{5}\delta^{\frac{3}{2}}\right)$
	$\displaystyle\leq e^{C_{4}k\delta}\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\int_{B_{R}}\|\sigma_{0}(x)-\sigma_{0}(x)\|dx\biggr{]}+\frac{4}{3}C_{5}\delta^{\frac{3}{2}}\sum_{i=1}^{k}e^{C_{4}(i-1)\delta}$
	$\displaystyle\leq\frac{4}{3}C_{5}\delta^{\frac{3}{2}}ke^{C_{4}k\delta}\leq C_{6}\delta^{\frac{1}{2}}.$

where $C_{6}$ is a constant which depends on $d,L,M_{0},M_{1},T$ .

Also, for the value we are concerned with in (107),

	$\displaystyle\tilde{E}\biggl{[}$	$\displaystyle 1_{\Omega_{M_{1}}}\int_{B_{R}}e^{h^{\top}(x)Y_{\tau_{k}}}\|u(\tau_{k},x)-u_{k}(\tau_{k},x)\|dx\biggr{]}$		(121)
		$\displaystyle\leq e^{M_{0}M_{1}}\tilde{E}\biggl{[}1_{\Omega_{M_{1}}}\int_{B_{R}}\|u(\tau_{k},x)-u_{k}(\tau_{k},x)\|dx\biggr{]}\leq C_{6}e^{M_{0}M_{1}}\delta^{\frac{1}{2}}$		(121)

On the event $\Omega_{M_{1}}^{c}=\{\omega:\sup_{0\leq t\leq T}\sum_{j=1}^{d}|Y_{t,j}(\omega)|>M_{1}\}$ , let

\overline{Y}_{T}\triangleq\sup_{0\leq t\leq T}\sum_{j=1}^{d}|Y_{t,j}|,

then,

$\displaystyle\tilde{E}$	$\displaystyle\biggl{[}1_{\Omega_{M_{1}}^{c}}\int_{B_{R}}e^{h^{\top}(x)Y_{\tau_{k}}}\|u(\tau_{k},x)-u_{k}(\tau_{k},x)\|dx\biggr{]}$	(122)
	$\displaystyle\leq\tilde{E}\biggl{[}1_{\Omega_{1,M_{1}}^{c}}\frac{\overline{Y}_{T}}{M_{1}}\exp\left(M_{0}\sum_{j=1}^{d}\|Y_{\tau_{k},j}\|\right)\int_{B_{R}}\|u(\tau_{k},x)-u_{k}(\tau_{k},x)\|dx\biggr{]}$
	$\displaystyle\leq\frac{1}{M_{1}}\left(\tilde{E}\biggl{[}\overline{Y}_{T}^{2}\exp\left(2M_{0}\sum_{j=1}^{d}\|Y_{\tau_{k},j}\|\right)\biggr{]}\right)^{\frac{1}{2}}\left(\tilde{E}\left(\int_{B_{R}}\|u(\tau_{k},x)-u_{k}(\tau_{k},x)\|dx\right)^{2}\right)^{\frac{1}{2}}$
	$\displaystyle\leq\frac{C_{7}}{M_{1}}\left(\tilde{E}\xi^{2}\right)^{\frac{1}{2}}\left(\tilde{E}\int_{B_{R}}\|u(\tau_{k},x)\|^{2}dx+\tilde{E}\int_{B_{R}}\|u_{k}(\tau_{k},x)\|^{2}dx\right)^{\frac{1}{2}}.$

where $C_{7}>0$ is a constant which is related to the volume of the $d$ -dimensional ball $B_{R}$ , and $\xi$ is the random variable given by

\xi=\overline{Y}_{T}\exp\left(M_{0}\sum_{j=1}^{d}|Y_{\tau_{k},j}|\right),

(123)

and

\displaystyle\tilde{E}\xi^{2}=\tilde{E}\biggl{[}\overline{Y}_{T}^{2}\exp\left(2M_{0}\sum_{j=1}^{d}|Y_{\tau_{k},j}|\right)\biggr{]}\leq\left(\tilde{E}\overline{Y}_{T}^{4}\right)^{\frac{1}{2}}\left(\tilde{E}\exp\left(4M_{0}\sum_{j=1}^{d}|Y_{\tau_{k},j}|\right)\right)^{\frac{1}{2}}

(124)

According to the Burkholder-Davis-Gundy inequality (cf. [20], Chapter 3, Theorem 3.28, for example), there exists $C_{8}>0$ , such that

\tilde{E}\overline{Y}_{T}^{4}\leq C_{8}\tilde{E}\sum_{j=1}^{d}|Y_{T,j}|^{4}\leq 3C_{8}dT^{2}.

(125)

and also, because $Y_{\tau_{k},j}$ are normal random variables, the expectation of

\exp\left(4M_{0}\sum_{j=1}^{d}Y_{\tau_{k},j}\right)

is bounded.

For the value $\tilde{E}\int_{B_{R}}|u(\tau_{k},x)|^{2}dx$ , because

u(t,x)=\exp\left(-\sum_{j=1}^{d}h_{j}(x)Y_{t,j}\right)\sigma(t,x),

(126)

then

$\displaystyle\tilde{E}\int_{B_{R}}\|u(\tau_{k},x)\|^{2}dx$	$\displaystyle=\tilde{E}\int_{B_{R}}\exp\biggl{(}-2\sum_{j=1}^{d}h_{j}(x)Y_{\tau_{k},j}\biggr{)}\sigma^{2}(\tau_{k},x)dx$	(127)
	$\displaystyle\leq\tilde{E}\biggl{[}\exp\left(2M_{0}\sum_{j=1}^{d}\|Y_{\tau_{k},j}\|\right)\int_{B_{R}}\sigma^{2}(\tau_{k},x)dx\biggr{]}$
	$\displaystyle\leq\left(\tilde{E}\exp\left(4M_{0}\sum_{j=1}^{d}\|Y_{\tau_{k},j}\|\right)\right)^{\frac{1}{2}}\left(\tilde{E}\left(\int_{B_{R}}\|\sigma(\tau_{k},x)\|^{2}dx\right)^{2}\right)^{\frac{1}{2}}.$

Notice that $\sigma(t,x)$ is the solution to the stochastic partial differential equation

d\sigma(t,x)=\mathcal{L}^{*}\sigma(t,x)dt+\sum_{j=1}^{d}h_{j}\sigma(t,x)dY_{t,j}.

(128)

and the boundedness of

\tilde{E}\left(\int_{B_{R}}|\sigma(\tau_{k},x)|^{2}dx\right)^{2}

(129)

follows from the regularity theory of stochastic partial differential equation.

In the monograph [21], the authors provided a similar regularity result, and proved that $\tilde{E}\int_{B_{R}}|\sigma(\tau_{k},x)|^{2}dx$ is bounded by the initial values. Here in our case, we will prove that there exists $C_{9}>0$ , such that

\tilde{E}\left(\int_{B_{R}}|\sigma(\tau_{k},x)|^{2}dx\right)^{2}\leq C_{9}\left(\int_{B_{R}}|\sigma_{0}(x)|^{2}dx\right)^{2}.

(130)

The detailed proof of (130) can be found in the Appendix.

Therefore, we have

\tilde{E}\int_{B_{R}}|u(\tau_{k},x)|^{2}dx\leq C_{10},

(131)

where $C_{10}>0$ is a constant that does not depend on $\delta$ or $M_{1}$ .

Furthermore, as we have discussed in the previous section, $\tilde{E}\int_{B_{R}}|u_{k}(\tau_{k},x)|^{2}dx$ is also bounded above, and thus, we have

\displaystyle\tilde{E}\biggl{[}

\displaystyle 1_{\Omega_{M_{1}}^{c}}\int_{B_{R}}e^{h^{\top}(x)Y_{\tau_{k}}}|u(\tau_{k},x)-u_{k}(\tau_{k},x)|dx\biggr{]}\leq\frac{C_{11}}{M_{1}},

(132)

where $C_{11}$ is a constant which does not depend on $M_{1}$ or $\delta$ .

In summary, for each $\epsilon>0$ , there exists $M_{1}>0$ , such that

\frac{C_{11}}{M_{1}}<\frac{\epsilon}{2},

(133)

and for this particular $M_{1}$ , there exists $\delta>0$ , such that

C_{6}e^{M_{0}M_{1}}\delta^{\frac{1}{2}}<\frac{\epsilon}{2},

(134)

Therefore, for every $k=1,\cdots,K$ ,

$\displaystyle\tilde{E}$	$\displaystyle\int_{B_{R}}e^{h^{\top}(x)Y_{\tau_{k}}}\|u(\tau_{k},x)-u_{k}(\tau_{k},x)\|dx$	(135)
	$\displaystyle=\tilde{E}\biggl{[}1_{\Omega_{1,M_{1}}}\int_{B_{R}}e^{h^{\top}(x)Y_{\tau_{k}}}\|u(\tau_{k},x)-u_{k}(\tau_{k},x)\|dx\biggr{]}$
	$\displaystyle\quad+\tilde{E}\biggl{[}1_{\Omega_{1,M_{1}}^{c}}\int_{B_{R}}e^{h^{\top}(x)Y_{\tau_{k}}}\|u(\tau_{k},x)-u_{k}(\tau_{k},x)\|dx\biggr{]}$
	$\displaystyle\leq\ C_{6}e^{M_{0}M_{1}}\delta^{\frac{1}{2}}+\frac{C_{10}}{M_{1}}<\epsilon.$

∎

8 Conclusion

In this paper, we provide a novel convergence analysis of Yau-Yau algorithm from a probabilistic perspective. With very liberal assumptions only on the coefficients of the filtering systems and the initial distributions (without assumptions on particular paths of observations), we can prove that Yau-Yau algorithm can provide accurate approximations with arbitrary precision to a quite broad class of statistics for the conditional distribution of state process given the observations, which includes the most commonly used conditional mean and covariance matrix. Therefore, the capability of Yau-Yau algorithm to solve very general nonlinear filtering problems is theoretically verified in this paper.

In the process of deriving this probabilistic version of the convergence results, we study the properties of the exact solution, $\{\sigma(t,x):0\leq t\leq T\}$ , to the DMZ equation and the approximated solution $\{\tilde{u}_{k+1}(\tau_{k},x):1\leq k\leq K\}$ , given by Yau-Yau algorithm, respectively.

For the exact solution $\sigma(t,x)$ of the DMZ equation, we have shown in Section 4 and Section 5 that most of the density of $\sigma(t,x)$ will remain in the closed ball $B_{R}$ , and $\sigma(t,x)$ can be approximated well by the corresponding initial-boundary value problem of DMZ equation in $B_{R}$ . This result also implies that it is very unlikely for the state process to reach infinity within finite terminal time.

For the approximated solution $\tilde{u}_{k+1}(\tau_{k},x)$ given by Yau-Yau algorithm, we have first proved in Section 6 that $\tilde{u}_{k+1}(\tau_{k},x)$ , which evolves in a recursive manner, will not explode in finite time interval, even if the time-discretization step $\delta\rightarrow 0$ . And then, in Section 7, the convergence of $\tilde{u}_{k+1}(\tau_{k},x)$ is proved and the convergence rate is also estimated to be $\sqrt{\delta}$ .

It is clear that the properties of exact solutions and approximated solutions, which we have proved in this paper, highly rely on the nice properties of Brownian motion and Gaussian distributions, especially the Markov and light-tail properties. On the one hand, Brownian motion and Gaussian distribution are up to now, among the most commonly used objects in the mathematical modeling of many areas of applications, and can describe most scenarios in practice. On the other hand, for those systems driven by non-Markov or heavy-tailed processes, minimum mean square criteria, together with the conditional expectations (if exist), may not result in a satisfactory estimation of the state process. In this case, the studies of estimations based on other criteria, such as maximum a posteriori (MAP) [22][23][24], will be a promising direction.

Finally, in this paper, we only consider filtering systems and conduct convergence analysis in time interval $[0,T]$ with a fixed finite terminal time $T$ . It is also interesting to study the behavior of the DMZ equation and the approximation capability of Yau-Yau algorithm in the case where the terminal time $T\rightarrow\infty$ , especially for filtering systems with further stable assumptions. We will continue working on how to combine the existing studies on filter stability, such as [25][26], with our techniques developed in this paper, and hopefully, obtain some convergence results of Yau-Yau algorithm for the whole time line $(0,\infty)$ .

Declarations

Funding

This work is supported by National Natural Science Foundation of China (NSFC) grant (12201631) and Tsinghua University Education Foundation fund (042202008).

Conflict of interest/Competing interests

The authors have no competing interests to declare that are relevant to the content of this article.

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Data, Materials and/or Code availability

Not applicable.

Author contribution

All authors contributed to the study conception and design. The first draft of the manuscript was written by Zeju Sun and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Appendix A Regularity Results of Parabolic Partial Differential Equation and Stochastic Evolution Equation

In this appendix, we will provide a detailed proof of the regularity results of the parabolic partial differential equation and the stochastic evolution equation.

For the purpose of deriving (104) and (130), the regularity results is slightly different from standard ones considered in square-integrable functional spaces.

Theorem 6.

Let $\sigma(t,x)$ be the solution of the following IBV problem:

\left\{\begin{aligned} &\frac{\partial\sigma(t,x)}{\partial t}=\frac{1}{2}\sum_{i,j=1}^{d}\frac{\partial^{2}}{\partial x_{i}\partial x_{j}}(a^{ij}(x)\sigma(t,x))-\sum_{i=1}^{d}\frac{\partial}{\partial x_{i}}(f_{i}(x)\sigma(t,x))\\ &\qquad-\frac{1}{2}|h(x)|^{2}\sigma(t,x),\ (t,x)\in[0,T]\times B_{R},\\ &\sigma(0,x)=\sigma_{0}(x),\ x\in B_{R}\\ &\sigma(t,x)=0,\ (t,x)\in[0,T]\times\partial B_{R},\end{aligned}\right.

(136)

where $B_{R}=\{x\in\mathbb{R}^{d}:|x|\leq R\}$ is the ball in $\mathbb{R}^{d}$ with radius $R$ ; $a:\mathbb{R}^{d}\rightarrow\mathbb{R}^{d\times d}$ , $f:\mathbb{R}^{d}\rightarrow\mathbb{R}^{d}$ , $h:\mathbb{R}^{d}\rightarrow\mathbb{R}^{d}$ are smooth enough functions. Assume that the matrix-valued function $a(x)$ is uniformly positive definite, i.e., there exists $\lambda>0$ , such that

\sum_{i,j=1}^{d}a^{ij}(x)\xi_{i}\xi_{j}\geq\lambda|\xi|^{2},\ \forall\ x\in B_{R},\ \xi\in\mathbb{R}^{d}.

(137)

If the initial value $\sigma_{0}(x)$ is quartic-integrable in $B_{R}$ , then there exists a constant $C>0$ , which depends on the coefficients of the system, such that

\int_{B_{R}}\sigma^{4}(T,x)dx\leq e^{CT}\int_{B_{R}}\sigma_{0}^{4}(x)dx.

(138)

Remark 1.

In fact, Assumption (A2) in the main text will imply the coercivity condition (137). This is because the closed ball $B_{R}$ is a compact set of $\mathbb{R}^{d}$ , and the continuous function $\lambda(x)$ in Assumption (A2) will map $B_{R}$ to a compact set. Therefore, there exists $\lambda>0$ , such that $\lambda(x)\geq\lambda>0$ , for all $x\in B_{R}$ .

Proof.

Let us define

\tilde{f}_{i}(x)=f_{i}(x)-\sum_{j=1}^{d}\frac{\partial a^{ij}(x)}{\partial x},\ i=1,\cdots,d.

(139)

Then the parabolic equation (136) can be written in a divergence form

\frac{\partial\sigma(t,x)}{\partial t}=\frac{1}{2}\sum_{i,j=1}^{d}\frac{\partial}{\partial x_{i}}\left(a^{ij}(x)\frac{\partial}{\partial x_{j}}\sigma(t,x)\right)-\sum_{i=1}^{d}\frac{\partial}{\partial x_{i}}(\tilde{f}_{i}(x)\sigma(t,x))-\frac{1}{2}|h(x)|^{2}\sigma(t,x).

(140)

Hence,

$\displaystyle\frac{d}{dt}$	$\displaystyle\int_{B_{R}}\sigma^{4}(t,x)dx=\int_{B_{R}}4\sigma^{3}(t,x)\frac{\partial\sigma}{\partial t}dx$	(141)
	$\displaystyle=\int_{B_{R}}2\sigma^{3}\sum_{i,j=1}^{d}\frac{\partial}{\partial x_{i}}\left(a^{ij}\frac{\partial}{\partial x_{j}}\sigma\right)dx-\int_{B_{R}}4\sigma^{3}\sum_{i=1}^{d}\frac{\partial}{\partial x_{i}}(\tilde{f}_{i}\sigma)dx-\int_{B_{R}}2\sigma^{4}\|h\|^{2}dx$
	$\displaystyle=-6\int_{B_{R}}\sigma^{2}\sum_{i,j=1}^{d}a^{ij}\frac{\partial\sigma}{\partial x_{i}}\frac{\partial\sigma}{\partial x_{j}}dx+12\int_{B_{R}}\sum_{i=1}^{d}\tilde{f}_{i}\sigma^{3}\frac{\partial\sigma}{\partial x_{i}}dx-2\int_{B_{R}}\sigma^{4}\|h\|^{2}dx$
	$\displaystyle\leq-6\lambda\int_{B_{R}}\sigma^{2}\|\nabla\sigma\|^{2}dx+12\int_{B_{R}}\sum_{i=1}^{d}\frac{\tilde{f}_{i}\sigma^{2}}{\sqrt{\lambda}}\cdot\left(\sqrt{\lambda}\sigma\frac{\partial\sigma}{\partial x_{i}}\right)dx-2\int_{B_{R}}\sigma^{4}\|h\|^{2}dx$
	$\displaystyle\leq-6\lambda\int_{B_{R}}\sigma^{2}\|\nabla\sigma\|^{2}dx+12\int_{B_{R}}\sum_{i=1}^{d}\biggl{(}\frac{\tilde{f}_{i}^{2}\sigma^{4}}{2\lambda}+\frac{\lambda}{2}\sigma^{2}\biggl{\|}\frac{\partial\sigma}{\partial x_{i}}\biggr{\|}^{2}\biggr{)}dx-2\int_{B_{R}}\sigma^{4}\|h\|^{2}dx$
	$\displaystyle\leq\int_{B_{R}}\biggl{(}\frac{6}{\lambda}\sum_{i=1}^{d}\tilde{f}_{i}^{2}-2\|h\|^{2}\biggr{)}\sigma^{4}(t,x)dx.$

In the bounded domain $B_{R}$ , there exists a constant $C>0$ , such that

\biggl{|}\frac{6}{\lambda}\sum_{i=1}^{d}\tilde{f}_{i}^{2}-2|h|^{2}\biggr{|}\leq C.

(142)

Thus,

\frac{d}{dt}\int_{B_{R}}\sigma^{4}(t,x)dx\leq C\int_{B_{R}}\sigma^{4}(t,x)dx,\ t\in[0,T],

(143)

and by Gronwall’s inequality, we have

\int_{B_{R}}\sigma^{4}(T,x)dx\leq e^{CT}\int_{B_{R}}\sigma_{0}^{4}(x)dx.

(144)

∎

Theorem 7.

Consider the IBV problem of stochastic partial differential equation given by

\left\{\begin{aligned} &d\sigma(t,x)=\mathcal{L}^{*}\sigma(t,x)dt+\sum_{j=1}^{d}h_{j}(x)\sigma(t,x)dY_{t,j},\ t\in[0,T]\\ &\sigma(t,x)=0,\ (t,x)\in[0,T]\times\partial B_{R},\\ &\sigma(0,x)=\sigma_{0}(x),\ x\in B_{R}.\end{aligned}\right.

(145)

where $Y=\{Y_{t}:0\leq t\leq T\}$ is a standard $d$ -dimensional Brownian motion in the filtered probability space $\left(\Omega,\mathcal{F},\{\mathcal{F}_{t}\}_{0\leq t\leq T},P\right)$ ; $B_{R}=\{x\in\mathbb{R}^{d}:|x|\leq R\}$ is the ball in $\mathbb{R}^{d}$ with radius $R$ , and

\mathcal{L}^{*}(\star)=\frac{1}{2}\sum_{i,j=1}^{d}\frac{\partial^{2}}{\partial x_{i}\partial x_{j}}(a^{ij}(x)\star)-\sum_{i=1}^{d}\frac{\partial}{\partial x_{i}}(f_{i}(x)\star).

(146)

Assume that the coefficients $a$ , $f$ , $h$ are smooth enough and the Assumption (A2) holds for the matrix-valued function $a(x)$ , which implies that $a(x)$ is uniformly positive definite in $B_{R}$ , i.e., there exists $\lambda>0$ , such that

\sum_{i,j=1}^{d}a^{ij}(x)\xi_{i}\xi_{j}\geq\lambda|\xi|^{2},\ \forall\ x\in B_{R},\ \xi\in\mathbb{R}^{d}.

(147)

If the initial value $\sigma_{0}(x)$ is square-integrable in $B_{R}$ , then there exists a constant $C>0$ , which depends on $T$ , $R$ and the coefficients of the system, such that

E\left(\int_{B_{R}}|\sigma(T,x)|^{2}dx\right)^{2}\leq C\left(\int_{B_{R}}|\sigma_{0}(x)|^{2}dx\right)^{2}.

(148)

Proof.

Let us define

\tilde{f}_{i}(x)=f_{i}(x)-\sum_{j=1}^{d}\frac{\partial a^{ij}(x)}{\partial x},\ i=1,\cdots,d.

(149)

Then the stochastic partial differential equation in (145) can be rewritten in divergence form:

d\sigma(t,x)=\frac{1}{2}\sum_{i,j=1}^{d}\frac{\partial}{\partial x_{i}}\biggl{(}a^{ij}\frac{\partial\sigma}{\partial x_{j}}\biggr{)}-\sum_{i=1}^{d}\frac{\partial}{\partial x_{i}}(\tilde{f}_{i}\sigma)+\sum_{j=1}^{d}h_{j}\sigma dY_{t,j}.

(150)

Let

\Phi(t)=\int_{B_{R}}\sigma^{2}(t,x)dx,\ t\in[0,T],

(151)

then according to Itô’s formula,

\displaystyle d\Phi(t)=\left(\int_{B_{R}}(2\sigma\mathcal{L}^{*}\sigma+\sigma^{2}|h|^{2})dx\right)dt+\sum_{j=1}^{d}\left(\int_{B_{R}}2h_{j}\sigma^{2}dx\right)dY_{t,j}

(152)

and

	$\displaystyle d\Phi^{2}(t)$	$\displaystyle=2\left(\int_{B_{R}}\sigma^{2}dx\right)\left(\int_{B_{R}}(2\sigma\mathcal{L}^{*}\sigma+\sigma^{2}\|h\|^{2})dx\right)dt$		(153)
		$\displaystyle\quad+2\Phi(t)\sum_{j=1}^{d}\left(\int_{B_{R}}2h_{j}\sigma^{2}dx\right)dY_{t,j}+\sum_{j=1}^{d}\left(\int_{B_{R}}2h_{j}\sigma^{2}dx\right)^{2}dt$		(153)

After taking expectations, we have

$\displaystyle\frac{d}{dt}E\Phi^{2}(t)$	$\displaystyle=\frac{d}{dt}E\left(\int_{B_{R}}\sigma^{2}(t,x)dx\right)^{2}$	(154)
	$\displaystyle=E\biggl{[}2\left(\int_{B_{R}}\sigma^{2}dx\right)\left(\int_{B_{R}}(2\sigma\mathcal{L}^{*}\sigma+\sigma^{2}\|h\|^{2})dx\right)$
	$\displaystyle\qquad+\sum_{j=1}^{d}\left(\int_{B_{R}}2h_{j}\sigma^{2}dx\right)^{2}\biggr{]}$

Notice that

$\displaystyle\int_{B_{R}}2\sigma\mathcal{L}^{*}\sigma dx$	$\displaystyle=\int_{B_{R}}\sigma\sum_{i,j=1}^{d}\frac{\partial}{\partial x_{i}}\biggl{(}a^{ij}\frac{\partial\sigma}{\partial x_{j}}\biggr{)}dx-\int_{B_{R}}2\sigma\sum_{i=1}^{d}\frac{\partial}{\partial x_{i}}(\tilde{f}_{i}\sigma)dx$	(155)
	$\displaystyle=-\int_{B_{R}}\sum_{i,j=1}^{d}a^{ij}\frac{\partial\sigma}{\partial x_{i}}\frac{\partial\sigma}{\partial x_{j}}dx+2\int_{B_{R}}\sum_{i=1}^{d}\tilde{f}_{i}\sigma\frac{\partial\sigma}{\partial x_{i}}dx$
	$\displaystyle\leq-\lambda\int_{B_{R}}\|\nabla\sigma\|^{2}dx+2\int_{B_{R}}\sum_{i=1}^{d}\frac{\tilde{f}_{i}\sigma}{\sqrt{\lambda}}\cdot\biggl{(}\sqrt{\lambda}\frac{\partial\sigma}{\partial x_{i}}\biggr{)}dx$
	$\displaystyle\leq-\lambda\int_{B_{R}}\|\nabla\sigma\|^{2}dx+\int_{B_{R}}\frac{1}{\lambda}\sum_{i=1}^{d}\tilde{f}_{i}^{2}\sigma^{2}dx+\lambda\int_{B_{R}}\|\nabla\sigma\|^{2}dx.$

Hence,

	$\displaystyle\frac{d}{dt}E\left(\int_{B_{R}}\sigma^{2}(t,x)dx\right)^{2}$	$\displaystyle\leq 2E\biggl{[}\biggl{(}\int_{B_{R}}\sigma^{2}dx\biggr{)}\biggl{(}\int_{B_{R}}\biggl{(}\frac{1}{\lambda}\|\tilde{f}\|^{2}+\|h\|^{2}\biggr{)}\sigma^{2}dx\biggr{)}\biggr{]}$		(156)
		$\displaystyle\quad+E\biggl{[}\sum_{j=1}^{d}\left(\int_{B_{R}}2h_{j}\sigma^{2}dx\right)^{2}\biggr{]}$		(156)

In the bounded domain $B_{R}$ , there exists $M>0$ , such that

\frac{1}{\lambda}|\tilde{f}(x)|^{2}+|h(x)|^{2}\leq M,\ |h_{j}(x)|\leq M,\ \forall\ x\in B_{R}.

(157)

Thus,

\frac{d}{dt}E\left(\int_{B_{R}}\sigma^{2}(t,x)dx\right)^{2}\leq(2M+4dM^{2})E\left(\int_{B_{R}}\sigma^{2}(t,x)dx\right)^{2}

(158)

According to Gronwall’s inequality,

E\left(\int_{B_{R}}\sigma^{2}(T,x)dx\right)^{2}\leq e^{(2M+4dM^{2})T}\left(\int_{B_{R}}\sigma_{0}^{2}(x)dx\right)^{2},

(159)

which is the desired result. ∎

References

\bibcommenthead
Yau and Yau [2000] Yau, S.-T., Yau, S.S.-T.: Real time solution of nonlinear filtering problem without memory i. Mathematical research letters 7(6), 671–693 (2000)
Yau and Yau [2008] Yau, S.-T., Yau, S.S.-T.: Real time solution of the nonlinear filtering problem without memory ii. SIAM Journal on Control and Optimization 47(1), 163–195 (2008)
Candy [2016] Candy, J.V.: Bayesian Signal Processing: Classical, Modern, and Particle Filtering Methods, (2016)
Roth et al. [2017] Roth, M., Hendeby, G., Fritsche, C., Gustafsson, F.: The ensemble kalman filter: a signal processing perspective. EURASIP Journal on Advances in Signal Processing 2017, 1–16 (2017)
Galanis et al. [2006] Galanis, G., Louka, P., Katsafados, P., Pytharoulis, I., Kallos, G.: Applications of kalman filters based on non-linear functions to numerical weather predictions. Ann. Geophys 24, 2451–2460 (2006)
Chen and Yu [2014] Chen, K., Yu, J.: Short-term wind speed prediction using an unscented kalman filter based state-space support vector regression approach. Applied energy 113, 690–705 (2014)
Ichard [2015] Ichard, C.: Random media and processes estimation using non-linear filtering techniques: application to ensemble weather forecast and aircraft trajectories. PhD thesis, Université de Toulouse, Université Toulouse III-Paul Sabatier (2015)
Sun et al. [2019] Sun, J., Blom, H.A., Ellerbroek, J., Hoekstra, J.M.: Particle filter for aircraft mass estimation and uncertainty modeling. Transportation Research Part C: Emerging Technologies 105, 145–162 (2019)
Jazwinski [2007] Jazwinski, A.H.: Stochastic Processes and Filtering Theory. Courier Corporation, (2007)
Bain [2009] Bain, A.: Fundamentals of Stochastic Filtering, 1st ed. 2009. edn. Springer, New York, NY (2009)
Duncan [1967] Duncan, T.E.: Probability densities for diffusion processes with applications to nonlinear filtering theory and detection theory. PhD thesis, Stanford University, Stanford, California (May 1967)
Mortensen [1966] Mortensen, R.E.: Optimal control of continuous-time stochastic systems. Technical report, California Univ. Berkeley Electronics Research Lab (1966)
Zakai [1969] Zakai, M.: On the optimal filtering of diffusion processes. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete 11(3), 230–243 (1969)
Luo and Yau [2013] Luo, X., Yau, S.S.-T.: Hermite spectral method to 1-d forward kolmogorov equation and its application to nonlinear filtering problems. IEEE Transactions on Automatic Control 58(10), 2495–2507 (2013)
Dong et al. [2020] Dong, W., Luo, X., Yau, S.S.-T.: Solving nonlinear filtering problems in real time by legendre galerkin spectral method. IEEE Transactions on Automatic Control 66(4), 1559–1572 (2020)
Wang et al. [2019] Wang, Z., Luo, X., Yau, S.S.-T., Zhang, Z.: Proper orthogonal decomposition method to nonlinear filtering problems in medium-high dimension. IEEE Transactions on Automatic Control 65(4), 1613–1624 (2019)
Li et al. [2022] Li, S., Wang, Z., Yau, S.S.-T., Zhang, Z.: Solving nonlinear filtering problems using a tensor train decomposition method. IEEE Transactions on Automatic Control (2022)
Chekroun et al. [2016] Chekroun, M.D., Park, E., Temam, R.: The stampacchia maximum principle for stochastic partial differential equations and applications. Journal of Differential Equations 260(3), 2926–2972 (2016)
Evans [2010] Evans, L.C.: Partial Differential Equations, 2nd ed. edn. American Mathematical Society, (2010)
Karatzas [1998] Karatzas, I.: Brownian Motion and Stochastic Calculus, 2nd edn. New York: Springer, (1998)
Rozovsky [2018] Rozovsky, B.L.: Stochastic Evolution Systems Linear Theory and Applications to Non-Linear Filtering, 2nd edn. Cham : Springer, (2018)
Godsill et al. [2001] Godsill, S., Doucet, A., West, M.: Maximum a posteriori sequence estimation using monte carlo particle filters. Annals of the Institute of Statistical Mathematics 53, 82–96 (2001)
Saha et al. [2012] Saha, S., Mandal, P.K., Bagchi, A., Boers, Y., Driessen, J.N.: Particle based smoothed marginal map estimation for general state space models. IEEE transactions on signal processing 61(2), 264–273 (2012)
Kang et al. [2023] Kang, J., Salmon, A., Yau, S.S.-T.: Log-concave posterior densities arising in continuous filtering and a maximum a posteriori algorithm. SIAM Journal on Control and Optimization 61(4), 2407–2424 (2023)
Atar [1998] Atar, R.: Exponential stability for nonlinear filtering of diffusion processes in a noncompact domain. Annals of Probability 26(4), 1552–1574 (1998)
Ocone and Pardoux [1996] Ocone, D., Pardoux, E.: Asymptotic stability of the optimal filter with respect to its initial condition. SIAM Journal on Control and Optimization 34(1), 226–243 (1996)

	$\displaystyle E$	$\displaystyle\biggl{\|}E[\varphi(X_{\tau_{k}})\|\mathcal{Y}_{\tau_{k}}]-\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggr{\|}$
		$\displaystyle=\tilde{E}\biggl{[}\tilde{Z}_{\tau_{k}}\biggl{\|}E[\varphi(X_{\tau_{k}})\|\mathcal{Y}_{\tau_{k}}]-\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggr{\|}\biggr{]}$
		$\displaystyle=\tilde{E}\biggl{[}\tilde{Z}_{\tau_{k}}\biggl{\|}\frac{\tilde{E}[\varphi(X_{\tau_{k}})\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]}{\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]}-\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggr{\|}\biggr{]}$
		$\displaystyle=\tilde{E}\biggl{[}\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]\biggl{\|}\frac{\tilde{E}[\varphi(X_{\tau_{k}})\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]}{\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]}-\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggr{\|}\biggr{]}$
		$\displaystyle\leq\tilde{E}\biggl{[}\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]\biggl{\|}\frac{\tilde{E}[\varphi(X_{\tau_{k}})\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]}{\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]}-\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]}\biggr{\|}\biggr{]}$
		$\displaystyle\quad+\tilde{E}\biggl{[}\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]\biggl{\|}\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]}-\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggr{\|}\biggr{]}$
		$\displaystyle\leq\tilde{E}\biggl{[}\biggl{\|}\tilde{E}[\varphi(X_{\tau_{k}})\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]-\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx\biggr{\|}\biggr{]}$
		$\displaystyle\quad+\tilde{E}\biggl{[}\frac{\int_{B_{R}}\|\varphi(x)\|\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggl{\|}\tilde{E}[\tilde{Z}_{\tau_{k}}\|\mathcal{Y}_{\tau_{k}}]-\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx\biggr{\|}\biggr{]}$
		$\displaystyle=\tilde{E}\biggl{[}\biggl{\|}\int_{\mathbb{R}^{d}}\varphi(x)\sigma(\tau_{k},x)dx-\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx\biggr{\|}\biggr{]}$
		$\displaystyle\quad+\tilde{E}\biggl{[}\frac{\int_{B_{R}}\|\varphi(x)\|\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggl{\|}\int_{\mathbb{R}^{d}}\sigma(\tau_{k},x)dx-\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx\biggr{\|}\biggr{]}$
		$\displaystyle\triangleq I_{1}+I_{2}.$

$\displaystyle I_{1}\leq$	$\displaystyle\tilde{E}\int_{\|x\|\geq R}\|\varphi(x)\|\sigma(\tau_{k},x)dx+\tilde{E}\biggl{[}\biggl{\|}\int_{B_{R}}\varphi(x)\sigma(\tau_{k},x)dx-\int_{B_{R}}\varphi(x)\sigma_{R}(\tau_{k},x)dx\biggr{\|}\biggr{]}$
	$\displaystyle+\tilde{E}\biggl{[}\biggl{\|}\int_{B_{R}}\varphi(x)\sigma_{R}(\tau_{k},x)dx-\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx\biggr{\|}\biggr{]}$
$\displaystyle\leq$	$\displaystyle\tilde{E}\int_{\|x\|\geq R}\|\varphi(x)\|\sigma(\tau_{k},x)dx+\tilde{E}\int_{B_{R}}\|\varphi(x)\|\cdot\|\sigma(\tau_{k},x)-\sigma_{R}(\tau_{k},x)\|dx$
	$\displaystyle+\tilde{E}\int_{B_{R}}\|\varphi(x)\|\cdot\|\sigma_{R}(\tau_{k},x)-\tilde{u}_{k+1}(\tau_{k},x)\|dx$
$\displaystyle\leq$	$\displaystyle\tilde{E}\int_{\|x\|\geq R}\|\varphi(x)\|\sigma(\tau_{k},x)dx+L(1+R^{2m})\tilde{E}\int_{B_{R}}\|\sigma(\tau_{k},x)-\sigma_{R}(\tau_{k},x)\|dx$
	$\displaystyle+L(1+R^{2m})\tilde{E}\int_{B_{R}}\|\sigma_{R}(\tau_{k},x)-\tilde{u}_{k+1}(\tau_{k},x)\|dx.$	(29)

$\displaystyle I_{2}$	$\displaystyle\leq L(1+R^{2m})\tilde{E}\biggl{[}\biggl{\|}\int_{\mathbb{R}^{d}}\sigma(\tau_{k},x)dx-\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx\biggr{\|}\biggr{]}$	(32)
	$\displaystyle\leq L(1+R^{2m})\biggl{(}\tilde{E}\int_{\|x\|\geq R}\sigma(\tau_{k},x)dx+\tilde{E}\biggl{[}\biggl{\|}\int_{B_{R}}\sigma(\tau_{k},x)dx-\int_{B_{R}}\sigma_{R}(\tau_{k},x)dx\biggr{\|}\biggr{]}\biggr{)}$
	$\displaystyle\quad+L(1+R^{2m})\tilde{E}\biggl{[}\biggl{\|}\int_{B_{R}}\sigma_{R}(\tau_{k},x)dx-\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx\biggr{\|}\biggr{]}$
	$\displaystyle\leq L(1+R^{2m})\biggl{(}\tilde{E}\int_{\|x\|\geq R}\sigma(\tau_{k},x)dx+\tilde{E}\int_{B_{R}}\|\sigma(\tau_{k},x)-\sigma_{R}(\tau_{k},x)\|dx\biggr{)}$
	$\displaystyle\quad+L(1+R^{2m})\tilde{E}\int_{B_{R}}\|\sigma_{R}(\tau_{k},x)-\tilde{u}_{k+1}(\tau_{k},x)\|dx.$

$\displaystyle E$	$\displaystyle\biggl{\|}E[\varphi(X_{\tau_{k}})\|\mathcal{Y}_{\tau_{k}}]-\frac{\int_{B_{R}}\varphi(x)\tilde{u}_{k+1}(\tau_{k},x)dx}{\int_{B_{R}}\tilde{u}_{k+1}(\tau_{k},x)dx}\biggr{\|}\leq I_{1}+I_{2}$	(33)
	$\displaystyle\leq\tilde{E}\int_{\|x\|\geq R}\|\varphi(x)\|\sigma(\tau_{k},x)dx+L(1+R^{m})\tilde{E}\int_{\|x\|\geq R}\sigma(\tau_{k},x)dx$
	$\displaystyle\quad+2L(1+R^{2m})\tilde{E}\int_{B_{R}}\|\sigma(\tau_{k},x)-\sigma_{R}(\tau_{k},x)\|dx$
	$\displaystyle\quad+2L(1+R^{2m})\tilde{E}\int_{B_{R}}\|\sigma_{R}(\tau_{k},x)-\tilde{u}_{k+1}(\tau_{k},x)\|dx.$

$\displaystyle\|\mathfrak{F}(x)\|$	$\displaystyle\leq d^{2}(4n^{2}+n)L+\frac{2n\|x\|^{2n-2}}{1+\|x\|^{2n}}\sum_{i=1}^{d}\frac{\|f_{i}(x)\|^{2}+\|x_{i}\|^{2}}{2}$	(60)
	$\displaystyle\leq d^{2}(4n^{2}+n)L+\frac{n\|x\|^{2n}+n\|x\|^{2n-2}\|f(x)\|^{2}}{1+\|x\|^{2n}}$
	$\displaystyle\leq d^{2}(4n^{2}+n)L+n(L^{2}+1)+n\|f(0)\|^{2}+2nL\|f(0)\|,\ \forall x\in\mathbb{R}^{d}.$

On the Convergence Analysis of Yau-Yau Nonlinear Filtering Algorithm: from a Probabilistic Perspective

Abstract

keywords:

pacs:

1 Introduction

2 Preliminaries

3 Main Results

Theorem 1.

A Sketch of the Proof of Theorem 25.

4 Estimation of the density outside the ball BRB_{R}

Theorem 2.

Proof of Theorem 2.

5 Approximation of σ​(t,x)\sigma(t,x) by the IBV problem in BRB_{R}

Theorem 3.

Proof of Theorem 3.

6 Regularity of the Approximated Function uk​(t,x)u_{k}(t,x)

Theorem 4.

Lemma 1.

Proof of Lemma 1.

Proof of Theorem 4.

7 Convergence Analysis of the Time Discretization Scheme

Theorem 5.

Proof of Theorem 5.

8 Conclusion

Declarations

Funding

Conflict of interest/Competing interests

Ethics approval and consent to participate

Consent for publication

Data, Materials and/or Code availability

Author contribution

Appendix A Regularity Results of Parabolic Partial Differential Equation and Stochastic Evolution Equation

Theorem 6.

Remark 1.

Proof.

Theorem 7.

Proof.

References

4 Estimation of the density outside the ball $B_{R}$

5 Approximation of $\sigma(t,x)$ by the IBV problem in $B_{R}$

6 Regularity of the Approximated Function $u_{k}(t,x)$