Linear Quadratic Control of Backward Stochastic Differential Equation with Partial Information ^†^†thanks: This work supported by the National Natural Science Foundations of China under Grants 61821004, 61633015, 61877062, and 61977043.

Guangchen Wang, Wencan Wang, Zhiguo Yan School of Control Science and Engineering, Shandong University, Jinan 250061, PR China, E-mail: [email protected]School of Control Science and Engineering, Shandong University, Jinan 250061, PR China, E-mail: [email protected]School of Electrical Engineering and Automation, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, PR China, E-mail:[email protected]

Abstract: In this paper, we study an optimal control problem of linear backward stochastic differential equation (BSDE) with quadratic cost functional under partial information. This problem is solved completely and explicitly by using a stochastic maximum principle and a decoupling technique. By using the maximum principle, a stochastic Hamiltonian system, which is a forward-backward stochastic differential equation (FBSDE) with filtering, is obtained. By decoupling the stochastic Hamiltonian system, three Riccati equations, a BSDE with filtering, and a stochastic differential equation (SDE) with filtering are derived. We then get an optimal control with a feedback representation. An explicit formula for the corresponding optimal cost is also established. As illustrative examples, we consider two special scalar-valued control problems and give some numerical simulations.

Keywords: Linear quadratic optimal control; backward stochastic differential equation; filtering; Ricatti equation; feedback representation.

Mathematics Subject Classification: 93E20, 60H10

1 Introduction

A BSDE is an Itô SDE for which a random terminal rather than an initial condition on state has been specified. Bismut [1] first introduced a linear BSDE, which is an adjoint equation of stochastic optimal control problem. Pardoux and Peng [2] extended the linear BSDE to a general case. Since then, there has been considerable attention on related topics and their applications among researchers in mathematical finance and stochastic optimal control. See for example, El Karoui et al. [3], Ma and Yong [4], Kohlmann and Zhou [5].

Since BSDE stems from stochastic control theory, it is very natural and appealing to investigate optimal control problem of BSDE. Moreover, controlled BSDE is expected to have wide and important applications in various fields, especially in mathematical finance. In financial investment, a European contingent claim $\xi$ , which is a random variable, can be thought as a contract to be guaranteed at maturity $T$ . Peng [6] and Dokuchaev and Zhou [7] derived local and global stochastic maximum principles of optimality for BSDEs, respectively. Linear quadratic (LQ) optimal control problems of BSDEs have also been investigated. Lim and Zhou [8] discussed an LQ control problem of BSDE with a general setting and gave a feedback representation of the optimal control. Li et al. [9] extended the results in [8] to the case with mean-field term. Huang et al. [10] and Du et al. [11] considered LQ backward mean-field games. Du and Wu [12] concerned a stackelberg game for mean field linear BSDE with quadratic cost functionals.

In this paper, we investigate an LQ control problem of BSDE with partial information, which will be referred as a stochastic backward LQ control problem. We are devoted to deriving the optimal control with a feedback representation and establishing an explicit formula for the corresponding optimal cost. Note that the mentioned papers above are concentrated on the complete information case. The motivation of studying stochastic control problems with partial information arises naturally from the area of financial economics. In a portfolio and consumption problem, let $\mathbb{F}\equiv\{\mathcal{F}_{t}\}_{t\geq 0}$ denote the flow of information generated by all market noises. In reality, the information available to an agent maybe less than the one produced by the market noises, that is, $\mathcal{G}_{t}\subseteq\mathcal{F}_{t}$ , where $\mathbb{G}\equiv\{\mathcal{G}_{t}\}_{t\geq 0}$ is the information available to the agent. There are considerable literatures on related topics [13, 14, 15, 16, 17, 18]. In particular, Huang et al. [15] derived a necessary condition for optimality of BSDE with partial information and applied their results to two classes of LQ problems. Wang et al. [16] and Wang et al. [17] concerned LQ problems with partially observable information driven by FBSDE and mean field FBSDE, respectively. LQ non-zero sum stochastic differential game of BSDE is considered in Wang et al. [18]. They obtained feedback Nash equilibrium points by FBSDE and Riccati equation under asymmetric information.

Our work distinguishes itself from existing literatures in the following aspects. (i) Both the generator of dynamic system and the cost functional contain diffusion terms $Z_{1}$ and $Z_{2}$ . Moreover, our results are obtained under some usual conditions (see Assumptions $A1$ and $A2$ in section 2). In the literatures on this topic, diffusion terms $Z_{1}$ and $Z_{2}$ are usually assumed not to be contained into the generator (see [15], [17]), or there are some additional conditions to ensure the solvability of Riccati equation (see [16], [18]). (ii) Sufficient and necessary conditions of optimality are established, which provide an expression for optimal control via the solution of stochastic Hamiltonian system. (iii) Explicit representations for optimal control in terms of three Riccati equations, a BSDE with filtering, and an SDE with filtering are obtained, as well as the associated optimal cost. The derivation of associated Riccati equations is extremely different from [8] and [9], since the stochastic Hamiltonian system is an FBSDE with filtering. Moreover, the uniqueness and existence of solution to BSDE (3.5) is first obtained, which is important in deriving explicit representations for optimal control and associated optimal cost. (iv) Last but not least, we consider two special scalar-valued control problems of BSDEs with partial information. In the case of $H=N_{1}=0$ , we obtain explicit solutions of the stochastic Hamiltonian system, as well as related Riccati equations. In the case of $C_{2}=0$ , we give some numerical simulations to illustrate our theoretical results.

The rest of this paper is organized as follows. In Section 2, we formulate the stochastic backward LQ control problem and give some preliminary results. Section 3 aims to decouple the associated stochastic Hamiltonian system and derive some Riccati equations. In Section 4, we give explicit representations of optimal control and the associated optimal cost. Section 5 is devoted to solving two special scalar-valued control problems and giving some numerical simulations. Finally, we conclude this paper.

2 Preliminaries

Let $(\Omega,\mathcal{F},\mathbb{F},\mathbb{P})$ be a complete filtered probability space and let $T>0$ be a fixed time horizon. Let $\{(W_{1t},W_{2t}):0\leq t\leq T\}$ be a $\mathbb{R}^{2}$ -valued standard Wiener process, defined on $(\Omega,\mathcal{F},\mathbb{F},\mathbb{P})$ . $\mathbb{F}\equiv\{\mathcal{F}_{t}\}_{t\geq 0}$ is a natural filtration of $(W_{1},W_{2})$ augmented by all $\mathbb{P}$ -null sets. Let $\mathcal{F}_{t}^{\beta}=\sigma\{\beta_{s},\ 0\leq s\leq t\}$ be the filtration generated by a stochastic process $\beta$ . Let $\mathbb{R}^{n\times m}$ be the set of all $n\times m$ matrices and $\mathbb{S}^{n}$ be the set of all $n\times n$ symmetric matrices. For a matrix $M\in\mathbb{R}^{n}$ , let $M^{\top}$ be its transpose. The inner product $\langle\cdot,\cdot\rangle$ on $\mathbb{R}^{n\times m}$ is defined by $\langle M,N\rangle\mapsto tr(M^{\top}N)$ with an induced norm $|M|=\sqrt{tr(M^{\top}M)}$ . In particular, we denote by $\mathbb{S}_{+}^{n}$ ( $\widehat{\mathbb{S}}_{+}^{n}$ ) the set of all $n\times n$ (uniformly) positive definite matrices. For any Euclidean space $M$ , we adopt the following notations:
$\mathcal{L}_{\mathcal{F}_{T}}^{2}(\Omega;M)=\Big{\{}\zeta:\Omega\to M|\zeta$ is an $\mathcal{F}_{T}$ -measurable random variable, $\mathbb{E}[|\zeta|^{2}]<\infty$ };
$\mathcal{L}^{\infty}(0,T;M)=\Big{\{}v:[0,T]\to M|v$ is a bounded function};
$\mathcal{L}_{\mathcal{F}}^{2}(0,T;M)=\Big{\{}v:[0,T]\times\Omega\to M|v$ is an $\{\mathcal{F}_{t}\}_{t\geq 0}$ -adapted stochastic process, $\mathbb{E}\left[\int_{0}^{T}|v_{t}|^{2}dt\right]<\infty\Big{\}};$
$\mathcal{S}_{\mathcal{F}}^{2}(0,T;M)=\Big{\{}v:[0,T]\times\Omega\to M|v$ is an $\{\mathcal{F}_{t}\}_{t\geq 0}$ -adapted stochastic process and has continuous paths, $\mathbb{E}\left[\sup_{t\in[0,T]}|v_{t}|^{2}\right]<\infty\Big{\}}$ .

Consider a controlled linear BSDE

\left\{\begin{aligned} dY_{t}=&\big{(}A_{t}Y_{t}+B_{t}v_{t}+C_{1t}Z_{1t}+C_{2t}Z_{2t}\big{)}dt+Z_{1t}dW_{1t}+Z_{2t}dW_{2t},\quad t\in{[0,T]},\\ Y_{T}=&\ \zeta,\end{aligned}\right.

(2.1)

where $\zeta\in L_{\mathcal{F}_{T}}^{2}(\Omega;\mathbb{R}^{n})$ and $v$ , valued in $\mathbb{R}^{m}$ , is a control process. Introduce an admissible control set
$\mathcal{V}_{ad}[0,T]=\Big{\{}v:[0,T]\times\Omega\to\mathbb{R}^{m}|v\ is\ \{\mathcal{F}_{t}^{W_{1}}\}_{t\geq 0}-$ adapted, $\mathbb{E}\left[\int_{0}^{T}|v_{t}|^{2}dt\right]<\infty\Big{\}}.$
Any $v\in\mathcal{V}_{ad}[0,T]$ is called an admissible control.
Assumption $A1$ : The coefficients of dynamic system satisfy

A,C_{1},C_{2}\in\mathcal{L}^{\infty}(0,T;\mathbb{R}^{n\times n}),\ B\in\mathcal{L}^{\infty}(0,T;\mathbb{R}^{n\times m}).

Under Assumption $A1$ , dynamic system (2.1) admits a unique solution pair $(Y,Z_{1},Z_{2})\in\mathcal{S}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n})\times\mathcal{L}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n})\times\mathcal{L}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n})$ , which is called the corresponding state process, for any $v\in\mathcal{V}_{ad}[0,T]$ (see Pardoux and Peng [2], Yong and Zhou [19]). We introduce a quadratic cost functional

\displaystyle J(v)=

\displaystyle\frac{1}{2}\mathbb{E}\Bigg{[}Y_{0}^{\top}GY_{0}+\int_{0}^{T}\Big{(}Y_{t}^{\top}H_{t}Y_{t}+v_{t}^{\top}R_{t}v_{t}+Z_{1t}^{\top}N_{1t}Z_{1t}+Z_{2t}^{\top}N_{2t}Z_{2t}\Big{)}dt\Bigg{]}.

(2.2)

Assumption $A2$ : The weighting matrices in cost functional satisfy

\displaystyle H,N_{1},N_{2}\in\mathcal{L}^{\infty}(0,T;\mathbb{S}_{+}^{n}),\ R\in\mathcal{L}^{\infty}(0,T;\widehat{\mathbb{S}}_{+}^{m}),G\in\mathbb{S}_{+}^{n}.

Our stochastic backward LQ control problem can be stated as follows.
Problem BLQ. Find a $v^{*}\in\mathcal{V}_{ad}[0,T]$ such that

J(v^{*})=\inf_{v\in\mathcal{V}_{ad}[0,T]}J(v).

(2.3)

Any $v^{*}\in\mathcal{V}_{ad}[0,T]$ satisfying (2.3) is called an optimal control, and the state process $(Y^{*},Z_{1}^{*},Z_{2}^{*})$ is called an optimal state process. Under Assumptions $A1$ and $A2$ , Problem BLQ is uniquely solvable for any terminal state $\zeta\in\mathcal{L}_{\mathcal{F}_{T}}^{2}(\Omega;\mathbb{R}^{n})$ (see Li et al. [9]). We suppressed the time argument in the sequel of this paper wherever necessary, for the sake of notation simplicity. The following theorem is a necessary condition of optimality, which is easy to be obtained from Theorem 3.1 in Huang et al. [15].

Theorem 2.1.

Under Assumptions $A1-A2$ , if $v^{*}$ is an optimal control of Problem BLQ and $(Y^{*},Z_{1}^{*},Z_{2}^{*})$ is the corresponding optimal state process, then

\left\{\begin{aligned} dX^{*}=&-\left(A^{\top}X^{*}+HY^{*}\right)dt-\left(C_{1}^{\top}X^{*}+N_{1}Z_{1}^{*}\right)dW_{1}-\left(C_{2}^{\top}X^{*}+N_{2}Z_{2}^{*}\right)dW_{2},\\ X_{0}^{*}=&-GY_{0}^{*}\end{aligned}\right.

(2.4)

admits a unique solution such that

\mathbb{E}\left[R_{t}v_{t}^{*}+B_{t}^{\top}X_{t}^{*}|\mathcal{F}^{W_{1}}\right]=0,\ t\in[0,T],\ a.s..

According to the above analysis, we end up with a stochastic Hamiltonian system

\left\{\begin{aligned} &dY=\left(AY+Bv+C_{1}Z_{1}+C_{2}Z_{2}\right)dt+Z_{1}dW_{1}+Z_{2}dW_{2},\\ &dX=-\left(A^{\top}X+HY\right)dt-\left(C_{1}^{\top}X+N_{1}Z_{1}\right)dW_{1}-\left(C_{2}^{\top}X+N_{2}Z_{2}\right)dW_{2},\\ &Y=\zeta,\ \ \ \ X_{0}=-GY_{0},\\ &\mathbb{E}[R_{t}v_{t}+B_{t}^{\top}X_{t}|\mathcal{F}_{t}^{W_{1}}]=0.\end{aligned}\right.

(2.5)

This is a coupled FBSDE with filtering. Note that the coupling comes from the last equation in (2.5), which is also called a stationarity condition. We point out that in our setting, the stationarity condition involves a conditional expectation, which makes the decoupling of this stochastic Hamiltonian system different and difficult. We now prove the sufficiency of the above result.

Theorem 2.2.

Let Assumption $A1-A2$ hold. If $(X^{*},Y^{*},Z_{1}^{*},Z_{2}^{*},v^{*})$ is an adapted solution to stochastic Hamiltonian system (2.5), then $v^{*}$ is an optimal control.

Proof.

For any $v\in\mathcal{V}_{ad}[0,T]$ , let $({Y},{Z_{1}},{Z_{2}})$ be the corresponding state process. Let $(\widetilde{Y},\widetilde{Z_{1}},\widetilde{Z_{2}})$ satisfies

\left\{\begin{aligned} d\widetilde{Y}=&\left[A\widetilde{Y}+B(v-v^{*})+C_{1}\widetilde{Z_{1}}+C_{2}\widetilde{Z_{2}}\right]dt+\widetilde{Z_{1}}dW_{1}+\widetilde{Z_{2}}dW_{2},\\ \widetilde{Y}_{T}=&\ 0.\end{aligned}\right.

According to the existence and uniqueness of solution to BSDE, we have $\widetilde{Y}=Y-Y^{*},\widetilde{Z_{1}}=Z_{1}-Z_{1}^{*},\widetilde{Z_{2}}=Z_{2}-Z_{2}^{*}$ . With the notation, we derive

\displaystyle J(v)-J(v^{*})=\mathbb{E}\bigg{[}Y_{0}^{*\top}G\widetilde{Y}_{0}+\int_{0}^{T}\Big{(}{Y^{*}}^{\top}H\widetilde{Y}+{v^{*}}^{\top}R(v-v^{*})+{Z_{1}^{*}}^{\top}N_{1}\widetilde{Z}_{1}+{Z_{2}^{*}}^{\top}N_{2}\widetilde{Z}_{2}\Big{)}dt\bigg{]}+\widetilde{J},

where

\displaystyle\widetilde{J}=

\displaystyle\frac{1}{2}\mathbb{E}\bigg{[}\widetilde{Y_{0}}^{\top}G\widetilde{Y}_{0}+\int_{0}^{T}\Big{(}\widetilde{Y}^{\top}H\widetilde{Y}+(v-v^{*})^{\top}R(v-v^{*})+\widetilde{Z_{1}}^{\top}N_{1}\widetilde{Z}_{1}+\widetilde{Z_{2}}^{\top}N_{2}\widetilde{Z}_{2}\Big{)}dt\bigg{]}.

It is easy to see that $\widetilde{J}\geq 0$ under Assumption $A2$ . Further,

	$\displaystyle\ \mathbb{E}\left[Y_{0}^{*\top}G\widetilde{Y}_{0}\right]=$	$\displaystyle\ \mathbb{E}\bigg{[}\int_{0}^{T}\Big{(}\langle A\widetilde{Y}+B(v-v^{})+C_{1}\widetilde{Z_{1}}+C_{2}\widetilde{Z_{2}},X^{}\rangle-\langle\widetilde{Y},A^{\top}X^{}+HY^{}\rangle$
		$\displaystyle-\langle\widetilde{Z_{1}},C_{1}^{\top}X^{}+N_{1}Z_{1}^{}\rangle-\langle\widetilde{Z_{2}},C_{2}^{\top}X^{}+N_{2}Z_{2}^{}\rangle\Big{)}dt\bigg{]}$
	$\displaystyle=$	$\displaystyle\ \mathbb{E}\bigg{[}\int_{0}^{T}\Big{(}\langle v-v^{},B^{\top}X^{}\rangle-\langle\widetilde{Y},HY^{}\rangle-\langle\widetilde{Z_{1}},N_{1}Z_{1}^{}\rangle-\langle\widetilde{Z_{2}},N_{2}Z_{2}^{*}\rangle\Big{)}dt\bigg{]}.$

Thus, we have

\displaystyle J(v)-J(v^{*})=\mathbb{E}\left[\int_{0}^{T}\langle v-v^{*},Rv^{*}+B^{\top}X^{*}\rangle dt\right]+\widetilde{J}\geq 0.

Then, $v^{*}$ is an optimal control. ∎

3 Decoupling stochastic Hamiltonian system (2.5)

In this section, we use the decoupling method for general FBSDE introduced in [4] to solve stochastic Hamiltonian system (2.5), which is an FBSDE with filtering. Different from the results in [8], we obtain three Riccati equations, an BSDE with filtering and an SDE with filtering. For simplicity of notation, we denote $\widehat{\beta}_{t}=\mathbb{E}[\beta_{t}|\mathcal{F}_{t}^{W_{1}}]$ . To be precise, we assume that

Y=\Upsilon\widehat{X}+\varphi,

(3.1)

where $\Upsilon$ is a differential and deterministic matrix-valued function with a terminal condition $\Upsilon_{T}=0$ , and $\varphi$ is a stochastic process satisfying the BSDE

\left\{\begin{aligned} d\varphi=&\ \lambda dt+\eta_{1}dW_{1}+\eta_{2}dW_{2},\\ \varphi_{T}=&\ \zeta,\end{aligned}\right.

for $\{\mathcal{F}_{t}\}_{t\geq 0}$ -adapted processes $\lambda$ , $\eta_{1}$ and $\eta_{2}$ . According to Theorem 2.1 in Wang et al. [20] (see also Theorem 5.7 in Xiong [21] and Theorem 8.1 in Liptser and Shiryayev [22] ), we have

\left\{\begin{aligned} d\widehat{X}=&-\left(A^{\top}\widehat{X}+H\widehat{Y}\right)dt-\left(C_{1}^{\top}\widehat{X}+N_{1}\widehat{Z}_{1}\right)dW_{1},\\ \widehat{X}_{0}=&-G\widehat{Y}_{0}.\end{aligned}\right.

Applying Itô formula to (3.1), we get

	$\displaystyle 0=$	$\displaystyle\ dY-\dot{\Upsilon}\widehat{X}dt-\Upsilon d\widehat{X}-d\varphi$
	$\displaystyle=$	$\displaystyle\left(AY+Bv+C_{1}Z_{1}+C_{2}Z_{2}\right)dt+Z_{1}dW_{1}+Z_{2}dW_{2}-\dot{\Upsilon}\widehat{X}dt+\Upsilon\left(A^{\top}\widehat{X}+H\widehat{Y}\right)dt$
		$\displaystyle+\Upsilon(C_{1}^{\top}\widehat{X}+N_{1}\widehat{Z}_{1})dW_{1}-\lambda dt-\eta_{1}dW_{1}-\eta_{2}dW_{2}.$

This implies

\left\{\begin{aligned} &AY-BR^{-1}B^{\top}\widehat{X}+C_{1}Z_{1}+C_{2}Z_{2}-\dot{\Upsilon}\widehat{X}+\Upsilon\left(A^{\top}\widehat{X}+H\widehat{Y}\right)-\lambda=0,\\ &Z_{1}+\Upsilon(C_{1}^{\top}\widehat{X}+N_{1}\widehat{Z_{1}})-\eta_{1}=0,\\ &Z_{2}-\eta_{2}=0.\end{aligned}\right.

(3.2)

Assuming that $I+\Upsilon N_{1}$ is invertible, we have

\left\{\begin{aligned} &Z_{1}=\eta_{1}-\widehat{\eta}_{1}+(I+\Upsilon N_{1})^{-1}(\widehat{\eta}_{1}-\Upsilon C_{1}^{\top}\widehat{X}),\\ &Z_{2}=\eta_{2}.\end{aligned}\right.

(3.3)

Substituting (3.1) and (3.3) into the first equation in (3.2), we obtain

	$\displaystyle\ A(\Upsilon\widehat{X}+\varphi)-BR^{-1}B^{\top}\widehat{X}+C_{1}(\eta_{1}-\widehat{\eta}_{1})+C_{1}(I+\Upsilon N_{1})^{-1}(\widehat{\eta}_{1}-\Upsilon C_{1}^{\top}\widehat{X})$
	$\displaystyle+C_{2}\eta_{2}-\dot{\Upsilon}\widehat{X}+\Upsilon A^{\top}\widehat{X}+\Upsilon H(\Upsilon\widehat{X}+\widehat{\varphi})-\lambda=0.$

Then $\Upsilon$ satisfies a Riccati equation

\left\{\begin{aligned} &\dot{\Upsilon}-\Upsilon A^{\top}-A\Upsilon-\Upsilon H\Upsilon+BR^{-1}B^{\top}+C_{1}(I+\Upsilon N_{1})^{-1}\Upsilon C_{1}^{\top}=0,\\ &\Upsilon_{T}=0,\end{aligned}\right.

(3.4)

and $\varphi$ satisfies a BSDE

\left\{\begin{aligned} d\varphi=&\Big{[}A\varphi+\Upsilon H\widehat{\varphi}+C_{1}(\eta_{1}-\widehat{\eta}_{1})+C_{1}(I+\Upsilon N_{1})^{-1}\widehat{\eta}_{1}+C_{2}\eta_{2}\Big{]}dt\\ &+\eta_{1}dW_{1}+\eta_{2}dW_{2},\\ \varphi_{T}=&\ \zeta.\end{aligned}\right.

(3.5)

Riccati equation (3.4) admits a unique solution $\Upsilon\in\mathcal{L}^{\infty}(0,T;\mathbb{S}_{+}^{n})$ under Assumptions $A1-A2$ (see [9, 8]). Note that (3.5) is a BSDE with filtering, for which the solvability has not been given in literatures before. We will specified this problem in Section 4. In order to give the optimal control with a feedback representation, we conjecture that

X=-\Gamma_{1}(Y-\widehat{Y})-\Gamma_{2}\widehat{Y}-\psi,

(3.6)

where $\Gamma_{1}$ and $\Gamma_{2}$ are differential and deterministic matrix-valued functions with initial conditions $\Gamma_{10}=G$ and $\Gamma_{20}=G$ , respectively; $\psi$ is a stochastic process satisfying an SDE

\left\{\begin{aligned} d\psi=&\ \alpha_{0}dt+\alpha_{1}dW_{1}+\alpha_{2}dW_{2},\\ \psi_{0}=&\ 0,\end{aligned}\right.

where $\alpha_{0}$ , $\alpha_{1}$ and $\alpha_{2}$ are $\{\mathcal{F}_{t}\}_{t\geq 0}$ -adapted processes. Note that

\left\{\begin{aligned} d\widehat{Y}=&\left(A\widehat{Y}-BR^{-1}B^{\top}\widehat{X}+C_{1}\widehat{Z}_{1}+C_{2}\widehat{Z}_{2}\right)dt+\widehat{Z}_{1}dW_{1},\\ \widehat{Y}_{T}=&\ \widehat{\zeta},\end{aligned}\right.

where $\widehat{\zeta}=\mathbb{E}[\zeta|\mathcal{F}_{T}^{W_{1}}]$ . Hence,

\left\{\begin{aligned} d(Y-\widehat{Y})=&\left[A(Y-\widehat{Y})+C_{1}(Z_{1}-\widehat{Z}_{1})+C_{2}(Z_{2}-\widehat{Z}_{2})\right]dt+(Z_{1}-\widehat{Z}_{1})dW_{1}+Z_{2}dW_{2},\\ Y_{T}-\widehat{Y}_{T}=&\ \zeta-\widehat{\zeta}.\end{aligned}\right.

Applying Itô formula to (3.6), we obtain

	$\displaystyle 0=$	$\displaystyle\ dX+\dot{\Gamma}_{1}(Y-\widehat{Y})dt+\Gamma_{1}d(Y-\widehat{Y})+\dot{\Gamma}_{2}\widehat{Y}dt+\Gamma_{2}d\widehat{Y}+d\psi$
	$\displaystyle=$	$\displaystyle-\left(A^{\top}X+HY\right)dt-\left(C_{1}^{\top}X+N_{1}Z_{1}\right)dW_{1}-\left(C_{2}^{\top}X+N_{2}Z_{2}\right)dW_{2}$
		$\displaystyle+\dot{\Gamma}_{1}(Y-\widehat{Y})dt+\Gamma_{1}\Big{[}A(Y-\widehat{Y})+C_{1}(Z_{1}-\widehat{Z}_{1})+C_{2}(Z_{2}-\widehat{Z}_{2})\Big{]}dt+\Gamma_{1}(Z_{1}-\widehat{Z}_{1})dW_{1}$
		$\displaystyle+\Gamma_{1}Z_{2}dW_{2}+\dot{\Gamma}_{2}\widehat{Y}dt+\Gamma_{2}\Big{(}A\widehat{Y}-BR^{-1}B^{\top}\widehat{X}+C_{1}\widehat{Z}_{1}+C_{2}\widehat{Z}_{2}\Big{)}dt+\Gamma_{2}\widehat{Z}_{1}dW_{1}$
		$\displaystyle+\alpha_{0}dt+\alpha_{1}dW_{1}+\alpha_{2}dW_{2}.$

It yields

\left\{\begin{aligned} &-\left(A^{\top}X+HY\right)+\dot{\Gamma}_{1}(Y-\widehat{Y})+\Gamma_{1}\Big{[}A(Y-\widehat{Y})+C_{1}(Z_{1}-\widehat{Z}_{1})+C_{2}(Z_{2}-\widehat{Z}_{2})\Big{]}\\ &+\dot{\Gamma}_{2}\widehat{Y}+\Gamma_{2}\Big{(}A\widehat{Y}-BR^{-1}B^{\top}\widehat{X}+C_{1}\widehat{Z}_{1}+C_{2}\widehat{Z}_{2}\Big{)}+\alpha_{0}=0,\\ &-(C_{1}^{\top}X+N_{1}Z_{1})+\Gamma_{1}(Z_{1}-\widehat{Z}_{1})+\Gamma_{2}\widehat{Z}_{1}+\alpha_{1}=0,\\ &-(C_{2}^{\top}X+N_{2}Z_{2})+\Gamma_{1}Z_{2}+\alpha_{2}=0.\end{aligned}\right.

Assuming that $I+\Gamma_{2}\Upsilon$ is invertible, we arrive at

\left\{\begin{aligned} \alpha_{1}=&\ (N_{1}-\Gamma_{1})(\eta_{1}-\widehat{\eta}_{1})+(N_{1}-\Gamma_{2})(I+\Upsilon N_{1})^{-1}\left[\widehat{\eta}_{1}+\Upsilon C_{1}^{\top}(I+\Gamma_{2}\Upsilon)^{-1}(\Gamma_{2}\widehat{\varphi}+\widehat{\psi})\right]\\ &-C_{1}^{\top}\Gamma_{1}(\varphi-\widehat{\varphi})-C_{1}^{\top}(\psi-\widehat{\psi})-C_{1}^{\top}(I+\Gamma_{2}\Upsilon)^{-1}(\Gamma_{2}\widehat{\varphi}+\widehat{\psi}),\\ \alpha_{2}=&\ (N_{2}-\Gamma_{1})\eta_{2}-C_{2}^{\top}\Gamma_{1}(\varphi-\widehat{\varphi})-C_{2}^{\top}(\psi-\widehat{\psi})-C_{2}^{\top}(I+\Gamma_{2}\Upsilon)^{-1}(\Gamma_{2}\widehat{\varphi}+\widehat{\psi}).\end{aligned}\right.

Further, it follows from (3.3) and (3.6) that

	$\displaystyle A^{\top}\Gamma_{1}(Y-\widehat{Y})+A^{\top}\Gamma_{2}\widehat{Y}+A^{\top}\psi-HY+\dot{\Gamma}_{1}(Y-\widehat{Y})+\Gamma_{1}A(Y-\widehat{Y})$
	$\displaystyle+\Gamma_{1}C_{1}(\eta_{1}-\widehat{\eta}_{1})+\Gamma_{1}C_{2}(\eta_{2}-\widehat{\eta}_{2})+\dot{\Gamma}_{2}\widehat{Y}+\Gamma_{2}A\widehat{Y}+\Gamma_{2}BR^{-1}B^{\top}(\Gamma_{2}\widehat{Y}+\widehat{\psi})$
	$\displaystyle+\Gamma_{2}C_{1}(I+\Upsilon N_{1})^{-1}\left[\widehat{\eta}_{1}+\Upsilon C_{1}^{\top}(\Gamma_{2}\widehat{Y}+\widehat{\psi})\right]+\Gamma_{2}C_{2}\widehat{\eta}_{2}+\alpha_{0}=0.$

Introduce

\left\{\begin{aligned} &\dot{\Gamma}_{1}+\Gamma_{1}A+A^{\top}\Gamma_{1}-H=0,\\ &\Gamma_{10}=G,\end{aligned}\right.

(3.7)

\left\{\begin{aligned} &\dot{\Gamma}_{2}+\Gamma_{2}A+A^{\top}\Gamma_{2}+\Gamma_{2}BR^{-1}B^{\top}\Gamma_{2}+\Gamma_{2}C_{1}(I+\Upsilon N_{1})^{-1}\Upsilon C_{1}^{\top}\Gamma_{2}-H=0,\\ &\Gamma_{20}=G,\end{aligned}\right.

(3.8)

and

\left\{\begin{aligned} d\psi=&-\Big{[}A^{\top}\psi+\Gamma_{2}BR^{-1}B^{\top}\widehat{\psi}+\Gamma_{2}C_{1}(I+\Upsilon N_{1})^{-1}\left(\widehat{\eta}_{1}+\Upsilon C_{1}^{\top}\widehat{\psi}\right)\\ &+\Gamma_{2}C_{2}\widehat{\eta}_{2}+\Gamma_{1}C_{1}(\eta_{1}-\widehat{\eta}_{1})+\Gamma_{1}C_{2}(\eta_{2}-\widehat{\eta}_{2})\Big{]}dt\\ &+\Big{\{}(N_{1}-\Gamma_{1})(\eta_{1}-\widehat{\eta}_{1})-C_{1}^{\top}\Gamma_{1}(\varphi-\widehat{\varphi})-C_{1}^{\top}(\psi-\widehat{\psi})\\ &+(N_{1}-\Gamma_{2})(I+\Upsilon N_{1})^{-1}\left[\widehat{\eta}_{1}+\Upsilon C_{1}^{\top}(I+\Gamma_{2}\Upsilon)^{-1}(\Gamma_{2}\widehat{\varphi}+\widehat{\psi})\right]\\ &-C_{1}^{\top}(I+\Gamma_{2}\Upsilon)^{-1}(\Gamma_{2}\widehat{\varphi}+\widehat{\psi})\Big{\}}dW_{1}\\ &+\Big{\{}(N_{2}-\Gamma_{1})\eta_{2}-C_{2}^{\top}\left[\Gamma_{1}(\varphi-\widehat{\varphi})+(\psi-\widehat{\psi})\right]-C_{2}^{\top}(I+\Gamma_{2}\Upsilon)^{-1}(\Gamma_{2}\widehat{\varphi}+\widehat{\psi})\Big{\}}dW_{2},\\ \psi_{0}=&\ 0.\end{aligned}\right.

(3.9)

There is a unique solution $\Gamma_{1}\in\mathcal{L}^{\infty}(0,T;\mathbb{S}_{+}^{n})$ to Riccati equation (3.7), since Assumptions $A1$ and $A2$ hold (see Yong and Zhou [19]). Corollary 4.6 in Lim and Zhou [8] implies that (3.8) admits a unique solution $\Gamma_{2}\in\mathcal{L}^{\infty}(0,T;\mathbb{S}_{+}^{n})$ . Once $\Upsilon$ , $\Gamma_{1}$ , $\Gamma_{2}$ and the solution $(\varphi,\eta_{1},\eta_{2})$ of (3.5) are known, the solvability of (3.9) will be obtained immediately.

4 Explicit representations of optimal control and optimal cost

Now we would like to give explicit formulas of optimal control and associated optimal cost in terms of Riccati equations (3.4), (3.7), (3.8), BSDE (3.5) and SDE (3.9). We first prove that (3.5) admits a unique solution. Consider a BSDE

\left\{\begin{aligned} dP=&\ g(t,P,Q_{1},Q_{2},\widehat{P},\widehat{Q_{1}},\widehat{Q_{2}})dt+Q_{1}dW_{1}+Q_{2}dW_{2},\\ P_{T}=&\ \zeta,\end{aligned}\right.

(4.1)

where $\widehat{P}_{t}=\mathbb{E}[P_{t}|\mathcal{F}_{t}^{W_{1}}],\widehat{Q}_{1t}=\mathbb{E}[Q_{1t}|\mathcal{F}_{t}^{W_{1}}],\widehat{Q}_{2t}=\mathbb{E}[Q_{2t}|\mathcal{F}_{t}^{W_{1}}]$ .
We assume that
Assumption $A3$ : There exists a constant $L$ , such that, $\mathbb{P}$ -a.s., for all $t\in[0,T]$ , $p,q_{1},q_{2},\bar{p},\bar{q_{1}},\bar{q_{2}}$ , $p^{\prime},q_{1}^{\prime},q_{2}^{\prime},\bar{p}^{\prime},\bar{q_{1}}^{\prime},\bar{q_{2}}^{\prime}\in\mathbb{R}^{n}$ ,

		$\displaystyle\left\|g(t,p,q_{1},q_{2},\bar{p},\bar{q_{1}},\bar{q_{2}})-g(t,p^{\prime},q_{1}^{\prime},q_{2}^{\prime},\bar{p}^{\prime},\bar{q_{1}}^{\prime},\bar{q_{2}}^{\prime})\right\|$
	$\displaystyle\leq$	$\displaystyle\ L\big{(}\|p-p^{\prime}\|+\|q_{1}-q_{1}^{\prime}\|+\|q_{2}-q_{2}^{\prime}\|+\|\bar{p}-\bar{p}^{\prime}\|+\|\bar{q_{1}}-\bar{q_{1}}^{\prime}\|+\|\bar{q_{2}}-\bar{q_{2}}^{\prime}\|\big{)}.$

Assumption $A4$ : $g(\cdot,0,0,0,0,0,0)\in\mathcal{L}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n})$ .

Lemma 4.1.

Let Assumptions $A3$ and $A4$ hold. For any $\zeta\in\mathcal{L}_{\mathcal{F}}^{2}(\Omega;\mathbb{R}^{n})$ , BSDE (4.1) admits a unique solution $(P,Q_{1},Q_{2})\in\mathcal{S}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n})\times\mathcal{L}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n})\times\mathcal{L}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n})$ .

Proof.

We first introduce a norm on $\mathcal{L}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n+n+n})$ , which is equivalent to the canonical norm

||u||_{\delta}=\left(\mathbb{E}\left[\int_{0}^{T}|u_{t}|^{2}e^{\delta t}dt\right]\right)^{\frac{1}{2}},\ \delta>0.

The parameter $\delta$ will be specified later. For any $(p,q_{1},q_{2})\in L_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n+n+n})$ , the following BSDE

\left\{\begin{aligned} dP=&\ g(t,P,Q_{1},Q_{2},\widehat{p},\widehat{q_{1}},\widehat{q_{2}})dt+Q_{1}dW_{1}+Q_{2}dW_{2},\\ P_{T}=&\ \zeta\end{aligned}\right.

admits a unique solution $(P,Q_{1},Q_{2})\in\mathcal{L}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n+n+n})$ . We then introduce a mapping $(P,Q_{1},Q_{2})=\mathbf{I}(p,q_{1},q_{2})$ : $\mathcal{L}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n+n+n})\to\mathcal{L}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n+n+n})$ by

\left\{\begin{aligned} dP=&\ g(t,P,Q_{1},Q_{2},\widehat{p},\widehat{q_{1}},\widehat{q_{2}})dt+Q_{1}dW_{1}+Q_{2}dW_{2},\\ P_{T}=&\ \zeta.\end{aligned}\right.

For any $(p,\ q_{1},\ q_{2})$ , $(p^{\prime},\ q_{1}^{\prime},\ q_{2}^{\prime})\in L_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n+n+n})$ , we denote $(P,Q_{1},Q_{2})=\mathbf{I}(p,q_{1},q_{2})$ , $(P^{\prime},Q_{1}^{\prime},Q_{2}^{\prime})=\mathbf{I}(p^{\prime},q_{1}^{\prime},q_{2}^{\prime})$ , $(\widetilde{p},\ \widetilde{q_{1}},\ \widetilde{q_{2}})=(p-p^{\prime},q_{1}-q_{1}^{\prime},q_{2}-q_{2}^{\prime})$ and $(\widetilde{P},\ \widetilde{Q_{1}},\ \widetilde{Q_{2}})=(P-P^{\prime},Z_{1}-Z_{1}^{\prime},Q_{2}-Q_{2}^{\prime})$ . Applying Itô formula to $|\widetilde{P}_{t}|^{2}e^{\delta t}$ and taking conditional expectations, we get

		$\displaystyle\ \|\widetilde{P}_{t}\|^{2}+\mathbb{E}\left[\int_{t}^{T}\delta e^{\delta(s-t)}\|\widetilde{P_{s}}\|^{2}ds\Big{\|}\mathcal{F}_{t}\right]+\mathbb{E}\left[\int_{t}^{T}e^{\delta(s-t)}(\|\widetilde{Q_{1s}}\|^{2}+\|\widetilde{Q_{2s}}\|^{2})ds\Big{\|}\mathcal{F}_{t}\right]$
	$\displaystyle=$	$\displaystyle\ 2\mathbb{E}\bigg{[}\int_{t}^{T}e^{\delta(s-t)}\langle\widetilde{P}_{s},g(s,P^{\prime},Q_{1}^{\prime},Q_{2}^{\prime},\widehat{p^{\prime}},\widehat{q_{1}^{\prime}},\widehat{q_{2}^{\prime}})-g(s,P,Q_{1},Q_{2},\widehat{p},\widehat{q_{1}},\widehat{q_{2}})\rangle ds\Big{\|}\mathcal{F}_{t}\bigg{]}$
	$\displaystyle\leq$	$\displaystyle\ 2L\mathbb{E}\bigg{[}\int_{t}^{T}e^{\delta(s-t)}\|\widetilde{P}_{s}\|\Big{(}\|\widetilde{P}_{s}\|+\|\widetilde{Q}_{1s}\|+\|\widetilde{Q}_{2s}\|+\|\widehat{\widetilde{p}}_{s}\|+\|\widehat{\widetilde{q}}_{1s}\|+\|\widehat{\widetilde{q}}_{2s}\|\Big{)}ds\Big{\|}\mathcal{F}_{t}\bigg{]}$
	$\displaystyle\leq$	$\displaystyle\ \mathbb{E}\bigg{[}\int_{t}^{T}e^{\delta(s-t)}\Big{(}(2L+4L^{2}+\frac{\delta}{2})\|\widetilde{P}_{s}\|^{2}+\frac{1}{2}\|\widetilde{Q}_{1s}\|^{2}+\frac{1}{2}\|\widetilde{Q}_{2s}\|^{2}\Big{)}ds\Big{\|}\mathcal{F}\bigg{]}$
		$\displaystyle+\mathbb{E}\bigg{[}\int_{t}^{T}e^{\delta(s-t)}\frac{6L^{2}}{\delta}\Big{(}\|\widehat{\widetilde{p}}_{s}\|^{2}+\|\widehat{\widetilde{q}}_{1s}\|^{2}+\|\widehat{\widetilde{q}}_{2s}\|^{2}\Big{)}ds\Big{\|}\mathcal{F}_{t}\bigg{]}.$

Thus we have

		$\displaystyle\left(\frac{\delta}{2}-2L-4L^{2}\right)\mathbb{E}\left[\int_{0}^{T}e^{\delta s}\|\widetilde{P}_{s}\|^{2}ds\right]+\frac{1}{2}\mathbb{E}\left[\int_{0}^{T}e^{\delta s}\Big{(}\|\widetilde{Q_{1}}_{s}\|^{2}+\|\widetilde{Q_{2}}_{s}\|^{2}\Big{)}ds\right]$
	$\displaystyle\leq$	$\displaystyle\ \frac{6L^{2}}{\delta}\mathbb{E}\left[\int_{0}^{T}e^{\delta s}\Big{(}\|\widehat{\widetilde{p}}_{s}\|^{2}+\|\widehat{\widetilde{q_{1}}}_{s}\|^{2}+\|\widehat{\widetilde{q_{2}}}_{s}\|^{2}\Big{)}ds\right].$

Taking $\delta=24L^{2}+4L+1$ , we arrive at

		$\displaystyle\mathbb{E}\left[\int_{0}^{T}e^{\delta s}\Big{(}\|\widetilde{P_{s}}\|^{2}+\|\widetilde{Q_{1s}}\|^{2}+\|\widetilde{Q_{2s}}\|^{2}\Big{)}ds\right]$
	$\displaystyle\leq$	$\displaystyle\ \frac{1}{2}\mathbb{E}\left[\int_{0}^{T}e^{\delta s}\Big{(}\|\widetilde{p_{s}}\|^{2}+\|\widetilde{q}_{1s}\|^{2}+\|\widetilde{q}_{2s}\|^{2}\Big{)}ds\right],$

which implies $||(\widetilde{P},\widetilde{Q_{1}},\widetilde{Q_{2}})||_{\delta}\leq\frac{1}{\sqrt{2}}||(\widetilde{p},\widetilde{q_{1}},\widetilde{q_{2}})||_{\delta}$ . That is, $\mathbf{I}$ is a contraction on $\mathcal{L}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n+n+n})$ , endowed with the norm $||\cdot||_{\delta}$ . According to the contraction mapping theorem, we know that there is a unique fixed point $(P,Q_{1},Q_{2})\in\mathcal{L}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n+n+n})$ , such that $\mathbf{I}(P,Q_{1},Q_{2})=(P,Q_{1},Q_{2})$ , which is exactly the solution of (4.1). We now proceed to prove that $P\in\mathcal{S}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n})$ . Using Jensen inequality, Hölder’s inequality and Burkholder-Davis-Gundy’s inequality yields

		$\displaystyle\ \mathbb{E}\left[\sup_{t\in[0,T]}\Big{\|}P_{t}\Big{\|}^{2}\right]$
	$\displaystyle\leq$	$\displaystyle\ 4\mathbb{E}\left[\|\zeta\|^{2}\right]+4\mathbb{E}\left[\sup_{t\in[0,T]}\Big{\|}\int_{t}^{T}g(s,P,Q_{1},Q_{2},\widehat{Q},\widehat{Q_{1}},\widehat{Q_{2}})ds\Big{\|}^{2}\right]$
		$\displaystyle+4\mathbb{E}\left[\sup_{t\in[0,T]}\Big{\|}\int_{t}^{T}Q_{1s}dW_{1s}\Big{\|}^{2}\right]+4\mathbb{E}\left[\sup_{t\in[0,T]}\Big{\|}\int_{t}^{T}Q_{2s}dW_{2s}\Big{\|}^{2}\right]$
	$\displaystyle\leq$	$\displaystyle\ 4\mathbb{E}\left[\|\zeta\|^{2}\right]+4T\mathbb{E}\left[\sup_{t\in[0,T]}\left(\int_{t}^{T}\|g(s,P,Q_{1},Q_{2},\widehat{P},\widehat{Q_{1}},\widehat{Q_{2}})\|^{2}ds\right)\right]$
		$\displaystyle+16\mathbb{E}\left[\int_{0}^{T}\big{\|}Q_{1t}\big{\|}^{2}dt\right]+16\mathbb{E}\left[\int_{0}^{T}\big{\|}Q_{2t}\big{\|}^{2}dt\right]$
	$\displaystyle<$	$\displaystyle\ \infty.$

Therefore, we obtain $P\in\mathcal{S}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n})$ . ∎

Remark 4.1.

Equation (3.5) is a linear BSDE with filtering, where the generator satisfies Assumptions $A3-A4$ . Then it follows that (3.5) admits a unique solution $(\varphi,\eta_{1},\eta_{2})\in\mathcal{S}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n})\times\mathcal{L}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n})\times\mathcal{L}_{\mathcal{F}}^{2}(0,T;\mathbb{R}^{n})$ .

We have the following theorem which specifies the solvability of stochastic Hamiltonian system (2.5) and gives some relations between the forward component and the backward components.

Theorem 4.1.

Under Assumptions $A1-A2$ , stochastic Hamiltonian system (2.5) admits a unique solution $(X,Y,Z_{1},Z_{2},v)$ . Moreover, we have the following relations

\left\{\begin{aligned} &Y=\Upsilon\widehat{X}+\varphi,\\ &Z_{1}=\eta_{1}-\widehat{\eta}_{1}+(I+\Upsilon N_{1})^{-1}(\widehat{\eta}_{1}-\Upsilon C_{1}^{\top}\widehat{X}),\\ &Z_{2}=\eta_{2},\\ &v=-R^{-1}B^{\top}\widehat{X},\\ &Y_{0}=(I+\Upsilon_{0}G)^{-1}\varphi_{0},\end{aligned}\right.

where $\Upsilon$ and $(\varphi,\eta_{1},\eta_{2})$ are solutions to (3.4) and (3.5), respectively.

Proof.

Consider the following SDE with filtering

\left\{\begin{aligned} d\bar{X}=&-\left[A^{\top}\bar{X}+H(\Upsilon\widehat{\bar{X}}+\varphi)\right]dt\\ &-\Big{[}C_{1}^{\top}\bar{X}+N_{1}(\eta_{1}-\widehat{\eta}_{1})+N_{1}(I+\Upsilon N_{1})^{-1}\big{(}\widehat{\eta}_{1}-\Upsilon C_{1}^{\top}\widehat{\bar{X}}\big{)}\Big{]}dW_{1}\\ &-\left(C_{2}^{\top}\bar{X}+N_{2}\eta_{2}\right)dW_{2},\\ \bar{X}_{0}=&-(I+G\Upsilon_{0})^{-1}G\varphi_{0},\end{aligned}\right.

(4.2)

where $\Upsilon$ and $(\varphi,\eta_{1},\eta_{2})$ are solutions of (3.4) and (3.5), respectively. According to Theorem 2.1 in Wang et al. [20], we get

\left\{\begin{aligned} d\widehat{\bar{X}}=&-\left(A^{\top}\widehat{\bar{X}}+H\Upsilon\widehat{\bar{X}}+H\widehat{\varphi}\right)dt\\ &-\Big{[}(I+N_{1}\Upsilon)^{-1}C_{1}^{\top}\widehat{\bar{X}}+N_{1}(I+\Upsilon N_{1})^{-1}\widehat{\eta}_{1}\Big{]}dW_{1},\\ \widehat{\bar{X}}_{0}=&-(I+G\Upsilon_{0})^{-1}G\varphi_{0}.\end{aligned}\right.

(4.3)

From the theory of linear SDE, (4.3) has a unique solution $\widehat{\bar{X}}$ . Then it follows that (4.2) also admits a unique solution $\bar{X}$ . We define

\bar{Y}=\Upsilon\widehat{\bar{X}}+\varphi.

By using Itô formula, $\bar{Y}$ satisfies

	$\displaystyle d\bar{Y}=$	$\displaystyle\Big{[}\Upsilon A^{\top}+A\Upsilon+\Upsilon H\Upsilon-BR^{-1}B^{\top}-C_{1}(I+\Upsilon N_{1})^{-1}\Upsilon C_{1}^{\top}\Big{]}\widehat{\bar{X}}dt$
		$\displaystyle-\Upsilon\left(A^{\top}\widehat{\bar{X}}+H\Upsilon\widehat{\bar{X}}+H\widehat{\varphi}\right)dt-\Upsilon\Big{[}(I+N_{1}\Upsilon)^{-1}C_{1}^{\top}\widehat{\bar{X}}+N_{1}(I+\Upsilon N_{1})^{-1}\widehat{\eta}_{1}\Big{]}dW_{1}$
		$\displaystyle+\Big{[}A\varphi+\Upsilon H\widehat{\varphi}+C_{1}(\eta_{1}-\widehat{\eta}_{1})+C_{1}(I+\Upsilon N_{1})^{-1}\widehat{\eta}_{1}+C_{2}\eta_{2}\Big{]}dt+\eta_{1}dW_{1}+\eta_{2}dW_{2}$
	$\displaystyle=$	$\displaystyle\Big{[}A\bar{Y}-BR^{-1}B^{\top}\widehat{\bar{X}}+C_{1}(\eta_{1}-\widehat{\eta}_{1})+C_{1}(I+\Upsilon N_{1})^{-1}(\widehat{\eta}_{1}-\Upsilon C_{1}^{\top}\widehat{\bar{X}})+C_{2}\eta_{2}\Big{]}dt$
		$\displaystyle+\left[\eta_{1}-\widehat{\eta}_{1}+(I+\Upsilon N_{1})^{-1}(\widehat{\eta}_{1}-\Upsilon C_{1}^{\top}\widehat{\bar{X}})\right]dW_{1}+\eta_{2}dW_{2},$

with an initial condition $Y_{0}=(I+\Upsilon_{0}G)^{-1}\varphi_{0}$ . Defining

\left\{\begin{aligned} &\bar{Z}_{1}=\eta_{1}-\widehat{\eta}_{1}+(I+\Upsilon N_{1})^{-1}(\widehat{\eta}_{1}-\Upsilon C_{1}^{\top}\widehat{\bar{X}}),\\ &\bar{Z}_{2}=\eta_{2},\\ &\bar{v}=-R^{-1}B^{\top}\widehat{\bar{X}}.\end{aligned}\right.

It is obvious that $(\bar{X},\bar{Y},\bar{Z}_{1},\bar{Z}_{2},\bar{v})$ is a solution to stochastic Hamiltonian system (2.5).

We now turn to prove the uniqueness. Suppose that equation (2.5) admits two solutions $(X,Y,Z_{1},Z_{2},v)$ and $(X^{\prime},Y^{\prime},Z_{1}^{\prime},Z_{2}^{\prime},v^{\prime})$ , respectively. Let $(\widetilde{X},\widetilde{Y},\widetilde{Z_{1}},\widetilde{Z_{2}},\widetilde{v})=(X-X^{\prime},Y-Y^{\prime},Z_{1}-Z_{1}^{\prime},Z_{2}-Z_{2}^{\prime},v-v^{\prime})$ . Thus $(\widetilde{X},\widetilde{Y},\widetilde{Z_{1}},\widetilde{Z_{2}},\widetilde{v})$ satisfies

\left\{\begin{aligned} &d\widetilde{Y}=\left(A\widetilde{Y}+B\widetilde{v}+C_{1}\widetilde{Z_{1}}+C_{2}\widetilde{Z_{2}}\right)dt+\widetilde{Z_{1}}dW_{1}+\widetilde{Z_{2}}dW_{2},\\ &d\widetilde{X}=-\left(A^{\top}\widetilde{X}+H\widetilde{Y}\right)dt-\left(C_{1}^{\top}\widetilde{X}+N_{1}\widetilde{Z_{1}}\right)dW_{1}-\left(C_{2}^{\top}\widetilde{X}+N_{2}\widetilde{Z_{2}}\right)dW_{2},\\ &\widetilde{Y}_{T}=0,\ \ \ \ \widetilde{X}_{0}=-G\widetilde{Y}_{0},\\ &\mathbb{E}[R_{t}\widetilde{v}_{t}+B_{t}^{\top}\widetilde{X}_{t}|\mathcal{F}_{t}^{W_{1}}]=0.\end{aligned}\right.

Applying Itô formula to $\widetilde{Y}^{\top}\widetilde{X}$ , we obtain

		$\displaystyle\mathbb{E}\left[\widetilde{Y}_{0}^{\top}G\widetilde{Y}_{0}\right]$
	$\displaystyle=$	$\displaystyle-\mathbb{E}\left[\int_{0}^{T}\widetilde{Y}^{\top}\left(A^{\top}\widetilde{X}+H\widetilde{Y}\right)dt\right]+\mathbb{E}\left[\int_{0}^{T}\left(A\widetilde{Y}-BR^{-1}B^{\top}\widehat{\widetilde{X}}+C_{1}\widetilde{Z_{1}}+C_{2}\widetilde{Z_{2}}\right)^{\top}\widetilde{X}dt\right]$
		$\displaystyle-\mathbb{E}\left[\int_{0}^{T}\widetilde{Z_{1}}^{\top}\left(C_{1}^{\top}\widetilde{X}+N_{1}\widetilde{Z_{1}}\right)dt\right]-\mathbb{E}\left[\int_{0}^{T}\widetilde{Z_{2}}^{\top}\left(C_{2}^{\top}\widetilde{X}+N_{2}\widetilde{Z_{2}}\right)dt\right]$
	$\displaystyle=$	$\displaystyle-\mathbb{E}\Big{[}\int_{0}^{T}\big{(}\widetilde{Y}^{\top}H\widetilde{Y}+\widehat{\widetilde{X}}^{\top}BR^{-1}B^{\top}\widetilde{X}+\widetilde{Z_{1}}^{\top}N_{1}\widetilde{Z_{1}}+\widetilde{Z_{2}}^{\top}N_{2}\widetilde{Z_{2}}\big{)}dt\Big{]}.$

We adopt the same procedure as in the proof of Theorem 2.2. Since $G,H,R,N_{1},N_{2}$ satisfy Assumption $A2$ , it follows that

\displaystyle\mathbb{E}\left[\int_{0}^{T}\widehat{\widetilde{X}}^{\top}BR^{-1}B^{\top}\widetilde{X}dt\right]=0.

Recalling $R$ is uniformly positive, it yields

B_{t}^{\top}\widehat{\widetilde{X}_{t}}=0,\ \ \ a.e.\ t\in[0,T],\ \ \mathbb{P}-a.s..

With the equality, $(\widetilde{Y},\widetilde{Z_{1}},\widetilde{Z_{2}})$ satisfies

\left\{\begin{aligned} &d\widetilde{Y}=\left(A\widetilde{Y}+C_{1}\widetilde{Z_{1}}+C_{2}\widetilde{Z_{2}}\right)dt+\widetilde{Z_{1}}dW_{1}+\widetilde{Z_{2}}dW_{2},\\ &\widetilde{Y}_{T}=0.\end{aligned}\right.

(4.4)

It is easy to see that (4.4) admits a unique solution $(\widetilde{Y},\widetilde{Z_{1}},\widetilde{Z_{2}})\equiv 0$ . Then

\left\{\begin{aligned} &d\widetilde{X}=-A^{\top}\widetilde{X}dt-C_{1}^{\top}\widetilde{X}dW_{1}-C_{2}^{\top}\widetilde{X}dW_{2},\\ &\widetilde{X}_{0}=-G\widetilde{Y}_{0}.\end{aligned}\right.

Hence it follows from the uniqueness of solution that $\widetilde{X}\equiv 0$ . The proof is completed. ∎

To summarize the above analysis, we establish the following main result.

Theorem 4.2.

Let Assumptions $A1-A2$ hold and let $\zeta\in\mathcal{L}_{\mathcal{F}}^{2}(\Omega;\mathbb{R}^{n})$ be given. Let $\Upsilon$ , $\Gamma_{1}$ , $\Gamma_{2}$ be the solutions of Riccati equations (3.4), (3.7) and (3.8), respectively. Let $(\varphi,\eta_{1},\eta_{2})$ and $\psi$ be the solutions of (3.5) and (3.9), respectively. Then the BSDE with filtering

\left\{\begin{aligned} dY=&\ \Big{(}AY+BR^{-1}B^{\top}\Gamma_{2}\widehat{Y}+BR^{-1}B^{\top}\widehat{\psi}+C_{1}Z_{1}+C_{2}Z_{2}\Big{)}dt+Z_{1}dW_{1}+Z_{2}dW_{2},\\ Y_{T}=&\ \zeta\end{aligned}\right.

admits a unique solution $(Y,Z_{1},Z_{2})$ . By defining

\left\{\begin{aligned} X=&-\Gamma_{1}(Y-\widehat{Y})-\Gamma_{2}\widehat{Y}-\psi,\\ v=&\ R^{-1}B^{\top}\Gamma_{2}\widehat{Y}+R^{-1}B^{\top}\widehat{\psi},\end{aligned}\right.

the 5-tuple $(X,Y,Z_{1},Z_{2},v)$ is an adapted solution to FBSDE (2.5) and $v$ is an optimal control of Problem BLQ. The corresponding optimal cost is

$\displaystyle J(v)=$	$\displaystyle\frac{1}{2}\mathbb{E}\left[\langle\widehat{\zeta},\Sigma_{T}\widehat{\zeta}\rangle\right]+\frac{1}{2}\mathbb{E}\left[\int_{0}^{T}\Big{(}\langle H\varphi,\varphi\rangle-\langle\widehat{\varphi},H\widehat{\varphi}\rangle\Big{)}dt\right]$	(4.5)
	$\displaystyle+\frac{1}{2}\mathbb{E}\left[\int_{0}^{T}\big{\langle}\big{[}N_{1}(I+\Upsilon N_{1})^{-1}-\Sigma\big{]}\widehat{\eta}_{1},\widehat{\eta}_{1}\big{\rangle}\right]$
	$\displaystyle+\frac{1}{2}\mathbb{E}\left[\int_{0}^{T}\Big{(}\langle N_{1}(\eta_{1}-\widehat{\eta}_{1}),\eta_{1}-\widehat{\eta}_{1}\rangle+\langle N_{2}\eta_{2},\eta_{2}\rangle\Big{)}dt\right]$
	$\displaystyle-\mathbb{E}\left[\int_{0}^{T}\Big{\langle}\widehat{\varphi},\Sigma\Big{(}C_{1}(I+\Upsilon N_{1})^{-1}\widehat{\eta_{1}}+C_{2}\widehat{\eta}_{2}\Big{)}\Big{\rangle}dt\right],$

where $\Sigma$ is the solution of

\left\{\begin{aligned} &\dot{\Sigma}+\Sigma(A+\Upsilon H)+(A+\Upsilon H)^{\top}\Sigma-H=0,\\ &\Sigma_{0}=G(I+\Upsilon_{0}G)^{-1}.\end{aligned}\right.

Proof.

We need only to prove (4.5). Substituting (3.1), (3.3) into the cost functional, we derive

	$\displaystyle J(v)=$	$\displaystyle\ \frac{1}{2}\big{\langle}G(I+\Upsilon_{0}G)^{-1}\varphi_{0},(I+\Upsilon_{0}G)^{-1}\varphi_{0}\big{\rangle}+\frac{1}{2}\mathbb{E}\left[\int_{0}^{T}\Big{\langle}\Upsilon\widehat{X}+\varphi,H(\Upsilon\widehat{X}+\varphi)\Big{\rangle}dt\right]$
		$\displaystyle+\frac{1}{2}\mathbb{E}\left[\int_{0}^{T}\Big{\langle}R^{-1}B^{\top}\widehat{X},B^{\top}\widehat{X}\Big{\rangle}dt\right]+\frac{1}{2}\mathbb{E}\left[\int_{0}^{T}\langle\eta_{2},N_{2}\eta_{2}\rangle dt\right]$
		$\displaystyle+\frac{1}{2}\mathbb{E}\bigg{[}\int_{0}^{T}\Big{\langle}\eta_{1}-\widehat{\eta}_{1}+(I+\Upsilon N_{1})^{-1}(\widehat{\eta}_{1}-\Upsilon C_{1}^{\top}\widehat{X}),N_{1}\Big{[}\eta_{1}-\widehat{\eta}_{1}+(I+\Upsilon N_{1})^{-1}(\widehat{\eta}_{1}-\Upsilon C_{1}^{\top}\widehat{X})\Big{]}\Big{\rangle}dt\bigg{]}$
	$\displaystyle=$	$\displaystyle\ \frac{1}{2}\big{\langle}G(I+\Upsilon_{0}G)^{-1}\varphi_{0},(I+\Upsilon_{0}G)^{-1}\varphi_{0}\big{\rangle}$
		$\displaystyle+\frac{1}{2}\mathbb{E}\bigg{[}\int_{0}^{T}\Big{\langle}\widehat{X},\Big{[}\Upsilon H\Upsilon+BR^{-1}B^{\top}+C_{1}(I+\Upsilon N_{1})^{-1}\Upsilon N_{1}\Upsilon(I+N_{1}\Upsilon)^{-1}C_{1}^{\top}\Big{]}\widehat{X}\Big{\rangle}dt\bigg{]}$
		$\displaystyle+\mathbb{E}\left[\int_{0}^{T}\Big{\langle}\Upsilon H\varphi,\widehat{X}\Big{\rangle}dt\right]-\mathbb{E}\bigg{[}\int_{0}^{T}\Big{\langle}\eta_{1}-\widehat{\eta}_{1}+(I+\Upsilon N_{1})^{-1}\widehat{\eta}_{1},N_{1}(I+\Upsilon N_{1})^{-1}\Upsilon C_{1}^{\top}\widehat{X}\Big{\rangle}dt\bigg{]}$
		$\displaystyle+\frac{1}{2}\mathbb{E}\bigg{[}\int_{0}^{T}\Big{\langle}\eta_{1}-\widehat{\eta}_{1}+(I+\Upsilon N_{1})^{-1}\widehat{\eta}_{1},N_{1}\Big{[}\eta_{1}-\widehat{\eta}_{1}+(I+\Upsilon N_{1})^{-1}\widehat{\eta}_{1}\Big{]}\Big{\rangle}dt\bigg{]}$
		$\displaystyle+\frac{1}{2}\mathbb{E}\left[\int_{0}^{T}\Big{(}\langle\eta_{2},N_{2}\eta_{2}\rangle+\langle\varphi,H\varphi\rangle\Big{)}dt\right].$

Applying Itô formula to $\langle\widehat{X},\Upsilon\widehat{X}\rangle$ , we have

		$\displaystyle\big{\langle}(I+G\Upsilon_{0})^{-1}G\varphi_{0},\Upsilon_{0}(I+G\Upsilon_{0})^{-1}G\varphi_{0}\big{\rangle}$
	$\displaystyle=$	$\displaystyle-\mathbb{E}\left[\int_{0}^{T}\Big{\langle}\widehat{X},\Upsilon\big{[}(A+\Upsilon H)^{\top}\widehat{X}+Q\widehat{\varphi}\big{]}\Big{\rangle}dt\right]-\mathbb{E}\left[\int_{0}^{T}\Big{\langle}(A+\Upsilon H)^{\top}\widehat{X}+H\widehat{\varphi},\Upsilon\widehat{X}\Big{\rangle}dt\right]$
		$\displaystyle+\mathbb{E}\bigg{[}\int_{0}^{T}\Big{\langle}\widehat{X},\Big{(}\Upsilon A^{\top}+A\Upsilon+\Upsilon H\Upsilon-BR^{-1}B^{\top}-C_{1}(I+\Upsilon N_{1})^{-1}\Upsilon C_{1}^{\top}\Big{)}\widehat{X}\Big{\rangle}dt\bigg{]}$
		$\displaystyle+\mathbb{E}\bigg{[}\int_{0}^{T}\Big{\langle}(I+N_{1}\Upsilon)^{-1}C_{1}^{\top}\widehat{X}+N_{1}(I+\Upsilon N_{1})^{-1}\widehat{\eta}_{1},\Upsilon\Big{[}(I+N_{1}\Upsilon)^{-1}C_{1}^{\top}\widehat{X}+N_{1}(I+\Upsilon N_{1})^{-1}\widehat{\eta}_{1}\Big{]}\Big{\rangle}dt\bigg{]}$
	$\displaystyle=$	$\displaystyle-\mathbb{E}\bigg{[}\int_{0}^{T}\Big{\langle}\widehat{X},\Big{[}\Upsilon H\Upsilon+BR^{-1}B^{\top}+C_{1}(I+\Upsilon N_{1})^{-1}\Upsilon N_{1}\Upsilon(I+N_{1}\Upsilon)^{-1}C_{1}^{\top}\Big{]}\widehat{X}\Big{\rangle}dt\bigg{]}$
		$\displaystyle+\mathbb{E}\left[\int_{0}^{T}\Big{\langle}N_{1}(I+\Upsilon N_{1})^{-1}\widehat{\eta_{1}},\Upsilon N_{1}(I+\Upsilon N_{1})^{-1}\widehat{\eta_{1}}\Big{\rangle}dt\right]$
		$\displaystyle+2\mathbb{E}\left[\int_{0}^{T}\Big{\langle}(I+N_{1}\Upsilon)^{-1}C_{1}^{\top}\widehat{X},\Upsilon N_{1}(I+\Upsilon N_{1})^{-1}\widehat{\eta_{1}}\Big{\rangle}dt\right]-2\mathbb{E}\left[\int_{0}^{T}\Big{\langle}\Upsilon H\widehat{\varphi},\widehat{X}\Big{\rangle}dt\right].$

With the equality, we derive

	$\displaystyle J(v)=$	$\displaystyle\ \frac{1}{2}\langle(I+G\Upsilon_{0})^{-1}G\varphi_{0},\varphi_{0}\rangle+\frac{1}{2}\mathbb{E}\left[\int_{0}^{T}\langle H\varphi,\varphi\rangle dt\right]$
		$\displaystyle+\frac{1}{2}\mathbb{E}\bigg{[}\int_{0}^{T}\Big{(}\langle N_{1}(I+\Upsilon N_{1})^{-1}\widehat{\eta}_{1},\widehat{\eta}_{1}\rangle+\langle N_{1}(\eta_{1}-\widehat{\eta}_{1}),\eta_{1}-\widehat{\eta}_{1}\rangle+\langle N_{2}\eta_{2},\eta_{2}\rangle\Big{)}dt\bigg{]}.$

Recalling that $\varphi$ satisfies (3.5) and applying Itô formula to $\langle\widehat{\varphi},\Sigma\widehat{\varphi}\rangle$ , we have

	$\displaystyle\langle(I+G\Upsilon_{0})^{-1}G\varphi_{0},\varphi_{0}\rangle=$	$\displaystyle\ \mathbb{E}[\langle\widehat{\zeta},\Sigma_{T}\widehat{\zeta}\rangle]-\mathbb{E}\left[\int_{0}^{T}\Big{(}\langle\widehat{\varphi},H\widehat{\varphi}\rangle+\langle\widehat{\eta_{1}},\Sigma\widehat{\eta_{1}}\rangle\Big{)}dt\right]$
		$\displaystyle-2\mathbb{E}\left[\int_{0}^{T}\Big{\langle}\widehat{\varphi},\Sigma\Big{[}C_{1}(I+\Upsilon N_{1})^{-1}\widehat{\eta_{1}}+C_{2}\widehat{\eta}_{2}\Big{]}\Big{\rangle}dt\right].$

We obtain

	$\displaystyle J(v)=$	$\displaystyle\frac{1}{2}\mathbb{E}\left[\langle\widehat{\zeta},\Sigma_{T}\widehat{\zeta}\rangle\right]+\frac{1}{2}\mathbb{E}\left[\int_{0}^{T}\Big{(}\langle H\varphi,\varphi\rangle-\langle\widehat{\varphi},H\widehat{\varphi}\rangle\Big{)}dt\right]$
		$\displaystyle+\frac{1}{2}\mathbb{E}\left[\int_{0}^{T}\big{\langle}\big{[}N_{1}(I+\Upsilon N_{1})^{-1}-\Sigma\big{]}\widehat{\eta}_{1},\widehat{\eta}_{1}\big{\rangle}\right]$
		$\displaystyle+\frac{1}{2}\mathbb{E}\left[\int_{0}^{T}\Big{(}\langle N_{1}(\eta_{1}-\widehat{\eta}_{1}),\eta_{1}-\widehat{\eta}_{1}\rangle+\langle N_{2}\eta_{2},\eta_{2}\rangle\Big{)}dt\right]$
		$\displaystyle-\mathbb{E}\left[\int_{0}^{T}\big{\langle}\widehat{\varphi},\Sigma\big{[}C_{1}(I+\Upsilon N_{1})^{-1}\widehat{\eta_{1}}+C_{2}\widehat{\eta}_{2}\big{]}\big{\rangle}dt\right].$

Then our claims follow. ∎

Remark 4.2.

When we consider the complete information case, i. e., $W_{2}$ disappears in (2.1). Let $\zeta$ be an $\mathcal{F}_{T}^{W_{1}}$ -measurable square integrable random variable. Let $v$ be an $\{\mathcal{F}_{t}^{W_{1}}\}_{t\geq 0}-$ adapted and square integrable stochastic process. Then Theorem 4.2 coincides with Theorem 3.2 in Lim and Zhou [8].

Remark 4.3.

In Huang et al. [23], an optimal control for Problem BLQ with feedback representation is given. We point out that their results rely on the condition that the solution of (3.5) satisfies $\eta_{2}=0$ .

5 One-dimensional case

In this section, we consider two scalar-valued backward LQ problems with partial information and give more detailed analyses. In the case of $H=N_{1}=0$ , we work out an explicit control problem and show the detailed procedure to obtain the feedback representation of optimal control using our theoretical results. In the case of $C_{2}=0$ , we give some numerical simulations to illustrate our theoretical results, since we can not obtain explicit solutions of related stochastic Hamiltonian system and Riccati equation.

5.1 Special case: $H=N_{1}=0$

Under Assumptions $A1$ and $A2$ , let all the coefficients of (2.1) and (2.2) are constants, and

\zeta=e^{(a-\frac{1}{2}b^{2}-\frac{1}{2}c^{2})T+bW_{1}+cW_{2}}.

In this case, (2.1) is given by

\left\{\begin{aligned} dY_{t}=&\left(AY_{t}+Bv_{t}+C_{1}Z_{1t}+C_{2}Z_{2t}\right)dt+Z_{1t}dW_{1t}+Z_{2t}dW_{2t},\qquad t\in{[0,T]},\\ Y_{T}=&\ \zeta.\end{aligned}\right.

The cost functional takes the form of

\displaystyle J(v)=

\displaystyle\ \frac{1}{2}\mathbb{E}\left[GY_{0}^{2}+\int_{0}^{T}(Rv^{2}+N_{2}Z_{2}^{2})dt\right].

Then Problem BLQ is stated as follows.
Problem BLQA. Find a $v^{*}\in\mathcal{V}[0,T]$ such that

J(v^{*})=\inf_{v\in\mathcal{V}[0,T]}J(v),

where the admissible control set is given by
$\mathcal{V}[0,T]=\Big{\{}v:[0,T]\times\Omega\to\mathbb{R}|v\ is\ \{\mathcal{F}_{t}^{W_{1}}\}_{t\geq 0}$ -adapted, $\mathbb{E}\left[\int_{0}^{T}v_{t}^{2}dt\right]<\infty\Big{\}}.$

The corresponding stochastic Hamiltonian system reads

\left\{\begin{aligned} &dY_{t}=\left(AY_{t}+Bv_{t}+C_{1}Z_{1t}+C_{2}Z_{2t}\right)dt+Z_{1t}dW_{1t}+Z_{2t}dW_{2t},\\ &dX_{t}=-AX_{t}dt-C_{1}X_{t}dW_{1t}-(C_{2}X_{t}+N_{2}Z_{2t})dW_{2t},\\ &Y_{T}=\zeta,\ \ \ \ X_{0}=-GY_{0},\\ &\mathbb{E}[Rv_{t}+BX_{t}|\mathcal{F}_{t}^{W_{1}}]=0.\end{aligned}\right.

(5.1)

We introduce

\left\{\begin{aligned} &\dot{\Upsilon}_{t}-(2A-C_{1}^{2})\Upsilon_{t}+\frac{B^{2}}{R}=0,\\ &\Upsilon_{T}=0,\end{aligned}\right.

and

\left\{\begin{aligned} d\varphi_{t}=&\big{(}A\varphi_{t}+C_{1}\eta_{1t}+C_{2}\eta_{2t}\big{)}dt+\eta_{1t}dW_{1t}+\eta_{2t}dW_{2t},\\ \varphi_{T}=&\ \zeta.\end{aligned}\right.

It is easy to see that

\Upsilon_{t}=\left\{\begin{array}[]{ll}\frac{B^{2}}{R(2A-C_{1}^{2})}\Big{(}1-e^{(2A-C_{1}^{2})(t-T)}\Big{)},&2A-C_{1}^{2}\neq 0,\\ \frac{B^{2}(T-t)}{R},&2A-C_{1}^{2}=0,\end{array}\right.

and

\left\{\begin{aligned} \varphi_{t}=&\exp\Big{[}(a-bC_{1}-cC_{2}-A)T\\ &+(A+bC_{1}+cC_{2}-\frac{1}{2}b^{2}-\frac{1}{2}c^{2})t+bW_{1t}+cW_{2t}\Big{]},\\ \eta_{1t}=&\ b\varphi_{t},\\ \eta_{2t}=&\ c\varphi_{t}.\end{aligned}\right.

(5.2)

Taking $t=0$ in (5.2), we have

\varphi_{0}=\exp\Big{[}(a-bC_{1}-cC_{2}-A)T\Big{]}.

Then it follows from (5.1) and (5.2) that

\left\{\begin{aligned} dX_{t}=&-AX_{t}dt-C_{1}X_{t}dW_{1t}-(C_{2}X_{t}+N_{2}\eta_{2t})dW_{2t},\\ X_{0}=&-(I+G\Upsilon_{0})^{-1}G\varphi_{0},\end{aligned}\right.

which admits a unique solution

\displaystyle X_{t}=

\displaystyle\Psi_{t}\Big{[}X_{0}-\int_{0}^{t}\Psi_{s}^{-1}C_{2}N_{2}\eta_{2s}ds-\int_{0}^{t}\Psi_{s}^{-1}N_{2}\eta_{2s}dW_{2s}\Big{]},

(5.3)

with

\Psi_{t}=\exp\left[-\big{(}A+\frac{1}{2}C_{1}^{2}+\frac{1}{2}C_{2}^{2}\big{)}t-C_{1}W_{1t}-C_{2}W_{2t}\right].

Further,

\displaystyle v_{t}=-R^{-1}B\widehat{X}_{t}=-R^{-1}BX_{0}\exp\Big{[}-\big{(}A+\frac{1}{2}C_{1}^{2}\big{)}t-C_{1}W_{1t}\Big{]}.

(5.4)

Theorem 2.2 implies that $v$ given by (5.4) is an optimal control of Problem BLQA.

In the following, we aim to derive a feedback representation of $v$ . For this end, we introduce

\left\{\begin{aligned} &\dot{\Gamma}_{1t}+2A\Gamma_{1t}=0,\\ &\Gamma_{10}=G,\end{aligned}\right.

(5.5)

\left\{\begin{aligned} &\dot{\Gamma}_{2t}+2A\Gamma_{2t}+\left(\frac{B^{2}}{R}+C_{1}^{2}\Upsilon_{t}\right)\Gamma_{2t}^{2}=0,\\ &\Gamma_{20}=G,\end{aligned}\right.

(5.6)

and

\left\{\begin{aligned} d\psi_{t}=&-\Big{[}A\psi_{t}+\Big{(}\frac{B^{2}\Gamma_{2t}}{R}+C_{1}^{2}\Gamma_{2t}\Upsilon_{t}\Big{)}\widehat{\psi}_{t}+C_{1}\Gamma_{2t}\widehat{\eta}_{1t}\\ &+C_{1}\Gamma_{1t}(\eta_{1t}-\widehat{\eta}_{1t})+C_{2}\Gamma_{1t}(\eta_{2t}-\widehat{\eta}_{2t})+C_{2}\Gamma_{2t}\widehat{\eta}_{2t}\Big{]}dt\\ &-\Big{[}\Gamma_{1t}(\eta_{1t}-\widehat{\eta}_{1t})+\Gamma_{2t}\widehat{\eta}_{1t}+C_{1}(\Gamma_{2t}\widehat{\varphi}_{t}+\widehat{\psi}_{t})\\ &+C_{1}\Gamma_{1t}(\varphi_{t}-\widehat{\varphi}_{t})+C_{1}(\psi_{t}-\widehat{\psi}_{t})\Big{]}dW_{1t}\\ &-\Big{[}(\Gamma_{1t}-N_{2})\eta_{2t}+C_{2}\Gamma_{1t}(\varphi_{t}-\widehat{\varphi}_{t})\\ &+C_{2}(1+\Gamma_{2t}\Upsilon_{t})^{-1}(\Gamma_{2t}\widehat{\varphi}_{t}+\widehat{\psi}_{t})+C_{2}(\psi_{t}-\widehat{\psi}_{t})\Big{]}dW_{2t},\\ \psi_{0}=&\ 0.\end{aligned}\right.

Solving (5.5) and (5.6), we get

\Gamma_{1t}=Ge^{-2At},

and

\Gamma_{2t}=\frac{G\exp(-2At)}{1+G\int_{0}^{t}\exp(-2As)(\frac{B^{2}}{R}+C_{1}^{2}\Upsilon_{s})ds},

respectively.
According to Theorem 2.1 in Wang et al. [20], we have

\left\{\begin{aligned} d\widehat{\psi}_{t}=&-\Big{(}\mathcal{A}_{t}\widehat{\psi}_{t}+\mathcal{B}_{t}\Big{)}dt-\Big{(}C_{1}\widehat{\psi}_{t}+\mathcal{D}_{t}\Big{)}dW_{1t},\\ \widehat{\psi}_{0}=&\ 0,\end{aligned}\right.

where

\left\{\begin{aligned} \mathcal{A}_{t}=&A+\frac{B^{2}\Gamma_{2t}}{R}+C_{1}^{2}\Gamma_{2t}\Upsilon_{t},\\ \mathcal{B}_{t}=&C_{1}\Gamma_{2t}\widehat{\eta}_{1t}+C_{2}\Gamma_{2t}\widehat{\eta}_{2t},\\ \mathcal{D}_{t}=&\Gamma_{2t}\widehat{\eta}_{1t}+C_{1}\Gamma_{2t}\widehat{\varphi}_{t}.\end{aligned}\right.

Similarly, we derive

\displaystyle\widehat{\psi}_{t}=

\displaystyle\Phi_{t}\bigg{[}-\int_{0}^{t}\Phi^{-1}_{s}\Big{(}\mathcal{B}_{s}+C_{1}\mathcal{D}_{s}\Big{)}ds-\int_{0}^{t}\Phi^{-1}_{s}\mathcal{D}_{s}dW_{1s}\bigg{]},

where

\Phi_{t}=\exp\left[-\int_{0}^{t}\big{(}\mathcal{A}_{s}+\frac{1}{2}C_{1}^{2}\big{)}ds-C_{1}W_{1t}\right].

Then Theorem 4.2 implies that (5.4) admits a feedback representation below

v_{t}=R^{-1}B\Gamma_{2t}\widehat{Y}_{t}+R^{-1}B\widehat{\psi}_{t},

where $Y$ satisfies

\left\{\begin{aligned} dY_{t}=&\ \Big{(}AY_{t}+B^{2}R^{-1}\Gamma_{2t}\widehat{Y}_{t}+B^{2}R^{-1}\widehat{\psi}_{t}+C_{1}Z_{1t}+C_{2}Z_{2t}\Big{)}dt+Z_{1t}dW_{1t}+Z_{2t}dW_{2t},\\ Y_{T}=&\ \zeta.\end{aligned}\right.

(5.7)

The corresponding optimal cost is

\displaystyle J(v)=

\displaystyle\frac{G\varphi_{0}^{2}}{2+2G\Upsilon_{0}}+\frac{1}{2}\mathbb{E}\left[\int_{0}^{T}\langle N_{2}\eta_{2t},\eta_{2t}\rangle dt\right].

Remark 5.1.

It follows from Theorem 4.1 that the solution of (5.7) is given by

\left\{\begin{aligned} &Y_{t}=\Upsilon_{t}\widehat{X}_{t}+\varphi_{t},\\ &Z_{1t}=\eta_{1t}-C_{1}\Upsilon_{t}\widehat{X}_{t},\\ &Z_{2t}=\eta_{2t}.\end{aligned}\right.

Here, $X$ and $(\varphi,\eta_{1},\eta_{2})$ are given by (5.3) and (5.2), respectively. Note that equation (5.7) is a BSDE with filtering, which is difficult to obtain the explicit solution in general.

5.2 Special case: $C_{2}=0$

In this case, (2.1) is written as

\left\{\begin{aligned} dY=&\left(AY+Bv+C_{1}Z_{1}\right)dt+Z_{1}dW_{1}+Z_{2}dW_{2},t\in{[0,T]},\\ Y_{T}=&\ \zeta.\end{aligned}\right.

Cost functional (2.2) takes the form of

\displaystyle J(v)=

\displaystyle\frac{1}{2}\mathbb{E}\Bigg{[}GY_{0}^{2}+\int_{0}^{T}\Big{(}HY^{2}+Rv^{2}+N_{1}Z_{1}^{2}+N_{2}Z_{2}^{2}\Big{)}dt\Bigg{]}.

Then Problem BLQ is formulated as follows.
Problem BLQB. Find a $v^{*}\in\mathcal{V}[0,T]$ such that

J(v^{*})=\inf_{v\in\mathcal{V}[0,T]}J(v).

The corresponding stochastic Hamiltonian system reads

\left\{\begin{aligned} &dY=\left(AY+Bv+C_{1}Z_{1}\right)dt+Z_{1}dW_{1}+Z_{2}dW_{2},\\ &dX=-\left(AX+HY\right)dt-\left(C_{1}X+N_{1}Z_{1}\right)dW_{1}-N_{2}Z_{2}dW_{2},\\ &Y=\zeta,\ \ \ \ X_{0}=-GY_{0},\\ &\mathbb{E}[R_{t}v_{t}+B_{t}X_{t}|\mathcal{F}_{t}^{W_{1}}]=0.\end{aligned}\right.

According to Theorem 4.2, the optimal control is

v_{t}=R_{t}^{-1}B_{t}\Gamma_{2t}\widehat{Y}_{t}+R_{t}^{-1}B_{t}\widehat{\psi}_{t}

with

\left\{\begin{aligned} dY=&\left[AY+B(R^{-1}B\Gamma_{2}\widehat{Y}+R^{-1}B\widehat{\psi})+C_{1}Z_{1}\right]dt+Z_{1}dW_{1}+Z_{2}dW_{2},\\ Y_{T}=&\ \zeta.\end{aligned}\right.

The corresponding Riccati equations are

\left\{\begin{aligned} &\dot{\Upsilon}-2A\Upsilon-H\Upsilon^{2}+B^{2}R^{-1}+C_{1}^{2}\Upsilon(1+\Upsilon N_{1})^{-1}=0,\\ &\Upsilon_{T}=0,\\ &\dot{\Gamma}_{1}+2A\Gamma_{1}-H=0,\\ &\Gamma_{10}=G,\\ &\dot{\Gamma}_{2}+2A\Gamma_{2}+[B^{2}R^{-1}+C_{1}^{2}\Upsilon(1+\Upsilon N_{1})^{-1}]\Gamma_{2}^{2}-H=0,\\ &\Gamma_{20}=G.\end{aligned}\right.

(5.8)

Equations (3.5) and (3.9) are reduced to

\left\{\begin{aligned} d\varphi=&\Big{[}A\varphi+\Upsilon H\widehat{\varphi}+C_{1}(\eta_{1}-\widehat{\eta}_{1})+C_{1}(1+\Upsilon N_{1})^{-1}\widehat{\eta}_{1}\Big{]}dt+\eta_{1}dW_{1}+\eta_{2}dW_{2},\\ \varphi_{T}=&\ \zeta,\end{aligned}\right.

(5.9)

and

\left\{\begin{aligned} d\psi=&-\Big{\{}A\psi+\Gamma_{2}B^{2}R^{-1}\widehat{\psi}+\Gamma_{1}C_{1}(\eta_{1}-\widehat{\eta}_{1})+\Gamma_{2}C_{1}(1+\Upsilon N_{1})^{-1}\left[\widehat{\eta}_{1}+\Upsilon C_{1}\widehat{\psi}\right]\Big{\}}dt\\ &+\Big{[}(N_{1}-\Gamma_{1})(\eta_{1}-\widehat{\eta}_{1})+(N_{1}-\Gamma_{2})(I+\Upsilon N_{1})^{-1}\widehat{\eta}_{1}-C_{1}(1+\Upsilon N_{1})^{-1}(\Gamma_{2}\widehat{\varphi}+\widehat{\psi})\\ &-C_{1}\Gamma_{1}(\varphi-\widehat{\varphi})-C_{1}(\psi-\widehat{\psi})\Big{]}dW_{1}+(N_{2}-\Gamma_{1})\eta_{2}dW_{2},\\ \psi(0)=&\ 0,\end{aligned}\right.

respectively.

Refer to caption — Figure 1: The solutions of $\Upsilon,\Gamma_{1},\Gamma_{2}$

Note that it is hard to obtain a more explicit expression of $v$ due to the complexity of (5.8) and (5.9). In the following, we hope to give numerical solutions for this case with certain particular coefficients. Let $T=1,A=2,B=3t+2,C_{1}=t-2,G=2,H=e^{-0.05t},R=2t+1,N_{1}=t(T-t),N_{2}=2$ and $\zeta=T+sin(W_{1T})+cos(2W_{2T})$ . Applying Runge-Kutta method, we generate the dynamic simulations of $\Upsilon,\Gamma_{1}$ and $\Gamma_{2}$ , shown in Figure 1.

It seems that there is no existing literature on numerical methods of equation (5.9), which is a BSDE with filtering. Using Theorem 2.1 in Wang et al. [20] again, we get

\left\{\begin{aligned} d\widehat{\varphi}=&\Big{[}(A+\Upsilon Q)\widehat{\varphi}+C_{1}(1+\Upsilon N_{1})^{-1}\widehat{\eta}_{1}\Big{]}dt+\widehat{\eta}_{1}dW_{1},\\ \widehat{\varphi}_{T}=&\ \widehat{\zeta},\\ d\widehat{\psi}=&-\Big{\{}\left[A+\Gamma_{2}B^{2}R^{-1}+\Gamma_{2}C_{1}^{2}\Upsilon(1+\Upsilon N_{1})^{-1}\right]\widehat{\psi}+\Gamma_{2}C_{1}(1+\Upsilon N_{1})^{-1}\widehat{\eta}_{1}\Big{\}}dt\\ &+\Big{[}(N_{1}-\Gamma_{2})(I+\Upsilon N_{1})^{-1}\widehat{\eta}_{1}-C_{1}(I+\Upsilon N_{1})^{-1}(\Gamma_{2}\widehat{\varphi}+\widehat{\psi})\Big{]}dW_{1},\\ \widehat{\psi}_{0}=&\ 0.\end{aligned}\right.

Applying the numerical method introduced in Ma et al. [24], we generate the dynamic simulations of $\widehat{\varphi}$ and $\widehat{\eta}_{1}$ , shown in Fig. 2. For more information about numerical methods for BSDEs, please refer to Peng and Xu [25], Zhao et al. [26] and the references therein. The simulation of $\widehat{\psi}$ is also shown in Figure 2.

From Theorem 4.1 and Theorem 4.2, we have $\widehat{Y}=\Upsilon\widehat{X}+\widehat{\varphi}$ , $\widehat{X}=-\Gamma_{2}\widehat{Y}-\widehat{\psi}$ and $\widehat{Z}_{1}=(I+\Upsilon N_{1})^{-1}(\widehat{\eta}_{1}-\Upsilon C_{1}\widehat{X})$ . Then the dynamic simulations of $\widehat{Y}$ and $\widehat{Z}_{1}$ are similarly generated, shown in Figure 3. Further, from Theorem 4.2, we also generate the dynamic simulation of $v$ , which is presented in Figure 4.

6 Conclusion

We investigate an LQ control problem of BSDE with partial information, where both the generator of dynamic system and the cost functional contain diffusion terms $Z_{1}$ and $Z_{2}$ . This problem is solved completely and explicitly under some standard conditions. An feedback representation of optimal control and an explicit formula of corresponding optimal cost are given in terms of three Riccati equations, a BSDE with filtering and an SDE with filtering. Moreover, we work out two special scalar-valued control problems to illustrate our theoretical results.

Note that the coefficients in the generator of state equation and the weighting matrices in the cost functional are deterministic. If the coefficients are random, there will be an essential difficulty in solving the case. Since $\mathbb{E}[A_{t}Y_{t}|\mathcal{F}_{t}^{W_{1}}]=A_{t}\mathbb{E}[Y_{t}|\mathcal{F}_{t}^{W_{1}}]$ is no longer true if $A$ is an $\{\mathcal{F}_{t}\}_{t\geq 0}$ -adapted stochastic process. We will investigate the stochastic case in future.

References

[1] J. M. Bismut, An introductory approach to duality in optimal stochastic control, SIAM Rev. 20 (1978) 62-78.
[2] E. Pardoux, S. Peng, Adapted solution of a backward stochastic differential equation, Syst. Control Lett. 14 (1) (1990) 55-61.
[3] N. El. Karoui, S. Peng, M. C. Quenez, Backward stochastic differential equations in finance, Math. Finance 7 (1997) 1-71.
[4] J. Ma, J. Yong, Forward-backward stochastic differential equations and their applications, Lecture Notes in Math, Springer-Verlag, New York, 1999.
[5] M. Kohlmann, X. Zhou, Relationship between backward stochastic differential equations and stochastic controls: a linear-quadratic approach, SIAM J. Control Optim. 38 (5) (2000) 1392-1407.
[6] S. Peng, Backward stochastic differential equations and applications to optimal control, Appl. Math. Optim. 27 (2) (1993) 125-144.
[7] N. G. Dokuchaev, X. Zhou, Stochastic controls with terminal contingent conditions, J. Math. Anal. Appl. 238 (1999) 143-165.
[8] A. E. B. Lim, X. Zhou, Linear-quadratic control of backward stochastic differential equations, SIAM J. Control Optim. 40 (2) (2001) 450-474.
[9] X. Li, J. Sun, J. Xiong, Linear quadratic optimal control problems for mean-field backward stochastic differential equations, Appl. Math. Optim. 80 (2019) 223-250.
[10] J. Huang, S. Wang, Z. Wu, Backward mean-field Linear-quadratic-gaussian (LQG) games: full and partial information, IEEE Trans. Automat. Control 60 (12) (2016) 3784-3796.
[11] K. Du, J. Huang, Z. Wu, Linear quadratic mean-field-game of backward stochastic differential systems, Math. Control Relat. Fields 8 (2018) 653-678.
[12] K. Du, Z. Wu, Linear-quadratic stackelberg game for mean-field backward stochastic differential system and application, Math. Probl. Eng. 17 (2019) 1-17.
[13] Y. Hu, B. Øksendal, Partial information linear quadratic control for jump diffusions, SIAM J. Control Optim. 47 (4) (2008) 1744-1761.
[14] Z. Wu, A maximum principle for partially observed optimal control of forward-backward stochastic control systems, Sci. China Infor. Sci. 53 (11) (2010) 2205-2214.
[15] J. Huang, G. Wang, J. Xiong, A maximum principle for partial information backward stochastic control problems with applications, SIAM J. Control Optim. 48 (4) (2009) 2106-2117.
[16] G. Wang, Z. Wu, J. Xiong, A linear-quadratic optimal control problem of forward-backward stochastic differential equations with partial information, IEEE Trans. Automat. Control 60 (11) (2015) 2904-2916.
[17] G. Wang, H. Xiao, G. Xing, An optimal control problem for mean-field forward-backward stochastic differential equation with noisy observation, Automatica 86 (2017) 104-109.
[18] G. Wang, H. Xiao, J. Xiong, A kind of LQ non-zero sum differential game of backward stochastic differential equation with asymmetric information, Automatica 97 (2018) 346-352.
[19] J. Yong, X. Zhou, Stochastic Controls: Hamiltonian Systems and HJB Equations, Springer-Verlag, New York, 1999.
[20] G. Wang, Z. Wu, J. Xiong, An Introduction to Optimal Control of FBSDE with Incomplete Information, Springer-Verlag, New York, 2018.
[21] J. Xiong, An Introduction to Stochastic Filtering Theory, Oxford University Press, London, 2008.
[22] R. S. Liptser, A. N. Shiryayev, Statistics of Random Processes, Springer-Verlag, New York, 1977.
[23] P. Huang, G. Wang, H. Zhang, A partial information linear-quadratic optimal control problem of backward stochastic differential equation with its applications, Sci. China Infor. Sci. 63 (9) (2020) 1-14.
[24] J. Ma, P. Protter, J. S. Martin, S. Torres, Numerical method for backward stochastic differential equations, Ann. Appl. Probab. 12 (2002) 302-316.
[25] S. Peng, M. Xu, Numerical algorithms for backward stochastic differential equations with 1-d brownian motion: Convergence and simulations, ESAIM Math. Model. Numer. Anal. 45 (2011) 335-360.
[26] W. Zhao, L. Chen, S. Peng, A new kind of accurate numerical method for backward stochastic differential equations, SIAM J. Sci. Comput. 28 (4) (2006) 1563-1581.

	$\displaystyle\ \mathbb{E}\left[Y_{0}^{*\top}G\widetilde{Y}_{0}\right]=$	$\displaystyle\ \mathbb{E}\bigg{[}\int_{0}^{T}\Big{(}\langle A\widetilde{Y}+B(v-v^{})+C_{1}\widetilde{Z_{1}}+C_{2}\widetilde{Z_{2}},X^{}\rangle-\langle\widetilde{Y},A^{\top}X^{}+HY^{}\rangle$
		$\displaystyle-\langle\widetilde{Z_{1}},C_{1}^{\top}X^{}+N_{1}Z_{1}^{}\rangle-\langle\widetilde{Z_{2}},C_{2}^{\top}X^{}+N_{2}Z_{2}^{}\rangle\Big{)}dt\bigg{]}$
	$\displaystyle=$	$\displaystyle\ \mathbb{E}\bigg{[}\int_{0}^{T}\Big{(}\langle v-v^{},B^{\top}X^{}\rangle-\langle\widetilde{Y},HY^{}\rangle-\langle\widetilde{Z_{1}},N_{1}Z_{1}^{}\rangle-\langle\widetilde{Z_{2}},N_{2}Z_{2}^{*}\rangle\Big{)}dt\bigg{]}.$

		$\displaystyle\ \|\widetilde{P}_{t}\|^{2}+\mathbb{E}\left[\int_{t}^{T}\delta e^{\delta(s-t)}\|\widetilde{P_{s}}\|^{2}ds\Big{\|}\mathcal{F}_{t}\right]+\mathbb{E}\left[\int_{t}^{T}e^{\delta(s-t)}(\|\widetilde{Q_{1s}}\|^{2}+\|\widetilde{Q_{2s}}\|^{2})ds\Big{\|}\mathcal{F}_{t}\right]$
	$\displaystyle=$	$\displaystyle\ 2\mathbb{E}\bigg{[}\int_{t}^{T}e^{\delta(s-t)}\langle\widetilde{P}_{s},g(s,P^{\prime},Q_{1}^{\prime},Q_{2}^{\prime},\widehat{p^{\prime}},\widehat{q_{1}^{\prime}},\widehat{q_{2}^{\prime}})-g(s,P,Q_{1},Q_{2},\widehat{p},\widehat{q_{1}},\widehat{q_{2}})\rangle ds\Big{\|}\mathcal{F}_{t}\bigg{]}$
	$\displaystyle\leq$	$\displaystyle\ 2L\mathbb{E}\bigg{[}\int_{t}^{T}e^{\delta(s-t)}\|\widetilde{P}_{s}\|\Big{(}\|\widetilde{P}_{s}\|+\|\widetilde{Q}_{1s}\|+\|\widetilde{Q}_{2s}\|+\|\widehat{\widetilde{p}}_{s}\|+\|\widehat{\widetilde{q}}_{1s}\|+\|\widehat{\widetilde{q}}_{2s}\|\Big{)}ds\Big{\|}\mathcal{F}_{t}\bigg{]}$
	$\displaystyle\leq$	$\displaystyle\ \mathbb{E}\bigg{[}\int_{t}^{T}e^{\delta(s-t)}\Big{(}(2L+4L^{2}+\frac{\delta}{2})\|\widetilde{P}_{s}\|^{2}+\frac{1}{2}\|\widetilde{Q}_{1s}\|^{2}+\frac{1}{2}\|\widetilde{Q}_{2s}\|^{2}\Big{)}ds\Big{\|}\mathcal{F}\bigg{]}$
		$\displaystyle+\mathbb{E}\bigg{[}\int_{t}^{T}e^{\delta(s-t)}\frac{6L^{2}}{\delta}\Big{(}\|\widehat{\widetilde{p}}_{s}\|^{2}+\|\widehat{\widetilde{q}}_{1s}\|^{2}+\|\widehat{\widetilde{q}}_{2s}\|^{2}\Big{)}ds\Big{\|}\mathcal{F}_{t}\bigg{]}.$

		$\displaystyle\ \mathbb{E}\left[\sup_{t\in[0,T]}\Big{\|}P_{t}\Big{\|}^{2}\right]$
	$\displaystyle\leq$	$\displaystyle\ 4\mathbb{E}\left[\|\zeta\|^{2}\right]+4\mathbb{E}\left[\sup_{t\in[0,T]}\Big{\|}\int_{t}^{T}g(s,P,Q_{1},Q_{2},\widehat{Q},\widehat{Q_{1}},\widehat{Q_{2}})ds\Big{\|}^{2}\right]$
		$\displaystyle+4\mathbb{E}\left[\sup_{t\in[0,T]}\Big{\|}\int_{t}^{T}Q_{1s}dW_{1s}\Big{\|}^{2}\right]+4\mathbb{E}\left[\sup_{t\in[0,T]}\Big{\|}\int_{t}^{T}Q_{2s}dW_{2s}\Big{\|}^{2}\right]$
	$\displaystyle\leq$	$\displaystyle\ 4\mathbb{E}\left[\|\zeta\|^{2}\right]+4T\mathbb{E}\left[\sup_{t\in[0,T]}\left(\int_{t}^{T}\|g(s,P,Q_{1},Q_{2},\widehat{P},\widehat{Q_{1}},\widehat{Q_{2}})\|^{2}ds\right)\right]$
		$\displaystyle+16\mathbb{E}\left[\int_{0}^{T}\big{\|}Q_{1t}\big{\|}^{2}dt\right]+16\mathbb{E}\left[\int_{0}^{T}\big{\|}Q_{2t}\big{\|}^{2}dt\right]$
	$\displaystyle<$	$\displaystyle\ \infty.$

Linear Quadratic Control of Backward Stochastic Differential Equation with Partial Information ††thanks: This work supported by the National Natural Science Foundations of China under Grants 61821004, 61633015, 61877062, and 61977043.

1 Introduction

2 Preliminaries

Theorem 2.1.

Theorem 2.2.

Proof.

3 Decoupling stochastic Hamiltonian system (2.5)

4 Explicit representations of optimal control and optimal cost

Lemma 4.1.

Proof.

Remark 4.1.

Theorem 4.1.

Proof.

Theorem 4.2.

Proof.

Remark 4.2.

Remark 4.3.

5 One-dimensional case

5.1 Special case: H=N1=0H=N_{1}=0

Remark 5.1.

5.2 Special case: C2=0C_{2}=0

6 Conclusion

References

Linear Quadratic Control of Backward Stochastic Differential Equation with Partial Information ^†^†thanks: This work supported by the National Natural Science Foundations of China under Grants 61821004, 61633015, 61877062, and 61977043.

5.1 Special case: $H=N_{1}=0$

5.2 Special case: $C_{2}=0$