Adaptive finite element approximation of sparse optimal control with integral fractional Laplacian

Fangyuan Wang School of Mathematics and Statistics, Shandong Normal University, Jinan, 250014, China , Qiming Wang School of Mathematical Sciences, Beijing Normal University, Zhuhai, 519087, China and Zhaojie Zhou^∗ School of Mathematics and Statistics, Shandong Normal University, Jinan, 250014, China

(Date: ⁰⁰footnotetext: ^∗Corresponding author: [email protected])

Abstract.

In this paper we present and analyze a weighted residual a posteriori error estimate for an optimal control problem. The problem involves a nondifferentiable cost functional, a state equation with an integral fractional Laplacian, and control constraints. We employ subdifferentiation in the context of nondifferentiable convex analysis to obtain first-order optimality conditions. Piecewise linear polynomials are utilized to approximate the solutions of the state and adjoint equations. The control variable is discretized using the variational discretization method. Upper and lower bounds for the a posteriori error estimate of the finite element approximation of the optimal control problem are derived. In the region where $\frac{3}{2}<\alpha<2$ , the residuals do not satisfy the $L^{2}(\Omega)$ regularity. To address this issue, an additional weight is included in the weighted residual estimator, which is based on a power of the distance from the mesh skeleton. Furthermore, we propose an h-adaptive algorithm driven by the posterior view error estimator, utilizing the D $\rm{\ddot{o}}$ rfler labeling criterion. The convergence analysis results show that the approximation sequence generated by the adaptive algorithm converges at the optimal algebraic rate. Finally, numerical experiments are conducted to validate the theoretical results.

Key words and phrases:

adaptive finite element; optimal control; sparse control; fractional Laplacian; a posteriori error estimate

Mathematics Subject Classification:

49J20,49M25,65N12,65N30,65N50

1. Introduction

In this paper, we present and analyze a weighted residual a posteriori error estimate for an optimal control problem involving a nondifferentiable cost functional, a state equation with an integral fractional Laplacian, and control constraints. For a bounded Lipschitz domain $\Omega\subset R^{d},\Omega^{c}:=R^{d}\backslash\overline{\Omega}$ , we consider

\displaystyle\min\limits_{u\in U_{ad}}J(y,u):=\frac{1}{2}\|y-y_{d}\|^{2}_{L^{2}(\Omega)}+\frac{\gamma}{2}\|u\|^{2}_{L^{2}(\Omega)}+\beta\|u\|_{L^{1}(\Omega)}

(1.1)

subject to

\displaystyle\left\{\begin{aligned} (-\Delta)^{\frac{\alpha}{2}}y&=f+u,\ &\mbox{in}\ \Omega,\\ y&=0,\ &\mbox{on}\ \Omega^{c},\end{aligned}\right.

(1.2)

and the control constraints

\displaystyle U_{ad}=\Big{\{}v\in L^{2}(\Omega)|a\leq v\leq b,\ a,b\in R\Big{\}}.

Here parameters $\gamma>0$ and $\beta>0$ . In the sequel, $y$ is state and $u$ is the control variable. The function $y_{d}\in L^{2}(\Omega)$ is referred to as desired state. To focus on the scenario of nondifferentiability, it is assumed that $a,b\in R$ satisfy the condition that $a<0<b$ . We notice that the set $U_{ad}$ , is a nonempty, bounded, closed and convex subset of $L^{2}(\Omega)$ .

The introduction of nonsmooth regularization term $\beta\|u\|_{L^{1}(\Omega)}$ in the PDE-constrained optimization problems promotes sparsity in the solutions. This allows the control variable in the optimization process to tend towards zero in regions where it has negligible impact on the cost function, therefore minimizing the cost function. Sparse optimization is widely used in many practical applications, especially in the processing and analysis of high dimensional data, such as noise processing, machine learning, face recognition, etc.

Previous research has addressed the analysis of optimal control problems with a cost term containing $L^{1}(\Omega)$ norm [1, 2, 3, 4, 5, 6, 7, 8, 20]. For instance, in [7], the authors investigated the $L^{1}(\Omega)$ control problem constrained by a linear elliptic PDE, where the objective functional incorporated a regularization technique based on the $L^{2}(\Omega)$ control cost term. The authors analyzed the optimality conditions and proposed a semismooth Newton method that achieves local convergence with superlinear speed. Building upon this work, [8] provided a priori and posteriori error estimates through finite element analysis. Furthermore, in [5], the authors considered a semilinear elliptic PDE as the state equation and analyzed second-order optimality conditions. Additionally, the authors in [6] studied the sparse control problem with a fractional diffusion equation as the state equation. They analyzed a priori error estimate for the fully discrete case using finite element methods. More recently, Ot $\rm{\acute{a}}$ rola et al. studied a sparse optimal control problem with a non-differentiable cost functional, where the state equations are Poisson’s problem and fractional diffusion equation, respectively in [9, 10]. In [9], the authors studied three different strategies for approximating the control variable, they proposed and analyzed a reliable and efficient a posteriori error estimate, and designed an adaptive strategy to achieve optimal convergence rates. In [10], the authors investigated an adaptive finite element method for sparse optimal control of fractional diffusion, taking into account the spectral definition of the fractional Laplacian operator.

In comparison to the priori error analysis of finite element approximations for PDE-constrained optimization, the design and analysis of a posteriori error estimate is not much. The initial work on reliable a posteriori error estimation for optimal control problems was presented in [11], followed by a series of related studies [12, 13, 14, 15, 16]. Residual-based a posteriori error estimates incorporating data oscillation were introduced in [17]. Later, a unified framework for the a posterior error analysis of linear quadratic optimal control problems with control constraints was established in [18], and the pure convergence of an adaptive finite element method for optimal control problems with variable divergence control was proved, that is, convergence without convergence rate. In [19], the authors rigorously proved convergence and quasi-optimality of AFEM for optimal control problem involving state and adjoint state variable.

However, to the best of our knowledge, no previous research has combined adaptive finite element methods (AFEMs) with integral fractional Laplacian sparse optimal control to address such problems. Therefore, in this paper, we focus on the adaptive finite element approximation for sparse optimal control with the integral fractional Laplacian. We outline and analyze the solution methodology for problem (1.1)-(1.2) based on the following considerations:

$\bullet$ The optimal control problem involving the fractional Laplacian operator can effectively simulate groundwater pollution [21], turbulent flow [22], and chaotic dynamics [23]. Unlike integer-order diffusion equations, the fractional Laplacian operator exhibits power-law decay, which accurately captures heavy-tailed power-law decay phenomena observed in these applications. Hence, studying fractional optimal control problem is essential.

$\bullet$ Objective functional by introducing the $L^{1}(\Omega)$ norm to control some specific physical quantities or locations, and the $L^{2}(\Omega)$ norm to maintain smoothness and continuity, can better solve the practical problems that need to control the optimal cost. The existence of $L^{1}(\Omega)$ term in the objective function requires us to derive the first-order condition using a subdifferential approach [5, 7, 8], which is different from distributed optimal control problems.

$\bullet$ Due to the non-locality, non-differentiability, and intrinsic constraints of the fractional Laplacian operator, by adopting adaptive strategies and a posteriori error estimate, we can identify singularities and refine the mesh accordingly, which can more effectively allocate computational resources and achieve higher accuracy with lower computational costs. One of the challenges in designing the a posterior error estimator is the nature of the residual, that is, it is not necessarily in $L^{2}(\Omega)$ . We refer to [26] and introduce the weighted residual estimator, where the weights are given by the power of the distance to the grid skeleton

	$\displaystyle E^{2}_{y}(y_{\mathcal{T}_{h}},K):=\\|\widetilde{h}^{\frac{\alpha}{2}}_{K}(f+u_{\mathcal{T}_{h}}-(-\Delta)^{s}y_{\mathcal{T}_{h}})\\|^{2}_{L^{2}(K)},$
	$\displaystyle E^{2}_{p}(p_{\mathcal{T}_{h}},K):=\\|\widetilde{h}^{\frac{\alpha}{2}}_{K}(y_{\mathcal{T}_{h}}-y_{d}-(-\Delta)^{s}p_{\mathcal{T}_{h}})\\|^{2}_{L^{2}(K)},$

(see section 4).

$\bullet$ The adaptive finite element method is widely used, but there are not many convergence analyses of the algorithm. The optimal control problem we studied is a coupled system with nonlinear properties, which leads to the lack of orthogonality presented in [30] and brings difficulties to our convergence analysis. In order to address this issue, we refer to reference [28] and prove its quasi-orthogonality.

Recently, the only work on a posteriori error analysis for sparse optimal control constrained by fractional order equations, as in (1.1)-(1.2), is found in [10]. Compared with [10], this paper studies the integral definition of the fractional Laplacian operator, which plays an important role in the modeling of complex non-local and nonlinear phenomena such as diffusion, heat transfer, resistance and elasticity. And the main difference is that the convergence of the adaptive algorithm is also analyzed in this paper. In this paper, we use piecewise linear polynomial dispersion for the state variable and variational discretization for the control variable. We design a posterior error estimator that requires only discretization of the state variable and adjoint variable. Notably, in the $\frac{3}{2}<\alpha<2$ , the residual does not satisfy the $L^{2}(\Omega)$ -regularity. To address this issue, an additional weight based on the power of the distance from the mesh skeleton is included in the weighted residual estimator. An h-adaptive algorithm driven by the D $\rm{\ddot{o}}$ rfler marking criterion based on the a posteriori error estimator is proposed and its convergence is proved.

The organization of the paper is as follows: In section 2, we introduce the symbols used and provide a brief overview of elements in convex analysis, along with the regularity of solutions to optimal control problem. In section 3, we analyze the first-order optimality conditions for the problem. In section 4, we introduce the finite element discretization of the optimal control problem (1.1)-(1.2) and design a weighted residual estimator. The core of our work is presented in sections 4 and 5. For the discretization introduced at the beginning of section 4, we first derive upper and lower bounds for the a posteriori error estimate of the finite element approximation for the optimal control problem. An h-adaptive algorithm driven by the posterior view error estimator based on D $\rm{\ddot{o}}$ rfler labeling criterion is proposed. In section 5, we show that the sequence of approximations produced by the adaptive algorithm converges at the optimal algebraic rate. In section 6, a series of numerical examples are provided to demonstrate the effectiveness of our theoretical findings.

2. Preliminaries

In this section we introduce some preliminaries about fractional Sobolev spaces, subdifferential and fractional Laplacian. For a bounded domain $\Lambda\subset R^{d},L^{2}(\Lambda)$ denotes the Banach spaces of standard 2-th Lebesgue integrable functions on $\Lambda$ . For $s\in(0,1),$ $H^{s}(\Lambda)$ denotes the fractional Sobolev space. $H_{0}^{s}(\Lambda)$ is the subspace of $H^{s}(\Lambda)$ consisting of functions whose trace is zero on $\partial\Lambda$ . Let $(\cdot,\cdot)$ and $\|\cdot\|$ denote the inner product and norm in $L^{2}(\Lambda)$ , respectively. The seminorm $|\cdot|_{H^{s}(\Lambda)}$ and the full norm $\|\cdot\|_{H^{s}(\Lambda)}$ are denoted as follows

|y|^{2}_{H^{s}(\Lambda)}=\int\int_{\Lambda\times\Lambda}\frac{y(v)-y(w)}{|v-w|^{d+2s}}dvdw,

\|y\|^{2}_{H^{s}(\Lambda)}=\|y\|^{2}+|y|^{2}_{H^{s}(\Lambda)}.

Moreover, we introduce the following space, which will be used in the weak formulation of state equation

\widetilde{H}^{s}(\Omega)=\{v\in H^{s}(R^{d}):v=0\ \ \textrm{in}\ \ \Omega^{c}\}.

Next, we will review some concepts with respect to subdifferentials from convex analysis that will be useful in our upcoming analysis. For details, please refer to [24]. Consider a real and normed vector space $G$ . Suppose $\phi:G\rightarrow R\cup\{\infty\}$ be a convex and proper functional. Let $v\in G$ be such that $\phi(v)<\infty$ . A subgradient of $G$ at $v$ is an element $v^{*}\in G^{*}$ that satisfies

\displaystyle\langle v^{*},w-v\rangle_{G^{*},G}\leq\phi(w)-\phi(v),\ \forall w\in G.

(2.1)

Here, $\langle\cdot,\cdot\rangle_{G^{*},G}$ represents the duality pairing between $G^{*}$ and $G$ . The set of all subgradients of $\phi$ at $\bar{v}$ , denoted by $\partial\phi(\bar{v})$ , refers to the subdifferential of $\phi$ at $\bar{v}$ .

\partial\phi(\bar{v})=\{v\in L^{2}(\Omega):\phi(w)-\phi(\bar{v})\geq(v,w-\bar{v}),\ w\in L^{2}(\Omega)\}.

As $\phi$ is a convex functional, the subdifferential at any point $v$ within the effective domain of $\phi$ is not empty. Additionally, it is important to note that the subdifferential is monotone, i.e.,

\displaystyle\langle v^{*}-w^{*},v-w\rangle_{G^{*},G}\geq 0,\ \forall v^{*}\in\partial\phi(v),\ \forall w^{*}\in\partial\phi(w).

(2.2)

Finally, we introduce the definition of fractional Laplacian:

\displaystyle(-\Delta)^{\frac{\alpha}{2}}y(x):=C(d,\alpha)\ {\textrm{p.v.}}\int_{R^{d}}\frac{y(x)-y(w)}{|x-w|^{d+\alpha}}dw.

(2.3)

Here $0<\alpha<2$ , and

C(d,\alpha)=\frac{2^{\alpha}\Gamma(\frac{\alpha}{2}+\frac{d}{2})}{\pi^{d/2}\Gamma(-\frac{\alpha}{2})}

and ”p.v.” denotes the principal value of the integral:

\displaystyle{\textrm{p.v.}}\int_{R^{d}}\frac{y(x)-y(w)}{|x-w|^{d+\alpha}}dw=\lim\limits_{\epsilon\rightarrow 0}\int_{R^{d}\setminus B_{\epsilon}(v)}\frac{y(x)-y(w)}{|x-w|^{d+\alpha}}dw,

(2.4)

where $B_{\epsilon}(v)$ is a ball of radius $\epsilon$ centered at $x$ . The difference $y(x)-y(w)$ in the numerator of (2.3), which vanishes at the singularity, provides a regularization, which together with averaging of positive and negative parts allows the principal value to exist. A consequence of this definition is the mapping property (see [25]).

\displaystyle(-\Delta)^{\frac{\alpha}{2}}:H^{s}(R^{d})\rightarrow H^{s-\alpha}(R^{d}),\ s\geq\frac{\alpha}{2}.

3. Optimal control problem

The weak formulation of state equation (1.2) reads: Find $y\in\widetilde{H}^{\frac{\alpha}{2}}(\Omega)$ such that

\displaystyle a(y,v)=(f+u,v),\ \ \forall v\in\widetilde{H}^{\frac{\alpha}{2}}(\Omega).

(3.1)

Here

\displaystyle a(y,v)=\frac{C(d,\alpha)}{2}\int\int_{R^{d}\times R^{d}}\frac{(y(x)-y(w))(v(x)-v(w))}{|x-w|^{d+\alpha}}dxdw.

We define

\|y\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}:=a(y,y)=\frac{C(d,\alpha)}{2}|y|^{2}_{H^{\frac{\alpha}{2}}(R^{d})}.

As the $H^{\frac{\alpha}{2}}(R^{d})$ seminorm is equivalent to the $H^{\frac{\alpha}{2}}(R^{d})$ norm on $\widetilde{H}^{\frac{\alpha}{2}}(\Omega)$ (see [29]), by Lax-Milgram theorem, the solution $y\in\widetilde{H}^{\frac{\alpha}{2}}(\Omega)$ exists and is unique.

For the state equation with the right hand $f$ we can define a linear and bounded solution operator $\mathcal{S}:L^{2}(\Omega)\longrightarrow\widetilde{H}^{\frac{\alpha}{2}}(\Omega)$ such that $y=\mathcal{S}f$ . Moreover, the following regularity result holds for the state equation.

Lemma 3.1.

([31]) For $f(x)+u(x)\in L^{2}(\Omega)$ , there exists a solution $y\in{H}^{{\frac{\alpha}{2}}+\sigma-\epsilon}(\Omega)$ satisfies

\displaystyle|y|_{{H}^{{\frac{\alpha}{2}}+\sigma-\epsilon}(\Omega)}\leq\frac{C(\Omega,d,\alpha)}{\epsilon^{\tau}}\|f+u\|_{L^{2}(\Omega)},\forall 0<\epsilon<{\frac{\alpha}{2}}.

Here $\sigma=\min\{\frac{\alpha}{2},\frac{1}{2}\}$ , $\tau=\frac{1}{2}$ for $1<\alpha<2$ and $\tau=\frac{1}{2}+\zeta$ for $0<\alpha\leq 1$ as well as a constant $\zeta$ depending on $\Omega$ and $d$ .

The weak formulation of the optimal control problem (1.1)-(1.2) reads:

\displaystyle\min\limits_{y\in\widetilde{H}^{\frac{\alpha}{2}}(\Omega),\ u\in U_{ad}}J(y,u)

(3.2)

subject to

\displaystyle a(y,v)=(f+u,v),\ \ \forall v\in\widetilde{H}^{\frac{\alpha}{2}}(\Omega).

(3.3)

Since the $J$ is strictly convex and weakly lower semicontinuous, this problem admits a unique optimal solution $(y,u)\in\widetilde{H}^{\frac{\alpha}{2}}(\Omega)\times L^{2}(\Omega)$ .

In order to obtain optimality conditions for (3.2)-(3.3), we introduce the following adjoint state $p$ as follows:

\displaystyle a(w,p)=(y-y_{d},w),\ \ \forall w\in\widetilde{H}^{\frac{\alpha}{2}}(\Omega).

(3.4)

Set

j_{1}(u)=\frac{1}{2}\|\mathcal{S}(u+f)-y_{d}\|^{2}_{L^{2}(\Omega)}+\frac{\gamma}{2}\|u\|^{2}_{L^{2}(\Omega)}

and $j_{2}(u)=\|u\|_{L^{1}(\Omega)}$ . Then we obtain the reduced problem of (1.1):

\displaystyle\min\limits_{u\in U_{ad}}\hat{J}(u)=\min\limits_{u\in U_{ad}}j_{1}(u)+\beta j_{2}(u).

(3.5)

Although the reduced cost functional (3.5) is nonsmooth, it consists in the sum of a regular part and a convex nondifferentiable term. Thanks to the structure, optimality conditions can still be established according to the following result.

Lemma 3.2.

([27]) Let $\hat{J}(u)$ be defined as in (3.5). The element $u\in U_{ad}$ is a minimizer of $\hat{J}(u)$ over $U_{ad}$ if and only if there exists a subgradient $\lambda^{*}\in\partial\hat{J}(u)$ such that

\displaystyle(\lambda^{*},v-u)\geq 0,\ \forall v\in U_{ad}.

(3.6)

Theorem 3.1.

(Optimality conditions) If $(y,u)$ is an optimal solution to (3.5), then it satisfies the following variational inequality

\displaystyle(p+\gamma u+\beta\lambda,v-u)\geq 0,\ \forall v\in U_{ad},

(3.7)

where $p$ denotes the solution to (3.4) and $\lambda\in\partial j_{2}(u)$ .

Proof.

Since the convex functional $j_{1}(u)$ is Fr $\rm{\acute{e}}$ chet differentiable we immediately have that $\partial j_{1}(u)=j^{\prime}_{1}(u).$ In view of the fact $j_{2}(u)$ is convex, that

\partial\hat{J}(u)=j_{1}^{\prime}(u)+\beta\partial j_{2}(u).

By simple calculations, we have that

	$\displaystyle j_{1}^{\prime}(u)(v-u)$	$\displaystyle=\lim\limits_{t\to 0}\frac{1}{2t}\int_{\Omega}\Big{(}(y(u+t(v-u))-y_{d})^{2}-(y(u)-y_{d})^{2}\Big{)}dx+\lim\limits_{t\to 0}\frac{\gamma}{2t}\int_{\Omega}\left((u+t(v-u))^{2}-u^{2}\right)dx$
		$\displaystyle=\int_{\Omega}(y(u)-y_{d})y^{\prime}(u)(v-u)dx+\gamma\int_{\Omega}u(v-u)$
		$\displaystyle=(p+\gamma u,v-u).$

According to Lemma 3.2, there exists a multiplier $\lambda\in\partial j_{2}(u),$ such that

(p+\gamma u+\beta\lambda,v-u)\geq 0.

∎

For $a,b\in R$ , we introduce a projection operator $\Pi_{[a,b]}:L^{2}(\Omega)\rightarrow U_{ad}$ defined by

\displaystyle\Pi_{[a,b]}(v)=\min\{b,\max\{a,v\}\}.

(3.8)

Then we have the following projection formulas.

Theorem 3.2.

(Projection formulas) Suppose $(y,p,u,\lambda)$ are the optimal variables associated to (3.7), then we obtain

	$\displaystyle u=\Pi_{[a,b]}\left(-\frac{1}{\gamma}(p+\beta\lambda)\right),$		(3.9)
	$\displaystyle\|p\|\leq\beta,\ \mbox{in}\ \{x\in\Omega,\ u=0\},$		(3.10)
	$\displaystyle\lambda=\Pi_{[-1,1]}\left(-\frac{1}{\beta}p\right).$		(3.11)

It guarantees the uniqueness of the subgradient $\lambda.$

Proof.

The derivation of the formula (3.9) is standard in control theory. According to [27], we know that $\lambda\in\partial j_{2}(u)$ if and only if

\displaystyle\left\{\begin{aligned} &\lambda(x)=1,&u(x)>0,\\ &\lambda(x)=-1,&u(x)<0,\\ &\lambda(x)\in[-1,1],&u(x)=0.\end{aligned}\right.

(3.12)

By (3.9), (3.12) and $a<0<b$ , we arrive at

\displaystyle\left\{\begin{aligned} &u(x)=0\xrightarrow{(\ref{lambda3})}\lambda(x)\in[-1,1]\xrightarrow{(\ref{u})}\mid p\mid\leq\beta,\\ &u(x)<0\xrightarrow{(\ref{lambda3})}\lambda=-1\xrightarrow{(\ref{u})}p+\beta\lambda>0\Rightarrow p>\beta,\\ &u(x)>0\xrightarrow{(\ref{lambda3})}\lambda=1\xrightarrow{(\ref{u})}p+\beta\lambda<0\Rightarrow p<-\beta.\end{aligned}\right.

These three properties are equivalent to (3.10). Therefore, (3.9), (3.10), (3.12) the previous estimate allow us to deduce (3.11)

\displaystyle\left\{\begin{aligned} &\mid p\mid\leq\beta\xrightarrow{(\ref{p})}u(x)=0\xrightarrow{(\ref{lambda3})}\lambda(x)\in[-1,1]\xrightarrow{(\ref{u})}p+\beta\lambda=0\Rightarrow\lambda=\Pi_{[-1,1]}\left(-\frac{1}{\beta}p\right),\\ &p>\beta\Rightarrow u(x)<0\xrightarrow{(\ref{lambda3})}\lambda=-1\Rightarrow\lambda=\Pi_{[-1,1]}\left(-\frac{1}{\beta}p\right),\\ &p<-\beta\Rightarrow u(x)>0\xrightarrow{(\ref{lambda3})}\lambda=1\Rightarrow\lambda=\Pi_{[-1,1]}\left(-\frac{1}{\beta}p\right),\end{aligned}\right.

which completes the proof. ∎

At end we present the following first order optimality conditions for above optimal control problems.

Theorem 3.3.

Let $(y,u)$ be the solution of the optimal control problem (3.2)-(3.3). Then there exists an adjoint state $p$ , $\lambda\in\partial j_{2}(u)$ such that

\displaystyle\left\{\begin{aligned} &a(y,v)=(f+u,v)\ &\forall v\in\widetilde{H}^{\frac{\alpha}{2}}(\Omega),\\ &a(w,p)=(y-y_{d},w)\ &\forall w\in\widetilde{H}^{\frac{\alpha}{2}}(\Omega),\\ &(p+\gamma u+\beta\lambda,v-u)\geq 0,\ &\forall v\in U_{ad},\end{aligned}\right.

(3.13)

4. Finite element approximation method and a posteriori error estimate

We begin by partitioning the domain $\Omega$ into simplices $K$ with size $h_{K}:=|K|^{\frac{1}{d}}$ , forming a conforming partition $\mathcal{T}_{h}=\{K\}$ . We then define $h_{\mathcal{T}_{h}}=\max\limits_{K\in\mathcal{T}_{h}}h_{K}$ and denote by $\mathbb{T}$ the collection of conforming and shape regular meshes that are refinements of an initial mesh $\mathcal{T}_{h_{0}}$ . For $\mathcal{T}_{h}\in\mathbb{T},$ let $\mathbb{V}_{\mathcal{T}_{h}}$ be the finite element space consisting of continuous piecewise linear functions over the triangulation $\mathcal{T}_{h}$

\displaystyle\mathbb{V}_{\mathcal{T}_{h}}=\{v_{\mathcal{T}_{h}}\in C(\bar{\Omega})\cap H_{0}^{1}(\Omega);\ v_{\mathcal{T}_{h}}|_{K}\in\mathbb{P}_{1}(K),\forall K\in\mathcal{T}_{h}\}.

For all elements $K\in\mathcal{T}_{h}$ and $k\in\mathbb{N}_{0}$ , we introduce the $k$ -th order element patch inductively by

	$\displaystyle\Omega_{h}^{0}(K):=K,\mathcal{T}_{h}^{0}(K):=\{K\}$
	$\displaystyle\Omega_{h}^{k}(K):=\mathrm{interior}(\bigcup\limits_{{K^{\prime}}\in\mathcal{T}_{h}^{k}(K)}\overline{K^{\prime}}),\ \mbox{where}\ \mathcal{T}_{h}^{k}(K):=\{K^{\prime}\in\mathcal{T}_{h}:\overline{K^{\prime}}\cap\overline{\Omega_{h}^{k-1}(K)}\neq\emptyset\}.$

The finite element approximation of the optimal control problem (3.2)-(3.3) can be characterized as

\min\limits_{(y_{\mathcal{T}_{h}},u_{\mathcal{T}_{h}})\in\mathbb{V}_{\mathcal{T}_{h}}\times U_{ad}}J(y_{\mathcal{T}_{h}},u_{\mathcal{T}_{h}})

subject to

\displaystyle a(y_{\mathcal{T}_{h}},v_{\mathcal{T}_{h}})=(f+u_{\mathcal{T}_{h}},v_{\mathcal{T}_{h}}),\ \ \forall v_{\mathcal{T}_{h}}\in\mathbb{V}_{\mathcal{T}_{h}}.

(4.1)

Here the admissible set of control $U_{ad}$ is not discretized, i.e., the so-called variational discretization approach. Similar to the continuous case we have the discrete first order optimality condition

\displaystyle\left\{\begin{aligned} &a(y_{\mathcal{T}_{h}},v_{\mathcal{T}_{h}})=(f+u_{\mathcal{T}_{h}},v_{\mathcal{T}_{h}}),&\forall v_{\mathcal{T}_{h}}\in\mathbb{V}_{\mathcal{T}_{h}},\\ &a(w_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}})=(y_{\mathcal{T}_{h}}-y_{d},w_{\mathcal{T}_{h}}),&\forall w_{\mathcal{T}_{h}}\in\mathbb{V}_{\mathcal{T}_{h}},\\ &(p_{\mathcal{T}_{h}}+\gamma u_{\mathcal{T}_{h}}+\beta\lambda_{\mathcal{T}_{h}},v_{\mathcal{T}_{h}}-u_{\mathcal{T}_{h}})\geq 0,\ &\forall v_{\mathcal{T}_{h}}\in U_{ad},\end{aligned}\right.

(4.2)

where $\lambda_{\mathcal{T}_{h}}\in\partial j_{2}(u_{\mathcal{T}_{h}}).$ Next, we give the following discrete projection formula.

Lemma 4.1.

Suppose $(y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},u_{\mathcal{T}_{h}},\lambda_{\mathcal{T}_{h}})$ are the optimal variables associated to (4.2), then we obtain

	$\displaystyle u_{\mathcal{T}_{h}}=\Pi_{[a,b]}\left(-\frac{1}{\gamma}(p_{\mathcal{T}_{h}}+\beta\lambda_{\mathcal{T}_{h}})\right),$		(4.3)
	$\displaystyle\|p_{\mathcal{T}_{h}}\|\leq\beta,\ \mbox{in}\ \{x\in\Omega,\ u_{\mathcal{T}_{h}}=0\},$		(4.4)
	$\displaystyle\lambda_{\mathcal{T}_{h}}=\Pi_{[-1,1]}\left(-\frac{1}{\beta}p_{\mathcal{T}_{h}}\right).$		(4.5)

Similar to the continuous case, we can define a discrete control-to-state mapping $\mathcal{S}_{h}:L^{2}(\Omega)\longrightarrow\mathbb{V}_{\mathcal{T}_{h}}$ . Set $\Theta(h)=\sup\limits_{f\in L^{2}(\Omega),\|f\|=1}\inf\limits_{\chi_{\mathcal{T}_{h}}\in\mathbb{V}_{\mathcal{T}_{h}}}\|\mathcal{S}f-\chi_{\mathcal{T}_{h}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.$ Then we have $\Theta(h)\ll 1$ for $h\in(0,h_{0})$ with $h_{0}\ll 1.$

Lemma 4.2.

Assume that $\mathcal{S}f\in\widetilde{H}^{\frac{\alpha}{2}}(\Omega)$ and $\mathcal{S}_{h}f\in\mathbb{V}_{\mathcal{T}_{h}}$ are the solutions of the continuous and discretised state equation with right hand term $f\in L^{2}(\Omega)$ . Then the following error estimates hold

\displaystyle\|\mathcal{S}f-\mathcal{S}_{h}f\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}

\displaystyle\leq C\Theta(h)\|f\|_{L^{2}(\Omega)}

and

\displaystyle\|\mathcal{S}f-\mathcal{S}_{h}f\|

\displaystyle\leq C\Theta(h)\|\mathcal{S}f-\mathcal{S}_{h}f\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.

Proof.

We invoke the Galerkin orthogonality to arrive at

	$\displaystyle\\|\mathcal{S}f-\mathcal{S}_{h}f\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}$	$\displaystyle\leq a(\mathcal{S}f-\mathcal{S}_{h}f,\mathcal{S}f-\mathcal{S}_{h}f)=a(\mathcal{S}f-\mathcal{S}_{h}f,\mathcal{S}f-\chi_{\mathcal{T}_{h}}+\chi_{\mathcal{T}_{h}}-\mathcal{S}_{h}f)$
		$\displaystyle\leq C\\|\mathcal{S}f-\mathcal{S}_{h}f\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\\|\mathcal{S}f-\chi_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.$

Thus we have

\displaystyle\|\mathcal{S}f-\mathcal{S}_{h}f\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}

\displaystyle\leq C\Theta(h)\|f\|_{L^{2}(\Omega)}.

Next, let $m=\mathcal{S}g$ be the solution of the following problem with $g(x)\in L^{2}(\Omega)$

\displaystyle a(w,m)=(g,w),\ w\in\widetilde{H}^{\frac{\alpha}{2}}(\Omega).

Then we have

\|\mathcal{S}g-\mathcal{S}_{h}g\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\leq C\Theta(h)\|g\|_{L^{2}(\Omega)}.

Setting $w=\mathcal{S}f-\mathcal{S}_{h}f$ , to prove the second estimate, we invoke the Galerkin orthogonality and the previous estimate to arrive at

	$\displaystyle(\mathcal{S}f-\mathcal{S}_{\mathcal{T}_{h}}f,g)$	$\displaystyle=a(\mathcal{S}g,\mathcal{S}f-\mathcal{S}_{\mathcal{T}_{h}}f)=a(\mathcal{S}g-\mathcal{S}_{\mathcal{T}_{h}}g+\mathcal{S}_{\mathcal{T}_{h}}g,\mathcal{S}f-\mathcal{S}_{\mathcal{T}_{h}}f)$
		$\displaystyle=a(\mathcal{S}g-\mathcal{S}_{\mathcal{T}_{h}}g,\mathcal{S}f-\mathcal{S}_{\mathcal{T}_{h}}f)$
		$\displaystyle\leq C\\|\mathcal{S}g-\mathcal{S}_{\mathcal{T}_{h}}g\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\\|\mathcal{S}f-\mathcal{S}_{\mathcal{T}_{h}}f\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$
		$\displaystyle\leq C\Theta(h)\\|g\\|_{L^{2}(\Omega)}\\|\mathcal{S}f-\mathcal{S}_{\mathcal{T}_{h}}f\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.$

Consequently,

\|\mathcal{S}f-\mathcal{S}_{h}f\|\leq C\Theta(h)\|\mathcal{S}f-\mathcal{S}_{h}f\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.

∎

To derive a posteriori error analysis we need to introduce the following auxiliary problems

\displaystyle\left\{\begin{aligned} a(\tilde{y},v)&=(f+u_{\mathcal{T}_{h}},v),\ \ \ \ \ \ \ \forall v\in\widetilde{H}^{\frac{\alpha}{2}}(\Omega),\\ a(w,\tilde{p})&=(y_{\mathcal{T}_{h}}-y_{d},w),\ \ \ \ \ \ \ \forall w\in\widetilde{H}^{\frac{\alpha}{2}}(\Omega).\end{aligned}\right.

(4.6)

Note that the residuals do not satisfy the $L^{2}(\Omega)$ -regularity for $\frac{3}{2}<\alpha<2$ . To address this issue, we require a weight function to measure the distance from the mesh skeleton. For a mesh $\mathcal{T}_{h}$ , we introduce the weight function defined in [26]

\omega_{\mathcal{T}_{h}}(x):=\inf\limits_{K\in\mathcal{T}_{h}}\inf\limits_{y\in\partial K}|x-y|.

We further define the corresponding weighted residual errors as follows

	$\displaystyle E^{2}_{y}(y_{\mathcal{T}_{h}},K):=\\|\widetilde{h}^{\frac{\alpha}{2}}_{K}(f+u_{\mathcal{T}_{h}}-(-\Delta)^{\frac{\alpha}{2}}y_{\mathcal{T}_{h}})\\|^{2}_{L^{2}(K)},$		(4.7)
	$\displaystyle E^{2}_{p}(p_{\mathcal{T}_{h}},K):=\\|\widetilde{h}^{\frac{\alpha}{2}}_{K}(y_{\mathcal{T}_{h}}-y_{d}-(-\Delta)^{\frac{\alpha}{2}}p_{\mathcal{T}_{h}})\\|^{2}_{L^{2}(K)},$		(4.8)

where

\displaystyle\widetilde{h}^{\frac{\alpha}{2}}_{K}=\left\{\begin{aligned} &h^{\frac{\alpha}{2}}_{K},\ \ \ \ \ \ \ \ \ \ \ \ &\alpha\in(0,1],\\ &h^{{\frac{\alpha}{2}}-\sigma}_{K}\omega_{\mathcal{T}_{h}}^{\sigma},\ &\alpha\in(1,2),\ \sigma:=\frac{\alpha}{2}-{\frac{1}{2}}.\end{aligned}\right.

Then on a subset $\omega\subset\Omega$ , we define the error estimators of the state and adjoint state by

\displaystyle E^{2}_{y}(y_{\mathcal{T}_{h}},\omega):=\sum\limits_{K\in\mathcal{T}_{h},K\subset\omega}E^{2}_{y}(y_{\mathcal{T}_{h}},K),\ \ \ \ E^{2}_{p}({\mathcal{T}_{h}},\omega):=\sum\limits_{K\in\mathcal{T}_{h},K\subset\omega}E^{2}_{p}(p_{\mathcal{T}_{h}},K).

Thus, $E_{y}(y_{\mathcal{T}_{h}},\mathcal{T}_{h})$ and $E_{p}(p_{\mathcal{T}_{h}},\mathcal{T}_{h})$ constitute the error estimators for the state equation and the adjoint state equation on $\Omega$ with respect to $\mathcal{T}_{h}$ as follows

\displaystyle E^{2}_{y}(y_{\mathcal{T}_{h}},\mathcal{T}_{h}):=\sum\limits_{K\in\mathcal{T}_{h}}E^{2}_{y}(y_{\mathcal{T}_{h}},K),\ \ \ \ E^{2}_{p}({\mathcal{T}_{h}},\mathcal{T}_{h}):=\sum\limits_{K\in\mathcal{T}_{h}}E^{2}_{p}(p_{\mathcal{T}_{h}},K).

Moreover we also need the Scott-Zhang operator([26]) $\Pi_{\mathcal{T}_{h}}:L^{2}(\Omega)\rightarrow\mathbb{V}_{\mathcal{T}_{h}}$ that satisfy the following properties

	$\displaystyle(1):\Pi_{\mathcal{T}_{h}}v=v,\ \forall v\in\mathbb{V}_{\mathcal{T}_{h}}.$		(4.9)
	$\displaystyle(2):\\|\Pi_{\mathcal{T}_{h}}v\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\leq\mathbb{C}_{sz}\\|v\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)},\forall v\in\widetilde{H}^{\frac{\alpha}{2}}(\Omega).$		(4.10)
	$\displaystyle(3):\\|\widetilde{h}^{-\frac{\alpha}{2}}_{\mathcal{T}_{h}}(1-\Pi_{\mathcal{T}_{h}})v\\|\leq\mathbb{C}_{sz}\\|v\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)},\forall v\in\widetilde{H}^{\frac{\alpha}{2}}(\Omega).$		(4.11)

Lemma 4.3.

For $0<\alpha<2$ , $f+u_{\mathcal{T}_{h}}\in L^{2}(\Omega)$ and $y_{\mathcal{T}_{h}}-y_{d}\in L^{2}(\Omega)$ the weighted residual error estimator is reliable:

\displaystyle\|\tilde{y}-y_{\mathcal{T}_{h}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\leq\mathbb{C}_{yrel}E_{y}(y_{\mathcal{T}_{h}},\mathcal{T}_{h}),\ \|\tilde{p}-p_{\mathcal{T}_{h}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\leq\mathbb{C}_{prel}E_{p}(p_{\mathcal{T}_{h}},\mathcal{T}_{h}).

Moreover, for $0<\alpha\leq 1$ and $\tilde{y},\ \tilde{p}\in H^{\frac{\alpha}{2}+\frac{1}{2}-\epsilon}(\Omega)\cap\widetilde{H}^{\frac{\alpha}{2}}(\Omega),0\leq\epsilon<\min\{\frac{\alpha}{2},\frac{1}{2}-\frac{\alpha}{2}\}$ , the estimator is also efficient

	$\displaystyle E^{2}_{y}(y_{\mathcal{T}_{h}},\mathcal{T}_{h})$	$\displaystyle\leq$	$\displaystyle\mathbb{C}_{yeff}\Big{(}\\|\tilde{y}-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|\tilde{y}-y_{\mathcal{T}_{h}}\\|_{H^{\frac{\alpha}{2}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}\Big{)},$
	$\displaystyle E^{2}_{p}(p_{\mathcal{T}_{h}},\mathcal{T}_{h})$	$\displaystyle\leq$	$\displaystyle\mathbb{C}_{peff}\Big{(}\\|\tilde{p}-p_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|\tilde{p}-p_{\mathcal{T}_{h}}\\|_{H^{\frac{\alpha}{2}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}\Big{)}.$

Proof.

Note that $y_{\mathcal{T}_{h}}$ and $p_{\mathcal{T}_{h}}$ are finite element approximations of $\tilde{y}$ and $\tilde{p}$ . We refer the reader to [26] for details on the proof of the upper and lower bounds in the Lemma. ∎

We define

	$\displaystyle\hat{u}=\Pi_{[a,b]}\left(-\frac{1}{\alpha}(p_{\mathcal{T}_{h}}+\beta\hat{\lambda})\right),$		(4.12)
	$\displaystyle\hat{\lambda}=\Pi_{[-1,1]}\left(-\frac{1}{\beta}p_{\mathcal{T}_{h}}\right).$		(4.13)

Here $\hat{\lambda}\in\partial j_{2}(\hat{u})$ . $\hat{u}$ can be described similarly by

\displaystyle(p_{\mathcal{T}_{h}}+\gamma\hat{u}+\beta\hat{\lambda},v-\hat{u})\geq 0,\ \forall v\in U_{ad}.

(4.14)

Due to the variational approach is considered, we have that $\hat{u}=u_{\mathcal{T}_{h}},\ \hat{\lambda}=\lambda_{\mathcal{T}_{h}}.$ Thus a posteriori error indicators and estimators for the optimal control variable and the associated subgradient are zero, i.e.,

\displaystyle E_{u}^{2}(u_{\mathcal{T}_{h}},K):=\|\hat{u}-u_{\mathcal{T}_{h}}\|^{2}=0,\ \ \ E_{\lambda}^{2}(\lambda_{\mathcal{T}_{h}},K):=\|\hat{\lambda}-\lambda_{\mathcal{T}_{h}}\|^{2}=0.

(4.15)

In the subsequent analysis, let $C$ represent a generic constant with distinct values in different instances. We define the errors $e_{y}=y-y_{\mathcal{T}_{h}},\ e_{p}=p-p_{\mathcal{T}_{h}},e_{u}=u-u_{\mathcal{T}_{h}},\ e_{\lambda}=\lambda-\lambda_{\mathcal{T}_{h}},$ the vector $\mathbf{e}=(e_{y},e_{p},e_{u},e_{\lambda})^{T}$ , and the norm

\displaystyle\|\mathbf{e}\|_{\Omega}^{2}=\|e_{y}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\|e_{p}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\|e_{u}\|+\|e_{\lambda}\|.

(4.16)

4.1. Reliability of the error estimator $\mathcal{E}_{ocp}$

Theorem 4.1.

Let $(y,p,u,\lambda)\in\widetilde{H}^{\frac{\alpha}{2}}(\Omega)\times\widetilde{H}^{\frac{\alpha}{2}}(\Omega)\times U_{ad}\times U_{ad}$ and $(y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},u_{\mathcal{T}_{h}},\lambda_{\mathcal{T}_{h}})\in\mathbb{V}_{\mathcal{T}_{h}}\times\mathbb{V}_{\mathcal{T}_{h}}\times U_{ad}\times U_{ad}$ be the solutions of problems (3.13) and (4.2), respectively. Then the following upper bound of a posteriori error holds for $h<h_{0}\ll 1$

\displaystyle\|\mathbf{e}\|_{\Omega}^{2}\leq\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},\mathcal{T}_{h}).

Here

\displaystyle\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},\mathcal{T}_{h})=\sum\limits_{K\in\mathcal{T}_{h},}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},K),\ \mathcal{E}_{K}^{2}(y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},K)=C_{st}E^{2}_{y}(y_{\mathcal{T}_{h}},K)+C_{ad}E^{2}_{p}(p_{\mathcal{T}_{h}},K).

(4.17)

Proof.

We proceed in five steps.

$\underline{Step\ 1.}$ By applying the triangle inequality and Lemma 4.3, we can readily obtain

\displaystyle\|y-y_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\leq 2\|y-\tilde{y}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+2\|\tilde{y}-y_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.

(4.18)

Moreover, by the coercivity of the bilinear form $a(\cdot,\cdot)$ , we can derive

\displaystyle\|y-\tilde{y}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\leq\|u-u_{\mathcal{T}_{h}}\|.

This estimate combined with (4.18) imply that

\displaystyle\|y-y_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}

\displaystyle\leq 2\|u-u_{\mathcal{T}_{h}}\|^{2}+2\|\tilde{y}-y_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.

(4.19)

$\underline{Step\ 2.}$ In a similar way, we can obtain that

	$\displaystyle\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$	$\displaystyle\leq 2\\|p-\tilde{p}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+2\\|\tilde{p}-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$
		$\displaystyle\leq 2\\|y-y_{\mathcal{T}_{h}}\\|^{2}+2\\|\tilde{p}-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$
		$\displaystyle\leq 2\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+2\\|\tilde{p}-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.$

Therefore, (4.19) and the previous estimate allow us to deduce the a posteriori error estimate

\displaystyle\|p-p_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}

\displaystyle\leq 4\|u-u_{\mathcal{T}_{h}}\|^{2}+4\|\tilde{y}-y_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+2\|\tilde{p}-p_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.

(4.20)

$\underline{Step\ 3.}$ The goal of this step is to estimate the error $\|u-u_{\mathcal{T}_{h}}\|$ . Setting $v=u$ in (4.2) and $v=u_{\mathcal{T}_{h}}$ in (3.7) we arrive at

\displaystyle\gamma\|u-u_{\mathcal{T}_{h}}\|^{2}\leq(p-p_{\mathcal{T}_{h}},u_{\mathcal{T}_{h}}-u)+\beta(\lambda-\lambda_{\mathcal{T}_{h}},u_{\mathcal{T}_{h}}-u).

Since $\lambda\in j_{2}(u)$ and $\lambda_{\mathcal{T}_{h}}\in j_{2}(u_{\mathcal{T}_{h}})$ , in view of (2.2), implies that

\displaystyle\beta(\lambda-\lambda_{\mathcal{T}_{h}},u_{\mathcal{T}_{h}}-u)\leq 0.

Thus we have that

\displaystyle\gamma\|u-u_{\mathcal{T}_{h}}\|^{2}\leq(p-p_{\mathcal{T}_{h}},u_{\mathcal{T}_{h}}-u).

(4.21)

To control the right hand side of (4.21), we now invoke the auxiliary states $p_{\mathcal{T}}$ that satisfy the following equation

	$\displaystyle a(y_{\mathcal{T}},v_{\mathcal{T}_{h}})$	$\displaystyle=(f+u,v_{\mathcal{T}_{h}}),\ \ \ \forall v_{\mathcal{T}_{h}}\in\mathbb{V}_{\mathcal{T}_{h}},$
	$\displaystyle a(w_{\mathcal{T}_{h}},p_{\mathcal{T}})$	$\displaystyle=(y_{\mathcal{T}}-y_{d},w_{\mathcal{T}_{h}}),\ \ \ \forall w_{\mathcal{T}_{h}}\in\mathbb{V}_{\mathcal{T}_{h}}.$

Then (4.21) can be rewritten as

\displaystyle\gamma\|u-u_{\mathcal{T}_{h}}\|^{2}\leq(p-p_{\mathcal{T}},u_{\mathcal{T}_{h}}-u)+(p_{\mathcal{T}}-p_{\mathcal{T}_{h}},u_{\mathcal{T}_{h}}-u).

(4.22)

Note that

	$\displaystyle a(y_{\mathcal{T}}-y_{\mathcal{T}_{h}},v_{\mathcal{T}_{h}})$	$\displaystyle=(u-u_{\mathcal{T}_{h}},v_{\mathcal{T}_{h}}),\ \ \ \forall v_{\mathcal{T}_{h}}\in\mathbb{V}_{\mathcal{T}_{h}},$		(4.23)
	$\displaystyle a(w_{\mathcal{T}_{h}},p_{\mathcal{T}}-p_{\mathcal{T}_{h}})$	$\displaystyle=(y_{\mathcal{T}}-y_{\mathcal{T}_{h}},w_{\mathcal{T}_{h}}),\ \ \ \ \forall w_{\mathcal{T}_{h}}\in\mathbb{V}_{\mathcal{T}_{h}}.$		(4.24)

Setting $v_{\mathcal{T}_{h}}=p_{\mathcal{T}}-p_{\mathcal{T}_{h}}$ and $w_{\mathcal{T}_{h}}=y_{\mathcal{T}}-y_{\mathcal{T}_{h}}$ in (4.23) yields

(u-u_{\mathcal{T}_{h}},p_{\mathcal{T}}-p_{\mathcal{T}_{h}})=a(y_{\mathcal{T}}-y_{\mathcal{T}_{h}},p_{\mathcal{T}}-p_{\mathcal{T}_{h}})=(y_{\mathcal{T}}-y_{\mathcal{T}_{h}},y_{\mathcal{T}}-y_{\mathcal{T}_{h}})\geq 0.

From (4.22) and the above equation we have

\displaystyle\|u-u_{\mathcal{T}_{h}}\|^{2}\leq\frac{1}{\gamma^{2}}\|p-p_{\mathcal{T}}\|^{2}.

(4.25)

$\underline{Step\ 4.}$ We now go to control $\|p-p_{\mathcal{T}}\|.$ To accomplish this task, we introduce the following problem

\displaystyle\left\{\begin{aligned} (-\Delta)^{s}\phi&=p_{\mathcal{T}}-p,&\mbox{in}\ \Omega,\\ \phi&=0,&\mbox{in}\ \Omega^{c}.\end{aligned}\right.

Let $\phi_{\mathcal{T}_{h}}$ be the finite element approximation of $\phi$ . Invoking Lemma 4.2, we have that

\displaystyle\|\phi-\phi_{\mathcal{T}_{h}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\leq C\Theta(h)\|p-p_{\mathcal{T}}\|\ \mbox{and}\ \|\phi-\phi_{\mathcal{T}_{h}}\|\leq C\Theta^{2}(h)\|p-p_{\mathcal{T}}\|.

(4.26)

Note that $y_{\mathcal{T}}$ is the finite element approximation of $y$ , by Lemma 4.2, we immediately arrive at the estimate

\displaystyle\|y-y_{\mathcal{T}}\|\leq C\Theta(h)\|y-y_{\mathcal{T}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.

(4.27)

We bound $\|p-p_{\mathcal{T}}\|$ in view of the previous inequality that

	$\displaystyle\\|p-p_{\mathcal{T}}\\|^{2}$	$\displaystyle=((-\Delta)^{\frac{\alpha}{2}}\phi,p-p_{\mathcal{T}})=a(\phi,p-p_{\mathcal{T}})=a(\phi-\phi_{\mathcal{T}_{h}},p-p_{\mathcal{T}})+a(\phi_{\mathcal{T}_{h}},p-p_{\mathcal{T}})$
		$\displaystyle=a(\phi-\phi_{\mathcal{T}_{h}},p-p_{\mathcal{T}})+(\phi_{\mathcal{T}_{h}}-\phi,y-y_{\mathcal{T}})+(\phi,y-y_{\mathcal{T}})$
		$\displaystyle\leq\\|\phi-\phi_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\\|p-p_{\mathcal{T}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|\phi_{\mathcal{T}_{h}}-\phi\\|\ \\|y-y_{\mathcal{T}}\\|+\\|\phi\\|\ \\|y-y_{\mathcal{T}}\\|.$

This result combined with (4.26) and (4.27) allows us to derive that

\displaystyle\|p-p_{\mathcal{T}}\|^{2}

\displaystyle\leq C\Theta(h)\left(\|p-p_{\mathcal{T}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|y-y_{\mathcal{T}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)\|p-p_{\mathcal{T}}\|+C\Theta^{3}(h)\|y-y_{\mathcal{T}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\|p-p_{\mathcal{T}}\|.

Consequently,

\displaystyle\|p-p_{\mathcal{T}}\|\leq C\Theta(h)\left(\|p-p_{\mathcal{T}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|y-y_{\mathcal{T}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right).

Thus, by (4.25) we have

\|u-u_{\mathcal{T}_{h}}\|\leq\frac{C}{\gamma}\Theta(h)\left(\|p-p_{\mathcal{T}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|y-y_{\mathcal{T}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right).

Further, invoking the discrete state variable $y_{\mathcal{T}_{h}}$ and the discrete adjoint variable $p_{\mathcal{T}_{h}}$ in the previous inequality, we derive that

\displaystyle\|u-u_{\mathcal{T}_{h}}\|

\displaystyle\leq\frac{C}{\gamma}\Theta(h)\left(\|p-p_{\mathcal{T}_{h}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|p_{\mathcal{T}_{h}}-p_{\mathcal{T}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|y-y_{\mathcal{T}_{h}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|y_{\mathcal{T}_{h}}-y_{\mathcal{T}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right).

We notice that $a(v_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}}-p_{\mathcal{T}})=(y_{\mathcal{T}_{h}}-y_{\mathcal{T}},v_{\mathcal{T}_{h}})$ and $a(y_{\mathcal{T}_{h}}-y_{\mathcal{T}},w_{\mathcal{T}_{h}})=(u_{\mathcal{T}_{h}}-u,w_{\mathcal{T}_{h}})$ , we can obtain that

	$\displaystyle\\|u-u_{\mathcal{T}_{h}}\\|$	$\displaystyle\leq\frac{C}{\gamma}\Theta(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|p_{\mathcal{T}_{h}}-p_{\mathcal{T}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y_{\mathcal{T}_{h}}-y_{\mathcal{T}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)$
		$\displaystyle\leq\frac{C}{\gamma}\Theta(h)\left(\\|y_{\mathcal{T}_{h}}-y_{\mathcal{T}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|p-p_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|u_{\mathcal{T}_{h}}-u\\|+\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)$
		$\displaystyle\leq\frac{C}{\gamma}\Theta(h)\left(\\|u_{\mathcal{T}_{h}}-u\\|+\\|p-p_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right).$

For $h_{0}\ll 1$ such that $\Theta(h)\ll 1,h<h_{0}$ , we can obtain

\displaystyle\|u-u_{\mathcal{T}_{h}}\|\leq\frac{C}{\gamma}\Theta(h)\left(\|p-p_{\mathcal{T}_{h}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|y-y_{\mathcal{T}_{h}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right).

(4.28)

Replacing this estimate in (4.19) and (4.20) we arrive at

\displaystyle\|y-y_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\leq\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\|p-p_{\mathcal{T}_{h}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|y-y_{\mathcal{T}_{h}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\|\tilde{y}-y_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}

(4.29)

and

\displaystyle\|p-p_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}

\displaystyle\leq\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\|p_{\mathcal{T}_{h}}-p\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|y_{\mathcal{T}_{h}}-y\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\|\tilde{y}-y_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+C\|\tilde{p}-p_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.

(4.30)

$\underline{Step\ 5.}$ Finally, we need to bound $\|\lambda-\lambda_{\mathcal{T}_{h}}\|$ . By (3.11) and (4.5) we have that

\displaystyle\|\lambda-\lambda_{\mathcal{T}_{h}}\|^{2}\leq\frac{C}{\beta^{2}}\|p-p_{\mathcal{T}_{h}}\|^{2}\leq\frac{C}{\beta^{2}}\|p-p_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.

Using (4.30), we can get

\displaystyle\|\lambda-\lambda_{\mathcal{T}_{h}}\|^{2}

\displaystyle\leq\frac{C}{(\beta\gamma)^{2}}\Theta^{2}(h)\left(\|p-p_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|y-y_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+\frac{C}{\beta^{2}}\|\tilde{y}-y_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\frac{C}{\beta^{2}}\|\tilde{p}-p_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.

Thus by Lemma 4.3 we further derive

	$\displaystyle\quad\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|p-p_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\\|u-u_{\mathcal{T}_{h}}\\|^{2}+\\|\lambda-\lambda_{\mathcal{T}_{h}}\\|^{2}$
	$\displaystyle\leq\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+\frac{C}{(\gamma\beta)^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)$
	$\displaystyle\quad+C(1+\frac{1}{\beta^{2}})\\|\tilde{y}-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+C(1+\frac{1}{\beta^{2}})\\|\tilde{p}-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$
	$\displaystyle\leq\mathcal{E}^{2}_{ocp}({y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},\mathcal{T}_{h})},$

which completes the proof. ∎

Remark 4.1.

Since the control variable is implicitly discretized, the error estimators with respect to $u$ and $\lambda$ are zeros. By (4.29) and (4.30) we can derive the estimate only for state and adjoint state

\displaystyle\|\mathbf{\bar{e}}\|_{\Omega}^{2}\leq\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},\mathcal{T}_{h}).

Here $\mathbf{\bar{e}}=(e_{y},e_{p})^{T}$ .

4.2. Efficiency of the error estimator $\mathcal{E}_{ocp}$

Theorem 4.2.

Suppose that $(y,u,p,\lambda)$ and $(y_{h},u_{h},p_{h},\lambda_{h})$ are the solutions of the optimal control problem of (3.13) and (4.2), respectively. If $\tilde{y},\ \tilde{p}\in{\widetilde{H}}^{\frac{1}{2}+{\frac{\alpha}{2}}-\epsilon}(\Omega)\cap\widetilde{H}^{\frac{\alpha}{2}}(\Omega)$ , for some parameter $0\leq\epsilon<\min\{{\frac{\alpha}{2}},\frac{1}{2}-\frac{\alpha}{2}\},$ then we have the error estimator $\mathcal{E}_{ocp}$ , defined as in (4.17) satisfied the following lower bound for $h<h_{0}\ll 1$

\displaystyle\mathcal{E}^{2}_{ocp}({y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},\mathcal{T}_{h})}

\displaystyle\leq C\|\mathbf{e}\|_{\Omega}^{2}+C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\|y-y_{\mathcal{T}_{h}}\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}+C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\|p-p_{\mathcal{T}_{h}}\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}.

(4.31)

Proof.

The Theorem 4.3, implies that

	$\displaystyle E^{2}_{y}(y_{\mathcal{T}_{h}},\mathcal{T}_{h})$	$\displaystyle\leq$	$\displaystyle\mathbb{C}_{yeff}\Big{(}\\|\tilde{y}-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|\tilde{y}-y_{\mathcal{T}_{h}}\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}\Big{)},$		(4.32)
	$\displaystyle E^{2}_{p}(p_{\mathcal{T}_{h}},\mathcal{T}_{h})$	$\displaystyle\leq$	$\displaystyle\mathbb{C}_{peff}\Big{(}\\|\tilde{p}-p_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|\tilde{p}-p_{\mathcal{T}_{h}}\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}\Big{)}.$		(4.33)

To bound $\|\tilde{y}-y_{\mathcal{T}_{h}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$ , by (4.28), we obtain that

$\displaystyle\\|\tilde{y}-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$	$\displaystyle\leq C\\|\tilde{y}-y\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+C\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$
	$\displaystyle\leq C\\|u_{\mathcal{T}_{h}}-u\\|^{2}+C\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$
	$\displaystyle\leq\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.$	(4.34)

We can deal with the second term of (4.32) in a similar way

	$\displaystyle\quad\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|\tilde{y}-y_{\mathcal{T}_{h}}\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}$
	$\displaystyle\leq C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|\tilde{y}-y\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}+C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|y-y_{\mathcal{T}_{h}}\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}$
	$\displaystyle\leq CMh^{1-2\epsilon}\\|\tilde{y}-y\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega)}^{2}+C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|y-y_{\mathcal{T}_{h}}\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}$
	$\displaystyle\leq\frac{C}{\gamma^{2}}Mh^{1-2\epsilon}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))},$

where $M$ denotes the maximum times of an element $K$ appearing in all element patch $\Omega^{3}_{h}(K)$ . On the basis of (4.32) and the previous estimate, we immediately obtain the local efficiency of $E_{y}$

	$\displaystyle E^{2}_{y}(y_{\mathcal{T}_{h}},\mathcal{T}_{h})$
	$\displaystyle\leq\mathbb{C}_{yeff}\bigg{\{}\\|p_{\mathcal{T}_{h}}-p\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y_{\mathcal{T}_{h}}-y\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}$
	$\displaystyle\quad+\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+CMh^{1-2\epsilon}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)\bigg{\}}.$

Assuming that the initial size of the mesh fulfills the following condition: $Mh_{0}^{1-2\epsilon}\Theta^{2}(h_{0})\leq C.$ For $h_{0}\ll 1$ , we can obtain

\displaystyle E^{2}_{y}(y_{\mathcal{T}_{h}},\mathcal{T}_{h})\leq C\left(\|p-p_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|y-y_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\|y-y_{\mathcal{T}_{h}}\|^{2}_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}\right).

(4.35)

Note that

	$\displaystyle\\|\tilde{p}-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$	$\displaystyle\leq C\\|\tilde{p}-p\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+C\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$
		$\displaystyle\leq C\\|y_{\mathcal{T}_{h}}-y\\|^{2}+C\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.$

We need to estimate $\|y-y_{\mathcal{T}_{h}}\|$ using the dual argument in the following analysis. Let $\psi$ be the solution of the following problem

\displaystyle\left\{\begin{aligned} (-\Delta)^{s}\psi&=y-y_{\mathcal{T}_{h}},&\mbox{in}\ \Omega,\\ \psi&=0,&\mbox{in}\ \Omega^{c}.\end{aligned}\right.

In an analogous way we obtain

	$\displaystyle\\|y-y_{\mathcal{T}_{h}}\\|^{2}$	$\displaystyle=((-\Delta)^{\frac{\alpha}{2}}\psi,y-y_{\mathcal{T}_{h}})=a(\psi,y-y_{\mathcal{T}_{h}})$
		$\displaystyle=a(\psi-\psi_{\mathcal{T}_{h}},y-y_{\mathcal{T}_{h}})+a(\psi_{\mathcal{T}_{h}},y-y_{\mathcal{T}_{h}})$
		$\displaystyle=a(\psi-\psi_{\mathcal{T}_{h}},y-y_{\mathcal{T}_{h}})+(\psi_{\mathcal{T}_{h}}-\psi,u-u_{\mathcal{T}_{h}})+(\psi,u-u_{\mathcal{T}_{h}})$
		$\displaystyle\leq\\|\psi-\psi_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|\psi_{\mathcal{T}_{h}}-\psi\\|\ \\|u-u_{\mathcal{T}_{h}}\\|+\\|\psi\\|\ \\|u-u_{\mathcal{T}_{h}}\\|$
		$\displaystyle\leq C\Theta(h)\\|y-y_{\mathcal{T}_{h}}\\|\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+C\Theta^{2}(h)\\|y-y_{\mathcal{T}_{h}}\\|\ \\|u-u_{\mathcal{T}_{h}}\\|+C\\|y-y_{\mathcal{T}_{h}}\\|\ \\|u-u_{\mathcal{T}_{h}}\\|.$

Then we can get

$\displaystyle\\|y-y_{\mathcal{T}_{h}}\\|^{2}$	$\displaystyle\leq C\Theta^{2}(h)\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+C\\|u-u_{\mathcal{T}_{h}}\\|^{2}$
	$\displaystyle\leq C\Theta^{2}(h)\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)$
	$\displaystyle\leq C(1+\frac{1}{\gamma^{2}})\Theta^{2}(h)\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\frac{C}{\gamma^{2}}\Theta^{2}(h)\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.$	(4.36)

Using the previous inequality, we can obtain

\displaystyle\|\tilde{p}-p_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}

\displaystyle\leq C(1+\frac{1}{\gamma^{2}})\Theta^{2}(h)\|y-y_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+C(1+\frac{1}{\gamma^{2}}\Theta^{2}(h))\|p-p_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}

and

	$\displaystyle\quad\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|\tilde{p}-p_{\mathcal{T}_{h}}\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}$
	$\displaystyle\leq C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|\tilde{p}-p\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}+C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|p-p_{\mathcal{T}_{h}}\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}$
	$\displaystyle\leq CMh^{1-2\epsilon}\\|\tilde{p}-p\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega)}^{2}+C\sum\limits_{K\in\mathcal{T}_{h}}h_{T}^{1-2\epsilon}\\|p-p_{\mathcal{T}_{h}}\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}$
	$\displaystyle\leq CMh^{1-2\epsilon}\\|y_{\mathcal{T}_{h}}-y\\|^{2}+C\sum\limits_{K\in\mathcal{T}_{h}}h_{T}^{1-2\epsilon}\\|p-p_{\mathcal{T}_{h}}\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}$
	$\displaystyle\leq CMh^{1-2\epsilon}\Theta^{2}(h)\left((1+\frac{1}{\gamma^{2}})\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\frac{1}{\gamma^{2}}\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}.$

Thus we have

\displaystyle E^{2}_{p}(p_{\mathcal{T}_{h}},\mathcal{T}_{h})\leq C\left(\|p-p_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|y-y_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\|p-p_{\mathcal{T}_{h}}\|^{2}_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}\right).

(4.37)

Combining the estimate (4.35) and (4.37) we derive

	$\displaystyle\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},\mathcal{T}_{h})$	$\displaystyle=E^{2}_{y}(y_{\mathcal{T}_{h}},\mathcal{T}_{h})+E^{2}_{p}(p_{\mathcal{T}_{h}},\mathcal{T}_{h})$
		$\displaystyle\leq C(\\|e_{p}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|e_{y}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)})+C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}+C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}$
		$\displaystyle\leq C\\|\mathbf{e}\\|^{2}_{\Omega}+C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}+C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}.$

This concludes the proof. ∎

5. AFEMs and convergence analysis

$\bullet$ The optimal control $u$ , due to the sparsity term $j_{2}(u)$ in the cost functional, is sparse and has sparsely support sets within $\Omega$ .
$\bullet$ The fractional Laplacian operator $(-\Delta)^{\frac{\alpha}{2}}$ is nonlocal ([32, 33, 34]) and can lead to singularity of the state variable and the adjoint variable near the boundary ([35]), which leads to a lower convergence rate ([36, 37]).

To overcome these hurdle, adaptive mesh refinement methods can be employed. The method facilitate more comprehensive mesh refinement in areas where the solution singularity is intense, and consequently improving the numerical solution’s accuracy.

5.1. AFEMs

Utilizing the residual error estimator $\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},\mathcal{T}_{h})$ to measure local contributions, we explore an established technique for adaptive mesh refinement known as $\mathbf{SOLVE}-\mathbf{ESTIMATE}-\mathbf{MARK}-\mathbf{REFINE}$ , which employs D $\rm{\ddot{o}}$ rfler’s marking criterion to designate elements for refinement.

\star\ \mathbf{SOLVE}

: Initial mesh

{\mathcal{T}_{h_{0}}}

with mesh size

h_{0}

, constraints

a

and

b

, regularization parameter

\gamma

, sparsity parameter

\beta

. Set

k=0

and solve (4.2) to obtain

(y_{{\mathcal{T}_{h_{k}}}},p_{{\mathcal{T}_{h_{k}}}},u_{{\mathcal{T}_{h_{k}}}})=\mathbf{SOLVE}\left(\mathbb{V}_{{\mathcal{T}_{h_{k}}}}\times\mathbb{V}_{{\mathcal{T}_{h}}_{k}}\times U_{ad}\right).

\star\ \mathbf{ESTIMATE}

: Compute the local error indicator

\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{h_{k}}},p_{\mathcal{T}_{h_{k}}},\mathcal{T}_{h_{k}})=\sum\limits_{K\in\mathcal{T}_{h_{k}}}\left(E^{2}_{y}(y_{\mathcal{T}_{h_{k}}},\mathcal{T}_{h_{k}})+E^{2}_{p}(p_{\mathcal{T}_{h_{k}}},\mathcal{T}_{h_{k}})\right)=\mathbf{ESTIMATE}\left(y_{\mathcal{T}_{h_{k}}},p_{\mathcal{T}_{h_{k}}},\mathcal{T}_{h_{k}}\right)

defined by (4.7), (4.8) and (4.31).

\star\ \mathbf{MARK}

: Given a parameter

0<\theta<1

; Construct a minimal subset

\mathcal{M}_{k}\subset{\mathcal{T}_{h_{k}}}

such that

\mathcal{M}_{k}=\mathbf{MARK}\left\{\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{h_{k}}},p_{\mathcal{T}_{h_{k}}},\mathcal{M}_{k})\right\}\geq\theta\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{h_{k}}},p_{\mathcal{T}_{h_{k}}},\mathcal{T}_{h_{k}}).

\star\ \mathbf{REFINE}

: We bisect all the elements

K\in\mathcal{T}_{h_{k}}

that are contained in

\mathcal{M}_{k}

with the newest-vertex bisection method and create a new mesh

\mathcal{T}_{h_{k+1}}

. Refine

\mathcal{M}_{k+1}=\mathbf{REFINE}\left(\mathcal{M}_{k}\right).

Algorithm 1 Design of the AFEMs:

In the first step of $\mathbf{SOLVE}$ , we used the following projection gradient algorithm:

\mathbf{Input}

Start with the mesh

\mathcal{T}_{h_{t}}

with mesh size

h_{t}

\mathbf{Start}

Given the initial value

u^{0}_{\mathcal{T}_{h_{t}}}

, and a tolerance

\mathrm{Tol}_{\mathrm{space}}>0

\mathbf{While}

error>\mathrm{Tol}_{\mathrm{space}}

\mathbf{1.}

Solving the state equation in (4.2) to get state variable

y_{\mathcal{T}_{h_{t}}}

;

\mathbf{2.}

Solving the adjoint state equation in (4.2) to obtain adjoint state variable

p_{\mathcal{T}_{h_{t}}}

;

\mathbf{3.}

Using (4.5) to compute the associted subgradient and control variable

\lambda_{\mathcal{T}_{h_{t}}}=\min\{1,\max\{-1,-\frac{1}{\beta}p_{\mathcal{T}_{h_{t}}}\}\},\ u^{new}_{\mathcal{T}_{h_{t}}}=\min\{b,\max\{a,-\frac{1}{\gamma}(p_{\mathcal{T}_{h_{t}}}+\beta\lambda_{\mathcal{T}_{h_{t}}})\}\}.

\mathbf{4.}

Calculate the error:

error=norm(u^{0}_{\mathcal{T}_{h_{t}}}-u^{new}_{\mathcal{T}_{h_{t}}},inf).

\mathbf{5.}

Update the control variable

u^{0}_{\mathcal{T}_{h_{t}}}=u^{new}_{\mathcal{T}_{h_{t}}}.

\mathbf{End\ While}

Algorithm 2 Projection gradient algorithm

5.2. Convergence analysis

To establish the quasi-optimality of Adaptive Finite Element Methods (AFEMs), we employ the framework proposed by Carstensen et al. in [28]. The fulfillment of several prerequisites is necessary to establish quasi-optimality in adaptive algorithms: (1) Stability, (2) Reduction, (3) Discrete reliability and (4) Quasi-orthogonality. The stability prerequisite ensures the stability of error estimate on non-refined elements, while the reduction prerequisite guarantees a reduction in error on refined elements. The discrete reliability ensures the ability of the error estimators on refined elements to effectively control the error between coarse and fine grid solutions. The quasi-orthogonality prerequisite involves providing a measure for the relationship between the error estimators and the exact errors. These requirements will be rigorously validated through a series of mathematical proofs.

Theorem 5.1.

(Stability) We use $\mathcal{T}_{h}$ to denote the refinements of $\mathcal{T}_{H}$ . For any subsets $\mathcal{U}\subset{\mathcal{T}_{h}}\cap\mathcal{T}_{H}$ , there holds

\displaystyle\left|\left(\sum\limits_{K\in\mathcal{U}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},K)\right)^{\frac{1}{2}}-\left(\sum\limits_{K\in\mathcal{U}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},K)\right)^{\frac{1}{2}}\right|\leq C_{stab}\left(\|y_{\mathcal{T}_{h}}-y_{\mathcal{T}_{H}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|p_{\mathcal{T}_{h}}-p_{\mathcal{T}_{H}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right),

where the constant $C_{stab}>0$ .

Proof.

From the definition of (4.7), (4.8) and (4.31) we can obtain

	$\displaystyle\left\|\left(\sum\limits_{K\in\mathcal{U}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},K)\right)^{\frac{1}{2}}-\left(\sum\limits_{K\in\mathcal{U}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},K)\right)^{\frac{1}{2}}\right\|$
	$\displaystyle\leq\\|\widetilde{h}^{\frac{\alpha}{2}}_{\mathcal{T}_{h}}(f+u_{\mathcal{T}_{h}}-(-\Delta)^{\frac{\alpha}{2}}y_{\mathcal{T}_{h}})\\|_{L^{2}(\omega)}+\\|\widetilde{h}^{\frac{\alpha}{2}}_{\mathcal{T}_{h}}(y_{\mathcal{T}_{h}}-y_{d}-(-\Delta)^{\frac{\alpha}{2}}p_{\mathcal{T}_{h}})\\|_{L^{2}(\omega)}$
	$\displaystyle\quad-\\|\widetilde{h}^{\frac{\alpha}{2}}_{\mathcal{T}_{h}}(f+u_{\mathcal{T}_{H}}-(-\Delta)^{\frac{\alpha}{2}}y_{\mathcal{T}_{H}})\\|_{L^{2}(\omega)}-\\|\widetilde{h}^{\frac{\alpha}{2}}_{\mathcal{T}_{h}}(y_{\mathcal{T}_{H}}-y_{d}-(-\Delta)^{\frac{\alpha}{2}}p_{\mathcal{T}_{H}})\\|_{L^{2}(\omega)}$
	$\displaystyle\leq\\|\widetilde{h}^{\frac{\alpha}{2}}_{\mathcal{T}_{h}}(-\Delta)^{\frac{\alpha}{2}}(y_{\mathcal{T}_{H}}-y_{\mathcal{T}_{h}})\\|_{L^{2}(\omega)}+\\|\widetilde{h}^{\frac{\alpha}{2}}_{\mathcal{T}_{h}}(-\Delta)^{\frac{\alpha}{2}}(p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}})\\|_{L^{2}(\omega)}+\\|\widetilde{h}^{\frac{\alpha}{2}}_{\mathcal{T}_{h}}(u_{\mathcal{T}_{h}}-u_{\mathcal{T}_{H}})\\|_{L^{2}(\omega)}+\\|\widetilde{h}^{\frac{\alpha}{2}}_{\mathcal{T}_{h}}(y_{\mathcal{T}_{h}}-y_{\mathcal{T}_{H}})\\|_{L^{2}(\omega)},$

where $\omega:=\mathrm{interior}(\bigcup\limits_{{K}\in\mathcal{U}}\overline{K}).$ Note that $u_{\mathcal{T}_{h}}=\Pi_{[a,b]}\left(-\frac{1}{\gamma}(p_{\mathcal{T}_{h}}+\beta\lambda_{\mathcal{T}_{h}})\right)$ and $u_{\mathcal{T}_{H}}=\Pi_{[a,b]}\left(-\frac{1}{\gamma}(p_{\mathcal{T}_{H}}+\beta\lambda_{\mathcal{T}_{H}})\right).$ By the Lipschitz continuity of the operator $\Pi_{[a,b]}$ , we have that

	$\displaystyle\\|u_{\mathcal{T}_{h}}-u_{\mathcal{T}_{H}}\\|$	$\displaystyle=\left\\|\Pi_{[a,b]}\left(-\frac{1}{\gamma}(p_{\mathcal{T}_{h}}+\beta\lambda_{\mathcal{T}_{h}})\right)-\Pi_{[a,b]}\left(-\frac{1}{\gamma}(p_{\mathcal{T}_{H}}+\beta\lambda_{\mathcal{T}_{H}})\right)\right\\|$
		$\displaystyle\leq\frac{C}{\gamma}\\|p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}}\\|+\frac{C\beta}{\gamma}\\|\lambda_{\mathcal{T}_{H}}-\lambda_{\mathcal{T}_{h}}\\|.$

An application of $\lambda_{\mathcal{T}_{h}}=\Pi_{[-1,1]}\left(-\frac{1}{\beta}p_{\mathcal{T}_{h}}\right)$ and $\lambda_{\mathcal{T}_{H}}=\Pi_{[-1,1]}\left(-\frac{1}{\beta}p_{\mathcal{T}_{H}}\right)$ yields

\displaystyle\|u_{\mathcal{T}_{h}}-u_{\mathcal{T}_{H}}\|\leq\frac{C}{\gamma}\|p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}}\|\leq\frac{C}{\gamma}\|p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.

Further by the inverse estimate for fractional Laplacian ([26]) we have

\displaystyle\|\widetilde{h}^{\frac{\alpha}{2}}_{\mathcal{T}_{h}}(-\Delta)^{\frac{\alpha}{2}}y_{{\mathcal{T}_{h}}}\|_{L^{2}(\Omega)}\leq C\|y_{\mathcal{T}_{h}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\ \mbox{and}\ \|\widetilde{h}^{\frac{\alpha}{2}}_{\mathcal{T}_{h}}(-\Delta)^{\frac{\alpha}{2}}p_{{\mathcal{T}_{h}}}\|_{L^{2}(\Omega)}\leq C\|p_{\mathcal{T}_{h}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.

This result allows us to derive that

\displaystyle\left|\left(\sum\limits_{K\in\mathcal{U}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},K)\right)^{\frac{1}{2}}-\left(\sum\limits_{K\in\mathcal{U}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},K)\right)^{\frac{1}{2}}\right|\leq C_{stab}\left(\|y_{\mathcal{T}_{h}}-y_{\mathcal{T}_{H}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|p_{\mathcal{T}_{h}}-p_{\mathcal{T}_{H}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right).

∎

Theorem 5.2.

(Reduction) We use $\mathcal{T}_{h}$ to denote the refinements of $\mathcal{T}_{H}$ . Then we have

\displaystyle\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},\mathcal{T}_{h}\setminus\mathcal{T}_{H})\leq Q_{red}\ \mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},\mathcal{T}_{H}\setminus\mathcal{T}_{h})+C_{red}\left(\|y_{\mathcal{T}_{h}}-y_{\mathcal{T}_{H}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\|p_{\mathcal{T}_{h}}-p_{\mathcal{T}_{H}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}\right),

where the constant $C_{red}>0$ , $Q_{red}=2^{-\frac{\rho\alpha}{2d}},$ for $0<\alpha\leq 1;$ $Q_{red}=2^{-\frac{\rho(\alpha-2\sigma)}{2d}},$ for $1<\alpha<2.$ Here $0<\sigma=\frac{\alpha}{2}-\frac{1}{2}<\frac{\alpha}{2}$ and $\frac{\alpha}{2}-\sigma>0.$

Proof.

Bisection ensures that $|K^{\prime}|\leq|\frac{K}{2}|$ for any $K\in\mathcal{T}_{H}\backslash\mathcal{T}_{h}$ and its descendants $K^{\prime}\in\mathcal{T}_{h}\backslash\mathcal{T}_{H}$ with $K^{\prime}\subset K.$ Note that

\displaystyle\widetilde{h}^{\frac{\alpha}{2}}_{K^{\prime}}=(|K^{\prime}|^{\frac{1}{d}})^{\frac{\alpha}{2}}\leq(2^{-\rho}|K|)^{\frac{\alpha}{2d}}=2^{-\frac{\rho\alpha}{2d}}\widetilde{h}^{\frac{\alpha}{2}}_{K},\ \ \ \mathrm{for}\ \ 0<\alpha\leq 1,

(5.1)

we prove the Theorem with $Q_{red}=2^{-\frac{\rho\alpha}{2d}}.$ Here $\rho$ denotes the bisection time of every element $K\in\mathcal{T}_{H}$ in the refinement. By the definition of (4.7), the relationship between $\mathcal{T}_{h}$ and $\mathcal{T}_{H}$ we can get

$\displaystyle\left(\sum\limits_{K^{\prime}\in\mathcal{T}_{h}\backslash{\mathcal{T}_{H}}}E_{y}^{2}(y_{{\mathcal{T}_{h}}},K^{\prime})\right)^{\frac{1}{2}}$	$\displaystyle=\left(\sum\limits_{K^{\prime}\in\mathcal{T}_{h}\backslash{\mathcal{T}_{H}}}\int_{L^{2}(K^{\prime})}\widetilde{h}^{\alpha}_{K^{\prime}}(f+u_{\mathcal{T}_{h}}-(-\Delta)^{\frac{\alpha}{2}}y_{\mathcal{T}_{h}})^{2}\right)^{\frac{1}{2}}$
	$\displaystyle=\left(\sum\limits_{K^{\prime}\in\mathcal{T}_{h}\backslash{\mathcal{T}_{H}}}\|K^{\prime}\|^{\frac{\alpha}{d}}\\|f+u_{\mathcal{T}_{h}}-(-\Delta)^{\frac{\alpha}{2}}y_{\mathcal{T}_{h}}\\|^{2}_{L^{2}(K^{\prime})}\right)^{\frac{1}{2}}$
	$\displaystyle\leq 2^{-\frac{\rho\alpha}{2d}}\left(\sum\limits_{K\in\mathcal{T}_{H}\backslash{\mathcal{T}_{h}}}\|K\|^{\frac{\alpha}{d}}\\|f+u_{\mathcal{T}_{H}}-(-\Delta)^{\frac{\alpha}{2}}y_{\mathcal{T}_{H}}\\|^{2}_{L^{2}(K)}\right)^{\frac{1}{2}}$
	$\displaystyle=2^{-\frac{\rho\alpha}{2d}}\left(\sum\limits_{K\in\mathcal{T}_{H}\backslash{\mathcal{T}_{h}}}E_{y}^{2}(y_{{\mathcal{T}_{H}}},K)\right)^{\frac{1}{2}}.$	(5.2)

Similarly,

\displaystyle\left(\sum\limits_{K^{\prime}\in\mathcal{T}_{h}\backslash{\mathcal{T}_{H}}}E_{p}^{2}(p_{{\mathcal{T}_{h}}},K^{\prime})\right)^{\frac{1}{2}}\leq 2^{-\frac{\rho\alpha}{2d}}\left(\sum\limits_{K\in\mathcal{T}_{H}\backslash{\mathcal{T}_{h}}}E_{p}^{2}(p_{{\mathcal{T}_{H}}},K)\right)^{\frac{1}{2}}.

Then we have

\displaystyle\left(\sum\limits_{K^{\prime}\in\mathcal{T}_{h}\backslash{\mathcal{T}_{H}}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},K^{\prime})\right)^{\frac{1}{2}}\leq 2^{-\frac{\rho\alpha}{2d}}\left(\sum\limits_{K\in\mathcal{T}_{H}\backslash{\mathcal{T}_{h}}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},K)\right)^{\frac{1}{2}}.

Therefore, the previous estimate allows us to deduce the reduction property on the refined elements

	$\displaystyle\quad\sum\limits_{K\in\mathcal{T}_{h}\backslash{\mathcal{T}_{H}}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},K)$
	$\displaystyle\leq\sum\limits_{K\in\mathcal{T}_{h}\backslash{\mathcal{T}_{H}}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},K)-\sum\limits_{K\in\mathcal{T}_{h}\backslash{\mathcal{T}_{H}}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},K)+\sum\limits_{K\in\mathcal{T}_{h}\backslash{\mathcal{T}_{H}}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},K)$
	$\displaystyle\leq\sum\limits_{K\in\mathcal{T}_{h}\backslash{\mathcal{T}_{H}}}\left(\\|\widetilde{h}^{\frac{\alpha}{2}}_{K}(f+u_{\mathcal{T}_{h}}-(-\Delta)^{\frac{\alpha}{2}}y_{\mathcal{T}_{h}})\\|_{L^{2}(K)}+\\|\widetilde{h}^{\frac{\alpha}{2}}_{K}(y_{\mathcal{T}_{h}}-y_{d}-(-\Delta)^{\frac{\alpha}{2}}p_{\mathcal{T}_{h}})\\|_{L^{2}(K)}\right.$
	$\displaystyle\left.\quad-\\|\widetilde{h}^{\frac{\alpha}{2}}_{K}(f+u_{\mathcal{T}_{H}}-(-\Delta)^{\frac{\alpha}{2}}y_{\mathcal{T}_{H}})\\|_{L^{2}(K)}-\\|\widetilde{h}^{\frac{\alpha}{2}}_{K}(y_{\mathcal{T}_{H}}-y_{d}-(-\Delta)^{\frac{\alpha}{2}}p_{\mathcal{T}_{H}})\\|_{L^{2}(K)}\right)+\sum\limits_{K\in\mathcal{T}_{h}\backslash{\mathcal{T}_{H}}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},K)$
	$\displaystyle\leq C_{stab}\left(\\|y_{\mathcal{T}_{h}}-y_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|p_{\mathcal{T}_{h}}-p_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+\sum\limits_{K\in\mathcal{T}_{h}\backslash{\mathcal{T}_{H}}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},K)$
	$\displaystyle\leq C_{stab}\left(\\|y_{\mathcal{T}_{h}}-y_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|p_{\mathcal{T}_{h}}-p_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+2^{-\frac{\rho\alpha}{d}}\sum\limits_{K\in\mathcal{T}_{H}\backslash{\mathcal{T}_{h}}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},K).$

For $1<\alpha<2,$ we note that $0<\sigma=\frac{\alpha}{2}-\frac{1}{2}<\frac{\alpha}{2}$ and $\frac{\alpha}{2}-\sigma>0.$ Moreover, $\omega:=\bigcup\limits_{K^{\prime}\in\mathcal{T}_{h}\backslash\mathcal{T}_{H}}\overline{K^{\prime}}=\bigcup\limits_{K\in\mathcal{T}_{H}\backslash\mathcal{T}_{h}}\overline{K},$ then we have

\displaystyle\widetilde{h}^{\frac{\alpha}{2}}_{K^{\prime}}=(|K^{\prime}|^{\frac{1}{d}})^{{\frac{\alpha}{2}}-\sigma}\omega_{\mathcal{T}_{h}}^{\sigma}\leq(2^{-\rho}|K|)^{\frac{\alpha-2\sigma}{2d}}\omega_{{\mathcal{T}}_{H}}^{\sigma}=2^{-\frac{\rho(\alpha-2\sigma)}{2d}}\widetilde{h}^{\frac{\alpha}{2}}_{K}.

(5.3)

Arguing as before, we prove the Theorem with $Q_{red}=2^{-\frac{\rho(\alpha-2\sigma)}{2d}}.$ ∎

Remark 5.1.

According to (5.2), we can obtain

\displaystyle\sum\limits_{K\in\mathcal{T}_{h}\setminus\mathcal{T}_{H}}E_{y}^{2}(y_{\mathcal{T}_{H}},K)\leq 2^{-\frac{\rho\xi}{d}}\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{y}^{2}(y_{\mathcal{T}_{H}},K).

(5.4)

Here $\xi=\alpha,\ 0<\alpha<1$ and $\xi=\alpha-2\sigma,\ 1<\alpha<2.$ Further we can derive

	$\displaystyle(1-2^{-\frac{\rho\xi}{d}})\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{y}^{2}(y_{\mathcal{T}_{H}},K)$	$\displaystyle\leq\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{y}^{2}(y_{\mathcal{T}_{H}},K)-\sum\limits_{K\in\mathcal{T}_{h}\setminus\mathcal{T}_{H}}E_{y}^{2}(y_{\mathcal{T}_{H}},K)$
		$\displaystyle=\sum\limits_{K\in\mathcal{T}_{H}}E_{y}^{2}(y_{\mathcal{T}_{H}},K)-\sum\limits_{K\in\mathcal{T}_{h}}E_{y}^{2}(y_{\mathcal{T}_{H}},K).$

Thus, it implies that

\displaystyle\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{y}^{2}(y_{\mathcal{T}_{H}},K)\leq\frac{1}{1-2^{-\frac{\rho\xi}{d}}}\left(\sum\limits_{K\in\mathcal{T}_{H}}E_{y}^{2}(y_{\mathcal{T}_{H}},K)-\sum\limits_{K\in\mathcal{T}_{h}}E_{y}^{2}(y_{\mathcal{T}_{H}},K)\right).

(5.5)

Similar arguments can be applied to $\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{p}^{2}(p_{\mathcal{T}_{H}},K)$ . Using the definition of (4.17), we can thus arrive at the estimate

\displaystyle\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},\mathcal{T}_{H}\setminus\mathcal{T}_{h})\leq\frac{1}{1-2^{-\frac{\rho\xi}{d}}}\left(\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},\mathcal{T}_{H})-\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},\mathcal{T}_{h})\right).

(5.6)

Lemma 5.1.

Set $\breve{y}=\mathcal{S}_{\mathcal{T}_{h}}(f+u_{\mathcal{T}_{H}})$ and $\breve{p}=\mathcal{S}^{*}_{\mathcal{T}_{h}}(\mathcal{S}_{\mathcal{T}_{H}}(f+u_{\mathcal{T}_{H}})-y_{d})$ . Then the following estimates hold

	$\displaystyle\\|y_{\mathcal{T}_{H}}-\breve{y}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}$	$\displaystyle\leq$	$\displaystyle\mathbb{C}_{yaux}\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{y}^{2}(y_{\mathcal{T}_{H}},K),$
	$\displaystyle\\|p_{\mathcal{T}_{H}}-\breve{p}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}$	$\displaystyle\leq$	$\displaystyle\mathbb{C}_{paux}\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{p}^{2}(p_{\mathcal{T}_{H}},K).$

Proof.

By the coercivity and Galerkin orthogonality we derive

$\displaystyle\\|y_{\mathcal{T}_{H}}-\breve{y}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}$	$\displaystyle\leq$	$\displaystyle a(\breve{y}-{y}_{\mathcal{T}_{H}},\breve{y}-{y}_{\mathcal{T}_{H}})$
	$\displaystyle\leq$	$\displaystyle a(\breve{y}-{y}_{\mathcal{T}_{H}},(1-\Pi_{\mathcal{T}_{H}})(\breve{y}-{y}_{\mathcal{T}_{H}})$
	$\displaystyle=$	$\displaystyle(f+u_{\mathcal{T}_{H}}-(-\Delta)^{\frac{\alpha}{2}}{y}_{\mathcal{T}_{H}},(1-\Pi_{\mathcal{T}_{H}})(\breve{y}-{y}_{\mathcal{T}_{H}})).$

Assume $\omega:=\mathrm{interior}(\bigcup\limits_{{K}\in\mathcal{T}_{H}\cap\mathcal{T}_{h}}\bar{K}).$ We obtained by applying (4.9) and (4.11) in the above estimation

	$\displaystyle\\|y_{\mathcal{T}_{H}}-\breve{y}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}$	$\displaystyle\leq C\\|\widetilde{h}^{\frac{\alpha}{2}}_{\mathcal{T}_{H}}(f+u_{\mathcal{T}_{H}}-(-\Delta)^{\frac{\alpha}{2}}{y}_{\mathcal{T}_{H}})\\|_{L^{2}(\Omega\setminus\omega)}\\|\widetilde{h}^{-{\frac{\alpha}{2}}}_{\mathcal{T}_{H}}(1-\Pi_{\mathcal{T}_{H}})(\breve{y}-{y}_{\mathcal{T}_{H}})\\|_{L^{2}(\Omega\setminus\omega)}$
		$\displaystyle\quad+C\\|\widetilde{h}^{\frac{\alpha}{2}}_{\mathcal{T}_{H}}(f+u_{\mathcal{T}_{H}}-(-\Delta)^{\frac{\alpha}{2}}{y}_{\mathcal{T}_{H}})\\|_{L^{2}(\omega)}\\|\widetilde{h}^{-{\frac{\alpha}{2}}}_{\mathcal{T}_{H}}(1-\Pi_{\mathcal{T}_{H}})(\breve{y}-{y}_{\mathcal{T}_{H}})\\|_{L^{2}(\omega)}$
		$\displaystyle\leq C\\|\widetilde{h}^{\frac{\alpha}{2}}_{\mathcal{T}_{H}}(f+u_{\mathcal{T}_{H}}-(-\Delta)^{\frac{\alpha}{2}}{y}_{\mathcal{T}_{H}})\\|_{L^{2}(\Omega\setminus\omega)}\\|\widetilde{h}^{-{\frac{\alpha}{2}}}_{\mathcal{T}_{H}}(1-\Pi_{\mathcal{T}_{H}})(\breve{y}-{y}_{\mathcal{T}_{H}})\\|_{L^{2}(\Omega\setminus\omega)}$
		$\displaystyle\leq\left(\mathbb{C}_{yaux}\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{y}^{2}(y_{\mathcal{T}_{H}},K)\right)^{\frac{1}{2}}\\|\widetilde{h}^{-\frac{\alpha}{2}}_{\mathcal{T}_{H}}(1-\Pi_{\mathcal{T}_{H}})(\breve{y}-{y}_{\mathcal{T}_{H}})\\|_{L^{2}(\Omega\setminus\omega)}$
		$\displaystyle\leq\left(\mathbb{C}_{yaux}\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{y}^{2}(y_{\mathcal{T}_{H}},K)\right)^{\frac{1}{2}}\\|\breve{y}-{y}_{\mathcal{T}_{H}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)},$

which yields the first result. The second result can be derived in an analogous way. ∎

Theorem 5.3.

(Discrete reliability) We use $\mathcal{T}_{H}$ to denote the refinements of $\mathcal{T}_{h}$ . There holds

\displaystyle\quad\|y_{\mathcal{T}_{H}}-y_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\leq\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},\mathcal{T}_{H}\setminus\mathcal{T}_{h}).

Proof.

Taking $y_{\mathcal{T}_{h}}$ and $u_{\mathcal{T}_{h}}$ as the continuous solutions and $y_{\mathcal{T}_{H}}$ and $u_{\mathcal{T}_{H}}$ as its approximation, respectively. It can be deduced from the coercivity of $a(\cdot,\cdot)$ , Galerkin orthogonality and Lemma 5.1 that

	$\displaystyle\\|y_{\mathcal{T}_{H}}-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}$	$\displaystyle\leq\\|{y}_{\mathcal{T}_{H}}-\breve{y}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\\|\breve{y}-{y}_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}$
		$\displaystyle\leq\mathbb{C}_{yaux}\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{y}^{2}(y_{\mathcal{T}_{H}},K)+\\|u_{\mathcal{T}_{H}}-u_{\mathcal{T}_{h}}\\|^{2}.$

By (4.28), we can obtain

\displaystyle\|y_{\mathcal{T}_{H}}-y_{\mathcal{T}_{h}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}

\displaystyle\leq\mathbb{C}_{yaux}\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{y}^{2}(y_{\mathcal{T}_{H}},K)+\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\|p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|y_{\mathcal{T}_{H}}-y_{\mathcal{T}_{h}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right).

Similar arguments can be applied to bound $\|p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}$ . Using the previous inequality, we can thus arrive at the estimate

	$\displaystyle\\|p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}$	$\displaystyle\leq\mathbb{C}_{paux}\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{p}^{2}(p_{\mathcal{T}_{H}},K)+\mathbb{C}_{yaux}\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{y}^{2}(y_{\mathcal{T}_{H}},K)$
		$\displaystyle\quad+\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y_{\mathcal{T}_{H}}-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right).$

Combining the above estimates yields

	$\displaystyle\\|y_{\mathcal{T}_{H}}-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$	$\displaystyle\leq\mathbb{C}_{yaux}\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{y}^{2}(y_{\mathcal{T}_{H}},K)+\mathbb{C}_{paux}\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{p}^{2}(p_{\mathcal{T}_{H}},K)$
		$\displaystyle\leq\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},\mathcal{T}_{H}\setminus\mathcal{T}_{h}).$

∎

The optimal control system is a coupled system with nonlinear characteristics. These nonlinear characteristics lead to a lack of support for orthogonality when attempting to prove a contraction. Therefore, we need to prove quasi-orthogonality next.

Let $(y_{\mathcal{T}_{h_{k}}},p_{\mathcal{T}_{h_{k}}})$ be the solution associated to the discrete problem (4.1) with respect to ${\mathcal{T}_{h_{k}}}$ and $(y_{\mathcal{T}_{h_{k+1}}},p_{\mathcal{T}_{h_{k+1}}})$ be the solution associated to the discrete problem (4.1) with respect to ${\mathcal{T}_{h_{k+1}}}$ . We assume ${\mathcal{T}_{h_{k+1}}}$ is a refinement of ${\mathcal{T}_{h_{k}}}$ , and define the following norm

\displaystyle\|e_{\mathcal{T}_{h_{k}}}\|_{\Omega}^{2}:=\|y-y_{\mathcal{T}_{h_{k}}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|p-p_{\mathcal{T}_{h_{k}}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}

(5.7)

and

\displaystyle\|r_{\mathcal{T}_{h_{k}}}\|_{\Omega}^{2}:=\|y_{\mathcal{T}_{h_{k}}}-y_{\mathcal{T}_{h_{k+1}}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\|p_{\mathcal{T}_{h_{k}}}-p_{\mathcal{T}_{h_{k+1}}}\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)},

(5.8)

where $(y,p)$ is the optimal solution of the problem (3.2)-(3.3). Then the following relation is satisfied

\displaystyle\|e_{\mathcal{T}_{h_{k+1}}}\|_{\Omega}^{2}=\|e_{\mathcal{T}_{h_{k}}}\|_{\Omega}^{2}-\|r_{\mathcal{T}_{h_{k}}}\|_{\Omega}^{2}+2a(y-y_{\mathcal{T}_{h_{k+1}}},y_{\mathcal{T}_{h_{k}}}-y_{\mathcal{T}_{h_{k+1}}})+2a(p-p_{\mathcal{T}_{h_{k+1}}},p_{\mathcal{T}_{h_{k}}}-p_{\mathcal{T}_{h_{k+1}}}).

(5.9)

Theorem 5.4.

(Quasi-orthogonality) By the above definitions, there holds

\displaystyle\sum\limits_{k=l}^{N}\left\{\|r_{k}\|^{2}_{\Omega}-2C\Theta^{2}(h_{0})\left(\|y-y_{\mathcal{T}_{h_{k}}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\|p-p_{\mathcal{T}_{h_{k}}}\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}\right)\right\}\leq C_{orth}\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{h_{l}}},p_{\mathcal{T}_{h_{l}}},\mathcal{T}_{h_{l}}).

Here, $h_{0}\ll 1$ , for all $l,N\in\mathbb{N}_{0},$ the constant $C_{orth}>0$ is depend on $\Omega,d,\alpha,$ and the $\gamma$ -shape regularity of the initial triangulation ${\mathcal{T}_{h_{0}}}$ .

Proof.

At first we prove the case $k\geq 1.$ For convenience, we use $(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},u_{\mathcal{T}_{H}},\lambda_{\mathcal{T}_{H}}),\ (y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},u_{\mathcal{T}_{h}},\lambda_{\mathcal{T}_{h}})$ to denote $(y_{\mathcal{T}_{h_{k}}},p_{\mathcal{T}_{h_{k}}},u_{\mathcal{T}_{h_{k}}},\lambda_{\mathcal{T}_{h_{k}}})$ and $(y_{\mathcal{T}_{h_{k+1}}},p_{\mathcal{T}_{h_{k+1}}},u_{\mathcal{T}_{h_{k+1}}},\lambda_{\mathcal{T}_{h_{k+1}}})$ . So it suffices to proceed in five steps.

$\underline{Step\ 1.}$ Since $y_{\mathcal{T}_{H}}-y_{\mathcal{T}_{h}}\in V_{\mathcal{T}_{h}}$ , we have that

	$\displaystyle a(y-y_{\mathcal{T}_{h}},y_{\mathcal{T}_{H}}-y_{\mathcal{T}_{h}})$	$\displaystyle=(u-u_{\mathcal{T}_{h}},y_{\mathcal{T}_{H}}-y_{\mathcal{T}_{h}})$
		$\displaystyle\leq\frac{1}{2}\\|u-u_{\mathcal{T}_{h}}\\|^{2}+\frac{1}{2}\\|y_{\mathcal{T}_{H}}-y_{\mathcal{T}_{h}}\\|^{2}.$		(5.10)

To control the right hand side of (5.2), we utilize the auxiliary state $\breve{y}$ , defined as $\breve{y}=S_{\mathcal{T}_{h}}(f+u_{\mathcal{T}_{H}})$ , the control $u$ defined in (3.13) and combine (4.28), Lemma 4.2 to obtain

	$\displaystyle\quad a(y-y_{\mathcal{T}_{h}},y_{\mathcal{T}_{H}}-y_{\mathcal{T}_{h}})$
	$\displaystyle\leq\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\\|y_{\mathcal{T}_{H}}-\breve{y}\\|^{2}+C\\|\breve{y}-y_{\mathcal{T}_{h}}\\|^{2}$
	$\displaystyle\leq\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\Theta^{2}(H)\\|y_{\mathcal{T}_{H}}-\breve{y}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+C\\|u_{\mathcal{T}_{H}}-u_{\mathcal{T}_{h}}\\|^{2}$
	$\displaystyle\leq\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\Theta^{2}(H)\\|y_{\mathcal{T}_{H}}-\breve{y}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+C\\|u_{\mathcal{T}_{H}}-u\\|^{2}+C\\|u-u_{\mathcal{T}_{h}}\\|^{2}$
	$\displaystyle\leq\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\Theta^{2}(H)\\|y_{\mathcal{T}_{H}}-\breve{y}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\frac{C}{\gamma^{2}}\Theta^{2}(H)\left(\\|p-p_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right).$

$\underline{Step\ 2.}$ The goal of this step is to bound $a(p-p_{\mathcal{T}_{h}},p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}})$ . Similarly, Since $p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}}\in V_{\mathcal{T}_{h}}$ , we have that

	$\displaystyle a(p-p_{\mathcal{T}_{h}},p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}})$	$\displaystyle=(y-y_{\mathcal{T}_{h}},p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}})$
		$\displaystyle\leq\frac{1}{2}\\|y-y_{\mathcal{T}_{h}}\\|^{2}+\frac{1}{2}\\|p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}}\\|^{2}.$

In an analogous way, we utilize the auxiliary adjoint state $\breve{p}$ , defined as $\breve{p}=\mathcal{S}^{*}_{\mathcal{T}_{h}}(\mathcal{S}_{\mathcal{T}_{H}}(f+u_{\mathcal{T}_{H}})-y_{d})$ , the control $u$ defined in (3.13) and combine (4.29), Lemma 4.2, (4.2) to obtain

	$\displaystyle\quad a(p-p_{\mathcal{T}_{h}},p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}})$
	$\displaystyle\leq\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\\|p_{\mathcal{T}_{H}}-\breve{p}\\|^{2}+C\\|\breve{p}-p_{\mathcal{T}_{h}}\\|^{2}$
	$\displaystyle\leq\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\Theta^{2}(H)\\|p_{\mathcal{T}_{H}}-\breve{p}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+C\\|y_{\mathcal{T}_{H}}-y_{\mathcal{T}_{h}}\\|^{2}$
	$\displaystyle\leq\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\Theta^{2}(H)\\|p_{\mathcal{T}_{H}}-\breve{p}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}$
	$\displaystyle\quad+\frac{C}{\gamma^{2}}\Theta^{2}(H)\left(\\|p-p_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\Theta^{2}(H)\\|y_{\mathcal{T}_{H}}-\breve{y}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}.$

$\underline{Step\ 3.}$ By the relation (5.9) and Lemma 5.1, combining above estimates leads to

	$\displaystyle\quad\\|e_{k+1}\\|_{\Omega}^{2}-\\|e_{k}\\|_{\Omega}^{2}+\\|r_{k}\\|_{\Omega}^{2}$
	$\displaystyle=2a(y-y_{\mathcal{T}_{h}},y_{\mathcal{T}_{H}}-y_{\mathcal{T}_{h}})+2a(p-p_{\mathcal{T}_{h}},p_{\mathcal{T}_{H}}-p_{\mathcal{T}_{h}})$
	$\displaystyle\leq\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+\frac{C}{\gamma^{2}}\Theta^{2}(H)\left(\\|p-p_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)$
	$\displaystyle\quad+C\Theta^{2}(H)\left(\\|y_{\mathcal{T}_{H}}-\breve{y}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\\|p_{\mathcal{T}_{H}}-\breve{p}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}\right)$
	$\displaystyle\leq\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+\frac{C}{\gamma^{2}}\Theta^{2}(H)\left(\\|p-p_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)$
	$\displaystyle\quad+C\Theta^{2}(H)\left(\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{y}^{2}(y_{\mathcal{T}_{H}},K)+\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}E_{p}^{2}(p_{\mathcal{T}_{H}},K)\right).$

Further, a simple application of the triangle inequality reveal that

	$\displaystyle\quad\\|r_{k}\\|_{\Omega}^{2}$
	$\displaystyle\leq(\\|y-y_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|p-p_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)})-(\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)})$
	$\displaystyle\quad+\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\Theta^{2}(H)\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},K)$
	$\displaystyle\quad+\frac{C}{\gamma^{2}}\Theta^{2}(H)\left(\\|p-p_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)$
	$\displaystyle\leq\left(1+\frac{C}{\gamma^{2}}\Theta^{2}(h_{0})\right)\left(\\|y-y_{\mathcal{T}_{H}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\\|p-p_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\Theta^{2}(H)\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},K)$
	$\displaystyle\quad-\left(1-\frac{C}{\gamma^{2}}\Theta^{2}(h_{0})\right)\left(\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)$
	$\displaystyle\leq\left(1+\frac{C}{\gamma^{2}}\Theta^{2}(h_{0})\right)\left(\\|y-y_{\mathcal{T}_{H}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\\|p-p_{\mathcal{T}_{H}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\Theta^{2}(H)\sum\limits_{K\in\mathcal{T}_{H}\setminus\mathcal{T}_{h}}\mathcal{E}_{K}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},K)$
	$\displaystyle\quad-\left(1-\frac{C}{\gamma^{2}}\Theta^{2}(h_{0})\right)\left(\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right),$

provided $h_{0}\ll 1.$ We apply the Lemma 5.2 to conclude that

	$\displaystyle\quad\\|r_{k}\\|_{\Omega}^{2}-2\frac{C}{\gamma^{2}}\Theta^{2}(h_{0})\left(\\|y-y_{\mathcal{T}_{H}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\\|p-p_{\mathcal{T}_{H}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}\right)$
	$\displaystyle\leq\left(1-\frac{C}{\gamma^{2}}\Theta^{2}(h_{0})\right)\left(\\|y-y_{\mathcal{T}_{H}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\\|p-p_{\mathcal{T}_{H}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}\right)-\left(1-\frac{C}{\gamma^{2}}\Theta^{2}(h_{0})\right)\left(\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\\|p-p_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}\right)$
	$\displaystyle\quad+\frac{C\Theta^{2}(h_{0})}{1-2^{-\frac{\rho\xi}{d}}}\left(\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},\mathcal{T}_{H})-\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{H}},p_{\mathcal{T}_{H}},\mathcal{T}_{h})\right),$

where $h_{0}\ll 1.$

To conclude the previous estimate combined with Theorem 4.1 and Remark 5.1 leads to the general quasi-orthogonality as follows

	$\displaystyle\quad\sum\limits_{k=l}^{N}\left\{\\|r_{k}\\|^{2}_{\Omega}-2\frac{C}{\gamma^{2}}\Theta^{2}(h_{0})\left(\\|y-y_{\mathcal{T}_{h_{k}}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\\|p-p_{\mathcal{T}_{h_{k}}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}\right)\right\}$
	$\displaystyle\leq\sum\limits_{k=l}^{N}\left\{\left(1-\frac{C}{\gamma^{2}}\Theta^{2}(h_{0})\right)\left(\\|y-y_{\mathcal{T}_{h_{k}}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}-\\|y-y_{\mathcal{T}_{h_{k+1}}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}\right)-\left(1-\frac{C}{\gamma^{2}}\Theta^{2}(h_{0})\right)\left(\\|p-p_{\mathcal{T}_{h_{k}}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}-\\|p-p_{\mathcal{T}_{h_{k+1}}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}\right)\right.$
	$\displaystyle\quad\left.+\frac{C\Theta^{2}(h_{0})}{1-2^{-\frac{\rho\xi}{d}}}\left(\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{h_{k}}},p_{\mathcal{T}_{h_{k}}},\mathcal{T}_{h_{k}})-\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{h_{k}}},p_{\mathcal{T}_{h_{k}}},\mathcal{T}_{h_{k+1}})\right)\right\}$
	$\displaystyle\leq\left(1-\frac{C}{\gamma^{2}}\Theta^{2}(h_{0})\right)\left(\\|y-y_{\mathcal{T}_{h_{l}}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\\|p-p_{\mathcal{T}_{h_{l}}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}\right)+\frac{C\Theta^{2}(h_{0})}{1-2^{-\frac{\rho\xi}{d}}}\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{h_{l}}},p_{\mathcal{T}_{h_{l}}},\mathcal{T}_{h_{l}})$
	$\displaystyle\leq C_{orth}\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{h_{l}}},p_{\mathcal{T}_{h_{l}}},\mathcal{T}_{h_{l}}).$

This concludes the proof. ∎

For each $t>0$ , if there exists constants $C_{ar},\ Q_{ar}>0$ such that

\displaystyle C_{ar}\mathbb{A}_{t}(v)\leq\sup\limits_{\ell\in\mathbb{N}_{0}}(\#{{\mathcal{T}_{h_{l}}}})^{t}\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{h_{l}}},p_{\mathcal{T}_{h_{l}}},\mathcal{T}_{h_{l}})<Q_{ar}\mathbb{A}_{t}(v),

(5.11)

we say that the adaptive Algorithm 2 is rate optimal with respect to the error estimator, where

\displaystyle\mathbb{A}_{t}(v)=\sup\limits_{N\in\mathbb{N}_{0}}(N+1)^{t}\mathop{\min}\limits_{\mathcal{T}_{h}\in\mathbb{T}\atop\#{\mathcal{T}_{h}}-\#{{\mathcal{T}_{h_{0}}}}\leq N}\mathcal{E}_{ocp}^{2}(y_{\mathcal{T}_{h_{l}}},p_{\mathcal{T}_{h_{l}}},\mathcal{T}_{h_{l}}).

According to [28], through the proof of the above Theorem we indeed also verify that Algorithm 1 can reach the optimal convergence order in the sense of (5.11).

6. Numerical results

In this section, three numerical experiments are presented, and the exact solutions in the circle domain of the first example are given. The solutions in the square domain of the second and third examples are not known. We proceed by establishing fixed values for the optimal state and adjoint state variable, by employing the projection formulas $u=\Pi_{[a,b]}\left(-\frac{1}{\gamma}(p+\beta\lambda)\right)$ and $\lambda=\Pi_{[-1,1]}\left(-\frac{1}{\beta}p\right)$ .

Example 6.1.

We set $\Omega=B(0,1)$ , $a=-0.5$ , $b=0.5$ , $c=3$ and the exact solutions are as follows:

	$\displaystyle y$	$\displaystyle=\frac{2^{-\alpha}(1-\|x\|^{2})^{\frac{\alpha}{2}}}{\Gamma(1+\frac{\alpha}{2})^{2}},\ p=cy,$
	$\displaystyle u$	$\displaystyle=\Pi_{[a,b]}\left(-\frac{1}{\gamma}(cy+\beta\lambda)\right),\ \lambda=\Pi_{[-1,1]}\left(-\frac{1}{\beta}cy\right).$

Figure 6.1 shows the initial mesh and the final refinement mesh with $\alpha=0.5,\ \theta=0.7$ . Since the exact solutions of the state variable and adjoint variable exhibit smoothdness within the unit circle, with singularities $\partial\Omega$ on the boundary, so the mesh is refined mainly in the region close to the boundary.

Refer to caption — Figure 6.1. The initial mesh (left) and the final refinement mesh (right) with $\alpha=0.5,\theta=0.7$ on the circle.

In Figure 6.2, the computational rates of convergence for the computable error estimators and indicators $\mathcal{E}_{ocp},E_{y}$ and $E_{p}$ for $\alpha=0.5$ and $\alpha=1.5$ are presented, respectively. It can be observed that, in both cases, each contribution decays with the optimal rate $N^{-\frac{1}{2}}$ .

We set $\alpha=0.5$ and the parameter $\theta=0.5,\ 0.7,\ 1$ that governs the module $\mathbf{MARK}$ . The left plot of Figure 6.3 illustrates the convergence orders of the error estimators $\mathcal{E}_{ocp}$ and error indicators $E_{y},\ E_{p}$ under different values of $\theta$ . On the right plot of Figure 6.3, the convergence orders of the errors for the state and adjoint variables in $\|\cdot\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$ and the effectivity indices which are given by $\mathcal{E}_{ocp}/\|e\|_{\Omega}$ are presented for different $\theta$ values. From the Figure 6.3, it is observed that when $\theta=1$ , indicating uniform refinement, the displayed convergence rates do not reach optimality. However, for $\theta<1,$ the convergence rates of the error eatimators $\mathcal{E}_{ocp}$ and errors clearly converge to $N^{-\frac{1}{2}}$ . Thus, our theoretical analysis is effectively verified.

Next, we consider the effect of changing the regularization parameter $\gamma$ on the system with $\alpha=1.5,\ \theta=0.5$ and $\beta=1$ . Specifically, we examine the cases where $\gamma$ takes the values of

\gamma\in\{10^{0},\ 10^{-1},\ 10^{-2},\ 10^{-3},\ 10^{-4}\}.

It can be seen from the Figure 6.4 that the error estimators $\mathcal{E}_{ocp}$ , the error indicators $E_{y},\ E_{p},$ errors of state and adjoint variable in $\|\cdot\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$ can reach the optimal convergence order for all the values of the parameter $\gamma$ considered.

In Figures 6.5-6.6, we show the profiles of the numerical solutions for the control and state when $\alpha=1.5,\ \theta=0.5$ , respectively. It can be seen that as $\gamma$ decreases, the $L^{1}$ term dominates and the numerical solutions of the control become sparsier.

Example 6.2.

In the second example we consider an optimal control problem with $f=-6,\ y_{d}=1$ . We set $\Omega=(-1,1)^{2}$ , $\gamma\in\{10^{0},\ 10^{-1},\ 10^{-2}\}$ , $\beta=1$ , $a=-0.3$ , $b=0.3$ , respectively.

In Figure 6.7 we show the initial mesh and the final refinement mesh with $\alpha=0.5,\ \theta=0.7$ . The primary refinement behavior is observed to occur exclusively along the boundaries of the entire square domain. This observation suggests that the estimators effectively capture the singularities of the exact solution along the entire boundary, thus guiding the mesh refinement process.

For $\alpha=0.5$ and $\alpha=1.5$ , the Figure 6.8 shows that the AFEM proposed in Section 6 delivers optimal experimental rates of convergence for the error estimators $\mathcal{E}_{ocp}$ , the error indicators $E_{y},\ E_{p}.$ The results obtained empirically are consistent with those of the previous example. The convergence rates of the estimators and indicators are $N^{-\frac{1}{4}}$ for uniform refinement, while adaptive refinement leads to optimal convergence rates of $N^{-\frac{1}{2}}$ .

For all choices of the parameter $\gamma$ considered, the Figure 6.9 shows the decrease of the total error estimators $\mathcal{E}_{ocp}$ and the error indicators $E_{y},\ E_{p}$ with respect to the number of degrees of freedom (Dofs). In all the values of the parameter $\gamma$ cases the optimal rate $N^{-\frac{1}{2}}$ is achieved.

In Figures 6.10-6.11, we show the profiles of the numerical solutions for the control when $\alpha=0.5,\ \theta=0.7$ and $\alpha=1.5,\ \theta=0.5$ , respectively. As $\gamma$ decreases, the numerical solutions of the control become sparsier.

Example 6.3.

In the third example we consider an optimal control problem with $f=6\sin(4y)\cos(4x)e^{x},\ y_{d}=-4\sin(4y)\cos(4x)e^{x}$ . We set $\Omega=(-1,1)^{2}$ , $\gamma=0.1$ , $\beta=1$ , $a=-0.3$ , $b=0.3$ , respectively.

In Figure 6.12 we show the initial mesh and the refinement mesh after 13 adaptive steps with $\alpha=0.5,\ \theta=0.7$ . We observe that the mesh nodes are distributed around the domain where the solutions have a large gradient as well as at the boundarys. In Figure 6.13, the convergence rates of error estimators and indicators for $\alpha=0.5$ and $\alpha=1.5$ are presented, respectively. It can be observed that, in both cases, each contribution decays with the optimal convergence rate $N^{-\frac{1}{2}}$ . In Figure 6.14, the profiles of the numerical control and state with $\alpha=0.5,\ \theta=0.7$ are provided.

7. Conclusion

In this paper, we present and analyze a weighted residual a posteriori error estimate for an optimal control problem. The problem involves a cost functional that is nondifferentiable, a state equation with an integral fractional Laplacian, and control constraints. We provide first-order optimality conditions and derive upper and lower bounds on the a posteriori error estimates for the finite element approximation of the optimal control problem. Moreover, we demonstrate that the approximation sequence generated by the adaptive algorithm converges at the optimal algebraic rate. Finally, we validate the theoretical findings through numerical experiments.

Acknowledgements

The work was supported by the National Natural Science Foundation of China under Grant No. 11971276 and 12171287.

References

[1] Clason, C., Kunisch, K.: A duality-based approach to ellipic control problems in nonreflexive Banach spaces. ESAIM Control Optim. Calc. Var. 17, 243-266 (2011)
[2] Casas, E., Herzog, R., Wachsmuth, G.: Approximation of sparse controls in semilinear equations by piecewise linear functions, Numer. Math. 122, 645-669 (2012)
[3] Casas, E.: A review on sparse solutions in optimal control of partial differential equations. SeMA Journal, 74, 319-344 (2017)
[4] Casas, E. and Kunisch, K.: Stabilization by sparse controls for a class of semilinear parabolic equations. SIAM J. Control Optim., 55, 512-532 (2017)
[5] Casas, E., Herzog, R., Wachsmuth, G.: Optimality conditions and error analysis of semilinear elliptic control problems with $L^{1}$ cost functional. SIAM J. Optim., 22, 795-820 (2012)
[6] Ot $\rm{\acute{a}}$ rola, E., Salgado, A.J.: Sparse optimal control for fractional diffffusion. Comput. Methods Appl. Math., 18(1), 95-110 (2018)
[7] Stadler, G.: Elliptic optimal control problems with $L^{1}$ -control cost and applications for the placement of control devices. Comput. Optim. Appl., 44, 159-181 (2009)
[8] Wachsmuth, G., Wachsmuth, D.: Convergence and regularization results for optimal control problems with sparsity functional. ESAIM Control Optim. Calc. Var., 17, 858-866 (2011)
[9] Allendes, A., Fuica, F., Ot $\rm{\acute{a}}$ rola, E.: Adaptive finite element methods for sparse PDE-constrained optimization. IMA J. Numer. Anal., 40(3), 2106-2142 (2020)
[10] Ot $\rm{\acute{a}}$ rola, E.: An adaptive finite element method for the sparse optimal control of fractional diffusion. Numer. Meth. Part. D. E., 36(2), 302-328 (2019)
[11] Liu, W.B., Yan, N.N.: A posteriori error analysis for convex distributed optimal control problems. Adv. Comput. Math. 15(1-4), 285-309 (2001)
[12] Liu, W.B., Yan, N.N.: A posteriori error estimates for convex boundary control problems. SIAM J. Numer. Anal. 39(1), 73-99 (2001)
[13] Li, R., Liu, W.B., Ma, H.P., Tang, T.: Adaptive finite element approximation for distributed elliptic optimal control problems. SIAM J. Control Optim. 41(5), 1321-1349 (2002)
[14] Liu, W.B., Yan, N.N.: A posteriori error estimates for optimal problems governed by Stokes equations. SIAM J. Numer. Anal. 40, 1850-1869 (2003)
[15] Liu, W.B., Yan, N.N.: A posteriori error estimates for optimal control problems governed by parabolic equations. Numer. Math. 93, 497-521 (2003)
[16] Liu, W.B., Yan, N.N.: Adaptive Finite Element Methods for Optimal Control Governed by PDEs. Science Press, Beijing (2008)
[17] Hinterm $\rm{\ddot{u}}$ ller, M., Hoppe, R.H.W.: Goal-oriented adaptivity in control constrained optimal control of partial differential equations. SIAM J. Control Optim. 47(4), 1721-1743 (2008)
[18] Kohls, K., R $\rm{\ddot{o}}$ sch, A., Siebert, K.G.: A posteriori error analysis of optimal control problems with control constraints. SIAM J. Control Optim. 52, 1832-1861 (2014)
[19] Gong, W., Yan, N.N.: Adaptive finite element method for elliptic optimal control problems: convergence and optimality. Numer. Math. 135, 1121-1170 (2017)
[20] Leng, H.T., Chen, Y.P., Huang, Y.Q.: Equivalent a posteriori error estimates for elliptic optimal control problems with $L^{1}$ -control cost, Comput. Math. Appl. 77(2), 342-356 (2019)
[21] Benson, D.A., Wheatcraft, S., Meerschaert. M.: The fractional-order governing equation of Lévy motion. Water Resour. Res., 36(6), 1413-1424, 2000.
[22] Shlesinger, M.F., West, B.J., Klafter. J.: Lévy dynamics of enhanced diffusion: application to turbulence. Phys. Rev. Lett., 58(11), 1100-1103, 1987.
[23] Zaslavsky, G.M., Stevens, D., Weitzner, H.: Self-similar transport in incomplete chaos. Phys. Rev. E., 48(3), 1683-1694 (1993)
[24] Schirotzek, W.: Nonsmooth analysis. Universitext. Springer, Berlin, 2007.
[25] Bonito, A., Borthagaray, J.P., Nochetto, R.H., Ot $\rm{\acute{a}}$ rola, E., Salgado, A.J.: Numerical methods for fractional diffusion, Comput. Vis. Sci. 19, 19-46 (2018)
[26] Faustmann, M., Melenk, J.M., Praetorius, D.: Quasi-optimal convergence rate for an adaptive method for the integral fractional Laplacian. Math. Comp. 90(330), 1557-1587 (2021)
[27] Ioffe, A.D., Tichomirov, V.M.: Theorie der Extremalaufgaben. VEB Deutscher Verlag der Wissenschaften, Berlin (1979).
[28] Carstensen, C., Feischl, M., Praetorius, D.: Axioms of adaptivity. Comput. Math. Appl. 67(6), 1195-1253 (2014)
[29] Acosta, G., Borthagaray, J.P.: A fractional laplace equation: regularity of solutions and finite element approximations. SIAM J. Numer. Anal., 55(2), 472-495 (2017)
[30] Cascon, J.M., Kreuzer, C., Nochetto, R.H., Siebert, K.G.: Quasi-optimal convergence rate for an adaptive finite element method, SIAM J. Numer. Anal. 46, 2524-2550 (2008)
[31] Borthagaray, J.P., Leykekhman, D., Nochetto, R.H.: Local energy estimates for the fractional Laplacian. SIAM J. Numer. Anal. 59(4), 1918-1947 (2021)
[32] Cabr $\rm{\acute{e}}$ , X., Tan, J.: Positive solutions of nonlinear problems involving the square root of the Laplacian. Adv. Math., 224(5), 2052-2093 (2010)
[33] Caffarelli L., Silvestre, L.: An extension problem related to the fractional Laplacian. Comm. Part. Diff. Eqs., 32(7-9), 1245-1260 (2007)
[34] Caffarelli, L.A., Stinga, P.R.: Fractional elliptic equations, Caccioppoli estimates and regularity. Ann. Inst. H. Poincar $\rm{\acute{e}}$ Anal. Non Lin $\rm{\acute{e}}$ aire, 33(3), 767-807 (2016)
[35] Capella, A., D $\rm{\acute{a}}$ vila, J., Dupaigne, L., Sire, Y.: Regularity of radial extremal solutions for some non-local semilinear equations. Comm. Part. Diff. Eqs., 36(8), 1353-1384 (2011)
[36] Banjai, L., Melenk, J.M., Nochetto, R.H., Ot $\rm{\acute{a}}$ rola, E., Salgado, A.J., Schwab. Ch.: Tensor FEM for spectral fractional diffusion. Found. Comput. Math., 19, 901-962 (2019)
[37] Nochetto, R.H., Ot $\rm{\acute{a}}$ rola, E., Salgado, A.J.: A PDE approach to fractional diffusion in general domains: a priori error analysis. Found. Comput. Math., 15(3), 733-791 (2015)

…

	$\displaystyle\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$	$\displaystyle\leq 2\\|p-\tilde{p}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+2\\|\tilde{p}-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$
		$\displaystyle\leq 2\\|y-y_{\mathcal{T}_{h}}\\|^{2}+2\\|\tilde{p}-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$
		$\displaystyle\leq 2\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+2\\|\tilde{p}-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.$

	$\displaystyle\\|u-u_{\mathcal{T}_{h}}\\|$	$\displaystyle\leq\frac{C}{\gamma}\Theta(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|p_{\mathcal{T}_{h}}-p_{\mathcal{T}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y_{\mathcal{T}_{h}}-y_{\mathcal{T}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)$
		$\displaystyle\leq\frac{C}{\gamma}\Theta(h)\left(\\|y_{\mathcal{T}_{h}}-y_{\mathcal{T}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|p-p_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|u_{\mathcal{T}_{h}}-u\\|+\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)$
		$\displaystyle\leq\frac{C}{\gamma}\Theta(h)\left(\\|u_{\mathcal{T}_{h}}-u\\|+\\|p-p_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right).$

	$\displaystyle\quad\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|p-p_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}^{2}+\\|u-u_{\mathcal{T}_{h}}\\|^{2}+\\|\lambda-\lambda_{\mathcal{T}_{h}}\\|^{2}$
	$\displaystyle\leq\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+\frac{C}{(\gamma\beta)^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)$
	$\displaystyle\quad+C(1+\frac{1}{\beta^{2}})\\|\tilde{y}-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+C(1+\frac{1}{\beta^{2}})\\|\tilde{p}-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$
	$\displaystyle\leq\mathcal{E}^{2}_{ocp}({y_{\mathcal{T}_{h}},p_{\mathcal{T}_{h}},\mathcal{T}_{h})},$

$\displaystyle\\|\tilde{y}-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$	$\displaystyle\leq C\\|\tilde{y}-y\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+C\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$
	$\displaystyle\leq C\\|u_{\mathcal{T}_{h}}-u\\|^{2}+C\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}$
	$\displaystyle\leq\frac{C}{\gamma^{2}}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}.$	(4.34)

	$\displaystyle\quad\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|\tilde{y}-y_{\mathcal{T}_{h}}\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}$
	$\displaystyle\leq C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|\tilde{y}-y\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}+C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|y-y_{\mathcal{T}_{h}}\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}$
	$\displaystyle\leq CMh^{1-2\epsilon}\\|\tilde{y}-y\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega)}^{2}+C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|y-y_{\mathcal{T}_{h}}\\|_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))}^{2}$
	$\displaystyle\leq\frac{C}{\gamma^{2}}Mh^{1-2\epsilon}\Theta^{2}(h)\left(\\|p-p_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}+\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{\widetilde{H}^{\frac{\alpha}{2}}(\Omega)}\right)+C\sum\limits_{K\in\mathcal{T}_{h}}h_{K}^{1-2\epsilon}\\|y-y_{\mathcal{T}_{h}}\\|^{2}_{H^{{\frac{\alpha}{2}}+\frac{1}{2}-\epsilon}(\Omega^{3}_{h}(K))},$

Adaptive finite element approximation of sparse optimal control with integral fractional Laplacian

Abstract.

Key words and phrases:

Mathematics Subject Classification:

1. Introduction

2. Preliminaries

3. Optimal control problem

Lemma 3.1.

Lemma 3.2.

Theorem 3.1.

Proof.

Theorem 3.2.

Proof.

Theorem 3.3.

4. Finite element approximation method and a posteriori error estimate

Lemma 4.1.

Lemma 4.2.

Proof.

Lemma 4.3.

Proof.

4.1. Reliability of the error estimator ℰo​c​p\mathcal{E}_{ocp}

Theorem 4.1.

Proof.

Remark 4.1.

4.2. Efficiency of the error estimator ℰo​c​p\mathcal{E}_{ocp}

Theorem 4.2.

Proof.

5. AFEMs and convergence analysis

5.1. AFEMs

5.2. Convergence analysis

Theorem 5.1.

Proof.

Theorem 5.2.

Proof.

Remark 5.1.

Lemma 5.1.

Proof.

Theorem 5.3.

Proof.

Theorem 5.4.

Proof.

6. Numerical results

Example 6.1.

Example 6.2.

Example 6.3.

7. Conclusion

Acknowledgements

References

4.1. Reliability of the error estimator $\mathcal{E}_{ocp}$

4.2. Efficiency of the error estimator $\mathcal{E}_{ocp}$