This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

11institutetext: 🖂  Enrique Otárola
[email protected]
22institutetext: 1 Departamento de Matemática, Universidad Técnica Federico Santa María, Av. España 1680, Valparaíso, Chile.

A Posteriori Error Estimates for an Optimal Control Problem with a Bilinear State Equation

Francisco Fuica1    Enrique Otárola1
(Received: date / Accepted: date)
Abstract

We propose and analyze a posteriori error estimators for an optimal control problem that involves an elliptic partial differential equation as state equation and a control variable that enters the state equation as a coefficient; pointwise constraints on the control variable are considered as well. We consider two different strategies to approximate optimal variables: a fully discrete scheme in which the admissible control set is discretized with piecewise constant functions and a semi-discrete scheme where the admissible control set is not discretized; the latter scheme being based on the so-called variational discretization approach. We design, for each solution technique, an a posteriori error estimator and show, in two and three dimensional Lipschitz polygonal/polyhedral domains (not necessarily convex), that the proposed error estimator is reliable and efficient. We design, based on the devised estimators, adaptive strategies that deliver optimal experimental rates of convergence for the performed numerical examples.

Keywords:
optimal control problems bilinear equations finite elements a posteriori error estimates adaptive finite element methods
MSC:
49M25 65N15 65N30 65N50
journal: JOTA

1 Introduction

The development and study of discretization techniques, based on finite elements, for distributed control–constrained linear–quadratic elliptic optimal control problems have been widely studied in the literature; see MR2843956; MR2516528 for an extensive list of references. These discretization techniques are mainly divided into two categories, which rely on the discretization of the state and adjoint equations; they differ on whether or not the admissible control set is also discretized. In contrast to these advances, the study of solution techniques for optimal control problems where the control variable enters the state equation as a coefficient is not as developed. One of the main sources of difficulty within these type of problems is that the solution of the state equation depends nonlinearly on the control variable MR2536007. Consequently, uniqueness of solutions cannot be guaranteed.

In this work, we will focus on the development and analysis of efficient solution techniques for the optimal control problem (2)–(4), which incorporates the control variable as a coefficient in the state equation; the control variable is not a source term. We immediately mention that this problem can be interpreted as a particular instance of parameter estimation. To the best of our knowledge, the first work that provides an analysis for suitable finite element discretizations for problem (2)–(4) is MR2536007. In this work, the authors propose, on convex polygonal/polyhedral domains and quasi–uniform meshes, two fully discrete schemes that discretize the admissible control set with piecewise constant and piecewise linear functions; the state and adjoint equations are discretized with piecewise linear functions. Estimates for the error committed within the approximation of a control variable are derived in (MR2536007, Corollaries 5.6 and 5.10). In addition, an error estimate for a post-processing strategy is obtained in (MR2536007, Theorem 5.18). The results obtained in MR2536007 were later extended to mixed and stabilized finite element methods in MR3103238 and MR3693332, respectively.

A particular class of numerical methods that has proven a competitive performance when are used to approximate solutions to PDE–constrained optimization problems, and the ones we shall consider in this work, are adaptive finite element methods (AFEMs). AFEMs are iterative methods recognizable by their capability to improve the quality of a discrete approximation to a corresponding PDE while keeping an efficient distribution of computational resources. A crucial component of an AFEM is an a posteriori error estimator, which is a computable quantity, depending on the problem data and discrete solution, that provides local information about the quality of the approximate solution. The a posteriori error analysis for control–constrained linear–quadratic optimal control problems has achieved several advances in recent years. We refer the interested reader to MR1887737; MR1780911; MR2434065; MR3212590; MR3621827; MR4122501 for a discussion. As opposed to these advances, the analysis of AFEMs for optimal control problems involving nonlinear or bilinear equations is rather scarce. To the best of our knowledge, the work MR2680928 appears to be the first that provides a posteriori error estimates for (2)–(4). In this work, the authors develop a posteriori error estimators for two fully discrete approximation schemes of (2)–(4) and obtain global reliability estimates (MR2680928, Theorems 4.1 and 4.3) and global efficiency results (MR2680928, Lemmas 4.2 and 4.3). We also mention the work MR3174031, where a posteriori error estimates for a parabolic version of (2)–(4) have been analyzed; an efficiency analysis, however, was not provided. We conclude this paragraph by mentioning the work MR2373479, where the authors provide, on the basis of a posteriori error estimators, upper bounds for discretization errors with respect to a cost functional and with respect to a given quantity of interest; the latter being an arbitrary functional depending on the control and the state variables. In our work, we derive upper and lower bounds for the approximation error when is measured in an energy norm; see below for a discussion.

In the present manuscript, we consider two different strategies to discretize the optimal control problem (2)–(4): a semi-discrete scheme, based on the so-called variational discretization approach MR2122182, in which the admissible control set is not discretized, and a fully discrete scheme, where control variables are approximated by using piecewise constant functions. We devise, for each one of the aforementioned schemes, a residual–based a posteriori error estimator. For the fully discrete scheme the error estimator is formed by the sum of three contributions: two of them are related to the discretization of the state and adjoint equations while the remaining one is related to the discretization of the admissible control set. In contrast, the error estimator for the variational discretization approach is formed by only two contributions that are related to the discretization of the state and adjoint equations. In two and three dimensional Lipschitz polygonal/polyhedral domains (not necessarily convex), we obtain reliability and efficiency estimates.

In what follows we list what, we believe, are the main contributions of our work:

  • For the fully and semi-discrete schemes that we consider, we devise a posteriori error estimators; both being different from the ones in MR2680928 and MR2373479.

  • For the aforementioned solution techniques, we prove that the corresponding local error indicators associated to the discretization of the state and adjoint equations are locally efficient. This analysis improves the global one in (MR2680928, Lemmas 4.2 and 4.3). We also prove that the total error indicator associated to the variational discretization approach is locally efficient (cf. Theorem LABEL:thm:global_eff_var); the one associated to the fully discrete scheme, as customary, being globally efficient (cf. Theorem LABEL:thm:global_eff).

  • We design a simple adaptive loop that delivers optimal experimental rates of convergence for all the involved individual contributions of the corresponding error. The loop based on the a posteriori error indicators devised for the variational discretization approach delivers quadratic rates of convergence for the error approximation of a control variable. This substantially improves the approximation properties that can be achieved by the considered fully discrete scheme. The indicators devised for the latter scheme tend to refine the involved meshes in regions where the restrictions of the control variable become active. These DOFs seem not necessary for an accurate approximation of control variables. An scheme based on piecewise linear approximation of the admissible control set would suffer the same limitations in terms of avoidable refinement MR4122501.

The rest of the paper is organized as follows. In section 2 we introduce the optimal control problem under consideration and set notation. Basic results for the state equation as well as basic a posteriori error estimates are reviewed in section 3. In section 4 we review the existence of solutions for the optimal control problem as well as first and second order optimality conditions. The crucial part of our work are sections 5 and LABEL:sec:a_posteriori_semi, where we design and analyze a posteriori error estimators for the fully and semi-discrete schemes, respectively. Finally, in section LABEL:sec:numerical_ex we present numerical examples in two and three dimensional domains that illustrate the theory and reveal a competitive performance of the devised AFEMs.

2 The Problem and Notation

Let us precisely introduce the optimal control problem that will be considered in our work, set notation, and describe the setting we shall operate with.

2.1 Presentation of the Problem

In this work we are interested in the design and analysis of a posteriori error estimates for an optimal control problem governed by an elliptic partial differential equation (PDE) as state equation. Our main source of difficulty here is that the control variable enters the state equation as a coefficient; control constraints are also considered. Let us make this discussion precise. Let Ωd\Omega\subset\mathbb{R}^{d}, with d{2,3}d\in\{2,3\}, be an open and bounded polygonal/polyhedral domain with Lipschitz boundary Ω\partial\Omega (MR2424078, Chapter 4). Given a desired state yΩL2(Ω)y_{\Omega}\in L^{2}(\Omega) and a regularization parameter α>0\alpha>0, let us introduce the cost functional

J(y,u):=12yyΩL2(Ω)2+α2uL2(Ω)2.J(y,u):=\frac{1}{2}\|y-y_{\Omega}\|_{L^{2}(\Omega)}^{2}+\frac{\alpha}{2}\|u\|_{L^{2}(\Omega)}^{2}. (1)

We are thus interested in the following optimal control problem: Find

minJ(y,u)\min{J(y,u)} (2)

subject to the elliptic PDE

Δy+uy=f in Ω,y=0 on Ω,-\Delta y+uy=f\text{ in }\Omega,\qquad y=0\text{ on }\partial\Omega, (3)

where fL2(Ω)f\in L^{2}(\Omega) denotes an external source, and the control constraints

u𝕌ad,𝕌ad:={vL2(Ω):0<avb a.e. in Ω}.u\in\mathbb{U}_{ad},\qquad\mathbb{U}_{ad}:=\{v\in L^{2}(\Omega):0<\texttt{a}\leq v\leq\texttt{b}\text{ a.e. in }\Omega\}. (4)

Here, a,b+\texttt{a},\texttt{b}\in\mathbb{R}^{+} satisfy a<b\texttt{a}<\texttt{b}.

2.2 Notation

Let us set notation and describe the setting we shall operate with. Throughout this work d{2,3}d\in\{2,3\} and Ωd\Omega\subset\mathbb{R}^{d} is an open and bounded polygonal/polyhedral domain with Lipschitz boundary Ω\partial\Omega (MR2424078, Chapter 4). Notice that we do not assume that Ω\Omega is convex. If 𝒳\mathcal{X} and 𝒴\mathcal{Y} are normed vector spaces, we write 𝒳𝒴\mathcal{X}\hookrightarrow\mathcal{Y} to denote that 𝒳\mathcal{X} is continuously embedded in 𝒴\mathcal{Y}. We denote by 𝒳\|\cdot\|_{\mathcal{X}} the norm of 𝒳\mathcal{X}. The relation aba\lesssim b indicates that aCba\leq Cb, with a positive constant that depends neither on aa, bb nor on the involved discretization parameters. The value of CC might change at each occurrence.

3 The State Equation

In this section, we briefly review some results related to the well-posedness of problem (3). Additionally, we present a posteriori error estimates for a specific finite element setting.

3.1 Weak Formulation

Let 𝔣\mathfrak{f} be a given forcing term in L2(Ω)L^{2}(\Omega) and 𝔲\mathfrak{u} be an arbitrary function in 𝕌ad\mathbb{U}_{ad}. With this setting at hand, we introduce the following weak problem:

zH01(Ω):(z,v)L2(Ω)+(𝔲z,v)L2(Ω)=(𝔣,v)L2(Ω)vH01(Ω).z\in H_{0}^{1}(\Omega):\quad(\nabla z,\nabla v)_{L^{2}(\Omega)}+(\mathfrak{u}z,v)_{L^{2}(\Omega)}=(\mathfrak{f},v)_{L^{2}(\Omega)}\quad\forall v\in H_{0}^{1}(\Omega). (5)

Lax–Milgram Theorem immediately yields the well-posedness of problem (5). In particular, we have the following stability estimate zL2(Ω)𝔣L2(Ω).\|\nabla z\|_{L^{2}(\Omega)}\lesssim\|\mathfrak{f}\|_{L^{2}(\Omega)}.

3.2 Finite Element Approximation

In this section, we introduce a basic finite element approximation for the weak problem (5) and review basic a posteriori error estimates. To accomplish this task, we first introduce some terminology and further basic ingredients.

We denote by 𝒯={T}\mathscr{T}=\{T\} a conforming partition of Ω¯\overline{\Omega} into simplices TT with size hT=diam(T)h_{T}=\text{diam}(T) and define h𝒯:=maxT𝒯hTh_{\mathscr{T}}:=\max_{T\in\mathscr{T}}h_{T}. We denote by 𝒮\mathscr{S} the set of internal (d1)(d-1)-dimensional interelement boundaries SS of 𝒯\mathscr{T}. For T𝒯T\in\mathscr{T}, we let 𝒮T\mathscr{S}_{T} denote the subset of 𝒮\mathscr{S} which contains the sides of the element TT. We denote by 𝒩S𝒯{\color[rgb]{0,0,0}\mathcal{N}_{S}\subset\mathscr{T}} the subset that contains the two elements that have SS as a side, namely, 𝒩S={T+,T}\mathcal{N}_{S}=\{T^{+},T^{-}\}, where T+,T𝒯T^{+},T^{-}\in\mathscr{T} are such that S=T+TS=T^{+}\cap T^{-}. For T𝒯T\in\mathscr{T}, we define the star associated with the element TT as

𝒩T:={T𝒯:𝒮T𝒮T}.\mathcal{N}_{T}:=\left\{T^{\prime}\in\mathscr{T}:\mathscr{S}_{T}\cap\mathscr{S}_{T^{\prime}}\neq\emptyset\right\}. (6)

In an abuse of notation, below we denote by 𝒩T\mathcal{N}_{T} either the set itself or the union of its elements.

We define, for T𝒯T\in\mathscr{T}, the shape coefficient σT\sigma_{T} of TT as the ratio of the diameter, i.e., diam(T)=hT\mathrm{diam}(T)=h_{T}, and the inball diameter of TT, i.e., 2sup{r>0:Br(x)T for xT}2\sup\{r>0:B_{r}(x)\subset T\textrm{ for }x\in T\}. The shape coefficient of a triangulation 𝒯\mathscr{T} corresponds to the quantity σ𝒯:=max{T:T𝒯}\sigma_{\mathscr{T}}:=\max\{T:T\in\mathscr{T}\}. A sequence of triangulations 𝕋={𝒯}\mathbb{T}=\{\mathscr{T}\}, that is obtained by subsequent refinements of an initial mesh 𝒯0\mathscr{T}_{0}, is shape regular if sup{σ𝒯:𝒯𝕋}C\sup\{\sigma_{\mathscr{T}}:\mathscr{T}\in\mathbb{T}\}\leq C; see (Nochetto_etal2009, section 3.2.1) for details.

Given a mesh 𝒯𝕋\mathscr{T}\in\mathbb{T}, we define the finite element space of continuous piecewise linear functions as

𝕍(𝒯):={v𝒯C(Ω¯):v𝒯|T1(T)T𝒯}H01(Ω).\mathbb{V}(\mathscr{T}):=\{v_{\mathscr{T}}\in C(\overline{\Omega}):v_{\mathscr{T}}|_{T}\in\mathbb{P}_{1}(T)\ \forall T\in\mathscr{T}\}\cap H_{0}^{1}(\Omega). (7)

Given a discrete function v𝒯𝕍(𝒯)v_{\mathscr{T}}\in\mathbb{V}(\mathscr{T}), we define, for any internal side S𝒮S\in\mathscr{S}, the jump or interelement residual v𝒯𝝂\llbracket\nabla v_{\mathscr{T}}\cdot\boldsymbol{\nu}\rrbracket by

v𝒯𝝂:=𝝂+v𝒯|T++𝝂v𝒯|T,\llbracket\nabla v_{\mathscr{T}}\cdot\boldsymbol{\nu}\rrbracket:=\boldsymbol{\nu}^{+}\cdot\nabla v_{\mathscr{T}}|_{T^{+}}+\boldsymbol{\nu}^{-}\cdot\nabla v_{\mathscr{T}}|_{T^{-}},

where 𝝂+,𝝂\boldsymbol{\nu}^{+},\boldsymbol{\nu}^{-} denote the unit normals to SS pointing towards T+T^{+}, T𝒯T^{-}\in\mathscr{T}, respectively. Here, T+T^{+}, T𝒯T^{-}\in\mathscr{T} are such that T+TT^{+}\neq T^{-} and T+T=S\partial T^{+}\cap\partial T^{-}=S.

With these ingredients at hand, we introduce a Galerkin approximation to problem (5) as follows:

z𝒯𝕍(𝒯):(z𝒯,v𝒯)L2(Ω)+(𝔲z𝒯,v𝒯)L2(Ω)=(𝔣,v𝒯)L2(Ω)z_{\mathscr{T}}\in\mathbb{V}(\mathscr{T}):\quad(\nabla z_{\mathscr{T}},\nabla v_{\mathscr{T}})_{L^{2}(\Omega)}+(\mathfrak{u}z_{\mathscr{T}},v_{\mathscr{T}})_{L^{2}(\Omega)}=(\mathfrak{f},v_{\mathscr{T}})_{L^{2}(\Omega)} (8)

for all v𝒯𝕍(𝒯)v_{\mathscr{T}}\in\mathbb{V}(\mathscr{T}). Here, 𝔣L2(Ω)\mathfrak{f}\in L^{2}(\Omega) and 𝔲𝕌ad\mathfrak{u}\in\mathbb{U}_{ad}. The existence and uniqueness of a solution z𝒯𝕍(𝒯)z_{\mathscr{T}}\in\mathbb{V}(\mathscr{T}) of problem (8) is standard. In particular, we have the stability estimate z𝒯L2(Ω)𝔣L2(Ω)\|\nabla z_{\mathscr{T}}\|_{L^{2}(\Omega)}\lesssim\|\mathfrak{f}\|_{L^{2}(\Omega)}.

3.3 An a Posteriori Error Estimate for the State Equation

We introduce the following local error indicators and a posteriori error estimator associated to the discretization (8) of problem (5):

T2:=hT2𝔣𝔲z𝒯L2(T)2+hTz𝒯𝝂L2(TΩ)2,𝒯2:=T𝒯T2.\mathcal{E}_{T}^{2}:=h_{T}^{2}\|\mathfrak{f}-\mathfrak{u}z_{\mathscr{T}}\|_{L^{2}(T)}^{2}+h_{T}\|\llbracket\nabla z_{\mathscr{T}}\cdot\boldsymbol{\nu}\rrbracket\|_{L^{2}(\partial T\setminus\partial\Omega)}^{2},\qquad\mathcal{E}_{\mathscr{T}}^{2}:=\sum_{T\in\mathscr{T}}\mathcal{E}_{T}^{2}.

We present the following global reliability result.

Theorem 3.1 (global reliability of \mathcal{E})

Let 𝔣L2(Ω)\mathfrak{f}\in L^{2}(\Omega) and 𝔲𝕌ad\mathfrak{u}\in\mathbb{U}_{ad} be given. Let zH01(Ω)z\in H_{0}^{1}(\Omega) be the unique solution to problem (5) and let z𝒯𝕍(𝒯)z_{\mathscr{T}}\in\mathbb{V}(\mathscr{T}) be its finite element approximation obtained as the solution to (8). We thus have

(zz𝒯)L2(Ω)𝒯,\|\nabla(z-z_{\mathscr{T}})\|_{L^{2}(\Omega)}\lesssim\mathcal{E}_{\mathscr{T}},

with a hidden constant that is independent of zz, z𝒯z_{\mathscr{T}}, the size of the elements in 𝒯\mathscr{T}, and #𝒯\#\mathscr{T} but depends on the shape coefficient of the triangulation 𝒯\mathscr{T}, i.e., σ𝒯\sigma_{\mathscr{T}}, and the dimension dd.

Proof

Since zz solves (5), we invoke Galerkin orthogonality and an elementwise integration by parts formula to arrive at

((zz𝒯),v)L2(Ω)+(𝔲(zz𝒯),v)L2(Ω)=T𝒯T(𝔣𝔲z𝒯)(vI𝒯v)dx+S𝒮Sz𝒯𝝂(vI𝒯v)dx.(\nabla(z-z_{\mathscr{T}}),\nabla v)_{L^{2}(\Omega)}+(\mathfrak{u}(z-z_{\mathscr{T}}),v)_{L^{2}(\Omega)}\\ =\sum_{T\in\mathscr{T}}\int_{T}(\mathfrak{f}-\mathfrak{u}z_{\mathscr{T}})(v-I_{\mathscr{T}}v)\mathrm{d}x+\sum_{S\in\mathscr{S}}\int_{S}\llbracket\nabla z_{\mathscr{T}}\cdot\boldsymbol{\nu}\rrbracket(v-I_{\mathscr{T}}v)\mathrm{d}x.

Here, vH01(Ω)v\in H_{0}^{1}(\Omega) and I𝒯:L1(Ω)𝕍(𝒯)I_{\mathscr{T}}:L^{1}(\Omega)\rightarrow\mathbb{V}(\mathscr{T}) denotes the Clément interpolation operator MR2373954; MR0520174. Standard approximation properties for I𝒯I_{\mathscr{T}} and the finite overlapping property of stars allow us to derive

((zz𝒯),v)L2(Ω)+(𝔲(zz𝒯),v)L2(Ω)[T𝒯hT2𝔣𝔲z𝒯L2(T)2+hTz𝒯𝝂L2(TΩ)2]12vL2(Ω).(\nabla(z-z_{\mathscr{T}}),\nabla v)_{L^{2}(\Omega)}+(\mathfrak{u}(z-z_{\mathscr{T}}),v)_{L^{2}(\Omega)}\\ \lesssim\left[\sum_{T\in\mathscr{T}}h_{T}^{2}\|\mathfrak{f}-\mathfrak{u}z_{\mathscr{T}}\|_{L^{2}(T)}^{2}+h_{T}\|\llbracket\nabla z_{\mathscr{T}}\cdot\boldsymbol{\nu}\rrbracket\|_{L^{2}(\partial T\setminus\partial\Omega)}^{2}\right]^{\tfrac{1}{2}}\|\nabla v\|_{L^{2}(\Omega)}.

Set v=zz𝒯H01(Ω)v=z-z_{\mathscr{T}}\in H_{0}^{1}(\Omega) and use the fact that 𝔲>0\mathfrak{u}>0 to conclude. ∎

4 The Optimal Control Problem

In this section, we follow (MR2536007, section 2) and introduce a weak formulation for the optimal control problem (2)–(4). In addition, we review first and second order optimality conditions and introduce finite element discretization schemes.

4.1 Weak Formulation and Existence of a Solution

Let JJ be the cost functional defined in (1). We consider the following weak version of the optimization problem (2)–(4): Find

min{J(y,u):(y,u)H01(Ω)×𝕌ad}\min\{J(y,u):(y,u)\in H_{0}^{1}(\Omega)\times\mathbb{U}_{ad}\} (9)

subject to the state equation

(y,v)L2(Ω)+(uy,v)L2(Ω)=(f,v)L2(Ω)vH01(Ω).(\nabla y,\nabla v)_{L^{2}(\Omega)}+(uy,v)_{L^{2}(\Omega)}=(f,v)_{L^{2}(\Omega)}\quad\forall v\in H_{0}^{1}(\Omega). (10)

The existence of an optimal solution (y¯,u¯)H01(Ω)×𝕌ad(\bar{y},\bar{u})\in H_{0}^{1}(\Omega)\times\mathbb{U}_{ad} for problem (9)–(10) follows standard arguments; see (MR2536007, Proposition 2.3).

4.2 Optimality Conditions

Due to the fact that the optimal control problem (9)–(10) is not convex, we discuss optimality conditions under the framework of local solutions in L2(Ω)L^{2}(\Omega). To be precise, a control u¯𝕌ad\bar{u}\in\mathbb{U}_{ad} is said to be locally optimal in L2(Ω)L^{2}(\Omega) for (9)–(10) if there exists a constant δ>0\delta>0 such that J(y¯,u¯)J(y,u)J(\bar{y},\bar{u})\leq J(y,u) for all u𝕌adu\in\mathbb{U}_{ad} such that uu¯L2(Ω)δ\|u-\bar{u}\|_{L^{2}(\Omega)}\leq\delta. Here, y¯\bar{y} and yy denote the states associated to u¯\bar{u} and uu, respectively.

Let us introduce the set 𝒰:={uL(Ω):c>0 such that u(x)>c>0 a.e. xΩ}\mathcal{U}:=\{u\in L^{\infty}(\Omega):\exists c>0\text{ such that }u(x)>c>0\text{ a.e. }x\in\Omega\}. We immediately notice that 𝕌ad𝒰\mathbb{U}_{ad}\subset\mathcal{U}. Having defined 𝒰\mathcal{U}, we introduce the control-to-state map 𝒮\mathcal{S} as follows: given a control u𝒰u\in\mathcal{U}, 𝒮\mathcal{S} associates to it a unique state y=𝒮uH01(Ω)y=\mathcal{S}u\in H_{0}^{1}(\Omega) solving (10). With these ingredients at hand, we define the reduced cost functional j:𝒰0+j:\mathcal{U}\to\mathbb{R}_{0}^{+} by

j(u)=J(𝒮u,u):=12𝒮uyΩL2(Ω)2+α2uL2(Ω)2.j(u)=J(\mathcal{S}u,u):=\frac{1}{2}\|\mathcal{S}u-y_{\Omega}\|_{L^{2}(\Omega)}^{2}+\frac{{\color[rgb]{0,0,0}\alpha}}{2}\|u\|_{L^{2}(\Omega)}^{2}.

We are now in position to formulate first order optimality conditions: if u¯\bar{u} is locally optimal for problem (9)–(10), then (MR2536007, Proposition 2.10)

j(u¯)(uu¯)0u𝕌ad.j^{\prime}(\bar{u})(u-\bar{u})\geq 0\quad\forall u\in\mathbb{U}_{ad}. (11)

Here, j(u¯)j^{\prime}(\bar{u}) denotes the Gateâux derivative of the functional jj at u¯\bar{u} in the direction uu¯u-\bar{u}. We notice that, for u,v𝕌adu,v\in\mathbb{U}_{ad}, j(u)v=(αuyp,v)L2(Ω)j^{\prime}(u)v=(\alpha u-yp,v)_{L^{2}(\Omega)} (MR2536007, equation (2.6)) and immediately comment that 𝒮\mathcal{S} and jj are not Fréchet differentiable with respect to the L2(Ω)L^{2}(\Omega)-topology (MR2536007, Remark 2.8). To investigate the inequality (11), we introduce the adjoint variable pH01(Ω)p\in H_{0}^{1}(\Omega) as the unique solution to the adjoint equation

(w,p)L2(Ω)+(up,w)L2(Ω)=(yyΩ,w)L2(Ω)wH01(Ω),(\nabla w,\nabla p)_{L^{2}(\Omega)}+(up,w)_{L^{2}(\Omega)}=(y-y_{\Omega},w)_{L^{2}(\Omega)}\quad\forall w\in H_{0}^{1}(\Omega), (12)

where y=𝒮uy=\mathcal{S}u solves (10). Observe that problem (12) is well-posed.

With the previous ingredients at hand, we reformulate first order optimality conditions as follows; see (MR2536007, Proposition 2.10 and equation (2.6)).

Theorem 4.1 (first order optimality conditions)

Every locally optimal control u¯𝕌ad\bar{u}\in\mathbb{U}_{ad} for problem (9)–(10) satisfies, together with the state y¯H01(Ω)\bar{y}\in H_{0}^{1}(\Omega) and the adjoint state p¯H01(Ω)\bar{p}\in H_{0}^{1}(\Omega), the variational inequality

(αu¯y¯p¯,uu¯)L2(Ω)0u𝕌ad.(\alpha\bar{u}-\bar{y}\bar{p},u-\bar{u})_{L^{2}(\Omega)}\geq 0\quad\forall u\in\mathbb{U}_{ad}. (13)

Here, p¯\bar{p} denotes the solution to (12) with yy replaced by y¯=𝒮u¯\bar{y}=\mathcal{S}\bar{u}.

Let us now introduce the projection operator Π[a,b]:L1(Ω)𝕌ad\Pi_{[\texttt{a},\texttt{b}]}:L^{1}(\Omega)\rightarrow\mathbb{U}_{ad} as

Π[a,b](v):=min{b,max{v,a}} a.e. in Ω.\Pi_{[\texttt{a},\texttt{b}]}(v):=\min\{\texttt{b},\max\{v,\texttt{a}\}\}\textrm{ a.e. in }\Omega. (14)

This operator allows us to present the following projection formula (MR2536007, equation (2.7)): If u¯\bar{u} denotes a locally optimal control for (9)–(10), then

u¯(x):=Π[a,b](α1y¯(x)p¯(x)) a.e.xΩ.\bar{u}(x):=\Pi_{[\texttt{a},\texttt{b}]}(\alpha^{-1}\bar{y}(x)\bar{p}(x))\textrm{ a.e.}\leavevmode\nobreak\ x\in\Omega. (15)

Let u¯𝕌ad\bar{u}\in\mathbb{U}_{ad} be a control that satisfies the first order necessary optimality condition (13). In what follows, we will assume that there exists a constant μ>0\mu>0 such that

j′′(u¯)v2μvL2(Ω)2vL(Ω);j^{\prime\prime}(\bar{u})v^{2}\geq\mu\|v\|_{L^{2}(\Omega)}^{2}\quad\forall v\in L^{\infty}(\Omega); (16)

see (MR2536007, Assumption 2.20). Here, for each vL(Ω)v\in L^{\infty}(\Omega), we have that

j′′(u¯)v2:=𝒮(u¯)vL2(Ω)2+(y¯yΩ,𝒮′′(u¯)v2)L2(Ω)+αvL2(Ω)2,j^{\prime\prime}(\bar{u})v^{2}:=\|\mathcal{S}^{\prime}(\bar{u})v\|_{L^{2}(\Omega)}^{2}+(\bar{y}-y_{\Omega},\mathcal{S}^{\prime\prime}(\bar{u})v^{2})_{L^{2}(\Omega)}+\alpha\|v\|_{L^{2}(\Omega)}^{2},

where 𝒮(u¯)v\mathcal{S}^{\prime}(\bar{u})v and 𝒮′′(u¯)v2\mathcal{S}^{\prime\prime}(\bar{u})v^{2} are defined as in (MR2536007, Lemma 2.9). We notice that assumption (16) is fulfilled if 𝒮u¯yΩL2(Ω)\|\mathcal{S}\bar{u}-y_{\Omega}\|_{L^{2}(\Omega)} is sufficiently small or α>0\alpha>0 is sufficiently large; see (MR2536007, Remark 2.21) for details.

The following result states that every control u¯\bar{u} that satisfies (13) and (16) is a local solution for problem (9)–(10); see (MR2536007, Theorem 2.24).

Theorem 4.2 (local optimality)

Let u¯𝕌ad\bar{u}\in\mathbb{U}_{ad} be a local solution to (9)–(10) satisfying the necessary and sufficient optimality conditions (13) and (16). Then, there exist positive constants δ,σ>0\delta,\sigma>0 such that

j(u)j(u¯)+σuu¯L2(Ω)2u𝕌adBδ(u¯),j(u)\geq j(\bar{u})+\sigma\|u-\bar{u}\|_{L^{2}(\Omega)}^{2}\quad\forall u\in\mathbb{U}_{ad}\cap B_{\delta}(\bar{u}),

where Bδ(u¯)B_{\delta}(\bar{u}) denotes the closed ball in L2(Ω)L^{2}(\Omega) with center at u¯\bar{u} and radius δ\delta.

We conclude this section with the following estimate (MR2536007, Proposition 2.22): Let u,v𝕌adu,v\in\mathbb{U}_{ad} and wL(Ω)w\in L^{\infty}(\Omega). Then, there exists >0\mathfrak{C}>0, depending on fL2(Ω)\|f\|_{L^{2}(\Omega)} and yΩL2(Ω)\|y_{\Omega}\|_{L^{2}(\Omega)}, such that

|j′′(u)w2j′′(v)w2|uvL2(Ω)wL2(Ω)2.|j^{\prime\prime}(u)w^{2}-j^{\prime\prime}(v)w^{2}|\leq\mathfrak{C}\|u-v\|_{L^{2}(\Omega)}\|w\|_{L^{2}(\Omega)}^{2}. (17)

4.3 Finite Element Approximation

In this section, we introduce two finite element discretization schemes for our optimal control problem.

4.3.1 The Fully Discrete Scheme

To approximate a control variable, we introduce the space of piecewise constant functions

𝕌(𝒯):={u𝒯L(Ω):u𝒯|T0(T)T𝒯}\mathbb{U}(\mathscr{T}):=\{u_{\mathscr{T}}\in L^{\infty}(\Omega):u_{\mathscr{T}}|_{T}\in\mathbb{P}_{0}(T)\ \forall T\in\mathscr{T}\}

and define the discrete admissible set 𝕌ad(𝒯):=𝕌(𝒯)𝕌ad\mathbb{U}_{ad}(\mathscr{T}):=\mathbb{U}(\mathscr{T})\cap\mathbb{U}_{ad}. The state and adjoint state variables, associated to a locally optimal control, are discretized by using the finite element space 𝕍(𝒯)\mathbb{V}(\mathscr{T}) defined in (7). With this setting at hand, the fully discrete scheme reads as follows: Find minJ(y𝒯,u𝒯)\min J(y_{\mathscr{T}},u_{\mathscr{T}}) subject to the discrete state equation

y𝒯𝕍(𝒯):(y𝒯,v𝒯)L2(Ω)+(u𝒯y𝒯,v𝒯)L2(Ω)=(f,v𝒯)L2(Ω)y_{\mathscr{T}}\in\mathbb{V}(\mathscr{T}):\quad(\nabla y_{\mathscr{T}},\nabla v_{\mathscr{T}})_{L^{2}(\Omega)}+(u_{\mathscr{T}}y_{\mathscr{T}},v_{\mathscr{T}})_{L^{2}(\Omega)}=(f,v_{\mathscr{T}})_{L^{2}(\Omega)} (18)

for all v𝒯𝕍(𝒯)v_{\mathscr{T}}\in\mathbb{V}(\mathscr{T}) and the discrete constraints u𝒯𝕌ad(𝒯)u_{\mathscr{T}}\in\mathbb{U}_{ad}(\mathscr{T}). The fully discrete scheme admits at least a solution; see (MR2536007, Section 3) for details. In addition, if u¯𝒯\bar{u}_{\mathscr{T}} denotes a discrete local solution, then

(αu¯𝒯y¯𝒯p¯𝒯,u𝒯u¯𝒯)L2(Ω)0u𝒯𝕌ad(𝒯),(\alpha\bar{u}_{\mathscr{T}}-\bar{y}_{\mathscr{T}}\bar{p}_{\mathscr{T}},u_{\mathscr{T}}-\bar{u}_{\mathscr{T}})_{L^{2}(\Omega)}\geq 0\quad\forall u_{\mathscr{T}}\in\mathbb{U}_{ad}(\mathscr{T}),

where p¯𝒯𝕍(𝒯)\bar{p}_{\mathscr{T}}\in\mathbb{V}(\mathscr{T}) is such that

(w𝒯,p¯𝒯)L2(Ω)+(u¯𝒯p¯𝒯,w𝒯)L2(Ω)=(y¯𝒯yΩ,w𝒯)L2(Ω)(\nabla w_{\mathscr{T}},\nabla\bar{p}_{\mathscr{T}})_{L^{2}(\Omega)}+(\bar{u}_{\mathscr{T}}\bar{p}_{\mathscr{T}},w_{\mathscr{T}})_{L^{2}(\Omega)}=(\bar{y}_{\mathscr{T}}-y_{\Omega},w_{\mathscr{T}})_{L^{2}(\Omega)} (19)

for all w𝒯𝕍(𝒯)w_{\mathscr{T}}\in\mathbb{V}(\mathscr{T}); see (MR2536007, equations (3.6) and (3.7)).

4.3.2 The Semi-discrete Scheme

In this section, we introduce the so-called variational discretization approach for (9)–(10). This scheme discretizes only the state space; the control space 𝕌ad\mathbb{U}_{ad} is not discretized. The scheme induces a discretization of an optimal control variable by projecting, in view of the operator introduced in (14), an optimal discrete adjoint state into 𝕌ad\mathbb{U}_{ad}. The semi-discrete scheme is defined as follows: Find minJ(y𝒯,𝗎)\min J(y_{\mathscr{T}},\mathsf{u}) subject to the discrete state equation

y𝒯𝕍(𝒯):(y𝒯,v𝒯)L2(Ω)+(𝗎y𝒯,v𝒯)L2(Ω)=(f,v𝒯)L2(Ω)y_{\mathscr{T}}\in\mathbb{V}(\mathscr{T}):\quad(\nabla y_{\mathscr{T}},\nabla v_{\mathscr{T}})_{L^{2}(\Omega)}+(\mathsf{u}y_{\mathscr{T}},v_{\mathscr{T}})_{L^{2}(\Omega)}=(f,v_{\mathscr{T}})_{L^{2}(\Omega)} (20)

for all v𝒯𝕍(𝒯)v_{\mathscr{T}}\in\mathbb{V}(\mathscr{T}) and the constraints 𝗎𝕌ad\mathsf{u}\in\mathbb{U}_{ad}. As in the fully discrete case, this problem admits at least a solution and, if 𝗎¯\bar{\mathsf{u}} denotes a local solution, then

(α𝗎¯y¯𝒯p¯𝒯,u𝗎¯)L2(Ω)0u𝕌ad,(\alpha\bar{\mathsf{u}}-\bar{y}_{\mathscr{T}}\bar{p}_{\mathscr{T}},u-\bar{\mathsf{u}})_{L^{2}(\Omega)}\geq 0\quad\forall u\in\mathbb{U}_{ad},

where p¯𝒯𝕍(𝒯)\bar{p}_{\mathscr{T}}\in\mathbb{V}(\mathscr{T}) solves

(w𝒯,p¯𝒯)L2(Ω)+(𝗎¯p¯𝒯,w𝒯)L2(Ω)=(y¯𝒯yΩ,w𝒯)L2(Ω)(\nabla w_{\mathscr{T}},\nabla\bar{p}_{\mathscr{T}})_{L^{2}(\Omega)}+(\bar{\mathsf{u}}\bar{p}_{\mathscr{T}},w_{\mathscr{T}})_{L^{2}(\Omega)}=(\bar{y}_{\mathscr{T}}-y_{\Omega},w_{\mathscr{T}})_{L^{2}(\Omega)} (21)

for all w𝒯𝕍(𝒯)w_{\mathscr{T}}\in\mathbb{V}(\mathscr{T}). Here, y¯𝒯=y¯𝒯(𝗎¯)\bar{y}_{\mathscr{T}}=\bar{y}_{\mathscr{T}}(\bar{\mathsf{u}}) solves (20) with 𝗎=𝗎¯\mathsf{u}=\bar{\mathsf{u}}.

5 A Posteriori Error Analysis for the Fully Discrete Scheme

In this section, we devise and analyze an a posteriori error estimator for the fully discrete scheme. The error estimator will be formed by the sum of three contributions: two contributions related to the discretization of the state and adjoint equations and a one contribution associated to the discretization of the admissible control set 𝕌ad\mathbb{U}_{ad}.

To begin with our studies we introduce, on the basis of the projection operator Π[a,b]\Pi_{[\texttt{a},\texttt{b}]} defined in (14), the auxiliary variable

u~:=Π[a,b](α1y¯𝒯p¯𝒯).\tilde{u}:=\Pi_{[\texttt{a},\texttt{b}]}\left(\alpha^{-1}\bar{y}_{\mathscr{T}}\bar{p}_{\mathscr{T}}\right). (22)

A key property in favor of the definition of u~\tilde{u} is that it satisfies the following variational inequality (Troltzsch, Lemma 2.26):

(αu~y¯𝒯p¯𝒯,uu~)L2(Ω)0u𝕌ad.(\alpha\tilde{u}-\bar{y}_{\mathscr{T}}\bar{p}_{\mathscr{T}},u-\tilde{u})_{L^{2}(\Omega)}\geq 0\quad\forall u\in\mathbb{U}_{ad}. (23)

With the variable u~\tilde{u} at hand, we present the following result which is instrumental for our a posteriori error analysis.

Theorem 5.1 (auxiliary estimate)

Let u¯𝕌ad\bar{u}\in\mathbb{U}_{ad} be a local solution to (9)–(10) satisfying the sufficient second order optimality condition (16). Let u¯𝒯\bar{u}_{\mathscr{T}} be a local minimum of the fully discrete optimal control problem with y¯𝒯\bar{y}_{\mathscr{T}} and p¯𝒯\bar{p}_{\mathscr{T}} being the corresponding state and adjoint state, respectively. If y¯𝒯\bar{y}_{\mathscr{T}} and p¯𝒯\bar{p}_{\mathscr{T}} satisfy, on the mesh 𝒯\mathscr{T}, the bound

y¯p¯y¯𝒯p¯𝒯L2(Ω)αμ(2)1,\|\bar{y}\bar{p}-\bar{y}_{\mathscr{T}}\bar{p}_{\mathscr{T}}\|_{L^{2}(\Omega)}\leq\alpha\mu(2\mathfrak{C})^{-1}, (24)

then

μ2u¯u~L2(Ω)2(j(u~)j(u¯))(u~u¯).\frac{\mu}{2}\|\bar{u}-\tilde{u}\|_{L^{2}(\Omega)}^{2}\leq(j^{\prime}(\tilde{u})-j^{\prime}(\bar{u}))(\tilde{u}-\bar{u}). (25)

The constants μ\mu and \mathfrak{C} are given as in (16) and (17), respectively.