∎

¹¹institutetext: 🖂 Enrique Otárola
[email protected] ²²institutetext: ¹ Departamento de Matemática, Universidad Técnica Federico Santa María, Av. España 1680, Valparaíso, Chile.

A Posteriori Error Estimates for an Optimal Control Problem with a Bilinear State Equation

Francisco Fuica¹ Enrique Otárola¹

(Received: date / Accepted: date)

Abstract

We propose and analyze a posteriori error estimators for an optimal control problem that involves an elliptic partial differential equation as state equation and a control variable that enters the state equation as a coefficient; pointwise constraints on the control variable are considered as well. We consider two different strategies to approximate optimal variables: a fully discrete scheme in which the admissible control set is discretized with piecewise constant functions and a semi-discrete scheme where the admissible control set is not discretized; the latter scheme being based on the so-called variational discretization approach. We design, for each solution technique, an a posteriori error estimator and show, in two and three dimensional Lipschitz polygonal/polyhedral domains (not necessarily convex), that the proposed error estimator is reliable and efficient. We design, based on the devised estimators, adaptive strategies that deliver optimal experimental rates of convergence for the performed numerical examples.

Keywords:

optimal control problems bilinear equations finite elements a posteriori error estimates adaptive finite element methods

MSC:

49M25 65N15 65N30 65N50

^†^†journal: JOTA

1 Introduction

The development and study of discretization techniques, based on finite elements, for distributed control–constrained linear–quadratic elliptic optimal control problems have been widely studied in the literature; see MR2843956; MR2516528 for an extensive list of references. These discretization techniques are mainly divided into two categories, which rely on the discretization of the state and adjoint equations; they differ on whether or not the admissible control set is also discretized. In contrast to these advances, the study of solution techniques for optimal control problems where the control variable enters the state equation as a coefficient is not as developed. One of the main sources of difficulty within these type of problems is that the solution of the state equation depends nonlinearly on the control variable MR2536007. Consequently, uniqueness of solutions cannot be guaranteed.

In this work, we will focus on the development and analysis of efficient solution techniques for the optimal control problem (2)–(4), which incorporates the control variable as a coefficient in the state equation; the control variable is not a source term. We immediately mention that this problem can be interpreted as a particular instance of parameter estimation. To the best of our knowledge, the first work that provides an analysis for suitable finite element discretizations for problem (2)–(4) is MR2536007. In this work, the authors propose, on convex polygonal/polyhedral domains and quasi–uniform meshes, two fully discrete schemes that discretize the admissible control set with piecewise constant and piecewise linear functions; the state and adjoint equations are discretized with piecewise linear functions. Estimates for the error committed within the approximation of a control variable are derived in (MR2536007, Corollaries 5.6 and 5.10). In addition, an error estimate for a post-processing strategy is obtained in (MR2536007, Theorem 5.18). The results obtained in MR2536007 were later extended to mixed and stabilized finite element methods in MR3103238 and MR3693332, respectively.

A particular class of numerical methods that has proven a competitive performance when are used to approximate solutions to PDE–constrained optimization problems, and the ones we shall consider in this work, are adaptive finite element methods (AFEMs). AFEMs are iterative methods recognizable by their capability to improve the quality of a discrete approximation to a corresponding PDE while keeping an efficient distribution of computational resources. A crucial component of an AFEM is an a posteriori error estimator, which is a computable quantity, depending on the problem data and discrete solution, that provides local information about the quality of the approximate solution. The a posteriori error analysis for control–constrained linear–quadratic optimal control problems has achieved several advances in recent years. We refer the interested reader to MR1887737; MR1780911; MR2434065; MR3212590; MR3621827; MR4122501 for a discussion. As opposed to these advances, the analysis of AFEMs for optimal control problems involving nonlinear or bilinear equations is rather scarce. To the best of our knowledge, the work MR2680928 appears to be the first that provides a posteriori error estimates for (2)–(4). In this work, the authors develop a posteriori error estimators for two fully discrete approximation schemes of (2)–(4) and obtain global reliability estimates (MR2680928, Theorems 4.1 and 4.3) and global efficiency results (MR2680928, Lemmas 4.2 and 4.3). We also mention the work MR3174031, where a posteriori error estimates for a parabolic version of (2)–(4) have been analyzed; an efficiency analysis, however, was not provided. We conclude this paragraph by mentioning the work MR2373479, where the authors provide, on the basis of a posteriori error estimators, upper bounds for discretization errors with respect to a cost functional and with respect to a given quantity of interest; the latter being an arbitrary functional depending on the control and the state variables. In our work, we derive upper and lower bounds for the approximation error when is measured in an energy norm; see below for a discussion.

In the present manuscript, we consider two different strategies to discretize the optimal control problem (2)–(4): a semi-discrete scheme, based on the so-called variational discretization approach MR2122182, in which the admissible control set is not discretized, and a fully discrete scheme, where control variables are approximated by using piecewise constant functions. We devise, for each one of the aforementioned schemes, a residual–based a posteriori error estimator. For the fully discrete scheme the error estimator is formed by the sum of three contributions: two of them are related to the discretization of the state and adjoint equations while the remaining one is related to the discretization of the admissible control set. In contrast, the error estimator for the variational discretization approach is formed by only two contributions that are related to the discretization of the state and adjoint equations. In two and three dimensional Lipschitz polygonal/polyhedral domains (not necessarily convex), we obtain reliability and efficiency estimates.

In what follows we list what, we believe, are the main contributions of our work:

•

For the fully and semi-discrete schemes that we consider, we devise a posteriori error estimators; both being different from the ones in MR2680928 and MR2373479.
•

For the aforementioned solution techniques, we prove that the corresponding local error indicators associated to the discretization of the state and adjoint equations are locally efficient. This analysis improves the global one in (MR2680928, Lemmas 4.2 and 4.3). We also prove that the total error indicator associated to the variational discretization approach is locally efficient (cf. Theorem LABEL:thm:global_eff_var); the one associated to the fully discrete scheme, as customary, being globally efficient (cf. Theorem LABEL:thm:global_eff).
•

We design a simple adaptive loop that delivers optimal experimental rates of convergence for all the involved individual contributions of the corresponding error. The loop based on the a posteriori error indicators devised for the variational discretization approach delivers quadratic rates of convergence for the error approximation of a control variable. This substantially improves the approximation properties that can be achieved by the considered fully discrete scheme. The indicators devised for the latter scheme tend to refine the involved meshes in regions where the restrictions of the control variable become active. These DOFs seem not necessary for an accurate approximation of control variables. An scheme based on piecewise linear approximation of the admissible control set would suffer the same limitations in terms of avoidable refinement MR4122501.

The rest of the paper is organized as follows. In section 2 we introduce the optimal control problem under consideration and set notation. Basic results for the state equation as well as basic a posteriori error estimates are reviewed in section 3. In section 4 we review the existence of solutions for the optimal control problem as well as first and second order optimality conditions. The crucial part of our work are sections 5 and LABEL:sec:a_posteriori_semi, where we design and analyze a posteriori error estimators for the fully and semi-discrete schemes, respectively. Finally, in section LABEL:sec:numerical_ex we present numerical examples in two and three dimensional domains that illustrate the theory and reveal a competitive performance of the devised AFEMs.

2 The Problem and Notation

Let us precisely introduce the optimal control problem that will be considered in our work, set notation, and describe the setting we shall operate with.

2.1 Presentation of the Problem

In this work we are interested in the design and analysis of a posteriori error estimates for an optimal control problem governed by an elliptic partial differential equation (PDE) as state equation. Our main source of difficulty here is that the control variable enters the state equation as a coefficient; control constraints are also considered. Let us make this discussion precise. Let $\Omega\subset\mathbb{R}^{d}$ , with $d\in\{2,3\}$ , be an open and bounded polygonal/polyhedral domain with Lipschitz boundary $\partial\Omega$ (MR2424078, Chapter 4). Given a desired state $y_{\Omega}\in L^{2}(\Omega)$ and a regularization parameter $\alpha>0$ , let us introduce the cost functional

J(y,u):=\frac{1}{2}\|y-y_{\Omega}\|_{L^{2}(\Omega)}^{2}+\frac{\alpha}{2}\|u\|_{L^{2}(\Omega)}^{2}.

(1)

We are thus interested in the following optimal control problem: Find

\min{J(y,u)}

(2)

subject to the elliptic PDE

-\Delta y+uy=f\text{ in }\Omega,\qquad y=0\text{ on }\partial\Omega,

(3)

where $f\in L^{2}(\Omega)$ denotes an external source, and the control constraints

u\in\mathbb{U}_{ad},\qquad\mathbb{U}_{ad}:=\{v\in L^{2}(\Omega):0<\texttt{a}\leq v\leq\texttt{b}\text{ a.e. in }\Omega\}.

(4)

Here, $\texttt{a},\texttt{b}\in\mathbb{R}^{+}$ satisfy $\texttt{a}<\texttt{b}$ .

2.2 Notation

Let us set notation and describe the setting we shall operate with. Throughout this work $d\in\{2,3\}$ and $\Omega\subset\mathbb{R}^{d}$ is an open and bounded polygonal/polyhedral domain with Lipschitz boundary $\partial\Omega$ (MR2424078, Chapter 4). Notice that we do not assume that $\Omega$ is convex. If $\mathcal{X}$ and $\mathcal{Y}$ are normed vector spaces, we write $\mathcal{X}\hookrightarrow\mathcal{Y}$ to denote that $\mathcal{X}$ is continuously embedded in $\mathcal{Y}$ . We denote by $\|\cdot\|_{\mathcal{X}}$ the norm of $\mathcal{X}$ . The relation $a\lesssim b$ indicates that $a\leq Cb$ , with a positive constant that depends neither on $a$ , $b$ nor on the involved discretization parameters. The value of $C$ might change at each occurrence.

3 The State Equation

In this section, we briefly review some results related to the well-posedness of problem (3). Additionally, we present a posteriori error estimates for a specific finite element setting.

3.1 Weak Formulation

Let $\mathfrak{f}$ be a given forcing term in $L^{2}(\Omega)$ and $\mathfrak{u}$ be an arbitrary function in $\mathbb{U}_{ad}$ . With this setting at hand, we introduce the following weak problem:

z\in H_{0}^{1}(\Omega):\quad(\nabla z,\nabla v)_{L^{2}(\Omega)}+(\mathfrak{u}z,v)_{L^{2}(\Omega)}=(\mathfrak{f},v)_{L^{2}(\Omega)}\quad\forall v\in H_{0}^{1}(\Omega).

(5)

Lax–Milgram Theorem immediately yields the well-posedness of problem (5). In particular, we have the following stability estimate $\|\nabla z\|_{L^{2}(\Omega)}\lesssim\|\mathfrak{f}\|_{L^{2}(\Omega)}.$

3.2 Finite Element Approximation

In this section, we introduce a basic finite element approximation for the weak problem (5) and review basic a posteriori error estimates. To accomplish this task, we first introduce some terminology and further basic ingredients.

We denote by $\mathscr{T}=\{T\}$ a conforming partition of $\overline{\Omega}$ into simplices $T$ with size $h_{T}=\text{diam}(T)$ and define $h_{\mathscr{T}}:=\max_{T\in\mathscr{T}}h_{T}$ . We denote by $\mathscr{S}$ the set of internal $(d-1)$ -dimensional interelement boundaries $S$ of $\mathscr{T}$ . For $T\in\mathscr{T}$ , we let $\mathscr{S}_{T}$ denote the subset of $\mathscr{S}$ which contains the sides of the element $T$ . We denote by ${\color[rgb]{0,0,0}\mathcal{N}_{S}\subset\mathscr{T}}$ the subset that contains the two elements that have $S$ as a side, namely, $\mathcal{N}_{S}=\{T^{+},T^{-}\}$ , where $T^{+},T^{-}\in\mathscr{T}$ are such that $S=T^{+}\cap T^{-}$ . For $T\in\mathscr{T}$ , we define the star associated with the element $T$ as

\mathcal{N}_{T}:=\left\{T^{\prime}\in\mathscr{T}:\mathscr{S}_{T}\cap\mathscr{S}_{T^{\prime}}\neq\emptyset\right\}.

(6)

In an abuse of notation, below we denote by $\mathcal{N}_{T}$ either the set itself or the union of its elements.

We define, for $T\in\mathscr{T}$ , the shape coefficient $\sigma_{T}$ of $T$ as the ratio of the diameter, i.e., $\mathrm{diam}(T)=h_{T}$ , and the inball diameter of $T$ , i.e., $2\sup\{r>0:B_{r}(x)\subset T\textrm{ for }x\in T\}$ . The shape coefficient of a triangulation $\mathscr{T}$ corresponds to the quantity $\sigma_{\mathscr{T}}:=\max\{T:T\in\mathscr{T}\}$ . A sequence of triangulations $\mathbb{T}=\{\mathscr{T}\}$ , that is obtained by subsequent refinements of an initial mesh $\mathscr{T}_{0}$ , is shape regular if $\sup\{\sigma_{\mathscr{T}}:\mathscr{T}\in\mathbb{T}\}\leq C$ ; see (Nochetto_etal2009, section 3.2.1) for details.

Given a mesh $\mathscr{T}\in\mathbb{T}$ , we define the finite element space of continuous piecewise linear functions as

\mathbb{V}(\mathscr{T}):=\{v_{\mathscr{T}}\in C(\overline{\Omega}):v_{\mathscr{T}}|_{T}\in\mathbb{P}_{1}(T)\ \forall T\in\mathscr{T}\}\cap H_{0}^{1}(\Omega).

(7)

Given a discrete function $v_{\mathscr{T}}\in\mathbb{V}(\mathscr{T})$ , we define, for any internal side $S\in\mathscr{S}$ , the jump or interelement residual $\llbracket\nabla v_{\mathscr{T}}\cdot\boldsymbol{\nu}\rrbracket$ by

\llbracket\nabla v_{\mathscr{T}}\cdot\boldsymbol{\nu}\rrbracket:=\boldsymbol{\nu}^{+}\cdot\nabla v_{\mathscr{T}}|_{T^{+}}+\boldsymbol{\nu}^{-}\cdot\nabla v_{\mathscr{T}}|_{T^{-}},

where $\boldsymbol{\nu}^{+},\boldsymbol{\nu}^{-}$ denote the unit normals to $S$ pointing towards $T^{+}$ , $T^{-}\in\mathscr{T}$ , respectively. Here, $T^{+}$ , $T^{-}\in\mathscr{T}$ are such that $T^{+}\neq T^{-}$ and $\partial T^{+}\cap\partial T^{-}=S$ .

With these ingredients at hand, we introduce a Galerkin approximation to problem (5) as follows:

z_{\mathscr{T}}\in\mathbb{V}(\mathscr{T}):\quad(\nabla z_{\mathscr{T}},\nabla v_{\mathscr{T}})_{L^{2}(\Omega)}+(\mathfrak{u}z_{\mathscr{T}},v_{\mathscr{T}})_{L^{2}(\Omega)}=(\mathfrak{f},v_{\mathscr{T}})_{L^{2}(\Omega)}

(8)

for all $v_{\mathscr{T}}\in\mathbb{V}(\mathscr{T})$ . Here, $\mathfrak{f}\in L^{2}(\Omega)$ and $\mathfrak{u}\in\mathbb{U}_{ad}$ . The existence and uniqueness of a solution $z_{\mathscr{T}}\in\mathbb{V}(\mathscr{T})$ of problem (8) is standard. In particular, we have the stability estimate $\|\nabla z_{\mathscr{T}}\|_{L^{2}(\Omega)}\lesssim\|\mathfrak{f}\|_{L^{2}(\Omega)}$ .

3.3 An a Posteriori Error Estimate for the State Equation

We introduce the following local error indicators and a posteriori error estimator associated to the discretization (8) of problem (5):

\mathcal{E}_{T}^{2}:=h_{T}^{2}\|\mathfrak{f}-\mathfrak{u}z_{\mathscr{T}}\|_{L^{2}(T)}^{2}+h_{T}\|\llbracket\nabla z_{\mathscr{T}}\cdot\boldsymbol{\nu}\rrbracket\|_{L^{2}(\partial T\setminus\partial\Omega)}^{2},\qquad\mathcal{E}_{\mathscr{T}}^{2}:=\sum_{T\in\mathscr{T}}\mathcal{E}_{T}^{2}.

We present the following global reliability result.

Theorem 3.1 (global reliability of $\mathcal{E}$ )

Let $\mathfrak{f}\in L^{2}(\Omega)$ and $\mathfrak{u}\in\mathbb{U}_{ad}$ be given. Let $z\in H_{0}^{1}(\Omega)$ be the unique solution to problem (5) and let $z_{\mathscr{T}}\in\mathbb{V}(\mathscr{T})$ be its finite element approximation obtained as the solution to (8). We thus have

\|\nabla(z-z_{\mathscr{T}})\|_{L^{2}(\Omega)}\lesssim\mathcal{E}_{\mathscr{T}},

with a hidden constant that is independent of $z$ , $z_{\mathscr{T}}$ , the size of the elements in $\mathscr{T}$ , and $\#\mathscr{T}$ but depends on the shape coefficient of the triangulation $\mathscr{T}$ , i.e., $\sigma_{\mathscr{T}}$ , and the dimension $d$ .

Proof

Since $z$ solves (5), we invoke Galerkin orthogonality and an elementwise integration by parts formula to arrive at

(\nabla(z-z_{\mathscr{T}}),\nabla v)_{L^{2}(\Omega)}+(\mathfrak{u}(z-z_{\mathscr{T}}),v)_{L^{2}(\Omega)}\\ =\sum_{T\in\mathscr{T}}\int_{T}(\mathfrak{f}-\mathfrak{u}z_{\mathscr{T}})(v-I_{\mathscr{T}}v)\mathrm{d}x+\sum_{S\in\mathscr{S}}\int_{S}\llbracket\nabla z_{\mathscr{T}}\cdot\boldsymbol{\nu}\rrbracket(v-I_{\mathscr{T}}v)\mathrm{d}x.

Here, $v\in H_{0}^{1}(\Omega)$ and $I_{\mathscr{T}}:L^{1}(\Omega)\rightarrow\mathbb{V}(\mathscr{T})$ denotes the Clément interpolation operator MR2373954; MR0520174. Standard approximation properties for $I_{\mathscr{T}}$ and the finite overlapping property of stars allow us to derive

(\nabla(z-z_{\mathscr{T}}),\nabla v)_{L^{2}(\Omega)}+(\mathfrak{u}(z-z_{\mathscr{T}}),v)_{L^{2}(\Omega)}\\ \lesssim\left[\sum_{T\in\mathscr{T}}h_{T}^{2}\|\mathfrak{f}-\mathfrak{u}z_{\mathscr{T}}\|_{L^{2}(T)}^{2}+h_{T}\|\llbracket\nabla z_{\mathscr{T}}\cdot\boldsymbol{\nu}\rrbracket\|_{L^{2}(\partial T\setminus\partial\Omega)}^{2}\right]^{\tfrac{1}{2}}\|\nabla v\|_{L^{2}(\Omega)}.

Set $v=z-z_{\mathscr{T}}\in H_{0}^{1}(\Omega)$ and use the fact that $\mathfrak{u}>0$ to conclude. ∎

4 The Optimal Control Problem

In this section, we follow (MR2536007, section 2) and introduce a weak formulation for the optimal control problem (2)–(4). In addition, we review first and second order optimality conditions and introduce finite element discretization schemes.

4.1 Weak Formulation and Existence of a Solution

Let $J$ be the cost functional defined in (1). We consider the following weak version of the optimization problem (2)–(4): Find

\min\{J(y,u):(y,u)\in H_{0}^{1}(\Omega)\times\mathbb{U}_{ad}\}

(9)

subject to the state equation

(\nabla y,\nabla v)_{L^{2}(\Omega)}+(uy,v)_{L^{2}(\Omega)}=(f,v)_{L^{2}(\Omega)}\quad\forall v\in H_{0}^{1}(\Omega).

(10)

The existence of an optimal solution $(\bar{y},\bar{u})\in H_{0}^{1}(\Omega)\times\mathbb{U}_{ad}$ for problem (9)–(10) follows standard arguments; see (MR2536007, Proposition 2.3).

4.2 Optimality Conditions

Due to the fact that the optimal control problem (9)–(10) is not convex, we discuss optimality conditions under the framework of local solutions in $L^{2}(\Omega)$ . To be precise, a control $\bar{u}\in\mathbb{U}_{ad}$ is said to be locally optimal in $L^{2}(\Omega)$ for (9)–(10) if there exists a constant $\delta>0$ such that $J(\bar{y},\bar{u})\leq J(y,u)$ for all $u\in\mathbb{U}_{ad}$ such that $\|u-\bar{u}\|_{L^{2}(\Omega)}\leq\delta$ . Here, $\bar{y}$ and $y$ denote the states associated to $\bar{u}$ and $u$ , respectively.

Let us introduce the set $\mathcal{U}:=\{u\in L^{\infty}(\Omega):\exists c>0\text{ such that }u(x)>c>0\text{ a.e. }x\in\Omega\}$ . We immediately notice that $\mathbb{U}_{ad}\subset\mathcal{U}$ . Having defined $\mathcal{U}$ , we introduce the control-to-state map $\mathcal{S}$ as follows: given a control $u\in\mathcal{U}$ , $\mathcal{S}$ associates to it a unique state $y=\mathcal{S}u\in H_{0}^{1}(\Omega)$ solving (10). With these ingredients at hand, we define the reduced cost functional $j:\mathcal{U}\to\mathbb{R}_{0}^{+}$ by

j(u)=J(\mathcal{S}u,u):=\frac{1}{2}\|\mathcal{S}u-y_{\Omega}\|_{L^{2}(\Omega)}^{2}+\frac{{\color[rgb]{0,0,0}\alpha}}{2}\|u\|_{L^{2}(\Omega)}^{2}.

We are now in position to formulate first order optimality conditions: if $\bar{u}$ is locally optimal for problem (9)–(10), then (MR2536007, Proposition 2.10)

j^{\prime}(\bar{u})(u-\bar{u})\geq 0\quad\forall u\in\mathbb{U}_{ad}.

(11)

Here, $j^{\prime}(\bar{u})$ denotes the Gateâux derivative of the functional $j$ at $\bar{u}$ in the direction $u-\bar{u}$ . We notice that, for $u,v\in\mathbb{U}_{ad}$ , $j^{\prime}(u)v=(\alpha u-yp,v)_{L^{2}(\Omega)}$ (MR2536007, equation (2.6)) and immediately comment that $\mathcal{S}$ and $j$ are not Fréchet differentiable with respect to the $L^{2}(\Omega)$ -topology (MR2536007, Remark 2.8). To investigate the inequality (11), we introduce the adjoint variable $p\in H_{0}^{1}(\Omega)$ as the unique solution to the adjoint equation

(\nabla w,\nabla p)_{L^{2}(\Omega)}+(up,w)_{L^{2}(\Omega)}=(y-y_{\Omega},w)_{L^{2}(\Omega)}\quad\forall w\in H_{0}^{1}(\Omega),

(12)

where $y=\mathcal{S}u$ solves (10). Observe that problem (12) is well-posed.

With the previous ingredients at hand, we reformulate first order optimality conditions as follows; see (MR2536007, Proposition 2.10 and equation (2.6)).

Theorem 4.1 (first order optimality conditions)

Every locally optimal control $\bar{u}\in\mathbb{U}_{ad}$ for problem (9)–(10) satisfies, together with the state $\bar{y}\in H_{0}^{1}(\Omega)$ and the adjoint state $\bar{p}\in H_{0}^{1}(\Omega)$ , the variational inequality

(\alpha\bar{u}-\bar{y}\bar{p},u-\bar{u})_{L^{2}(\Omega)}\geq 0\quad\forall u\in\mathbb{U}_{ad}.

(13)

Here, $\bar{p}$ denotes the solution to (12) with $y$ replaced by $\bar{y}=\mathcal{S}\bar{u}$ .

Let us now introduce the projection operator $\Pi_{[\texttt{a},\texttt{b}]}:L^{1}(\Omega)\rightarrow\mathbb{U}_{ad}$ as

\Pi_{[\texttt{a},\texttt{b}]}(v):=\min\{\texttt{b},\max\{v,\texttt{a}\}\}\textrm{ a.e. in }\Omega.

(14)

This operator allows us to present the following projection formula (MR2536007, equation (2.7)): If $\bar{u}$ denotes a locally optimal control for (9)–(10), then

\bar{u}(x):=\Pi_{[\texttt{a},\texttt{b}]}(\alpha^{-1}\bar{y}(x)\bar{p}(x))\textrm{ a.e.}\leavevmode\nobreak\ x\in\Omega.

(15)

Let $\bar{u}\in\mathbb{U}_{ad}$ be a control that satisfies the first order necessary optimality condition (13). In what follows, we will assume that there exists a constant $\mu>0$ such that

j^{\prime\prime}(\bar{u})v^{2}\geq\mu\|v\|_{L^{2}(\Omega)}^{2}\quad\forall v\in L^{\infty}(\Omega);

(16)

see (MR2536007, Assumption 2.20). Here, for each $v\in L^{\infty}(\Omega)$ , we have that

j^{\prime\prime}(\bar{u})v^{2}:=\|\mathcal{S}^{\prime}(\bar{u})v\|_{L^{2}(\Omega)}^{2}+(\bar{y}-y_{\Omega},\mathcal{S}^{\prime\prime}(\bar{u})v^{2})_{L^{2}(\Omega)}+\alpha\|v\|_{L^{2}(\Omega)}^{2},

where $\mathcal{S}^{\prime}(\bar{u})v$ and $\mathcal{S}^{\prime\prime}(\bar{u})v^{2}$ are defined as in (MR2536007, Lemma 2.9). We notice that assumption (16) is fulfilled if $\|\mathcal{S}\bar{u}-y_{\Omega}\|_{L^{2}(\Omega)}$ is sufficiently small or $\alpha>0$ is sufficiently large; see (MR2536007, Remark 2.21) for details.

The following result states that every control $\bar{u}$ that satisfies (13) and (16) is a local solution for problem (9)–(10); see (MR2536007, Theorem 2.24).

Theorem 4.2 (local optimality)

Let $\bar{u}\in\mathbb{U}_{ad}$ be a local solution to (9)–(10) satisfying the necessary and sufficient optimality conditions (13) and (16). Then, there exist positive constants $\delta,\sigma>0$ such that

j(u)\geq j(\bar{u})+\sigma\|u-\bar{u}\|_{L^{2}(\Omega)}^{2}\quad\forall u\in\mathbb{U}_{ad}\cap B_{\delta}(\bar{u}),

where $B_{\delta}(\bar{u})$ denotes the closed ball in $L^{2}(\Omega)$ with center at $\bar{u}$ and radius $\delta$ .

We conclude this section with the following estimate (MR2536007, Proposition 2.22): Let $u,v\in\mathbb{U}_{ad}$ and $w\in L^{\infty}(\Omega)$ . Then, there exists $\mathfrak{C}>0$ , depending on $\|f\|_{L^{2}(\Omega)}$ and $\|y_{\Omega}\|_{L^{2}(\Omega)}$ , such that

|j^{\prime\prime}(u)w^{2}-j^{\prime\prime}(v)w^{2}|\leq\mathfrak{C}\|u-v\|_{L^{2}(\Omega)}\|w\|_{L^{2}(\Omega)}^{2}.

(17)

4.3 Finite Element Approximation

In this section, we introduce two finite element discretization schemes for our optimal control problem.

4.3.1 The Fully Discrete Scheme

To approximate a control variable, we introduce the space of piecewise constant functions

\mathbb{U}(\mathscr{T}):=\{u_{\mathscr{T}}\in L^{\infty}(\Omega):u_{\mathscr{T}}|_{T}\in\mathbb{P}_{0}(T)\ \forall T\in\mathscr{T}\}

and define the discrete admissible set $\mathbb{U}_{ad}(\mathscr{T}):=\mathbb{U}(\mathscr{T})\cap\mathbb{U}_{ad}$ . The state and adjoint state variables, associated to a locally optimal control, are discretized by using the finite element space $\mathbb{V}(\mathscr{T})$ defined in (7). With this setting at hand, the fully discrete scheme reads as follows: Find $\min J(y_{\mathscr{T}},u_{\mathscr{T}})$ subject to the discrete state equation

y_{\mathscr{T}}\in\mathbb{V}(\mathscr{T}):\quad(\nabla y_{\mathscr{T}},\nabla v_{\mathscr{T}})_{L^{2}(\Omega)}+(u_{\mathscr{T}}y_{\mathscr{T}},v_{\mathscr{T}})_{L^{2}(\Omega)}=(f,v_{\mathscr{T}})_{L^{2}(\Omega)}

(18)

for all $v_{\mathscr{T}}\in\mathbb{V}(\mathscr{T})$ and the discrete constraints $u_{\mathscr{T}}\in\mathbb{U}_{ad}(\mathscr{T})$ . The fully discrete scheme admits at least a solution; see (MR2536007, Section 3) for details. In addition, if $\bar{u}_{\mathscr{T}}$ denotes a discrete local solution, then

(\alpha\bar{u}_{\mathscr{T}}-\bar{y}_{\mathscr{T}}\bar{p}_{\mathscr{T}},u_{\mathscr{T}}-\bar{u}_{\mathscr{T}})_{L^{2}(\Omega)}\geq 0\quad\forall u_{\mathscr{T}}\in\mathbb{U}_{ad}(\mathscr{T}),

where $\bar{p}_{\mathscr{T}}\in\mathbb{V}(\mathscr{T})$ is such that

(\nabla w_{\mathscr{T}},\nabla\bar{p}_{\mathscr{T}})_{L^{2}(\Omega)}+(\bar{u}_{\mathscr{T}}\bar{p}_{\mathscr{T}},w_{\mathscr{T}})_{L^{2}(\Omega)}=(\bar{y}_{\mathscr{T}}-y_{\Omega},w_{\mathscr{T}})_{L^{2}(\Omega)}

(19)

for all $w_{\mathscr{T}}\in\mathbb{V}(\mathscr{T})$ ; see (MR2536007, equations (3.6) and (3.7)).

4.3.2 The Semi-discrete Scheme

In this section, we introduce the so-called variational discretization approach for (9)–(10). This scheme discretizes only the state space; the control space $\mathbb{U}_{ad}$ is not discretized. The scheme induces a discretization of an optimal control variable by projecting, in view of the operator introduced in (14), an optimal discrete adjoint state into $\mathbb{U}_{ad}$ . The semi-discrete scheme is defined as follows: Find $\min J(y_{\mathscr{T}},\mathsf{u})$ subject to the discrete state equation

y_{\mathscr{T}}\in\mathbb{V}(\mathscr{T}):\quad(\nabla y_{\mathscr{T}},\nabla v_{\mathscr{T}})_{L^{2}(\Omega)}+(\mathsf{u}y_{\mathscr{T}},v_{\mathscr{T}})_{L^{2}(\Omega)}=(f,v_{\mathscr{T}})_{L^{2}(\Omega)}

(20)

for all $v_{\mathscr{T}}\in\mathbb{V}(\mathscr{T})$ and the constraints $\mathsf{u}\in\mathbb{U}_{ad}$ . As in the fully discrete case, this problem admits at least a solution and, if $\bar{\mathsf{u}}$ denotes a local solution, then

(\alpha\bar{\mathsf{u}}-\bar{y}_{\mathscr{T}}\bar{p}_{\mathscr{T}},u-\bar{\mathsf{u}})_{L^{2}(\Omega)}\geq 0\quad\forall u\in\mathbb{U}_{ad},

where $\bar{p}_{\mathscr{T}}\in\mathbb{V}(\mathscr{T})$ solves

(\nabla w_{\mathscr{T}},\nabla\bar{p}_{\mathscr{T}})_{L^{2}(\Omega)}+(\bar{\mathsf{u}}\bar{p}_{\mathscr{T}},w_{\mathscr{T}})_{L^{2}(\Omega)}=(\bar{y}_{\mathscr{T}}-y_{\Omega},w_{\mathscr{T}})_{L^{2}(\Omega)}

(21)

for all $w_{\mathscr{T}}\in\mathbb{V}(\mathscr{T})$ . Here, $\bar{y}_{\mathscr{T}}=\bar{y}_{\mathscr{T}}(\bar{\mathsf{u}})$ solves (20) with $\mathsf{u}=\bar{\mathsf{u}}$ .

5 A Posteriori Error Analysis for the Fully Discrete Scheme

In this section, we devise and analyze an a posteriori error estimator for the fully discrete scheme. The error estimator will be formed by the sum of three contributions: two contributions related to the discretization of the state and adjoint equations and a one contribution associated to the discretization of the admissible control set $\mathbb{U}_{ad}$ .

To begin with our studies we introduce, on the basis of the projection operator $\Pi_{[\texttt{a},\texttt{b}]}$ defined in (14), the auxiliary variable

\tilde{u}:=\Pi_{[\texttt{a},\texttt{b}]}\left(\alpha^{-1}\bar{y}_{\mathscr{T}}\bar{p}_{\mathscr{T}}\right).

(22)

A key property in favor of the definition of $\tilde{u}$ is that it satisfies the following variational inequality (Troltzsch, Lemma 2.26):

(\alpha\tilde{u}-\bar{y}_{\mathscr{T}}\bar{p}_{\mathscr{T}},u-\tilde{u})_{L^{2}(\Omega)}\geq 0\quad\forall u\in\mathbb{U}_{ad}.

(23)

With the variable $\tilde{u}$ at hand, we present the following result which is instrumental for our a posteriori error analysis.

Theorem 5.1 (auxiliary estimate)

Let $\bar{u}\in\mathbb{U}_{ad}$ be a local solution to (9)–(10) satisfying the sufficient second order optimality condition (16). Let $\bar{u}_{\mathscr{T}}$ be a local minimum of the fully discrete optimal control problem with $\bar{y}_{\mathscr{T}}$ and $\bar{p}_{\mathscr{T}}$ being the corresponding state and adjoint state, respectively. If $\bar{y}_{\mathscr{T}}$ and $\bar{p}_{\mathscr{T}}$ satisfy, on the mesh $\mathscr{T}$ , the bound

\|\bar{y}\bar{p}-\bar{y}_{\mathscr{T}}\bar{p}_{\mathscr{T}}\|_{L^{2}(\Omega)}\leq\alpha\mu(2\mathfrak{C})^{-1},

(24)

then

\frac{\mu}{2}\|\bar{u}-\tilde{u}\|_{L^{2}(\Omega)}^{2}\leq(j^{\prime}(\tilde{u})-j^{\prime}(\bar{u}))(\tilde{u}-\bar{u}).

(25)

The constants $\mu$ and $\mathfrak{C}$ are given as in (16) and (17), respectively.