This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

11institutetext: D.T.V. An 22institutetext: Department of Mathematics and Informatics, Thai Nguyen University of Sciences, Thai Nguyen city, Vietnam
22email: [email protected]
33institutetext: N.D. Yen 44institutetext: Institute of Mathematics, Vietnam Academy of Science and Technology, Hanoi, Vietnam
44email: [email protected]

Optimality conditions based on the Fréchet second-order subdifferential

D.T.V. An    N.D. Yen
(Received: date / Accepted: date)
Abstract

This paper focuses on second-order necessary optimality conditions for constrained optimization problems on Banach spaces. For problems in the classical setting, where the objective function is C2C^{2}-smooth, we show that strengthened second-order necessary optimality conditions are valid if the constraint set is generalized polyhedral convex. For problems in a new setting, where the objective function is just assumed to be C1C^{1}-smooth and the constraint set is generalized polyhedral convex, we establish sharp second-order necessary optimality conditions based on the Fréchet second-order subdifferential of the objective function and the second-order tangent set to the constraint set. Three examples are given to show that the used hypotheses are essential for the new theorems. Our second-order necessary optimality conditions refine and extend several existing results.

Keywords:
Constrained optimization problems on Banach spaces second-order necessary optimality conditions Fréchet second-order subdifferential second-order tangent set generalized polyhedral convex set.
MSC:
49K27 49J53 90C30 90C46 90C20

1 Introduction

It is well-known that second-order optimality conditions are fundamental results in nonlinear mathematical programming Ben-Tal1980 ; Ben-Tal1982 ; Bonnans_Shapiro_2000 ; L_Y_2008 ; McCormick1967 ; Penot1994 ; Penot1999 ; Polyak ; Ruszczynski2006 , which have numerous applications in stability and sensitivity analysis, as well as in numerical methods for optimization problems. The need of generalizing these conditions to broader settings continues to attract attention of many researchers; see, e.g., ChieuLeeYen2017 ; HSN_1984 ; Huy_Tuyen and the references therein.

In classical second-order optimality conditions, the objective function of the finite-dimensional optimization problem in question is assumed to be twice continuously differentiable (a C2C^{2}-smooth function for short). If the objective function is continuously Fréchet differentiable and the gradient mapping is locally Lipschitz, then one has deal with a C1,1C^{1,1}-smooth problem. Second-order optimality conditions for finite-dimensional C1,1C^{1,1}- smooth optimization problems have been obtained by Hiriart-Urruty et al. HSN_1984 , Huy and Tuyen Huy_Tuyen .

If the objective function of an optimization problem is continuously Fréchet differentiable and the gradient mapping is merely continuous, then one has deal with a C1C^{1}-smooth problem. The class of C1C^{1}-smooth optimization problems is much larger than that of C1,1C^{1,1}- smooth optimization problems. As far as we know, the tools employed in HSN_1984 ; Huy_Tuyen are no longer suitable for C1C^{1}-smooth problems. To describe locally optimal solutions of C1C^{1}-smooth unconstrained minimization problems in a Banach space setting, Chieu et al. ChieuLeeYen2017 have explored the possibility of using the Fréchet second-order subdifferential and the limiting second-order subdifferential, which can be viewed as generalized Hessians of extended-real-valued functions. These concepts are due to Mordukhovich Mordukhovich_1992 ; Mordukhovich_2006a . The limiting second-order subdifferential has many applications in stability analysis of optimization problems; see, e.g., Mo_Ro_SIOPT2012 ; MRS_SIOPT2013 ; Poli_Roc_1998 and the references therein. As shown in ChieuChuongYaoYen2011 ; ChieuHuy2011 , the Fréchet second-order subdifferential is very useful in characterizing convexity of extended-real-valued functions. The authors of ChieuLeeYen2017 have shown that the Fréchet second-order subdifferential is suitable for presenting second-order necessary optimality conditions (ChieuLeeYen2017, , Theorems 3.1 and 3.3), while the limiting second-order subdifferential works well for second-order sufficient optimality conditions (ChieuLeeYen2017, , Theorem 4.7 and Corollary 4.8). Consulting a preprint version of ChieuLeeYen2017 , which appeared in 2013, Dai LVD2014 has extended the finite-dimensional version of (ChieuLeeYen2017, , Theorem 3.3) to the case of C1C^{1}-smooth optimization problems whose constraint sets are described by linear equalities.

Our interest in knowing deeper the role of second-order tangent sets in second-order optimality conditions mainly comes from the book of Bonnans and Shapiro Bonnans_Shapiro_2000 and Theorem 3.45 in the book by Ruszczynski Ruszczynski2006 . When the second-order derivative of the C2C^{2}-smooth objective function is replaced by the Fréchet second-order subdifferential or the limiting second-order subdifferential, nontrivial questions arise if one wants to have second-order optimality conditions based on second-order tangent sets. Since optimization problems with polyhedral convex constraint sets or generalized polyhedral convex constraint sets will be encountered frequently in our investigations, we remark that they are of great importance in optimization theory (see for example MRS_SIOPT2013 , where full stability of the local minimizers of such problems was characterized). An extended-real-valued function defined on a Banach space is said to be a generalized polyhedral convex function if its epigraph is a generalized polyhedral convex set. The interested reader is referred to (Luan_Yen, , pp. 71–77) and Luan_Yao for more comments on the role of generalized polyhedral convex sets and generalized polyhedral convex functions.

The main goal of this paper is to clarify the applicability of the Fréchet second-order subdifferential to establishing second-order optimality conditions for constrained minimization problems. For problems in the classical setting, where the objective function is C2C^{2}-smooth, we show that strengthened second-order necessary optimality conditions are valid if the constraint set is generalized polyhedral convex. For problems in a new setting, where the objective function is just assumed to be C1C^{1}-smooth and the constraint set is generalized polyhedral convex, we establish sharp second-order necessary optimality conditions based on the Fréchet second-order subdifferential of the objective function and the second-order tangent set to the constraint set. Our second-order necessary optimality conditions refine and extend several existing results. We will give three examples to show that the used hypotheses are essential for the new theorems.

The paper organization is as follows. Section 2 presents some basic definitions and auxiliary results. Section 3 is devoted to second-order optimality conditions for constrained optimization problems, where the objective function is C2C^{2}-smooth. Section 4 studies the possibility of using the Fréchet second-order subdifferential in second-order necessary optimality conditions for constrained optimization problems, where the objective function is C1C^{1}-smooth.

2 Preliminaries

Let XX be a Banach space over the reals with the dual and the second dual being denoted, respectively, by XX^{*} and XX^{**}. As usual, for a subset ΩX\Omega\subset X, we denote its convex hull (resp., interior, and boundary) by convΩ{\rm conv}\,\Omega (resp., intΩ{\rm int}\Omega, and Ω\partial\Omega). One says that a nonempty subset KXK\subset X is a cone if tKKtK\subset K for any t>0.t>0. Following Luan_Yao_Yen , we abbreviate the smallest convex cone containing Ω\Omega to cone Ω\Omega. Then, coneΩ={txt>0,xconvΩ}.{\rm cone}\,\Omega=\{tx\mid t>0,\,x\in{\rm conv}\,\Omega\}. The polar to a cone KXK\subset X is K:={xXx,x0,xK}K^{*}:=\{x^{*}\in X^{*}\mid\langle x^{*},x\rangle\leq 0,\ \forall x\in K\}. If AA is a matrix, then we denote its transpose by ATA^{T}. The set of positive integers is denoted by \mathbb{N}.

The forthcoming subsection recalls the definitions of contingent cone and second-order tangent set.

2.1 Second-order tangent sets

Definition 1

(See, e.g., (Ruszczynski2006, , Definition 3.11)) A direction vv is called tangent to the set CXC\subset X at a point x¯C\bar{x}\in C if there exist sequences of points xkCx_{k}\in C and scalar τk>0\tau_{k}>0, kk\in\mathbb{N}, such that τk0\tau_{k}\rightarrow 0 and v=limk[τk1(xkx¯)].v=\lim\limits_{k\rightarrow\infty}\big{[}\tau_{k}^{-1}(x_{k}-\bar{x})\big{]}.

The set of all tangent directions to CC at a point x¯C\bar{x}\in C, denoted by TC(x¯)T_{C}(\bar{x}), is called the contingent cone or the Bouligand-Severi tangent cone (Mordukhovich_2006a, , Chapter 1) to CC at x¯\bar{x}. From the definition it follows that vTC(x¯)v\in T_{C}(\bar{x}) if and only if there exist a sequence {τk}\{\tau_{k}\} of positive scalars and a sequence of vectors {vk}\{v_{k}\} with τk0\tau_{k}\to 0 and vkvv_{k}\to v as kk\to\infty such that xk:=x¯+τkvkx_{k}:=\bar{x}+\tau_{k}v_{k} belongs to CC for all kk\in\mathbb{N}.

Definition 2

(See, e.g., (Ruszczynski2006, , Definition 3.41)) A vector ww is called a second order tangent direction to a set CXC\subset X at a point x¯C\bar{x}\in C and in a tangent direction vv, if there exist a sequence of scalars τk>0\tau_{k}>0 and a sequence of points xkCx^{k}\in C such that τk0\tau_{k}\rightarrow 0 and

w=limkxkx¯τkvτk22.w=\lim_{k\rightarrow\infty}\frac{x^{k}-\bar{x}-\tau_{k}v}{\frac{\tau_{k}^{2}}{2}}. (2.1)

The set of all second-order tangent directions to CC at a point x¯C\bar{x}\in C in a tangent direction vv, denoted by TC2(x¯,v)T_{C}^{2}(\bar{x},v), is said to be the second-order tangent set to CC at x¯\bar{x} in direction vv. Note that the equality (2.1) can be rewritten as

xk=x¯+τkv+τk22w+o(τk2).x^{k}=\bar{x}+\tau_{k}v+\frac{\tau_{k}^{2}}{2}w+\mathnormal{o}(\tau_{k}^{2}).

Thus, wTC2(x¯,v)w\in T_{C}^{2}(\bar{x},v) if and only if there exist a sequence {τk}\{\tau_{k}\} of positive scalars and a sequence of vectors {wk}\{w_{k}\} with τk0\tau_{k}\to 0 and wkww_{k}\to w as kk\to\infty such that xk:=x¯+τkv+τk22wkx_{k}:=\bar{x}+\tau_{k}v+\frac{\tau_{k}^{2}}{2}w_{k} belongs to CC for all kk\in\mathbb{N}.

In the next subsection, we recall the definition of the generalized polyhedral convex set from Bonnans_Shapiro_2000 and establish some auxiliary results.

2.2 Generalized polyhedral convex sets

Definition 3

(See (Bonnans_Shapiro_2000, , p. 133) and (Luan_Yao_Yen, , Definition 2.1)) A subset DXD\subset X is said to be a generalized polyhedral convex set if there exist xiXx_{i}^{*}\in X^{*}, αi,\alpha_{i}\in\mathbb{R}, i=1,2,,pi=1,2,...,p, and a closed affine subspace LXL\subset X, such that

D={xXxL,xi,xαi,i=1,2,,p}.\displaystyle D=\{x\in X\mid x\in L,\ \langle x_{i}^{*},x\rangle\leq\alpha_{i},\ i=1,2,...,p\}. (2.2)

If DD can be represented in the form of (2.2) with L=XL=X, then we say that it is a polyhedral convex set.

From Definition 3 it follows that every generalized polyhedral convex set is a closed set. If XX is finite-dimensional, a subset DXD\subset X is a generalized polyhedral convex set if and only if it is a polyhedral convex set; see (Luan_Yao_Yen, , p. 541).

Let DD be given as in (2.2). According to (Bonnans_Shapiro_2000, , Remark 2.196), there exists a continuous surjective linear mapping AA from XX to a Banach space YY and a vector yYy\in Y such that L={xXAx=y}L=\{x\in X\mid Ax=y\}. Hence,

D={xXAx=y,xi,xαi,i=1,2,,p}.\displaystyle D=\big{\{}x\in X\mid Ax=y,\ \langle x_{i}^{*},x\rangle\leq\alpha_{i},\ i=1,2,...,p\big{\}}. (2.3)

Put I={1,2,..,p}I=\{1,2,..,p\} and, for any xDx\in D, let I(x):={iIxi,x=αi}I(x):=\{i\in I\mid\langle x_{i}^{*},x\rangle=\alpha_{i}\}.

The first assertion of the next proposition can be found in Ban_Mordukhovich_Song_2011 . The second assertion extends the result in (Ruszczynski2006, , Lemma 3.43) to an infinite-dimensional spaces setting.

Proposition 1

Let DD be a generalized polyhedral convex set in a Banach space XX. The contingent cones and the second-order tangent sets to DD are represented as follows:

  • (i)

    TD(x¯)={vXAv=0,xi,v0,iI(x¯)}T_{D}(\bar{x})=\{v\in X\mid Av=0,\;\langle x_{i}^{*},v\rangle\leq 0,\;i\in I(\bar{x})\} for any x¯D\bar{x}\in D;

  • (ii)

    TD2(x¯,v)=TTD(x¯)(v)T^{2}_{D}(\bar{x},v)=T_{T_{D}(\bar{x})}(v) for any x¯D\bar{x}\in D and vTD(x¯)v\in T_{D}(\bar{x}).

Proof

(i) To show that

TD(x¯){vXAv=0,xi,v0,iI(x¯)},\displaystyle T_{D}(\bar{x})\subset\{v\in X\mid Av=0,\ \langle x_{i}^{*},v\rangle\leq 0,\ i\in I(\bar{x})\}, (2.4)

take any vTD(x¯)v\in T_{D}(\bar{x}). Let τk0\tau_{k}\downarrow 0 and vkvv_{k}\rightarrow v be such that x¯+τkvkD\bar{x}+\tau_{k}v_{k}\in D for kk\in\mathbb{N}. Then, we have A(x¯+τkvk)=yA(\bar{x}+\tau_{k}v_{k})=y and xi,x¯+τkvkαi\langle x_{i}^{*},\bar{x}+\tau_{k}v_{k}\rangle\leq\alpha_{i} for all iI.i\in I. This implies that

A(τkvk)=0andxi,τkvk0(iI(x¯),k).\displaystyle A(\tau_{k}v_{k})=0\ \;\mbox{and}\ \langle x_{i}^{*},\tau_{k}v_{k}\rangle\leq 0\ \;(\forall i\in I(\bar{x}),\ \forall k\in\mathbb{N}). (2.5)

From (2.5) we have

A(vk)=0andxi,vk0(iI(x¯),k).\displaystyle A(v_{k})=0\ \;\mbox{and}\ \;\langle x_{i}^{*},v_{k}\rangle\leq 0\ \;(\forall i\in I(\bar{x}),\ \forall k\in\mathbb{N}). (2.6)

Letting kk\rightarrow\infty, from (2.6) we get A(v)=0A(v)=0 and xi,v0\langle x_{i}^{*},v\rangle\leq 0 for any iI(x¯)i\in I(\bar{x}). In other words, vv belongs to the right-hand-side of (2.4). So, the inclusion (2.4) is valid. To prove the opposite inclusion, pick any vXv\in X satisfying Av=0Av=0 and xi,v0\langle x_{i}^{*},v\rangle\leq 0 for iI(x¯).i\in I(\bar{x}). Since x¯D\bar{x}\in D, one has Ax¯=yA\bar{x}=y, xi,x¯=αi\langle x_{i}^{*},\bar{x}\rangle=\alpha_{i} for iI(x¯)i\in I(\bar{x}), and xi,x¯<αi\langle x_{i}^{*},\bar{x}\rangle<\alpha_{i} for iII(x¯).i\in I\setminus I(\bar{x}). Hence, for all t>0t>0 small enough, one has A(x¯+tv)=y,xi,x¯+tvαiA(\bar{x}+tv)=y,\ \langle x_{i}^{*},\bar{x}+tv\rangle\leq\alpha_{i} for iI(x¯)i\in I(\bar{x}) and xi,x¯+tv<αi\langle x_{i}^{*},\bar{x}+tv\rangle<\alpha_{i} for iII(x¯).i\in I\setminus I(\bar{x}). So, x¯+tvD\bar{x}+tv\in D for all t>0t>0 small enough. It follows that vTD(x¯)v\in T_{D}(\bar{x}). Thus, assertion (i) is justified.

(ii) Fix any x¯D\bar{x}\in D and vTD(x¯)v\in T_{D}(\bar{x}). By assertion (i), Av=0Av=0 and xi,v0\langle x_{i}^{*},v\rangle\leq 0 for all iI(x¯)i\in I(\bar{x}). Moreover, since

TD(x¯)={uXAu=0,xi,u0,iI(x¯)},\displaystyle T_{D}(\bar{x})=\{u\in X\mid Au=0,\;\langle x_{i}^{*},u\rangle\leq 0,\;i\in I(\bar{x})\}, (2.7)

applying the same assertion we can compute the contingent cone to the generalized polyhedral convex set TD(x¯)T_{D}(\bar{x}) at vv as follows

TTD(x¯)(v)={uXAu=0,xi,u0,iI0(v)},\displaystyle T_{T_{D}(\bar{x})}(v)=\big{\{}u\in X\mid Au=0,\ \langle x_{i}^{*},u\rangle\leq 0,\ i\in I^{0}(v)\big{\}}, (2.8)

where I0(v):={iI(x¯)xi,v=0}I^{0}(v):=\{i\in I(\bar{x})\mid\langle x_{i}^{*},v\rangle=0\}. On one hand, for any fixed vector wTD2(x¯,v)w\in T^{2}_{D}(\bar{x},v), we can find sequences τk0\tau_{k}\downarrow 0 and wkww_{k}\rightarrow w such that

x¯+τkv+τk22wkD(k).\bar{x}+\tau_{k}v+\frac{\tau^{2}_{k}}{2}w_{k}\in D\quad(\forall k\in\mathbb{N}).

By (2.3), one has A(x¯+τkv+τk22wk)=yA(\bar{x}+\tau_{k}v+\frac{\tau^{2}_{k}}{2}w_{k})=y and xi,x¯+τkv+τk22wkαi,iI.\langle x_{i}^{*},\bar{x}+\tau_{k}v+\frac{\tau^{2}_{k}}{2}w_{k}\rangle\leq\alpha_{i},\ i\in I. As x¯D\bar{x}\in D and vTD(x¯)v\in T_{D}(\bar{x}), this yields

A(τk22wk)=0andxi,τk22wk0,iI0(v).\displaystyle A\Big{(}\frac{\tau^{2}_{k}}{2}w_{k}\Big{)}=0\ \mbox{and}\ \big{\langle}x_{i}^{*},\frac{\tau^{2}_{k}}{2}w_{k}\big{\rangle}\leq 0,\ \forall i\in I^{0}(v). (2.9)

Since τk>0\tau_{k}>0, (2.9) implies that A(wk)=0A\left(w_{k}\right)=0 and xi,wk0\langle x_{i}^{*},w_{k}\rangle\leq 0 for all iI0(v).i\in I^{0}(v). Letting kk\rightarrow\infty, we obtain A(w)=0A\left(w\right)=0 and xi,w0\langle x_{i}^{*},w\rangle\leq 0 for all iI0(v).i\in I^{0}(v). Therefore, by (2.8) we can assert that wTTD(x¯)(v).w\in T_{T_{D}(\bar{x})}(v). On the other hand, taking any wTTD(x¯)(v)w\in T_{T_{D}(\bar{x})}(v), from (2.8) one gets Aw=0Aw=0 and xi,w0\langle x_{i}^{*},w\rangle\leq 0 for all iI0(v).i\in I^{0}(v). By the definition of I0(v)I^{0}(v), we have xi,v=0\langle x_{i}^{*},v\rangle=0 for any iI0(v)i\in I^{0}(v) and xi,v<0\langle x_{i}^{*},v\rangle<0 for any iI(x¯)I0(v)i\in I(\bar{x})\setminus I^{0}(v). Moreover, since x¯D\bar{x}\in D, it holds that Ax¯=yA\bar{x}=y, xi,x¯=αi\langle x_{i}^{*},\bar{x}\rangle=\alpha_{i} for iI(x¯)i\in I(\bar{x}), and xi,x¯<αi\langle x_{i}^{*},\bar{x}\rangle<\alpha_{i} for iII(x¯).i\in I\setminus I(\bar{x}). So, for every t>0t>0 sufficiently small, one has A(x¯+tv+t22w)=y,xi,x¯+tv+t22wαiA(\bar{x}+tv+\frac{t^{2}}{2}w)=y,\ \langle x_{i}^{*},\bar{x}+tv+\frac{t^{2}}{2}w\rangle\leq\alpha_{i} for all iI0(v)i\in I^{0}(v) and xi,x¯+tv+t22w<αi\langle x_{i}^{*},\bar{x}+tv+\frac{t^{2}}{2}w\rangle<\alpha_{i} for all iII0(v).i\in I\setminus I^{0}(v). This yields x¯+tv+t22wD\bar{x}+tv+\frac{t^{2}}{2}w\in D for every t>0t>0 sufficiently small. Hence, wTD2(x¯,v).w\in T^{2}_{D}(\bar{x},v). We have thus proved the equality stated in assertion (ii). \hfill\Box

Remark 1

If DXD\subset X is a generalized polyhedral convex set then, for any x¯D\bar{x}\in D and vTD(x¯)v\in T_{D}(\bar{x}), one has TD(x¯)TD2(x¯,v)T_{D}(\bar{x})\subset T^{2}_{D}(\bar{x},v), and the inclusion can be strict. We can justify this observation by representing DD in the form (2.3) and applying some formulas established in the proof of Proposition 1. Indeed, since I0(v)I(x¯)I^{0}(v)\subset I(\bar{x}), from (2.7), (2.8), and the equality TD2(x¯,v)=TTD(x¯)(v)T^{2}_{D}(\bar{x},v)=T_{T_{D}(\bar{x})}(v), one can deduce that TD(x¯)TD2(x¯,v)T_{D}(\bar{x})\subset T^{2}_{D}(\bar{x},v). When I0(v)I^{0}(v) is a proper subset of I(x¯)I(\bar{x}), the last inclusion can be strict. To have an example, one can choose

D={x=(x1,x2)2x10,x20},D=\big{\{}x=(x_{1},x_{2})\in\mathbb{R}^{2}\mid x_{1}\geq 0,x_{2}\geq 0\big{\}},

x¯=(0,0)\bar{x}=(0,0), v=(1,0)v=(1,0), then use (2.8) and the equality TD2(x¯,v)=TTD(x¯)(v)T^{2}_{D}(\bar{x},v)=T_{T_{D}(\bar{x})}(v) to show that TD2(x¯,v)={w=(w1,w2)2w20}T^{2}_{D}(\bar{x},v)=\big{\{}w=(w_{1},w_{2})\in\mathbb{R}^{2}\mid w_{2}\geq 0\big{\}}, while

TD(x¯)={u=(u1,u2)2u10,u20}.T_{D}(\bar{x})=\big{\{}u=(u_{1},u_{2})\in\mathbb{R}^{2}\mid u_{1}\geq 0,u_{2}\geq 0\big{\}}.

As a preparation for getting optimality conditions based on the Fréchet second-order subdifferential, we now recall the later concept and some related constructions.

2.3 Constructions from generalized differentiation

Definition 4

(See (Mordukhovich_2006a, , p. 4 )) Let Ω\Omega be a nonempty subset of X.X. The Fréchet normal cone to Ω\Omega at xΩx\in\Omega is given by

N^Ω(x):={xXlim supuΩxx,uxux0},\displaystyle\widehat{N}_{\Omega}(x):=\Big{\{}x^{*}\in X^{*}\mid\limsup\limits_{u\xrightarrow{\Omega}x}\dfrac{\langle x^{*},u-x\rangle}{\|u-x\|}\leq 0\Big{\}},

where uΩxu\xrightarrow{\Omega}x means that uxu\rightarrow x and uΩu\in\Omega. If xΩx\not\in\Omega, we put N^Ω(x)=\widehat{N}_{\Omega}(x)=\emptyset.

If Ω\Omega is convex, one has

N^Ω(x)=NΩ(x):={xXx,ux0,uΩ},\widehat{N}_{\Omega}(x)=N_{\Omega}(x):=\big{\{}x^{*}\in X^{*}\mid\langle x^{*},u-x\rangle\leq 0,\ \forall u\in\Omega\big{\}},

i.e., N^Ω(x)\widehat{N}_{\Omega}(x) coincides with the normal cone in the sense of convex analysis. In that case, [TΩ(x)]=NΩ(x)[T_{\Omega}(x)]^{*}=N_{\Omega}(x) and [NΩ(x)]=TΩ(x)[N_{\Omega}(x)]^{*}=T_{\Omega}(x), where

[NΩ(x)]:={xXx,x0,xNΩ(x)}.[N_{\Omega}(x)]^{*}:=\{x\in X\mid\langle x^{*},x\rangle\leq 0,\ \forall x^{*}\in N_{\Omega}(x)\}.

Given a set-valued map F:XYF:X\rightrightarrows Y between Banach spaces, one defines the graph of FF by gphF={(x,y)X×YyF(x)}.{\rm{gph}}\,F=\{(x,y)\in X\times Y\mid y\in F(x)\}. The product space X×YX\times Y is equipped with the norm (x,y):=x+y\|(x,y)\|:=\|x\|+\|y\|.

Definition 5

(See (Mordukhovich_2006a, , p. 40)) The Fréchet coderivative of FF at z¯=(x¯,y¯)\bar{z}=(\bar{x},\bar{y}) in gphF{\rm{gph}}\,F is the multifunction D^F(x¯,y¯):YX\widehat{D}^{*}F(\bar{x},\bar{y}):Y^{*}\rightrightarrows X^{*} given by

D^F(z¯)(y)={xX(x,y)N^gphF(z¯)},yY.\displaystyle\widehat{D}^{*}F(\bar{z})(y^{*})=\!\left\{x^{*}\in X^{*}\mid(x^{*},-y^{*})\!\in\!\widehat{N}_{{\rm gph}\,F}(\bar{z})\right\},\,\forall y^{*}\in Y^{*}.

If (x¯,y¯)gphF(\bar{x},\bar{y})\notin{\rm{gph}}\,F, one puts D^F(z¯)(y)=\widehat{D}^{*}F(\bar{z})(y^{*})=\emptyset for any yYy^{*}\in Y^{*}.

If F(x)={f(x)}F(x)=\{f(x)\} for all xXx\in X, where f:XYf:X\to Y is a single-valued map, we will write D^f(x¯)(y)\widehat{D}f(\bar{x})(y^{*}) instead of D^F(x¯,f(x¯))(y)\widehat{D}^{*}F(\bar{x},f(\bar{x}))(y^{*}).

Proposition 2

(See (Mordukhovich_2006a, , Theorem 1.38)) Let f:XYf:X\rightarrow Y be a Fréchet differentiable function at x¯\bar{x}. Then D^f(x¯)(y)={f(x¯)y}\widehat{D}f(\bar{x})(y^{*})=\{\nabla f(\bar{x})^{*}y^{*}\} for every yY,y^{*}\in Y^{*}, where f(x¯)\nabla f(\bar{x})^{*} is the adjoint operator of f(x¯).\nabla f(\bar{x}).

Consider a function f:X¯f:X\rightarrow\overline{\mathbb{R}}, where ¯=[,+]\overline{\mathbb{R}}=[-\infty,+\infty] is the extended real line. The epigraph of ff is given by epif={(x,α)X×αf(x)}.{\rm{epi}}\,f=\{(x,\alpha)\in X\times\mathbb{R}\mid\alpha\geq f(x)\}.

Definition 6

(See (Mordukhovich_2006a, , Chapter 1)) Let f:X¯f:X\rightarrow\overline{\mathbb{R}} be a function defined on a Banach space. Suppose that x¯X\bar{x}\in X and |f(x¯)|<.|f(\bar{x})|<\infty. One calls the set

^f(x¯):={xX(x,1)N^epif((x¯,f(x¯)))}\displaystyle\widehat{\partial}f(\bar{x}):=\left\{x^{*}\in X^{*}\mid(x^{*},-1)\in\widehat{N}_{{\rm epi}\,f}((\bar{x},f(\bar{x})))\right\}

the Fréchet subdifferential of ff at x¯\bar{x}. If |f(x¯)|=|f(\bar{x})|=\infty, one puts ^f(x¯)=\widehat{\partial}f(\bar{x})=\emptyset.

Definition 7

(See (Mordukhovich_2006a, , p. 122)) Let f:X¯f:X\rightarrow\overline{\mathbb{R}} be a function with a finite value at x¯.\bar{x}. For any y¯^f(x¯)\bar{y}\in\widehat{\partial}f(\bar{x}), the map ^2f(x¯,y¯):XX\widehat{\partial}^{2}f(\bar{x},\bar{y}):X^{**}\rightrightarrows X^{*} with the values

^2f(x¯,y¯)(u):=(D^^f)(x¯,y¯)(u)(uX)\displaystyle\widehat{\partial}^{2}f(\bar{x},\bar{y})(u):=(\widehat{D}^{*}\widehat{\partial}f)(\bar{x},\bar{y})(u)\quad(u\in X^{**})

is said to be the Fréchet second-order subdifferential of ff at x¯\bar{x} relative to y¯.\bar{y}.

If ^f(x¯)\widehat{\partial}f(\bar{x}) is a singleton, the symbol y¯\bar{y} in the notation ^2f(x¯,y¯)(u)\widehat{\partial}^{2}f(\bar{x},\bar{y})(u) will be omitted. If f:X¯f:X\rightarrow\overline{\mathbb{R}} is Fréchet differentiable in an open neighborhood of x¯\bar{x}, then ^f(x¯)={f(x¯)}\widehat{\partial}f(\bar{x})=\{\nabla f(\bar{x})\}. Moreover, if the operator f:XX\nabla f:X\rightarrow X^{*} is Fréchet differentiable at x¯\bar{x} with the second-order derivative 2f(x¯):=(f())(x¯)\nabla^{2}f(\bar{x}):=\nabla(\nabla f(\cdot))(\bar{x}), then 2f(x¯)\nabla^{2}f(\bar{x}) maps XX^{**} to XX^{*}. By Proposition 2, ^2f(x¯)(u)={2f(x¯)u}\widehat{\partial}^{2}f(\bar{x})(u)=\{\nabla^{2}f(\bar{x})^{*}u\} for every uXu\in X^{**}. When XX is finite-dimensional and ff is C2C^{2}-smooth in an open neighborhood of x¯\bar{x}, then 2f(x¯)\nabla^{2}f(\bar{x}) is identified with the Hessian matrix of ff at x¯\bar{x} for which one has 2f(x¯)=2f(x¯)\nabla^{2}f(\bar{x})^{*}=\nabla^{2}f(\bar{x}) by Clairaut’s rule.

The forthcoming subsection presents two lemmas which will be used repeatedly in the sequel.

2.4 Auxiliary results

Lemma 1

Let C={xXAx=y,xi,xαi,i=1,2,,p},C=\big{\{}x\in X\mid Ax=y,\ \langle x_{i}^{*},x\rangle\leq\alpha_{i},\ i=1,2,...,p\big{\}}, where AA, yy, xi,x_{i}^{*}, and αi\alpha_{i} for i=1,,pi=1,\dots,p are the same as in (2.3), be a generalized polyhedral convex set. For any vTC(x¯)v\in T_{C}(\bar{x}) with vTC(x¯)-v\in T_{C}(\bar{x}), it holds that

TC2(x¯,v)=TC2(x¯,v).\displaystyle T^{2}_{C}(\bar{x},-v)=T^{2}_{C}(\bar{x},v). (2.10)
Proof

By Proposition 1, TC2(x¯,v)=TTC(x¯)(v)T_{C}^{2}(\bar{x},v)=T_{T_{C}(\bar{x})}(v) and TC2(x¯,v)=TTC(x¯)(v).T_{C}^{2}(\bar{x},-v)=T_{T_{C}(\bar{x})}(-v). Moreover, one has TTC(x¯)(v)=[NTC(x¯)(v)]T_{T_{C}(\bar{x})}(v)=[N_{T_{C}(\bar{x})}(v)]^{*} and TTC(x¯)(v)=[NTC(x¯)(v)].T_{T_{C}(\bar{x})}(-v)=[N_{T_{C}(\bar{x})}(-v)]^{*}. Therefore,

TC2(x¯,v)=[NTC(x¯)(v)]andTC2(x¯,v)=[NTC(x¯)(v)].\displaystyle T_{C}^{2}(\bar{x},v)=[N_{T_{C}(\bar{x})}(v)]^{*}\quad{\rm and}\quad T_{C}^{2}(\bar{x},-v)=[N_{T_{C}(\bar{x})}(-v)]^{*}. (2.11)

On one hand, by (Luan_Yao_Yen, , Proposition 4.2), NC(x¯)=cone{xiiI(x¯)}+(kerA),N_{C}(\bar{x})={\rm cone}\,\big{\{}x_{i}^{*}\mid i\in I(\bar{x})\big{\}}+({\rm ker}\,A)^{\intercal}, where I(x¯)={iIxi,x¯=αi}I(\bar{x})=\{i\in I\mid\langle x_{i}^{*},\bar{x}\rangle=\alpha_{i}\} and

(kerA)={xXx,x=0,xkerA}.({\rm ker}\,A)^{\intercal}=\{x^{*}\in X^{*}\mid\langle x^{*},x\rangle=0,\ \forall x\in{\rm ker}\,A\}.

On the other hand, according to Proposition 1,

TC(x¯)={vXAv=0,xi,v0,iI(x¯)}.\displaystyle T_{C}(\bar{x})=\{v\in X\mid Av=0,\ \langle x_{i}^{*},v\rangle\leq 0,\ i\in I(\bar{x})\}.

So, vTC(x¯)v\in T_{C}(\bar{x}) and vTC(x¯)-v\in T_{C}(\bar{x}) if and only if Av=0,Av=0, xi,v0,\langle x_{i}^{*},v\rangle\leq 0, and xi,v0\langle x_{i}^{*},-v\rangle\leq 0 for all iI(x¯).i\in I(\bar{x}). This means that Av=0Av=0 and xi,v=0\langle x_{i}^{*},v\rangle=0 for all iI(x¯).i\in I(\bar{x}). Putting I0(u)={iI(x¯)xi,u=0}I^{0}(u)=\{i\in I(\bar{x})\mid\langle x_{i}^{*},u\rangle=0\} for every uTC(x¯),u\in T_{C}(\bar{x}), we see that I0(v)=I(x¯)=I0(v)I^{0}(v)=I(\bar{x})=I^{0}(-v). So, thanks to (Luan_Yao_Yen, , Proposition 4.2), we have

NTC(x¯)(v)=cone{xiiI0(v)}+(kerA)\displaystyle N_{T_{C}(\bar{x})}(v)={\rm cone}\,\{x_{i}^{*}\mid i\in I^{0}(v)\}+({\rm ker}\,A)^{\intercal}

and NTC(x¯)(v)=cone{xiiI0(v)}+(kerA)N_{T_{C}(\bar{x})}(-v)={\rm cone}\,\{x_{i}^{*}\mid i\in I^{0}(v)\}+({\rm ker}\,A)^{\intercal}. Thus, by (2.11) we get

TC2(x¯,v)=[NTC(x¯)(v)]=[NTC(x¯)(v)]=TC2(x¯,v).\displaystyle T_{C}^{2}(\bar{x},-v)=[N_{T_{C}(\bar{x})}(-v)]^{*}=[N_{T_{C}(\bar{x})}(v)]^{*}=T_{C}^{2}(\bar{x},v).

This justifies (2.10) and completes the proof. \hfill\Box

Consider the problem

min{f(x)xC},\min\{f(x)\mid x\in C\}, (P)

where f:Xf:X\rightarrow\mathbb{R} is a Fréchet differentiable function and CC is a nonempty subset of XX.

Lemma 2

Suppose that x¯\bar{x} is a local minimum of (P), where CC is a generalized polyhedral convex set. Then, f(x¯),v0\langle\nabla f(\bar{x}),v\rangle\geq 0 for every vTC(x¯).v\in T_{C}(\bar{x}). Moreover, if vTC(x¯)v\in T_{C}(\bar{x}) is such that f(x¯),v=0\langle\nabla f(\bar{x}),v\rangle=0, then

f(x¯),w0for allwTC2(x¯,v).\displaystyle\langle\nabla f(\bar{x}),w\rangle\geq 0\ \;\mbox{for all}\ \,w\in T_{C}^{2}(\bar{x},v). (2.12)
Proof

The first assertion is a special case of the result recalled in Theorem 3.1 below. Let vTC(x¯)v\in T_{C}(\bar{x}) be such that f(x¯),v=0\langle\nabla f(\bar{x}),v\rangle=0. To get (2.12), fix any wTC2(x¯,v)w\in T_{C}^{2}(\bar{x},v). By Proposition 1 we have TC2(x¯,v)=TTC(x¯)(v).T_{C}^{2}(\bar{x},v)=T_{T_{C}(\bar{x})}(v). Moreover, since CC is a generalized polyhedral convex set, TC(x¯)T_{C}(\bar{x}) is a generalized polyhedral convex cone by (Luan_Yao_Yen, , Proposition 2.22). So, applying (Luan_Yao_Yen, , Proposition 2.22), one has TTC(x¯)(v)=cone(TC(x¯)v).T_{T_{C}(\bar{x})}(v)={\rm cone}\,(T_{C}(\bar{x})-v). Thus, the representation w=λ(vv)w=\lambda(v^{\prime}-v) holds for some vTC(x¯)v^{\prime}\in T_{C}(\bar{x}) and λ>0\lambda>0. Therefore,

f(x¯),w=λf(x¯),vλf(x¯),v.\langle\nabla f(\bar{x}),w\rangle=\lambda\langle\nabla f(\bar{x}),v^{\prime}\rangle-\lambda\langle\nabla f(\bar{x}),v\rangle.

As f(x¯),v0\langle\nabla f(\bar{x}),v^{\prime}\rangle\geq 0 for any vTC(x¯)v^{\prime}\in T_{C}(\bar{x}) by the first assertion and f(x¯),v=0\langle\nabla f(\bar{x}),v\rangle=0 by our assumption, this implies (2.12). \hfill\Box

3 Problems in the classical setting

In this section, we focus on second-order optimality conditions for problem (P) under the assumption that ff is twice continuously differentiable on XX (i.e., ff is a C2C^{2}-smooth function). By abuse of terminology, we call this (P) a problem in the classical setting.

The next first-order and second-order necessary optimality conditions are known results. The proofs in a finite-dimensional setting given in (Ruszczynski2006, , p. 114 and p. 144) are also valid for the infinite-dimensional setting adopted in the present paper. For the first statement, it suffices to assume that ff is Fréchet differentiable at x¯\bar{x}.

Theorem 3.1

(See, e.g., (Ruszczynski2006, , Theorem 3.24)) If x¯\bar{x} is a local minimum of (P), then

f(x¯),v0for allvTC(x¯).\displaystyle\langle\nabla f(\bar{x}),v\rangle\geq 0\ \;\mbox{for all}\ \,v\in T_{C}(\bar{x}). (3.1)
Theorem 3.2

(See, e.g., (Ruszczynski2006, , Theorem 3.45)) Assume that x¯\bar{x} is a local minimum of (P). Then (3.1) holds and, for every vTC(x¯)v\in T_{C}(\bar{x}) satisfying f(x¯),v=0\langle\nabla f(\bar{x}),v\rangle=0, one has

f(x¯),w+2f(x¯)v,v0for allwTC2(x¯,v).\displaystyle\langle\nabla f(\bar{x}),w\rangle+\langle\nabla^{2}f(\bar{x})v,v\rangle\geq 0\ \;\mbox{for all}\ \,w\in T_{C}^{2}(\bar{x},v). (3.2)

Clearly, the simultaneous fulfillment of the inequalities f(x¯),w0\langle\nabla f(\bar{x}),w\rangle\geq 0 and 2f(x¯)v,v0\langle\nabla^{2}f(\bar{x})v,v\rangle\geq 0 yields the inequality f(x¯),w+2f(x¯)v,v0\langle\nabla f(\bar{x}),w\rangle+\langle\nabla^{2}f(\bar{x})v,v\rangle\geq 0 in (3.2). Hence, it is reasonable to raise the next question.

Question 1: When Theorem 3.2 can be stated in the following stronger form: “If x¯\bar{x} is a local minimum of (P), then (3.1) holds and the conditions

(c1)

f(x¯),w0\langle\nabla f(\bar{x}),w\rangle\geq 0 for all wTC2(x¯,v)w\in T_{C}^{2}(\bar{x},v), where vTC(x¯)v\in T_{C}(\bar{x}) is such that f(x¯),v=0\langle\nabla f(\bar{x}),v\rangle=0 (i.e., vv is a critical direction),

(c2)

2f(x¯)v,v0\langle\nabla^{2}f(\bar{x})v,v\rangle\geq 0 for all vTC(x¯)v\in T_{C}(\bar{x}) satisfying f(x¯),v=0\langle\nabla f(\bar{x}),v\rangle=0

are fulfilled.”?

If CC is a generalized polyhedral convex set, we can answer the above question as follows.

Theorem 3.3

Let CC be a generalized polyhedral convex set in a Banach space XX. If x¯\bar{x} is a local minimum of (P), then (3.1) holds and the conditions (c1) and (c2) are fulfilled.

Proof

To obtain (c1), pick an arbitrary vector wTC2(x¯,v)w\in T_{C}^{2}(\bar{x},v), where vTC(x¯)v\in T_{C}(\bar{x}) and f(x¯),v=0\langle\nabla f(\bar{x}),v\rangle=0. Applying Lemma 2, we have f(x¯),w0\langle\nabla f(\bar{x}),w\rangle\geq 0.

To prove (c2), take any vTC(x¯)v\in T_{C}(\bar{x}) with f(x¯),v=0\langle\nabla f(\bar{x}),v\rangle=0. If v=0v=0, then the inequality 2f(x¯)v,v0\langle\nabla^{2}f(\bar{x})v,v\rangle\geq 0 is obvious. Now, assume that v0v\neq 0. On one hand, since CC is a generalized polyhedral convex set, Proposition 2.22 from Luan_Yao_Yen guarantees that

TC(x¯)=cone(Cx)={λ(xx¯)λ>0,xC}.\displaystyle T_{C}(\bar{x})={\rm cone}\,(C-x)=\{\lambda(x-\bar{x})\mid\lambda>0,\ x\in C\}.

Hence, we have v=λ0(yx¯)v=\lambda_{0}(y-\bar{x}) for some yCy\in C, yx¯y\not=\bar{x}, and λ0>0\lambda_{0}>0. On the other hand, as x¯\bar{x} is a local minimum of (P), there exists ε>0\varepsilon>0 such that f(x¯)f(x)f(\bar{x})\leq f(x) for every xCx\in C with xx¯ε.||x-\bar{x}||\leq\varepsilon. Put λ¯=min{λ0,ε(λ0yx¯)1}\bar{\lambda}=\min\{\lambda_{0},\varepsilon(\lambda_{0}||y-\bar{x}||)^{-1}\}. Then, λ¯>0\bar{\lambda}>0 and we have x¯+λvC\bar{x}+\lambda v\in C and (x¯+λv)x¯ε||(\bar{x}+\lambda v)-\bar{x}||\leq\varepsilon for all λ(0,λ¯]\lambda\in(0,\bar{\lambda}]. Therefore,

f(x¯)f(x¯+λv)\displaystyle f(\bar{x})\leq f(\bar{x}+\lambda v) =f(x¯)+λf(x¯),v+λ222f(x¯)v,v+o(λ2)\displaystyle=f(\bar{x})+\lambda\langle\nabla f(\bar{x}),v\rangle+\frac{\lambda^{2}}{2}\langle\nabla^{2}f(\bar{x})v,v\rangle+o(\lambda^{2})
=f(x¯)+λ222f(x¯)v,v+o(λ2).\displaystyle=f(\bar{x})+\frac{\lambda^{2}}{2}\langle\nabla^{2}f(\bar{x})v,v\rangle+o(\lambda^{2}).

It follows that λ222f(x¯)v,v+o(λ2)0\frac{\lambda^{2}}{2}\langle\nabla^{2}f(\bar{x})v,v\rangle+o(\lambda^{2})\geq 0 for all λ(0,λ¯]\lambda\in(0,\bar{\lambda}]. Dividing both sides of the last inequality by λ22\frac{\lambda^{2}}{2} and taking the limit as λ0+\lambda\to 0^{+}, we get 2f(x¯)v,v0,\langle\nabla^{2}f(\bar{x})v,v\rangle\geq 0, as desired. \hfill\Box

Remark 2

In the setting of Theorem 3.3, one has TC(x¯)TC2(x¯,v)T_{C}(\bar{x})\subset T^{2}_{C}(\bar{x},v) for any vTC(x¯)v\in T_{C}(\bar{x}). Since the inclusion of sets can be strict (see Remark 1), the property (c1) asserted by Theorem 3.3 is more stringent than the first-order necessary condition in (3.1) which reads as follows: f(x¯),u0\langle\nabla f(\bar{x}),u\rangle\geq 0 for every uTC(x¯)u\in T_{C}(\bar{x}).

As an application of Theorem 3.3, we now specialize it to the case of quadratic programming problems on Banach spaces with generalized polyhedral convex constraint sets. Note that the later problems have been considered, for example, in Bonnans_Shapiro_2000 and Yen_Yang_2018 . One calls (P) a quadratic programming problem on a generalized polyhedral convex set if CXC\subset X is a generalized polyhedral convex set and f(x)=12Mx,x+q,x+αf(x)=\frac{1}{2}\langle Mx,x\rangle+\langle q,x\rangle+\alpha, where M:XXM:X\to X^{*} is a bounded linear operator, qXq\in X^{*}, and α\alpha\in\mathbb{R}. It is assumed that MM is symmetric in the sense that Mx,y=My,x\langle Mx,y\rangle=\langle My,x\rangle for all x,yXx,y\in X. Since f(x)=Mx+q\nabla f(x)=Mx+q and 2f(x)v=Mv\nabla^{2}f(x)v=Mv for all x,vXx,v\in X, the next statement follows directly from Theorem 3.3.

Theorem 3.4

Assume that (P) be a quadratic programming problem given by a generalized polyhedral convex set CXC\subset X and a linear-quadratic function f(x)=12Mx,x+q,x+αf(x)=\frac{1}{2}\langle Mx,x\rangle+\langle q,x\rangle+\alpha with MM being symmetric. If x¯\bar{x} is a local minimum of this problem (P), then the following conditions are satisfied:

(c0)

Mx¯+q,v0\langle M\bar{x}+q,v\rangle\geq 0 for all vTC(x¯)v\in T_{C}(\bar{x});

(c1’)

Mx¯+q,w0\langle M\bar{x}+q,w\rangle\geq 0 for all wTC2(x¯,v)w\in T_{C}^{2}(\bar{x},v), where vTC(x¯)v\in T_{C}(\bar{x}) is such that Mx¯+q,v=0\langle M\bar{x}+q,v\rangle=0,

(c2’)

Mv,v0\langle Mv,v\rangle\geq 0 for all vTC(x¯)v\in T_{C}(\bar{x}) satisfying Mx¯+q,v=0\langle M\bar{x}+q,v\rangle=0.

According to the Majthay-Contesse theorem (see (Lee_Tam_Yen, , Theorem 3.4)), second-order necessary optimality conditions for finite-dimensional quadratic programs are also sufficient ones. Thus, it is of interest to know whether a similar assertion remains true for the second-order necessary optimality conditions in Theorem 3.4, or not.

Question 2: Under the assumptions of Theorem 3.4, if x¯C\bar{x}\in C is such that the conditions (c0), (c1’), and (c2’) are fulfilled, then x¯\bar{x} is a local minimum of (P)?

Turning our attention back to Theorem 3.3, observe that if CC is not a generalized polyhedral convex set, then the assertions of that theorem may not hold anymore. This means that, in general, the pair of conditions (c1) and (c2) is much stronger than condition (3.2).

To clarify the above observation, we first consider an example where CC is a compact convex set in 2\mathbb{R}^{2}, which is given by a simple inequality.

Example 1

(See (LVD2014, , Example 2, p. 20)) Consider problem (P) where X=2X=\mathbb{R}^{2}, f(x)=2x12x22f(x)=-2x_{1}^{2}-x_{2}^{2} for all x=(x1,x2)x=(x_{1},x_{2}), and

C={x=(x1,x2)g(x)=2x12+3x2260}.C=\big{\{}x=(x_{1},x_{2})\mid g(x)=2x_{1}^{2}+3x_{2}^{2}-6\leq 0\big{\}}.

Since ff is continuous and CC is compact, (P) has a global solution. As ff is Fréchet differentiable, by a well known necessary optimality condition (see the proof of Theorem 5.1 in Mordukhovich_2006b ) which is a dual form of the condition recalled in Theorem 3.1, if x¯=(x¯1,x¯2)\bar{x}=(\bar{x}_{1},\bar{x}_{2}) is a solution of (P) then

0f(x¯)+N^C(x¯).\displaystyle 0\in\nabla f(\bar{x})+\widehat{N}_{C}(\bar{x}). (3.3)

On one hand, f(x¯)=(4x¯1,2x¯2)T\nabla f(\bar{x})=(-4\bar{x}_{1},-2\bar{x}_{2})^{T}. On the other hand, as CC is a convex set, N^C(x¯)\widehat{N}_{C}(\bar{x}) coincides with the normal cone to CC at x¯\bar{x} in the sense of convex analysis. Hence, by (IoffeTihomirov, , p. 206) we have N^C(x¯)={λg(x¯)=λ(4x¯1,6x¯2)Tλ0}\widehat{N}_{C}(\bar{x})=\{\lambda\nabla g(\bar{x})=\lambda(4\bar{x}_{1},6\bar{x}_{2})^{T}\mid\lambda\geq 0\} whenever x¯C\bar{x}\in\partial C. Therefore, if x¯C\bar{x}\in\partial C, then (3.3) is equivalent to the existence of λ0\lambda\geq 0 satisfying

{4x¯1+4λx¯1=02x¯2+6λx¯2=0.\begin{cases}-4\bar{x}_{1}+4\lambda\bar{x}_{1}=0\\ -2\bar{x}_{2}+6\lambda\bar{x}_{2}=0.\end{cases}

From this condition, we get four critical points x¯1=(3,0)T\bar{x}^{1}=(\sqrt{3},0)^{T}, x¯2=(3,0)T\bar{x}^{2}=(-\sqrt{3},0)^{T}, x¯3=(0,2)T\bar{x}^{3}=(0,-\sqrt{2})^{T}, x¯4=(0,2)T\bar{x}^{4}=(0,\sqrt{2})^{T}. If x¯intC\bar{x}\in{\rm int}C, then (3.3) is equivalent to the condition f(x¯)=0\nabla f(\bar{x})=0, which gives the fifth critical point x¯5=(0,0)T\bar{x}^{5}=(0,0)^{T}. Comparing the values of ff at these five points, we conclude that x¯1=(3,0)T\bar{x}^{1}=(\sqrt{3},0)^{T} and x¯2=(3,0)T\bar{x}^{2}=(-\sqrt{3},0)^{T} are the global minima of (P). Obviously, there exists x02x^{0}\in\mathbb{R}^{2} such that g(x¯1),x0<0.\langle\nabla g(\bar{x}^{1}),x^{0}\rangle<0. This means that the regularity condition in (Ruszczynski2006, , Lemma 3.16) is satisfied. So, according to (Ruszczynski2006, , formula (3.29), p. 115), one has

TC(x¯1)\displaystyle T_{C}(\bar{x}^{1}) ={v2g(x¯1),v0}\displaystyle=\{v\in\mathbb{R}^{2}\mid\langle\nabla g(\bar{x}^{1}),v\rangle\leq 0\}
={v=(v1,v2)2v10,v2}.\displaystyle=\{v=(v_{1},v_{2})\in\mathbb{R}^{2}\mid v_{1}\leq 0,\ v_{2}\in\mathbb{R}\}.

Since f(x¯1)=(43,0)T\nabla f(\bar{x}^{1})=\left(-4\sqrt{3},0\right)^{T}, fixing any v=(0,v2)TTC(x¯1)v=(0,v_{2})^{T}\in T_{C}(\bar{x}^{1}), we have f(x¯1),v=0\langle\nabla f(\bar{x}^{1}),v\rangle=0. Moreover, by (Ruszczynski2006, , Lemma 3.44),

TC2(x¯1,v)\displaystyle T^{2}_{C}(\bar{x}^{1},v) ={w=(w1,w2)2g(x¯1),w2g(x¯1)v,v}\displaystyle=\{w=(w_{1},w_{2})\in\mathbb{R}^{2}\mid\langle\nabla g(\bar{x}^{1}),w\rangle\leq-\langle\nabla^{2}g(\bar{x}^{1})v,v\rangle\}
={w=(w1,w2)2w16v2243}.\displaystyle=\Big{\{}w=(w_{1},w_{2})\in\mathbb{R}^{2}\mid w_{1}\leq\dfrac{-6v_{2}^{2}}{4\sqrt{3}}\Big{\}}.

It follows that f(x¯1),w=43w10\langle\nabla f(\bar{x}^{1}),w\rangle=-4\sqrt{3}w_{1}\geq 0 for every wTC2(x¯,v)w\in T_{C}^{2}(\bar{x},v). Hence, condition (c1) in Theorem 3.3 is satisfied. Since 2f(x¯1)v,v=2v22\langle\nabla^{2}f(\bar{x}^{1})v,v\rangle=-2v_{2}^{2}, the requirement 2f(x¯)v,v0\langle\nabla^{2}f(\bar{x})v,v\rangle\geq 0 in condition (c2) is violated if v20v_{2}\neq 0. Thus, the pair of conditions (c1) and (c2) does not hold, while condition (3.2) is fulfilled.

Next, let us consider an example where CC is a nonconvex compact set given by an equality.

Example 2

(See (LVD2014, , Example 1, p. 29)) Consider problem (P) and suppose that f(x)=x12x22f(x)=-x_{1}^{2}-x_{2}^{2} for x=(x1,x2)2x=(x_{1},x_{2})\in\mathbb{R}^{2},

C={x=(x1,x2)2h(x)=x12+2x221=0}.C=\big{\{}x=(x_{1},x_{2})\in\mathbb{R}^{2}\mid h(x)=x_{1}^{2}+2x_{2}^{2}-1=0\big{\}}.

As it has been shown in (LVD2014, , p. 29), x¯1=(1,0)T\bar{x}^{1}=(1,0)^{T} and x¯2=(1,0)T\bar{x}^{2}=(-1,0)^{T} are the global solutions of this problem. According to (Ruszczynski2006, , Formula (3.29), p. 115),

TC(x¯2)={v=(v1,v2)2v1=0}.\displaystyle T_{C}(\bar{x}^{2})=\{v=(v_{1},v_{2})\in\mathbb{R}^{2}\mid v_{1}=0\}.

Fixing any v=(0,v2)TTC(x¯2)v=(0,v_{2})^{T}\in T_{C}(\bar{x}^{2}), we have f(x¯2),v=0\langle\nabla f(\bar{x}^{2}),v\rangle=0. By (Ruszczynski2006, , Lemma 3.44),

TC2(x¯2,v)\displaystyle T^{2}_{C}(\bar{x}^{2},v) ={w=(w1,w2)2h(x¯2),w=2h(x¯2)v,v}\displaystyle=\{w=(w_{1},w_{2})\in\mathbb{R}^{2}\mid\langle\nabla h(\bar{x}^{2}),w\rangle=-\langle\nabla^{2}h(\bar{x}^{2})v,v\rangle\}
={w=(w1,w2)2w1=2v22}.\displaystyle=\{w=(w_{1},w_{2})\in\mathbb{R}^{2}\mid w_{1}=2v_{2}^{2}\}.

Since f(x¯2),w=2w1=4v220\langle\nabla f(\bar{x}^{2}),w\rangle=2w_{1}=4v_{2}^{2}\geq 0 for all wTC2(x¯,v)w\in T_{C}^{2}(\bar{x},v), condition (c1) in Theorem 3.3 is satisfied. Meanwhile, since 2f(x¯2)v,v=2v220\langle\nabla^{2}f(\bar{x}^{2})v,v\rangle=-2v_{2}^{2}\leq 0, the inequality 2f(x¯)v,v0\langle\nabla^{2}f(\bar{x})v,v\rangle\geq 0 in condition (c2) is violated if v20v_{2}\neq 0. Thus, the conditions (c1) and (c2) do not hold simultaneously, while condition (3.2) is fulfilled.

4 Problems in a new setting

The following second-order necessary optimality condition for (P) is one of the main results of this paper. It is based on the Fréchet second-order subdifferential of ff and the second-order tangent set to CC, which is assumed to be a convex set of a special type. Unlike the situation in Theorem 3.3 where ff was assumed to be a C2C^{2}-smooth function, in the next theorem and throughout this section we just assume that ff is a C1C^{1}-smooth function.

Theorem 4.1

(Second-order necessary optimality condition) Assume that x¯\bar{x} is a locally optimal solution of (P), where CC is a generalized polyhedral convex set. Suppose that there exists a constant >0\ell>0 such that

f(x)f(x¯)xx¯\displaystyle||\nabla f(x)-\nabla f(\bar{x})||\leq\ell||x-\bar{x}|| (4.1)

for every xx in some neighborhood of x¯\bar{x}. Consider the restricted second-order subdifferential ^2f(x¯):XX\widehat{\partial}^{2}f(\bar{x}):X\rightrightarrows X^{*}, where XX is canonically embedded in XX^{**}. Then, (3.1) is valid and, for each vTC(x¯)v\in T_{C}(\bar{x}) such that vTC(x¯)-v\in T_{C}(\bar{x}) and f(x¯),v=0\langle\nabla f(\bar{x}),v\rangle=0, one has

f(x¯),w0\displaystyle\langle\nabla f(\bar{x}),w\rangle\geq 0 (4.2)

and

z,v0\displaystyle\langle z,v\rangle\geq 0 (4.3)

for any wTC2(x¯,v)w\in T_{C}^{2}(\bar{x},v) and z^2f(x¯)(v).z\in\widehat{\partial}^{2}f(\bar{x})(v).

Proof

Let x¯\bar{x} be such a locally optimal solution of (P) that (4.1) is valid for all xx in a neighborhood UU of x¯\bar{x}, where \ell is a positive constant. Let vTC(x¯)v\in T_{C}(\bar{x}) be such that vTC(x¯)-v\in T_{C}(\bar{x}) and f(x¯),v=0\langle\nabla f(\bar{x}),v\rangle=0. Suppose that wTC2(x¯,v)w\in T_{C}^{2}(\bar{x},v) and z^2f(x¯)(v)z\in\widehat{\partial}^{2}f(\bar{x})(v) are given arbitrarily. Since CC is a generalized polyhedral convex set, by Lemma 2 we have (4.2). It remains to prove (4.3). To obtain a contraction, suppose that

z,v<0.\displaystyle\langle z,v\rangle<0. (4.4)

By the definition of Fréchet second-order subdifferential, from z^2f(x¯)(v)z\in\widehat{\partial}^{2}f(\bar{x})(v) we get zD^f()(x¯)(v)z\in\widehat{D}^{*}\nabla f(\cdot)(\bar{x})(v) or, equivalently, (z,v)N^gphf()((x¯,f(x¯))).(z,-v)\in\widehat{N}_{\textrm{gph}\nabla f(\cdot)}((\bar{x},\nabla f(\bar{x}))). So, one has

lim supxx¯(z,v),(x,f(x))(x¯,f(x¯))xx¯+f(x)f(x¯)0.\displaystyle\limsup\limits_{x\rightarrow\bar{x}}\frac{\langle(z,-v),\big{(}x,\nabla f(x)\big{)}-(\bar{x},\nabla f(\bar{x}))\rangle}{\|x-\bar{x}\|+\|\nabla f(x)-\nabla f(\bar{x})\|}\leq 0. (4.5)

Recall that every vector uXu\in X can be regarded as an element of XX^{**} by setting u,x=x,u\langle u,x^{*}\rangle=\langle x^{*},u\rangle for all xXx^{*}\in X^{*}. Hence u,f(x)=f(x),u\langle u,\nabla f(x)\rangle=\langle\nabla f(x),u\rangle for all u,xXu,x\in X. Since f(x¯),v=0\langle\nabla f(\bar{x}),v\rangle=0, from (4.5) we obtain

lim supxx¯z,xx¯f(x),vxx¯+f(x)f(x¯)0.\displaystyle\limsup\limits_{x\rightarrow\bar{x}}\frac{\langle z,x-\bar{x}\rangle-\langle\nabla f(x),v\rangle}{\|x-\bar{x}\|+\|\nabla f(x)-\nabla f(\bar{x})\|}\leq 0. (4.6)

Moreover, as CC is a generalized polyhedral convex set, there exists k¯\bar{k}\in\mathbb{N} such that xk:=x¯1kvx^{k}:=\bar{x}-\frac{1}{k}v belongs to CC for all kk¯k\geq\bar{k}.

Since x¯\bar{x} is a local solution of (P) and limkxk=x¯\displaystyle\lim_{k\to\infty}x^{k}=\bar{x}, there is no loss of generality in assuming that

f(xk)f(x¯),kk¯.\displaystyle f(x^{k})\geq f(\bar{x}),\ \,\forall k\geq\bar{k}. (4.7)

For each kk¯k\geq\bar{k}, by the classical mean value theorem one can find a vector

ξk(x¯,xk):={(1τ)x¯+τxk|τ(0,1)}\xi^{k}\in(\bar{x},x^{k}):=\{(1-\tau)\bar{x}+\tau x^{k}\ |\ \tau\in(0,1)\}

such that f(xk)f(x¯)=f(ξk),xkx¯.f(x^{k})-f(\bar{x})=\langle\nabla f(\xi^{k}),x^{k}-\bar{x}\rangle. Since xk=x¯1kvx^{k}=\bar{x}-\frac{1}{k}v, combining this with (4.7) yields 1kf(ξk),v0.-\frac{1}{k}\langle\nabla f(\xi_{k}),v\rangle\geq 0. It follows that

f(ξk),v0(kk¯).\displaystyle\langle\nabla f(\xi_{k}),v\rangle\leq 0\quad(\forall k\geq\bar{k}). (4.8)

From (4.6) we can deduce that

lim supkz,ξkx¯f(ξk),vξkx¯+f(ξk)f(x¯)0.\displaystyle\limsup\limits_{{k}\rightarrow\infty}\frac{\langle z,\xi_{k}-\bar{x}\rangle-\langle\nabla f(\xi_{k}),v\rangle}{\|\xi_{k}-\bar{x}\|+\|\nabla f(\xi_{k})-\nabla f(\bar{x})\|}\leq 0.

Noting that ξk=x¯tkv\xi_{k}=\bar{x}-t_{k}v for some tk(0,1k)t_{k}\in\left(0,\frac{1}{k}\right), from this one gets

lim supkΔk0,\displaystyle\limsup\limits_{{k}\rightarrow\infty}\Delta_{k}\leq 0, (4.9)

where

Δk:=tkz,vf(ξk),vtkv||+f(ξk)f(x¯).\Delta_{k}:=\frac{-t_{k}\langle z,v\rangle-\langle\nabla f(\xi_{k}),v\rangle}{\|-t_{k}v||+\|\nabla f(\xi_{k})-\nabla f(\bar{x})\|}.

Clearly,

Δk=z,vtk1f(ξk),vv||+tk1f(ξk)f(x¯).\displaystyle\Delta_{k}=\frac{-\langle z,v\rangle-t_{k}^{-1}\langle\nabla f(\xi_{k}),v\rangle}{\|v||+t_{k}^{-1}\|\nabla f(\xi_{k})-\nabla f(\bar{x})\|}.

Hence, by (4.8) one has

Δkz,vv||+tk1f(ξk)f(x¯).\Delta_{k}\geq\frac{-\langle z,v\rangle}{\|v||+t_{k}^{-1}\|\nabla f(\xi_{k})-\nabla f(\bar{x})\|}.

On one hand, using (4.1) we obtain

f(ξk)f(x¯)ξkx¯=tkv,\displaystyle||\nabla f(\xi_{k})-\nabla f(\bar{x})||\leq\ell||\xi_{k}-\bar{x}||=\ell t_{k}||v||,

provided that kk is large enough. On the other hand, by virtue of (4.4) we have z,v>0-\langle z,v\rangle>0. Consequently, for large enough indexes kk, it holds that

Δkz,v(1+)v.\Delta_{k}\geq\frac{-\langle z,v\rangle}{(1+\ell)\|v\|}.

So, we get lim supkΔk>0\limsup\limits_{{k}\rightarrow\infty}\Delta_{k}>0, which contradicts (4.9).

The proof is complete. \hfill\Box

Remark 3

To compare Theorem 4.1 with Theorem 3.3, assume for a while that ff is C2C^{2}-smooth. Let x¯\bar{x} be a locally optimal solution of (P), where CC is a generalized polyhedral convex set. Then, applying the mean-value theorem for vector-valued functions (see (IoffeTihomirov, , p. 27)) to the gradient mapping f():XX\nabla f(\cdot):X\to X^{*}, one can show that there exists a constant >0\ell>0 such that (4.1) holds for every xx in some neighborhood of x¯\bar{x}. Since ^2f(x¯)(u)={2f(x¯)u}\widehat{\partial}^{2}f(\bar{x})(u)=\{\nabla^{2}f(\bar{x})^{*}u\} for every uu in the space XX, which is canonically embedded in XX^{**}, inequality (4.3) means that 2f(x¯)v,v0\langle\nabla^{2}f(\bar{x})^{*}v,v\rangle\geq 0. Hence, v,2f(x¯)v0\langle v,\nabla^{2}f(\bar{x})v\rangle\geq 0. By the definition of the canonical embedding of XX in XX^{**}, the latter means that 2f(x¯)v,v0\langle\nabla^{2}f(\bar{x})v,v\rangle\geq 0. Therefore, the assertions of Theorem 4.1 coincide with those of Theorem 3.3, provided that the critical direction vv satisfies the condition vTC(x¯)-v\in T_{C}(\bar{x}). Thus, in comparison with Theorem 3.3, although Theorem 4.1 helps us to treat optimization problems with objective functions from a larger class, it does not provide a complete extension for the former theorem.

When C=XC=X, (P) becomes the unconstrained optimization problem

min{f(x)xX}\min\{f(x)\mid x\in X\} (P1)

with f:Xf:X\rightarrow\mathbb{R} being a C1C^{1}-smooth function. From Theorem 4.1 one can easily derive the following second-order optimality condition for (P1), which is due to Chieu et al. ChieuLeeYen2017 .

Theorem 4.2

(See (ChieuLeeYen2017, , Theorem 3.3)) Suppose that x¯\bar{x} is a local solution of (P1) and there exists >0\ell>0 such that f(x)f(x¯)xx¯||\nabla f(x)-\nabla f(\bar{x})||\leq\ell||x-\bar{x}|| for every xx in some neighborhood of x¯\bar{x}. Then f(x¯)=0\nabla f(\bar{x})=0 and the second-order subdifferential ^2f(x¯):XX\widehat{\partial}^{2}f(\bar{x}):X\rightrightarrows X^{*}, where XX is canonically embedded in XX^{**}, is positive semi-definite, i.e., z,u0\langle z,u\rangle\geq 0 for any uXu\in X and z^2f(x¯)(u).z\in\widehat{\partial}^{2}f(\bar{x})(u).

Dai (LVD2014, , Chapter 3) has extended the finite-dimensional version of Theorem 4.2 to case of constrained C1C^{1}-smooth optimization problems of the form

min{f(x)h(x)=0}\min\{f(x)\mid h(x)=0\} (P2)

with h(x)=Ax+bh(x)=Ax+b, where Ap×nA\in\mathbb{R}^{p\times n} is a given matrix and bpb\in\mathbb{R}^{p} is a given vector. In this case, one has C={xnAx+b=0}C=\{x\in\mathbb{R}^{n}\mid Ax+b=0\}. Thus, CC is a special polyhedral convex set in n\mathbb{R}^{n}. The Lagrange function associated with (P2) is defined by setting L(x,μ)=f(x)+μ,h(x)L(x,\mu)=f(x)+\langle\mu,h(x)\rangle for (x,μ)n×p(x,\mu)\in\mathbb{R}^{n}\times\mathbb{R}^{p}.

Theorem 4.3

(See (LVD2014, , Theorem 3.3)) Suppose that x¯\bar{x} is a local solution of (P2) and μ¯p\bar{\mu}\in\mathbb{R}^{p} is a Lagrange multiplier corresponding to x¯\bar{x}, that is,

xL(x¯,μ¯)=f(x¯)+ATμ¯=0.\nabla_{x}L(\bar{x},\bar{\mu})=\nabla f(\bar{x})+A^{T}\bar{\mu}=0. (4.10)

Suppose that, in addition, there exists a constant >0\ell>0 and a neighborhood UU of x¯\bar{x} such that f(x)f(x¯)xx¯||\nabla f(x)-\nabla f(\bar{x})||\leq\ell||x-\bar{x}|| for all xUx\in U. Then, for any vnv\in\mathbb{R}^{n} with Av=0Av=0, one has z,v0\langle z,v\rangle\geq 0 for any z^2L(,μ¯)(x¯)(v)z\in\widehat{\partial}^{2}L(\cdot,\bar{\mu})(\bar{x})(v).

Theorem 4.1 is a generalization of Theorem 4.3. Indeed, the existence of μ¯p\bar{\mu}\in\mathbb{R}^{p} satisfying (4.10) follows from the necessary condition in (3.1) and Farkas’ Lemma (see, e.g., (Rockafellar_1970, , p. 200)). On one hand, since xL(x,μ)=f(x)+ATμ\nabla_{x}L(x,\mu)=\nabla f(x)+A^{T}\mu for every (x,μ)n×p(x,\mu)\in\mathbb{R}^{n}\times\mathbb{R}^{p}, one has ^2L(,μ¯)(x¯)()=^2f(x¯)()\widehat{\partial}^{2}L(\cdot,\bar{\mu})(\bar{x})(\cdot)=\widehat{\partial}^{2}f(\bar{x})(\cdot). Hence, the inclusion z^2L(,μ¯)(x¯)(v)z\in\widehat{\partial}^{2}L(\cdot,\bar{\mu})(\bar{x})(v) is equivalent to saying that z^2f(x¯)(v)z\in\widehat{\partial}^{2}f(\bar{x})(v). On the other hand, as TC(x¯)={unAu=0}T_{C}(\bar{x})=\{u\in\mathbb{R}^{n}\mid Au=0\}, the condition Av=0Av=0 implies that vTC(x¯)v\in T_{C}(\bar{x}) and vTC(x¯)-v\in T_{C}(\bar{x}). Moreover, from (3.1) one deduces that f(x¯),v=0\langle\nabla f(\bar{x}),v\rangle=0. Therefore, its follows from (4.3) that z,v0\langle z,v\rangle\geq 0 for any z^2L(,μ¯)(x¯)(v)z\in\widehat{\partial}^{2}L(\cdot,\bar{\mu})(\bar{x})(v).

Theorem 4.1 asserts that inequality (4.3) holds for any z^2f(x¯)(v)z\in\widehat{\partial}^{2}f(\bar{x})(v) if the critical direction vv satisfies the additional condition vTC(x¯)-v\in T_{C}(\bar{x}). The following example will show that the last condition is essential for the validity of the assertion.

Example 3

Let n=1n=1, C=+C=\mathbb{R}_{+}, g(x)=xg(x)=-x for x0x\leq 0 and g(x)=x2g(x)=x^{2} for x0x\geq 0. Define f(x)=0xg(t)𝑑tf(x)=\displaystyle\int_{0}^{x}g(t)dt for all xx\in\mathbb{R}, where the integration is Riemannian. Since g()g(\cdot) is continuous on \mathbb{R}, ff is a C1C^{1}-smooth function and f(x)=g(x)\nabla f(x)=g(x) for xx\in\mathbb{R}. Note that f(x)=12x2f(x)=-\frac{1}{2}x^{2} for x0x\leq 0, f(x)=13x3f(x)=\frac{1}{3}x^{3} for x0x\geq 0. Consider the point x¯:=0\bar{x}:=0, which is the unique global solution of (P). Clearly, ff satisfies condition (4.1) for every x(1,1)x\in(-1,1) with =1\ell=1. On one hand, by Proposition 1 we have TC(x¯)=+T_{C}(\bar{x})=\mathbb{R_{+}} and

TC2(x¯,v)=TTC(x¯)(v)={ifv>0,+ifv=0.\displaystyle T^{2}_{C}(\bar{x},v)=T_{T_{C}(\bar{x})}(v)=\begin{cases}\mathbb{R}&\mbox{if}\ v>0,\\ \mathbb{R_{+}}&\mbox{if}\ v=0.\end{cases}

On the other hand, using the definition of the second-order subdifferential, we have

z^2f(x¯)(v)zD^f()(x¯)(v)(z,v)N^gphf()((x¯,f(x¯)))lim supxx¯(z,v),(x,f(x))(x¯,f(x¯))|xx¯|+|f(x)f(x¯)|0.\displaystyle\begin{array}[]{rcl}z\in\widehat{\partial}^{2}f(\bar{x})(v)&\Leftrightarrow&z\in\widehat{D}^{*}\nabla f(\cdot)(\bar{x})(v)\\ &\ \Leftrightarrow&(z,-v)\in\widehat{N}_{\textrm{gph}\nabla f(\cdot)}((\bar{x},\nabla f(\bar{x})))\\ &\Leftrightarrow&\limsup\limits_{x\to\ \bar{x}}\dfrac{\langle(z,-v),(x,\nabla f(x))-(\bar{x},\nabla f(\bar{x}))\rangle}{|x-\bar{x}|+|\nabla f(x)-\nabla f(\bar{x})|}\leq 0.\end{array}

Since x¯=0\bar{x}=0 and f(x¯)=0\nabla f(\bar{x})=0, the last inequality is equivalent to

lim supx0zxvf(x)|x|+|f(x)|0.\displaystyle\limsup\limits_{x\to 0}\dfrac{zx-v\nabla f(x)}{|x|+|\nabla f(x)|}\leq 0. (4.11)

From (4.11) one has

0lim supx0+zxvx2x+x2=lim supx0+zvx1+x=z\displaystyle 0\geq\limsup\limits_{x\to 0^{+}}\dfrac{zx-vx^{2}}{x+x^{2}}=\limsup\limits_{x\to 0^{+}}\dfrac{z-vx}{1+x}=z

and

0lim supx0zx+vx2x=(z+v)2.\displaystyle 0\geq\limsup\limits_{x\to 0^{-}}\dfrac{zx+vx}{-2x}=\dfrac{-(z+v)}{2}.

It follows that

z0andz+v0.\displaystyle z\leq 0\quad\mbox{and}\quad z+v\geq 0. (4.12)

Conversely, if (4.12) is satisfied, then (4.11) holds. Consequently, the inclusion z^2f(x¯)(v)z\in\widehat{\partial}^{2}f(\bar{x})(v) means that vz0-v\leq z\leq 0. So, choosing v=1v=1 and z=1z=-1, one has vTC(x¯)v\in T_{C}(\bar{x}), f(x¯)v=0\nabla f(\bar{x})v=0, and z^2f(x¯)(v)z\in\widehat{\partial}^{2}f(\bar{x})(v). Clearly, (4.2) holds for any wTC2(x¯,v)w\in T_{C}^{2}(\bar{x},v) because f(x¯)=0\nabla f(\bar{x})=0. However, (4.3) is violated as zv=1zv=-1. Note that vTC(x¯)-v\notin T_{C}(\bar{x}).

Acknowledgements. This research was supported by Vietnam Institute for Advanced Study in Mathematics (VIASM). Duong Thi Viet An was also supported by the Simons Foundation Grant Targeted for Institute of Mathematics, Vietnam Academy of Science and Technology.

References

  • (1) Ban, L., Mordukhovich, B.S., Song, W.: Lipschitzian stability of parametric variational inequalities over generalized polyhedra in Banach spaces. Nonlinear Anal. 74, 441–461 (2011)
  • (2) Ben-Tal, A.: Second-order and related extremality conditions in nonlinear programming. J. Optim. Theory Appl. 31, 143–165 (1980)
  • (3) Ben-Tal, A., Zowe, J.: Necessary and sufficient optimality conditions for a class of nonsmooth minimization problems. Math. Programming 24, 70–91 (1982)
  • (4) Bonnans, J.F., Shapiro, A.: Perturbation Analysis of Optimization Problems. Springer, New York (2000)
  • (5) Chieu, N.H., Chuong, T.D., Yao, J.-C., Yen, N.D.: Characterizing convexity of a function by its Fréchet and limiting second-order subdifferentials. Set-Valued Var. Anal. 19, 75–96 (2011)
  • (6) Chieu, N.H., Huy, N.Q.: Second-order subdifferentials and convexity of real-valued functions. Nonlinear Anal. 74, 154–160 (2011)
  • (7) Chieu, N.H., Lee, G.M., Yen, N.D.: Second-order subdifferentials and optimality conditions for C1C^{1}-smooth optimization problems. Appl. Anal. Optim. 1, 461–476 (2017)
  • (8) Dai, L.V.: Necessary and Sufficient Optimality Conditions with Lagrange Multipliers. Undergraduate Thesis, University of Science, Vietnam National University (2014)
  • (9) Hiriart-Urruty, J.-B., Strodiot, J.-J., Nguyen, V.H: Generalized Hessian matrix and second-order optimality conditions for problems with C1,1C^{1,1} data. Appl. Math. Optim. 11, 43–56 (1984)
  • (10) Ioffe, A.D., Tihomirov, V.M.: Theory of Extremal Problems. Amsterdam, North-Holland (1979)
  • (11) Huy, N.Q., Tuyen, N.V.: New second-order optimality conditions for a class of differentiable optimization problems. J. Optim. Theory Appl. 171, 27–44 (2016)
  • (12) Lee, G.M., Tam, N.N., Yen, N.D.: Quadratic Programming and Affine Variational Inequalities. A Qualitative Study. Springer-Verlag, New York (2005)
  • (13) Luan, N.N., Yao, J-.C.: Generalized polyhedral convex optimization problems. J. Global Optim. 75, 789–811 (2019)
  • (14) Luan, N.N., Yao, J-.C., Yen, N.D.: On some generalized polyhedral convex constructions. Numer. Funct. Anal. Optim. 39, 537–570 (2018)
  • (15) Luan, N.N., Yen, N.D.: A representation of generalized convex polyhedra and applications. Optimization 69, 471–492 (2020)
  • (16) Luenberger, D.G., Ye, Y.: Linear and Nonlinear Programming. Springer, New York (2008)
  • (17) McCormick, Garth P.: Second order conditions for constrained minima. SIAM J. Appl. Math. 15, 641–652 (1967)
  • (18) Mordukhovich, B.S.: Sensitivity analysis in nonsmooth optimization. In: Field, D.A., Komkov, V. (eds.) Theoretical Aspects of Industrial Design Field, pp. 32–46. SIAM, Philadelphia (1992)
  • (19) Mordukhovich, B.S.: Variational Analysis and Generalized Differentiation, Volume I: Basic Theory. Springer, Berlin (2006)
  • (20) Mordukhovich, B.S.: Variational Analysis and Generalized Differentiation, Volume II: Applications. Springer, Berlin (2006)
  • (21) Mordukhovich, B.S., Rockafellar, R.T.: Second-order subdifferential calculus with applications to tilt stability in optimization. SIAM J. Optim. 22, 953–986 (2012)
  • (22) Mordukhovich, B.S., Rockafellar, R.T., Sarabi, M.E.: Characterizations of full stability in constrained optimization. SIAM J. Optim. 23, 1810–1849 (2013)
  • (23) Penot, J-.P.: Optimality conditions in mathematical programming and composite optimization. Math. Programming 67, 225–245 (1994)
  • (24) Penot, J-.P.: Second-order conditions for optimization problems with constraints. SIAM J. Control Optim. 37, 303–318 (1999)
  • (25) Poliquin, R.A., Rockafellar, R.T.: Tilt stability of a local minimum. SIAM J. Optim. 8, 287–299 (1998)
  • (26) Polyak, B.T.: Introduction to Optimization. Revised version. Optimization Software, Inc., New York (2010)
  • (27) Rockafellar, R.T.: Convex Analysis. Princeton University Press, Princeton, New Jersey (1970)
  • (28) Ruszczynski, A.: Nonlinear Optimization. Princeton University Press, New Jersey (2006)
  • (29) Yen, N.D., Yang, X.: Affine variational inequalities on normed spaces. J. Optim. Theory Appl. 178, 36–55 (2018)