This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

11institutetext: Nguyen Duy Cuong 22institutetext: Department of Mathematics, College of Natural Sciences, Can Tho University, Can Tho, Vietnam
22email: [email protected], ORCID: 0000-0003-2579-3601
33institutetext: A. Y. Kruger (corresponding author) 44institutetext: Optimization Research Group, Faculty of Mathematics and Statistics, Ton Duc Thang University, Ho Chi Minh City, Vietnam
44email: [email protected], ORCID: 0000-0002-7861-7380
55institutetext: Nguyen Hieu Thao 66institutetext: School of Science, Engineering and Technology, RMIT University Vietnam, Ho Chi Minh City, Vietnam
66email: [email protected], ORCID: 0000-0002-1455-9169

Extremality of families of sets and set-valued optimization

Nguyen Duy Cuong    Alexander Y. Kruger    Nguyen Hieu Thao
Abstract

The paper explores a new extremality model involving collections of arbitrary families of sets. We demonstrate its applicability to set-valued optimization problems with general preferences, weakening the assumptions of the known results and streamlining their proofs.

Keywords:
extremal principle separation optimality conditions set-valued optimization
MSC:
49J52 49J53 49K40 90C30 90C46

1 Introduction

The paper explores another extension of the extremal principle introduced recently in CuoKruTha24 . The conventional extremal principle KruMor80 ; MorSha96 ; Mor06.1 covers a wide range of problems in optimization and variational analysis as demonstrated, e.g., in the books BorZhu05 ; Mor06.1 ; Mor06.2 . At the same time, there exist problems, mainly in multiobjective and set-valued optimization described via closed preference relations, that cannot be covered within the framework of linear translations (as in the conventional principle). The first example of this kind was identified in Zhu Zhu00 . Fortunately, such problems can be handled with the help of a more flexible extended version of the extremal principle using nonlinear perturbations (deformations) of the sets defined by set-valued mappings. Such an extension was developed in Mordukhovich et al. MorTreZhu03 (see also BorZhu05 ; Mor06.2 ) and applied by other authors to various multiobjective problems ZheNg06 ; LiNgZhe07 ; Bao14.2 .

The model in MorTreZhu03 exploits non-intersection of perturbations of given sets Ω1,,Ωn\Omega_{1},\ldots,\Omega_{n}. The perturbations are chosen from the respective families of sets Ξi:={Si(s)sMi}\Xi_{i}:=\{S_{i}(s)\mid s\in M_{i}\} (i=1,,n)(i=1,\ldots,n) determined by given set-valued mappings Si:MiXS_{i}:M_{i}\rightrightarrows X (i=1,,n)(i=1,\ldots,n) on metric spaces. In the particular case of linear translations, i.e., when, for all i=1,,ni=1,\ldots,n, (Mi,di)=(X,d)(M_{i},d_{i})=(X,d) and Si(a)=ΩiaS_{i}(a)=\Omega_{i}-a (aX)(a\in X), the model reduces to the conventional extremal principle. It was shown by examples in MorTreZhu03 ; Mor06.2 that the framework of set-valued perturbations is richer than that of linear translations. With minor modifications in the proof, the conventional extremal principle was extended to the set-valued setting producing a more advanced model.

This model has been refined in CuoKruTha24 , making it more flexible and, at the same time, simpler. The refined model studies extremality and stationarity of nonempty families of arbitrary sets Ξi\Xi_{i} (i=1,,n)(i=1,\ldots,n) and is applicable to a wider range of variational problems. The next definition and theorem are simplified versions of (CuoKruTha24, , Definition 3.2 and Theorem 3.4), respectively.

Definition 1 (Extremality and stationarity: families of sets).

Let Ξ1,,Ξn\Xi_{1},\ldots,\Xi_{n} be families of subsets of a normed space XX, and x¯X\bar{x}\in X. The collection {Ξ1,,Ξn}\{\Xi_{1},\ldots,\Xi_{n}\} is

  1. (i)

    extremal at x¯\bar{x} if there is a ρ(0,+]\rho\in(0,+\infty] such that, for any ε>0\varepsilon>0, there exist AiΞiA_{i}\in\Xi_{i} (i=1,,n)(i=1,\ldots,n) such that max1ind(x¯,Ai)<ε\max_{1\leq i\leq n}d(\bar{x},A_{i})<\varepsilon and i=1nAiBρ(x¯)=\bigcap_{i=1}^{n}A_{i}\cap B_{\rho}(\bar{x})=\emptyset.

  2. (ii)

    stationary at x¯\bar{x} if, for any ε>0\varepsilon>0, there exist a ρ(0,ε)\rho\in(0,\varepsilon) and AiΞiA_{i}\in\Xi_{i} (i=1,,n)(i=1,\ldots,n) such that max1ind(x¯,Ai)<ερ\max_{1\leq i\leq n}d(\bar{x},A_{i})<\varepsilon\rho and i=1nAiBρ(x¯)=\bigcap_{i=1}^{n}A_{i}\cap B_{\rho}(\bar{x})=\emptyset.

  3. (iii)

    approximately stationary at x¯\bar{x} if, for any ε>0\varepsilon>0, there exist a ρ(0,ε)\rho\in(0,\varepsilon), AiΞiA_{i}\in\Xi_{i} and xiBε(x¯)x_{i}\in B_{\varepsilon}(\bar{x}) (i=1,,n)(i=1,\ldots,n) such that max1ind(xi,Ai)<ερ\max_{1\leq i\leq n}d(x_{i},A_{i})<\varepsilon\rho and i=1n(Aixi)(ρ𝔹)=\bigcap_{i=1}^{n}(A_{i}-x_{i})\cap(\rho\mathbb{B})=\emptyset.

Theorem 1.1.

Let Ξ1,,Ξn\Xi_{1},\ldots,\Xi_{n} be families of closed subsets of a Banach space XX, and x¯X\bar{x}\in X. If {Ξ1,,Ξn}\{\Xi_{1},\ldots,\Xi_{n}\} is approximately stationary at x¯\bar{x}, then, for any ε>0\varepsilon>0, there exist AiΞiA_{i}\in\Xi_{i}, xiAiBε(x¯)x_{i}\in A_{i}\cap B_{\varepsilon}(\bar{x}), and xiNAiC(xi)x_{i}^{*}\in N^{C}_{A_{i}}(x_{i}) (i=1,,n)(i=1,\ldots,n) such that

i=1nxi<εandi=1nxi=1.\displaystyle\Big{\|}\sum_{i=1}^{n}x_{i}^{*}\Big{\|}<\varepsilon\quad\mbox{and}\quad\sum_{i=1}^{n}\|x_{i}^{*}\|=1.

If XX is Asplund, then NCN^{C} in the above assertion can be replaced by NFN^{F}.

The symbols NCN^{C} and NFN^{F} in the above theorem denote, respectively, the Clarke and Fréchet normal cones. Recall that a Banach space is Asplund if every continuous convex function on an open convex set is Fréchet differentiable on a dense subset Phe93 , or equivalently, if the dual of each its separable subspace is separable. We refer the reader to Phe93 ; Mor06.1 for discussions about and characterizations of Asplund spaces. All reflexive, particularly, all finite dimensional Banach spaces are Asplund. Most assertions involving Fréchet normals, subdifferentials and coderivatives are only valid in Asplund spaces; see MorSha96 .

Remark 1.
  1. (i)

    Part ii of Definition 1 is the explicit form of (CuoKruTha24, , Definition 3.2 (iii)), while part iii is a particular case of (CuoKruTha24, , Definition 3.2 (ii)) with Ω1==Ωn:=X\Omega_{1}=\ldots=\Omega_{n}:=X, thus, representing the weakest version of the property in (CuoKruTha24, , Definition 3.2 (ii)).

  2. (ii)

    It is easy to see that i \Rightarrow\ ii \Rightarrow\ iii in Definition 1. Hence, the necessary conditions in Theorem 1.1 are also valid for the stationarity and extremality.

  3. (iii)

    Theorem 1.1 shows that approximate stationarity of a given collection of families of closed sets implies its fuzzy (up to ε\varepsilon) separation. Note that, unlike the general model discussed in BuiKru19 , not only the points xix_{i} and xix_{i}^{*} (i=1,,n)(i=1,\ldots,n) usually involved in fuzzy separation statements depend on ε\varepsilon, but also the sets A1,,AnA_{1},\ldots,A_{n}.

  4. (iv)

    For each i=1,,ni=1,\ldots,n, the sets AiA_{i} making the family Ξi\Xi_{i} in Definition 1 can be considered as perturbations of some given set Ωi\Omega_{i}. With this interpretation in mind, Definition 1i covers the definition of extremality with respect to set-valued perturbations in MorTreZhu03 ; Mor06.2 . Note that the “perturbation” sets in MorTreZhu03 ; Mor06.2 are rather loosely connected with the given sets.

Example 1.

Let Ξ1\Xi_{1} consist of a single one-point set {0}\{0\}\subset\mathbb{R}, and Ξ2\Xi_{2} be a family of singletons {1/n}\{1/n\} for nn\in\mathbb{N}. It is easy to see that {Ξ1,Ξ2}\{\Xi_{1},\Xi_{2}\} is extremal at 0 in the sense of Definition 1i (even with ρ=+\rho=+\infty). The subsets of Ξ1,Ξ2\Xi_{1},\Xi_{2} may be considered as “perturbations” of the sets Ω1=Ω2:=\Omega_{1}=\Omega_{2}:=\mathbb{R} in the sense of MorTreZhu03 ; Mor06.2 . The pair {Ω1,Ω2}\{\Omega_{1},\Omega_{2}\} is clearly not extremal at 0 in the conventional sense of KruMor80 ; Mor06.1 .

The more general (and simpler) model in Definition 1 and Theorem 1.1 is capable of treating a wider range of applications. In this paper, we demonstrate the applicability of Theorem 1.1 to set-valued optimization problems with general preference relations. This allows us to expand the range of set-valued optimization models studied in earlier publications, weaken their assumptions and streamline the proofs.

We study extremality/stationarity properties of the triple {F,Ω,Ξ}\{F,\Omega,\Xi\}, where F:XYF:X\rightrightarrows Y is a set-valued mapping between normed spaces, Ω\Omega is a subset of XX, and Ξ\Xi is a nonempty family of subsets of YY. The latter family may, in particular be determined by an abstract level-set mapping defining a preference relation on YY.

Members of Ξ\Xi do not have to be simply translations (deformations) of a fixed set (ordering cone). Extremality/stationarity properties of the triple {F,Ω,Ξ}\{F,\Omega,\Xi\} reduce to the corresponding properties of the two special families of subsets of X×YX\times Y which involve gphF{\rm gph}\,F, and products of Ω\Omega and members of Ξ\Xi. The properties are illustrated by examples. Application of Theorem 1.1 yields necessary conditions for approximate stationarity and, hence, also stationarity and extremality. Natural qualification conditions in terms of Clarke or Fréchet coderivatives and normal cones are provided, which allow one to write down the necessary conditions in the form of an abstract multiplier rule. The statements cover the corresponding results in MorTreZhu03 ; ZheNg05.2 ; ZheNg06 .

Requirements on preference relations defined by level-set mappings, making them meaningful in optimization and applications, are discussed. A certain subset of properties, which are satisfied by most conventional and many other preference relations, is established. The properties are shown to be in general weaker than those used in Mor06.2 ; Bao14.2 ; KhaTamZal15 , but still sufficient for the corresponding set-valued optimization problems to fall within the theory developed in the current paper. Several multiplier rules for problems with a single set-valued mapping, and then with multiple set-valued mappings are formulated.

The structure of the paper is as follows. Section 2 recalls some definitions and facts used throughout the paper. The applicability of Theorem 1.1 is illustrated in Sections 35 considering set-valued optimization problems with general preference relations. A model with a single set-valued mapping is studied in Section 3. A particular case of this model when the family Ξ\Xi is determined by an abstract level-set mapping is considered in Section 4. A more general model with multiple set-valued mappings is briefly discussed in Section 5.

2 Preliminaries

Our basic notation is standard, see, e.g., Mor06.1 ; RocWet98 ; DonRoc14 ; Iof17 . Throughout the paper, if not explicitly stated otherwise, XX and YY are normed spaces. Products of normed spaces are assumed to be equipped with the maximum norm. The topological dual of a normed space XX is denoted by XX^{*}, while ,\langle\cdot,\cdot\rangle denotes the bilinear form defining the pairing between the two spaces. The open ball with center xx and radius δ>0\delta>0 is denoted by Bδ(x)B_{\delta}(x). If (x,y)X×Y(x,y)\in X\times Y, we write Bε(x,y)B_{\varepsilon}(x,y) instead of Bε((x,y))B_{\varepsilon}((x,y)). The open unit ball is denoted by 𝔹\mathbb{B} with a subscript indicating the space, e.g., 𝔹X\mathbb{B}_{X} and 𝔹X\mathbb{B}_{X^{*}}. Symbols \mathbb{R} and \mathbb{N} stand for the real line and the set of all positive integers, respectively.

The interior and closure of a set Ω\Omega are denoted by intΩ{\rm int}\,\Omega and clΩ{\rm cl}\,\Omega, respectively. The distance from a point xXx\in X to a subset ΩX\Omega\subset X is defined by d(x,Ω):=infuΩuxd(x,\Omega):=\inf_{u\in\Omega}\|u-x\|, and we use the convention d(x,)=+d(x,\emptyset)=+\infty.

Given a subset Ω\Omega of a normed space XX and a point x¯Ω\bar{x}\in\Omega, the sets (cf. Kru03 ; Cla83 )

NΩF(x¯):={xXlim supΩxx¯,xx¯x,xx¯xx¯0},\displaystyle N_{\Omega}^{F}(\bar{x}):=\Big{\{}x^{\ast}\in X^{\ast}\mid\limsup_{\Omega\ni x{\rightarrow}\bar{x},\;x\neq\bar{x}}\frac{\langle x^{\ast},x-\bar{x}}{\|x-\bar{x}\|}\leq 0\Big{\}}, (1)
NΩC(x¯):={xXx,z0for allzTΩC(x¯)}\displaystyle N_{\Omega}^{C}(\bar{x}):=\left\{x^{\ast}\in X^{\ast}\mid\left\langle x^{\ast},z\right\rangle\leq 0\quad\mbox{for all}\quad z\in T_{\Omega}^{C}(\bar{x})\right\} (2)

are the Fréchet and Clarke normal cones to Ω\Omega at x¯\bar{x}, where TΩC(x¯)T_{\Omega}^{C}(\bar{x}) stands for the Clarke tangent cone to Ω\Omega at x¯\bar{x}:

TΩC(x¯):={zXxkx¯,xkΩ,tk0,zkzsuch thatxk+tkzkΩfor allk}.T_{\Omega}^{C}(\bar{x}):=\big{\{}z\in X\mid\forall x_{k}{\rightarrow}\bar{x},\;x_{k}\in\Omega,\;\forall t_{k}\downarrow 0,\;\exists z_{k}\to z\\ \mbox{such that}\quad x_{k}+t_{k}z_{k}\in\Omega\quad\mbox{for all}\quad k\in\mathbb{N}\big{\}}.

The sets (1) and (2) are nonempty closed convex cones satisfying NΩF(x¯)NΩC(x¯)N_{\Omega}^{F}(\bar{x})\subset N_{\Omega}^{C}(\bar{x}). If Ω\Omega is a convex set, they reduce to the normal cone in the sense of convex analysis:

NΩ(x¯):={xXx,xx¯0for allxΩ}.\displaystyle N_{\Omega}(\bar{x}):=\left\{x^{*}\in X^{*}\mid\langle x^{*},x-\bar{x}\rangle\leq 0\quad\mbox{for all}\quad x\in\Omega\right\}.

By convention, we set NΩF(x¯)=NΩC(x¯):=N_{\Omega}^{F}(\bar{x})=N_{\Omega}^{C}(\bar{x}):=\emptyset if x¯Ω\bar{x}\notin\Omega.

A set-valued mapping F:XYF:X\rightrightarrows Y between two sets XX and YY is a mapping, which assigns to every xXx\in X a (possibly empty) subset F(x)F(x) of YY. We use the notations gphF:={(x,y)X×YyF(x)}{\rm gph}\,F:=\{(x,y)\in X\times Y\mid y\in F(x)\} and domF:={xXF(x)}{\rm dom}\,\>F:=\{x\in X\mid F(x)\neq\emptyset\} for the graph and the domain of FF, respectively, and F1:YXF^{-1}:Y\rightrightarrows X for the inverse of FF. This inverse (which always exists with possibly empty values at some yy) is defined by F1(y):={xXyF(x)}F^{-1}(y):=\{x\in X\mid y\in F(x)\}, yYy\in Y. Obviously domF1=F(X){\rm dom}\,F^{-1}=F(X).

If XX and YY are normed spaces, the Clarke coderivative DCF(x,y)D^{*C}F(x,y) of FF at (x,y)gphF(x,y)\in{\rm gph}\,F is a set-valued mapping defined by

DCF(x,y)(y):={xX(x,y)NgphFC(x,y)},yY.\displaystyle D^{*C}F(x,y)(y^{*}):=\{x^{*}\in X^{*}\mid(x^{*},-y^{*})\in N^{C}_{{\rm gph}\,F}(x,y)\},\quad y^{*}\in Y^{*}. (3)

Replacing the Clarke normal cone in (3) by the Fréchet one, we obtain the definition of the Fréchet coderivative.

Definition 2 (Aubin property).

A mapping F:XYF:X\rightrightarrows Y between metric spaces has the Aubin property at (x¯,y¯)gphF(\bar{x},\bar{y})\in{\rm gph}\,F if there exist τ>0\tau>0 and δ>0\delta>0 such that

d(y,F(x))τd(x,x)for allx,xBδ(x¯),yF(x)Bδ(y¯).\displaystyle d(y,F(x))\leq\tau d(x,x^{\prime})\quad\mbox{for all}\quad x,x^{\prime}\in B_{\delta}(\bar{x}),\;y\in F(x^{\prime})\cap B_{\delta}(\bar{y}). (4)

The number τ\tau is called the modulus of the Aubin property at (x¯,y¯)(\bar{x},\bar{y}).

Aubin property is among the most widely used properties of set-valued mappings in variational analysis (see, e.g., AubFra90 ; RocWet98 ; Mor06.1 ; DonRoc14 ; Iof17 ). It is known, in particular, to be equivalent to the metric regularity of the inverse mapping. It also yields estimates for the normals to the graph of the (given) mapping.

Lemma 1.

Let XX and YY be normed spaces, F:XYF:X\rightrightarrows Y, and (x¯,y¯)gphF(\bar{x},\bar{y})\in{\rm gph}\,F.

  1. (i)

    If FF has the Aubin property at (x¯,y¯)(\bar{x},\bar{y}) with modulus τ>0\tau>0, then there is a δ>0\delta>0 such that

    xτyfor all(x,y)gphFBδ(x¯,y¯),(x,y)NgphFF(x,y).\displaystyle\|x^{*}\|\leq\tau\|y^{*}\|\quad\mbox{for all}\quad(x,y)\in{\rm gph}\,F\cap B_{\delta}(\bar{x},\bar{y}),\;(x^{*},y^{*})\in N^{F}_{{\rm gph}\,F}(x,y). (5)
  2. (ii)

    If dimY<+\dim Y<+\infty, then NFN^{F} in the above assertion can be replaced by NCN^{C}.

Proof.
  1. (i)

    is well known; see, e.g., (Mor06.1, , Theorem 1.43(i)). (The latter theorem is formulated in Mor06.1 in the Banach space setting, but the proof is valid in arbitrary normed spaces.)

  2. (ii)

    Suppose dimY<+\dim Y<+\infty, and FF has the Aubin property at (x¯,y¯)(\bar{x},\bar{y}) with modulus τ>0\tau>0, i.e., condition (4) is satisfied for some δ>0\delta>0. Let (x,y)Bδ(x¯,y¯)gphF(x,y)\in B_{\delta}(\bar{x},\bar{y})\cap{\rm gph}\,F and (x,y)NgphFC(x,y)(x^{*},y^{*})\in N_{{\rm gph}\,F}^{C}(x,y). Take any sequences (xk,yk)gphF(x_{k},y_{k})\in{\rm gph}\,F and tk>0t_{k}>0 such that (xk,yk)(x,y)(x_{k},y_{k})\to(x,y) and tk0t_{k}\downarrow 0 as k+k\to+\infty. Fix an arbitrary uXu\in X. Without lost generality, we can assume that xk,xk+tkuBδ(x¯)x_{k},x_{k}+t_{k}u\in B_{\delta}(\bar{x}) and ykBδ(y¯)y_{k}\in B_{\delta}(\bar{y}) for all kk\in\mathbb{N}. By (4), for each kk\in\mathbb{N}, there exists a point ykF(xk+tku)y_{k}^{\prime}\in F(x_{k}+t_{k}u) such that ykykτtku\|y_{k}^{\prime}-y_{k}\|\leq\tau t_{k}\|u\|. Set vk:=(ykyk)/tkv_{k}:=(y_{k}^{\prime}-y_{k})/t_{k}. Then vkτu\|v_{k}\|\leq\tau\|u\|. Passing to subsequences, we can suppose that vkvYv_{k}\to v\in Y. Observe that (u,vk)(u,v)(u,v_{k})\to(u,v) as k+k\to+\infty, and (xk,yk)+tk(u,vk)gphF(x_{k},y_{k})+t_{k}(u,v_{k})\in{\rm gph}\,F for each kk\in\mathbb{N}. Thus, (u,v)TgphFC(x,y)(u,v)\in T_{{\rm gph}\,F}^{C}(x,y), and vτu\|v\|\leq\tau\|u\|. By the definition of the Clarke normal cone, we have x,uy,vτyu\left\langle x^{*},u\right\rangle\leq-\left\langle y^{*},v\right\rangle\leq\tau\|y^{*}\|\|u\|. Since vector uu is arbitrary, it follows that xτy\|x^{*}\|\leq\tau\|y^{*}\|.

3 Set-valued optimization: a single mapping

Let XX and YY be normed spaces, ΩX\Omega\subset X, F:XYF:X\rightrightarrows Y, x¯Ω\bar{x}\in\Omega and y¯F(x¯)\bar{y}\in F(\bar{x}). To model the setting of Definition 1, we consider a nonempty family Ξ\Xi of subsets of YY, and define two families of subsets of X×YX\times Y:

Ξ1:={gphF}andΞ2:={Ω×AAΞ}.\displaystyle\Xi_{1}:=\{{\rm gph}\,F\}\quad\text{and}\quad\Xi_{2}:=\{\Omega\times A\mid A\in\Xi\}. (6)

The first family consists of the single set gphF{\rm gph}\,F, and the first component of each member of the second family is always the given set Ω\Omega; only the second component varies.

To emphasize the structure of the pair (6), when referring to the corresponding properties in Definition 1, we will talk about extremality/stationarity of the triple {F,Ω,Ξ}\{F,\Omega,\Xi\}.

Definition 3.

The triple {F,Ω,Ξ}\{F,\Omega,\Xi\} is extremal (resp., stationary, approximately stationary) at (x¯,y¯)(\bar{x},\bar{y}) if the pair (6) is extremal (resp., stationary, approximately stationary) at (x¯,y¯)(\bar{x},\bar{y}).

The next proposition is a direct consequence of Definitions 1 and 3.

Proposition 1.

The triple {F,Ω,Ξ}\{F,\Omega,\Xi\} is

  1. (i)

    extremal at (x¯,y¯)(\bar{x},\bar{y}) if and only if there is a ρ(0,+]\rho\in(0,+\infty] such that, for any ε>0\varepsilon>0, there exists an AΞA\in\Xi such that d(y¯,A)<εd(\bar{y},A)<\varepsilon, and

    F(ΩBρ(x¯))ABρ(y¯)=;\displaystyle F(\Omega\cap B_{\rho}(\bar{x}))\cap A\cap B_{\rho}(\bar{y})=\emptyset; (7)
  2. (ii)

    stationary at (x¯,y¯)(\bar{x},\bar{y}) if and only if for any ε>0\varepsilon>0, there exist a ρ(0,ε)\rho\in(0,\varepsilon) and an AΞA\in\Xi such that d(y¯,A)<ερd(\bar{y},A)<\varepsilon\rho, and condition (7) is satisfied;

  3. (iii)

    approximately stationary at (x¯,y¯)(\bar{x},\bar{y}) if and only if, for any ε>0\varepsilon>0, there exist a ρ(0,ε)\rho\in(0,\varepsilon), an AΞA\in\Xi, and (x1,y1),(x2,y2)Bε(x¯,y¯)(x_{1},y_{1}),(x_{2},y_{2})\in B_{\varepsilon}(\bar{x},\bar{y}) such that d((x1,y1),gphF)<ερd((x_{1},y_{1}),{\rm gph}\,F)<\varepsilon\rho, d(x2,Ω)<ερd(x_{2},\Omega)<\varepsilon\rho, d(y2,A)<ερd(y_{2},A)<\varepsilon\rho, and

    F(x1+(Ωx2)(ρ𝔹X))(y1+(Ay2)(ρ𝔹Y))=.\displaystyle F(x_{1}+(\Omega-x_{2})\cap(\rho\mathbb{B}_{X}))\cap(y_{1}+(A-y_{2})\cap(\rho\mathbb{B}_{Y}))=\emptyset.

The next example illustrates relations between the properties in Proposition 1.

Example 2.

Let X=Y=Ω:=X=Y=\Omega:=\mathbb{R}, Ξ:={(,t]t}\Xi:=\{(-\infty,t]\mid t\in\mathbb{R}\}, and F1,F2,F3,F4:F_{1},F_{2},F_{3},F_{4}:\mathbb{R}\rightrightarrows\mathbb{R} be given by

F1(x)\displaystyle F_{1}(x) :=[0,+)for allx,F2(x):={[x+1,+)if x<1,[0,+)if x1,\displaystyle:=[0,+\infty)\quad\mbox{for all}\;x\in\mathbb{R},\quad\;F_{2}(x):=\begin{cases}[x+1,+\infty)&\mbox{if }x<-1,\\ [0,+\infty)&\mbox{if }x\geq-1,\end{cases}
F3(x)\displaystyle F_{3}(x) :=[x2,+)for allx,F4(x):={[x,+)if x<0,[x2,+)if x0.\displaystyle:=[-x^{2},+\infty)\;\mbox{for all}\;x\in\mathbb{R},\quad F_{4}(x):=\begin{cases}[x,+\infty)&\mbox{if }x<0,\\ [-x^{2},+\infty)&\mbox{if }x\geq 0.\end{cases}

Then 0Fi(0)0\in F_{i}(0) for all i=1,2,3,4i=1,2,3,4. The following assertions hold true:

  1. (i)

    {F1,,Ξ}\{F_{1},\mathbb{R},\Xi\} is extremal at (0,0)(0,0) with ρ=+\rho=+\infty;

  2. (ii)

    {F2,,Ξ}\{F_{2},\mathbb{R},\Xi\} is extremal at (0,0)(0,0) with some ρ(0,+)\rho\in(0,+\infty) but not with ρ=+\rho=+\infty;

  3. (iii)

    {F3,,Ξ}\{F_{3},\mathbb{R},\Xi\} is stationary but not extremal at (0,0)(0,0);

  4. (iv)

    {F4,,Ξ}\{F_{4},\mathbb{R},\Xi\} is approximately stationary at (0,0)(0,0) but not stationary at (0,0)(0,0).

The assertions are straightforward. We only prove assertion (iv). Let ε(0,1)\varepsilon\in(0,1). Choose any ρ(0,ε)\rho\in(0,\varepsilon) and t(ερ,ρ)t\in(\varepsilon\rho,\rho). Then tρ𝔹-t\in\rho\mathbb{B}_{\mathbb{R}}, tF4(t)-t\in F_{4}(-t) and tA-t\in A for any A:=(,η]ΞA:=(-\infty,-\eta]\in\Xi with d(0,A)<ερd(0,A)<\varepsilon\rho (i.e., for any η<ερ\eta<\varepsilon\rho). By Proposition 1 (ii), {F4,,Ξ}\{F_{4},\mathbb{R},\Xi\} is not stationary at (0,0)(0,0).

Let ε>0\varepsilon>0. Choose a ρ(0,min{ε,1}/3)\rho\in(0,\min\{\varepsilon,1\}/3) and points (x1,y1):=(ρ,ρ2)gphF4(ε𝔹2)(x_{1},y_{1}):=(\rho,-\rho^{2})\in{\rm gph}\,F_{4}\cap(\varepsilon\mathbb{B}_{\mathbb{R}^{2}}), x2:=0ε𝔹x_{2}:=0\in\varepsilon\mathbb{B}_{\mathbb{R}}, y2:=0ε𝔹y_{2}:=0\in\varepsilon\mathbb{B}_{\mathbb{R}}. Observe that A:=(,3ρ2]ΞA:=(-\infty,-3\rho^{2}]\in\Xi satisfies d(0,A)<ερd(0,A)<\varepsilon\rho. Then

F4(x1+(ρ𝔹))=F4(0,2ρ)=(4ρ2,+),\displaystyle F_{4}(x_{1}+(\rho\mathbb{B}_{\mathbb{R}}))=F_{4}(0,2\rho)=(-4\rho^{2},+\infty),
y1+(Ay2)(ρ𝔹)=ρ2+(ρ,3ρ2]=(ρ2ρ,4ρ2],\displaystyle y_{1}+(A-y_{2})\cap(\rho\mathbb{B}_{\mathbb{R}})=-\rho^{2}+(-\rho,-3\rho^{2}]=(-\rho^{2}-\rho,-4\rho^{2}],

and consequently, F4(x1+(ρ𝔹))(y1+(Ay2)(ρ𝔹))=F_{4}(x_{1}+(\rho\mathbb{B}_{\mathbb{R}}))\cap(y_{1}+(A-y_{2})\cap(\rho\mathbb{B}_{\mathbb{R}}))=\emptyset. By Proposition 1 (iii), {F4,,Ξ}\{F_{4},\mathbb{R},\Xi\} is approximately stationary at (0,0)(0,0).

The following example shows that the family of sets Ξ\Xi plays an important role in determining the properties.

Example 3.

Let X=Y=Ω:=X=Y=\Omega:=\mathbb{R} and F:F:\mathbb{R}\rightrightarrows\mathbb{R} be given by

F(x):={{x2}if x is rational,{x}otherwise.\displaystyle F(x):=\begin{cases}\{x\sqrt{2}\}&\mbox{if }x\text{ is rational},\\ \{x\}&\mbox{otherwise}.\end{cases}

Then 0F(0)0\in F(0).

Let Ξ:={(,t]t}\Xi:=\{(-\infty,t]\mid t\in\mathbb{R}\} and ε(0,1)\varepsilon\in(0,1). Choose any ρ(0,ε)\rho\in(0,\varepsilon), (x1,y1)gphF(x_{1},y_{1})\in{\rm gph}\,F, x2x_{2}\in\mathbb{R}, y2(ε,0]y_{2}\in(-\varepsilon,0] and A:=(,t]ΞA:=(-\infty,-t]\in\Xi with d(y2,A)<ερd(y_{2},A)<\varepsilon\rho (i.e., t<ερy2t<\varepsilon\rho-y_{2}). Then x1+(Ωx2)(ρ𝔹X)=Bρ(x1)x_{1}+(\Omega-x_{2})\cap(\rho\mathbb{B}_{X})=B_{\rho}(x_{1}), Ay2=(,τ]A-y_{2}=(-\infty,-\tau], where τ:=y2+t<ερ\tau:=y_{2}+t<\varepsilon\rho, and consequently, y1+(Ay2)(ρ𝔹Y)(y1ρ,y1ερ)y_{1}+(A-y_{2})\cap(\rho\mathbb{B}_{Y})\supset(y_{1}-\rho,y_{1}-\varepsilon\rho). We next show that F(Bρ(x1))(y1ρ,y1ερ)F(B_{\rho}(x_{1}))\cap(y_{1}-\rho,y_{1}-\varepsilon\rho)\neq\emptyset. If x1x_{1} is rational, then y1=x12y_{1}=x_{1}\sqrt{2}, and choosing a rational number x^(x1ρ/2,x1ερ/2)Bρ(x1)\hat{x}\in(x_{1}-\rho/\sqrt{2},x_{1}-\varepsilon\rho/\sqrt{2})\subset B_{\rho}(x_{1}), we get y^:=x^2F(x^)(y1ρ,y1ερ)\hat{y}:=\hat{x}\sqrt{2}\in F(\hat{x})\cap(y_{1}-\rho,y_{1}-\varepsilon\rho). If x1x_{1} is irrational, then y1=x1y_{1}=x_{1}, and choosing an irrational number x^(x1ρ,x1ερ)Bρ(x1)\hat{x}\in(x_{1}-\rho,x_{1}-\varepsilon\rho)\subset B_{\rho}(x_{1}), we get y^:=x^F(x^)(y1ρ,y1ερ)\hat{y}:=\hat{x}\in F(\hat{x})\cap(y_{1}-\rho,y_{1}-\varepsilon\rho). By Proposition 1 (iii), {F,,Ξ}\{F,\mathbb{R},\Xi\} is not approximately stationary at (0,0)(0,0).

Let Ξ:={{1/n}n}\Xi:=\{\{-1/n\}\mid n\in\mathbb{N}\}. Since F()F(\mathbb{R}) only contains irrational numbers, we have F()A=F(\mathbb{R})\cap A=\emptyset for all AΞA\in\Xi. By Proposition 1 (i), {F,,Ξ}\{F,\mathbb{R},\Xi\} is extremal at (0,0)(0,0) (with ρ=+\rho=+\infty).

Application of Theorem 1.1 yields necessary conditions for approximate stationarity and, hence, also stationarity and extremality.

Theorem 3.1.

Let XX and YY be Banach spaces, the sets Ω\Omega, gphF{\rm gph}\,F and all members of Ξ\Xi be closed. If the triple {F,Ω,Ξ}\{F,\Omega,\Xi\} is approximately stationary at (x¯,y¯)(\bar{x},\bar{y}), then, for any ε>0\varepsilon>0, there exist (x1,y1)gphFBε(x¯,y¯)(x_{1},y_{1})\in{\rm gph}\,F\cap B_{\varepsilon}(\bar{x},\bar{y}), x2ΩBε(x¯)x_{2}\in\Omega\cap B_{\varepsilon}(\bar{x}), AΞA\in\Xi, y2ABε(y¯)y_{2}\in A\cap B_{\varepsilon}(\bar{y}), (x1,y1)NgphFC(x1,y1)(x_{1}^{*},y_{1}^{*})\in N^{C}_{{\rm gph}\,F}(x_{1},y_{1}), x2NΩC(x2)x_{2}^{*}\in N^{C}_{\Omega}(x_{2}) and y2NAC(y2)y_{2}^{*}\in N^{C}_{A}(y_{2}) such that

(x1,y1)+(x2,y2)<εand(x1,y1)+(x2,y2)=1.\displaystyle\|(x_{1}^{*},y_{1}^{*})+(x_{2}^{*},y_{2}^{*})\|<\varepsilon\quad\mbox{and}\quad\|(x_{1}^{*},y_{1}^{*})\|+\|(x_{2}^{*},y_{2}^{*})\|=1.

If XX is Asplund, then NCN^{C} in the above assertion can be replaced by NFN^{F}.

The normalization condition (x1,y1)+(x2,y2)=1\|(x_{1}^{*},y_{1}^{*})\|+\|(x_{2}^{*},y_{2}^{*})\|=1 in Theorem 3.1 ensures that normal vectors (x1,y1)(x_{1}^{*},y_{1}^{*}) to gphF{\rm gph}\,F remain sufficiently large when ε0\varepsilon\downarrow 0, i.e., x1x_{1}^{*} and y1y_{1}^{*} cannot go to 0 simultaneously. The case when vectors y1y_{1}^{*} are bounded away from 0 (hence, one can assume y1=1\|y_{1}^{*}\|=1) is of special interest as it leads to a proper multiplier rule. A closer look at the alternative: either y1y_{1}^{*} are bounded away from 0 as ε0\varepsilon\downarrow 0, or they are not (hence, vectors x1x_{1}^{*} remain large), allows one to formulate the following consequence of Theorem 3.1.

Corollary 1.

Let XX and YY be Banach spaces, the sets Ω\Omega, gphF{\rm gph}\,F and all members of Ξ\Xi be closed. If the triple {F,Ω,Ξ}\{F,\Omega,\Xi\} is approximately stationary at (x¯,y¯)(\bar{x},\bar{y}), then one of the following assertions holds true:

  1. (i)

    there is an M>0M>0 such that, for any ε>0\varepsilon>0, there exist (x1,y1)gphFBε(x¯,y¯)(x_{1},y_{1})\in{\rm gph}\,F\cap B_{\varepsilon}(\bar{x},\bar{y}), x2ΩBε(x¯)x_{2}\in\Omega\cap B_{\varepsilon}(\bar{x}), AΞA\in\Xi, y2ABε(y¯)y_{2}\in A\cap B_{\varepsilon}(\bar{y}), and yNAC(y2)+ε𝔹Yy^{*}\in N^{C}_{A}(y_{2})+\varepsilon\mathbb{B}_{Y^{*}} such that y=1\|y^{*}\|=1 and

    0DCF(x1,y1)(y)+NΩC(x2)(M𝔹X)+ε𝔹X;\displaystyle 0\in D^{*C}F(x_{1},y_{1})(y^{*})+N^{C}_{\Omega}(x_{2})\cap(M\mathbb{B}_{X^{*}})+\varepsilon\mathbb{B}_{X^{*}}; (8)
  2. (ii)

    for any ε>0\varepsilon>0, there exist (x1,y1)gphFBε(x¯,y¯)(x_{1},y_{1})\in{\rm gph}\,F\cap B_{\varepsilon}(\bar{x},\bar{y}), x2ΩBε(x¯)x_{2}\in\Omega\cap B_{\varepsilon}(\bar{x}), x1DCF(x1,y1)(ε𝔹Y)x_{1}^{*}\in D^{*C}F(x_{1},y_{1})(\varepsilon\mathbb{B}_{Y^{*}}) and x2NΩC(x2)x_{2}^{*}\in N^{C}_{\Omega}(x_{2}) such that

    x1+x2<εandx1+x2=1.\displaystyle\|x_{1}^{*}+x_{2}^{*}\|<\varepsilon\quad\mbox{and}\quad\|x_{1}^{*}\|+\|x_{2}^{*}\|=1.

If XX is Asplund, then NCN^{C} and DCD^{*C} in the above assertions can be replaced by NFN^{F} and DFD^{*F}, respectively.

Proof.

Let the triple {F,Ω,Ξ}\{F,\Omega,\Xi\} be approximately stationary at (x¯,y¯)(\bar{x},\bar{y}). By Theorem 3.1, for any jj\in\mathbb{N}, there exist (x1j,y1j)gphFB1/j(x¯,y¯)(x_{1j},y_{1j})\in{\rm gph}\,F\cap B_{1/j}(\bar{x},\bar{y}), x2jΩB1/j(x¯)x_{2j}\in\Omega\cap B_{1/j}(\bar{x}), AjΞ{A_{j}}\in\Xi, y2jAjB1/j(y¯)y_{2j}\in A_{j}\cap B_{1/j}(\bar{y}), (x1j,y1j)NgphFC(x1j,y1j)(x_{1j}^{*},y_{1j}^{*})\in N^{C}_{{\rm gph}\,F}(x_{1j},y_{1j}), x2jNΩC(x2j)x_{2j}^{*}\in N^{C}_{\Omega}(x_{2j}) and y2jNAjC(y2j)y^{*}_{2j}\in N^{C}_{A_{j}}(y_{2j}) such that (x1j,y1j)+(x2j,y2j)=1\|(x_{1j}^{*},y_{1j}^{*})\|+\|(x_{2j}^{*},y_{2j}^{*})\|=1 and (x1j,y1j)+(x2j,y2j)<1/j\|(x_{1j}^{*},y_{1j}^{*})+(x_{2j}^{*},y_{2j}^{*})\|<1/j. We consider two cases.

Case 1. lim supj+y1j>α>0\limsup_{j\to+\infty}\|y_{1j}^{*}\|>\alpha>0. Note that α<1\alpha<1. Set M:=1/αM:=1/\alpha. Let ε>0\varepsilon>0. Choose a number jj\in\mathbb{N} so that j1<αεj^{-1}<\alpha\varepsilon and y1j>α\|y_{1j}^{*}\|>\alpha. Set x1:=x1jx_{1}:=x_{1j}, y1:=y1jy_{1}:=y_{1j}, x2:=x2jx_{2}:=x_{2j}, A:=AjA:=A_{j}, y2:=y2jy_{2}:=y_{2j}, y:=y1j/y1jy^{*}:=-y_{1j}^{*}/\|y_{1j}^{*}\|, x1:=x1j/y1jx_{1}^{*}:=x_{1j}^{*}/\|y_{1j}^{*}\|, x2:=x2j/y1jx_{2}^{*}:=x_{2j}^{*}/\|y_{1j}^{*}\| and y2:=y2j/y1jy_{2}^{*}:=y_{2j}^{*}/\|y_{1j}^{*}\|. Then (x1,y1)gphFBε(x¯,y¯)(x_{1},y_{1})\in{\rm gph}\,F\cap B_{\varepsilon}(\bar{x},\bar{y}), x2ΩBε(x¯)x_{2}\in\Omega\cap B_{\varepsilon}(\bar{x}), y2ABε(y¯)y_{2}\in A\cap B_{\varepsilon}(\bar{y}), x1DCF(x1,y1)(y)x_{1}^{*}\in D^{*C}F(x_{1},y_{1})(y^{*}), x2NΩC(x2)x_{2}^{*}\in N^{C}_{\Omega}(x_{2}), y=1{\|y^{*}\|=1}, y2NAC(y2)y^{*}_{2}\in N^{C}_{A}(y_{2}), x2<1/α=M\|x_{2}^{*}\|<1/\alpha=M. Furthermore, yy2=y1j+y2j/y1j<1/(αj)<ε\|y^{*}-y_{2}^{*}\|=\|y_{1j}^{*}+y_{2j}^{*}\|/\|y_{1j}^{*}\|<1/(\alpha j)<\varepsilon, hence, yNAC(y2)+ε𝔹Yy^{*}\in N^{C}_{A}(y_{2})+\varepsilon\mathbb{B}_{Y^{*}}; and x1+x2=x1j+x2j/y1j<1/(αj)<ε\|x_{1}^{*}+x_{2}^{*}\|=\|x_{1j}^{*}+x_{2j}^{*}\|/\|y_{1j}^{*}\|<1/(\alpha j)<\varepsilon, hence, condition (8) is satisfied. Thus, assertion (i) holds true.

Case 2. limj+y1j=0\lim_{j\to+\infty}\|y_{1j}^{*}\|=0. Then y2j0y_{2j}^{*}\to 0, x1j+x2j0x_{1j}^{*}+x_{2j}^{*}\to 0 and 1x1j+x2j11\geq\|x_{1j}^{*}\|+\|x_{2j}^{*}\|\to 1 as j+j\to+\infty. Let ε>0\varepsilon>0. Choose a number jj\in\mathbb{N} so that x1j+x2j>0\|x_{1j}^{*}\|+\|x_{2j}^{*}\|>0 and max{j1,y1j,x1j+x2j/(x1j+x2j)}<ε\max\{j^{-1},\|y_{1j}^{*}\|,\|x_{1j}^{*}+x_{2j}^{*}\|/(\|x_{1j}^{*}\|+\|x_{2j}^{*}\|)\}<\varepsilon. Set x1:=x1jx_{1}:=x_{1j}, y1:=y1jy_{1}:=y_{1j}, x2:=x2jx_{2}:=x_{2j}, x1:=x1j/(x1j+x2j)x_{1}^{*}:=x_{1j}^{*}/(\|x_{1j}^{*}\|+\|x_{2j}^{*}\|) and x2:=x2j/(x1j+x2j)x_{2}^{*}:=x_{2j}^{*}/(\|x_{1j}^{*}\|+\|x_{2j}^{*}\|). Then (x1,y1)gphFBε(x¯,y¯)(x_{1},y_{1})\in{\rm gph}\,F\cap B_{\varepsilon}(\bar{x},\bar{y}), x2ΩBε(x¯)x_{2}\in\Omega\cap B_{\varepsilon}(\bar{x}), x1DCF(x1,y1)(ε𝔹Y)x_{1}^{*}\in D^{*C}F(x_{1},y_{1})(\varepsilon\mathbb{B}_{Y^{*}}), x2NΩC(x2)x_{2}^{*}\in N^{C}_{\Omega}(x_{2}), x1+x2=1\|x_{1}^{*}\|+\|x_{2}^{*}\|=1, and x1+x2<ε\|x_{1}^{*}+x_{2}^{*}\|<\varepsilon. Thus, assertion (ii) holds true.

If XX is Asplund, then NCN^{C} and DCD^{*C} in the above arguments can be replaced by NFN^{F} and DFD^{*F}, respectively.

Remark 2.
  1. (i)

    Part (i) of Corollary 1 gives a kind of multiplier rule with yy^{*} playing the role of the vector of multipliers.

  2. (ii)

    Part (ii) corresponds to ‘singular’ behaviour of FF on Ω\Omega. It involves ‘horizontal’ normals to the graph of FF; the yy^{*} component vanishes, and consequently, Ξ\Xi plays no role.

The following condition is the negation of the condition in Corollary 1 (ii).

    1. (QC)C(QC)_{C}

      there is an ε>0\varepsilon>0 such that x1+x2ε\|x_{1}^{*}+x_{2}^{*}\|\geq\varepsilon for all (x1,y1)gphFBε(x¯,y¯)(x_{1},y_{1})\in{\rm gph}\,F\cap B_{\varepsilon}(\bar{x},\bar{y}), x2ΩBε(x¯)x_{2}\in\Omega\cap B_{\varepsilon}(\bar{x}), x1DCF(x1,y1)(ε𝔹Y)x_{1}^{*}\in D^{*C}F(x_{1},y_{1})(\varepsilon\mathbb{B}_{Y^{*}}) and x2NΩC(x2))x_{2}^{*}\in N^{C}_{\Omega}(x_{2})) such that x1+x2=1\|x_{1}^{*}\|+\|x_{2}^{*}\|=1.

It excludes the singular behavior mentioned in Remark 2 (ii) and serves as a qualification condition ensuring that only the condition in part (i) of Corollary 1 is possible. We denote by (QC)F(QC)_{F} the analogue of (QC)C(QC)_{C} with NFN^{F} and DFD^{*F} in place of NCN^{C} and DCD^{*C}, respectively.

Corollary 2.

Let XX and YY be Banach spaces, Ω\Omega, gphF{\rm gph}\,F and all members of Ξ\Xi be closed. Suppose that the triple {F,Ω,Ξ}\{F,\Omega,\Xi\} is approximately stationary at (x¯,y¯)(\bar{x},\bar{y}). If condition (QC)C(QC)_{C} is satisfied, then assertion (i) in Corollary 1 holds true.

If XX is Asplund and condition (QC)F(QC)_{F} is satisfied, then assertion (i) in Corollary 1 holds true with NFN^{F} and DFD^{*F} in place of NCN^{C} and DCD^{*C}, respectively.

The next proposition provides two typical sufficient conditions for the fulfillment of conditions (QC)C(QC)_{C} and (QC)F(QC)_{F}.

Proposition 2.

Let XX and YY be normed spaces.

  1. (i)

    If FF has the Aubin property at (x¯,y¯)(\bar{x},\bar{y}), then (QC)F(QC)_{F} is satisfied. If, additionally, dimY<+\dim Y<+\infty, then (QC)C(QC)_{C} is satisfied too.

  2. (ii)

    If x¯intΩ\bar{x}\in{\rm int}\,\Omega, then both (QC)C(QC)_{C} and (QC)F(QC)_{F} are satisfied.

Proof.
  1. (i)

    If FF has the Aubin property at (x¯,y¯)(\bar{x},\bar{y}), then, by Lemma 1, condition (5) is satisfied with some τ>0\tau>0 and δ>0\delta>0, and, if dimY<+\dim Y<+\infty, then the latter condition is also satisfied with NCN^{C} in place of NFN^{F}. Hence, (QC)F(QC)_{F} is satisfied with ε:=1/(2τ+1)\varepsilon:=1/(2\tau+1), as well as (QC)C(QC)_{C} if dimY<+\dim Y<+\infty. Indeed, if (x1,y1)gphFBδ(x¯,y¯)(x_{1},y_{1})\in{\rm gph}\,F\cap B_{\delta}(\bar{x},\bar{y}) and x1DF(x1,y1)(y1)x_{1}^{*}\in D^{*}F(x_{1},y_{1})(y_{1}^{*}) (where DD^{*} stands for either DCD^{*C} or DFD^{*F}), x2Xx_{2}^{*}\in X^{*}, x1+x2=1\|x_{1}^{*}\|+\|x_{2}^{*}\|=1 and y1<ε\|y_{1}^{*}\|<\varepsilon, then x1+x2x2x1=12x1>12τε=ε.\|x_{1}^{*}+x_{2}^{*}\|\geq\|x_{2}^{*}\|-\|x_{1}^{*}\|=1-2\|x_{1}^{*}\|>1-2\tau\varepsilon=\varepsilon.

  2. (ii)

    If x¯intΩ\bar{x}\in{\rm int}\,\Omega, then NΩC(x2)=NΩF(x2)={0}N^{C}_{\Omega}(x_{2})=N^{F}_{\Omega}(x_{2})=\{0\} for all x2x_{2} near x¯\bar{x}, and consequently, for any normal vector x2x_{2}^{*} to Ω\Omega at x2x_{2} and any x1Xx_{1}^{*}\in X^{*}, condition x1+x2=1\|x_{1}^{*}\|+\|x_{2}^{*}\|=1 yields x1+x2=1\|x_{1}^{*}+x_{2}^{*}\|=1. Hence, both (QC)C(QC)_{C} and (QC)F(QC)_{F} are satisfied with any sufficiently small ε\varepsilon.

Remark 3.

In view of Proposition 2, each of the Corollaries 1 and 2 covers (ZheNg05.2, , Theorems 3.1 and 4.1) and (ZheNg06, , Theorem 3.1 and Corollary 3.1), which give dual necessary conditions for Pareto optimality, as well as (MorTreZhu03, , Proposition 5.1) for a vector optimization problem with a general preference relation.

The next example illustrates the verification of the necessary conditions for approximate stationarity in Corollaries 1 and 2 for the triple {F,Ω,Ξ}\{F,\Omega,\Xi\} , where F4F_{4} is defined in Example 2.

Example 4.

Let X=Y=Ω:=X=Y=\Omega:=\mathbb{R}, and F4F_{4} and Ξ\Xi be as in Example 2. Thus, the triple {F,Ω,Ξ}\{F,\Omega,\Xi\} is approximately stationary at (x¯,y¯)(\bar{x},\bar{y}), and the conclusions of Corollary 1 must hold true. Moreover, the assumptions in both parts of Proposition 2 are satisfied, and consequently, condition (QC)F(QC)_{F} holds true. By Corollary 2, assertion (i) in Corollary 1 holds true with NFN^{F} and DFD^{*F} in place of NCN^{C} and DCD^{*C}, respectively. We now verify this assertion.

Let M>0M>0 and ε>0\varepsilon>0. Choose a t(0,min{ε,1})t\in(0,\min\{\varepsilon,1\}). Set (x1,y1):=(t/2,t2/4)gphF4(ε𝔹2)(x_{1},y_{1}):=(t/2,-t^{2}/4)\in{\rm gph}\,F_{4}\cap(\varepsilon\mathbb{B}_{\mathbb{R}^{2}}), x2:=0Ω(ε𝔹)x_{2}:=0\in\Omega\cap{(\varepsilon\mathbb{B}_{\mathbb{R}})}, A:=(,t]ΞA:=(-\infty,-t]\in\Xi, y2:=tA(ε𝔹)y_{2}:=-t\in A\cap{(\varepsilon\mathbb{B}_{\mathbb{R}})} and y:=1y^{*}:=1. Thus, NΩF(x2)={0}N^{F}_{\Omega}(x_{2})=\{0\}, yNAF(y2)y^{*}\in N^{F}_{A}(y_{2}) and DFF4(x1,y1)(y)={t}D^{*F}F_{4}(x_{1},y_{1})(y^{*})=\{-t\}. Hence,

DFF4(x1,y1)(y)+NΩF(x2)(M𝔹)={t}ε𝔹,\displaystyle D^{*F}F_{4}(x_{1},y_{1})(y^{*})+N^{F}_{\Omega}(x_{2})\cap(M\mathbb{B}_{\mathbb{R}})=\{-t\}\in\varepsilon\mathbb{B}_{\mathbb{R}},

i.e., assertion (i) in Corollary 1 holds true (in terms of Fréchet normals and coderivatives).

4 Abstract level-set mapping

We now consider a particular case of the model in Sections 3 when the family Ξ\Xi is determined by an abstract level-set mapping L:YYL:Y\rightrightarrows Y. The latter mapping defines a preference relation \prec on YY: vyv\prec y if and only if vL(y)v\in L(y); see, e.g., (KhaTamZal15, , p. 67).

Given a point yYy\in Y, we employ below the following notation:

L(y):=L(y){y},L(y):=L(y){y}.\displaystyle L^{\circ}(y):=L(y)\setminus\{y\},\quad L^{-}(y):=L(y)\cup\{y\}. (9)

Certain requirements are usually imposed on LL in order to make the corresponding preference relation meaningful in optimization and applications; see, e.g., Zhu00 ; MorTreZhu03 ; Mor06.2 ; KhaTamZal15 . In this section, we discuss the following properties of LL at or near the reference point y¯\bar{y}:

  1. (O1)

    lim infL(y¯)yy¯d(y¯,L(y))=0\liminf\limits_{L^{\circ}(\bar{y})\ni y\to\bar{y}}d(\bar{y},L(y))=0;

  2. (O2)

    y¯clL(y¯)\bar{y}\in{\rm cl}\,L^{\circ}(\bar{y});

  3. (O3)

    y¯L(y¯)\bar{y}\notin L(\bar{y});

  4. (O4)

    yclL(y)y\in{\rm cl}\,L(y) for all yy near y¯\bar{y};

  5. (O5)

    if yL(y¯)y\in L^{\circ}(\bar{y}) and vclL(y)v\in{\rm cl}\,L(y), then vL(y¯)v\in L^{\circ}(\bar{y});

  6. (O6)

    if yL(y¯)y\in L(\bar{y}) and vclL(y)v\in{\rm cl}\,L(y), then vL(y¯)v\in L(\bar{y}).

Some characterizations of the properties and relations between them are collected in the next proposition.

Proposition 3.

Let L:YYL:Y\rightrightarrows Y, and LL^{\circ} be given by (9). The following assertions hold true.

  1. (i)

    (O1) \Leftrightarrow\ {yYd(y¯,L(y))<ε}L(y¯)Bε(y¯)\{y\in Y\mid d(\bar{y},L(y))<\varepsilon\}\cap L^{\circ}(\bar{y})\cap B_{\varepsilon}(\bar{y})\neq\emptyset for all ε>0\varepsilon>0.

  2. (ii)

    (O1) \Rightarrow\ (O2).

  3. (iii)

    (O3) \Leftrightarrow\ [L(y¯)=L(y¯)][L(\bar{y})=L^{\circ}(\bar{y})].

  4. (iv)

    (O2) & (O4) \Rightarrow\ (O1).

  5. (v)

    (O3) & (O4) \Rightarrow\ (O2).

  6. (vi)

    (O3) \Rightarrow\ [(O5)(O6)][\eqref{O5}\Leftrightarrow\eqref{O6}].

Proof.
  1. (i)

    (O1)  \Leftrightarrow\ infyL(y¯)Bε(y¯)d(y¯,L(y))=0\inf_{y\in L^{\circ}(\bar{y})\cap B_{\varepsilon}(\bar{y})}d(\bar{y},L(y))=0 for any ε>0\varepsilon>0\Rightarrow\  for any ε>0\varepsilon>0, there is a yL(y¯)Bε(y¯)y\in L^{\circ}(\bar{y})\cap B_{\varepsilon}(\bar{y}) such that d(y¯,L(y))<εd(\bar{y},L(y))<\varepsilon. This proves the ‘\Rightarrow’ implication. Conversely, let δ:=infyL(y¯)Bε(y¯)d(y¯,L(y))>0\delta:=\inf_{y\in L^{\circ}(\bar{y})\cap B_{\varepsilon}(\bar{y})}d(\bar{y},L(y))>0 for some ε>0\varepsilon>0. Then {yYd(y¯,L(y))<δ}L(y¯)Bε(y¯)=\{y\in Y\mid d(\bar{y},L(y))<\delta\}\cap L^{\circ}(\bar{y})\cap B_{\varepsilon}(\bar{y})=\emptyset, and consequently, {yYd(y¯,L(y))<ε}L(y¯)Bε(y¯)=\{y\in Y\mid d(\bar{y},L(y))<\varepsilon^{\prime}\}\cap L^{\circ}(\bar{y})\cap B_{\varepsilon^{\prime}}(\bar{y})=\emptyset, where ε:=min{ε,δ}\varepsilon^{\prime}:=\min\{\varepsilon,\delta\}. The implication ‘\Leftarrow’ follows.

  2. (ii)

    (O1)  \Rightarrow\  there exists a sequence {yk}L(y¯)\{y_{k}\}\subset L^{\circ}(\bar{y}) with yky¯y_{k}\to\bar{y}\Leftrightarrow\  (O2).

  3. (iii)

    The assertion is a consequence of the definition of LL^{\circ} in (9).

  4. (iv)

    Suppose conditions (O2) and (O4) are satisfied. Let ε>0\varepsilon>0. Thanks to (O4), we can choose a ξ(0,ε)\xi\in(0,\varepsilon) such that yclL(y)y\in{\rm cl}\,L(y) for all yBξ(y¯)y\in B_{\xi}(\bar{y}). If yBξ(y¯)y\in B_{\xi}(\bar{y}), then d(y¯,L(y))=d(y¯,clL(y))yy¯<ξd(\bar{y},L(y))=d(\bar{y},{\rm cl}\,L(y))\leq\|y-\bar{y}\|<\xi. Thus, Bξ(y¯){yYd(y¯,L(y))<ξ}B_{\xi}(\bar{y})\subset\{y\in Y\mid d(\bar{y},L(y))<\xi\}. Thanks to (O2), we have

    {yYd(y¯,L(y))<ε}L(y¯)Bε(y¯)L(y¯)Bξ(y¯).\{y\in Y\mid d(\bar{y},L(y))<\varepsilon\}\cap L^{\circ}(\bar{y})\cap B_{\varepsilon}(\bar{y})\supset L^{\circ}(\bar{y})\cap B_{\xi}(\bar{y})\neq\emptyset.

    Since ε\varepsilon is an arbitrary positive number, in view of i, this proves (O1).

  5. (v)

    (O4)  \Rightarrow\ y¯clL(y¯)\bar{y}\in{\rm cl}\,L(\bar{y}). The conclusion follows thanks to (iii).

  6. (vi)

    The assertion is a consequence of (iii).

Remark 4.

Properties (O4) and (O6) are components of the definition of closed preference relation (see (Mor06.2, , Definition 5.55), (Bao14.2, , p. 583), (KhaTamZal15, , p. 68)) widely used in vector and set-valued optimization. They are called, respectively, local satiation (around y¯\bar{y}) and almost transitivity. Note that the latter property is actually stronger than the conventional transitivity. It is not satisfied for the preference defined by the lexicographical order (see (Mor06.2, , Example 5.57)) and some other natural preference relations important in vector optimization and its applications including those to welfare economics (see (KhaTamZal15, , Sect. 15.3)). Closed preference relations are additionally assumed in Mor06.2 ; Bao14.2 ; KhaTamZal15 to be nonreflexive, thus, satisfying, in particular, property (O3). In view of Proposition 3, if a preference relation satisfies properties (O3), (O4) and (O6), it also satisfies properties (O1), (O2) and (O5). In this section, we employ the weaker properties (O1) and (O5), which are satisfied by most conventional and many other preference relations. This makes our model applicable to a wider range of multiobjective and set-valued optimization problems compared to those studied in Mor06.2 ; Bao14.2 ; KhaTamZal15 .

The next two examples illustrate some characterizations of the level-set mapping.

Example 5.

Let L(y):={y}L(y):=\{y\} for all yYy\in Y. Then L(y)=L^{\circ}(y)=\emptyset. Thus, properties (O4) and (O5) are obviously satisfied, while properties (O1) and (O2) are violated.

Example 6.

Let L:22L:\mathbb{R}^{2}\rightrightarrows\mathbb{R}^{2} be defined by

L(y1,y2):={{(v1,v2)2v1<y1,v2<y2}if (y1,y2)(0,0),{(0,0)}otherwise.\displaystyle L(y_{1},y_{2}):=\begin{cases}\{(v_{1},v_{2})\in\mathbb{R}^{2}\mid v_{1}<y_{1},\;v_{2}<y_{2}\}&\text{if }(y_{1},y_{2})\neq(0,0),\\ \{(0,0)\}&\text{otherwise}.\end{cases}

Let y¯:=(0,0)\bar{y}:=(0,0). Then L(y1,y2)=L(y1,y2)L^{\circ}(y_{1},y_{2})=L(y_{1},y_{2}) if (y1,y2)y¯(y_{1},y_{2})\neq\bar{y} and L(y¯)=L^{\circ}(\bar{y})=\emptyset. As in Example 5, properties (O4) and (O5) are satisfied, while properties (O1) and (O2) are violated.

We are going to employ in our model the ‘localized’ family of sets

Ξδ:={clL(y)yL(y¯)Bδ(y¯)}.\displaystyle\Xi^{\delta}:=\{{\rm cl}\,L(y)\mid y\in L^{-}(\bar{y})\cap B_{\delta}(\bar{y})\}. (10)

Note that members of Ξδ\Xi^{\delta} are not simply translations (deformations) of the fixed set L(y¯)L(\bar{y}) (or L(y¯)L^{\circ}(\bar{y})); they are defined by sets L(y)L(y) where yy does not have to be equal to y¯\bar{y}.

Remark 5.

Given a set KK containing y¯\bar{y}, one can naturally define the level-set mapping by L(y)=yy¯+KL(y)=y-\bar{y}+K for all yYy\in Y. Then (10) defines the family of perturbations as the traditional collection of translations of clK{\rm cl}\,K, i.e., Ξδ={yy¯+clKyKBδ(y¯)}\Xi^{\delta}=\{y-\bar{y}+{\rm cl}\,K\mid y\in K\cap B_{\delta}(\bar{y})\}.

In the current setting, the properties in Proposition 1 take the following form.

Proposition 4.

Let δ>0\delta>0, and Ξδ\Xi^{\delta} be given by (10). The triple {F,Ω,Ξδ}\{F,\Omega,\Xi^{\delta}\} is

  1. (i)

    extremal at (x¯,y¯)(\bar{x},\bar{y}) if and only if there is a ρ(0,+]\rho\in(0,+\infty] such that, for any ε>0\varepsilon>0, there exists a yL(y¯)Bδ(y¯)y\in L^{-}(\bar{y})\cap B_{\delta}(\bar{y}) such that d(y¯,L(y))<εd(\bar{y},L(y))<\varepsilon, and

    F(ΩBρ(x¯))clL(y)Bρ(y¯)=;\displaystyle F(\Omega\cap B_{\rho}(\bar{x}))\cap{\rm cl}\,L(y)\cap B_{\rho}(\bar{y})=\emptyset; (11)
  2. (ii)

    stationary at (x¯,y¯)(\bar{x},\bar{y}) if and only if, for any ε>0\varepsilon>0, there exist a ρ(0,ε)\rho\in(0,\varepsilon) and a yL(y¯)Bδ(y¯)y\in L^{-}(\bar{y})\cap B_{\delta}(\bar{y}) such that d(y¯,L(y))<ερd(\bar{y},L(y))<\varepsilon\rho, and condition (11) is satisfied;

  3. (iii)

    approximately stationary at (x¯,y¯)(\bar{x},\bar{y}) if and only if, for any ε>0\varepsilon>0, there exist a ρ(0,ε)\rho\in(0,\varepsilon), a yL(y¯)Bδ(y¯)y\in L^{-}(\bar{y})\cap B_{\delta}(\bar{y}), and (x1,y1),(x2,y2)Bε(x¯,y¯)(x_{1},y_{1}),(x_{2},y_{2})\in B_{\varepsilon}(\bar{x},\bar{y}) such that d((x1,y1),gphF)<ερd((x_{1},y_{1}),{\rm gph}\,F)<\varepsilon\rho, d(x2,Ω)<ερd(x_{2},\Omega)<\varepsilon\rho, d(y2,L(y))<ερd(y_{2},L(y))<\varepsilon\rho, and

    F(x1+(Ωx2)(ρ𝔹X))(y1+(clL(y)y2)(ρ𝔹Y))=.\displaystyle F(x_{1}+(\Omega-x_{2})\cap(\rho\mathbb{B}_{X}))\cap(y_{1}+({\rm cl}\,L(y)-y_{2})\cap(\rho\mathbb{B}_{Y}))=\emptyset.

The statements of Theorem 3.1 and its corollaries can be easily adjusted to the current setting. For instance, Corollary 2 can be reformulated as follows.

Corollary 3.

Let XX and YY be Banach spaces, Ω\Omega and gphF{\rm gph}\,F be closed, δ>0\delta>0, and Ξδ\Xi^{\delta} be given by (10). Suppose that the triple {F,Ω,Ξδ}\{F,\Omega,\Xi^{\delta}\} is approximately stationary at (x¯,y¯)(\bar{x},\bar{y}). If condition (QC)C(QC)_{C} is satisfied, then there is an M>0M>0 such that, for any ε>0\varepsilon>0, there exist (x1,y1)gphFBε(x¯,y¯)(x_{1},y_{1})\in{\rm gph}\,F\cap B_{\varepsilon}(\bar{x},\bar{y}), x2ΩBε(x¯)x_{2}\in\Omega\cap B_{\varepsilon}(\bar{x}), yL(y¯)Bδ(y¯)y\in L^{-}(\bar{y})\cap B_{\delta}(\bar{y}), y2clL(y)Bε(y¯)y_{2}\in{\rm cl}\,L(y)\cap B_{\varepsilon}(\bar{y}), and yNclL(y)C(y2)+ε𝔹Yy^{*}\in N^{C}_{{\rm cl}\,L(y)}(y_{2})+\varepsilon\mathbb{B}_{Y^{*}} such that y=1\|y^{*}\|=1, and condition (8) holds true.

If XX is Asplund and condition (QC)F(QC)_{F} is satisfied, then the above assertion holds true with NFN^{F} and DFD^{*F} in place of NCN^{C} and DCD^{*C}, respectively.

The properties in Definition 3 are rather general. They cover various optimality and stationarity concepts in vector and set-valued optimization. With Ω\Omega, FF and LL as above, and points x¯Ω\bar{x}\in\Omega and y¯F(x¯)\bar{y}\in F(\bar{x}), the next definition seems reasonable.

Definition 4.

The point (x¯,y¯)(\bar{x},\bar{y}) is extremal for FF on Ω\Omega if there is a ρ(0,+]\rho\in(0,+\infty] such that

F(ΩBρ(x¯))L(y¯)Bρ(y¯)=.\displaystyle F(\Omega\cap B_{\rho}(\bar{x}))\cap L^{\circ}(\bar{y})\cap B_{\rho}(\bar{y})=\emptyset. (12)

Definition 4 covers both local (ρ<+\rho<+\infty) and global (ρ=+\rho=+\infty) extremality. The above concept is applicable, in particular, to solutions of the following set-valued minimization problem with respect to the preference determined by LL:

minimize F(x)subject to xΩ.\displaystyle\text{minimize }\;F(x)\quad\text{subject to }\;x\in\Omega. (PP)
Remark 6.
  1. (i)

    The concept in Definition 4 is broader than just (local) minimality as FF is not assumed to be an objective mapping of an optimization problem. It can, for instance, be involved in modeling constraints.

  2. (ii)

    The property in Definition 4 is similar to the one in the definition of fully localized minimizer in (BaoMor10, , Definition 3.1) (see also (KhaTamZal15, , p. 68)). The latter definition uses the larger set clL(y¯){y¯}{\rm cl}\,L(\bar{y})\setminus\{\bar{y}\} in place of L(y¯)L^{\circ}(\bar{y}) in (12). It is not difficult to check that the two properties are equivalent (when ρ<+\rho<+\infty). Unlike many solution concepts in vector optimization, the above definition involves “image localization” (hence, is in general weaker). It has proved to be useful when studying locally optimal allocations of welfare economics; cf. BaoMor10 ; KhaTamZal15 .

We next show that, under some mild assumptions on the level-set mapping LL, the extremality in the sense of Definition 4 can be treated in the framework of the extremality in the sense of Definition 3 (or its characterization in Proposition 4 (i)).

Proposition 5.

Let x¯Ω\bar{x}\in\Omega, y¯F(x¯)\bar{y}\in F(\bar{x}), δ>0\delta>0, and Ξδ\Xi^{\delta} be given by (10). Suppose LL satisfies conditions (O1) and (O5). If (x¯,y¯)(\bar{x},\bar{y}) is extremal for FF on Ω\Omega, then the triple {F,Ω,Ξδ}\{F,\Omega,\Xi^{\delta}\} is extremal at (x¯,y¯)(\bar{x},\bar{y}).

Proof.

In view of (O1), it follows from Proposition 3 (i) that

{yYd(y¯,L(y))<ε}L(y¯)Bε(y¯)for allε>0.\displaystyle\{y\in Y\mid d(\bar{y},L(y))<\varepsilon\}\cap L^{\circ}(\bar{y})\cap B_{\varepsilon}(\bar{y})\neq\emptyset\quad\mbox{for all}\quad\varepsilon>0. (13)

Suppose {F,Ω,Ξδ}\{F,\Omega,\Xi^{\delta}\} is not extremal at (x¯,y¯)(\bar{x},\bar{y}). Let ρ(0,+]\rho\in(0,+\infty]. By Proposition 4 (i), there exists an ε>0\varepsilon>0 such that, for any yL(y¯)Bδ(y¯)y\in L^{-}(\bar{y})\cap B_{\delta}(\bar{y}) with d(y¯,L(y))<εd(\bar{y},L(y))<\varepsilon, it holds

F(ΩBρ(x¯))clL(y)Bρ(y¯).\displaystyle F(\Omega\cap B_{\rho}(\bar{x}))\cap{\rm cl}\,L(y)\cap B_{\rho}(\bar{y})\neq\emptyset. (14)

In view of (13), there is a point yL(y¯)Bδ(y¯)L(y¯)Bδ(y¯)y\in L^{\circ}(\bar{y})\cap B_{\delta}(\bar{y})\subset L^{-}(\bar{y})\cap B_{\delta}(\bar{y}) with d(y¯,L(y))<εd(\bar{y},L(y))<\varepsilon, and we can choose a point y^\hat{y} belonging to the set in (14). Thus, yL(y¯)y\in L^{\circ}(\bar{y}) and y^clL(y)\hat{y}\in{\rm cl}\,L(y). Thanks to (O5), we have y^L(y¯)\hat{y}\in L^{\circ}(\bar{y}), and consequently, y^F(ΩBρ(x¯))L(y¯)Bρ(y¯)\hat{y}\in F(\Omega\cap B_{\rho}(\bar{x}))\cap L^{\circ}(\bar{y})\cap B_{\rho}(\bar{y}). Since ρ(0,+]\rho\in(0,+\infty] is arbitrary, (x¯,y¯)(\bar{x},\bar{y}) is not extremal for FF on Ω\Omega.

Thanks to Proposition 5, if the level-set mapping LL satisfies conditions (O1) and (O5), then extremal points of problem (PP) satisfy the necessary conditions in Theorem 3.1 and its corollaries.

5 Set-valued optimization: multiple mappings

It is not difficult to upgrade the model used in Definition 3 and the subsequent statements to make it directly applicable to constraint optimization problems: instead of a single mapping F:XYF:X\rightrightarrows Y with y¯F(x¯)\bar{y}\in F(\bar{x}) for some x¯ΩX\bar{x}\in\Omega\subset X and a single family Ξ\Xi of subsets of YY, one can consider finite collections of mappings Fi:XYiF_{i}:X\rightrightarrows Y_{i} between normed spaces together with points y¯iFi(x¯)\bar{y}_{i}\in F_{i}(\bar{x}), and nonempty families Ξi\Xi_{i} of subsets of YiY_{i} (i=1,,n)(i=1,\ldots,n).

This more general setting can be viewed as a structured particular case of the set-valued optimization model considered in Section 3 if one sets

Y:=Y1××Yn,F:=(F1,,Fn),y¯:=(y¯1,,y¯n) and Ξ:=Ξ1××Ξn.\displaystyle Y:=Y_{1}\times\ldots\times Y_{n},\quad F:=(F_{1},\ldots,F_{n}),\quad\bar{y}:=(\bar{y}_{1},\ldots,\bar{y}_{n})\and\Xi:=\Xi_{1}\times\ldots\times\Xi_{n}.

Thus, y¯F(x¯)\bar{y}\in F(\bar{x}), and AΞA\in\Xi means that A=A1××AnA=A_{1}\times\ldots\times A_{n} and AiΞiA_{i}\in\Xi_{i} (i=1,,n)(i=1,\ldots,n). To shorten the notation, we keep talking in this section about extremality/stationarity of the triple {F,Ω,Ξ}\{F,\Omega,\Xi\} at (x¯,y¯)(\bar{x},\bar{y}).

Definition 5.

The triple {F,Ω,Ξ}\{F,\Omega,\Xi\} is extremal (resp., stationary, approximately stationary) at (x¯,y¯)(\bar{x},\bar{y}) if the collection of n+1n+1 families of sets:

Ξ^i:={Ωi}(i=1,,n) and Ξ^n+1:={Ω×AAΞ}.\displaystyle\widehat{\Xi}_{i}:=\{\Omega_{i}\}\;\;(i=1,\ldots,n)\and\widehat{\Xi}_{n+1}:=\{\Omega\times A\mid A\in\Xi\}.

is extremal (resp., stationary, approximately stationary) at (x¯,y¯)(\bar{x},\bar{y}), where Ωi:={(x,y1,,yn)X×Y1××YnyiFi(x)}\Omega_{i}:=\{(x,y_{1},\ldots,y_{n})\in X\times Y_{1}\times\ldots\times Y_{n}\mid y_{i}\in F_{i}(x)\} (i=1,,n)(i=1,\ldots,n).

With the notation introduced above, Definitions 1 and 5 lead to characterizations of the extremality and stationarity of the triple {F,Ω,Ξ}\{F,\Omega,\Xi\} given in parts (i) and (ii) of Proposition 1. The corresponding characterization of the approximate stationarity is a little different. It is formulated in the next proposition.

Proposition 6.

The triple {F,Ω,Ξ}\{F,\Omega,\Xi\} is approximately stationary at (x¯,y¯)(\bar{x},\bar{y}) if and only if, for any ε>0\varepsilon>0, there exist a ρ(0,ε)\rho\in(0,\varepsilon), AiΞiA_{i}\in\Xi_{i} (i=1,,n)(i=1,\ldots,n), xiBε(x¯)x_{i}\in B_{\varepsilon}(\bar{x}) (i=1,,n+1)(i=1,\ldots,n+1), and yi,viBε(y¯i)y_{i},v_{i}\in B_{\varepsilon}(\bar{y}_{i}) (i=1,,n)(i=1,\ldots,n) such that d((xi,yi),gphFi)<ερd((x_{i},y_{i}),{\rm gph}\,F_{i})<\varepsilon\rho, d(vi,Ai)<ερd(v_{i},A_{i})<\varepsilon\rho (i=1,,n)(i=1,\ldots,n), d(xn+1,Ω)<ερd(x_{n+1},\Omega)<\varepsilon\rho and, for each xΩBρ(xn+1)x\in\Omega\cap B_{\rho}(x_{n+1}), there is an i{1,,n}i\in\{1,\ldots,n\} such that

Fi(xi+xxn+1)(yi+(Aivi)(ρ𝔹Y))=.\displaystyle F_{i}(x_{i}+x-x_{n+1})\cap\big{(}y_{i}+(A_{i}-v_{i})\cap(\rho\mathbb{B}_{Y})\big{)}=\emptyset.

Application of Theorem 1.1 in the current setting produces necessary conditions for approximate stationarity and, hence, also stationarity and extremality extending Theorem 3.1 and its corollaries. Condition (QC)C({QC})_{C} can be extended as follows:

    1. (QC^)C(\widehat{QC})_{C}

      there is an ε>0\varepsilon>0 such that i=1n+1xiε\big{\|}\sum_{i=1}^{n+1}x_{i}^{*}\big{\|}\geq\varepsilon for all (xi,yi)gphFiBε(x¯,y¯i)(x_{i},y_{i})\in{\rm gph}\,F_{i}\cap B_{\varepsilon}(\bar{x},\bar{y}_{i}), xiDCFi(xi,yi)(ε𝔹Yi)x_{i}^{*}\in D^{*C}F_{i}(x_{i},y_{i})(\varepsilon\mathbb{B}_{Y_{i}^{*}}) (i=1,,n)(i=1,\ldots,n), xn+1ΩBε(x¯)x_{n+1}\in\Omega\cap B_{\varepsilon}(\bar{x}) and xn+1NΩC(xn+1)x_{n+1}^{*}\in N^{C}_{\Omega}(x_{n+1}) such that i=1n+1xi=1\sum_{i=1}^{n+1}\|x_{i}^{*}\|=1,

while the corresponding extension (QC^)F(\widehat{QC})_{F} of condition (QC)F({QC})_{F} is obtained by replacing NCN^{C} and DCD^{*C} in (QC^)C(\widehat{QC})_{C} by NFN^{F} and DFD^{*F}, respectively. An extension of Corollary 3 takes the following form.

Theorem 5.1.

Let XX, Y1,,YnY_{1},\ldots,Y_{n} be Banach spaces, Ω\Omega and, for each i=1,,ni=1,\ldots,n, the graph gphFi{\rm gph}\,F_{i} and all members of Ξi\Xi_{i} be closed. Suppose {F,Ω,Ξ}\{F,\Omega,\Xi\} is approximately stationary at (x¯,y¯)(\bar{x},\bar{y}). If condition (QC^)C(\widehat{QC})_{C} is satisfied, then there is an M>0M>0 such that, for any ε>0\varepsilon>0, there exist (xi,yi)gphFiBε(x¯,y¯i)(x_{i},y_{i})\in{\rm gph}\,F_{i}\cap B_{\varepsilon}(\bar{x},\bar{y}_{i}), AiΞiA_{i}\in\Xi_{i}, viAiBε(y¯i)v_{i}\in A_{i}\cap B_{\varepsilon}(\bar{y}_{i}), yiNAiC(vi)+ε𝔹Yiy^{*}_{i}\in N^{C}_{A_{i}}(v_{i})+\varepsilon\mathbb{B}_{Y_{i}^{*}} (i=1,,n)(i=1,\ldots,n), and xn+1ΩBε(x¯)x_{n+1}\in\Omega\cap B_{\varepsilon}(\bar{x}) such that i=1nyi=1\sum_{i=1}^{n}\|y_{i}^{*}\|=1 and

0i=1nDCFi(xi,yi)(yi)+NΩC(xn+1)(M𝔹X)+ε𝔹X.\displaystyle 0\in\sum_{i=1}^{n}D^{*C}F_{i}(x_{i},y_{i})(y_{i}^{*})+N^{C}_{\Omega}(x_{n+1})\cap(M\mathbb{B}_{X^{*}})+\varepsilon\mathbb{B}_{X^{*}}.

If XX is Asplund and condition (QC^)F(\widehat{QC})_{F} is satisfied, then the above assertion holds true with NFN^{F} and DFD^{*F} in place of NCN^{C} and DCD^{*C}, respectively.

Remark 7.
  1. (i)

    Proposition 2 (with F=(F1,,Fn)F=(F_{1},\ldots,F_{n}) in part (i)) gives two typical sufficient conditions for the fulfillment of conditions (QC^)C(\widehat{QC})_{C} and (QC^)F(\widehat{QC})_{F}.

  2. (ii)

    Theorem 5.1 covers (ZheNg06, , Theorems 3.1 and 3.2). In view of the previous item, it also covers (ZheNg06, , Corollary 3.2).

  3. (iii)

    Theorem 5.1 is a consequence of the dual necessary conditions for approximate stationarity of a collection of sets in Theorem 1.1. The latter theorem can be extended to cover a more general quantitative notion of approximate α\alpha-stationarity (with a fixed α>0\alpha>0), leading to corresponding extensions of Theorem 5.1 and its corollaries covering, in particular, dual conditions for ε\varepsilon-Pareto optimality in (ZheNg11, , Theorems 4.3 and 4.5).

Employing the multiple-mapping model studied in this section, one can consider a more general than (PP) optimization problem with set-valued constraints:

minimize F0(x)subject to Fi(x)Ki(i=1,,n),xΩ,\displaystyle\text{minimize }\;F_{0}(x)\quad\text{subject to }\;F_{i}(x)\cap K_{i}\neq\emptyset\;(i=1,\ldots,n),\;\;x\in\Omega, (𝒫\mathcal{P})

where Fi:XYiF_{i}:X\rightrightarrows Y_{i} (i=0,,n)(i=0,\ldots,n) are mappings between normed spaces, ΩX\Omega\subset X, KiYiK_{i}\subset Y_{i} (i=1,,n)(i=1,\ldots,n), and Y0Y_{0} is equipped with a level-set mapping LL. The “functional” constraints in (𝒫\mathcal{P}) can model a system of equalities and inequalities as well as more general operator-type constraints.

Using the set of admissible solutions

Ω^:={xΩFi(x)Ki,i=1,,n}\widehat{\Omega}:=\{x\in\Omega\mid F_{i}(x)\cap K_{i}\neq\emptyset,\;i=1,\ldots,n\},

we say that a point (x¯,y¯0)X×Y0(\bar{x},\bar{y}_{0})\in X\times Y_{0} is extremal in problem (𝒫\mathcal{P}) if it is extremal for F0F_{0} on Ω^\widehat{\Omega}. This means, in particular, that x¯Ω\bar{x}\in\Omega, y¯0F0(x¯)\bar{y}_{0}\in F_{0}(\bar{x}), and there exist y¯iFi(x¯)Ki\bar{y}_{i}\in F_{i}(\bar{x})\cap K_{i} (i=1,,n)(i=1,\ldots,n).

We are going to employ the model studied in the first part of this section with n+1n+1 objects in place of nn. There are n+1n+1 mappings F0,,FnF_{0},\ldots,F_{n} and nn sets K1,,KnK_{1},\ldots,K_{n} in (𝒫\mathcal{P}). As in (10), we define Ξ0δ:={clL(y)yLδ(y¯0)}\Xi_{0}^{\delta}:=\{{\rm cl}\,L(y)\mid y\in L_{\delta}(\bar{y}_{0})\} (δ>0)(\delta>0), where Lδ(y¯0)=(L(y¯0)Bδ(y¯)){y¯}L_{\delta}(\bar{y}_{0})=(L(\bar{y}_{0})\cap B_{\delta}(\bar{y}))\cup\{\bar{y}\}. Now, set

Y:=Y0××Yn,F:=(F0,,Fn),y¯:=(y¯0,,y¯n) and Ξδ:=Ξ0δ×K1××Kn.\displaystyle Y:=Y_{0}\times\ldots\times Y_{n},\;\;F:=(F_{0},\ldots,F_{n}),\;\;\bar{y}:=(\bar{y}_{0},\ldots,\bar{y}_{n})\and\Xi^{\delta}:=\Xi_{0}^{\delta}\times K_{1}\times\ldots\times K_{n}.

Using the same arguments, one can prove the next extension of Proposition 5.

Proposition 7.

Let x¯Ω\bar{x}\in\Omega, y¯0F0(x¯)\bar{y}_{0}\in F_{0}(\bar{x}), y¯iFi(x¯)Ki\bar{y}_{i}\in F_{i}(\bar{x})\cap K_{i} (i=1,,n)(i=1,\ldots,n), δ>0\delta>0, and FF, y¯\bar{y} and Ξδ\Xi^{\delta} be defined as above. Suppose LL satisfies conditions (O1) and (O5). If (x¯,y¯0)(\bar{x},\bar{y}_{0}) is extremal in problem (𝒫\mathcal{P}), then {F,Ω,Ξδ}\{F,\Omega,\Xi^{\delta}\} is extremal at (x¯,y¯)(\bar{x},\bar{y}).

Condition (QC^)C(\widehat{QC})_{C} in the current setting is reformulated as follows:

    1. (QC^)C(\widehat{QC})_{C}^{\prime}

      there is an ε>0\varepsilon>0 such that i=0n+1xiε\big{\|}\sum_{i=0}^{n+1}x_{i}^{*}\big{\|}\geq\varepsilon for all (xi,yi)gphFiBε(x¯,y¯i)(x_{i},y_{i})\in{\rm gph}\,F_{i}\cap B_{\varepsilon}(\bar{x},\bar{y}_{i}), xiDCFi(xi,yi)(ε𝔹Yi)x_{i}^{*}\in D^{*C}F_{i}(x_{i},y_{i})(\varepsilon\mathbb{B}_{Y_{i}^{*}}) (i=0,,n)(i=0,\ldots,n), xn+1ΩBε(x¯)x_{n+1}\in\Omega\cap B_{\varepsilon}(\bar{x}) and xn+1NΩC(xn+1)x_{n+1}^{*}\in N^{C}_{\Omega}(x_{n+1}) such that i=0n+1xi=1\sum_{i=0}^{n+1}\|x_{i}^{*}\|=1,

while the corresponding reformulation (QC^)F(\widehat{QC})_{F}^{\prime} of condition (QC^)F(\widehat{QC})_{F} is obtained by replacing NCN^{C} and DCD^{*C} in (QC^)C(\widehat{QC})_{C}^{\prime} by NFN^{F} and DFD^{*F}, respectively. In view of Proposition 7, Theorem 5.1 yields the following statement.

Corollary 4.

Let XX, Y0,,YnY_{0},\ldots,Y_{n} be Banach spaces, the sets Ω\Omega, gphFi{\rm gph}\,F_{i} (i=0,,n)(i=0,\ldots,n) and KiK_{i} (i=1,,n)(i=1,\ldots,n) be closed, and δ>0\delta>0. Suppose LL satisfies conditions (O1) and (O5). If (x¯,y¯0)(\bar{x},\bar{y}_{0}) is extremal in problem (𝒫\mathcal{P}) and condition (QC^)C(\widehat{QC})_{C}^{\prime} is satisfied, then there is an M>0M>0 such that, for any ε>0\varepsilon>0, there exist (xi,yi)gphFiBε(x¯,y¯i)(x_{i},y_{i})\in{\rm gph}\,F_{i}\cap B_{\varepsilon}(\bar{x},\bar{y}_{i}) (i=0,,n)(i=0,\ldots,n), xn+1ΩBε(x¯)x_{n+1}\in\Omega\cap B_{\varepsilon}(\bar{x}), yBδ(y¯0)y\in B_{\delta}(\bar{y}_{0}), v0clL(y)Bε(y¯0)v_{0}\in{\rm cl}\,L(y)\cap B_{\varepsilon}(\bar{y}_{0}), y0NclL(y)C(v0)+ε𝔹Y0y_{0}^{*}\in N^{C}_{{\rm cl}\,L(y)}(v_{0})+\varepsilon\mathbb{B}_{Y_{0}^{*}}, viKiBε(y¯i)v_{i}\in K_{i}\cap B_{\varepsilon}(\bar{y}_{i}) and yiNKiC(vi)+ε𝔹Yiy_{i}^{*}\in N^{C}_{K_{i}}(v_{i})+\varepsilon\mathbb{B}_{Y_{i}^{*}} (i=1,,n)(i=1,\ldots,n) such that i=0nyi=1\sum_{i=0}^{n}\|y_{i}^{*}\|=1 and

0i=0nDCF(xi,yi)(yi)+NΩC(xn+1)(M𝔹X)+ε𝔹X.\displaystyle 0\in\sum_{i=0}^{n}D^{*C}F(x_{i},y_{i})(y_{i}^{*})+N^{C}_{\Omega}(x_{n+1})\cap(M\mathbb{B}_{X^{*}})+\varepsilon\mathbb{B}_{X^{*}}.

If XX is Asplund and condition (QC^)F(\widehat{QC})_{F}^{\prime} is satisfied, then the above assertion holds true with NFN^{F} and DFD^{*F} in place of NCN^{C} and DCD^{*C}, respectively.

Declarations

Funding. Nguyen Duy Cuong is supported by Vietnam National Program for the Development of Mathematics 2021-2030 under grant number B2023-CTT-09.

Conflict of interest. The authors have no competing interests to declare that are relevant to the content of this article.

Data availability. Data sharing is not applicable to this article as no datasets have been generated or analysed during the current study.

Acknowledgments

A part of the work was done during Alexander Kruger’s stay at the Vietnam Institute for Advanced Study in Mathematics in Hanoi. He is grateful to the Institute for its hospitality and supportive environment.

References

  • (1) J.-P. Aubin and H. Frankowska. Set-Valued Analysis. Birkhäuser Boston Inc., Boston, MA, 1990. doi:10.1007/978-0-8176-4848-0.
  • (2) T. Q. Bao. Extremal systems for sets and multifunctions in multiobjective optimization with variable ordering structures. Vietnam J. Math., 42(4):579–593, 2014. doi:10.1007/s10013-014-0099-6.
  • (3) T. Q. Bao and B. S. Mordukhovich. Set-valued optimization in welfare economics. In Advances in Mathematical Economics, volume 13 of Adv. Math. Econ., pages 113–153. Springer, Tokyo, 2010. doi:10.1007/978-4-431-99490-9_5.
  • (4) J. M. Borwein and Q. J. Zhu. Techniques of Variational Analysis. Springer, New York, 2005.
  • (5) H. T. Bui and A. Y. Kruger. Extremality, stationarity and generalized separation of collections of sets. J. Optim. Theory Appl., 182(1):211–264, 2019. doi:10.1007/s10957-018-01458-8.
  • (6) F. H. Clarke. Optimization and Nonsmooth Analysis. John Wiley & Sons Inc., New York, 1983. doi:10.1137/1.9781611971309.
  • (7) N. D. Cuong, A. Y. Kruger, and N. H. Thao. Extremality of families of sets. Optimization, 2024. doi:10.1080/02331934.2024.2385656.
  • (8) A. L. Dontchev and R. T. Rockafellar. Implicit Functions and Solution Mappings. A View from Variational Analysis. Springer Series in Operations Research and Financial Engineering. Springer, New York, 2 edition, 2014. doi:10.1007/978-1-4939-1037-3.
  • (9) A. D. Ioffe. Variational Analysis of Regular Mappings. Theory and Applications. Springer Monographs in Mathematics. Springer, 2017. doi:10.1007/978-3-319-64277-2.
  • (10) A. A. Khan, C. Tammer, and C. Zălinescu. Set-Valued Optimization: An Introduction with Applications. Vector Optimization. Springer, Heidelberg, 2015. doi:10.1007/978-3-642-54265-7.
  • (11) A. Y. Kruger. On Fréchet subdifferentials. J. Math. Sci. (N.Y.), 116(3):3325–3358, 2003. doi:10.1023/A:1023673105317.
  • (12) A. Y. Kruger and B. S. Mordukhovich. Extremal points and the Euler equation in nonsmooth optimization problems. Dokl. Akad. Nauk BSSR, 24(8):684–687, 1980. In Russian. Available from: https://asterius.federation.edu.au/akruger/research/publications.html.
  • (13) G. Li, K. F. Ng, and X. Y. Zheng. Unified approach to some geometric results in variational analysis. J. Funct. Anal., 248(2):317–343, 2007.
  • (14) B. S. Mordukhovich. Variational Analysis and Generalized Differentiation. I: Basic Theory, volume 330 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer, Berlin, 2006. doi:10.1007/3-540-31247-1.
  • (15) B. S. Mordukhovich. Variational Analysis and Generalized Differentiation. II: Applications, volume 331 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer, Berlin, 2006.
  • (16) B. S. Mordukhovich and Y. Shao. Extremal characterizations of Asplund spaces. Proc. Amer. Math. Soc., 124(1):197–205, 1996.
  • (17) B. S. Mordukhovich, J. S. Treiman, and Q. J. Zhu. An extended extremal principle with applications to multiobjective optimization. SIAM J. Optim., 14(2):359–379, 2003.
  • (18) R. R. Phelps. Convex Functions, Monotone Operators and Differentiability, volume 1364 of Lecture Notes in Mathematics. Springer-Verlag, Berlin, 2nd edition, 1993. doi:10.1007/978-3-662-21569-2.
  • (19) R. T. Rockafellar and R. J.-B. Wets. Variational Analysis. Springer, Berlin, 1998.
  • (20) X. Y. Zheng and K. F. Ng. The Fermat rule for multifunctions on Banach spaces. Math. Program., 104(1):69–90, 2005. doi:10.1007/s10107-004-0569-9.
  • (21) X. Y. Zheng and K. F. Ng. The Lagrange multiplier rule for multifunctions in Banach spaces. SIAM J. Optim., 17(4):1154–1175, 2006. doi:10.1137/060651860.
  • (22) X. Y. Zheng and K. F. Ng. A unified separation theorem for closed sets in a Banach space and optimality conditions for vector optimization. SIAM J. Optim., 21(3):886–911, 2011. doi:10.1137/100811155.
  • (23) Q. J. Zhu. Hamiltonian necessary conditions for a multiobjective optimal control problem with endpoint constraints. SIAM J. Control Optim., 39(1):97–112, 2000.