Relaxed constant positive linear dependence constraint qualification and its application to bilevel programs
Mengwei Xu and Jane J. Ye
School of Mathematics, Tianjin University, Tianjin, 300072, China.
E-mail: [email protected]. The
research of this author was supported by the National Natural
Science Foundation of China under Projects No. 11601376Corresponding author. Department of Mathematics
and Statistics, University of Victoria, Victoria, B.C., Canada V8W 2Y2. E-mail: [email protected].
The research of this author was partially supported by NSERC.
Abstract. Relaxed constant positive linear dependence constraint qualification (RCPLD) for a system of smooth equalities and inequalities is a constraint qualification that is weaker than the usual constraint qualifications such as Mangasarian Fromovitz constraint qualification and the linear constraint qualification. Moreover RCPLD is known to induce an error bound property. In this paper we extend RCPLD to a very general feasibility system which may include Lipschitz continuous inequality constraints, complementarity constraints and abstract constraints. We show that this RCPLD for the general system is a constraint qualification for the optimality condition in terms of limiting subdifferential and limiting normal cone and it is a sufficient condition for the error bound property under the strict complementarity condition for the complementarity system and Clarke regularity conditions for the inequality constraints and the abstract constraint set. Moreover we introduce and study some sufficient conditions for RCPLD including the relaxed constant rank constraint qualification (RCRCQ). Finally we apply our results to the bilevel program.
A constraint qualification is a condition imposed on the constraint region of a mathematical program under which the
Karush-Kuhn-Tucker (KKT) condition holds at any local optimal solution. Other than guaranteeing KKT condition holding at all local optimal solutions, some constraint qualifications also lead to existence of error bounds to the feasible region and hence play a key role in
convergence analysis of certain computational methods. Hence studying constraint qualifications is essential in both theoretical and numerical points of view.
For smooth nonlinear programs with equality and inequality constraints, the classical constraint qualifications are the linear independence constraint qualification (LICQ),
Mangasarian-Fromovitz constraint qualification (MFCQ), and the linear constraint qualification (LCQ), i.e., all functions in the equality and inequality constraints are affine. These three classical constraint qualifications may be too restrictive for many problems.
Janin [16] relaxed LICQ and proposed the constant rank constraint qualification (CRCQ),
which neither implies nor is implied by MFCQ. The concept of CRCQ was weakened to the relaxed constant rank constraint qualification (RCRCQ) which is shown to be a constraint qualification by Minchenko and Stakhovshi [19].
Qi and Wei [26] introduced the concept of the constant positive linear dependence (CPLD) condition which is weaker than both CRCQ and MFCQ. CPLD was shown to be a constraint qualification by Andreani, Martínez and Schuverdt in [2].
In [1], Andreani, Haeser, Schuverdt and Silva introduced the following relaxed version of CPLD for a system of smooth equality and inequality constraints:
where are smooth at , a feasible solution. is said to satisfy the relaxed constant positive linear dependence constraint qualification if there exists , a neighborhood of such that
(i)
has the same rank for every .
(ii)
Let be such that is a basis for span. For every , if is positive linearly dependent, i.e.,
there exist scalars , , , not all zero such that
then is linearly dependent for every .
It is easy to see that in the case where either LCQ or MFCQ holds, RCPLD also holds. Hence RCPLD is weaker than LCQ and MFCQ.
In [1], the authors not only showed that RCPLD is a constraint qualification but also proved that if all functions have second-order derivatives at all points near the point , then RCPLD is a sufficient condition for the error bound property: , a neighborhood of such that
where , , , is the distance from to set , denotes any norm and , where the maximum is taken component-wise. Moreover as an open question, in [1], a question was asked on whether or not it was possible to prove the error bound property without imposing the second-order differentiability of all functions. In Guo, Zhang and Lin [14], it was shown that the error bound property holds under RCPLD without imposing the second order differentiability of all functions. Other than using it as a constraint qualification to ensure the KKT condition holds, RCPLD
is also used in the convergence analysis of the augmented Lagrangian method to obtain a KKT point (see e.g., [1, 3, 15, 28]).
Recently, Guo and Ye [13, Corollary 3] extended RCPLD to the case where there is an extra abstract constraint set and showed that it is still a constraint qualification. In [11, Definition 4.3], a version of RCPLD called MPEC RCPLD was introduced for the mathematical programs with
equilibrium constraints (MPEC) and was shown in [10, Corollary 4.1] that it is a constraint qualification for M-stationary conditions. Moreover, it was shown in [14, Theorem 5.1] that the RCPLD for MPECs ensures the error bound property under the assumption of strict complementarity.
In this paper, we extend RCPLD to the following very general feasibility system:
(1.1)
where is the th dimensional complementarity set,
with
closed, , ,
the functions , are locally Lipschitz continuous and , , are continuously differentiable at , a feasible solution.
Denote the feasible region of system by .
For a feasible point , we define the following index sets:
Definition 1.1 (RCPLD for the nonsmooth system (1.1))
We say that the relaxed constant positive linear dependence constraint qualification holds at for system (1.1) if
the following conditions hold:
(i)
The vectors
have the same rank
for all in a neighbourhood of .
(ii)
Let , , be such that the set of vectors
is a basis for
For any index sets , ,
if there exists a nonzero vector satisfying for ,
and , , for such that
then for all sufficiently large, the set of vectors
(1.2)
where , , , , and ,
is linearly dependent for all sequences satisfying , ,
as .
Remark 1.1
In Definition 1.1 (ii), we use sequences instead of neighborhoods. Although we could also use a neighborhood in the definition equivalently, it is more convenient to use the sequential form since if the point is an isolated non-differentiable point, i.e., there exists a neighborhood around where is differentiable, then can be taken as the gradient . Since a Lipschitz continuous function is differentiable almost everywhere and so such points are abundant.
In the case where the rank of is equal to , it is easy to see that RCPLD holds automatically.
What is more, in Theorem 4.1 we will show that the error bound property holds in this case.
Note that Definition 1.1 is weaker than the one defined in Guo and Ye [13, Corollary 3] for the system containing only smooth equality and inequality constraints and one abstract constraint, in which the stronger condition is linearly dependent for every required.
If the set of vectors in (1.2) is replaced by the following set of vectors
then since ,
the resulting condition would be stronger than the RCPLD defined in Definition 1.1. We illustrate this by Example 4.1.
In this paper we show that RCPLD for the nonsmooth feasibility system is a constraint qualification for any optimization problem with a Lipschitz objective function and the constraints described as in the feasibility system (1.1). Moreover with some extra conditions, we will show that RCPLD is a
sufficient condition for the following error bound property: , a neighborhood of such that
(1.3)
where .
One of the motivations to extend the concept of RCPLD to the nonsmooth system (1.1) is to study the constraint qualification and optimality condition for the following bilevel program:
where denotes the solution set of the lower level program
and ,
is locally Lipschitz continuous, and are continuously differentiable,
, are continuously differentiable and twice continuously differentiable with respect to variable .
Bilevel programs naturally fall in the domain of global optimization since in the lower level problem, the global optimal solution is always required in either optimality conditions or numerical algorithms. A popular approach to deal with (BP) is to replace the set of global solutions by the KKT optimality conditions of the lower level problem. This reformulation is based on the fact that if the lower level problem is a convex program for each fixed and certain constraint qualification holds, then if and only if its KKT optimality condition holds. But due to the introduction of multipliers for the lower level problem as extra variables, the resulting reformulation may not be equivalent to the original bilevel program even in the case where is convex but the multipliers are not unique. For the discussion of this issue and the recent new results, the reader is referred to recent paper [31] and the reference within for more discussions.
If the lower level problem is not a convex program for each fixed , the KKT condition for the lower level problem is usually only a necessary but not sufficient condition for optimality and moreover, as pointed out by Mirrlees in [20], such a reformulation by the KKT condition may miss out the true optimal solution of the original bilevel program.
Instead of using as a constraint in (BP), Outrata [25] proposed to replace it with where is the value function of the lower level program for a numerical consideration. This so-called value function approach was further used in Ye and Zhu [32] and later in other papers such as [7, 8, 21, 22, 23, 24] to derive various necessary optimality conditions under the partial calmness condition. The partial calmness condition for the value function reformulation of the bilevel program, however, is a very strong assumption. To derive a necessary optimality condition under weaker assumptions, Ye and Zhu [33] proposed the following combined program where both the value function constraint and the KKT condition of the lower level program are used as constraints:
As discussed in [33], such a reformulation can avoid some difficulties caused by using the value function or the classical KKT approach alone. In [30], necessary optimality conditions in the form of Mordukhovich (M-)
stationary condition for (CP) is studied under the partial calmness/weak calmness condition. These conditions, however, may not be easy to verify.
Note that the inclusion of the value function makes the problem nonsmooth since the value function is usually nonsmooth but under some reasonable assumptions on the lower level problem, the value function is Lipschitz continuous. Hence the feasible set of (CP) is a special case of the general feasibility system (1.1). However as it was shown in Ye and Zhu [33, Proposition 1.3], the no nonzero abnormal multiplier constraint qualification (NNAMCQ) as defined in Definition 4.1 never holds for (CP). Being able to deal with nonsmooth inequality constraints in RCPLD would allow us to present verifiable constraint qualifications and exact penalty for the reformulation of the bilevel program in which the value function is used, such as (CP).
The rest of the paper is organized as follows. In Section 2, we present basic definitions
as well as some preliminaries which will be used in this paper. In Section 3, we show that RCPLD is a constraint qualification and a sufficient condition for error bound properties under certain regularity conditions. We introduce some sufficient conditions for RCPLD, which are easier to verify in Section 4. In Section 5, we apply RCPLD to the bilevel program.
We adopt the following standard notation in this paper. For any two vectors and , we denote by either or their inner product. Given a differentiable function , we denote its
gradient vector by . For a differentiable mapping with and a vector , we denote by
the Jacobian matrix of at . For a set , we denote by int , co , , bd and the interior, the convex hull, the closure, the boundary of and the distance from to , respectively.
We denote by the cardinality of index set . For any vector and a given index set , we use to denote the vector in with components . For a matrix , denotes its transpose, denotes its rank. We denote by the open ball center at with radius and by the open unit ball center at the origin. Unless otherwise specified we denote by any norm in the finite dimensional space.
2 Background and preliminaries
In this section, we present some background materials on variational analysis which will be used throughout the paper.
Detailed discussions on these
subjects
can be found in [4, 5, 21, 27].
Given a set , its Fréchet/regular normal cone at is defined by
and the limiting/Mordukhovich/basic normal cone to at point is defined by
A set is Clarke regular at if it is locally closed at and [27, Definition 6.4]. Note that for with
closed, , by [27, Proposition 6.41], is regular at if and only if each is regular at .
The exact expressions for the limiting normal cone of the complementarity set present below will be useful. In this paper we denote by the complementarity set and when , .
Proposition 2.1
(See e.g. [29, Proposition 3.7])
For any lying in the complementarity set , the limiting normal cone to at is
Let be a lower semicontinuous function and finite at .
We define the Fréchet/regular subdifferential
([27, Definition 8.3])
of at
as
and the limiting/Mordukhovich/basic subdifferential of at
as
Let be Lipschitz continuous at . We say that is subdifferentially/Clarke regular at
provided that [27, Corollary 8.11].
The following proposition collects some useful properties and calculus rules of the limiting subdifferential.
Proposition 2.2
(i)
[27, Exercise 10.10] and [21, Theorem 2.33] Let be proper lower semi-continuous around and finite at , and be nonnegative scalars. Assume that at least one of them is Lipschitz continuous around . Then
(ii)
[17, Theorem 2.5, Remark (2)] and [21, Theorem 3.41]Let be Lipschitz continuous around and be Lipschitz continuous near . Then the composite function is Lipschitz continuous around and
(iii)
[21, Theorem 3.46 and Proposition 1.113] Let be Lipschitz continuous around and
and
Then and are Lipschitz continuous
around and
where and ,
and the first inclusion holds as an equation if each is subdifferentially regular at .
Taking into account that all norms in a finite dimensional space are equivalent, the following formula for distance function to the complementarity set can be used in the error bound property (1.3).
Proposition 2.3
(see e.g. [18]) When the norm is chosen to be the -norm in the distance function , for any ,
When the norm is chosen to be the -norm in the distance function , for any ,
In order to calculate , we define the following index sets for each :
(2.2)
From the calculus rules in Proposition 2.2,
we have the following estimate for the limiting subdifferential of .
Lemma 2.1
Assume that the functions , are locally Lipschitz continuous and , , are continuously differentiable around .
is a Lipschitz continuous function and for any , there exist ,
and , satisfying
(2.3)
such that
Proof. For , let and . From the chain rule in Proposition 2.2(ii), we have
We divide the analysis into three cases.
Case 1:
. In this case we have and hence .
By the chain rule we have if and if . If ,
then and hence .
Case 2:
. Similarly as in Case 1, we can show .
Case 3:
. In this case, . If , then and
. Since , by Proposition 2.2(iii), . If , then and . It follows by Proposition 2.2(iii) that
. If , then , . If , then . If , then
. Hence
Summarizing the above cases, we have that for any ,
where holds.
Since , and
are all Lipschitz continuous around , by the calculus rules in Proposition 2.2 and (2.3),
there exist parameters ,
and , satisfying (2.3) such that
We will need the following result which is an extension of Carathéodory’s lemma.
Lemma 2.2
[1, Lemma 1]
If with for every , is linearly independent and , , then there exist and scalars for every such that
(i) with for every ,
(ii) is linearly independent.
3 RCPLD as a constraint qualification and a sufficient condition for error bounds
We first show that the RCPLD introduced in Definition 1.1 is a constraint qualification for any optimization problem in the form
where is Lipschitz continuous around any local optimal solution and is the feasible region defined by the system (1.1).
Theorem 3.1
Let be a local minimizer of the optimization problem .
Suppose that RCPLD holds at .
Then is an M-stationary point, i.e., there exist
, , , , , and , such that
Proof.
Step 1: Note that a point is feasible for problem () if and only if
For each , we consider the following penalized problem:
where denotes the 2-norm, is such that for all feasible point .
Since the feasible region is compact and the objective function is continuous, there exists an optimal solution of the problem . Taking subsequence if necessary, we assume that and thus . Moreover by the optimality of , we have
(3.1)
From the boundedness of , we have that
as . It follows that is a feasible point of the problem . Condition yields
Taking limit as , we obtain
which means that . Thus the sequence converges to .
Step 2: Since for sufficiently large , is an interior point of , by the necessary optimality condition in terms of limiting subdifferential and the nonsmooth calculus rule, there exist , , such that
(3.2)
Suppose that there is a subsequence of such that all equal to zero in this subsequence.
Since is
Lipschitz continuous, its limiting subdifferential is compact and so the sequences
is compact. Taking subsequence if necessary, we assume and
. Taking limit as in , we have by the outer semi-continuity of the limiting subdifferential and thus is a stationary point of problem () automatically. So without loss of generality, we may assume for all sufficiently large .
By Lemma 2.1, since there exist
,
,
, and , such that and
By the continuity of , it is easy to see that for sufficiently large . Similarly, by the continuity of and , and
for sufficiently large .
Let if , if , and if .
Then we have that .
Since , it follows that
Since , we have . We denote by with .
Let , and be such that is a basis for .
Then by RCPLD (i),
is linearly independent and thus is a basis of .
Hence there exist
,
such that
(3.3)
where ,
or if .
Since , applying Carathéodory’s lemma in Lemma 2.2 to (3.3), we obtain subsets
and
with , ,
, and or if
such that
(3.4)
and the set of vectors is linearly independent.
Since the index sets are all finite, for every large , we may assume , , and without loss of generality. Hence the set of vectors
is linearly independent.
From , the condition reduces to
(3.5)
where , ,
if , or if .
Step 3: We now prove that the sequence must be bounded.
To the contrary, assume that it is unbounded. Let . Then there exists a subsequence such that and
where
, and , . Since and , it follows from the outer semicontinuity of the limiting normal cone that and for each .
It is easy to see that if and .
Without loss of generality, assume that
and
. Then by the outer semi-continuity of the limiting subdifferential, we have and .
Dividing on both sides of and taking limits with , we have
By RCPLD(ii), since the vectors , are not all equal to zero, , , the set of vectors
must be linearly dependent. Since for each , this is a contradiction to the conclusion in step 2.
The contradiction proves that is bounded.
Without loss of generality, assume that as . It is easy to see that ,
.
Taking limits for a subsequence in , we derive that is an M-stationary point of .
We now perform the next task of proving that RCPLD is a sufficient condition for the error bound property under extra regularity conditions. First we prove the following result which shows that RCPLD is persistent locally.
Proposition 3.1
Assume that holds at a feasible point of the system
(3.6)
Then holds at every point belongs to a sufficiently small neighborhood of .
Proof. Consider any sequence as .
It is obvious that
(i) holds at each when is sufficiently large.
Assume that with is a basis for span. Then is a basis of span for any large .
We now show that RCPLD (ii) holds at for any sufficiently large .
To contrary, assume that RCPLD(ii) does not hold at each point of a subsequence . Then there exist an index set , , and a nonzero vector with , such that
(3.7)
but the set of vectors
where ,
is linearly independent for some
sequences
with ,
, as .
Since is a finite set, we may consider a subsequence such that for every large .
Let . Suppose there exists a subsequence such that
with .
Without loss of generality, assume that
and we assume , .
Since as ,
we also have as .
By the diagonalization law, there exists a sequence converging to such that for each , , , ,
and is linearly independent for all large and .
Dividing by in both sides of
and letting , we have that and
From RCPLD (ii),
the set of vectors
must be linearly dependent, which is a contradiction.
The contradiction shows that holds at for sufficiently large.
Theorem 3.2
Assume the holds at a feasible point of the system ,
are subdifferentially regular and is Clarke regular around ,
then there exist and such that
where denotes the set of feasible points satisfying system (3.6).
Proof. If is an interior point of , then for all and hence the result holds automatically. Now assume that lies in the boundary of . Let .
We rewrite the feasible set by
Assume for a contradiction that there exists such that
Obviously . Let be the projector of to . Then and .
For each , is an optimal solution of the following problem:
where denotes the 2-norm.
Since the RCPLD persists in a neighborhood of , RCPLD holds at for sufficiently large. From Theorem 3.1, is a limiting stationary point of . By the optimality condition,
there exist parameters , for and such that
(3.8)
Let , and .
Assume that with is a basis for span.
From Lemma 2.2, we obtain
and
with
, for and ,
such that
(3.9)
with bounded multipliers from Theorem 3.1, for , .
Let for sufficiently .
From the subdifferentially regularity of , for sufficiently large ,
Similarly, .
Furthermore, for each ,
Then from , we have
This means
which is a contradiction.
Therefore the error bound property holds.
The error bound property for the general system (1.1) can be now obtained from Theorem 3.2. It extends [14, Theorem 5.1] to allow nonsmooth inequality constraints and abstract constraints.
Corollary 3.1
Assume the holds at ,
are subdifferentially regular and is Clarke regular around , the constraint satisfies the strict complementarity condition at ,
then there exist and such that
(3.10)
where and denotes the set of feasible points satisfying system (1.1).
Proof. Since the strict complementarity holds at , we have . Hence for all sufficiently close to , we can represent it as a solution to the system
Then the error bound property follows by Theorem 3.2 and the equivalence of the finite dimensional norm.
4 Sufficient conditions for RCPLD
Note that although the RCPLD is a weak condition, it may not be easy to verify. In this section we investigate sufficient conditions for RCPLD which are stronger but easier to verify.
It is easy to see that the following well-known constraint qualification implies RCPLD.
Definition 4.1
Let . We say that the no nonzero abnormal multiplier constraint qualification (NNAMCQ) holds at if there is no nonzero vector
satisfying for ,
and , , for such that
It is obvious that when the rank of is equal to , RCPLD holds automatically. This condition is easy to verify and moreover it is not just a constraint qualification but also a sufficient condition for error bounds without imposing any regularity conditions.
Theorem 4.1
For a feasible point ,
suppose that
the rank of is equal to . Then the error bound property (1.3) holds at .
Proof.
Around the point , we can equivalently formulate the complementarity system as
Hence the constraints
can be treated as equality constraints. We denote by .
If is an interior point of , then for all and hence the result holds automatically. Now assume that lies in the boundary of .
We rewrite the feasible set by
To a contrary, assume that there exists such that
(4.1)
For any , and by the Taylor expansion,
(4.2)
From , , which implies that . Taking subsequence if necessary, let . Dividing the both sides of (4.2) by and letting , we have
Similarly from the above discussion, we have
This means that is linearly independent with all vectors in . Since by assumption the rank of is , this is impossible since these vectors lie in . The proof of the theorem is therefore complete.
In the following definition we extend the well-known concept of RCRCQ (see [19] for the smooth equality and inequality systems and [11, Definition 3.4] for system with complementarity constraints) to our general system (1.1). RCRCQ is stronger than the RCPLD but may be easier to verify.
Definition 4.2
Let .
We say that the relaxed constant rank constraint qualification holds at if for all sufficiently large ,
any index sets , , , and any vectors , (), with and , the set of vectors
and the set of vectors
where ,
,
have the same rank for all sequences satisfying , , as , , .
Proposition 4.1
RCRCQ implies RCPLD.
Proof. Let satisfy RCRCQ. Taking , we have that RCPLD (i) holds. We now show the RCPLD (ii) holds at . Let , , be such that the set of vectors
is a basis for
Let , be the index sets such that
there exists a nonzero vector satisfying for ,
and , , for satisfying
Then the vectors
where and ,
is linearly dependent. Since
is a basis for
it follows that
is linearly dependent as well.
By RCRCQ, it follows that
is linearly dependent
for all sequences satisfying , , , , , .
From RCPLD (i), is a basis of and thus
is linearly dependent. Since for any sufficiently large , RCPLD (ii) holds.
Definition 4.3
We say that the Linear Constraint Qualification (LCQ) holds if
all functions are linear and the set is the union of finitely many polyhedral sets.
We now show that LCQ implies RCRCQ.
Proposition 4.2
LCQ implies RCRCQ holds at each .
Proof.
Since is the union of finitely many polyhedral convex sets, for all and sufficiently large , we have
if and . Moreover all functions are linear. Therefore the matrix is a constant matrix for all , , and therefore RCRCQ holds.
Example 4.1
Consider the nonsmooth system:
where is the graph of a continuous function as shown in Fig.1 and
. We set when , for .
Between any two points of the form and , or and , the graph of describes the edge of an isosceles triangle whose
apex is located at or .
Consider the feasible solution .
Figure 1: Feasible region
We have ,
Since is symmetrical, we only give the expression of the normal cone for the case when and , for ,
and
Also we have .
Hence .
Let being an element in the normal cone . For any with , , with , we have
if but . This shows that RCRCQ fails at .
Since the condition
(4.5)
holds when
, , . Hence the NNAMCQ fails at .
We now verify that RCPLD holds. Actually it is easy to see that
holds with not all equal to zero if and only if and
either (a) or (b) below holds:
(a)
,
.
(b)
,
.
In either case (a) or (b), the set . For any ,
denote by where , and . We also denote by .
Note that at any such that and is large enough, is differentiable and .
Since the matrix
is not full rank,
the set of vectors , , is always linearly dependent and thus RCPLD holds at .
Note that
if for large ,
it is easy to see that the rank of the matrix
equals to and thus
the set of vectors , is linearly independent. Hence as pointed out in Remark 1.1, our definition for RCPLD is weaker than the condition that the set of vectors , is linearly dependent.
5 Applications to bilevel programs
In this section we apply RCPLD to the combined program (CP).
Throughout this section we assume that the value function is Lipschitz continuous at the point of interest. For the case where is independent of , we can use Danskin’s theorem and for the general case one can use Proposition 5.2 which is a special case of [4, Theorem 6.5.2].
Other weaker sufficient conditions for Lipschitz continuity of the value function as well as the estimates for its subdifferentials can be found for examples in [12].
Proposition 5.1 (Danskin’s Theorem)
([5, page 99] or [6]) Let be a compact set and be a function defined on that is continuously differentiable at . Then the value function
is Lipschitz continuous near and its Clarke subdifferential is
,
where is the Clarke subdifferential of at .
Proposition 5.2
Assume that the set-valued map is uniformly bounded around , i.e., there exists a neighborhood of such that the set is bounded. Suppose that MFCQ holds at for all . Then the valued function is Lipschitz continuous near and
where
where .
Note that if in addition to the assumptions of Proposition 5.2, is inner semicontinuous at for some , then the union sign can be omitted in Proposition 5.2; see [21, Corollary 1.109].
In the case where the LICQ holds at each , the set of multipliers is a singleton and by
[9, Corollary 5.4], the inclusion becomes an equality in the above proposition
and is Clarke regular around . In this case, LICQ for the lower level problem holds at every , for all near due to the outer semi-continuity of the solution mapping . Thus we have
By Carathéodory’s theorem, the convex set can be represented by not more than elements. It follows that for any , there exist , such that
.
Given a feasible vector of the problem (CP), define the following index sets:
Theorem 3.1 can now be applied to the problem (CP) to obtain the M-stationary condition under RCPLD at any local optimal solution.
Theorem 5.1
Let be a local solution of and suppose that the value function is Lipschitz continuous at .
If the RCPLD holds at , then
is an M-stationary point of problem (CP).
In the rest of this section, we apply the sufficient conditions for RCPLD which are introduced in Section 4 to the bilevel programs. The following theorem follows immediately from Theorem 4.1. The condition is easy to verify since the nonsmooth constraint is not needed in the verification.
Theorem 5.2
Let be a local solution of and suppose that the value function is Lipschitz continuous at .
If the rank of the matrix
is equal to ,
then
RCPLD holds and
is an M-stationary point of problem (CP). Moreover is a local optimal solution of the penalized problem for some :
where .
Proof. Let .
It is easy to see that
where is the identity matrix of size .
From Theorem 4.1, the error bound property holds for the problem (CP), i.e., there exist and such that
where denotes the feasible region of problem (CP).
It follows from Clarke’s exact penalty principle [4, Proposition 2.4.3] that
the problem is exact with with being the Lipschitz constant of the function .
Corollary 5.1
Let be a local solution of . Suppose that the value function is Lipschitz continuous at and LICQ holds at .
Suppose
the matrix
has full column rank . Then is an M-stationary point of problem (CP) and a local optimal solution of the penalized problem for some .
Proof. Since LICQ holds at , the rank of the matrix
equals to . Then and thus the conclusions in Theorem 5.2 hold.
The following example illustrates the application of Theorem 5.2 and Corollary 5.1.
Example 5.1
Consider the following bilevel program:
where is a Lipschitz continuous function.
It is easy to see that the solution set for the lower level program is
and the value function is
In fact since the lower level feasible set is uniformly bounded whenever and LICQ holds at each , , we can conclude that the value function is Lipschitz continuous without actually calculating it.
For any , the KKT condition for the lower level problem is
Hence the combined problem can be written as follows:
It is easy to see that the feasible region of the bilevel problem contains three points: . From calculation,
Suppose that the optimal solution of the bilevel problem is and the one for the combined program is .
The index sets and is empty. The rank of the matrix
is equal to . Hence by Corollary 5.1, is an M-stationary point of problem (CP).
Suppose that
the optimal solution of the bilevel problem is and the one for the combined program is .
The index sets and both are empty. The rank of the matrix
is equal to . Hence by Theorem 5.2, is an M-stationary point of problem (CP).
Moreover
and are local solutions of the penalized problem for some :
Since RCRCQ is a stronger condition for RCPLD, the following result follows from
Theorem 3.1 and Propositions 5.1 and 5.2.
Theorem 5.3
Let be a local solution of . Suppose that either is independent of with compact or the set-valued map is uniformly bounded around and MFCQ holds at for all .
Given
index sets and , , denote the matrix
where denotes the matrix with as its rows, and is the vector such that the -th component is one and others are zero.
Assume that the matrix where has the same rank with the matrix for all sequences satisfying , .
Then for hold at and is an M-stationary point of problem (CP).
The following example illustrate the result.
Example 5.2
Consider the following bilevel program:
where is a Lipschitz continuous function.
It is easy to see that the solution set for the lower level program is
and the value function
In fact we do not need the above explicit representation of the value function. Indeed,
since the constraint set is compact, by Danskin’s theorem, for any around the Clarke subdifferential of the value function is equal to
The combined problem can be written as follows:
It is easy to see that the bilevel program only has three kinds of optimal solutions, with , with and with .
We now verify that RCRCQ holds in the following three cases.
(a) Consider the optimal solution with , the corresponding solution for the combined program is .
Obviously,
the index sets and both .
For , denote by
where . It is easy to see that for any , and any and . Hence
RCRCQ holds by Theorem 5.3.
(b) Consider the optimal solution with , the corresponding solution for the combined program is with and .
The index sets and , and for any around , , .
For , denote by
It is easy to see that or any and any and . Hence
RCRCQ holds from Theorem 5.3.
(c) Consider the optimal solution with , the corresponding solution for the combined program is with and . Similarly as in case (b), RCRCQ holds.
Acknowlegement
The authors would like to thank the anonymous referees for their helpful suggestions and comments which help us to improve the presentation of the paper.
References
[1] R. Andreani, G. Haeser, M.L. Schuverdt and P.J.S. Silva, A relaxed constant positive linear dependence constraint qualification and applications, Math. Program., 135(2012), 255–273.
[2] R. Andreani, J.M. Martínez and M.L. Schuverdt, On the relation between constant positive linear dependence condition and quasinormality constraint qualification, J. Optim. Theory Appl., 125(2005), 473–483.
[3] X. Chen, L. Guo, Z. Lu and J.J. Ye, An augmented Lagrangian method for non-Lipschitz nonconvex programming, SIAM J. Numer. Anal., 55(2017), 168–193.
[4] F.H. Clarke, Optimization and Nonsmooth Analysis, Wiley-Interscience, New York, 1983.
[5] F.H. Clarke, Yu.S. Ledyaev, R.J. Stern and P.R. Wolenski, Nonsmooth Analysis and Control Theory, Springer, New York, 1998.
[6] J.M. Danskin, The Theory of Max-Min and its Applications to Weapons Allocation Problems, Springer, New York, 1967.
[7] S. Dempe, J. Dutta and B.S. Mordukhovich, New necessary optimality conditions in optimistic bilevel programming, Optimization, 56 (2007), 577-604.
[8] S. Dempe and A.B. Zemkoho, The bilevel programming problems: reformulations, constraint qualifications and optimality conditions, Math. Program., 138 (2013), 447-473.
[9] J. Gauvin and F. Dubeau, Differential properties of the marginal function in mathematical programming, Math. Program. Study, 19(1982), 101–119.
[10] L. Guo and G.-H. Lin, Notes on some constraint qualifications for mathematical programs with equilibrium constraints, J. Optim. Theory Appl., 156(2013), 600–616.
[11] L. Guo, G.-H. Lin and J.J. Ye, Second-order optimality conditions for mathematical programs with equilibrium constraints, J. Optim. Theory Appl., 158(2013), 33–64.
[12] L. Guo, G.-H., Lin, J.J. Ye and J. Zhang, Sensitivity analysis of the value function for parametric mathematical programs with equilibrium constraints, SIAM J. Optim., 24(2014), 1206–1237.
[13] L. Guo and J.J. Ye, Necessary optimality conditions and exact penalization for non-Lipschitz nonlinear programs, Math. Program., 168(2018), 571–598.
[14] L. Guo, J. Zhang and G.-H. Lin, New results on constraint qualifications for nonlinear extremum problems and extensions, J. Optim. Theory Appl., 163(2014), 737–754.
[15] A.F. Izmailov, M.V. Solodov and E.I. Uskov, Global convergence of augmented Lagrangian methods applied to optimization problems with degenerate constraints, including problems with complementarity constraints, SIAM J. Optim., 22(2012), 1579–1606.
[16] R. Janin, Direction derivative of the marginal function in nonlinear programming, Math. Program. Study, 21(1984), 110–126.
[17] A. Jourani and L. Thibault, The approximate subdifferential of composite functions, Bull. Austral. Math. Soc., 47(1993), 443–455.
[18] C. Kanzow and A. Schwartz, Mathematical programs with equilibrium constraints: enhanced Fritz John-conditions, new constraint qualifications, and improved exact penalty results, SIAM J. Optim., 20(2010), 2730–2753.
[19] L. Minchenko and S. Stakhovski, On relaxed constant rank regularity condition in mathematical programming, Optim., 60(2011), 429–440.
[20] J. Mirrlees, The theory of moral hazard and unobservable behaviour: Part I, Rev. Econ. Stud., 66(1999), 3–22.
[22] B.S. Mordukhovich, Variational Analysis and Applications, Springer, New York, 2018.
[23] B.S. Mordukhovich, Bilevel optimization and variational analysis, in Advances in Bilevel Optimization, S. Dempe and A. Zemkoho, eds., arxiv:1907.06140, 2019.
[24]B.S. Mordukhovich, N. M. Nam and H. M. Phan, Variational analysis of marginal functions with applications to bilevel programming,
J. Optim. Theory Appl., 152 (2012), pp. 557-586.
[25] J.V. Outrata, On the numerical solution of a class of Stackelberg problems, Z. Oper. Res., 34(1990), 255–277.
[26] L. Qi and Z. Wei, On the constant positive linear dependence condition and its application to SQP methods, SIAM J. Optim., 10(2000), 963–981.
[28] X. Wang and Y. Yuan, An augmented Lagrangian trust region method for equality constrained optimization, Optim. Methods soft., 30(2015), 559–582.
[29] J.J. Ye, Constraint qualifications and necessary optimality conditions for optimization problems with variational inequality constraints, SIAM J. Optim., 10(2000), 943–962.
[30]J.J. Ye, Necessary optimality conditions for multiobjective bilevel programs, Math. Opera. Res., 36(2011), 165–184.
[31] J.J. Ye, Constraint qualifications and optimality conditions in bilevel optimization, in Advances in Bilevel Optimization, S. Dempe and A. Zemkoho, eds., arxiv:1910.04072, 2019.
[32] J.J. Ye and D. Zhu, Optimality conditions for bilevel programming problems, Optim., 33(1995), 9–27.
[33] J.J. Ye and D. Zhu, New necessary optimality conditions for bilevel programs by combining the MPEC and value function approaches, SIAM J. Optim., 20(2010), 1885–1905.