This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\UseRawInputEncoding

Multiparametric analysis of conic linear optimization based on lift-and-project procedure thanks: Supported by the National Natural Science Foundation of China (11871118,12271061).

Zi-zong Yan  Xiang-jun Li  and Jin-hai Guo School of Information and Mathematics, Yangtze University, Jingzhou, Hubei, China([email protected]).School of Information and Mathematics, Yangtze University, Jingzhou, Hubei, China([email protected]).School of Information and Mathematics, Yangtze University, Jingzhou, Hubei, China([email protected]).
Abstract

We study the application of the lift-and-project procedure to the multiparametric analysis of conic linear optimization (CLO) problems. We first introduce the concept of a pair of primal and dual conic representable sets, and define the set-valued mappings between them. We then explore a novel duality of mpCLOs, that allows us to generalize and treat the previous results for parametric analysis in a unified framework. In particular, we discuss the behavior of the optimal partition of a conic representable set. This leads to invariant region decomposition of a conic representable set, which is more general than the results in the literature. Finally, we study the properties of the optimal objective values as a function of parametric vectors. All the results are corroborated by examples with correlations.

Keywords: Multiparametric conic linear optimization, conic representable set, lift-and-project, optimal partition, duality, multiparametric KKT property

AMS subject classifications. Primary: 90C31; Secondary: 90C25, 90C05, 90C22, 90C46

1 Introduction

We are interested in the following multiparametric conic linear optimization (mpCLO) problems with two independent vectors of parameters u,vru,v\in\mathbb{R}^{r}

p(u)=minc+MTu,xs.t.Ax=b,xK\begin{array}[]{lll}p^{*}(u)=&\min&\langle c+M^{T}u,x\rangle\\ &s.t.&Ax=b,\\ &&x\in K\end{array} (P-lift)

and

d(v)=mind+MTv,ys.t.By=a,yK\begin{array}[]{lll}d^{*}(v)=&\min&\langle d+M^{T}v,y\rangle\\ &s.t.&By=a,\\ &&y\in K^{*}\end{array} (D-lift)

for given vectors ala\in\mathbb{R}^{l}, bmb\in\mathbb{R}^{m}, c,dqc,d\in\mathbb{R}^{q} and matrices Am×qA\in\mathbb{R}^{m\times q}, Bl×qB\in\mathbb{R}^{l\times q}, Mr×qM\in\mathbb{R}^{r\times q}, where KqK\subset\mathbb{R}^{q} is a pointed, closed, convex, solid (with a non-empty interior) cone (for this formulation of a primal-dual pair and its properties, Ref. [NN94]); and KK^{*} is the dual of KK under the standard inner-product, that is,

K={yq|y,x0,xK}.K^{*}=\{y\in\mathbb{R}^{q}|\langle y,x\rangle\geq 0,\forall x\in K\}.

The minimum values of the objective functions are denoted by p(u)p^{*}(u) and d(v)d^{*}(v), respectively. These two problems are multiparametric linear programming (mpLP) if KK is a nonnegative orthant; and multiparametric semidefinite programming (mpSDP) if KK is the cone of symmetric positive semidefinite matrices (for example, Refs. [AL12, BN01, BPT13, Nem07]).

Two technical claims are assumed throughout this paper.

Assumption 1. The three range spaces R(AT),R(BT),R(MT)R(A^{T}),R(B^{T}),R(M^{T}) are orthogonal to each other, and their direct sum is equal to the entire space RqR^{q}.

Assumption 2. MMT=IrMM^{T}=I_{r}, a r×rr\times r unit matrix.

Assumption 1 is necessary for (P-lift) and (D-lift) because it guarantees that the perturbation of the objective function is independent of the constraints; More importantly, it guarantees that the (nonstandard) Lagrange dual of one is a projection of the other. We comment more on this in Subsections 2.2 and LABEL:otjg1.

1.1 Motivations

The original motivation for this study is the inductive proof of the strong duality theorem of CLOs (see Theorem LABEL:dualityt1 and its proof at the end of Subsection LABEL:otjg1). Indeed, such a proof requires that a pair of primal-dual optimal solutions remains the same in the lift-and-project procedure, in which the complement slackness property always holds. More precisely, a pair of optimal solutions (x(u),y(v))(x^{*}(u),y^{*}(v)) of (P-lift) and (D-lift) satisfies the following equation

x(u),y(v)=0,\langle x^{*}(u),y^{*}(v)\rangle=0, (1)

(see Theorem LABEL:interior2yy). A surprising aspect of this result is that it holds for some vector pairs (u,v)r×r(u,v)\in\mathbb{R}^{r}\times\mathbb{R}^{r}, under Slater’s type condition.

We exploit the special structure of the CLO problem to obtain a new algebraic representation of the Lagrangian dual problem (see Subsection 2.2). Such a dual, called a nonstandard dual, results in a unified representation of the feasible sets. Surprisingly, for (P-lift) and (D-lift), the nonstandard dual of one is a projection of the other; and their projections also satisfy the complement slackness property

x¯(v),y¯(u)=0\langle\bar{x}^{*}(v),\bar{y}^{*}(u)\rangle=0 (2)

under the same Slater’s type condition, where (x¯(v),y¯(u))(\bar{x}^{*}(v),\bar{y}^{*}(u)) denotes a pair of optimal solutions of the projections (see Theorem LABEL:interior2yy).

These characterization results directly motivate us to explore in detail the relationship between the following two conic (linear inequality) representable sets,

ΘP\displaystyle\varTheta_{P} ={vr|d+MTv+BTw1Kforsomew1l},\displaystyle=\{v\in\mathbb{R}^{r}|d+M^{T}v+B^{T}w^{1}\in K\ for\ some\ w^{1}\in\mathbb{R}^{l}\}, (3a)
ΘD\displaystyle\varTheta_{D} ={ur|c+MTu+ATw2Kforsomew2m},\displaystyle=\{u\in\mathbb{R}^{r}|c+M^{T}u+A^{T}w^{2}\in K^{*}\ for\ some\ w^{2}\in\mathbb{R}^{m}\}, (3b)

(see Subsection LABEL:jzyss). This relationship helps us present a geometric framework that unifies and extends some properties of parametric LPs and SDPs to the case of mpCLOs.

1.2 Related works

Prior to parametric analysis in recent years, the actual invariancy region played an important role in the development of parametric LPs and SDPs. Adler and Monteiro [AD92] investigated the parametric analysis of LPs using the optimal partition approach, in which they identified the range of a single-parameter where the optimal partition remained invariant. Other parametric analysis treatments for LPs based on the optimal partition approach were reported in previous studies [Ber97, Deh07, GG08, Gre94, Ha10, RT05, JR03]. The actual invariancy region has been studied extensively in the SDP setting (Refs. [GS99, MT20]), the second-order conic optimization (Refs. [MT21]), and more generally in CLO (Refs. [Yil04]).

Gass and Saaty [GS55] proposed the first method for solving parametric LPs, and Gal and Nedoma [GN72] generalized this method. Recently, there has been growing interest in multiparametric optimizations arising from process engineering, such as process design, optimization, and control. The survey by Pistikopoulos et al.[Pis12] contributes to recent theoretical and algorithmic advances, and applications in the areas of multi-parametric programming, specifically explicit/multi-parametric model predictive control. Thus far, various types of invariances in parametric/multiparametric problems have been used, mainly in single-parametric or bi-parametric analyses. To the best of our knowledge, almost all approaches to multiparametric LPs reported in the literature (Refs [Bor03, Fil97, Fil04, Gal95, GG97, Sch87, WW90]) use bases to obtain a description of the invariancy regions.

1.3 Contributions

This study investigates multiparametric optimization in general CLO problems in which either the objective function or the right-hand side is perturbed along many fixed directions. First, we establish the connection by showing that the conic representable set defined by (3a) or (3b) is viewed as a projection of the feasible set of an appropriately defined mpCLO problem, although this connection is not easily identifiable. We then develop classical duality theory using set-valued mappings to relate the two conic representable sets. This treatment makes it possible to combine some known, yet scattered, results and derive new ones. All these results can be used to develop the optimal partition approach given in [AD92] for parametric LPs and [GG08] for parametric SDPs.

The main contributions of this study are as follows:

(1) Characterization of the relationship between the primal and dual conic representable sets.

(2) Presentation of a novel duality in CLO.

(3) Identification of optimal partitions and development of parametric analysis technique.

In the first category, we define set-valued mappings between the primal and dual conic representable sets, which provide an important and useful tool. This tool plays a critical role in the analysis.

In the second category, we develop the classical duality theory of CLO. In addition to the complementary slackness properties mentioned earlier, we present the weak and strong duality properties for the pair of almost primal-dual problems (P-lift) and (D-lift). We show that the sum of their objective optimal values is a bilinear function with respect to the vectors of parameters uu and vv (see Corollary LABEL:sumtwop21). The corresponding multiparametric KKT (mpKKT) conditions are also given in Theorem LABEL:interior2yy.

In the third category, along with invariant region decomposition, our results capture and generalize the SDP cases of Mohammad-Nezhad and Terlaky [MT20]. We develop the concepts of the nonlinearity region and the transition point for the optimal partition to conic representable sets by means of set-valued mappings, and provide some sufficient conditions for the existence of a nonlinearity region and a transition point. Such concepts are very useful for the analysis of an mpCLO problem because the nonlinearity region can be regarded as a stability region and its identification has a significant influence on the post-optimal analysis of SDPs.

In addition, we study the behavior of the optimal value function in its domain; and partially answer the open question proposed by Hauenstein et al. [Hau19].

1.4 Organization of the paper

In the next section, we review some useful results from convex analysis and the duality theory of CLO, and introduce nonstandard Lagrange dual and lift-and-project procedures. In Section 3, we define the set-valued mappings between the aforementioned conic representable sets and develop duality theory in CLOs. In Section 4, some fundamental concepts are introduced and several examples are presented to illustrate these concepts. The extension of the corresponding approach to the multiparametric analysis of CLOs is discussed in Section 5. The identification of the optimal partitions of a conic representable set is discussed and the behavior of the optimal value function in its domain is studied in this section. Finally, in Section 6, we conclude the paper with some remarks.

2 Preliminaries

The aim of this section is twofold. First, we introduce the notation used in this study. However, we state some useful tools and results that facilitate the subsequent proofs.

2.1 Notation

First, we review some useful facts regarding convex sets and cones. A standard reference for convex analysis is a book by Rockafellar [Roc70].

The (topological) boundary of a set CqC\in\mathbb{R}^{q} is denoted as C\partial C and defined as C=cl(C)\int(C)\partial C=\operatorname{cl}(C)\backslash\operatorname{int}(C), where cl(CC) and int(CC) denote the closure and interior of CC, respectively. dim(C)\operatorname{dim}(C) denotes the affine dimension of CC. A set CC is called simply connected if for any two points x,yCx,y\in C, there is a continuous curve ΓC\Gamma\subset C connecting xx and yy. A singleton set is referred to as simply connected set. A set CC is called convex if for any x,yCx,y\in C, the linear segment [x,y]={αx+(1α)y|α[0,1]}[x,y]=\{\alpha x+(1-\alpha)y|\alpha\in[0,1]\} is contained in CC. The convex hull of CC, denoted as conv(C)\operatorname{conv}(C), is a set of all the convex combinations from CC.

Let CC be a nonempty convex set in q\mathbb{R}^{q}. The vector 0hq0\neq h\in\mathbb{R}^{q} is called the recession direction of CC if c+λhCc+\lambda h\in C for all cCc\in C and λ>0\lambda>0. The set of all recession directions of CC is called the recession cone of CC, and is denoted as 0+(C)0^{+}(C). Given a boundary point x¯\bar{x} of CC, its normal cone Normal(C,x¯)\operatorname{Normal}(C,\bar{x}) is defined as

Normal(C,x¯)={cq|c,x¯c,xforallxC}.\operatorname{Normal}(C,\bar{x})=\{c\in\mathbb{R}^{q}|\langle c,\bar{x}\rangle\geq\langle c,x\rangle\ \operatorname{for\ all}\ x\in C\}.

A point x¯\bar{x} is a vertex of CC if its normal cone is full-dimensional.

Mapping Φ(ξ):rr\Phi(\xi):\mathbb{R}^{r}\rightrightarrows\mathbb{R}^{r} is called set-valued mapping if it assigns a subset of r\mathbb{R}^{r} to each element of r\mathbb{R}^{r}. The following set

Φ(C)={Φ(ξ)|ξC}\Phi(C)=\{\Phi(\xi)|\xi\in C\}

denotes the image of set CrC\in\mathbb{R}^{r} under set-valued mapping Φ\Phi.

The notations R(A)={Ax|xq}R(A)=\{Ax|x\in\mathbb{R}^{q}\} and N(A)={x|Ax=0,xq}N(A)=\{x|Ax=0,x\in\mathbb{R}^{q}\} denote the range and kernel space of the matrix AA, respectively.

Let UU be an open subset of r\mathbb{R}^{r}. We say that the mapping f:Uf:U\rightarrow\mathbb{R} is Ga^\hat{a}teaux differentiable at a point uUu\in U in a direction hrh\in\mathbb{R}^{r} if there exists f(u,h)f^{\prime}(u,h)\in\mathbb{R} such that

f(u,h)=limt0+1t(f(u+th)f(u)),f^{\prime}(u,h)=\lim\limits_{t\rightarrow 0+}\frac{1}{t}(f(u+th)-f(u)),

where f(u,h)f^{\prime}(u,h) is called the directional derivative of ff at uu in the direction hh. The mapping ff is called Ga^\hat{a}teaux differentiable at uu, provided that f(u,h)f^{\prime}(u,h) exists for all hrh\in\mathbb{R}^{r} and the mapping f(u,):rf^{\prime}(u,\cdot):\mathbb{R}^{r}\rightarrow\mathbb{R} is a continuous linear operator.

2.2 The nonstandard dual

Recall that (P-lift), without perturbation is the typical form of the CLO

minc,xs.t.Ax=b,xK,\begin{array}[]{ll}\min&\langle c,x\rangle\\ s.t.&Ax=b,\\ &x\in K,\end{array} (P)

and its Lagrangian dual problem is expressed as follows:

maxi=1mbiwis.t.ATwKc,wm.\begin{array}[]{ll}\max&\sum\limits_{i=1}^{m}b_{i}w_{i}\\ s.t.&A^{T}w\leq_{K^{*}}c,\\ &w\in\mathbb{R}^{m}.\end{array} (D)

For clarity and elegance (another crucial reason is given in the next section), we use an equivalent form instead of (D). In other words, (D) is rewritten as a new objective function with new constraints. This can be implemented through the following process.

Left multiplying both sides of the equality constraint in (D) by the matrix BB and MM, respectively, one has By=BcBy=Bc and My=McMy=Mc if y=cATwy=c-A^{T}w. Conversely, if yKy\in K^{*} satisfies both By=BcBy=Bc and My=McMy=Mc, wmw\in\mathbb{R}^{m} exists, such as y=cATwy=c-A^{T}w. Thus, the feasible set of (D) can be expressed equivalently as in {yK|By=a,My=Mc}\{y\in K^{*}|By=a,My=Mc\}. On the other hand, if a pair of vectors (x;w)(x;w) is feasible, and if b=Adb=Ad and y=cATwy=c-A^{T}w,

d,cy=d,ATw=Ad,w=bTw.\langle d,c-y\rangle=\langle d,A^{T}w\rangle=\langle Ad,w\rangle=b^{T}w. (4)

That is, (D) can be rewritten as follows:

maxd,cys.t.By=a,My=Mc,yK,\begin{array}[]{ll}\max&\langle d,c-y\rangle\\ s.t.&By=a,\\ &My=Mc,\\ &y\in K^{*},\end{array} (D-non)

where a=Bca=Bc. This is essentially the dual of (P), where yy denotes the slackness vector variable in (D). Therefore, (D) and (D-non) are the standard dual and nonstandard dual of (P), respectively.

For the primal and dual programs (P) and (D), the weak duality property holds. Then, from the equality (4), the following result holds:

Corollary 2.1.

If b=Adb=Ad and a=Bca=Bc, then for any primal feasible solution xx of (P) and any dual feasible solution yy of (D-non), the weak duality property holds, i.e.,

c,xd,cy.\langle c,x\rangle\geq\langle d,c-y\rangle. (5)

The equality holds if and only if (x,y)(x,y) is a pair of optimal solutions.

If the primal and dual programs have optimal solutions and the duality gap is zero, that is, the equality (5) holds, then the Karush-Kuhn-Tucker (KKT) conditions for the primal-dual CLO pair are

Ax=b,xK,\displaystyle Ax=b,\ x\in K, (6a)
By=a,My=Mc,yK,\displaystyle By=a,\ My=Mc,\ y\in K^{*}, (6b)
x,y=0.\displaystyle\langle x,y\rangle=0. (6c)

Conversely, if (x,y)q×q(x^{*},y^{*})\in\mathbb{R}^{q}\times\mathbb{R}^{q} is a pair of solutions of system (6a)-(6c), then (x,y)(x^{*},y^{*}) is a pair of optimal solutions of (P) and (D-non).