This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Solving sparse separable bilinear programs using lifted bilinear cover inequalities

Xiaoyi Gu [email protected], H. Milton Stewart School of Industrial & Systems Engineering, Georgia Institute of Technology, Atlanta, GA 30332. Santanu S. Dey [email protected], H. Milton Stewart School of Industrial & Systems Engineering, Georgia Institute of Technology, Atlanta, GA 30332. Jean-Philippe P. Richard [email protected], Department of Industrial and Systems Engineering, University of Minnesota.
Abstract

Recently, in [12], we proposed a class of inequalities called lifted bilinear cover inequalities, which are second-order cone representable convex inequalities, and are valid for a set described by a separable bilinear constraint together with bounds on variables. In this paper, we study the computational potential of these inequalities for separable bilinear optimization problems. We first prove that the semi-definite programming relaxation provides no benefit over the McCormick relaxation for such problems. We then design a simple randomized separation heuristic for lifted bilinear cover inequalities. In our computational experiments, we separate many rounds of these inequalities starting from McCormick’s relaxation of instances where each constraint is a separable bilinear constraint set. We demonstrate that there is a significant improvement in the performance of a state-of-the-art global solver in terms of gap closed, when these inequalities are added at the root node compared to when they are not.

1 Introduction.

Lifting is a technique used to generate cutting planes for a set from a seed inequality valid for a restriction of this set. Lifting was first studied in the context of mixed integer linear programming (MILP); see [20] for a review. Cover inequalities ([24, 19, 1, 2, 15]) are valid for 0-1 knapsack sets with coefficients satisfying the minimal cover property. The inequalities obtained after lifting cover inequalities are called lifted cover inequalities; see  [15, 28, 25, 26, 13, 14]. They form a family of strong valid inequalities for general 0-1 knapsack sets that is very important for state-of-the-art MILP solvers; see  [3].

Inspired by the success of these inequalities we introduced in [12] a class of inequalities we call lifted bilinear cover inequalities for separable bilinear constraints. These second-order cone representable (SOCR) convex inequalities are derived using lifting and are valid for a set described by a separable bilinear constraint together with bounds on variables. In Section 1.1 below, we give a detailed description of lifted bilinear cover inequalities and the separable bilinear sets they are valid for.

In this paper, we study the computational potential of these inequalities. We are inspired by the classical paper of [7]. This paper is one of the first to highlight the computational importance of lifted cover inequalities in MILP. Specifically, this paper considered sparse instances – since most practical instances are sparse. Here, sparsity means that the support of each constraint is significantly smaller than the number of variables in the entire problem; see [8, 9] for discussions and results regarding sparsity of problems and cutting-planes. Similarly, we generate sparse separable bilinear instances to test lifted bilinear cover inequalities. We numerically show that a significant improvement in the performance (in terms of gap closed) of a global solver can be observed when these inequalities are added at the root node compared to when they are not.

1.1 Lifted bilinear cover inequalities for separable bilinear programs.

For a positive integer nn, we use the notation [n][n] to describe the set {1,,n}\{1,\dots,n\}.

In [12], we derive inequalities that can be applied to improve convex relaxations of the feasible region of separable bilinear programs, which we call inequalities lifted bilinear cover inequalities.

Definition 1.

A separable bilinear program is an optimization problem of the form

min i[n](cixxi+ciyyi)\displaystyle\sum_{i\in[n]}\left(c_{i}^{x}x_{i}+c_{i}^{y}y_{i}\right) (1)
s.t. i[n]aijxiyidj,\displaystyle\sum_{i\in[n]}a_{i}^{j}x_{i}y_{i}\geq d_{j},\quad j[m],\displaystyle\forall j\in[m],
xi,yi[0,1],\displaystyle x_{i},y_{i}\in[0,1],\quad i[n].\displaystyle\forall i\in[n].

Each bilinear constraint in (1) is in the form of a separable bilinear set, defined as follows.

Definition 2.

A set SS is called a separable bilinear set if it is of the form

S:={(x,y)[0,1]n×[0,1]n|i[n]aixiyid},\displaystyle S:=\left\{\ (x,y)\in[0,1]^{n}\times[0,1]^{n}\ \Bigm{|}\ \sum_{i\in[n]}a_{i}x_{i}y_{i}\geq d\ \right\},

where dd\in\mathbb{R} and aia_{i}\in\mathbb{R} for all i[n]i\in[n].

For each i[n]i\in[n], variables xix_{i} and yiy_{i} in SS appear in only one term in the left-hand-side. The convex hull of the set SS has been studied in [11, 21] and the convex hull of the relaxation of SS obtained by dropping the upper bounds on the variables is presented in [23] for the case where all coefficients aia_{i} are non-negative. These convex hull results (especially the exact ones presented in [11] and [21]) have limited computational usefulness, since the description of the convex hull is exponential in the number of variables. This motivates the derivation of families of cutting planes for SS.

In [12] we proposed using minimal covers to generate seed inequalities we call bilinear cover inequalities and then perform lifting to obtain valid inequalities for separable bilinear sets. We next briefly introduce the main results of [12]. We start by introducing the notions of minimal cover and minimal cover yielding partition.

Definition 3.

A set {ai|i[k]}\{a_{i}\in\mathbb{R}\ |\ i\in[k]\}, with k+k\in\mathbb{Z}_{+} being a positive integer, is said to form a minimal cover of dd\in\mathbb{R} if

  1. (i)

    ai>0a_{i}>0 for all i[k]i\in[k], d>0d>0,

  2. (ii)

    i[k]ai>d\sum_{i\in[k]}a_{i}>d,

  3. (iii)

    iKaid\sum_{i\in K}a_{i}\leq d, K[k]\forall K\subsetneq[k].

For a separable bilinear set SS, we say that a partition Λ={I,J0,J1}\Lambda=\{I,J_{0},J_{1}\} of [n][n], where II\neq\emptyset, is a minimal cover yielding partition if: {ai|iI}\{a_{i}\ |\ i\in I\} forms a minimal cover of dΛ:=diJ1aid^{\Lambda}:=d-\sum_{i\in J_{1}}a_{i}. For a minimal cover yielding partition, we let J0+:={iJ0|ai>0}J_{0}^{+}:=\{i\in J_{0}\ |\ a_{i}>0\} and J0:={iJ0|ai<0}J_{0}^{-}:=\{i\in J_{0}\ |\ a_{i}<0\}. We define J1+J_{1}^{+} and J1J_{1}^{-} similarly.

Remark 1.

When k2k\geq 2, conditions (ii) and (iii) in the definition of minimal cover imply condition (i). For example, if ai0a_{i}\leq 0 for some i[k]i\in[k], then (ii) implies j[k]{i}aj>d\sum_{j\in[k]\setminus\{i\}}a_{j}>d, contradicting (iii). Now (iii) together with ai>0a_{i}>0 for i[k]i\in[k] implies that d>0d>0.

Notation 1.

Assuming that {ai|i[k]}\{a_{i}\ |i\in[k]\} forms a minimal cover of dΛ{d^{\Lambda}}, we use

  1. 1.

    Δ:=i[k]aidΛ\Delta:=\sum_{i\in[k]}a_{i}-{d^{\Lambda}},

  2. 2.

    di:=dΛj[k]\{i}aj=aiΔd_{i}:={d^{\Lambda}}-\sum_{j\in[k]\backslash\{i\}}a_{j}=a_{i}-\Delta,

  3. 3.

    I>:={i[k]|ai>Δ}I^{>}:=\{i\in[k]\ |\ a_{i}>\Delta\},

  4. 4.

    when I>I^{>}\neq\emptyset, let ı0\i 0 denote any index in I>I^{>} such that aı0=min{ai|iI>}a_{\i 0}=\min\{a_{i}\ |\ i\in I^{>}\}. We say that i0i_{0} does not exist when I>=I^{>}=\emptyset.

The process to generate lifted bilinear cover inequalities for SS from the minimal cover yielding partition is to (i) first fix xi=yi=0x_{i}=y_{i}=0 for iJ0i\in J_{0} and xi=yi=1x_{i}=y_{i}=1 for iJ1i\in J_{1}, (ii) generate valid seed inequality for the restricted region, and (iii) finally lift the seed inequality to obtain a valid cut for SS.

The following result from [12] establishes the existence of minimal cover yielding partitions in many instances of SS.

Theorem 1.

For a nonempty separable bilinear set SS, either there exists at least one minimal cover yielding partition or conv(S)\mathrm{conv}(S) is polyhedral.

The seed inequality used for lifting is presented in Theorem 2, which also discusses its strength.

Theorem 2.

For a separable bilinear set SS as in Definition 2 where {ai|i[n]}\{a_{i}\ |\ i\in[n]\} forms a minimal cover of dd, the following bilinear cover inequality is valid:

i[n]aiaidi(xiyi1)1.\displaystyle\sum_{i\in[n]}\frac{\sqrt{a_{i}}}{\sqrt{a_{i}}-\sqrt{d_{i}}}\left(\sqrt{x_{i}y_{i}}-1\right)\geq-1. (2)

Further, the set R:={(x,y)+2n|(2)}R:=\{(x,y)\in\mathbb{R}^{2n}_{+}\ |\ (\ref{eq:bilincoverineq})\} is such that (4R)[0, 1]2nconv(S)R[0, 1]2n.(4\cdot R)\cap[0,\ 1]^{2n}\subseteq\textup{conv}(S)\subseteq R\cap[0,\ 1]^{2n}.

With the seed bilinear cover inequality, the following lifted bilinear cover inequality was proposed, which is valid for the whole separable bilinear set.

Theorem 3.

Consider a separable bilinear set SS as in Definition 2. Let Λ={I,J0,J1}\Lambda=\{I,J_{0},J_{1}\} be a minimal cover yielding partition, let Δ,aı0,di,l+,l\Delta,a_{\i 0},d_{i},l_{+},l_{-} be defined as in Notation 1, and let J0+J_{0}^{+}, J0J_{0}^{-}, J1+J_{1}^{+}, J1J_{1}^{-} be as in Definition 3. Then inequality

iIaiaidi(xiyi1)+iIγi(xi,yi)1,\displaystyle\sum_{i\in I}\frac{\sqrt{a_{i}}}{\sqrt{a_{i}}-\sqrt{d_{i}}}\left(\sqrt{x_{i}y_{i}}-1\right)+\sum_{i\notin I}\gamma_{i}(x_{i},y_{i})\geq-1, (3)

is valid for SS where γi:2\gamma_{i}:\mathbb{R}^{2}\rightarrow\mathbb{R} for i[n]Ii\in[n]\setminus I are the concave functions:

  1. (i)

    γi(x,y)=l+aimin{x,y}\gamma_{i}(x,y)=l_{+}a_{i}\min\{x,y\} for iJ0+i\in J_{0}^{+};

  2. (ii)

    γi(x,y)=l+aimin{2xy,1}\gamma_{i}(x,y)=-l_{+}a_{i}\min\{2-x-y,1\} for iJ1i\in J_{1}^{-};

  3. (iii)

    γi(x,y)=min{lai(x+y1),l+ai(x+y1)+l+Δ1,0}\gamma_{i}(x,y)=\min\{l_{-}a_{i}(x+y-1),l_{+}a_{i}(x+y-1)+l_{+}\Delta-1,0\} for iJ0i\in J_{0}^{-};

  4. (iv)

    γi(x,y)=min{g~i(x,y),h~i(x,y),gi(x,y),hi(x,y)},\gamma_{i}(x,y)=\min\{\tilde{g}_{i}(x,y),\tilde{h}_{i}(x,y),g_{i}(x,y),h_{i}(x,y)\}, for iJ1+i\in J_{1}^{+} with aiaı0a_{i}\geq a_{\i 0} when I>I^{>}\neq\emptyset, and γi(x,y)=min{g~i(x,y),h~i(x,y)}\gamma_{i}(x,y)=\min\{\tilde{g}_{i}(x,y),\tilde{h}_{i}(x,y)\} in all other cases where iJ1+i\in J_{1}^{+}, with

    g~i(x,y)\displaystyle\tilde{g}_{i}(x,y) =l+ai(min{x,y}1)+l+Δ1\displaystyle=l_{+}a_{i}(\min\{x,y\}-1)+l_{+}\Delta-1
    h~i(x,y)\displaystyle\tilde{h}_{i}(x,y) =lai(min{x,y}1)\displaystyle=l_{-}a_{i}(\min\{x,y\}-1)
    gi(x,y)\displaystyle g_{i}(x,y) =aiΔail+xyl+(aiΔ)1\displaystyle=\sqrt{a_{i}-\Delta}\sqrt{a_{i}}l_{+}\sqrt{xy}-l_{+}(a_{i}-\Delta)-1
    hi(x,y)\displaystyle h_{i}(x,y) =aiaidi(xy1)\displaystyle=\frac{\sqrt{a_{i}}}{\sqrt{a_{i}}-\sqrt{d_{i}}}(\sqrt{xy}-1)

with l=1Δl_{-}=\frac{1}{\Delta} and l+=ai0+di0Δdi0l_{+}=\frac{\sqrt{a_{i_{0}}}+\sqrt{d_{i_{0}}}}{\Delta\sqrt{d_{i_{0}}}} if i0i_{0} exists and l+=1Δl_{+}=\frac{1}{\Delta} otherwise.

The lifted bilinear cover inequality (3) is convex and second-order cone representable (SOCR), making it easy to incorporate in relaxations, given the enormous progress in SOC solvers.

1.2 Main contributions.

While the lifted bilinear cover inequality was derived in [12], its computational usefulness has not been evaluated. In this paper, we utilize lifted bilinear cover inequalities in a computational study and illustrate their benefit.

We expand on our main contributions next:

  • Instead of using lifted bilinear cover inequalities, we could consider using the semidefinite programming (SDP) relaxation for strengthening the McCormick relaxation. In general, it is known that combining the McCormick relaxation with the SDP relaxation produces good bounds; see [5] for instance. We show that, surprisingly, for the class of separable instances, the SDP relaxation gives a bound equal to that of the McCormick relaxation. We shall see in the later sections that the lifted bilinear cover inequalities are able to close significant root gap over the McCormick relaxation – thus showing that lifted bilinear cover inequalities are important in solving separable bilinear instances where SDP relaxations are of limited use.

  • We design a simple randomized separation heuristic for lifted bilinear cover inequalities. We use this separation heuristic to add many rounds of cuts starting from a natural relaxation of the problem that uses McCormick inequalities. We then solve the instances using a commercial global solver with and without these inequalities added to the root node and compare the results obtained.

  • We discover that the inequalities separated using the simple heuristic provide a major performance boost to a commercial global solver in terms of overall gap closed on sparse test instances.

The rest of the paper is organized as follows. In Section 2, we compare the SDP and McCormick relaxations for separable programs. In Section 3, we present our separation heuristic. In Section 4, we describe the procedure we use to generate synthetic test instances and discuss various other parameters regarding the use of the separation heuristic on these instances. In Section 5, we present detailed results of our experiments. We give concluding remarks in Section 6.

2 Comparing SDP and McCormick relaxations of separable programs.

We show that, for separable bilinear programs, the relaxation obtained by adding the traditional SDP constraint to the McCormick relaxation gives the same bound as that given by just the McCormick relaxation. In fact, we prove this result in the following slightly more general seting. Consider the following quadratically constrained quadratic program (QCQP):

min i[n]ai0xiyi+i[n]cix,0xi+i[n]ciy,0yi\displaystyle\sum_{i\in[n]}a^{0}_{i}x_{i}y_{i}+\sum_{i\in[n]}c^{x,0}_{i}x_{i}+\sum_{i\in[n]}c^{y,0}_{i}y_{i} (4a)
s.t. i[n]aijxiyi+i[n]cix,jxi+i[n]ciy,jyibj,j[m],\displaystyle\sum_{i\in[n]}a^{j}_{i}x_{i}y_{i}+\sum_{i\in[n]}c^{x,j}_{i}x_{i}+\sum_{i\in[n]}c^{y,j}_{i}y_{i}\geq b_{j},\quad\forall j\in[m], (4b)
x,y[0,1]n.\displaystyle x,y\in[0,1]^{n}. (4c)

We consider next two relaxations of (4). In both of these relaxations, we introduce variables wi,jw_{i,j} to represent the products xixjx_{i}x_{j}, variables wi,n+jw_{i,n+j} and wn+i,jw_{n+i,j} to represent the products xiyjx_{i}y_{j} and yixjy_{i}x_{j}, respectively, and variables wn+i,n+jw_{n+i,n+j} to represent the products yiyjy_{i}y_{j}. Clearly, only variables wi,n+iw_{i,n+i} are needed to relax (4b). The first relaxation, which we call McCormick relaxation, is obtained by using McCormick inequalities to approximate the relationships between variables ww, xx, and yy:

zMc:=\displaystyle z_{Mc}:=\ min i[n]ai0wi,n+i+i[n]cix,0xi+i[n]ciy,0yi\displaystyle\sum_{i\in[n]}a^{0}_{i}w_{i,n+i}+\sum_{i\in[n]}c^{x,0}_{i}x_{i}+\sum_{i\in[n]}c^{y,0}_{i}y_{i} (5a)
s.t. i[n]aijwi,n+i+i[n]cix,jxi+i[n]ciy,jyibj,\displaystyle\sum_{i\in[n]}a^{j}_{i}w_{i,n+i}+\sum_{i\in[n]}c^{x,j}_{i}x_{i}+\sum_{i\in[n]}c^{y,j}_{i}y_{i}\geq b_{j}, j[m],\displaystyle\forall j\in[m], (5b)
xi=ui,\displaystyle x_{i}=u_{i}, i[n],\displaystyle\forall i\in[n], (5c)
yi=un+i,\displaystyle y_{i}=u_{n+i}, i[n],\displaystyle\forall i\in[n], (5d)
u[0,1]2n,\displaystyle u\in[0,1]^{2n}, (5e)
max{0,ui+uk1}wi,kmin{ui,uk},\displaystyle\textup{max}\{0,u_{i}+u_{k}-1\}\leq w_{i,k}\leq\textup{min}\{u_{i},u_{k}\}, (i,k)[2n]×[2n].\displaystyle\forall(i,k)\in[2n]\times[2n]. (5f)

The second relaxation, which we call McCormick+SDP relaxation, is obtained from the McCormick relaxation by including the traditional SDP relaxation of the property that WW is a rank-1 matrix:

zMS:=\displaystyle z_{MS}:=\ min i[n]ai0wi,n+i+i[n]cix,0xi+i[n]ciy,0yi\displaystyle\sum_{i\in[n]}a^{0}_{i}w_{i,n+i}+\sum_{i\in[n]}c^{x,0}_{i}x_{i}+\sum_{i\in[n]}c^{y,0}_{i}y_{i} (6a)
s.t. (5b),(5c),(5d),(5e),(5f)\displaystyle(\ref{eq:eq1}),(\ref{eq:eq2}),(\ref{eq:eq3}),(\ref{eq:eq4}),(\ref{eq:eq5}) (6b)
W:=[1uuw]0,\displaystyle W:=\left[\begin{array}[]{cc}1&\ u^{\top}\\ u&\ w\end{array}\right]\succeq 0, (6e)

where W0W\succeq 0 denotes that WW is positive semi-definite.

Proposition 1.

For the optimization problem (4), zMc=zMSz_{Mc}=z_{MS}, or both (5) and (6) are infeasible.

Proof.

Note that both (5) and (6) are bounded, as all variables are bounded by [0,1][0,1]. Therefore, if either problem is feasible, then the feasible region is compact and thus a corresponding optimal solution must exist (objective function is linear). Since the feasible region of (6) is contained in that of (5), we only need to prove that if (5) is feasible, then (6) is also feasible and that zMczMSz_{Mc}\leq z_{MS}.

Let (x,y,u,w)(x^{*},y^{*},u^{*},w^{*}) be an optimal solution corresponding to (5).

Then, construct the matrix W(2n+1)×(2n+1)W^{*}\in\mathbb{R}^{(2n+1)\times(2n+1)} as follows:

Wik={1,if i=k=1,ui1,if k=1,i2,uk1,if i=1,k2,ui1,if i=k>1,ui1uk1,if ik,i>1,k>1,|ik|n,wmin{i1,k1},max{i1,k1},if ik,i>1,k>1,|ik|=n.\displaystyle W^{*}_{ik}=\left\{\begin{array}[]{ll}1,&\quad\textup{if }i=k=1,\\ u^{*}_{i-1},&\quad\textup{if }k=1,i\geq 2,\\ u^{*}_{k-1},&\quad\textup{if }i=1,k\geq 2,\\ u^{*}_{i-1},&\quad\textup{if }i=k>1,\\ u^{*}_{i-1}u^{*}_{k-1},&\quad\textup{if }i\neq k,i>1,k>1,|i-k|\neq n,\\ w^{*}_{\textup{min}\{i-1,k-1\},\textup{max}\{i-1,k-1\}},&\quad\textup{if }i\neq k,i>1,k>1,|i-k|=n.\end{array}\right.

It is easy to verify that (x,y,u,W)(x^{*},y^{*},u^{*},W^{*}) satisfy all the linear constraints corresponding to zMSz_{MS}. It remains to show that W0W^{*}\succeq 0. This would show that there exists a feasible solution to (6) with objective function value equal to that of zMcz_{Mc}, completing the proof.

Construct the matrix G(2n+1)×(2n+1)G\in\mathbb{R}^{(2n+1)\times(2n+1)} as follows:

Gik={1,if i=k=1,ui1,if k=1,uk1,if i=1,ui1uk1,if jk,i>1,k>1.\displaystyle G_{ik}=\left\{\begin{array}[]{ll}1,&\quad\textup{if }i=k=1,\\ u^{*}_{i-1},&\quad\textup{if }k=1,\\ u^{*}_{k-1},&\quad\textup{if }i=1,\\ u^{*}_{i-1}u^{*}_{k-1},&\quad\textup{if }j\neq k,i>1,k>1.\end{array}\right.

Clearly, G0G\succeq 0 since G=[1u][1(u)]G=\left[\begin{array}[]{c}1\\ u^{*}\end{array}\right][1\quad(u^{*})^{\top}]. So it is sufficient to show that WG0W^{*}-G\succeq 0. We compute that

(WG)ik={0,if i=1 or k=1,ui1(ui1)2,if i=k>1,0,if ik,i>1,k>1,|ik|n,wmin{i1,k1},max{i1,k1}ui1uk1,if ik,i>1,k>1,|ik|=n.\displaystyle(W^{*}-G)_{ik}=\left\{\begin{array}[]{ll}0,&\quad\textup{if }i=1\textup{ or }k=1,\\ u^{*}_{i-1}-(u^{*}_{i-1})^{2},&\quad\textup{if }i=k>1,\\ 0,&\quad\textup{if }i\neq k,i>1,k>1,|i-k|\neq n,\\ w^{*}_{\textup{min}\{i-1,k-1\},\textup{max}\{i-1,k-1\}}-u^{*}_{i-1}u^{*}_{k-1},&\quad\textup{if }i\neq k,i>1,k>1,|i-k|=n.\end{array}\right.

In words, the first row and column are all zero. For each of the remaining rows, there is exactly one non-zero off-diagonal term. Therefore (without loss of generality) by the Gershgorin circle theorem (see [16]) it is sufficient to prove that

ui1(ui1)2|wi1,n+i1ui1ui+n1|,i{2,,n+1},u^{*}_{i-1}-(u^{*}_{i-1})^{2}\geq|w^{*}_{i-1,n+i-1}-u^{*}_{i-1}u^{*}_{i+n-1}|,\quad\forall i\in\{2,\dots,n+1\},

that is, it is sufficient to prove that

xi(xi)2|wi,n+ixiyi|,i[n]x^{*}_{i}-(x^{*}_{i})^{2}\geq|w^{*}_{i,n+i}-x^{*}_{i}y^{*}_{i}|,\quad\forall i\in[n]

where min{xi,yi}wi,n+imax{xi+yi1,0}\textup{min}\{x^{*}_{i},y^{*}_{i}\}\geq w^{*}_{i,n+i}\geq\textup{max}\{x^{*}_{i}+y^{*}_{i}-1,0\} for i[n]i\in[n]. This can be done by considering the four possible extreme values of ww^{*}:

  • Case 1: wi,n+i=0w^{*}_{i,n+i}=0. In this case, xi+yi1x^{*}_{i}+y^{*}_{i}\leq 1. Then

    1xiyixi(1xi)xiyixi(xi)2xiyi=|wi,n+ixiyi|.\displaystyle{1-x^{*}_{i}\geq y^{*}_{i}\quad\Rightarrow\quad x^{*}_{i}(1-x^{*}_{i})\geq x^{*}_{i}y^{*}_{i}\quad\Rightarrow\quad x^{*}_{i}-(x^{*}_{i})^{2}\geq x^{*}_{i}y^{*}_{i}=|w^{*}_{i,n+i}-x^{*}_{i}y^{*}_{i}|.}
  • Case 2: wi,n+i=xi+yi10w^{*}_{i,n+i}=x^{*}_{i}+y^{*}_{i}-1\geq 0. In this case, xi+yi1x^{*}_{i}+y^{*}_{i}\geq 1. First, observe that wi,n+ixiyi=xi+yi1xiyi=(1+yi)(1xi)0w^{*}_{i,n+i}-x^{*}_{i}y^{*}_{i}=x^{*}_{i}+y^{*}_{i}-1-x^{*}_{i}y^{*}_{i}=(-1+y^{*}_{i})(1-x^{*}_{i})\leq 0. Thus, |wi,n+ixiyi|=(1+yi)(1xi)|w^{*}_{i,n+i}-x^{*}_{i}y^{*}_{i}|=-(-1+y^{*}_{i})(1-x^{*}_{i}). Then,

    xi1yi\displaystyle x^{*}_{i}\geq 1-y^{*}_{i} \displaystyle\quad\Rightarrow\quad xi(1xi)(1+yi)(1xi)\displaystyle x^{*}_{i}(1-x^{*}_{i})\geq-(-1+y^{*}_{i})(1-x^{*}_{i})
    \displaystyle\quad\Rightarrow\quad xi(xi)2(1+yi)(1xi)=|wi,n+ixiyi|.\displaystyle x^{*}_{i}-(x^{*}_{i})^{2}\geq-(-1+y^{*}_{i})(1-x^{*}_{i})=|w^{*}_{i,n+i}-x^{*}_{i}y^{*}_{i}|.
  • Case 3: wi,n+i=xiw^{*}_{i,n+i}=x^{*}_{i}. In this case, xiyix_{i}^{*}\leq y_{i}^{*}. Then,

    xiyi(xi)2xiyixi(xi)2xixiyi=|wi,n+ixiyi|.\displaystyle{x^{*}_{i}\leq y^{*}_{i}\quad\Rightarrow\quad(x^{*}_{i})^{2}\leq x^{*}_{i}y^{*}_{i}\quad\Rightarrow\quad x^{*}_{i}-(x^{*}_{i})^{2}\geq x^{*}_{i}-x^{*}_{i}y^{*}_{i}=|w^{*}_{i,n+i}-x^{*}_{i}y^{*}_{i}|.}
  • Case 4: wi,n+i=yiw^{*}_{i,n+i}=y^{*}_{i}. In this case, xiyix_{i}^{*}\geq y_{i}^{*}. Then,

    xiyixi(1xi)yi(1xi)xi(xi)2yixiyi=|wi,n+ixiyi|.\displaystyle{x^{*}_{i}\geq y^{*}_{i}\quad\Rightarrow\quad x^{*}_{i}(1-x^{*}_{i})\geq y^{*}_{i}(1-x^{*}_{i})\quad\Rightarrow\quad x^{*}_{i}-(x^{*}_{i})^{2}\geq y^{*}_{i}-x^{*}_{i}y^{*}_{i}=|w^{*}_{i,n+i}-x^{*}_{i}y^{*}_{i}|.}

3 Improved relaxation through a heuristic separation algorithm.

In this section, we describe a procedure that, starting from the McCormick relaxation of the problem, produces an improved relaxation by adding a selection of lifted bilinear cover inequalities. This procedure, whose details are presented in Algorithm 1, solves a convex relaxation of the problem at each step and separates the obtained optimal relaxation solution using a heuristic separation procedure. The steps are repeated until either a preset time limit is reached, a preset iteration limit is reached, or the improvement in relaxation bounds between iterations becomes too small.

Algorithm 1 Improving Root Relaxation with Lifted Bilinear Cover Cuts
0:  bilinear programming problem; non-convex nonlinear solver globalsolver; convex second-order cone programming (SOCP) solver convexsolver;parameters: minimum improvement threshold ϵz\epsilon_{z}, iteration number limit TT, heuristic time limit TheuT_{heu}, parameters for heuristic and solver
1:  generate McCormick relaxation (5)
2:  solve (5) using convexsolver;obtain optimal solution (x0,y0)(x^{0},y^{0}) and lower bound z0z^{0}; set t=1t=1
3:  while (heuristic time limit TheuT_{heu} not reached) do
4:     generate bilinear cover cuts using Algorithm 2 and add them to the problem
5:     if (violated bilinear cover cuts were not generated) break
6:     solve the improved relaxation with convexsolver;obtain optimal solution (xt,yt)(x^{t},y^{t}) and lower bound ztz^{t}
7:     if |ztzt1|/|zt1|<ϵz|z^{t}-z^{t-1}|/|z^{t-1}|<\epsilon_{z} (improvement is too small), break
8:     tt+1t\leftarrow t+1
9:     if t>Tt>T (iteration limit is reached) break
10:  end while
11:  solve the problem using globalsolver

The workhorse of the procedure described above is a heuristic separation algorithm that, given a relaxation solution, seeks to generate a violated lifted bilinear cover cut from each of the bilinear constraints of the problem. The heuristic, which we explain next, is formally presented in Algorithm 2.

Algorithm 2 Heuristic Separation Algorithm for Lifted Bilinear Cover Cuts
0:  bilinear programming problem with relaxation solution (xt,yt)(x^{t},y^{t});parameters:approximation threshold ϵ\epsilon, search iteration limit SjS_{j} for j[m]j\in[m]
1:  for j=1:mj=1:m do
2:     if heuristic time limit TheuT_{heu} reached, break
3:     if i=1naijxiyidj0\sum_{i=1}^{n}a_{i}^{j}x_{i}y_{i}-d_{j}\geq 0 (bilinear constraint jj not violated), continue
4:     for i=1:ni=1:n do
5:        if aij=0a_{i}^{j}=0, set label lijl_{i}^{j} as inactive ((xi,yi)(x_{i},y_{i}) is not in the bilinear cover cut for row jj)
6:        else if xiyi<ϵx_{i}y_{i}<\epsilon, set lijl_{i}^{j} as J0J_{0}
7:        else if xiyi>1ϵx_{i}y_{i}>1-\epsilon, set lijl_{i}^{j} as J1J_{1}
8:        else if aij>0a_{i}^{j}>0, set lijl_{i}^{j} as II
9:        else (i.e., aij<0a_{i}^{j}<0), set lijl_{i}^{j} as J1J_{1} or J0J_{0} with probability xiyix_{i}y_{i} and 1xiyi1-x_{i}y_{i} respectively
10:     end for
11:     if partition Λ\Lambda with labels ll is not a minimal cover yielding partition for row jj then
12:        for s=1:Sjs=1:S_{j} do
13:           (check reason for failure)
14:           if djΛ=djiJ1aij0d_{j}^{\Lambda}=d_{j}-\sum_{i\in J_{1}}a_{i}^{j}\leq 0 (right-hand-side is not positive):select randomly i{i|aij>0,lij=J1}{i|aij<0,lij=J0}i^{*}\in\{i\ |\ a_{i}^{j}>0,l_{i}^{j}=J_{1}\}\cup\{i\ |\ a_{i}^{j}<0,l_{i}^{j}=J_{0}\} and flip lijl_{i^{*}}^{j} from J1J_{1} to II (in the first case) or from J0J_{0} to J1J_{1} (in the second case); break if not found
15:           else if Δj=IaijdjΛ0\Delta_{j}=\sum_{I}a_{i}^{j}-d_{j}^{\Lambda}\leq 0 (Δj\Delta_{j} is not positive, i.e., not a cover):select randomly i{i|aij>0,lij=J0}{i|aij<0,lij=J1}i^{*}\in\{i\ |\ a_{i}^{j}>0,l_{i}^{j}=J_{0}\}\cup\{i\ |\ a_{i}^{j}<0,l_{i}^{j}=J_{1}\} and flip lijl_{i^{*}}^{j} from J0J_{0} to J1J_{1} or from J1J_{1} to J0J_{0}; break if not found
16:           else (iI,aij<Δj\exists i\in I,a_{i}^{j}<\Delta_{j}, i.e., not a minimal cover):find iargminIaiji^{*}\in\textup{argmin}_{I}{a_{i}^{j}} and set lijl_{i^{*}}^{j} as J1J_{1}
17:           if partition Λ\Lambda with adjusted labels ll is a minimal cover yielding partition, break
18:        end for
19:     end if
20:     if lifted bilinear cover cut is generated and violated by (xt,yt)(x^{t},y^{t}): add the lifted bilinear cover cut to the problem
21:  end for

The simple randomized heuristic that we implement can be intuitively described as “guess a reasonable partition (I,J0,J1)(I,J_{0},J_{1})” and “randomly adjust it if it does not yield a violated cut”. Given a solution to separate, it considers each constraint jj of the bilinear problem in sequence. For separating from a given row jj, the heuristic must decide for each variable, whether it is in the set II, J0J_{0}, or J1J_{1}. The resulting cut is entirely determined based on this choice. Formally, the following steps are performed:

  • The heuristic starts with an intuitive guess (steps 4-10), where the label lijl_{i}^{j} for variable ii is assigned to J0J_{0} if xiyix_{i}y_{i} is close to 0 and to J1J_{1} if xiyix_{i}y_{i} close to 11. For all other indices ii, lijl_{i}^{j} is set to II if aij>0a^{j}_{i}>0, and randomly to J0J_{0} or J1J_{1} otherwise.

  • After generating the initial guess, if (I,J0,J1)(I,J_{0},J_{1}) fails to be a minimal cover yielding partition, the heuristic performs several attempts at randomly adjusting the partition (steps 12-19), where labels lijl_{i}^{j} are flipped based on different cases of failure (steps 14, 15, 16 respectively). We allow the number SjS_{j} of attempts taken to depend on the row jj, as we expect separation to be harder to perform in denser rows.

  • If a partition (I,J0,J1)(I,J_{0},J_{1}) is found that corresponds to a violated lifted cover, the associated inequality is added, and we proceed to attempting to generate a cut from the following problem row.

Algorithm 2 is a basic guess-and-adjust heuristic, where the adjust step shares similarity to the WALK SAT approach of [22]. Nevertheless, our experiments presented in the later sections will illustrate the power of this basic heuristic, thereby also demonstrating the potential of the lifted bilinear cover cuts.

4 Experimental setup.

In this section, we present the way we setup our experiments, including instance generation, environment, and various other parameters.

4.1 Randomly generated instances

We consider instances of the form

min i[n](cixxi+ciyyi)\displaystyle\sum_{i\in[n]}\left(c_{i}^{x}x_{i}+c_{i}^{y}y_{i}\right)
s.t. i[n]aijxiyidj,\displaystyle\sum_{i\in[n]}a_{i}^{j}x_{i}y_{i}\geq d_{j}, j[m],\displaystyle\forall j\in[m],
xi,yi[0,1],\displaystyle x_{i},y_{i}\in[0,1], i[n].\displaystyle\forall i\in[n].

Inspired by the classical paper [7] and by the fact most instances in practice are sparse, we generate sparse instances of the above model.

We generate test instances based on three parameters: the number of rows (i.e., bilinear constraints) mm, the number of variables nn, and the density parameter pp. Moreover, we generate two groups of instances, with the first group having non-negative coefficients aija_{i}^{j} (which we call non-negative instances), and the second group having both non-negative and non-positive coefficients aija_{i}^{j} (which we call mixed-signs instances).

After fixing the three parameters mm, nn, and pp, we randomly generate the objective coefficients cc, the bilinear coefficients aa and the right-hand-sides dd. The coefficients cixc_{i}^{x} and ciyc_{i}^{y} are generated independently from a uniform distribution on [0,1][0,1]; aija_{i}^{j} is set to be non-zero with probability pp, and when non-zero, it is generated from a uniform distribution on [0,1][0,1] for non-negative instances, or from a uniform distribution on [1,1][-1,1] for mixed-signs instances. For djd_{j}, we let sj:=i[n]aijs_{j}:=\sum_{i\in[n]}a_{i}^{j} and we set dj=rjsjd_{j}=r_{j}s_{j}, where rjr_{j} is drawn from a uniform distribution on [0,1][0,1] when sj>0s_{j}>0 and rjr_{j} is drawn from a uniform distribution on [1,2][1,2] otherwise.

For our experimental design, we chose m,n[100,250,500]m,n\in[100,250,500] and p[0.01,0.02,0.05]p\in[0.01,0.02,0.05]. To make the problem “reasonably” sparse (i.e., neither too sparse nor too dense), we only select those combinations where np[5,20]np\in[5,20] where npnp can be understood as the expected number of non-zero terms for each constraint. There is a total of 1515 different settings of mm, nn, and pp that satisfy this requirement. For each setting, we randomly generate ten different instances with non-negative coefficients and ten instances with mixed signs.

4.2 Environment.

The experiments were implemented in Python 3.9.7 with Gurobi 9.5.0 for both convex SOCP solver convexsolver and non-convex nonlinear solver globalsolver, with parameter NonConvex=2 for the latter. All experiments were run in parallel on the high-performance computational cluster of ISyE, Georgia Tech, which contains roughly 2,340 cores of x86-64 processing with over 28.9TB of memory spread across the systems. Each task was processed on a dedicated core with 8GB of memory.

4.3 Testing.

We compare two settings for evaluating the efficacy of the lifted bilinear cover inequalities:

  • We separate the lifted bilinear cover inequalities as described in the previous section. Then we give as input to Gurobi the separable bilinear optimization problem augmented with these separated cuts.

  • We directly input the separable bilinear optimization problem to Gurobi.

In both the above settings, we use the default parameters of Gurobi.

For non-negative instances, we also add a comparison to the valid inequality described in [23]:

i[n]aijxiyidj,\sum_{i\in[n]}\sqrt{a_{i}^{j}x_{i}y_{i}}\geq\sqrt{d_{j}}, (10)

for the jj-th row, where the aija^{j}_{i}s are non-negative. Together with nonnegativity constraints, inequality (10) gives the convex hull of the jj-th row if there is no upper bound on the variables. For testing these cuts, we add them to the problem (effectively bolstering the McCormick relaxation at the root node) and then solve the resulting problem with default Gurobi.

We note that there is no obvious way to use the cuts (10) for mixed-signs instances, as there is no easy way to complement the variables in mixed-signs instances so as to produce equivalent bilinear instances with non-negative coefficients.

In Algorithm 1, we set both the heuristic time limit and the time limit for Gurobi non-linear non-convex solver to 1,800 seconds (half an hour). While it might seem that our algorithm has an unfair advantage with extra time for heuristic, we will see later that the difference is negligible as the time spent on the separation heuristic is significantly smaller compared to the time for solving the non-convex problem.

Moreover, for our heuristic, we set ϵz=5×103\epsilon_{z}=5\times 10^{-3}, ϵ=102\epsilon=10^{-2}, T=10npT=10\cdot np and Sj=10#(aij|aij0)S_{j}=10\cdot\#(a_{i}^{j}\ |\ a_{i}^{j}\neq 0), where #(aij|aij0)\#(a_{i}^{j}\ |\ a_{i}^{j}\neq 0) represents the number of non-zero coefficients in the jj-th constraint.

The main measure we use for comparison is the quality of the lower bound, which is measured by relative gap closed ρ\rho. Using zMcz_{Mc} as the value of McCormick relaxation, and zoptz_{opt} as the best primal value (best solution found among both settings), we define the relative gap closed in percentage as

ρ:=zzMczoptzMc×100%,\rho:=\frac{z^{*}-z_{Mc}}{z_{opt}-z_{Mc}}\times 100\%,

where zz^{*} is the final lower bound achieved by the corresponding method.

Similarly, we consider the relative gap improvement Δρ\Delta\rho to measure the gain by using lifted bilinear cover cuts, which is defined as

Δρ:=zBCzGRBzoptzMc×100%=ρBCρGRB,\Delta\rho:=\frac{z^{*}_{BC}-z^{*}_{GRB}}{z_{opt}-z_{Mc}}\times 100\%=\rho_{BC}-\rho_{GRB},

where zBCz^{*}_{BC} is the final lower bound achieved by Gurobi for the model where the lifted bilinear cover cuts are added, whereas zGRBz^{*}_{GRB} is the lower bound returned by the default Gurobi non-convex solver without any lifted bilinear cover cuts.

5 Evaluation.

In this section, we evaluate the performance of lifted bilinear cover cuts separated with the heuristic, compared to the default Gurobi solver. Our main focus is the relative gap closed ρ\rho and the relative gap improvement Δρ\Delta\rho defined in the previous section. We will however also discuss the number and the quality of bilinear cover cuts produced as well as associated computation times.

5.1 Non-negative instances.

5.1.1 Relative gap closed at the end of the time limit.

We begin by presenting in Figure 1 results regarding the relative gap closed (ρ\rho) for non-negative instances.

Refer to caption
Figure 1: Relative gap closed ρ\rho (%) for non-negative instances: ρGRB\rho_{GRB} is the gap closed by Gurobi on the separable bilinear instances, and ρBC\rho_{BC} is the gap closed by Gurobi on the separable bilinear instances when lifted bilinear cover cuts are added at the root node.

As can be seen in the figure, with the bilinear cover cuts applied at the root node, Gurobi closes more gap than the default method, resulting in a final lower bound much closer to the primal value. The improvement reduces when the density pp increases, which is anticipated as it becomes more difficult to generate violated lifted bilinear cover cuts using our separation heuristic.

In Figure 2, we take a closer look at the relative gap improvement Δρ\Delta\rho. Not only is the average Δρ\Delta\rho (as shown in the figure) positive, it is in fact positive for most instances.

Refer to caption
Figure 2: Relative gap improvement Δρ\Delta\rho (%) for non-negative instances.

From analyzing these two figures, we draw the conclusion that lifted bilinear cover cuts provide a stable and robust performance boost for sparse non-negative instances.

5.1.2 Lifted bilinear cover cuts - number and root gap closed.

Next, we investigate the number and the quality of generated lifted bilinear cover cuts. In Figure 3, we present the number of cuts generated. In Figure 4, we display the relative gap closed solely at the root node by lifted bilinear cover cuts separated using the heuristics, which we compute as

ρHeu:=zrootBCzMczoptzMc.\rho_{Heu}:=\frac{z_{rootBC}-z_{Mc}}{z_{opt}-z_{Mc}}.

We observe that the number of separated lifted bilinear cover cuts reduces when pp increases. Further, as can be anticipated, the number of cuts generated increases with the number of constraints.

The results in Figure 4 illustrate the strength of lifted bilinear cover cuts over non-negative instances, where much of the gap can be closed by solving the convex relaxation obtained after applying the separation heuristic, which is significantly cheaper than solving the original problem. Interestingly, as far as the root gap closed, the performance of these cuts appears to be independent of the sparsity level pp: it is approximately 60%60\% for most classes of instances.

Refer to caption
Figure 3: Number of lifted bilinear cover cuts nBCn_{BC} for non-negative instances.
Refer to caption
Figure 4: Relative gap closed at root node by lifted bilinear cover cuts (ρHeu\rho_{Heu}) for non-negative instances.

5.1.3 Time.

We consider now the time taken by the heuristic to separate cuts at the root node. As mentioned earlier, we impose a time restriction of 30 minutes for this step. As it turns out, the heuristic (including cut separation and solution of the SOCR convex relaxations) takes only a few seconds; see results in Figure 5.

The time taken by the heuristic increases as the size of the instances becomes larger, though it is still relatively negligible compared to the time spent by the non-convex solver.

Refer to caption
Figure 5: Heuristic CPU time (s) for non-negative instances.

Next we present the total times in Figure 6. The difference in time is nearly non-existent, as the root node heuristic takes very little time and the instances remained unsolved after 30 minutes of Gurobi’s run time in both settings. The similarity in overall time supports the fairness of the comparison of the two settings and confirms the effectiveness of the lifted bilinear cover cuts.

Refer to caption
Figure 6: Total CPU time (s) for non-negative instances: tBCt_{BC} is the time for the case where lifted bilinear cover cuts are added at the root node, and tGRBt_{GRB} is the time for the other setting.

5.1.4 Comparison with the cuts (10).

We next compare the lifted bilinear cover cuts with the cuts (10) described in [23]. Because a single one of these cuts can be generated from each row, no involved separation is required. The results are presented in Figure 7 and Figure 8.

In Figure 7, we present the relative gap closed results at the root node. While it has already been shown from previous sections that the lifted bilinear cover cuts could close a significant portion of gap at the root node, we note that (10) closes a significantly smaller portion of the gap at root node.

In Figure 8, the final relative gap closed results are presented. we observe a similar performance between default Gurobi (ρGRB\rho_{GRB}) and the method applying (10) (ρMT\rho_{MT}), with lifted bilinear cover cuts closing more gap (ρBC\rho_{BC}).

Combining the above observations, it is clear that the lifted bilinear cover cuts are able to close more gap at the root node and provide a more robust and stable performance boost overall, compared against both default Gurobi as well as cuts (10).

Refer to caption
Figure 7: Relative gap closed at root node by lifted bilinear cover cuts (ρHeu\rho_{Heu}) or by (10) (ρMTroot\rho_{MTroot}) for non-negative instances.
Refer to caption
Figure 8: Relative gap closed ρ\rho (%): ρGRB\rho_{GRB} is the gap closed by Gurobi, ρBC\rho_{BC} is the gap closed by Gurobi when lifted bilinear cover cuts are added at the root node, and ρMT\rho_{MT} is the gap closed by Gurobi when cuts (10) are added at the root node

5.2 Mixed-signs instances.

We now focus on mixed-signs instances. We expect that such instances are inherently harder compared to non-negative instances, since it is more difficult to generate lifted bilinear cover cuts for them. Nevertheless, we next show that our heuristic still works well in separating lifted bilinear cover cuts in this case.

5.2.1 Relative gap at the end of the time limit.

We first present in Figure 9 the results regarding the relative gap closed ρ\rho.

Refer to caption
Figure 9: Relative gap closed ρ\rho (%) for mixed-signs instances: ρGRB\rho_{GRB} is the gap closed by Gurobi on the separable bilinear instances, and ρBC\rho_{BC} is the gap closed by Gurobi on the separable bilinear instances when lifted bilinear cover cuts are added at the root node.

For the more difficult mixed-signs instances, our algorithm with root node heuristic again showed improvement, which was especially marked for the sparser instances.

Refer to caption
Figure 10: Relative gap improvement Δρ\Delta\rho (%) for mixed-signs instances.

A closer look at Figure 10 confirms the benefits, as we can observe a positive Δρ\Delta\rho across every instance, which is especially noticeable for sparse settings (i.e., small values of pp).

Therefore, for the more difficult mixed-signs instances, Figure 9 and Figure 10 indicate that our root node separation heuristic retains its strength by providing robust improvements across instances.

5.2.2 Lifted bilinear cover cuts - number and root gap closed.

We now turn to the number of lifted bilinear cover cuts added; see Figure 11. Figure 12 presents the gap closed by these inequalities at the root node.

Refer to caption
Figure 11: Number of lifted bilinear cover cuts nBCn_{BC} for mixed-signs instances.
Refer to caption
Figure 12: Relative gap closed at root node by lifted bilinear cover cuts (ρHeu\rho_{Heu}) for mixed-signs instances.

As shown in the figures, our algorithm excels at separating lifted bilinear cover cuts for sparser problems, as more cuts were generated for smaller pp while closing more gap (i.e., larger ρHeu\rho_{Heu}). Comparing Figure 3 and Figure 11, it appears (as we expected) that negative coefficients in bilinear rows make separation harder, as evidenced by fewer cuts being generated for mixed-signs instances and the root gap closed also reducing for mixed-signs instances.

5.2.3 Time.

We now focus on total computational time expanded. Figure 13 presents the time for the root node separation heuristic whereas Figure 14 presents the overall time results for mixed-signs instances.

Refer to caption
Figure 13: Heuristic CPU time (s) for mixed-signs instances.
Refer to caption
Figure 14: Total CPU time(s) for mixed-signs instances.

These numbers are very similar to the case of non-negative instances: the heuristic itself takes very little time and most of the time is consumed by the non-convex solver.

5.3 Overall Evaluation.

The results for both non-negative instances and mixed-signs instances indicate that with minimal consumption of cpu time, the heuristic we propose is capable of separating very good lifted bilinear cover cuts, thus leading to a stable and robust performance improvement across instances, as measured in the relative gap closed ρ\rho as well as relative gap improvement Δρ\Delta\rho. Such improvement in performance is especially noticeable when the instances are sparse.

Overall, we believe the strength and potential of the lifted bilinear cover cuts were demonstrated by our experiments, which were most conclusive for sparser problems.

6 Conclusions and Future Directions.

In this paper, we designed a simple heuristic to separate the lifted bilinear cover cuts introduced in [12]. In the computational experiments, the cuts demonstrate major and robust improvement for sparse problems, illustrating the potential of these cuts in situations were SDP-based formulations might not be as helpful.

We envision multiple future directions that will better uncover and utilize the potential of the lifted bilinear cover inequalities, as well as achieving better results:

  • Complexity of separation: We developed a heuristic for separation since we could not design a polynomial-time exact separation routine. We conjecture that the separation problem is NP-hard although proving this result appears nontrivial.

  • Improving the heuristic for cutting planes: While the heuristic we developed is quite simple, it uses randomness. It would be interesting to devise a deterministic heuristic and explore other more sophisticated variants of the heuristic. Another direction is to consider separating cuts from aggregations of constraints, an approach that has proven to be useful for MILPs (see [17, 4]) and for QCQPs (see [27, 6, 18, 10]).

  • Integrating the bilinear cover cuts within the solver: The procedure described in the paper only applies lifted bilinear cover cuts at the root node, and its potential cannot be fully realized without integration within a global solver. It is clear that the method could be applied at children nodes during the branch-and-bound process; evaluating the benefits this provides is an open question.

  • Other problem types: It is easy to see that any quadratic constraint can be relaxed (at the expense of an increase in the number of variables) into a separable bilinear constraint. An interesting direction of research is to determine if adding these lifted bilinear cover cuts to such relaxation is useful in the solution of nonconvex QCQPs.

7 Acknowledgments

Santanu S. Dey gratefully acknowledges the support by ONR under grant N000141912323.

References

  • [1] Egon Balas. Facets of the knapsack polytope. Mathematical Programming, 8(1):146–164, 1975.
  • [2] Egon Balas and Eitan Zemel. Facets of the knapsack polytope from minimal covers. SIAM Journal on Applied Mathematics, 34(1):119–148, 1978.
  • [3] Robert Bixby and Edward Rothberg. Progress in computational mixed integer programming–a look back from the other side of the tipping point. Annals of Operations Research, 149(1):37, 2007.
  • [4] Merve Bodur, Alberto Del Pia, Santanu S Dey, Marco Molinaro, and Sebastian Pokutta. Aggregation-based cutting-planes for packing and covering integer programs. Mathematical Programming, 171(1):331–359, 2018.
  • [5] Samuel Burer. A gentle, geometric introduction to copositive optimization. Mathematical Programming, 151(1):89–116, 2015.
  • [6] Samuel Burer and Fatma Kilinç-Karzan. How to convexify the intersection of a second order cone and a nonconvex quadratic. Mathematical Programming, 162(1):393–429, 2017.
  • [7] Harlan Crowder, Ellis L Johnson, and Manfred Padberg. Solving large-scale zero-one linear programming problems. Operations Research, 31(5):803–834, 1983.
  • [8] Santanu S Dey, Marco Molinaro, and Qianyi Wang. Approximating polyhedra with sparse inequalities. Mathematical Programming, 154(1):329–352, 2015.
  • [9] Santanu S Dey, Marco Molinaro, and Qianyi Wang. Analysis of sparse cutting planes for sparse milps with applications to stochastic milps. Mathematics of Operations Research, 43(1):304–332, 2018.
  • [10] Santanu S Dey, Gonzalo Munoz, and Felipe Serrano. On obtaining the convex hull of quadratic inequalities via aggregations. SIAM Journal on Optimization, 32(2):659–686, 2022.
  • [11] Santanu S. Dey, Asteroide Santana, and Yang Wang. New SOCP relaxation and branching rule for bipartite bilinear programs. Optimization and Engineering, 20(2):307–336, 2019.
  • [12] Xiaoyi Gu, Santanu S Dey, and Jean-Philippe P Richard. Lifting convex inequalities for bipartite bilinear programs. Mathematical Programming, pages 1–33, 2022.
  • [13] Zonghao Gu, George L. Nemhauser, and Martin W. P. Savelsbergh. Lifted flow cover inequalities for mixed 0-1 integer programs. Mathematical Programming, 85(3):439–467, 1999.
  • [14] Zonghao Gu, George L. Nemhauser, and Martin W. P. Savelsbergh. Sequence independent lifting in mixed integer programming. Journal of Combinatorial Optimization, 4(1):109–129, 2000.
  • [15] Peter L. Hammer, Ellis L. Johnson, and Uri N. Peled. Facet of regular 0–1 polytopes. Mathematical Programming, 8(1):179–206, 1975.
  • [16] Roger A Horn and Charles R Johnson. Matrix analysis. Cambridge university press, 2012.
  • [17] Hugues Marchand and Laurence A Wolsey. Aggregation and mixed integer rounding to solve MIPs. Operations Research, 49(3):363–371, 2001.
  • [18] Sina Modaresi and Juan Pablo Vielma. Convex hull of two quadratic or a conic quadratic and a quadratic inequality. Mathematical Programming, 164(1):383–409, 2017.
  • [19] Manfred W. Padberg. A note on zero-one programming. Operations Research, 23(4):833–837, 1975.
  • [20] Jean-Philippe P. Richard. Lifting techniques for mixed integer programming. Wiley Encyclopedia of Operations Research and Management Science, 2010.
  • [21] Asteroide Santana and Santanu S. Dey. The convex hull of a quadratic constraint over a polytope. SIAM Journal on Optimization, 30(4):2983–2997, 2020.
  • [22] T Schoning. A probabilistic algorithm for k-SAT and constraint satisfaction problems. In 40th Annual Symposium on Foundations of Computer Science (Cat. No. 99CB37039), pages 410–414. IEEE, 1999.
  • [23] Mohit Tawarmalani, Jean-Philippe P. Richard, and Kwanghun Chung. Strong valid inequalities for orthogonal disjunctions and bilinear covering sets. Mathematical Programming, 124(1-2):481–512, 2010.
  • [24] Laurence A. Wolsey. Faces for a linear inequality in 0–1 variables. Mathematical Programming, 8(1):165–178, 1975.
  • [25] Laurence A. Wolsey. Facets and strong valid inequalities for integer programs. Operations Research, 24(2):367–372, 1976.
  • [26] Laurence A. Wolsey. Valid inequalities and superadditivity for 0–1 integer programs. Mathematics of Operations Research, 2(1):66–77, 1977.
  • [27] Uğur Yildiran. Convex hull of two quadratic constraints is an LMI set. IMA Journal of Mathematical Control and Information, 26(4):417–450, 2009.
  • [28] Eitan Zemel. Lifting the facets of zero–one polytopes. Mathematical Programming, 15(1):268–277, 1978.