Short-step Methods Are Not Strongly Polynomial-Time
Abstract
Short-step methods are an important class of algorithms for solving convex constrained optimization problems. In this short paper, we show that under very mild assumptions on the self-concordant barrier and the width of the -neighbourhood, any short-step interior-point method is not strongly polynomial-time.
1 Introduction
An algorithm for solving linear programming problems is said to be strongly polynomial-time if the required number of arithmetic operations is a polynomial in the numbers of variables and constraints, independent of the bit-length for encoding the problem instance. A major open problem in optimization and computer science is whether there exists a strongly polynomial-time algorithm for solving linear programming problems.
This paper is concerned with interior-point methods [17], which are an important class of algorithms for solving convex constrained optimization problems and enjoy great success in both theory and practice [9, 13, 17, 19]. In particular, we focus on the sub-class of path-following interior-point methods [17]. These methods are built upon the concept of self-concordant barriers [17]. A self-concordant barrier allows us to define a curve, called the central path, leading from a certain center of the feasible region to an optimal solution. Roughly speaking, the principle of path-following interior-point methods is to approximately trace the central path to move towards the optimal solution. The geometry of the central path therefore plays an important role in the studies of path-following interior-point methods.
By constructing a family of linear programming problems with ill-behaved central paths, it has been shown in [7] that the number of iterations of any path-following interior-point method is lower bounded by , where is the number of constraints. Based on another family of linear programming problems, the recent paper [3] improved the lower bound to , see also [4]. This lower bound applies to and thus denies the strong polynomiality of a large class of path-following interior-point methods, including the short-step methods associated with the logarithmic barrier [10, 15], the long-step methods [11] and the predictor-corrector methods [14]. More instances of ill-behaved central paths are presented in [6, 8].
One way to generalize the short-step methods is to replace the logarithmic barrier by an arbitrary self-concordant barrier [17], such as the volumetric barrier [20], universal barrier [13, 17], entropic barrier [5] and Lee-Sidford barrier [12]. Such an idea has led to improved algorithms for solving linear programming problems, see [12, 20] for example.
In view of the above-mentioned results, it is interesting to ask whether the short-step method associated with a general self-concordant barrier is strongly polynomial-time. The purpose of this short paper is to settle this question negatively, see Corollary 1. Our study is also motivated by the paper [2], which proved that for a family of linear programming problems, slightly different from the one in [3], a certain limit of the central path associated with the entropic barrier coincides with the central path associated with the logarithmic barrier. This suggests that the exponential lower bound in [3] for short-step methods associated with the logarithmic barrier could possibly be generalized to short-step methods associated with a general self-concordant barrier. Our main result Corollary 1 partly fills this gap.
2 Preliminaries
Consider the linear programming problem
(P) |
where , and . Denote the feasible region of problem (P) and its interior by and , respectively. We assume that is bounded and that is non-empty. For simplicity, we denote the optimality gap of a feasible point by
The notion of self-concordant barriers plays vital role in the study of interior-point methods. Let be a proper convex domain, i.e., a convex set with non-empty interior and containing no 1-dimensional affine subspace. A function is said to be a barrier on if . A three times continuously differentiable convex function is said to be self-concordant on if for any and ,
If additionally satisfies that for any and ,
then is said to be -self-concordant on . This paper focuses on the proper convex domain , which is a polytope defined by the linear inequalities . A standard self-concordant barrier on is the logarithmic barrier
Given a self-concordant barrier on , we consider the problem
(Pμ) |
From [16, Section 5.3.4], for any , there exists a unique minimizer to problem (Pμ). We call the -analytic center of problem (P) associated with the self-concordant barrier . The central path of problem (P) associated with is then defined as (the image of) the curve for . By [16, Theorem 5.3.10], converges to an optimal solution to problem (P) as .
By [16, Theorem 5.1.6], is positive definite for any . This allows us to define a norm
and its dual norm
For any and , the -neighbourhood of problem (P) is defined as
Note that is the gradient of the objective function of problem (Pμ). The -neighbourhood is important and a natural choice for the design of path-following interior-point methods since the Newton’s method applied to problem (Pμ) converges quadratically to the optimal solution [16, Theorem 5.2.2]. We write
which is also called the -neighbourhood of problem (P). By the optimality condition of problem (Pμ), we can see that for any and .
A short-step method associated with is defined as an algorithm that generates a sequence of iterates such that the polygonal (i.e., continuous piecewise linear) curve formed using the sequence is contained in the -neighbourhood of problem (P) for some , where is the iteration counter, or more precisely,
Another popular choice of the neighbourhood for path-following algorithms for solving linear programming problems is the so-called wide neighbourhood [21]:
Similarly to the -neighbourhood, we write
3 Main Results
Our non-strong polynomiality result is based on the following family of linear programs introduced in [3]:
where is a real number and is an integer. The notation follows from [3] and signifies that the central path of this linear program is long and winding.
To distinguish the properties of problem from those of a general linear program, we introduce an extra subscript to the notations. For instances, the feasible region, the -analytic center associated with a self-concordant barrier and the wide neighbourhood of problem are denoted as , and , respectively.
We can now present the main results of this paper, whose proofs are deferred to Section 4. The main contribution of this paper is the non-strong polynomiality of short-step methods.
Corollary 1 (Non-strong Polynomiality of Short-step Methods).
We should emphasize that although the self-concordance parameter in Corollary 1 is assumed to be independent of , it could possibly depend on the numbers of variables and constraints of problem . This subsumes almost all known barriers. Furthermore, the constants in Corollary 1 are simplified for readability. They are not optimal and can be made slightly better (see Corollary 2). Finally, we note that the requirement on the initial optimality gap is only very mild. Indeed, it is customary for interior-point methods to start at the -analytic center . Using Theorem 2, we can check that , see also the proof of Lemma 2.
The cruxes of the proof of Corollary 1 are the following theorems. The first shows a certain equivalence among the central paths of all self-concordant barriers. Geometrically speaking, it guarantees that when restricted to a constant-cost slice, all central paths are approximately equally close to (or away from) the boundary of the feasible region.
Theorem 1 (Equivalence of Central Paths).
Consider the linear program (P). Let and be - and -self-concordant barriers on the feasible region . Let be such that . Then, for any ,
The second one bounds the optimality gap of analytic centers.
Theorem 2 (Optimality Gap of Analytic Center).
Consider the linear program (P). Let be a -self-concordant barrier on the feasible region . Then, for any ,
where is the radius of the largest ball contained in .
4 Proofs
In this section, we prove the main results. Throughout this section, given a vector and a positive definite matrix , we let . Also, for any , we let .
4.1 Proof of Theorem 1
We first collect a simple lemma about linear optimization over ellipsoids, whose proof uses only elementary optimality arguments and is thus omitted.
Lemma 1.
Let , and be a positive definite matrix. Then,
We can now prove Theorem 1.
Proof of Theorem 1.
Let . By [16, Theorem 5.1.5], we have
Combining [16, Theorem 5.3.8] with these inclusions yields
and
which implies, respectively,
Using Lemma 1, for any ,
(1) |
and
(2) |
where is the -row of . Also, since for any , Lemma 1 implies that
It follows from (1), (2) and that
Using the same arguments, we also obtain
Thus, for any ,
which completes the proof. ∎
4.2 Proof of Theorem 2
We next prove Theorem 2.
Proof of Theorem 2.
The upper bound follows directly from [16, Theorem 5.3.10]. Therefore, we prove only the lower bound. Two cases are discussed separately:
and |
We first consider the case of
By [16, Lemma 5.1.5 and Theorem 5.2.1], we have
where is the -analytic center, i.e., the unique optimal solution to the problem
Solving the quadratic inequality, we get
(3) |
Using Lemma 1 and inequality (3), we have that
(4) |
Next, since , Lemma 1 implies
(5) |
Combining the inequalities (4) and (5), we obtain
(6) |
On the other hand, by supposition, the feasible region contains a ball of radius . By [16, Theorem 5.3.9], it is contained in the ellipsoid . Hence,
where the left-hand side is a ball with center of radius . Using Lemma 1, we get
which, upon substitution into (6), yields
Then, we consider the case of
Using the optimality condition of problem (Pμ), we get . Therefore,
(7) |
Since , by Lemma 1,
which, together with (7), yields
This completes the proof. ∎
4.3 Some Extra Tools
The following proposition compares the slack of the -analytic center with that of any point in its -neighbourhood.
Proposition 1.
Proof.
The next proposition compares the optimality gap of the -analytic center with that of any point in its -neighbourhood.
Proposition 2.
Consider the linear program (P). Let , , be a self-concordant function on and . Then,
The proof of Proposition 2 uses the same arguments as in the proof of Proposition 1 and is therefore omitted.
We will need to compare the -analytic center associated with and the -analytic center associated with the logarithmic barrier along the constant-cost slices of .
Lemma 2.
Proof.
We first prove the existence of satisfying . Since and are both continuous and strictly increasing functions attaining the minimum value at , it suffices to show that for any with
there exists such that . To show this, we note that the feasible region contains a hypercube of side length , which in turn contains a ball of radius . Also, the cost vector of problem is and hence . Furthermore, it is well-known that the self-concordance parameter of the logarithmic barrier on a polytope is precisely the number of linear inequalities defining the polytope, see [16, Section 5.3] for example. Therefore, is -self-concordant on . Theorem 2 then implies that for a large enough ,
Therefore, there exists a unique with .
We will also need the following remarkable result from [3, Theorem 29].
Some remarks are in order. First, the original statement of [3, Theorem 29] concerns iterates of a longer vector consisting of primal, dual and slack variables. But Theorem 3 concerns only the primal variables and hence is seemingly stronger than [3, Theorem 29], because, for a polygonal curve in with pieces, its projection to a lower dimensional space could possibly have less than pieces in general. However, upon a close inspection of the proof of [3, Theorem 29] and [3, Section 6.2], we find that Theorem 3 holds for the linear program . Using the language of [3], the reason is that for the tropical central path of the linear program , even the projection onto the primal variables is already a polygonal curve with a huge number of pieces. Second, in the original statement of [3, Theorem 29], the requirement on is that , and the conclusion is that . By considering a shorter part of the tropical central path of , we find that replacing the requirement on with the weaker bound of leads the weaker conclusion of , see [3, Section 4.3].
4.4 Proof of Corollary 1
We now have enough tools at our disposal to prove Corollary 1. In fact, we will prove the following version with slightly better constants, from which Corollary 1 follows.
Corollary 2 (Precise Version of Corollary 1).
Consider for a sufficiently large . Let , be a -self-concordant barrier on with independent of and be a sequence of iterates generated by a short-step method associated with . Suppose that
where is a constant depending only on defined in (8). Then, .
Proof.
Let
and
Note that .
Reduction
Without loss of generality, we can assume that for any ,
(9) |
Indeed, if this is not the case, because of the continuity of and the assumption that
we can choose a sub-curve (connected subset) of that satisfies all the assumptions of Corollary 2 as well as assumption (9).
Our goal is to show that and that and , for some and . If this can be proved, the desired conclusion would then follow from Theorem 3.
Proving
Let . Then, for some . By assumption (9) and Proposition 2, we have
Using Lemma 2, there exists a unique such that
By Proposition 1 and Theorem 1, for any ,
(10) |
From the definition of the logarithmic barrier and the optimality conditions of problem (Pμ), there exists such that and
Averaging these equalities and then using Proposition 1, Theorem 1 and the fact that is -self-concordant on , we get
(11) |
Hence, using (10) and (11), for any ,
(12) | ||||
which implies that with
(13) |
Proving and for some and
Acknowledgments
The authors thank Defeng Sun, Kim-Chuan Toh and Stephen Wright for their helpful discussions.
References
- [1] I. Adler and S. Cosares. A strongly polynomial algorithm for a special class of linear programs. Operations Research, 39(6):955–960, 1991.
- [2] X. Allamigeon, A. Aznag, S. Gaubert, and Y. Hamdi. The tropicalization of the entropic barrier. arXiv preprint arXiv:2010.10205, 2020.
- [3] X. Allamigeon, P. Benchimol, S. Gaubert, and M. Joswig. Log-barrier interior point methods are not strongly polynomial. SIAM Journal on Applied Algebra and Geometry, 2(1):140–178, 2018.
- [4] X. Allamigeon, P. Benchimol, S. Gaubert, and M. Joswig. What tropical geometry tells us about the complexity of linear programming. SIAM Review, 63(1):123–164, 2021.
- [5] S. Bubeck and R. Eldan. The entropic barrier: a simple and optimal universal self-concordant barrier. In Conference on Learning Theory, pages 279–279, 2015.
- [6] A. Deza, T. Terlaky, and Y. Zinchenko. Polytopes and arrangements: Diameter and curvature. Operations Research Letters, 36(2):215–222, 2008.
- [7] A. Deza, T. Terlaky, and Y. Zinchenko. Central path curvature and iteration-complexity for redundant Klee-Minty cubes. In Advances in Applied Mathematics and Global Optimization, pages 223–256. Springer, 2009.
- [8] J. C. Gilbert, C. C. Gonzaga, and E. Karas. Examples of ill-behaved central paths in convex optimization. Mathematical Programming, 103(1):63–94, 2004.
- [9] J. Gondzio. Interior point methods 25 years later. European Journal of Operational Research, 218(3):587–601, 2012.
- [10] M. Kojima, S. Mizuno, and A. Yoshise. A polynomial-time algorithm for a class of linear complementarity problems. Mathematical Programming, 44(1):1–26, 1989.
- [11] M. Kojima, S. Mizuno, and A. Yoshise. A primal-dual interior point algorithm for linear programming. In Progress in Mathematical Programming, pages 29–47. Springer, 1989.
- [12] Y. T. Lee and A. Sidford. Solving linear programs with linear system solves. arXiv preprint arXiv:1910.08033, 2019.
- [13] Y. T. Lee and M.-C. Yue. Universal barrier is -self-concordant. Mathematics of Operations Research, 46(3):1129–1148, 2021.
- [14] S. Mizuno, M. J. Todd, and Y. Ye. On adaptive-step primal-dual interior-point algorithms for linear programming. Mathematics of Operations Research, 18(4):964–981, 1993.
- [15] R. Monteiro and I. Adler. Interior path following primal-dual algorithms. Part I: Linear programming. Mathematical Programming, 44(1):27–41, 1989.
- [16] Y. Nesterov. Lectures on Convex Optimization. Springer, 2018.
- [17] Y. Nesterov and A. Nemirovskii. Interior-point Polynomial Algorithms in Convex Programming. SIAM, 1994.
- [18] É. Tardos. A strongly polynomial algorithm to solve combinatorial linear programs. Operations Research, 34(2):250–256, 1986.
- [19] K.-C. Toh, M. J. Todd, and R. H. Tütüncü. SDPT3 – A MATLAB software package for semidefinite programming, Version 1.3. Optimization Methods and Software, 11(1-4):545–581, 1999.
- [20] P. M. Vaidya. A new algorithm for minimizing convex functions over convex sets. In 30th Annual Symposium on Foundations of Computer Science, pages 338–343. IEEE Computer Society, 1989.
- [21] S. J. Wright. Primal-dual Interior-point Methods. SIAM, 1997.
- [22] Y. Ye. The simplex and policy-iteration methods are strongly polynomial for the Markov decision problem with a fixed discount rate. Mathematics of Operations Research, 36(4):593–603, 2011.