On the Projection-Based Convexification of Some Spectral Sets
Abstract
Given a finite-dimensional real inner-product space and a closed convex cone , we call a spectral map if forms a generalized Fan-Theobald-von Neumann (FTvN) systemย [5]. Common examples of include the eigenvalue map, the singular-value map and the characteristic map of complete and isometric hyperbolic polynomials. We call a spectral set if for some . We provide projection-based characterizations of (i.e., the closed convex hull of ) under two settings, namely, when has no invariance property and when has certain invariance properties. In the former setting, our approach is based on characterizing the bi-polar set of , which allows us to judiciously exploit the properties of via convex dualities. In the latter setting, our results complement the existing characterization of inย [7], and unify and extend the related results inย [8] established for certain special cases of and .
1 Introduction
Let be a finite-dimensional real inner-product space and be a closed and convex cone. Consider a function that satisfies the following two properties:
-
(P1)
For all , we have .
-
(P2)
For all and , there exists such that and
(In particular, is surjective with range .) We call a spectral map. Given and a nonempty set , we can define the following spectral set (associated with and ):
(1.1) |
The aim of this paper is to provide projection-based characterizations of under different structural assumptions of , and . To avoid triviality, throughout this paper we assume the set to be feasible, namely .
The definition above suggests that the triple is a generalized version of the (finite dimensional) FTvN system, which was initially proposed in the seminal workย [5]. Specifically, the original definition of the FTvN system requires the spectral map to be isometric, namely, for all , where is induced by the inner product on . In this work, we relax this requirement to requiring the range of (namely, ) to be a closed and convex cone, which is a consequence of the isometry property as well asย (P1) andย (P2). The FTvN system subsumes two fairly general systems, namely the normal decomposition systemย [9] and the system induced by complete and isometric (c.i.) hyperbolic polynomialsย [1], both of which include many important special cases. To elaborate the latter, let be a degree- homogeneous polynomial that is hyperbolic with respect to (w.r.t.) some direction , namely, and for all , the univariate polynomial has only real roots, which are denoted by . Inย [1], it was shown that if is complete and isometric, then the characteristic map of (w.r.t.ย ), namely, for , is indeed an isometric spectral map. A notable special case of such a characteristic map is the eigenvalue map on a Euclidean Jordan algebra of rank , which in turn includes the eigenvalue map on the vector space of real symmetric matrices as a special case. (The eigenvalue map on is given by , where are the eigenvalues of .) Beyond the system induced by c.i.ย hyperbolic polynomials, the FTvN system also includes the system induced by the singular-value map for . Here for any with singular values . In addition, when , examples of the FTvN system include the systems induced by the reordering, absolute-value and absolute-reordering maps. For details and more examples, we refer readers toย [4, Sectionย 4].
The spectral set appears as the spectral constraints in many optimization problems, where can take various forms, including the absolute-reordering map on ย [8], the eigenvalue map on ย [3, 10] or more generally on a Euclidean Jordan algebra of rank ย [6], and the singular-value map on ย [10]. Recently, some worksย [5, 6] pioneered the study of the optimization problems with spectral constraints defined by the isometric spectral map in the FTvN system, which unifies and generalizes all the aforementioned forms of . Indeed, optimization problems of this form not only possess great generality, but also enjoy many attractive properties for algorithmic development.
Despite the important role that plays in optimization, to our knowledge, the study of has mainly focused on the setting where has certain invariance propertiesย [9, 8, 7]. Indeed, the proper notion of invariance depends on the spectral map . For example, if is the eigenvalue (resp.ย singular-value) map, then the corresponding notion of invariance is the permutation- (resp.ย permutation- and sign-) invariance. More generally, in the normal decomposition system, the invariance of is defined w.r.t.ย some closed group of orthogonal transformations on ย [9], and in the FTvN system, the invariance is defined via a -compatible spectral map on ย [4] (see Definitionsย 2.1 andย 2.2 for details). When is invariant (in some proper sense), from the seminal worksย [9, 8, 7], we know that is closed and convex if and only if is closed and convexย [9, 7], and furthermore, ย [8, 7]. On the other hand, there also exist simple examples demonstrating that for a general set without any invariance property, may not be convex even if is. This leads to the main question that we aim to address in this work:
How to characterize when has no invariance property?
The motivation behind this question is not only due to its natural theoretical interest, but also comes from the fact that non-invariant instances of frequently appear in the spectral constraints of optimization problems, and one of the most common examples is the H-polyhedron defined by general and ย [3]. In these problems, the spectral constraint sets may be non-convex, and a natural step to obtain the convex relaxations of these problems is to characterize . Unfortunately, the proof techniques in prior works (see e.g.,ย [8, 7]) leading to for invariant cannot be easily extended to the case where is non-invariant. As such, new techniques need to be developed to tackle the question posed above.
Apart from addressing the main question above, a side goal of this work is to establish alternative characterizations of when is invariant in the context of FTvN system (cf.ย Definitionsย 2.1 andย 2.2). Although the result inย [7] is simple and elegant, sometimes is complicated and difficult to describe, even though itself admits a simple description (see Remarkย 2.5). Therefore, we aim to characterize in certain ways that do not involve , and hopefully in some cases, these new characterizations admit simpler descriptions compared to .
Main contributions. Motivated by the goals above, in this work, we provide projection-based characterizations of under different structural assumptions of , and . Our main results can be summarized in two parts.
First, we derive projection-based characterizations of , where is defined by a class of sets that do not have any invariance property, but instead satisfy some other relatively mild assumptions (see Theoremย 2.1 and Corollaryย 2.1 for details). Our approach is based on a simple idea, namely characterizing the bipolar set of . Despite its simplicity, this idea allows us to judiciously exploitย (P1) andย (P2) through convex dualities. To our knowledge, our approach is the first one for characterizing without leveraging the invariance properties of .
Second, we consider the setting where a -compatible spectral map exists on and is -invariant (cf.ย Definitionsย 2.1 andย 2.2). (Indeed, this notion of invariance is fairly general, and includes many common notions of invariance as special cases, e.g., permutation- and sign-invariance.) Under this setting, we derive a projection-based characterization of , which is complementary to the characterization ย [7]. We demonstrate that in some cases, our characterization admits a simpler description compared to (cf.ย Remarkย 2.5). In addition, our result unifies and extends the related results inย [8] established for certain special cases of and .
Notations.ย For any set , denote its convex hull, affine hull and relative interior by , and , respectively. For a nonempty cone , define its polar . Let be the norm induced by the inner product on . Also, for , define and for . We define and let be the vector with entries of arranged in non-increasing order. Also, let be the nonnegative orthant, and define and . Given real vector spaces and , the function is called positively homogeneous (p.h.) if for all and . Given a closed convex function , define .
2 Main Results
For some results below, we need to make the following additional assumption about . To that end, let denote the recession subspace of , i.e., .
-
(P3)
For any , there exists such that , for all .
Remark 2.1
Theorem 2.1
-
Proof
See Sectionย 3.
Remark 2.2
(Membership Oracle of ) Note that the description of inย (2.1) is based on projection. Therefore, given , checking amounts to a convex feasibility problem in . Under certain assumptions, this feasibility problem can be solved in polynomial-time using the ellipsoid method, so long as the separation oracles of the sets , and can be computed in polynomial time. Similar remarks also apply to other projection-based characterizations of to be presented later.
From Theoremย 2.1, we can easily obtain the following corollary that characterizes when is (potentially) non-convex or non-closed. Indeed, this corollary can be viewed as a generalization of Theoremย 2.1.
Corollary 2.1
The proof of Corollaryย 2.1 is immediate from the following lemma.
Lemma 2.1
Let be given in Corollaryย 2.1 and define Then .
-
Proof
See Sectionย 3.
Proof of Corollaryย 2.1. Since is closed, convex and feasible, based on Lemmaย 2.1, we can invoke Theoremย 2.1 to characterize .
Next, let us turn our focus to the case where has certain invariance properties. We shall define the invariance of in a general way via the so-called -compatible spectral map on , which is in line withย [4, Sectionย 10].
Definition 2.1 (-Compatible Spectral Map)
Definition 2.2 (-Invariant Set)
Let be a spectral map. A set is called -invariant if for any , , where
(2.3) |
Remark 2.3
Several remarks are in order. First, note that the condition on is equivalent to that for all . Therefore, the compatibility of with is essentially its compatibility with , i.e., the range of . Second, from Definitionย 2.1, it is easy to see that , and hence for all . Third, note that given a spectral map , a -compatible spectral map may not necessarily exist. A typical example of such a is the characteristic map of a complete and isometric hyperbolic polynomial, in which case is a closed convex cone without additional structures (seeย [4, Sectionย 10] for a discussion). That said, a -compatible spectral map does exist for some special instances of . For example, for , for and for . Indeed, in these examples, a set is -invariant if it is permutation-, sign- and permutation- and sign-invariant, respectively.
Proposition 2.1
Given a spectral map , let be a -compatible spectral map, and be a -invariant set. Define , then . Moreover, if is also continuous and p.h., then .
-
Proof
See Sectionย 3.
Remark 2.4
Two remarks are in order. First, the result can be found inย [7, Theoremย 3.2(b)], but stated under the stronger assumption that is isometric. Note that this assumption implies that is continuous and p.h., but the reverse may not be true. In fact, our proof of Propositionย 2.1 shares similar ideas with that ofย [7, Theoremย 3.2(b)], but appears to be slightly shorter. Second, inย [8, Sectionย 3.1], it was shown that for two special cases, namely i) is the eigenvalue map on and is permutation-invariant and ii) is the singular-value map on and is permutation- and sign-invariant. By taking closure, it is easy to see that in these two cases. This result is subsumed by the general result in Propositionย 2.1, which applies to any continuous and p.h.ย that admits a compatible spectral map , and any -invariant set .
Next, let us present a projection-based characterization of . This characterization requires to be isometric, namely, for all . Common examples of include those mentioned at the end of Remarkย 2.3. The advantages of such a projection-based characterization can be seen in Remarkย 2.5 and Corollaryย 2.2.
Proposition 2.2
Let , and be given in Propositionย 2.1. Moreover, let be isometric. Define and for any , define
(2.4) |
-
(i)
For any satisfying , we have .
-
(ii)
If is continuous and p.h., then for any satisfying that , we have .
-
Proof
See Sectionย 3.
Remark 2.5
Propositionย 2.2(ii) provides another characterization of , namely with . Note that in some cases, may be simpler to describe than , as given in Propositionย 2.1. For example, let , and for some . Note that can be rather complicated to describe (cf.ย [8, Section 3]). In contrast, , which is the intersection of the unit -ball with linear inequalities. Since is convex and compact, we have . Additionally, has a simple description, i.e., . Overall, admits a simple description. Lastly, since is compact, we have .
Remark 2.6
(Connection to Results inย [8]) Inย [8, Sectionย 1], it was shown that if is permutation-, sign- and permutation- and sign-invariant, and , and , respectively, then for any satisfying ,
(2.5) |
(In fact, was stated algebraically in terms of (weak) majorization and entry-wise inequality inย [8].) Since in all cases above, we can take in Propositionย 2.2, and becomes -compatible. By taking closure on both sides ofย (2.5), it is clear that the resulting characterization of is subsumed by the more general result in Propositionย 2.2(ii). As a noteworthy point, the proof techniques inย [8] are rather different from those in our proof of Propositionย 2.2. Specifically,ย (2.5) was proved separately for each of the three cases above inย [8], and the proof in each case made use of the special structures of and . In contrast, our proof of Propositionย 2.2 only uses some general properties of (i.e., isometry,ย (P1) andย (P2)) and (i.e., closedness and convexity), and acts as a unified proof for all the three cases above.
As the last result in this section, we present an extension of Propositionย 2.2. Indeed, if a -compact spectral map exists, the projection-based characterization in Propositionย 2.2 can be extended to a much broader setting, where is only required to be feasible (but not necessarily -invariant).
Corollary 2.2
-
Proof
Define , which is -invariant. We claim that . Indeed, it is clear that if , then . On the other hand, if , then there exists such that , or equivalently, . Since , we have and , and hence . As a result, we have . Now, by Propositionย 2.2(ii), we have for any satisfying that . Since , we complete the proof.
3 Proofs of Results in Sectionย 2
We start by introducing some preliminary convex analytic facts, which can be found in Rockafellarย [11, Sectionsย 12โ14]. For any nonempty set , define its support function for . It is clear that is proper, closed, convex and p.h. In addition, for any , we have for all . We also denote the indicator function of by , such that for and otherwise. For any proper function , define its Fenchel conjugate
(3.1) |
It is clear that , and if is closed and convex, we also have . Let denote the polar set of , which is defined as
(3.2) |
We define , which we call the bipolar set of . An important fact about is that
(3.3) |
Next, we mention two implications ofย (P1) andย (P2). First, note thatย (P1) implies that , and hence if and only if . Second, (P1) andย (P2) straightforwardly imply the following lemma.
Lemma 3.1 ([6, Proposition 3.3]; see alsoย [5, Corollaryย 3.3])
For any and any nonempty set , we have
(3.4) |
We first prove Theoremย 2.1. Our proof leverages the following lemma.
Lemma 3.2
Let be closed and convex, and define . If is polyhedral or , then for any , we have
-
Proof
Indeed, by definition,
(3.5) Since we can write , fromย Lemmaย 3.1, we have
(3.6) Since and the Fenchel conjugate of the function is for we can write down the Fenchel dual problem ofย (3.6) as follows:
(3.7) Note that since is convex, we always have (since ). In addition, if , then (otherwise, since , we have and hence , contradicting ). Now, using classical results on Fenchel duality (see e.g.,ย [11, Theorem 31.1]), if is polyhedral or , we know that strong duality holds betweenย (3.6) andย (3.7), and the infimum inย (3.7) is attained. Consequently, fromย (3.5), we know that if and only if
Proof of Theoremย 2.1. By definition, we have , and by Lemmaย 3.2 andย Lemmaย 3.1, we have
(3.8) | ||||
(3.9) |
where . The Lagrange dual problem ofย (3.9) reads
(3.10) |
Inย (3.10), note that if and otherwise. In addition, we have if and otherwise. To see this, note that if , since is closed and convex, we have (II) = ; otherwise, if , then . Based on the discussions above, we can writeย (3.10) in the following form:
(3.11) |
We aim to show that strong duality holds betweenย (3.9) andย (3.11), and the problem inย (3.11) has an optimal solution. To that end, first notice that since , we have . (To see this, let and note that , and hence . Byย [11, Theoremย 6.5], we have , and hence . Also, since is closed, we clearly have .) Since is bounded, is convex and compact, and hence . Now, if is polyhedral, then the problem inย (3.9) clearly has a Slater point (cf.ย [2, Propositionย 5.3.6]). Otherwise, since may have empty interior, we invokeย [12, Theoremย 18(b)] and verify the generalized Slater condition: for any and , there exist , and such that
(3.12) |
(cf.ย [12, Eqn.ย (8.13)]). It is easy to see that for any and , if , and (since ), the two conditions inย (3.12) are satisfied. To summarize, in both cases (i.e., is polyhedral or not), the Slater condition is satisfied. As a result, we have
(3.13) |
Based onย (3.3) andย (3.13), we know that
If , then and . We then have for all (since is convex) and
(3.14) |
This proves the part (i) of Theoremย 2.1. Now, let and satisfyย (P3). Take , and let be given inย (P3). Define
(3.15) |
Since , we have , and hence is nonempty and bounded if and only if is. Since , by part (i) of Theoremย 2.1, we have
(3.16) | ||||
(3.17) | ||||
(3.18) |
Since , we complete the proof.
Proof of Lemmaย 2.1. Since , we have . Thus to show , it suffices to show that . Suppose there exists such that , then there exists such that , where the last equality follows from Lemmaย 3.1. On the other hand, fromย (P1), we have , where the second inequality is due to . This leads to a contradiction.
The proofs of Propositionsย 2.1 andย 2.2 require the following lemma. Most of the results in this lemma can be found inย [4], however, we restate or reprove them with weaker assumptions on the spectral maps and .
Lemma 3.3
Let , , be given in Propositionย 2.1, and . Then we have the following:
-
(a)
If is p.h.ย and is convex, then is convex.
-
(b)
If is isometric, then it is p.h.ย and continuous.
-
(c)
If is isometric, then both and are -invariant.
-
(d)
If is continuous, then .
-
(e)
For all and , we have and consequently, .
-
(f)
If is isometric, then for all , we have .
-
Proof
We only proveย (d) andย (f), since the proofs of the other parts directly follow from those inย [4]. To showย (d), it suffices to show that . Take any such that . Then there exists such that . In addition, since , we have Note that for all , there exists such that and hence . Also, since is -invariant, . Thus , which implies that . Thus we have , which is a contradiction. Next, we showย (f). We only prove the reverse direction, since the proof of the other direction can be found inย [4]. Let , then , where for , , and . Byย (b), we know is p.h.ย and byย (e), we have for all , . Since , we have for all , which amounts to .
Proof of Propositionย 2.1. To show , it suffices to show that . Suppose there exists such that , using the same reasoning as in the proof of Lemmaย 2.1, there exists such that . On the other hand, fromย (P1), we have
where the last step follows from and (since is -invariant). This leads to a contradiction. In addition, if is continuous and p.h., then is closed and convex (cf.ย Lemmaย 3.3(a)). Thus .
Proof of Propositionย 2.2. To showย (i), we first show that for . Take any . Since and there exists such that , by Lemmaย 3.3(f), we have . In addition, since is -invariant, by Lemmaย 3.3(c), is -invariant. As a result, since , we have and hence . This implies that , or equivalently, . Next, we show for . Let , so that . Write , where for , , and . Since is -invariant, we have . Also, since is isometric, by Lemmaย 3.3(b) and (e), we have , where . This shows that . To showย (ii), let , and we know that from Propositionย 2.1. Hence it suffices to show that . We first show that for . Using similar reasoning as in the proof ofย (i), we know that is -invariant and for any , there exists such that , implying that . Since is continuous, is closed, and hence . Next, we show that for , but this directly follows fromย (i) and Lemmaย 3.3(d).
4 Concluding Remarks
In this work, we have provided projection-based characterizations of when has no invariance property (cf.ย Theoremย 2.1 and Corollaryย 2.1) and when has certain invariance properties (cf.ย Propositionย 2.2 and Corollaryย 2.2). One may naturally wonder if there exist any connections between these two sets of results. We start the discussion with a conjecture: for any , . If this conjecture is true, then under certain assumptions, we can derive Propositionย 2.2(ii) by leveraging Theoremย 2.1, instead of Propositionย 2.1. Specifically, let and be given in Propositionย 2.1. If is -invariant, then we have . By Theoremย 2.1, if is nonempty and bounded or is polyhedral, then we have , where . Now, by the proof of Propositionย 2.1(ii), we know that for all satisfying that , and this establishes Propositionย 2.1(ii). However, note that the approach of deriving Propositionย 2.2(ii) using Theoremย 2.1 requires additional assumptions on or itself, and hence appears to be more restrictive than the original approach that leverages Propositionย 2.1.
Acknowledgment. The author thanks Casey Garner, M.ย S.ย Gowda, Bruno Lurenco, Weijun Xie and Shuzhong Zhang for inspiring and helpful discussions during the preparation of this work.
References
- [1] Bauschke, H.H., Gรผler, O., Lewis, A.S., Sendov, H.S.: Hyperbolic polynomials and convex analysis. Canad. J. Math. 53(3), 470โ488 (2001)
- [2] Bertsekas, D.P.: Convex Optimization Theory. Athena Scitific (2009)
- [3] Garner, C., Lerman, G., Zhang, S.: Spectrally constrained optimization. arXiv:2307.04069 (2023)
- [4] Gowda, M., Jeong, J.: Commutativity, majorization, and reduction in FanโTheobaldโvon Neumann systems. Results Math 78 (2023)
- [5] Gowda, M.S.: Optimizing certain combinations of spectral and lineardistance functions over spectral sets. arXiv:1902.06640 (2019)
- [6] Ito, M., Lourenรงo, B.F.: Eigenvalue programming beyond matrices. arXiv:2311.04637 (2023)
- [7] Jeong, J., Gowda, M.: Transfer principles, fenchel conjugate and subdifferential formulas in Fan-Theobald-von Neumann systems. J. Optim. Thoery Appl. 202, 1242โ1267 (2024)
- [8] Kim, J., Tawarmalani, M., Richard, J.P.P.: Convexification of permutation-invariant sets and an application to sparse principal component analysis. Math. Oper. Res. 47(4), 2547โ2584 (2022)
- [9] Lewis, A.S.: Group invariance and convex matrix analysis. SIAM J. Matrix Anal. Appl. 17(4), 927โ949 (1996)
- [10] Li, Y., Xie, W.: On the partial convexification for low-rank spectral optimization: Rank bounds and algorithms. arXiv:2305.07638 (2023)
- [11] Rockafellar, R.T.: Convex analysis. Princeton University Press (1970)
- [12] Rockafellar, R.T.: Conjugate duality and optimization. SIAM (1974)