Improved bounds in Weaver’s ${\rm KS}_{r}$ conjecture for high rank positive semidefinite matrices

Zhiqiang Xu LSEC, Inst. Comp. Math., Academy of Mathematics and System Science, Chinese Academy of Sciences, Beijing, 100091, China
School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China [email protected] , Zili Xu LSEC, Inst. Comp. Math., Academy of Mathematics and System Science, Chinese Academy of Sciences, Beijing, 100091, China
School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China [email protected] and Ziheng Zhu LSEC, Inst. Comp. Math., Academy of Mathematics and System Science, Chinese Academy of Sciences, Beijing, 100091, China
School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China [email protected]

Abstract.

Recently Marcus, Spielman and Srivastava proved Weaver’s ${\rm{KS}}_{r}$ conjecture, which gives a positive solution to the Kadison-Singer problem. In [Coh16, Br ${\rm\ddot{a}}$ 18], Cohen and Brändén independently extended this result to obtain the arbitrary-rank version of Weaver’s ${\rm{KS}}_{r}$ conjecture. In this paper, we present a new bound in Weaver’s ${\rm{KS}}_{r}$ conjecture for the arbitrary-rank case. To do that, we introduce the definition of $(k,m)$ -characteristic polynomials and employ it to improve the previous estimate on the largest root of the mixed characteristic polynomials. For the rank-one case, our bound agrees with the Bownik-Casazza-Marcus-Speegle’s bound when $r=2$ [BCMS19] and with the Ravichandran-Leake’s bound when $r>2$ [RL20]. For the higher-rank case, we sharpen the previous bounds from Cohen and from Brändén.

1. Introduction

1.1. The Kadison-Singer problem

The Kadison-Singer problem, posed by Richard Kadison and Isadore Singer in 1959 [KS59], is a fundamental problem that relates to a dozen areas of research in pure mathematics, applied mathematics and engineering. Basically, it asked whether each pure state on the diagonal subalgebra $l^{\infty}(\mathbb{N})$ to $\mathcal{B}(l^{2}(\mathbb{N}))$ has a unique extension. This problem was known to be equivalent to a large number of problems in analysis such as the Anderson Paving Conjecture [And79, And79, And81], Bourgain-Tzafriri Restricted Invertibility Conjecture [BT89, CT06], Feichtinger Conjecture [BS06, CCLV05, Gr ${\rm\ddot{o}}$ 03] and Weaver Conjecture [Wea04].

In a seminal work [MSS15b], Marcus, Spielman and Srivastava resolved the Kadison-Singer problem by proving the Weaver’s ${\rm{KS}}_{r}$ conjecture. The case $r=2$ of Weaver’s ${\rm{KS}}_{r}$ conjecture can be stated as follows.

Conjecture 1.1.

( ${\rm{KS}}_{2}$ ) There exist universal constants $\eta\geq 2$ and $\theta>0$ such that the following holds. Let $\mathbf{u}_{1},\ldots,\mathbf{u}_{m}\in\mathbb{C}^{d}$ satisfy $\|\mathbf{u}_{i}\|\leq 1$ for all $i$ and

(1.1)

\sum\limits_{i=1}^{m}|\langle\mathbf{u},\mathbf{u}_{i}\rangle|^{2}=\eta

for every unit vector $\mathbf{u}\in\mathbb{C}^{d}$ . Then there exists a partition $S_{1},S_{2}$ of $[m]:=\{1,\ldots,m\}$ such that

(1.2)

\sum\limits_{i\in S_{j}}|\langle\mathbf{u},\mathbf{u}_{i}\rangle|^{2}\leq\eta-\theta

for every unit vector $\mathbf{u}\in\mathbb{C}^{d}$ and each $j\in\{1,2\}$ .

The following theorem plays an important role in their proof of Weaver’s ${\rm{KS}}_{r}$ conjecture.

Theorem 1.2.

( see [MSS15b, Theorem 1.4] ) Let $\varepsilon>0$ and let $\mathbf{W}_{1},\ldots,\mathbf{W}_{m}$ be independent random positive semidefinite Hermitian matrices in $\mathbb{C}^{d\times d}$ of rank 1 with finite support. If

\sum_{i=1}^{m}\mathbb{E}\mathbf{W}_{i}=\mathbf{I}_{d}

and

\operatorname{tr}(\mathbb{E}\mathbf{W}_{i})\leq\varepsilon,\quad\text{for all $i\in[m]$,}

then

(1.3)

\mathbb{P}\bigg{[}\bigg{\|}\sum_{i=1}^{m}\mathbf{W}_{i}\bigg{\|}\leq(1+\sqrt{\varepsilon})^{2}\bigg{]}>0.

Theorem 1.3 directly follows from Theorem 1.2 and implies a positive solution to Weaver’s ${\rm{KS}}_{r}$ conjecture.

Theorem 1.3.

( see [MSS15b, Corollary 1.5] ) Let $r\geq 2$ be an integer. Assume that $\mathbf{X}_{1},\ldots,\mathbf{X}_{m}\in\mathbb{C}^{d\times d}$ are positive semidefinite Hermitian matrices of rank at most $1$ such that

\mathbf{X}_{1}+\cdots+\mathbf{X}_{m}=\mathbf{I}_{d}.

Let $\varepsilon:=\max\limits_{1\leq i\leq m}\operatorname{tr}(\mathbf{X}_{i})$ . Then there exists a partition $S_{1}\cup\cdots\cup S_{r}=[m]$ such that

(1.4)

\bigg{\|}\sum\limits_{i\in S_{j}}\mathbf{X}_{i}\bigg{\|}\leq\frac{1}{r}\cdot(1+\sqrt{r\varepsilon})^{2}\quad\text{for all $j\in[r]$.}

In particular, when $r=2$ , if we set $\mathbf{X}_{i}=\frac{1}{\eta}\cdot\mathbf{u}_{i}\mathbf{u}_{i}^{*}$ and $\varepsilon=\frac{1}{\eta}$ , then Theorem 1.3 implies Conjecture 1.1 with any constant $\eta>(2+\sqrt{2})^{2}\approx 11.6569$ .

1.2. Related work

Here we briefly introduce the previous improvements on Theorem 1.3.

1.2.1. The rank-one case

When $r=2$ , Bownik, Casazza, Marcus and Speegle [BCMS19] improves the upper bound in (1.4) to $\frac{1}{2}\cdot(\sqrt{1-2\varepsilon}+\sqrt{2\varepsilon})^{2}$ when $\varepsilon\leq\frac{1}{4}$ . This bound implies the same result in Conjecture 1.1, but with any constant $\eta>4$ . To our knowledge, this is the best known estimate on the constant $\eta$ in Conjecture 1.1.

Ravichandran and Leake in [RL20] adapted the method of interlacing families to directly prove Anderson’s paving conjecture, which is well-known to be equivalent to Weaver’s ${\rm{KS}}_{r}$ conjecture. They showed that, for any integer $r\geq 2$ and any real number $0<\varepsilon\leq\frac{(r-1)^{2}}{r^{2}}$ , if $\mathbf{A}\in\mathbb{C}^{m\times m}$ is a positive semidefinite matrix satisfying $\mathbf{0}\preceq\mathbf{A}\preceq\mathbf{I}_{m}$ and $\mathbf{A}(i,i)\leq\varepsilon$ for all $i$ , then there exists a partition $S_{1}\cup\cdots\cup S_{r}=[m]$ such that

(1.5)

\|\mathbf{A}(S_{j})\|\leq\frac{1}{r}\cdot(\sqrt{1-\frac{r\varepsilon}{r-1}}+\sqrt{r\varepsilon})^{2}\quad\text{for all $j\in[r]$.}

Here, $\mathbf{A}(S)$ denotes the submatrix of $\mathbf{A}$ with rows and columns indexed by $S\subset[m]$ . Their result implies that the upper bound in (1.4) can be improved to

(1.6)

\bigg{\|}\sum\limits_{i\in S_{j}}\mathbf{X}_{i}\bigg{\|}\leq\frac{1}{r}\cdot(\sqrt{1-\frac{r\varepsilon}{r-1}}+\sqrt{r\varepsilon})^{2}\quad\text{for all $j\in[r]$}

when $\varepsilon\leq\frac{(r-1)^{2}}{r^{2}}$ . To see this, we write $\mathbf{X}_{i}=\mathbf{u}_{i}\mathbf{u}_{i}^{*}$ for each $i\in[m]$ and set $\mathbf{A}=\mathbf{U}^{*}\mathbf{U}$ where $\mathbf{U}=[\mathbf{u}_{1},\ldots,\mathbf{u}_{m}]\in\mathbb{C}^{d\times m}$ . Then (1.6) immediately follows since

\|\mathbf{A}(S_{j})\|=\bigg{\|}\sum\limits_{i\in S_{j}}\mathbf{X}_{i}\bigg{\|}

for each $j\in[r]$ . When $r=2$ , Ravichandran and Leake’s bound in (1.6) coincides with the estimate of Bownik et. al. from [BCMS19]. In [AB20], Alishahi and Barzegar extended Ravichandran and Leake’s result to the case of real stable polynomials and studied the paving property for strongly Rayleigh process.

1.2.2. The higher-rank case

In [Coh16], Cohen showed that Theorem 1.3 holds for matrices with higher ranks and the upper bound in (1.4) still holds in this case. Brändén also got rid of the rank constraints in [Br ${\rm\ddot{a}}$ 18] and extended Theorem 1.3 to the realm of hyperbolic polynomials. For each $\varepsilon>0$ and each integer $k>2$ , let $\delta_{\varepsilon,k}$ be defined as

(1.7)

\delta_{\varepsilon,k}:=\begin{cases}(\sqrt{1-\frac{\varepsilon}{k}}+\sqrt{\varepsilon})^{2},&\text{if $\varepsilon\leq\frac{k}{k+1}$,}\\ 2+2\cdot\varepsilon(1-\frac{1}{k}),&\text{otherwise.}\end{cases}

In the setting of Theorem 1.3, Brändén proved that, if the rank of each $\mathbf{X}_{i}$ is at most $k$ , then the upper bound (1.4) can be improved to $\frac{1}{r}\cdot\delta_{r\varepsilon,kr}$ when $kr>2$ . For the case where $k=1$ and $r=2$ , Brändén also obtained the same bound with that of [BCMS19]. One can check that Ravichandran and Leake’s bound (1.6) is better than ${\rm Br\ddot{a}nd\acute{e}n}$ ’s bound when $k=1$ and $r>2$ , but ${\rm Br\ddot{a}nd\acute{e}n}$ ’s bound is more general since it is available for $k>1$ .

1.3. Our contribution

In this paper we focus on extending the results in Theorem 1.2 and Theorem 1.3 to the higher-rank case. We first present the following theorem, which improves the previous bounds from (1.3) and from[Br ${\rm\ddot{a}}$ 18, Theorem 6.1].

Theorem 1.4.

Let $k\geq 2$ be an integer and let $\varepsilon\in(0,\frac{(k-1)^{2}}{k}]$ . Let $\mathbf{W}_{1},\ldots,\mathbf{W}_{m}$ be independent random positive semidefinite Hermitian matrices in $\mathbb{C}^{d\times d}$ with finite support. Suppose that $\sum_{i=1}^{m}\mathbb{E}\mathbf{W}_{i}=\mathbf{I}_{d}$ and

\operatorname{tr}(\mathbb{E}\mathbf{W}_{i})\leq\varepsilon,\,\,\operatorname{rank}(\mathbb{E}\mathbf{W}_{i})\leq k\quad\text{for all $i\in[m]$}.

Then,

\mathbb{P}\bigg{[}\bigg{\|}\sum_{i=1}^{m}\mathbf{W}_{i}\bigg{\|}\leq\bigg{(}\sqrt{1-\frac{\varepsilon}{k-1}}+\sqrt{\varepsilon}\bigg{)}^{2}\bigg{]}>0.

We immediately obtain the following corollary with a similar argument in [MSS15b, Corollary 1.5].

Corollary 1.5.

Let $r\geq 2$ and $k\geq 1$ be integers. Assume that $\mathbf{X}_{1},\ldots,\mathbf{X}_{m}\in\mathbb{C}^{d\times d}$ are positive semidefinite Hermitian matrices of rank at most $k$ such that

(1.8)

\mathbf{X}_{1}+\cdots+\mathbf{X}_{m}=\mathbf{I}_{d}.

Let $\varepsilon:=\max\limits_{1\leq i\leq m}\operatorname{tr}(\mathbf{X}_{i})$ . If $\varepsilon\leq\frac{(kr-1)^{2}}{kr^{2}}$ , then there exists a partition $S_{1}\cup\cdots\cup S_{r}=[m]$ such that

(1.9)

\bigg{\|}\sum\limits_{i\in S_{j}}\mathbf{X}_{i}\bigg{\|}\leq\frac{1}{r}\cdot\bigg{(}\sqrt{1-\frac{r\varepsilon}{kr-1}}+\sqrt{r\varepsilon}\bigg{)}^{2}\quad\text{for all $j\in[r]$.}

Proof.

For each $i\in[m]$ , let $\mathbf{W}_{i}$ be a random matrix that takes the following matrices of size $rd\times rd$ with equal probability:

\mathbf{W}_{i,1}:=\begin{bmatrix}r\cdot\mathbf{X}_{i}&&&\\ &0&&\\ &&\ddots&\\ &&&0\end{bmatrix},\mathbf{W}_{i,2}:=\begin{bmatrix}0&&&\\ &r\cdot\mathbf{X}_{i}&&\\ &&\ddots&\\ &&&0\end{bmatrix},\ldots,\mathbf{W}_{i,r}:=\begin{bmatrix}0&&&\\ &0&&\\ &&\ddots&\\ &&&r\cdot\mathbf{X}_{i}\end{bmatrix}.

A simple calculation shows that

\mathbb{E}\mathbf{W}_{i}=\begin{bmatrix}\mathbf{X}_{i}&&&\\ &\mathbf{X}_{i}&&\\ &&\ddots&\\ &&&\mathbf{X}_{i}\end{bmatrix}\in\mathbb{C}^{rd\times rd},\quad\text{for all $i\in[m]$.}

This gives

\operatorname{tr}(\mathbb{E}\mathbf{W}_{i})\leq r\varepsilon\quad\text{and}\quad{\rm rank}(\mathbb{E}\mathbf{W}_{i})\leq kr\quad\text{for all $i\in[m]$}.

We also have $\sum_{i=1}^{m}\mathbb{E}\mathbf{W}_{i}=\mathbf{I}_{rd}$ . By Theorem 1.4, we obtain

(1.10)

\mathbb{P}\bigg{[}\bigg{\|}\sum_{i=1}^{m}\mathbf{W}_{i}\bigg{\|}\leq\bigg{(}\sqrt{1-\frac{r\varepsilon}{kr-1}}+\sqrt{r\varepsilon}\bigg{)}^{2}\bigg{]}>0.

Hence, for each $i\in[m]$ there exists $j_{i}\in[r]$ so that

\bigg{\|}\sum_{i=1}^{m}\mathbf{W}_{i,j_{i}}\bigg{\|}\leq\bigg{(}\sqrt{1-\frac{r\varepsilon}{kr-1}}+\sqrt{r\varepsilon}\bigg{)}^{2}.

For each $j\in[r]$ , set $S_{j}:=\{i\in[m]:j_{i}=j\}$ . Then $S_{1},\ldots,S_{r}$ form a partition of $[m]$ , and the (1.10) gives

\bigg{\|}\sum_{i\in S_{j}}\mathbf{X}_{i}\bigg{\|}\leq\frac{1}{r}\bigg{(}\sqrt{1-\frac{r\varepsilon}{kr-1}}+\sqrt{r\varepsilon}\bigg{)}^{2}\quad\text{for all $j\in[r]$}.

∎

Remark 1.6.

Motivated by the argument in [FY19, Proposition 2.2], we can relax the condition (1.8) in Corollary 1.5 to $\sum_{i=1}^{m}\mathbf{X}_{i}\preceq\mathbf{I}_{d}$ . More specifically, we can find rank-one matrices $\{{\mathbf{v}}_{j}{\mathbf{v}}_{j}^{*}\}_{1\leq j\leq M}$ with trace at most $\varepsilon$ such that $\sum_{i=1}^{m}\mathbf{X}_{i}+\sum_{j=1}^{M}{\mathbf{v}}_{j}{\mathbf{v}}_{j}^{*}=\mathbf{I}_{d}$ . Then there exists an partition of $[m+M]$ satisfying (1.9) by Corollary 1.5. We can get a desired partition of $[m]$ by restricting each subset in the partition of $[m+M]$ to $[m]$ .

Our bound in (1.9) coincides with that of [RL20] for each $r\geq 2$ when $k=1$ . In particular, our bound is also the same with that of [BCMS19] when $r=2$ and $k=1$ . For the case when $k\geq 2$ , our bound (1.9) slightly improves Brändén’s bound, i.e., $\frac{1}{r}\cdot\delta_{r\varepsilon,kr}=\frac{1}{r}\cdot(\sqrt{1-\frac{\varepsilon}{k}}+\sqrt{r\varepsilon})^{2}$ . We summarize the related works in Table 1.3.

Table 1. The estimate on the paving bound in Corollary 1.5

The value of $k$ and $r$	Paving bound in (1.9)
$k=1,r=2$	$\frac{1}{2}\cdot(\sqrt{1-2\varepsilon}+\sqrt{2\varepsilon})^{2}$ for $\varepsilon\leq\frac{1}{4}$ [BCMS19, RL20, Br ${\rm\ddot{a}}$ 18]
$k=1,r\geq 2$	$\frac{1}{r}\cdot(1+\sqrt{r\epsilon})^{2}$ [MSS15b]
$k=1,r\geq 2$	$\frac{1}{r}\cdot(\sqrt{1-\frac{r\varepsilon}{r-1}}+\sqrt{r\varepsilon})^{2}$ for $\varepsilon\leq\frac{(r-1)^{2}}{r^{2}}$ [RL20]
$k\geq 1,r\geq 2$	$\frac{1}{r}\cdot(1+\sqrt{r\epsilon})^{2}$ [Coh16]
	$\frac{1}{r}\cdot(\sqrt{1-\frac{\varepsilon}{k}}+\sqrt{r\varepsilon})^{2}$ for $\varepsilon\leq\frac{k}{kr+1}$ [Br ${\rm\ddot{a}}$ 18]
	$\frac{1}{r}\cdot(2+2\cdot r\varepsilon(1-\frac{1}{kr}))$ for $\varepsilon>\frac{k}{kr+1}$ [Br ${\rm\ddot{a}}$ 18]
	$\frac{1}{r}\cdot(\sqrt{1-\frac{r\varepsilon}{kr-1}}+\sqrt{r\varepsilon})^{2}$ for $\varepsilon\leq\frac{(kr-1)^{2}}{kr^{2}}$ (Corollary 1.5)

We next provide an application of Corollary 1.5, which specifies a simultaneous paving bound for multiple positive semidefinite Hermitian matrices. In [RS19], Ravichandran and Srivastava proved a simultaneous paving bound for a tuple of zero-diagonal Hermitian matrices. Therefore, the following corollary serves as a counterpart of [RS19, Theorem 1.1]. The result also coincides with [RL20, Theorem 2] when paving just one matrix.

Corollary 1.7.

Let $r\geq 2$ and $k\geq 1$ be integers. Assume that $\mathbf{A}_{1},\ldots,\mathbf{A}_{k}\in\mathbb{C}^{m\times m}$ are positive semidefinite Hermitian matrices satisfying $\mathbf{0}\preceq\mathbf{A}_{i}\preceq\mathbf{I}_{m}$ for $1\leq i\leq k$ . Let $\alpha:=\max\limits_{1\leq i\leq k}\max\limits_{1\leq l\leq m}\mathbf{A}_{i}(l,l)$ . If $\alpha\leq\frac{(kr-1)^{2}}{k^{2}r^{2}}$ , then there exists a partition $S_{1}\cup\cdots\cup S_{r}=[m]$ such that

\|\mathbf{A}_{i}(S_{j})\|\leq\bigg{(}\sqrt{\frac{1}{r}-\frac{k\alpha}{kr-1}}+\sqrt{k\alpha}\bigg{)}^{2}\quad\text{for all $i\in[k]$ and $j\in[r]$.}.

Proof.

For $1\leq i\leq k$ , let the vectors $\{\mathbf{u}_{i,l}\}_{l\in[m]}\subset\mathbb{C}^{d}$ satisfy $\mathbf{A}_{i}=(\left<\mathbf{u}_{i,l_{1}},\mathbf{u}_{i,l_{2}}\right>)_{1\leq l_{1},l_{2}\leq m}$ , and then we have $\sum_{l=1}^{m}{\mathbf{u}}_{i,l}{\mathbf{u}}_{i,l}^{*}\preceq\mathbf{I}_{d}$ and $\|\mathbf{u}_{i,l}\|^{2}\leq\alpha$ for $1\leq l\leq m$ . For $1\leq l\leq m$ , define a block diagonal matrix

\mathbf{X}_{l}:=\begin{bmatrix}{\mathbf{u}}_{1,l}{\mathbf{u}}_{1,l}^{*}&&&\\ &{\mathbf{u}}_{2,l}{\mathbf{u}}_{2,l}^{*}&&\\ &&\ddots&\\ &&&{\mathbf{u}}_{k,l}{\mathbf{u}}_{k,l}^{*}\end{bmatrix}\in\mathbb{C}^{kd\times kd}.

Then $\max\limits_{1\leq l\leq m}\operatorname{tr}(\mathbf{X}_{l})\leq k\alpha\leq\frac{(kr-1)^{2}}{kr^{2}}$ . Note that $\sum_{l=1}^{m}\mathbf{X}_{l}\preceq\mathbf{I}_{kd}.$ By Collolary 1.5, there exists a partition $S_{1}\cup\cdots\cup S_{r}=[m]$ such that

(1.11)

\bigg{\|}\sum_{l\in S_{j}}\mathbf{X}_{l}\bigg{\|}\leq\frac{1}{r}\cdot\bigg{(}\sqrt{1-\frac{rk\alpha}{kr-1}}+\sqrt{rk\alpha}\bigg{)}^{2}\quad\text{for all $j\in[r]$.}.

Note that

\sum_{l\in S_{j}}\mathbf{X}_{l}=\begin{bmatrix}\sum\limits_{l\in S_{j}}{\mathbf{u}}_{1,l}{\mathbf{u}}_{1,l}^{*}&&&\\ &\sum\limits_{l\in S_{j}}{\mathbf{u}}_{2,l}{\mathbf{u}}_{2,l}^{*}&&\\ &&\ddots&\\ &&&\sum\limits_{l\in S_{j}}{\mathbf{u}}_{k,l}{\mathbf{u}}_{k,l}^{*}\end{bmatrix}.

Then we have

(1.12)

\bigg{\|}\sum_{l\in S_{j}}\mathbf{X}_{l}\bigg{\|}=\max_{1\leq i\leq k}\bigg{\|}\sum\limits_{l\in S_{j}}{\mathbf{u}}_{i,l}{\mathbf{u}}_{i,l}^{*}\bigg{\|}=\max_{1\leq i\leq k}\|\mathbf{A}_{i}(S_{j})\|.

Combining (1.11) and (1.12), we arrive at the conclusion. ∎

1.4. Our techniques

To introduce our techniques, let us briefly recall the proof of Theorem 1.2 and how [Coh16] and [Br ${\rm\ddot{a}}$ 18] extended Theorem 1.2 to the higher-rank case.

For the rank-one case, let $\{\mathbf{W}_{i}\}_{1\leq i\leq m}$ be as defined in Theorem 1.2. For $1\leq i\leq m$ , let the support of $\mathbf{W}_{i}$ be $W_{i}:=\{\mathbf{W}_{i,1},\ldots,\mathbf{W}_{i,l_{i}}\}$ . In [MSS15b], Marcus, Spielman and Srivastava showed that the characteristic polynomials of $\{\sum_{i=1}^{m}\mathbf{W}_{i,j_{i}}:1\leq j_{i}\leq l_{i},\,i=1,\ldots,m\}$ form a so-called interlacing family. This implies that there exists a polynomial in this family whose largest root is at most that of the expectation of the characteristic polynomial of $\sum_{i=1}^{m}\mathbf{W}_{i}$ . Hence, it is enough to estimate the largest root of this expected characteristic polynomial. This expected characteristic polynomial is referred to as the mixed characteristic polynomial:

Definition 1.8.

(see [MSS15b]) Given $\mathbf{X}_{1},\ldots,\mathbf{X}_{m}\in\mathbb{C}^{d\times d}$ , the mixed characteristic polynomial of $\mathbf{X}_{1},\ldots,\mathbf{X}_{m}$ is defined as

(1.13)

\mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}](x):=\prod_{i=1}^{m}(1-\partial_{z_{i}})\det[x\cdot\mathbf{I}_{d}+\sum_{i=1}^{m}z_{i}\mathbf{X}_{i}]|_{z_{1}=\cdots=z_{m}=0}.

Assume that $\mathbf{W}_{1},\ldots,\mathbf{W}_{m}$ are independent random matrices of rank one in $\mathbb{C}^{d\times d}$ satisfying $\mathbb{E}\mathbf{W}_{i}=\mathbf{X}_{i}$ for each $i\in[m]$ . Marcus, Spielman and Srivastava in [MSS15b, Theorem 4.1] showed that

(1.14)

\mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}](x)=\mathbb{E}\ \det[x\cdot\mathbf{I}_{d}-\sum\limits_{i=1}^{m}\mathbf{W}_{i}].

Based on the above formula, they employed an argument of the barrier function that was developed in [BSS12] to estimate the largest root of the mixed characteristic polynomials.

For the higher-rank case, instead of studying the characteristic polynomials of $\{\sum_{i=1}^{m}\mathbf{W}_{i,j_{i}}:1\leq j_{i}\leq l_{i},\,i=1,\ldots,m\}$ , the authors of [Coh16] and [Br ${\rm\ddot{a}}$ 18] concentrated their attention on the mixed characteristic polynomials of $\mathbf{W}_{1,j_{1}},\ldots,\mathbf{W}_{m,j_{m}}$ . They showed that this family of polynomials also form an interlacing family. Furthermore, they proved that, for any positive semidefinite Hermitian matrices $\mathbf{X}_{1},\ldots,\mathbf{X}_{m}\in\mathbb{C}^{d\times d}$ , the operator norm of $\sum_{i=1}^{m}\mathbf{X}_{i}$ is upper bounded by the largest root of $\mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}]$ , i.e.

\bigg{\|}\sum_{i=1}^{m}\mathbf{X}_{i}\bigg{\|}\leq\operatorname{maxroot}\ \mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}].

Hence, the original problem is reduced to estimating the largest root of the expectation of the mixed characteristic polynomials, which can be done with a similar argument of barrier function.

To prove Theorem 1.4, we follow the above framework of [Coh16] and [Br ${\rm\ddot{a}}$ 18]. Our main technique is that we derive a new formula for the mixed characteristic polynomials (see Theorem 3.8). Based on this new formula and the method of barrier function, we obtain an improved estimate on the largest root of the mixed characteristic polynomials. We state it as follows.

Theorem 1.9.

Assume that $\mathbf{X}_{1},\ldots,\mathbf{X}_{m}\in\mathbb{C}^{d\times d}$ are positive semidefinite Hermitian matrices of rank at most $k$ such that $\sum\limits_{i=1}^{m}\mathbf{X}_{i}\preceq\mathbf{I}_{d}$ . Let $\varepsilon:=\max\limits_{1\leq i\leq m}\operatorname{tr}(\mathbf{X}_{i})$ . If $\varepsilon\leq\frac{(k-1)^{2}}{k}$ , then we have

(1.15)

\operatorname{maxroot}\ \mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}]\leq\bigg{(}\sqrt{1-\frac{\varepsilon}{k-1}}+\sqrt{\varepsilon}\bigg{)}^{2}.

1.5. Organization

This paper is organized as follows. After introducing some useful notation and lemmas in Section 2, we introduce the definition of $(k,m)$ -characteristic polynomials, showing the connection between $(k,m)$ -characteristic polynomials and the mixed characteristic polynomials in Section 3. In Section 4, we use the method of barrier function to present a proof of Theorem 1.9. The proof of Theorem 1.4 is presented in Section 5.

2. Preliminaries

2.1. Notations

We first introduce some notations. For a vector $\mathbf{x}\in\mathbb{C}^{m}$ , we let $\|\mathbf{x}\|$ denote its Euclidean 2-norm. For a matrix $\mathbf{B}\in\mathbb{C}^{m\times m}$ , we use $\|\mathbf{B}\|=\max_{\|\mathbf{x}\|=1}\|\mathbf{Bx}\|$ to denote its operator norm. We write $\partial_{z_{i}}$ to indicate the partial differential $\partial/\partial_{z_{i}}$ . For each $t\in[m]$ , let $\mathbf{e}_{t}\in\mathbb{R}^{m}$ denote the vector whose $t$ -th entry equals $1$ and the rest entries equal to $0$ . For a polynomial $p\in\mathbb{R}[z]$ with real roots, we use $\operatorname{maxroot}\ p$ and $\operatorname{minroot}\ p$ to denote the maximum and minimum root of $p$ respectively.

For an integer $m$ , we use $[m]$ to denote the set $\{1,2,\ldots,m\}$ . For any two positive integers $k$ and $m$ , we call $\mathcal{S}=(S_{1},\ldots,S_{k})\in[m]^{k}$ an $k$ -partition of $[m]$ if $S_{1},\ldots,S_{k}$ are disjoint and $S_{1}\cup\cdots\cup S_{k}=[m]$ . We use the notation $\mathcal{P}_{k}(m)$ to denote the set of all $k$ -partitions of $[m]$ .

Given a matrix $\mathbf{A}\in\mathbb{C}^{m\times m}$ , for a subset $S\subset[m]$ , we use $\mathbf{A}(S)$ to denote the principal submatrix of $\mathbf{A}$ whose rows and columns are indexed by $S$ . Let $k\geq 1$ be an integer. Given a matrix $\mathbf{A}\in\mathbb{C}^{km\times km}$ , for $\mathcal{S}=(S_{1},\ldots,S_{k})\in[m]^{k}$ , we use $\mathbf{A}(\mathcal{S})$ to denote the principal submatrix $\mathbf{A}(S_{1}\cup(m+S_{2})\cup\ldots\cup((k-1)\cdot m+S_{k}))$ . For example, let $m=4,k=3$ and let $S_{1}=\{1,2\},S_{2}=\{2,3,4\},S_{3}=\{3\}$ . If we set $\mathcal{S}=(S_{1},S_{2},S_{3})$ , then for a matrix $\mathbf{A}\in\mathbb{C}^{12\times 12}$ the principal submatrix $\mathbf{A}(\mathcal{S})\in\mathbb{C}^{6\times 6}$ is composed of the shaded part in Figure 1.

Refer to caption — Figure 1. The principal submatrix $\mathbf{A}(\mathcal{S})\in\mathbb{C}^{6\times 6}$ for $m=4,k=3$ , $\mathcal{S}=(\{1,2\},\{2,3,4\},\{3\})$ , and $\mathbf{A}\in\mathbb{C}^{12\times 12.}$

2.2. Interlacing families

The method of interlacing families is a powerful tool to show the existence of some combinatorial objects. Marcus, Spielman and Srivastava employed this tool to prove the existence of bipartite Ramanujan graphs of all sizes and degrees, solved the Kadison-Singer problem and proved a sharper restricted invertibility [MSS15a, MSS15b, MSS17, MSS18].

Here we recall the definition and related results of interlacing families from [MSS15b]. Throughout this paper, we say that a univariate polynomial is real-rooted if all of its coefficients and roots are real.

Definition 2.1.

(see [MSS15b, Definition 3.1] ) We say a real-rooted polynomial $g(x)=\alpha_{0}\prod_{i=1}^{n-1}(x-\alpha_{i})$ interlaces a real-rooted polynomial $f(x)=\beta_{0}\prod_{i=1}^{n}(x-\beta_{i})$ if $\beta_{1}\leq\alpha_{1}\leq\beta_{2}\leq\alpha_{2}\leq\cdots\leq\alpha_{n-1}\leq\beta_{n}$ . For polynomials $f_{1},\ldots,f_{k}$ , if there exists a polynomial $g$ that interlaces $f_{i}$ for each $i$ , then we say that $f_{1},\ldots,f_{k}$ have a common interlacing.

We next introduce the definition of interlacing families.

Definition 2.2.

(see [MSS15b, Definition 3.3] ) Assume that $S_{1},\ldots,S_{m}$ are finite sets. For every assignment $s_{1},\ldots,s_{m}\in S_{1}\times\cdots\times S_{m}$ , let $f_{s_{1},\ldots,s_{m}}(x)$ be a real-rooted degree $n$ polynomial with positive leading coefficient. For each $k<m$ and each partial assignment $s_{1},\ldots,s_{k}\in S_{1}\times\cdots\times S_{k}$ , we define

f_{s_{1},\ldots,s_{k}}:=\sum\limits_{s_{k+1}\in S_{k+1},\ldots,s_{m}\in S_{m}}f_{s_{1},\ldots,s_{k},s_{k+1},\ldots,s_{m}}.

We also define

f_{\emptyset}:=\sum\limits_{s_{1}\in S_{1},\ldots,s_{m}\in S_{m}}f_{s_{1},\ldots,s_{m}}.

We say that the polynomials $\{f_{s_{1},\ldots,s_{m}}:s_{1},\ldots,s_{m}\in S_{1}\times\cdots\times S_{m}\}$ form an interlacing family if for all integers $k=0,\ldots,m-1$ and all partial assignments $s_{1},\ldots,s_{k}\in S_{1}\times\cdots\times S_{k}$ , the polynomials $\{f_{s_{1},\ldots,s_{k},t}\}_{t\in S_{k+1}}$ have a common interlacing.

We state the main property of interlacing families as the following lemma.

Lemma 2.3.

(see [MSS15b, Theorem 3.4] ) Assume that $S_{1},\ldots,S_{m}$ be finite sets and let $\{f_{s_{1},\ldots,s_{m}}:s_{1},\ldots,s_{m}\in S_{1}\times\cdots\times S_{m}\}$ be an interlacing family. Then there exists some $s^{\prime}_{1},\ldots,s^{\prime}_{m}\in S_{1}\times\cdots\times S_{m}$ such that the largest root of $f_{s^{\prime}_{1},\ldots,s^{\prime}_{m}}$ is upper bounded by the largest root of $f_{\emptyset}$ .

2.3. Real stable polynomials

Through our analysis, we exploit the notion of real stable polynomials, which can be viewed as a multivariate generalization of real-rooted polynomials. For more details, see [BB10, Wag11].

We first introduce the definition of real stable polynomials.

Definition 2.4.

A polynomial $p\in\mathbb{R}[z_{1},\ldots,z_{m}]$ is real stable if $p(z_{1},\ldots,z_{m})\neq 0$ for all $(z_{1},\ldots,z_{n})\in\mathbb{C}^{n}$ with $\mathbf{Im}(z_{i})>0$ and $i\in[m]$ .

To show the polynomial we concern in this paper is real stable, we need the following lemma.

Lemma 2.5.

(see [BB08, Proposition 2.4] ) For any positive semidefinite Hermitian matrices $\mathbf{A}_{1},\ldots,\mathbf{A}_{m}\in\mathbb{C}^{d\times d}$ and any Hermitian matrix $\mathbf{B}\in\mathbb{C}^{d\times d}$ , the polynomial

(2.1)

\det[\mathbf{A}_{1}z_{1}+\cdots+\mathbf{A}_{m}z_{m}+\mathbf{B}]\in\mathbb{R}[z_{1},\ldots,z_{m}]

is either real stable or identically zero.

Real stability can be preserved under some certain transformations. In our proof, we use some real stability preservers to reduce multivariate polynomials to a univariate one which is a real-rooted polynomial.

Lemma 2.6.

(see [Wag11, Lemma 2.4] ) If $p\in\mathbb{R}[z_{1},\ldots,z_{m}]$ is real stable and $a\in\mathbb{R}$ , then the following polynomials are also real stable:

•

$p(a,z_{2},\ldots,z_{m})\in\mathbb{R}[z_{2},...,z_{m}]$ ,
•

$p(z_{1},\ldots,z_{m})|_{z_{i}=z_{j}}$ , for all $\{i,j\}\subset[m]$ ,
•

$\partial_{z_{i}}p(z_{1},\ldots,z_{m})$ , for all $i\in[m]$ .

2.4. The mixed characteristic polynomial

The mixed characteristic polynomial plays a key role in the proof of Theorem 1.2. In this subsection we introduce some basic properties of mixed characteristic polynomials which is useful in our proof of Theorem 1.4.

Lemma 2.7.

(see [Br ${\rm\ddot{a}}$ 18, Theorem 5.2] and [Coh16]) Assume that $\mathbf{X}_{1},\ldots,\mathbf{X}_{m}\in\mathbb{C}^{d\times d}$ are positive semidefinite Hermitian matrices. Then we have

(2.2)

\bigg{\|}\sum\limits_{i=1}^{m}\mathbf{X}_{i}\bigg{\|}\leq\operatorname{maxroot}\ \mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}],

where $\mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}]$ is defined in (1.13).

Lemma 2.8.

(see [Br ${\rm\ddot{a}}$ 18, Theorem 3.5] and [Coh16]) Assume that $\mathbf{W}_{1},\ldots,\mathbf{W}_{m}$ are independent random positive semidefinite Hermitian matrices in $\mathbb{C}^{d\times d}$ with finite support. For each $i\in[m]$ let $W_{i}:=\{\mathbf{W}_{i,1},\ldots,\mathbf{W}_{i,l_{i}}\}$ be the support of $\mathbf{W}_{i}$ . Then, the mixed characteristic polynomials

(2.3)

\mu[\mathbf{W}_{1,j_{1}},\ldots,\mathbf{W}_{m,j_{m}}](x),\quad 1\leq j_{i}\leq l_{i},\,\,i=1,\ldots,m

form an interlacing family.

It is also pointed out by Brändén in [Br ${\rm\ddot{a}}$ 18] that the mixed characteristic polynomial $\mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}](x)$ is affine linear in each $\mathbf{X}_{i}$ , i.e., for all $\alpha\in\mathbb{R}$ and all $i\in[m]$ :

(2.4)			$\displaystyle\quad\ \ \mu[\mathbf{X}_{1},\ldots,(1-\alpha)\cdot\mathbf{X}_{i}+\alpha\cdot\mathbf{X}_{i}^{\prime},\ldots,\mathbf{X}_{m}](x)$
(2.4)			$\displaystyle=(1-\alpha)\cdot\mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{i},\ldots,\mathbf{X}_{m}](x)+\alpha\cdot\mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{i}^{\prime},\ldots,\mathbf{X}_{m}](x),$

where $\mathbf{X}_{1},\ldots,\mathbf{X}_{m},\mathbf{X}_{i}^{\prime}\in\mathbb{C}^{d\times d}$ . Hence, if $\mathbf{W}_{1},\ldots,\mathbf{W}_{m}$ are independent random matrices with finite support, then we have

(2.5)

\mathbb{E}\ \mu[\mathbf{W}_{1},\ldots,\mathbf{W}_{m}](x)=\mu[\mathbb{E}\mathbf{W}_{1},\ldots,\mathbb{E}\mathbf{W}_{m}](x).

3. A new formula for mixed characteristic polynomials

For positive integers $k$ and $m$ , we first introduce the $(k,m)$ -determinant and $(k,m)$ -characteristic polynomial of a matrix $\mathbf{A}\in\mathbb{C}^{km\times km}$ . Then we show some connections between $(k,m)$ -characteristic polynomials and mixed characteristic polynomials. Recall that we use $\mathcal{P}_{k}(m)$ to denote the set of all $k$ -partitions of $[m]$ . Also recall that for each $\mathbf{A}\in\mathbb{C}^{km\times km}$ and each $\mathcal{S}=(S_{1},\ldots,S_{k})\in[m]^{k}$ we use $\mathbf{A}(\mathcal{S})$ to denote the principal submatrix of $\mathbf{A}$ with rows and columns indexed by $S_{1}\cup(m+S_{2})\cup\cdots\cup((k-1)\cdot m+S_{k})$ .

Definition 3.1.

Let $k,m$ be two positive integers. For any matrix $\mathbf{A}\in\mathbb{C}^{km\times km}$ , the $(k,m)$ -determinant $D_{k,m}[\mathbf{A}]$ of $\mathbf{A}$ is defined as

D_{k,m}[\mathbf{A}]:=\sum\limits_{\mathcal{S}\in\mathcal{P}_{k}(m)}\det[\mathbf{A}(\mathcal{S})].

The $(k,m)$ -characteristic polynomial $\psi_{k,m}[\mathbf{A}](x)$ of $\mathbf{A}$ is defined as

(3.1)

\psi_{k,m}[\mathbf{A}](x):=\frac{1}{k^{m}}\cdot D_{k,m}[x\cdot\mathbf{I}_{km}-\mathbf{A}].

Remark 3.2.

A simple calculation shows that $\psi_{k,m}[\mathbf{A}](x)$ is a monic polynomial of degree $m$ . For $k=1$ , we have $D_{1,m}[\mathbf{A}]=\det[\mathbf{A}]$ , and $\psi_{1,m}[\mathbf{A}](x)$ is the regular characteristic polynomial of $\mathbf{A}\in\mathbb{C}^{m\times m}$ . If we take $m=1$ , then $\psi_{k,1}[\mathbf{A}](x)$ is a multiple of the $(k-1)$ -th order derivative of the characteristic polynomial of $\mathbf{A}\in\mathbb{C}^{k\times k}$ (see Proposition 3.3).

We next introduce some connections among $D_{k,m}[\mathbf{A}],\psi_{k,m}[\mathbf{A}](x)$ and other generalizations of the determinant and the characteristic polynomial in the literature.

(Borcea-Brändén [BB08]) For a $k$ -tuple $(\mathbf{A}_{1},\ldots,\mathbf{A}_{k})$ of matrices in $\mathbb{C}^{m\times m}$ , the mixed determinant, first introduced by Borcea and Brändén in [BB08], is defined as

D[\mathbf{A}_{1},\ldots,\mathbf{A}_{k}]:=\sum_{(S_{1},\ldots,S_{k})\in\mathcal{P}_{k}(m)}\prod_{i=1}^{k}\det[\mathbf{A}_{i}(S_{i})].

Note that $D(x\cdot\mathbf{I}_{m},-\mathbf{B})=\det(x\cdot\mathbf{I}_{m}-\mathbf{B})$ is the regular characteristic polynomial of $\mathbf{B}\in\mathbb{C}^{m\times m}$ . In [BB08], Borcea and Brändén used the notion of the mixed determinant to prove a stronger version of Johnson’s Conjecture. We remark here that the $(k,m)$ -determinant introduced in Definition 3.1 can be viewed as a generalization of the mixed determinant by the following identity:

(3.2)

D_{k,m}[\mathbf{A}]\,\,=\,\,D[\mathbf{A}_{1},\ldots,\mathbf{A}_{k}],

where $\mathbf{A}={\rm diag}(\mathbf{A}_{1},\mathbf{A}_{2},\ldots,\mathbf{A}_{k})\in\mathbb{C}^{km\times km}$ is a block diagonal matrix.

2.

(Ravichandran-Leake [RL20]) For a matrix $\mathbf{B}\in\mathbb{C}^{m\times m}$ , the $k$ -characteristic polynomial of the matrix $\mathbf{B}$ is defined as (see [RL20, Proposition 2])

(3.3) $\chi_{k}[\mathbf{B}](x):=D[\underbrace{x\cdot\mathbf{I}_{m}-\mathbf{B},\ldots,x\cdot\mathbf{I}_{m}-\mathbf{B}}_{k}].$

Ravichandran and Leake used the $k$ -characteristic polynomial to prove Anderson’s paving formulation of Kadison-Singer problem [RL20]. It is easy to see the relationship between $\chi_{k}$ and $\psi_{k,m}$ :

(3.4) $\chi_{k}[\mathbf{B}](x)=k^{m}\cdot\psi_{k,m}[{\rm diag}(\underbrace{\mathbf{B},\ldots,\mathbf{B}}_{k})](x).$

(Ravichandran-Srivastava [RS19]) For a $k$ -tuple $(\mathbf{A}_{1},\ldots,\mathbf{A}_{k})$ of matrices in $\mathbb{C}^{m\times m}$ , the mixed determinantal polynomial is defined as

(3.5)

\chi[\mathbf{A}_{1},\ldots,\mathbf{A}_{k}](x):=\frac{1}{k^{m}}\cdot D[x\cdot\mathbf{I}_{m}-\mathbf{A}_{1},\ldots,x\cdot\mathbf{I}_{m}-\mathbf{A}_{k}].

Ravichandran and Srivastava used the mixed determinantal polynomial to provide a simultaneous paving of a $k$ -tuple of zero-diagonal Hermitian matrices [RL20]. According to (3.2), we immediately have the following identity:

(3.6)

\chi[\mathbf{A}_{1},\ldots,\mathbf{A}_{k}](x)=\psi_{k,m}[{\rm diag}(\mathbf{A}_{1},\ldots,\mathbf{A}_{k})](x).

Hence, the $(k,m)$ -characteristic polynomial can be considered as a generalization of the mixed determinantal polynomial.

We next provide several basic properties about the $(k,m)$ -characteristic polynomial. The first one provides an alternative expression of $(k,m)$ -characteristic polynomial.

Proposition 3.3.

Let $k,m$ be two positive integers. Let

\mathbf{Z}_{k}:={\rm diag}(\underbrace{\mathbf{z},\ldots,\mathbf{z}}_{k})\quad{\rm and}\quad\mathbf{z}:=(z_{1},\ldots,z_{m}).

Then, for any matrix $\mathbf{A}\in\mathbb{C}^{km\times km}$ , we have

(3.7)

\psi_{k,m}[\mathbf{A}](x)=\frac{1}{(k!)^{m}}\cdot\prod\limits_{i=1}^{m}\partial_{z_{i}}^{k-1}\cdot\det[\mathbf{Z}_{k}-\mathbf{A}]|_{z_{1}=\cdots=z_{m}=x}.

Remark 3.4.

Proposition 3.3 can be considered as a generalization of [RL20, Proposition 1] and [RS19, Proposition 2.3], which express the $k$ -characteristic polynomial and the mixed determinantal polynomial both in differential formulas. In [RL20], Ravichandran and Leake showed that

\chi_{k}[\mathbf{B}](x)=\prod_{i=1}^{m}\partial_{z_{i}}^{k-1}\det[\mathbf{Z}-\mathbf{B}]^{k}|_{z_{1}=\ldots=z_{m}=x}

for matrix $\mathbf{B}\in\mathbb{C}^{m\times m}$ , where $\mathbf{Z}={\rm diag}(z_{1},\ldots,z_{m})$ . In [RS19], Ravichandran and Srivastava showed that

\chi[\mathbf{A}_{1},\ldots\mathbf{A}_{k}](x)=\frac{1}{(k!)^{m}}\prod_{i=1}^{m}\partial_{z_{i}}^{k-1}\prod_{j=1}^{k}\det[\mathbf{Z}-\mathbf{A}_{i}]|_{z_{1}=\ldots=z_{m}=x}.

for matrices $\mathbf{A}_{1},\ldots,\mathbf{A}_{k}\in\mathbb{C}^{m\times m}$ . These two formulas correspond to the case where $\mathbf{A}$ is a block-diagoal matrices in Proposition 3.3.

Proof of Proposition 3.3.

To prove the conclusion, according to (3.1), it is enough to show that

\frac{1}{k^{m}}\cdot D_{k,m}[x\cdot\mathbf{I}_{km}-\mathbf{A}]=\frac{1}{(k!)^{m}}\cdot\prod\limits_{i=1}^{m}\partial_{z_{i}}^{k-1}\cdot\det[\mathbf{Z}_{k}-\mathbf{A}]|_{z_{1}=\cdots=z_{m}=x}.

A simple calculation shows that the following algebraic identity holds for any polynomial $f(x_{1},\ldots,x_{k})$ :

(3.8)

\sum\limits_{j=1}^{k}\partial_{x_{j}}f(x_{1},\ldots,x_{k})\ |_{x_{1}=\cdots=x_{k}=x}=\partial_{x}f(x,\ldots,x).

For $\rho(x_{1},\ldots,x_{k}):=\prod_{j\in S}x_{j}$ with $S\subset[k]$ , we have

(k-1)!\cdot\sum\limits_{j=1}^{k}(\prod_{l\neq j}\partial_{x_{l}})\rho\ |_{x_{1}=\cdots=x_{k}=x}=\partial_{x}^{k-1}\rho(x,\ldots,x),

which implies

(3.9)

(k-1)!\cdot\sum\limits_{j=1}^{k}(\prod_{l\neq j}\partial_{x_{l}})f\ |_{x_{1}=\cdots=x_{k}=x}=\partial_{x}^{k-1}f(x,\ldots,x),

where $f(x_{1},\ldots,x_{k})$ is a polynomial which has degree $1$ in each $x_{j}$ . Now we define the polynomial

(3.10)

p_{t}(z_{1},\ldots,z_{m}):=(\prod\limits_{i=1}^{t}\partial_{z_{i}}^{k-1})\det[\mathbf{Z}_{k}-\mathbf{A}],

for each $t\in[m]$ . We set

g(z_{1,1},\ldots,z_{m,1},\ldots,z_{1,k},\ldots,z_{m,k}):=\det[\mathbf{Z}-\mathbf{A}],

where

\mathbf{Z}:={\rm diag}(z_{1,1},\ldots,z_{m,1},\ldots,z_{1,k},\ldots,z_{m,k}).

For each $t\in[m]$ , we use (3.9) to obtain that

(3.11)

\displaystyle p_{t}(z_{1},\ldots,z_{m})=((k-1)!)^{t}\prod_{i=1}^{t}(\sum\limits_{j=1}^{k}(\prod_{l\neq j}\partial_{z_{i,l}}))g(z_{1,1},\ldots,z_{m,1},\ldots,z_{1,k},\ldots,z_{m,k})\ |_{z_{s,t}=z_{s},\ \forall s\in[k],\ \forall t\in[m]}.

A simple calculation shows that

(3.12)

\prod_{i=1}^{t}(\sum\limits_{j=1}^{k}(\prod_{l\neq j}\partial_{z_{i,l}}))=\sum\limits_{\mathcal{S}=(S_{1},\ldots,S_{k})\in\mathcal{P}_{k}(t)}\prod_{j=1}^{k}\prod_{i\in S_{j}}\prod_{l\neq j}\partial_{z_{i,l}}.

Also note that, for each $\mathcal{S}=(S_{1},\ldots,S_{k})\in\mathcal{P}_{k}(t)$ , we have

(3.13)

(\prod_{j=1}^{k}\prod_{i\in S_{j}}\prod_{l\neq j}\partial_{z_{i,l}})\cdot g=\det[(\mathbf{Z}-\mathbf{A})(\mathcal{T}_{t,\mathcal{S}})],

where $\mathcal{T}_{t,\mathcal{S}}$ is defined as

(3.14)

\mathcal{T}_{t,\mathcal{S}}:=(S_{1}\cup\{t+1,t+2,\ldots,m\},\ldots,S_{k}\cup\{t+1,t+2,\ldots,m\})\in[m]^{k}.

Combining (3.11), (3.12) and (3.13), we obtain that, for each $t\in[m]$ ,

(3.15)		$\displaystyle p_{t}(z_{1},\ldots,z_{m})$	$\displaystyle=((k-1)!)^{t}\sum\limits_{\mathcal{S}=(S_{1},\ldots,S_{k})\in\mathcal{P}_{k}(t)}\det[(\mathbf{Z}-\mathbf{A})(\mathcal{T}_{t,\mathcal{S}})]\ \|_{z_{i,j}=z_{i},\ \forall i\in[k],\ \forall j\in[m]}$
(3.15)			$\displaystyle=((k-1)!)^{t}\sum\limits_{\mathcal{S}=(S_{1},\ldots,S_{k})\in\mathcal{P}_{k}(t)}\det[(\mathbf{Z}_{k}-\mathbf{A})(\mathcal{T}_{t,\mathcal{S}})].$

In particular, when $t=m$ , we have

(3.16)

\displaystyle p_{m}(z_{1},\ldots,z_{m})=((k-1)!)^{m}\sum\limits_{\mathcal{S}=(S_{1},\ldots,S_{k})\in\mathcal{P}_{k}(m)}\det[(\mathbf{Z}_{k}-\mathbf{A})(\mathcal{S})].

Combining (3.10) and (3.16), we obtain that

		$\displaystyle\quad\frac{1}{(k!)^{m}}\cdot(\prod\limits_{i=1}^{m}\partial_{z_{i}}^{k-1})\det[\mathbf{Z}_{k}-\mathbf{A}]\ \|_{z_{1}=\cdots=z_{m}=x}$
		$\displaystyle=\frac{1}{k^{m}}\cdot\sum\limits_{\mathcal{S}=(S_{1},\ldots,S_{k})\in\mathcal{P}_{k}(m)}\det[(\mathbf{Z}_{k}-\mathbf{A})(\mathcal{S})]\ \|_{z_{1}=\cdots=z_{m}=x}$
		$\displaystyle=\frac{1}{k^{m}}\cdot\sum\limits_{\mathcal{S}=(S_{1},\ldots,S_{k})\in\mathcal{P}_{k}(m)}\det[x\cdot\mathbf{I}_{m}-\mathbf{A}(\mathcal{S})]$
		$\displaystyle=\frac{1}{k^{m}}\cdot D_{k,m}[x\cdot\mathbf{I}_{km}-\mathbf{A}]=\psi_{k,m}[\mathbf{A}](x).$

∎

The next proposition shows that $\psi_{k,m}[\mathbf{A}](x)$ is real-rooted if $\mathbf{A}\in\mathbb{C}^{km\times km}$ is Hermitian.

Proposition 3.5.

Let $k,m$ be two positive integers. For any Hermitian matrix $\mathbf{A}\in\mathbb{C}^{km\times km}$ , the $(k,m)$ -characteristic polynomial $\psi_{k,m}[\mathbf{A}](x)$ is real-rooted.

Proof.

Since $\mathbf{A}$ is Hermitian, by Lemma 2.5, we see that the polynomial

\det[\mathbf{Z}_{k}-\mathbf{A}]\in\mathbb{R}[z_{1},\ldots,z_{m}]

is either real stable or identically zero. If it is zero, then we are done. We assume that $\det[\mathbf{Z}_{k}-\mathbf{A}]\not\equiv 0$ . Lemma 2.6 shows that the differential operator $\prod\limits_{i=1}^{m}\partial_{z_{i}}^{k-1}$ and setting all variables to $x$ preserve real stability. Then, by Proposition 3.3, we conclude that $\psi_{k,m}[\mathbf{A}](x)$ is a univariate real stable polynomial, which is real-rooted. ∎

We next present several properties about the roots of the $(k,m)$ -characteristic polynomial $\psi_{k,m}[\mathbf{A}](x)$ of a Hermitian matrix $\mathbf{A}\in\mathbb{C}^{km\times km}$ .

Proposition 3.6.

Let $k,m$ be two positive integers. Assume that $\mathbf{A}\in\mathbb{C}^{km\times km}$ is a Hermitian matrix. Then we have the following results:

(i)

The sum of the roots of $\psi_{k,m}[\mathbf{A}](x)$ equals $\frac{\operatorname{tr}(\mathbf{A})}{k}$ .
(ii)

For any vector $\mathbf{v}\in\mathbb{C}^{km}$ we have

(3.17) $\operatorname{maxroot}\ \psi_{k,m}[\mathbf{A}]\leq\operatorname{maxroot}\ \psi_{k,m}[\mathbf{A+\mathbf{v}\mathbf{v}^{*}}].$
(iii)

If $\mathbf{A}$ is positive semidefinite, then we have

(3.18) $\|\mathbf{A}\|\leq k\cdot\operatorname{maxroot}\ \psi_{k,m}[\mathbf{A}].$

Proof.

(i) We use $\alpha$ to denote the sum of the roots of $\psi_{k,m}[\mathbf{A}](x)$ . Since $\psi_{k,m}[\mathbf{A}](x)$ is a real-rooted polynomial of degree $m$ with leading coefficient being $1$ , it is known that $\alpha$ equals the negative value of the coefficient of $x^{m-1}$ in $\psi_{k,m}[\mathbf{A}](x)$ . Note that, for each $k$ -partition $\mathcal{S}$ of $[m]$ , the characteristic polynomial of $\mathbf{A}(\mathcal{S})$ is of the form

x^{m}-\operatorname{tr}(\mathbf{A}(\mathcal{S}))\cdot x^{m-1}+\ \text{lower order terms.}

According to (3.1), we have

(3.19)

\alpha=\frac{1}{k^{m}}\cdot\sum\limits_{\mathcal{S}\in\mathcal{P}_{k}(m)}\operatorname{tr}(\mathbf{A}(\mathcal{S})).

Observing that each diagonal entry of $\mathbf{A}$ appears in exactly $k^{m-1}$ distinct $k$ -partitions of $[m]$ , we can rewrite equation (3.19) as

\alpha=\frac{1}{k^{m}}\cdot k^{m-1}\cdot\operatorname{tr}(\mathbf{A})=\frac{\operatorname{tr}(\mathbf{A})}{k},

which gives the desired result.

(ii) For each $t\in\mathbb{R}$ we set

p_{t}(x):=\psi_{k,m}[\mathbf{A}+t\cdot\mathbf{v}\mathbf{v}^{*}](x).

According to Proposition 3.5, $p_{t}(x)$ is real-rooted for each $t\in\mathbb{R}$ . Define $f:\mathbb{R}\to\mathbb{R}$ as

f(t):=\operatorname{maxroot}\ p_{t}(x).

Since the maximal root of a real-rooted polynomial is continuous in its coefficients, we obtain that $f$ is a continuous function. Also note that

(3.20)

p_{0}(x)=\psi_{k,m}[\mathbf{A}](x),\ p_{1}(x)=\psi_{k,m}[\mathbf{A}+\mathbf{v}\mathbf{v}^{*}](x).

Hence, it is enough to show that $f(t)$ is monotone increasing in $t$ which implies $f(0)\leq f(1)$ and hence (3.17).

According to Definition 3.1 we have

p_{t}(x)=\frac{1}{k^{m}}\cdot\sum\limits_{\mathcal{S}\in\mathcal{P}_{k}(m)}\det[x\cdot\mathbf{I}_{m}-\mathbf{A}(\mathcal{S})-t\cdot\mathbf{v}_{\mathcal{S}}\mathbf{v}_{\mathcal{S}}^{*}],

where $\mathbf{v}_{\mathcal{S}}$ denotes the subvector of $\mathbf{v}$ by extracting the entries of $\mathbf{v}$ indexed by $S_{1}\cup(m+S_{2})\cup\cdots\cup((k-1)\cdot m+S_{k})$ . Note that, for each $\mathcal{S}\in\mathcal{P}_{k}(m)$ , $\det[x\cdot\mathbf{I}_{m}-\mathbf{A}(\mathcal{S})-t\cdot\mathbf{v}_{\mathcal{S}}\mathbf{v}_{\mathcal{S}}^{*}]$ is a polynomial in $t$ of degree at most one, which implies that we can write $p_{t}(x)$ in the form of

(3.21)

p_{t}(x)=(1-t)\cdot p_{0}(x)+t\cdot p_{1}(x).

We next prove that $f$ is monotone by contradiction. For the aim of contradiction, we assume that $f$ is not monotone. Since $f$ is continuous, there exist $t_{1},s_{1}\in\mathbb{R}$ such that $t_{1}<s_{1}$ and $f(t_{1})=f(s_{1})=z_{1}$ . We use (3.21) to obtain

(1-t_{1})\cdot p_{0}(z_{1})+t_{1}\cdot p_{1}(z_{1})=(1-s_{1})\cdot p_{0}(z_{1})+s_{1}\cdot p_{1}(z_{1})=0,

which implies

(3.22)

p_{0}(z_{1})=p_{1}(z_{1})=0.

Then we substitute (3.22) into (3.21) and obtain $p_{t}(z_{1})=0$ for any $t\in\mathbb{R}$ . By the definition of $f$ , this means that $f(t)\geq z_{1}$ for any $t\in\mathbb{R}$ . Let $z_{max}:=\max\limits_{t_{1}\leq t\leq s_{1}}f(t)$ . Since $f$ is not monotone, we have $z_{max}>z_{1}$ . Set $z_{2}=\frac{1}{2}\cdot(z_{1}+z_{max})>z_{1}$ . By continuity, there exist $t_{2},s_{2}\in[t_{1},s_{1}]$ such that $t_{2}<s_{2}$ and $f(t_{2})=f(s_{2})=z_{2}$ . Then the preceding argument shows that $f(t)\geq z_{2}$ for any $t\in\mathbb{R}$ , which contradicts with $f(t_{1})=f(s_{1})=z_{1}<z_{2}$ . Hence, $f$ is monotone.

We next show that $f$ is monotone increasing. Let $\alpha_{t}$ denote the sum of the roots of $p_{t}(x)$ . It follows from (i) that

\alpha_{t}=\frac{\operatorname{tr}(\mathbf{A})}{k}+\frac{\mathbf{v}^{*}\mathbf{v}}{k}\cdot t.

Hence, when $t\to+\infty$ , we have $\alpha_{t}\to+\infty$ . Since $\alpha_{t}\leq m\cdot\operatorname{maxroot}\ p_{t}(x)$ , we have

f(t)=\operatorname{maxroot}\ p_{t}(x)\to+\infty

with $t\to+\infty$ . Thus, $f$ is monotone increasing in $t$ which implies $f(1)\geq f(0)$ . We arrive at the conclusion.

(iii) By the spectral decomposition, we can write

\mathbf{A}=\sum\limits_{i=1}^{km}\lambda_{i}(\mathbf{A})\mathbf{v}_{i}\mathbf{v}_{i}^{*},

where $\mathbf{v}_{i}\in\mathbb{C}^{km}$ is the unit-norm eigenvector of $\mathbf{A}$ corresponding to the $i$ -th largest eigenvalue $\lambda_{i}(\mathbf{A})$ . Since $\mathbf{A}$ is positive semidefinite, we have $\lambda_{i}(\mathbf{A})\geq 0$ for each $i\in[km]$ . According to (3.17), we have

(3.23)

\operatorname{maxroot}\ \psi_{k,m}[\lambda_{1}(\mathbf{A})\mathbf{v}_{1}\mathbf{v}_{1}^{*}]\leq\operatorname{maxroot}\ \psi_{k,m}[\mathbf{A}].

A simple calculation shows that

\psi_{k,m}[\lambda_{1}(\mathbf{A})\mathbf{v}_{1}\mathbf{v}_{1}^{*}](x)=x^{m-1}(x-\frac{1}{k}\cdot\lambda_{1}(\mathbf{A})\mathbf{v}_{1}^{*}\mathbf{v}_{1}),

which implies

(3.24)

\operatorname{maxroot}\ \psi_{k,m}[\lambda_{1}(\mathbf{A})\mathbf{v}_{1}\mathbf{v}_{1}^{*}]=\frac{1}{k}\cdot\lambda_{1}(\mathbf{A}).

Since $\mathbf{A}$ is positive semidefinite, its operator norm is exactly $\lambda_{1}(\mathbf{A})$ . Hence, combining (3.23) with (3.24), we arrive at

\|\mathbf{A}\|\leq k\cdot\operatorname{maxroot}\ \psi_{k,m}[\mathbf{A}].

∎

Remark 3.7.

In [RS19, Theorem 1.9], Ravichandran and Srivastava proved that

(3.25)

\operatorname{maxroot}\ \det[x\cdot\mathbf{I}_{km}-\mathbf{A}]\leq k\cdot\operatorname{maxroot}\ \psi_{k,m}[\mathbf{A}],

where $\mathbf{A}={\rm diag}(\mathbf{A}_{1},\ldots,\mathbf{A}_{k})\in\mathbb{C}^{km\times km}$ is a block diagonal matrix with $\mathbf{A}_{i}\in\mathbb{C}^{m\times m}$ being zero-diagonal Hermitian. Recall that (3.18) requires that $\mathbf{A}$ is positive semidefinite. Motivated by these results, we conjecture (3.18) holds provided $\mathbf{A}\in\mathbb{C}^{km\times km}$ is diagonal-non-negative Hermitian.

Inspired by [RS19, Lemma 5.5], we next show a connection between the $(k,m)$ -characteristic polynomial and the mixed characteristic polynomial, which is the main result of this section.

Theorem 3.8.

Let $\mathbf{X}_{1},\ldots,\mathbf{X}_{m}\in\mathbb{C}^{d\times d}$ be matrices of rank at most $k$ . Suppose that $\{\mathbf{u}_{i,j}\}_{i\in[m],j\in[k]}\subset\mathbb{C}^{d}$ and $\{\mathbf{v}_{i,j}\}_{i\in[m],j\in[k]}\subset\mathbb{C}^{d}$ satisfy

\mathbf{X}_{i}=\sum\limits_{j=1}^{k}\mathbf{u}_{i,j}\mathbf{v}_{i,j}^{*}\quad\text{for all $i\in[m]$.}

We set $\mathbf{U}_{j}:=[\mathbf{u}_{1,j},\ldots,\mathbf{u}_{m,j}]\in\mathbb{C}^{d\times m},\mathbf{V}_{j}:=[\mathbf{v}_{1,j},\ldots,\mathbf{v}_{m,j}]\in\mathbb{C}^{d\times m}$ for all $j\in[k]$ and

(3.26)

\mathbf{A}:=\left(\begin{matrix}\mathbf{V}_{1}^{*}\\ \vdots\\ \mathbf{V}_{k}^{*}\end{matrix}\right)(\mathbf{U}_{1},\ldots,\mathbf{U}_{k})\in\mathbb{C}^{km\times km}.

Then, we have

\mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}](x)=x^{d-m}\cdot\psi_{k,m}[k\cdot\mathbf{A}](x).

Proof.

For each $i\in[m]$ , let $\mathbf{W}_{i}$ be the random rank one matrix taking values in

\{\sqrt{k}\cdot\mathbf{u}_{i,1}\mathbf{v}_{i,1}^{*},\ldots,\sqrt{k}\cdot\mathbf{u}_{i,k}\mathbf{v}_{i,k}^{*}\}\subset\mathbb{C}^{d\times d}

with equal probability. Then, we have

\mathbb{E}\ \mathbf{W}_{i}=\frac{1}{k}\cdot\sum\limits_{j=1}^{k}k\cdot\mathbf{u}_{i,j}\mathbf{v}_{i,j}^{*}=\mathbf{X}_{i}.

For each subset $S\subset[m]$ and each $j\in[k]$ , we let $\mathbf{U}_{j,S},\mathbf{V}_{j,S}\in\mathbb{C}^{d\times|S|}$ denote the submatrix of $\mathbf{U}_{j}$ and $\mathbf{V}_{j}$ , respectively, obtained by extracting the columns indexed by $S$ . Now a simple calculation shows

	$\displaystyle\mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}](x)$	$\displaystyle=\mathbb{E}\ \det[x\cdot\mathbf{I}_{d}-\sum\limits_{i=1}^{m}\mathbf{W}_{i}]$
		$\displaystyle=\frac{1}{k^{m}}\sum\limits_{\mathcal{S}=(S_{1},\ldots,S_{k})\in\mathcal{P}_{k}(m)}\det[x\cdot\mathbf{I}_{d}-k\cdot\sum\limits_{j=1}^{k}\sum\limits_{i\in S_{j}}\mathbf{u}_{i,j}\mathbf{v}_{i,j}^{*}]$
		$\displaystyle=\frac{1}{k^{m}}\sum\limits_{\mathcal{S}=(S_{1},\ldots,S_{k})\in\mathcal{P}_{k}(m)}\det[x\cdot\mathbf{I}_{d}-k\cdot\sum\limits_{j=1}^{k}\mathbf{U}_{j,S_{j}}\mathbf{V}_{j,S_{j}}^{*}].$

Note that for each $\mathcal{S}=(S_{1},\ldots,S_{k})\in\mathcal{P}_{k}(m)$ we have

	$\displaystyle\det[x\cdot\mathbf{I}_{d}-k\cdot\sum\limits_{j=1}^{k}\mathbf{U}_{j,S_{j}}\mathbf{V}_{j,S_{j}}^{*}]$	$\displaystyle=\det[x\cdot\mathbf{I}_{d}-k\cdot(\mathbf{U}_{1,S_{1}},\ldots,\mathbf{U}_{k,S_{k}})\left(\begin{matrix}\mathbf{V}_{1,S_{1}}^{}\\ \vdots\\ \mathbf{V}_{k,S_{k}}^{}\end{matrix}\right)]$
		$\displaystyle=x^{d-m}\cdot\det[x\cdot\mathbf{I}_{m}-k\cdot\left(\begin{matrix}\mathbf{V}_{1,S_{1}}^{}\\ \vdots\\ \mathbf{V}_{k,S_{k}}^{}\end{matrix}\right)(\mathbf{U}_{1,S_{1}},\ldots,\mathbf{U}_{k,S_{k}})]$
		$\displaystyle=x^{d-m}\cdot\det[x\cdot\mathbf{I}_{m}-k\cdot\mathbf{A}(\mathcal{S})].$

Hence, we arrive at

	$\displaystyle\mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}](x)$	$\displaystyle=x^{d-m}\cdot\frac{1}{k^{m}}\sum\limits_{\mathcal{S}=(S_{1},\ldots,S_{k})\in\mathcal{P}_{k}(m)}\det[x\cdot\mathbf{I}_{m}-k\cdot\mathbf{A}(\mathcal{S})]$
		$\displaystyle=x^{d-m}\cdot\psi_{k,m}[k\cdot\mathbf{A}](x).$

∎

Remark 3.9.

Note that $\operatorname{rank}(\mathbf{A})\leq d$ where $\mathbf{A}\in\mathbb{C}^{km\times km}$ is presented in (3.26). We next assume that $\mathbf{X}_{1},\ldots,\mathbf{X}_{m}\in\mathbb{C}^{d\times d}$ are positive semidefinite. Combining Proposition 3.5 and Theorem 3.8, we obtain that $\mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}]$ is real-rooted, which conincides with the well-known result in [MSS15b, Corollary 4.4]. Note that

\psi_{k,m}[k\cdot\mathbf{A}](x)=k^{m}\cdot\psi_{k,m}[\mathbf{A}](\frac{x}{k}).

It immediately follows from Theorem 3.8 that

(3.27)

\operatorname{maxroot}\ \mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}]=\operatorname{maxroot}\ \psi_{k,m}[k\cdot\mathbf{A}]=k\cdot\operatorname{maxroot}\ \psi_{k,m}[\mathbf{A}].

Also note that

(3.28)

\bigg{\|}\sum_{i=1}^{m}\mathbf{X}_{i}\bigg{\|}\,\,=\,\,\|\mathbf{A}\|.

Combining Proposition 3.6 (iii) , (3.27) and (3.28), we obtain that

(3.29)

\bigg{\|}\sum\limits_{i=1}^{m}\mathbf{X}_{i}\bigg{\|}\leq\operatorname{maxroot}\ \mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}],

which is the same with the result in Proposition 2.7.

4. Proof of Theorem 1.9

The aim of this section is to prove Theorem 1.9, which shows an upper bound for the maximum root of $\mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}]$ . To do that, we first present an upper bound of the maximum root of $\psi_{k,m}[\mathbf{A}](x)$ , and then we employ the connection between $\mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}]$ and $\psi_{k,m}[\mathbf{A}](x)$ (see equation (3.27)) to prove Theorem 1.9.

Theorem 4.1.

Let $k\geq 2$ and $m\geq 1$ be two positive integers. Assume that $\mathbf{A}\in\mathbb{C}^{km\times km}$ is a positive semidefinite Hermitian matrix such that $\mathbf{0}\preceq\mathbf{A}\preceq\mathbf{I}_{km}$ . Set

\varepsilon:=\max\limits_{1\leq i\leq m}\sum\limits_{j=1}^{k}\mathbf{A}(i+(j-1)m,i+(j-1)m).

If $\varepsilon\leq\frac{(k-1)^{2}}{k}$ , then we have

(4.1)

\operatorname{maxroot}\ \psi_{k,m}[\mathbf{A}]\leq\frac{1}{k}\cdot\bigg{(}\sqrt{1-\frac{\varepsilon}{k-1}}+\sqrt{\varepsilon}\bigg{)}^{2}.

Remark 4.2.

Theorem 4.1 generalizes Ravichandran and Leake’s result in [RL20, Theorem 9] which presents an upper bound for the largest root of the $k$ -characteristic polynomial $\chi_{k}[\mathbf{B}]=k^{m}\psi_{k,m}[{\rm diag}(\underbrace{\mathbf{B},\ldots,\mathbf{B}}_{k})]$ for matrix $\mathbf{B}\in\mathbb{C}^{m\times m}$ . Particularly, they showed that

(4.2)

\operatorname{maxroot}\ \chi_{k}[\mathbf{B}]\leq\frac{1}{k}\cdot\bigg{(}\sqrt{1-\frac{k\alpha}{k-1}}+\sqrt{k\alpha}\bigg{)}^{2}

provided $\alpha:=\max_{1\leq i\leq m}\mathbf{B}(i,i)\leq\frac{(k-1)^{2}}{k^{2}}$ .

We next give a proof of Theorem 1.9. The proof of Theorem 4.1 is postponed to the end of this section.

Proof of Theorem 1.9.

Recall that $\mathbf{X}_{1},\ldots,\mathbf{X}_{m}\in\mathbb{C}^{d\times d}$ are positive semidefinite Hermitian matrices of rank at most $k$ such that $\sum\limits_{i=1}^{m}\mathbf{X}_{i}\preceq\mathbf{I}_{d}$ . Suppose that $\{\mathbf{u}_{i,j}\}_{i\in[m],j\in[k]}$ are the vectors in $\mathbb{C}^{d}$ such that

\mathbf{X}_{i}=\sum\limits_{j=1}^{k}\mathbf{u}_{i,j}\mathbf{u}_{i,j}^{*}\quad\text{for all $i\in[m]$.}

We set $\mathbf{U}:=[\mathbf{u}_{1,1},\ldots,\mathbf{u}_{m,1},\ldots,\mathbf{u}_{1,k},\ldots,\mathbf{u}_{m,k}]\in\mathbb{C}^{d\times km}$ and set $\mathbf{A}=\mathbf{U}^{*}\mathbf{U}$ . Note that

\mathbf{0}\preceq\sum\limits_{i=1}^{m}\mathbf{X}_{i}=\sum\limits_{i=1}^{m}\sum\limits_{j=1}^{k}\mathbf{u}_{i,j}\mathbf{u}_{i,j}^{*}=\mathbf{U}\mathbf{U}^{*}\preceq\mathbf{I}_{d}.

Since $\mathbf{U}\mathbf{U}^{*}$ and $\mathbf{U}^{*}\mathbf{U}$ have the same nonzero eigenvalues, we obtain

\mathbf{0}\preceq\mathbf{A}\preceq\mathbf{I}_{km}.

Moreover, for each $i\in[m]$ we have

\operatorname{tr}(\mathbf{X}_{i})=\operatorname{tr}(\sum\limits_{j=1}^{k}\mathbf{u}_{i,j}\mathbf{u}_{i,j}^{*})=\sum\limits_{j=1}^{k}\mathbf{A}(i+(j-1)m,i+(j-1)m)\leq\varepsilon.

Since $\varepsilon\leq(k-1)^{2}/k$ , Theorem 4.1 gives

\operatorname{maxroot}\ \psi_{k,m}[\mathbf{A}]\leq\frac{1}{k}\cdot\bigg{(}\sqrt{1-\frac{\varepsilon}{k-1}}+\sqrt{\varepsilon}\bigg{)}^{2},

which implies

\operatorname{maxroot}\ \mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}]\leq\bigg{(}\sqrt{1-\frac{\varepsilon}{k-1}}+\sqrt{\varepsilon}\bigg{)}^{2}.

Here, we use Theorem 3.8, i.e.,

\operatorname{maxroot}\ \mu[\mathbf{X}_{1},\ldots,\mathbf{X}_{m}]=k\cdot\operatorname{maxroot}\ \psi_{k,m}[\mathbf{A}].

∎

In the remainder of this section, we aim to prove Theorem 4.1 by adapting the method of multivariate barrier function. We first introduce the barrier function of a real stable polynomial (see also [MSS15b]).

Definition 4.3.

Let $p(z_{1},\ldots,z_{m})\in\mathbb{R}[z_{1},\ldots,z_{m}]$ be a multivariate polynomial. We say that a point $\mathbf{z}\in\mathbb{R}^{m}$ is above the roots of $p(z_{1},\ldots,z_{m})$ if $p(\mathbf{z}+\mathbf{t})\neq 0$ for all $\mathbf{t}\in\mathbb{R}_{\geq 0}^{m}$ . We use $\mathbf{Ab}_{p}$ to denote the set of points that are above the roots of $p$ .

Definition 4.4.

Let $p\in\mathbb{R}[z_{1},\ldots,z_{m}]$ be a real stable polynomial. The barrier function of $p$ at a point $\mathbf{z}\in\mathbf{Ab}_{p}$ in the direction $i$ is defined as

(4.3)

\Phi_{p}^{i}(\mathbf{z}):=\frac{\partial_{z_{i}}p}{p}(\mathbf{z}).

Here we briefly introduce our proof of Theorem 4.1. According to Proposition 3.3, we have

\psi_{k,m}[\mathbf{A}](x)=\frac{1}{(k!)^{m}}\cdot\prod\limits_{i=1}^{m}\partial_{z_{i}}^{k-1}\cdot\det[\mathbf{Z}_{k}-\mathbf{A}]|_{z_{1}=\cdots=z_{m}=x},

where $\mathbf{A}\in\mathbb{C}^{km\times km}$ . For Hermitian matrix $\mathbf{A}\in\mathbb{C}^{km\times km}$ , we consider the real stable polynomial

(4.4)

p_{0}(z_{1},\cdots,z_{m}):=\det[\mathbf{Z}_{k}-\mathbf{A}],

where

(4.5)

\mathbf{Z}_{k}={\rm diag}(\underbrace{\mathbf{z},\ldots,\mathbf{z}}_{k}),\quad\mathbf{z}=(z_{1},\ldots,z_{m}).

We iteratively define the polynomials

(4.6)

p_{t}(z_{1},\ldots,z_{m}):=\partial_{z_{t}}^{k-1}p_{t-1}(z_{1},\ldots,z_{m}),\quad t=1,\ldots,m.

We start with an initial point $\mathbf{b}_{0}:=a\mathbf{1}\in\mathbb{R}^{m}$ with $a>1$ that is above the roots of $p_{0}$ . For each $t\in[m]$ , we aim to find a positive number $\delta_{t}$ such that each entries of $\mathbf{b}_{m}=\mathbf{b}_{0}-\sum_{t=1}^{m}\delta_{t}\mathbf{e}_{t}\in\mathbf{Ab}_{p_{0}}$ less than or equal to $\frac{1}{k}(\sqrt{1-\frac{\epsilon}{k-1}}+\sqrt{\epsilon})^{2}$ , which implies the result in Theorem 4.1.

We need the following result which depicts the behavior of barrier functions of polynomials affected by taking derivatives.

Proposition 4.5.

[RL20, Proposition 9, Proposition 10] Let $j\in[m]$ be any integer. Assume that $p(z_{1},\ldots,z_{m})$ is a real stable polynomial of degree $k$ in $z_{j}$ . Let $\mathbf{a}=(a_{1},\ldots,a_{m})\in\mathbf{Ab}_{p}$ and let $\lambda_{k}$ be the smallest root of the univariate polynomial $q(z):=p(a_{1},\cdots,a_{j-1},z,a_{j+1},\cdots,a_{m})$ . Let

(4.7)

\delta=\frac{(k-1)^{2}}{k}\bigg{(}\frac{1}{\Phi_{p}^{j}(\mathbf{a})-\frac{1}{a_{j}-\lambda_{k}}}\bigg{)}.

Then, we have $\mathbf{a}-\delta\cdot\mathbf{e}_{j}\in\mathbf{Ab}_{\partial_{z_{j}}^{k-1}p}$ and

\Phi_{\partial_{z_{j}}^{k-1}p}^{i}(\mathbf{a}-\delta\cdot\mathbf{e}_{j})\leq\Phi_{p}^{i}(\mathbf{a}).

We introduce a well-known result of the zeros of the real stable polynomials (see also [RL20, Proposition 8]).

Lemma 4.6.

[Tao13, Lemma 17] Let $p(z_{1},z_{2})$ be a real stable polynomial of degree $k$ in $z_{2}$ . For each $(a,b)\in\mathbf{Ab}_{p}$ , denote the roots of the univariate polynomial $q_{a}(z):=p(a,z)$ by $\lambda_{k}(a)\leq\cdots\leq\lambda_{1}(a)$ . Then for each $j\in[k]$ , the map $a\mapsto\lambda_{j}(a)$ defined on $\{a\in\mathbb{R}:(a,b)\in\mathbf{Ab}_{p}\,\,\text{for some}\,\,b\in\mathbb{R}\}$ is non-increasing.

The following lemma is useful for our analysis.

Lemma 4.7.

Let $\mathbf{A}\in\mathbb{C}^{m\times m}$ be a positive semidefinite Hermitian matrix. Set

(4.8)

p(z_{1},\ldots,z_{m}):=\det[\operatorname{diag}(z_{1},\ldots,z_{m})-\mathbf{A}].

Assume that $\mathbf{a}=(a_{1},\ldots,a_{m})\in\mathbf{Ab}_{p}$ . Then, for each $t\in[m]$ , the smallest root of the univariate polynomial $q(z):=p(\underbrace{z,\ldots,z}_{t},a_{t+1},\ldots,a_{m})$ is non-negative.

Proof.

We can write $\mathbf{A}$ in the form of $\begin{pmatrix}\mathbf{P}&\mathbf{Q}\\ \mathbf{Q}^{*}&\mathbf{S}\end{pmatrix}$ , where $\mathbf{P}\in\mathbb{C}^{t\times t},\mathbf{S}\in\mathbb{C}^{(m-t)\times(m-t)}$ are both positive semidefinite Hermitian and $\mathbf{Q}\in\mathbb{C}^{t\times(m-t)}$ . Since $\mathbf{a}=(a_{1},\ldots,a_{m})\in\mathbf{Ab}_{p}$ , we obtain that ${\rm diag}(a_{1},\ldots,a_{m})-\mathbf{A}$ is positive definite. Set $\mathbf{D}_{t}:={\rm diag}(a_{t+1},\ldots,a_{m})$ . Noting that $\mathbf{D}_{t}-\mathbf{S}\in\mathbb{C}^{(m-t)\times(m-t)}$ is a principal submatrix of ${\rm diag}(a_{1},\ldots,a_{m})-\mathbf{A}$ , we obtain that $\mathbf{D}_{t}-\mathbf{S}$ is positive definite. We use the Schur complement to obtain that

	$\displaystyle q(z)$	$\displaystyle=\det\begin{pmatrix}z\cdot\mathbf{I}_{t}-\mathbf{P}&-\mathbf{Q}\\ -\mathbf{Q}^{*}&\mathbf{D}_{t}-\mathbf{S}\end{pmatrix}$
		$\displaystyle=\det(\mathbf{D}_{t}-\mathbf{S})\det\left(z\cdot\mathbf{I}_{t}-\mathbf{P}-\mathbf{Q}(\mathbf{D}_{t}-\mathbf{S})^{-1}\mathbf{Q}^{*}\right).$

Hence, the roots of $q(z)$ are the eigenvalues of the positive semidefinite matrix $\mathbf{P}+\mathbf{Q}(\mathbf{D}_{t}-\mathbf{S})^{-1}\mathbf{Q}^{*}\in\mathbb{C}^{t\times t}$ , which implies that all roots of $q(z)$ are non-negative. ∎

We next extend Lemma 4.7 to the polynomials that we will use in our proof of Theorem 4.1.

Proposition 4.8.

Let $k,m$ be two positive integers. Let $\mathbf{A}\in\mathbb{C}^{km\times km}$ be a positive semidefinite Hermitian matrix. Let $p_{0}(z_{1},\ldots,z_{m})$ be defined as in (4.4). Assume that $\mathbf{a}=(a_{1},\ldots,a_{m})\in\mathbf{Ab}_{p_{0}}$ . For each $t\in[m]$ , let $p_{t}(z_{1},\ldots,z_{m})$ be defined as in (4.6). Set

q_{t}(z):=p_{t-1}(a_{1},\ldots,a_{t-1},z,a_{t+1},\ldots,a_{m}).

Then, for each $t\in[m]$ , the smallest root of $q_{t}(z)$ is non-negative.

Proof.

By equation (3.15), we can write the univariate polynomial $q_{t}(z)$ in the form of

q_{t}(z)=((k-1)!)^{(t-1)}\sum\limits_{\mathcal{S}=(S_{1},\ldots,S_{k})\in\mathcal{P}_{k}(t-1)}g_{\mathcal{S}}(a_{1},\ldots,a_{t-1},z,a_{t+1},\ldots,a_{n}),

where

g_{\mathcal{S}}(z_{1},\ldots,z_{m}):=\det[(\mathbf{Z}_{k}-\mathbf{A})(\mathcal{T}_{t-1,\mathcal{S}})],

and $\mathcal{T}_{t-1,\mathcal{S}}$ is defined in (3.14). Since $\mathbf{a}\in\mathbf{Ab}_{p_{0}}$ , we have

\operatorname{diag}(\mathbf{a},\ldots,\mathbf{a})-\mathbf{A}\succ\mathbf{0},

which implies that the principal matrix

(\operatorname{diag}(\mathbf{a},\ldots,\mathbf{a})-\mathbf{A})(\mathcal{T}_{t-1,\mathcal{S}})\succ\mathbf{0}.

Then we obtain that $\mathbf{a}\in\mathbf{Ab}_{g_{\mathcal{S}}}$ . Lemma 4.7 shows that the smallest root of the polynomial

g_{\mathcal{S}}(a_{1},\ldots,a_{t-1},z,a_{t+1},\ldots,a_{n})\in\mathbb{R}[z]

is non-negative. Since $q_{t}(z)$ is a multiple of the sum of monic polynomials whose smallest root is non-negative, it follows that the smallest root of $q_{t}(z)$ is non-negative. ∎

We need the following lemma to provide an estimate on the value of $\delta_{t}$ .

Lemma 4.9.

Let $k,m$ be two positive integers. Let $\mathbf{A}\in\mathbb{C}^{km\times km}$ be a positive semidefinite Hermitian matrix satisfying $\mathbf{0}\preceq\mathbf{A}\preceq\mathbf{I}_{km}$ . Set

\varepsilon:=\max_{1\leq i\leq m}\sum_{j=1}^{k}\mathbf{A}(i+(j-1)m,i+(j-1)m).

Let $p_{0}(z_{1},\ldots,z_{m})$ be defined as in (4.4). Then for any $a>1$ we have

\Phi_{p_{0}}^{i}(a\mathbf{1})\leq\frac{\varepsilon}{a-1}+\frac{k-\varepsilon}{a}\quad\text{for all $i\in[m]$}.

Proof.

Define the polynomial

g(z_{1},\ldots,z_{km}):=\det[\mathbf{Z}-\mathbf{A}],

where

\mathbf{Z}:={\rm diag}(z_{1},\ldots,z_{km}).

By equation (3.8), for each $i\in[m]$ , we have

(4.9)

\Phi_{p_{0}}^{i}(a\mathbf{1})=\frac{\partial_{z_{i}}p_{0}}{p_{0}}(a\mathbf{1})=\sum_{j=1}^{k}\frac{\partial_{z_{i+(j-1)m}}g}{g}\bigg{|}_{\mathbf{Z}=a\mathbf{I}_{km}}.

For each $i\in[m],j\in[k]$ , we have

\partial_{z_{i+(j-1)m}}\det[\mathbf{Z}-\mathbf{A}]=\det[(\mathbf{Z}-\mathbf{A})_{i+(j-1)m}],

where $(\mathbf{Z}-\mathbf{A})_{i+(j-1)m}$ denotes the principle submatrix of $\mathbf{Z}-\mathbf{A}$ whose the $(i+(j-1)m)$ -th row and $(i+(j-1)m)$ -th column are deleted. By the expression of the inverse of a matrix, we obtain that

(4.10)

(\mathbf{Z}-\mathbf{A})^{-1}=\frac{1}{\det[\mathbf{Z}-\mathbf{A}]}\cdot\operatorname{adj}(\mathbf{Z}-\mathbf{A}).

Taking the $(i+(j-1)m)$ -th diagonal element of each side of (4.10), we have

\mathbf{e}_{i+(j-1)m}^{*}(\mathbf{Z}-\mathbf{A})^{-1}\mathbf{e}_{i+(j-1)m}=\frac{1}{\det[\mathbf{Z}-\mathbf{A}]}\cdot\det[(\mathbf{Z}-\mathbf{A})_{i+(j-1)m}]

Then for each $i\in[m],j\in[k]$ , we have

(4.11)

\frac{\partial_{z_{i+(j-1)m}}g}{g}=\frac{\partial_{z_{i+(j-1)m}}\det[\mathbf{Z}-\mathbf{A}]}{\det[\mathbf{Z}-\mathbf{A}]}=\mathbf{e}_{i+(j-1)m}^{*}(\mathbf{Z}-\mathbf{A})^{-1}\mathbf{e}_{i+(j-1)m}.

Combining (4.9) and (4.11), we obtain that

\Phi_{p_{0}}^{i}(a\mathbf{1})=\sum\limits_{j=1}^{k}\mathbf{e}_{i+(j-1)m}^{*}(a\mathbf{I}_{km}-\mathbf{A})^{-1}\mathbf{e}_{i+(j-1)m}.

Suppose that the spectral decomposition of $\mathbf{A}$ is

(4.12)

\mathbf{A}=\mathbf{U}\mathbf{D}\mathbf{U}^{*},

where $\mathbf{U}\in\mathbb{C}^{km\times km}$ is a unitary matrix and $\mathbf{D}={\rm diag}(\lambda_{1},\ldots,\lambda_{km})$ is a diagonal matrix of eigenvalues of $\mathbf{A}$ . It follows that $(a\mathbf{I}_{km}-\mathbf{A})^{-1}=\mathbf{U}(a\mathbf{I}_{km}-\mathbf{D})^{-1}\mathbf{U}^{*}$ . Thus we have

(4.13)		$\displaystyle\Phi_{p_{0}}^{i}(a\mathbf{1})$	$\displaystyle=\sum\limits_{j=1}^{k}\mathbf{e}_{i+(j-1)m}^{}\mathbf{U}(a\mathbf{I}_{km}-\mathbf{D})^{-1}\mathbf{U}^{}\mathbf{e}_{i+(j-1)m}$
(4.13)			$\displaystyle=\sum\limits_{j=1}^{k}\sum\limits_{l=1}^{km}\frac{\|\mathbf{U}(i+(j-1)m,l)\|^{2}}{a-\lambda_{l}}.$

Set $c_{l}:=\sum\limits_{j=1}^{k}|\mathbf{U}(i+(j-1)m,l)|^{2}$ for each $l\in[km]$ . Then we have

(4.14)		$\displaystyle\Phi_{p_{0}}^{i}(a\mathbf{1})=\sum\limits_{l=1}^{km}\frac{c_{l}}{a-\lambda_{l}}$	$\displaystyle\leq\sum\limits_{l=1}^{km}\bigg{(}\frac{c_{l}\cdot\lambda_{l}}{a-1}+\frac{c_{l}\cdot(1-\lambda_{l})}{a}\bigg{)}$
(4.14)			$\displaystyle=\bigg{(}\frac{1}{a-1}-\frac{1}{a}\bigg{)}\cdot\sum\limits_{l=1}^{km}\lambda_{l}c_{l}+\frac{1}{a}\cdot\sum\limits_{l=1}^{km}c_{l}.$

Here, we use the following inequality:

\frac{1}{a-\lambda_{l}}\leq\frac{\lambda_{l}}{a-1}+\frac{1-\lambda_{l}}{a},

where $a>1$ and $\lambda_{l}\in[0,1]$ . Note that

(4.15)

\sum_{l=1}^{km}c_{l}=\sum_{j=1}^{k}\sum_{l=1}^{km}|\mathbf{U}(i+(j-1)m,l)|^{2}=\sum_{j=1}^{k}1=k,

since each column of $\mathbf{U}$ is a unit vector. We use (4.12) to obtain that

\mathbf{A}(i+(j-1)m,i+(j-1)m)=\sum_{l=1}^{km}\lambda_{l}|\mathbf{U}(i+(j-1)m,l)|^{2}.

Then we have

(4.16)	$\displaystyle\sum_{l=1}^{km}\lambda_{l}c_{l}$	$\displaystyle=\sum_{j=1}^{k}\sum_{l=1}^{km}\lambda_{l}\|\mathbf{U}(i+(j-1)m,l)\|^{2}$
		$\displaystyle=\sum_{j=1}^{k}\mathbf{A}(i+(j-1)m,i+(j-1)m)$
		$\displaystyle\leq\varepsilon.$

Thus, combining (4.14), (4.15) and (4.16), we obtain that

\Phi_{p_{0}}^{i}(a\mathbf{1})\leq\varepsilon\cdot\bigg{(}\frac{1}{a-1}-\frac{1}{a}\bigg{)}+\frac{k}{a}=\frac{\varepsilon}{a-1}+\frac{k-\varepsilon}{a}.

∎

Now we have all the materials to prove Theorem 4.1.

Proof of Theorem 4.1.

Suppose that $a>1$ is a constant which is chosen later. Set $\mathbf{b}_{0}=a\mathbf{1}\in{\mathbb{R}}^{m}$ . Let $p_{0}(z_{1},\ldots,z_{m})$ be defined as in (4.4). According to $\mathbf{A}\preceq\mathbf{I}_{km}$ , we obtain that $\mathbf{b}_{0}$ is above the roots of $p_{0}$ . Recall that

p_{t}=\partial_{z_{t}}^{k-1}p_{t-1},\quad t=1,\ldots,m.

A simple observation is that

\psi_{k,m}[\mathbf{A}](x)=\frac{1}{(k!)^{m}}\cdot p_{m}(z_{1},\ldots,z_{m})|_{z_{1}=\cdots=z_{m}=x}.

Hence, to prove the conclusion, it is enough to show that $\frac{1}{k}\big{(}\sqrt{1-\frac{\varepsilon}{k-1}}+\sqrt{\varepsilon}\big{)}^{2}\cdot\mathbf{1}$ is above the roots of $p_{m}$ . For each $t\in[m]$ , set

\delta_{t}:=\frac{(k-1)^{2}}{k}\bigg{(}\frac{1}{\Phi_{p_{t-1}}^{t}(\mathbf{b}_{t-1})-\frac{1}{a-\lambda_{k}^{(t)}}}\bigg{)},

where

\mathbf{b}_{t}:=\mathbf{b}_{0}-\sum_{j=1}^{t}\delta_{j}{\mathbf{e}}_{j}=(a-\delta_{1},\ldots,a-\delta_{t},a,\ldots,a).

Let $\lambda_{k}^{(t)}$ be the smallest root of the univariate polynomial

(4.17)

q_{t}(z):=p_{t-1}(a-\delta_{1},\ldots,a-\delta_{t-1},z,a,\ldots,a).

By Proposition 4.5 , we have $\mathbf{b}_{t}\in\mathbf{Ab}_{p_{t}}$ and

(4.18)

\Phi_{p_{t}}^{i}(\mathbf{b}_{t})\leq\Phi_{p_{t-1}}^{i}(\mathbf{b}_{t-1})\quad\text{for all $t\in[m]$ and $i\in[m]$}.

we claim that

(4.19)

\max_{t\in[m]}\inf_{a>1}(a-\delta_{t})\leq\frac{1}{k}\bigg{(}\sqrt{1-\frac{\varepsilon}{k-1}}+\sqrt{\varepsilon}\bigg{)}^{2}.

Noting that $\mathbf{b}_{m}=a\mathbf{1}-\sum\limits_{t=1}^{m}\delta_{t}\cdot\mathbf{e}_{t}$ is above the roots of $p_{m}$ , we can use (4.19) to obtain the conclusion.

It remains to prove (4.19). We first show that $\lambda_{k}^{(t)}\geq 0$ for each $t\in[m]$ . For each $t\in[m]$ we define the univariate polynomial

(4.20)

g_{t}(z):=p_{t-1}\left(\underbrace{a,\ldots,a}_{t-1},z,a,\ldots,a\right).

Since $\mathbf{a}$ and $\mathbf{b}_{t-1}=\mathbf{a}-\sum\limits_{i=1}^{t-1}\delta_{i}\cdot\mathbf{e}_{i}\in\mathbf{Ab}_{p_{t-1}}$ , we obtain that

(4.21)

\lambda_{k}^{(t)}=\text{minroot}\ q_{t}\geq\text{minroot}\ g_{t}\geq 0,

where the first inequality follows from Lemma 4.6 and the second inequality follows from Proposition 4.8. Then, combining (4.21), (4.18) and Lemma 4.9, for each $t\in[m]$ , we have

	$\displaystyle\delta_{t}$	$\displaystyle\geq\frac{(k-1)^{2}}{k}\bigg{(}\frac{1}{\Phi_{p_{t-1}}^{t}(\mathbf{b}_{t-1})-\frac{1}{a}}\bigg{)}$
		$\displaystyle\geq\frac{(k-1)^{2}}{k}\bigg{(}\frac{1}{\Phi_{p_{0}}^{t}(\mathbf{b}_{0})-\frac{1}{a}}\bigg{)}$
		$\displaystyle\geq\frac{(k-1)^{2}}{k}\bigg{(}\frac{1}{\frac{\varepsilon}{a-1}+\frac{k-\varepsilon}{a}-\frac{1}{a}}\bigg{)}.$

So, for each $t\in[m]$ , we have

(4.22)

\inf\limits_{a>1}(a-\delta_{t})\,\,\leq\,\,\inf\limits_{a>1}\bigg{(}a-\frac{(k-1)^{2}}{k}\bigg{(}\frac{1}{\frac{\varepsilon}{a-1}+\frac{k-\varepsilon}{a}-\frac{1}{a}}\bigg{)}\bigg{)}.

Set $\alpha:=a-(1-\frac{\varepsilon}{k-1})>0$ . A simple calculation shows that

	$\displaystyle a-\frac{(k-1)^{2}}{k}\bigg{(}\frac{1}{\frac{\varepsilon}{a-1}+\frac{k-\varepsilon}{a}-\frac{1}{a}}\bigg{)}$	$\displaystyle=\frac{1}{k}\cdot\bigg{(}\alpha+\frac{(1-\frac{\varepsilon}{k-1})\varepsilon}{\alpha}+(1-\frac{\varepsilon}{k-1})+\varepsilon\bigg{)}$
		$\displaystyle\geq\frac{1}{k}\cdot\bigg{(}2\sqrt{(1-\frac{\varepsilon}{k-1})\varepsilon}+(1-\frac{\varepsilon}{k-1})+\varepsilon\bigg{)}$
		$\displaystyle=\frac{1}{k}\cdot\bigg{(}\sqrt{1-\frac{\varepsilon}{k-1}}+\sqrt{\varepsilon}\bigg{)}^{2},$

where the equality holds if and only if $\alpha=\sqrt{(1-\frac{\varepsilon}{k-1})\varepsilon}$ , i.e.,

a=a_{0}:=\sqrt{(1-\frac{\varepsilon}{k-1})\varepsilon}+(1-\frac{\varepsilon}{k-1}).

This implies that if $a_{0}\geq 1$ , i.e., $\varepsilon\leq(k-1)^{2}/k$ , then

(4.23)

\inf\limits_{a>1}\bigg{(}a-\frac{(k-1)^{2}}{k}\bigg{(}\frac{1}{\frac{\varepsilon}{a-1}+\frac{k-\varepsilon}{a}-\frac{1}{a}}\bigg{)}\bigg{)}=\frac{1}{k}\cdot\bigg{(}\sqrt{1-\frac{\varepsilon}{k-1}}+\sqrt{\varepsilon}\bigg{)}^{2}.

Combing (4.22) and (4.23), we arrive at (4.19). ∎

5. Proof of Theorem 1.4

Following [Coh16] and [Br ${\rm\ddot{a}}$ 18, Theorem 6.1], we now prove Theorem 1.4 by employing Theorem 1.9.

Proof of Theorem 1.4.

For each $i\in[m]$ , let $W_{i}:=\{\mathbf{W}_{i,1},\ldots,\mathbf{W}_{i,l_{i}}\}$ be the support of $\mathbf{W}_{i}$ . By Lemma 2.8, we see that the polynomials

\mu[\mathbf{W}_{1,j_{1}},\ldots,\mathbf{W}_{m,j_{m}}](x),\,j_{i}\in[l_{i}],i=1,\ldots,m

form an interlacing family. Then Lemma 2.3 implies that there exists $j_{1}\in[l_{1}],\ldots,j_{m}\in[l_{m}]$ such that

(5.1)

\operatorname{maxroot}\ \mu[\mathbf{W}_{1,j_{1}},\ldots,\mathbf{W}_{m,j_{m}}]\leq\operatorname{maxroot}\ \mathbb{E}\ \mu[\mathbf{W}_{1},\ldots,\mathbf{W}_{m}].

Combining (2.5) and (5.1), we obtain that

(5.2)

\operatorname{maxroot}\ \mu[\mathbf{W}_{1,j_{1}},\ldots,\mathbf{W}_{m,j_{m}}]\leq\operatorname{maxroot}\ \mu[\mathbb{E}\mathbf{W}_{1},\ldots,\mathbb{E}\mathbf{W}_{m}].

Since

\operatorname{tr}(\mathbb{E}\mathbf{W}_{i})\leq\varepsilon,\quad\,\operatorname{rank}(\mathbb{E}\mathbf{W}_{i})\leq k\quad\text{for all $i\in[m]$},

and $\sum_{i=1}^{m}\mathbb{E}\mathbf{W}_{i}=\mathbf{I}_{d}$ , Theorem 1.9 gives

\operatorname{maxroot}\ \mu[\mathbb{E}\mathbf{W}_{1},\ldots,\mathbb{E}\mathbf{W}_{m}]\leq\bigg{(}\sqrt{1-\frac{\varepsilon}{k-1}}+\sqrt{\varepsilon}\bigg{)}^{2}.

Finally, combining Lemma 2.7 and (5.2) we arrive at

\bigg{\|}\sum_{i=1}^{m}\mathbf{W}_{i,j_{i}}\bigg{\|}\leq\operatorname{maxroot}\ \mu[\mathbf{W}_{1,j_{1}},\ldots,\mathbf{W}_{m,j_{m}}]\leq\bigg{(}\sqrt{1-\frac{\varepsilon}{k-1}}+\sqrt{\varepsilon}\bigg{)}^{2}.

This implies the desired conclusion.

∎

References

[AB20] K. Alishahi and M. Barzegar, Paving property for real stable polynomials and strongly rayleigh processes, arXiv preprint arXiv:2006.13923, 2020.
[And79] J. Anderson, Extensions, restrictions, and representations of states on $C^{*}$ -algebras, Trans. Amer. Math. Soc., 249 (1979), 303-329. MR 0525675. Zbl 0408. 46049. http://dx.doi.org/10.2307/1998793.
[And79] J. Anderson, Extreme points in sets of positive linear maps on $\mathcal{B}(\mathcal{H})$ , J. Funct. Anal., 31 (1979), 195-217. MR 0525951. Zbl 0422.46049. http://dx.doi.org/10. 1016/0022-1236(79)90061-2.
[And81] J. Anderson, A conjecture concerning the pure states of $\mathcal{B}(\mathcal{H})$ and a related theorem, in Topics in Modern Operator Theory (Timisoara/Herculane, 1980), Operator Theory: Adv. Appl. 2, Birkh ${\rm\ddot{a}}$ user, Boston, 1981, pp. 27-43. MR 0672813. Zbl 0455.47026.
[BSS12] J. Batson, D. A. Spielman, and N. Srivastava, Twice-Ramanujan sparsifiers, SIAM J. Comput. 41 (2012), 1704-1721. MR 3029269. Zbl 1260.05092. http: //dx.doi.org/10.1137/090772873.
[BB08] J. Borcea and P. Brändén Applications of stable polynomials to mixed determinants: Johnson’s conjectures, unimodality, and symmetrized Fischer products, Duke Math. J. 143 (2008), 205-223. MR 2420507. Zbl 1151.15013. http://dx.doi.org/10.1215/00127094-2008-018.
[BB10] J. Borcea and P. Brändén, Multivariate Pólya-Schur classification problems in the Weyl algebra, Proceedings of the London Mathematical Society, 101(1):73-104, 2010.
[BT89] J. Bourgain and L. Tzafriri, Restricted invertibility of matrices and applications, in Analysis at Urbana, Vol. II (Urbana, IL, 1986-1987), London Math. Soc. Lecture Note Ser. 138, Cambridge Univ. Press, Cambridge, (1989), pp. 61-107. MR 1009186. Zbl 0698.47018. http://dx.doi.org/10.1017/CBO9781107360204.006.
[BCMS19] M. Bownik, P. Casazza, A. Marcus and D. Speelge Improved bounds in Weaver and Feichtinger conjectures, Journal für die reine und angewandte Mathematik (2019), 749:267-293
[BS06] M. Bownik and D. Speegle, The Feichtinger conjecture for wavelet frames, Gabor frames and frames of translates, Canad. J. Math. 58 (2006), no. 6, 1121-1143.
[Br ${\rm\ddot{a}}$ 11] P. Brändén Obstructions to determinantal representability. Advances in Mathematics, 226(2):1202-1212, 2011.
[Br ${\rm\ddot{a}}$ 18] P. Brändén Hyperbolic polynomials and the Kadison-Singer problem, In arXiv preprint. https://arxiv.org/pdf/1809.03255, 2018.
[CCLV05] P. Casazza, O. Christensen, A. Lindner and R. Vershynin, Frames and the Feichtinger conjecture, Proc. Amer. Math. Soc. 133 (2005), no. 4, 1025-1033.
[CT06] P. G. Casazza and J. C. Tremain, The Kadison-Singer problem in mathematics and engineering, Proc. Natl. Acad. Sci. USA 103, (2006), 2032-2039. MR 2204073. Zbl 1160.46333. http://dx.doi.org/10.1073/pnas.0507888103.
[Coh16] M. Cohen, Improved Spectral Sparsification and Kadison-Singer for Sums of Higher-rank Matrices, 2016, from http://www.birs.ca/events/2016/5-day-workshops/16w5111/videos/watch/201608011534-Cohen.html.
[FY19] O. Friedland and P. Youssef. Approximating matrices and convex bodies, International Mathematics Research Notices, 2019(8):2519–2537, 2019.
[Gr ${\rm\ddot{o}}$ 03] K. Gr ${\rm\ddot{o}}$ chenig, Localized frames are ﬁnite unions of Riesz sequences, Adv. Comput. Math. 18 (2003), no. 2-4, 149-157.
[KS59] R. V. Kadison and I. M. Singer, Extensions of pure states, Amer. J. Math., 81 (1959), 383-400. MR 0123922. Zbl 0086.09704. http://dx.doi.org/10.2307/ 2372748.
[LPR05] A. Lewis, P. Parrilo, and M. Ramana. The lax conjecture is true. Proceedings of the American Mathematical Society, 133(9): 2495-2499, 2005.
[MSS15a] A. Marcus, D. A. Spielman, and N. Srivastava, Interlacing families I: Bipartite Ramanujan graphs of all degrees, Ann. of Math. 182 (2015), 307-325. http://dx.doi.org/10.4007/2015.182.1.7.
[MSS15b] A. Marcus, D. A. Spielman, and N. Srivastava, Interlacing families II: mixed characteristic polynomials and the Kadison-Singer problem, Ann. of Math. (2) 182, no. 1 (2015): 327-350.
[MSS17] A. W. Marcus, D. A. Spielman, and N. Srivastava, Interlacing Families III: Sharper restricted invertibility estimates, arXiv: 1712.07766, 2017, to appear in Israel J. Math.
[MSS18] A. W. Marcus, D. A. Spielman, and N. Srivastava, Interlacing families IV: Bipartite ramanujan graphs of all sizes, SIAM Journal on Computing, 47(6): 2488-2509, 2018.
[Rav20] M. Ravichandran, Principal submatrices, restricted invertibility and a quantitative Gauss-Lucas theorem, International Mathematics Research Notices, Volume 2020, Issue 15, August 2020, Pages 4809-4832, https://doi.org/10.1093/imrn/rny163.
[RL20] M. Ravichandran and J. Leake, Mixed determinants and the Kadison-Singer problem., Mathematische Annalen (2020) 377:511-541 https://doi.org/10.1007/s00208-020-01986-7
[RS19] M. Ravichandran and N. Srivastava, Asymptotically Optimal Multi-Paving, International Mathematics Research Notices, (2019) Vol. 00, No. 0, pp. 1-33 doi:10.1093/imrn/rnz111
[Tao13] T.Tao, Real stable polynomials and the Kadison-Singer problem, 2013, from https://terrytao.wordpress.com/2013/11/04/real-stable-polynomials-and-the-kadison-singer-problem/#more-7109.
[Wag11] D. Wagner, Multivariate stable polynomials: theory and applications. Bulletin of the American Mathematical Society, 48(1):53-84, January 2011.
[Wea04] N. Weaver, The Kadison-Singer problem in discrepancy theory, Discrete Math. 278 (2004), 227-239. MR 2035401. Zbl 1040.46040. http://dx.doi.org/10.1016/ S0012-365X(03)00253-X.

Improved bounds in Weaver’s KSr{\rm KS}_{r} conjecture for high rank positive semidefinite matrices

Abstract.

1. Introduction

1.1. The Kadison-Singer problem

Conjecture 1.1.

Theorem 1.2.

Theorem 1.3.

1.2. Related work

1.2.1. The rank-one case

1.2.2. The higher-rank case

1.3. Our contribution

Theorem 1.4.

Corollary 1.5.

Proof.

Remark 1.6.

Corollary 1.7.

Proof.

1.4. Our techniques

Definition 1.8.

Theorem 1.9.

1.5. Organization

2. Preliminaries

2.1. Notations

2.2. Interlacing families

Definition 2.1.

Definition 2.2.

Lemma 2.3.

2.3. Real stable polynomials

Definition 2.4.

Lemma 2.5.

Lemma 2.6.

2.4. The mixed characteristic polynomial

Lemma 2.7.

Lemma 2.8.

3. A new formula for mixed characteristic polynomials

Definition 3.1.

Remark 3.2.

Proposition 3.3.

Remark 3.4.

Proof of Proposition 3.3.

Proposition 3.5.

Proof.

Proposition 3.6.

Proof.

Remark 3.7.

Theorem 3.8.

Proof.

Remark 3.9.

4. Proof of Theorem 1.9

Theorem 4.1.

Remark 4.2.

Proof of Theorem 1.9.

Definition 4.3.

Definition 4.4.

Proposition 4.5.

Lemma 4.6.

Lemma 4.7.

Proof.

Proposition 4.8.

Proof.

Lemma 4.9.

Proof.

Proof of Theorem 4.1.

5. Proof of Theorem 1.4

Proof of Theorem 1.4.

References

Improved bounds in Weaver’s ${\rm KS}_{r}$ conjecture for high rank positive semidefinite matrices