Quantum Expander Mixing Lemma and its Converse

Ning Ninglabel=e1][email protected] [ Department of Statistics, Texas A&M University.

Abstract

Expander graphs are fundamental in both computer science and mathematics, with a wide array of applications. With quantum technology reshaping our world, quantum expanders have emerged, finding numerous uses in quantum information theory. The classical expander mixing lemma plays a central role in graph theory, offering essential insights into edge distribution within graphs and aiding in the analysis of diverse network properties and algorithms. This paper establishes the quantum analogue of the classical expander mixing lemma and its converse for quantum expanders.

\startlocaldefs\endlocaldefs

1 Introduction

Expander graphs are foundational in both computer science and mathematics, offering diverse applications across these domains (Hoory et al.,, 2006; Lubotzky,, 2012). Leveraging their expansion property, these graphs contribute significantly to algorithmic innovations, cryptographic protocols, analysis of circuit and proof complexity, development of derandomization techniques and pseudorandom generators, as well as the theory of error-correcting codes. Additionally, expander graphs play pivotal roles in shaping structural insights within group theory, algebra, number theory, geometry, and combinatorics. As quantum technology is revolutionizing the world, quantum expanders were introduced and have foundd numerous applications in the field of quantum information theory (Ambainis and Smith,, 2004; Ben-Aroya et al.,, 2008; Hastings, 2007a, ; Hastings and Harrow,, 2009; Aharonov et al.,, 2014).

Quantum expanders are the quantum extension of expander graphs, described by means of operators that map quantum states to quantum states. A general quantum state is a density matrix, which is a trace-one and positive semidefinite operator, i.e.,

\rho=\sum\limits_{v}p_{v}\left|\psi_{v}\right\rangle\left\langle\psi_{v}\right|,\qquad 0\leq p_{v}\leq 1,\qquad\sum\limits_{v}p_{v}=1,

with $\left\{\psi_{v}\right\}$ being some orthonormal basis of a Hilbert space $\mathcal{V}$ . Notice that $\rho\in L(\mathcal{V}):=\operatorname{Hom}(\mathcal{V},\mathcal{V})$ . An admissible quantum transformation $F:L(\mathcal{V})\rightarrow L(\mathcal{V})$ is any transformation that can be implemented by a quantum circuit, with unitary operators and measurements. Thus, admissible quantum operators map density matrices to density matrices; see, Nielsen and Chuang, (2001); Kitaev et al., (2002).

In this paper, we rigorously define quantum expanders following Hastings, 2007b and Pisier, (2014). In essence, we consider a quantum operator $T_{n}:M(N_{n})\rightarrow M(N_{n})$ in the following normalized form (dividing by $d$ ):

\displaystyle T_{n}(\eta)=\frac{1}{d}\sum_{j=1}^{d}u_{j}^{(n)}\eta u_{j}^{(n)*},

(1.1)

where $M(N_{n})$ stands for the space of $N_{n}\times N_{n}$ complex matrices, $\big{(}u_{1}^{(n)},\cdots,u_{d}^{(n)}\big{)}$ is a $d$ -tuple of unitary matrices with $d$ being a positive integer that is independent of $n$ , and the superscript $``\ast"$ represents the conjugate transpose. We have the operator norm $\|T_{n}\|=1$ and $T_{n}(I_{N_{n}})=I_{N_{n}}$ , where $I_{N_{n}}\in M(N_{n})$ is the identity matrix. In the quantum context, the spectral gap is delineated by the reduced spectral radius $\big{\|}T_{n}|_{I_{N_{n}}^{\perp}}\big{\|}$ , where $T_{n}$ is restricted to the orthogonal complement $I_{N_{n}}^{\perp}$ of the identity matrix. A quantum expander sequence comprises those having reduced spectral radius smaller than $1$ uniformly for all $n$ , as $N_{n}$ tends to infinity with $n$ .

The aim of this paper is to establish the quantum counterpart of the classical expander mixing lemma and its converse. The expander mixing lemma intuitively states that the edges of certain $d$ -regular graphs are evenly distributed throughout the graph. Specifically, the normalized number of edges connecting two vertex subsets $S_{1}$ and $S_{2}$ is always close to the expected number of edges between them in a random $d$ -regular graph, which is $\frac{1}{n}|S_{1}||S_{2}|$ . This well-known result is demonstrated, for example, in Corollary 9.2.5 of Alon and Spencer, (2000). The expander mixing lemma plays a pivotal role in graph theory, providing crucial insights into the distribution of edges within graphs and facilitating the analysis of various network properties and algorithms. Variants of the expander mixing lemma have been proposed. For instance, Chen et al., (2013) established an operator version of the expander mixing lemma, utilizing it iteratively for bias amplification. This approach was further employed in Jeronimo et al., (2022) to formulate the iterated operator expander mixing lemma.

We establish the expander mixing lemma for quantum expanders, termed the quantum expander mixing lemma, for the first time to the authors’ best knowledge. This lemma asserts that for any two orthogonal projections $P_{1},P_{2}\in M(N_{n})$ , the Hilbert–Schmidt inner product of $P_{1}$ and $T_{n}P_{2}$ is always close to $\frac{1}{N_{n}}{\operatorname{tr}}(P_{1}){\operatorname{tr}}(P_{2})$ for all $n$ , for $T_{n}$ being in the normalized form given in (1.1). In quantum mechanics, the Hilbert-Schmidt inner product of two matrices (which symbolize quantum states) serves as a measure of the similarity between these states. A substantial inner product implies a significant degree of similarity or correlation between the states, whereas a small inner product indicates dissimilarity or orthogonality.

Studying the converse of the expander mixing lemma enhances the understanding of graph properties and their implications in diverse fields, leading to more robust theories, algorithms, and applications. The converse of the expander mixing lemma has been articulated in two distinct ways. The first way involves investigating the upper bound of the second-largest (in absolute value) eigenvalue of the $d$ -regular graph, when the normalized number of edges between two vertex subsets $S_{1}$ and $S_{2}$ is always close to $\frac{1}{n}|S_{1}||S_{2}|$ , as demonstrated in Bilu and Linial, (2006). The second way examines whether there exist vertex subsets $S_{1}$ and $S_{2}$ such that the normalized number of edges in between, compared to $\frac{1}{n}|S_{1}||S_{2}|$ , cannot fall below a certain threshold, as shown in Lev, (2015).

In this paper, we further establish the converse of the expander mixing lemma for quantum expanders, which is the first result to the best of the authors’ knowledge. Following the second way, our focus lies on exploring the bounds of that difference. Specifically, our quantum expander mixing lemma states that there exist two orthogonal projections $P_{1},P_{2}\in M(N_{n})$ such that the difference of the Hilbert–Schmidt inner product of $P_{1}$ and $T_{n}P_{2}$ to $\frac{1}{N_{n}}{\operatorname{tr}}(P_{1}){\operatorname{tr}}(P_{2})$ cannot be small up to a certain threshold for all $n$ .

The rest of the paper proceeds as follows. In Section 2, we formally define expander graphs and then present the classical expander mixing lemma. In Section 3, we rigorously define quantum expander graphs and then state our quantum expander mixing lemma along with its converse. Technical proofs of this paper are provided in Section 4. Throughout the paper, $\|\cdot\|$ denotes the Hilbert–Schmidt norm when applied to a matrix and the operator norm when applied to an operator, with the specific meaning determined by the context.

2 Expander mixing lemma

We start by defining formally the concept of an expander graph and then state the expander mixing lemma in Theorem 2.3. There exist several equivalent definitions of expander graphs, which can be characterized through vertex, edge, or spectral expansion. We adopt the spectral perspective towards expander graphs, defining them in terms of a certain spectral gap by means of Markov operators. We refer interested readers to Tao, (2015) for further details.

We consider an undirected graph $\mathcal{G}=(V,E)$ , where $V$ is the set of vertices and $E$ is the set of edges. A graph is finite if $V$ is finite, and hence $E$ is finite. We consider $\mathcal{G}$ as a $d$ -regular graph, where each vertex of $V$ is contained in exactly $d$ edges in $E$ ; we say that $d$ is the degree of the regular graph $\mathcal{G}$ . We let $\ell^{2}(V)$ be the finite-dimensional complex Hilbert space of functions $f:V\rightarrow\mathbb{C}$ with norm and inner product given respectively as

\|f\|_{\ell^{2}(V)}:=\Big{(}\sum_{v\in V}|f(v)|^{2}\Big{)}^{1/2}\quad\text{and}\quad\langle f,g\rangle_{\ell^{2}(V)}:=\sum_{v\in V}f(v)\overline{g(v)},

where the overline notation represents the complex conjugate. We then define the adjacency operator $A:\ell^{2}(V)\rightarrow\ell^{2}(V)$ on functions $f\in\ell^{2}(V)$ as the sum of $f$ over all of the neighbours of $v$ , i.e.,

Af(v):=\sum_{w\in V:\{v,w\}\in E}f(w).

Clearly, $A$ is a linear operator and one can associate it with the adjacency matrix.

Since $\mathcal{G}$ is $d$ -regular, a linear algebraic definition of expansion is possible based on the eigenvalues of the adjacency matrix $A$ . For $\mathcal{G}$ being undirected, $A$ is real symmetric. It is known that for $\mathcal{G}$ being a $d$ -regular graph having $n$ vertices, the spectral theorem implies that $A$ has $n$ real-valued eigenvalues $\lambda_{1}\geq\lambda_{2}\geq\ldots\geq\lambda_{n}$ and these eigenvalues are in $[-d,d]$ . Because $\mathcal{G}$ is regular, the uniform distribution $u\in\mathbb{R}^{n}$ , with entry $u_{i}=1/n$ for all $i=1,…,n$ , is its stationary distribution. Thus, $u$ is an eigenvector of $A$ with eigenvalue $\lambda_{1}=d$ , i.e., $Au=du$ . The spectral gap of $\mathcal{G}$ is defined to be $d-\lambda_{2}$ , which measures the spectral expansion.

The normalized variants of these definitions are commonly employed and offer greater convenience when displaying certain results. Consider the matrix $\frac{1}{d}A$ , which is the Markov transition matrix of $\mathcal{G}$ , and its eigenvalues are in $[-1,1]$ . It is associated with the Markov operator $T:\ell^{2}(V)\rightarrow\ell^{2}(V)$ defined by

\displaystyle\langle T\delta_{x},\delta_{y}\rangle_{\ell^{2}(V)}=\left\{\begin{array}[]{ll}1/d&(x,y)\in E,\\ 0&(x,y)\notin E,\end{array}\right.

(2.3)

where $\delta_{x}$ is the Kronecker delta function on $x$ . Then, $T$ is self-adjoint (equal to its conjugate transpose $T^{*}$ ) and the operator norm $\|T\|=1$ . Denote $1_{V}\in\ell^{2}(V)$ for the constant function $v\mapsto 1$ , and then $T(1_{V})=1_{V}$ . We can observe the reduction in the operator norm of $T$ when it is restricted to the orthogonal complement $1_{V}^{\perp}$ .

Definition 2.1.

Define the reduced spectral radius $\rho(\mathcal{G})$ of $\mathcal{G}$ as the restricted operator norm

\displaystyle\rho(\mathcal{G}):=\big{\|}T|_{1_{V}^{\perp}}\big{\|}.

In this paper, we assume graph $\mathcal{G}$ is connected, and then the largest eigenvalue $1$ of $T$ has multiplicity $1$ . We assume $\mathcal{G}$ is non-bipartite, and then $-1$ is not an eigenvalue of $T$ . Therefore, the reduced spectral radius $\rho(\mathcal{G})$ is always smaller than $1$ . We identify $\mathcal{G}$ with the set of vertices. A sequence $(\mathcal{G}_{n})_{n=1}^{\infty}$ is a sequence of finite $d$ -regular simple connected graphs with the cardinality of the vertex set $|V_{n}|=n\rightarrow\infty$ . For a given sequence of $d$ -regular graphs $(\mathcal{G}_{n})_{n=1}^{\infty}$ , the reduced spectral radius $\rho(\mathcal{G}_{n})$ for each $n\in\mathbb{N}$ is always smaller than $1$ . The operator $\Delta:=1-T$ is sometimes known as the (combinatorial or graph) Laplacian. This is a positive semidefinite operator with at least one zero eigenvalue. To inquire whether a uniform gap exists between $\rho(\mathcal{G}_{n})$ and $1$ , here is the definition of the expander sequence.

Definition 2.2.

A graph is called an expander if there is a spectral gap of size $\epsilon>0$ in $\Delta$ , in the sense that the first eigenvalue of $\Delta$ exceeds the second by at least $\epsilon$ , equivalently $1-\rho(\mathcal{G})\geq\epsilon$ . A sequence of $d$ -regular graphs $(\mathcal{G}_{n})$ is called an expander sequence if there exists $\epsilon>0$ such that $\rho(\mathcal{G}_{n})\leq 1-\epsilon$ for all $n$ .

We have defined the notion of an expander sequence, which is a family of graphs instead of an individual graph. It is also commonly defined through the $(n,d,\lambda)$ -graph, which is a $d$ -regular graph with $n$ vertices such that all of the eigenvalues of its adjacency matrix except one have absolute value at most $\lambda$ , i.e., $\max_{i\neq 1}|\lambda_{i}|\leq\lambda$ . If we fix $d$ and $\lambda$ then $(n,d,\lambda)$ -graphs form a family of expander graphs with a constant spectral gap. The expander mixing lemma states the number of edges between two vertex subsets $S_{1}$ and $S_{2}$ is always close to the expected number of edges between them in a random $d$ -regular graph, namely $\frac{d}{n}|S_{1}||S_{2}|$ ; see, for instance, Corollary 9.2.5 of Alon and Spencer, (2000).

Theorem 2.3 (Expander mixing lemma, Alon and Spencer, (2000)).

Consider the expander sequence defined in accordance to Definition 2.2. For any two subsets $S_{1},S_{2}\subseteq V$ , let

e(S_{1},S_{2})=\Big{|}\big{\{}(x,y)\in S_{1}\times S_{2}:xy\in E\big{\}}\Big{|}

be the number of edges between $S_{1}$ and $S_{2}$ (counting edges contained in the intersection of $S_{1}$ and $S_{2}$ twice). Then

\displaystyle\left|\frac{1}{d}e(S_{1},S_{2})-\frac{1}{n}|S_{1}||S_{2}|\right|\leq(1-\epsilon)\sqrt{|S_{1}||S_{2}|}.

A converse to the above well-known expander mixing lemma is provided in Corollary 4 of Lev, (2015).

Theorem 2.4 (Converse of expander mixing lemma, Lev, (2015)).

Consider the expander sequence defined in accordance to Definition 2.2. With $e(S_{1},S_{2})$ as in Theorem 2.3, there exist non-empty subsets $S_{1},S_{2}\subseteq V$ such that

\displaystyle\left|\frac{1}{d}e(S_{1},S_{2})-\frac{1}{n}|S_{1}||S_{2}|\right|\geq\frac{(1-\epsilon)}{32\sqrt{2}(\log(2/(1-\epsilon))+4)}\sqrt{|S_{1}||S_{2}|}.

3 Main results

In this section, we define formally the concept of quantum expanders following Hastings, 2007b and Pisier, (2014). Before that, it is worth providing the motivation from classical expanders and explaining how quantum expanders appear as a non-commutative version of the classical ones.

The story could be started with the Cayley graph. Consider $G$ as a finite group generated by $S=\{s_{1},\ldots,s_{d}\}$ . Suppose $S$ is symmetric in the sense that $s\in S$ whenever $s^{-1}\in S$ and does not contain the identity $1$ to avoid loops. The Cayley graph $\operatorname{Cay}(G,S)$ is defined as the graph with vertex set $G$ and edge set $\{\{sx,x\}:x\in G,s\in S\}$ . Each vertex $x\in G$ is connected to the $|S|=d$ elements $sx$ for $s\in S$ and hence $\operatorname{Cay}(G,S)$ is a $d$ -regular graph. A unitary representation of $G$ is the Hilbert space $\ell^{2}(G)$ , together with a homomorphism $\rho:G\rightarrow U(\ell^{2}(G))$ from $G$ to the group $U(\ell^{2}(G))$ of unitary transformations on $\ell^{2}(G)$ . Define the (left) regular unitary representation of $G$ as $\pi_{G}:G\rightarrow U(\ell^{2}(G))$ such that

\displaystyle(\pi_{G}(g)f)(x)=f(g^{-1}x)

for $f\in l^{2}(G)$ and $x,g\in G$ . Unitary operator $\pi_{G}(s)$ is a unitary induced by the permutation of vertices of the graph.

To discuss expanders, let us assume that we are given a sequence of Cayley graphs ${\operatorname{Cay}(G_{n},S_{n})}_{n=1}^{\infty}$ , where $S_{n}=\{s_{1}(n),\cdots s_{d}(n)\}\subset G_{n}$ for each $n$ with $d$ being a positive integer independent of $n$ . Then $\{\operatorname{Cay}(G_{n},S_{n})\}_{n=1}^{\infty}$ where

N_{n}:=|G_{n}|\rightarrow\infty\quad\text{as}\quad n\rightarrow\infty,

is an expander sequence if and only if the sequence of $d$ -tuples of unitaries

\displaystyle\Big{\{}\big{(}\pi_{G_{n}}(s_{1}(n)),\cdots,\pi_{G_{n}}(s_{d}(n))\big{)}\in U(l^{2}(G_{n}))^{d}\Big{\}}_{n=1}^{\infty}

satisfies that with $\epsilon>0$ ,

\displaystyle\sup_{n}\left\|\Bigg{(}\frac{1}{d}\sum_{i=1}^{d}\pi_{G_{n}}(s_{i}(n))\Bigg{)}\Bigg{|}_{1_{G_{n}}^{\perp}}\right\|\leq 1-\epsilon,

where $\|\cdot\|$ is the Hilbert–Schmidt norm. This is equivalent to the condition that there exists a sequence of $d$ -tuples of unitaries

\displaystyle\Big{\{}\big{(}u_{1}^{(n)},\cdots,u_{d}^{(n)}\big{)}\in U(N_{n})^{d}\Big{\}}_{n=1}^{\infty}\quad\text{with}\quad u_{j}^{(n)}=\pi_{G_{n}}(s_{j}(n)),

such that for any $n\geq 1$ , some $\epsilon>0$ , and any $x$ being a $N_{n}\times N_{n}$ diagonal complex matrix,

\displaystyle\left\|\frac{1}{d}\sum_{j=1}^{d}u_{j}^{(n)}\Big{(}x-\frac{1}{N_{n}}{\operatorname{tr}}(x)\Big{)}u_{j}^{(n)*}\right\|\leq(1-\epsilon)\Bigg{\|}x-\frac{1}{N_{n}}{\operatorname{tr}}(x)\Bigg{\|},

(3.1)

where $U(N_{n})$ stands for the set of $N_{n}\times N_{n}$ unitary matrices.

The term “quantum expander” is just to designate unitaries $\big{(}u_{1}^{(n)},\cdots,u_{d}^{(n)}\big{)}$ to satisfy (3.1) for a general $x\in M(N_{n})$ beyond being diagonal, where $M(N_{n})$ stands for the space of $N_{n}\times N_{n}$ complex matrices. In this light, quantum expanders can be seen as a non-commutative version of the classical ones. Identifying $M(N_{n})$ with the space $B(\ell_{2}^{N_{n}})$ of bounded operators on the $N_{n}$ -dimensional Hilbert space $\ell_{2}^{N_{n}}$ , different to the Markov operator $T$ defined in (2.3), in the quantum setting we consider quantum operators act on $\ell_{2}^{N_{n}}\otimes\overline{\ell_{2}^{N_{n}}}$ of the following form for each $n$ :

T_{n}=\frac{1}{d}\sum_{j=1}^{d}u_{j}^{(n)}\otimes\overline{u}_{j}^{(n)},\qquad u_{j}^{(n)}\in U(N_{n}),

where $\overline{u}_{j}^{(n)}$ is the complex conjugate of the matrix $u_{j}^{(n)}$ . Identifying as usual $\ell_{2}^{N_{n}}\otimes\overline{\ell_{2}^{N_{n}}}$ as $S_{2}^{N_{n}}$ , the Hilbert space obtained by equipping $M(N_{n})$ with the corresponding scalar product and the Hilbert–Schmidt norm. We can write

\displaystyle\Bigg{\|}\frac{1}{d}\sum_{j=1}^{d}u_{j}^{(n)}\otimes\overline{u}_{j}^{(n)}:\ell_{2}^{N_{n}}\otimes\overline{\ell_{2}^{N_{n}}}\rightarrow\ell_{2}^{N_{n}}\otimes\overline{\ell_{2}^{N_{n}}}\Bigg{\|}=\left\|T_{n}:S_{2}^{N_{n}}\rightarrow S_{2}^{N_{n}}\right\|.

Then we may consider $T_{n}$ as an operator acting on $M(N_{n})$ defined by

T_{n}(\eta)=\frac{1}{d}\sum_{j=1}^{d}u_{j}^{(n)}\eta u_{j}^{(n)*},\qquad\forall\eta\in M(N_{n}),

which satisfies that

		$\displaystyle\Bigg{\\|}\frac{1}{d}\sum_{j=1}^{d}u_{j}^{(n)}\otimes\overline{u}_{j}^{(n)}\Bigg{\\|}$
	$\displaystyle=$	$\displaystyle\sup\Bigg{\{}\Bigg{\\|}\frac{1}{d}\sum_{j=1}^{d}u_{j}^{(n)}\eta u_{j}^{(n)*}\Bigg{\\|}\;\Bigg{\|}\;\eta\in M(N_{n}),\,\\|\eta\\|\leq 1\Bigg{\}}$
	$\displaystyle=$	$\displaystyle\sup\Bigg{\{}\frac{1}{d}\Bigg{\|}\sum_{j=1}^{d}{\operatorname{tr}}(u_{j}^{(n)}\eta u_{j}^{(n)}\zeta^{})\Bigg{\|}\;\Bigg{\|}\;\eta,\zeta\in M(N_{n}),\,\\|\eta\\|\leq 1,\,\\|\zeta\\|\leq 1\Bigg{\}}.$

Analogous to the classical setting, we have the operator norm $\|T_{n}\|=1$ and $T_{n}(I_{N_{n}})=I_{N_{n}}$ , where $I_{N_{n}}\in M(N_{n})$ is the identity matrix. Then we ask how much the operator norm of the quantum operator $T_{n}$ would be decreased if it is restricted to the orthogonal complement $I_{N_{n}}^{\perp}$ of the identity matrix. For that purpose, we give the quantum version of Definition 2.1.

Definition 3.1.

Define the reduced spectral radius $\rho(n)$ as the restricted operator norm

\displaystyle\rho(n):=\big{\|}T_{n}|_{I_{N_{n}}^{\perp}}\big{\|}.

The definition of a quantum expander is defined in terms of $\rho(n)$ . It says that there exists $\epsilon>0$ such that $\rho(n)\leq 1-\epsilon$ for all $n$ , which is in consistent with the classical expander sequence in Definition 2.2.

Definition 3.2 (Hastings, 2007b and Pisier, (2014)).

Fix a positive integer $d$ and a sequence of positive integers $\{N_{n}\}$ with $N_{n}\rightarrow\infty$ as $n\rightarrow\infty$ . We say a sequence of $d$ -tuples of unitaries

\Big{\{}\big{(}u_{1}^{(n)},\cdots,u_{d}^{(n)}\big{)}\in U(N_{n})^{d}\Big{\}}_{n=1}^{\infty}

is a quantum expander sequence if their reduced spectral radius $\rho(n)$ is smaller than $1$ uniformly for all $n$ .

The following two theorems are our main results of this paper.

Theorem 3.3 (Quantum expander mixing lemma).

Consider a quantum expander sequence defined in accordance to Definition 3.2. For any two orthogonal projections $P_{1},P_{2}\in M(N_{n})$ , we have that for all $n$ ,

\displaystyle\left|\langle P_{1},T_{n}P_{2}\rangle_{{\operatorname{HS}}}-\frac{1}{N_{n}}{\operatorname{tr}}(P_{1}){\operatorname{tr}}(P_{2})\right|\leq(1-\epsilon)\sqrt{{\operatorname{tr}}(P_{1}){\operatorname{tr}}(P_{2})},

where the Hilbert–Schmidt inner product of two matrices $A$ and $B$ is defined as usual

\langle A,B\rangle_{{\operatorname{HS}}}={\operatorname{tr}}(A^{*}B).

Theorem 3.4 (Converse of quantum expander mixing lemma).

Consider a quantum expander sequence defined in accordance to Definition 3.2. There exist two universal constants $C_{1},C_{2}>0$ and two non-zero orthogonal projections $P_{1},P_{2}\in M(N_{n})$ , such that for all $n$ ,

\displaystyle\left|\langle P_{1},T_{n}P_{2}\rangle_{{\operatorname{HS}}}-\frac{1}{N_{n}}{\operatorname{tr}}(P_{1}){\operatorname{tr}}(P_{2})\right|\geq\frac{1-\epsilon}{C_{1}(-\log(1-\epsilon)+C_{2})}\sqrt{{\operatorname{tr}}(P_{1}){\operatorname{tr}}(P_{2})}.

4 Proofs of the paper

Proof of Theorem 3.3.

Let $E:M(N_{n})\rightarrow M(N_{n})$ be the orthogonal projection onto the space $\langle I_{N_{n}}\rangle=\ker(1-T_{n})$ . Then

E(P_{2})=\frac{{\operatorname{tr}}(P_{2})}{N_{n}}I_{N_{n}}.

Therefore,

\displaystyle{\operatorname{tr}}(P_{1}){\operatorname{tr}}(P_{2})=\langle P_{1},N_{n}E(P_{2})\rangle_{{\operatorname{HS}}}.

We have by the Cauchy-Schwarz inequality and properties of the unitaries that

	$\displaystyle\left\|\langle P_{1},T_{n}P_{2}\rangle_{{\operatorname{HS}}}-\frac{1}{N_{n}}{\operatorname{tr}}(P_{1}){\operatorname{tr}}(P_{2})\right\|^{2}$	$\displaystyle=\Big{\|}\langle P_{1},(T-E)P_{2}\rangle_{{\operatorname{HS}}}\Big{\|}^{2}$
		$\displaystyle\leq{\operatorname{tr}}(P_{1}^{}P_{1}){\operatorname{tr}}\Big{(}\big{(}(T-E)P_{2}\big{)}^{}\big{(}(T-E)P_{2}\big{)}\Big{)}$
		$\displaystyle\leq{\operatorname{tr}}(P_{1})\big{\\|}T\|_{I_{N_{n}}^{\perp}}\big{\\|}^{2}{\operatorname{tr}}(P_{2})$
		$\displaystyle\leq(1-\epsilon)^{2}{\operatorname{tr}}(P_{1}){\operatorname{tr}}(P_{2}).$

∎

To prove Theorem 3.4, we commence by introducing the Schatten norm. In mathematics, particularly in functional analysis, the Schatten norm, also known as the Schatten–von-Neumann norm, emerges as a generalization of p-integrability, akin to the trace class norm and the Hilbert–Schmidt norm. For $p\in[1,\infty)$ , define the Schatten $p$ -norm of $A\in M(N_{n})$ as

\|A\|_{p}=\Big{[}{\operatorname{tr}}\big{(}(A^{*}A)^{\frac{p}{2}}\big{)}\Big{]}^{\frac{1}{p}};

for $p=\infty$ , the convention is to define $\|\cdot\|_{\infty}$ as the operator norm

\|A\|_{\infty}=\sup_{x\in\mathbb{C}^{n},\|x\|=1}\|Ax\|.

The Schatten norms are unitarily invariant in the sense that for unitaries $U,V$ and $p\in[1,\infty]$ ,

\displaystyle\|UAV\|_{p}=\|A\|_{p}.

(4.1)

Considering $q=1$ as the dual index to $p=\infty$ , the noncommutative Hölder’s inequality says that for all $p,q,r\in[1,\infty]$ such that $\frac{1}{p}+\frac{1}{q}=\frac{1}{r}$ , we have that for any matrices $A,B\in M(N_{n})$ ,

\|AB\|_{r}\leq\|A\|_{p}\|B\|_{q}.

Its special case is the well-known inequality

\displaystyle\|A\|^{2}=\|A\|_{2}^{2}\leq\|A\|_{1}\|A\|_{\infty},

(4.2)

where $\|\cdot\|_{1}$ is called the trace norm or the nuclear norm, $\|\cdot\|_{2}$ is called the Frobenius norm or the Hilbert–Schmidt norm, and $\|\cdot\|_{\infty}$ is called the spectral norm or the operator norm. Lev, (2015) defined height (Definition 4.1) of a matrix using these three important norms; by (4.2) the height of any matrix is at least one.

Definition 4.1 (Lev, (2015)).

Define the height of a non-zero matrix $A\in M(N_{n})$ by

\displaystyle h(A):=\frac{\sqrt{\|A\|_{1}\|A\|_{\infty}}}{\|A\|}.

We define the Schatten norm induced operator norm as $\|T_{n}\|_{p\rightarrow p}$ for operator $T_{n}:M(N_{n})\rightarrow M(N_{n})$ . Then the three special cases $p=1,2$ and $\infty$ give operator norms $\|T\|_{1\rightarrow 1}$ , $\|T\|_{2\rightarrow 2}$ and $\|T\|_{\infty\rightarrow\infty}$ , respectively. Then we define a linear map for unitaries $U_{1},U_{2},V_{1},V_{2}\in M(N_{n})$ as follows:

	$\displaystyle T_{n}(U_{1},U_{2},V_{1},V_{2}):\mathbb{C}^{n}$	$\displaystyle\rightarrow\mathbb{C}^{n}$
	$\displaystyle(\lambda_{1},\cdots\lambda_{n})$	$\displaystyle\mapsto{\operatorname{diag}}(U_{2}T_{n}(U_{1}D_{\lambda}V_{1})V_{2}),$		(4.3)

where $D_{\lambda}$ is the diagonal matrix ${\operatorname{diag}}(\lambda_{1},\cdots\lambda_{n})$ . Here, we follow the convention that when the ${\operatorname{diag}}$ operator is applied to a vector, it yields the corresponding diagonal matrix, and when applied to a matrix, it produces the corresponding diagonal vector. Then we have

\displaystyle\|T_{n}\|_{p\rightarrow p}=\sup_{U_{1},U_{2},V_{1},V_{2}}\|T_{n}(U_{1},U_{2},V_{1},V_{2})\|_{p\rightarrow p}.

(4.4)

The next lemma is a special case of Gillespie, (1991) on page 226 therein.

Lemma 4.2 (Gillespie, (1991)).

For any linear operator $T_{n}:M(N_{n})\rightarrow M(N_{n})$ , we have

\displaystyle\|T_{n}\|_{2\rightarrow 2}^{2}\leq\|T_{n}\|_{1\rightarrow 1}\|T_{n}\|_{\infty\rightarrow\infty}.

Similar to the definition of height for a matrix in Definition 4.1, we give definition of the height of a linear operator here; by Lemma 4.2 the height of any linear operator is at least one.

Definition 4.3.

For any linear operator $T_{n}:M(N_{n})\rightarrow M(N_{n})$ , define its height by

\displaystyle h(T_{n}):=\frac{\sqrt{\|T_{n}\|_{1\rightarrow 1}\|T_{n}\|_{\infty\rightarrow\infty}}}{\|T_{n}\|_{2\rightarrow 2}}.

Recall that in the classical setting, for an operator $T:\mathbb{C}^{n}\rightarrow\mathbb{C}^{n}$ , we have

\displaystyle\|T\|_{\ell^{1}\rightarrow\ell^{1}}=\|T^{*}\|_{\ell^{\infty}\rightarrow\ell^{\infty}}.

In the quantum setting, denote $T_{n}^{*}$ as the adjoint of $T_{n}:M(N_{n})\rightarrow M(N_{n})$ with respect to the Hilbert–Schmidt inner product. For $\lambda,\mu\in\mathbb{C}^{n}$ , we have

	$\displaystyle\Big{\langle}\mu,\,{\operatorname{diag}}\big{(}U_{2}T_{n}(U_{1}D_{\lambda}V_{1})V_{2}\big{)}\Big{\rangle}=$	$\displaystyle\Big{\langle}D_{\mu},\,U_{2}T_{n}(U_{1}D_{\lambda}V_{1})V_{2}\Big{\rangle}_{{\operatorname{HS}}}$
	$\displaystyle=$	$\displaystyle\Big{\langle}U_{2}^{}D_{\mu}V_{2}^{},\,T_{n}(U_{1}D_{\lambda}V_{1})\Big{\rangle}_{{\operatorname{HS}}}$
	$\displaystyle=$	$\displaystyle\Big{\langle}U_{1}^{}T_{n}^{}(U_{2}^{}D_{\mu}V_{2}^{})V_{1}^{*},\,D_{\lambda}\Big{\rangle}_{{\operatorname{HS}}}$
	$\displaystyle=$	$\displaystyle\Big{\langle}{\operatorname{diag}}\big{(}U_{1}^{}T_{n}^{}(U_{2}^{}D_{\mu}V_{2}^{})V_{1}^{*}\big{)},\,\lambda\Big{\rangle},$

which yields that

T_{n}(U_{1},U_{2},V_{1},V_{2})^{*}=T^{*}(U_{2}^{*},U_{1}^{*},V_{2}^{*},V_{1}^{*}).

Hence, we have

	$\displaystyle\\|T_{n}\\|_{2\rightarrow 2}$	$\displaystyle=\\|T_{n}^{*}\\|_{2\rightarrow 2},$		(4.5)
	$\displaystyle\\|T_{n}\\|_{1\rightarrow 1}=\\|T_{n}^{*}\\|_{\infty\rightarrow\infty}\quad$	$\displaystyle\text{and}\quad\\|T_{n}\\|_{\infty\rightarrow\infty}=\\|T_{n}^{*}\\|_{1\rightarrow 1}.$

By Definition 4.3,

\displaystyle h(T_{n})=h(T_{n}^{*})

(4.6)

In the classical setting, Lev, (2015) defined the logarithmic diameter for a non-zero vector $z=(z_{1},\ldots,z_{n})\in\mathbb{C}^{n}$ by

\displaystyle\ell(z):=\frac{\max\{|z_{i}|:i\in\{1,\ldots,n\}\}}{\max\{|z_{i}|:i\in\{1,\ldots,n\},z_{i}\neq 0\}}.

(4.7)

In the quantum setting, considering complex matrices, we define the logarithmic diameter as follows.

Definition 4.4.

For $A\in M(N_{n})$ , let

\displaystyle 0<\lambda_{1}\leq\lambda_{2}\leq\cdots\leq\lambda_{k}

be all the non-zero (strictly positive) eigenvalues of $A^{*}A$ . We define the logarithmic diameter of $A$ by

\displaystyle\ell(A):=\sqrt{\lambda_{k}/\lambda_{1}}.

The following lemma will serve as the cornerstone for the subsequent proofs.

Lemma 4.5.

If $T:M(N_{n})\rightarrow M(N_{n})$ is a linear map with $h(T)\leq K$ for $K\geq 1$ a real number, then there exists a non-zero matrix $A\in M(N_{n})$ such that

\|T_{n}(A)\|>\frac{1}{4}\|T\|_{2\rightarrow 2}\|A\|\quad\text{and}\quad\ell(A)<32K^{2}+1.

Proof.

By the definition of $T_{n}(U_{1},U_{2},V_{1},V_{2})$ given in (4) where $U_{1},U_{2},V_{1}$ and $V_{2}$ are unitaries in $M(N_{n})$ , together with (4.4), we have

\displaystyle\|T_{n}\|_{2\rightarrow 2}=\sup_{U_{1},U_{2},V_{1},V_{2}}\|T_{n}(U_{1},U_{2},V_{1},V_{2})\|_{2\rightarrow 2}\leq 2\|T_{n}(U_{1},U_{2},V_{1},V_{2})\|,

(4.8)

which yields that

\displaystyle\frac{1}{\|T_{n}\|_{2\rightarrow 2}}\geq\frac{1}{2\|T_{n}(U_{1},U_{2},V_{1},V_{2})\|}.

Then by the definition of the height of a matrix given in Definition 4.1, together with (4.4), we have

	$\displaystyle h(T_{n}(U_{1},U_{2},V_{1},V_{2}))^{2}$	$\displaystyle=\frac{\\|T_{n}(U_{1},U_{2},V_{1},V_{2})\\|_{1}\\|T_{n}(U_{1},U_{2},V_{1},V_{2})\\|_{\infty}}{\\|T_{n}(U_{1},U_{2},V_{1},V_{2})\\|_{2}^{2}}$
		$\displaystyle\leq 4\frac{\\|T_{n}(U_{1},U_{2},V_{1},V_{2})\\|_{1}\\|T_{n}(U_{1},U_{2},V_{1},V_{2})\\|_{\infty}}{\\|T_{n}\\|_{2\rightarrow 2}}$
		$\displaystyle\leq 4\frac{\\|T_{n}\\|_{1\rightarrow 1}\\|T_{n}\\|_{\infty\rightarrow\infty}}{\\|T_{n}\\|_{2\rightarrow 2}}.$

Therefore, by the definition of height of a linear operator given in Definition 4.2, we have

h(T_{n}(U_{1},U_{2},V_{1},V_{2}))\leq 2h(T)\leq 2K.

By Lemma 1 of Lev, (2015), there exists $z\in\mathbb{C}^{n}$ such that

\displaystyle\|T_{n}(U_{1},U_{2},V_{1},V_{2})z\|>\frac{1}{2}\|T_{n}(U_{1},U_{2},V_{1},V_{2})\|\cdot\|z\|,

(4.9)

and then the logarithmic diameter for a non-zero vector defined in (4.7)

\ell(z)<8(2K)^{2}+1.

Taking the matrix

\displaystyle A=U_{1}D_{z}V_{1},

(4.10)

with $D_{z}$ being the diagonal matrix ${\operatorname{diag}}(z)$ , the logarithmic diameter for a non-zero matrix defined in Definition 4.4

\ell(A)=\ell(z)<32K^{2}+1.

Note that

	$\displaystyle\langle T_{n}(A),T_{n}(A)\rangle_{{\operatorname{HS}}}$	$\displaystyle=\Big{\langle}U_{2}T_{n}(A)V_{2},\,U_{2}T_{n}(A)V_{2}\Big{\rangle}_{{\operatorname{HS}}}$
		$\displaystyle\geq\Big{\langle}{\operatorname{diag}}(U_{2}T_{n}(A)V_{2}),\,{\operatorname{diag}}(U_{2}T_{n}(A)V_{2})\Big{\rangle}.$

Recall that by the definition of $T_{n}(U_{1},U_{2},V_{1},V_{2})$ given in (4),

{\operatorname{diag}}(U_{2}T_{n}(A)V_{2})=T_{n}(U_{1},U_{2},V_{1},V_{2})z,

using the specific form of $A$ . By equation (4.9),

\displaystyle\langle T_{n}(A),T_{n}(A)\rangle_{{\operatorname{HS}}}=\|T_{n}(U_{1},U_{2},V_{1},V_{2})z\|^{2}

\displaystyle>\frac{1}{4}\|T_{n}(U_{1},U_{2},V_{1},V_{2})\|^{2}\|z\|^{2}.

At last, using equations (4.1) and (4.8), we have by the specific form of $A$ that

\displaystyle\langle T_{n}(A),T_{n}(A)\rangle_{{\operatorname{HS}}}>\frac{1}{16}\|T\|_{2\rightarrow 2}^{2}\|A\|^{2},

which completes the proof. ∎

Lemma 4.6.

If $X\in M(N_{n})$ is a matrix with $h(X)\leq K$ for $K\geq 1$ a real number, then there exists a non-zero orthogonal projection $P\in M(N_{n})$ such that

\displaystyle|\langle X,P\rangle_{{\operatorname{HS}}}|\geq\frac{1}{2\sqrt{4\log(2K)+2}}\|X\|\|P\|.

Proof.

We proceed the proof in two steps. In the first step, we establish the desired result for a self-adjoint matrix. In the second step, we extend that to a general matrix.

Step 1. Suppose $A\in M(N_{n})$ is a self-adjoint matrix with height $h(A)\leq K$ for $K\geq 1$ a real number, where $h(A)$ is given in Definition 4.1. Since $A$ is self-adjoint, there exists a unitary $U\in M(N_{n})$ such that

\displaystyle UAU^{*}={\operatorname{diag}}(\lambda_{1},\cdots,\lambda_{n})\in M(N_{n}).

By the definition of height and the unitarily invariant property (4.1), we have

\displaystyle h(A)^{2}=\frac{\|A\|_{1}\|A\|_{\infty}}{\|A\|^{2}}=\frac{\|UAU^{*}\|_{1}\|UAU^{*}\|_{\infty}}{\|UAU^{*}\|^{2}}=h(UAU^{*})^{2}.

Next, we have

	$\displaystyle h((\lambda_{1},\cdots,\lambda_{n})^{T})^{2}=$	$\displaystyle\frac{\\|(\lambda_{1},\cdots,\lambda_{n})^{T}\\|_{1}\\|(\lambda_{1},\cdots,\lambda_{n})^{T}\\|_{\infty}}{\\|(\lambda_{1},\cdots,\lambda_{n})^{T}\\|^{2}}$
	$\displaystyle=$	$\displaystyle\frac{\\|{\operatorname{diag}}(\lambda_{1},\cdots,\lambda_{n})\\|_{1}\\|{\operatorname{diag}}(\lambda_{1},\cdots,\lambda_{n})\\|_{\infty}}{\\|{\operatorname{diag}}(\lambda_{1},\cdots,\lambda_{n})\\|^{2}}$
	$\displaystyle=$	$\displaystyle h(UAU^{*})^{2},$

where the superscript $``T"$ represents the transpose, and the norms applied to vectors or matrices by their corresponding definitions. Hence,

h((\lambda_{1},\cdots,\lambda_{n})^{T})=h(A)\leq K.

Then by Lemma 4 of Lev, (2015), there exists an $n$ -dimensional binary vector $\xi\in\{0,1\}^{n}$ such that

\displaystyle|\cos{((\lambda_{1},\cdots,\lambda_{n})^{T},\xi)}|\geq\frac{1}{2\sqrt{\log(2K^{2})+1}},

(4.11)

where, for non-zero vectors $u,v\in\mathbb{C}^{n}$ ,

\cos(u,v)=\langle u,v\rangle/\|u\|\|v\|.

Note that

	$\displaystyle\\|A\\|=\\|(\lambda_{1},\cdots,\lambda_{n})^{T}\\|,$
	$\displaystyle\\|U^{*}{\operatorname{diag}}(\xi_{1},\cdots,\xi_{n})U\\|=\\|(\xi_{1},\cdots,\xi_{n})^{T}\\|,$

and given that the trace is invariant under circular shifts

	$\displaystyle\Big{\langle}A,\,U^{*}{\operatorname{diag}}(\xi_{1},\cdots,\xi_{n})U\Big{\rangle}_{{\operatorname{HS}}}=$	$\displaystyle{\operatorname{tr}}\Big{(}A^{}U^{}{\operatorname{diag}}(\xi_{1},\cdots,\xi_{n})U\Big{)}$
	$\displaystyle=$	$\displaystyle{\operatorname{tr}}\Big{(}UA^{}U^{}{\operatorname{diag}}(\xi_{1},\cdots,\xi_{n})\Big{)}$
	$\displaystyle=$	$\displaystyle{\operatorname{tr}}\Big{(}{\operatorname{diag}}(\lambda_{1},\cdots,\lambda_{n})\,{\operatorname{diag}}(\xi_{1},\cdots,\xi_{n})\Big{)}$
	$\displaystyle=$	$\displaystyle\Big{\langle}(\lambda_{1},\cdots,\lambda_{n})^{T},\,(\xi_{1},\cdots,\xi_{n})^{T}\Big{\rangle}.$

Set

\displaystyle P=U^{*}{\operatorname{diag}}(\xi_{1},\cdots,\xi_{n})U

(4.12)

which is an orthogonal projection in $M(N_{n})$ . Then by equation (4.11), we have that

\displaystyle|\langle A,P\rangle_{{\operatorname{HS}}}|\geq\frac{1}{2\sqrt{\log(2K^{2})+1}}\|A\|\|P\|.

(4.13)

Step 2. According to the Toeplitz decomposition, each $X\in M(N_{n})$ can be written uniquely as

X=A+iB,

in which $A,B\in M(N_{n})$ are self-adjoint matrices; see Horn and Johnson, (2012). Since

\|X\|_{1}=\|X^{*}\|_{1}\quad\text{and}\quad\|X\|_{\infty}=\|X^{*}\|_{\infty},

we have

\displaystyle\|A\|_{1}=\left\|\frac{X+X^{*}}{2}\right\|_{1}\leq\frac{\|X\|_{1}+\|X^{*}\|_{1}}{2}=\|X\|_{1}\quad\text{and}\quad\|A\|_{\infty}\leq\|X\|_{\infty}.

(4.14)

Additionally, for the Hilbert-Schmidt norm, we have

\displaystyle\|X\|^{2}={\operatorname{tr}}((A+iB)(A-iB))={\operatorname{tr}}(A^{2})+{\operatorname{tr}}(B^{2})=\|A\|^{2}+\|B\|^{2}.

Replacing $X$ by $iX$ if necessarily, we may assume $\|A\|^{2}\geq\|B\|^{2}$ , which gives

\|A\|\geq\frac{1}{\sqrt{2}}\|X\|.

Then, by the definition of height given in Definition 4.1, equation (4.14), and the condition that $h(X)\leq K$ ,

\displaystyle h(A)=\frac{\sqrt{\|A\|_{1}\|A\|_{\infty}}}{\|A\|}\leq\frac{\sqrt{\|X\|_{1}\|X\|_{\infty}}}{\|X\|/\sqrt{2}}\leq\sqrt{2}K.

By the result of Step 1, specifically (4.13), there exists an orthogonal projection $P\in M(N_{n})$ such that

	$\displaystyle\|\langle A,P\rangle_{{\operatorname{HS}}}\|\geq\frac{1}{2\sqrt{\log(2(\sqrt{2}K)^{2})+1}}\\|A\\|\\|P\\|$	$\displaystyle\geq\frac{1}{2\sqrt{\log(4K^{2})+1}}\frac{\\|X\\|}{\sqrt{2}}\\|P\\|$
		$\displaystyle=\frac{1}{2\sqrt{4\log(2K)+2}}\\|X\\|\\|P\\|.$

Combining this with

\displaystyle|\langle X,P\rangle_{{\operatorname{HS}}}|=|\langle A,P\rangle_{{\operatorname{HS}}}+i\langle B,P\rangle_{{\operatorname{HS}}}|\geq|\langle A,P\rangle_{{\operatorname{HS}}}|

yields the desired estimate. ∎

Theorem 4.7.

For any linear map $T_{n}:M(N_{n})\rightarrow M(N_{n})$ , there exists a non-zero orthogonal projection $P\in M(N_{n})$ such that

\displaystyle\frac{\|T_{n}\|_{2\rightarrow 2}}{8\sqrt{4(\log(48h^{2}(T_{n}))+2)}}<\frac{\|T_{n}P\|}{\|P\|}\leq\|T_{n}\|_{2\rightarrow 2}.

Proof.

The second inequality clearly holds by the definition of $\|T_{n}\|_{2\rightarrow 2}$ , since $P$ being a non-zero projection is a special case of a complex matrix. Now, we prove the first inequality.

Suppose $h(T_{n})=K$ for $K\geq 1$ a real number. By Lemma 4.5, there exists a non-zero matrix $A\in M(N_{n})$ such that

\displaystyle\|T_{n}(A)\|>\frac{1}{4}\|T_{n}\|_{2\rightarrow 2}\|A\|\quad\text{and}\quad\ell(A)<32K^{2}+1.

(4.15)

Furthermore, by equations (4.5) and (4.6), we have

\displaystyle\|T_{n}^{*}(A)\|\geq\frac{1}{4}\|T_{n}^{*}\|_{2\rightarrow 2}\|A\|=\frac{1}{4}\|T_{n}\|_{2\rightarrow 2}\|A\|.

(4.16)

Consider the spectral decomposition of $(A^{*}A)^{\frac{1}{2}}$ as

(A^{*}A)^{\frac{1}{2}}=\sum\limits_{i\in I}\sigma_{i}P_{i},

where $\{\sigma_{i}\}_{i\in I}$ is the set of eigenvalues of $(A^{*}A)^{\frac{1}{2}}$ and $P_{i}$ is the corresponding spectral projection. Then, for $p\in[1,\infty)$ , the Schatten $p$ -norm of $A\in M(N_{n})$ can be written as

\|A\|_{p}=\Big{[}{\operatorname{tr}}\big{(}(A^{*}A)^{\frac{p}{2}}\big{)}\Big{]}^{\frac{1}{p}}=\left(\sum_{i\in I}\sigma_{i}^{p}\right)^{\frac{1}{p}};

while for $p=\infty$ , it can be written as

\|A\|_{\infty}=\sup_{x\in\mathbb{C}^{n},\|x\|=1}\|Ax\|=\max_{i\in I}\sigma_{i}.

Now, choose $j\in I$ such that

\displaystyle\sigma_{j}=\min_{i\in I}\{\sigma_{i}\mid\sigma_{i}\neq 0\}.

By the definition of $\ell(A)$ given in Definition 4.4 which is defined in terms of eigenvalues of $A^{*}A$ (instead of $(A^{*}A)^{\frac{1}{2}}$ ), together with equation (4.15), we obtain

\displaystyle h^{2}(A)=\frac{\|A\|_{1}\|A\|_{\infty}}{\|A\|^{2}}\leq\frac{(\sum\limits\sigma_{i})l(A)\sigma_{j}}{\sum\limits\sigma_{i}^{2}}\leq\ell(A)<36K^{2}.

Furthermore, by equation (4.15),

	$\displaystyle h^{2}(T_{n}(A))=\frac{\\|T_{n}(A)\\|_{1}\\|T_{n}(A)\\|_{\infty}}{\\|T_{n}(A)\\|^{2}}$	$\displaystyle<\frac{\\|T_{n}\\|_{1\rightarrow 1}\\|A\\|_{1}\\|T_{n}\\|_{\infty\rightarrow\infty}\\|A\\|_{\infty}}{(\frac{1}{4}\\|T_{n}\\|_{2\rightarrow 2}\\|A\\|)^{2}}$
		$\displaystyle=16h^{2}(T_{n})h^{2}(A).$

Therefore,

h(T_{n}(A))<4h(T_{n})h(A)<24K^{2}.

By Lemma 4.6, there exists an orthogonal projection $P\in M(N_{n})$ such that

\displaystyle|\langle T_{n}^{*}A,P\rangle_{{\operatorname{HS}}}|>\frac{1}{2\sqrt{4\log(48K^{2})+2}}\|T_{n}^{*}A\|\|P\|.

Therefore, by equation (4.16),

\displaystyle|\langle A,T_{n}P\rangle_{{\operatorname{HS}}}|=|\langle T_{n}^{*}A,P\rangle_{{\operatorname{HS}}}|>\frac{1}{8\sqrt{4(\log(48K^{2})+2)}}\|T_{n}\|_{2\rightarrow 2}\|A\|\|P\|,

which yields that

\displaystyle\|T_{n}P\|>

\displaystyle\frac{1}{8\sqrt{4(\log(48K^{2})+2)}}\|T_{n}\|_{2\rightarrow 2}\|P\|,

as desired.

∎

Theorem 4.8.

There exist two universal constants $C_{1},C_{2}>0$ and two non-zero orthogonal projections $P,Q\in M(N_{n})$ , such that for any linear map $T_{n}:M(N_{n})\rightarrow M(N_{n})$ , we have

\displaystyle\frac{\|T_{n}\|_{2\rightarrow 2}}{C_{1}(\log(h(T_{n}Q))+C_{2})}<\frac{\langle P,\,T_{n}Q\rangle_{{\operatorname{HS}}}}{\sqrt{{\operatorname{tr}}(P){\operatorname{tr}}(Q)}}\leq\|T_{n}\|_{2\rightarrow 2}

Proof.

The second inequality clearly holds by the definition of $\|T_{n}\|_{2\rightarrow 2}$ , because the non-zero projections $P$ and $Q$ are specific instances of complex matrices. Now, we prove the first inequality.

Suppose $h(T_{n})=K$ for $K\geq 1$ a real number. Define

\displaystyle f(K):={8\sqrt{4(\log(48K^{2})+2)}}.

Theorem 4.7 states that there exists an orthogonal projection $P$ such that

\displaystyle\|T_{n}(P)\|>\|T_{n}\|_{2\rightarrow 2}\|P\|/f(K),

(4.17)

which yields that

	$\displaystyle h(T_{n}(P))=\frac{\sqrt{\\|T_{n}(P)\\|_{1}\\|T_{n}(P)\\|_{\infty}}}{\\|T_{n}(P)\\|}$	$\displaystyle<\frac{\sqrt{\\|T_{n}\\|_{1\rightarrow 1}{\\|P\\|_{1}{\\|T_{n}\\|}_{\infty\rightarrow\infty}{\\|P\\|}_{\infty}}}}{\\|T_{n}\\|_{2\rightarrow 2}\\|P\\|/f(K)}$
		$\displaystyle=h(T_{n})\frac{\sqrt{{\\|P\\|_{1}{\\|P\\|}_{\infty}}}}{\\|P\\|}f(K).$

Note that, for an orthogonal projection $P$ , we have

\|P\|^{2}=\|P\|_{1}\|P\|_{\infty}.

Then we have

\displaystyle h(T_{n}(P))<Kf(K).

By Lemma 4.6, there exists an orthogonal projection $Q\in M(N_{n})$ such that

	$\displaystyle\|\langle T_{n}(P),Q\rangle_{{\operatorname{HS}}}\|$	$\displaystyle\geq\frac{1}{2\sqrt{4\log(2Kf(K))+2}}\\|T_{n}(P)\\|\\|Q\\|$
		$\displaystyle>\frac{1}{2f(K)\sqrt{4\log(2Kf(K))+2}}\\|T_{n}\\|_{2\rightarrow 2}\\|P\\|\\|Q\\|,$

where we used (4.17) in the last inequality. Note that

\displaystyle 2f(K)\sqrt{4\log(2Kf(K))+2}\leq C_{1}(\log K+C_{2})

for some $C_{1}>0$ and $C_{2}>0$ . The proof is complete. ∎

With Theorem 4.8, we are finally ready to establish the converse of quantum expander mixing lemma.

Proof of Theorem 3.4.

As in the proof of Theorem 3.3, we let $E:M(N_{n})\rightarrow M(N_{n})$ be the orthogonal projection onto the space $\langle I_{N_{n}}\rangle=\ker(1-T_{n})$ . Then $T_{n}|_{I_{N_{n}}^{\perp}}$ is the restriction of $T_{n}$ onto $(1-E)$ .

According to the definition of a quantum expander sequence given in Definition 3.2, there exists $\epsilon>0$ such that

\big{\|}T_{n}|_{I_{N_{n}}^{\perp}}\big{\|}\leq 1-\epsilon.

Here, we consider $\epsilon$ as the largest of those constants that suffice the above inquality. Then by the definition of $\|\cdot\|_{2\rightarrow 2}$ , we have

\displaystyle\big{\|}T_{n}|_{I_{N_{n}}^{\perp}}\big{\|}_{2\rightarrow 2}\geq 1-\epsilon.

Therefore,

h\big{(}T_{n}|_{I_{N_{n}}^{\perp}}\big{)}\leq\frac{\sqrt{\big{\|}T_{n}|_{I_{N_{n}}^{\perp}}\big{\|}_{1\rightarrow 1}\big{\|}T_{n}|_{I_{N_{n}}^{\perp}}\big{\|}_{\infty\rightarrow\infty}}}{1-\epsilon}\leq\frac{2}{1-\epsilon}.

By Theorem 4.8, there exist two universal constants $C_{1}^{\prime},C_{2}^{\prime}>0$ and two non-zero orthogonal projections $P_{1},P_{2}\in M(N_{n})$ , such that

\displaystyle{\langle P_{1},\,(T_{n}-E)P_{2}\rangle_{{\operatorname{HS}}}}\geq\frac{1-\epsilon}{C_{1}^{\prime}(\log(\frac{2}{1-\epsilon})+C_{2}^{\prime})}\sqrt{{\operatorname{tr}}(P_{1}){\operatorname{tr}}(P_{2})}.

Noting that

\displaystyle\langle P_{1},\,EP_{2}\rangle_{{\operatorname{HS}}}={\operatorname{tr}}\left(P_{1}^{*}\frac{{\operatorname{tr}}(P_{2})}{N_{n}}I_{N_{n}}\right)=\frac{1}{N_{n}}{\operatorname{tr}}(P_{1}){\operatorname{tr}}(P_{2}),

we have $P_{1}$ and $P_{2}$ satisfies

\displaystyle\left|{\langle P_{1},\,T_{n}(P_{2})\rangle_{{\operatorname{HS}}}}-\frac{1}{N_{n}}{\operatorname{tr}}(P_{1}){\operatorname{tr}}(P_{2})\right|\geq\frac{1-\epsilon}{C_{1}(-\log(1-\epsilon)+C_{2})}\sqrt{{\operatorname{tr}}(P_{1}){\operatorname{tr}}(P_{2})},

where $C_{1},C_{2}>0$ are two universal constants. ∎

Acknowledgments

This research project was partially supported by NIH grant 1R21AI180492-01 and the Individual Research Grant at Texas A&M University. The author gives special thanks to Ryo Toyota for conversations on this subject.

References

Aharonov et al., (2014) Aharonov, D., Harrow, A. W., Landau, Z., Nagaj, D., Szegedy, M., and Vazirani, U. (2014). Local tests of global entanglement and a counterexample to the generalized area law. In 2014 IEEE 55th Annual Symposium on Foundations of Computer Science, pages 246–255. IEEE.
Alon and Spencer, (2000) Alon, N. and Spencer, J. H. (2000). The Probabilistic Method, second edition. John Wiley & Sons.
Ambainis and Smith, (2004) Ambainis, A. and Smith, A. (2004). Small pseudo-random families of matrices: Derandomizing approximate quantum encryption. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques: 7th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems, APPROX 2004, and 8th International Workshop on Randomization and Computation, RANDOM 2004, Cambridge, MA, USA, August 22-24, 2004. Proceedings, pages 249–260. Springer.
Ben-Aroya et al., (2008) Ben-Aroya, A., Schwartz, O., and Ta-Shma, A. (2008). Quantum expanders: Motivation and constructions. In 2008 23rd Annual IEEE Conference on Computational Complexity, pages 292–303. IEEE.
Bilu and Linial, (2006) Bilu, Y. and Linial, N. (2006). Lifts, discrepancy and nearly optimal spectral gap. Combinatorica, 26(5):495–519.
Chen et al., (2013) Chen, S., Moore, C., and Russell, A. (2013). Small-bias sets for nonabelian groups: derandomizations of the Alon-Roichman theorem. In International Workshop on Approximation Algorithms for Combinatorial Optimization, pages 436–451. Springer.
Gillespie, (1991) Gillespie, T. (1991). Noncommutative variations on theorems of Marcel Riesz and others. In PAUL HALMOS Celebrating 50 Years of Mathematics, pages 221–236. Springer.
(8) Hastings, M. B. (2007a). Entropy and entanglement in quantum ground states. Physical Review B, 76(3):035–114.
(9) Hastings, M. B. (2007b). Random unitaries give quantum expanders. Physical Review A, 76(3):032315.
Hastings and Harrow, (2009) Hastings, M. B. and Harrow, A. W. (2009). Classical and quantum tensor product expanders. arXiv preprint arXiv:0804.0011, 9(3):336–360.
Hoory et al., (2006) Hoory, S., Linial, N., and Wigderson, A. (2006). Expander graphs and their applications. Bulletin of the American Mathematical Society, 43(4):439–561.
Horn and Johnson, (2012) Horn, R. A. and Johnson, C. R. (2012). Matrix analysis. Cambridge university press.
Jeronimo et al., (2022) Jeronimo, F. G., Mittal, T., Roy, S., and Wigderson, A. (2022). Almost Ramanujan expanders from arbitrary expanders via operator amplification. In 2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS), pages 378–388. IEEE.
Kitaev et al., (2002) Kitaev, A. Y., Shen, A., and Vyalyi, M. N. (2002). Classical and quantum computation. Number 47. American Mathematical Soc.
Lev, (2015) Lev, V. F. (2015). Discrete norms of a matrix and the converse to the expander mixing lemma. Linear Algebra and its Applications, 483:158–181.
Lubotzky, (2012) Lubotzky, A. (2012). Expander graphs in pure and applied mathematics. Bulletin of the American Mathematical Society, 49(1):113–162.
Nielsen and Chuang, (2001) Nielsen, M. A. and Chuang, I. L. (2001). Quantum computation and quantum information, volume 2. Cambridge university press Cambridge.
Pisier, (2014) Pisier, G. (2014). Quantum expanders and geometry of operator spaces. Journal of the European Mathematical Society, 16(6):1183–1219.
Tao, (2015) Tao, T. (2015). Expansion in finite simple groups of Lie type, volume 164. American Mathematical Soc.

		$\displaystyle\Bigg{\\|}\frac{1}{d}\sum_{j=1}^{d}u_{j}^{(n)}\otimes\overline{u}_{j}^{(n)}\Bigg{\\|}$
	$\displaystyle=$	$\displaystyle\sup\Bigg{\{}\Bigg{\\|}\frac{1}{d}\sum_{j=1}^{d}u_{j}^{(n)}\eta u_{j}^{(n)*}\Bigg{\\|}\;\Bigg{\|}\;\eta\in M(N_{n}),\,\\|\eta\\|\leq 1\Bigg{\}}$
	$\displaystyle=$	$\displaystyle\sup\Bigg{\{}\frac{1}{d}\Bigg{\|}\sum_{j=1}^{d}{\operatorname{tr}}(u_{j}^{(n)}\eta u_{j}^{(n)}\zeta^{})\Bigg{\|}\;\Bigg{\|}\;\eta,\zeta\in M(N_{n}),\,\\|\eta\\|\leq 1,\,\\|\zeta\\|\leq 1\Bigg{\}}.$

	$\displaystyle\left\|\langle P_{1},T_{n}P_{2}\rangle_{{\operatorname{HS}}}-\frac{1}{N_{n}}{\operatorname{tr}}(P_{1}){\operatorname{tr}}(P_{2})\right\|^{2}$	$\displaystyle=\Big{\|}\langle P_{1},(T-E)P_{2}\rangle_{{\operatorname{HS}}}\Big{\|}^{2}$
		$\displaystyle\leq{\operatorname{tr}}(P_{1}^{}P_{1}){\operatorname{tr}}\Big{(}\big{(}(T-E)P_{2}\big{)}^{}\big{(}(T-E)P_{2}\big{)}\Big{)}$
		$\displaystyle\leq{\operatorname{tr}}(P_{1})\big{\\|}T\|_{I_{N_{n}}^{\perp}}\big{\\|}^{2}{\operatorname{tr}}(P_{2})$
		$\displaystyle\leq(1-\epsilon)^{2}{\operatorname{tr}}(P_{1}){\operatorname{tr}}(P_{2}).$

	$\displaystyle\\|T_{n}\\|_{2\rightarrow 2}$	$\displaystyle=\\|T_{n}^{*}\\|_{2\rightarrow 2},$		(4.5)
	$\displaystyle\\|T_{n}\\|_{1\rightarrow 1}=\\|T_{n}^{*}\\|_{\infty\rightarrow\infty}\quad$	$\displaystyle\text{and}\quad\\|T_{n}\\|_{\infty\rightarrow\infty}=\\|T_{n}^{*}\\|_{1\rightarrow 1}.$

	$\displaystyle h(T_{n}(U_{1},U_{2},V_{1},V_{2}))^{2}$	$\displaystyle=\frac{\\|T_{n}(U_{1},U_{2},V_{1},V_{2})\\|_{1}\\|T_{n}(U_{1},U_{2},V_{1},V_{2})\\|_{\infty}}{\\|T_{n}(U_{1},U_{2},V_{1},V_{2})\\|_{2}^{2}}$
		$\displaystyle\leq 4\frac{\\|T_{n}(U_{1},U_{2},V_{1},V_{2})\\|_{1}\\|T_{n}(U_{1},U_{2},V_{1},V_{2})\\|_{\infty}}{\\|T_{n}\\|_{2\rightarrow 2}}$
		$\displaystyle\leq 4\frac{\\|T_{n}\\|_{1\rightarrow 1}\\|T_{n}\\|_{\infty\rightarrow\infty}}{\\|T_{n}\\|_{2\rightarrow 2}}.$