Quantum Implications of Huang’s Sensitivity Theorem

Scott Aaronson¹¹1Department of Computer Science, University of Texas at Austin. [email protected] Shalev Ben-David²²2University of Waterloo. [email protected] Robin Kothari³³3Microsoft Quantum and Microsoft Research. [email protected] Avishay Tal⁴⁴4Department of Electrical Engineering and Computer Sciences, University of California at Berkeley. [email protected]

Abstract

Based on the recent breakthrough of Huang (2019), we show that for any total Boolean function $f$ , the deterministic query complexity, $\operatorname{\mathsf{D}}(f)$ , is at most quartic in the quantum query complexity, $\operatorname{\mathsf{Q}}(f)$ : $\operatorname{\mathsf{D}}(f)=O(\operatorname{\mathsf{Q}}(f)^{4})$ . This matches the known separation (up to log factors) due to Ambainis, Balodis, Belovs, Lee, Santha, and Smotrovs (2017). We also use the result to resolve the quantum analogue of the Aanderaa–Karp–Rosenberg conjecture. We show that if $f$ is a nontrivial monotone graph property of an $n$ -vertex graph specified by its adjacency matrix, then $\operatorname{\mathsf{Q}}(f)=\Omega(n)$ , which is also optimal.

1 Introduction

Last year, Huang resolved a major open problem in the analysis of Boolean functions called the sensitivity conjecture [Hua19], which was open for nearly 30 years [NS94]. Surprisingly, Huang’s elegant proof takes less than 2 pages—truly a “proof from the book.” Specifically, Huang showed that for any total Boolean function, which is a function $f:\{0,1\}^{n}\to\{0,1\}$ , we have

\operatorname{\mathsf{deg}}(f)\leq\operatorname{\mathsf{s}}(f)^{2},

(1)

where $\operatorname{\mathsf{deg}}(f)$ is the real degree of $f$ and $\operatorname{\mathsf{s}}(f)$ is the (maximum) sensitivity of $f$ . These measures and other measures appearing in this introduction are defined in Section 2.

In this note, we describe some implications of Huang’s resolution of the sensitivity conjecture to quantum query complexity. We observe that Huang actually proves a stronger claim, in which $\operatorname{\mathsf{s}}(f)$ in Eq. 1 can be replaced by $\operatorname{\lambda}(f)$ , a spectral relaxation of sensitivity that we define later. This observation has several implications for quantum query complexity.

We use this observation to settle the optimal relation between the deterministic query complexity, $\operatorname{\mathsf{D}}(f)$ , and quantum query complexity, $\operatorname{\mathsf{Q}}(f)$ , for total functions. We know from the seminal results of Nisan [Nis91], Nisan and Szegedy [NS94] and Beals et al. [BBC⁺01] that any total Boolean function $f$ satisfies⁵⁵5This means that for total functions, quantum query algorithms can only outperform classical query algorithms by a polynomial factor. On the other hand, for partial functions, which are defined on a subset of $\{0,1\}^{n}$ , exponential and even larger speedups are possible.

\operatorname{\mathsf{D}}(f)=O(\operatorname{\mathsf{Q}}(f)^{6}).

(2)

Grover’s algorithm [Gro96] shows that for the or function, a quadratic separation between $\operatorname{\mathsf{D}}$ and $\operatorname{\mathsf{Q}}$ is possible. This was the best known quantum speedup for total functions until the work of Ambainis et al. [ABB⁺17], who constructed a total function $f$ with

\operatorname{\mathsf{D}}(f)=\widetilde{\Omega}(\operatorname{\mathsf{Q}}(f)^{4}).

(3)

In this note, we show that the quartic separation (up to log factors) in Eq. 3 is actually the best possible:

Theorem 1.

For all Boolean functions $f:\{0,1\}^{n}\to\{0,1\}$ , we have $\operatorname{\mathsf{D}}(f)=O(\operatorname{\mathsf{Q}}(f)^{4})$ .

We deduce Theorem 1 as a corollary of a new tight quadratic relationship between $\operatorname{\mathsf{deg}}(f)$ and $\operatorname{\mathsf{Q}}(f)$ :

Theorem 2.

For all Boolean functions $f:\{0,1\}^{n}\to\{0,1\}$ , we have $\operatorname{\mathsf{deg}}(f)=O(\operatorname{\mathsf{Q}}(f)^{2})$ .

Observe that Theorem 2 is tight for the or function on $n$ variables, whose degree is $n$ and whose quantum query complexity is $\Theta(\sqrt{n})$ [Gro96, BBBV97]. Prior to this work, the best relation between $\operatorname{\mathsf{deg}}(f)$ and $\operatorname{\mathsf{Q}}(f)$ was a sixth power relation, $\operatorname{\mathsf{deg}}(f)=O(\operatorname{\mathsf{Q}}(f)^{6})$ , which follows from Eq. 2.

As discussed earlier, our proof relies on the restatement of Huang’s result (Theorem 5), showing that $\operatorname{\mathsf{deg}}(f)\leq\operatorname{\lambda}(f)^{2}$ , where $\operatorname{\lambda}(f)$ is the spectral relaxation of sensitivity defined in Section 3. We then show that the measure $\operatorname{\lambda}(f)$ lower bounds the original quantum adversary method of Ambainis [Amb02], which in turn lower bounds $\operatorname{\mathsf{Q}}(f)$ .

We now show how Theorem 1 straightforwardly follows from Theorem 2 using two previously known connections between complexity measures of Boolean functions.

Proof of Theorem 1 assuming Theorem 2.

In [Mid04], Midrijanis showed that for all total functions $f$ , we have

\operatorname{\mathsf{D}}(f)\leq\operatorname{\mathsf{bs}}(f)\operatorname{\mathsf{deg}}(f),

(4)

where $\operatorname{\mathsf{bs}}(f)$ is the block sensitivity of $f$ .

Theorem 2 shows that $\operatorname{\mathsf{deg}}(f)=O(\operatorname{\mathsf{Q}}(f)^{2})$ . Combining the relationship between block sensitivity and approximate degree from [NS94] with the results of [BBC⁺01], we get that $\operatorname{\mathsf{bs}}(f)=O(\operatorname{\mathsf{Q}}(f)^{2})$ . (This can also be proved directly using the lower bound method in [BBBV97].)

Combining these three inequalities yields $\operatorname{\mathsf{D}}(f)=O(\operatorname{\mathsf{Q}}(f)^{4})$ for all total Boolean functions $f$ . ∎

It remains to show the main result, Theorem 2, which we do in Section 3 using the proof of the sensitivity conjecture by Huang [Hua19] and the spectral adversary method in quantum query complexity [BSS03].

In Section 4, we also use Theorem 2 to prove the quantum analogue of the famous Aanderaa–Karp–Rosenberg conjecture. Briefly, this conjecture is about the minimum possible query complexity of a nontrivial monotone graph property, for graphs specified by their adjacency matrices.

There are variants of the conjecture for different models of computation. For example, the randomized variant of the Aanderaa–Karp–Rosenberg conjecture, attributed to Karp [SW86, Conjecture 1.2] and Yao [Yao77, Remark (2)], states that for all nontrivial monotone graph properties $f$ , we have $\operatorname{\mathsf{R}}(f)=\Omega(n^{2})$ . Following a long line of work, the current best lower bound is $\operatorname{\mathsf{R}}(f)=\Omega(n^{4/3}\log^{1/3}n)$ due to Chakrabarti and Khot [CK01].

The quantum version of the conjecture was raised by Buhrman, Cleve, de Wolf, and Zalka [BCdWZ99], who observed that the best one could hope for is $\operatorname{\mathsf{Q}}(f)=\Omega(n)$ , because the nontrivial monotone graph property “contains at least one edge” can be decided with $O(n)$ queries using Grover’s algorithm [Gro96]. Buhrman et al. [BCdWZ99] also showed that all nontrivial monotone graph properties $f$ satisfy $\operatorname{\mathsf{Q}}(f)=\Omega(\sqrt{n})$ . The current best lower bound is $\operatorname{\mathsf{Q}}(f)=\Omega(n^{2/3}\log^{1/6}n)$ , which was credited to Yao in [MSS07]. We resolve this conjecture by showing an optimal $\Omega(n)$ lower bound.

Theorem 3.

Let $f:\{0,1\}^{\binom{n}{2}}\to\{0,1\}$ be a nontrivial monotone graph property. Then $\operatorname{\mathsf{Q}}(f)=\Omega(n)$ .

Theorem 3 follows by combining Theorem 2 with a known quadratic lower bound on the degree of monotone graph properties.

1.1 Known relations and separations

Figure 1: Relations between complexity measures. An upward line from a measure

M_{1}(f)

M_{2}(f)

denotes

M_{1}(f)=O(M_{2}(f))

for all total functions

f

Table 1 summarizes the known relations and separations between complexity measures studied in this paper (and more). This is an update to a similar table that appears in [ABK16] with the addition of $\operatorname{\mathsf{s}}(f)$ and $\operatorname{\lambda}(f)$ . Definitions and additional details about interpreting the table can be found in [ABK16].

For all the separations claimed in the table, we provide either an example of a separating function or a citation to a result that constructs such a function. All the relationships in the table follow by combining the relationships depicted in Figure 1 and the following inequalities that hold for all total Boolean functions:

•

$\operatorname{\mathsf{C}}(f)\leq\operatorname{\mathsf{bs}}(f)\operatorname{\mathsf{s}}(f)$ [Nis91]
•

$\operatorname{\mathsf{D}}(f)\leq\operatorname{\mathsf{bs}}(f)\operatorname{\mathsf{C}}(f)$ [BBC⁺01]
•

$\operatorname{\mathsf{D}}(f)\leq\operatorname{\mathsf{bs}}(f)\operatorname{\mathsf{deg}}(f)$ [Mid04]
•

$\operatorname{\mathsf{RC}}(f)=O(\operatorname{\mathsf{\widetilde{deg}}}(f)^{2})$ [KT16]
•

$\operatorname{\mathsf{R}}_{0}(f)=O(\operatorname{\mathsf{R}}(f)\operatorname{\mathsf{s}}(f)\log\operatorname{\mathsf{RC}}(f))$ [KT16]
•

$\operatorname{\mathsf{deg}}(f)\leq\operatorname{\lambda}(f)^{2}$ [Hua19]
•

$\operatorname{\mathsf{s}}(f)\leq\operatorname{\lambda}(f)^{2}$ (Lemma 15)

Table 1: Best known separations between complexity measures

\operatorname{\mathsf{D}}

\operatorname{\mathsf{R}}_{0}

\operatorname{\mathsf{R}}

\operatorname{\mathsf{C}}

\operatorname{\mathsf{RC}}

\operatorname{\mathsf{bs}}

\operatorname{\mathsf{s}}

\operatorname{\lambda}

\operatorname{\mathsf{Q}}_{E}

\operatorname{\mathsf{deg}}

\operatorname{\mathsf{Q}}

\operatorname{\mathsf{\widetilde{deg}}}

\operatorname{\mathsf{D}}

2, 2

[ABB⁺17]

2, 3

[ABB⁺17]

2, 2

\wedge\circ\vee

2, 3

\wedge\circ\vee

2, 3

\wedge\circ\vee

3,6

[BHT17]

4,6

[ABB⁺17]

2, 3

[ABB⁺17]

2, 3

[GPW18]

4,4

[ABB⁺17]

4, 6

[ABB⁺17]

\operatorname{\mathsf{R}}_{0}

1, 1

\oplus

2, 2

[ABB⁺17]

2, 2

\wedge\circ\vee

2, 3

\wedge\circ\vee

2, 3

\wedge\circ\vee

3,6

[BHT17]

3,6

[BHT17]

2, 3

[ABB⁺17]

2, 3

[GJPW18]

3,4

[ABB⁺17]

4, 6

[ABB⁺17]

\operatorname{\mathsf{R}}

1, 1

\oplus

1, 1

\oplus

2, 2

\wedge\circ\vee

2, 3

\wedge\circ\vee

2, 3

\wedge\circ\vee

3,6

[BHT17]

3,6

[BHT17]

\frac{3}{2}

, 3

[ABB⁺17]

2, 3

[GJPW18]

\frac{8}{3}

[Tal19]

4, 6

[ABB⁺17]

\operatorname{\mathsf{C}}

1, 1

\oplus

1, 1

\oplus

1, 2

\oplus

2, 2

[GSS13]

2, 2

[GSS13]

2.22,5

[BHT17]

2.22,6

[BHT17]

1.15, 3

[Amb13]

1.63, 3

[NW95]

2, 4

\wedge

2, 4

\wedge

\operatorname{\mathsf{RC}}

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

\frac{3}{2}

, 2

[GSS13]

2,4

[Rub95]

2,4

\wedge

1.15, 2

[Amb13]

1.63, 2

[NW95]

2, 2

\wedge

2, 2

\wedge

\operatorname{\mathsf{bs}}

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

2,4

[Rub95]

2,4

\wedge

1.15, 2

[Amb13]

1.63, 2

[NW95]

2, 2

\wedge

2, 2

\wedge

\operatorname{\mathsf{s}}

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

2,2

\wedge

1.15, 2

[Amb13]

1.63, 2

[NW95]

2, 2

\wedge

2, 2

\wedge

\operatorname{\lambda}

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

1, 2

\oplus

1, 1

\oplus

1, 2

\oplus

\operatorname{\mathsf{Q}}_{E}

1, 1

\oplus

1.33, 2

\bar{\wedge}

-tree

1.33, 3

\bar{\wedge}

-tree

2, 2

\wedge\circ\vee

2, 3

\wedge\circ\vee

2, 3

\wedge\circ\vee

3,6

[BHT17]

3,6

[BHT17]

2, 3

[ABK16]

2,4

\wedge

4, 6

[ABK16]

\operatorname{\mathsf{deg}}

1, 1

\oplus

1.33, 2

\bar{\wedge}

-tree

1.33,2

\bar{\wedge}

-tree

2, 2

\wedge\circ\vee

2,2

\wedge\circ\vee

2,2

\wedge\circ\vee

2,2

\wedge\circ\vee

2,2

\wedge

1, 1

\oplus

2,2

\wedge

2,4

\wedge

\operatorname{\mathsf{Q}}

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

2, 2

[ABK16]

2, 3

[ABK16]

2, 3

[ABK16]

3,6

[BHT17]

3,6

[BHT17]

1, 1

\oplus

2, 3

[ABK16]

4, 6

[ABK16]

\operatorname{\mathsf{\widetilde{deg}}}

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

2

, 2

[BT17]

2

[BT17]

2

[BT17]

2,2

[BT17]

2,2

[BT17]

1, 1

\oplus

1, 1

\oplus

1, 1

\oplus

•

An entry $a,b$ in the row $M_{1}$ and column $M_{2}$ roughly means that for all total functions $f$ , $M_{1}(f)\leq M_{2}(f)^{b+o(1)}$ and there exists a function $g$ with $M_{1}(g)\geq M_{2}(g)^{a-o(1)}$ (see [ABK16] for a precise definition).
•

The second row of each cell contains an example of a function that achieves the separation (or a citation to an example), where $\oplus=\textsc{parity}$ , $\wedge=\textsc{and}$ , $\vee=\textsc{or}$ , $\wedge\circ\vee=\textsc{and-or}$ , and $\bar{\wedge}$ -tree is the balanced nand-tree function.
•

Cells have a white background if the relationship is optimal and a gray background otherwise.
•

Entries with a green background follow from Huang’s result. Entries with a red background follow from this work.

1.2 Paper organization

Section 2 contains some preliminaries required to understand the proof of Theorem 2, which is proved in Section 3. Section 4 gives some background and motivation for the Aanderaa–Karp–Rosenberg conjecture and proves Theorem 3. We end with some open problems in Section 5.

Appendix A describes some properties of $\operatorname{\lambda}(f)$ , its many equivalent formulations, and its relationship with other complexity measures.

2 Preliminaries

2.1 Query complexity

Let $f:\{0,1\}^{n}\to\{0,1\}$ be a Boolean function. Let $A$ be a deterministic algorithm that computes $f(x)$ on input $x\in\{0,1\}^{n}$ by making queries to the bits of $x$ . The worst-case number of queries $A$ makes (over choices of $x$ ) is the query complexity of $A$ . The minimum query complexity of any deterministic algorithm computing $f$ is the deterministic query complexity of $f$ , denoted by $\operatorname{\mathsf{D}}(f)$ .

We define the bounded-error randomized (respectively quantum) query complexity of $f$ , denoted by $\operatorname{\mathsf{R}}(f)$ (respectively $\operatorname{\mathsf{Q}}(f)$ ), in an analogous way. We say an algorithm $A$ computes $f$ with bounded error if $\mathop{\bf Pr\/}[A(x)=f(x)]\geq 2/3$ for all $x\in\{0,1\}^{n}$ , where the probability is over the internal randomness of $A$ . Then $\operatorname{\mathsf{R}}(f)$ (respectively $\operatorname{\mathsf{Q}}(f)$ ) is the minimum number of queries required by any randomized (respectively quantum) algorithm that computes $f$ with bounded error. It is clear that $\operatorname{\mathsf{Q}}(f)\leq\operatorname{\mathsf{R}}(f)\leq\operatorname{\mathsf{D}}(f)$ . For more details on these measures, see the survey by Buhrman and de Wolf [BDW02].

2.2 Sensitivity and block sensitivity

Let $f:\{0,1\}^{n}\to\{0,1\}$ be a Boolean function, and let $x\in\{0,1\}^{n}$ be a string. A block is a subset of $[n]$ . We say that a block $B\in[n]$ is sensitive for $x$ (with respect to $f$ ) if $f(x\oplus\mathbbold{1}_{B})\neq f(x)$ , where $\mathbbold{1}_{B}$ is the $n$ -bit string that is $1$ on bits in $B$ and $0$ otherwise. We say a bit $i$ is sensitive for $x$ if the block $\{i\}$ is sensitive for $x$ . The maximum number of disjoint blocks that are all sensitive for $x$ is called the block sensitivity of $x$ (with respect to $f$ ), denoted by $\operatorname{\mathsf{bs}}_{x}(f)$ . The number of sensitive bits for $x$ is called the sensitivity of $x$ , denoted by $\operatorname{\mathsf{s}}_{x}(f)$ . Clearly, $\operatorname{\mathsf{bs}}_{x}(f)\geq\operatorname{\mathsf{s}}_{x}(f)$ , since $\operatorname{\mathsf{s}}_{x}(f)$ is has the same definition as $\operatorname{\mathsf{bs}}_{x}(f)$ except that the size of the blocks is restricted to $1$ . We define $\operatorname{\mathsf{s}}(f)=\max_{x\in\{0,1\}^{n}}{\operatorname{\mathsf{s}}_{x}(f)}$ and $\operatorname{\mathsf{bs}}(f)=\max_{x\in\{0,1\}^{n}}{\operatorname{\mathsf{bs}}_{x}(f)}$ .

2.3 Degree measures

A polynomial $q\in{\mathbb{R}}[x_{1},\ldots,x_{n}]$ is said to represent the function $f:\{0,1\}^{n}\to\{0,1\}$ if $q(x)=f(x)$ for all $x\in\{0,1\}^{n}$ . A polynomial $q$ is said to $\varepsilon$ -approximate $f$ if $q(x)\in[0,\varepsilon]$ for all $x\in f^{-1}(0)$ and $q(x)\in[1-\varepsilon,1]$ for all $x\in f^{-1}(1)$ . The degree of $f$ , denoted by $\operatorname{\mathsf{deg}}(f)$ , is the minimum degree of a polynomial representing $f$ . The $\varepsilon$ -approximate degree, denoted by $\widetilde{\operatorname{\mathsf{deg}}}_{\varepsilon}(f)$ , is the minimum degree of a polynomial $\varepsilon$ -approximating $f$ . We will omit $\varepsilon$ when $\varepsilon=1/3$ . We know that $\operatorname{\mathsf{D}}(f)\geq\operatorname{\mathsf{deg}}(f)$ , $\operatorname{\mathsf{R}}(f)\geq\widetilde{\operatorname{\mathsf{deg}}}(f)$ , and $\operatorname{\mathsf{Q}}(f)\geq\widetilde{\operatorname{\mathsf{deg}}}(f)/2$ .

The degree of $f$ as a polynomial is also called the Fourier-degree of $f$ , which equals $\max\{|S|:|\widehat{f}(S)|\neq 0\}$ where $\widehat{f}(S):=\mathop{\bf E\/}_{x}[f(x)\cdot(-1)^{\sum_{i\in S}x_{i}}]$ . In particular, $\operatorname{\mathsf{deg}}(f)<n$ if and only if $f$ agrees with the Parity function, $\textsc{parity}_{n}(x)=\oplus_{i=1}^{n}x_{i}$ , on exactly half of the inputs.

3 Proof of main result (Theorem 2)

Before proving Theorem 2, which is based on Huang’s proof, we reinterpret his result in terms of a new complexity measure of Boolean functions that we call $\operatorname{\lambda}(f)$ : the spectral norm of the sensitivity graph of $f$ .

Definition 4 (Sensitivity Graph $G_{f}$ , Spectral Sensitivity $\operatorname{\lambda}(f)$ ).

Let $f:\{0,1\}^{n}\to\{0,1\}$ be a Boolean function. The sensitivity graph of $f$ , $G_{f}=(V,E)$ is a subgraph of the Boolean hypercube, where $V=\{0,1\}^{n}$ , and $E=\{(x,x\oplus e_{i})\in V\times V:i\in[n],f(x)\neq f(x\oplus e_{i})\}$ . That is, $E$ is the set of edges between neighbors on the hypercube that have different $f$ -value. Let $A_{f}$ be the adjacency matrix of the graph $G_{f}$ . We define the spectral sensitivity of $f$ as $\operatorname{\lambda}(f)=\left\lVert A_{f}\right\rVert$ .

Note that because $A_{f}$ is a real symmetric matrix, $\operatorname{\lambda}(f)$ is also the largest eigenvalue of $A_{f}$ . Since $G_{f}$ is bipartite, the largest and smallest eigenvalues of $A_{f}$ are equal in magnitude.

Huang’s proof of the sensitivity conjecture can be divided into two steps:

1.

$\forall{f}:\operatorname{\mathsf{deg}}(f)\leq\operatorname{\lambda}(f)^{2}$
2.

$\forall{f}:\operatorname{\lambda}(f)\leq\operatorname{\mathsf{s}}(f)$

The second step is the simple fact that the spectral norm of an adjacency matrix is at most the maximum degree of any vertex in the graph, which equals $\operatorname{\mathsf{s}}(f)$ in this case.

We reprove the first claim, i.e., $\operatorname{\mathsf{deg}}(f)\leq\operatorname{\lambda}(f)^{2}$ , for completeness.

Theorem 5 ([Hua19]).

For all Boolean functions $f:\{0,1\}^{n}\to\{0,1\}$ , we have $\operatorname{\mathsf{deg}}(f)\leq\operatorname{\lambda}(f)^{2}$ .

Proof.

Without loss of generality we can assume that $\operatorname{\mathsf{deg}}(f)=n$ since otherwise we can restrict our attention to a subcube of dimension $\operatorname{\mathsf{deg}}(f)$ in which the degree remains the same and the top eigenvalue is at most $\operatorname{\lambda}(f)$ . Specifically, we can choose any monomial in the polynomial representing $f$ of degree $\operatorname{\mathsf{deg}}(f)$ and set all the variables not appearing in this monomial to $0$ .

For $f$ with $\operatorname{\mathsf{deg}}(f)=n$ , let $V_{0}=\{x\in\{0,1\}^{n}:f(x)=\textsc{parity}_{n}(x)\}$ and $V_{1}=\{x\in\{0,1\}^{n}:f(x)\neq\textsc{parity}_{n}(x)\}$ . By the fact that $\operatorname{\mathsf{deg}}(f)=n$ we know that $|V_{0}|\neq|V_{1}|$ as otherwise $f$ would have $0$ correlation with the $n$ -variate parity function, implying that $f$ ’s top Fourier coefficient is $0$ .

We also note that any edge in the hypercube that goes between $V_{0}$ and $V_{0}$ is an edge in $G_{f}$ since it changes the value of $f$ . This holds since for such an edge, $(x,x\oplus e_{i})$ , we have $f(x)=\textsc{parity}_{n}(x)\neq\textsc{parity}_{n}(x\oplus e_{i})=f(x\oplus e_{i})$ . Similarly, any edge in the hypercube that goes between $V_{1}$ and $V_{1}$ is an edge in $G_{f}$ .

Assume without loss of generality that $|V_{0}|>|V_{1}|$ . Thus, $|V_{0}|\geq 2^{n-1}+1$ . We will show that there exists a nonzero vector $v^{\prime}$ supported only on the entries of $V_{0}$ , such that $\|A_{f}\cdot v^{\prime}\|\geq\sqrt{n}\cdot\|v^{\prime}\|$ .

Let $G=(V,E)$ be the complete $n$ -dimensional Boolean Hypercube. That is, $V=\{0,1\}^{n}$ and $E=\{(x,x\oplus e_{i})\;:\;x\in\{0,1\}^{n},i\in[n]\}$ . Take the following signing of the edges of the Boolean hypercube, defined recursively.

B_{1}=\begin{pmatrix}0&1\\ 1&0\end{pmatrix}\ \mathrm{and}\ B_{i}=\begin{pmatrix}B_{i-1}&I\\ I&-B_{i-1}\end{pmatrix}\ \text{for $i\in\{2,\ldots,n\}$}.

(5)

This gives a new matrix $B_{n}\in\{-1,0,1\}^{V\times V}$ where $B_{n}(x,y)=0$ if and only if $x$ is not a neighbor of $y$ in the hypercube.

Huang showed that $B_{n}$ has $2^{n}/2$ eigenvalues that equal $-\sqrt{n}$ and $2^{n}/2$ eigenvalues that equal $+\sqrt{n}$ . To show this, he showed that $B_{n}^{2}=n\cdot I$ by induction on $n$ and thus all eigenvalues of $B_{n}$ must be either $+\sqrt{n}$ or $-\sqrt{n}$ . Then, observing that the trace of $B_{n}$ is $0$ , as all diagonal entries equal $0$ , we see that we must have an equal number of $+\sqrt{n}$ and $-\sqrt{n}$ eigenvalues.

Thus, the subspace of eigenvectors for $B_{n}$ with eigenvalue $\sqrt{n}$ is of dimension $2^{n}/2$ . Using $|V_{1}|<2^{n}/2$ , there must exists a nonzero eigenvector for $B_{n}$ with eigenvalue $\sqrt{n}$ that vanishes on $V_{1}$ . Fix $v$ to be any such vector.

Let $v^{\prime}$ be the vector whose entries are the absolute values of the entries of $v$ . We claim that $\|A_{f}\cdot v^{\prime}\|_{2}\geq\sqrt{n}\cdot\|v^{\prime}\|_{2}$ . To see so, note that for every $x\in V_{0}$ we have

	$\displaystyle(A_{f}\cdot v^{\prime})_{x}$	$\displaystyle=\sum_{y\sim x:f(y)\neq f(x)}v^{\prime}_{y}=\sum_{y\sim x:y\in V_{0}}v^{\prime}_{y}=\sum_{y\sim x}v^{\prime}_{y}$
		$\displaystyle\geq\sum_{y\in\{0,1\}^{n}}\|B_{x,y}v_{y}\|\geq\left\|\sum_{y\in\{0,1\}^{n}}B_{x,y}v_{y}\right\|=\sqrt{n}\cdot\|v_{x}\|=\sqrt{n}\cdot v^{\prime}_{x}\;.$		(6)

On the other hand, for $x\in V_{1}$ we have $(A_{f}\cdot v^{\prime})_{x}=0=v^{\prime}_{x}$ . Thus the norm of $A_{f}\cdot v^{\prime}$ is at least $\sqrt{n}$ times the norm of $v^{\prime}$ , and hence $\operatorname{\lambda}(f)=\|A_{f}\|\geq\sqrt{n}=\sqrt{\operatorname{\mathsf{deg}}(f)}$ . ∎

Finally, we prove that $\operatorname{\lambda}(f)=O(\operatorname{\mathsf{Q}}(f))$ . We rely on a variant of the adversary method introduced by Barnum, Saks, and Szegedy [BSS03] (see also [SS06]).

Definition 6 (Spectral Adversary method).

Let $\{D_{i}\}_{i\in[n]}$ and $F$ be matrices of size $\{0,1\}^{n}\times\{0,1\}^{n}$ with entries in $\{0,1\}$ satisfying $D_{i}[x,y]=1$ if and only if $x_{i}\neq y_{i}$ , and $F[x,y]=1$ if and only if $f(x)\neq f(y)$ . Let $\Gamma$ denote a $\{0,1\}^{n}\times\{0,1\}^{n}$ nonnegative symmetric matrix such that $\Gamma\circ F=\Gamma$ (i.e., the nonzero entries of $\Gamma$ are a subset of the the nonzero entries of $F$ ). Then $\operatorname{\mathsf{SA}}(f)=\max_{\Gamma}\frac{\|\Gamma\|}{\max_{i\in[n]}{\|\Gamma\circ D_{i}\|}}$ .

Barnum, Saks, and Szegedy [BSS03] proved that $\operatorname{\mathsf{Q}}(f)=\Omega(\operatorname{\mathsf{SA}}(f))$ .

Lemma 7.

For all Boolean functions $\operatorname{\mathsf{Q}}(f)=\Omega(\operatorname{\mathsf{SA}}(f))=\Omega(\operatorname{\lambda}(f))$ .

Proof.

We prove that $\operatorname{\mathsf{SA}}(f)\geq\operatorname{\lambda}(f)$ . Indeed, one can take $\Gamma$ to be simply the adjacency matrix of $G_{f}$ . That is, for any $x,y\in\{0,1\}^{n}$ put $\Gamma[x,y]=1$ if and only if $y\sim x$ in the hypercube and $f(x)\neq f(y)$ . We observe that $\|\Gamma\|=\operatorname{\lambda}(f)$ . On the other hand, for any $i\in[n]$ , $\Gamma\circ D_{i}$ is the restriction of the sensitive edges in direction $i$ . The maximum degree in the graph represented by $\Gamma\circ D_{i}$ is $1$ hence $\|\Gamma\circ D_{i}\|$ is at most $1$ . Thus we have

\operatorname{\mathsf{SA}}(f)\geq\frac{\|\Gamma\|}{\max_{i\in[n]}{\|\Gamma\circ D_{i}\|}}\geq\operatorname{\lambda}(f).

(7)

Combining this with $\operatorname{\mathsf{Q}}(f)=\Omega(\operatorname{\mathsf{SA}}(f))$ [BSS03], we get $\operatorname{\mathsf{Q}}(f)=\Omega(\operatorname{\mathsf{SA}}(f))=\Omega(\operatorname{\lambda}(f))$ . ∎

From Theorem 5 and Lemma 7 we immediately get Theorem 2.

4 Monotone graph properties

The Aanderaa–Karp–Rosenberg conjectures are a collection of conjectures related to the query complexity of deciding whether an input graph specified by its adjacency matrix satisfies a given property in various models of computation.

Specifically, let the input be an $n$ -vertex undirected simple graph specified by its adjacency matrix. This means we can query any unordered pair $\{i,j\}$ , where $i,j\in[n]$ , and learn whether there is an edge between vertex $i$ and $j$ . Note that the input size is $\binom{n}{2}=\Theta(n^{2})$ .

A function $f$ on $\binom{n}{2}$ variables is a graph property if it treats the input as a graph and not merely a string of length $\binom{n}{2}$ . Specifically, the function must be invariant under permuting vertices of the graph. In other words, the function can only depend on the isomorphism class of the graph, not the specific labels of the vertices. A function $f$ is monotone (increasing) if for all $x,y\in\{0,1\}^{n}$ , $x\leq y\implies f(x)\leq f(y)$ , where $x\leq y$ means $x_{i}\leq y_{i}$ for all $i\in[n]$ . For a monotone function, negating a $0$ in the input cannot change the function value from $1$ to $0$ . In the context of graph properties, if the input graph has a certain monotone graph property, then adding more edges cannot destroy the property.

Examples of monotone graph properties include “ $G$ is connected,” “ $G$ contains a clique of size $k$ ,” “ $G$ contains a Hamiltonian cycle,” “ $G$ has chromatic number greater than $k$ ,” “ $G$ is not planar”, and “ $G$ has diameter at most $k$ .” Many commonly encountered graph properties (or their negation) are monotone graph properties. Finally, we say a function $f:\{0,1\}^{n}\to\{0,1\}$ is nontrivial if there exist inputs $x$ and $y$ such that $f(x)\neq f(y)$ .

The deterministic Aanderaa–Karp–Rosenberg conjecture, also called the evasiveness conjecture,⁶⁶6A function $f$ is called evasive if its deterministic query complexity equals its input size. states that for all nontrivial monotone graph properties $f$ , $\operatorname{\mathsf{D}}(f)=\binom{n}{2}$ . This conjecture remains open to this day, although the weaker claim that $\operatorname{\mathsf{D}}(f)=\Omega(n^{2})$ was proved over 40 years ago by Rivest and Vuillemin [RV76]. Several works have improved on the constant in their lower bound, and the best current result is due to Scheidweiler and Triesch [ST13], who prove a lower bound of $\operatorname{\mathsf{D}}(f)\geq(1/3-o(1))\cdot n^{2}$ . The evasiveness conjecture has been established in several special cases including when $n$ is prime [KSS84] and when restricted to bipartite graphs [Yao88].

The randomized Aanderaa–Karp–Rosenberg conjecture asserts that all nontrivial monotone graph properties $f$ satisfy $\operatorname{\mathsf{R}}(f)=\Omega(n^{2})$ . A sequence of increasingly stronger lower bounds, starting with a lower bound of $\Omega(n\log^{1/12}n)$ due to Yao [Yao91], a lower bound of $\Omega(n^{5/4})$ due to King [Kin88], and a lower bound of $\Omega(n^{4/3})$ due to Hajnal [Haj91], has led to the current best lower bound of $\Omega(n^{4/3}\log^{1/3}n)$ due to Chakrabarti and Khot [CK01]. There are also two lower bounds due to Friedgut, Kahn, and Wigderson [FKW02] and O’Donnell, Saks, Schramm, and Servedio [OSSS05] that are better than this bound for some graph properties.

The quantum Aanderaa–Karp–Rosenberg conjecture states that all nontrivial monotone graph properties $f$ satisfy $\operatorname{\mathsf{Q}}(f)=\Omega(n)$ . This is the best lower bound one could hope to prove since there exist properties with $\operatorname{\mathsf{Q}}(f)=O(n)$ , such as the property of containing at least one edge. In fact, for any $\alpha\in[1,2]$ it is possible to construct a graph property with quantum query complexity $\Theta(n^{\alpha})$ using known lower bounds for the threshold function [BBC⁺01].

As stated in the introduction, the question was first raised by Buhrman, Cleve, de Wolf, and Zalka [BCdWZ99], who showed a lower bound of $\Omega(\sqrt{n})$ . This was improved by Yao to $\Omega(n^{2/3}\log^{1/6}n)$ using the technique in [CK01] and Ambainis’ adversary bound [Amb02]. Better lower bounds are known in some special cases, such as when the property is a subgraph isomorphism property, where we know a lower bound of $\Omega(n^{3/4})$ due to Kulkarni and Podder [KP16].

As stated in Theorem 3, we resolve the quantum Aanderaa–Karp–Rosenberg conjecture and show an optimal $\Omega(n)$ lower bound. The proof combines Theorem 2 with a quadratic lower bound on the degree of nontrivial monotone graph properties. With some work, the original quadratic lower bound on the deterministic query complexity of nontrivial monotone graph properties by Rivest and Vuillemin [RV76] can be modified to prove a similar lower bound for degree. We were not able to find such a proof in the literature, and instead combine the following two claims to obtain the desired claim.

First, we use the result of Dodis and Khanna [DK99, Theorem 2]:

Theorem 8.

For all nontrivial monotone graph properties, $\operatorname{\mathsf{deg}}_{2}(f)=\Omega(n^{2})$ .

Here $\operatorname{\mathsf{deg}}_{2}(f)$ is the minimum degree of a Boolean function when represented as a polynomial over the finite field with two elements, $\mathbb{F}_{2}$ . We combine this with a standard lemma that shows that this measure lower bounds $\operatorname{\mathsf{deg}}(f)$ . A proof can be found in [O’D09, Proposition 6.23]:

Lemma 9.

For all Boolean functions $f:\{0,1\}^{n}\to\{0,1\}$ , we have $\operatorname{\mathsf{deg}}_{2}(f)\leq\operatorname{\mathsf{deg}}(f)$ .

Combining these with Theorem 2, we get that all nontrivial monotone graph properties $f$ satisfy $\operatorname{\mathsf{Q}}(f)=\Omega(n)$ , which is the statement of Theorem 3.

5 Open questions

We saw that $\operatorname{\lambda}(f)$ lower-bounds both $\operatorname{\mathsf{Adv}}(f)$ , and thus $\operatorname{\mathsf{Q}}(f)$ , and also the sensitivity $\operatorname{\mathsf{s}}(f)$ . One might conjecture that $\operatorname{\lambda}(f)$ lower-bounds all the complexity measures in Figure 1, including $\widetilde{\operatorname{\mathsf{deg}}}(f)$ .

Conjecture 1.

For all Boolean functions $f:\{0,1\}^{n}\to\{0,1\}$ , we have $\operatorname{\lambda}(f)=O(\widetilde{\operatorname{\mathsf{deg}}}(f))$ .

If 1 we true, Theorem 5 would imply that $\operatorname{\mathsf{deg}}(f)=O(\widetilde{\operatorname{\mathsf{deg}}}(f)^{2})$ , settling a longstanding conjecture posed by Nisan and Szegedy [NS94]. The current best relation between the two measures is $\operatorname{\mathsf{deg}}(f)=O(\widetilde{\operatorname{\mathsf{deg}}}(f)^{6})$ . The following conjecture is weaker, and might be easier to tackle first.

Conjecture 2.

For all Boolean functions $f:\{0,1\}^{n}\to\{0,1\}$ , we have $\operatorname{\lambda}(f)=O(\operatorname{\mathsf{deg}}(f))$ .

Another longstanding open problem is to show a quadratic relation between deterministic query complexity and block sensitivity:

Conjecture 3.

For all Boolean functions $f:\{0,1\}^{n}\to\{0,1\}$ , we have $\operatorname{\mathsf{D}}(f)=O(\operatorname{\mathsf{bs}}(f)^{2})$ .

If this conjecture were true, it would optimally resolve several relationships in Table 1, and would imply, for example, $\operatorname{\mathsf{D}}(f)=O(\operatorname{\mathsf{R}}(f)^{2}))$ and $\operatorname{\mathsf{D}}(f)=O(\operatorname{\mathsf{\widetilde{deg}}}(f)^{4})$ .

After settling the best relation between $\operatorname{\mathsf{D}}(f)$ and $\operatorname{\mathsf{Q}}(f)$ , the next pressing question is to settle the best relation between $\operatorname{\mathsf{R}}(f)$ and $\operatorname{\mathsf{Q}}(f)$ . Recently, the fourth author [Tal19] showed a power $8/3$ separation between $\operatorname{\mathsf{R}}(f)$ and $\operatorname{\mathsf{Q}}(f)$ , while the best known relationship is a power $4$ relationship (this work). We conjecture that both these bounds can be improved.

Conjecture 4.

For all Boolean functions $f:\{0,1\}^{n}\to\{0,1\}$ , we have $\operatorname{\mathsf{R}}(f)=O(\operatorname{\mathsf{Q}}(f)^{3})$ .

Conjecture 5.

There exists a Boolean function $f:\{0,1\}^{n}\to\{0,1\}$ such that $\operatorname{\mathsf{R}}(f)=\Omega(\operatorname{\mathsf{Q}}(f)^{3})$ .

We note that there are candidate constructions based on the work of [AA18, ABK16, Tal19] that are conjectured to satisfy $\operatorname{\mathsf{Q}}(f)\geq\operatorname{\mathsf{R}}(f)^{3-o(1)}$ . In particular, it suffices to prove a conjectured bound on the Fourier spectrum of deterministic decision trees [Tal19] to prove 5.

Finally, for the special case of monotone total Boolean functions $f$ , Beals et al. [BBC⁺01] already showed in 1998 that $\operatorname{\mathsf{D}}(f)=O(\operatorname{\mathsf{Q}}(f)^{4})$ . It would be interesting to know whether this can be improved, perhaps all the way to $\operatorname{\mathsf{D}}(f)=O(\operatorname{\mathsf{Q}}(f)^{2})$ .

References

[AA18] Scott Aaronson and Andris Ambainis. Forrelation: A problem that optimally separates quantum from classical computing. SIAM J. Comput., 47(3):982–1038, 2018. doi:10.1137/15M1050902.
[ABB⁺17] Andris Ambainis, Kaspars Balodis, Aleksandrs Belovs, Troy Lee, Miklos Santha, and Juris Smotrovs. Separations in query complexity based on pointer functions. Journal of the ACM, 64(5):1–24, September 2017. doi:10.1145/3106234.
[ABK16] Scott Aaronson, Shalev Ben-David, and Robin Kothari. Separations in query complexity using cheat sheets. In Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2016, pages 863–876, 2016. doi:10.1145/2897518.2897644.
[Amb02] Andris Ambainis. Quantum lower bounds by quantum arguments. Journal of Computer and System Sciences, 64(4):750–767, jun 2002. doi:10.1006/jcss.2002.1826.
[Amb13] Andris Ambainis. Superlinear advantage for exact quantum algorithms. In Proceedings of the 45th ACM Symposium on Theory of Computing (STOC 2013), pages 891–900, 2013. doi:10.1145/2488608.2488721.
[BBBV97] Charles H. Bennett, Ethan Bernstein, Gilles Brassard, and Umesh Vazirani. Strengths and weaknesses of quantum computing. SIAM Journal on Computing, 26(5):1510–1523, October 1997. doi:10.1137/S0097539796300933.
[BBC⁺01] Robert Beals, Harry Buhrman, Richard Cleve, Michele Mosca, and Ronald de Wolf. Quantum lower bounds by polynomials. Journal of the ACM, 48(4):778–797, 2001. doi:10.1145/502090.502097.
[BCdWZ99] Harry Buhrman, Richard Cleve, Ronald de Wolf, and Christof Zalka. Bounds for small-error and zero-error quantum algorithms. In 40th Annual Symposium on Foundations of Computer Science, pages 358–368, 1999. doi:10.1109/SFFCS.1999.814607.
[BDW02] Harry Buhrman and Ronald De Wolf. Complexity measures and decision tree complexity: a survey. Theoretical Computer Science, 288(1):21–43, 2002. doi:10.1016/S0304-3975(01)00144-X.
[BHT17] Shalev Ben-David, Pooya Hatami, and Avishay Tal. Low-sensitivity functions from unambiguous certificates. In 8th Innovations in Theoretical Computer Science Conference, ITCS 2017, volume 67 of LIPIcs, pages 28:1–28:23. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2017. doi:10.4230/LIPIcs.ITCS.2017.28.
[BSS03] Howard Barnum, Michael E. Saks, and Mario Szegedy. Quantum query complexity and semi-definite programming. In 18th Annual IEEE Conference on Computational Complexity (Complexity 2003), pages 179–193, 2003. doi:10.1109/CCC.2003.1214419.
[BT17] Mark Bun and Justin Thaler. A nearly optimal lower bound on the approximate degree of AC⁰. In IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS 2017), pages 1–12, 2017. doi:10.1109/FOCS.2017.10.
[CK01] Amit Chakrabarti and Subhash Khot. Improved lower bounds on the randomized complexity of graph properties. In Automata, Languages and Programming, pages 285–296, 2001. doi:10.1007/3-540-48224-5_24.
[DK99] Yevgeniy Dodis and Sanjeev Khanna. Space-time tradeoffs for graph properties. In Automata, Languages and Programming, pages 291–300, 1999. doi:10.1007/3-540-48523-6_26.
[FKW02] Ehud Friedgut, Jeff Kahn, and Avi Wigderson. Computing graph properties by randomized subcube partitions. In Randomization and Approximation Techniques in Computer Science, pages 105–113, 2002. doi:10.1007/3-540-45726-7_9.
[GJPW18] Mika Göös, T. S. Jayram, Toniann Pitassi, and Thomas Watson. Randomized communication versus partition number. ACM Transactions on Computation Theory, 10(1):1–20, January 2018. doi:10.1145/3170711.
[GL13] Gene H. Golub and Charles F. Van Loan. Matrix computations. Johns Hopkins University Press, Baltimore, 2013. URL: https://jhupbooks.press.jhu.edu/title/matrix-computations.
[GPW18] Mika Göös, Toniann Pitassi, and Thomas Watson. Deterministic communication vs. partition number. SIAM Journal on Computing, 47(6):2435–2450, 2018. doi:10.1137/16M1059369.
[Gro96] Lov K. Grover. A fast quantum mechanical algorithm for database search. In Proceedings of the Twenty-Eighth Annual ACM Symposium on the Theory of Computing, STOC 1996, pages 212–219, 1996. doi:10.1145/237814.237866.
[GSS13] Justin Gilmer, Michael Saks, and Srikanth Srinivasan. Composition limits and separating examples for some Boolean function complexity measures. In Proceedings of 2013 IEEE Conference on Computational Complexity (CCC 2013), pages 185–196, June 2013. doi:10.1109/CCC.2013.27.
[Haj91] Péter Hajnal. An $\Omega(n^{4/3})$ lower bound on the randomized complexity of graph properties. Combinatorica, 11(2):131–143, jun 1991. doi:10.1007/bf01206357.
[Hua19] Hao Huang. Induced subgraphs of hypercubes and a proof of the sensitivity conjecture. Annals of Mathematics, 190(3):949–955, 2019. doi:10.4007/annals.2019.190.3.6.
[Khr71] V. M. Khrapchenko. Method of determining lower bounds for the complexity of P-schemes. Mathematical Notes of the Academy of Sciences of the USSR, 10(1):474–479, 1971. doi:10.1007/bf01747074.
[Kin88] Valerie King. Lower bounds on the complexity of graph properties. In Proceedings of the Twentieth Annual ACM Symposium on Theory of Computing, STOC ’88, page 468–476. Association for Computing Machinery, 1988. doi:10.1145/62212.62258.
[Kou93] Elias Koutsoupias. Improvements on Khrapchenko’s theorem. Theoretical Computer Science, 116(2):399–403, aug 1993. doi:10.1016/0304-3975(93)90330-V.
[KP16] Raghav Kulkarni and Supartha Podder. Quantum Query Complexity of Subgraph Isomorphism and Homomorphism. In 33rd Symposium on Theoretical Aspects of Computer Science (STACS 2016), volume 47 of Leibniz International Proceedings in Informatics (LIPIcs), pages 48:1–48:13. Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, 2016. doi:10.4230/LIPIcs.STACS.2016.48.
[KSS84] Jeff Kahn, Michael Saks, and Dean Sturtevant. A topological approach to evasiveness. Combinatorica, 4(4):297–306, dec 1984. doi:10.1007/bf02579140.
[KT16] Raghav Kulkarni and Avishay Tal. On fractional block sensitivity. Chicago Journal of Theoretical Computer Science, 2016(8), July 2016. doi:10.4086/cjtcs.2016.008.
[LLS06] Sophie Laplante, Troy Lee, and Mario Szegedy. The quantum adversary method and classical formula size lower bounds. computational complexity, 15(2):163–196, jun 2006. doi:10.1007/s00037-006-0212-7.
[LNS20] Sophie Laplante, Reza Naserasr, and Anupa Sunny. Sensitivity lower bounds from linear dependencies. Technical Report TR20-002, Electronic Colloquium on Computational Complexity (ECCC), January 2020. URL: https://eccc.weizmann.ac.il/report/2020/002/.
[Mid04] Gatis Midrijanis. Exact quantum query complexity for total Boolean functions. arXiv preprint quant-ph/0403168, 2004. arXiv:quant-ph/0403168.
[MSS07] Frédéric Magniez, Miklos Santha, and Mario Szegedy. Quantum algorithms for the triangle problem. SIAM Journal on Computing, 37(2):413–424, 2007. doi:10.1137/050643684.
[Nis91] Noam Nisan. CREW prams and decision trees. SIAM J. Comput., 20(6):999–1007, 1991. doi:10.1137/0220062.
[NS94] Noam Nisan and Mario Szegedy. On the degree of Boolean functions as real polynomials. Computational Complexity, 4(4):301–313, dec 1994. doi:10.1007/BF01263419.
[NW95] Noam Nisan and Avi Wigderson. On rank vs. communication complexity. Combinatorica, 15(4):557–565, 1995. doi:10.1007/BF01192527.
[O’D09] Ryan O’Donnell. Analysis of Boolean Functions. Cambridge University Press, 2009. doi:10.1017/cbo9781139814782.
[OSSS05] Ryan O’Donnell, Michael Saks, Oded Schramm, and Rocco A. Servedio. Every decision tree has an influential variable. In 46th Annual IEEE Symposium on Foundations of Computer Science (FOCS’05), pages 31–39, 2005. doi:10.1109/SFCS.2005.34.
[Rub95] David Rubinstein. Sensitivity vs. block sensitivity of Boolean functions. Combinatorica, 15(2):297–299, 1995. doi:10.1007/BF01200762.
[RV76] Ronald L. Rivest and Jean Vuillemin. On recognizing graph properties from adjacency matrices. Theoretical Computer Science, 3(3):371 – 384, 1976. doi:10.1016/0304-3975(76)90053-0.
[SS06] Robert Spalek and Mario Szegedy. All quantum adversary methods are equivalent. Theory of Computing, 2(1):1–18, 2006. doi:10.4086/toc.2006.v002a001.
[ST13] Robert Scheidweiler and Eberhard Triesch. A lower bound for the complexity of monotone graph properties. SIAM Journal on Discrete Mathematics, 27(1):257–265, 2013. doi:10.1137/120888703.
[SW86] Michael Saks and Avi Wigderson. Probabilistic Boolean decision trees and the complexity of evaluating game trees. In Proceedings of the 27th Annual Symposium on Foundations of Computer Science, SFCS ’86, page 29–38, 1986. doi:10.1109/SFCS.1986.44.
[Tal13] Avishay Tal. Properties and applications of Boolean function composition. In Proceedings of the 4th Conference on Innovations in Theoretical Computer Science, ITCS ’13, pages 441–454, 2013. doi:10.1145/2422436.2422485.
[Tal19] Avishay Tal. Towards optimal separations between quantum and randomized query complexities. arXiv, 2019. arXiv:1912.12561.
[Yao77] Andrew Chi-Chih Yao. Probabilistic computations: Toward a unified measure of complexity. In 18th Annual Symposium on Foundations of Computer Science (SFCS 1977), pages 222–227, 1977. doi:10.1109/SFCS.1977.24.
[Yao88] Andrew Chi-Chih Yao. Monotone bipartite graph properties are evasive. SIAM Journal on Computing, 17(3):517–520, 1988. doi:10.1137/0217031.
[Yao91] Andrew Chi-Chih Yao. Lower bounds to randomized algorithms for graph properties. Journal of Computer and System Sciences, 42(3):267 – 287, 1991. doi:10.1016/0022-0000(91)90003-N.

Appendix A Properties of the measure $\operatorname{\lambda}(f)$

We show that the measure $\operatorname{\lambda}(f)$ satisfies various elegant properties. First, it can be defined in multiple ways, one of which was introduced by Koutsoupias back in 1993 [Kou93]. It also has a formulation as a special case of the quantum adversary bound and hence can be expressed as as a semidefinite program closely related to that of the quantum adversary bound. Due to this characterization, $\operatorname{\lambda}(f)$ can be viewed as both a maximization problem and a minimization problem. These equivalent formulations are described in Section A.1.

Second, we show that $\operatorname{\lambda}(f)\leq\sqrt{\operatorname{\mathsf{s}}_{0}(f)\operatorname{\mathsf{s}}_{1}(f)}$ , which was already observed by Laplante, Lee, and Szegedy [LLS06] (though we give a slightly different proof). Finally, we show lower bounds on $\operatorname{\lambda}(f)$ and an optimal quadratic separation between $\operatorname{\lambda}(f)$ and $\operatorname{\mathsf{s}}(f)$ .

A.1 Equivalent formulations

Theorem 10.

For all Boolean functions $f\colon\{0,1\}^{n}\to\{0,1\}$ , we have

\operatorname{\lambda}(f)=\operatorname{\mathsf{K}}(f)=\operatorname{\mathsf{Adv}}_{1}(f)=\operatorname{\mathsf{Adv}}_{1}^{\pm}(f),

(8)

where the measures $\operatorname{\mathsf{K}}(f)$ , $\operatorname{\mathsf{Adv}}_{1}(f)$ , and $\operatorname{\mathsf{Adv}}_{1}^{\pm}(f)$ are defined below. Furthermore, $\operatorname{\mathsf{Adv}}_{1}(f)$ itself has several equivalent formulations: $\operatorname{\mathsf{Adv}}_{1}(f)\coloneqq\operatorname{\mathsf{SA}}_{1}(f)=\operatorname{\mathsf{SWA}}_{1}(f)=\operatorname{\mathsf{MM}}_{1}(f)=\operatorname{\mathsf{GSA}}_{1}(f)$ .

We now define all these measures before proving this theorem.

Koutsoupias complexity $\operatorname{\mathsf{K}}(f)$ .

For a Boolean function $f$ , let $A\subseteq f^{-1}(0)$ , and let $B\subseteq f^{-1}(1)$ . Let $Q$ be the matrix with rows and columns labeled by $A$ and $B$ respectively, with $Q[x,y]=1$ if the Hamming distance of $x$ and $y$ is $1$ , and $Q[x,y]=0$ otherwise. Koutsoupias [Kou93] observed that $\|Q\|^{2}$ is a lower bound on formula size, for every such choice of $A$ and $B$ . We define $\operatorname{\mathsf{K}}(f)$ to be the maximum value of $\|Q\|$ over choices of $A$ and $B$ . Thus $\operatorname{\mathsf{K}}(f)^{2}$ is a lower bound on the formula size of $f$ .

Single-bit positive adversary $\operatorname{\mathsf{Adv}}_{1}(f)$ .

We define $\operatorname{\mathsf{Adv}}_{1}(f)$ as a version of the adversary bound where we are only allowed to put nonzero weight on input pairs $(x,y)$ where $f(x)\neq f(y)$ and the Hamming distance between $x$ and $y$ is exactly $1$ . We will define $\operatorname{\mathsf{Adv}}_{1}(f)$ in terms of the spectral adversary version, which we also denote by $\operatorname{\mathsf{SA}}_{1}(f)$ . $\operatorname{\mathsf{Adv}}_{1}(f)=\operatorname{\mathsf{SA}}_{1}(f)$ is defined as the maximum of

\frac{\|\Gamma\|}{\max_{i\in[n]}\|\Gamma\circ D_{i}\|}

(9)

over matrices $\Gamma$ of a special form. We require $\Gamma$ satisfy the following: (1) its entries are nonnegative reals; (2) its rows and columns are indexed by $\operatorname{Dom}(f)$ ; (3) $\Gamma[x,y]=0$ whenever $f(x)=f(y)$ ; (4) $\Gamma[x,y]=0$ whenever the Hamming distance of $x$ and $y$ is not $1$ ; and (5) $\Gamma$ is not all $0$ . In the above expression, $\circ$ refers to the Hadamard (entrywise) product, $\operatorname{Dom}(f)$ is the domain of $f$ , and $D_{i}$ is the $\{0,1\}$ -valued matrix with $D_{i}[x,y]=1$ if and only if $x_{i}\neq y_{i}$ .

Single-bit negative adversary $\operatorname{\mathsf{Adv}}_{1}^{\pm}(f)$ .

We define $\operatorname{\mathsf{Adv}}_{1}^{\pm}(f)$ using the same definition as $\operatorname{\mathsf{Adv}}_{1}(f)$ above, except that the matrix $\Gamma$ is allowed to have negative entries. Note that since this is a relaxation of the conditions on $\Gamma$ , we clearly have $\operatorname{\mathsf{Adv}}_{1}^{\pm}(f)\geq\operatorname{\mathsf{Adv}}_{1}(f)$ .

Single-bit strong weighted adversary $\operatorname{\mathsf{SWA}}_{1}(f)$ .

We define $\operatorname{\mathsf{SWA}}_{1}(f)$ as a single-bit version of the strong weighted adversary method $\operatorname{\mathsf{SWA}}(f)$ from [SS06]. For this definition, we say a weight function $w\colon\operatorname{Dom}(f)\times\operatorname{Dom}(f)\to[0,\infty)$ is feasible if it is symmetric (i.e., $w(x,y)=w(y,x)$ ) and if it satisfies the conditions on $\Gamma$ above (i.e., it places weight $0$ on a pair $(x,y)$ unless both $f(x)\neq f(y)$ and the Hamming distance between $x$ and $y$ is $1$ ). We view such a feasible weight scheme $w$ as the weights on a weighted bipartite graph, where the left vertex set is $f^{-1}(0)$ and the right vertex set is $f^{-1}(1)$ . We let $wt(x)\coloneqq\sum_{y}w(x,y)$ denote the weighted degree of $x$ in this graph, i.e., the sum of the weights of its incident edges. Then $\operatorname{\mathsf{SWA}}_{1}(f)$ is defined as the maximum, over such feasible weight schemes $w$ , of

\min_{x,i:w(x,x^{i})>0}\frac{\sqrt{wt(x)wt(x^{i})}}{w(x,x^{i})}.

(10)

Here $x$ ranges over $\operatorname{Dom}(f)$ , $i$ ranges over $[n]$ , and $x^{i}$ denotes the string $x$ with bit $i$ flipped.⁷⁷7Readers familiar with the adversary bound should note that this definition is analogous a weighted version of Ambainis’s original adversary method; in the original method, the denominator was the geometric mean of (a) the weight of the neighbors of $x$ with disagree with $x$ at $i$ , and (b) the weight of the neighbors of $x^{i}$ which disagree with $x^{i}$ at $i$ ; but in our case, both (a) and (b) are simply $w(x,x^{i})$ , since $x^{i}$ is the only string that disagrees with $x$ on bit $i$ and is connected to $x$ in the bipartite graph.

Single-bit minimax adversary $\operatorname{\mathsf{MM}}_{1}(f)$ .

Unlike the other forms, we define $\operatorname{\mathsf{MM}}_{1}(f)$ as a minimization problem rather than a maximization problem. We say a weight function $w\colon\operatorname{Dom}(f)\times[n]\to[0,\infty)$ is feasible if for all $x,y\in\operatorname{Dom}(f)$ with $f(x)\neq f(y)$ and Hamming distance $1$ , we have $w(x,i)w(y,i)\geq 1$ , where $i$ is the bit on which $x$ and $y$ disagree. $\operatorname{\mathsf{MM}}_{1}(f)$ is defined as the minimum, over such feasible weight schemes $w$ , of

\max_{x\in\operatorname{Dom}(f)}\sum_{i\in[n]}w(x,i).

(11)

Semidefinite program version $\operatorname{\mathsf{GSA}}_{1}(f)$ .

We define $\operatorname{\mathsf{GSA}}_{1}(f)$ to be the optimal value of the following semidefinite program.

\begin{array}[]{lll}\text{maximize}&\langle Z,A_{f}\rangle&\\ \text{subject to}&\Delta\mbox{ is diagonal}&\\ &\operatorname{tr}\Delta=1&\\ &\Delta-Z\circ D_{i}\succeq 0&\forall i\in[n]\\ &Z\geq 0&\end{array}

(12)

Here $Z$ and $\Delta$ are variable matrices with rows and columns indexed by $\operatorname{Dom}(f)$ , $A_{f}$ is the $\{0,1\}$ -matrix with $A_{f}[x,y]=1$ if and only if both $f(x)\neq f(y)$ and $(x,y)$ have Hamming distance $1$ , and $D_{i}$ is the $\{0,1\}$ -matrix with $D_{i}[x,i]=1$ if and only if $x_{i}\neq y_{i}$ .

We now prove Theorem 10.

Proof.

Recall that in the definition of $\operatorname{\mathsf{K}}(f)$ , we picked $A\subseteq f^{-1}(0)$ and $B\subseteq f^{-1}(1)$ and defined the resulting matrix $Q$ . Since the spectral norm of a submatrix is always smaller than or equal to the spectral norm of the original matrix, we can always assume without loss of generality that $A=f^{-1}(0)$ and $B=f^{-1}(1)$ . Then $\operatorname{\mathsf{K}}(f)=\|Q\|$ for the resulting matrix $Q$ with rows and columns indexed by $f^{-1}(1)$ and $f^{-1}(0)$ respectively. Now, recall that $A_{f}$ was the adjacency matrix of the graph $G_{f}$ , which has an edge between $x$ and $y$ if $f(x)\neq f(y)$ and the Hamming weight between $x$ and $y$ is $1$ . The rows and columns of $A_{f}$ are each indexed by $\operatorname{Dom}(f)$ . By rearranging them, we can make $A_{f}$ be block diagonal with blocks equal to $Q$ and $Q^{\dagger}$ . From there it follows that $\|A_{f}\|=\|Q\|$ , so $\operatorname{\lambda}(f)=\operatorname{\mathsf{K}}(f)$ .

Next, recall that $\operatorname{\mathsf{Adv}}_{1}(f)$ is defined as the maximum ratio $\|\Gamma\|/\max_{i}\|\Gamma\circ D_{i}\|$ over valid choices of $\Gamma$ . Note that since $\Gamma[x,y]$ can only be nonzero if $x$ and $y$ disagree on one bit, $\Gamma\circ D_{i}$ is nonzero only on pairs $(x,y)$ which disagree exactly on bit $i$ . In other words, if $P_{i}$ denotes the $\{0,1\}$ -valued matrix with $P_{i}[x,y]=1$ if and only if $x$ and $y$ disagree on bit $i$ and only on $i$ , then $\Gamma\circ D_{i}$ is nonzero only in entries where $P_{i}$ is $1$ . Now, note that $P_{i}$ is a permutation matrix. Hence, by rearranging the rows and columns of $\Gamma\circ D_{i}$ , we can get it to be diagonal. This means $\|\Gamma\circ D_{i}\|$ is the maximum entry of $\Gamma\circ D_{i}$ , and hence $\max_{i}\|\Gamma\circ D_{i}\|$ is the maximum entry of $\Gamma$ . It follows that $\operatorname{\mathsf{Adv}}_{1}(f)$ is the maximum of $\|\Gamma\|$ over feasible matrices $\Gamma$ with $\max(\Gamma)\leq 1$ , where $\max(\Gamma)=\max_{ij}|\Gamma_{ij}|$ . This argument also holds for $\operatorname{\mathsf{Adv}}_{1}^{\pm}(f)$ , which is the maximum of $\|\Gamma\|$ over feasible (possibly negative) matrices $\Gamma$ with $\max(\Gamma)\leq 1$ .

Next, observe that negative weights never help for maximizing $\|\Gamma\|$ : indeed, if we had $\Gamma$ with negative entries maximizing $\|\Gamma\|$ , then we would have vectors $u$ and $v$ with $\|u\|_{2}=\|v\|_{2}=1$ and $u^{T}\Gamma v=\|\Gamma\|$ ; but then replacing $u$ and $v$ with their entry-wise absolute values, and replacing $\Gamma$ with its entry-wise absolute value $\Gamma^{\prime}$ , we clearly get that $\|\Gamma^{\prime}\|\geq\|\Gamma\|$ . However, $\max(\Gamma^{\prime})=\max(\Gamma)$ , so $\Gamma^{\prime}$ remains feasible. This means we can always take the maximizing matrix $\Gamma$ to be nonnegative, so $\operatorname{\mathsf{Adv}}_{1}^{\pm}(f)=\operatorname{\mathsf{Adv}}_{1}(f)$ . We can similarly assume that the unit vectors $u$ and $v$ maximizing $u^{T}\Gamma v$ are nonnegative.

Finally, consider the maximizing matrix $\Gamma$ and the maximizing unit vectors $u$ and $v$ , all nonnegative, and satisfying $\max(\Gamma)\leq 1$ . Note that the expression $u^{T}\Gamma v$ is nondecreasing in the entries of $\Gamma$ , since everything is nonnegative. Hence to maximize $u^{T}\Gamma v$ , we can always take every nonzero entry of $\Gamma$ to be $1$ , since this maintains $\max(\Gamma)\leq 1$ . In other words, the matrix maximizing $\|\Gamma\|$ will always simply be $A_{f}$ , and hence $\operatorname{\mathsf{Adv}}_{1}(f)$ is always exactly equal to $\operatorname{\lambda}(f)$ .

It remains to show that $\operatorname{\mathsf{SA}}_{1}(f)=\operatorname{\mathsf{SWA}}_{1}(f)=\operatorname{\mathsf{MM}}_{1}(f)=\operatorname{\mathsf{GSA}}_{1}(f)$ . The proof of this essentially follows the arguments in [SS06] for the regular positive adversary, though some steps are a little simpler. To start, we’ve seen that $\operatorname{\mathsf{SA}}_{1}(f)=\operatorname{\lambda}(f)$ . Since $A_{f}$ is symmetric, we have $\operatorname{\lambda}(f)=v^{T}A_{f}v$ for some unit vector $v$ , which we’ve established is nonnegative; this vector is also an eigenvector, so $A_{f}v=\operatorname{\lambda}(f)v$ . Consider the weight scheme $w(x,y)=v[x]v[y]A_{f}[x,y]$ . Then $wt(x)=\sum_{y}v[x]v[y]A_{f}[x,y]=v[x](A_{f}v)[x]=\operatorname{\lambda}(f)v[x]^{2}$ . Hence if $w(x,x^{i})>0$ , we have

\frac{\sqrt{wt(x)wt(x^{i})}}{w(x,x^{i})}=\frac{\operatorname{\lambda}(f)v[x]v[x^{i}]}{v[x]v[x^{i}]A_{f}[x,x^{i}]}=\operatorname{\lambda}(f).

(13)

This means $\operatorname{\mathsf{SWA}}_{1}(f)\geq\operatorname{\mathsf{SA}}_{1}(f)$ . In the other direction, let $w$ be a feasible weight scheme for $\operatorname{\mathsf{SWA}}_{1}(f)$ , let $\Gamma[x,y]=w(x,y)/\sqrt{wt(x)wt(y)}$ , and let $v[x]=\sqrt{wt(x)/W}$ , where $W=\sum_{x}wt(x)$ . Then $\|v\|_{2}^{2}=\sum_{x}wt(x)/W=1$ , and

v^{T}\Gamma v=\sum_{x,y}\sqrt{wt(x)wt(y)}w(x,y)/W\sqrt{wt(x)wt(y)}=(1/W)\sum_{x,y}w(x,y)=1.

(14)

Hence $\|\Gamma\|\geq 1$ . On the other hand, we have $\max(\Gamma)=\max_{x,y}w(x,y)/\sqrt{wt(x)wt(y)}$ . This means that the ratio $\|\Gamma\|/\max(\Gamma)$ equals $\min_{x,y:w(x,y)>0}\sqrt{wt(x)wt(y)}/w(x,y)$ , which is $\operatorname{\mathsf{SWA}}_{1}(f)$ ; thus $\operatorname{\mathsf{SA}}_{1}(f)\geq\operatorname{\mathsf{SWA}}_{1}(f)$ .

Next we examine $\operatorname{\mathsf{GSA}}_{1}(f)$ . Consider a solution $(Z,\Delta)$ to this semidefinite program and define $\Gamma=Z\circ M\circ A_{f}$ , where $M$ is defined as $M=uu^{T}$ and $u$ is defined by $u[x]=1/\sqrt{\Delta[x,x]}$ when $\Delta[x,x]>0$ and $u[x]=0$ otherwise. Recall that $\Delta$ is diagonal and that $\Delta-Z\circ D_{i}\succeq 0$ for all $i$ . Since positive semidefinite matrices are symmetric, $Z\circ D_{i}$ must be symmetric for all $i$ , so $Z$ is symmetric. Moreover, the diagonal of $Z\circ D_{i}$ is all zeros, so we must have $\Delta\geq 0$ . Further, if $\Delta[x,x]=0$ for some $x$ , we must have the corresponding row and column of $Z$ be all zeros. If we let $\Delta^{\prime}$ and $Z^{\prime}$ be $\Delta$ and $Z$ with the all-zero rows and columns deleted, then it is clear that $\Delta-Z\circ D_{i}\succeq 0$ if and only if $\Delta^{\prime}-Z^{\prime}\circ D_{i}\succeq 0$ . Defining $M^{\prime}$ as $M$ with those rows and columns deleted and $u^{\prime}$ as $u$ with those entries deleted, we have $M^{\prime}=u^{\prime}(u^{\prime})^{T}>0$ . Observe that $\Delta^{\prime}-Z^{\prime}\circ D_{i}\succeq 0$ if and only if $v^{T}(\Delta^{\prime}-Z^{\prime}\circ D_{i})v\geq 0$ for all vectors $v$ , which is if and only if $(v\circ u^{\prime})^{T}(\Delta^{\prime}-Z^{\prime}\circ D_{i})(v\circ u^{\prime})\geq 0$ for all vectors $v$ (since we have $u^{\prime}>0$ ). This, in turn, is equivalent to $M^{\prime}\circ(\Delta^{\prime}-Z^{\prime}\circ D_{i})\succeq 0$ . Since $M^{\prime}\circ\Delta^{\prime}=I$ , this is equivalent to $I-M^{\prime}\circ Z^{\prime}\circ D_{i}\succeq 0$ , which is in turn equivalent to $I-M\circ Z\circ D_{i}\succeq 0$ . Since $Z\geq 0$ and we are maximizing $\langle Z,A_{f}\rangle$ , it never helps for $Z$ to have nonzero entries in places where $A_{f}$ is $0$ . Hence we can assume without loss of generality that $Z=Z\circ A_{f}$ , which means the constraint becomes $I-\Gamma\circ D_{i}\succeq 0$ , where we defined $\Gamma=M\circ Z\circ A_{f}$ . We thus have $\|\Gamma\circ D_{i}\|\leq 1$ . On the other hand, letting $v[x]=\sqrt{\Delta[x,x]}$ , we have

v^{T}\Gamma v=\sum_{x,y}v[x]v[y]M[x,y]Z[x,y]A_{f}[x,y]=\sum_{x,y:\Delta[x,x],\Delta[y,y]>0}Z[x,y]A_{f}[x,y]=\langle Z,A_{f}\rangle.

(15)

Hence $\operatorname{\mathsf{SA}}_{1}(f)\geq\operatorname{\mathsf{GSA}}_{1}(f)$ . The reduction in the other direction works similarly: start with an adversary matrix $\Gamma$ with $\max(\Gamma)\leq 1$ , and let $v$ be its principle eigenvector. Then set $Z=\Gamma\circ(vv^{T})$ and $\Delta=I\circ(vv^{T})$ . Then $I-\Gamma\circ D_{i}\succeq 0$ , which implies that $\Delta-Z\circ D_{i}\succeq 0$ . We also have $\operatorname{tr}\Delta=1$ , $Z\geq 0$ , and $\langle Z,A_{f}\rangle=\|\Gamma\|$ .

Finally, we handle $\operatorname{\mathsf{MM}}_{1}(f)$ . To do so, we first take the dual of the semidefinite program for $\operatorname{\mathsf{GSA}}_{1}(f)$ . This dual has the form

\begin{array}[]{lll}\text{minimize}&\alpha&\\ \text{subject to}&\sum_{i}R_{i}\circ I\leq\alpha I&\\ &\sum_{i}R_{i}\circ D_{i}\geq A_{f}&\\ &R_{i}\succeq 0&\forall i\in[n]\end{array}

(16)

where the variables are $\alpha$ (a scalar) and matrices $R_{i}$ , each with rows and columns indexed by $\operatorname{Dom}(f)$ . Strong duality follows since when $A_{f}$ is not all zeros, and the semidefinite program in $\operatorname{\mathsf{GSA}}_{1}(f)$ has a strictly feasible solution (just take $Z$ to equal $\epsilon A_{f}$ for a small enough positive constant $\epsilon$ , and take $\Delta=I/|\operatorname{Dom}(f)|$ ). This means the optimal solution of the minimization problem above equals $\operatorname{\mathsf{Adv}}_{1}(f)$ . It remains to show that this optimal solution $T$ also equals $\operatorname{\mathsf{MM}}_{1}(f)$ .

Let $\alpha$ and $\{R_{i}\}_{i}$ be a feasible solution to the semidefinite program above. Since $R_{i}\succeq 0$ , we have $R_{i}=X_{i}X_{i}^{T}$ for some matrix $X_{i}$ . Define $w(x,i)=R_{i}[x,x]$ . Note that we also have $w(x,i)=\sum_{a}X_{i}[x,a]^{2}$ . Then by Cauchy–Schwarz, $w(x,i)w(y,i)\geq\left(\sum_{a}X_{i}[x,a]X_{i}[y,a]\right)^{2}=(X_{i}X_{i}^{T})[x,y]^{2}=R_{i}[x,y]^{2}$ . If $x$ and $y$ are such that $A_{f}[x,y]=1$ , then they disagree in only one bit $i$ , and hence $D_{i}[x,y]=1$ for that $i$ and $D_{j}[x,y]=0$ for all $j\neq i$ . Since we have $\sum_{i}R_{i}\circ D_{i}\geq A_{f}$ , we conclude that for all such pairs $(x,y)$ , we have $w(x,i)w(y,i)\geq R_{i}[x,y]^{2}\geq A_{f}[x,y]^{2}=1$ on the bit $i$ where $x$ and $y$ differ; hence the weight scheme $w$ is feasible. Furthermore, for any $x$ , $\sum_{i}w(x,i)=\sum_{i}R_{i}[x,x]\leq\alpha I[x,x]=\alpha$ . Hence $\operatorname{\mathsf{MM}}_{1}(f)$ is at most the optimal value of this semidefinite program.

In the other direction, consider a feasible weight scheme $w$ , and define $R_{i}[x,y]=\sqrt{w(x,i)w(y,i)}$ . Then $R_{i}=w(\cdot,i)w(\cdot,i)^{T}$ , where we treat $w(\cdot,i)$ as a vector; hence $R_{i}\succeq 0$ . Moreover, $R_{i}\geq 0$ , and for a pair $(x,y)$ with $A_{f}[x,y]=1$ , there is some $i$ which is the unique bit they disagree on, and hence $w(x,i)w(y,i)\geq 1$ ; but this means that $R_{i}[x,y]\geq 1$ , and so $(R_{i}\cdot D_{i})[x,y]\geq 1=A_{f}[x,y]$ . Finally, $\sum_{i}R_{i}[x,x]=\sum_{i}w(x,i)$ , which means that $\sum_{i}R_{i}\circ I\leq\operatorname{\mathsf{MM}}_{1}(f)\cdot I$ , as desired. ∎

A.2 Upper bounds

We now show a slightly better upper bound on $\operatorname{\lambda}(f)$ , that it is upper bounded by the geometric mean of the 0-sensitivity and 1-sensitivity, which can be a better upper bound than $\operatorname{\mathsf{s}}(f)$ .

We provide two proofs of this. The first uses the $\operatorname{\lambda}(f)$ formulation and uses a linear algebra argument about norms. This proof is due to Laplante, Lee, and Szegedy [LLS06], who observed this about the measure $\operatorname{\mathsf{K}}(f)$ .

To describe this proof, we briefly need to describe some matrix norms. For a vector $v\in{\mathbb{R}}^{n}$ , the $p$ -norm for a positive integer $p$ is defined as $\left\lVert v\right\rVert_{p}=(\sum_{i\in[n]}|v_{i}|^{p})^{1/p}$ . We also define $\left\lVert v\right\rVert_{\infty}=\max_{i\in[n]}|v_{i}|$ . Note that $\left\lVert v\right\rVert_{1}$ is simply the sum of the absolute values of all the entries of the vector.

Similarly, for a matrix $A\in{\mathbb{R}}^{n\times m}$ , we define the induced $p$ -norm of $A$ to be

\left\lVert A\right\rVert_{p}=\max\{\left\lVert Ax\right\rVert_{p}:\left\lVert x\right\rVert_{p}=1\}.

(17)

The spectral norm $\left\lVert A\right\rVert$ is the induced $2$ -norm $\left\lVert A\right\rVert_{2}$ . The 1-norm $\left\lVert A\right\rVert_{1}$ is simply the maximum sum of absolute values of entries in any column of the matrix. The $\infty$ -norm $\left\lVert A\right\rVert_{\infty}$ is the maximum sum of absolute values of entries in any row of the matrix.

Lastly, we need a useful relationship between these norms sometimes called Hölder’s inequality for induced matrix norms (see [GL13, Corollary 2.3.2] for a proof):

Proposition 11.

For all matrices $A\in{\mathbb{R}}^{n\times m}$ , we have $\left\lVert A\right\rVert\leq\sqrt{\left\lVert A\right\rVert_{1}\left\lVert A\right\rVert_{\infty}}$ .

We can now prove the upper bound:

Lemma 12.

For all (possibly partial) functions $f$ , we have $\operatorname{\lambda}(f)\leq\sqrt{\operatorname{\mathsf{s}}_{0}(f)\operatorname{\mathsf{s}}_{1}(f)}$ .

Proof.

We know that $\operatorname{\lambda}(f)=\left\lVert A_{f}\right\rVert$ and $A_{f}$ is a matrix of the form $\Bigl{(}\begin{smallmatrix}0&B\\ B^{T}&0\end{smallmatrix}\Bigr{)}$ if we rearrange the rows and columns so that all $0$ -inputs come first and are followed by $1$ -inputs, since $A_{f}$ only connects inputs with different $f$ -values. Thus we have

\operatorname{\lambda}(f)=\left\lVert A_{f}\right\rVert=\left\lVert B\right\rVert\leq\sqrt{\left\lVert B\right\rVert_{1}\left\lVert B_{\infty}\right\rVert}=\sqrt{\operatorname{\mathsf{s}}_{0}(f)\operatorname{\mathsf{s}}_{1}(f)},

(18)

where we used Hölder’s inequality (Proposition 11) and the fact that the maximum row and column sum of $B$ are precisely $\operatorname{\mathsf{s}}_{0}(f)$ and $\operatorname{\mathsf{s}}_{1}(f)$ , respectively. ∎

Our second proof of this claim uses the $\operatorname{\mathsf{MM}}_{1}(f)$ formulation which yields an arguably simpler proof.

Lemma 13.

For all (possibly partial) functions $f$ , we have $\operatorname{\mathsf{Adv}}_{1}(f)\leq\sqrt{\operatorname{\mathsf{s}}_{0}(f)\operatorname{\mathsf{s}}_{1}(f)}$ .

Proof.

Using the $\operatorname{\mathsf{MM}}_{1}(f)$ version of $\operatorname{\mathsf{Adv}}_{1}(f)$ , set $w(x,i)=\sqrt{\operatorname{\mathsf{s}}_{0}(f)}/\sqrt{\operatorname{\mathsf{s}}_{1}(f)}$ if $f(x)=1$ , and set $w(x,i)=\sqrt{\operatorname{\mathsf{s}}_{1}(f)}/\sqrt{\operatorname{\mathsf{s}}_{0}(f)}$ if $f(x)=0$ . Then if $x$ and $y$ differ in a single bit $i$ , we clearly have $w(x,i)w(y,i)=1$ . On the other hand, $\sum_{i}w(x,i)\leq\operatorname{\mathsf{s}}_{1}(f)\cdot\sqrt{\operatorname{\mathsf{s}}_{0}(f)}/\sqrt{\operatorname{\mathsf{s}}_{1}(f)}=\sqrt{\operatorname{\mathsf{s}}_{0}(f)\operatorname{\mathsf{s}}_{1}(f)}$ for $1$ -inputs $x$ , and analogously $\sum_{i}w(y,i)\leq\sqrt{\operatorname{\mathsf{s}}_{0}(f)\operatorname{\mathsf{s}}_{1}(f)}$ for $0$ -inputs $y$ . ∎

Using this better bound on $\operatorname{\lambda}(f)$ and Huang’s result, we also get that for all total Boolean functions $f$ ,

\operatorname{\mathsf{deg}}(f)\leq\operatorname{\mathsf{s}}_{0}(f)\operatorname{\mathsf{s}}_{1}(f).

(19)

This result was also recently observed by Laplante, Naserasr, and Sunny [LNS20]. Unlike their proof, the following uses Huang’s theorem in a completely black-box way.

Proposition 14.

Assume that $\operatorname{\mathsf{deg}}(f)\leq\operatorname{\mathsf{s}}(f)^{2}$ for all total Boolean functions $f$ . Then we also have $\operatorname{\mathsf{deg}}(f)\leq\operatorname{\mathsf{s}}_{0}(f)\operatorname{\mathsf{s}}_{1}(f)$ .

Proof.

Let $\operatorname{\mathsf{s}}_{0}(f)=k$ and $\operatorname{\mathsf{s}}_{1}(f)=\ell$ . We know that $\operatorname{\mathsf{deg}}(f)\leq\max\{k,\ell\}$ by assumption. Let $\textsc{and}_{k}\circ\textsc{or}_{\ell}$ be the AND function on $k$ bits composed with the OR function on $\ell$ bits. Clearly $\operatorname{\mathsf{s}}_{0}(\textsc{and}_{k}\circ\textsc{or}_{\ell})=\ell$ and $\operatorname{\mathsf{s}}_{1}(\textsc{and}_{k}\circ\textsc{or}_{\ell})=k$ . Furthermore, because the function is monotone, the sensitive bits for a $0$ -input are bits set to $0$ , and the sensitive bits for a $1$ -input are bits set to $1$ . This means that composing this function with $f$ with yield a function where the one-sided sensitivity will be upper bounded by the product of one-sided sensitivity of the individual functions. Hence for all $b\in\{0,1\}$ , we have

\operatorname{\mathsf{s}}_{b}(\textsc{and}_{k}\circ\textsc{or}_{\ell}\circ f)\leq\operatorname{\mathsf{s}}_{b}(\textsc{and}_{k}\circ\textsc{or}_{\ell})\operatorname{\mathsf{s}}_{b}(f)\leq k\ell.

(20)

Using the assumption on the function $\textsc{and}_{k}\circ\textsc{or}_{\ell}\circ f$ , we get

\operatorname{\mathsf{deg}}(\textsc{and}_{k}\circ\textsc{or}_{\ell}\circ f)\leq(\operatorname{\mathsf{s}}(\textsc{and}_{k}\circ\textsc{or}_{\ell}\circ f))^{2}\leq(k\ell)^{2}.

(21)

Finally, it is well known that $\operatorname{\mathsf{deg}}(f\circ g)=\operatorname{\mathsf{deg}}(f)\operatorname{\mathsf{deg}}(g)$ (see, e.g., [Tal13]), and hence $\operatorname{\mathsf{deg}}(\textsc{and}_{k}\circ\textsc{or}_{\ell}\circ f)=k\ell\operatorname{\mathsf{deg}}(f)$ , which implies $\operatorname{\mathsf{deg}}(f)\leq k\ell$ . ∎

A.3 Lower bounds

Finally, we prove some lower bounds on $\operatorname{\lambda}(f)$ .

Lemma 15.

For all (possibly partial) functions $f$ , $\operatorname{\mathsf{s}}(f)\leq\operatorname{\lambda}(f)^{2}$ .

Proof.

Consider any input $x$ with sensitivity $\operatorname{\mathsf{s}}(f)$ . This means $x$ has $\operatorname{\mathsf{s}}(f)$ neighbors on the hypercube with different $f$ value. The sensitivity graph restricted to these $\operatorname{\mathsf{s}}(f)+1$ inputs is a star graph centered at $x$ . The spectral norm of the adjacency matrix of the star graph on $k+1$ vertices is $\sqrt{k}$ . Since the spectral norm of $A_{f}$ is lower bounded by that of a submatrix, we have $\operatorname{\lambda}(f)\geq\sqrt{\operatorname{\mathsf{s}}(f)}$ . ∎

This relationship is tight for the $\textsc{or}_{n}$ function which has $\operatorname{\mathsf{s}}(\textsc{or}_{n})=n$ and $\operatorname{\lambda}(\textsc{or}_{n})=\sqrt{n}$ . Although $\textsc{or}_{n}$ has unbalanced sensitivities, with $\operatorname{\mathsf{s}}_{0}(\textsc{or}_{n})=n$ and $\operatorname{\mathsf{s}}_{1}(\textsc{or}_{n})=1$ , there are functions $f$ with $\operatorname{\mathsf{s}}(f)=\operatorname{\mathsf{s}}_{0}(f)=\operatorname{\mathsf{s}}_{1}(f)=n$ and $\operatorname{\lambda}(f)=\sqrt{n}$ . One example of such a function is $x_{1}\oplus\textsc{or}(x_{2},\ldots,x_{n})$ . Another example of such a function with a quadratic gap between $\operatorname{\mathsf{s}}(f)$ and $\operatorname{\lambda}(f)$ is the function that is $1$ if and only if the input string has Hamming weight $1$ . This function has $\operatorname{\mathsf{s}}_{0}(f)=n$ since the all zeros string is fully sensitive and $\operatorname{\mathsf{s}}_{1}(f)=n$ since every Hamming weight $1$ string is also fully sensitive. But we know that this problem can be solved by Grover’s algorithm with $O(\sqrt{n})$ queries, and hence $\operatorname{\lambda}(f)=O(\operatorname{\mathsf{Q}}(f))=O(\sqrt{n})$ .

We can also lower bound $\left\lVert A_{f}\right\rVert$ using the relationship between spectral norm and Frobenius norm. We have for all $N\times N$ matrices $A$ that $\left\lVert A\right\rVert\geq\frac{1}{\sqrt{N}}\left\lVert A\right\rVert_{F}$ [GL13, Eq. (2.3.7)], where $\left\lVert A\right\rVert^{2}_{F}=\sum_{i,j}|A_{ij}|^{2}$ . For the sensitivity graph of $f$ , $\frac{1}{\sqrt{N}}\left\lVert A_{f}\right\rVert_{F}$ is just the average sensitivity.

Lemma 16.

For all (possibly partial) functions $f$ , $\operatorname{\lambda}(f)\geq\mathop{\bf E\/}_{x}[\operatorname{\mathsf{s}}_{x}(f)]$ .

This can be improved by only taking the expectation on the right over a subset of the inputs of $f$ , which then equals another complexity measure originally defined by Khrapchenko [Khr71]. See [Kou93] for more details.

Quantum Implications of Huang’s Sensitivity Theorem

Abstract

1 Introduction

Theorem 1.

Theorem 2.

Proof of Theorem 1 assuming Theorem 2.

Theorem 3.

1.1 Known relations and separations

1.2 Paper organization

2 Preliminaries

2.1 Query complexity

2.2 Sensitivity and block sensitivity

2.3 Degree measures

3 Proof of main result (Theorem 2)

Definition 4 (Sensitivity Graph GfG_{f}, Spectral Sensitivity λ⁡(f)\operatorname{\lambda}(f)).

Theorem 5 ([Hua19]).

Proof.

Definition 6 (Spectral Adversary method).

Lemma 7.

Proof.

4 Monotone graph properties

Theorem 8.

Lemma 9.

5 Open questions

Conjecture 1.

Conjecture 2.

Conjecture 3.

Conjecture 4.

Conjecture 5.

References

Appendix A Properties of the measure λ⁡(f)\operatorname{\lambda}(f)

A.1 Equivalent formulations

Theorem 10.

Koutsoupias complexity 𝖪⁡(f)\operatorname{\mathsf{K}}(f).

Single-bit positive adversary 𝖠𝖽𝗏1⁡(f)\operatorname{\mathsf{Adv}}_{1}(f).

Single-bit negative adversary 𝖠𝖽𝗏1±⁡(f)\operatorname{\mathsf{Adv}}_{1}^{\pm}(f).

Single-bit strong weighted adversary 𝖲𝖶𝖠1⁡(f)\operatorname{\mathsf{SWA}}_{1}(f).

Single-bit minimax adversary 𝖬𝖬1⁡(f)\operatorname{\mathsf{MM}}_{1}(f).

Semidefinite program version 𝖦𝖲𝖠1⁡(f)\operatorname{\mathsf{GSA}}_{1}(f).

Proof.

A.2 Upper bounds

Proposition 11.

Lemma 12.

Proof.

Lemma 13.

Proof.

Proposition 14.

Proof.

A.3 Lower bounds

Lemma 15.

Proof.

Lemma 16.

Definition 4 (Sensitivity Graph $G_{f}$ , Spectral Sensitivity $\operatorname{\lambda}(f)$ ).

Appendix A Properties of the measure $\operatorname{\lambda}(f)$

Koutsoupias complexity $\operatorname{\mathsf{K}}(f)$ .

Single-bit positive adversary $\operatorname{\mathsf{Adv}}_{1}(f)$ .

Single-bit negative adversary $\operatorname{\mathsf{Adv}}_{1}^{\pm}(f)$ .

Single-bit strong weighted adversary $\operatorname{\mathsf{SWA}}_{1}(f)$ .

Single-bit minimax adversary $\operatorname{\mathsf{MM}}_{1}(f)$ .

Semidefinite program version $\operatorname{\mathsf{GSA}}_{1}(f)$ .