This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A nearly-4logn4\log n depth lower bound for formulas with restriction on top

Hao Wu College of Information Engineering, Shanghai Maritime University, Shanghai, China. My email is [email protected], you can also reach me via [email protected].
Abstract

One of the major open problems in complexity theory is to demonstrate an explicit function which requires super logarithmic depth, a.k.a, the 𝐏\mathbf{P} versus 𝐍𝐂𝟏\mathbf{NC^{1}} problem. The current best depth lower bound is (3o(1))logn(3-o(1))\cdot\log n, and it is widely open how to prove a super-3logn3\log n depth lower bound. Recently Mihajlin and Sofronova (CCC’22) show if considering formulas with restriction on top, we can break the 3logn3\log n barrier. Formally, they prove there exist two functions f:{0,1}n{0,1},g:{0,1}n{0,1}nf:\{0,1\}^{n}\rightarrow\{0,1\},g:\{0,1\}^{n}\rightarrow\{0,1\}^{n}, such that for any constant 0<α<0.40<\alpha<0.4 and constant 0<ϵ<α/20<\epsilon<\alpha/2, their XOR composition f(g(x)y)f(g(x)\oplus y) is not computable by an AND of 2(αϵ)n2^{(\alpha-\epsilon)n} formulas of size at most 2(1α/2ϵ)n2^{(1-\alpha/2-\epsilon)n}. This implies a modified version of Andreev function is not computable by any circuit of depth (3.2ϵ)logn(3.2-\epsilon)\log n with the restriction that top 0.4ϵ0.4-\epsilon layers only consist of AND gates for any small constant ϵ>0\epsilon>0. They ask whether the parameter α\alpha can be push up to nearly 11 thus implying a nearly-3.5logn3.5\log n depth lower bound.

In this paper, we provide a stronger answer to their question. We show there exist two functions f:{0,1}n{0,1},g:{0,1}n{0,1}nf:\{0,1\}^{n}\rightarrow\{0,1\},g:\{0,1\}^{n}\rightarrow\{0,1\}^{n}, such that for any constant 0<α<2o(1)0<\alpha<2-o(1), their XOR composition f(g(x)y)f(g(x)\oplus y) is not computable by an AND of 2αn2^{\alpha n} formulas of size at most 2(1α/2o(1))n2^{(1-\alpha/2-o(1))n}. This implies a (4o(1))logn(4-o(1))\log n depth lower bound with the restriction that top 2o(1)2-o(1) layers only consist of AND gates. We prove it by observing that one crucial component in Mihajlin and Sofronova’s work, called the well-mixed set of functions, can be significantly simplified thus improved. Then with this observation and a more careful analysis, we obtain these nearly tight results.

1 Introduction

One of the major open problems in complexity theory is to demonstrate an explicit function which requires super logarithmic depth, a.k.a, the 𝐏\mathbf{P} versus 𝐍𝐂𝟏\mathbf{NC^{1}} problem. The current best depth lower bound [Hås98, Tal14, Tal16] is (3o(1))logn(3-o(1))\cdot\log n, and we still don’t even know how to obtain a lower bound strictly larger than 3logn3\log n. One promising approach to tackle this problem was suggested by Karchmer, Raz and Wigderson [KRW95], they proposed that we should understand the complexity of (block)-composition of Boolean functions. Given two functions f:{0,1}m{0,1}f:\{0,1\}^{m}\rightarrow\{0,1\}, g:{0,1}n{0,1}g:\{0,1\}^{n}\rightarrow\{0,1\}, we define their composite function fg:({0,1}n)m{0,1}f\diamond g:(\{0,1\}^{n})^{m}\rightarrow\{0,1\} as: fg(x1,,xm)=f(g(x1),,g(xm)).f\diamond g\left(x_{1},\ldots,x_{m}\right)=f\left(g\left(x_{1}\right),\ldots,g\left(x_{m}\right)\right). Given any Boolean function ff, we denote the depth complexity of ff by 𝖣(f)\mathsf{D}(f), that is the minimal depth of a circuit of AND, OR and NOT gates of fan-in 22 that computes ff. And it is easy to see the depth complexity of fgf\diamond g is upper-bounded by 𝖣(f)+𝖣(g)\mathsf{D}(f)+\mathsf{D}(g) and it is natural to ask whether the depth complexity of fgf\diamond g is far from this upper bound. Karchmer, Raz and Wigderson [KRW95] conjectured that the depth complexity of fgf\diamond g is not far from its upper bound:

Conjecture 1.1.

Given two arbitrary non-constant Boolean functions f:{0,1}m{0,1}f:\{0,1\}^{m}\rightarrow\{0,1\} and g:{0,1}n{0,1}g:\{0,1\}^{n}\rightarrow\{0,1\}, then 𝖣(fg)𝖣(f)+𝖣(g).\mathsf{D}(f\diamond g)\approx\mathsf{D}(f)+\mathsf{D}(g).

If the conjecture is proved and the “approximate equality” is instantiated with proper parameters, by an argument of iterative composition [KRW95], we will obtain an explicit function with super-logarithmic depth, which separates 𝐏\mathbf{P} from 𝐍𝐂𝟏\mathbf{NC^{1}}. Many restricted cases of KRW conjecture have been proved to be true. For example, there are composition theorems when the inner function gg satisfies certain property [Hås98, Tal14, DM18, FMT21]. There are composition theorems about universal relation [EIRS01, HW93, GMWW17, KM18, MS21, Wu23]. There are composition theorems where the composition itself is restricted such as monotone composition, semi-monotone composition [dRMN+20] and strong composition [Mei23]. There are also some variants [EIRS01, Mei20, MS21] of original conjecture with the similar effect to the 𝐏\mathbf{P} versus 𝐍𝐂𝟏\mathbf{NC^{1}} problem, but we don’t know how to prove them either. Maybe to prove the general form of KRW conjecture is far out of our reach now.

Note that we don’t even know how to prove a super-3logn3\log n depth lower bound, maybe we should consider following weaker conjecture which suffices to break the 3logn3\log n barrier in the first place.

Conjecture 1.2.

There exist two non-constant Boolean functions f,g:{0,1}n{0,1}f,g:\{0,1\}^{n}\rightarrow\{0,1\} such that 𝖣(fg)(1+ϵ)n\mathsf{D}(f\diamond g)\geq{(1+\epsilon)n} for some small constant ϵ(0,1)\epsilon\in(0,1).

Unfortunately, we don’t even know how to prove this weaker conjecture. Currently, the closest answer to Conjecture 1.2 is Meir’s strong composition theorem [Mei23], but we don’t know how to prove it for the case of standard composition. Mihajlin and Sofronova [MS22] proposed we should consider proving depth lower bound against even weaker formulas by considering restriction on top of the formulas. They managed to prove a composition theorem for formulas with restriction on top via XOR composition. The so-called XOR composition, proposed by Mihajlin and Smal [MS21], is a useful special case of standard composition. In fact, the first nontrivial composition theorem [MS21] of a universal relation and some function is proved via XOR composition.

Given two functions f:{0,1}n{0,1},g:{0,1}n{0,1}nf:\{0,1\}^{n}\rightarrow\{0,1\},g:\{0,1\}^{n}\rightarrow\{0,1\}^{n}, their XOR composition fgf\boxplus g is defined as :

fg(x,y)=f(g(x)y)f\boxplus g(x,y)=f(g(x)\oplus y)

where \oplus denotes the bit-wise XOR of two binary strings. Mihajlin and Sofronova [MS22] proved following result.

Theorem 1.3 ([MS22]).

If we choose a function f:{0,1}n{0,1}f:\{0,1\}^{n}\rightarrow\{0,1\} randomly, with probability 1o(1)1-o(1), there exists a function g:{0,1}n{0,1}ng:\{0,1\}^{n}\rightarrow\{0,1\}^{n}, such that for any constant 0<α<0.40<\alpha<0.4 and constant 0<ϵ<α/20<\epsilon<\alpha/2, their XOR composition fgf\boxplus g is not computable by an AND (or OR) of 2(αϵ)n2^{(\alpha-\epsilon)n} formulas of size at most 2(1α/2ϵ)n2^{(1-\alpha/2-\epsilon)n}.

This implies a super-3logn3\log n depth lower bound for a modified version of the Andreev function against formulas with restriction on top.

Theorem 1.4 ([MS22]).

A modified version of Andreev function 𝐀𝐧𝐝𝐫\mathbf{Andr}^{\prime} is not computable by any circuit of depth (3.2ϵ)logn(3.2-\epsilon)\log n with the restriction that top 0.4ϵ0.4-\epsilon layers only consist of AND(or OR) gates for any small constant ϵ>0\epsilon>0.

Such kind of super-3logn3\log n depth lower bound is unknown before their work even for such strong restriction. They asked whether their result can be improved as asked by following question.

Question 1.5 ([MS22]).

Is it possible to extend the range of parameter α\alpha in Theorem 1.3 to 0<α<10<\alpha<1?

In this paper, we give a positive answer to their question with an even better result. In fact, we extend the range of parameter α\alpha to 0<α<2o(1)0<\alpha<2-o(1) which is nearly optimal.

1.1 Our results

Our main result is an improved XOR composition theorem for formulas with restriction on top. Formally, we have following result.

Theorem 1.6.

Let 𝖫(f)\mathsf{L}(f) be the protocol size of any Boolean function ff. For most functions f:{0,1}n{0,1}f:\{0,1\}^{n}\rightarrow\{0,1\}, there exists a function g:{0,1}n{0,1}ng:\{0,1\}^{n}\rightarrow\{0,1\}^{n}, such that fgf\boxplus g is not computable by an AND (or OR) of 2αn2^{\alpha n} formulas of size at most 2n+log𝖫(f)αn2loglogn22^{\frac{n+\log\mathsf{L}(f)-\alpha n-2\log\log n}{2}} for any 0<α<1+log𝖫(f)2loglognn0<\alpha<1+\frac{\log\mathsf{L}(f)-2\log\log n}{n}.

This implies a nearly-4logn4\log n depth lower bound for formulas with restriction on top.

Theorem 1.7.

A modified version of Andreev function 𝐀𝐧𝐝𝐫\mathbf{Andr}^{\prime} is not computable by any circuit of depth (4o(1))logn(4-o(1))\log n with the restriction that top (2o(1))logn(2-o(1))\log n layers only consist of AND (or OR) gates.

Comparing to the results of Mihajlin and Sofronova [MS22], our results are nearly tight and our approach is much simpler to be described next.

Our approach.

Here we give a concise description of the proof idea of Theorem 1.6. The whole proof strategy is similar to that in [MS22], we call such strategy as a double-measurement argument, a generalized form of the double-counting argument. One crucial component in such argument is the notion of well-mixed set of functions, and our improvement is mainly due to the simplification and improvement for such well-mixed set of functions.

Let 𝒢\mathcal{G} be the set of all functions from {0,1}n{0,1}n\{0,1\}^{n}\rightarrow\{0,1\}^{n}. Now given a hard function f:{0,1}n{0,1}f:\{0,1\}^{n}\rightarrow\{0,1\}, we want to show there exists a function g𝒢g\in\mathcal{G}, such that if fgf\boxplus g can be computed by a formula ϕg=i=12αnϕg,i\phi_{g}=\bigwedge_{i=1}^{2^{\alpha n}}\phi_{g,i}, there must be a sub-formula ϕg,i\phi_{g,i} such that 𝖫(ϕg,i)\mathsf{L}(\phi_{g,i}) is large. To show this, Mihajlin and Sofronova defined a sub-additive measure μ\mu for Boolean functions of two arguments and μ(fg)\mu(f\boxplus g) is large, thus by averaging, for every gg, there exists some igi_{g} such that μ(ϕg,ig)\mu(\phi_{g,i_{g}}) is large enough. Note that for every g,igg,i_{g}, ϕg,ig\phi_{g,i_{g}} computes some function hg,igh_{g,i_{g}}, and let \mathcal{H} be the set of all such functions hg,igh_{g,i_{g}}. If the size of \mathcal{H} is large, by a standard counting argument, there must be a hard function hh\in\mathcal{H} as required. But \mathcal{H} may be a small set, to prevent this, we need to show for every hh\in\mathcal{H}, there only exists a small subset of 𝒢h𝒢\mathcal{G}_{h}\subseteq\mathcal{G} such that for every g𝒢hg\in\mathcal{G}_{h}, hg,igh_{g,i_{g}} is the same function as hh. Formally, denote the {g|g𝒢,hg,ig=h}\{g|g\in\mathcal{G},h_{g,i_{g}}=h\} by 𝒢h\mathcal{G}_{h}. Let hh_{\star} be the function such that the size of 𝒢h\mathcal{G}_{h_{\star}} is maximum among all 𝒢h\mathcal{G}_{h}, it suffices to show 𝒢h\mathcal{G}_{h_{\star}} is a small set. To this end, we need to show an up bound of measurement μ(h)\mu(h_{\star}) in another way and this is why the notion of well-mixed set is involved.

Now consider this function f,𝒢h(x,y)=g𝒢hf(g(x)y)\mathcal{M}_{f,\mathcal{G}_{h_{\star}}}(x,y)=\bigvee_{g\in\mathcal{G}_{h_{\star}}}f(g(x)\oplus y). Let f,𝒢hx\mathcal{M}_{f,\mathcal{G}_{h_{\star}}}^{x} be the function f,𝒢h\mathcal{M}_{f,\mathcal{G}_{h_{\star}}} with the first argument is fixed to be some x{0,1}nx\in\{0,1\}^{n}, we want to show if the set 𝒢h\mathcal{G}_{h_{\star}} is large, there are many xxs such that f,𝒢hx\mathcal{M}_{f,\mathcal{G}_{h_{\star}}}^{x} is almost a constant function, and it eventually implies μ(h)\mu(h_{\star}) is small which contradicts the fact that μ(h)\mu(h_{\star}) is already large. This is essentially the property that Mihajlin and Sofronova wanted for 𝒢h\mathcal{G}_{h_{\star}}, or in their terms, 𝒢h\mathcal{G}_{h_{\star}} is well-mixed for function ff. In their work, Mihajlin and Sofronova used a rather complicated probabilistic method to show that property.

We will show such complication is entirely unnecessary and the well-mixed property could be obtained by a simple counting argument if you choose the function ff properly. For convenience, let N=2nN=2^{n}, now choose a hard function f:{0,1}n{0,1}f:\{0,1\}^{n}\rightarrow\{0,1\} such that density(f1(1))=|f1(1)|Nδ\text{density}(f^{-1}(1))=\frac{|f^{-1}(1)|}{N}\geq\delta, typically, we set δ\delta to be 14\frac{1}{4}. Note that given any fixed xx, f,𝒢hx(y)=1\mathcal{M}_{f,\mathcal{G}_{h_{\star}}}^{x}(y)=1 if there is a function g𝒢hg\in\mathcal{G}_{h_{\star}} such that (g(x)y)f1(1)(g(x)\oplus y)\in f^{-1}(1). Given any x{0,1}nx\in\{0,1\}^{n}, denote the set {z|g𝒢h,g(x)=z}\{z|\exists g\in\mathcal{G}_{h_{\star}},g(x)=z\} by 𝒢h(x)\mathcal{G}_{h_{\star}}(x). Similarly, given x,y{0,1}nx,y\in\{0,1\}^{n}, we denote the set {z|g𝒢h,g(x)y=z}\{z|\exists g\in\mathcal{G}_{h_{\star}},g(x)\oplus y=z\} by 𝒢h(x)y\mathcal{G}_{h_{\star}}(x)\oplus y. Given an xx, if |𝒢h(x)|>N(1δ)|\mathcal{G}_{h_{\star}}(x)|>N(1-\delta), for any fixed yy, we also have |𝒢h(x)y|>N(1δ)|\mathcal{G}_{h_{\star}}(x)\oplus y|>N(1-\delta), since for any fixed yy, zyz\oplus y is permutation function of zz. When |𝒢h(x)y|>N(1δ)|\mathcal{G}_{h_{\star}}(x)\oplus y|>N(1-\delta), (𝒢h(x)y)f1(1)(\mathcal{G}_{h_{\star}}(x)\oplus y)\cap f^{-1}(1) is not empty, this means there exists g𝒢hg\in\mathcal{G}_{h_{\star}} such that for that xx, f(g(x)y)=1f(g(x)\oplus y)=1.

Now we say xx is bad, if |𝒢h(x)|N(1δ)|\mathcal{G}_{h_{\star}}(x)|\leq N(1-\delta). If 𝒢h\mathcal{G}_{h_{\star}} is a dense subset of 𝒢\mathcal{G}, the number of bad xxs is small. Assume the size of 𝒢h\mathcal{G}_{h_{\star}} is (at least) |𝒢|(1δ)P|\mathcal{G}|\cdot(1-\delta)^{P}, then there are at most PP bad xxs. If not, the number of functions in 𝒢h\mathcal{G}_{h_{\star}} is less than

(N(1δ))PNNP=NN(1δ)P=|𝒢|(1δ)P\left(N(1-\delta)\right)^{P}\cdot N^{N-P}={N^{N}}\cdot(1-\delta)^{P}=|\mathcal{G}|\cdot(1-\delta)^{P}

which is a contradiction. This means, given any xx which is not bad, the function f,𝒢hx(y)=1\mathcal{M}_{f,\mathcal{G}_{h_{\star}}}^{x}(y)=1 for any yy, thus f,𝒢hx\mathcal{M}_{f,\mathcal{G}_{h_{\star}}}^{x} is a constant function. This eventually implies μ(h)\mu(h_{\star}) is small which leads to the desired contradiction. Finally, since 𝒢h\mathcal{G}_{h_{\star}} has to be a small set, the set \mathcal{H} must be a large set of functions which contains a hard function hh\in\mathcal{H} as required. See more details in Lemma 3.2 and Theorem 3.3.

Other related works.

Besides Mihajlin and Sofronova’s work, we note that Bathie and Williams [BW24] established a super-3logn3\log n depth lower bound against uniform circuits consisting of only NAND gates. Since their result is against uniform circuits, it is incomparable to ours. They also pointed out that results similar to Mihajlin and Sofronova’s work could be obtained from average-case lower bounds [KRT17] via restriction-based techniques but they didn’t provide further details. In a personal communication, Meir [Mei24] pointed out that results similar to Mihajlin and Sofronova’s work can also be obtained via techniques from communication complexity but it is not clear whether such results are as tight as ours.

1.2 Organization of the rest of the paper

The rest of the paper is organized as follows. In Section 2, we provide necessary preliminaries. In Section 3, we prove Theorem 1.6, an improved XOR composition theorem of formulas with restriction on top. In Section 4, we prove Theorem 1.7, a nearly-4logn4\log n depth lower bound for formulas with restriction on top. In Section 5, we conclude and make some discussion about future directions.

2 Preliminaries

Definition 2.1 (De Morgan formula).

A De Morgan formula ϕ\phi is a binary tree, its internal vertices are gates such as AND()\mathrm{AND}(\wedge) or OR()\mathrm{OR}(\vee), its leaves are literals such as xix_{i} or its negation ¬xi\neg x_{i}. The depth of a formula is the depth of underling tree of the formula. The size of a formula is the number of its leaves.

Definition 2.2.

The formula complexity of a boolean function f:{0,1}n{0,1},f:\{0,1\}^{n}\rightarrow\{0,1\}, denoted 𝖫(f),\mathsf{L}(f), is the size of the smallest formula that computes f.f. The depth complexity of f,f, denoted 𝖣(f),\mathsf{D}(f), is the smallest depth of a formula that computes ff.

We will need following fact, given a large set of distinct functions, there is a function with large formula size in that set.

Fact 2.3 ([Juk12], Theorem 1.23).

Let \mathcal{F} be a set of distinct Boolean functions with input length nn, then there exists a function ff\in\mathcal{F} such that 𝖫(f)log||logn+4\mathsf{L}(f)\geq\frac{\log|\mathcal{F}|}{{\log n}+4}.

Definition 2.4 (XOR composition of two functions, [MS21]).

Let f:{0,1}n{0,1},g:{0,1}n{0,1}nf:\{0,1\}^{n}\rightarrow\{0,1\},g:\{0,1\}^{n}\rightarrow\{0,1\}^{n} be two functions, their XOR composition fg:{0,1}n×{0,1}n{0,1}f\boxplus g:\{0,1\}^{n}\times\{0,1\}^{n}\rightarrow\{0,1\} is defined as follows:

fg(x,y)=f(g(x)y)f\boxplus g(x,y)=f(g(x)\oplus y)

where \oplus denotes the bit-wise XOR of two binary strings.

We recall a measure μ(h)\mu(h) of any Boolean function hh of two arguments defined in [MS22].

Definition 2.5 ([MS22]).

Let hh be a Boolean function of two arguments, given any fixed xx as the first argument, we define the function hxh^{x} by setting hx(y)=h(x,y)h^{x}(y)=h(x,y) and define

μ(h)=xX𝖫(hx).\mu(h)=\sum_{x\in X}\mathsf{L}(h^{x}).
Fact 2.6 ([MS22]).

μ(fg)2n𝖫(f)\mu(f\boxplus g)\geq 2^{n}\cdot\mathsf{L}(f) and for every xx, 𝖫(hx)𝖫(h)\mathsf{L}(h^{x})\leq\mathsf{L}(h).

The measure μ\mu is sub-additive in the following sense:

Lemma 2.7 ([MS22]).

Let h(x,y)=(g1,g2,,gk)(x,y)h(x,y)=\circ(g_{1},g_{2},\ldots,g_{k})(x,y), where \circ is \wedge or \vee. Then μ(h)μ(g1)++μ(gk).\mu(h)\leq\mu(g_{1})+\ldots+\mu(g_{k}).

Matrix representation for a function of two arguments.

For convenience, we follow a notation in [MS22] which treats a Boolean function of two arguments as a Boolean matrix.

Definition 2.8.

Set X={0,1}n,Y={0,1}nX=\{0,1\}^{n},Y=\{0,1\}^{n}, given a function h:X×Y{0,1}h:X\times Y\rightarrow\{0,1\}, define a corresponding matrix h\mathcal{M}_{h} such that

  • the rows of h\mathcal{M}_{h} are indexed by xXx\in X and the columns are indexed by yYy\in Y,

  • and h(x,y)=h(x,y)\mathcal{M}_{h}(x,y)=h(x,y) for every x,yx,y.

Similarly, given two functions f:{0,1}n{0,1},g:{0,1}n{0,1}nf:\{0,1\}^{n}\rightarrow\{0,1\},g:\{0,1\}^{n}\rightarrow\{0,1\}^{n}, define a matrix f,g\mathcal{M}_{f,g} such that f,g(x,y)=f(g(x)y)\mathcal{M}_{f,g}(x,y)=f(g(x)\oplus y) for every x,yx,y.

Furthermore, given a function f:{0,1}n{0,1}f:\{0,1\}^{n}\rightarrow\{0,1\} and a set 𝒵\mathcal{Z} of functions from {0,1}n{0,1}n\{0,1\}^{n}\rightarrow\{0,1\}^{n}, define a matrix f,𝒵\mathcal{M}_{f,\mathcal{Z}} such that for every x,yx,y,

f,𝒵(x,y)=g𝒵f(g(x)y).\mathcal{M}_{f,\mathcal{Z}}(x,y)=\bigvee_{g\in\mathcal{Z}}f(g(x)\oplus y).

Finally, given a subset of indexes AXA\subseteq X for rows in a matrix \mathcal{M}, the matrix A\mathcal{M}^{A} is a sub-matrix of \mathcal{M} restricted to rows indexed by AA.

Concentration of measure

Theorem 2.9 (Chernoff bound).

Given nn independent random variables X1,,XnX_{1},\ldots,X_{n} which distribute in {0,1}\{0,1\}, let X=i=1nXiX=\sum_{i=1}^{n}X_{i} be their sum and 𝔼[X]=μ\mathbb{E}[X]=\mu. For any constant δ\delta such that 0<δ<10<\delta<1, we have

Pr(X(1+δ)μ)eδ2μ2+δ\Pr(X\geq(1+\delta)\mu)\leq e^{-\frac{\delta^{2}\mu}{2+\delta}}

and

Pr(X(1δ)μ)eδ2μ2.\Pr(X\leq(1-\delta)\mu)\leq e^{-\frac{\delta^{2}\mu}{2}}.

By Chernoff bound, we have following fact.

Fact 2.10.

Let N=2nN=2^{n}. If we choose a function f:{0,1}n{0,1}f:\{0,1\}^{n}\rightarrow\{0,1\} randomly, then Pr(|f1(1)|N<14)eΩ(N)\Pr(\frac{|f^{-1}(1)|}{N}<\frac{1}{4})\leq e^{-\Omega(N)} and Pr(|f1(0)|N<14)eΩ(N)\Pr(\frac{|f^{-1}(0)|}{N}<\frac{1}{4})\leq e^{-\Omega(N)}.

3 An improved XOR composition theorem for formulas with restriction on top

In this section, we prove Theorem 1.6. At first, let’s recall the notion of well-mixed set of functions.

Definition 3.1 (Well-mixed set of functions, [MS22]).

A set of functions 𝒢\mathcal{G} from {0,1}n{0,1}n\{0,1\}^{n}\rightarrow\{0,1\}^{n} is (Q,D,P)(Q,D,P)-well-mixed for ff if 𝒵𝒢,|𝒵|Q\forall\mathcal{Z}\subseteq\mathcal{G},|\mathcal{Z}|\geq Q, there exist a set K{0,1}n,|K|PK\subseteq\{0,1\}^{n},|K|\leq P , such that f,𝒵XK\mathcal{M}^{X\setminus K}_{f,\mathcal{Z}} has no more than DD zeroes in total where f,𝒵(x,y)=g𝒵f(g(x)y)\mathcal{M}_{f,\mathcal{Z}}(x,y)=\bigvee_{g\in\mathcal{Z}}f(g(x)\oplus y).

Now we show given any approximately balanced function ff, the set 𝒢\mathcal{G} of all functions {0,1}n{0,1}n\{0,1\}^{n}\rightarrow\{0,1\}^{n} is already a well-mixed set of functions for ff.

Lemma 3.2.

Let f:{0,1}n{0,1}f:\{0,1\}^{n}\rightarrow\{0,1\} be a function, 𝒢\mathcal{G} be the set of all functions {0,1}n{0,1}n\{0,1\}^{n}\rightarrow\{0,1\}^{n}. For convenience, let N=2nN=2^{n} and X={0,1}nX=\{0,1\}^{n}. Assume density(f1(1))=|f1(1)|Nδ\text{density}(f^{-1}(1))=\frac{|f^{-1}(1)|}{N}\geq\delta, then 𝒢\mathcal{G} is (|𝒢|(1δ)P,0,P)({|\mathcal{G}|}\cdot(1-\delta)^{P},0,P)-well-mixed for ff. Particularly, let 𝒵𝒢\mathcal{Z}\subseteq\mathcal{G} be a set of functions and density(𝒵)=|𝒵||𝒢|\text{density}(\mathcal{Z})=\frac{|\mathcal{Z}|}{|\mathcal{G}|}, there exists a set K{0,1}nK\subseteq\{0,1\}^{n} such that |K|=Plogdensity(𝒵)log(1δ)|K|=P\leq\frac{\log\text{density}(\mathcal{Z})}{\log(1-\delta)} and all entries in f,𝒵XK\mathcal{M}^{X\setminus K}_{f,\mathcal{Z}} are ones.

Proof.

Let 𝒵𝒢\mathcal{Z}\subseteq\mathcal{G} be a set of functions, given any x{0,1}nx\in\{0,1\}^{n}, denote the set {z|g𝒵,g(x)=z}\{z|\exists g\in\mathcal{Z},g(x)=z\} by 𝒵(x)\mathcal{Z}(x), and we say xx is bad if |𝒵(x)|N(1δ)|\mathcal{Z}(x)|\leq N(1-\delta). Similarly, given x,y{0,1}nx,y\in\{0,1\}^{n}, we denote the set {z|g𝒵,g(x)y=z}\{z|\exists g\in\mathcal{Z},g(x)\oplus y=z\} by 𝒵(x)y\mathcal{Z}(x)\oplus y.

Now assume there are PP bad xxs, the number of functions in 𝒵\mathcal{Z} is at most

(N(1δ))PNNP=NN(1δ)P=|𝒢|(1δ)P.\left(N(1-\delta)\right)^{P}\cdot N^{N-P}={N^{N}}\cdot(1-\delta)^{P}=|\mathcal{G}|\cdot(1-\delta)^{P}.

This implies density(𝒵)=|𝒵||𝒢|(1δ)P\text{density}(\mathcal{Z})=\frac{|\mathcal{Z}|}{|\mathcal{G}|}\leq(1-\delta)^{P}, this means Plogdensity(𝒵)log(1δ)P\leq\frac{\log\text{density}(\mathcal{Z})}{\log(1-\delta)}.

Now we show when xx is not bad, for every y{0,1}y\in\{0,1\}, f,𝒵(x,y)=g𝒵f(g(x)y)=1\mathcal{M}_{f,\mathcal{Z}}(x,y)=\bigvee_{g\in\mathcal{Z}}f(g(x)\oplus y)=1. Since |𝒵(x)|>N(1δ)|\mathcal{Z}(x)|>N(1-\delta), we have |𝒵(x)y|>N(1δ)|\mathcal{Z}(x)\oplus y|>N(1-\delta). Since |𝒵(x)y|>N(1δ)|\mathcal{Z}(x)\oplus y|>N(1-\delta), (𝒵(x)y)f1(1)(\mathcal{Z}(x)\oplus y)\cap f^{-1}(1) is not empty, there must be a g𝒵g\in\mathcal{Z} such that g(x)yf1(1)g(x)\oplus y\in f^{-1}(1), thus gZf(g(x)y)\bigvee_{g\in Z}f(g(x)\oplus y) must be 11 as required. ∎

Now we are ready to prove Theorem 1.6 rephrased as follows. It is proved via a similar idea in [MS22] and a more careful analysis.

Theorem 3.3.

Let f:{0,1}n{0,1}f:\{0,1\}^{n}\rightarrow\{0,1\} be a function and density(f1(1))14\text{density}(f^{-1}(1))\geq\frac{1}{4}, there exists a function g:{0,1}n{0,1}ng:\{0,1\}^{n}\rightarrow\{0,1\}^{n}, such that fgf\boxplus g is not computable by an AND of 2αn2^{\alpha n} formulas of size at most 2n+log𝖫(f)αn2loglogn22^{\frac{n+\log\mathsf{L}(f)-\alpha n-2\log\log n}{2}} for any 0<α<1+log𝖫(f)2loglognn0<\alpha<1+\frac{\log\mathsf{L}(f)-2\log\log n}{n}.

Proof of Theorem 3.3.

We prove it by contradiction. Let 𝒢\mathcal{G} be the set of all functions {0,1}n{0,1}n\{0,1\}^{n}\rightarrow\{0,1\}^{n}. Assume the contrary that for all g𝒢g\in\mathcal{G} the XOR composition fgf\boxplus g is computable by AND of small enough formulas. That is, for any g𝒢g\in\mathcal{G}, there is a formula ϕg\phi_{g} computing fgf\boxplus g and ϕg\phi_{g} is of following form

ϕg=i=12αnϕg,i\phi_{g}=\bigwedge_{i=1}^{2^{\alpha n}}\phi_{g,i}

where the size of every ϕg,i\phi_{g,i} is at most 2n+log𝖫(f)αn2loglogn22^{\frac{n+\log\mathsf{L}(f)-\alpha n-2\log\log n}{2}}. Now let hg,ih_{g,i} be the function that ϕg,i\phi_{g,i} computes, thus 𝖫(hg,i)2n+log𝖫(f)αn2loglogn2\mathsf{L}(h_{g,i})\leq 2^{\frac{n+\log\mathsf{L}(f)-\alpha n-2\log\log n}{2}} and fgf\boxplus g can be represented as i=12αnhg,i\bigwedge_{i=1}^{2^{\alpha n}}h_{g,i}.

Recall that for every g𝒢g\in\mathcal{G}, μ(fg)2n𝖫(f).\mu(f\boxplus g)\geq 2^{n}\cdot\mathsf{L}(f). By Lemma 2.7, there must be an ig[2αn]i_{g}\in[2^{\alpha n}] such that, the measure μ(hg,ig)\mu(h_{g,i_{g}}) is large:

μ(hg,ig)2(1α)n𝖫(f).\mu(h_{g,i_{g}})\geq 2^{(1-\alpha)n}\cdot\mathsf{L}(f).

Now we collect all such functions hg,igh_{g,i_{g}} and let \mathcal{H} be the set of all such functions hg,igh_{g,i_{g}}. We want to show that the size of \mathcal{H} is large, thus by a standard counting argument, there must be a function hh\in\mathcal{H} which requires large formulas which contradicts the hypothesis.

Given any hh\in\mathcal{H}, denote the {g|g𝒢,hg,ig=h}\{g|g\in\mathcal{G},h_{g,i_{g}}=h\} by 𝒢h\mathcal{G}_{h}. Let hh_{\star} be the function such that the size of 𝒢h\mathcal{G}_{h_{\star}} is maximum among all 𝒢h\mathcal{G}_{h}. We will prove

4logdensity(𝒢h)𝖫(h)μ(h)2(1α)n𝖫(f),-4\log\text{density}(\mathcal{G}_{h_{\star}})\cdot\mathsf{L}(h_{\star})\geq\mu(h_{\star})\geq 2^{(1-\alpha)n}\cdot\mathsf{L}(f),

before proving this, let’s show it indeed leads to the contradiction required. Recall by assumption 𝖫(h)2n+log𝖫(f)αn2loglogn2\mathsf{L}(h_{\star})\leq 2^{\frac{n+\log\mathsf{L}(f)-\alpha n-2\log\log n}{2}}, this means

density(𝒢h)22n+log𝖫(f)αn+2loglogn42.\text{density}(\mathcal{G}_{h_{\star}})\leq 2^{-2^{\frac{n+\log\mathsf{L}(f)-\alpha n+2\log\log n-4}{2}}}.

Now we are ready to lower bound the size of |||\mathcal{H}|, that is the number of distinct functions in \mathcal{H}. Since |||𝒢h||𝒢||\mathcal{H}|\cdot|\mathcal{G}_{h_{\star}}|\geq|\mathcal{G}|,

||1density(𝒢h)22n+log𝖫(f)αn+2loglogn42.|\mathcal{H}|\geq\frac{1}{\text{density}(\mathcal{G}_{h_{\star}})}\geq 2^{2^{\frac{n+\log\mathsf{L}(f)-\alpha n+2\log\log n-4}{2}}}.

By Fact 2.3, there exists an hh\in\mathcal{H} such that

𝖫(h)\displaystyle\mathsf{L}(h) 2n+log𝖫(f)αn+2loglogn42log2n+4\displaystyle\geq\frac{2^{\frac{n+\log\mathsf{L}(f)-\alpha n+2\log\log n-4}{2}}}{{\log 2n}+4}
>2n+log𝖫(f)αn2loglogn2,when n is large enough,\displaystyle>2^{\frac{n+\log\mathsf{L}(f)-\alpha n-2\log\log n}{2}},\text{when $n$ is large enough,}

which is a contradiction to the assumption. Now we show 4logdensity(𝒢h)𝖫(h)μ(h)-4\log\text{density}(\mathcal{G}_{h_{\star}})\cdot\mathsf{L}(h_{\star})\geq\mu(h_{\star}) by considering the matrix f,𝒢h\mathcal{M}_{f,\mathcal{G}_{h_{\star}}}. By Lemma 3.2,

  • there exists a set K{0,1}nK\subseteq\{0,1\}^{n} such that

    |K|\displaystyle|K| logdensity(𝒢h)log(11/4)\displaystyle\leq\frac{\log\text{density}(\mathcal{G}_{h_{\star}})}{\log(1-1/4)}
    4logdensity(𝒢h), since log340.415 thus 1<log340.25 .\displaystyle\leq-4\cdot\log\text{density}(\mathcal{G}_{h_{\star}}),\text{ since $\log\frac{3}{4}\approx-0.415$ thus $1<\frac{\log\frac{3}{4}}{-0.25}$ }.
  • And all entries in f,𝒢hXK\mathcal{M}^{X\setminus K}_{f,\mathcal{G}_{h_{\star}}} are ones.

Now we have following key fact about the function hh_{\star}.

Fact 3.4.

For every x,y{0,1}nx,y\in\{0,1\}^{n}, f,𝒢h(x,y)=1\mathcal{M}_{f,\mathcal{G}_{h_{\star}}}(x,y)=1 implies h(x,y)=1h_{\star}(x,y)=1.

Proof.

Recall that f,𝒢h(x,y)=g𝒢h(fg)(x,y)\mathcal{M}_{f,\mathcal{G}_{h_{\star}}}(x,y)=\bigvee_{g\in\mathcal{G}_{h_{\star}}}(f\boxplus g)(x,y). If f,𝒢h(x,y)=1\mathcal{M}_{f,\mathcal{G}_{h_{\star}}}(x,y)=1, there must be some g𝒢hg\in\mathcal{G}_{h_{\star}} such that (fg)(x,y)=1(f\boxplus g)(x,y)=1. Furthermore, by assumption fgf\boxplus g is simply i=12αnhg,i\bigwedge_{i=1}^{2^{\alpha n}}h_{g,i} and h=hg,ih_{\star}=h_{g,i} for some ii, thus h(x,y)h_{\star}(x,y) must be 11 as well. ∎

By Fact 3.4, given any xXKx\in X\setminus K, for every y{0,1}ny\in\{0,1\}^{n}, hx(y)=1h_{\star}^{x}(y)=1 , in other words, hxh_{\star}^{x} is a constant function and 𝖫(hx)=0\mathsf{L}(h_{\star}^{x})=0. Now we are ready to up bound μ(h)\mu(h_{\star}) and have

μ(h)\displaystyle\mu(h_{\star}) =xL(hx)=xKL(hx)+xKL(hx)\displaystyle=\sum_{x}L(h_{\star}^{x})=\sum_{x\in K}L(h_{\star}^{x})+\sum_{x\notin K}L(h_{\star}^{x})
=xKL(hx)\displaystyle=\sum_{x\in K}L(h_{\star}^{x})
|K|𝖫(h)\displaystyle\leq|K|\cdot\mathsf{L}(h_{\star})
4logdensity(𝒢h)𝖫(h).\displaystyle\leq-4\cdot\log\text{density}(\mathcal{G}_{h_{\star}})\cdot\mathsf{L}(h_{\star}).

as required. ∎

Remark 3.5.

We want to point out the AND(\wedge) gate can be replaced with OR(\vee) gate and since we use the counting argument to show there exists a hard function in the set \mathcal{H}, this result can be extended to the case of formula over full binary basis.

4 A nearly-4logn4\log n depth lower bound for formulas with restriction on top

We will prove a depth lower bound for a modified Andreev function defined in [MS22].

Definition 4.1 ([MS22]).

The modified Andreev function 𝐀𝐧𝐝𝐫:{0,1}n×{0,1}nlogn×{0,1}n×2logn{0,1}\mathbf{Andr}^{\prime}:\{0,1\}^{n}\times\{0,1\}^{n\log n}\times\{0,1\}^{n\times 2\log n}\rightarrow\{0,1\} is defined as follows:

𝐀𝐧𝐝𝐫(𝚃𝚃f,𝚃𝚃g,x1,,x2logn)=(fg)((x1),,(x2logn))\mathbf{Andr}^{\prime}(\mathtt{TT}_{f},\mathtt{TT}_{g},x_{1},\ldots,x_{2\log n})=(f\boxplus g)\left(\oplus(x_{1}),\ldots,\oplus(x_{2\log n})\right)

where 𝚃𝚃f\mathtt{TT}_{f} is a truth table of some function ff from {0,1}logn{0,1}\{0,1\}^{\log n}\rightarrow\{0,1\},𝚃𝚃g\mathtt{TT}_{g} is a truth table of some function gg from {0,1}logn{0,1}logn\{0,1\}^{\log n}\rightarrow\{0,1\}^{\log n}, for every i[2logn]i\in[2\log n], xix_{i} is a binary string of length nn and ()\oplus(\cdot) is the parity function.

Theorem 4.2.

There exist two parameters γ=o(1),ϵ=o(1)\gamma=o(1),\epsilon=o(1), for every constant α\alpha such that 0<α<2γ0<\alpha<2-\gamma, the modified Andreev function 𝐀𝐧𝐝𝐫\mathbf{Andr}^{\prime} is not computable by an AND of nαn^{\alpha} formulas of size at most n3α/2ϵn^{3-\alpha/2-\epsilon}.

In terms of depth lower bound, the modified Andreev function 𝐀𝐧𝐝𝐫\mathbf{Andr}^{\prime} is not computable by any circuit of depth (3+α/2ϵ)logn(3+\alpha/2-\epsilon)\log n with the restriction that top α\alpha layers only consist of AND gates where 0<α<2γ0<\alpha<2-\gamma. Choose α=2o(1)\alpha=2-o(1) properly, Theorem 1.7 follows.

Remark 4.3.

Note that the input length of the modified Andreev function 𝐀𝐧𝐝𝐫\mathbf{Andr}^{\prime} is n=(3logn+1)nn^{\prime}=(3\log n+1)n, writing the results in terms of nn^{\prime} doesn’t change them essentially. For example, (4o(1))logn>(4o(1))logn4logn>(4o(1))logn4logn=(4o(1)loglogn+2logn)logn=(4o(1))logn(4-o(1))\log n>(4-o(1))\log\frac{n^{\prime}}{4\log n}>(4-o(1))\log\frac{n^{\prime}}{4\log n^{\prime}}=(4-o(1)-\frac{\log\log n^{\prime}+2}{\log n^{\prime}})\log{n^{\prime}}=(4-o(1))\log{n^{\prime}}. Similarly, in the restriction for top gates, AND could be replaced by OR.

Theorem 4.2 is proved via the same idea from [MS22], the only differences here are details of parameters. For completeness of this paper, we present the proof here. At first, we recall the standard notion of random restriction.

Definition 4.4 (Restriction).

Given a Boolean function f:{0,1}n{0,1}f:\{0,1\}^{n}\rightarrow\{0,1\}, a restriction ρ{0,1,}n\rho\in\{0,1,\ast\}^{n} to function ff is a vector of length nn and for every i[n]i\in[n], ρi\rho_{i} is an element from {0,1,}\{0,1,\ast\}. Define f|ρf|_{\rho} to be the function restricted according to ρ\rho as follows: if ρi\rho_{i} is \ast then the ii-th input bit of ff is unfixed thus free to be 0 or 11; otherwise the ii-th input bit of ff is fixed to be ρi\rho_{i}.

Definition 4.5 (Random restriction).

Given 0<p<10<p<1, the random restriction Rp\text{R}_{p} randomly samples restrictions as follows: every ρi\rho_{i} is sampled independently such that Pr[ρi=]=p\Pr[\rho_{i}=\ast]=p and Pr[ρi=1]=Pr[ρi=0]=1p2\Pr[\rho_{i}=1]=\Pr[\rho_{i}=0]=\frac{1-p}{2}.

In the rest of this section, we will set p=2lnlognnp=\frac{2\ln\log n}{n}. Mihajlin and Sofronova proved following useful two lemmas about random restriction of the modified Andreev function implicitly in [MS22].

Lemma 4.6 (Implicit in [MS22]).

Let 𝐀𝐧𝐝𝐫f,g\mathbf{Andr}^{\prime}_{f,g} be the 𝐀𝐧𝐝𝐫\mathbf{Andr}^{\prime} function hardwired with two fixed functions f,gf,g. Then with probability 1o(1)1-o(1), the random restriction Rp\text{R}_{p} will turn 𝐀𝐧𝐝𝐫f,g\mathbf{Andr}^{\prime}_{f,g} into fgf\boxplus g.

Lemma 4.7 (Implicit in [MS22]).

Let α,β\alpha,\beta be two parameters such that 0<α<2,2<β<30<\alpha<2,2<\beta<3. Let ϕ\phi be a formula of form i=1nαϕi\bigwedge_{i=1}^{n^{\alpha}}\phi_{i} where the size of each ϕi\phi_{i} is at most nβn^{\beta}. Then with probability 1o(1)1-o(1), the random restriction Rp\text{R}_{p} will turn ϕ\phi into a formula ϕ\phi^{\prime} of form i=1nαϕi\bigwedge_{i=1}^{n^{\alpha}}\phi_{i}^{\prime} where the size of each ϕi\phi_{i}^{\prime} is at most nβ2+δn^{\beta-2+\delta} for some δ=o(1)\delta=o(1).

Now we show how to prove Theorem 4.2 with above two lemmas.

Proof of Theorem 4.2.

At first choose some function ff from {0,1}logn{0,1}\{0,1\}^{\log n}\rightarrow\{0,1\} and make sure that ff is the function with maximum protocol size and |f1(1)|n14\frac{|f^{-1}(1)|}{n}\geq\frac{1}{4}. By Fact 2.3 and 2.10, we have 𝖫(f)log((1o(1))22logn)/(loglogn+4)n/2loglogn\mathsf{L}(f)\geq\log\left((1-o(1))2^{2^{\log n}}\right)/(\log\log n+4)\geq n/2\log\log n when nn is large enough. By Theorem 3.3, we have following fact.

Fact 4.8.

Let ff be the function chosen above, there exists a function g:{0,1}logn{0,1}logng:\{0,1\}^{\log n}\rightarrow\{0,1\}^{\log n}, such that fgf\boxplus g is not computable by an AND of nαn^{\alpha} formulas of size at most n1α/22logloglognlognn^{1-\alpha/2-\frac{2\log\log\log n}{\log n}} for any 0<α<24logloglognlogn0<\alpha<2-\frac{4\log\log\log n}{\log n} when nn is large enough.

Let δ\delta be the same parameter from Lemma 4.7, γ=4logloglognlogn\gamma=\frac{4\log\log\log n}{\log n} and ϵ=δ+γ/2\epsilon=\delta+\gamma/2. Choose some α\alpha such that 0<α<2γ0<\alpha<2-\gamma and set β=3α/2ϵ\beta=3-\alpha/2-\epsilon. Now assume Theorem 4.2 is false, that is 𝐀𝐧𝐝𝐫\mathbf{Andr}^{\prime} is computable by formula ϕ\phi of form i=1nαϕi\bigwedge_{i=1}^{n^{\alpha}}\phi_{i} where the size of each ϕi\phi_{i} is at most nβn^{\beta}, and so is 𝐀𝐧𝐝𝐫f,g\mathbf{Andr}^{\prime}_{f,g}. By Lemma 4.6 and 4.7, fgf\boxplus g is computable by a formula ϕ\phi^{\prime} of form i=1nβϕi\bigwedge_{i=1}^{n^{\beta}}\phi_{i}^{\prime} where the size of each ϕi\phi_{i}^{\prime} is at most

nβ2+δ=n1α/2ϵ+δ=n1α/22logloglognlognn^{\beta-2+\delta}=n^{1-\alpha/2-\epsilon+\delta}=n^{1-\alpha/2-\frac{2\log\log\log n}{\log n}}

which contradicts Fact 4.8. ∎

5 Conclusion and discussion

In this paper, we obtain a nearly-tight XOR composition theorem for formulas with restriction on top and with this composition theorem we have a nearly-4logn4\log n depth lower bound for formulas with restriction on top. Intuitively, in such the depth lower bound, we trade one unrestricted layer to nearly two layers of AND gates on top of the circuit and our trade-off is nearly optimal.

The next nature question as pointed out by Mihajlin and Sofronova [MS22] is to prove the case with 𝐀𝐂0\mathbf{AC}^{0} formula on top. But it turns out even to prove the case of depth-2 formula with unbounded fan-in is difficult. The obstacle to extending current approach to the case of depth-2 formula on top is that we don’t know how to find the sub-formula hh_{\star} such that the measure μ(h)\mu(h_{\star}) is large enough meanwhile hh_{\star} is correlated with fgf\boxplus g properly like that in Fact 3.4. This difficulty also appears in the approach via communication complexity [Mei24], since such result shares the same feature with a composition theorem of a depth-2 formula and a De Morgan formula, but we don’t even know how to prove a composition theorem of two depth-2 formulas in general. New ideas are needed, maybe we should try to prove a general composition theorem of two depth-2 formulas in the first place.

Acknowledgments

The author would like to thank Or Meir for sharing the idea about how to prove results similar to Mihajlin and Sofronova’s work via techniques from communication complexity and other helpful discussions. The author would also like to thank Pei Wu for suggesting the question of the composition of two depth-2 formulas and other helpful discussions.

References

  • [BW24] Gabriel Bathie and R Ryan Williams. Towards stronger depth lower bounds. In 15th Innovations in Theoretical Computer Science Conference (ITCS 2024). Schloss-Dagstuhl-Leibniz Zentrum für Informatik, 2024.
  • [DM18] Irit Dinur and Or Meir. Toward the KRW composition conjecture: Cubic formula lower bounds via communication complexity. Comput. Complex., 27(3):375–462, 2018.
  • [dRMN+20] Susanna F. de Rezende, Or Meir, Jakob Nordström, Toniann Pitassi, and Robert Robere. KRW composition theorems via lifting. In Sandy Irani, editor, 61st IEEE Annual Symposium on Foundations of Computer Science, FOCS 2020, Durham, NC, USA, November 16-19, 2020, pages 43–49. IEEE, 2020.
  • [EIRS01] Jeff Edmonds, Russell Impagliazzo, Steven Rudich, and Jirí Sgall. Communication complexity towards lower bounds on circuit depth. Comput. Complex., 10(3):210–246, 2001.
  • [FMT21] Yuval Filmus, Or Meir, and Avishay Tal. Shrinkage under random projections, and cubic formula lower bounds for AC0 (extended abstract). In James R. Lee, editor, 12th Innovations in Theoretical Computer Science Conference, ITCS 2021, January 6-8, 2021, Virtual Conference, volume 185 of LIPIcs, pages 89:1–89:7. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021.
  • [GMWW17] Dmitry Gavinsky, Or Meir, Omri Weinstein, and Avi Wigderson. Toward better formula lower bounds: The composition of a function and a universal relation. SIAM J. Comput., 46(1):114–131, 2017.
  • [Hås98] Johan Håstad. The shrinkage exponent of de morgan formulas is 2. SIAM J. Comput., 27(1):48–64, 1998.
  • [HW93] Johan Håstad and Avi Wigderson. Composition of the universal relation. In ADVANCES IN COMPUTATIONAL COMPLEXITY THEORY, AMS-DIMACS, 1993.
  • [Juk12] Stasys Jukna. Boolean Function Complexity - Advances and Frontiers, volume 27 of Algorithms and combinatorics. Springer, 2012.
  • [KM18] Sajin Koroth and Or Meir. Improved composition theorems for functions and relations. Leibniz International Proceedings in Informatics, LIPIcs, 116(48):1–18, 2018.
  • [KRT17] Ilan Komargodski, Ran Raz, and Avishay Tal. Improved average-case lower bounds for de morgan formula size: Matching worst-case lower bound. SIAM Journal on Computing, 46(1):37–57, 2017.
  • [KRW95] Mauricio Karchmer, Ran Raz, and Avi Wigderson. Super-logarithmic depth lower bounds via the direct sum in communication complexity. Comput. Complex., 5(3/4):191–204, 1995.
  • [Mei20] Or Meir. Toward better depth lower bounds: Two results on the multiplexor relation. Comput. Complex., 29(1):4, 2020.
  • [Mei23] Or Meir. Toward better depth lower bounds: A krw-like theorem for strong composition. Electron. Colloquium Comput. Complex., TR23-078, 2023.
  • [Mei24] Or Meir. Personal communication, 2024.
  • [MS21] Ivan Mihajlin and Alexander Smal. Toward better depth lower bounds: The XOR-KRW conjecture. In Valentine Kabanets, editor, 36th Computational Complexity Conference, CCC 2021, July 20-23, 2021, Toronto, Ontario, Canada (Virtual Conference), volume 200 of LIPIcs, pages 38:1–38:24. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021.
  • [MS22] Ivan Mihajlin and Anastasia Sofronova. A better-than-3log(n) depth lower bound for de morgan formulas with restrictions on top gates. In Shachar Lovett, editor, 37th Computational Complexity Conference, CCC 2022, July 20-23, 2022, Philadelphia, PA, USA, volume 234 of LIPIcs, pages 13:1–13:15. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2022.
  • [Tal14] Avishay Tal. Shrinkage of de morgan formulae by spectral techniques. In 55th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2014, Philadelphia, PA, USA, October 18-21, 2014, pages 551–560. IEEE Computer Society, 2014.
  • [Tal16] Avishay Tal. Computing requires larger formulas than approximating. Electron. Colloquium Comput. Complex., TR16-179, 2016.
  • [Wu23] Hao Wu. An improved composition theorem of a universal relation and most functions via effective restriction. Electron. Colloquium Comput. Complex., TR23-151, 2023.