A nearly- depth lower bound for formulas with restriction on top
Abstract
One of the major open problems in complexity theory is to demonstrate an explicit function which requires super logarithmic depth, a.k.a, the versus problem. The current best depth lower bound is , and it is widely open how to prove a super- depth lower bound. Recently Mihajlin and Sofronova (CCC’22) show if considering formulas with restriction on top, we can break the barrier. Formally, they prove there exist two functions , such that for any constant and constant , their XOR composition is not computable by an AND of formulas of size at most . This implies a modified version of Andreev function is not computable by any circuit of depth with the restriction that top layers only consist of AND gates for any small constant . They ask whether the parameter can be push up to nearly thus implying a nearly- depth lower bound.
In this paper, we provide a stronger answer to their question. We show there exist two functions , such that for any constant , their XOR composition is not computable by an AND of formulas of size at most . This implies a depth lower bound with the restriction that top layers only consist of AND gates. We prove it by observing that one crucial component in Mihajlin and Sofronova’s work, called the well-mixed set of functions, can be significantly simplified thus improved. Then with this observation and a more careful analysis, we obtain these nearly tight results.
1 Introduction
One of the major open problems in complexity theory is to demonstrate an explicit function which requires super logarithmic depth, a.k.a, the versus problem. The current best depth lower bound [Hås98, Tal14, Tal16] is , and we still don’t even know how to obtain a lower bound strictly larger than . One promising approach to tackle this problem was suggested by Karchmer, Raz and Wigderson [KRW95], they proposed that we should understand the complexity of (block)-composition of Boolean functions. Given two functions , , we define their composite function as: Given any Boolean function , we denote the depth complexity of by , that is the minimal depth of a circuit of AND, OR and NOT gates of fan-in that computes . And it is easy to see the depth complexity of is upper-bounded by and it is natural to ask whether the depth complexity of is far from this upper bound. Karchmer, Raz and Wigderson [KRW95] conjectured that the depth complexity of is not far from its upper bound:
Conjecture 1.1.
Given two arbitrary non-constant Boolean functions and , then
If the conjecture is proved and the “approximate equality” is instantiated with proper parameters, by an argument of iterative composition [KRW95], we will obtain an explicit function with super-logarithmic depth, which separates from . Many restricted cases of KRW conjecture have been proved to be true. For example, there are composition theorems when the inner function satisfies certain property [Hås98, Tal14, DM18, FMT21]. There are composition theorems about universal relation [EIRS01, HW93, GMWW17, KM18, MS21, Wu23]. There are composition theorems where the composition itself is restricted such as monotone composition, semi-monotone composition [dRMN+20] and strong composition [Mei23]. There are also some variants [EIRS01, Mei20, MS21] of original conjecture with the similar effect to the versus problem, but we don’t know how to prove them either. Maybe to prove the general form of KRW conjecture is far out of our reach now.
Note that we don’t even know how to prove a super- depth lower bound, maybe we should consider following weaker conjecture which suffices to break the barrier in the first place.
Conjecture 1.2.
There exist two non-constant Boolean functions such that for some small constant .
Unfortunately, we don’t even know how to prove this weaker conjecture. Currently, the closest answer to Conjecture 1.2 is Meir’s strong composition theorem [Mei23], but we don’t know how to prove it for the case of standard composition. Mihajlin and Sofronova [MS22] proposed we should consider proving depth lower bound against even weaker formulas by considering restriction on top of the formulas. They managed to prove a composition theorem for formulas with restriction on top via XOR composition. The so-called XOR composition, proposed by Mihajlin and Smal [MS21], is a useful special case of standard composition. In fact, the first nontrivial composition theorem [MS21] of a universal relation and some function is proved via XOR composition.
Given two functions , their XOR composition is defined as :
where denotes the bit-wise XOR of two binary strings. Mihajlin and Sofronova [MS22] proved following result.
Theorem 1.3 ([MS22]).
If we choose a function randomly, with probability , there exists a function , such that for any constant and constant , their XOR composition is not computable by an AND (or OR) of formulas of size at most .
This implies a super- depth lower bound for a modified version of the Andreev function against formulas with restriction on top.
Theorem 1.4 ([MS22]).
A modified version of Andreev function is not computable by any circuit of depth with the restriction that top layers only consist of AND(or OR) gates for any small constant .
Such kind of super- depth lower bound is unknown before their work even for such strong restriction. They asked whether their result can be improved as asked by following question.
In this paper, we give a positive answer to their question with an even better result. In fact, we extend the range of parameter to which is nearly optimal.
1.1 Our results
Our main result is an improved XOR composition theorem for formulas with restriction on top. Formally, we have following result.
Theorem 1.6.
Let be the protocol size of any Boolean function . For most functions , there exists a function , such that is not computable by an AND (or OR) of formulas of size at most for any .
This implies a nearly- depth lower bound for formulas with restriction on top.
Theorem 1.7.
A modified version of Andreev function is not computable by any circuit of depth with the restriction that top layers only consist of AND (or OR) gates.
Comparing to the results of Mihajlin and Sofronova [MS22], our results are nearly tight and our approach is much simpler to be described next.
Our approach.
Here we give a concise description of the proof idea of Theorem 1.6. The whole proof strategy is similar to that in [MS22], we call such strategy as a double-measurement argument, a generalized form of the double-counting argument. One crucial component in such argument is the notion of well-mixed set of functions, and our improvement is mainly due to the simplification and improvement for such well-mixed set of functions.
Let be the set of all functions from . Now given a hard function , we want to show there exists a function , such that if can be computed by a formula , there must be a sub-formula such that is large. To show this, Mihajlin and Sofronova defined a sub-additive measure for Boolean functions of two arguments and is large, thus by averaging, for every , there exists some such that is large enough. Note that for every , computes some function , and let be the set of all such functions . If the size of is large, by a standard counting argument, there must be a hard function as required. But may be a small set, to prevent this, we need to show for every , there only exists a small subset of such that for every , is the same function as . Formally, denote the by . Let be the function such that the size of is maximum among all , it suffices to show is a small set. To this end, we need to show an up bound of measurement in another way and this is why the notion of well-mixed set is involved.
Now consider this function . Let be the function with the first argument is fixed to be some , we want to show if the set is large, there are many s such that is almost a constant function, and it eventually implies is small which contradicts the fact that is already large. This is essentially the property that Mihajlin and Sofronova wanted for , or in their terms, is well-mixed for function . In their work, Mihajlin and Sofronova used a rather complicated probabilistic method to show that property.
We will show such complication is entirely unnecessary and the well-mixed property could be obtained by a simple counting argument if you choose the function properly. For convenience, let , now choose a hard function such that , typically, we set to be . Note that given any fixed , if there is a function such that . Given any , denote the set by . Similarly, given , we denote the set by . Given an , if , for any fixed , we also have , since for any fixed , is permutation function of . When , is not empty, this means there exists such that for that , .
Now we say is bad, if . If is a dense subset of , the number of bad s is small. Assume the size of is (at least) , then there are at most bad s. If not, the number of functions in is less than
which is a contradiction. This means, given any which is not bad, the function for any , thus is a constant function. This eventually implies is small which leads to the desired contradiction. Finally, since has to be a small set, the set must be a large set of functions which contains a hard function as required. See more details in Lemma 3.2 and Theorem 3.3.
Other related works.
Besides Mihajlin and Sofronova’s work, we note that Bathie and Williams [BW24] established a super- depth lower bound against uniform circuits consisting of only NAND gates. Since their result is against uniform circuits, it is incomparable to ours. They also pointed out that results similar to Mihajlin and Sofronova’s work could be obtained from average-case lower bounds [KRT17] via restriction-based techniques but they didn’t provide further details. In a personal communication, Meir [Mei24] pointed out that results similar to Mihajlin and Sofronova’s work can also be obtained via techniques from communication complexity but it is not clear whether such results are as tight as ours.
1.2 Organization of the rest of the paper
The rest of the paper is organized as follows. In Section 2, we provide necessary preliminaries. In Section 3, we prove Theorem 1.6, an improved XOR composition theorem of formulas with restriction on top. In Section 4, we prove Theorem 1.7, a nearly- depth lower bound for formulas with restriction on top. In Section 5, we conclude and make some discussion about future directions.
2 Preliminaries
Definition 2.1 (De Morgan formula).
A De Morgan formula is a binary tree, its internal vertices are gates such as or , its leaves are literals such as or its negation . The depth of a formula is the depth of underling tree of the formula. The size of a formula is the number of its leaves.
Definition 2.2.
The formula complexity of a boolean function denoted is the size of the smallest formula that computes The depth complexity of denoted is the smallest depth of a formula that computes .
We will need following fact, given a large set of distinct functions, there is a function with large formula size in that set.
Fact 2.3 ([Juk12], Theorem 1.23).
Let be a set of distinct Boolean functions with input length , then there exists a function such that .
Definition 2.4 (XOR composition of two functions, [MS21]).
Let be two functions, their XOR composition is defined as follows:
where denotes the bit-wise XOR of two binary strings.
We recall a measure of any Boolean function of two arguments defined in [MS22].
Definition 2.5 ([MS22]).
Let be a Boolean function of two arguments, given any fixed as the first argument, we define the function by setting and define
Fact 2.6 ([MS22]).
and for every , .
The measure is sub-additive in the following sense:
Lemma 2.7 ([MS22]).
Let , where is or . Then
Matrix representation for a function of two arguments.
For convenience, we follow a notation in [MS22] which treats a Boolean function of two arguments as a Boolean matrix.
Definition 2.8.
Set , given a function , define a corresponding matrix such that
-
•
the rows of are indexed by and the columns are indexed by ,
-
•
and for every .
Similarly, given two functions , define a matrix such that for every .
Furthermore, given a function and a set of functions from , define a matrix such that for every ,
Finally, given a subset of indexes for rows in a matrix , the matrix is a sub-matrix of restricted to rows indexed by .
Concentration of measure
Theorem 2.9 (Chernoff bound).
Given independent random variables which distribute in , let be their sum and . For any constant such that , we have
and
By Chernoff bound, we have following fact.
Fact 2.10.
Let . If we choose a function randomly, then and .
3 An improved XOR composition theorem for formulas with restriction on top
In this section, we prove Theorem 1.6. At first, let’s recall the notion of well-mixed set of functions.
Definition 3.1 (Well-mixed set of functions, [MS22]).
A set of functions from is -well-mixed for if , there exist a set , such that has no more than zeroes in total where .
Now we show given any approximately balanced function , the set of all functions is already a well-mixed set of functions for .
Lemma 3.2.
Let be a function, be the set of all functions . For convenience, let and . Assume , then is -well-mixed for . Particularly, let be a set of functions and , there exists a set such that and all entries in are ones.
Proof.
Let be a set of functions, given any , denote the set by , and we say is bad if . Similarly, given , we denote the set by .
Now assume there are bad s, the number of functions in is at most
This implies , this means .
Now we show when is not bad, for every , . Since , we have . Since , is not empty, there must be a such that , thus must be as required. ∎
Now we are ready to prove Theorem 1.6 rephrased as follows. It is proved via a similar idea in [MS22] and a more careful analysis.
Theorem 3.3.
Let be a function and , there exists a function , such that is not computable by an AND of formulas of size at most for any .
Proof of Theorem 3.3.
We prove it by contradiction. Let be the set of all functions . Assume the contrary that for all the XOR composition is computable by AND of small enough formulas. That is, for any , there is a formula computing and is of following form
where the size of every is at most . Now let be the function that computes, thus and can be represented as .
Recall that for every , By Lemma 2.7, there must be an such that, the measure is large:
Now we collect all such functions and let be the set of all such functions . We want to show that the size of is large, thus by a standard counting argument, there must be a function which requires large formulas which contradicts the hypothesis.
Given any , denote the by . Let be the function such that the size of is maximum among all . We will prove
before proving this, let’s show it indeed leads to the contradiction required. Recall by assumption , this means
Now we are ready to lower bound the size of , that is the number of distinct functions in . Since ,
By Fact 2.3, there exists an such that
which is a contradiction to the assumption. Now we show by considering the matrix . By Lemma 3.2,
-
•
there exists a set such that
-
•
And all entries in are ones.
Now we have following key fact about the function .
Fact 3.4.
For every , implies .
Proof.
Recall that . If , there must be some such that . Furthermore, by assumption is simply and for some , thus must be as well. ∎
By Fact 3.4, given any , for every , , in other words, is a constant function and . Now we are ready to up bound and have
as required. ∎
Remark 3.5.
We want to point out the AND() gate can be replaced with OR() gate and since we use the counting argument to show there exists a hard function in the set , this result can be extended to the case of formula over full binary basis.
4 A nearly- depth lower bound for formulas with restriction on top
We will prove a depth lower bound for a modified Andreev function defined in [MS22].
Definition 4.1 ([MS22]).
The modified Andreev function is defined as follows:
where is a truth table of some function from , is a truth table of some function from , for every , is a binary string of length and is the parity function.
Theorem 4.2.
There exist two parameters , for every constant such that , the modified Andreev function is not computable by an AND of formulas of size at most .
In terms of depth lower bound, the modified Andreev function is not computable by any circuit of depth with the restriction that top layers only consist of AND gates where . Choose properly, Theorem 1.7 follows.
Remark 4.3.
Note that the input length of the modified Andreev function is , writing the results in terms of doesn’t change them essentially. For example, . Similarly, in the restriction for top gates, AND could be replaced by OR.
Theorem 4.2 is proved via the same idea from [MS22], the only differences here are details of parameters. For completeness of this paper, we present the proof here. At first, we recall the standard notion of random restriction.
Definition 4.4 (Restriction).
Given a Boolean function , a restriction to function is a vector of length and for every , is an element from . Define to be the function restricted according to as follows: if is then the -th input bit of is unfixed thus free to be or ; otherwise the -th input bit of is fixed to be .
Definition 4.5 (Random restriction).
Given , the random restriction randomly samples restrictions as follows: every is sampled independently such that and .
In the rest of this section, we will set . Mihajlin and Sofronova proved following useful two lemmas about random restriction of the modified Andreev function implicitly in [MS22].
Lemma 4.6 (Implicit in [MS22]).
Let be the function hardwired with two fixed functions . Then with probability , the random restriction will turn into .
Lemma 4.7 (Implicit in [MS22]).
Let be two parameters such that . Let be a formula of form where the size of each is at most . Then with probability , the random restriction will turn into a formula of form where the size of each is at most for some .
Now we show how to prove Theorem 4.2 with above two lemmas.
Proof of Theorem 4.2.
At first choose some function from and make sure that is the function with maximum protocol size and . By Fact 2.3 and 2.10, we have when is large enough. By Theorem 3.3, we have following fact.
Fact 4.8.
Let be the function chosen above, there exists a function , such that is not computable by an AND of formulas of size at most for any when is large enough.
Let be the same parameter from Lemma 4.7, and . Choose some such that and set . Now assume Theorem 4.2 is false, that is is computable by formula of form where the size of each is at most , and so is . By Lemma 4.6 and 4.7, is computable by a formula of form where the size of each is at most
which contradicts Fact 4.8. ∎
5 Conclusion and discussion
In this paper, we obtain a nearly-tight XOR composition theorem for formulas with restriction on top and with this composition theorem we have a nearly- depth lower bound for formulas with restriction on top. Intuitively, in such the depth lower bound, we trade one unrestricted layer to nearly two layers of AND gates on top of the circuit and our trade-off is nearly optimal.
The next nature question as pointed out by Mihajlin and Sofronova [MS22] is to prove the case with formula on top. But it turns out even to prove the case of depth-2 formula with unbounded fan-in is difficult. The obstacle to extending current approach to the case of depth-2 formula on top is that we don’t know how to find the sub-formula such that the measure is large enough meanwhile is correlated with properly like that in Fact 3.4. This difficulty also appears in the approach via communication complexity [Mei24], since such result shares the same feature with a composition theorem of a depth-2 formula and a De Morgan formula, but we don’t even know how to prove a composition theorem of two depth-2 formulas in general. New ideas are needed, maybe we should try to prove a general composition theorem of two depth-2 formulas in the first place.
Acknowledgments
The author would like to thank Or Meir for sharing the idea about how to prove results similar to Mihajlin and Sofronova’s work via techniques from communication complexity and other helpful discussions. The author would also like to thank Pei Wu for suggesting the question of the composition of two depth-2 formulas and other helpful discussions.
References
- [BW24] Gabriel Bathie and R Ryan Williams. Towards stronger depth lower bounds. In 15th Innovations in Theoretical Computer Science Conference (ITCS 2024). Schloss-Dagstuhl-Leibniz Zentrum für Informatik, 2024.
- [DM18] Irit Dinur and Or Meir. Toward the KRW composition conjecture: Cubic formula lower bounds via communication complexity. Comput. Complex., 27(3):375–462, 2018.
- [dRMN+20] Susanna F. de Rezende, Or Meir, Jakob Nordström, Toniann Pitassi, and Robert Robere. KRW composition theorems via lifting. In Sandy Irani, editor, 61st IEEE Annual Symposium on Foundations of Computer Science, FOCS 2020, Durham, NC, USA, November 16-19, 2020, pages 43–49. IEEE, 2020.
- [EIRS01] Jeff Edmonds, Russell Impagliazzo, Steven Rudich, and Jirí Sgall. Communication complexity towards lower bounds on circuit depth. Comput. Complex., 10(3):210–246, 2001.
- [FMT21] Yuval Filmus, Or Meir, and Avishay Tal. Shrinkage under random projections, and cubic formula lower bounds for AC0 (extended abstract). In James R. Lee, editor, 12th Innovations in Theoretical Computer Science Conference, ITCS 2021, January 6-8, 2021, Virtual Conference, volume 185 of LIPIcs, pages 89:1–89:7. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021.
- [GMWW17] Dmitry Gavinsky, Or Meir, Omri Weinstein, and Avi Wigderson. Toward better formula lower bounds: The composition of a function and a universal relation. SIAM J. Comput., 46(1):114–131, 2017.
- [Hås98] Johan Håstad. The shrinkage exponent of de morgan formulas is 2. SIAM J. Comput., 27(1):48–64, 1998.
- [HW93] Johan Håstad and Avi Wigderson. Composition of the universal relation. In ADVANCES IN COMPUTATIONAL COMPLEXITY THEORY, AMS-DIMACS, 1993.
- [Juk12] Stasys Jukna. Boolean Function Complexity - Advances and Frontiers, volume 27 of Algorithms and combinatorics. Springer, 2012.
- [KM18] Sajin Koroth and Or Meir. Improved composition theorems for functions and relations. Leibniz International Proceedings in Informatics, LIPIcs, 116(48):1–18, 2018.
- [KRT17] Ilan Komargodski, Ran Raz, and Avishay Tal. Improved average-case lower bounds for de morgan formula size: Matching worst-case lower bound. SIAM Journal on Computing, 46(1):37–57, 2017.
- [KRW95] Mauricio Karchmer, Ran Raz, and Avi Wigderson. Super-logarithmic depth lower bounds via the direct sum in communication complexity. Comput. Complex., 5(3/4):191–204, 1995.
- [Mei20] Or Meir. Toward better depth lower bounds: Two results on the multiplexor relation. Comput. Complex., 29(1):4, 2020.
- [Mei23] Or Meir. Toward better depth lower bounds: A krw-like theorem for strong composition. Electron. Colloquium Comput. Complex., TR23-078, 2023.
- [Mei24] Or Meir. Personal communication, 2024.
- [MS21] Ivan Mihajlin and Alexander Smal. Toward better depth lower bounds: The XOR-KRW conjecture. In Valentine Kabanets, editor, 36th Computational Complexity Conference, CCC 2021, July 20-23, 2021, Toronto, Ontario, Canada (Virtual Conference), volume 200 of LIPIcs, pages 38:1–38:24. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021.
- [MS22] Ivan Mihajlin and Anastasia Sofronova. A better-than-3log(n) depth lower bound for de morgan formulas with restrictions on top gates. In Shachar Lovett, editor, 37th Computational Complexity Conference, CCC 2022, July 20-23, 2022, Philadelphia, PA, USA, volume 234 of LIPIcs, pages 13:1–13:15. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2022.
- [Tal14] Avishay Tal. Shrinkage of de morgan formulae by spectral techniques. In 55th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2014, Philadelphia, PA, USA, October 18-21, 2014, pages 551–560. IEEE Computer Society, 2014.
- [Tal16] Avishay Tal. Computing requires larger formulas than approximating. Electron. Colloquium Comput. Complex., TR16-179, 2016.
- [Wu23] Hao Wu. An improved composition theorem of a universal relation and most functions via effective restriction. Electron. Colloquium Comput. Complex., TR23-151, 2023.