Hardness of Sparse Sets and Minimal Circuit Size Problem
Abstract
We study the magnification of hardness of sparse sets in nondeterministic time complexity classes on a randomized streaming model. One of our results shows that if there exists a -sparse set in that does not have any randomized streaming algorithm with updating time, and space, then , where a -sparse set is a language that has at most strings of length . We also show that if is -hard under polynomial time truth-table reductions, then .
1 Introduction
Hardness magnification has been intensively studied in the recent years [13, 4, 10, 12]. A small lower bound such as for one problem may bring a large lower bound such as super-polynomial lower bound for another problem. This research is closely related to Minimum Circuit Size Problem (MCSP) that is to determine if a given string of length with integer can be generated by a circuit of size . For a function , is that given a string of length , determine if there is a circuit of size at most to generate . This problem has received much attention in the recent years [3, 2, 13, 8, 7, 6, 5, 4, 12, 11, 10].
Hardness magnification results are shown in a series of recent papers about MCSP [13, 4, 10, 12]. Oliveira and Santhanam [13] show that -size lower bounds for approximating with an additive error implies . Oliveira, Pich and Santhanam [12] show that for all small , -size lower bounds for approximating with factor error implies . McKay, Murray, and Williams [10] show that an lower bound on space deterministic streaming model for implies separation of P from NP.
The hardness magnification of non-uniform complexity for sparse sets is recently developed by Chen, Jin and Williams [4]. Since are of sub-exponential density for , the hardness magnification for sub-exponential density sets is more general than the hardness magnification for . They show that if there is an and a family of languages (indexed over ) such that each is a -sparse language in , and , then for all , where is the class of languages with nonuniform circuits of size bounded by function . Their result also holds for all complexity classes with .
On the other hand, it is unknown if MCSP is NP-hard. Murray and Williams [11] show that NP-completeness of MCSP implies the separation of EXP from ZPP, a long standing unsolved problem in computational complexity theory. Hitchcock and Pavan [8, 11] if MCSP is -hard under polynomial time truth-table reductions, then EXP.
Separating NEXP from BPP, and EXP from ZPP are two of major open problems in the computational complexity theory. We are motivated by further relationship about sparse sets and MCSP, and the two separations and . We develop a polynomial method on finite fields to magnify the hardness of sparse sets in nondeterministic time complexity classes over a randomized streaming model. One of our results show that if there exists a -sparse set in that does not have a randomized streaming algorithm with updating time, and space, then , where a -sparse set is a language that has at most strings of length . Our magnification result has a flexible trade off between the spareness and time complexity.
We use two functions and to control the sparseness of a tally set . Function gives an upper bound for the number of elements of in and is the gap lower bound between a string and the next string in , which satisfy . The class defines the class of all those tally sets. By choosing , and , we prove that if MCSP is -hard under polynomial time truth-table reductions, then .
1.1 Comparison with the existing results
Comparing with some existing results about sparse sets hardness magnification in this line [4], there are some new advancements in this paper.
- 1.
-
2.
Our method is conceptually simple, and easy to understand. It is a polynomial algebraic approach on finite fields.
-
3.
A flexible trade off between sparseness and time complexity is given in our paper.
Proving NP-hardness for MCSP implies [8, 11]. We consider the implication of ZPP-hardness for , and show that if MCSP is -hard for a function pair such as and , then . It seems that proving MCSP is -hard is much easier than proving MCSP is NP-hard since . According to the low-high hierarchy theory developed by Schöning [14], the class is the low class . Although may not be in the class , it is possible to be -hard.
2 Notations
Minimum Circuit Size Problem (MCSP) is that given an integer , and a binary string of length for some integer , determine if can be generated by a circuit of size . Let be the set of all natural numbers. For a language , is the set of strings in of length , and is the set of strings in of length at most . For a finite set , denote to be the number of elements in . For a string , denote to be its length. If are not empty strings, we have a coding method that converts a into a string with and converts into with . For example, for , let .
Let be the class of languages accepted by deterministic Turing machines in time . Let be the class of languages accepted by nondeterministic Turing machines in time . Define and . P/poly, which is also called PSIZE, is the class of languages that have polynomial-size circuits.
We use a polynomial method on a finite field . It is classical theory that each finite field is of size for some prime number and integer (see [9]). For a finite field , we denote to represent , where is a irreducible polynomial over field for the prime number and its degree is . The polynomial is equal to if is of size , which is a prime number. Each element of with is a polynomial with degree less than the degree of . For two elements and in , their addition is defined by , and their multiplication is defined by (see [9]). Each element in is a polynomial (), which is represented by a binary string of length .
We use field in our randomized streaming algorithm for hardness magnification . Let be a field (a field of size ) and has its . Let be a binary string of length with , and be a variable. Define to be the element in . Let be a string in and be an integer at least . Let such that each is a substring of of length for , and the substring has its length . Each is called a -segment of for . Define the polynomial , which converts a binary string into a polynomial in .
We develop a streaming algorithm that converts an input string into an element in a finite field. We give the definition to characterize the properties of the streaming algorithm developed in this paper. Our streaming algorithm is to convert an input stream into an element by selecting a random element from .
Definition 1
Let be nondecreasing functions from to . Define to be the class of languages that have one-pass streaming algorithms that has input with ( is a string and read by streaming), it satisfies
-
i.
It takes time to generate a field , which is represented by with a irreducible polynomial over of degree .
-
ii.
It takes random steps before reading the first bit from the input stream .
-
iii.
It uses space that includes the space to hold the field representation generated by the algorithm. The space for a field representation is and for the irreducible polynomial over .
-
iv.
It takes field conversions to elements in and field operations in after reading each bit.
-
v.
It runs randomized steps after reading the entire input.
3 Overview of Our Methods
In this section, we give a brief description about our methods used in this paper. Our first result is based on a polynomial method on a finite field whose size affects the hardness of magnification. The second result is a translational method for zero-error probabilistic complexity classes.
3.1 Magnify the Hardness of Sparse Sets
We have a polynomial method over finite fields. Let be -sparse language in . In order to handle an input string of size , a finite field with for some integer is selected, and is represented by , where is a irreducible polynomial over . An input is partitioned into -segments such that each is converted into an element in , and is transformed into an polynomial . A random element is chosen in the beginning of streaming algorithm before processing the input stream. The value is evaluated with the procession of input stream. The finite is large enough such that for different and of the same length, and are different polynomials due to their different coefficients derived from and , respectively. Let be the set of all with and . Set is the union of all with . The set of is . A small lower bound for the language is magnified to large lower bound for .
The size of field depends on the density of set and is . By the construction of , if , there are tuples in that are generated by via all in . For two different and of length , the intersection is bounded by the degree of . If , the number of items generated by is at most in . If , the number of items generated by is in . This enables us to convert a string of length in into some strings in of length much smaller than , make the hardness magnification possible.
3.2 Separation by ZPP-hardness of MCSP
Our another result shows that ZPP-hardness for MCSP implies . We identify a class of functions that are padding stable, which has the property if , then . The function pair and has this property. We construct a very sparse tally set that separates from , where is the zero error exponential time probabilistic class. It is based on a diagonal method that is combined with a padding design. A tally language has a zero-error -time probabilistic algorithm implies has a zero-error -time probabilistic algorithm. Adapting to the method of [11], we prove that if MCSP is -hard under polynomial time truth-table reductions, then .
4 Hardness Magnification via Streaming
In this section, we show a hardness magnification of sparse sets via a streaming algorithm. A classical algorithm to find irreducible polynomial [15] is used to construct a field that is large enough for our algorithm.
Theorem 2
[15] There is a deterministic algorithm that constructs a irreducible polynomial of degree in operations in , where is a finite field with prime number .
Definition 3
Let be a function from to . For a language , we say is -sparse if for all large integer .
4.1 Streaming Algorithm
The algorithm Streaming(.) is based on a language that is -sparse. It generates a field and evaluates with a random element in . A polynomial can be evaluated by according to the classical Horner’s algorithm. For example, .
Algorithm
Streaming()
Input: an integer , and string of the length ;
Steps:
-
1.
Select a field size such that .
-
2.
Generate an irreducible polynomial of degree over such that represents finite (by Theorem 2 with );
-
3.
Let be a random element in ;
-
4.
Let ; (Note that is the number of -segments of . See Section 2)
-
5.
Let ;
-
6.
Let ;
-
7.
Repeat
-
8.
{
-
9.
Receive the next -segment from the input stream ;
-
10.
Convert into an element in ;
-
11.
Let ;
-
12.
Let ;
}
-
13.
Until (the end of the stream);
-
14.
Output ;
End of Algorithm
Now we have our magnification algorithm. Let be a randomized Turing machine to accept a language that contains all with and . We have the following randomized streaming algorithm to accept via the randomized algorithm for .
Algorithm
Magnification()
Input integer and as a stream;
Steps: Let Streaming(); Accept if accepts;
End of Algorithm
4.2 Hardness Magnification
In this section, we derive some results about hardness magnification via sparse set. Our results show a trade off between the hardness magnification and sparseness via the streaming model.
Definition 4
For a nondecreasing function , define the class of languages that have two-side bounded error probabilistic algorithms with time complexity . Define .
Theorem 5
Assume that be nondecreasing function for the time to generate an irreducible polynomial of degree in , and be the nondecreasing function of a time upper bound for the operations () in . Let be nondecreasing functions with , , and for all large . If there is a -sparse set with and , then there is a language such that and .
Proof: Select a finite field with for an integer by line 1 of the algorithm streaming(.). For each , let be partitioned into -segments: . Let convert into an element of (See Section 2). Define polynomial . For each , let be the set , where . Define set for , and language .
Claim 1. For any with , we have .
Proof: Assume that for some with , . It is easy to see that and for all large by the algorithm Streaming(.) and the condition of in the theorem. Assume that for every . Since is the union with and , there are at most elements in by line 1 of the algorithm Streaming(.). Thus, . This brings a contradiction. Therefore, there is a to have . Since the polynomials and are of degrees at most , we have (two polynomials are equal). Thus, . This brings a contradiction because and .
Claim 2. If , then Streaming. Otherwise, with probability at most , Streaming.
Proof: For each , it generates for a random . Each determines a random path. We have that if , then , and if , then with probability at most by Claim 1.
Claim 3. .
Proof: Let . Each element in field is of length . For each (), we need to guess a string such that . It is easy to see that for all large if (See Section 2 about coding). Let . It takes at most steps to generate a irreducible polynomial for the field by our assumption.
Since , checking if takes nondeterministic steps to guess a string , deterministic steps to generate for the field , nondeterministic steps to generate a random element , and additional steps to evaluate in by following algorithm Streaming(.) and check . The polynomial in the has degree at most . Each polynomial operation ( or .) in takes at most steps. Since time by the condition of this theorem, we have .
Claim 4. If , then .
Proof: The field generated at line 2 in algorithm Streaming(.) takes time. Let be the input string. The string partitioned into -segments . Transform each into an element in in the streaming algorithm. We generate a polynomial . Given a random element , we evaluate according to the classical algorithm. Therefore, is evaluated in Streaming(.) with input .
If , then has a randomized streaming algorithm that has at most random steps after reading the input, and at most space. After reading one substring from , it takes one conversion from a substring of the input to an element of field by line 10, and at most two field operations by line 11 in the algorithm Streaming(.).
Claim 4 brings a contradiction to our assumption about the complexity of in the theorem. This proves the theorem.
Proposition 6
Let be a nondecreasing function. If for each fixed , for all large , then there is a nondecreasing unbounded function with .
Proof: Let . For each , let be the least integer such that and for all . Clearly, we have the infinite list such that . Define function such that for all . For each , we have .
Our Definition 7 is based Proposition 6. It can simplify the proof when we handle a function that is .
Definition 7
A function is if there is a nondecreasing function such that and for all large . A function is if there is a nondecreasing function such that and for all large .
Corollary 8
If there exists a -sparse language in such that does not have any randomized streaming algorithm with updating time, and space, then .
Proof: Let be an arbitrary unbounded nondecreasing function that satisfies and . Let and Let , , and .
It is easy to see that , and both and are (see Theorem 2). For any fixed , we have for all large . For all large , we have
(1) | |||||
(2) |
Clearly, these functions
satisfy the inequality of the precondition in
Theorem 5.
Assume .
With space, we have a field representation
with . Thus, each field operation takes
time by the brute force method for polynomial
addition and multiplication. We have by inequality (2). Thus, the
streaming algorithm updating time is . Therefore,
we have that has a randomized streaming algorithm with
updating time, and space. This gives a
contradiction. So,
. By
Theorem 5, there is such that
. Therefore, . Thus,
.
5 Implication of ZPP-Hardness of MCSP
In this section, we show that if MCSP is -hard, then . The conclusion still holds if is replaced by a very sparse subclass of languages.
Definition 9
For a nondecreasing function , define
the class of languages that have zero-error
probabilistic algorithms with time complexity .
Define , and
.
Definition 10
For an nondecreasing function , define to be the class of tally set such that for each , there is an integer with . For a tally language , define .
Definition 11
For two languages and , a polynomial time truth-table reduction from to is a polynomial time computable function such that for each instance for , to satisfy if and only if , where is circuit of input bits and is the characteristic function of .
Let be a type of polynomial time reductions ( represents polynomial time truth-table reductions), and be a class of languages. A language is -hard under reductions if for each , .
Definition 12
Let be an integer. Define two classes of functions with recursions: (1) , and . (2) , and .
Definition 13
For two nondecreasing functions , the pair is time constructible if can be computed in time steps.
Definition 14
Define to be the class of tally sets such that and for any two strings with , they satisfy . We call to be the density function and to be the gap function. A gap function is padding stable if for all .
Lemma 15
-
i.
Assume the gap function is padding stable. If , then .
-
ii.
For each integer , is padding stable.
Proof: Part i. Let be a string in . The next shortest string with satisfies . We have and are two consecutive neighbor strings in such that there is no other string with . We have . Since the strings in are one-one mapped from the strings in with length less than , , we have . This proves Part (i).
Lemma 16
Let and be nondecreasing unbounded functions from
to , and is time constructible. Then there exists
a time constructible increasing unbounded function
such that
.
Proof: Compute the least integer with . Let be the number of steps for the computation. Define . Assume that has been defined. We determine the function value below.
For an integer , compute and the least numbers such that . Assume the computation above takes steps. Define to be the . For each language , there are at most strings in with length at most . On the other hand, by the increasing list . Therefore, we have . Furthermore, we also have . Since is the number of steps to determine the values and . We have . Thus, can be computed in steps by spending some idle steps. Therefore, the function is time constructible.
We will use the notion to characterize extremely sparse tally sets with fast growing function such as . It is easy to see that , where is the identity function .
Lemma 17
Let and be nondecreasing unbounded functions. If function is padding stable, then there is a language such that and .
Proof:
It is based on the classical translational method. Assume . Let be a time constructible
increasing unbounded function via Lemma 16 such that
. Let and
. Let be a tally language in
, but it is not in
. Such a language can be constructed via a
standard diagonal method. Let be the list of
Turing machines such that each has time upper bound by
function . Define language such that for
each , if and only if rejects in
steps. We have by
Lemma 16.
Let . We have by Lemma 15. We have . Thus, . So, . Therefore, . We have . Thus, . This brings a contradiction.
Theorem 18
Let and be nondecreasing unbounded functions from to . Assume that is padding stable. If MCSP is -hard under polynomial time truth-table reductions, then EXP.
Proof: Assume that MCSP is -hard under polynomial time truth-table reductions, and .
Let be a language in , but by Lemma 17. Let . Clearly, every string in has the property that for some integer . This property is easy to check and we reject all strings without this property in linear time. We have . Therefore, there is a polynomial time truth-table reduction from to MCSP via a polynomial time truth-table reduction . Let polynomial be the running time for for a fixed and .
Define the language the -th bit of -th query of is equal to and . We can easily prove that is in . Therefore, (See [1]).
Therefore, there is a class of polynomial size circuits to recognize such that recognize all with in . Assume that the size of is of size at most for a fixed . For an instance for , consider the instance for . We can compute all non-adaptive queries to in time via . If , the answer from for the query is yes since can be generated as one of the instances via the circuit . If , we can use a brute force method to check if there exists a circuit of size at most to generate . It takes time. Therefore, . Thus, . This bring a contradiction as we already assume .
Corollary 19
For any integer , if MCSP is -hard under polynomial time truth-table reductions, then EXP.
Corollary 20
For any integer , if MCSP is -hard under polynomial time truth-table reductions, then EXP.
Corollary 21
If MCSP is -hard under polynomial time truth-table reductions, then EXP.
6 Conclusions
In this paper, we develop an algebraic method to magnify the hardness of sparse sets in nondeterministic classes via a randomized streaming model. It has a flexible trade off between the sparseness and time complexity. This shows connection to the major problems to prove . We also prove that if is ZPP-hard, then .
Acknowledgements: This research was supported in part by National Science Foundation Early Career Award 0845376, and Bensten Fellowship of the University of Texas Rio Grande Valley. Part of this research was conducted while the author was visiting the School of Computer Science and Technology of Hengyang Normal University in the summer of 2019 and was supported by National Natural Science Foundation of China 61772179.
References
- [1] L. Adleman. Two theorems on random polynomial time. In Proceedings of the 19th Annual IEEE Symposium on Foundations of Computer Science, pages 75–83, 1978.
- [2] E. Allender and S. Hirahara. New insights on the (non-)hardness of circuit minimization and related problems. In 42nd International Symposium on Mathematical Foundations of Computer Science, MFCS 2017, August 21-25, 2017 - Aalborg, Denmark, pages 54:1–54:14, 2017.
- [3] E. Allender, D. Holden, and V. Kabanets. The minimum oracle circuit size problem. Computational Complexity, 26(2):469–496, 2017.
- [4] L. Chen, C. Jin, and R. Williams. Hardness magnification for all sparse NP languages. Electronic Colloquium on Computational Complexity (ECCC), 26:118, 2019.
- [5] S. Hirahara, I. C. Oliveira, and R. Santhanam. Np-hardness of minimum circuit size problem for OR-AND-MOD circuits. In 33rd Computational Complexity Conference, CCC 2018, June 22-24, 2018, San Diego, CA, USA, pages 5:1–5:31, 2018.
- [6] S. Hirahara and R. Santhanam. On the average-case complexity of MCSP and its variants. In 32nd Computational Complexity Conference, CCC 2017, July 6-9, 2017, Riga, Latvia, pages 7:1–7:20, 2017.
- [7] S. Hirahara and O. Watanabe. Limits of minimum circuit size problem as oracle. In 31st Conference on Computational Complexity, CCC 2016, May 29 to June 1, 2016, Tokyo, Japan, pages 18:1–18:20, 2016.
- [8] J. M. Hitchcock and A. Pavan. On the np-completeness of the minimum circuit size problem. In 35th IARCS Annual Conference on Foundation of Software Technology and Theoretical Computer Science, FSTTCS 2015, December 16-18, 2015, Bangalore, India, pages 236–245, 2015.
- [9] T. Hungerford. Algebra. Springer-Verlag, 1974.
- [10] D. M. McKay, C. D. Murray, and R. R. Williams. Weak lower bounds on resource-bounded compression imply strong separations of complexity classes. In Proceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing, STOC 2019, Phoenix, AZ, USA, June 23-26, 2019., pages 1215–1225, 2019.
- [11] C. D. Murray and R. R. Williams. On the (non) np-hardness of computing circuit complexity. Theory of Computing, 13(1):1–22, 2017.
- [12] I. C. Oliveira, J. Pich, and R. Santhanam. Hardness magnification near state-of-the-art lower bounds. In 34th Computational Complexity Conference, CCC 2019, July 18-20, 2019, New Brunswick, NJ, USA., pages 27:1–27:29, 2019.
- [13] I. C. Oliveira and R. Santhanam. Hardness magnification for natural problems. In 59th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2018, Paris, France, October 7-9, 2018, pages 65–76, 2018.
- [14] U. Schöning. A low and a high hierarchy within NP. JCSS, 27:14–28, 1983.
- [15] V. Shoup. New algorithms for finding irreducible polynomials over finite fields. In 29th Annual Symposium on Foundations of Computer Science, White Plains, New York, USA, 24-26 October 1988, pages 283–290, 1988.