On versus and Frege Systems
By Tianrong Lin
Abstract
We prove in this paper that there is a language accepted by some nondeterministic Turing machines but not by any -machines (defined later). Then we further show that is in , thus proving that . The main techniques used in this paper are lazy-diagonalization and the novel new technique developed in the author’s recent work [Lin21]. Further, since there exists some oracle such that , we then explore what mystery lies behind it and show that if and under some rational assumptions, then the set of all polynomial-time co-nondeterministic oracle Turing machines with oracle is not enumerable, thus showing that the technique of lazy-diagonalization is not applicable for the first half of the whole step to separate from . As a by-product, we reach the important result that [Lin21] once again, which is clear from the above outcome and the well-known fact that . Next, we show that the complexity class has intermediate languages, i.e., there exists a language , which is not in and not -complete. We also summarize other direct consequences implied by our main outcome, such as , and there exists no super proof system. Lastly, we show a lower bounds result for Frege proof systems, i.e., no Frege proof systems can be polynomially bounded.
Table of Contents
1. Introduction
2. Preliminaries
3. All -machines are enumerable
4. Lazy-diagonalization against All -machines
5. Proving That
6. Breaking the Relativization Barrier
7. Rich Structure of
8. Frege Systems
9. Conclusions
References
1 Introduction
It is well-known that computational complexity theory is a central subfield of theoretical computer science and mathematics, which mainly concerns the efficiency of Turing machines (i.e., algorithms), or the intrinsic complexity of computational tasks, i.e., focuses on classifying computational problems according to their resource usage, and relating these classes to each other (see e.g. [A1]). In other words, computational complexity theory specifically deals with fundamental questions such as what is feasible computation and what can and cannot be computed with a reasonable amount of computational resources in terms of time or space (memory). What’s exciting is that there are many remarkable open problems in the area of computational complexity theory, such as the famous versus problem and the important unknown relation between the complexity class and the complexity class , which deeply attract many researchers to conquer them like crazy. However, despite decades of effort, very little progress has been made on these important problems. Furthermore, as pointed out by Wigderson [Wig07], to understand the power and limits of efficient computation has led to the development of computational complexity theory, and this discipline in general, and the aforementioned famous open problems in particular, have gained prominence within the mathematics community in the past decades.
As introduced in the Wikipedia [A1], in the field of computational complexity theory, a problem is regarded as inherently difficult if its solution requires significant resources such as time and space, whatever the algorithm used. The theory formalizes this intuition, by introducing mathematical models of computing to study these problems and quantifying their computational complexity, i.e., the amount of resources needed to solve them. However, it is worth noting that other measures of complexity are also used, such as the amount of communication, which appeared in communication complexity, the number of gates in a circuit, which appeared in circuit complexity, and so forth (see e.g. [A1]). Going further, one of the roles of computational complexity theory is to determine the practical limits on what computers (the computing models) can and cannot do.
Recently, in the author’s work [Lin21], we left an important open conjecture in computational complexity untouched on, which is one of the aforementioned open problems, i.e., the versus problem (the unknown relation between complexity classes and ). Naturally, one will ask, are these two complexity classes the same? Further, also note that there is a subfield of computational complexity theory, namely, the proportional proof complexity, which was initiated by Cook and Reckhow [CR79] and is devoted to the goal of proving the conjecture
In history, the fundamental measure of time opened the door to the study of the extremely expressive time complexity class , one of the most important classical complexity classes, i.e., nondeterministic polynomial-time. Specifically, is the set of decision problems for which the problem instances, where the answer is “yes”, have proofs verifiable in polynomial time by some deterministic Turing machine, or alternatively the set of problems that can be solved in polynomial time by a nondeterministic Turing machine (see e.g. [A2]). The famous Cook-Levin theorem [Coo71, Lev73] shows that this class has complete problems, which states that the Satisfiability is -complete, i.e., Satisfiability is in and any other language in can be reduced to it in polynomial-time. At the same time, it is also worth noting that this famous and seminal result opened the door to research into the renowned rich theory of -completeness [Kar72].
On the other hand, the complexity class is formally defined as follows: For a complexity class , its complement is denoted by (see e.g. [Pap94]), i.e.,
where is a decision problem, and is the complement of (i.e., with assumption that the language is over the alphabet ). That is to say, is the complement of . Note that, the complement of a decision problem is defined as the decision problem whose answer is “yes” whenever the input is a “no” input of , and vice versa. In other words, for instance, according to the Wikipedia’s language [A3], is the set of decision problems where there exists a polynomial and a polynomial-time bounded Turing machine such that for every instance , is a no-instance if and only if: for some possible certificate of length bounded by , the Turing machine accepts the pair . To the best of our knowledge, the complexity class was introduced for the first time by Meyer and Stockmeyer [MS72] with the name “”, and Stockmeyer wrote a full paper on the polynomial hierarchy which also uses the notation ; see e.g. [Sto77].
The versus problem is deeply connected to the field of proof complexity. Here are some historical notes: In 1979, Cook and Reckhow [CR79] introduced a general definition of propositional proof system and related it to mainstream complexity theory by pointing out that such a system exists in which all tautologies have polynomial length proofs if and only if the two complexity classes and coincide; see e.g. [CN10]. We refer the reader to the reference [Coo00] for the importance of this research field and to the reference [Kra95] for the motivation of the development of this rich theory. Apart from those, Chapter of [Pap94] also contains the introductions of importance of the problem
In this paper, our main goal is to settle the above important open conjecture.
In the next few paragraphs, let us review some of the interesting background, the current status, the history, and the main goals of the field of proof complexity. On the one hand, as we all know well, proof theory is a major branch of mathematical logic and theoretical computer science with which proofs are treated as formal mathematical objects, facilitating their analysis by mathematical techniques (see e.g. [A4]). Some of the major areas of proof theory include structural proof theory, ordinal analysis, automated theorem proving, and proof complexity, and so forth. It should be pointed out that much research also focuses on applications in computer science, linguistics, and philosophy, from which we see that the impact of proof theory is very profound. Moreover, as pointed out in [Raz04] by Razborov, proof and computations are among the most fundamental concepts relevant to virtually any human intellectual activity. Both have been central to the development of mathematics for thousands of years. The effort to study these concepts themselves in a rigorous, metamathematical way initiated in the th century led to flourishing of mathematical logic and derived disciplines.
On the other hand, according to the points of view given in [Rec76], logicians have proposed a great number of systems for proving theorems, which give certain rules for constructing proofs and for associating a theorem (formula) with each proof. More importantly, these rules are much simpler to understand than the theorems. Thus, a proof gives a constructive way of understanding that a theorem is true. Specifically, a proof system is sound if every theorem is true, and it is complete if every true statement (from a certain class) is a theorem (i.e., has a proof); see e.g. [Rec76].
Indeed, the relationship between proof theory and proof complexity is inclusion. Within the area of proof theory, proof complexity is the subfield aiming to understand and analyse the computational resources that are required to prove or refute statements (see e.g. [A5]). Research in proof complexity is predominantly concerned with proving proof-length lower and upper bounds in various propositional proof systems. For example, among the major challenges of proof complexity is showing that the Frege system, the usual propositional calculus, does not admit polynomial-size proofs of all tautologies; see e.g. [A5]. Here the size of the proof is simply the number of symbols in it, and a proof is said to be of polynomial size if it is polynomial in the size of the tautology it proves. Moreover, contemporary proof complexity research draws ideas and methods from many areas in computational complexity, algorithms and mathematics. Since many important algorithms and algorithmic techniques can be cast as proof search algorithms for certain proof systems, proving lower bounds on proof sizes in these systems implies run-time lower bounds on the corresponding algorithms; see e.g. [A5].
At the same time, propositional proof complexity is an area of study that has seen a rapid development over the last two decades, which plays as important a role in the theory of feasible proofs as the role played by the complexity of Boolean circuits in the theory of efficient computations, see e.g. the celebrated work [Raz15] by Razborov. In most cases, according to [Raz15], the basic question of propositional proof complexity can be reduced to that given a mathematical statement encoded as a propositional tautology and a class of admissible mathematical proofs formalized as a propositional proof system , what is the minimal possible complexity of a -proof of ? In other words, propositional proof complexity aims to understand and analyze the computational resources required to prove propositional tautologies, in the same way that circuit complexity studies the resources required to compute Boolean functions. In the light of the point of view given in [Raz15], the task of proving lower bounds for strong proof systems like Frege or Extended Frege modulo any hardness assumption in the purely computational world may be almost as interesting; see e.g. [Raz15]. It is worth noting that, it has been observed by Razborov [Raz15] that implies lower bounds for any propositional proof system. In addition, see e.g. [Raz03] for interesting directions in propositional proof complexity.
Historically, systematic study of proof complexity began with the seminal work of Cook and Reckhow [CR79] who provided the basic definition of a propositional proof system from the perspective of computational complexity. In particular, as pointed out by the authors of [FSTW21], the work [CR79] relates the goal of propositional proof complexity to fundamental hardness questions in computational complexity such as the versus problem: establishing super-polynomial lower bounds for every propositional proof system would separate from .
Let us return back to the versus problem for this moment. As a matter of fact, the so-called versus problem is the central problem in proof complexity; see e.g. the excellent survey [Kra19]. It formalizes the question of whether or not there are efficient ways to prove the negative cases in our, and in many other similar examples. Moreover, the ultimate goal of proof complexity is to show that there is no universal propositional proof system allowing for efficient proofs of all tautologies, which is equivalent to showing that the computational complexity class is not closed under the complement; see e.g. [Kra19].
Particularly, in 1974, Cook and Reckhow showed in [CR74] that there exists a super proof system if and only if is closed under complement; that is, if and only if . It is, of course, open whether any super proof system exists. The above result has led to what is sometimes called “Cook’s program” for proving : prove superpolynomial lower bounds for proof lengths in stronger and stronger proposition proof systems, until they are established for all abstract proof systems. Cook’s program, which also can be seen as a program to prove , is an attractive and plausible approach; unfortunately, it has turned out to be quite hard to establish superpolynomial lower bounds on common proof systems, see e.g. the excellent survey on proof complexity [Bus12].
In this paper, the proof systems we consider are those familiar from textbook presentations of logic, such as the Frege systems which were mentioned previously. It can be said that the Frege proof system is a “textbook-style” propositional proof system with Modus Ponens as its only rule of inference. In fact, before our showing that , although is considered to be very likely true, researchers are not able to prove that some very basic proof systems are not polynomially bounded; see e.g. [Pud08]. In particular, the following open problem is listed as the first open problem in [Pud08]:
Prove a superpolynomial lower bound on the length of proofs for a Frege system (or prove that it is polynomially bounded).
However, despite decades of effort, very little progress has been made on the above mentioned open problem.
1.1 Main Results
In this paper, we explore and settle the aforementioned open problems. Our first main goal in this paper is to prove the following most important theorem:
Theorem 1
There is a language accepted by a nondeterministic Turing machine but by no -machines, i.e., . Further, it can be proved that . That is,
Further, since there exists some oracle for which by the result of [BGS75], and just as the result that [BGS75] led to a strong belief that problems with contradictory relativization are very hard to solve and are not amenable to current proof techniques, i.e., the solutions of such problems are beyond the current state of mathematics (see e.g. [HCCRR93, Hop84]), the conclusion also suggests that the problem of separating from is beyond the current state of mathematics. So, the next result, i.e., the following theorem, is to break the so-called Relativization Barrier:
Theorem 2
Under some rational assumptions (see Section 6 below), and if , then the set of all polynomial-time co-nondeterministic oracle Turing machine with oracle is not enumerable. Thereby, the ordinary diagonalization techniques (lazy-diagonalization) will generally not apply to the relativized versions of the versus problem.
By the well-known fact that and the Theorem 1, it immediately follows that
Corollary 3
.
Thus, we reach the important result that [Lin21] once again, which can be seen as a by-product of Theorem 1 and the well-known fact that .
Let and be the complexity classes defined by
and
respectively (the complexity classes and are defined below, see Section 2). Then, besides the above obvious corollary, we also have the following consequence of Theorem 1, whose proof can be similar to that of Theorem 1 (e.g., a point is setting the counter of tape to count up to in the proof of Theorem 4.1):
Corollary 4
.
Since (where is the class of languages accepted by deterministic Turing machines in time ; see Section 2 for big notation), this, together with Corollary 4, follows that . Furthermore, the complexity class (bounded-error probabilistic polynomial-time) aims to capture the set of decision problems efficiently solvable by polynomial-time probabilistic Turing machines (see e.g. [AB09]), and a central open problem in probabilistic complexity theory is the relation between and . However, currently researchers only know that is sandwiched between and (i.e., ) but are even unable to show that is a proper subset of ; see e.g. page 126 of [AB09]. With the consequence of Corollary 4 (i.e., ) at hand, it immediately follows that
Corollary 5
.
It is interesting that the complexity class has a rich structure if and differ. Specifically, in [Lad75], Lander constructed a language that is -intermediate under the assumption that . In fact, since by Theorem 1, we know that and symmetrically also have the following interesting outcome saying that the complexity class has the intermediate languages. To see so, let be the -intermediate language constructed in [Lad75]; then is a -intermediate language in . In other words, we will prove in detail the following important result:
Theorem 6
There are -intermediate languages, i.e., there exists language which is not in and not -complete.
By the result of [CR74], there exists a super proof system if and only if is closed under complement. Then, by Theorem 1, we clearly have the following:
Corollary 7
There exists no super proof system.
Moreover, we will settle the open problem listed as the first in [Pud08]; See the introduction in Section 1. Namely, we show the following theorem with respect to the aforementioned problem:
Theorem 8
There exists no polynomial such that for all , there is a Frege proof of of length at most . In other words, no Frege proof systems of can be polynomially bounded.
1.2 Overview
The rest of the paper is organized as follows: For the convenience of the reader, we will review some notions closely associated with our discussions and fix some notation we will use in the following context in the next Section, with some useful technical lemmas also presented in the next Section. In Section 3, we prove that all -machines are enumerable, thus showing that there exists an enumeration of all -machines. Section 4 contains the definition of our nondeterministic Turing machine, which accepts a language not in , showing the desired lower bounds. And the Section 5 is devoted to showing that the language is in fact in , proving the upper bounds required, in which we finish the proof of Theorem 1 at our own convenience. In Section 6 we will show Theorem 2 to break the so-called Relativization Barrier. In Section 7, we show that there are -intermediate languages in the complexity class . In Section 8, we prove the Theorem 8 which says that no Frege proof systems are polynomially bounded, thus answering an important open question in the area of proof complexity. Finally, we draw some conclusions in the last Section.
2 Preliminaries
In this section, we introduce some notions and notation that will be used in what follows.
Let where , i.e., the set of all positive integers. We also denote by the set of natural numbers, i.e.,
Let be an alphabet, for finite words , the concatenation of and , denoted by , is . For example, suppose that , and , , then
The length of a finite word , denoted as , is defined to be the number of symbols in it. It is clear that for finite words and ,
The big notation indicates the order of growth of some quantity as a function of or the limiting behavior of a function. For example, that is big of , i.e.,
means that there exist a positive integer and a positive constant such that
for all .
The little notation also indicates the order of growth of some quantity as a function of or the limiting behavior of a function but with different meaning. Specifically, that is little of , i.e.,
means that for any constant , there exists a positive integer such that
for all .
Throughout this paper, the computational modes used are nondeterministic Turing machines (or their variants such as nondeterministic Turing machines with oracle). We follow the standard definition of a nondeterministic Turing machine given in the standard textbook [AHU74]. Let us first introduce the precise definition of a nondeterministic Turing machine as follows:
Definition 2.1 (-tape nondeterministic Turing machine, [AHU74])
A -tape nondeterministic Turing machine (shortly, NTM) is a seven-tuple where:
-
1.
is the set of states.
-
2.
is the set of tape symbols.
-
3.
is the set of input symbols; .
-
4.
, is the blank.
-
5.
is the initial state.
-
6.
is the final (or accepting) state.
-
7.
is the next-move function, or a mapping from to subsets of
Suppose
and the nondeterministic Turing machine is in state with the th tape head scanning tape symbol for . Then in one move the nondeterministic Turing machine enters state , changes symbol to , and moves the th tape head in the direction for and .
Let be a nondeterministic Turing machine, and be an input. Then represents that is on input .
A nondeterministic Turing machine works in time (or of time complexity ), if for any input where is the input alphabet of , will halt within steps for any input .
Now, it is time for us to introduce the concept of a polynomial-time nondeterministic Turing machine as follows:
Definition 2.2 (cf. polynomial-time deterministic Turing machines in [Coo00])
Formally, a polynomial-time nondeterministic Turing machine is a nondeterministic Turing machine such that there exists , for all input of length where , will halt within steps.
By default, a word is accepted by a nondeterministic time-bounded Turing machine if there exists a computation path such that (i.e., stop in the “accepting” state) on that computation path described by . This is the “exists” accepting criterion for nondeterministic Turing machines in general, based on which the complexity class is defined:
Definition 2.3
The set of languages decided by nondeterministic Turing machines within time is denoted by . Thus,
Apart from the “exists” accepting criterion for defining the complexity class , there is another “for all” accepting criterion111Originally, for a polynomial-time nondeterministic Turing machine, it should be the “for all” rejecting criterion (i.e., all computation paths of reject), but we can exchange the rejecting state and the accepting state, i.e., treat the rejecting state as the accepting state. Thus, in this sense, we can also call that the “for all” accepting criterion. for nondeterministic Turing machines, defined as follows:
Definition 2.4 (“for all” accepting criterion)
Let be a nondeterministic Turing machine. accepts the input if and only if for all computation paths of leading to the accepting state of , i.e.,
Remark 2.1
Obviously, for a polynomial-time nondeterministic Turing machine , accepts a language when using the “exists” accepting criterion. However, if using the “for all” accepting criterion (whose synonym is the “for all” rejecting criterion), then accepts the language which is the complement of .
With the Definition 2.4 above, we can similarly define the complexity class of languages decided by nondeterministic Turing machines within time in terms of the “for all” accepting criterion:
Definition 2.5
The set of languages decided by nondeterministic Turing machines within time in terms of the “for all” accepting criterion is denoted . Namely,
if and only if there is a nondeterministic Turing machine of time complexity such that
Thus,
We can define the complexity class in a equivalent way using polynomial-time deterministic Turing machines as verifiers and “for all” witnesses, given as follows:
Definition 2.6 ([AB09], Definition 2.20)
For every , we say that if there exists a polynomial and a deterministic polynomial-time Turing machine such that for every ,
With the complexity class above, we next define the set of all -machines as follows.
Definition 2.7
For any language , the nondeterministic Turing machine is called a time-bounded -machine for the language if there exists a polynomial (), such that for every ,
For convenience, we denote by the notation the set of all -machines, i.e., the set consists of all -machines:
Remark 2.2
In regard to the relation between the time complexity of -tape nondeterministic Turing machines and that of single-tape nondeterministic Turing machines, we quote the following useful lemma, extracted from [AHU74] (see Lemma 10.1 in [AHU74]), which plays important roles in the following context:
Lemma 2.1 (Lemma 10.1 in [AHU74])
If is accepted by a -tape nondeterministic time-bounded Turing machine, then is accepted by a single-tape nondeterministic time-bounded Turing machine.
Since -machines are also polynomial-time nondeterministic Turing machines but with the “for all” accepting criterion for the input, we can similarly prove the following technical lemma:
Lemma 2.2
If is accepted by a -tape co-nondeterministic time-bounded Turing machine, then is accepted by a single-tape co-nondeterministic time-bounded Turing machine, i.e., by a single-tape time-bounded -machine.
The following theorem about efficient simulation for universal nondeterministic Turing machines is useful in proving the main result in Section 4.
Lemma 2.3 ([AB09])
There exists a Turing machine such that for every , , where denotes the Turing machine represented by . Moreover, if halts on input within steps, then halts within 222In this paper, stands for . steps, where is a constant independent of and depending only on ’s alphabet size, number of tapes, and number of states.
Other background information and notions will be given along the way in proving our main results stated in Section 1.
3 All -machines Are Enumerable
In this section, our main goal is to establish an important theorem that all -machines are in fact enumerable, which is the prerequisite for the next steps.
Following Definition 2.2, a polynomial-time nondeterministic Turing machine can be represented by a tuple of , where is the nondeterministic Turing machine itself and is the unique minimal degree of some polynomial such that will halt within steps for any input of length . We call such a positive integer the order of .
By Lemma 2.1 stated above, we only need to discuss the single-tape nondeterministic Turing machines. Thus, in the following context, by a nondeterministic Turing machine we mean a single-tape nondeterministic Turing machine. Similarly, by Lemma 2.2, we only need to discuss the single-tape -machines. Thus, in the following context, by a -machine we mean a single-tape -machine.
To obtain our main result, we must enumerate all -machines and suppose that the set of all -machines is enumerable (which needs to be proved) so that we can refer to the -th -machine in the enumeration.
To show that the set of all -machines is enumerable, we need to study whether the whole set of polynomial-time nondeterministic Turing machines is enumerable, because by Definition 2.4 or by Remark 2.1, -machines are nondeterministic Turing machines running within polynomial time but the accepting criterion for the input is the “for all” criterion, so the set can be seen as a subset of all polynomial-time nondeterministic Turing machines but with the “for all” accepting criterion.
In what follows, we first use the method presented in [AHU74], p. 407, to encode a single-tape nondeterministic Turing machine into an integer.
Without loss of generality, we can make the following assumptions about the representation of a single-tape nondeterministic Turing machine with input alphabet because that will be all we need:
-
1.
The states are named
for some , with the initial state and the accepting state.
-
2.
The input alphabet is .
-
3.
The tape alphabet is
for some , where , , and .
-
4.
The next-move function is a list of quintuples of the form,
meaning that
and is the direction, , , or , if , or , respectively. We assume this quintuple is encoded by the string
-
5.
The Turing machine itself is encoded by concatenating in any order the codes for each of the quintuples in its next-move function. Additional ’s may be prefixed to the string if desired. The result will be some string of ’s and ’s, beginning with , which we can interpret as an integer.
Next, we encode the order of to be
so that the tuple should be the concatenation of the binary string representing itself followed by the order . Now the tuple is encoded as a binary string, which can be explained as an integer. In the following context, we often use the notation to denote the shortest binary string that represents the same time-bounded nondeterministic Turing machine . We should further remark that the shortest binary string that represents the same time-bounded nondeterministic Turing machine is by concatenating in any order the codes for each of the quintuples in its next-move function followed by the order without additional prefixed to it, from which it is clear that may not be unique (since the order of codes of the quintuples may be different), but the lengths of all different are the same.
By this encoding, any integer that cannot be decoded is assumed to represent the trivial Turing machine with an empty next-move function. Every single-tape polynomial-time nondeterministic Turing machine will appear infinitely often in the enumeration since, given a polynomial-time nondeterministic Turing machine, we may prefix ’s at will to find larger and larger integers representing the same set of . We denote such a polynomial-time nondeterministic Turing machine by , where is the integer representing the tuple , i.e., is the integer value of the binary string representing the tuple .
For convenience, let us introduce an additional notation to denote a set of all polynomial-time nondeterministic Turing machines, i.e., we denote the set of all polynomial-time nondeterministic Turing machines by the notation .333The reader should not confuse and . Note that is the set of all polynomial-time nondeterministic Turing machines; however, is the set of all languages where, for any language , there is some polynomial-time nondeterministic Turing machine such that accepts . In short, .
From the above arguments, we have reached the following important result:
Lemma 3.1
There exists a one-one correspondence between and the set of all polynomial-time nondeterministic Turing machines. In other words, all polynomial-time nondeterministic Turing machines are enumerable, and all languages in are enumerable (with languages appearing multiple times).
As mentioned earlier, by Definition 2.4 or further by Remark 2.1, the set can be seen as a subset of all polynomial-time nondeterministic Turing machines with the “for all” accepting criterion. Thus, Lemma 3.1 implies that we also establish the following important theorem:
Theorem 3.2
The set is enumerable (All -machines are enumerable). In other words, all languages in are enumerable (with languages appearing multiple times).
Remark 3.1
In fact, we can establish a one-one correspondence between the set of all polynomial-time nondeterministic Turing machines and the set of all polynomial-time co-nondeterministic Turing machines. Since the set of all polynomial-time nondeterministic Turing machines is enumerable, we then deduce that the set of all polynomial-time co-nondeterministic Turing machines is enumerable as well. To see so, by the “exists” accepting criterion we know that the elements in are polynomial-time nondeterministic Turing machines with the “exists” accepting criterion, and by Definition 2.7 or by Remark 2.1 we know that the elements in are polynomial-time nondeterministic Turing machines with the “for all” accepting criterion. We then let the function
be defined as follows: For , where with the “exists” accepting criterion:
Obviously,
is the inverse of , i.e., for any , where with the “for all” accepting criterion,
It is not hard to see that such a function is a one-one correspondence between and . Thus, by Lemma 3.1, Theorem 3.2 follows. In fact, these arguments also follow from Remark 2.1.
There are also one-one correspondence between and . For any language , let the function
be defined as:
where is the complement of , and the inverse of is defined as: For any ,
It is not difficult to check that the function is indeed a one-one correspondence between and . From the fact that the set of all polynomial-time nondeterministic Turing machines is enumerable, we know that the set is enumerable. This, together with the one-one correspondence , follows the fact that the set is enumerable too.
4 Lazy Diagonalization against All -machines
In computational complexity, the most basic approach to show hierarchy theorems uses simulation and diagonalization, because the simulation and diagonalization technique [Tur37, HS65] is a standard method to prove lower bounds on uniform computing models (i.e., the Turing machine model); see for example [AHU74]. These techniques work well in deterministic time and space measures; see e.g. [For00, FS07]. However, they do not work for nondeterministic time, which is not known to be closed under complement at present — one of the main issues discussed in this paper; hence it is unclear how to define a nondeterministic machine that “does the opposite”; see e.g. [FS07].
For nondeterministic time, we can still do a simulation but can no longer negate the answer directly. In this case, we apply the lazy diagonalization to show the nondeterministic time hierarchy theorem; see e.g. [AB09]. It is worth noting that Fortnow [For11] developed a much more elegant and simple style of diagonalization to show nondeterministic time hierarchy. Generally, lazy diagonalization [AB09, FS07] is a clever application of the standard diagonalization technique [Tur37, HS65, AHU74]. The basic strategy of lazy diagonalization is that for any , on input simulates on input , accepting if accepts and rejecting if rejects. When , on input simulates on input deterministically, accepting if rejects and rejecting if accepts. Since , has enough time to perform the trivial exponential-time deterministic simulation of on input , what we are doing is deferring the diagonalization step by linking and together on larger and larger inputs until has an input large enough that it can actually “do the opposite” deterministically; see e.g. [FS07] for details.
For our goal in this section to establish the lower bounds result, we first make the following remarks, which will be referred to in what follows:
Remark 4.1
Given , for any , it is not hard for a nondeterministic Turing machine to find the number such that is sandwiched between and within time for some , which is less than in time , (see e.g. standard textbook [AB09], page ; or see Remark 4.2 below), where the function
is defined as follows:
Our diagonalizing machine (defined below) will try to flip the answer of a -machine on some input in the set
with and , where is the shortest length of the binary string representing , and is the concatenation of finite words and .
Remark 4.2
Let denote the length of a number , i.e.,
Computing requires time in the RAM model, and so computing from takes time
We need to compute , , , until , which means that . Further note that is growing faster than geometrically. Thus, the total running time is
However, in the Turing machine model with a single tape, calculating takes time , and so computing from takes time
So the total running time is
Our main goal of this section is to establish the following lower bounds theorem with the technique of lazy diagonalization from [AB09]:
Theorem 4.1
There exists a language accepted by a universal nondeterministic Turing machine , but not by any -machines. Namely, there is a language such that
Proof. By Theorem 3.2, it is convenient that let
be an effective enumeration (denoted by ) of all -machines.
The key idea will be lazy diagonalization, so named (in accordance with the point of view presented in [AB09]) because the new machine is in no hurry to diagonalize and only ensures that it flips the answer of each -machine in only one string out of a sufficiently large (exponentially large) set of strings.
Let be a five-tape nondeterministic Turing machine which operates as follows on an input string of length of .
-
1.
On input , if , reject . If for some , then decodes encoded by , including to determine , the number of tape symbols used by ; , its number of states; , its order of polynomial
and , the shortest length of , i.e.,
The fifth tape of can be used as “scratch” memory to calculate , , and .
-
2.
Then lays off on its second tape blocks of
cells each, the blocks being separated by single cell holding a marker , i.e., there are
cells in all. Each tape symbol occurring in a cell of ’s tape will be encoded as a binary number in the corresponding block of the second tape of . Initially, places its input, in binary coded form, in the blocks of tape , filling the unused blocks with the code for the blank.
-
3.
On tape , sets up a block of
cells, initialized to all ’s. Tape is used as a counter to count up to
where .
-
4.
simulates , using tape , its input tape, to determine the moves of and using tape to simulate the tape of (i.e., will place ’s input to the tape ). The moves of are counted in binary in the block of tape , and tape is used to hold the states of . If the counter on tape overflows, halts with rejecting. The specified simulation is the following:
“On input (note that ), then uses the fifth tape as “scratch” memory to compute such that
where
(by Remark 4.1, to compute can be done within time
with which is less than time
so has sufficient time) and
-
(i).
If then simulates on input 444 first erases the old content of the second tape, then lays off on its second tape (i.e., ) blocks of cells each, the blocks being separated by single cell holding a marker , i.e., there are cells in all. Each tape symbol occurring in a cell of ’s tape will be encoded as a binary number in the corresponding block of the second tape of . (where ) using nondeterminism in time and output its answer, i.e., rejects the input if and only if rejects the input . 555 Because as a -machine with the “for all” accepting criterion, rejects an input if and only if as a sequence of choices made by such that on the computation path described by , where is the time bounded of .
-
(ii).
If , then simulates on input .666Similarly, first erases the second tape of , then lays off on its second tape (i.e., ) blocks of cells each, the blocks being separated by single cell holding a marker , i.e., there are cells in all. Each tape symbol occurring in a cell of ’s tape will be encoded as a binary number in the corresponding block of the second tape of . Then, rejects if accepts in time; accepts if rejects in time.777 Note again that as a -machine with the “for all” accepting criterion, accepts an input if and only if as the sequences of choices made by such that on the computation path described by , where is the time bounded of .”
-
(i).
Part (ii) requires going through all possible branches of on input , but that is fine since the input size is (see below).
The nondeterministic Turing machine constructed above is of time complexity, say , which is currently unknown. By Lemma 2.1, is equivalent to a single-tape nondeterministic time-bounded Turing machine, and it of course accepts some language .
We claim that . Indeed, suppose for the sake of contradiction that is decided by some -machine running in steps. Then by Lemma 2.2 and Remark 2.2, we may assume that is a single-tape time-bounded -machine. Let have states and tape symbols and the shortest length of is , i.e,
Since is represented by infinitely many strings and appears infinitely often in the enumeration , was to set the counter of tape to count up to , by Lemma 2.3 to simulate
steps of (Note that at this moment the input of is where , so the length of is ), requires to perform
steps, and since
So, there exists a such that for any ,
which implies that for a sufficiently long , say , and denoted by such is , we have
Thus, on input , has sufficient time to simulate according to the requirement (i) within item above.
Furthermore, to simulate steps of , by Lemma 2.3, requires to perform
steps, and because as , and for , we have
and
where is the binomial coefficient, these yield
which implies that there exits such that for all ,
Thus, for a sufficiently long such that , and denoted by such is , we have
Thereby, on input , also has sufficient time to simulate according to the requirement (ii) within the item above.
Overall, we can find large enough such that represents the same and on inputs of length
can be simulated in less than steps to ensure the conditions (i) and (ii) within item 4 above. This means that the two steps in the description of within the item ensure, respectively, that
By our assumption and agree on all inputs for in the semi-open interval
Together with (4.1), this implies that
But the last of the above, i.e.,
contradicting (4.2), which yields that there exists no -machine in the enumeration accepting the language . Thereby,
This finishes the proof.
Remark 4.3
To make it more clearer that how does the last part of the above proof of Theorem 4.1 work, we further make the following remarks. Suppose that
then by the assumption that and agree on all inputs for in the semi-open interval
it must have that . By the simulation requirement (i) within item ,
it must have that . Similarity, by the assumption again that and agree on all inputs for in the semi-open interval
it has that
Repeating this reasoning, we finally arrive at that
But by our diagonalization requirement (ii) within item ,
and our initial condition
we must have . Thus, we get a contradiction. See Fig. 1 below.

Similarity, we can show the second possibility. Assume that
then by the assumption that and agree on all inputs for in the semi-open interval
it must have that
By the simulation requirement (i) within item ,
this means that there exists no such that on the computation path described by , where . So, it must have that
Similarly, by the assumption that and agree on all inputs for in the semi-open interval
again, it has that . Repeating this deduction, we finally arrive at that . But by our diagonalization requirement (ii) within item ,
and our initial condition
we must have . Thus, we also get a contradiction. See Fig. 2 below.

To summarize, in both cases, we have shown the language can not be accepted by . Thus, the conclusion follows immediately.
We close this section by presenting the following corollary:
Corollary 4.2
, if If and are time-constructible functions and .
Proof. Similar to the proof of Theorem 4.1.
5 Proving That
The previous section aims to establish a lower bounds result, i.e., to show that there is a language not in . We now turn our attention to the upper bounds, i.e., to show that the language accepted by the universal nondeterministic Turing machine is in fact in .
As a matter of fact, the technique to establish the desired upper bounds theorem is essentially the same as the one developed in the author’s recent work [Lin21], because the discussed issue to establish desired upper bounds is essentially the same as [Lin21]. But the work to establish such upper bounds for the first time is very difficult, and to see why; see e.g. [Lin21].
Now, for our goals, we are going to show firstly that the universal nondeterministic Turing machine runs within time for any :
Theorem 5.1
The universal nondeterministic Turing machine constructed in proof of Theorem 4.1 runs within time for any .
Proof. The simplest way to show the theorem is to prove that for any input to , there is a corresponding positive integer such that runs at most
steps, which can be done as follows.
On the one hand, if the input encodes a time-bounded -machine, then turns itself off mandatorily within
steps by the construction, so the corresponding integer is in this case (i.e., ). This holds true for all polynomial-time -machines as input with to be the order of that corresponding -machine.
But on the other hand, if the input does not encode some polynomial-time -machine, then the input is rejected, which can be done within time . In both cases we have show that for any input to , there is a corresponding positive integer such that runs at most steps. So is a nondeterministic
time-bounded Turing machine for any . Thus, is a nondeterministic
time-bounded Turing machine for any . By Lemma 2.1, there is a single-tape nondeterministic Turing machine equivalent to and runs within time
for any .
The following theorem is our main result of this section describing the upper bounds result:
Theorem 5.2
The language is in where is accepted by the universal nondeterministic Turing machine which runs within time for any .
Proof. Let us first define the family of languages
as follows:
language accepted by running within time for fixed . | |||
That is, turns itself off mandatorily when its moves made by itself | |||
during the computation exceed steps, |
which technically can be done by adding a new tape to as a counter to count up to
for a fixed , meaning that turns itself off when the counter of tape exceeds
or the counter of the newly added tape exceeds
Obviously, for each , is a truncation of .
Then by the construction of , namely, for an arbitrary input to , there is a corresponding integer such that runs at most
steps (in other words, runs at most steps for any where is the length of input, see Theorem 5.1 above), we have
Furthermore,
since for any word accepted by within steps, it surely can be accepted by within steps, i.e.,
This gives that for any fixed ,
Note further that for any fixed , is the language accepted by the nondeterministic Turing machine within time , i.e., at most steps, we thus obtain
Now, (5.1), together with (5.2) and (5.3) easily yields
as desired.
Proof of Theorem 1. It is clear that Theorem 1 follows immediately from Theorem 4.1 and Theorem 5.2. This completes the proof.
By the proof of Theorem 5.2, we also have the following corollary:
Corollary 5.3
For each fixed , it holds that
where .
Proof. It follows clearly from the relations (5.2) and (5.3).
We close this section by making the following remark:
Remark 5.1
In fact, if the notion of the universal co-nondeterministic Turing machine exists, then we can apply lazy diagonalization against polynomial-time nondeterministic Turing machines via a universal co-nondeterministic Turing machine to obtain a language but in a similar way to the above proofs (i.e., proof of Theorem 4.1 and proof of Theorem 5.2). But we do not know whether such a concept of the universal co-nondeterministic Turing machines exists or not. Simultaneously, we also suspect that a universal co-nondeterministic Turing machine is just a universal nondeterministic Turing machine with the “for all” accepting criterion.
6 Breaking the Relativization Barrier
In , Baker, Gill, and Solovay [BGS75] presented a proof of that:
Baker, Gill, and Solovay [BGS75] suggested that their results imply that ordinary diagonalization techniques are not capable of proving . Note that the above result also implies that for the same oracle ,
because if then . Further by
we have the conclusion that . Thereby, Baker, Gill, and Solovay’s above result also indirectly suggests that ordinary diagonalization techniques (lazy-diagonalization) are not capable of proving . Now, let us explore what is behind this kind of mystery.
The computation model we use in this Section is the query machines, or the oracle Turing machines, which is an extension of the multi-tape Turing machine, i.e., Turing machines that are given access to a black box or “oracle” that can magically solve the decision problem for some language
The machine has a special oracle tape on which it can write a string
and in one step gets an answer to a query of the form
which can be repeated arbitrarily often with different queries. If is a difficult language (say, one that cannot be decided in polynomial time, or is even undecidable), then this oracle gives the Turing machine additional power. We first quote its formal definition as follows:
Definition 6.1 (cf. the notion of deterministic oracle Turing machines in [AB09])
A nondeterministic oracle Turing machine is a nondeterministic Turing machine that has a special read-write tape we call ’s oracle tape and three special states , , . To execute , we specify in addition to the input a language that is used as the oracle for . Whenever during the execution enters the state , the machine moves into the state if and if , where denotes the contents of the special oracle tape. Note that, regardless of the choice of , a membership query to counts only as single computation step. If is an oracle machine, a language, and , then we denote the output of on input and with oracle by . An input is said to be accepted by a nondeterministic oracle Turing machine if there is a computation path of on input leading to the accepting state of .
In a similar way to define the notion of -machines (Definition 2.7) in Section 2, we can define the notion of -machines as follows.
Definition 6.2
A co-nondeterministic oracle Turing machine is a nondeterministic Turing machine that has a special read-write tape we call ’s oracle tape and three special states , , . To execute , we specify in addition to the input a language that is used as the oracle for . Whenever during the execution enters the state , the machine moves into the state if and if , where denotes the contents of the special oracle tape. Note that, regardless of the choice of , a membership query to counts only as single computation step. If is an oracle machine, a language, and , then we denote the output of on input and with oracle by . An input is said to be accepted by a co-nondeterministic oracle Turing machine if all computation paths of on input lead to the accepting state of .
If for every input of length , all computations of on input end in less than or equal to steps, then is said to be a time-bounded nondeterministic (co-nondeterministic) oracle Turing machine with oracle , or said to be of time complexity . The family of languages of nondeterministic time complexity with oracle is denoted by
the family of languages of co-nondeterministic time complexity with oracle is denoted by
The notation and are defined, respectively, to be the class of languages:
and
Let us denote the set of all polynomial-time co-nondeterministic oracle Turing machines with oracle by the notation .
To break the Relativization Barrier or to explore the mystery behind the implications of Baker, Gill, and Solovay’s result [BGS75], let us further make the following rational assumptions:
-
1.
the polynomial-time co-nondeterministic oracle Turing machine can be encoded to a string over ;
-
2.
there are universal nondeterministic oracle Turing machines that can simulate any other co-nondeterministic oracle Turing machine, which is similar to the behavior of universal nondeterministic Turing machine in Section 4;
-
3.
the simulation can be done within in time
where is the time complexity of the simulated co-nondeterministic oracle Turing machine.
Then, we will prove the following interesting theorems:
Theorem 6.1
Assume that the set of polynomial-time co-nondeterministic oracle Turing machines with oracle is enumerable, and further we have the above rational assumptions. Then, there exists a language accepted by a universal nondeterministic Turing machine with oracle , but not by any -machines with oracle . Namely, there is a language such that
Proof. Since the set of polynomial-time co-nondeterministic oracle Turing machines with oracle is enumerable, and apart from it, we further have the above rational assumptions, i.e., the following: (1) the polynomial-time co-nondeterministic oracle Turing machine can be encoded to a string over ; (2) there are universal nondeterministic oracle Turing machines that can simulate any other co-nondeterministic oracle Turing machine, which is similar to the behavior of universal nondeterministic Turing machine in Section 4; (3) the simulation can be done within in time
where is the time complexity of the simulated co-nondeterministic oracle Turing machine. Then, the remainder is to show that there exists a universal nondeterministic Turing machine with oracle accepting the language
in a similar way to that of Theorem 4.1.
Theorem 6.2
The universal nondeterministic Turing machine with oracle constructed in the proof of Theorem 6.1 runs within time for any . Further, the language accepted by the nondeterministic Turing machine with oracle is in fact in .
Theorem 6.3
Assume that the set of polynomial-time co-nondeterministic oracle Turing machines with oracle is enumerable. Further under the above rational assumptions, then
But now we have the fact that , so our assumption in Theorem 6.3 is not true, i.e., the set of all polynomial-time co-nondeterministic oracle Turing machines with oracle is not enumerable. So the truths behind this kind of mystery are the following theorem:
Theorem 6.4
Under some rational assumptions (i.e., the above rational assumptions), and if , then the set of all polynomial-time co-nondeterministic oracle Turing machine with oracle is not enumerable. Thereby, the ordinary diagonalization techniques (lazy-diagonalization) will generally not apply to the relativized versions of the versus problem.
7 Rich Structure of
In complexity theory, or computational complexity, problems that are in the complexity class but are neither in the class nor -complete are called -intermediate, and the class of such problems is called . The well-known Ladner’s theorem, proved in 1975 by Ladner [Lad75], is a result asserting that if the complexity classes and are different, then the class is not empty. Namely, the complexity class contains problems that are neither in nor -complete. We call such a nice theorem the rich structure of the complexity class .
Our main goal in this section is to establish a result saying that similar to the complexity class having a rich structure, the complexity class also has intermediate languages that are not in nor -complete. To do so in a simple way, we need to quote the following useful result whose proof can be found in [Lad75]:
Lemma 7.1 ([Lad75])
(Suppose that ). There is a language such that is not in and is not -complete.
Next, we need to show that the language of all tautologies is -complete:
Lemma 7.2
Let . Then the language is -complete.
Proof. It is sufficient to show that for every language , where is polynomial-time many-one reduction [Kar72]. We can modify the Cook-Levin reduction [Coo71, Lev73] from to (recall that all nondeterministic Turing machines are with input alphabet because that will be all we need, see Section 3). So for any input the Cook-Levin reduction produces a formula that is satisfiable if and only if . In other words, the formula is in if and only if , which completes the proof.
Now, we are naturally at a right point to give the proof of Theorem 6 as follows:
Proof of Theorem 6. Let , where the language is the same as in Lemma 7.1. Such a language indeed exists by Corollary 3, which states that .
Then , and we will show that is not in nor -complete. To see so, suppose that , then since , we have , a contradiction to the fact that by Lemma 7.1. Further assume that is -complete, then . By Lemma 7.2, is -complete, it is also a contradiction to the fact that is not -complete. Thereby, to summarize from the above two cases, we have shown that is -intermediate. This finishes the proof.
8 Frege Systems
The focal point of this section is Frege systems, which are very strong proof systems in propositional setting [CR79], based on axiom schemes and rules such as modus ponens. While Frege systems operate with Boolean formulas as lines, the extended Frege system EF works with Boolean circuits; see e.g. [Jer05]. Furthermore, showing lower bounds on Frege or even extended Frege systems constitutes a major open problem in proof complexity; see e.g. [BP01].
Informally, a proof system for a language is a definition of what is considered to be a proof that ; see e.g. [CR79]. The key features of a proof system are that it is sound, i.e., only formulas in have proofs, and complete, i.e., all formulas in have proofs, and that there is an algorithm with running time polynomial in to check whether is a proof that [BH19].
Let be the set of functions , where and are any finite alphabets, such that can be computed by a deterministic Turing machine in time bounded by a polynomial in the length of the input. Then, we have the following definition:
Definition 8.1 ([CR79])
If , a proof system for is a function for some alphabet and such that is onto. We say that the proof system is polynomially bounded iff there is a polynomial such that for all there is a such that and , where denotes the length of a string .
If , then is a proof of , and is a short proof of , if in addition, . Thus a proof system is polynomially bounded iff there is a bounding polynomial with respect to which every has a short proof.
In particular, Frege proof systems are proof systems for propositional logic. As a matter of fact, Frege systems are the usual “textbook” proof systems for propositional logic based on axioms and rules; see e.g. [CR79]. A Frege system composed of a finite set of axiom schemes and a finite number of rules is a possible axiom scheme. Furthermore, a Frege proof is a sequence of formulas where each formula is either a substitution instance of an axiom or can be inferred from previous formulas by a valid inference rule. At the same time, Frege systems are required to be sound and complete. However, the exact choice of the axiom schemes and rules does not matter, as any to Frege system are polynomial equivalent; see e.g.[CR79] or see Theorem 8.1 below. Thus, we can assume without loss of generality that modus ponens (see e.g. Remark 8.1 below) is the only rule of inference.
Definition 8.2 ([BG98])
A Frege proof system is an inference system for propositional logic based on
-
(1)
a language of well-formed formulas obtained from a numerable set of propositional variables and any finite propositionally complete set of connectives;
-
(2)
a finite set of axiom schemes;
-
(3)
and the rule of Modus Ponens
A proof of the formula in a Frege system is a sequence of formulas such that is and every is either an instance of an axiom scheme or it is obtained by the application of the Modus Ponens from premises and with .
Remark 8.1
The well-known inference rule of Modus Ponens is its only rule of inference:
We give a more intuitive explanation about the inference rule of Modus Ponens: For example, in the proof of Theorem 5.2 in Section 5 where we prove that , is the following:
represents the proposition . Namely,
and one of the main deductions of Theorem 5.2, besides showing that is true, is to show that implies .
The axiom schemes of a Frege proof system is the following (see e.g. [Bus99]):
More generally, a Frege system is specified by any finite complete set of Boolean connectives and finite set of axiom schemes and rule schemes, provided it is implicationally sound and implicationally complete.
Definition 8.3
The length of a Frege proof is the number of symbols in the proof. The length of a formula is the number of symbols in .
Now, we introduce the notion of -simulation between two proof systems.
Definition 8.4 ([CR79])
If and are proof system for , then -simulates provided there is a function such that , and for all .
The notion of an abstract propositional proof system is given as follows:
Definition 8.5 ([Bus99])
An abstract propositional proof system is a polynomial-time computable function such that ; i.e., the range of is the set of all Boolean tautologies. An -proof of a formula is a string such that .
8.1 Lower Bounds for Frege Systems
Let and be two arbitrary Frege systems, the following theorem indicates that these standard proof systems for the propositional calculus are about equally powerful:
Theorem 8.1 ([Rec76, CR79])
Any two Frege systems -simulate each other. Hence one Frege system is polynomially bounded if and only if all Frege systems are.
The following theorem gives a necessary and sufficient condition for the complexity class being closed under complementation. For completeness, the proof is also quoted from [CR79]:
Theorem 8.2 ([CR79])
is closed under complementation if and only if is in .
Proof. The “if” part. Suppose that the set of tautologies is in . Then every set in is reducible to the complement of the tautologies [Coo71], i.e., there is a function that can be computed by a deterministic Turing machine in time bounded by a polynomial in the length of the input such that for all strings , iff is not a tautology. Hence a nondeterministic procedure for accepting the complement of is: on input , compute , and accept if is a tautology, using the nondeterministic algorithm for tautologies assumed above. Hence the complement of is in .
The “only if” part. Suppose that is closed under complementation. The the complement of the set of tautologies is in , since to verify that a formula is not a tautology one can guess at a truth assignment and verify that it falsifies the formula.
Since nondeterministic Turing machines can simulate Frege proof (see e.g. [Bus99]) and by Theorem 8.1 we know that any two Frege systems -simulate each others, we then can present the proof of Theorem 8 as follows:
Proof of Theorem 8. We show the theorem by contradiction. Suppose that there is a polynomial such that for all , there exists a proof of of length at most (see Definition 8.3), then a nondeterministic Turing machine on input can guess of length at most and checking whether is correct in deterministic polynomial time, and we know that the set of tautologies is -complete, which, together with Theorem 8.2, implies
a contradiction to Theorem 1. This finishes the proof.
Remark 8.2
Initially, propositional proof complexity has been primarily concerned with proving lower bounds (even conditional) for the length of proofs in propositional proof systems, which is extremely interesting and well justified in its own right, with the ultimate goal of settling whether (see e.g. [CR79]). In fact, as mentioned in Section 1, such research directions are called Cook’s Program for separating and (see e.g. [Bus12]): Prove superpolynomial lower bounds for proof lengths in stronger and stronger propositional proof systems until they are established for all abstract proof systems. However, our approach is “to do the opposite”, that is, we first prove that , and then apply it to prove the lower bounds on the length of proofs in Frege proof systems.
9 Conclusions
In conclusion, we have shown that
thus resolving a very important and long-standing conjecture in computational complexity theory. Specifically, we first showed that the set of all -machines is enumerable. Then, we constructed a universal nondeterministic Turing machine to simulate all -machines and to do the lazy-diagonalization operation to create a language which is not in . Next, we applied the technique developed in the author’s recent work [Lin21] to show that the language accepted by the universal nondeterministic Turing machine is in fact in , thus establishing the desired main conclusion of this paper.
We stress again that one of the important techniques used to prove lower bounds (i.e., Theorem 4.1) is the so-called lazy-diagonalization [FS07, AB09], and the essential technique used to show the upper bounds result, i.e., the result that the language , is essentially the same as the one developed in the author’s recent work [Lin21].
We have broken the so-called Relativization Barrier. Since by the result of [BGS75], it is not hard to show that there exists some oracle such that
which suggests that (in the light of the point of view given in [BGS75]) ordinary diagonalization techniques (lazy-diagonalization) are not capable of proving ; see [BGS75]. We explored the mystery behind this phenomenon by proving that if and under some rational assumptions, then the set of all -machines with oracle is in fact not enumerable, thus showing that the ordinary diagonalization techniques (i.e., lazy-diagonalization) will generally not apply to the relativized versions of the versus problem.
As a by-product, the important result that which is shown in [Lin21], also follows. The above outcome can be seen as, in a sense, the direct or indirect consequence of the proof procedures of
Note again that by Corollary 3 we in fact have shown because if , then , a contradiction to Corollary 3. As a matter of fact and interestingly, we have also shown by Theorem 6 that there exists a -intermediate language in , which is not in nor -complete, i.e., the complexity class also has a rich structure. We have also obtained other interesting consequences, such as Corollary 4 (i.e., ), Corollary 5 (i.e., ) and Corollary 7, which states that there is no super proof system.
As mentioned in the introduction section, the problem of versus closely connects with the field of proof complexity. In particular, we have shown that no Frege proof systems are polynomially bounded (Theorem 8), thus answering an open problem in [Pud08]. As pointed out in Remark 8.2, we do the opposite direction of Cook’s program. That is, we first prove that , and then apply it to prove the lower bounds on the length of proofs in Frege proof systems. Note that one of the key points to show the Theorem 8 is that nondeterministic Turing machines can simulate Frege proofs (see [Bus99]). But we do not know whether the nondeterministic Turing machines can simulate Extended Frege proofs. For more details about Extended Frege Systems, please consult the references [CR79, CN10, Kra95].
References
- [A1] Anonymous authors. Computational complexity theory. Wikipedia, the free encyclopedia (Auguest, 2024). Available at /wiki/Computational complexity theory.
- [A2] Anonymous authors. NP (complexity). Wikipedia, the free encyclopedia (Auguest, 2024). Available at /wiki/NP (complexity).
- [A3] Anonymous authors. Co-NP. Wikipedia, the free encyclopedia (Auguest, 2024). Available at /wiki/Co-NP.
- [A4] Anonymous authors. Proof theory. Wikipedia, the free encyclopedia (Auguest, 2024). Available at /wiki/Proof theory.
- [A5] Anonymous authors. Proof complexity. Wikipedia, the free encyclopedia (Auguest, 2024). Available at /wiki/Proof complexity.
- [AB09] Sanjeev Arora and Boaz Barak. Computational Complexity: A Modern Approach. Cambridge University Press, 2009.
- [AHU74] Alfred V. Aho, John E. Hopcroft and Jeffrey D. Ullman. The Design and Analysis of Computer Algorithms. Addison–Wesley Publishing Company, Reading, California, 1974.
- [BGS75] Theodore Baker, John Gill, and Robert Solovay. Relativizations of The Question. SIAM Journal on Computing, Vol. 4, No. 4, December 1975, pp. 431–442. https://doi.org/10.1137/0204037.
- [BG98] M. L. Bonet and N. Galesi. Linear Lower Bounds and Simulations in Frege Systems with Substitutions. In: M. Nielsen, W. Thomas (eds) Computer Science Logic. CSL 1997. Lecture Notes in Computer Science, vol. 1414, Springer, Berlin, Heidelberg. pp. 115–128, 1998.
- [BH19] O. Beyersdorff and L. Hinde. Characterising tree-like Frege proofs for QBF. Information and Computation 268 (2019) 104429. https://doi.org/10.1016/j.ic.2019.05.002.
- [BP01] P. Beame and T. Pitassi. Propositional proof complexity: past, present, and future. In: G. Paun, G. Rozenberg, A. Salomaa (Eds.), Current Trends in Theoretical Computer Science: Entering the st Century, World Scientific Publishing, 2001, pp. 42–70.
- [Bus99] Samuel R. Buss. Propositional Proof Complexity: An Introduction. In: Computational Logic, edited by U. Berger and H. Schwichtenberg, Springer–Verlag, Berlin, 1999, pp. 127–178.
- [Bus12] Samuel R. Buss. Towards via proof complexity and search. Annals of Pure and Applied Logic 163 (2012) 906–917. https://doi.org/10.1016/j.apal.2011.09.009.
- [Coo71] Stephen A. Cook. The complexity of theorem-proving procedures. In: Proceedings of the Third Annual ACM Symposium on Theory of Computing, May 1971, Pages 151–158. https://doi.org/10.1145/800157.805047.
- [CR74] Stephen A. Cook and Robert A. Reckhow. On the lengths of proof in propositional calculus, preliminary version. In: Proceedings of the th Annual ACM Symposium on the Theory of Computing, 1974, pp. 135–148. https://doi.org/10.1145/800119.803893.
- [CR79] Stephen A. Cook and Robert A. Reckhow. The Relative Efficiency of Propositional Proof Systems. The Journal of Symbolic Logic, Volume 44, Number 1, March 1979, pp. 36–50. https://doi.org/10.2307/2273702.
- [Coo00] Stephen A. Cook. The P versus NP problem. April, 2000. Available at PvsNP.ps.
- [CN10] Stephen A. Cook and Phuong Nguyen. Logical Foundations of Proof Complexity. Association for Symbolic Logic, Cambridge University Press, 2010.
- [For00] Lance Fortnow. Diagonalization. Bulletin of the European Association for Theoretical Computer Science 71: 102–113 (2000).
- [For11] Lance Fortnow. A New Proof of the Nondeterministic Time Hierarchy. Computational Complexity Blog, 2011. Available at this https URL.
- [FS07] Lance Fortnow and Rahul Santhanam. Time Hierarchies: A Survey. Electronic Colloquium on Computational Complexity, Report No. 4 (2007).
- [FSTW21] Michael A. Forbes, Amir Shpilka, Iddo Tzameret and Avi Wigderson. Proof Complexity Lower Bounds from Algebraic Circuit Complexity. Theory of Computing, Volume 17 (10), 2021, pp. 1–88. https://doi.org/10.4086/toc.2021.v017a010.
- [Jer05] E. Jerabek. Weak Pigeonhole Principle, and Randomized Computation. Ph.D. thesis, Faculty of Mathematics and Physics, Charles University, Prague, 2005.
- [HS65] J. Hartmanis and R. Stearns. On the computational complexity of algorithms. Transactions of the American Mathematical Society, 117: 285–306, 1965.
- [HCCRR93] Juris Hartmanis, Richard Chang, Suresh Chari, Desh Ranjan, and Pankaj Rohatgi. Relativization: a Revisionistic Retrospective. Current Trends in Theoretical Computer Science, 1993, pp. 537–547. https://doi.org/10.1142/9789812794499_0040.
- [Hop84] John. E. Hopcroft. Turing machines. Scientific American, May 1984, pp. 86–98.
- [Kar72] Richard M. Karp. Reducibility among combinatorial problems. In: Miller R. E., Thatcher J. W., Bohlinger J. D. (eds) Complexity of Computer Computations., Plenum Press, New York, 1972, 85–103. https://doi.org/10.1007/978-1-4684-2001-2_9.
- [Kra95] J. Krajicek. Bounded Arithmetic, Propositional Logic, and Complexity Theory. Cambridge, 1995.
- [Kra19] J. Krajicek. Proof complexity. Encyclopedia of Mathematics and Its Applications, Vol. 170, Cambridge University Press, 2019.
- [Lad75] Richard E. Ladner. On the Structure of Polynomial Time Reducibility. Journal of the ACM, Vol. 22, No. 1, January 1975, pp. 155-171. https://doi.org/10.1145/321864.321877.
- [Lev73] Leonid A. Levin. Universal search problems (in Russian). Problemy Peredachi Informatsii 9 (1973), 265–266. English translation in B. A. Trakhtenbrot, A survey of Russian approaches to Perebor (brute-force search) algorithms, Annals of the History of Computing 6 (1984), 384–400.
- [Lin21] Tianrong Lin. Diagonalization of Polynomial–Time Deterministic Turing Machines via Nondeterministic Turing Machines. CoRR abs/2110.06211 (2021). Available at /abs/2110.06211
- [MS72] Albert R. Meyer and Larry J. Stockmeyer. The equivalence problem for regular expressions with squaring requires exponential space. In: IEEE 13th Annual Symposium on Switching and Automata Theory (swat 1972), pp. 125–129. https://doi.org/10.1109/SWAT.1972.29.
- [Pap94] Christos H. Papadimitriou. Computational Complexity. Addison–Wesley, 1994.
- [Pud08] Pavel Pudlák. Twelve Problems in Proof Complexity. In: E.A. Hirsch et al. (Eds.): CSR 2008. Lecture Notes in Computer Science, vol. 5010, Springer, Berlin, Heidelberg, pp. 13–27, 2008.
- [Raz03] Alexander A. Razborov. Propositional Proof Complexity. Journal of the ACM, Vol. 50, No. 1, January 2003, pp. 80–82. https://doi.org/10.1145/602382.602406.
- [Raz04] Alexander A. Razborov. Feasible proofs and computations: partnership and fusion. In: J. Diaz, J. Karhumaki, A. Lepisto, D. Sannella (eds) Automata, Langugages and Programming. ICALP 2004. Lecture Notes in Computer Science, vol. 3142, Springer, Berlin, Heidelberg, 2004, pp. 8–14.
- [Raz15] Alexander A. Razborov. Pseudorandom generators hard for -DNF resolution and polynomial calculus resolution. Annals of Mathematics 181 (2015), 415–472. https://doi.org/10.4007/annals.2015.181.2.1
- [Rec76] Robert A. Reckhow. On the lengths of proofs in the propositional calculus. Ph.D. Thesis, Department of Computer Science, University of Toronto, 1976.
- [Sto77] Larry J. Stockmeyer. The polynomial-time hierarchy. Theoretical Computer Science 3 (1977) 1–22. https://doi.org/10.1016/0304-3975(76)90061-X.
- [Tur37] Alan M. Turing. On computable numbers with an application to the entscheidnungsproblem. Proceedings of the London Mathematical Society, Volume s2-42, Issue 1, 1937, Pages 230–265. https://doi.org/10.1016/0066-4138(60)90045-8
- [Wig07] Avi Wigderson. , and Mathematics–a computational complexity perspective. Proceedings of the ICM 06, vol. 1, EMS Publishing House, Zurich, pp. 665–712, 2007.
Tianrong Lin |
National Hakka University, China |