This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A Closer Look at Some Recent Proof Compression-Related Claimsthanks: Supported in part by NSF grant CCF-2006496 .

Michael C. Chavrimootoo Department of Computer Science
University of Rochester
Rochester, NY 14627, USA
Ethan Ferland Department of Computer Science
University of Rochester
Rochester, NY 14627, USA
Erin Gibson Department of Computer Science
University of Rochester
Rochester, NY 14627, USA
Ashley H. Wilson Department of Computer Science
University of Rochester
Rochester, NY 14627, USA
(December 23, 2022)
Abstract

Gordeev and Haeusler [GH19] claim that each tautology ρ\rho of minimal propositional logic can be proved with a natural deduction of size polynomial in |ρ||\rho|. This builds on work from Hudelmaier [Hud93] that found a similar result for intuitionistic propositional logic, but for which only the height of the proof was polynomially bounded, not the overall size. They arrive at this result by transforming a proof in Hudelmaier’s sequent calculus into an equivalent tree-like proof in Prawitz’s system of natural deduction, and then compressing the tree-like proof into an equivalent DAG-like proof in such a way that a polynomial bound on the height and foundation implies a polynomial bound on the overall size. Our paper, however, observes that this construction was performed only on minimal implicational logic, which we show to be weaker than the minimal propositional logic for which they claim the result (see Section 4.2). Simply extending the logic systems used to cover minimal propositional logic would not be sufficient to recover the results of the paper, as it would entirely disrupt proofs of a number of the theorems that are critical to proving the main result. Relying heavily on their aforementioned work, Gordeev and Haeusler [GH20] claim to establish NP=PSPACE{\rm NP}={\rm PSPACE}. The argument centrally depends on the polynomial bound on proof size in minimal propositional logic. Since we show that that bound has not been correctly established by them, their purported proof does not correctly establish NP=PSPACE{\rm NP}={\rm PSPACE}.

1 Introduction

This paper will discover and analyze flaws in two journal papers written by Gordeev and Haeusler [GH19, GH20]. In particular, the two works in question are “Proof Compression and NP versus PSPACE,” which was published in Studia Logica in 2019, and its follow-up, “Proof Compression and NP versus PSPACE II,” which was published in the Bulletin of the Section of Logic in 2020.

In their first paper, Gordeev and Haeusler [GH19] consider Gentzen-style Sequent Calculus, denoted as SC, and Prawitz’s Natural Deduction, denoted as ND, in order to obtain the result that a formula ρ\rho is valid in minimal propositional logic if and only if there exists a DAG-like proof of ρ\rho whose size is polynomial in |ρ|\lvert\rho\rvert. They arrive at this conclusion by first attempting to create an equivalent formal definition of minimal propositional logic, called LM\text{LM}_{\mathord{\rightarrow}}, which they attribute to the work of Hudelmaier [Hud93]. Then, using a version of natural deduction, NM\text{NM}_{\mathord{\rightarrow}}, which is derived from Prawitz’s ND [Pra06], Gordeev and Haeusler [GH19] argue that any valid formula ρ\rho has an LM\text{LM}_{\mathord{\rightarrow}} proof of length polynomial in the height and foundation of ρ\rho. They go on to show that this LM\text{LM}_{\mathord{\rightarrow}} proof can be converted into an equivalent tree-like proof in NM\text{NM}_{\mathord{\rightarrow}} and can ultimately be horizontally compressed into a DAG-like proof with size polynomially bounded in |ρ|\lvert\rho\rvert. The critical fault in this argument comes from Gordeev and Haeusler’s [GH19] incorrect definition of minimal propositional logic. The purpose of the first part of this paper is to bring to light the issues that arise from the faulty definition. Further, in Section 4.4, this paper will address how possible extensions of the incorrect definition would still be insufficient to salvage Gordeev and Haeusler’s [GH19] result.

Gordeev and Haeusler’s second paper [GH20] is heavily reliant on the first paper. This subsequent paper calls attention to the problem of deciding whether a formula is provable in minimal logic, a problem that is PSPACE{\rm PSPACE}-complete. Using this classification along with the result of their first paper, Gordeev and Haeusler claim to establish that NP=PSPACE{\rm NP}={\rm PSPACE}. Once again, there is a misconception about the equivalence of logical systems that causes significant damage to the argument. The second part of this critique will reveal the incorrect assumptions found in Gordeev and Haeusler’s second paper, which are mostly attributable to the errors of the first paper.

Note that this critique is largely focused on the first paper of Gordeev and Haeusler, but the importance of the critique is better appreciated in light of the latter paper. The result of NP=PSPACE{\rm NP}={\rm PSPACE} would be highly influential in the field of complexity theory, having hierarchical collapsing repercussions such as and NP=coNP{\rm NP}={\rm coNP}, P#P=NP{\rm P}^{\#{\rm P}}={\rm NP}, and NPNL{\rm NP}\neq{\rm NL}. Thus, the considerations of this paper will mainly cover Gordeev and Haeusler’s first work but hold the intentions of thwarting the propagation of undue results in the field of complexity theory.

2 Preliminaries

2.1 Natural Deduction

Natural deduction is a way of formalizing languages of logic that involves creating proofs (also referred to as deductions) of logical sentences by application of a set of inference rules. These inference rules represent the very natural notion of deductive reasoning, where if some set of premises are known to be true, we can infer some conclusion. The rules are defined using what can be thought of as templates of logical sentences. Instead of using individual propositional variables, the templates use variables that represent entire sentences of their own. For example, the template ABA\lor B could represent infinitely many sentences including pqp\lor q, (pq)(rq)(p\land q)\lor(r\rightarrow q), etc. The inference rules are defined with the following notation

α1α2αnβ,\frac{\alpha_{1}\quad\alpha_{2}\quad\cdots\quad\alpha_{n}}{\beta},

where β\beta can be immediately derived from the αi\alpha_{i}s, i.e., if every αi\alpha_{i} is known to be true, then β\beta is also known to be true. Application of these rules to derive some sentence ρ\rho results in a tree for which ρ\rho is the root, and each parent is related to its children by a particular inference rule. Each leaf of the tree is an assumption. Anything can be assumed, but the derived sentence only holds under the assumptions (unless they are discharged—see below).

For example, suppose we have a system of natural deduction that has the rules ABAB\frac{A\ B}{A\land B} and ABA\frac{A\land B}{A}. If we wanted to derive (pq)r(p\land q)\land r from the assumptions pp, qq, and rsr\land s, we could do so with the following derivation:

pqpqrsr(pq)r.\dfrac{\dfrac{p\quad q}{p\land q}\quad\dfrac{r\land s}{r}}{(p\land q)\land r}.

With the system as is, sentences can only be derived under a nonempty set of assumptions, so there are no tautologies (sentences that are always true; they can be derived without assumptions). This is where the notion of discharging assumptions comes in. Inference rules can be defined so that their application discharges specific assumptions, meaning that those assumptions are accounted for and are no longer assumed. This is indicated in the inference rule by writing the assumption being discharged in parentheses above the α\alpha for which it applies. We can expand the formalization of the notation for inference rules to

(γ1)α1(γk)αkαk+1αnβ,\frac{\genfrac{}{}{0.0pt}{0}{(\gamma_{1})}{\alpha_{1}}\genfrac{}{}{0.0pt}{0}{\ldots}{\ldots}\genfrac{}{}{0.0pt}{0}{(\gamma_{k})}{\alpha_{k}}\quad\genfrac{}{}{0.0pt}{0}{}{\alpha_{k+1}}\genfrac{}{}{0.0pt}{0}{}{\ldots}\genfrac{}{}{0.0pt}{0}{}{\alpha_{n}}}{\beta},

where, for each i{1,,k}i\in\{1,\ldots,k\}, whenever γi\gamma_{i} is an assumption of the subderivation leading to αi\alpha_{i}, γi\gamma_{i} is discharged. In the derivation, every rule application that discharges an assumption is labeled with a number, and that number is written above every instance of the assumption it discharges.

Returning to the earlier example, suppose we add this new inference rule that allows us to discharge any assumption:

(A)BAB.\dfrac{\genfrac{}{}{0.0pt}{0}{(A)}{B}}{A\rightarrow B}.

We can now prove the tautology (rs)(q(p((pq)r)))(r\land s)\rightarrow(q\rightarrow(p\rightarrow((p\land q)\land r))) by extending the derivation from above and discharging all the assumptions.

p(1)q(2)pqrs(3)r(pq)rp((pq)r)(1)q(p((pq)r))(2)(rs)(q(p((pq)r)))(3).\dfrac{\dfrac{\dfrac{\dfrac{\dfrac{\stackrel{{\scriptstyle\mathclap{\tiny\mbox{(1)}}}}{{p}}\quad\stackrel{{\scriptstyle\mathclap{\tiny\mbox{(2)}}}}{{q}}}{p\land q}\quad\dfrac{\stackrel{{\scriptstyle\mathclap{\tiny\mbox{(3)}}}}{{r\land s}}}{r}}{(p\land q)\land r}}{p\rightarrow((p\land q)\land r)}\tiny\text{(1)}}{q\rightarrow(p\rightarrow((p\land q)\land r))}\tiny\text{(2)}}{(r\land s)\rightarrow(q\rightarrow(p\rightarrow((p\land q)\land r)))}\tiny\text{(3)}.

We can clearly see that this is a tautology because every leaf in the tree has a number above it, so every assumption is discharged.

For a more detailed exposition of natural deduction, we refer readers to Prawitz’s monograph on the topic [Pra06].

2.2 Sequent Calculus

Sequent calculus is another approach to logical derivation. Compared to natural deduction, it has a similar notation for inference rules and a similar tree-like proof structure, but differs greatly in how it handles assumptions. Rather than single logical sentences, each node in the deduction (an α\alpha or β\beta in the notation specified in Section 2.1) is a sequent—a conditional tautology expressed as ΓΔ\Gamma\Rightarrow\Delta where Γ\Gamma is a set of antecedents and Δ\Delta is a single consequent.111 In some systems, Δ\Delta could be set of consequents rather than a singe consequent, but we will not consider such systems here. The consequent is what can be proved given that all of the antecedents are true. The sequent essentially bundles the assumptions together with the consequent, making each sequent in some sense self-contained: while the full derivation is needed to establish validity of the sequent, each individual sequent in the derivation is true on its own; there are no separate assumptions in other parts of the derivation that must be accounted for. This removes the need to discharge assumptions, and instead we can simply remove antecedents. An unconditional tautology would be any consequent for which there are no antecedents. In other words, any Δ\Delta for which Γ\Gamma is the empty set. Sequent calculi include axioms in addition to their inference rules. Rather than each leaf of the deduction tree being an assumption, each leaf must be an axiom. In both axioms and inference rules, a subset of the antecedents can be specified while the rest of the antecedents are left free using the notation Γ,α1αn\Gamma,\alpha_{1}\ldots\alpha_{n}.

As an example, consider a sequent calculus with just the axiom Γ,pp\Gamma,p\Rightarrow p and the inference rule Γ,ABΓAB\frac{\Gamma,A\Rightarrow B}{\Gamma\Rightarrow A\rightarrow B}.222This inference rule is essentially the sequent calculus analogue of the last natural deduction rule from the example in Section 2.1 that was for discharging assumptions. We could use this system to prove the unconditional tautology p(qp)p\rightarrow(q\rightarrow p) as follows:

q,pppqpp(qp).\dfrac{\dfrac{q,p\Rightarrow p}{p\Rightarrow q\rightarrow p}}{\Rightarrow p\rightarrow(q\rightarrow p)}.

For a more detailed exposition of sequent calculus, we refer readers to Appendix A of Prawitz [Pra06].

2.3 Minimal and Intuitionistic Logic

Intuitionistic logic is a logic which rejects both the law of excluded middle (p¬p\vdash p\lor\lnot p) and double negation elimination (¬¬pp\lnot\lnot p\vdash p). Minimal logic, originally introduced by Johansson [Joh37], is an intuitionistic logic that additionally rejects the principle of explosion (p\bot\vdash p).

The notions of intuitionistic and minimal logic can be applied to different syntaxes of logic. They are traditionally formalized in the context of predicate logic [Joh37, Pra06] but may also be applied in the context of propositional logic (as in [Hud93]) or even purely implicational logic (discussed in [Pog94]). Regardless of the syntactic context, the restrictions relative to classical logic remain the same, and the differences between, for example, intuitionistic propositional logic and intuitionistic predicate logic are the same as the differences between classical propositional logic and classical predicate logic (namely the inclusion of quantifiers, quantified variables, and relations).

It is worth noting that negation (¬\lnot) is generally not included as a propositional connective in intuitionistic or minimal logic. Rather, absurdity (\bot) is added as a nullary connective, where ¬p\lnot p is interpreted as a shorthand for pp\rightarrow\bot.

Prawitz formalizes systems of natural deduction for minimal, intuitionistic, and classical logic [Pra06]. The system for minimal logic uses the following inference rules:

I:\displaystyle\land I: ABAB\displaystyle\quad\frac{A\quad B}{A\land B} I:\displaystyle\lor I: AABBAB\displaystyle\quad\frac{A}{A\lor B}\quad\frac{B}{A\lor B} I:\displaystyle\mathord{\rightarrow}I: (A)BAB\displaystyle\quad\frac{\genfrac{}{}{0.0pt}{0}{(A)}{B}}{A\rightarrow B}
E:\displaystyle\land E: ABAABB\displaystyle\quad\frac{A\land B}{A}\quad\frac{A\land B}{B} E:\displaystyle\lor E: AB(A)C(B)CC\displaystyle\quad\frac{\genfrac{}{}{0.0pt}{0}{}{A\lor B}\ \genfrac{}{}{0.0pt}{0}{(A)}{C}\ \genfrac{}{}{0.0pt}{0}{(B)}{C}}{C} E:\displaystyle\mathord{\rightarrow}E: AABB.\displaystyle\quad\frac{A\quad A\rightarrow B}{B}.

These rules consist of one introduction rule and one elimination rule for each standard (binary) propositional connective. Prawitz also included introduction and elimination rules for \forall and \exists because the system was formulated in predicate logic, but we can simply take the propositional fragment (i.e., remove the rules for quantifiers) to get a system for minimal propositional logic.

Prawitz’s system for intuitionistic logic includes all of the above inference rules along with a rule for the principle of explosion, formalized as:

I:A.\bot_{I}:\quad\frac{\bot}{A}.

2.4 Hudelmaier’s Sequent Calculus

Hudelmaier presents a PSPACE{\rm PSPACE} algorithm for intuitionistic propositional logic [Hud93]. The procedure uses backward application of the rules of a sequent calculus (referred to as LG). The PSPACE{\rm PSPACE} classification is achieved because LG was constructed to have the critical property that every deduction of a sequent ss is linearly bounded in length with respect to the length of ss. This is not a linear (or even polynomial) bound on the overall size of the deduction, only the length (as some of the rules produce multiple branches).

The calculus LG includes the axioms Γ,pp\Gamma,p\Rightarrow p and Γ,p\Gamma,\bot\Rightarrow p. The latter axiom is the principle of explosion, and thus can be excluded to create a minimal propositional logic version of the calculus. LG uses 12 inference rules labeled: GI1GI1\land, GI2GI2\land, GI1GI1\lor, GI2GI2\lor, GI1GI1\mathord{\rightarrow}, GI2GI2\mathord{\rightarrow}, GEGE\land, GEGE\lor, GEPGE\mathord{\rightarrow}P, GEGE\mathord{\rightarrow}\land, GEGE\mathord{\rightarrow}\lor, GEGE\mathord{\rightarrow}\mathord{\rightarrow}. For brevity, these will not be listed here in their full form, but they use the following naming conventions: Rules in the form GIXCGIXC, where X{1,2}X\in\{1,2\} and CC is a propositional connective, are introduction rules that introduce CC in the consequent. Rules in the form GECGEC are elimination rules that introduce CC in the antecedent. The four GEGE\mathord{\rightarrow} rules introduce an implication in the antecedent, in which the antecedent of said implication includes the specified connective or, in the case of GEPGE\mathord{\rightarrow}P is just a propositional variable.

2.5 Computational Complexity

In this paper, we assume familiarity with basic concepts of complexity theory. Readers may wish to consult any standard text on the subject, e.g., [Pap94, Sip13].

The complexity class NP{\rm NP} is the class of languages that can be decided by a nondeterministic Turing machine in polynomial time. The class PSPACE{\rm PSPACE} is the class of languages that can be decided by a deterministic Turing machine using polynomial space. The class FP{\rm FP} is the set of polynomial-time computable functions. It is well-known that NPPSPACE{\rm NP}\subseteq{\rm PSPACE}, and whether NP=PSPACE{\rm NP}={\rm PSPACE} is a central open issue in theoretical computer science.

Formally, given two sets (i.e., decision problems) AA and BB, we say that AmpB(fFP)(x)[xAf(x)B]A\leq_{m}^{p}B\iff(\exists f\in{\rm FP})(\forall x)[x\in A\iff f(x)\in B], i.e., AA (polynomial-time) many-one reduces to BB. Many-one reductions are tremendously important in complexity theory, as they allow one to relate the hardness of one problem to that of another. One such way is via “complete problems.”

Formally, a PSPACE{\rm PSPACE}-complete problem BB is a PSPACE{\rm PSPACE} problem such that (APSPACE)[AmpB](\forall A\in{\rm PSPACE})[A\leq_{m}^{p}B]. Informally, PSPACE{\rm PSPACE}-complete problems are the “hardest” problems in PSPACE{\rm PSPACE}. It is well-known that polynomial-time many-one reductions are transitive and that NP{\rm NP} is closed downwards under polynomial-time many-one reductions (i.e., if AmpBA\leq_{m}^{p}B and BNPB\in{\rm NP}, then ANPA\in{\rm NP}). It follows that if some PSPACE{\rm PSPACE}-complete problem is shown to be in NP{\rm NP}, then NP=PSPACE{\rm NP}={\rm PSPACE}.

3 Overview of the Polynomial Proof-Size Argument of [GH19]

Gordeev and Haeusler [GH19] introduce a positive implicational subsystem of Hudelmaier’s LG calculus [Hud93], which they call LM\text{LM}_{\mathord{\rightarrow}}. They claim [GH19, Claim 1] that LM\text{LM}_{\mathord{\rightarrow}} is sound and complete for minimal propositional logic, and that it inherits the properties of LG [GH19, Lemma 2.2-3]. They further claim that deductions in LM\text{LM}_{\mathord{\rightarrow}} have the “semi-subformula property,” meaning that any formula occurring in an LM\text{LM}_{\mathord{\rightarrow}} deduction of ρ\rho is either a proper subformula of ρ\rho or in the form βγ\beta\rightarrow\gamma for some semi-subformula (αβ)γ(\alpha\rightarrow\beta)\rightarrow\gamma of ρ\rho [GH19, Lemma 2.1]. A proof of an upper bound on the number of semi-subformulas establishes that the foundation (number of distinct formulas occurring in a deduction) of any tree-like deduction of ρ\rho in LM\text{LM}_{\mathord{\rightarrow}} is polynomial in |ρ||\rho| (the length of the formula) [GH19, Lemma 2.4].

LM\text{LM}_{\mathord{\rightarrow}} is a sequent calculus, but the compression technique that will be used only works with natural deduction, so the authors introduce NM\text{NM}_{\mathord{\rightarrow}}: the implicational fragment of Prawitz’s ND system for minimal logic [Pra06]. They modify it to include some repetition rules and claim soundness and completeness with respect to minimal propositional logic [GH19, Claim 3, Lemma 5]. They then present a recursive transformation from any derivation in LM\text{LM}_{\mathord{\rightarrow}} to NM\text{NM}_{\mathord{\rightarrow}}, such that the height of the NM\text{NM}_{\mathord{\rightarrow}} derivation is polynomial with respect to the height of the LM\text{LM}_{\mathord{\rightarrow}} derivation, and therefore polynomial in |ρ||\rho| [GH19, Theorem 4].

While the height and foundation of the tree-like deduction have polynomial bounds, the overall size could still be exponential due to repetition of identical formulas in different branches of the tree. To get around this, the Gordeev and Haeusler introduce the notion of DAG-like deductions [GH19, Section 2]. This allows multiple parents to have the same child, so proofs of a semi-subformula α\alpha need not be repeated if α\alpha is needed in multiple places.

Next, they outline a compression algorithm to take a tree-like deduction in NM\text{NM}_{\mathord{\rightarrow}} and convert it into a DAG-like deduction [GH19, Section 3]. DAG-like deductions use a directed acyclic graph instead of a tree to represent the derivations, allowing repeated sub-derivations to be merged into one single sub-derivation that is referenced in multiple different places in the derivation. This leads to the result that any tautology of NM\text{NM}_{\mathord{\rightarrow}} has a DAG-like deduction with overall size polynomial in |ρ||\rho| [GH19, Theorem 14]. Using the soundness and completeness results from earlier, they conclude that any tautology of minimal propositional logic has a DAG-like proof in NM\text{NM}_{\mathord{\rightarrow}} with size polynomial in |ρ||\rho| [GH19, Corollary 15]. This only establishes that tautologies are provable by small DAG-like deductions. Finally, they prove the other direction: that all small DAG-like deductions in NM\text{NM}_{\mathord{\rightarrow}} do represent tautologies [GH19, Theorem 19]. Together this gives the soundness and completeness with respect to minimal propositional logic of DAG-like NM\text{NM}_{\mathord{\rightarrow}} provability [GH19, Corollary 20] and the most central result (which is Corollary 21 in the paper):

Corollary 1 ([GH19]).

A given formula ρ\rho is valid in minimal propositional logic iff there exists a DAG-like NM\text{NM}_{\mathord{\rightarrow}} proof of ρ\rho whose size is O(|ρ|4)O(|\rho|^{4})

The paper ends by addressing the complexity of verifying small DAG-like proofs in NM\text{NM}_{\mathord{\rightarrow}}, but does not reach a definitive conclusion [GH19, Section 5].

4 Errors in the Polynomial Proof-Size Argument of [GH19]

The argument that builds to Corollary 1 [GH19] fails due to a conflation of minimal implicational logic and minimal propositional logic, which are not equivalent. While both are types of minimal logic, they have a different syntax. Specifically, minimal implicational logic is a fragment of minimal propositional logic, meaning that minimal implicational logic is a subset of minimal propositional logic formed by restricting the syntax. While minimal propositional logic allows all standard propositional connectives (\land, \lor, \mathord{\rightarrow}, \bot), minimal implicational logic allows only implication (\mathord{\rightarrow}) (and sometimes \bot, depending on the formalization).

The authors note that they are considering a language of minimal logic using only propositional variables and the connective \rightarrow [GH19, Section 1]. This is minimal implicational logic, but the authors seem to be under the impression that it is equivalent to minimal propositional logic. Rather than take the difference in logic systems for granted and just assume the results don’t scale, this section will firmly establish why their proof construction and compression does not work for minimal propositional logic.

4.1 Omissions from Referenced Proof Systems

The two calculi introduced in the paper, LM\text{LM}_{\mathord{\rightarrow}} and NM\text{NM}_{\mathord{\rightarrow}}, are not novel systems—they are modifications to existing systems proposed by Hudelmaier [Hud93] and Prawitz [Pra06], respectively. Thus, proofs of their soundness and completeness with respect to minimal propositional logic are delegated to the papers from which the systems originated [GH19, Claim 1, Claim 3]. However, the modifications are substantial and equivalence is not proved, so delegating the proofs is not a valid strategy. Thus, LM\text{LM}_{\mathord{\rightarrow}} and NM\text{NM}_{\mathord{\rightarrow}} are not established to be sound and complete for minimal propositional logic.

In both cases, all rules that are not purely implicational are left out, as well as all rules including \bot. In converting from LG [Hud93] to LM\text{LM}_{\mathord{\rightarrow}}, this includes the second axiom (Γ,p\Gamma,\bot\Rightarrow p), which is correctly omitted to modify the intuitionistic logic to a minimal logic, along with GI1GI1\land, GI2GI2\land, GI1GI1\lor, GI2GI2\lor, GEGE\land, GEGE\lor, GEGE\mathord{\rightarrow}\land, and GEGE\mathord{\rightarrow}\lor, which are incorrectly omitted, resulting in a modification from minimal propositional logic to minimal implicational logic. In converting from Prawitz’s minimal logic [Pra06] to NM\text{NM}_{\mathord{\rightarrow}}, rules I\land I, E\land E, I\lor I, and E\lor E are incorrectly omitted, again resulting in a modification from minimal propositional logic to minimal implicational logic. These inference rules are needed to prove many things in minimal propositional logic, so omitting them weakens the system.

4.2 Incompleteness of LM\text{LM}_{\mathord{\rightarrow}} and NM\text{NM}_{\mathord{\rightarrow}}

As a counterexample to the completeness of LM\text{LM}_{\mathord{\rightarrow}} and NM\text{NM}_{\mathord{\rightarrow}} with respect to minimal propositional logic, consider

(pq)p.(p\land q)\rightarrow p.

This is clearly provable in minimal propositional logic, as demonstrated by the following natural deduction proof:

pq(1)p(pq)p(1)(E)(I).\displaystyle\dfrac{\dfrac{\stackrel{{\scriptstyle\mathclap{\tiny\mbox{(1)}}}}{{p\land q}}}{p}}{(p\land q)\rightarrow p}\tiny\text{(1)}\quad\genfrac{}{}{0.0pt}{0}{(\land E)}{(\mathord{\rightarrow}I)}.

Pure implication is not functionally complete and thus cannot express all the standard propositional connectives. LM\text{LM}_{\mathord{\rightarrow}} and NM\text{NM}_{\mathord{\rightarrow}}, being purely implicational systems, cannot directly express, for example, \land. This alone could be considered sufficient evidence against the completeness of LM\text{LM}_{\mathord{\rightarrow}} and NM\text{NM}_{\mathord{\rightarrow}}; if they cannot even express certain tautologies of minimal propositional logic they have no hope of proving the tautologies. However, one might argue that a system that is able to prove the same notions in a different notation could still be considered complete for minimal propositional logic. At the very least, the overall results of the paper would hold if minimal propositional logic could be polynomial-time many-one reduced to minimal implicational logic.

The functional incompleteness of implication can be resolved with the addition of \bot. For example, pqp\land q could be expressed as (p(q))(p\rightarrow(q\rightarrow\bot))\rightarrow\bot. Since each standard propositional connective could be replaced with an equivalent expression using only \rightarrow and \bot in polynomial time, we can permit this transformation with regards to time complexity results. The paper does establish that the language of minimal logic being used does not include \bot [GH19, Section 1], but since allowing \bot to take the place of any predicate in an existing axiom or rule does not impact any of their proofs, we will permit it in an attempt to recover the results of the paper.

Using only implication and \bot, (pq)p(p\land q)\rightarrow p can be expressed as:

((p(q)))p.((p\rightarrow(q\rightarrow\bot))\rightarrow\bot)\rightarrow p.

There are other ways it could be expressed, but this is the simplest and most standard. The below argument will work in exactly the same way for any alternative choice of how to represent \land.333It is worth noting that the representation must treat the (pq)(p\land q) clause as atomic, and thus cannot restructure the entire sentence to something like p(qp)p\rightarrow(q\rightarrow p). While p(qp)p\rightarrow(q\rightarrow p) also expresses a similar idea, it cannot be used for a transformation from minimal propositional logic to minimal implicational logic because it depends on the \rightarrow following the \land clause, and thus cannot be generalized.

Because the proof systems LM\text{LM}_{\mathord{\rightarrow}} and NM\text{NM}_{\mathord{\rightarrow}} have no axioms or inference rules specifically including \bot, formulas including \bot can only be derived by treating \bot as an ordinary subformula, no different from some arbitrary AA. In other words, the special meaning we have assigned to \bot as a nullary connective cannot be leveraged in derivations. Any derivation involving \bot could be modified by replacing every occurrence of \bot with some new propositional variable, and each rule application would remain valid, as pp could take the place of a variable in an inference rule just as easily as \bot could. For the same reason, any propositional variable could be replaced with \bot, as the variable cannot be broken down into smaller components to take advantage of some rule that \bot cannot. Essentially, \bot and pp are both atomic and indistinguishable from the perspective of the inference rules in LM\text{LM}_{\mathord{\rightarrow}} and NM\text{NM}_{\mathord{\rightarrow}}, so they are interchangeable. For example, (p)\bot\rightarrow(p\rightarrow\bot) is a theorem in LM\text{LM}_{\mathord{\rightarrow}} and NM\text{NM}_{\mathord{\rightarrow}} precisely because q(pq)q\rightarrow(p\rightarrow q) is also a theorem in LM\text{LM}_{\mathord{\rightarrow}} and NM\text{NM}_{\mathord{\rightarrow}}.

Thus, ((p(q)))p((p\rightarrow(q\rightarrow\bot))\rightarrow\bot)\rightarrow p is a theorem in LM\text{LM}_{\mathord{\rightarrow}} and NM\text{NM}_{\mathord{\rightarrow}} if and only if ((p(qr))r)p((p\rightarrow(q\rightarrow r))\rightarrow r)\rightarrow p is a theorem. The latter is obviously not a tautology in minimal propositional logic (or even in the more permissive classical logic—consider the case where pp is false and rr is true), and therefore is not a theorem either since LM\text{LM}_{\mathord{\rightarrow}} and NM\text{NM}_{\mathord{\rightarrow}} are both sound.

We can thus conclude that (pq)p(p\land q)\rightarrow p is a tautology in minimal propositional logic but is not a theorem of LM\text{LM}_{\mathord{\rightarrow}} or NM\text{NM}_{\mathord{\rightarrow}}, and cannot be transformed into an equivalent implicational theorem of LM\text{LM}_{\mathord{\rightarrow}} or NM\text{NM}_{\mathord{\rightarrow}}. Therefore, LM\text{LM}_{\mathord{\rightarrow}} and NM\text{NM}_{\mathord{\rightarrow}} are not complete with respect to minimal propositional logic.

4.3 Ramifications of the Errors

The paper claims that tree-like provability in NM\text{NM}_{\mathord{\rightarrow}}{} is sound and complete with respect to minimal propositional logic [GH19, Lemma 5]. We have established this to be false. It makes the same claim for DAG-like provability in NM\text{NM}_{\mathord{\rightarrow}}{} [GH19, Corollary 20], but depends on the soundness and completeness of tree-like provability [GH19, Lemma 5] to do so, so it does not hold. Corollary 1 [GH19, Corollary 21], the central result of the paper, depends on soundness and completeness of DAG-like provability [GH19, Corollary 20], and thus it also does not hold. Therefore, the paper has not proven that all (and only) tautologies of minimal propositional logic have a proof whose size is polynomial in the size of the tautology.

4.4 Challenges in Recovering from the Errors

One might try to recover the results of the paper by extending LM\text{LM}_{\mathord{\rightarrow}} and NM\text{NM}_{\mathord{\rightarrow}} to include the omitted rules and updating the transformation and compression algorithms to account for \land and \lor. This would require adding at least eight additional cases to the recursive function that converts from LM\text{LM}_{\mathord{\rightarrow}} to NM\text{NM}_{\mathord{\rightarrow}} to handle the eight missing rules of LG, and would disrupt many of the assumptions and proofs throughout the paper.

Notably, the inclusion of the rest of the rules from LG would invalidate the semi-subformula property [GH19, Lemma 2.1]. For a counterexample, consider inference rule GEGE\mathord{\rightarrow}\lor:444 The notation has been slightly modified from the original paper to maintain consistency of notation within this paper; the semantics remain identical. Note that pp is a propositional variable not occurring in AA, BB, CC, or DD.

Γ,Ap,Bp,pCDΓ,(AB)CD.\frac{\Gamma,A\rightarrow p,B\rightarrow p,p\rightarrow C\Rightarrow D}{\Gamma,(A\lor B)\rightarrow C\Rightarrow D}.

This involves multiple formulas that cannot appear in the resulting formula that they are being used to prove. Thus, even a weaker notion of the semi-subformula property (one that allows more nonproper subformulas beyond just βγ\beta\rightarrow\gamma as a semi-subformula of (αβ)γ(\alpha\rightarrow\beta)\rightarrow\gamma) could not hold, as arbitrary propositional variables can appear in a deduction in LG that are not anywhere in the formula being proved.

The semi-subformula property is crucial for proving the polynomial bound on foundation in LM\text{LM}_{\mathord{\rightarrow}} [GH19, Lemma 2.4]. The bound on foundation depends on a polynomial bound on the number of distinct semi-subformulas of any given formula, but if the semi-subformula property is lost and arbitrary formulas can appear in the deduction that are not semi-subformulas, the bound on foundation is lost as well. Without a polynomial bound on foundation in LM\text{LM}_{\mathord{\rightarrow}}, the polynomial bound on foundation for deductions in NM\text{NM}_{\mathord{\rightarrow}} [GH19, Theorem 4] would also fail to hold, as it relies on property carryover from LG. The bound on foundation for NM\text{NM}_{\mathord{\rightarrow}} is what allows the compression algorithm to get a polynomial bound in size. Only identical formulas can be merged in the DAG, so if there is no polynomial bound on foundation then the compression cannot get a polynomial bound on size [GH19, Corollary 15]. Without the polynomial bound on size, Corollary 1 [GH19, Corollary 21] is similarly lost. Thus even the substantial additions that would be required to cover the rest of minimal propositional logic are not sufficient to recover the central results of the paper.

5 Implications for the Claimed Resolution of 𝐍𝐏{\rm NP} vs. 𝐏𝐒𝐏𝐀𝐂𝐄{\rm PSPACE}

In a subsequent paper, [GH20], Gordeev and Haeusler build on their earlier work [GH19] and claim to prove that NP=PSPACE{\rm NP}={\rm PSPACE}. In their proof, they claim that the PSPACE{\rm PSPACE}-complete problem of deciding if a formula is provable in minimal logic is also in NP{\rm NP} by leveraging their polynomial upper bound on the size of proofs in minimal propositional logic [GH19]. However, as we saw in the previous section, that proof is flawed and does not actually provide a polynomial upper bound on the size of proofs in minimal propositional logic. In this section, we briefly detail some of the errors made by Gordeev and Haeusler [GH20] and conclude that the paper fails to establish that NP{\rm NP} equals PSPACE{\rm PSPACE}.

We first note that the paper cites Statman [Sta79] and Svejdar [Sve03] as sources for the PSPACE{\rm PSPACE}-completeness of the problem of deciding if a formula is provable in minimal logic (in both papers by Gordeev and Haeusler [GH19, GH20], minimal logic is actually minimal implicational logic, although they incorrectly call it minimal propositional logic). However, upon closer inspection, these papers give completeness results for the problem of deciding if a formula is provable in intuitionistic propositional logic, which is a stronger form of logic than minimal implicational logic. We are not aware of analogous PSPACE{\rm PSPACE}-complete problems with respect to minimal propositional logic. However, for the sake of argument, we entertain the notion that such results may exist and try to see where the argument made in [GH20] fails.

Theorem 2 ([GH20]).

PSPACENP{\rm PSPACE}\subseteq{\rm NP} and hence NP=PSPACE{\rm NP}={\rm PSPACE}.

In their proof of Theorem 2 (which is Theorem 2.8 in the original paper), the authors argue that the theorem follows directly from [GH19, Corollary 15]. Fortunately, they also explain how it follows, and from that we can easily spot the error, thanks to our observations from the previous section. Their argument starts as follows:

First we “guess” the existence of “short” Hudelmaier-style cutfree sequential deduction of ρ\rho that leads (by deterministic compression) to a “small” natural deduction frame D~\tilde{D} that is supposed to have fst \mathcal{F}. Then we “guess” the existence of a “cleansed” modified subdeduction that confirms in |ρ||\rho|-polynomial time the provability of ρ\rho with regard to D~,\langle\tilde{D},\mathcal{F}\rangle. — [GH20]

Although it is not explicit in their argument, ρ\rho is simply a sentence whose provability in minimal propositional logic they are trying to decide. Unfortunately, several notions in the proof are not properly defined. In order to analyze the above proof, we interpret those terms as faithfully as possible. We assume that “short” means “of height polynomial in |ρ||\rho|” and that the “guess” is a nondeterministic guess by a nondeterministic polynomial-time machine. We also assume that “small” means “of size polynomial in |ρ||\rho|.” In this critique, we do not cover the definitions “deduction frame,” “fst,” “subdeduction,” and so on. Rather, we refer interested readers to the original papers if they wish to have those definitions. However, the gist of the argument is as follows: Nondeterministically guess a deduction of polynomial height (in |ρ||\rho|) and deterministically compress it to one of polynomial size (in |ρ||\rho|). Finally guess another deduction of size polynomial in |ρ||\rho| and use both that newly guessed deduction and the compressed one to verify that ρ\rho is provable in minimal propositional logic. Unfortunately, per Section 4, there is no established polynomial upper bound on the size of proofs (i.e., deductions) in the minimal propositional logic, and so, it’s entirely possible that no nondeterministic polynomial-time machine can guess a deduction for an arbitrary (and provable in minimal propositional logic) ρ\rho. We thus observe a ripple effect, having its source in an earlier paper by Gordeev and Haeusler [GH19], that directly impacts the central result of [GH20].

6 Conclusion

Gordeev and Haeusler’s [GH19] modifications to Hudelmaier’s minimal propositional logic result in LM\text{LM}_{\mathord{\rightarrow}} and NM\text{NM}_{\mathord{\rightarrow}}, both of which are implicational systems that are not complete with respect to minimal propositional logic. Completeness is required to prove a polynomial bound on the size of all proofs of tautologies of minimal propositional logic, and extending LM\text{LM}_{\mathord{\rightarrow}} and NM\text{NM}_{\mathord{\rightarrow}} to be complete invalidates numerous lemmas and theorems that are required for the final result. The invalid result of [GH19], which claims that any tautology of NM\text{NM}_{\mathord{\rightarrow}} has a DAG-like deduction of polynomial-bounded size, directly impacts their later paper, which uses the polynomial bound on proofs of minimal propositional logic to argue that NP=PSPACE{\rm NP}={\rm PSPACE}.

Acknowledgements

We thank Ian Clingerman, Lane A. Hemaspaandra, and Quan Luu for their helpful comments on previous versions of this article. Additionally, we thank David E. Narváez for helping us get this project underway and into its current state. Any remaining errors are the responsibility of the authors.

References

  • [GH19] L. Gordeev and E. Haeusler. Proof compression and NP versus PSPACE. Studia Logica, 107(1):53–83, 2019.
  • [GH20] L. Gordeev and E. Haeusler. Proof compression and NP versus PSPACE II. Bulletin of the Section of Logic, 49(3):213–230, 2020.
  • [Hud93] J. Hudelmaier. An O(nlogn)O(n\log{n})-space decision procedure for intuitionistic propositional logic. Journal of Logic and Computation, 3(1):63–75, 1993.
  • [Joh37] I. Johansson. Der Minimalkalkül, ein reduzierter intuitionistischer Formalismus. Compositio Mathematica, 4:119–136, 1937.
  • [Pap94] C. Papadimitriou. Computational Complexity. Addison-Wesley, 1994.
  • [Pog94] W. Pogorzelski. A minimal implicational logic. In J. Woleǹski, editor, Philosophical Logic in Poland, pages 213–216. Springer Netherlands, 1994.
  • [Pra06] D. Prawitz. Natural Deduction: A Proof-Theoretical Study. Dover, 2006.
  • [Sip13] M. Sipser. Introduction to the Theory of Computation. Cengage Learning, 3rd edition, 2013.
  • [Sta79] R. Statman. Intuitionistic propositional logic is polynomial-space complete. Theoretical Computer Science, 9(1):67–72, 1979.
  • [Sve03] V. Svejdar. On the polynomial-space completeness of intuitionistic propositional logic. Archive for Mathematical Logic, 42(7):711–716, 2003.