1639 \lmcsheadingLABEL:LastPageApr. 07, 2020Aug. 12, 2020
A limitation on the KPT interpolation
Abstract.
We prove a limitation on a variant of the KPT theorem proposed for propositional proof systems by Pich and Santhanam [7], for all proof systems that prove the disjointness of two NP sets that are hard to distinguish.
Key words and phrases:
Propositional proof complexity, interpolation, KPT theorem.For a coNP property , given , we can construct a size propositional formula with atoms and atoms such that for any , is true iff . This is just a restatement of the NP-completeness of SAT. In addition, if is defined in a suitable language of arithmetic and has a suitable logical form, the translation can be defined purely syntactically without a reference to machines or computations. This then allows to transform also a possible first-order proof of into a sequence of short propositional proofs of tautologies , ; if the original proof uses axioms of theory (essentially any sound r.e. theory) then the propositional proofs will be in a proof system associated to . Many standard proof systems are of the form for some , and this is often the most efficient way how to construct short -proofs of uniform sequences of tautologies. Although the unprovability of in does not imply lower bounds for -proofs of the tautologies, a method used in establishing the unprovability sometimes yields an insight how the lower bound could be proved. All this is a well-established part of proof complexity and the reader can find it in [4, Chpt.12] (or in references given there).
The translation is, however, not entirely faithful for formulas of a certain logical form, and this is an obstacle for transforming the conditional unprovability result for strong universal theories in [3] into conditional lower bounds for strong proof systems. To explain the problem in some detail assume has the form
(1) |
where is a p-time property and is the bit length of . The provability of in a universal can be analyzed using the KPT theorem which provides an efficient interactive algorithm for finding given (cf. [6] or [4, Sec.12.2]). The same method does not, however, work in the propositional setting. To illustrate this assume that has a proof in proof system attached to and from that we can deduce in that
(2) |
is a tautology (in addition the translation assures that all are disjoint tuples of atoms). This implies in that for all assignments and for all and all variables there is such that is true. But to get (1) (and then use the KPT analysis from [3]) we would need to show that for all there is one such that for all the formula is true. Unfortunately, to derive this one needs to use the bounded collection scheme (allowing to move the quantifier bounding before the quantifier bounding ) and this scheme is not available in universal theories under consideration, cf. [1]. The reader can find more about this issue in [3, Sec.5] or at the end of [4, Sec.12.8]; knowing this background offers my motivation for this research (which differs perhaps from that of [7]) but it is not needed to understand the argument below.
Pich and Santhanam [7] proposed a direct way how to bypass this obstacle: simply ignore it and prove a version of the KPT theorem for (some, at least) strong propositional proof systems. For such proof systems a conditional lower bound can be indeed proved, cf. [7] or [3].
[[7]] Let P be a propositional proof system. The system has KPT interpolation if there are a constant and p-time functions
such that whenever is a P-proof of a disjunction of the form
where is a -tuple of atoms and are disjoint tuples of atoms, then for all the following is valid for all of the appropriate lengths:
-
•
either for or, if is false,
-
•
for or, if is false,
-
•
, or
-
•
for .
An illuminating interpretation of the definition can be made using the interactive communication model of [5] involving Student and Teacher. Student is a p-time machine while Teacher has unlimited powers. At the beginning Student gets and the proof and computes from it his first candidate solution: index such that is — he thinks — a tautology. Teacher either approves or she provides Student with a counter-example: an assignment for which falsifies the formula. In the next round Student can use this counter-example to propose his next candidate solution, etc. Functions in the definition form a strategy for Student so that he solves the task for all and in steps in the worst case. Note that if we fixed as in ordinary interpolation then would suffice; the concept makes sense for variable only.
Unfortunately, we show in this note that this property fails for strong proof systems (above a low depth Frege system) for essentially the same reasons why ordinary feasible interpolation fails for them (cf. [4, Sec.18.7]). For a set and put . is the -depth subsystem of sequent calculus (cf. [4, Sec.3.4]).
Theorem 1.
Let P be a proof system containing . Assume that are disjoint NP sets such that:
-
(1)
Propositional formulas expressing that have p-size P-proofs.
-
(2)
For any constant , for all large enough there is a distribution on with support such that there is no size circuit for which
where samples in the probability are chosen according to .
Then P does not admit KPT interpolation.
Remarks.
-
(1)
An example of a pair of two NP sets that are conjectured to satisfy the second condition can be defined using one-way permutation (more generally an injective one-way function with output length determined by input length) and its hard bit: (resp. ) are the strings in the range of the permutation whose hard bit is (resp. ). Distribution is in this case generated by the permutation from the uniform distribution on the seed strings, i.e. it is uniform itself.
-
(2)
It is known that the hypothesis of the theorem can be fulfilled for systems such as EF, F, -F and, under stronger hypotheses about non-separability of and , also for -F above certain small depth; see the comprehensive discussion in [4, Sec.18.7].
-
(3)
The phrase that P contains means for simplicity just that: P can operate with sequents consisting of -depth formulas and all -proofs are also P-proofs. However, this is used only in Claim 1 and, in fact, it would suffice that P represents formulas and (defined below) in some other formalism and efficiently simulates modus ponens.
Proof of the theorem occupies the rest of this note.
Write for a p-time relation that witnesses and similarly for , with the length of both and p-bounded in the length of . Let and for strings of length each consider the following propositional formulas translating the predicates and (which we shall denote also and in order to ease on notation):
-
•
: is an -tuple of atoms for bits of and is an -tuple of atoms for bits of a witness associated with together with bits needed to encode as propositional formula suitable for P (e.g. as 3CNF),
-
•
: analogously for ,
-
•
where all are disjoint.
Consider the induction statement:
(3) |
and write it as a disjunction with disjuncts:
(4) |
Now replace by and by and write it propositionally:
(5) |
Note that except the -variables the disjuncts are disjoint.
Claim 1.
(5) has a p-size proof in P.
To see this note that induction (3) can be proved by simulating modus ponens (here we use that P contains ). Disjunction (5) follows from it because we assume that the disjointness of has short P-proofs, i.e. has a short proof.
Now apply the supposed KPT interpolation to (5). W.l.o.g. we shall assume (and arrange that in the construction below) that and (with witnesses and , respectively). Hence Student in the KPT computation is supposed to find for which the -th disjunct
is valid (i.e. where the induction step going from to fails). We shall show that the existence of such a KPT p-time Student allows to separate from with a non-negligible advantage violating the hypotheses of the theorem.
Take any such that (the upper bound implies that the proof in Claim 1 is of size ). For define:
Note that any string satisfies and .
Let and be the constant and the p-time functions provided the assumed KPT interpolation for P. Assume that is the most frequent value computes on inputs from (thinking of a P-proof as fixed). This maximal frequency is at least . (Here the frequency means with respect to the product of distributions on for which it is assumed that are hard to separate.)
Claim 2.
The frequency on is at least , i.e. it is at least modulo a negligible error.
Note that for any the frequency for can differ only negligibly because otherwise we could use the usual triangle inequality argument to find a non-negligible discrepancy between frequencies on and for some , and use it to separate from (on position , after fixing the rest of coordinates by averaging). Because all are disjoint, the frequency must be up to a negligible difference.
Now we describe a process that transforms the assumed successful strategy for Student into a p-time algorithm with p-size advice, separating with a non-negligible advantage.
Assume first . By averaging there are s.t. with frequency at least (the factor in the denominator allows us to forget about the “up to the negligible error” phrase) for all of the form:
Fix such and also witnesses for their membership in . These will be used as advice for the eventual algorithm.
If then fill analogously the last positions by elements of and include the relevant witnesses in the advice. W.l.o.g. we assume that the first case occurred.
We interpret this situation as reducing the Student-Teacher computation to rounds on smaller universe . Namely, given define:
(6) |
and run on . If , declare failure. Otherwise use the advice witnesses to produce a falsifying assignment for : holds.
After this first step use functions (and Claim 2 for the smaller universes) and as long as they give values always answer for Teacher using the advice strings . Eventually Student proposes value : choose the most frequent such value and proceed as in case of , further restricting domain (6) as in binary search. Repeating this at most -times the situation will be as follows:
-
(1)
The universe will shrink at most to which is at least . In fact, we shall arrange in the last step that exactly remains (by filling in more positions by elements of or , respectively, if needed) and hence the inputs before applying the last KPT function are of the form with and .
Note that Student gets to use because if he succeeded earlier it would violate Claim 2.
-
(2)
The last function has to find a gap in the induction, and this itself will violate Claim 2. In particular, the gap is either between and and then , or between and and then .
-
(3)
This process has the probability , i.e. non-negligible, of not failing in any of the rounds and hence it will not fail and will compute correctly the membership of (any) in or with a non-negligible probability. In all cases when the process fails output random bit or with equal probability.
This proves the theorem. ∎
We conclude by pointing out that the KPT theorem enters propositional proof complexity also via notions of pseudo-surjective and iterable maps in the theory of proof complexity generators, cf. [2] or [4, Sec.19.4] for detailed expositions of this subject.
Acknowledgment
I thank J. Pich (Oxford) for comments on an earlier note.
References
- [1] S. A. Cook and N. Thapen. The strength of replacement in weak arithmetic. ACM Transactions on Computational Logic, 7:4, (2006).
- [2] J. Krajíček. Dual weak pigeonhole principle, pseudo-surjective functions, and provability of circuit lower bounds. J. of Symbolic Logic, 69(1), (2004), pp.265-286.
- [3] J. Krajíček. On the proof complexity of the Nisan-Wigderson generator based on a hard function. J. of Mathematical Logic, 11(1), (2011), pp.11-27.
- [4] J. Krajíček. Proof complexity. Encyclopedia of Mathematics and Its Applications, Vol. 170, Cambridge University Press, 2019.
- [5] J. Krajíček, P. Pudlák, and J. Sgall. Interactive Computations of Optimal Solutions. in: B. Rovan (ed.): Mathematical Foundations of Computer Science (B. Bystrica, August ’90), Lecture Notes in Computer Science 452, Springer-Verlag, (1990), pp. 48-60.
- [6] J. Krajíček, P. Pudlák and G. Takeuti. Bounded arithmetic and the polynomial hierarchy. Annals of Pure and Applied Logic, 52, (1991), pp.143–153.
- [7] J. Pich and R. Santhanam. Strong Co-Nondeterministic Lower Bounds for NP Cannot be Proved Feasibly. preprint, (2020).