This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\hideLIPIcs

Karlsruhe Institute of Technology, Karlsruhe, [email protected]://orcid.org/0009-0002-7500-6848 Karlsruhe Institute of Technology, Karlsruhe, [email protected] \CopyrightGeri Gokaj and Marvin Künnemann \ccsdesc[100]Theory of computation \to Computational complexity and cryptography \to Complexity theory and logic \fundingThis research is supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) - 462679611.

Acknowledgements.
The authors like to thank the reviewers for constructive feedback as well as Karl Bringmann and Nick Fischer for helpful discussions. \EventEditorsRaghu Meka \EventNoEds1 \EventLongTitle16th Innovations in Theoretical Computer Science Conference (ITCS 2025) \EventShortTitleITCS 2025 \EventAcronymITCS \EventYear2025 \EventDateJanuary 7–10, 2025 \EventLocationColumbia University, New York, NY, USA \EventLogo \SeriesVolume325 \ArticleNo55

Completeness Theorems for kk-SUM and Geometric Friends: Deciding Fragments of Linear Integer Arithmetic

Geri Gokaj    Marvin Künnemann
Abstract

In the last three decades, the kk-SUM hypothesis has emerged as a satisfying explanation of long-standing time barriers for a variety of algorithmic problems. Yet to this day, the literature knows of only few proven consequences of a refutation of this hypothesis. Taking a descriptive complexity viewpoint, we ask: What is the largest logically defined class of problems captured by the kk-SUM problem?

To this end, we introduce a class 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} of problems corresponding to deciding sentences in Presburger arithmetic/linear integer arithmetic over finite subsets of integers. We establish two large fragments for which the kk-SUM problem is complete under fine-grained reductions:

  1. 1.

    The kk-SUM problem is complete for deciding the sentences with kk existential quantifiers.

  2. 2.

    The 33-SUM problem is complete for all 33-quantifier sentences of 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} expressible using at most 33 linear inequalities.

Specifically, a faster-than-nk/2±o(1)n^{\lceil k/2\rceil\pm o(1)} algorithm for kk-SUM (or faster-than-n2±o(1)n^{2\pm o(1)} algorithm for 33-SUM, respectively) directly translate to polynomial speedups of a general algorithm for all sentences in the respective fragment.

Observing a barrier for proving completeness of 33-SUM for the entire class 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}, we turn to the question which other – seemingly more general – problems are complete for 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}. In this direction, we establish 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}-completeness of the problem pair of Pareto Sum Verification and Hausdorff Distance under nn Translations under the LL_{\infty}/L1L_{1} norm in d\mathbb{Z}^{d}. In particular, our results invite to investigate Pareto Sum Verification as a high-dimensional generalization of 3-SUM.

keywords:
fine-grained complexity theory, descriptive complexity, presburger arithmetic, completeness results, k-SUM
category:
\relatedversion

1 Introduction

Consider a basic question in complexity theory: How can we determine for which problems an essentially quadratic-time algorithm is best possible? If a given problem AA admits an algorithm running in n2+o(1)n^{2+o(1)} time, and it is known that AA cannot be solved in time O(n2ϵ)O(n^{2-\epsilon}) for any ϵ>0\epsilon>0, then clearly the n2+o(1)n^{2+o(1)} algorithm has optimal runtime, up to subpolynomial factors. This question can be asked more generally for any k1k\geq 1 and time nk±o(1)n^{k\pm o(1)}. To this day, the theoretical computer science community is far from able to resolve this question unconditionally. However, a surge of results over recent years uses conditional lower bounds based on plausible hardness assumptions to shed some light on why some problems seemingly cannot be solved in time O(nkϵ)O(n^{k-\epsilon}) for any ϵ>0\epsilon>0. Most notably, reductions from kk-OV, kk-SUM and the weighted kk-clique problem have been used to establish nko(1)n^{k-o(1)}-time conditional lower bounds, often matching known algorithms; see [50] for a detailed survey.

In this context, the 33-SUM hypothesis is arguably the first – and particularly central – hardness assumption for conditional lower bounds. Initially introduced to explain various quadratic-time barriers observed in computational geometry [35], it has since been used to show quadratic-time hardness for a wealth of problems from various fields [52, 46, 6, 40, 29, 3, 21]. Its generalization, the kk-SUM111The kk-SUM problem asks, given sets A1,,AkA_{1},\dots,A_{k} of nn numbers, whether there exist a1A1,,akAka_{1}\in A_{1},\dots,a_{k}\in A_{k} such that i=1kai=0\sum_{i=1}^{k}a_{i}=0. The kk-SUM hypothesis states that for no ϵ>0\epsilon>0 there exists a O(nk/2ϵ)O(n^{\lceil k/2\rceil-\epsilon}) time algorithm that solves kk-SUM. hypothesis, has led to further conditional lower bounds beyond the quadratic-time regime [31, 4, 1, 2, 41]. For a more comprehensive overview, we refer to [50].

The centrality of the 33-SUM hypothesis for understanding quadratic-time barriers begs an interesting question: Does 33-SUM fully capture quadratic-time solvability, in the sense that it is hard for the entire class 𝖣𝖳𝖨𝖬𝖤(n2)\mathsf{DTIME}(n^{2})? Alas, Bloch, Buss, and Goldsmith [10] give evidence that we are unlikely to prove this: If 33-SUM is hard for 𝖣𝖳𝖨𝖬𝖤(n2)\mathsf{DTIME}(n^{2}) under quasilinear reductions, then 𝖯𝖭𝖯\mathsf{P}\neq\mathsf{NP}. Thus, to understand precisely the role of 33-SUM to understand quadratic-time computation, the more reasonable question to ask is:

What is the largest class 𝒞\mathcal{C} of problems such that 33-SUM is 𝒞\mathcal{C}-hard?222Note that there are different reasonable notions of reductions to consider. Rather than the quasilinear reductions used by Bloch et al., we will consider the currently more commonly used notion of fine-grained reductions; see Section 1.2 for details on the notion of completeness that we will use.

Finding a large class 𝒞\mathcal{C} for which 33-SUM is hard may be seen as giving evidence for the 33-SUM hypothesis. Furthermore, such a result may clarify the true expressive power of the 33-SUM hypothesis, much like the NP-completeness of 3-SAT highlights its central role for polynomial intractability.

1.1 Our approach

We approach our central question from a descriptive complexity perspective. This line of research has been initiated by Gao et al. [36], who establish the sparse OV problem as complete for the class of model checking first-order properties. One can interpret this result as showing that the OV problem expresses relational database queries in the sense that a truly subquadratic algorithm for OV would improve the fine-grained data complexity of such queries (see [36] for details). Related works further delineate the fine-grained hardness of model checking first-order properties and related problem classes [49, 14, 12, 7, 13, 32], see Section 1.3 for more discussion.

Towards continuing the line of research on fine-grained completeness theorems, we introduce a class of problems corresponding to deciding formulas in linear integer arithmetic over finite sets of integers. Specifically, consider the vectors

x1=(x1[1],,x1[d1]),,xk=(xk[1],,xk[dk])x_{1}=(x_{1}[1],\dots,x_{1}[d_{1}]),\dots,x_{k}=(x_{k}[1],\dots,x_{k}[d_{k}])

as quantified variables, and let t1,,tlt_{1},\dots,t_{l} be free variables. Moreover, let

X:={x1[1],,x1[d1],xk[1],,xk[dk],t1,,tl},X:=\{x_{1}[1],\dots,x_{1}[d_{1}],\dots x_{k}[1],\dots,x_{k}[d_{k}],t_{1},\dots,t_{l}\},

and let ψ\psi be a quantifier-free linear arithmetic formula over variables in XX. We consider the model-checking problem for formulas ϕ\phi in the prenex normal form

ϕ:=Q1x1Qkxk:ψ,\phi:=Q_{1}x_{1}\dots Q_{k}x_{k}:\psi,

where the quantifiers Q1,,Qk{,}Q_{1},\dots,Q_{k}\in\{\exists,\forall\} are arbitrary. Formally, for such a ϕ\phi, we define the model checking problem 𝖥𝖮𝖯(ϕ)\mathsf{FOP}_{\mathbb{Z}}(\phi) as follows333Below, we use the notation ψ[(t1,,tl)\(t1^,,tl^)]\psi[(t_{1},\dots,t_{l})\backslash(\hat{t_{1}},\dots,\hat{t_{l}})] to denote the substitution of the variables t1,,tlt_{1},\dots,t_{l} by t1^,,tl^\hat{t_{1}},\dots,\hat{t_{l}} respectively.

𝖥𝖮𝖯(ϕ):\displaystyle\mathsf{FOP}_{\mathbb{Z}}(\phi): (1)
Input: Finite sets A1d1,,Akdk and t1^,,tl^.\displaystyle\text{ Finite sets }A_{1}\subseteq\mathbb{Z}^{d_{1}},\dots,A_{k}\subseteq\mathbb{Z}^{d_{k}}\text{ and }\hat{t_{1}},\dots,\hat{t_{l}}\in\mathbb{Z}.
Problem: Does Q1x1A1QkxkAk:ψ[(t1,,tl)\(t1^,,tl^)] hold?\displaystyle\text{ Does }Q_{1}x_{1}\in A_{1}\dots Q_{k}x_{k}\in A_{k}:\psi[(t_{1},\dots,t_{l})\backslash(\hat{t_{1}},\dots,\hat{t_{l}})]\text{ hold?}

We let n:=maxi{|Ai|}n:=\max_{i}\{|A_{i}|\} denote the input size and will assume throughout the paper that all input numbers (i.e., the coordinates of the vectors in A1,,AkA_{1},\dots,A_{k} and the values t1^,,tl^\hat{t_{1}},\dots,\hat{t_{l}}) are chosen from a polynomially sized universe, i.e., {U,,U}\{-U,\dots,U\} with UncU\leq n^{c} for some cc. Let 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} be the union of all 𝖥𝖮𝖯(ϕ)\mathsf{FOP}_{\mathbb{Z}}(\phi) problems, where ϕ\phi has at least 3 quantifiers.444It is not too difficult to see that all formulas with 2 quantifiers can be model-checked in near-linear time; see the full version for details. Besides 33-SUM, a variety of interesting problems is contained in 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}; we discuss a few notable examples below and in the full version.

Frequently, we will distinguish formulas in 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} using their quantifier structure; e.g., 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\forall) describes the class of model checking problems 𝖥𝖮𝖯(ϕ)\mathsf{FOP}_{\mathbb{Z}}(\phi) where in ϕ\phi we have Q1=Q2=Q_{1}=Q_{2}=\exists and Q3=.Q_{3}=\forall. Furthermore, we let 𝖥𝖮𝖯k\mathsf{FOP}_{\mathbb{Z}}^{k} be the union of all 𝖥𝖮𝖯(ϕ)\mathsf{FOP}_{\mathbb{Z}}(\phi) problems, where ϕ\phi consists of precisely kk quantifiers, regardless of their quantifier structure. For a quantifier Q{,}Q\in\{\exists,\forall\}, we write QkQ^{k} for the repetition QQk times\underbrace{Q\dots Q}_{k\text{ times}}. Finally, we remark that a small subset of 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} has already been studied by An et al. [7], for a discussion see Section 1.3.

1.2 Our Contributions

We seek to determine completeness results for the class 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}. In particular: What are the largest fragments of this class for which 3-SUM (or more generally, kk-SUM) is complete? Is there a problem that is complete for the entire class?

Intuitively, we say that a TA(n)T_{A}(n)-time solvable problem AA is (fine-grained) complete for a T𝒞(n)T_{\mathcal{C}}(n)-time solvable class of problems 𝒞\mathcal{C}, if the existence of an O(TA(n)1ϵ)O(T_{A}(n)^{1-\epsilon})-time algorithm for AA with ϵ>0\epsilon>0 implies that for all problems CC in 𝒞\mathcal{C} there exists δ>0\delta>0 such that CC can be solved in time O(T𝒞(n)1δ)O(T_{\mathcal{C}}(n)^{1-\delta}). We extend this notion to completeness of a family of problems, since strictly speaking, any (geometric) problem over d\mathbb{Z}^{d} expressible in linear integer arithmetic corresponds to a family of formulas 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} (one for each dd\in\mathbb{N}). Formally, consider a family of problems 𝒫\mathcal{P} with an associated time bound T𝒫(n)T_{\mathcal{P}}(n) and a class of problems 𝒞\mathcal{C} with an associated time bound T𝒞(n)T_{\mathcal{C}}(n); usually T𝒫(n),T𝒞(n)T_{\mathcal{P}}(n),T_{\mathcal{C}}(n) denote the running time of the fastest known algorithm solving all problems in 𝒫\mathcal{P} or 𝒞\mathcal{C}, respectively (often, we omit these time bounds, as they are clear from context).555Here, we use family and class as a purely semantic and intuitive distinction: A family consists of a small set of similar problems, and a class consists of a large and diverse variety of problems. We say that 𝒫\mathcal{P} is (fine-grained) complete for 𝒞\mathcal{C}, if

  1. 1.

    the family 𝒫\mathcal{P} is a subset of the class 𝒞\mathcal{C}, and

  2. 2.

    if for all problems PP in 𝒫\mathcal{P} there exists ϵ>0\epsilon>0 such that PP can be solved in time O(T𝒫(n)1ϵ)O(T_{\mathcal{P}}(n)^{1-\epsilon}), then for all problems CC in 𝒞\mathcal{C} there exists some δ>0\delta>0 such that we can solve CC in time O(T𝒞(n)1δ)O(T_{\mathcal{C}}(n)^{1-\delta}).

That is, a polynomial-factor improvement for solving the problems in 𝒫\mathcal{P} would lead to a polynomial-factor improvement in solving all problems in 𝒞\mathcal{C}. If a singleton family 𝒫={P}\mathcal{P}=\{P\} is fine-grained complete for 𝒞\mathcal{C}, then we also say that PP is fine-grained complete for 𝒞\mathcal{C}. We work with standard hypotheses and problems encountered in fine-grained complexity; for detailed definitions of these, we refer to the full version of this article.

1.2.1 kk-SUM is complete for the existential fragment of 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}

Consider first the existential fragment of 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}, i.e., formulas exhibiting only existential quantifiers. Any 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formula with kk existential quantifiers can be decided using a standard meet-in-the-middle approach, augmented by orthogonal range search, in time O~(nk/2)\tilde{O}(n^{\lceil k/2\rceil})666We use the notation O~(T):=TlogO(1)(T)\tilde{O}(T):=T\log^{O(1)}(T) to hide polylogarithmic factors., see the full version of the paper for details. Since kk-SUM is a member of 𝖥𝖮𝖯(k)\mathsf{FOP}_{\mathbb{Z}}(\exists^{k}), this running time is optimal up to subpolynomial factors, assuming the kk-SUM Hypothesis. As our first contribution, we provide a converse reduction. Specifically, we show that a polynomially improved kk-SUM algorithm would give a polynomially improved algorithm for solving the entire class. In our language, we show that kk-SUM is fine-grained complete for formulas of 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} with kk existential quantifiers.

Theorem 1.1 (kk-SUM is 𝖥𝖮𝖯(k)\mathsf{FOP}_{\mathbb{Z}}(\exists^{k})-complete).

Let k3k\geq 3 and assume that kk-SUM can be solved in time TkSUM(n)T_{k\mathrm{SUM}}(n). For any problem PP in 𝖥𝖮𝖯(k)\mathsf{FOP}_{\mathbb{Z}}(\exists^{k}), there exists some cc such that PP can be solved in time O(TkSUM(n)logcn)O(T_{k\mathrm{SUM}}(n)\log^{c}n).

Thus, if there are k3k\geq 3 and ϵ>0\epsilon>0 such that we can solve kk-SUM in time O(nk/2ϵ)O(n^{\lceil k/2\rceil-\epsilon}), then we can solve all problems in 𝖥𝖮𝖯(k)\mathsf{FOP}_{\mathbb{Z}}(\exists^{k}) in time O(nk/2ϵ)O(n^{\lceil k/2\rceil-\epsilon^{\prime}}) for any 0<ϵ<ϵ0<\epsilon^{\prime}<\epsilon. By a simple negation argument, we conclude that kk-SUM is also complete for the class of problems 𝖥𝖮𝖯(k)\mathsf{FOP}_{\mathbb{Z}}(\forall^{k}).

The above theorem generalizes and unifies previous reductions from problems expressible as 𝖥𝖮𝖯(k)\mathsf{FOP}_{\mathbb{Z}}(\exists^{k}) formulas to 3-SUM, using different proof ideas: Jafargholi and Viola [39, Lemma 4] give a simple randomized linear-time reduction from triangle detection in sparse graphs to 3-SUM, and a derandomization via certain combinatorial designs. Dudek, Gawrychowski, and Starikovskaya [29] study the family of 3-linear degeneracy testing (3-LDT), which constitutes a large and interesting subset of 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\exists): This family includes, for any α1,α2,α3,t\alpha_{1},\alpha_{2},\alpha_{3},t\in\mathbb{Z}, the 3-partite formula a1A1a2A2a3A3:α1a1+α2a2+α3a3=t\exists a_{1}\in A_{1}\exists a_{2}\in A_{2}\exists a_{3}\in A_{3}:\alpha_{1}a_{1}+\alpha_{2}a_{2}+\alpha_{3}a_{3}=t and the 1-partite formula α1,α2,α3A:α1a1+α2a2+α3a3=ta1a2a2a3a1a3\exists\alpha_{1},\alpha_{2},\alpha_{3}\in A:\alpha_{1}a_{1}+\alpha_{2}a_{2}+\alpha_{3}a_{3}=t\wedge a_{1}\neq a_{2}\wedge a_{2}\neq a_{3}\wedge a_{1}\neq a_{3}. The authors show that each such formula is either trivial or subquadratic equivalent to 33-SUM. For 3-partite formulas, a reduction to 33-SUM is essentially straightforward. For 1-partite formulas, Dudek et al. [29] use color coding.777We remark that the reverse direction, i.e., 33-SUM-hardness of non-trivial formulas, is technically much more involved and can be regarded as the main technical contribution of [29].

As further examples for reductions from 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} problems to kk-SUM, we highlight a reduction from Vector kk-SUM to kk-SUM [5] as well as a reduction from (min,+)(\min,+)-convolution to 33-SUM (see [9, 27]) based on a well-known bit-level trick due to Vassilevska Williams and Williams [52], which allows us to reduce inequalities to equalities.

Perhaps surprisingly in light of its generality and applicability, Theorem 1.1 is obtained via a very simple, deterministic reduction that combines the tricks from [5, 52]. This generality comes at the cost of polylogarithmic factors (which we do not optimize), which depend on the number of inequalities occurring in the considered formula; for the details see Section 3 and the full version of the paper.

1.2.2 Completeness for counting witnesses

We provide a certain extension of the above completeness result to the problem class of counting witnesses to existential 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formulas888A witness for a 𝖥𝖮𝖯(k)\mathsf{FOP}_{\mathbb{Z}}(\exists^{k}) formula a1A1akAk:φ\exists a_{1}\in A_{1}\dots\exists a_{k}\in A_{k}:\varphi with t1^,,tl^\hat{t_{1}},\dots,\hat{t_{l}}\in\mathbb{Z} is a tuple (a1,,ak)A1××Ak(a_{1},\dots,a_{k})\in A_{1}\times\cdots\times A_{k} that satisfies the formula φ[(t1,,tl)\(t1^,,tl^)]\varphi[(t_{1},\dots,t_{l})\backslash(\hat{t_{1}},\dots,\hat{t_{l}})].. Counting witnesses is an important task particularly in database applications (usually referred to as model counting). Furthermore, we will make use of witness counting to decide certain quantified formulas in subsequent results detailed below. In Section 4, we will obtain the following result.

Theorem 1.2.

Let k3k\geq 3 be odd. If there is ϵ>0\epsilon>0 such that we can count the number of witnesses for kk-SUM in time O(nk/2ϵ)O(n^{\lceil k/2\rceil-\epsilon}), then for all problem PP in 𝖥𝖮𝖯(k)\mathsf{FOP}_{\mathbb{Z}}(\exists^{k}), there is some ϵ>0\epsilon^{\prime}>0 such that we can count the number of witnesses for PP in time O(nk/2ϵ)O(n^{\lceil k/2\rceil-\epsilon^{\prime}}).

Leveraging the recent breakthrough by [22] that 3-SUM is subquadratic equivalent to counting witnesses of 3-SUM, we obtain the corollary that 3-SUM is hard even for counting witnesses of 𝖥𝖮𝖯(3)\mathsf{FOP}_{\mathbb{Z}}(\exists^{3}).

Corollary 1.3.

For all problems PP in 𝖥𝖮𝖯(3)\mathsf{FOP}_{\mathbb{Z}}(\exists^{3}), there is some ϵP>0\epsilon_{P}>0 such that we can count the number of witnesses for PP in randomized time O(n2ϵP)O(n^{2-\epsilon_{P}}) if and only if there is some ϵ>0\epsilon^{\prime}>0 such that 33-SUM can be solved in randomized time O(n2ϵ)O(n^{2-\epsilon^{\prime}}).

1.2.3 Completeness for general quantifier structures of 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}

In light of our first completeness result, one might wonder whether kk-SUM is complete for deciding all kk-quantifier formulas in 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}, regardless of the quantifier structure of the formulas. Note that for these general quantifier structures, a baseline algorithm with running time O~(nk1)\tilde{O}(n^{k-1}) can be achieved with a combination of brute-force and orthogonal range queries; see the full version for details.

However, by [7, Theorem 15] there exists a 𝖥𝖮𝖯(k1)\mathsf{FOP}_{\mathbb{Z}}(\exists^{k-1}\forall)-formula ϕ\phi that cannot be solved in time O(nk1ϵ)O(n^{k-1-\epsilon})-time unless the 3-uniform hyperclique hypothesis is false (see the discussion in Section 1.3). Thus, proving that 3-SUM is complete for all 3-quantifier formulas would establish that the 3-uniform hyperclique hypothesis implies the 33-SUM hypothesis – this would be a novel tight reduction among important problems/hypotheses in fine-grained complexity theory. For k4k\geq 4, it becomes even more intricate: the conditionally optimal running time of nk1±o(1)n^{k-1\pm o(1)} for 𝖥𝖮𝖯(k)\mathsf{FOP}_{\mathbb{Z}}(\exists^{k}\forall) formulas exceeds the conditionally optimal running time of nk2±o(1)n^{\lceil\frac{k}{2}\rceil\pm o(1)} for 𝖥𝖮𝖯(k)\mathsf{FOP}_{\mathbb{Z}}(\exists^{k}) formulas.

We are nevertheless able to obtain a completeness result for general quantifier structures: Specifically, we show that if two geometric problems over d\mathbb{Z}^{d} can be solved in time O(n2ϵd)O(n^{2-\epsilon_{d}}) where ϵd>0\epsilon_{d}>0 for all dd, then each kk-quantifier formula in 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} can be decided in time O(nk1ϵ)O(n^{k-1-\epsilon}) for some ϵ>0\epsilon>0. These problems are (1) a variation of the Hausdorff distance that we call Hausdorff distance under nn Translations and (2) the Pareto Sum problem; the details are covered in Section 5.

Hausdorff Distance under nn Translations

Among the most common translation-invariant distance measures for given point sets BB and CC is the Hausdorff Distance under Translation [24, 18, 19, 23, 45, 38]. To define it, we denote the directed Hausdorff distance under the LL_{\infty} metric by δH(B,C):=maxbBmincCbc\delta_{\overrightarrow{H}}(B,C):=\max_{b\in B}\min_{c\in C}\|b-c\|_{\infty}.999Since we will exclusively consider the directed Hausdorff distance under Translation, we will drop “directed” throughout the paper. The Hausdorff distance under translation δHT(B,C)\delta_{\overrightarrow{H}}^{T}(B,C) is defined as the minimum Hausdorff distance of BB and an arbitrary translation of CC, i.e.,

δHT(B,C)minτdδH(B,C+{τ})=minτdmaxbBmincCb(c+τ).\delta_{\overrightarrow{H}}^{T}(B,C)\coloneqq\min_{\tau\in\mathbb{R}^{d}}\delta_{\overrightarrow{H}}(B,C+\{\tau\})=\min_{\tau\in\mathbb{R}^{d}}\max_{b\in B}\min_{c\in C}\|b-(c+\tau)\|_{\infty}.

For d=2d=2, Bringmann et al. [18] were able to show a (|B||C|)1o(1)(|B||C|)^{1-o(1)} time lower bound based on the orthogonal vector hypothesis, and there exists a matching O~(|B||C|)\tilde{O}(|B||C|) upper bound by Chew et al. [25].

We shall establish that restricting the translation vector to be among a set of mm candidate vectors yields a central problem in 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}. Specifically, we define the Hausdorff distance under Translation in AA, denoted as δHT(A)(B,C)\delta_{\overrightarrow{H}}^{T(A)}(B,C), by

δHT(A)(B,C)minτAδH(B,C+{τ})=minτAmaxbBmincCb(c+τ).\delta_{\overrightarrow{H}}^{T(A)}(B,C)\coloneqq\min_{\tau\in A}\delta_{\overrightarrow{H}}(B,C+\{\tau\})=\min_{\tau\in A}\max_{b\in B}\min_{c\in C}\|b-(c+\tau)\|_{\infty}.

Correspondingly, we define the problem Hausdorff distance under mm Translations as: Given A,B,CdA,B,C\subseteq\mathbb{Z}^{d} with |A|m|A|\leq m, |B|,|C|n|B|,|C|\leq n and a distance value γ\gamma\in\mathbb{N}, determine whether δHT(A)(B,C)γ\delta_{\overrightarrow{H}}^{T(A)}(B,C)\leq\gamma. Note that this can be rewritten as a 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\forall\exists)-formula, see the full version of the paper for details.

The Hausdorff distance under mm Translations occurs naturally when approximating the Hausdorff distance under translation: Specifically, common algorithms compute a set AA of |A|=f(ϵ)|A|=f(\epsilon) translations such that δHT(A)(B,C)(1+ϵ)δHT(B,C)\delta_{\overrightarrow{H}}^{T(A)}(B,C)\leq(1+\epsilon)\delta_{\overrightarrow{H}}^{T}(B,C). Generally, this problem is then solved by performing |A||A| computations of the Hausdorff distance, which yields O~(|A|n)=O~(f(ϵ)n)\tilde{O}(|A|n)=\tilde{O}(f(\epsilon)n)-time algorithms [48]. Improving over the O~(mn)\tilde{O}(mn)-time baseline for Hausdorff Distance under mm Translations would thus lead to immediate improvements for approximating the Hausdorff Distance under Translation. Our results will establish additional consequences of fast algorithms for this problem: an O(n2ϵd)O(n^{2-\epsilon_{d}})-time algorithm with ϵd>0\epsilon_{d}>0 for Hausdorff distance under nn Translations would give an algorithmic improvement for the classes of 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\forall\exists)- and 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\exists\forall)-formulas.

Verification of Pareto Sums

Our second geometric problem is a verification version of computing Pareto sums: Given point sets A,BdA,B\subseteq\mathbb{Z}^{d}, the Pareto sum CC of A,BA,B is defined as the Pareto front of their sumset A+B={a+baA,bB}A+B=\{a+b\mid a\in A,b\in B\}. Put differently, the Pareto sum of A,BA,B is a set of points CC satisfying (1) CA+BC\subseteq A+B, (2) for every aAa\in A and bBb\in B, the vector a+ba+b is dominated101010We consider the usual domination notion: A vector udu\in\mathbb{Z}^{d} is dominated by some vector vdv\in\mathbb{Z}^{d} (written uvu\leq v) if and only if in all dimensions i[d]i\in[d] it holds that u[i]v[i]u[i]\leq v[i]. by some cCc\in C and (3) there are no distinct c,cCc,c^{\prime}\in C such that cc^{\prime} dominates cc. The task of computing Pareto sums appears in various multicriteria optimization settings [8, 47, 30, 44]; fast output-sensitive algorithms (both in theory and in practice) have recently been investigated by Hespe, Sanders, Storandt, and Truschel [37].

We consider the following problem as Pareto Sum Verification: Given A,B,CdA,B,C\subseteq\mathbb{Z}^{d}, determine whether

aAbBcC:a+bc.\displaystyle\forall a\in A\forall b\in B\exists c\in C:a+b\leq c.

The complexity of Pareto Sum Verification111111We remark that our problem definition only checks a single of the three given conditions, specifically, condition (2). However, in Section 7, we will establish that the verifying all three conditions reduces to verifying this single condition. More specifically, for sets A,B,CA,B,C of size at most nn, we obtain that if we can solve Pareto Sum Verification in time T(n)T(n), then we can check whether CC is the Pareto sum of A,BA,B in time O(T(n))O(T(n)). is tightly connected to output-sensitive algorithms for Pareto Sum. Specifically, solving Pareto Sum Verification reduces to computing the Pareto sum CC when given inputs A,BA,B of size at most nn with the promise that |C|=Θ(n)|C|=\Theta(n); see Section 7 for details. The work of Hespe et al. [37] gives a practically fast O(n2)O(n^{2})-time algorithm in this case for d=2d=2; note that for d3d\geq 3, we still obtain an O~(n2)\tilde{O}(n^{2})-time algorithm via our Baseline Algorithm, which is described in the full version of the paper.

A problem pair that is complete for 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}

As a pair, these two geometric problems turn out to be fine-grained complete for the class 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}.

Theorem 1.4.

There is a function ϵ(d)>0\epsilon(d)>0 such that both of the following problems can be solved in time O(n2ϵ(d))O(n^{2-\epsilon(d)})

  • Pareto Sum Verification,

  • Hausdorff distance under nn Translations,

if and only if for each problem PP in 𝖥𝖮𝖯k\mathsf{FOP}_{\mathbb{Z}}^{k} with k3k\geq 3 there exists an ϵP>0\epsilon_{P}>0 such that PP can be solved in time O(nk1ϵP)O(n^{k-1-\epsilon_{P}}).

The above theorem shows that a single pair of natural problems captures the fine-grained complexity of the expressive and diverse class 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}. As an illustration just how expressive this class is, we observe the following barriers:121212The first three statements follow from 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} generalizing the class PTOPTO studied in [7], see Section 1.3. The remaining statements rely on the additive structure of 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}.

  1. 1.

    If there is some ϵ>0\epsilon>0 such that all problems in 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\forall) (or 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists)) can be solved in time O(n2ϵ)O(n^{2-\epsilon}), then OVH (and thus SETH) is false [7, Theorem 16].

  2. 2.

    If there is some ϵ>0\epsilon>0 such that all problems in 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\forall\exists) (or 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\exists\forall)) can be solved in time O(n2ϵ)O(n^{2-\epsilon}), then the Hitting Set Hypothesis is false [7, Theorem 12].

  3. 3.

    If for all problems PP in 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\forall) (or 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists)), there exists some ϵ>0\epsilon>0 such that we can solve PP in O(n2ϵ)O(n^{2-\epsilon}), then the 3-uniform Hyperclique Hypothesis is false [7, Theorem 15].

  4. 4.

    If for all problems PP in 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\exists) (𝖥𝖮𝖯(),𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\forall),\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists), or 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\forall)), there exists some ϵ>0\epsilon>0 such that we can solve PP in time O(n2ϵ)O(n^{2-\epsilon}), then the 33-SUM Hypothesis is false (Theorem 1.1 with Lemma 2.4).

Theorem 1.4 raises the question whether for any constant dimension dd, the Hausdorff distance under nn Translations admits a subquadratic reduction to Pareto Sum Verification. A positive answer would establish Pareto Sum Verification as complete for the entire class 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}. We elaborate on this in Section 8.

1.2.4 33-SUM is complete for 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formulas of low inequality dimension

Returning to our motivating question, we ask: Since it appears unlikely to prove completeness of 3-SUM for all 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formulas (as this requires a tight 3-uniform hyperclique lower bound for 3-SUM), can we at least identify a large fragment of 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} for which 33-SUM is complete? In particular, can we extend our first result of Theorem 1.1 from existentially quantified formulas to substantially different problems in 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}, displaying other quantifier structures?

Surprisingly, we are able to show that 33-SUM is complete for low-dimensional 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formulas, independent of their quantifier structure. To formalize this, we introduce the inequality dimension of a 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formula as the smallest number of linear inequalities required to model it. More formally, consider a 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formula ϕ=Q1x1A1,,QkxkAk:ψ\phi=Q_{1}x_{1}\in A_{1},\dots,Q_{k}x_{k}\in A_{k}:\psi with AidiA_{i}\subseteq\mathbb{Z}^{d_{i}}. The inequality dimension of ϕ\phi is the smallest number ss such that there exists a Boolean function ψ:{0,1}s{0,1}\psi^{\prime}:\{0,1\}^{s}\to\{0,1\} and (strict or non-strict) linear inequalities L1,,LsL_{1},\dots,L_{s} in the variables {xi[j]:i{1,,k},j{1,,di}}\{x_{i}[j]:i\in\{1,\dots,k\},j\in\{1,\dots,d_{i}\}\} and the free variables such that ψ(x1,,xk)\psi(x_{1},\dots,x_{k}) is equivalent to ψ(L1,,Ls)\psi^{\prime}(L_{1},\dots,L_{s}). As an example, the 3-SUM formula aAbBcC:a+b=c\exists a\in A\exists b\in B\exists c\in C:a+b=c has inequality dimension 2, as a+b=ca+b=c can be modelled as conjunction of the two linear inequalities a+bca+b\leq c and a+bca+b\geq c, whereas no single linear inequality can model a+b=ca+b=c.

We show that 33-SUM is fine-grained complete for model-checking 𝖥𝖮𝖯3\mathsf{FOP}^{3}_{\mathbb{Z}} formulas with inequality dimension at most 33. This result is our perhaps most interesting technical contribution and intuitively combines our result that 33-SUM is hard for counting 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} witnesses (Corollary 1.3) with a geometric argument, specifically, that the union of nn unit cubes in 3\mathbb{R}^{3} can be decomposed into the union of O(n)O(n) pairwise interior- and exterior-disjoint axis-parallel boxes. To this end, we extend a result from [23], which constructs pairwise interior-disjoint axis-parallel boxes, to also achieve exterior-disjointness. For more details, see the Technical Overview below and Section 6.

Theorem 1.5.

There is an algorithm deciding 33-SUM in randomized time O(n2ϵ)O(n^{2-\epsilon}) for an ϵ>0\epsilon>0, if and only if for each problem PP in 𝖥𝖮𝖯k\mathsf{FOP}^{k}_{\mathbb{Z}} with k3k\geq 3 and inequality dimension at most 33, there exists some ϵ>0\epsilon>0 such that we can solve PP in randomized time O(nk1ϵ)O(n^{k-1-\epsilon}).

Note that this fragment of 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} contains a variety of interesting problems. A general example is given by comparisons of sets defined using the sumset arithmetic131313The sumset arithmetic uses the sumset operator X+YX+Y to denote the sumset {x+yxX,yY}\{x+y\mid x\in X,y\in Y\} and λX\lambda X to denote {λxxX}\{\lambda x\mid x\in X\}., which correspond to formulas of inequality dimension at most 2: E.g., checking, given sets A,B,CA,B,C\subseteq\mathbb{Z} and tt\in\mathbb{Z}, whether CC is an additive tt-approximation of the sumset A+BA+B is equivalent to verifying the conjunction of the 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists) problem141414Note that the corresponding formula is aAbBcC:(ca+b)(a+bc+t)\forall a\in A\forall b\in B\exists c\in C:(c\leq a+b)\wedge(a+b\leq c+t), which clearly has inequality dimension at most 2. A+BC+{0,,t}A+B\subseteq C+\{0,\dots,t\} and (2) the 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\exists\exists) problem151515Note that the corresponding formula is cCaAbB:a+b=c\forall c\in C\exists a\in A\exists b\in B:a+b=c, which clearly has inequality dimension at most 2. CA+BC\subseteq A+B. Likewise, this extends to λ\lambda-multiplicative approximations of sumsets. Furthermore, the problems corresponding to general sumset comparisons like α1A1++αiAiαi+1Ai+1++αkAk+{,,u}\alpha_{1}A_{1}+\cdots+\alpha_{i}A_{i}\subseteq\alpha_{i+1}A_{i+1}+\cdots+\alpha_{k}A_{k}+\{-\ell,\dots,u\} have inequality dimension at most 22 as well.

Our results of Theorems 1.4 and 1.5 suggests to view Pareto Sum Verification as a geometric, high-dimensional generalization of 33-SUM. Furthermore, it remains an interesting problem to establish the highest dd such that 3-SUM is complete for 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formulas of inequality dimension at most dd; for a discussion see Section 8.

Further Applications

As an immediate application of our first completeness theorem, we obtain a simple proof of a n4/3o(1)n^{4/3-o(1)} lower bound for the 4-SUM problem based on the the 33-uniform hyperclique hypothesis; see the full version of the paper for details. Specifically, by Theorem 1.1, it suffices to model the 3-uniform 4-hyperclique problem as a problem in 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\exists\exists). The resulting conditional lower bound is implicitly known in the literature, as it can alternatively be obtained by combining a 3-uniform hyperclique lower bound for 44-cycle given in [43] with a folklore reduction from 44-cycle to 44-SUM (see [39] for a deterministic reduction from 33-cycle to 33-SUM).

Theorem 1.6.

If there is some ϵ>0\epsilon>0 such that 4-SUM can be solved in time O(n43ϵ)O(n^{\frac{4}{3}-\epsilon}), then the 3-uniform hyperclique hypothesis fails.

Similarly, we can also give a simple proof for a known lower bound for 33-SUM.

Another application of our results is to establish class-based conditional bounds. As a case in point, consider the problem of computing the Pareto sum of A,BdA,B\subseteq\mathbb{Z}^{d}: Clearly, this problem can be solved in time O~(n2)\tilde{O}(n^{2}) by explicitly computing the sumset A+BA+B and computing the Pareto front using any algorithm running in near-linear time in its input, e.g. [34]. We prove the following conditional optimality results already in the case when the desired output (the Pareto sum of A,BA,B) has size Θ(n)\Theta(n).

Theorem 1.7 (Pareto Sum Computation Lower Bound).

The following conditional lower bounds hold for output-sensitive Pareto sum computation:

  1. 1.

    If there is ϵ>0\epsilon>0 such that we can compute the Pareto sum CC of A,B2A,B\subseteq\mathbb{Z}^{2}, whenever CC is of size Θ(n)\Theta(n), in time O(n2ϵ)O(n^{2-\epsilon}), then the 33-SUM hypothesis fails (thus, for any 𝖥𝖮𝖯k\mathsf{FOP}_{\mathbb{Z}}^{k} formula ϕ\phi of inequality dimension at most 3, there is ϵ>0\epsilon^{\prime}>0 such that ϕ\phi can be decided in time O(nk1ϵ)O(n^{k-1-\epsilon^{\prime}})).

  2. 2.

    If for all d2d\geq 2, there is ϵ>0\epsilon>0 such that we can compute the Pareto sum CC of A,BdA,B\subseteq\mathbb{Z}^{d}, whenever CC is of size Θ(n)\Theta(n), in time O(n2ϵ)O(n^{2-\epsilon}), then there is some ϵ>0\epsilon^{\prime}>0 such that we can decide all 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formulas with kk quantifiers not ending in \exists\forall\exists or \forall\exists\forall in time O(nk1ϵ)O(n^{k-1-\epsilon^{\prime}}).

Our lower bound for 2D strengthens a quadratic-time lower bound found by Funke et al. [33] based on the (min,+)(\min,+)-convolution hypothesis to hold already under the weaker (i.e., more believable) 33-SUM hypothesis. For higher dimensions, we furthermore strengthen the conditional lower bound via its connection to 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}.

We conclude with remaining open questions in Section 8.

1.3 Further Related Work

To our knowledge, the first investigation of the connection between classes of model-checking problems and central problems in fine-grained complexity was given by Williams [49], who shows that the kk-clique problem is complete for the class of existentially-quantified first order graph properties, among other results. As important follow-up work, Gao et al. [36] establish OV as complete problem for model-checking any first-order property.

Subsequent results include classification results for k\exists^{k}\forall-quantified first-order graph properties [14], fine-grained upper and lower bounds for counting witnesses of first-order properties [28], completeness theorems for multidimensional ordering properties [7] (discussed below), completeness and classification results for optimization classes [12, 13] as well as an investigation of sparsity for monochromatic graph properties [32].

We remark that An et al. [7] study completeness results for a strict subset of 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formulas: Specifically, they introduce a class 𝖯𝖳𝖮k,d\mathsf{PTO}_{k,d} of kk-quantifier first-order sentences over inputs d\mathbb{N}^{d} (or, without loss of generality {1,,n}d\{1,\dots,n\}^{d}) that may only use comparisons of coordinates (and constants). Note that such sentences lack additive structure, and indeed the fine-grained complexity differs decisively from 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}: E.g., for 𝖯𝖳𝖮()\mathsf{PTO}(\exists\exists\exists) formulas, they establish the sparse triangle detection problem as complete, establishing a conditionally tight running time of m2ω/(ω+1)±o(1)m^{2\omega/(\omega+1)\pm o(1)}. This is in stark contrast to 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\exists) formulas, for which we establish 33-SUM as complete problem, yielding a conditionally optimal running time of n2±o(1)n^{2\pm o(1)}. In particular, for each 3-quantifier structure Q1Q2Q3Q_{1}Q_{2}Q_{3}, a O(n2ϵ)O(n^{2-\epsilon})-time algorithm for all 𝖥𝖮𝖯(Q1Q2Q3)\mathsf{FOP}_{\mathbb{Z}}(Q_{1}Q_{2}Q_{3}) problems would break a corresponding hardness barrier161616Specifically, an O(n2ϵ)O(n^{2-\epsilon}) time algorithm for problems in 𝖥𝖮𝖯(),𝖥𝖮𝖯(),𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\exists),\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\forall),\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists), or 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\forall) with ϵ>0\epsilon>0 would refute the 33-SUM hypothesis. Furthermore, an O(n2ϵ)O(n^{2-\epsilon}) time algorithm for problems in 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\exists\exists), 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\forall\forall), 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\forall\exists), or 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\exists\forall) with ϵ>0\epsilon>0 would immediately yield an improvement for the MaxConv lower bound problem [27]; for details see the full version of the paper..

Since any 𝖯𝖳𝖮k,d\mathsf{PTO}_{k,d} formula is also a 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formula with the same quantifier structure, any hardness result in [7] for 𝖯𝖳𝖮(Q1,,Qk)\mathsf{PTO}(Q_{1},\dots,Q_{k}) carries over to 𝖥𝖮𝖯(Q1,,Qk)\mathsf{FOP}_{\mathbb{Z}}(Q_{1},\dots,Q_{k}). On the other hand, any of our algorithmic results for 𝖥𝖮𝖯(Q1,,Qk)\mathsf{FOP}_{\mathbb{Z}}(Q_{1},\dots,Q_{k}) transfers to its subclass 𝖯𝖳𝖮(Q1,,Qk)\mathsf{PTO}(Q_{1},\dots,Q_{k}).

2 Technical Overview

In this section, we sketch the main ideas behind our proofs.

Completeness of kk-SUM for 𝖥𝖮𝖯(k)\mathsf{FOP_{\mathbb{Z}}}(\exists^{k})

With the right ingredients, proving that kk-SUM is complete for 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formulas with kk existential quantifiers (Theorem 1.1) is possible via a simple approach: We observe that any 𝖥𝖮𝖯(k)\mathsf{FOP}_{\mathbb{Z}}(\exists^{k}) formula ϕ\phi can be rewritten such that we may assume that ϕ\phi is a conjunction of mm inequalities. We then use a slight generalization of a bit-level trick of [52] to reduce each inequality to an equality, incurring only O(logn)O(\log n) overhead per inequality (intuitively, we need to guess the most significant bit position at which the left-hand side and the right-hand side differ). Thus, we obtain O(logmn)O(\log^{m}n) conjunctions of mm equalities; each such conjunction can be regarded as an instance of Vector kk-SUM. Using a straightforward approach for reducing Vector kk-SUM to kk-SUM given in [5], the reduction to kk-SUM follows. We give all details in Section 3 and the full version of the paper.

Counting witnesses and handling multisets

While the reduction underlying Theorem 1.1 preserves the existence of solutions, it fails to preserve the number of solutions. The challenge is that when applying the bit-level trick to reduce inequalities to equalities, we need to make sure that for each witness of a 𝖥𝖮𝖯(k)\mathsf{FOP}_{\mathbb{Z}}(\exists^{k}) formula ϕ\phi, there is a unique witness in the kk-SUM instances produced by the reduction. While it is straightforward to ensure that we do not produce multiple witnesses, the subtle issue arises that distinct witnesses for ϕ\phi may be mapped to the same witness in the kk-SUM instances. It turns out that it suffices to solve a multiset version of #kk-SUM, i.e., to count all witnesses in a kk-SUM instance in which each input number may occur multiple times.

Thus, to obtain Theorem 1.2, we show a fine-grained equivalence of Multiset #kk-SUM and #kk-SUM, for all odd k3k\geq 3. This fine-grained equivalence, which we prove via a heavy-light approach, might be of independent interest.171717We remark that it is plausible that the proof of the subquadratic equivalence of 33-SUM and #33-SUM due to Chan et al. [22] could be extended to establish subquadratic equivalence with Multiset #33-SUM as well. Note, however, that a fine-grained equivalence of #kk-SUM and kk-SUM is not known for any k4k\geq 4. Combining this equivalence with an inclusion-exclusion argument, we may thus lift Theorem 1.1 to a counting version for all odd k3k\geq 3.

In the reductions below, we will make crucial use of the immediate corollary of Theorem 1.2 and [22] that for each 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\exists) formula ϕ\phi, there exists a subquadratic reduction from counting witnesses for ϕ\phi to 33-SUM (Corollary 1.3).

On general quantifier structures

We perform a systematic study on the different quantifier structures for k=3k=3. Due to simple negation arguments, we only have to perform a systematic study on the classes of problems 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\exists), 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\exists\exists), 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists), 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\forall\exists).

First, we state a simple lemma establishing syntactic complete problems for the classes above.

Lemma 2.1 (Syntactic Complete problems (Informal Version)).

Let Q1,Q2{,}Q_{1},Q_{2}\in\{\exists,\forall\}. We can reduce every formula of the class 𝖥𝖮𝖯(Q1Q2)\mathsf{FOP}_{\mathbb{Z}}(Q_{1}Q_{2}\exists) to the formula

Q1a1~A1~Q2a2~A2~a3~A3~:a1~+a2~a3~.Q_{1}\tilde{a_{1}}\in\tilde{A_{1}}Q_{2}\tilde{a_{2}}\in\tilde{A_{2}}\exists\tilde{a_{3}}\in\tilde{A_{3}}:\tilde{a_{1}}+\tilde{a_{2}}\leq\tilde{a_{3}}.
On the quantifier change 𝖥𝖮𝖯()𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\exists\exists)\to\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\exists).

We rely on the subquadratic equivalence between 33-SUM and a functional version of 33-SUM called All-ints 33-SUM, which asks to determine for every aAa\in A whether there is a solution involving aa. A randomized subquadratic equivalence was given in [51], which can be turned deterministic [42].

This equivalence allows us to use the bit-level trick to turn inequalities to equalities, despite it seemingly not interacting well with the quantifier structure \forall\exists\exists at first sight. This results in a proof of the following hardness result.

Lemma 2.2.

If 33-SUM can be solved in time O(n2ϵ)O(n^{2-\epsilon}) for an ϵ>0\epsilon>0, then all problems PP of 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\exists\exists) can be solved in time O(n2ϵP)O(n^{2-\epsilon_{P}}) for an ϵP>0\epsilon_{P}>0.

On the quantifier change 𝖥𝖮𝖯()𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\exists)\to\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists).

As a first result for the class 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists), we are able to show equivalence to 33-SUM for a specific problem in this class, thus introducing a 33-SUM equivalent problem with a different quantifier structure in comparison to 33-SUM. Specifically, we consider the problem of verifying additive tt-approximation of sumsets. We are able to precisely characterize the fine-grained complexity depending on tt.

Formally, we show the following theorem.

Theorem 2.3.

Consider the Additive Sumset Approximation problem of deciding, given A,B,C,tA,B,C\subseteq\mathbb{Z},t\in\mathbb{Z}, whether

A+BC+{0,,t}.A+B\subseteq C+\{0,\dots,t\}.

This problem is

  • solvable in time O(n2δ)O(n^{2-\delta}) with δ>0\delta>0, whenever t=O(n1ϵ)t=O(n^{1-\epsilon}) for any ϵ>0,\epsilon>0,

  • not solvable in time O(n2ϵ)O(n^{2-\epsilon}), whenever t=Ω(n)t=\Omega(n) assuming the Strong 33-SUM hypothesis.

Furthermore, subquadratic hardness holds under the standard 3-SUM Hypothesis if no restriction on tt is made.

The above theorem is essentially enabling a quantifier change transforming the \exists\exists\exists quantifier structure for which 33-SUM is complete into a subquadratic equivalent problem with a quantifier structure \forall\forall\exists. Moreover, the 33-SUM hardness is a witness to the hardness of the class 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists).

Let us remark a few interesting aspects: The algorithmic part follows from sparse convolution techniques going back to Cole and Hariharan [26], see [16] for a recent account and also [20, 17, 15]. Specifically, whenever t=O(n1ϵ)t=O(n^{1-\epsilon}), it holds that |C+{0,,t}|=O(n2ϵ)|C+\{0,\dots,t\}|=O(n^{2-\epsilon}) and intuitively, we can use an output-sensitive convolution algorithm to compute A+BA+B and compare it to C+{0,,t}C+\{0,\dots,t\}.181818The argument is slightly more subtle, since we need to avoid computing A+BA+B if its size exceeds O(n2ϵ)O(n^{2-\epsilon}). Our result indicates that an explicit construction of C+{0,,t}C+\{0,\dots,t\} is required, since once it may get as large as Ω(n2)\Omega(n^{2}), we obtain a n2o(1)n^{2-o(1)}-time lower bound assuming the Strong 33-SUM Hypothesis.

The lower bound follows from describing the 33-SUM problem alternatively as (A+B)C(A+B)\cap C\neq\emptyset, which is equivalent to the negation of (A+B)C¯(A+B)\subseteq\bar{C}, where C¯\bar{C} denotes the complement of CC. Thus, we aim to cover the complement of CC by intervals of length tt. While this appears impossible for 33-SUM, we employ the subquadratic equivalence of 33-SUM and its convolutional version due to Patrascu [46]. This problem will deliver us the necessary structure to represent this complement with the addition of few auxilliary points.

The reverse reduction from Additive Sumset Approximation to 33-SUM follows from Theorem 1.5 (as Additive Sumset Approximation has inequality dimension 22).

On completeness results for 𝖥𝖮𝖯k\mathsf{FOP}_{\mathbb{Z}}^{k}

The above ingredients establish our completeness theorems by exhaustive search over remaining quantifiers. Specifically, by a combination of Theorem 2.3, which shows that Additive Sumset Approximation is 33-SUM hard, and a combination of Lemma 2.2 and Theorem 1.1, we get:

Lemma 2.4.

There is a function ϵ(d)>0\epsilon(d)>0 such that the Verification of Pareto Sum problem can be solved in time O(n2ϵ(d))O(n^{2-\epsilon(d)}) if and only if all problems PP in the classes

  • 𝖥𝖮𝖯(Q1Qk3)\mathsf{FOP}_{\mathbb{Z}}(Q_{1}\dots Q_{k-3}\exists\exists\exists),𝖥𝖮𝖯(Q1Qk3),\mathsf{FOP}_{\mathbb{Z}}(Q_{1}\dots Q_{k-3}\forall\forall\forall),

  • 𝖥𝖮𝖯(Q1Qk3)\mathsf{FOP}_{\mathbb{Z}}(Q_{1}\dots Q_{k-3}\forall\exists\exists),𝖥𝖮𝖯(Q1Qk3),\mathsf{FOP}_{\mathbb{Z}}(Q_{1}\dots Q_{k-3}\exists\forall\forall),

  • 𝖥𝖮𝖯(Q1Qk3)\mathsf{FOP}_{\mathbb{Z}}(Q_{1}\dots Q_{k-3}\forall\forall\exists),𝖥𝖮𝖯(Q1Qk3),\mathsf{FOP}_{\mathbb{Z}}(Q_{1}\dots Q_{k-3}\exists\exists\forall),

where Q1,Qk3{,}Q_{1},\dots Q_{k-3}\in\{\exists,\forall\} and k3k\geq 3, can be solved in time O(nk1ϵP)O(n^{k-1-\epsilon_{P}}) for an ϵP>0\epsilon_{P}>0.

Similarly, for quantifier structures ending in \exists\forall\exists and \forall\exists\forall, we obtain the following completeness result.

Lemma 2.5.

There is a function ϵ(d)>0\epsilon(d)>0 such that the Hausdorff Distance under nn Translations problem can be solved in time O(n2ϵ(d))O(n^{2-\epsilon(d)}) if and only if all problems PP in the classes

  • 𝖥𝖮𝖯(Q1Qk3),𝖥𝖮𝖯(Q1Qk3),\mathsf{FOP}_{\mathbb{Z}}(Q_{1}\dots Q_{k-3}\exists\forall\exists),\mathsf{FOP}_{\mathbb{Z}}(Q_{1}\dots Q_{k-3}\forall\exists\forall),

where Q1,Qk3{,}Q_{1},\dots Q_{k-3}\in\{\exists,\forall\} and k3k\geq 3, can be solved in time O(nk1ϵP)O(n^{k-1-\epsilon_{P}}) for an ϵP>0\epsilon_{P}>0.

The combination of Lemma 2.4 and Lemma 2.5, thus suffice to prove Theorem 1.4.

The 33-SUM completeness of formulas with inequality dimension at most 33

As a first idea, one could try to solve problems of different quantifier structures by just counting witnesses. Consider in the following the example 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists).

Assume we are promised that the formula aAbBcCψ(a,b,c)\forall a\in A\forall b\in B\exists c\in C\psi(a,b,c) satisfies a kind of disjointness property, specifically that for every (a,b)A×B(a,b)\in A\times B there exists at most one cCc\in C such that ψ(a,b,c)\psi(a,b,c). Then satisfying the formula boils down to checking whether the number of witnesses (a,b,c)(a,b,c) satisfiying ψ(a,b,c)\psi(a,b,c) equals to |A||B||A|\cdot|B|.

To create this disjointness effect, we use the following geometric approach: We show that one can re-interpret the formula as aAbB:a+bcCV(c)\forall a\in A\forall b\in B:a+b\in\bigcup_{c^{\prime}\in C^{\prime}}V(c^{\prime}), where A,B,C3A,B,C^{\prime}\subseteq\mathbb{Z}^{3}, CC^{\prime} is a set of size O(n)O(n) and V(c)V(c^{\prime}) is an orthant associated to cc^{\prime}. Using an adapted variant of [23], we decompose this union of orthants in 3\mathbb{R}^{3} (which we may equivalently view as sufficiently large congruent cubes) into a set \mathcal{R} of O(n)O(n) disjoint boxes. Thus, it remains to notice that the resulting problem – i.e., for all aA,bBa\in A,b\in B is there a box RR\in\mathcal{R} such that a+ba+b is contained in RR – is a 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists) formula with the desired disjointness property, which can be handled as argued above. For the class 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\forall\exists), we perform a slightly more involved argument. The classes 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\exists) and 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\exists\exists) reduce to 33-SUM regardless of the inequality dimension due to Theorem 1.1 and Lemma 2.2.

3 kk-SUM is complete for existential 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formulas

We begin with a simple completeness theorem that kk-SUM is complete for the class of problems 𝖥𝖮𝖯(k)\mathsf{FOP}_{\mathbb{Z}}(\exists^{k}). Since kk-SUM is indeed a 𝖥𝖮𝖯(k)\mathsf{FOP}_{\mathbb{Z}}(\exists^{k})-formula, it remains to show a fine-grained reduction from any 𝖥𝖮𝖯(k)\mathsf{FOP}_{\mathbb{Z}}(\exists^{k}) formula to kk-SUM. The proofs in this section are deferred to the full version. As a first step towards this Theorem, we consider how to reduce a conjunction of mm linear inequalities to a vector kk-SUM instance.

Lemma 3.1.

Consider vectors a1{U,,U}d1,,ak{U,,U}dk,a_{1}\in\{-U,\dots,U\}^{d_{1}},\dots,a_{k}\in\{-U,\dots,U\}^{d_{k}}, integers S1,,Sm{U,,U},S_{1},\dots,S_{m}\in\{-U,\dots,U\}, for each i{1,,m},j{1,,k}i\in\{1,\dots,m\},j\in\{1,\dots,k\}, vectors ci,jdjc_{i,j}\in\mathbb{Z}^{d_{j}}, and a formula

ψ:=i=1m(j=1kci,jTajSi).\psi:=\bigwedge_{i=1}^{m}\left(\sum_{j=1}^{k}c_{i,j}^{T}a_{j}\geq S_{i}\right).

There exist O(1)O(1) time computable functions f1,ψ,,fk,ψ,g,ψ,Wf_{1}^{\ell,\psi},\dots,f_{k}^{\ell,\psi},g^{\ell,\psi,W} such that the following statements are equivalent

  1. 1.

    The formula i=1m(j=1kci,jTajSi)\bigwedge_{i=1}^{m}\left(\sum_{j=1}^{k}c_{i,j}^{T}a_{j}\geq S_{i}\right) holds.

  2. 2.

    There are {1,,log2(M)}m,W{1,,k}m\ell\in\{1,\dots,\lceil\log_{2}(M)\rceil\}^{m},W\in\{1,\dots,k\}^{m} such that f1,ψ(a1)++fk,ψ(ak)=g,ψ,W(S1,,Sm).f^{\ell,\psi}_{1}(a_{1})+\dots+f^{\ell,\psi}_{k}(a_{k})=g^{\ell,\psi,W}(S_{1},\dots,S_{m}).

Moreover, if the second item holds, there is a unique choice of such \ell and WW.

Essentially the above lemma enables a reduction from a conjunction of inequality checks to a conjunction of equality checks. We can now continue with our completeness theorem. See 1.1 Ater applying Lemma 3.1, it remains to reduce a conjunction of equality checks to kk-SUM. To do so, we interpret the conjunction of equalities as a Vector kk-SUM problem, which can be reduced to kk-SUM in a straightforward way [5].

4 On counting witnesses in 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}

In this section, we show reductions from counting witnesses of 𝖥𝖮𝖯(k)\mathsf{FOP}_{\mathbb{Z}}(\exists^{k}) formulas to #k\#k-SUM, specifically, we prove Theorem 1.2. To do so, we adapt the proof of Theorem 1.1 given in Section 3 to a counting version. As discussed in Section 2, this requires us to work with a multiset version of #kk-SUM. Handling multisets is thus the main challenge addressed in this section. Formally, we say that a multiset is a set AA together with a function f:Af:A\to\mathbb{N}. For aAa\in A, we abbreviate na:=f(a)n_{a}:=f(a) as the multiplicity of aa. To measure multiset sizes, we still think of each aa to have nan_{a} copies in the input, i.e. the size of AA is aAna\sum_{a\in A}n_{a}. Almost all proofs in this section are deferred to the full version of the paper.

Definition 4.1 ((U,d)(U,d)-vector Multiset #k\#k-SUM).

Let X:={U,,U}dX:=\{-U,\dots,U\}^{d}. Given kk multisets A1,,AkXA_{1},\dots,A_{k}\subseteq X and tXt\in X, we ask for the total number of kk-SUM witnesses, that is

a1++ak=t,a1A1,,akAki=1knai.\displaystyle\sum_{\begin{subarray}{c}a_{1}+\dots+a_{k}=t,\\ a_{1}\in A_{1},\dots,a_{k}\in A_{k}\end{subarray}}\prod_{i=1}^{k}n_{a_{i}}.

Furthermore, define Multiset #k\#k-SUM as (U,1)(U,1)-vector Multiset #k\#k-SUM and MM-multiplicity #k\#k-SUM as Multiset #k\#k-SUM with the additional restriction that the multiplicity of each element is limited, that is for all aA1Ak:naMa\in A_{1}\cup\dots\cup A_{k}:n_{a}\leq M holds. Lastly, #k\#k-SUM is defined as 11-Multiplicity #k\#k-SUM and (U,d)(U,d)-vector #k\#k-SUM is (U,d)(U,d)-vector Multiset #k\#k-SUM where for all aA1Ak:na=1a\in A_{1}\cup\dots\cup A_{k}:n_{a}=1 holds.

For the case of 𝖥𝖮𝖯3\mathsf{FOP}_{\mathbb{Z}}^{3} we will also introduce the #\#All-ints version of the above problems, which asks to determine, for each a1A1a_{1}\in A_{1}, the number of witnesses involving a1a_{1}.

The (deferred) proof of the following lemma is analogous to the proof of Abboud et al. [5] to reduce Vector kk-SUM to kk-SUM.

Lemma 4.2 ((U,d)(U,d)-vector Multiset #k\#k-SUM k/2\leq_{\lceil k/2\rceil} Multiset #k\#k-SUM).

If Multiset #k\#k-SUM can be solved in time T(n)T(n) then (U,d)(U,d)-vector Multiset #k\#k-SUM can be solved in time O(ndlog(U)+T(n)).O(nd\log(U)+T(n)).

Next, we give a simple approach to solve Multiset #k\#k-SUM when all multiplicities are comparably small.

Lemma 4.3 (MM-multiplicity #k\#k-SUMk/2\leq_{\lceil k/2\rceil} #k\#k-SUM).

If #k\#k-SUM can be solved in time T(n)T(n), then MM-multiplicity #k\#k-SUM can be solved in time O~(T(nMk1))\tilde{O}(T(nM^{k-1})).

For later purposes, we will need the following version of the above lemma. {observation} If #\#All-ints 33-SUM can be solved in time T(n)T(n), then we can solve #\#All-ints MM-multiplicity 33-SUM in time O~(T(nM2)).\tilde{O}(T(nM^{2})).

We can finally prove the main result of this section.

Lemma 4.4.

For odd k3k\geq 3, if there exists an algorithm for the #k\#k-SUM problem running in time O(nk/2ϵ)O(n^{\lceil k/2\rceil-\epsilon}) for an ϵ>0\epsilon>0, then there exists an algorithm for the Multiset #\#k-SUM problem running in time O(nk/2ϵ)O(n^{\lceil k/2\rceil-\epsilon^{\prime}}) for an ϵ>0\epsilon^{\prime}>0.

Proof 4.5.

We proceed with a heavy-light approach. Assume there exists an O(nk/2ϵ)O(n^{\lceil k/2\rceil-\epsilon}) algorithm for the #k\#k-SUM problem. Set c:=(k1)(k/2)c:=(k-1)(\lceil k/2\rceil). Firstly, we count the number of solutions (a1,,ak)A1××Ak(a_{1},\dots,a_{k})\in A_{1}\times\dots\times A_{k}, where na1,,naknϵ/cn_{a_{1}},\dots,n_{a_{k}}\leq n^{\epsilon/c} using Lemma 4.3. This takes time

O~((n(nϵ/c)k1)k/2ϵ)\displaystyle\tilde{O}\left((n\cdot(n^{\epsilon/c})^{k-1})^{\lceil k/2\rceil-\epsilon}\right) =O~((n1+ϵk/2)k/2ϵ)\displaystyle=\tilde{O}\left(\left(n^{1+\frac{\epsilon}{\lceil k/2\rceil}}\right)^{\lceil k/2\rceil-\epsilon}\right)
=O~(nk/2ϵ+ϵϵ2k/2)\displaystyle=\tilde{O}\left(n^{{\lceil k/2\rceil-\epsilon}+\epsilon-\frac{\epsilon^{2}}{\lceil k/2\rceil}}\right)
=O(nk/2ϵ),\displaystyle=O\left(n^{\lceil k/2\rceil-\epsilon^{\prime}}\right),

where ϵ>0\epsilon^{\prime}>0. It remains to calculate the number of witnesses (a1,,ak)(a_{1},\dots,a_{k}), where for at least one i{1,,k}i\in\{1,\dots,k\}, we have high multiplicity, meaning nai>nϵ/cn_{a_{i}}>n^{\epsilon/c} holds. Consider the case that a1A1a_{1}\in A_{1} is a high-multiplicity number (the case where aiAia_{i}\in A_{i} with i1i\neq 1 is a high-multiplicity number is analogous). For each high-multiplicity number a1a_{1} in A1A_{1} we do the following. Solve the (k1)(k-1)-SUM instance with sets A2,,AkA_{2},\dots,A_{k} and target ta1t-a_{1}. There are at most n1(ϵ/c)n^{1-(\epsilon/c)} many high-multiplicity numbers in A1A_{1}, and solving the (k1)(k-1)-SUM instance takes time O(n(k1)/2)O(n^{(k-1)/2}), since kk is odd. We get a total runtime of

n1ϵcO~(n(k1)/2)\displaystyle n^{1-\frac{\epsilon}{c}}\cdot\tilde{O}(n^{(k-1)/2}) =O~(n1(ϵ/c)+(k1)/2)\displaystyle=\tilde{O}(n^{1-(\epsilon/c)+(k-1)/2})
=O~(n(k+1)/2(ϵ/c))\displaystyle=\tilde{O}(n^{(k+1)/2-(\epsilon/c)})
=O(nk/2ϵ′′),\displaystyle=O(n^{\lceil k/2\rceil-\epsilon^{\prime\prime}}),

where ϵ′′>0\epsilon^{\prime\prime}>0, which concludes the proof.

See 1.2

By combining the subquadratic equivalence between 33-SUM and #3\#3-SUM due to Chan et al. [22] and the above theorem, we obtain the following corollary. See 1.3

The above proof can also be adapted for the special case k=3k=3 to count for each a1A1a_{1}\in A_{1} the number of witnesses involving a1a_{1}, by plugging in the appropriate All-ints versions; see the full version of the paper for details. Together with the equivalence between #\#All-ints 33-SUM and 33-SUM of Chan et al. [22], we get

Corollary 4.6.

For all problems PP in 𝖥𝖮𝖯(3)\mathsf{FOP}_{\mathbb{Z}}(\exists^{3}), we are able to count for each a1A1a_{1}\in A_{1} the number of witnesses involving a1a_{1} in randomized time O(n2ϵ)O(n^{2-\epsilon}) for an ϵ>0\epsilon>0, if 33-SUM can be solved in randomized time O(n2ϵ)O(n^{2-\epsilon^{\prime}}) for an ϵ>0\epsilon^{\prime}>0.

5 Completeness Theorems for General Quantifier Structures

As Theorem 1.1 establishes 33-SUM as the complete problem for the class 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\exists), we would like to similarly explore complete problems for other quantifier structures. All proofs in this section are deferred to the full version. Let us recall our main geometric problems.

Definition 5.1 (Verification of dd-dimensional Pareto Sum).

Given sets A,B,CdA,B,C\subseteq\mathbb{Z}^{d}. Does the set CC dominate A+BA+B, that is does for all aA,bBa\in A,b\in B exist a cCc\in C, with ca+bc\geq a+b ?

It is easy to see that Verification of dd-dimensional Pareto Sum is in 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists).

Definition 5.2 (Hausdorff Distance under nn Translations).

Given sets A,B,CdA,B,C\subseteq\mathbb{Z}^{d} with at most nn elements and a γ\gamma\in\mathbb{N}, the Hausdorff distance under nn Translations problem asks whether the following holds:

δHT(A)(B,C)minτAδH(B,C+{τ})=minτAmaxbBmincCb(c+τ)γ.\delta_{\overrightarrow{H}}^{T(A)}(B,C)\coloneqq\min_{\tau\in A}\delta_{\overrightarrow{H}}(B,C+\{\tau\})=\min_{\tau\in A}\max_{b\in B}\min_{c\in C}\|b-(c+\tau)\|_{\infty}\leq\gamma.

We show the following result firstly, which allows us to assume without loss of generality a certain normal form.

Lemma 5.3.

A general 𝖥𝖮𝖯(Q1Q2)\mathsf{FOP}_{\mathbb{Z}}(Q_{1}Q_{2}\exists) formula, with input set A1d1,A2d2,A3d3A_{1}\subseteq\mathbb{Z}^{d_{1}},A_{2}\subseteq\mathbb{Z}^{d_{2}},A_{3}\subseteq\mathbb{Z}^{d_{3}}, where |A1|=|A2|=|A3|=n|A_{1}|=|A_{2}|=|A_{3}|=n, can be reduced to the 𝖥𝖮𝖯(Q1Q2)\mathsf{FOP}_{\mathbb{Z}}(Q_{1}Q_{2}\exists) formula

Q1a1A1Q2a2A2a3A3:a1+a2a3Q_{1}a_{1}^{\prime}\in A_{1}^{\prime}Q_{2}a_{2}^{\prime}\in A_{2}^{\prime}\exists a_{3}^{\prime}\in A_{3}^{\prime}:a_{1}^{\prime}+a_{2}^{\prime}\leq a_{3}^{\prime}

in time O(n)O(n), where |A1|=|A2|=n|A_{1}^{\prime}|=|A_{2}^{\prime}|=n and |A3|=O(n)|A_{3}^{\prime}|=O(n).

The above lemma immediately gives us complete syntactic problems for our classes. It remains to establish connections between the different quantifier structure classes, and explore natural variants of the syntactic problems.

The syntactic complete problem for the class 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\forall\exists) turns out to be equivalent to Hausdorff Distance under nn Translations. We obtain:

Lemma 5.4 (Hausdorff Distance under nn Translations is complete for 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\forall\exists)).

There is a function ϵ(d)>0\epsilon(d)>0 such that Hausdorff Distance under nn Translations can be solved in time O(n2ϵ(d))O(n^{2-\epsilon(d)}) if and only if all problems PP in 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\forall\exists) can be solved in time O(n2ϵP)O(n^{2-\epsilon_{P}}) for an ϵP>0\epsilon_{P}>0.

Similarly, the Verification of Pareto Sum problem is complete for the class 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists).

Lemma 5.5 (Verification of Pareto Sum is complete for 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists)).

There is a function ϵ(d)>0\epsilon(d)>0 such that Verification of Pareto Sum can be solved in time O(n2ϵ(d))O(n^{2-\epsilon(d)}) if and only if all problems PP in 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists) can be solved in time O(n2ϵP)O(n^{2-\epsilon_{P}}) for an ϵP>0\epsilon_{P}>0.

5.1 𝖥𝖮𝖯()𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\exists\exists)\to\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\exists)

We continue with handling the class 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\exists\exists). By simply making use of Corollary 4.6, one can easily prove that 33-SUM is hard for the class 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\exists\exists). We can also show a deterministic proof, as Corollary 4.6 makes use of the subquadratic equivalence between 33-SUM and #\#All-ints 33-SUM, which relies on randomization techniques.

See 2.2

5.2 𝖥𝖮𝖯()𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\exists)\to\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists)

We explore the connection between the problem Additive Sumset Approximation, which is a member of the class 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists), and the 33-SUM problem. The following theorem will play a key role to enable the discovery of the relationship between 33-SUM and other quantifier structures. See 2.3 The proof is deferred to the full version of the paper.

5.3 Completeness results for the class 𝖥𝖮𝖯k\mathsf{FOP}_{\mathbb{Z}}^{k}

We turn to combining the above insights to establish (a pair of) complete problems for the class 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}. The proofs in this section are deferred to the full version of the paper.

See 2.4

See 2.5 We finally obtain our completeness theorem for the whole class 𝖥𝖮𝖯k\mathsf{FOP}_{\mathbb{Z}}^{k}. See 1.4

Essentially, these two problems capture the complexity of the class 𝖥𝖮𝖯3\mathsf{FOP}_{\mathbb{Z}}^{3} and can be seen as the most important problems in 𝖥𝖮𝖯k\mathsf{FOP}_{\mathbb{Z}}^{k}.

6 The 33-SUM problem is complete for 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formulas with Inequality Dimension at most 3

In this section, we show that 33-SUM problem captures an interesting subclass of 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formulas with arbitrary quantifier structure, namely the formulas of sufficiently small inequality dimension. Let us recall the notion of inequality dimension.

Definition 6.1 (Inequality Dimension of a Formula).

Let ϕ=Q1x1A1,,QkxkAk:ψ\phi=Q_{1}x_{1}\in A_{1},\dots,Q_{k}x_{k}\in A_{k}:\psi be a 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formula with AidiA_{i}\subseteq\mathbb{Z}^{d_{i}}.

The inequality dimension of ϕ\phi is the smallest number ss such that there exists a Boolean function ψ:{0,1}s{0,1}\psi^{\prime}:\{0,1\}^{s}\to\{0,1\} and (strict or non-strict) linear inequalities L1,,LsL_{1},\dots,L_{s} in the variables {xi[j]:i{1,,k},j{1,,di}}\{x_{i}[j]:i\in\{1,\dots,k\},j\in\{1,\dots,d_{i}\}\} and the free variables such that ψ(x1,,xk)\psi(x_{1},\dots,x_{k}) is equivalent to ψ(L1,,Ls)\psi^{\prime}(L_{1},\dots,L_{s}).

In the following, we look at the class of problems 𝖥𝖮𝖯k\mathsf{FOP}_{\mathbb{Z}}^{k} with the restriction of inequality dimension at most 33. We use the following naming convention for boxes.

Definition 6.2.

A dd-box in d\mathbb{R}^{d} is the cartesian product of dd proper intervals s1××sds_{1}\times\dots\times s_{d}, where sis_{i} is an open, closed or half-open interval. We call a cartesian product of only closed intervals a closed box and a cartesian product of only open intervals an open box.

Given a set RR of nn closed boxes (represented as 2d2d integer coordinates), and dd-dimensional points aA,bBa\in A,b\in B, we can express in 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\exists\exists) whether a+ba+b lies in one of the boxes as follows:

aAbBrR:i=1dr[i]a[i]+b[i]a[i]+b[i]r[d+i].\exists a\in A\exists b\in B\exists r\in R:\bigwedge_{i=1}^{d}r[i]\leq a[i]+b[i]\land a[i]+b[i]\leq r[d+i].

In fact, we are not limited to closed boxes, if a box is open or half open in a dimension, one can adjust the inequalities in this dimension appropriately.

In order to prove our main theorem in this section, we need to partition the union of nn unit cubes in 3\mathbb{R}^{3} into pairwise interior- and exterior-disjoint boxes. While Chew et al. [23] studied such a decomposition of unit cubes with the requirement of only interior-disjoint boxes, we need an extension of their result to guarantee disjoint exteriors.

Lemma 6.3 (Disjoint decomposition of the union of cubes in 3\mathbb{R}^{3}).

Let 𝒞\mathcal{C} be a set of nn axis-aligned congruent cubes in 3\mathbb{R}^{3}. The union of these cubes, can be decomposed into O(n)O(n) boxes whose interiors and exteriors are disjoint in time O(nlog2n)O(n\log^{2}n).

The proof is deferred to the full version.

Theorem 6.4.

There is an algorithm deciding 33-SUM in randomized time O(n2ϵ)O(n^{2-\epsilon}) for an ϵ>0\epsilon>0 if and only if for each problem PP in the classes 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\forall\forall\exists) and 𝖥𝖮𝖯()\mathsf{FOP}_{\mathbb{Z}}(\exists\forall\exists) of inequality dimension at most 33 there exists some ϵ>0\epsilon^{\prime}>0 such that we can solve PP in randomized time O(n2ϵ)O(n^{2-\epsilon^{\prime}}).

Proof 6.5.

For the first direction due to Theorem 2.3, we can reduce 33-SUM to an instance of Additive Sumset Approximation,

aAbBcC:ca+ba+bc+t,\forall a\in A\forall b\in B\exists c\in C:c\leq a+b\land a+b\leq c+t,

which has inequality dimension 2. Let us continue with the other direction. Let ϕ:=Q1aAbBcC:φ\phi:=Q_{1}a\in A\forall b\in B\exists c\in C:\varphi, where Q1{,}Q_{1}\in\{\exists,\forall\} and φ\varphi is a quantifier free linear arithmetic formula with inequality dimension 33. Let L1:=α1Ta+β1Tbγ1Tc+S1L_{1}:=\alpha_{1}^{T}a+\beta_{1}^{T}b\leq\gamma_{1}^{T}c+S_{1}, L2:=α2Ta+β2Tbγ2Tc+S2L_{2}:=\alpha_{2}^{T}a+\beta_{2}^{T}b\leq\gamma_{2}^{T}c+S_{2} and L3:=α3Ta+β3Tbγ3Tc+S3L_{3}:=\alpha_{3}^{T}a+\beta_{3}^{T}b\leq\gamma_{3}^{T}c+S_{3} after replacing the free variables. Assume that the formula φ\varphi is given in DNF, thus each co-clause has at most 33 atoms, chosen from L1,L2,L3L_{1},L_{2},L_{3} and their negations. Let

A:={(α1Taα2Taα3Ta):aA},B:={(β1Tbβ2Tbβ3Tb):bB},C:={(γ1Tc+S1γ2Tc+S2γ3Tc+S3):cC}\displaystyle A^{\prime}:=\left\{\left(\begin{array}[]{cc}\alpha_{1}^{T}a\\ \alpha_{2}^{T}a\\ \alpha_{3}^{T}a\end{array}\right):a\in A\right\},B^{\prime}:=\left\{\left(\begin{array}[]{cc}\beta_{1}^{T}b\\ \beta_{2}^{T}b\\ \beta_{3}^{T}b\end{array}\right):b\in B\right\},C^{\prime}:=\left\{\left(\begin{array}[]{cc}\gamma_{1}^{T}c+S_{1}\\ \gamma_{2}^{T}c+S_{2}\\ \gamma_{3}^{T}c+S_{3}\end{array}\right):c\in C\right\}

Thus each co-clause consists of conjunctions of a subset of the following set

{\displaystyle\{ a[0]+b[0]c[0],a[0]+b[0]c[0]+1,a[1]+b[1]c[1],\displaystyle a^{\prime}[0]+b^{\prime}[0]\leq c^{\prime}[0],a^{\prime}[0]+b^{\prime}[0]\geq c^{\prime}[0]+1,a^{\prime}[1]+b^{\prime}[1]\leq c^{\prime}[1],
a[1]+b[1]c[1]+1,a[2]+b[2]c[2],a[2]+b[2]c[2]+1}.\displaystyle a^{\prime}[1]+b^{\prime}[1]\geq c^{\prime}[1]+1,a^{\prime}[2]+b^{\prime}[2]\leq c^{\prime}[2],a^{\prime}[2]+b^{\prime}[2]\geq c^{\prime}[2]+1\}.

Let the co-clauses of φ\varphi be V1,,VhV_{1},\dots,V_{h}. Thus, we aim to decide a formula of the form:

Q1aAbBcC:i=1hViQ_{1}a^{\prime}\in A^{\prime}\forall b^{\prime}\in B^{\prime}\exists c^{\prime}\in C^{\prime}:\bigvee_{i=1}^{h}V_{i} (2)

For each co-clause ViV_{i}, i{1,,h}i\in\{1,\dots,h\} it holds that ViV_{i} is of the form

kViKLkjViJ¬Lj,\bigwedge_{k\in V_{i}^{K}}L_{k}\land\bigwedge_{j\in V_{i}^{J}}\lnot L_{j},

for some ViJ,ViK{1,2,3}V_{i}^{J},V_{i}^{K}\subseteq\{1,2,3\} and ViJViK=V_{i}^{J}\cap V_{i}^{K}=\emptyset.

Let us consider for each fixed cCc^{\prime}\in C^{\prime} the following possibly empty orthant in 3\mathbb{R}^{3}.

𝒮(Vi,c):={x3:kViKx[k]c[k]jViJx[j]c[j]+1}.\mathcal{S}(V_{i},c^{\prime}):=\{x\in\mathbb{R}^{3}:\bigwedge_{k\in V_{i}^{K}}x[k]\leq c^{\prime}[k]\land\bigwedge_{j\in V_{i}^{J}}x[j]\geq c^{\prime}[j]+1\}.

By construction, it is immediate that for a fixed cc^{\prime} and (a,b)A×B(a^{\prime},b^{\prime})\in A^{\prime}\times B^{\prime} that (a,b,c)(a^{\prime},b^{\prime},c^{\prime}) fulfill the co-clause ViV_{i} if and only if a+b𝒮(Vi,c)a^{\prime}+b^{\prime}\in\mathcal{S}(V_{i},c^{\prime}). Thus, equivalently to (2), we ask

Q1aAbBcC:i=1h(a+bS(Vi,c)).Q_{1}a^{\prime}\in A^{\prime}\forall b^{\prime}\in B^{\prime}\exists c^{\prime}\in C^{\prime}:\bigvee_{i=1}^{h}\left(a^{\prime}+b^{\prime}\in S(V_{i},c^{\prime})\right).

Having a closer look, i=1h(a+bS(Vi,c))\bigvee_{i=1}^{h}\left(a^{\prime}+b^{\prime}\in S(V_{i},c^{\prime})\right) is true if and only if a+ba^{\prime}+b^{\prime} lies in one of the orthants S(Vi,c)S(V_{i},c^{\prime}).

We argue that we may represent the orthant 𝒮(Vi,c)\mathcal{S}(V_{i},c^{\prime}) as an appropriately chosen cube in 3\mathbb{R}^{3}. To this end, let M:=2max{a1+b1+c1:aA,bB,cC}M:=2\cdot\max\{\|a\|_{1}+\|b\|_{1}+\|c\|_{1}:a^{\prime}\in A^{\prime},b^{\prime}\in B^{\prime},c^{\prime}\in C^{\prime}\} be a sufficiently large number. We can interpret 𝒮(Vi,c)\mathcal{S}(V_{i},c^{\prime}) as a cube of the type 𝒞i,c=[m0,m0]×[m1,m1]×[m2,m2]\mathcal{C}_{i,c^{\prime}}=[m_{0},m^{\prime}_{0}]\times[m_{1},m_{1}^{\prime}]\times[m_{2},m^{\prime}_{2}], where for u{0,1,2}u\in\{0,1,2\}, we define:

mu:={MuViK,uViJ,2M+c[u]uViK,c[u]+1uViJ,mu:={MuViK,uViJ,c[u]uViK,2M+c[u]+1uViJ.\displaystyle m_{u}:=\begin{cases}-M&u\not\in V_{i}^{K},u\not\in V_{i}^{J},\\ -2M+c[u]&u\in V_{i}^{K},\\ c[u]+1&u\in V_{i}^{J},\end{cases}\quad m_{u}^{\prime}:=\begin{cases}M&u\not\in V_{i}^{K},u\not\in V_{i}^{J},\\ c[u]&u\in V_{i}^{K},\\ 2M+c[u]+1&u\in V_{i}^{J}.\end{cases}

The cubes are axis-aligned and have side length 2M2M. Due to the large size of the cube we get for fixed cCc^{\prime}\in C^{\prime} that a+b𝒮(Vi,c)a^{\prime}+b^{\prime}\in\mathcal{S}(V_{i},c^{\prime}) if and only if a+ba^{\prime}+b^{\prime} lies inside the cube 𝒞i,c\mathcal{C}_{i,c^{\prime}}.

By Lemma 6.3, we can decompose the collection of cubes 𝒞i,c\mathcal{C}_{i,c^{\prime}} for i{1,,H},cCi\in\{1,\dots,H\},c^{\prime}\in C^{\prime} into l=O(n)l=O(n) disjoint boxes :={R1,,Rl}\mathcal{R}:=\{R_{1},\dots,R_{l}\} in time O(nlog2n)O(n\log^{2}n). Let us now go through a case distinction based on the first quantifier.

  • If Q1=Q_{1}=\forall, equivalent to ϕ\phi we ask

    aAbBi{1,,l}:a+b lies in Ri.\forall a^{\prime}\in A^{\prime}\forall b^{\prime}\in B^{\prime}\exists i\in\{1,\dots,l\}:a^{\prime}+b^{\prime}\text{ lies in }R_{i}.

    By replacing each i{1,,l}i\in\{1,\dots,l\} by a 6-tuple denoting the dimensions of the box RiR_{i}, we can reduce counting the number of (a,b,Ri)(a^{\prime},b^{\prime},R_{i}) with a+bRia^{\prime}+b^{\prime}\in R_{i} to 3-SUM using Corollary 1.3. Due to the disjointness of the boxes RiR_{i}, we know that no (a,b)(a^{\prime},b^{\prime}) can be in different boxes Ri,RiR_{i},R_{i^{\prime}} with iii\neq i^{\prime}.

    Thus, we can decide our original question by checking whether the number of such witnesses equals |A||B||A^{\prime}|\cdot|B^{\prime}|, concluding the fine-grained reduction to 3-SUM.

  • Assume now that Q1=Q_{1}=\exists. Thus, equivalently to ϕ\phi, we ask.

    aAbBi{1,,l}:a+b lies in Ri.\exists a^{\prime}\in A^{\prime}\forall b^{\prime}\in B^{\prime}\exists i\in\{1,\dots,l\}:a^{\prime}+b^{\prime}\text{ lies in }R_{i}.

    We can now make use of Corollary 4.6. Count for each aAa^{\prime}\in A^{\prime} the number of witnesses (a,b,Ri)(a^{\prime},b^{\prime},R_{i}) with a+bRa^{\prime}+b^{\prime}\in R^{\prime}. We claim that it remains to check whether there is some aa^{\prime} that is involved in |B||B^{\prime}| witnesses. To see this, note that due to the disjointness of the RiR_{i}’s, for any aAa^{\prime}\in A^{\prime} we have that the number of (b,Ri)(b^{\prime},R_{i}) with a+bRia^{\prime}+b^{\prime}\in R_{i} is equal to the number of bb^{\prime} such that there exists RiR_{i} with a+bRia^{\prime}+b^{\prime}\in R_{i}. Again, the desired reduction to 3-SUM follows.

We remark that, by [11], we know that the complexity of the union of orthants in d\mathbb{R}^{d} has worst case complexity O(nd/2)O(n^{\lfloor d/2\rfloor}). Thus, the above proof does not seem directly generalizable to inequality dimensions larger than 3. We can extend Theorem 6.4 to kk-quantifiers by the following theorem.

See 1.5

The above theorem gives us immediate reductions to 33-SUM for many seemingly unrelated problems of different quantifier structures and semantics.

For instance, as a direct application of the above theorem we can conclude the equivalence of the Additive Sumset Approximation problem to 33-SUM, together with Theorem 2.3.

Lemma 6.6 (Additive Sumset Approximation 2\leq_{2} 33-SUM).

If the 33-SUM problem can be solved in randomized time O(n2ϵ)O(n^{2-\epsilon}) for an ϵ>0\epsilon>0 then Additive Sumset Approximation problem can be solved in randomized time O(n2ϵ)O(n^{2-\epsilon^{\prime}}) for an ϵ>0\epsilon^{\prime}>0.

7 Application: A lower bound on the computation of Pareto Sums

In the following, we explore how the 33-SUM hardness of Verification of Pareto Sum translates to a hardness result for the problem of computing Pareto Sums. Let us first justify the naming of the Verification of Pareto Sum problem, by showing it to be subquadratic equivalent to the more natural extended version of Verification of Pareto Sum. Throughout this section, we consider dimensions d2d\geq 2.

Definition 7.1 (Verification of Pareto Sum (Extended version)).

Given sets A,B,CdA,B,C\subseteq\mathbb{Z}^{d}, do the following properties hold simultaneously:

  • (Inclusion): CA+BC\subseteq A+B,

  • (Dominance): CC dominates A+BA+B. More formally, for every aA,bBa\in A,b\in B there exists cCc\in C with ca+bc\geq a+b.

  • (Minimality): There are no c,cCc,c^{\prime}\in C with ccc\neq c^{\prime} and ccc\leq c^{\prime}.

We make use of the following lemma and its construction for the results in this section.

Lemma 7.2.

Given sets A,B,CdA,B,C\subseteq\mathbb{Z}^{d} of size at most nn, one can construct sets A~,B~,C~d\tilde{A},\tilde{B},\tilde{C}\subseteq\mathbb{Z}^{d} of size Θ(n)\Theta(n) in time O~(n)\tilde{O}(n) such that (1) A~,B~,C~\tilde{A},\tilde{B},\tilde{C} always satisfy the minimality and inclusion condition and (2) A~,B~,C~\tilde{A},\tilde{B},\tilde{C} fulfill the dominance condition if and only if A,B,CA,B,C fulfill the dominance condition.

Due to space constraints, the proof had to be deferred to the full version. Using this construction, it is not difficult to obtain the following equivalence.

Lemma 7.3.

There is an O(n2ϵ)O(n^{2-\epsilon}) time algorithm for an ϵ>0\epsilon>0 for Verification of Pareto Sum (Extended Version) if and only if there is an O(n2ϵ)O(n^{2-\epsilon^{\prime}}) time algorithm for an ϵ>0\epsilon^{\prime}>0 for Verification of Pareto Sum.

The proof can be found in the full version. Thus, for subquadratic reductions, we can restrict ourselves to the Verification of Pareto Sum problem, which essentially only checks the dominance condition.

Let us now consider the natural problem of computing the Pareto Sum.

Definition 7.4 (Pareto Sum).

Given sets A,BdA,B\subseteq\mathbb{Z}^{d}, compute a set CC\subseteq\mathbb{Z}, such that A,B,CA,B,C satisfy the Inclusion, Dominance and Minimality condition.

In the following, we argue why the lower bounds to Verification of Pareto Sum translate to lower bounds to Computation of the Pareto Sum. Formally, we prove:

Lemma 7.5.

If there is an algorithm to compute the Pareto Sum CC of sets A,BdA,B\subseteq\mathbb{Z}^{d} in time O(n2ϵ)O(n^{2-\epsilon}) for an ϵ>0\epsilon>0 even when C=Θ(n)C=\Theta(n), then one can also decide Verification of Pareto Sum of sets A,B,CA,B,C in time O(n2ϵ)O(n^{2-\epsilon^{\prime}}) for an ϵ>0\epsilon^{\prime}>0.

We conclude this section with our resulting hardness results for computing Pareto Sums. See 1.7

8 Future Work

While we exhibit a pair of problems that is complete for the class 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}, one could still ask whether there is a subquadratic reduction from Hausdorff distance under nn Translations to Verification of Pareto Sum. As a result there would be a single complete problem (or rather the canonical multidimensional family of a single geometric problem) for 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}.

Is Verification of Pareto Sum complete for the class 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}}?

Interestingly, previous completeness theorems [36] were able to establish a problem of quantifier structure \forall\forall\exists (the Orthogonal Vectors problem) as complete by making use of a technique in [51] that was originally used to show subcubic equivalence between All-Pairs Negative Triangle and Negative Triangle. However, a major problem we encounter is that while the third quantifier in the Orthogonal Vectors problem ranges over a sparse (intuitively: subpolynomially sized) domain (i.e., the dimensions of the vectors), the third quantifier in Pareto Sum Verification ranges over a linearly sized domain (i.e., the set CC).

Finally, we ask if our 33-SUM completeness result for arbitrary quantifier structures can be improved upon.

Can we establish a d>3d>3 such that 33-SUM is complete for 𝖥𝖮𝖯\mathsf{FOP}_{\mathbb{Z}} formulas of inequality dimension at most dd?

References

  • [1] Amir Abboud, Arturs Backurs, Karl Bringmann, and Marvin Künnemann. Fine-grained complexity of analyzing compressed data: Quantifying improvements over decompress-and-solve. In Chris Umans, editor, 58th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2017, Berkeley, CA, USA, October 15-17, 2017, pages 192–203. IEEE Computer Society, 2017. doi:10.1109/FOCS.2017.26.
  • [2] Amir Abboud, Arturs Backurs, Karl Bringmann, and Marvin Künnemann. Impossibility results for grammar-compressed linear algebra. In Hugo Larochelle, Marc’Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, and Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, 2020. URL: https://proceedings.neurips.cc/paper/2020/hash/645e6bfdd05d1a69c5e47b20f0a91d46-Abstract.html.
  • [3] Amir Abboud, Karl Bringmann, Seri Khoury, and Or Zamir. Hardness of approximation in p via short cycle removal: cycle detection, distance oracles, and beyond. In Stefano Leonardi and Anupam Gupta, editors, STOC ’22: 54th Annual ACM SIGACT Symposium on Theory of Computing, Rome, Italy, June 20 - 24, 2022, pages 1487–1500. ACM, 2022. doi:10.1145/3519935.3520066.
  • [4] Amir Abboud and Kevin Lewi. Exact weight subgraphs and the k-sum conjecture. In Fedor V. Fomin, Rusins Freivalds, Marta Z. Kwiatkowska, and David Peleg, editors, Automata, Languages, and Programming - 40th International Colloquium, ICALP 2013, Riga, Latvia, July 8-12, 2013, Proceedings, Part I, volume 7965 of Lecture Notes in Computer Science, pages 1–12. Springer, 2013. doi:10.1007/978-3-642-39206-1\_1.
  • [5] Amir Abboud, Kevin Lewi, and Ryan Williams. Losing weight by gaining edges. In Andreas S. Schulz and Dorothea Wagner, editors, Algorithms - ESA 2014 - 22th Annual European Symposium, Wroclaw, Poland, September 8-10, 2014. Proceedings, volume 8737 of Lecture Notes in Computer Science, pages 1–12. Springer, 2014. doi:10.1007/978-3-662-44777-2\_1.
  • [6] Amir Abboud and Virginia Vassilevska Williams. Popular conjectures imply strong lower bounds for dynamic problems. In 55th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2014, Philadelphia, PA, USA, October 18-21, 2014, pages 434–443. IEEE Computer Society, 2014. doi:10.1109/FOCS.2014.53.
  • [7] Haozhe An, Mohit Gurumukhani, Russell Impagliazzo, Michael Jaber, Marvin Künnemann, and Maria Paula Parga Nina. The fine-grained complexity of multi-dimensional ordering properties. Algorithmica, 84(11):3156–3191, 2022. URL: https://doi.org/10.1007/s00453-022-01014-x, doi:10.1007/S00453-022-01014-X.
  • [8] Christian Artigues, Marie-José Huguet, Fallou Gueye, Frédéric Schettini, and Laurent Dezou. State-based accelerations and bidirectional search for bi-objective multi-modal shortest paths. Transportation Research Part C: Emerging Technologies, 27:233–259, 2013.
  • [9] Arturs Backurs, Piotr Indyk, and Ludwig Schmidt. Better approximations for tree sparsity in nearly-linear time. In Philip N. Klein, editor, Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2017, Barcelona, Spain, Hotel Porta Fira, January 16-19, pages 2215–2229. SIAM, 2017. doi:10.1137/1.9781611974782.145.
  • [10] Stephen A Bloch, Jonathan F Buss, and Judy Goldsmith. How hard are n 2-hard problems? ACM SIGACT News, 25(2):83–85, 1994.
  • [11] Jean-Daniel Boissonnat, Micha Sharir, Boaz Tagansky, and Mariette Yvinec. Voronoi diagrams in higher dimensions under certain polyhedral distance functions. Discret. Comput. Geom., 19(4):485–519, 1998. doi:10.1007/PL00009366.
  • [12] Karl Bringmann, Alejandro Cassis, Nick Fischer, and Marvin Künnemann. Fine-grained completeness for optimization in P. In Mary Wootters and Laura Sanità, editors, Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, APPROX/RANDOM 2021, August 16-18, 2021, University of Washington, Seattle, Washington, USA (Virtual Conference), volume 207 of LIPIcs, pages 9:1–9:22. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021. URL: https://doi.org/10.4230/LIPIcs.APPROX/RANDOM.2021.9, doi:10.4230/LIPICS.APPROX/RANDOM.2021.9.
  • [13] Karl Bringmann, Alejandro Cassis, Nick Fischer, and Marvin Künnemann. A structural investigation of the approximability of polynomial-time problems. In Mikolaj Bojanczyk, Emanuela Merelli, and David P. Woodruff, editors, 49th International Colloquium on Automata, Languages, and Programming, ICALP 2022, July 4-8, 2022, Paris, France, volume 229 of LIPIcs, pages 30:1–30:20. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2022. URL: https://doi.org/10.4230/LIPIcs.ICALP.2022.30, doi:10.4230/LIPICS.ICALP.2022.30.
  • [14] Karl Bringmann, Nick Fischer, and Marvin Künnemann. A fine-grained analogue of schaefer’s theorem in P: dichotomy of exists^k-forall-quantified first-order graph properties. In Amir Shpilka, editor, 34th Computational Complexity Conference, CCC 2019, July 18-20, 2019, New Brunswick, NJ, USA, volume 137 of LIPIcs, pages 31:1–31:27. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2019. URL: https://doi.org/10.4230/LIPIcs.CCC.2019.31, doi:10.4230/LIPICS.CCC.2019.31.
  • [15] Karl Bringmann, Nick Fischer, and Vasileios Nakos. Sparse nonnegative convolution is equivalent to dense nonnegative convolution. In Samir Khuller and Virginia Vassilevska Williams, editors, STOC ’21: 53rd Annual ACM SIGACT Symposium on Theory of Computing, Virtual Event, Italy, June 21-25, 2021, pages 1711–1724. ACM, 2021. doi:10.1145/3406325.3451090.
  • [16] Karl Bringmann, Nick Fischer, and Vasileios Nakos. Deterministic and las vegas algorithms for sparse nonnegative convolution. In Joseph (Seffi) Naor and Niv Buchbinder, editors, Proceedings of the 2022 ACM-SIAM Symposium on Discrete Algorithms, SODA 2022, Virtual Conference / Alexandria, VA, USA, January 9 - 12, 2022, pages 3069–3090. SIAM, 2022. doi:10.1137/1.9781611977073.119.
  • [17] Karl Bringmann and Vasileios Nakos. Fast n-fold boolean convolution via additive combinatorics. In Nikhil Bansal, Emanuela Merelli, and James Worrell, editors, 48th International Colloquium on Automata, Languages, and Programming, ICALP 2021, July 12-16, 2021, Glasgow, Scotland (Virtual Conference), volume 198 of LIPIcs, pages 41:1–41:17. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021. URL: https://doi.org/10.4230/LIPIcs.ICALP.2021.41, doi:10.4230/LIPICS.ICALP.2021.41.
  • [18] Karl Bringmann and André Nusser. Translating hausdorff is hard: Fine-grained lower bounds for hausdorff distance under translation. In Kevin Buchin and Éric Colin de Verdière, editors, 37th International Symposium on Computational Geometry, SoCG 2021, June 7-11, 2021, Buffalo, NY, USA (Virtual Conference), volume 189 of LIPIcs, pages 18:1–18:17. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021. URL: https://doi.org/10.4230/LIPIcs.SoCG.2021.18, doi:10.4230/LIPICS.SOCG.2021.18.
  • [19] Timothy M. Chan. Minimum l_\infty hausdorff distance of point sets under translation: Generalizing klee’s measure problem. In Erin W. Chambers and Joachim Gudmundsson, editors, 39th International Symposium on Computational Geometry, SoCG 2023, June 12-15, 2023, Dallas, Texas, USA, volume 258 of LIPIcs, pages 24:1–24:13. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2023. URL: https://doi.org/10.4230/LIPIcs.SoCG.2023.24, doi:10.4230/LIPICS.SOCG.2023.24.
  • [20] Timothy M. Chan and Moshe Lewenstein. Clustered integer 3sum via additive combinatorics. In Rocco A. Servedio and Ronitt Rubinfeld, editors, Proceedings of the Forty-Seventh Annual ACM on Symposium on Theory of Computing, STOC 2015, Portland, OR, USA, June 14-17, 2015, pages 31–40. ACM, 2015. doi:10.1145/2746539.2746568.
  • [21] Timothy M. Chan, Virginia Vassilevska Williams, and Yinzhan Xu. Hardness for triangle problems under even more believable hypotheses: reductions from real apsp, real 3sum, and OV. In Stefano Leonardi and Anupam Gupta, editors, STOC ’22: 54th Annual ACM SIGACT Symposium on Theory of Computing, Rome, Italy, June 20 - 24, 2022, pages 1501–1514. ACM, 2022. doi:10.1145/3519935.3520032.
  • [22] Timothy M. Chan, Virginia Vassilevska Williams, and Yinzhan Xu. Fredman’s trick meets dominance product: Fine-grained complexity of unweighted apsp, 3sum counting, and more. In Barna Saha and Rocco A. Servedio, editors, Proceedings of the 55th Annual ACM Symposium on Theory of Computing, STOC 2023, Orlando, FL, USA, June 20-23, 2023, pages 419–432. ACM, 2023. doi:10.1145/3564246.3585237.
  • [23] L. Paul Chew, Dorit Dor, Alon Efrat, and Klara Kedem. Geometric pattern matching in d -dimensional space. Discret. Comput. Geom., 21(2):257–274, 1999. doi:10.1007/PL00009420.
  • [24] L Paul Chew and Klara Kedem. Improvements on geometric pattern matching problems. In Algorithm Theory—SWAT’92: Third Scandinavian Workshop on Algorithm Theory Helsinki, Finland, July 8–10, 1992 Proceedings 3, pages 318–325. Springer, 1992.
  • [25] L. Paul Chew and Klara Kedem. Improvements on geometric pattern matching problems. In Otto Nurmi and Esko Ukkonen, editors, Algorithm Theory - SWAT ’92, Third Scandinavian Workshop on Algorithm Theory, Helsinki, Finland, July 8-10, 1992, Proceedings, volume 621 of Lecture Notes in Computer Science, pages 318–325. Springer, 1992. doi:10.1007/3-540-55706-7\_28.
  • [26] Richard Cole and Ramesh Hariharan. Verifying candidate matches in sparse and wildcard matching. In John H. Reif, editor, Proceedings on 34th Annual ACM Symposium on Theory of Computing, May 19-21, 2002, Montréal, Québec, Canada, pages 592–601. ACM, 2002. doi:10.1145/509907.509992.
  • [27] Marek Cygan, Marcin Mucha, Karol Wegrzycki, and Michal Wlodarczyk. On problems equivalent to (min, +)-convolution. ACM Trans. Algorithms, 15(1):14:1–14:25, 2019. doi:10.1145/3293465.
  • [28] Holger Dell, Marc Roth, and Philip Wellnitz. Counting answers to existential questions. In Christel Baier, Ioannis Chatzigiannakis, Paola Flocchini, and Stefano Leonardi, editors, 46th International Colloquium on Automata, Languages, and Programming, ICALP 2019, July 9-12, 2019, Patras, Greece, volume 132 of LIPIcs, pages 113:1–113:15. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2019. URL: https://doi.org/10.4230/LIPIcs.ICALP.2019.113, doi:10.4230/LIPICS.ICALP.2019.113.
  • [29] Bartlomiej Dudek, Pawel Gawrychowski, and Tatiana Starikovskaya. All non-trivial variants of 3-ldt are equivalent. In Konstantin Makarychev, Yury Makarychev, Madhur Tulsiani, Gautam Kamath, and Julia Chuzhoy, editors, Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, STOC 2020, Chicago, IL, USA, June 22-26, 2020, pages 974–981. ACM, 2020. doi:10.1145/3357713.3384275.
  • [30] Matthias Ehrgott and Xavier Gandibleux. A survey and annotated bibliography of multiobjective combinatorial optimization. OR-spektrum, 22:425–460, 2000.
  • [31] Jeff Erickson. New lower bounds for convex hull problems in odd dimensions. SIAM J. Comput., 28(4):1198–1214, 1999. doi:10.1137/S0097539797315410.
  • [32] Nick Fischer, Marvin Künnemann, and Mirza Redzic. The effect of sparsity on k-dominating set and related first-order graph properties. In David P. Woodruff, editor, Proceedings of the 2024 ACM-SIAM Symposium on Discrete Algorithms, SODA 2024, Alexandria, VA, USA, January 7-10, 2024, pages 4704–4727. SIAM, 2024. doi:10.1137/1.9781611977912.168.
  • [33] Daniel Funke, Demian Hespe, Peter Sanders, Sabine Storandt, and Carina Truschel. Pareto sums of pareto sets: Lower bounds and algorithms. CoRR, abs/2409.10232, 2024. URL: https://doi.org/10.48550/arXiv.2409.10232, arXiv:2409.10232, doi:10.48550/ARXIV.2409.10232.
  • [34] Harold N. Gabow, Jon Louis Bentley, and Robert Endre Tarjan. Scaling and related techniques for geometry problems. In Richard A. DeMillo, editor, Proceedings of the 16th Annual ACM Symposium on Theory of Computing, April 30 - May 2, 1984, Washington, DC, USA, pages 135–143. ACM, 1984. doi:10.1145/800057.808675.
  • [35] Anka Gajentaan and Mark H. Overmars. On a class of o(n2) problems in computational geometry. Comput. Geom., 5:165–185, 1995. doi:10.1016/0925-7721(95)00022-2.
  • [36] Jiawei Gao, Russell Impagliazzo, Antonina Kolokolova, and Ryan Williams. Completeness for first-order properties on sparse structures with algorithmic applications. ACM Trans. Algorithms, 15(2):23:1–23:35, 2019. doi:10.1145/3196275.
  • [37] Demian Hespe, Peter Sanders, Sabine Storandt, and Carina Truschel. Pareto sums of pareto sets. In Inge Li Gørtz, Martin Farach-Colton, Simon J. Puglisi, and Grzegorz Herman, editors, 31st Annual European Symposium on Algorithms, ESA 2023, September 4-6, 2023, Amsterdam, The Netherlands, volume 274 of LIPIcs, pages 60:1–60:17. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2023. URL: https://doi.org/10.4230/LIPIcs.ESA.2023.60, doi:10.4230/LIPICS.ESA.2023.60.
  • [38] Daniel P. Huttenlocher and Klara Kedem. Computing the minimum hausdorff distance for point sets under translation. In Raimund Seidel, editor, Proceedings of the Sixth Annual Symposium on Computational Geometry, Berkeley, CA, USA, June 6-8, 1990, pages 340–349. ACM, 1990. doi:10.1145/98524.98599.
  • [39] Zahra Jafargholi and Emanuele Viola. 3sum, 3xor, triangles. Algorithmica, 74(1):326–343, 2016. URL: https://doi.org/10.1007/s00453-014-9946-9, doi:10.1007/S00453-014-9946-9.
  • [40] Tsvi Kopelowitz, Seth Pettie, and Ely Porat. Higher lower bounds from the 3sum conjecture. In Robert Krauthgamer, editor, Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2016, Arlington, VA, USA, January 10-12, 2016, pages 1272–1287. SIAM, 2016. URL: https://doi.org/10.1137/1.9781611974331.ch89, doi:10.1137/1.9781611974331.CH89.
  • [41] Marvin Künnemann. A tight (non-combinatorial) conditional lower bound for klee’s measure problem in 3d. In 63rd IEEE Annual Symposium on Foundations of Computer Science, FOCS 2022, Denver, CO, USA, October 31 - November 3, 2022, pages 555–566. IEEE, 2022. doi:10.1109/FOCS54457.2022.00059.
  • [42] Andrea Lincoln, Virginia Vassilevska Williams, Joshua R. Wang, and R. Ryan Williams. Deterministic time-space trade-offs for k-sum. In Ioannis Chatzigiannakis, Michael Mitzenmacher, Yuval Rabani, and Davide Sangiorgi, editors, 43rd International Colloquium on Automata, Languages, and Programming, ICALP 2016, July 11-15, 2016, Rome, Italy, volume 55 of LIPIcs, pages 58:1–58:14. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2016. URL: https://doi.org/10.4230/LIPIcs.ICALP.2016.58, doi:10.4230/LIPICS.ICALP.2016.58.
  • [43] Andrea Lincoln, Virginia Vassilevska Williams, and R. Ryan Williams. Tight hardness for shortest cycles and paths in sparse graphs. In Artur Czumaj, editor, Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2018, New Orleans, LA, USA, January 7-10, 2018, pages 1236–1252. SIAM, 2018. doi:10.1137/1.9781611975031.80.
  • [44] Thibaut Lust and Daniel Tuyttens. Variable and large neighborhood search to solve the multiobjective set covering problem. Journal of Heuristics, 20:165–188, 2014.
  • [45] André Nusser. Fine-grained complexity and algorithm engineering of geometric similarity measures. PhD thesis, Saarland University, Saarbrücken, Germany, 2021. URL: https://publikationen.sulb.uni-saarland.de/handle/20.500.11880/33904.
  • [46] Mihai Puatracscu. Towards polynomial lower bounds for dynamic problems. In Leonard J. Schulman, editor, Proceedings of the 42nd ACM Symposium on Theory of Computing, STOC 2010, Cambridge, Massachusetts, USA, 5-8 June 2010, pages 603–610. ACM, 2010. doi:10.1145/1806689.1806772.
  • [47] Britta Schulze, Kathrin Klamroth, and Michael Stiglmayr. Multi-objective unconstrained combinatorial optimization: a polynomial bound on the number of extreme supported solutions. Journal of Global Optimization, 74(3):495–522, 2019.
  • [48] Carola Wenk. Shape matching in higher dimensions. PhD thesis, Free University of Berlin, Dahlem, Germany, 2003. URL: http://www.diss.fu-berlin.de/2003/151/index.html.
  • [49] Ryan Williams. Faster decision of first-order graph properties. In Thomas A. Henzinger and Dale Miller, editors, Joint Meeting of the Twenty-Third EACSL Annual Conference on Computer Science Logic (CSL) and the Twenty-Ninth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS), CSL-LICS ’14, Vienna, Austria, July 14 - 18, 2014, pages 80:1–80:6. ACM, 2014. doi:10.1145/2603088.2603121.
  • [50] Virginia Vassilevska Williams. On some fine-grained questions in algorithms and complexity. In Proceedings of the international congress of mathematicians: Rio de janeiro 2018, pages 3447–3487. World Scientific, 2018.
  • [51] Virginia Vassilevska Williams and Ryan Williams. Subcubic equivalences between path, matrix and triangle problems. In 51th Annual IEEE Symposium on Foundations of Computer Science, FOCS 2010, October 23-26, 2010, Las Vegas, Nevada, USA, pages 645–654. IEEE Computer Society, 2010. doi:10.1109/FOCS.2010.67.
  • [52] Virginia Vassilevska Williams and Ryan Williams. Finding, minimizing, and counting weighted subgraphs. SIAM J. Comput., 42(3):831–854, 2013. doi:10.1137/09076619X.