Univ Lyon, CNRS, ENS de Lyon, Université Claude Bernard Lyon 1, LIP UMR5668, [email protected]://orcid.org/0000-0002-1653-5822 \CopyrightÉdouard Bonnet\ccsdesc[100]Theory of computation → Graph algorithms analysis\supplement\hideLIPIcs\EventEditorsJohn Q. Open and Joan R. Access \EventNoEds2 \EventLongTitle42nd Conference on Very Important Topics (CVIT 2016) \EventShortTitleCVIT 2016 \EventAcronymCVIT \EventYear2016 \EventDateDecember 24–27, 2016 \EventLocationLittle Whinging, United Kingdom \EventLogo \SeriesVolume42 \ArticleNo23

4 vs 7 sparse undirected unweighted Diameter is SETH-hard at time $n^{4/3}$

Édouard Bonnet

Abstract

We show, assuming the Strong Exponential Time Hypothesis, that for every $\varepsilon>0$ , approximating undirected unweighted Diameter on $n$ -vertex $m$ -edge graphs within ratio $7/4-\varepsilon$ requires $m^{4/3-o(1)}$ time, even when $m=\tilde{O}(n)$ . This is the first result that conditionally rules out a near-linear time $5/3$ -approximation for undirected Diameter.

keywords:

Diameter, inapproximability, SETH lower bounds, k-Orthogonal Vectors

1 Introduction

The diameter of a graph is the length of a longest shortest path between two of its vertices. We write Diameter for the algorithmic task of computing the diameter of an input graph. Throughout the paper, $n$ implicitly denotes the number of vertices of a graph, and $m$ , its number of edges. We will often prefix Diameter with undirected/directed to indicate whether or not edges may be oriented¹¹1In directed Diameter, we are to compute the length of a longest shortest path taken from any vertex to any vertex., and unweighted/weighted to indicate whether or not non-negative edge weights are allowed.

A fairly recent and active line of work aims to determine the best runtime for an algorithm approximating Diameter within a given ratio. First, there is an exact algorithm running in time²²2where $\tilde{O}(\cdot)$ suppresses the polylogarithmic factors $\tilde{O}(mn)$ , which computes $n$ shortest-path trees from every vertex of the graph. Secondly, there is a $2$ -approximation running in time $\tilde{O}(m)$ , which computes a shortest-path tree from an arbitrary vertex and outputs the largest distance found. There are an $\tilde{O}(m^{3/2})$ time $3/2$ -approximation for directed weighted Diameter [1, 15, 6], and for every non-negative integer $k$ , an $\tilde{O}(mn^{\frac{1}{k+1}})$ time $(2-2^{-k})$ -approximation³³3with an extra additive factor depending on the weights for undirected weighted Diameter [4]. We refer the interested reader to the survey of Rubinstein and Vassilevska Williams [16].

We will now focus on sparse graphs, for which $m=\tilde{O}(n)$ . This is because the current paper deals with conditional lower bounds on approximating Diameter, and all such results even work with that restriction. Observe that, on sparse graphs, the first result of the previous paragraph is a near-quadratic 1-approximation, while the second result is a near-linear 2-approximation. One can represent these ratio-runtime trade-offs in the two-dimensional plane. The ultimate goal of fine-grained complexity, in that particular context, is to obtain a complete curve of algorithms linking these two extreme points, matched by tight conditional lower bounds. We now present one way of deriving conditional lower bounds for polytime problems.

Lower bounds based on the Strong Exponential Time Hypothesis

The Strong Exponential Time Hypothesis (SETH, for short) asserts that for every $\varepsilon>0$ , there is an integer $k$ such that $k$ -SAT cannot be solved in time $(2-\varepsilon)^{n}$ on $n$ -variable instances [11]. At first glance, this assumption should only be useful to rule out some specific running time for NP-hard problems which, like the satisfiability problem, seems to require superpolynomial time. Such conditional lower bounds to classical [7] or parameterized algorithms [8] are overviewed in a survey [14] on the consequences of the SETH (as well as the weaker assumption ETH) on solving computationally hard problems.

Interestingly, using the SETH to rule out a given running time for a polynomial-time solvable problem took more time. In a survey of fine-grained complexity [18], Vassilevska Williams dates the first reduction (albeit used positively) from SAT to a problem in P back to 2005 [17]. We will see that this reduction to 2-Orthogonal Vectors, where one wants to find two orthogonal $0,1$ -vectors within a given list, is very relevant to the fine-grained complexity of Diameter. As it turns out, the first SETH-based lower bound for a polytime graph problem occurred almost a decade later, on the very unweighted undirected Diameter [15].

There might have been a psychological barrier in reducing a “hard” problem to an “easy” one, in order to derive a conditional lower bound. However this makes perfect sense. Let us give an apropos example. Suppose (as it is actually the case) that one can create in time $O(n)$ a list of $n$ 0,1-vectors with $n=O(2^{N/2})$ , from an $N$ -variable SAT formula, such that there is pair of orthogonal vectors in the list if and only if the formula is satisfiable. Now a truly subquadratic algorithm, that is in time $n^{2-\varepsilon}$ for some $\varepsilon>0$ , for 2-Orthogonal Vectors would enable to solve SAT in time $O(2^{(1-\varepsilon/2)N})=O((2-\delta)^{N})$ for some $\delta>0$ , contradicting the SETH. We thus say that 2-Orthogonal Vectors is SETH-hard at time $n^{2}$ , and more generally a problem $\Pi$ is SETH-hard at time $T$ if it requires time $T^{1-o(1)}$ under the SETH.

SETH lower bounds for Diameter

There is a handful of SETH-hardness results on approximating Diameter [15, 2, 3, 9, 12, 13]. Unless the SETH fails, any $3/2-\varepsilon$ -approximation for sparse undirected unweighted Diameter, with $\varepsilon>0$ , requires time $n^{2-o(1)}$ [15] (this is the above-mentioned seminal result to the fine-grained complexity within P), whereas any $5/3-\varepsilon$ -approximation requires time $n^{3/2-o(1)}$ [12] (an early version of [13]). Since a $5/3$ -approximation of Diameter running in near-linear time was consistent with the then knowledge (up until mid-August 2020, even in weighted directed graphs) Rubinstein and Vassilevska Williams [16] and Li [12] ask for such an algorithm or some lower bounds with a ratio closer to 2.

In the last few months, there were several developments on directed graphs. The author showed that, under the SETH, $7/4-\varepsilon$ -approximating sparse directed weighted Diameter requires time $n^{4/3-o(1)}$ [3]. Then Wein and Dalirrooyfard [9], and independently, Li [13] (an updated version of [12]) both show that not only this result holds on directed unweighted graphs but they generalize it in the following way: Unless the SETH fails, for every $\varepsilon>0$ and every integer $k\geqslant 4$ , $\frac{2k-1}{k}-\varepsilon$ -approximating directed unweighted Diameter requires time $n^{\frac{k}{k-1}-o(1)}$ .

Despite these advances, a near-linear time $5/3$ -approximation for the undirected Diameter may still have existed. In this paper, we rule out this possibility by showing the following (see Figure 1 for a visual summary of what is now known on approximating undirected Diameter).

Theorem 1.1.

Unless the SETH fails, for any $\varepsilon>0$ , $7/4-\varepsilon$ -approximating Diameter on undirected unweighted $n$ -vertex $\tilde{O}(n)$ -edge graphs requires $n^{4/3-o(1)}$ time.

In particular we resolve [16, Open Question 2.2.], on the existence of a near-linear time $5/3$ -approximation for undirected Diameter, by the negative.

Figure 1: Approximability of sparse undirected unweighted Diameter. Blue areas are feasible, as witnessed by algorithms at bottom-left corners (blue dots). The red regions are SETH-hard, as witnessed by reductions at top-right corners (red dots). Dotted cyan areas are not SETH-hard, unless the NSETH fails. The current landscape for the sparse undirected weighted Diameter is the same, except the middle red region is due to Backurs et al. [2] instead of [13]. The axis-parallel black curve represents the tractability frontier as foreseen by 1.2.

In light of the recent results (see in particular the paragraph on barriers to SETH-hardness), it is reasonable to conjecture that the four variants of sparse Diameter (undirected/directed unweighted/weighted) are equally approximable. More precisely, we venture the following optimistic prediction.

Conjecture 1.2.

Sparse (un)directed (un)weighted Diameter is $2-\frac{1}{k}$ -approximable in time $\tilde{O}(n^{\frac{k+1}{k}})$ for every $k\in\mathbb{N}^{+}\cup\{\infty\}$ . Unless the SETH fails, approximating sparse (un)directed (un)weighted Diameter within ratio better than $2-\frac{1}{k+1}$ requires time $n^{\frac{k+1}{k}-o(1)}$ for every $k\in\mathbb{N}^{+}$ .

1.2 is naturally equivalent to obtaining the algorithms for the directed weighted Diameter and the SETH-hardness for the undirected unweighted Diameter. Settling the conjecture would give a complete landscape of the approximability of Diameter, where if one represents the results in the two-dimensional space of approximation factor vs runtime exponent, the feasible and infeasible regions are separated by a rectilinear curve with infinitely many corners (the black curve drawn in Figure 1). In that respect, our contribution is to give the third lower bound on the curve (i.e., North-East corner) after Roditty and Vassilevska Williams gave the first [15], and Li, the second [12]. Hopefully our new ideas (together with the recent constructions in the directed case of Wein and Dalirrooyfard [9], and Li [13]) will also help in generalizing the lower bound predicted by 1.2 to every positive integer $k$ .

Barriers to SETH lower bounds

1.2 is partly prompted by intriguing results due to Li [13]. To state them, we need to recall the definition of a strengthening of SETH introduced by Carmosino et al. [5]. It is called NSETH for Nondeterministic SETH. NSETH asserts that for every $\varepsilon>0$ , there is an integer $k$ such that the $k$ -Taut problem cannot be solved in non-deterministic time $(2-\varepsilon)^{n}$ , where $k$ -Taut asks, given a $k$ -DNF formula whether every truth assignment satisfies it (in other words, if it is a tautology). Li shows, for all four variants of Diameter but the directed weighted one, that no point positioned strictly above the rectilinear black curve of Figure 1 can be shown SETH-hard, under the NSETH (and, if randomized reductions are permitted, under a stronger assumption, called NUNSETH for Non-Uniform NSETH).

1.2 is very optimistic since it predicts that every such point will be explained by an algorithm. There are many alternatives to that event. For instance NSETH could be false⁴⁴4If we are totally honest, even the weaker SETH does not gather such a wide consensus, and is false if quantum computation is allowed., or the intractability region could extend further North via a non SETH-based reduction, or via a deterministic SETH-based reduction in the directed weighted case. Besides it would require significant progress in approximating the sparse directed Diameter, when currently no algorithm running in time $n^{\frac{3}{2}-\varepsilon}$ achieves approximation factor better than 2.

The second half of 1.2 shown for every $k\geqslant 4$ on directed graphs [9, 13], and for $k=1,2,3$ on undirected graphs, is much easier to believe in.

Techniques

Like every mentioned Diameter lower bound (for more details, see the paragraphs on $k$ -OV and Diameter in the surveys [18, 16]), we reduce from $k$ -Orthogonal Vectors, where one seeks, in a given set of $N$ $0,1$ -vectors of dimension $\ell$ , $k$ vectors such that at every index, at least one of these $k$ vectors has a 0 entry. Under the SETH, $k$ -Orthogonal Vectors requires time $N^{k-o(1)}$ [17], even when $\ell$ is polylogarithmic in $N$ .

Here we will reduce from 4-Orthogonal Vectors. We thus wish to build a graph on $\tilde{O}(N^{3})$ vertices and edges with diameter 7 if there is an orthogonal quadruple (i.e., a solution to the the 4-Orthogonal Vectors instance), and diameter 4 otherwise. Following a reduction to $ST$ -Diameter⁵⁵5where one seeks the length of a longest shortest path from a vertex of $S$ to a vertex of $T$ by Backurs et al. [2] (arguably also following [15]) most of the reductions (as in [3, 9, 13]) feature layers $L_{0},L_{1},\ldots,L_{k-1},L_{k}$ , with only (forward) edges between two consecutive $L_{i}$ . The vertices within the same layer share the same number of “vector attributes” and “index attributes”. The interplay between vector and index attributes in defining the vertices and edges is made so that if there are no $k$ orthogonal vectors, then there are paths of “optimal” length $k$ between every pair in $L_{0}\times L_{k}$ , whereas if there is set $X$ of $k$ orthogonal vectors, a pair in $L_{0}\times L_{k}$ jointly encoding $X$ is suddenly very far apart (usually and ideally at distance $2k-1$ ).

Then the challenge is to make sure that, on NO-instances, the other pairs (not in $L_{0}\times L_{k}$ ) are at distance at most $k$ , without destroying the previous property. The core of our reduction is similar to our previous construction for directed weighted Diameter [3]. However we simplify and streamline it in the following way. As in the first construction of Li [12], we collapse some layers into one. We will have $L_{0}=L_{4}(=T)$ and $L_{1}=L_{3}(=C)$ , while $L_{2}$ is called $P$ . This makes the case analyses simpler (fewer kinds of pairs to consider).

At this point, we face the same issue as in [3]: There are pairs in $T\times P$ that are too far apart. On directed graphs, this can be fixed by adding parallel layers and appropriate “back” edges [3, 9] or simply “back” edges [13]. This is no longer an option. Instead we add a set $I$ of vertices with only index attributes. These vertices link the right pairs of $T\times P$ with path of length 4 (we are back to the first variation on the theme [15]). To emphasize that the situation is somewhat delicate, we observe that not all the pairs of $T\times P$ can be at distance 4, since otherwise every pair in $T\times T$ is at distance at most 6. We set $I$ at distance 3 of $T$ (by initially putting edges of weight 3). This permits to cliquify $I$ without creating $TT$ -paths of length at most 6. In turn, this puts every pair involving $I$ at distance at most 4, as well as pairs of $(C\cup P)\times P$ . Note that as long as $d(T,X)+d(T,Y)\geqslant 3$ (or $k-1$ ), one can have all the pairs of $X\times Y$ at distance 4 (or $k$ ), without creating undesired $TT$ -paths of length at most 6 (or $2k-2$ ).

We then remove the weight-3 edges between $T$ and $I$ . This involves some vertex splits transforming $T$ into $T,T^{\prime},T^{\prime\prime}$ , and a simpler echo of the idea of having the clique $I$ , with a clique $I^{\prime}$ connecting appropriately the pairs in $T\times T^{\prime\prime}$ .

Organization

In Section 2, we recall graph-theoretic notations, and give the relevant background on the Orthogonal Vectors problem. In Section 3, we present a simpler reduction with edge weights. It thus achieves the statement of Theorem 1.1 for sparse undirected weighted Diameter. In Section 4, we tune this reduction to get rid of the edge weights, and establish Theorem 1.1.

2 Preliminaries

We use standard graph-theoretic notations. If $G$ is a graph, $V(G)$ denotes its vertex set, and $E(G)$ , its edge set. We denote the edge set between $X\subseteq V(G)$ and $Y\subseteq V(G)$ by $E(X,Y)$ . If $S\subseteq V(G)$ , $G[S]$ denotes the subgraph of $G$ induced by $S$ . Weighted graphs have positive edge weights. (Throughout the paper, we will only need edges of weight 1 and 3.) We exclusively deal with undirected graphs (for which the distance function is symmetric). For $u,v\in V(G)$ , $d_{G}(u,v)$ denotes the distance between $u$ and $v$ in $G$ , that is, the number of edges in a shortest path between $u$ and $v$ . For every positive integer $r$ and every vertex $u\in V(G)$ , $N^{r}_{G}[u]$ denotes the set of vertices $v$ such that $d_{G}(u,v)\leqslant r$ . In unweighted graphs, the closed neighborhood of $u$ , denoted $N_{G}[u]$ , coincides with $N^{1}_{G}[u]$ . However in a weighted graph $N^{1}_{G}[u]$ would for instance not contain the neighbors of $u$ via an edge of weight greater than 1. This subtlety will arise only once, and we will remind the reader in due time. For every positive integer $r$ and every vertex $S\subseteq V(G)$ , $N^{r}_{G}[S]$ denotes the set of vertices $v\in V(G)$ such that $d_{G}(u,v)\leqslant r$ for some $u\in S$ . We observe that, in unweighted graphs, $N^{r}_{G}[S]$ coincides with $N_{G}[N_{G}[\cdots N_{G}[N_{G}[S]]\cdots]]$ where $N_{G}[\cdot]$ is applied $r$ times and $N_{G}[S]$ is the closed neighborhood of $S$ . We drop the subscript in the above notations, if the graph $G$ is clear from the context.

We denote by $\text{diam}(G)$ the diameter of $G$ , that is, $\max_{u,v\in V(G)}d(u,v)$ . The Diameter problem asks, given a graph $G$ , for the value of $\text{diam}(G)$ . We call $uv$ -path, a path going from vertex $u$ to vertex $v$ , and $ST$ -path (with possibly $S=T$ ), any path going from some vertex $u\in S$ to some vertex $v\in T$ .

If $\ell$ is a positive integer, $[\ell]$ denotes the set $\{1,2,\ldots,\ell\}$ . If $v$ is a vector and $i$ is a positive integer, then $v[i]$ denotes the $i$ -th coordinate of $v$ . We use $\text{maj}(a_{1},\ldots,a_{h})$ to denote the value with the largest number of occurrences in the tuple $(a_{1},\ldots,a_{h})$ .

For every fixed positive integer $k$ , the $k$ -Orthogonal Vectors ( $k$ -OV for short) problem is as follows. It asks, given a set $S$ of 0,1-vectors in $\{0,1\}^{\ell}$ , if there are $k$ vectors $v_{1},\ldots,v_{k}\in S$ such that for every $i\in[\ell]$ , $\Pi_{h\in[k]}v_{h}[i]=0$ , or equivalently, $v_{1}[i]=v_{2}[i]=\cdots=v_{k}[i]=1$ does not hold. Williams [17] showed that, assuming the SETH, $k$ -OV requires $N^{k-o(1)}$ time with $N:=|S|$ . Furthermore, using the Sparsification Lemma [10], this lower bound holds even when, say, $\ell=\lceil\log^{2}N\rceil$ . Here we will leverage this lower bound for $k=4$ . This is, in the context of the SETH-hardness of approximating Diameter, a usual opening step: For example, Roditty and Vassilevska Williams [15] uses this lower bound for $k=2$ , Li [12], for $k=3$ and general $k\geqslant 3$ , the author [3], for $k=4$ , Wein and Dalirrooyfard [9], for general $k\geqslant 5$ and $k=4$ .

3 A simpler reduction with edge weights

From any set $S$ of $N$ vectors in $\{0,1\}^{\ell}$ , we build an undirected weighted graph $G=\rho(S)$ (with edge weights 1 and 3, only) with $O(N^{3}+N^{2}\ell^{3}+\ell^{5})$ vertices and $O(N^{3}\ell^{5}+N^{2}\ell^{6}+\ell^{10})$ edges such that if $S$ admits an orthogonal quadruple then the diameter of $G$ is (at least) 7, whereas if $S$ has no orthogonal quadruple then the diameter of $G$ is (at most) 4. We recall that 4-OV requires $N^{4-o(1)}$ time, unless the SETH fails, even when $\ell=\lceil\log^{2}N\rceil$ [17]. In that case, the graph $G$ has $O(N^{3})$ vertices and $\tilde{O}(N^{3})$ edges. Hence any algorithm approximating sparse undirected weighted Diameter within ratio better than $7/4$ in time $n^{4/3-\delta}$ , with $\delta>0$ , would refute the SETH.

3.1 Construction

We first describe the vertex set of $G$ , then its edge set, and finally check that the number of vertices and edges are as announced.

Vertex set

Every vertex of $G$ is the concatenation of a possibly empty tuple of vectors of $S$ , called vector tuple, followed by a possibly empty tuple of possibly equal indices of $[\ell]$ , called index tuple. Each coordinate of the vector tuple is called a vector field, while each coordinate of the index tuple is called an index field. The set $V(G)$ is partitioned into four sets: $T$ (for triples), $C$ (for couples), $P$ (for pairs), and $I$ (for indices). The names behind $T,C,P$ reflect the number and the nature (ordered or unordered) of the vector fields. Each of these sets comprise vertices with up to three vector fields and five index fields. They are defined in the following way.

•

$T$ : for every $\{a,b,c\}\in{S\choose 3}$ , we add vertex $(a,b,c)$ to $T$ . Thus vertices of $T$ have three vector fields and no index field.
•

$C$ : for every $\{a,b\}\in{S\choose 2}$ and $i,j,k\in[\ell]$ such that $a[i]=a[j]=a[k]=1$ and $\text{maj}(b[i],b[j],b[k])=1$ , we add vertex $(a,b,i,j,k)$ to $C$ . Thus vertices of $C$ have two vector fields and three index fields.
•

$P$ : for every $a,b\in S$ and $i,j,k\in[\ell]$ such that $a[i]=a[j]=a[k]=1$ and $b[i]=b[j]=b[k]=1$ , we add vertex $(\{a,b\},i,j,k)$ to $P$ . We will still see $a$ and $b$ as filling the two vector fields of the vertex, without a first vector field and a second vector field. Contrary to vertices of $C$ , $(\{a,b\},i,j,k)$ and $(\{b,a\},i,j,k)$ are two names for the same vertex (whereas $(a,b,i,j,k)$ and $(b,a,i,j,k)$ are two distinct vertices, whose existence implies slightly different properties). Thus vertices of $P$ also have two vector fields and three index fields. Note also that $\{a,b\}$ is a multiset, since $a$ may be equal to $b$ .
•

$I$ : for every $p_{1},p_{2},i,j,k\in[\ell]$ , we add vertex $(p_{1},p_{2},i,j,k)$ to $I$ . The chosen labels for the five index fields anticipate that, to build the edge set, it is convenient to imagine a separation after the first two index fields of the tuple. The vertices of $I$ have no vector field and five index fields.

Edge set

We will put some edges between $T$ and $C$ , $C$ and $P$ , $P$ and $I$ , and $I$ and $T$ . In addition, we put index-switching edges within $I$ and within $C$ . An index-switching edge is between two vertices of the same set ( $I$ or $C$ ) with the same vector tuple (which is always the case in $I$ ) and distinct index tuples. The only edges with a weight different than 1 are the edges between $I$ and $T$ , which all have weight 3. Thus, unless specified otherwise, an edge has weight 1.

The total list of edges is as follows.

•

We add all the index-switching edges within $I$ and $C$ . Thus $G[I]$ is a clique and $G[C]$ is a disjoint union of at most ${|S|\choose 2}$ cliques (while $G[T]$ and $G[P]$ remain independent sets). More explicitly, we have an edge between every pair of distinct vertices $(p_{1},p_{2},i,j,k)\in I$ and $(p^{\prime}_{1},p^{\prime}_{2},i^{\prime},j^{\prime},k^{\prime})\in I$ , and for every $a\neq b\in S$ between every pair of distinct vertices $(a,b,i,j,k)\in C$ and $(a,b,i^{\prime},j^{\prime},k^{\prime})\in C$ .
•

$E(T,C)$ : We add an edge between every $(a,b,c)\in T$ and $(a,b,i,j,k)\in C$ provided that there is an $h\in\{i,j,k\}$ such that $b[h]=c[h]=1$ .
•

$E(C,P)$ : We add an edge between every $(a,b,i,j,k)\in C$ and $(\{c,d\},i,j,k)\in P$ whenever $a\in\{c,d\}$ .
•

$E(T,I)$ : We add an edge of weight 3 between every $(a,b,c)\in T$ and $(p_{1},p_{2},i,j,k)\in I$ whenever $a[p_{1}]=b[p_{1}]=c[p_{1}]=a[p_{2}]=b[p_{2}]=c[p_{2}]=1$ .
•

$E(I,P)$ : We add an edge between every $(p_{1},p_{2},i,j,k)\in I$ and $(\{a,b\},i,j,k)\in P$ whenever $a[p_{1}]=b[p_{2}]=1$ or $a[p_{2}]=b[p_{1}]=1$ .

This ends the construction. See Figure 2 for an illustration.

Figure 2: The weighted construction

G

. In bold, the conditions for the existence of a vertex or of an edge. The edge in blue, and more generally every edge of

E(T,I)

, has weight 3, while all other edges have weight 1. The pairs in red recall, for vertices of the corresponding set, the length of their vector tuple followed by the length of their index tuple.

Vertex and edge count

There are $O(N^{3})$ vertices in $T$ , $O(N^{2}\ell^{3})$ , in $C\cup P$ , and $\ell^{5}$ , in $I$ , hence $O(N^{3}+N^{2}\ell^{3}+\ell^{5})=O(N^{3})$ in total. There are $O(N^{3}\ell^{3})$ edges in $E(T,C)\cup E(C,P)$ , $O(N^{2}\ell^{6})$ , in $E(C)$ , $O(N^{3}\ell^{5})$ , in $E(T,I)$ , $O(N^{2}\ell^{5})$ , in $E(I,P)$ , and $O(\ell^{10})$ in $E(I)$ , hence $O(N^{3}\ell^{5}+N^{2}\ell^{6}+\ell^{10})=\tilde{O}(N^{3})$ edges in total. Furthermore $G$ can be built in time $\tilde{O}(N^{3})$ .

3.2 The absence of orthogonal quadruple implies diameter at most 4

Assuming that there is no orthogonal quadruple, we show that every pair of vertices of $G$ is at distance at most 4. For that we repeatedly use that, for every $a,b,c,d\in S$ , $\text{ind}(a,b,c,d):=\min\{i\in[\ell]$ $|$ $a[i]=b[i]=c[i]=d[i]=1\}$ is a well-defined index in $[\ell]$ . We only take the minimum index to have a deterministic notation, but there is nothing particular with it, and any index of the non-empty $\{i\in[\ell]$ $|$ $a[i]=b[i]=c[i]=d[i]=1\}$ would work all the same.

We first observe that every vertex is at distance at most 3 from $I$ .

Lemma 3.1.

$N^{1}[I]\supseteq I\cup P$ , $N^{2}[I]\supseteq I\cup P\cup C$ , and $N^{3}[I]=V(G)$ .

Proof 3.2.

The first and second inclusions are actually equalities but we will not need those facts. $N^{1}[I]\supseteq I\cup P$ since every $(\{a,b\},i,j,k)\in P$ is adjacent (with an edge of weight 1) to $(i,i,i,j,k)\in I$ . Then, $N^{2}[I]\supseteq N^{1}[I\cup P]\supseteq I\cup P\cup C$ since every $(a,b,i,j,k)\in C$ is adjacent to $(\{a,a\},i,j,k)\in P$ . Finally, $N^{3}[I]\supseteq N^{1}[I\cup P\cup C]=V(G)$ since every $(a,b,c)\in T$ is adjacent to $(a,b,i,i,i)\in C$ for some $i\in[\ell]$ , for otherwise $a,b,c$ is an orthogonal triple.

We now exhibit paths of length at most 4 between every pair of vertices of $G$ . For the case disjunction, initially imagine the $K_{4}$ with loops on vertices $T,C,P,I$ , where edges correspond to kinds of pairs that are left to check. The following paragraphs remove all its edges in the order: all edges incident to $I$ , all remaining edges incident to $P$ but $TP$ , all remaining edges incident to $C$ , the loop on $T$ , and finally the edge $TP$ .

Between $u\in I$ and $v\in V(G)$

As $G[I]$ is a clique and, by Lemma 3.1, $N^{3}[I]=V(G)$ , every vertex $u\in I$ is at distance at most 4 from every vertex $v\in V(G)$ .

Between $u\in P$ and $v\in P\cup C$

For every $u\in P$ , $N^{2}[u]\supset I$ and so $N^{4}[u]\supset P\cup C$ , by Lemma 3.1. In particular there is a path of length at most 4 between $u$ and any vertex $v\in P\cup C$ .

Between $u\in C$ and $v\in T\cup C$

Let $(a,b)$ be the two vector fields of $u\in C$ , $(c,d)$ be the first two vector fields of $v\in T\cup C$ , and $e$ be the third vector field of $v$ if $v\in T$ . Let $i=\text{ind}(a,b,c,d)$ , $j=\text{ind}(a,c,d,e)$ if $v\in T$ , and $j=i$ if $v\in C$ . We observe that $(a,b,i,i,j),(\{a,c\},i,i,j),(c,d,i,i,j)$ are (existing) vertices of $C$ , $P$ , and $C$ , respectively, and that $u-(a,b,i,i,j)-(\{a,c\},i,i,j)-(c,d,i,i,j)$ is a path of length 3 in $G$ . The existence of these vertices is implied by $a[i]=b[i]=c[i]=d[i]=1$ , $a[j]=c[j]=d[j]=1$ . The first edge of the path is an index-switching edge within $C$ . The existence of the other edges is implied by $a\in\{a,c\}$ , $c\in\{a,c\}$ , and the fact that the index tuple $(i,i,j)$ does not change.

Finally if $v\in C$ , then the index-switching edge $(c,d,i,i,j)-v$ completes the $uv$ -path of length 4. If instead $v\in T$ , then the edge $(c,d,i,i,j)-(c,d,e)=v$ completes the $uv$ -path of length 4. This edge exists since $d[j]=e[j]=1$ .

Between $u\in T$ and $v\in T$

Let $u=(a,b,c),v=(d,e,f)\in T$ , $i=\text{ind}(a,b,c,d)$ , $j=\text{ind}(a,b,d,e)$ and $k=\text{ind}(a,d,e,f)$ . Then $u=(a,b,c)-(a,b,i,j,k)-(\{a,d\},i,j,k)-(d,e,i,j,k)-(d,e,f)=v$ is a path of length 4 in $G$ . These vertices exist since $a$ and $d$ have value 1 on indices $i,j,k$ , $b$ , on indices $i,j$ , and $e$ , on indices $j,k$ . The first edge exists since $b[i]=c[i]=1$ , the next two edges exist for similar reasons as invoked in the previous paragraph, and the fourth edge exists since $e[k]=f[k]=1$ .

Between $u\in T$ and $v\in P$

Let $u=(a,b,c)\in T$ and $v=(\{d,e\},i,j,k)\in P$ . We set $p_{1}=\text{ind}(a,b,c,d)$ , $p_{2}=\text{ind}(a,b,c,e)$ , and exhibit a $uv$ -path of length 4 via $I$ . Indeed $u=(a,b,c)-(p_{1},p_{2},i,j,k)-(\{d,e\},i,j,k)=v$ is a path of length 4 in $G$ (recall that the first edge of the path has weight 3). Edge $(a,b,c)-(p_{1},p_{2},i,j,k)\in E(T,I)$ exists since $a[p_{1}]=b[p_{1}]=c[p_{1}]=a[p_{2}]=b[p_{2}]=c[p_{2}]=1$ . Edge $(p_{1},p_{2},i,j,k)-(\{d,e\},i,j,k)\in E(I,P)$ exists since $d[p_{1}]=e[p_{2}]=1$ and the three last indices $(i,j,k)$ remain unchanged.

3.3 The presence of orthogonal quadruple implies diameter at least 7

Let $a,b,c,d\in S$ be an orthogonal quadruple, that is, such that there is no index $i\in[\ell]$ satisfying $a[i]=b[i]=c[i]=d[i]=1$ . We may further assume that $a,b,c,d$ are all distinct since checking for an orthogonal triple can be done in time $\tilde{O}(N^{3})$ . We will now show that there is no path $\mathcal{P}$ of length at most 6 between $u=(a,b,c)\in T$ and $v=(d,c,b)\in T$ .

Since the distance between every pair of vertices in $T\times I$ is at least 3, a $TT$ -path of length at most 6 cannot contain an edge of the clique $G[I]$ , nor more generally intersects $I$ at least twice. We thus distinguish two cases: (case A) $\mathcal{P}$ visits $I$ exactly once, and (case B) $\mathcal{P}$ remains within $T\cup C\cup P$ . Before proving that no $uv$ -path $\mathcal{P}$ of length at most 6 visits $I$ , thereby ruling out case A, we state a couple of useful observations.

Observation 3.3.

There is at most one path of length 2 between $(\{d,e\},i,j,k)\in P$ and $(a,b,c)\in T$ , namely $(\{d,e\},i,j,k)-(a,b,i,j,k)-(a,b,c)$ , which in particular implies that $a\in\{d,e\}$ .

More basically, the only neighbors of $(a,b,c)\in T$ (at distance 1, so not in $I$ ) are of the form $(a,b,i,j,k)\in C$ . We can generalize this observation to paths contained in $T\cup C$ .

Observation 3.4.

For every path within $G[T\cup C]$ , all the vertices of the path have the same first two vector fields.

Case A: $\mathcal{P}$ visiting $I$

As $\mathcal{P}$ cannot visit $I$ twice, if it visits $I$ then it has length exactly 6 and is one of the following kinds: (case 1) $T-I-T$ , (case 2) $T-C-P-I-P-C-T$ , or (case 3) $T-I-P-C-T$ (recall that the edges in $E(I,T)$ have weight 3). An important feature of such paths is that no index-switching edge can be used, thus the three last index fields (when they exist) have to remain the same.

Case 1. A path $(a,b,c)-(p_{1},p_{2},i,j,k)-(d,c,b)$ would in particular imply that $a[p_{1}]=b[p_{1}]=c[p_{1}]=d[p_{1}]=1$ , contradicting the orthogonality of $a,b,c,d$ .

Case 2. By 3.3 applied to both ends of the path, $\mathcal{P}$ is of the form $(a,b,c)-(a,b,i,j,k)-(\{a,e\},i,j,k)-(p_{1},p_{2},i,j,k)-(\{d,f\},i,j,k)-(d,c,i,j,k)-(d,c,b)$ with some $e,f\in S$ . The existence of the vertices $(a,b,i,j,k),(d,c,i,j,k)\in C$ implies that $a[i]=a[j]=a[k]=d[i]=d[j]=d[k]=1$ , and that $b$ and $c$ have value 1 on at least two indices (with multiplicity) of multiset $\{i,j,k\}$ . In particular, there is an $h\in\{i,j,k\}$ such that $a[h]=b[h]=c[h]=d[h]=1$ , a contradiction to $a,b,c,d$ being orthogonal.

Case 3. By 3.3 applied to the second half of the path, $\mathcal{P}$ has then the form $(a,b,c)-(p_{1},p_{2},i,j,k)-(\{d,e\},i,j,k)-(d,c,i,j,k)-(d,c,b)$ . The first three vertices yield a contradiction. Indeed, the existence of edge $(p_{1},p_{2},i,j,k)-(\{d,e\},i,j,k)$ implies that $d[p_{z}]=1$ for some $z\in\{1,2\}$ , while the existence of $(a,b,c)-(p_{1},p_{2},i,j,k)$ implies that $a[p_{z}]=b[p_{z}]=c[p_{z}]=1$ .

Case B: paths $\mathcal{P}$ within $T\cup C\cup P$

We now consider paths $\mathcal{P}$ in $G[T\cup C\cup P]$ . Since $a\neq d$ , $\mathcal{P}$ has to visit $P$ , since otherwise the first vector field cannot change, by 3.4. We then observe that no shortest $uv$ -path visits $T$ a third time (one more time than the two endpoints $u$ and $v$ ). A $TT$ -path visiting $T$ a third time would contain a segment $C-T-C$ that can be shortcut into $C-C$ . Indeed, $(a,b,i,j,k)-(a,b,c)-(a,b,i^{\prime},j^{\prime},k^{\prime})$ has a chord $(a,b,i,j,k)-(a,b,i^{\prime},j^{\prime},k^{\prime})$ which is an index-switching edge of $C$ .

We further distinguish two cases: (case 1) $\mathcal{P}$ does not contain any index-switching edge, or (case 2) $\mathcal{P}$ contains at least one index-switching edge.

Case 1. In that case, $\mathcal{P}$ is of the form $T-C-P-C-T$ or $T-C-P-C-P-C-T$ . Either way, we consider the unique neighbors of $u$ and $v$ in $\mathcal{P}$ . These neighbors have to be $(a,b,i,j,k)\in C$ and $(d,c,i,j,k)\in C$ for some $i,j,k\in[\ell]$ . Indeed no index-switching edge nor return to $T$ is allowed here. Thus we conclude as in case A.2.

Case 2. We now assume that $\mathcal{P}$ contains at least one index-switching edge (of $C$ ). In that case, as $\mathcal{P}$ has length at most 6, it can visit $P$ only once. Hence $\mathcal{P}$ is of the kind $T-C-C-P-C-C-T$ , where one of the two edges $C-C$ is optional. We consider the last vertex $u^{\prime}\in C$ before visiting $P$ , and the first vertex $v^{\prime}\in C$ after visiting $P$ . There is, by design, no index-switching edge between $u^{\prime}$ and $v^{\prime}$ on path $\mathcal{P}$ . Thus by 3.4, there are $i,j,k\in[\ell]$ such that $u^{\prime}=(a,b,i,j,k)$ and $v^{\prime}=(d,c,i,j,k)$ . We then conclude as in case A.2.

4 Removing the weights

So far we showed the announced lower bound for sparse undirected weighted Diameter. We show how to tune the previous construction to get the same lower bound for sparse undirected unweighted Diameter. The weighted graph $G$ had only non-trivial edge weights in $E(T,I)$ . We now describe how to replace these weighted edges, to get an unweighted graph $G^{\prime}=\rho^{\prime}(S)$ .

4.1 Unweighted construction

We start with a short summary of the changes. We will replace $T$ by three copies $T,T^{\prime},T^{\prime\prime}$ with an induced perfect matching between $T$ and $T^{\prime}$ , and between $T^{\prime}$ and $T^{\prime\prime}$ . We link $T^{\prime\prime}$ to $I$ as we linked $T$ to $I$ , and $T$ and $C$ , and $T^{\prime}$ and $C$ , as we linked $T$ and $C$ . We finally add a set $I^{\prime}$ of vertices with empty vector tuple (like $I$ ) that we link to $T^{\prime\prime}$ and $I$ only.

Addition to the vertex set

We add three sets to $V(G)$ to get $V(G^{\prime})$ : two identical copies of $T$ , denoted by $T^{\prime}$ and $T^{\prime\prime}$ , and a set $I^{\prime}$ isomorphic to $[\ell]$ . More precisely, for every $i\in[\ell]$ , we add vertex $(i)$ to $I^{\prime}$ . Thus $I^{\prime}$ has no vector field and a unique index field. We use a subscript to distinguish the homologous vertices in $T,T^{\prime},T^{\prime\prime}$ . Vertices $(a,b,c)_{T}\in T,(a,b,c)_{T^{\prime}}\in T^{\prime},(a,b,c)_{T^{\prime\prime}}\in{T^{\prime\prime}}$ are the three vertices of $G^{\prime}$ corresponding to the same vertex $(a,b,c)$ of $G$ .

Edition of the edge set

We first remove the edges of $G$ with weight 3 (between $T$ and $I$ ). For every $\{a,b,c\}\in{S\choose 3}$ , we add the edges $(a,b,c)_{T}-(a,b,c)_{T^{\prime}}$ and $(a,b,c)_{T^{\prime}}-(a,b,c)_{T^{\prime\prime}}$ . We also add edges between $T^{\prime}$ and $C$ , the same way we have defined edges between $T$ and $C$ . That is, $(a,b,c)_{T^{\prime}}-(a,b,i,j,k)$ is an edge if and only if $(a,b,c)_{T}-(a,b,i,j,k)$ is an edge. Let us recall that the existence of this edge (and of its endpoint in $C$ ) implies that $a,b,c$ have value 1 on indices $\{i,j,k\}$ , three times, at least twice, and at least once, respectively, and that there is an $h\in\{i,j,k\}$ such that $a[h]=b[h]=c[h]$ .

We add edges (of weight 1) between $T^{\prime\prime}$ and $I$ , the same way we defined the weight-3 edges of $G$ between $T$ and $I$ . Thus there is an edge $(a,b,c)_{T^{\prime\prime}}-(p_{1},p_{2},i,j,k)$ whenever $a[p_{1}]=b[p_{1}]=c[p_{1}]=a[p_{2}]=b[p_{2}]=c[p_{2}]=1$ . We further add an edge between $(i)\in I^{\prime}$ and $(a,b,c)_{T^{\prime\prime}}$ whenever $a[i]=b[i]=1$ . Finally we add all the index-switching edges in $I^{\prime}$ , and we make $I$ and $I^{\prime}$ fully adjacent, that is, we turn $G^{\prime}[I^{\prime}\cup I]$ into a clique.

This finishes the edition to the unweighted construction. See Figure 3 for an illustration.

Figure 3: The unweighted construction

G^{\prime}

. In bold, the conditions for the existence of a vertex or of an edge. The pairs in red recall, for vertices of the corresponding set, the length of their vector tuple followed by the length of their index tuple.

New vertex and edge count

We added to $V(G)$ $O(N^{3})$ vertices in $T^{\prime}\cup T^{\prime\prime}$ , and $\ell$ , in $I^{\prime}$ . Thus $G^{\prime}$ has also $O(|V(G)|)=O(N^{3}+N^{2}\ell^{3}+\ell^{5})=O(N^{3})$ vertices. We added to $E(G)$ $O(N^{3}+N^{3}\ell^{3})$ edges incident to $T^{\prime}$ , and $O(N^{3}\ell+\ell^{6}+\ell^{2})$ edges incident to $I^{\prime}$ . (The edges between $T^{\prime\prime}$ and $I$ were already counted in $G$ between $T$ and $I$ .) Thus $G^{\prime}$ has $O(|E(G)|)=O(N^{3}\ell^{5}+N^{2}\ell^{6}+\ell^{10})=\tilde{O}(N^{3})$ edges. Again $G^{\prime}$ can be computed in time $\tilde{O}(N^{3})$ .

4.2 The absence of orthogonal quadruple implies diameter at most 4

In case $S$ has no orthogonal quadruple, we use similar arguments as in $G$ , to find paths of length at most 4 between every pair of vertices in $G^{\prime}$ . We first show that $I^{\prime}$ is at distance at most 3 of every vertex of $G^{\prime}$ .

Lemma 4.1.

$N[I^{\prime}]\supseteq I^{\prime}\cup I\cup T^{\prime\prime}$ , $N^{2}[I^{\prime}]\supseteq I^{\prime}\cup I\cup T^{\prime\prime}\cup P\cup T^{\prime}$ , and $N^{3}[I^{\prime}]=V(G^{\prime})$ .

Proof 4.2.

The inclusions are actually equalities. $N[I^{\prime}]\supseteq I^{\prime}\cup I\cup T^{\prime\prime}$ since $I$ is fully adjacent to $I^{\prime}$ and every $(a,b,c)_{T^{\prime\prime}}\in T^{\prime\prime}$ is adjacent to some $(i)\in I^{\prime}$ , for otherwise $a,b$ is an orthogonal pair. $N^{2}[I^{\prime}]\supseteq N[I^{\prime}\cup I\cup T^{\prime\prime}]\supseteq I^{\prime}\cup I\cup T^{\prime\prime}\cup P\cup T^{\prime}$ since every $(\{a,b\},i,j,k)\in P$ is adjacent to $(i,i,i,j,k)\in I$ and every $(a,b,c)_{T^{\prime}}\in T^{\prime}$ is adjacent to $(a,b,c)_{T^{\prime\prime}}\in T^{\prime\prime}$ . Finally, $N^{3}[I^{\prime}]\supseteq N[I^{\prime}\cup I\cup T^{\prime\prime}\cup P\cup T^{\prime}]=V(G^{\prime})$ since every $(a,b,i,j,k)\in C$ is adjacent to $(\{a,a\},i,j,k)\in P$ and every $(a,b,c)_{T}\in T$ is adjacent to $(a,b,c)_{T^{\prime}}\in T^{\prime}$ .

We also show the following inclusions.

Lemma 4.3.

$N[I]\supset P\cup T^{\prime\prime}$ , and $N^{2}[I]\supset P\cup C\cup T^{\prime\prime}\cup T^{\prime}$ .

Proof 4.4.

$N[I]\supset P$ , $N^{2}[I]\supset C$ , and $N[T^{\prime\prime}]\supset T^{\prime}$ have all been shown in Lemma 4.1. Therefore we shall just prove that $N[I]\supset T^{\prime\prime}$ . Indeed every vertex $(a,b,c)_{T^{\prime\prime}}\in T^{\prime\prime}$ is adjacent to some $(i,i,i,i,i)\in I$ , since otherwise $a,b,c$ is an orthogonal triple.

For the case disjunction, initially imagine the $K_{7}$ with loops on vertices $T,T^{\prime},T^{\prime\prime},C,P,I,I^{\prime}$ , where edges correspond to the kinds of pairs that are left to check. The following paragraphs remove all its edges in the order: all edges incident to $I$ and to $I^{\prime}$ , all remaining edges incident to $P$ and to $T^{\prime\prime}$ but $TP$ and $TT^{\prime\prime}$ , all remaining edges incident to $C$ , all remaining edges incident to $T^{\prime}$ as well the loop on $T$ , the edge $TP$ , and finally the edge $TT^{\prime\prime}$ .

Between $u\in I\cup I^{\prime}$ and $v\in V(G^{\prime})$

As $G^{\prime}[I\cup I^{\prime}]$ is a clique and, by Lemma 4.1, $N^{3}[I^{\prime}]=V(G^{\prime})$ , then $N^{4}[u]=V(G^{\prime})$ holds for every vertex $u\in I\cup I^{\prime}$ .

Between $u\in P\cup T^{\prime\prime}$ and $v\in P\cup C\cup T^{\prime\prime}\cup T^{\prime}$

For every $u\in P\cup T^{\prime\prime}$ , by Lemma 4.3 and the fact that $G^{\prime}[I]$ is a clique, $N^{2}[u]\supset I$ and, again by Lemma 4.3, $N^{4}[u]\supset P\cup C\cup T^{\prime\prime}\cup T^{\prime}$ . In particular there is a path of length at most 4 from $u$ to any vertex $v\in P\cup C\cup T^{\prime\prime}\cup T^{\prime}$ .

The following two cases work as in $G$ , since $(a,b,c)_{T}$ and $(a,b,c)_{T^{\prime}}$ are twins in $G^{\prime}[T\cup T^{\prime}\cup C\cup P]$ .

Between $u\in C$ and $v\in T\cup T^{\prime}\cup C$

This holds by replacing the occurrence of $(c,d,e)$ by $(c,d,e)_{T}$ or $(c,d,e)_{T^{\prime}}$ , and every occurrence of $T$ by $T\cup T^{\prime}$ , in the paragraph Between $u\in C$ and $v\in T\cup C$ of the weighted construction.

Between $u\in T\cup T^{\prime}$ and $v\in T\cup T^{\prime}$

Again this holds by replacing occurrences of $(a,b,c)$ (resp. $(c,d,e)$ ) by $(a,b,c)_{T}$ or $(a,b,c)_{T^{\prime}}$ (resp. $(c,d,e)_{T}$ or $(c,d,e)_{T^{\prime}}$ ).

Between $u\in T$ and $v\in P$

This works as in $G$ by following three edges of weight 1 from $T$ to $I$ , instead of a single edge of weight 3. For every $u=(a,b,c)_{T}\in T$ and $v=(\{d,e\},i,j,k)\in P$ , there is a path $u=(a,b,c)_{T}-(a,b,c)_{T^{\prime}}-(a,b,c)_{T^{\prime\prime}}-(p_{1},p_{2},i,j,k)-(\{d,e\},i,j,k)=v$ in $G^{\prime}$ , with $p_{1}=\text{ind}(a,b,c,d)$ and $p_{2}=\text{ind}(a,b,c,e)$ .

Between $u\in T$ and $v\in T^{\prime\prime}$

This case is the real novelty compared to $G$ , and the reason for introducing $I^{\prime}$ . For every $u=(a,b,c)_{T}\in T$ and $v=(d,e,f)_{T^{\prime\prime}}\in T^{\prime\prime}$ , there is a path $u=(a,b,c)_{T}-(a,b,c)_{T^{\prime}}-(a,b,c)_{T^{\prime\prime}}-(i)-(d,e,f)_{T^{\prime\prime}}=v$ in $G^{\prime}$ , with $i=\text{ind}(a,b,d,e)$ . The last two edges exist since $a[i]=b[i]=d[i]=e[i]=1$ .

4.3 The presence of orthogonal quadruple implies diameter at least 7

Again we assume that there is an orthogonal quadruple $a,b,c,d\in S$ such that $a,b,c,d$ are pairwise distinct. We claim that there is no path of length at most 6 in $G^{\prime}$ between $u=(a,b,c)_{T}$ and $v=(d,c,b)_{T}$ . Since the distance between $T$ and $I\cup I^{\prime}$ is at least 3, any $TT$ -path of length at most 6 visits $I\cup I^{\prime}$ at most once. For the sake of contradiction, let $\mathcal{P}$ be such a path that we further assume shortest (hence in particular chordless) and, among shortest $uv$ -paths, having the fewest edges in $E(T^{\prime},C)$ . We will show that $\mathcal{P}$ cannot visit $I^{\prime}$ , nor use any edge of $E(T^{\prime},C)$ . Finally we observe that $TT$ -paths of length at most 6 in $G^{\prime}$ respecting these two interdictions are in length-preserving one-to-one correspondence with $TT$ -paths in $G$ .

$\mathcal{P}$ cannot visit $I^{\prime}$

The only possible kind of a $TT$ -path of length at most 6 visiting $I^{\prime}$ is $T-T^{\prime}-T^{\prime\prime}-I^{\prime}-T^{\prime\prime}-T^{\prime}-T$ . This forces $\mathcal{P}$ to be of the form $(a,b,c)_{T}-(a,b,c)_{T^{\prime}}-(a,b,c)_{T^{\prime\prime}}-(i)-(d,c,b)_{T^{\prime\prime}}-(d,c,b)_{T^{\prime}}-(d,c,b)_{T}$ for some $i\in[\ell]$ . However the third and fourth edges imply that there is an $i\in[\ell]$ such that $a[i]=b[i]=c[i]=d[i]=1$ , a contradiction to the orthogonality of $a,b,c,d$ .

$\mathcal{P}$ cannot use any edge of $E(T^{\prime},C)$

Assuming that $\mathcal{P}$ contains at least one edge in $E(T^{\prime},C)$ , we first show that it has to contain a subpath $C-T^{\prime}-T^{\prime\prime}-I\cup I^{\prime}$ or $I\cup I^{\prime}-T^{\prime\prime}-T^{\prime}-C$ . Let $w=(a^{\prime},b^{\prime},c^{\prime})_{T^{\prime}}\in T^{\prime}\cap V(\mathcal{P})$ be a vertex of $\mathcal{P}$ with one neighbor $x\in C\cap V(\mathcal{P})$ on $\mathcal{P}$ . The other neighbor $y$ of $w$ on $\mathcal{P}$ is necessarily in $T^{\prime\prime}$ . Indeed if $y\in T$ , then $y=(a^{\prime},b^{\prime},c^{\prime})_{T}$ , and $xy\in E(G^{\prime})$ is a chord. If instead $y\in C$ , then one can replace the subpath $x-(a^{\prime},b^{\prime},c^{\prime})_{T^{\prime}}-y$ by $x-(a^{\prime},b^{\prime},c^{\prime})_{T}-y$ , contradicting the minimality of the number of used edges in $E(T^{\prime},C)$ (since this number decreases by 2).

Thus the only possibility is that $y\in T^{\prime\prime}$ . Then the other neighbor of $y$ on $\mathcal{P}$ (other than $w$ ) has to be in $I\cup I^{\prime}$ , since otherwise $\mathcal{P}$ is not a simple path. Hence $\mathcal{P}$ contains a subpath of the kind $C-T^{\prime}-T^{\prime\prime}-I\cup I^{\prime}$ (or the reverse, $I\cup I^{\prime}-T^{\prime\prime}-T^{\prime}-C$ ). Now we observe that $C$ is at distance at least 1 from $T$ , while $I\cup I^{\prime}$ is at distance at least 3 from $T$ . Therefore such a path $\mathcal{P}$ would have length at least 7.

Such a path $\mathcal{P}$ would also exist in $G$

We can now assume that $\mathcal{P}$ does not use any vertex of $I^{\prime}$ nor any edge of $E(T^{\prime},C)$ . Every such simple $TT$ -path (visiting $I$ at most once) also exists in the weighted graph $G$ , with the same length. To see it, we notice that if $\mathcal{P}$ contains an edge $(a^{\prime},b^{\prime},c^{\prime})_{T}-(a^{\prime},b^{\prime},c^{\prime})_{T^{\prime}}$ , then it has to contain a subpath of the form $(a^{\prime},b^{\prime},c^{\prime})_{T}-(a^{\prime},b^{\prime},c^{\prime})_{T^{\prime}}-(a^{\prime},b^{\prime},c^{\prime})_{T^{\prime\prime}}-(p_{1},p_{2},i,j,k)\in I$ , and is emulated in $G$ by taking the weight-3 edge $(a^{\prime},b^{\prime},c^{\prime})-(p_{1},p_{2},i,j,k)$ . However we showed in the previous section that no $uv$ -path of length at most 6 exists in $G$ .

References

[1] Donald Aingworth, Chandra Chekuri, Piotr Indyk, and Rajeev Motwani. Fast Estimation of Diameter and Shortest Paths (Without Matrix Multiplication). SIAM J. Comput., 28(4):1167–1181, 1999. URL: https://doi.org/10.1137/S0097539796303421, doi:10.1137/S0097539796303421.
[2] Arturs Backurs, Liam Roditty, Gilad Segal, Virginia Vassilevska Williams, and Nicole Wein. Towards tight approximation bounds for graph diameter and eccentricities. In Ilias Diakonikolas, David Kempe, and Monika Henzinger, editors, Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2018, Los Angeles, CA, USA, June 25-29, 2018, pages 267–280. ACM, 2018. URL: https://doi.org/10.1145/3188745.3188950, doi:10.1145/3188745.3188950.
[3] Édouard Bonnet. Inapproximability of Diameter in super-linear time: Beyond the 5/3 ratio. CoRR, To appear at STACS 2021, abs/2008.11315, 2020. URL: https://arxiv.org/abs/2008.11315, arXiv:2008.11315.
[4] Massimo Cairo, Roberto Grossi, and Romeo Rizzi. New Bounds for Approximating Extremal Distances in Undirected Graphs. In Robert Krauthgamer, editor, Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2016, Arlington, VA, USA, January 10-12, 2016, pages 363–376. SIAM, 2016. URL: https://doi.org/10.1137/1.9781611974331.ch27, doi:10.1137/1.9781611974331.ch27.
[5] Marco L. Carmosino, Jiawei Gao, Russell Impagliazzo, Ivan Mihajlin, Ramamohan Paturi, and Stefan Schneider. Nondeterministic extensions of the strong exponential time hypothesis and consequences for non-reducibility. In Madhu Sudan, editor, Proceedings of the 2016 ACM Conference on Innovations in Theoretical Computer Science, Cambridge, MA, USA, January 14-16, 2016, pages 261–270. ACM, 2016. URL: https://doi.org/10.1145/2840728.2840746, doi:10.1145/2840728.2840746.
[6] Shiri Chechik, Daniel H. Larkin, Liam Roditty, Grant Schoenebeck, Robert Endre Tarjan, and Virginia Vassilevska Williams. Better Approximation Algorithms for the Graph Diameter. In Chandra Chekuri, editor, Proceedings of the Twenty-Fifth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2014, Portland, Oregon, USA, January 5-7, 2014, pages 1041–1052. SIAM, 2014. URL: https://doi.org/10.1137/1.9781611973402.78, doi:10.1137/1.9781611973402.78.
[7] Marek Cygan, Holger Dell, Daniel Lokshtanov, Dániel Marx, Jesper Nederlof, Yoshio Okamoto, Ramamohan Paturi, Saket Saurabh, and Magnus Wahlström. On Problems as Hard as CNF-SAT. ACM Trans. Algorithms, 12(3):41:1–41:24, 2016. URL: https://doi.org/10.1145/2925416, doi:10.1145/2925416.
[8] Marek Cygan, Fedor V. Fomin, Lukasz Kowalik, Daniel Lokshtanov, Dániel Marx, Marcin Pilipczuk, Michal Pilipczuk, and Saket Saurabh. Parameterized Algorithms. Springer, 2015. URL: https://doi.org/10.1007/978-3-319-21275-3, doi:10.1007/978-3-319-21275-3.
[9] Mina Dalirrooyfard and Nicole Wein. Tight Conditional Lower Bounds for Approximating Diameter in Directed Graphs. CoRR, abs/2011.03892, 2020. URL: https://arxiv.org/abs/2011.03892, arXiv:2011.03892.
[10] Russell Impagliazzo and Ramamohan Paturi. On the complexity of k-sat. J. Comput. Syst. Sci., 62(2):367–375, 2001. URL: https://doi.org/10.1006/jcss.2000.1727, doi:10.1006/jcss.2000.1727.
[11] Russell Impagliazzo, Ramamohan Paturi, and Francis Zane. Which Problems Have Strongly Exponential Complexity? J. Comput. Syst. Sci., 63(4):512–530, 2001. URL: https://doi.org/10.1006/jcss.2001.1774, doi:10.1006/jcss.2001.1774.
[12] Ray Li. Improved SETH-hardness of unweighted Diameter. CoRR, abs/2008.05106, 2020. URL: https://arxiv.org/abs/2008.05106, arXiv:2008.05106.
[13] Ray Li. Settling SETH vs. Approximate Sparse Directed Unweighted Diameter (up to (NU)NSETH). CoRR, abs/2008.05106, 2020. URL: https://arxiv.org/abs/2008.05106, arXiv:2008.05106.
[14] Daniel Lokshtanov, Dániel Marx, and Saket Saurabh. Lower bounds based on the Exponential Time Hypothesis. Bull. EATCS, 105:41–72, 2011. URL: http://eatcs.org/beatcs/index.php/beatcs/article/view/92.
[15] Liam Roditty and Virginia Vassilevska Williams. Fast approximation algorithms for the diameter and radius of sparse graphs. In Dan Boneh, Tim Roughgarden, and Joan Feigenbaum, editors, Symposium on Theory of Computing Conference, STOC’13, Palo Alto, CA, USA, June 1-4, 2013, pages 515–524. ACM, 2013. URL: https://doi.org/10.1145/2488608.2488673, doi:10.1145/2488608.2488673.
[16] Aviad Rubinstein and Virginia Vassilevska Williams. SETH vs Approximation. SIGACT News, 50(4):57–76, 2019. URL: https://doi.org/10.1145/3374857.3374870, doi:10.1145/3374857.3374870.
[17] Ryan Williams. A new algorithm for optimal 2-constraint satisfaction and its implications. Theor. Comput. Sci., 348(2-3):357–365, 2005. URL: https://doi.org/10.1016/j.tcs.2005.09.023, doi:10.1016/j.tcs.2005.09.023.
[18] Virginia Vassilevska Williams. On some fine-grained questions in algorithms and complexity. In Proceedings of the ICM, volume 3, pages 3431–3472. World Scientific, 2018.

4 vs 7 sparse undirected unweighted Diameter is SETH-hard at time n4/3n^{4/3}

Abstract

keywords:

category:

1 Introduction

Lower bounds based on the Strong Exponential Time Hypothesis

SETH lower bounds for Diameter

Theorem 1.1.

Conjecture 1.2.

Barriers to SETH lower bounds

Techniques

Organization

2 Preliminaries

3 A simpler reduction with edge weights

3.1 Construction

Vertex set

Edge set

Vertex and edge count

3.2 The absence of orthogonal quadruple implies diameter at most 4

Lemma 3.1.

Proof 3.2.

Between u∈Iu\in I and v∈V​(G)v\in V(G)

Between u∈Pu\in P and v∈P∪Cv\in P\cup C

Between u∈Cu\in C and v∈T∪Cv\in T\cup C

Between u∈Tu\in T and v∈Tv\in T

Between u∈Tu\in T and v∈Pv\in P

3.3 The presence of orthogonal quadruple implies diameter at least 7

Observation 3.3.

Observation 3.4.

Case A: 𝒫\mathcal{P} visiting II

Case B: paths 𝒫\mathcal{P} within T∪C∪PT\cup C\cup P

4 Removing the weights

4.1 Unweighted construction

Addition to the vertex set

Edition of the edge set

New vertex and edge count

4.2 The absence of orthogonal quadruple implies diameter at most 4

Lemma 4.1.

Proof 4.2.

Lemma 4.3.

Proof 4.4.

Between u∈I∪I′u\in I\cup I^{\prime} and v∈V​(G′)v\in V(G^{\prime})

Between u∈P∪T′′u\in P\cup T^{\prime\prime} and v∈P∪C∪T′′∪T′v\in P\cup C\cup T^{\prime\prime}\cup T^{\prime}

Between u∈Cu\in C and v∈T∪T′∪Cv\in T\cup T^{\prime}\cup C

Between u∈T∪T′u\in T\cup T^{\prime} and v∈T∪T′v\in T\cup T^{\prime}

Between u∈Tu\in T and v∈Pv\in P

Between u∈Tu\in T and v∈T′′v\in T^{\prime\prime}

4.3 The presence of orthogonal quadruple implies diameter at least 7

𝒫\mathcal{P} cannot visit I′I^{\prime}

𝒫\mathcal{P} cannot use any edge of E​(T′,C)E(T^{\prime},C)

Such a path 𝒫\mathcal{P} would also exist in GG

References

4 vs 7 sparse undirected unweighted Diameter is SETH-hard at time $n^{4/3}$

Between $u\in I$ and $v\in V(G)$

Between $u\in P$ and $v\in P\cup C$

Between $u\in C$ and $v\in T\cup C$

Between $u\in T$ and $v\in T$

Between $u\in T$ and $v\in P$

Case A: $\mathcal{P}$ visiting $I$

Case B: paths $\mathcal{P}$ within $T\cup C\cup P$

Between $u\in I\cup I^{\prime}$ and $v\in V(G^{\prime})$

Between $u\in P\cup T^{\prime\prime}$ and $v\in P\cup C\cup T^{\prime\prime}\cup T^{\prime}$

Between $u\in C$ and $v\in T\cup T^{\prime}\cup C$

Between $u\in T\cup T^{\prime}$ and $v\in T\cup T^{\prime}$

Between $u\in T$ and $v\in P$

Between $u\in T$ and $v\in T^{\prime\prime}$

$\mathcal{P}$ cannot visit $I^{\prime}$

$\mathcal{P}$ cannot use any edge of $E(T^{\prime},C)$

Such a path $\mathcal{P}$ would also exist in $G$