This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Univ Lyon, CNRS, ENS de Lyon, Université Claude Bernard Lyon 1, LIP UMR5668, [email protected]://orcid.org/0000-0002-1653-5822 \CopyrightÉdouard Bonnet\ccsdesc[100]Theory of computation → Graph algorithms analysis\supplement\hideLIPIcs\EventEditorsJohn Q. Open and Joan R. Access \EventNoEds2 \EventLongTitle42nd Conference on Very Important Topics (CVIT 2016) \EventShortTitleCVIT 2016 \EventAcronymCVIT \EventYear2016 \EventDateDecember 24–27, 2016 \EventLocationLittle Whinging, United Kingdom \EventLogo \SeriesVolume42 \ArticleNo23

4 vs 7 sparse undirected unweighted Diameter is SETH-hard at time n4/3n^{4/3}

Édouard Bonnet
Abstract

We show, assuming the Strong Exponential Time Hypothesis, that for every ε>0\varepsilon>0, approximating undirected unweighted Diameter on nn-vertex mm-edge graphs within ratio 7/4ε7/4-\varepsilon requires m4/3o(1)m^{4/3-o(1)} time, even when m=O~(n)m=\tilde{O}(n). This is the first result that conditionally rules out a near-linear time 5/35/3-approximation for undirected Diameter.

keywords:
Diameter, inapproximability, SETH lower bounds, k-Orthogonal Vectors
category:

1 Introduction

The diameter of a graph is the length of a longest shortest path between two of its vertices. We write Diameter for the algorithmic task of computing the diameter of an input graph. Throughout the paper, nn implicitly denotes the number of vertices of a graph, and mm, its number of edges. We will often prefix Diameter with undirected/directed to indicate whether or not edges may be oriented111In directed Diameter, we are to compute the length of a longest shortest path taken from any vertex to any vertex., and unweighted/weighted to indicate whether or not non-negative edge weights are allowed.

A fairly recent and active line of work aims to determine the best runtime for an algorithm approximating Diameter within a given ratio. First, there is an exact algorithm running in time222where O~()\tilde{O}(\cdot) suppresses the polylogarithmic factors O~(mn)\tilde{O}(mn), which computes nn shortest-path trees from every vertex of the graph. Secondly, there is a 22-approximation running in time O~(m)\tilde{O}(m), which computes a shortest-path tree from an arbitrary vertex and outputs the largest distance found. There are an O~(m3/2)\tilde{O}(m^{3/2}) time 3/23/2-approximation for directed weighted Diameter [1, 15, 6], and for every non-negative integer kk, an O~(mn1k+1)\tilde{O}(mn^{\frac{1}{k+1}}) time (22k)(2-2^{-k})-approximation333with an extra additive factor depending on the weights for undirected weighted Diameter [4]. We refer the interested reader to the survey of Rubinstein and Vassilevska Williams [16].

We will now focus on sparse graphs, for which m=O~(n)m=\tilde{O}(n). This is because the current paper deals with conditional lower bounds on approximating Diameter, and all such results even work with that restriction. Observe that, on sparse graphs, the first result of the previous paragraph is a near-quadratic 1-approximation, while the second result is a near-linear 2-approximation. One can represent these ratio-runtime trade-offs in the two-dimensional plane. The ultimate goal of fine-grained complexity, in that particular context, is to obtain a complete curve of algorithms linking these two extreme points, matched by tight conditional lower bounds. We now present one way of deriving conditional lower bounds for polytime problems.

Lower bounds based on the Strong Exponential Time Hypothesis

The Strong Exponential Time Hypothesis (SETH, for short) asserts that for every ε>0\varepsilon>0, there is an integer kk such that kk-SAT cannot be solved in time (2ε)n(2-\varepsilon)^{n} on nn-variable instances [11]. At first glance, this assumption should only be useful to rule out some specific running time for NP-hard problems which, like the satisfiability problem, seems to require superpolynomial time. Such conditional lower bounds to classical [7] or parameterized algorithms [8] are overviewed in a survey [14] on the consequences of the SETH (as well as the weaker assumption ETH) on solving computationally hard problems.

Interestingly, using the SETH to rule out a given running time for a polynomial-time solvable problem took more time. In a survey of fine-grained complexity [18], Vassilevska Williams dates the first reduction (albeit used positively) from SAT to a problem in P back to 2005 [17]. We will see that this reduction to 2-Orthogonal Vectors, where one wants to find two orthogonal 0,10,1-vectors within a given list, is very relevant to the fine-grained complexity of Diameter. As it turns out, the first SETH-based lower bound for a polytime graph problem occurred almost a decade later, on the very unweighted undirected Diameter [15].

There might have been a psychological barrier in reducing a “hard” problem to an “easy” one, in order to derive a conditional lower bound. However this makes perfect sense. Let us give an apropos example. Suppose (as it is actually the case) that one can create in time O(n)O(n) a list of nn 0,1-vectors with n=O(2N/2)n=O(2^{N/2}), from an NN-variable SAT formula, such that there is pair of orthogonal vectors in the list if and only if the formula is satisfiable. Now a truly subquadratic algorithm, that is in time n2εn^{2-\varepsilon} for some ε>0\varepsilon>0, for 2-Orthogonal Vectors would enable to solve SAT in time O(2(1ε/2)N)=O((2δ)N)O(2^{(1-\varepsilon/2)N})=O((2-\delta)^{N}) for some δ>0\delta>0, contradicting the SETH. We thus say that 2-Orthogonal Vectors is SETH-hard at time n2n^{2}, and more generally a problem Π\Pi is SETH-hard at time TT if it requires time T1o(1)T^{1-o(1)} under the SETH.

SETH lower bounds for Diameter

There is a handful of SETH-hardness results on approximating Diameter [15, 2, 3, 9, 12, 13]. Unless the SETH fails, any 3/2ε3/2-\varepsilon-approximation for sparse undirected unweighted Diameter, with ε>0\varepsilon>0, requires time n2o(1)n^{2-o(1)} [15] (this is the above-mentioned seminal result to the fine-grained complexity within P), whereas any 5/3ε5/3-\varepsilon-approximation requires time n3/2o(1)n^{3/2-o(1)} [12] (an early version of [13]). Since a 5/35/3-approximation of Diameter running in near-linear time was consistent with the then knowledge (up until mid-August 2020, even in weighted directed graphs) Rubinstein and Vassilevska Williams [16] and Li [12] ask for such an algorithm or some lower bounds with a ratio closer to 2.

In the last few months, there were several developments on directed graphs. The author showed that, under the SETH, 7/4ε7/4-\varepsilon-approximating sparse directed weighted Diameter requires time n4/3o(1)n^{4/3-o(1)} [3]. Then Wein and Dalirrooyfard [9], and independently, Li [13] (an updated version of [12]) both show that not only this result holds on directed unweighted graphs but they generalize it in the following way: Unless the SETH fails, for every ε>0\varepsilon>0 and every integer k4k\geqslant 4, 2k1kε\frac{2k-1}{k}-\varepsilon-approximating directed unweighted Diameter requires time nkk1o(1)n^{\frac{k}{k-1}-o(1)}.

Despite these advances, a near-linear time 5/35/3-approximation for the undirected Diameter may still have existed. In this paper, we rule out this possibility by showing the following (see Figure 1 for a visual summary of what is now known on approximating undirected Diameter).

Theorem 1.1.

Unless the SETH fails, for any ε>0\varepsilon>0, 7/4ε7/4-\varepsilon-approximating Diameter on undirected unweighted nn-vertex O~(n)\tilde{O}(n)-edge graphs requires n4/3o(1)n^{4/3-o(1)} time.

In particular we resolve [16, Open Question 2.2.], on the existence of a near-linear time 5/35/3-approximation for undirected Diameter, by the negative.

1k+1k\frac{k+1}{k}6/55/44/33/22132\frac{3}{2}53\frac{5}{3}74\frac{7}{4}212k12-\frac{1}{2^{k-1}}22k1k\frac{2k-1}{k}approximation factor

runtime exponent

[15, RV13][13, Li20] [15, RV13][2, CLRSTV18][4, CGR16][13]
Figure 1: Approximability of sparse undirected unweighted Diameter. Blue areas are feasible, as witnessed by algorithms at bottom-left corners (blue dots). The red regions are SETH-hard, as witnessed by reductions at top-right corners (red dots). Dotted cyan areas are not SETH-hard, unless the NSETH fails. The current landscape for the sparse undirected weighted Diameter is the same, except the middle red region is due to Backurs et al. [2] instead of [13]. The axis-parallel black curve represents the tractability frontier as foreseen by 1.2.

In light of the recent results (see in particular the paragraph on barriers to SETH-hardness), it is reasonable to conjecture that the four variants of sparse Diameter (undirected/directed unweighted/weighted) are equally approximable. More precisely, we venture the following optimistic prediction.

Conjecture 1.2.

Sparse (un)directed (un)weighted Diameter is 21k2-\frac{1}{k}-approximable in time O~(nk+1k)\tilde{O}(n^{\frac{k+1}{k}}) for every k+{}k\in\mathbb{N}^{+}\cup\{\infty\}. Unless the SETH fails, approximating sparse (un)directed (un)weighted Diameter within ratio better than 21k+12-\frac{1}{k+1} requires time nk+1ko(1)n^{\frac{k+1}{k}-o(1)} for every k+k\in\mathbb{N}^{+}.

1.2 is naturally equivalent to obtaining the algorithms for the directed weighted Diameter and the SETH-hardness for the undirected unweighted Diameter. Settling the conjecture would give a complete landscape of the approximability of Diameter, where if one represents the results in the two-dimensional space of approximation factor vs runtime exponent, the feasible and infeasible regions are separated by a rectilinear curve with infinitely many corners (the black curve drawn in Figure 1). In that respect, our contribution is to give the third lower bound on the curve (i.e., North-East corner) after Roditty and Vassilevska Williams gave the first [15], and Li, the second [12]. Hopefully our new ideas (together with the recent constructions in the directed case of Wein and Dalirrooyfard [9], and Li [13]) will also help in generalizing the lower bound predicted by 1.2 to every positive integer kk.

Barriers to SETH lower bounds

1.2 is partly prompted by intriguing results due to Li [13]. To state them, we need to recall the definition of a strengthening of SETH introduced by Carmosino et al. [5]. It is called NSETH for Nondeterministic SETH. NSETH asserts that for every ε>0\varepsilon>0, there is an integer kk such that the kk-Taut problem cannot be solved in non-deterministic time (2ε)n(2-\varepsilon)^{n}, where kk-Taut asks, given a kk-DNF formula whether every truth assignment satisfies it (in other words, if it is a tautology). Li shows, for all four variants of Diameter but the directed weighted one, that no point positioned strictly above the rectilinear black curve of Figure 1 can be shown SETH-hard, under the NSETH (and, if randomized reductions are permitted, under a stronger assumption, called NUNSETH for Non-Uniform NSETH).

1.2 is very optimistic since it predicts that every such point will be explained by an algorithm. There are many alternatives to that event. For instance NSETH could be false444If we are totally honest, even the weaker SETH does not gather such a wide consensus, and is false if quantum computation is allowed., or the intractability region could extend further North via a non SETH-based reduction, or via a deterministic SETH-based reduction in the directed weighted case. Besides it would require significant progress in approximating the sparse directed Diameter, when currently no algorithm running in time n32εn^{\frac{3}{2}-\varepsilon} achieves approximation factor better than 2.

The second half of 1.2 shown for every k4k\geqslant 4 on directed graphs [9, 13], and for k=1,2,3k=1,2,3 on undirected graphs, is much easier to believe in.

Techniques

Like every mentioned Diameter lower bound (for more details, see the paragraphs on kk-OV and Diameter in the surveys [18, 16]), we reduce from kk-Orthogonal Vectors, where one seeks, in a given set of NN 0,10,1-vectors of dimension \ell, kk vectors such that at every index, at least one of these kk vectors has a 0 entry. Under the SETH, kk-Orthogonal Vectors requires time Nko(1)N^{k-o(1)} [17], even when \ell is polylogarithmic in NN.

Here we will reduce from 4-Orthogonal Vectors. We thus wish to build a graph on O~(N3)\tilde{O}(N^{3}) vertices and edges with diameter 7 if there is an orthogonal quadruple (i.e., a solution to the the 4-Orthogonal Vectors instance), and diameter 4 otherwise. Following a reduction to STST-Diameter555where one seeks the length of a longest shortest path from a vertex of SS to a vertex of TT by Backurs et al. [2] (arguably also following [15]) most of the reductions (as in [3, 9, 13]) feature layers L0,L1,,Lk1,LkL_{0},L_{1},\ldots,L_{k-1},L_{k}, with only (forward) edges between two consecutive LiL_{i}. The vertices within the same layer share the same number of “vector attributes” and “index attributes”. The interplay between vector and index attributes in defining the vertices and edges is made so that if there are no kk orthogonal vectors, then there are paths of “optimal” length kk between every pair in L0×LkL_{0}\times L_{k}, whereas if there is set XX of kk orthogonal vectors, a pair in L0×LkL_{0}\times L_{k} jointly encoding XX is suddenly very far apart (usually and ideally at distance 2k12k-1).

Then the challenge is to make sure that, on NO-instances, the other pairs (not in L0×LkL_{0}\times L_{k}) are at distance at most kk, without destroying the previous property. The core of our reduction is similar to our previous construction for directed weighted Diameter [3]. However we simplify and streamline it in the following way. As in the first construction of Li [12], we collapse some layers into one. We will have L0=L4(=T)L_{0}=L_{4}(=T) and L1=L3(=C)L_{1}=L_{3}(=C), while L2L_{2} is called PP. This makes the case analyses simpler (fewer kinds of pairs to consider).

At this point, we face the same issue as in [3]: There are pairs in T×PT\times P that are too far apart. On directed graphs, this can be fixed by adding parallel layers and appropriate “back” edges [3, 9] or simply “back” edges [13]. This is no longer an option. Instead we add a set II of vertices with only index attributes. These vertices link the right pairs of T×PT\times P with path of length 4 (we are back to the first variation on the theme [15]). To emphasize that the situation is somewhat delicate, we observe that not all the pairs of T×PT\times P can be at distance 4, since otherwise every pair in T×TT\times T is at distance at most 6. We set II at distance 3 of TT (by initially putting edges of weight 3). This permits to cliquify II without creating TTTT-paths of length at most 6. In turn, this puts every pair involving II at distance at most 4, as well as pairs of (CP)×P(C\cup P)\times P. Note that as long as d(T,X)+d(T,Y)3d(T,X)+d(T,Y)\geqslant 3 (or k1k-1), one can have all the pairs of X×YX\times Y at distance 4 (or kk), without creating undesired TTTT-paths of length at most 6 (or 2k22k-2).

We then remove the weight-3 edges between TT and II. This involves some vertex splits transforming TT into T,T,T′′T,T^{\prime},T^{\prime\prime}, and a simpler echo of the idea of having the clique II, with a clique II^{\prime} connecting appropriately the pairs in T×T′′T\times T^{\prime\prime}.

Organization

In Section 2, we recall graph-theoretic notations, and give the relevant background on the Orthogonal Vectors problem. In Section 3, we present a simpler reduction with edge weights. It thus achieves the statement of Theorem 1.1 for sparse undirected weighted Diameter. In Section 4, we tune this reduction to get rid of the edge weights, and establish Theorem 1.1.

2 Preliminaries

We use standard graph-theoretic notations. If GG is a graph, V(G)V(G) denotes its vertex set, and E(G)E(G), its edge set. We denote the edge set between XV(G)X\subseteq V(G) and YV(G)Y\subseteq V(G) by E(X,Y)E(X,Y). If SV(G)S\subseteq V(G), G[S]G[S] denotes the subgraph of GG induced by SS. Weighted graphs have positive edge weights. (Throughout the paper, we will only need edges of weight 1 and 3.) We exclusively deal with undirected graphs (for which the distance function is symmetric). For u,vV(G)u,v\in V(G), dG(u,v)d_{G}(u,v) denotes the distance between uu and vv in GG, that is, the number of edges in a shortest path between uu and vv. For every positive integer rr and every vertex uV(G)u\in V(G), NGr[u]N^{r}_{G}[u] denotes the set of vertices vv such that dG(u,v)rd_{G}(u,v)\leqslant r. In unweighted graphs, the closed neighborhood of uu, denoted NG[u]N_{G}[u], coincides with NG1[u]N^{1}_{G}[u]. However in a weighted graph NG1[u]N^{1}_{G}[u] would for instance not contain the neighbors of uu via an edge of weight greater than 1. This subtlety will arise only once, and we will remind the reader in due time. For every positive integer rr and every vertex SV(G)S\subseteq V(G), NGr[S]N^{r}_{G}[S] denotes the set of vertices vV(G)v\in V(G) such that dG(u,v)rd_{G}(u,v)\leqslant r for some uSu\in S. We observe that, in unweighted graphs, NGr[S]N^{r}_{G}[S] coincides with NG[NG[NG[NG[S]]]]N_{G}[N_{G}[\cdots N_{G}[N_{G}[S]]\cdots]] where NG[]N_{G}[\cdot] is applied rr times and NG[S]N_{G}[S] is the closed neighborhood of SS. We drop the subscript in the above notations, if the graph GG is clear from the context.

We denote by diam(G)\text{diam}(G) the diameter of GG, that is, maxu,vV(G)d(u,v)\max_{u,v\in V(G)}d(u,v). The Diameter problem asks, given a graph GG, for the value of diam(G)\text{diam}(G). We call uvuv-path, a path going from vertex uu to vertex vv, and STST-path (with possibly S=TS=T), any path going from some vertex uSu\in S to some vertex vTv\in T.

If \ell is a positive integer, [][\ell] denotes the set {1,2,,}\{1,2,\ldots,\ell\}. If vv is a vector and ii is a positive integer, then v[i]v[i] denotes the ii-th coordinate of vv. We use maj(a1,,ah)\text{maj}(a_{1},\ldots,a_{h}) to denote the value with the largest number of occurrences in the tuple (a1,,ah)(a_{1},\ldots,a_{h}).

For every fixed positive integer kk, the kk-Orthogonal Vectors (kk-OV for short) problem is as follows. It asks, given a set SS of 0,1-vectors in {0,1}\{0,1\}^{\ell}, if there are kk vectors v1,,vkSv_{1},\ldots,v_{k}\in S such that for every i[]i\in[\ell], Πh[k]vh[i]=0\Pi_{h\in[k]}v_{h}[i]=0, or equivalently, v1[i]=v2[i]==vk[i]=1v_{1}[i]=v_{2}[i]=\cdots=v_{k}[i]=1 does not hold. Williams [17] showed that, assuming the SETH, kk-OV requires Nko(1)N^{k-o(1)} time with N:=|S|N:=|S|. Furthermore, using the Sparsification Lemma [10], this lower bound holds even when, say, =log2N\ell=\lceil\log^{2}N\rceil. Here we will leverage this lower bound for k=4k=4. This is, in the context of the SETH-hardness of approximating Diameter, a usual opening step: For example, Roditty and Vassilevska Williams [15] uses this lower bound for k=2k=2, Li [12], for k=3k=3 and general k3k\geqslant 3, the author [3], for k=4k=4, Wein and Dalirrooyfard [9], for general k5k\geqslant 5 and k=4k=4.

3 A simpler reduction with edge weights

From any set SS of NN vectors in {0,1}\{0,1\}^{\ell}, we build an undirected weighted graph G=ρ(S)G=\rho(S) (with edge weights 1 and 3, only) with O(N3+N23+5)O(N^{3}+N^{2}\ell^{3}+\ell^{5}) vertices and O(N35+N26+10)O(N^{3}\ell^{5}+N^{2}\ell^{6}+\ell^{10}) edges such that if SS admits an orthogonal quadruple then the diameter of GG is (at least) 7, whereas if SS has no orthogonal quadruple then the diameter of GG is (at most) 4. We recall that 4-OV requires N4o(1)N^{4-o(1)} time, unless the SETH fails, even when =log2N\ell=\lceil\log^{2}N\rceil [17]. In that case, the graph GG has O(N3)O(N^{3}) vertices and O~(N3)\tilde{O}(N^{3}) edges. Hence any algorithm approximating sparse undirected weighted Diameter within ratio better than 7/47/4 in time n4/3δn^{4/3-\delta}, with δ>0\delta>0, would refute the SETH.

3.1 Construction

We first describe the vertex set of GG, then its edge set, and finally check that the number of vertices and edges are as announced.

Vertex set

Every vertex of GG is the concatenation of a possibly empty tuple of vectors of SS, called vector tuple, followed by a possibly empty tuple of possibly equal indices of [][\ell], called index tuple. Each coordinate of the vector tuple is called a vector field, while each coordinate of the index tuple is called an index field. The set V(G)V(G) is partitioned into four sets: TT (for triples), CC (for couples), PP (for pairs), and II (for indices). The names behind T,C,PT,C,P reflect the number and the nature (ordered or unordered) of the vector fields. Each of these sets comprise vertices with up to three vector fields and five index fields. They are defined in the following way.

  • TT: for every {a,b,c}(S3)\{a,b,c\}\in{S\choose 3}, we add vertex (a,b,c)(a,b,c) to TT. Thus vertices of TT have three vector fields and no index field.

  • CC: for every {a,b}(S2)\{a,b\}\in{S\choose 2} and i,j,k[]i,j,k\in[\ell] such that a[i]=a[j]=a[k]=1a[i]=a[j]=a[k]=1 and maj(b[i],b[j],b[k])=1\text{maj}(b[i],b[j],b[k])=1, we add vertex (a,b,i,j,k)(a,b,i,j,k) to CC. Thus vertices of CC have two vector fields and three index fields.

  • PP: for every a,bSa,b\in S and i,j,k[]i,j,k\in[\ell] such that a[i]=a[j]=a[k]=1a[i]=a[j]=a[k]=1 and b[i]=b[j]=b[k]=1b[i]=b[j]=b[k]=1, we add vertex ({a,b},i,j,k)(\{a,b\},i,j,k) to PP. We will still see aa and bb as filling the two vector fields of the vertex, without a first vector field and a second vector field. Contrary to vertices of CC, ({a,b},i,j,k)(\{a,b\},i,j,k) and ({b,a},i,j,k)(\{b,a\},i,j,k) are two names for the same vertex (whereas (a,b,i,j,k)(a,b,i,j,k) and (b,a,i,j,k)(b,a,i,j,k) are two distinct vertices, whose existence implies slightly different properties). Thus vertices of PP also have two vector fields and three index fields. Note also that {a,b}\{a,b\} is a multiset, since aa may be equal to bb.

  • II: for every p1,p2,i,j,k[]p_{1},p_{2},i,j,k\in[\ell], we add vertex (p1,p2,i,j,k)(p_{1},p_{2},i,j,k) to II. The chosen labels for the five index fields anticipate that, to build the edge set, it is convenient to imagine a separation after the first two index fields of the tuple. The vertices of II have no vector field and five index fields.

Edge set

We will put some edges between TT and CC, CC and PP, PP and II, and II and TT. In addition, we put index-switching edges within II and within CC. An index-switching edge is between two vertices of the same set (II or CC) with the same vector tuple (which is always the case in II) and distinct index tuples. The only edges with a weight different than 1 are the edges between II and TT, which all have weight 3. Thus, unless specified otherwise, an edge has weight 1.

The total list of edges is as follows.

  • We add all the index-switching edges within II and CC. Thus G[I]G[I] is a clique and G[C]G[C] is a disjoint union of at most (|S|2){|S|\choose 2} cliques (while G[T]G[T] and G[P]G[P] remain independent sets). More explicitly, we have an edge between every pair of distinct vertices (p1,p2,i,j,k)I(p_{1},p_{2},i,j,k)\in I and (p1,p2,i,j,k)I(p^{\prime}_{1},p^{\prime}_{2},i^{\prime},j^{\prime},k^{\prime})\in I, and for every abSa\neq b\in S between every pair of distinct vertices (a,b,i,j,k)C(a,b,i,j,k)\in C and (a,b,i,j,k)C(a,b,i^{\prime},j^{\prime},k^{\prime})\in C.

  • E(T,C)E(T,C): We add an edge between every (a,b,c)T(a,b,c)\in T and (a,b,i,j,k)C(a,b,i,j,k)\in C provided that there is an h{i,j,k}h\in\{i,j,k\} such that b[h]=c[h]=1b[h]=c[h]=1.

  • E(C,P)E(C,P): We add an edge between every (a,b,i,j,k)C(a,b,i,j,k)\in C and ({c,d},i,j,k)P(\{c,d\},i,j,k)\in P whenever a{c,d}a\in\{c,d\}.

  • E(T,I)E(T,I): We add an edge of weight 3 between every (a,b,c)T(a,b,c)\in T and (p1,p2,i,j,k)I(p_{1},p_{2},i,j,k)\in I whenever a[p1]=b[p1]=c[p1]=a[p2]=b[p2]=c[p2]=1a[p_{1}]=b[p_{1}]=c[p_{1}]=a[p_{2}]=b[p_{2}]=c[p_{2}]=1.

  • E(I,P)E(I,P): We add an edge between every (p1,p2,i,j,k)I(p_{1},p_{2},i,j,k)\in I and ({a,b},i,j,k)P(\{a,b\},i,j,k)\in P whenever a[p1]=b[p2]=1a[p_{1}]=b[p_{2}]=1 or a[p2]=b[p1]=1a[p_{2}]=b[p_{1}]=1.

This ends the construction. See Figure 2 for an illustration.

(a,b,c)(a,b,c)(a,b,i,j,k)(a,b,i,j,k)(a,b,i,j,k)(a,b,i^{\prime},j^{\prime},k^{\prime})({d,e},i,j,k)(\{d,e\},i,j,k)(p1,p2,i,j,k)(p_{1},p_{2},i,j,k)(p1,p2,i,j,k)(p^{\prime}_{1},p^{\prime}_{2},i^{\prime},j^{\prime},k^{\prime})𝐚[𝐢]=𝐚[𝐣]=𝐚[𝐤]=𝟏\mathbf{a[i]=a[j]=a[k]=1}maj(𝐛[𝐢],𝐛[𝐣],𝐛[𝐤])=𝟏\mathbf{(b[i],b[j],b[k])=1}𝐝[𝐢]=𝐝[𝐣]=𝐝[𝐤]=𝐞[𝐢]=𝐞[𝐣]=𝐞[𝐤]=𝟏\mathbf{d[i]=d[j]=d[k]=e[i]=e[j]=e[k]=1}𝐡{𝐢,𝐣,𝐤},\mathbf{\exists h\in\{i,j,k\},}𝐜[𝐡]=𝐛[𝐡]=𝟏\mathbf{c[h]=b[h]=1}𝐚{𝐝,𝐞}\mathbf{a\in\{d,e\}}𝐚[𝐩𝟏]=𝐛[𝐩𝟏]=𝐜[𝐩𝟏]=\mathbf{a[p_{1}]=b[p_{1}]=c[p_{1}]=}𝐚[𝐩𝟐]=𝐛[𝐩𝟐]=𝐜[𝐩𝟐]=𝟏\mathbf{a[p_{2}]=b[p_{2}]=c[p_{2}]=1}3𝐝[𝐩𝟏]=𝐞[𝐩𝟐]=𝟏\mathbf{d[p_{1}]=e[p_{2}]=1}or𝐝[𝐩𝟐]=𝐞[𝐩𝟏]=𝟏\mathbf{d[p_{2}]=e[p_{1}]=1}TTCCPPII(3,0)(3,0)(2,3)(2,3)(2,3)(2,3)(0,5)(0,5)
Figure 2: The weighted construction GG. In bold, the conditions for the existence of a vertex or of an edge. The edge in blue, and more generally every edge of E(T,I)E(T,I), has weight 3, while all other edges have weight 1. The pairs in red recall, for vertices of the corresponding set, the length of their vector tuple followed by the length of their index tuple.

Vertex and edge count

There are O(N3)O(N^{3}) vertices in TT, O(N23)O(N^{2}\ell^{3}), in CPC\cup P, and 5\ell^{5}, in II, hence O(N3+N23+5)=O(N3)O(N^{3}+N^{2}\ell^{3}+\ell^{5})=O(N^{3}) in total. There are O(N33)O(N^{3}\ell^{3}) edges in E(T,C)E(C,P)E(T,C)\cup E(C,P), O(N26)O(N^{2}\ell^{6}), in E(C)E(C), O(N35)O(N^{3}\ell^{5}), in E(T,I)E(T,I), O(N25)O(N^{2}\ell^{5}), in E(I,P)E(I,P), and O(10)O(\ell^{10}) in E(I)E(I), hence O(N35+N26+10)=O~(N3)O(N^{3}\ell^{5}+N^{2}\ell^{6}+\ell^{10})=\tilde{O}(N^{3}) edges in total. Furthermore GG can be built in time O~(N3)\tilde{O}(N^{3}).

3.2 The absence of orthogonal quadruple implies diameter at most 4

Assuming that there is no orthogonal quadruple, we show that every pair of vertices of GG is at distance at most 4. For that we repeatedly use that, for every a,b,c,dSa,b,c,d\in S, ind(a,b,c,d):=min{i[]\text{ind}(a,b,c,d):=\min\{i\in[\ell] || a[i]=b[i]=c[i]=d[i]=1}a[i]=b[i]=c[i]=d[i]=1\} is a well-defined index in [][\ell]. We only take the minimum index to have a deterministic notation, but there is nothing particular with it, and any index of the non-empty {i[]\{i\in[\ell] || a[i]=b[i]=c[i]=d[i]=1}a[i]=b[i]=c[i]=d[i]=1\} would work all the same.

We first observe that every vertex is at distance at most 3 from II.

Lemma 3.1.

N1[I]IPN^{1}[I]\supseteq I\cup P, N2[I]IPCN^{2}[I]\supseteq I\cup P\cup C, and N3[I]=V(G)N^{3}[I]=V(G).

Proof 3.2.

The first and second inclusions are actually equalities but we will not need those facts. N1[I]IPN^{1}[I]\supseteq I\cup P since every ({a,b},i,j,k)P(\{a,b\},i,j,k)\in P is adjacent (with an edge of weight 1) to (i,i,i,j,k)I(i,i,i,j,k)\in I. Then, N2[I]N1[IP]IPCN^{2}[I]\supseteq N^{1}[I\cup P]\supseteq I\cup P\cup C since every (a,b,i,j,k)C(a,b,i,j,k)\in C is adjacent to ({a,a},i,j,k)P(\{a,a\},i,j,k)\in P. Finally, N3[I]N1[IPC]=V(G)N^{3}[I]\supseteq N^{1}[I\cup P\cup C]=V(G) since every (a,b,c)T(a,b,c)\in T is adjacent to (a,b,i,i,i)C(a,b,i,i,i)\in C for some i[]i\in[\ell], for otherwise a,b,ca,b,c is an orthogonal triple.

We now exhibit paths of length at most 4 between every pair of vertices of GG. For the case disjunction, initially imagine the K4K_{4} with loops on vertices T,C,P,IT,C,P,I, where edges correspond to kinds of pairs that are left to check. The following paragraphs remove all its edges in the order: all edges incident to II, all remaining edges incident to PP but TPTP, all remaining edges incident to CC, the loop on TT, and finally the edge TPTP.

Between uIu\in I and vV(G)v\in V(G)

As G[I]G[I] is a clique and, by Lemma 3.1, N3[I]=V(G)N^{3}[I]=V(G), every vertex uIu\in I is at distance at most 4 from every vertex vV(G)v\in V(G).

Between uPu\in P and vPCv\in P\cup C

For every uPu\in P, N2[u]IN^{2}[u]\supset I and so N4[u]PCN^{4}[u]\supset P\cup C, by Lemma 3.1. In particular there is a path of length at most 4 between uu and any vertex vPCv\in P\cup C.

Between uCu\in C and vTCv\in T\cup C

Let (a,b)(a,b) be the two vector fields of uCu\in C, (c,d)(c,d) be the first two vector fields of vTCv\in T\cup C, and ee be the third vector field of vv if vTv\in T. Let i=ind(a,b,c,d)i=\text{ind}(a,b,c,d), j=ind(a,c,d,e)j=\text{ind}(a,c,d,e) if vTv\in T, and j=ij=i if vCv\in C. We observe that (a,b,i,i,j),({a,c},i,i,j),(c,d,i,i,j)(a,b,i,i,j),(\{a,c\},i,i,j),(c,d,i,i,j) are (existing) vertices of CC, PP, and CC, respectively, and that u(a,b,i,i,j)({a,c},i,i,j)(c,d,i,i,j)u-(a,b,i,i,j)-(\{a,c\},i,i,j)-(c,d,i,i,j) is a path of length 3 in GG. The existence of these vertices is implied by a[i]=b[i]=c[i]=d[i]=1a[i]=b[i]=c[i]=d[i]=1, a[j]=c[j]=d[j]=1a[j]=c[j]=d[j]=1. The first edge of the path is an index-switching edge within CC. The existence of the other edges is implied by a{a,c}a\in\{a,c\}, c{a,c}c\in\{a,c\}, and the fact that the index tuple (i,i,j)(i,i,j) does not change.

Finally if vCv\in C, then the index-switching edge (c,d,i,i,j)v(c,d,i,i,j)-v completes the uvuv-path of length 4. If instead vTv\in T, then the edge (c,d,i,i,j)(c,d,e)=v(c,d,i,i,j)-(c,d,e)=v completes the uvuv-path of length 4. This edge exists since d[j]=e[j]=1d[j]=e[j]=1.

Between uTu\in T and vTv\in T

Let u=(a,b,c),v=(d,e,f)Tu=(a,b,c),v=(d,e,f)\in T, i=ind(a,b,c,d)i=\text{ind}(a,b,c,d), j=ind(a,b,d,e)j=\text{ind}(a,b,d,e) and k=ind(a,d,e,f)k=\text{ind}(a,d,e,f). Then u=(a,b,c)(a,b,i,j,k)({a,d},i,j,k)(d,e,i,j,k)(d,e,f)=vu=(a,b,c)-(a,b,i,j,k)-(\{a,d\},i,j,k)-(d,e,i,j,k)-(d,e,f)=v is a path of length 4 in GG. These vertices exist since aa and dd have value 1 on indices i,j,ki,j,k, bb, on indices i,ji,j, and ee, on indices j,kj,k. The first edge exists since b[i]=c[i]=1b[i]=c[i]=1, the next two edges exist for similar reasons as invoked in the previous paragraph, and the fourth edge exists since e[k]=f[k]=1e[k]=f[k]=1.

Between uTu\in T and vPv\in P

Let u=(a,b,c)Tu=(a,b,c)\in T and v=({d,e},i,j,k)Pv=(\{d,e\},i,j,k)\in P. We set p1=ind(a,b,c,d)p_{1}=\text{ind}(a,b,c,d), p2=ind(a,b,c,e)p_{2}=\text{ind}(a,b,c,e), and exhibit a uvuv-path of length 4 via II. Indeed u=(a,b,c)(p1,p2,i,j,k)({d,e},i,j,k)=vu=(a,b,c)-(p_{1},p_{2},i,j,k)-(\{d,e\},i,j,k)=v is a path of length 4 in GG (recall that the first edge of the path has weight 3). Edge (a,b,c)(p1,p2,i,j,k)E(T,I)(a,b,c)-(p_{1},p_{2},i,j,k)\in E(T,I) exists since a[p1]=b[p1]=c[p1]=a[p2]=b[p2]=c[p2]=1a[p_{1}]=b[p_{1}]=c[p_{1}]=a[p_{2}]=b[p_{2}]=c[p_{2}]=1. Edge (p1,p2,i,j,k)({d,e},i,j,k)E(I,P)(p_{1},p_{2},i,j,k)-(\{d,e\},i,j,k)\in E(I,P) exists since d[p1]=e[p2]=1d[p_{1}]=e[p_{2}]=1 and the three last indices (i,j,k)(i,j,k) remain unchanged.

3.3 The presence of orthogonal quadruple implies diameter at least 7

Let a,b,c,dSa,b,c,d\in S be an orthogonal quadruple, that is, such that there is no index i[]i\in[\ell] satisfying a[i]=b[i]=c[i]=d[i]=1a[i]=b[i]=c[i]=d[i]=1. We may further assume that a,b,c,da,b,c,d are all distinct since checking for an orthogonal triple can be done in time O~(N3)\tilde{O}(N^{3}). We will now show that there is no path 𝒫\mathcal{P} of length at most 6 between u=(a,b,c)Tu=(a,b,c)\in T and v=(d,c,b)Tv=(d,c,b)\in T.

Since the distance between every pair of vertices in T×IT\times I is at least 3, a TTTT-path of length at most 6 cannot contain an edge of the clique G[I]G[I], nor more generally intersects II at least twice. We thus distinguish two cases: (case A) 𝒫\mathcal{P} visits II exactly once, and (case B) 𝒫\mathcal{P} remains within TCPT\cup C\cup P. Before proving that no uvuv-path 𝒫\mathcal{P} of length at most 6 visits II, thereby ruling out case A, we state a couple of useful observations.

Observation 3.3.

There is at most one path of length 2 between ({d,e},i,j,k)P(\{d,e\},i,j,k)\in P and (a,b,c)T(a,b,c)\in T, namely ({d,e},i,j,k)(a,b,i,j,k)(a,b,c)(\{d,e\},i,j,k)-(a,b,i,j,k)-(a,b,c), which in particular implies that a{d,e}a\in\{d,e\}.

More basically, the only neighbors of (a,b,c)T(a,b,c)\in T (at distance 1, so not in II) are of the form (a,b,i,j,k)C(a,b,i,j,k)\in C. We can generalize this observation to paths contained in TCT\cup C.

Observation 3.4.

For every path within G[TC]G[T\cup C], all the vertices of the path have the same first two vector fields.

Case A: 𝒫\mathcal{P} visiting II

As 𝒫\mathcal{P} cannot visit II twice, if it visits II then it has length exactly 6 and is one of the following kinds: (case 1) TITT-I-T, (case 2) TCPIPCTT-C-P-I-P-C-T, or (case 3) TIPCTT-I-P-C-T (recall that the edges in E(I,T)E(I,T) have weight 3). An important feature of such paths is that no index-switching edge can be used, thus the three last index fields (when they exist) have to remain the same.

Case 1. A path (a,b,c)(p1,p2,i,j,k)(d,c,b)(a,b,c)-(p_{1},p_{2},i,j,k)-(d,c,b) would in particular imply that a[p1]=b[p1]=c[p1]=d[p1]=1a[p_{1}]=b[p_{1}]=c[p_{1}]=d[p_{1}]=1, contradicting the orthogonality of a,b,c,da,b,c,d.

Case 2. By 3.3 applied to both ends of the path, 𝒫\mathcal{P} is of the form (a,b,c)(a,b,i,j,k)({a,e},i,j,k)(p1,p2,i,j,k)({d,f},i,j,k)(d,c,i,j,k)(d,c,b)(a,b,c)-(a,b,i,j,k)-(\{a,e\},i,j,k)-(p_{1},p_{2},i,j,k)-(\{d,f\},i,j,k)-(d,c,i,j,k)-(d,c,b) with some e,fSe,f\in S. The existence of the vertices (a,b,i,j,k),(d,c,i,j,k)C(a,b,i,j,k),(d,c,i,j,k)\in C implies that a[i]=a[j]=a[k]=d[i]=d[j]=d[k]=1a[i]=a[j]=a[k]=d[i]=d[j]=d[k]=1, and that bb and cc have value 1 on at least two indices (with multiplicity) of multiset {i,j,k}\{i,j,k\}. In particular, there is an h{i,j,k}h\in\{i,j,k\} such that a[h]=b[h]=c[h]=d[h]=1a[h]=b[h]=c[h]=d[h]=1, a contradiction to a,b,c,da,b,c,d being orthogonal.

Case 3. By 3.3 applied to the second half of the path, 𝒫\mathcal{P} has then the form (a,b,c)(p1,p2,i,j,k)({d,e},i,j,k)(d,c,i,j,k)(d,c,b)(a,b,c)-(p_{1},p_{2},i,j,k)-(\{d,e\},i,j,k)-(d,c,i,j,k)-(d,c,b). The first three vertices yield a contradiction. Indeed, the existence of edge (p1,p2,i,j,k)({d,e},i,j,k)(p_{1},p_{2},i,j,k)-(\{d,e\},i,j,k) implies that d[pz]=1d[p_{z}]=1 for some z{1,2}z\in\{1,2\}, while the existence of (a,b,c)(p1,p2,i,j,k)(a,b,c)-(p_{1},p_{2},i,j,k) implies that a[pz]=b[pz]=c[pz]=1a[p_{z}]=b[p_{z}]=c[p_{z}]=1.

Case B: paths 𝒫\mathcal{P} within TCPT\cup C\cup P

We now consider paths 𝒫\mathcal{P} in G[TCP]G[T\cup C\cup P]. Since ada\neq d, 𝒫\mathcal{P} has to visit PP, since otherwise the first vector field cannot change, by 3.4. We then observe that no shortest uvuv-path visits TT a third time (one more time than the two endpoints uu and vv). A TTTT-path visiting TT a third time would contain a segment CTCC-T-C that can be shortcut into CCC-C. Indeed, (a,b,i,j,k)(a,b,c)(a,b,i,j,k)(a,b,i,j,k)-(a,b,c)-(a,b,i^{\prime},j^{\prime},k^{\prime}) has a chord (a,b,i,j,k)(a,b,i,j,k)(a,b,i,j,k)-(a,b,i^{\prime},j^{\prime},k^{\prime}) which is an index-switching edge of CC.

We further distinguish two cases: (case 1) 𝒫\mathcal{P} does not contain any index-switching edge, or (case 2) 𝒫\mathcal{P} contains at least one index-switching edge.

Case 1. In that case, 𝒫\mathcal{P} is of the form TCPCTT-C-P-C-T or TCPCPCTT-C-P-C-P-C-T. Either way, we consider the unique neighbors of uu and vv in 𝒫\mathcal{P}. These neighbors have to be (a,b,i,j,k)C(a,b,i,j,k)\in C and (d,c,i,j,k)C(d,c,i,j,k)\in C for some i,j,k[]i,j,k\in[\ell]. Indeed no index-switching edge nor return to TT is allowed here. Thus we conclude as in case A.2.

Case 2. We now assume that 𝒫\mathcal{P} contains at least one index-switching edge (of CC). In that case, as 𝒫\mathcal{P} has length at most 6, it can visit PP only once. Hence 𝒫\mathcal{P} is of the kind TCCPCCTT-C-C-P-C-C-T, where one of the two edges CCC-C is optional. We consider the last vertex uCu^{\prime}\in C before visiting PP, and the first vertex vCv^{\prime}\in C after visiting PP. There is, by design, no index-switching edge between uu^{\prime} and vv^{\prime} on path 𝒫\mathcal{P}. Thus by 3.4, there are i,j,k[]i,j,k\in[\ell] such that u=(a,b,i,j,k)u^{\prime}=(a,b,i,j,k) and v=(d,c,i,j,k)v^{\prime}=(d,c,i,j,k). We then conclude as in case A.2.

4 Removing the weights

So far we showed the announced lower bound for sparse undirected weighted Diameter. We show how to tune the previous construction to get the same lower bound for sparse undirected unweighted Diameter. The weighted graph GG had only non-trivial edge weights in E(T,I)E(T,I). We now describe how to replace these weighted edges, to get an unweighted graph G=ρ(S)G^{\prime}=\rho^{\prime}(S).

4.1 Unweighted construction

We start with a short summary of the changes. We will replace TT by three copies T,T,T′′T,T^{\prime},T^{\prime\prime} with an induced perfect matching between TT and TT^{\prime}, and between TT^{\prime} and T′′T^{\prime\prime}. We link T′′T^{\prime\prime} to II as we linked TT to II, and TT and CC, and TT^{\prime} and CC, as we linked TT and CC. We finally add a set II^{\prime} of vertices with empty vector tuple (like II) that we link to T′′T^{\prime\prime} and II only.

Addition to the vertex set

We add three sets to V(G)V(G) to get V(G)V(G^{\prime}): two identical copies of TT, denoted by TT^{\prime} and T′′T^{\prime\prime}, and a set II^{\prime} isomorphic to [][\ell]. More precisely, for every i[]i\in[\ell], we add vertex (i)(i) to II^{\prime}. Thus II^{\prime} has no vector field and a unique index field. We use a subscript to distinguish the homologous vertices in T,T,T′′T,T^{\prime},T^{\prime\prime}. Vertices (a,b,c)TT,(a,b,c)TT,(a,b,c)T′′T′′(a,b,c)_{T}\in T,(a,b,c)_{T^{\prime}}\in T^{\prime},(a,b,c)_{T^{\prime\prime}}\in{T^{\prime\prime}} are the three vertices of GG^{\prime} corresponding to the same vertex (a,b,c)(a,b,c) of GG.

Edition of the edge set

We first remove the edges of GG with weight 3 (between TT and II). For every {a,b,c}(S3)\{a,b,c\}\in{S\choose 3}, we add the edges (a,b,c)T(a,b,c)T(a,b,c)_{T}-(a,b,c)_{T^{\prime}} and (a,b,c)T(a,b,c)T′′(a,b,c)_{T^{\prime}}-(a,b,c)_{T^{\prime\prime}}. We also add edges between TT^{\prime} and CC, the same way we have defined edges between TT and CC. That is, (a,b,c)T(a,b,i,j,k)(a,b,c)_{T^{\prime}}-(a,b,i,j,k) is an edge if and only if (a,b,c)T(a,b,i,j,k)(a,b,c)_{T}-(a,b,i,j,k) is an edge. Let us recall that the existence of this edge (and of its endpoint in CC) implies that a,b,ca,b,c have value 1 on indices {i,j,k}\{i,j,k\}, three times, at least twice, and at least once, respectively, and that there is an h{i,j,k}h\in\{i,j,k\} such that a[h]=b[h]=c[h]a[h]=b[h]=c[h].

We add edges (of weight 1) between T′′T^{\prime\prime} and II, the same way we defined the weight-3 edges of GG between TT and II. Thus there is an edge (a,b,c)T′′(p1,p2,i,j,k)(a,b,c)_{T^{\prime\prime}}-(p_{1},p_{2},i,j,k) whenever a[p1]=b[p1]=c[p1]=a[p2]=b[p2]=c[p2]=1a[p_{1}]=b[p_{1}]=c[p_{1}]=a[p_{2}]=b[p_{2}]=c[p_{2}]=1. We further add an edge between (i)I(i)\in I^{\prime} and (a,b,c)T′′(a,b,c)_{T^{\prime\prime}} whenever a[i]=b[i]=1a[i]=b[i]=1. Finally we add all the index-switching edges in II^{\prime}, and we make II and II^{\prime} fully adjacent, that is, we turn G[II]G^{\prime}[I^{\prime}\cup I] into a clique.

This finishes the edition to the unweighted construction. See Figure 3 for an illustration.

(a,b,c)T(a,b,c)_{T}(a,b,c)T(a,b,c)_{T^{\prime}}(a,b,c)T′′(a,b,c)_{T^{\prime\prime}}(i)(i)(i)(i^{\prime})(a,b,i,j,k)(a,b,i,j,k)(a,b,i,j,k)(a,b,i^{\prime},j^{\prime},k^{\prime})({d,e},i,j,k)(\{d,e\},i,j,k)(p1,p2,i,j,k)(p_{1},p_{2},i,j,k)(p1,p2,i,j,k)(p^{\prime}_{1},p^{\prime}_{2},i^{\prime},j^{\prime},k^{\prime})𝐚[𝐢]=𝐚[𝐣]=𝐚[𝐤]=𝟏\mathbf{a[i]=a[j]=a[k]=1}maj(𝐛[𝐢],𝐛[𝐣],𝐛[𝐤])=𝟏\mathbf{(b[i],b[j],b[k])=1}𝐝[𝐢]=𝐝[𝐣]=𝐝[𝐤]=𝐞[𝐢]=𝐞[𝐣]=𝐞[𝐤]=𝟏\mathbf{d[i]=d[j]=d[k]=e[i]=e[j]=e[k]=1}𝐡{𝐢,𝐣,𝐤},\mathbf{\exists h\in\{i,j,k\},}𝐜[𝐡]=𝐛[𝐡]=𝟏\mathbf{c[h]=b[h]=1}𝐚{𝐝,𝐞}\mathbf{a\in\{d,e\}}𝐚[𝐩𝟏]=𝐛[𝐩𝟏]=𝐜[𝐩𝟏]=\mathbf{a[p_{1}]=b[p_{1}]=c[p_{1}]=}𝐚[𝐩𝟐]=𝐛[𝐩𝟐]=𝐜[𝐩𝟐]=𝟏\mathbf{a[p_{2}]=b[p_{2}]=c[p_{2}]=1}𝐚[𝐢]=𝐛[𝐢]=𝟏\mathbf{a[i]=b[i]=1}𝐝[𝐩𝟏]=𝐞[𝐩𝟐]=𝟏\mathbf{d[p_{1}]=e[p_{2}]=1}or𝐝[𝐩𝟐]=𝐞[𝐩𝟏]=𝟏\mathbf{d[p_{2}]=e[p_{1}]=1}TTTT^{\prime}T′′T^{\prime\prime}II^{\prime}CCPPII(3,0)(3,0)(3,0)(3,0)(3,0)(3,0)(0,1)(0,1)(2,3)(2,3)(2,3)(2,3)(0,5)(0,5)
Figure 3: The unweighted construction GG^{\prime}. In bold, the conditions for the existence of a vertex or of an edge. The pairs in red recall, for vertices of the corresponding set, the length of their vector tuple followed by the length of their index tuple.

New vertex and edge count

We added to V(G)V(G) O(N3)O(N^{3}) vertices in TT′′T^{\prime}\cup T^{\prime\prime}, and \ell, in II^{\prime}. Thus GG^{\prime} has also O(|V(G)|)=O(N3+N23+5)=O(N3)O(|V(G)|)=O(N^{3}+N^{2}\ell^{3}+\ell^{5})=O(N^{3}) vertices. We added to E(G)E(G) O(N3+N33)O(N^{3}+N^{3}\ell^{3}) edges incident to TT^{\prime}, and O(N3+6+2)O(N^{3}\ell+\ell^{6}+\ell^{2}) edges incident to II^{\prime}. (The edges between T′′T^{\prime\prime} and II were already counted in GG between TT and II.) Thus GG^{\prime} has O(|E(G)|)=O(N35+N26+10)=O~(N3)O(|E(G)|)=O(N^{3}\ell^{5}+N^{2}\ell^{6}+\ell^{10})=\tilde{O}(N^{3}) edges. Again GG^{\prime} can be computed in time O~(N3)\tilde{O}(N^{3}).

4.2 The absence of orthogonal quadruple implies diameter at most 4

In case SS has no orthogonal quadruple, we use similar arguments as in GG, to find paths of length at most 4 between every pair of vertices in GG^{\prime}. We first show that II^{\prime} is at distance at most 3 of every vertex of GG^{\prime}.

Lemma 4.1.

N[I]IIT′′N[I^{\prime}]\supseteq I^{\prime}\cup I\cup T^{\prime\prime}, N2[I]IIT′′PTN^{2}[I^{\prime}]\supseteq I^{\prime}\cup I\cup T^{\prime\prime}\cup P\cup T^{\prime}, and N3[I]=V(G)N^{3}[I^{\prime}]=V(G^{\prime}).

Proof 4.2.

The inclusions are actually equalities. N[I]IIT′′N[I^{\prime}]\supseteq I^{\prime}\cup I\cup T^{\prime\prime} since II is fully adjacent to II^{\prime} and every (a,b,c)T′′T′′(a,b,c)_{T^{\prime\prime}}\in T^{\prime\prime} is adjacent to some (i)I(i)\in I^{\prime}, for otherwise a,ba,b is an orthogonal pair. N2[I]N[IIT′′]IIT′′PTN^{2}[I^{\prime}]\supseteq N[I^{\prime}\cup I\cup T^{\prime\prime}]\supseteq I^{\prime}\cup I\cup T^{\prime\prime}\cup P\cup T^{\prime} since every ({a,b},i,j,k)P(\{a,b\},i,j,k)\in P is adjacent to (i,i,i,j,k)I(i,i,i,j,k)\in I and every (a,b,c)TT(a,b,c)_{T^{\prime}}\in T^{\prime} is adjacent to (a,b,c)T′′T′′(a,b,c)_{T^{\prime\prime}}\in T^{\prime\prime}. Finally, N3[I]N[IIT′′PT]=V(G)N^{3}[I^{\prime}]\supseteq N[I^{\prime}\cup I\cup T^{\prime\prime}\cup P\cup T^{\prime}]=V(G^{\prime}) since every (a,b,i,j,k)C(a,b,i,j,k)\in C is adjacent to ({a,a},i,j,k)P(\{a,a\},i,j,k)\in P and every (a,b,c)TT(a,b,c)_{T}\in T is adjacent to (a,b,c)TT(a,b,c)_{T^{\prime}}\in T^{\prime}.

We also show the following inclusions.

Lemma 4.3.

N[I]PT′′N[I]\supset P\cup T^{\prime\prime}, and N2[I]PCT′′TN^{2}[I]\supset P\cup C\cup T^{\prime\prime}\cup T^{\prime}.

Proof 4.4.

N[I]PN[I]\supset P, N2[I]CN^{2}[I]\supset C, and N[T′′]TN[T^{\prime\prime}]\supset T^{\prime} have all been shown in Lemma 4.1. Therefore we shall just prove that N[I]T′′N[I]\supset T^{\prime\prime}. Indeed every vertex (a,b,c)T′′T′′(a,b,c)_{T^{\prime\prime}}\in T^{\prime\prime} is adjacent to some (i,i,i,i,i)I(i,i,i,i,i)\in I, since otherwise a,b,ca,b,c is an orthogonal triple.

For the case disjunction, initially imagine the K7K_{7} with loops on vertices T,T,T′′,C,P,I,IT,T^{\prime},T^{\prime\prime},C,P,I,I^{\prime}, where edges correspond to the kinds of pairs that are left to check. The following paragraphs remove all its edges in the order: all edges incident to II and to II^{\prime}, all remaining edges incident to PP and to T′′T^{\prime\prime} but TPTP and TT′′TT^{\prime\prime}, all remaining edges incident to CC, all remaining edges incident to TT^{\prime} as well the loop on TT, the edge TPTP, and finally the edge TT′′TT^{\prime\prime}.

Between uIIu\in I\cup I^{\prime} and vV(G)v\in V(G^{\prime})

As G[II]G^{\prime}[I\cup I^{\prime}] is a clique and, by Lemma 4.1, N3[I]=V(G)N^{3}[I^{\prime}]=V(G^{\prime}), then N4[u]=V(G)N^{4}[u]=V(G^{\prime}) holds for every vertex uIIu\in I\cup I^{\prime}.

Between uPT′′u\in P\cup T^{\prime\prime} and vPCT′′Tv\in P\cup C\cup T^{\prime\prime}\cup T^{\prime}

For every uPT′′u\in P\cup T^{\prime\prime}, by Lemma 4.3 and the fact that G[I]G^{\prime}[I] is a clique, N2[u]IN^{2}[u]\supset I and, again by Lemma 4.3, N4[u]PCT′′TN^{4}[u]\supset P\cup C\cup T^{\prime\prime}\cup T^{\prime}. In particular there is a path of length at most 4 from uu to any vertex vPCT′′Tv\in P\cup C\cup T^{\prime\prime}\cup T^{\prime}.

The following two cases work as in GG, since (a,b,c)T(a,b,c)_{T} and (a,b,c)T(a,b,c)_{T^{\prime}} are twins in G[TTCP]G^{\prime}[T\cup T^{\prime}\cup C\cup P].

Between uCu\in C and vTTCv\in T\cup T^{\prime}\cup C

This holds by replacing the occurrence of (c,d,e)(c,d,e) by (c,d,e)T(c,d,e)_{T} or (c,d,e)T(c,d,e)_{T^{\prime}}, and every occurrence of TT by TTT\cup T^{\prime}, in the paragraph Between uCu\in C and vTCv\in T\cup C of the weighted construction.

Between uTTu\in T\cup T^{\prime} and vTTv\in T\cup T^{\prime}

Again this holds by replacing occurrences of (a,b,c)(a,b,c) (resp. (c,d,e)(c,d,e)) by (a,b,c)T(a,b,c)_{T} or (a,b,c)T(a,b,c)_{T^{\prime}} (resp. (c,d,e)T(c,d,e)_{T} or (c,d,e)T(c,d,e)_{T^{\prime}}).

Between uTu\in T and vPv\in P

This works as in GG by following three edges of weight 1 from TT to II, instead of a single edge of weight 3. For every u=(a,b,c)TTu=(a,b,c)_{T}\in T and v=({d,e},i,j,k)Pv=(\{d,e\},i,j,k)\in P, there is a path u=(a,b,c)T(a,b,c)T(a,b,c)T′′(p1,p2,i,j,k)({d,e},i,j,k)=vu=(a,b,c)_{T}-(a,b,c)_{T^{\prime}}-(a,b,c)_{T^{\prime\prime}}-(p_{1},p_{2},i,j,k)-(\{d,e\},i,j,k)=v in GG^{\prime}, with p1=ind(a,b,c,d)p_{1}=\text{ind}(a,b,c,d) and p2=ind(a,b,c,e)p_{2}=\text{ind}(a,b,c,e).

Between uTu\in T and vT′′v\in T^{\prime\prime}

This case is the real novelty compared to GG, and the reason for introducing II^{\prime}. For every u=(a,b,c)TTu=(a,b,c)_{T}\in T and v=(d,e,f)T′′T′′v=(d,e,f)_{T^{\prime\prime}}\in T^{\prime\prime}, there is a path u=(a,b,c)T(a,b,c)T(a,b,c)T′′(i)(d,e,f)T′′=vu=(a,b,c)_{T}-(a,b,c)_{T^{\prime}}-(a,b,c)_{T^{\prime\prime}}-(i)-(d,e,f)_{T^{\prime\prime}}=v in GG^{\prime}, with i=ind(a,b,d,e)i=\text{ind}(a,b,d,e). The last two edges exist since a[i]=b[i]=d[i]=e[i]=1a[i]=b[i]=d[i]=e[i]=1.

4.3 The presence of orthogonal quadruple implies diameter at least 7

Again we assume that there is an orthogonal quadruple a,b,c,dSa,b,c,d\in S such that a,b,c,da,b,c,d are pairwise distinct. We claim that there is no path of length at most 6 in GG^{\prime} between u=(a,b,c)Tu=(a,b,c)_{T} and v=(d,c,b)Tv=(d,c,b)_{T}. Since the distance between TT and III\cup I^{\prime} is at least 3, any TTTT-path of length at most 6 visits III\cup I^{\prime} at most once. For the sake of contradiction, let 𝒫\mathcal{P} be such a path that we further assume shortest (hence in particular chordless) and, among shortest uvuv-paths, having the fewest edges in E(T,C)E(T^{\prime},C). We will show that 𝒫\mathcal{P} cannot visit II^{\prime}, nor use any edge of E(T,C)E(T^{\prime},C). Finally we observe that TTTT-paths of length at most 6 in GG^{\prime} respecting these two interdictions are in length-preserving one-to-one correspondence with TTTT-paths in GG.

𝒫\mathcal{P} cannot visit II^{\prime}

The only possible kind of a TTTT-path of length at most 6 visiting II^{\prime} is TTT′′IT′′TTT-T^{\prime}-T^{\prime\prime}-I^{\prime}-T^{\prime\prime}-T^{\prime}-T. This forces 𝒫\mathcal{P} to be of the form (a,b,c)T(a,b,c)T(a,b,c)T′′(i)(d,c,b)T′′(d,c,b)T(d,c,b)T(a,b,c)_{T}-(a,b,c)_{T^{\prime}}-(a,b,c)_{T^{\prime\prime}}-(i)-(d,c,b)_{T^{\prime\prime}}-(d,c,b)_{T^{\prime}}-(d,c,b)_{T} for some i[]i\in[\ell]. However the third and fourth edges imply that there is an i[]i\in[\ell] such that a[i]=b[i]=c[i]=d[i]=1a[i]=b[i]=c[i]=d[i]=1, a contradiction to the orthogonality of a,b,c,da,b,c,d.

𝒫\mathcal{P} cannot use any edge of E(T,C)E(T^{\prime},C)

Assuming that 𝒫\mathcal{P} contains at least one edge in E(T,C)E(T^{\prime},C), we first show that it has to contain a subpath CTT′′IIC-T^{\prime}-T^{\prime\prime}-I\cup I^{\prime} or IIT′′TCI\cup I^{\prime}-T^{\prime\prime}-T^{\prime}-C. Let w=(a,b,c)TTV(𝒫)w=(a^{\prime},b^{\prime},c^{\prime})_{T^{\prime}}\in T^{\prime}\cap V(\mathcal{P}) be a vertex of 𝒫\mathcal{P} with one neighbor xCV(𝒫)x\in C\cap V(\mathcal{P}) on 𝒫\mathcal{P}. The other neighbor yy of ww on 𝒫\mathcal{P} is necessarily in T′′T^{\prime\prime}. Indeed if yTy\in T, then y=(a,b,c)Ty=(a^{\prime},b^{\prime},c^{\prime})_{T}, and xyE(G)xy\in E(G^{\prime}) is a chord. If instead yCy\in C, then one can replace the subpath x(a,b,c)Tyx-(a^{\prime},b^{\prime},c^{\prime})_{T^{\prime}}-y by x(a,b,c)Tyx-(a^{\prime},b^{\prime},c^{\prime})_{T}-y, contradicting the minimality of the number of used edges in E(T,C)E(T^{\prime},C) (since this number decreases by 2).

Thus the only possibility is that yT′′y\in T^{\prime\prime}. Then the other neighbor of yy on 𝒫\mathcal{P} (other than ww) has to be in III\cup I^{\prime}, since otherwise 𝒫\mathcal{P} is not a simple path. Hence 𝒫\mathcal{P} contains a subpath of the kind CTT′′IIC-T^{\prime}-T^{\prime\prime}-I\cup I^{\prime} (or the reverse, IIT′′TCI\cup I^{\prime}-T^{\prime\prime}-T^{\prime}-C). Now we observe that CC is at distance at least 1 from TT, while III\cup I^{\prime} is at distance at least 3 from TT. Therefore such a path 𝒫\mathcal{P} would have length at least 7.

Such a path 𝒫\mathcal{P} would also exist in GG

We can now assume that 𝒫\mathcal{P} does not use any vertex of II^{\prime} nor any edge of E(T,C)E(T^{\prime},C). Every such simple TTTT-path (visiting II at most once) also exists in the weighted graph GG, with the same length. To see it, we notice that if 𝒫\mathcal{P} contains an edge (a,b,c)T(a,b,c)T(a^{\prime},b^{\prime},c^{\prime})_{T}-(a^{\prime},b^{\prime},c^{\prime})_{T^{\prime}}, then it has to contain a subpath of the form (a,b,c)T(a,b,c)T(a,b,c)T′′(p1,p2,i,j,k)I(a^{\prime},b^{\prime},c^{\prime})_{T}-(a^{\prime},b^{\prime},c^{\prime})_{T^{\prime}}-(a^{\prime},b^{\prime},c^{\prime})_{T^{\prime\prime}}-(p_{1},p_{2},i,j,k)\in I, and is emulated in GG by taking the weight-3 edge (a,b,c)(p1,p2,i,j,k)(a^{\prime},b^{\prime},c^{\prime})-(p_{1},p_{2},i,j,k). However we showed in the previous section that no uvuv-path of length at most 6 exists in GG.

References