A Survey on Parameterized Inapproximability: $k$ -Clique, $k$ -SetCover, and More

Xuandi Ren
Peking University
[email protected]

Abstract

In the past a few years, many interesting inapproximability results have been obtained from the parameterized perspective. This article surveys some of such results, with a focus on $k$ -Clique, $k$ -SetCover, and other related problems.

1 Introduction

Parameterization and Approximation are two natural ways to cope with NP-complete optimization problems. For Clique and SetCover, two very basic NP-complete problems whose parameterized version $k$ -Clique and $k$ -SetCover are also complete problems of W[1] and W[2], both approximation and parameterization have been studied extensively. However, the combining of parameterization and approximation remains unexplored until recent years.

In their breakthrough work, [CCK⁺17] showed very strong inapproximability results for $k$ -Clique and $k$ -SetCover under the hypothesis Gap-ETH. However, although maybe plausible, Gap-ETH is such a strong hypothesis that it already gives a gap in hardness of approximation. Thus it is still of great interest to prove the same lower bound under a gap-free assumption like ETH, W[1] $\neq$ FPT and so on. Although these years have witnessed many significant developments along this way, the inapproximability of $k$ -Clique and $k$ -SetCover under gap-free hypotheses is still far beyond settled.

This article surveys some recent results, all of which are beautiful, full of smart ideas, and involve delicate algebraic or combinatorial tools. We hope to extract the relationship between different problems, capture the essence of successful attempts, and convey the ideas inside those results to readers.

1.1 Organization of the Survey

This article is organized by the problems. In Section 2, we put some preliminaries, including the definition of problems and hypotheses. In Section 3, we introduce MaxCover and MinLabel, two problems which are not only important intermediate problems in proving parameterized inapproximability, but also of great interest themselves. In Section 4 and 5, we introduce recent parameterized inapproximability results of $k$ -Clique and $k$ -SetCover, respectively.

2 Preliminaries

In this section, we first introduce some concepts in FPT approximation, then briefly describe the problems discussed in this article, and the hypotheses which the results are based on.

2.1 FPT Approximation

For a parameterized optimization problem, we use $n$ to denote the input size, and the parameter $k$ usually refers to the number of elements we need to pick to obtain an optimal solution. In some problems $k$ is just the optimal solution size (e.g. $k$ -Clique), while in other problems it is not (e.g. One-Sided $k$ -Biclique). By enumerating the $k$ elements in the solution, the brute-force algorithm usually runs in time $O(n^{k})$ .

An algorithm for a maximization (respectively, minimization) problem is called $\rho$ -FPT-approximation if it runs in $f(k)n^{O(1)}$ time for some computable function $f$ , and outputs a solution of size at least $k/\rho$ (respectively, at most $k\cdot\rho$ ). Here $\rho$ is called approximation ratio. If an optimization problem admits no $f(k)$ -FPT-approximation for any computable function $f$ , we say this problem is totally FPT inapproximable.

Note that since computing a constant-size solution is trivial, for a maximization problem, we only care about $o(k)$ -FPT-approximation and the approximation ratio is measured in terms of only $k$ . However, for a minimization problem, any computable approximation ratio is non-trivial, so if totally FPT inapproximability is already established, we can also discuss approximation ratio in terms of both $k$ and $n$ .

2.2 Problems

If the input is divided into $k$ groups, and one is only allowed to pick at most one element from each group, we say this problem is colored, otherwise it is uncolored. For some problems (e.g. $k$ -Clique), the two versions are equivalent, while for some other problems (e.g. $k$ -Biclique) they are not equivalent at all. We will discuss the coloring in each problem’s section separately.

Now we list the problems considered in this article. There are some other problems (e.g. MaxCover) which are used as intermediate problems in proving hardness of approximation. We put their definitions in their separate sections since they are a bit more complicated.

•

3SAT. The input is a 3-CNF formula $\varphi$ with $m$ clauses on $n$ variables. The goal is to decide whether there is a satisfying assignment for $\varphi$ .
•

$k$ -Clique. The input is an undirected graph $G=(V,E)$ with $n$ vertices. The goal is to decide whether there is a clique of size $k$ .
•

Densest $k$ -Subgraph. The input is an undirected graph $G=(V,E)$ with $n$ vertices, and the goal is to find the maximum number of edges induced by $k$ vertices.
•

$k$ -SetCover. The input is a collection of $n$ sets $\mathcal{S}=\{S_{1},\ldots,S_{n}\}$ over universe $U$ . The goal is to decide whether there are $k$ sets in $\mathcal{S}$ , whose union is $U$ .

2.3 Hypotheses

Here we list the hypotheses which the results are based on.

W[1] $\neq$ FPT and W[2] $\neq$ FPT are arguably the most natural hypotheses in parameterized complexity, and are often used to derive FPT time lower bounds. Since $k$ -Clique and $k$ -SetCover are complete problems of W[1] and W[2], respectively, we directly use their intractability results in the statements of those two hypotheses here, and omit the definition of W-Hierarchy.

Hypothesis 2.1 (W[1] $\neq$ FPT).

$k$ -Clique cannot be solved in $f(k)n^{O(1)}$ time, for any computable function $f$ .

Hypothesis 2.2 (W[2] $\neq$ FPT).

$k$ -SetCover cannot be solved in $f(k)n^{O(1)}$ time, for any computable function $f$ .

Tighter time lower bounds like $n^{\Omega(k)}$ often involves the Exponential Time Hypothesis (ETH).

Hypothesis 2.3 (Exponential Time Hypothesis (ETH)[IP01, IPZ01, Tov84]).

3SAT cannot be solved deterministically in $2^{o(n)}$ time, where $n$ is the number of variables. Moreover, this holds even when restricted to formulae in which $m=O(n)$ , and each variable appears in at most three clauses.

There are two stronger assumptions on the intractability of 3SAT, namely, the Gap Exponential Time hypothesis (Gap-ETH) and Strong Exponential Time hypothesis (SETH). Gap-ETH is useful in proving strong inapproximability results for many parameterized problems, while SETH is used to show tight time lower bounds like $n^{k-o(1)}$ .

Hypothesis 2.4 (Gap Exponential Time Hypothesis (Gap-ETH) [Din16, MR17]).

For some constant $\varepsilon>0$ , there is no deterministic algorithm which runs in $2^{o(n)}$ time can, given a 3SAT formula on $n$ variables and $m=O(n)$ clauses, distinguish between the following two cases:

•

(Completeness) the formula is satisfiable.
•

(Soundness) any assignment violates more than $\varepsilon$ fraction of clauses.

Note that by current state-of-the-art PCP theorem, a 3SAT formula $\varphi$ on $n$ variables can be transformed into a constant gap 3SAT formula $\varphi^{\prime}$ on only $n\text{polylog}(n)$ variables [Din07]. Therefore, assuming ETH, constant gap 3SAT cannot be solved in $2^{o(n/\text{polylog}(n))}$ time. A big open problem is whether linear-sized PCP exists. If so, Gap-ETH would follow from ETH.

Hypothesis 2.5 (Strong Exponential Time Hypothesis (SETH) [IP01, IPZ01]).

For any $\varepsilon>0$ , there is an integer $k\geq 3$ such that no algorithm can solve $k$ SAT deterministically in $2^{(1-\varepsilon)n}$ time.

Gap-ETH and SETH both imply ETH. However, no formal relationship between them is known now.

There are also randomized versions of ETH, Gap-ETH and SETH, which also rule out randomized algorithms running in corresponding time. We do not separately list them here.

One last important hypothesis is the Parameterized Inapproximability Hypothesis (PIH).

Hypothesis 2.6 (Parameterized Inapproximability Hypothesis (PIH) [LRSZ20]).

For some constant $\varepsilon>0$ , there is no $(1+\varepsilon)$ factor FPT approximation algorithm for Colored Densest $k$ -Subgraph.

The factor $(1+\varepsilon)$ can be replaced by any constant and is not important.

Note that if for a graph the number of edges induced by $k$ vertices is only $\binom{\varepsilon k}{2}\approx\varepsilon^{2}\binom{k}{2}$ , it can not have a clique of size $>\varepsilon k$ . Thus, PIH implies $k$ -Clique is hard to approximate within any constant factor in FPT time. However, the reverse direction is not necessarily true (forbiddinng small clique does not imply low density of edges), and it remains an important open question that whether PIH holds if we assume $k$ -Clique is FPT inapproximable within any constant factor.

Another remark is that PIH can be implied from Gap-ETH. See Appendix A of [BGKM18] for a simple proof. However, deriving PIH from a gap-free hypothesis such as ETH is still open.

3 MaxCover and MinLabel

In this section, we introduce two intermediate problems which are important in proving hardness of $k$ -Clique and $k$ -SetCover.

The input is the same for both problems. It is a bipartite graph $G=(U\dot{\cup}W,E)$ , such that $U$ is partitioned into $U=U_{1}\dot{\cup}\ldots\dot{\cup}U_{\ell}$ and $W$ is partitioned into $W=W_{1}\dot{\cup}\ldots\dot{\cup}W_{h}$ . We refer to $U_{i}$ ’s and $W_{j}$ ’s as left super-nodes and right super-nodes, respectively, and we refer to the maximum size of left super-nodes and right super-nodes as left alphabet size and right alphabet size, and denote them as $|\Sigma_{U}|$ and $|\Sigma_{W}|$ , respectively.

We say a MaxCover or MinLabel instance has projection property if for every $i\in[\ell],j\in[h]$ , one of the following holds:

•

Every $u\in U_{i}$ has exactly one neighbor $w\in W_{j}$ .
•

There is a full bipartite graph between $U_{i}$ and $W_{j}$ .

The bipartite case just means there are no restrictions between $U_{i}$ and $W_{j}$ .

Another interesting property is called pseudo projection property, which is almost the same as projection property, except the projection direction in the first case is opposite (Every $w\in W_{j}$ has exactly one neighbor $u\in U_{i}$ ).

For convenience, in an instance $\Gamma$ and for a left super-node $U_{i},i\in[\ell]$ , we sometimes refer to the number of right super-nodes $W_{j}$ ’s, such that the edges between $U_{i}$ and $W_{j}$ do not form a full bipartite graph, as $U_{i}$ ’s degree. Similarly define it for every right super-nodes. We call the maximum degree over all $U_{i}$ ’s (respectively, over all $W_{i}$ ’s), the left degree (respectively, right degree) of $\Gamma$ .

A solution to MaxCover is a subset of vertices $S\subseteq W$ formed by picking a vertex from each $W_{j}$ (i.e. $|S\cap W_{j}|=1$ for all $j\in[h]$ ). We say a labeling $S$ covers a left super-node $U_{i}$ if there exists a vertex $u_{i}\in U_{i}$ which is a common neighbor of all vertices in $S$ . The goal in MaxCover is to find a labeling that covers the maximum fraction of left super-nodes. The value of a MaxCover instance is defined as

\frac{1}{\ell}\left(\max_{\text{labeling}~{}S}|\{i\in[\ell]|U_{i}\text{ is covered by }S\}|\right)

A solution to MinLabel is also a subset of vertices $S\subseteq W$ , but not necessarily one vertex from each $W_{j}$ . We say a multi-labeling $S$ covers a left super-node $U_{i}$ if there exists a vertex $u_{i}\in U_{i}$ which has a neighbor in $S\cap W_{j}$ for every $j\in[h]$ . The goal in MinLabel is to find a minimum-size multi-labeling $S$ that covers all the left super-nodes. The value of a MinLabel instance is defined as

\frac{1}{h}\left(\min_{\begin{subarray}{c}\text{multi-labeling }S\\ \text{ which covers every }U_{i}\end{subarray}}|S|\right)

There is a remark on the relationship between projection property and pseudo projection property. If the degree of each left super-node $U_{i}$ is bounded by some constant (it is the case when reducing from 3SAT, see Theorem 3.2), then a MaxCover instance with projection property can be reduced to a MaxCover instance with pseudo projection property, with a constant shrinking of the gap.

Theorem 3.1.

There is a reduction which, on input a MaxCover instance $\Gamma=(\bigcup_{i\in[\ell]}U_{i},\bigcup_{i\in[h]}W_{i},E)$ with projection property, and the left degree of $\Gamma$ is bounded by a constant $q$ , outputs a MaxCover instance $\Gamma^{\prime}=(\bigcup_{i\in[\ell\cdot q]}U^{\prime}_{i},\bigcup_{i\in[h+\ell]}W^{\prime}_{i},E^{\prime})$ with pseudo projection property, such that

•

(Completeness) If MaxCover $(\Gamma)=1$ , then MaxCover $(\Gamma^{\prime})=1$ .
•

(Soundness) If MaxCover $(\Gamma)<1-\varepsilon$ , then MaxCover $(\Gamma^{\prime})<1-\varepsilon/q$ .

and the right degree of $\Gamma^{\prime}$ is bounded by $q$ .

Proof.

The reduction is straightforward: for each restriction between some $U_{i},i\in[\ell]$ and $W_{j},j\in[h]$ , build a copy of $W_{j}$ on the left (there are at most $\ell\cdot q$ restrictions, thus that many copies), and the new right super-nodes are just the juxtaposition of $U$ and $W$ . Each left super-node is only responsible to check one restriction in $\Gamma$ . The edges between the left and the right $W_{i}$ parts form either a bijection or a full bipartite graph, while the edges between the left and the right $U_{i}$ parts form either an injection from $U$ to $W$ (as they do in $\Gamma$ ), or a full bipartite graph, too. It’s easy to see the new instance satisfies pseudo projection property, the right degree is $\leq q$ , and the gap is only hurt by a factor of $q$ . ∎

In the following we briefly list the inapproximability results for MaxCover and MinLabel, which will be introduced detailedly in subsequent subsections. We use $n$ to denote $|\Sigma|$ for simplicity.

Problem

Assumption

Ratio

Lower Bound

Holds Even

Reference

MaxCover

Gap-ETH

Any Constant

2^{\Omega(h)}

|\Sigma_{U}|=|\Sigma_{W}|=O(1)

\ell=O(h)

projection property

r/\ell

n^{\Omega(r)}

|\Sigma_{W}|=O(1)

projection property

[CCK⁺17]

\gamma

n^{\Omega(h)}

|\Sigma_{U}|\leq(1/\gamma)^{O(1)}

W[1]

\neq

FPT

n^{-O(\frac{1}{\sqrt{h}})}

f(h)\cdot\text{poly}(n)

[KN21]

ETH

n^{-O(\frac{1}{h^{3}})}

n^{\Omega(h)}

MinLabel

Gap-ETH

\gamma^{-1/h}

n^{\Omega(h)}

|\Sigma_{U}|\leq(1/\gamma)^{O(1)}

[CCK⁺17]

It’s worth noting that [KLM19] also showed some inapproximability results for MaxCover. However, as their parameters are specifically designed for later use of proving hardness of $k$ -SetCover, we defer their results to Section 5.3 instead of here.

3.1 Hardness Results Based on Gap-ETH

Results in this subsection are from [CCK⁺17].

A 3SAT instance can be formulated as MaxCover instance as follows. Each left super-node consists of 7 vertices, which represent the satisfying assignments of a clause. Each right super-node consists of 2 vertices, which correspond to the true/false assignment for a variable. Two vertices are linked if and only if the assignments are consistent. Therefore, Gap-ETH can be also restated as an intractability result of constant gap MaxCover:

Theorem 3.2.

Assuming Gap-ETH, there is a constant $\varepsilon>0$ such that no deterministic algorithm can distinguish between the following cases for an instance $\Gamma=(\cup_{i\in[\ell]}U_{i},\cup_{i\in[h]}W_{i},E)$ in $2^{o(h)}$ time:

•

(Completeness) MaxCover $(\Gamma)=1$ .
•

(Soundness) MaxCover $(\Gamma)<1-\varepsilon$ .

Moreover, this holds even when $|\Sigma_{U}|,|\Sigma_{W}|=O(1),\ell=\Theta(h)$ and $\Gamma$ has projection property.

Actually Gap-ETH is equivalent to the above, see Appendix E of [CCK⁺17].

Note that MaxCover can be solved in $O^{*}(|\Sigma_{U}|^{\ell})$ or $O^{*}(|\Sigma_{W}|^{h})$ time, by just enumerating vertices picked from each left super-nodes or right super-nodes. Moreover, it can even decide whether the answer is $\geq\frac{r}{\ell}$ in $O^{*}(\binom{\ell}{r}|\Sigma_{U}|^{r})=O^{*}((\ell|\Sigma_{U}|)^{r})$ or $O^{*}(|\Sigma_{W}|^{h})$ time. As shown below, these are the best possible assuming Gap-ETH.

Theorem 3.3 (Theorem 4.2 in [CCK⁺17]).

Assuming Gap-ETH, there exists constants $\delta,\rho>0$ such that for any $\ell\geq r\geq\rho$ , no algorithm can take a MaxCover instance $\Gamma$ with $\ell$ left super-nodes, distinguish between the following cases in $O_{\ell,r}(|\Gamma|^{\delta r})$ time:

•

(Completeness) MaxCover $(\Gamma)=1$ .
•

(Soundness) MaxCover $(\Gamma)<\frac{r}{\ell}$ .

This holds even when $|\Sigma_{W}|=O(1)$ and $\Gamma$ has projection property.

The theorem is straightforward when $r=\Theta(\ell)$ , because we can directly compress a constant number of left super-nodes into one. The interesting case is when $r$ is much smaller than $\ell$ .

The proof involves a combinatorial object called disperser, which is defined as follows.

Definition 3.4 (Disperser[CW89, Zuc96a, Zuc96b]).

For positive integers $m,\ell,k,r\in\mathbb{N}$ and constant $\varepsilon\in(0,1)$ , an $(m,\ell,k,r,\varepsilon)$ -disperser is a collection $\mathcal{I}$ of $\ell$ subsets $I_{1},\ldots,I_{\ell}\subseteq[m]$ , each of size $k$ , such that the union of any $r$ different subsets from the collection has size at least $(1-\varepsilon)m$ .

Dispersers with proper parameters can be constructed using random subsets with high probability.

Lemma 3.5.

For positive integers $m,\ell,r\in\mathbb{N}$ and constant $\varepsilon\in(0,1)$ , let $k=\lceil\frac{3m}{\varepsilon r}\rceil$ and let $I_{1},\ldots,I_{\ell}$ be random $k$ -subsets of $[m]$ . If $\ln\ell\leq\frac{m}{r}$ then $\mathcal{I}=\{I_{1},\ldots,I_{\ell}\}$ is an $(m,\ell,k,r,\varepsilon)$ -disperser with probability at least $1-e^{-m}$ .

This above construction can also be derandomized easily. With this tool, we can compress the left super-nodes in a MaxCover instance according to those subsets. Each left super-node now corresponds to satisfying assignments of the AND of $k$ clauses. If there is a labeling that covers at least $r$ left super-nodes, from the definition of disperser we know that at least $(1-\varepsilon)m$ clauses in the original 3SAT instance can be simultaneously satisfied. The size of the new instance is $2^{O(k)}=2^{O(m/r)}$ , thus an algorithm for MaxCover which runs in $|\Gamma|^{o(r)}$ time would lead to an algorithm for constant gap 3SAT in $2^{o(m)}$ time, refuting Gap-ETH.

In the other direction, we would like to rule out $|\Gamma|^{o(h)}$ algorithms for approximating MaxCover, where $h$ is the number of right super-nodes. We have the following theorem:

Theorem 3.6 (Theorem 4.3 in [CCK⁺17]).

Assuming Gap-ETH, there exists constants $\delta,\rho>0$ such that for any $h\geq\rho$ and $1\geq\gamma>0$ , no algorithm can take a MaxCover instance $\Gamma$ with $h$ right super-nodes, distinguish between the following cases in $O_{h,\gamma}(|\Gamma|^{\delta h})$ time:

•

(Completeness) MaxCover $(\Gamma)=1$ .
•

(Soundness) MaxCover $(\Gamma)<\gamma$ .

This holds even when $|\Sigma_{U}|\leq(1/\gamma)^{O(1)}$ .

Note that in the statement of this theorem, $h$ can be any fixed constant, while in the statement of Gap-ETH, $h$ is the number of variables which goes to infinity. Thus, a straightforward idea is to compress the variables into $h$ groups, each of size $n/h$ .

After grouping variables, we can also compress the clauses in order to amplify the soundness parameter to $\gamma$ . Specifically, let $k=\ln(\frac{1}{\gamma})/\varepsilon$ , take $\ell=\binom{m}{k}$ left super-nodes, each corresponding to satisfying assignments of the AND of some $k$ clauses. If only $(1-\varepsilon)m$ clauses in the original 3SAT instance can be satisfied, then only $\binom{(1-\varepsilon)m}{k}$ clauses in the new instance can be satisfied, leading to a soundness parameter $\binom{(1-\varepsilon)m}{k}/\binom{m}{k}\leq e^{-\varepsilon k}=\gamma$ . Furthermore, $|\Sigma_{U}|=O(1)^{k}\leq(1/\gamma)^{O(1)}$ .

One important thing is that we need to make sure $\ell$ and $|\Sigma_{U}|$ can be bounded by $|\Sigma_{W}|=2^{n/h}$ , so that $|\Sigma_{W}|$ is the dominating term in $|\Gamma|$ . Thus, $\gamma$ cannot be arbitrarily small. Fortunately in its major applications (e.g. hardness of SetCover), this will not be the bottleneck.

Next we proceed to discuss hardness of MinLabel.

Theorem 3.7 (Theorem 4.4 in [CCK⁺17] ).

Assuming Gap-ETH, there exists constants $\delta,\rho>0$ such that, for any $h\geq\rho$ and $1\geq\gamma>0$ , no algorithm can take a MinLabel instance $\Gamma$ with $h$ right super-nodes, distinguish between the following cases in $O_{h,\gamma}(|\Gamma|^{\delta h})$ time:

•

(Completeness) MinLabel $(\Gamma)=1$ .
•

(Soundness) MinLabel $(\Gamma)>\gamma^{-1/h}$ .

This holds even when $|\Sigma_{U}|\leq(1/\gamma)^{O(1)}$ .

The instance is exactly the one in Theorem 3.6. It is easy to see that in the completeness case, MaxCover $=1$ implies MinLabel $=1$ . We only need to additionally argue that, in the soundness case, small MaxCover implies large MinLabel. Prove by contradiction, if MinLabel $\leq\gamma^{-1/h}$ , we can fix this multi-labeling, and pick a vertex uniformly at random from each right super-node to form a labeling. By proving the expected fraction of covered left super-nodes $\geq\gamma$ , we have MaxCover $\geq\gamma$ . In the following we suppose the optimal right multi-labeling $S\subseteq W$ covers left vertices $u_{1}\in U_{1},\ldots,u_{h}\in U_{h}$ .

	$\displaystyle{\sc MaxCover}\geq$	$\displaystyle\mathbb{E}\left[\frac{1}{\ell}\sum_{i=1}^{\ell}[u_{i}\text{ is covered}]\right]$
	$\displaystyle=$	$\displaystyle\frac{1}{\ell}\sum_{i=1}^{\ell}\prod_{j=1}^{h}\|\mathcal{N}(u_{i},W_{j}\cap S)\|^{-1}$
	$\displaystyle\geq$	$\displaystyle\frac{1}{\ell}\sum_{i=1}^{\ell}\left(\frac{1}{h}\sum_{j=1}^{h}\|\mathcal{N}(u_{i},W_{j}\cap S)\|\right)^{-h}$
	$\displaystyle\geq$	$\displaystyle\frac{1}{\ell}\sum_{i=1}^{\ell}\left(\frac{\|S\|}{h}\right)^{-h}$
	$\displaystyle=$	$\displaystyle\gamma$

The left alphabet size is $(1/\gamma)^{O(1)}$ , as in Theorem 3.6.

3.2 Gap Producing via Threshold Graph Composition

Threshold Graph Composition is a powerful gap-producing technique. It was first proposed by Lin in his breakthrough work [Lin18], and has been used to create gap for many parameterized problems [CL19, Lin19, BBE⁺19, KN21].

At a high level, in TGC we compose an instance which has no gap, with a threshold graph which is oblivious to the instance, to produce a gap instance of that problem. The two main challenges of TGC are the creation of a threshold graph with desired properties, and the right way to compose the input and the threshold graph, respectively.

In this subsection, we introduce a delicate threshold graph, which is constructed via error correcting codes. This graph was proposed by Karthik et al. [KN21], and had many applications such as in proving hardness of MaxCover starting from W[1] $\neq$ FPT or ETH (later in this Section), and in simplifying the proof of $k$ -SetCover inapproximability in [Lin19] (see Section 5.4).

We first formalize some definitions related to error correcting codes.

Definition 3.8 (Error Correcting Codes).

Let $\Sigma$ be a finite set, for every $\ell\in\mathbb{N}$ a subset $C:\Sigma^{r}\to\Sigma^{\ell}$ is an error correcting code with message length $r$ , block length $\ell$ and relative distance $\delta$ if for every $x,y\in\Sigma^{r}$ , $\Delta(C(x),C(y))\geq\delta$ . We denote then $\Delta(C)=\delta$ . Here $\Delta(x,y)=\frac{1}{\ell}|\{i\in[\ell]|x_{i}\neq y_{i}\}|$ .

We sometimes abuse notations and treat an error correcting code as its image, i.e., $C\subset\Sigma^{\ell}$ .

Definition 3.9 (Collision Number).

The collision number of an error correcting code $C$ is the smallest number $s$ such that there exists a set $S\subseteq C$ with $|S|=s$ , and for every $j\in[\ell]$ , there are two strings $x,y\in S$ such that $x_{j}=y_{j}$ . We denote this number as $Col(C)$ .

For any error correcting code $C:\Sigma^{r}\to\Sigma^{\ell}$ and any $k\in\mathbb{N}$ , we can take it to build a bipartite threshold graph $G=(A\dot{\cup}B,E)$ with the following properties efficiently:

•

$A$ is divided into $k$ groups, each of size $|\Sigma|^{r}$ . $B$ is divided into $\ell$ groups, each of size $|\Sigma|^{k}$ .
•

(Completeness) For any $a_{1}\in A_{1},\ldots a_{k}\in A_{k}$ and for any $j\in[\ell]$ , there is a unique vertex $b\in B_{j}$ which is a common neighbor of $\{a_{1},\ldots,a_{k}\}$ .
•

(Soundness) For every $i\in[k]$ and every distinct $a\neq a^{\prime}\in A_{i}$ , for $\Delta(C)\cdot\ell$ of the parts $j\in[\ell]$ , we have that $\mathcal{N}(a)\cap\mathcal{N}(a^{\prime})\cap\mathcal{B}_{j}=\emptyset$ .
•

(Collision Property) Let $X\subseteq A$ such that for every $j\in[\ell]$ , there exists $b\in B_{j}$ which is a common neighbor of (at least) $k+1$ vertices in $X$ . Then $|X|\geq Col(C)$ .

The graph is constructed as follows:

•

For every $i\in[k]$ , we associate $A_{i}$ with all codewords in $C$ , i.e. each vertex in $A_{i}$ is a unique codeword in the image of $C$ .
•

For every $j\in[\ell]$ , we associate $B_{j}$ with the set $\Sigma^{k}$ .
•

A vertex $a\in A_{i}$ and a vertex $b\in B_{j}$ are linked if and only if $a_{j}=b_{i}$ .

We can think of the graph as an $k\times\ell$ matrix, when picking vertices from each $A_{i},i\in[k]$ , we are filling the codewords into each row of the matrix. By reading out each column of the matrix, we can pick exactly one common neighbor of them in each $B_{j},j\in[\ell]$ , satisfying the completeness property.

The soundness property is also straightforward: if we pick two vertices $a,a^{\prime}$ from the same left group $A_{i}$ , the two codewords differ in at least $\Delta(C)\cdot\ell$ positions. For those columns we cannot “read out” any $k$ -bit string whose $i$ -th bit equals to two different characters.

As for the collision property, for a set $X\subseteq A$ , if for every $j\in[\ell]$ there exists $b\in B_{j}$ which is a common neighbor of at least $k+1$ vertices in $X$ , it’s easy to see that for any $j\in[\ell]$ , we can pick $x,y\in X$ such that $x_{j}=y_{j}$ by pigeonhole principle.

Now we describe how to compose this threshold graph with a $k$ -MaxCover instance $\Gamma$ where the parameter $k$ denotes the number of right super-nodes, to produce a Gap $k$ -MaxCover instance $\Gamma^{\prime}$ such that:

•

(Completeness) If MaxCover $(\Gamma)=1$ , then MaxCover $(\Gamma^{\prime})=1$ .
•

(Soundness) If MaxCover $(\Gamma)<1$ , then MaxCover $(\Gamma^{\prime})\leq 1-\Delta(C)$ .

Given a $k$ -MaxCover instance $\Gamma=G=(U\dot{\cup}W,E)$ with pseudo projection property, where $W=W_{1}\dot{\cup}\ldots\dot{\cup}W_{k}$ and $U=U_{1}\dot{\cup}\ldots\dot{\cup}U_{t}$ , and a threshold graph $G^{\prime}=(A\dot{\cup}B,E^{\prime})$ , where $A=A_{1}\dot{\cup}\ldots\dot{\cup}A_{t}$ and $B=B_{1}\dot{\cup}\ldots\dot{\cup}B_{\ell}$ , w.l.o.g. assume $|\Sigma|^{r}\geq\max_{i=1}^{t}|U_{i}|$ , we build the new $k$ -MaxCover instance $\Gamma^{\prime}$ as follows.

•

Arbitrarily match every vertex $u_{i}\in U_{i}$ to a vertex in $a_{i}\in A_{i}$ without repetitions. This can be done since $|\Sigma|^{r}\geq\max_{i=1}^{t}|U_{i}|$ .
•

The new right super-nodes are $W_{1}\ldots W_{k}$ , and the new left super-nodes are $B_{1}\ldots B_{\ell}$ .
•

A right vertex $w\in W_{i}$ and a left vertex $b\in B_{j}$ are linked if and only if there exists $u_{1}\in U_{1},\ldots,u_{t}\in U_{t}$ such that $w_{i}$ is linked to each $u_{1}\ldots u_{t}$ in $G$ , and $b$ is linked to the matching $a_{1}\ldots a_{t}$ in $G^{\prime}$ .

The completeness case is obvious, by picking one $w$ in each right super-node $W_{i}$ , there is a common neighbor in each left super-node $U_{i}$ . Consider their matching vertices $a_{1}\ldots a_{t}$ , there is exactly one common neighbor in each $B_{i}$ .

The soundness case needs the pseudo projection property of $G$ , i.e., for every $i\in[k]$ and $j\in t$ , edges between $W_{i}$ and $U_{j}$ either form a function, or are complete. Fix any labeling $w_{1}\in W_{1},\ldots,w_{k}\in W_{k}$ , there must be a super-node $U_{j}$ which cannot be covered. This means there must be two left vertices $w,w^{\prime}$ mapping to different vertices in $U_{j}$ . Let the two vertices be $u,u^{\prime}\in U_{j}$ , and let the matching vertices of them be $a,a^{\prime}\in A_{j}$ , by the soundness property of threshold graph, only in $1-\Delta(C)$ fraction of parts $j\in[\ell]$ is there a common neighbor of $a,a^{\prime}$ in $B_{j}$ . This means the labeling $\{w_{1},\ldots,w_{k}\}$ can only cover $(1-\Delta(C))\cdot\ell$ parts of $B$ , i.e., MaxCover $(\Gamma^{\prime})\leq 1-\Delta(C)$ .

Next we use this technique to prove strong inapproximability results of $k$ -MaxCover based on W[1] $\neq$ FPT and ETH.

Theorem 3.10 (Theorem 4.3 in [KN21] ).

Assuming W[1] $\neq$ FPT, for any computable function $f$ , there is no $f(k)\cdot\text{poly}(n)$ time algorithm which can take a MaxCover instance $\Gamma$ with $k$ right super-nodes, distinguish between the following two cases:

•

(Completeness) MaxCover $(\Gamma)=1$ .
•

(Soundness) MaxCover $(\Gamma)\leq n^{-O(\frac{1}{\sqrt{k}})}$ .

Proof.

First reduce $k$ -Clique to MaxCover with $K=\binom{k}{2}$ right super-nodes and $k$ left super-nodes in the canonical way. Note that this MaxCover instance has pseudo projection property. Then take a Reed-Solomon Code to build the threshold graph. To ensure the right alphabet size (which is $|\Sigma|^{k}$ here) $\leq n$ , we need $|\Sigma|\leq n^{\frac{1}{k}}$ , and to ensure $|\Sigma|^{r}\geq\max_{i=1}^{t}|U_{i}|=n$ , we need $r\geq\log_{|\Sigma|}n=k$ . Thus according to the properties of Reed-Solomon Code, the soundness parameter is $1-\Delta(C)=1-(1-\frac{r}{|\Sigma|})=n^{-O(\frac{1}{k})}=n^{-O(\frac{1}{\sqrt{K}})}$ . ∎

Theorem 3.11 (Theorem 4.4 in [KN21] ).

Assuming ETH, there is no $n^{o(k)}$ time algorithm which can take a MaxCover instance $\Gamma$ with $k$ right super-nodes, distinguish between the following two cases:

•

(Completeness) MaxCover $(\Gamma)=1$ .
•

(Soundness) MaxCover $(\Gamma)\leq n^{-O(\frac{1}{k^{3}})}$ .

Proof.

First reduce 3SAT to MaxCover with $k$ right super-nodes and $t=\binom{k}{1}+\binom{k}{2}+\binom{k}{3}$ left super-nodes. Each right super-node corresponds to satisfying assignments of some $m/k$ clauses, and thus has $N=2^{\Theta(n/k)}$ vertices in it. Each left super-node corresponds to assignments to variables which appear in exactly some one/two/three groups of clauses. This MaxCover instance also has pseudo projection property. Note that here it’s necessary to group the variables to change the number of left super-nodes from $n$ to $\binom{k}{1}+\binom{k}{2}+\binom{k}{3}$ , because in our construction of threshold graph, the size of each new right super-node $B$ is $|\Sigma|^{t}$ , which is too large if $t=n$ . After that, we still use Reed-Solomon Codes. Now to ensure $|\Sigma|^{t}\leq N$ we need $|\Sigma|\leq 2^{n/k^{4}}$ , and to ensure $|\Sigma|^{r}\geq N$ we need $r\geq\log_{|\Sigma|}N=\Omega(k^{3})$ . The soundness parameter is $1-\Delta(C)=1-(1-\frac{r}{|\Sigma|})=\frac{k^{3}}{2^{n/k^{4}}}=N^{-O(\frac{1}{k^{3}})}$ . ∎

4 $k$ -Clique

Clique is arguably the first natural combinatorial optimization problem. Its inapproximability in the NP regime is studied extensively [BGLR93, BS94, Has96, FGL⁺96, Gol98, FK00, Zuc07]. However, in the parameterized perspective, although $k$ -Clique is known to be the complete problem of W[1], and even cannot be solved in $n^{o(k)}$ time assuming ETH [CHKX06], there is still a lot of work to do on its parameterized inapproximability.

To approximate $k$ -Clique to a factor of $\rho$ , we only need to compute a clique of size $k/\rho$ , which can be trivially done in $n^{k/\rho}$ time. In their milestone work, Chalermsook et al. [CCK⁺17] showed this cannot be improved assuming Gap-ETH. However, results based on non-gap assumptions are not reached until very recently Lin [Lin21] showed constant approximating $k$ -Clique is W[1]-hard. He also obtained an $n^{\Omega(\sqrt[5]{\log k})}$ lower bound for constant approximating $k$ -Clique based on ETH, and this bound was recently improved to $n^{\Omega(\log k)}$ by [LRSW21].

The following table lists current state-of-the-art inapproximability results of $k$ -Clique based on different hypotheses. Here $f$ can be any computable function.

Complexity Assumption	Inapproximability Ratio	Time Lower Bound	Reference
W[1] $\neq$ FPT	Any constant	$f(k)\cdot\text{poly}(n)$	[Lin21]
PIH	Any constant	$f(k)\cdot\text{poly}(n)$	/
ETH	Any constant	$f(k)\cdot n^{\Omega(\log k)}$	[LRSW21]
ETH	$k^{o(1)}$	$f(k)\cdot\text{poly}(n)$	[LRSW21]
Gap-ETH	$\rho=o(k)$	$f(k)\cdot n^{\Omega(k/\rho)}$	[CCK⁺17]

We shall notice that the colored version and uncolored version of $k$ -Clique are equivalent, because a colored version can be interpreted as an uncolored version by leaving each group an independent set, and we can make $k$ different copies of the original graph to transform an uncolored version to a colored version.

4.1 Reduction from MaxCover with Projection Property

The Gap-ETH hardness of $k$ -Clique directly follows from Theorem 3.3. Since the instance has projection property, two left vertices agree if and only if they map to the same vertex in each right super-node, and $k$ left vertices agree if and only if they pairwise agree. Thus, we can transform a MaxCover instance in Theorem 3.3 with $k$ left super-nodes to an $k$ -Clique instance with the same value.

Therefore, as pointed out in Theorem 3.3, even deciding if there is a clique of size $\geq r$ needs $n^{\Omega(r)}$ time. Here $r$ can be any $\omega(1)$ , which means approximating $k$ -Clique to any $\rho=k/r=o(k)$ ratio cannot be done in $f(k)\cdot n^{o(r)}=f(k)\cdot n^{o(k/\rho)}$ time.

Note the projection property is crucial, without which a MaxCover instance cannot be reduced to an $k$ -Clique instance, because the agreement test of $k$ left vertices cannot be decomposed locally to agreement tests of $\binom{k}{2}$ pairs of left vertices.

One interesting thing is that the optimal inapproximability result of $k$ -Clique can be obtained from hypotheses other than (but similar to) Gap-ETH, see the following as an example.

Theorem 4.1.

Given an undirected graph with $n$ groups of $O(1)$ vertices, each forming an independent set, if there exists a constant $\varepsilon>0$ such that distinguishing between the following cases cannot be done in $2^{o(n)}$ time:

•

(Completeness) there is a clique of size $n$ .
•

(Soundness) there is no clique of size $\varepsilon n$ .

then $k$ -Clique cannot be approximated to any $\rho=o(k)$ ratio in $f(k)\cdot n^{o(k/\rho)}$ time.

This assumption is a little weaker than Gap-ETH since it can be obtained through the canonical reduction from 3SAT to Clique.

The proof is almost the same as that in Theorem 3.3: just compose the groups using a disperser. Details are omitted here.

4.2 $k$ -VectorSum

Before introducing W[1]-hardness of constant approximating $k$ -Clique, we want to mention an important W[1]-complete problem, $k$ -VectorSum, which is used as an intermediate problem in the reduction in [Lin21].

In the $k$ -VectorSum problem, we are given $k$ groups of vectors $V_{1},\ldots,V_{k}\subseteq\mathbb{F}^{d}$ together with a target vector $\vec{t}\in\mathbb{F}^{d}$ , where $\mathbb{F}$ is some finite field and $d$ is the dimension of vectors. The goal is to decide whether there exists vectors $\vec{v}_{1}\in V_{1},\ldots,\vec{v}_{k}\in V_{k}$ such that $\sum_{i=1}^{k}\vec{v}_{i}=\vec{t}$ .

It’s easy to see $k$ -VectorSum with $|\mathbb{F}|=O(1)$ and $d=O(k\log n)$ is W[1]-hard, and even does not admit $n^{o(k)}$ time algorithms assuming ETH. The idea is to use entries of vectors to check the consistency in $k$ -Clique or 3SAT.

Theorem 4.2.

Assuming W[1] $\neq$ FPT, $k$ -VectorSum with $\mathbb{F}=\mathbb{F}_{2}$ and $d=\Theta(k\log n)$ can not be solved in $f(k)\cdot\text{poly}(n)$ time.

Proof.

Set $K=\binom{k}{2}$ groups of vectors, each representing valid edges between an $i$ -th block and a $j$ -th block of vertices in $k$ -Clique. For each $i\in[k]$ , we want to make sure that, the $(k-1)$ vertices chosen from the $i$ -th block are all the same one. Thus we need to do $k\cdot(k-2)$ equality checks, each on two $(\log n)$ -bit strings. In short, we exploit a new entry to do each bitwise equality check: set the entry to be the unchecked bit in the two vectors involved, and set the entry to be zero in all other vectors. Let the target vector to be $\vec{\mathbf{0}}$ . Thus all vectors sum up to zero in this entry if and only if the two to-be-checked bits are the same. The produced $K$ -VectorSum instance has parameter $K=\Theta(k^{2})$ and dimension $d=\Theta(k^{2}\log n)$ . ∎

Theorem 4.3.

Assuming ETH, $k$ -VectorSum with $\mathbb{F}=\mathbb{F}_{2}$ and $d=\Theta(k\log n)$ can not be solved in $n^{o(k)}$ time.

Proof.

Divide the clauses into $k$ equal-sized groups. Enumerate satisfying partial assignments of each group of clauses, then we want to check the consistency of those partial assignments. Note we can assume that each variable only appears in at most 3 clauses, thus we only need to do at most 2 pair-wise equality checks (between the first and the second appearances, and between the second and the third). The equality check step is the same as that in the proof of Theorem 4.2, and is omitted here.

The produced $k$ -VectorSum instance has size $N=2^{\Theta(n/k)}$ and the dimension is $d=O(n)=O(k\log N)$ . Assuming ETH, this can not be solved in $N^{o(k)}$ time. ∎

Note that the $\mathbb{F}_{2}$ can be replaced by any finite field, as long as we slightly adjust the value of an entry: both 0 if $x=0$ ; $c$ in the first appearance and $-c$ in the second appearance for some constant $c\neq 0$ if $x=1$ .

4.3 Gap Producing via Hadamard Codes

In this subsection we briefly introduce how Lin [Lin21] rules out constant FPT-approximations of $k$ -Clique under W[1] $\neq$ FPT.

The most essential step in the reduction is to combine $k$ -VectorSum, whose W[1]-hardness was shown in Theorem 4.2, with Hadamard codes to create a gap. The technique is very similar to which people used in proving weakened PCP theorem, namely, in proving NP $\subseteq$ PCP $(\text{poly(n),1})$ [ALM⁺98].

Here follows the definition of Hadamard Code, and its two important properties.

Definition 4.4 (Walsh-Hadamard Code).

For two strings $x,y\in\{0,1\}^{n}$ , define $\langle x,y\rangle=\sum_{i=1}^{n}x_{i}y_{i}\pmod{2}$ . The Walsh-Hadamard Code is the function $f:\{0,1\}^{n}\to\{0,1\}^{2^{n}}$ that maps every string $x\in\{0,1\}^{n}$ into the string $z\in\{0,1\}^{2^{n}}$ such that $z_{y}=\langle x,y\rangle$ for every $y\in\{0,1\}^{n}$ .

1.

(Linearity Testing, [BLR93]) Let $g$ be a function mapping from $\{0,1\}^{n}$ to $\{0,1\}$ . If it can pass $(1-\delta)$ fraction of tests $g(x)+g(x^{\prime})=g(x+x^{\prime}),x,x^{\prime}\in\{0,1\}^{n}$ , then $g$ is at least $(1-\delta)$ -close to a true linear function $\tilde{g}:\{0,1\}^{n}\to\{0,1\}$ , which can also be parsed as a Hadamard codeword. By setting $0\leq\delta<\frac{1}{4}$ , this codeword is unique since the relative distance of Hadamard Code is exactly $\frac{1}{2}$ .
2.

(Locally Decodable Property) Suppose $g$ is $(1-\delta)$ -close to a Hadamard codeword $f(x)$ for some $x\in\{0,1\}^{n}$ , then for any $y\in\{0,1\}^{n}$ , we can recover $(f(x))_{y}$ probabilistically by querying only two positions of $g$ . Namely, sample $y^{\prime}\in\{0,1\}^{n}$ uniformly at random, and output $g(y+y^{\prime})+g(y^{\prime})$ . This succeeds with probability at least $1-2\delta$ by a simple union bound.

Given an $k$ -VectorSum instance $(V_{1},\ldots,V_{k},\vec{t})$ , we build a CSP instance on variable set $X=\{x_{\vec{a}_{1},\ldots,\vec{a}_{k}}:\vec{a}_{1},\ldots,\vec{a}_{k}\in\mathbb{F}^{d}\}$ . Let $\vec{v}_{1}\in V_{1},\ldots,\vec{v}_{k}\in V_{k}$ be a solution that sums up to $\vec{t}$ in the yes case, each variable $x_{\vec{a}_{1},\ldots,\vec{a}_{k}}$ is supposed to take the value $\sum_{i\in[k]}\langle\vec{a}_{i},\vec{v}_{i}\rangle$ . Here we are actually concatenating the $k$ solution vectors into one long vector $\vec{v}\in\mathbb{F}^{kd}$ , and the concatenation of variables is supposed to be the Hadamard codeword of $\vec{v}$ .

There are three types of tests we want to make:

•

(T1) $\forall\vec{a}_{1},\ldots,\vec{a}_{k},\vec{b}_{1},\ldots\vec{b}_{k}\in\mathbb{F}^{d}$ , test whether $x_{\vec{a}_{1},\ldots,\vec{a}_{k}}+x_{\vec{b}_{1},\ldots\vec{b}_{k}}=x_{\vec{a}_{1}+\vec{b}_{1},\ldots,\vec{a}_{k}+\vec{b}_{k}}$ .
•

(T2) $\forall i\in[k],\forall\vec{a}_{1},\ldots,\vec{a}_{k},\vec{a}\in\mathbb{F}^{d}$ , test whether $x_{\vec{a}_{1},\ldots,\vec{a}_{i}+\vec{a},\ldots,\vec{a}_{k}}-x_{\vec{a}_{1},\ldots,\vec{a}_{k}}=\langle\vec{a},\vec{v}\rangle$ for some $v\in V_{i}$ .
•

(T3) $\forall\vec{a}_{1},\ldots,\vec{a}_{k},\vec{a}\in\mathbb{F}^{d}$ , test whether $x_{\vec{a}_{1}+\vec{a},\ldots,\vec{a}_{k}+\vec{a}}-x_{\vec{a}_{1},\ldots,\vec{a}_{k}}=\langle\vec{a},\vec{t}\rangle$ .

By linearity testing, if an assignment to $X$ satisfies $(1-\delta)$ -fraction of constraints in (T1), then $X$ is at least $(1-\delta)$ -close to a true Hadamard codeword $f(\vec{u})$ , $\vec{u}\in\mathbb{F}^{kd}$ . The constraints (T2) and (T3) use locally decodable properties to recover $f(\vec{u})_{(\vec{0},\ldots,\vec{a},\ldots,\vec{0})},f(\vec{u})_{(\vec{a},\ldots,\vec{a})}$ , and use them to check whether $\vec{u}$ indeed indicates a satisfying solution of our $k$ -VectorSum instance.

After that, we use a slightly modified FGLSS reduction to build an $2^{\text{poly}(k)\cdot d}$ -clique instance. We build a group for each variable indicating its possible values, and a group for each (T1) test. A variable vertex and a test vertex are linked either if they are consistent, or the test is irrelevant to the variable. Two test vertices are linked in a similar way accordingly. Two variable vertices are linked either if the values specified by them pass the (T2) and (T3) tests, or there are no such tests between them. Therefore, if there is a clique whose size is at least $(1-\varepsilon)$ times the maximum, the following conditions hold:

1.

A constant fraction of (T1) tests are passed.
2.

For a constant fraction of variables, all (T2) and (T3) tests between them are passed.

This completes the reduction.

There are still some technical details. For example, in this reduction the number of vertices is $2^{\text{poly}(k)\cdot d}=n^{\text{poly}(k)}$ , which is too large. [Lin21] sampled some random matrices to handle this. Details are omitted here.

5 $k$ -SetCover

SetCover, which is equivalent to the Dominating Set problem, is one of the fundamental problems in computational complexity. A simple greedy algorithm yield a $(\ln n)$ -approximation of this problem. On the opposite side, it was shown that $(1-\varepsilon)\ln n$ -approximation for this problem is NP-hard for every $\varepsilon>0$ [DS14]. Thus, its approximability in the NP regime has been completely settled.

In the parameterized regime, $k$ -SetCover is the complete problem of W[2], and does not admit $n^{o(k)}$ algorithms assuming ETH [CHKX06], not even $n^{k-\varepsilon}$ algorithms assuming SETH [PW10]. Hardness of approximation of $k$ -SetCover in FPT time was studied by [CL19, CCK⁺17, KLM19, Lin19], and currently based on Gap-ETH, ETH, W[1] $\neq$ FPT or $k$ -SUM Hypothesis, $k$ -SetCover is hard to approximate to within a $(\log n)^{\frac{1}{\text{poly}(k)}}$ factor. In one direction, we wonder if this $(\log n)^{\frac{1}{\text{poly}(k)}}$ can be further improved to $\log n$ , or it is already tight. In the other direction, it is also worth questioning whether the total FPT inapproximability of $k$ -SetCover can be based on the weaker assumption W[2] $\neq$ FPT.

Current state-of-the-art inapproximability results of $k$ -SetCover are reached by two contrasting methods, namely, Distributed PCP Framework [KLM19] and Threshold Graph Composition [Lin19], which we will introduce in Section 5.3 and 5.4, respectively. See the following table for an overview of results.

Complexity Assumption	Inapproximability Ratio	Time Lower Bound	Reference
W[1] $\neq$ FPT	$(\log n)^{\varepsilon(k)=o(1)}$	$f(k)\cdot\text{poly}(n)$	[Lin19]
W[1] $\neq$ FPT	$(\log n)^{1/\text{poly}(k)}$	$f(k)\cdot\text{poly}(n)$	[KLM19]
ETH	$\left(\frac{\log n}{\log\log n}\right)^{1/k}$	$f(k)\cdot n^{\Omega(k)}$	[Lin19]
ETH	$(\log n)^{1/\text{poly}(k)}$	$f(k)\cdot n^{\Omega(k)}$	[KLM19]
SETH	$\left(\frac{\log n}{\log\log n}\right)^{1/k}$	$f(k)\cdot n^{k-\varepsilon}$	[Lin19]
SETH	$(\log n)^{1/\text{poly}(k)}$	$f(k)\cdot n^{k-\varepsilon}$	[KLM19]
$k$ -SUM Hypothesis	$\left(\frac{\log n}{\log\log n}\right)^{1/k}$	$f(k)\cdot n^{\lceil k/2\rceil-\varepsilon}$	[Lin19]
$k$ -SUM Hypothesis	$(\log n)^{1/\text{poly}(k)}$	$f(k)\cdot n^{\lceil k/2\rceil-\varepsilon}$	[KLM19]

As for the coloring, the colored version and uncolored version of (exact) $k$ -SetCover are also equivalent, since we can add $k$ elements in the universe to ensure there is a set from each group, to reduce a colored version to an uncolored version. In the other direction, taking $k$ copies of the sets also works, because choosing replicated sets does not contribute. In the approximating sense, since it is a minimization problem, in the soundness case when solutions of size $\leq g(k)\cdot k$ are ruled out, it always means we can’t find such many sets to cover the universe even when choosing from the same set is allowed. Thus there is no need to specify the coloring.

5.1 Hypercube Partition System

We first introduce hypercube partition system, which is a powerful tool used in the reduction from MinLabel to $k$ -SetCover [CCK⁺17, KLM19], and in creating the gap instance of $k$ -SetCover [Lin19].

Definition 5.1 (Hypercube Partition System).

The $(\kappa,\rho)$ -hypercube partition system consists of the universe $\mathcal{M}$ and a collection of subset $\{P_{x,y}\}_{x\in[\rho],y\in[\kappa]}$ where $\mathcal{M}=[\kappa]^{\rho}$ and $P_{x,y}=\{z\in\mathcal{M}:z_{x}=y\}$ .

The universe $\mathcal{M}$ consists of all functions from $[\rho]$ to $[\kappa]$ , and is of size $\kappa^{\rho}$ . Each subset $P_{x,y}$ ( $x\in[\rho],y\in[\kappa]$ ) consists of all functions mapping $x$ to $y$ . It can be observed that one can cover the universe by picking all $\kappa$ subsets from some row $x\in[\rho]$ , and this is the only way to cover the universe. In other words, even if we include $\kappa-1$ subsets from every row, it is not possible to cover the universe.

5.2 Reduction from MinLabel

Now we show how to reduce MinLabel to SetCover. This reduction preserves gap, but significantly increases the instance size.

Given a MinLabel instance $\Gamma$ with $\ell$ left super-nodes $U_{1},\ldots,U_{\ell}$ and $h$ right super-nodes $W_{1},\ldots,W_{h}$ , we build a SetCover instance as follows. Take $\ell$ different copies of $(h,|\Sigma_{U}|)$ -hypercube partition system and set the universe to be the union of $\ell$ universes. Each set in SetCover corresponds to a right vertex in MinLabel. For a set $S_{v}$ associated to a right vertex $v\in W_{j}$ and for a left vertex $u\in U_{i}$ , if there is an edge $(u,v)$ , we include $P_{u,j}$ in set $S_{v}$ .

In order to see there is a one-to-one mapping from a solution of $\Gamma$ to a solution of the new SetCover instance, note that for a left vertex $u\in U_{i}$ , if a right multi-labeling covers $u$ , by picking corresponding sets we have $P_{u,j}$ for all $j\in[h]$ . Moreover, the only way to cover the universe is to include all $P_{u,j}$ for some row indexed by $u$ , so a valid SetCover solution must contain sets associated to at least one neighbor in each right super-nodes for some specific left vertex $u$ . The same argument applies to each of the $\ell$ left super-nodes, because we have a different copy of the hypercube partition system for each of them.

One important thing about this reduction is that the instance size is blowed up to $\ell\cdot h^{|\Sigma_{U}|}$ , where $h$ is the solution size of yes instance (and also the parameter of $k$ -SetCover). Thus, in order to make it $\leq f(k)n^{O(1)}$ , the left alphabet size can not exceed $\frac{\log n}{\log k}$ . Following the hardness of MinLabel (Theorem 3.7) and letting $1/\gamma=(\log n)^{O(1)}$ , $k$ -SetCover is hard to approximate to a $(\log n)^{O(\frac{1}{k})}$ factor assuming Gap-ETH.

5.3 Gap Producing via Distributed PCP

In this section we introduce the Distributed PCP Framework. This framework was first proposed by Abbound et al. [ARW17], and later used by Karthik et al. to rule out FPT approximation algorithms for $k$ -SetCover [KLM19]. The interesting part of [KLM19] is to obtain hardness of MaxCover with specific parameters, while hardness of MinLabel and $k$ -SetCover directly follow from reductions in [CCK⁺17] (see Theorem 3.7 and Section 5.2 respectively).

At a high level, in this framework, one first rewrites the problem related to the hypothesis as a communication problem, then derives a Simultaneous Message Protocol for this problem and extracts an instance of MaxCover from the transcript of the protocol.

Due to space limitations, we only introduce their W[1] and ETH results to give an overview of their methods. The ideas in SETH and $k$ -SUM Hypothesis results are similar, except that they involve more complicated error correcting codes and protocols.

The similarity between W[1] $\neq$ FPT and ETH is that they both often involve agreement tests (in other words, equality tests). Starting from W[1] $\neq$ FPT, one may want to set $K=\binom{k}{2}$ groups, each containing $N=|E|$ elements representing valid edges. The goal is to pick an edge from each group such that the label of end points (which is of $\log N$ bits length) are consistent. W[1] $\neq$ FPT states that this problem does not admit an $f(K)N^{O(1)}$ algorithm. Similarly, starting from ETH, one may also want to divide the clauses into $K$ groups, each containing at most $N=2^{\Theta(n/K)}$ partial satisfying assignments for those clauses. The goal is also to pick an assignment from each group such that the values on each variables (which is of $n=\Theta(K\log N)$ bits length in total) are consistent. ETH states that this cannot be done in $N^{o(K)}$ time.

We can think of the agreement test problem as a communication problem: there are $K$ players, each receiving an element from the corresponding group. They want to collaborate nicely to decide whether their elements in hand “agree” or not. To achieve this, they use a specific communication protocol called Simultaneous Message Protocol, which was introduced by Yao [Yao79]:

Definition 5.2 (Simultaneous Message Protocol).

We say $\pi$ is a $(r,\ell,s)$ -efficient protocol if the following holds:

•
The protocol is one-round with public randomness. The following actions happen sequentially:
1. 1.
  
  The players receive their inputs.
2. 2.
  
  The players and the referee jointly toss $r$ random coins.
3. 3.
  
  Each player on seeing the randomness deterministically sends an $\ell$ -bit message to the referee.
4. 4.
  
  Based on the randomness and the $K\cdot\ell$ bits sent from the players, the referee outputs accept or reject.
•
The protocol has completeness 1 and soundness $s$ , i.e.,
- –
  
  If their inputs indeed agree, then the referee always accepts regardless of randomness.
- –
  
  Otherwise, the referee accepts with probability at most $s$ .

The full version of SMP protocol also admits $w$ bits of advice. In the contexts of W[1] $\neq$ FPT and ETH here, we do not need this.

Note that by repeating the protocol $z$ times, each time using fresh randomness, we can derive a $(z\cdot r,z\cdot\ell,s^{z})$ -efficient protocol from an $(r,\ell,s)$ -efficient protocol.

Next we will see how to use MaxCover to simulate an SMP protocol. Then the hardness of the starting problem ( $k$ -Clique or 3SAT) and the existence of an efficient protocol for it will lead to hardness of MaxCover.

Theorem 5.3 (Theorem 5.2 in [KLM19], slightly simplified).

An instance $\Pi$ of a ( $k$ -Clique or 3SAT) problem which admits an $(r,\ell,s)$ -efficient $K$ -player SMP protocol can be reduced to a MaxCover instance $\Gamma$ as follows:

•

The reduction runs in time $2^{r+K\cdot\ell}\cdot\text{poly}(N,K)$ .
•

$\Gamma$ has $K$ right super-nodes of size at most $N$ each.
•

$\Gamma$ has $2^{r}$ left super-nodes of size at most $2^{K\cdot\ell}$ each.
•

If $\Pi$ is a YES instance, then MaxCover $=1$ .
•

If $\Pi$ is a NO instasnce, then MaxCover $\leq s$ .

Proof.

Here is the construction:

•

Each right super-node corresponds to the group which a player receives an input from.
•

Each left super-node corresponds to a random string $\gamma\in\{0,1\}^{r}$ . The left super-node $U_{\gamma}$ contains one node for each possible accepting messages from the $K$ players, i.e., each vertex in $U_{\gamma}$ corresponds to $(m_{1},\ldots,m_{K})\in(\{0,1\}^{\ell})^{K}$ where in the protocol the referee accepts on seeing randomness $\gamma$ and messages $(m_{1},\ldots,m_{K})$ .
•

We add an edge between a right vertex $x\in W_{j}$ and a left vertex $(m_{1},\ldots,m_{K})\in U_{\gamma}$ if $m_{j}$ is equal to the message that player $j$ sends on an input $x$ and randomness $\gamma$ in the protocol.

Detailed proofs of the desired properties are omitted here since they are rather straightforward. ∎

In the last step, we need to derive an efficient SMP protocol for $k$ -Clique and 3SAT. Directly sending their inputs (which is of $\geq\log N$ bits length) does not seem to be a good idea because it would make the size of MaxCover to be $2^{r+K\cdot\ell}=2^{\Omega(K)\log N}=N^{\Omega(K)}$ , which is too large. What’s more, since the further reduction from MaxCover to MinLabel then to $k$ -SetCover will introduce a $K^{|\Sigma_{U}|}$ blow-up, we even need $|\Sigma_{U}|=2^{K\cdot\ell}\leq\frac{\log N}{\log K}$ .

Fortunately, this can be done via a simple error correcting code called good code which has constant rate and constant relative distance. To check the consistency of their inputs, we only need to check the equality of one random bit of their codes. The randomness is used to specify the index of that bit. This way we can have $r=O(\log\log N),\ell=O(1),s=O(1)$ . Within the bound that $\ell\leq\frac{1}{K}(\log\log N-\log\log K)$ , we can repeat the protocol $z=O(\frac{1}{K}\log\log N)$ times, leading to a soundness parameter $s=O(1)^{z}=O((\log N)^{\frac{1}{K}})$ . After the reduction to MinLabel, the $\varepsilon$ in gap would become $\varepsilon^{\frac{1}{K}}$ .

Along such a long way we finally reach a $(\log n)^{\frac{1}{\text{poly}(k)}}$ inapproximability ratio for $k$ -SetCover with FPT time lower bound under W[1] $\neq$ FPT and $n^{o(k)}$ time lower bound under ETH.

5.4 Gap Producing via Threshold Graph Composition

In this subsection we introduce how to obtain $\left(\frac{\log n}{\log\log n}\right)^{1/k}$ inapproximability of $k$ -SetCover via threshold graph composition technique.

In general, this technique transforms an $k$ -SetCover instance with small universe (typically $|U|=\Theta(\log n)$ ) still to an instance of $k$ -SetCover, increasing the size of universe to $O(n)$ while creating a gap. In the YES case, the number of sets needed to cover the universe is still $k$ , but in the NO case it becomes $h\gg k$ , where $h$ is determined by the threshold graph.

Given an $k$ -SetCover instance $\Gamma=(\mathcal{S},U)$ , we need a bipartite threshold graph $G=(A\dot{\cup}B,E)$ with the following properties:

1.

$A$ is not divided. $B$ is divided into $\ell$ groups: $B=B_{1}\dot{\cup}\ldots\dot{\cup}B_{\ell}$ , where $\ell$ is arbitrary.
2.

$|A|=n,|B_{i}|\leq\frac{\log n}{\log|U|},\forall i\in[\ell]$ .
3.

For any $k$ vertices $a_{1},\ldots,a_{k}\in A$ and for any $i\in[\ell]$ , there is at least one common neighbor of $\{a_{1},\ldots,a_{k}\}$ in $B_{i}$ .
4.

For any $X\subseteq A$ , if for any $i\in[\ell]$ , there is at least one vertex in $B_{i}$ , which is a common neighbor of at least $k+1$ vertices in $X$ , then $|X|\geq h$ .

We compose the original $k$ -SetCover instance $\Gamma=(\mathcal{S},U)$ with this threshold graph $G$ to produce an instance $\Gamma^{\prime}=(\mathcal{S}^{\prime},U^{\prime})$ as follows.

•

$|\mathcal{S}^{\prime}|=|\mathcal{S}|$ . For each $S\in\mathcal{S}$ we associate a new set $S^{\prime}\in\mathcal{S}^{\prime}$ . We also associate a vertex in $A$ (the left side of the threshold graph) to each set $S^{\prime}\in\mathcal{S}^{\prime}$ .
•

$\Gamma$ consists of $\ell$ hypercube partition systems, one for each $B_{i}$ (the right side of the threshold graph). The $i$ -th partition system has $|B_{i}|$ rows and $|U|$ columns, thus is of size $|U|^{|B_{i}|}$ .
•
For any $i\in[\ell],x\in B_{i},y\in U$ , subset $P_{x,y}$ is included in a set $S^{\prime}\in\mathcal{S}^{\prime}$ if and only if:
1. 1.
  
  $y\in S$ , i.e., set $S$ can cover $y$ in the original instance $\Gamma$ .
2. 2.
  
  There is an edge between the vertex associated to $S^{\prime}$ and vertex $x$ in the threshold graph $G$ .

As shown above, each row in the partition system corresponds to a vertex in $B_{i}$ , and each column corresponds to an element in $U$ . According to the properties of partition system, in order to cover the universe, we must pick all $|U|$ subsets in some specific row. This, together with our construction, means there is a vertex $x\in B_{i}$ such that sets correspond to its neighbors in $G$ can cover $U$ . Since the $\ell$ hypercube partition systems are independent, this holds for each $B_{i},i\in[\ell]$ .

In the YES case, $k$ sets are enough to cover $U$ , and there is at least one common neighbor of them in every $B_{i},i\in[\ell]$ . Thus the answer to the new instance is still $k$ .

In the NO case, at least $k+1$ sets are need to cover $U$ . Consider any solution $X\subseteq\mathcal{S}^{\prime}$ of $\Gamma^{\prime}$ , we know that in each group $B_{i}$ , there is a vertex $x\in B_{i}$ such that $|\mathcal{N}(x)\cap X|\geq k+1$ (because they form a solution in $\Gamma$ ). Therefore, by property 4 of threshold graph, $|X|\geq h$ as desired.

The last step is to construct a threshold graph the desired properties. [Lin19] used a specific combinatorial object called universal sets to construct it. However, the graph in Section 3.2 with proper parameters also suffices, and is simpler.

The required property is closely related to the collision property in the construction of Section 3.2. In fact, the gap $h$ here is just the collision number $Col(C)$ of the error correcting code. Thus, we want codes with large collision numbers.

Let the alphabet of the code be $\Sigma$ , then it’s easy to see the collision number cannot be larger than $|\Sigma|+1$ , since such many strings must collide in every position by pigeon principle. However, this upper bound can be reached by codes constructed from perfect hash family.

Definition 5.4 (Perfect Hash Family).

For every $N,\ell\in\mathbb{N}$ and $\Sigma$ , we say that $H:=\{h_{i}:[N]\to\Sigma|i\in[\ell]\}$ is a $[N,\ell]_{|\Sigma|}$ -perfect hash family if for every subset $T\subseteq[N]$ of size $|\Sigma|$ , there exists an $i\in[\ell]$ such that

\forall x,y\in T,x\neq y,h_{i}(x)\neq h_{i}(y)

Think of an $[N,\ell]_{|\Sigma|}$ -perfect hash family as an $N\times\ell$ matrix, where each column represents a hash function. Then the property is that for any $\leq|\Sigma|$ rows, there is a column with no collisions (on these rows). Regard each row as a codeword of an error correcting code from $\Sigma^{\log_{|\Sigma|}N}\to\Sigma^{\ell}$ , then the collision number of this error correcting code is $>|\Sigma|$ , thus exactly $|\Sigma|+1$ as desired.

Lemma 5.5 (Alon et al. [AYZ95] ).

For every $N,|\Sigma|\in\mathbb{N}$ there is a $[N,2^{O(|\Sigma|)}\cdot\log N]_{|\Sigma|}$ -perfect hash family that can be computed in $\tilde{O}_{|\Sigma|}(N)$ time.

In this threshold graph the size of a right super-node $B_{i}$ is $|\Sigma|^{k}$ . Assuming $|U|=O(\log n)$ , to make $|B_{i}|\leq\frac{\log n}{\log|U|}=\frac{\log n}{\log\log n}$ , we need $|\Sigma|\leq\left(\frac{\log n}{\log\log n}\right)^{1/k}$ and this is the best gap possible. By the above lemma, the perfect hash family (and the corresponding code) can be constructed efficiently.

Our last step is to show that, assuming different hypothesis, $k$ -SetCover with universe size $\leq\log n$ is still hard. The reduction from 3SAT is straightforward: divide the variables into $k$ groups of size $n/k$ each, a group of sets corresponds to $N=2^{n/k}$ possible assignments to those $n/k$ variables, and the universe is just the $m=\Theta(k\log N)$ clauses. ETH (respectively, SETH) asserts that this instance cannot be solved in $N^{o(k)}$ (respectively, $N^{k-\varepsilon}$ ) time. Thus by the above threshold graph composition technique, we reach $\left(\frac{\log n}{\log\log n}\right)^{1/k}$ -inapproximability of $k$ -SetCover based on ETH and SETH.

The reduction from $k$ -Clique to $k$ -SetCover is a bit more complicated, as stated in the following theorem. The main idea is from Karthik et al. [KLM19].

Theorem 5.6.

There is an $\text{poly}(n)$ time algorithm, which can reduce an $k$ -Clique instance to an $\binom{k}{2}$ -SetCover instance with universe size $\text{poly}(k)\log n$ .

Proof.

Sets in a group $(i,j)$ still represent edges between the $i$ -th block and the $j$ -th block (in the $k$ -Clique instance). In order to check the consistency of labels, we need $k\times\log n$ hypercube partition systems, one for each $(i,\ell),i\in[k],\ell\in[\log n]$ . The $(i,\ell)$ -th partition system is meant to check whether the $\ell$ -th bits of labels of vertices in block $i$ are all the same. In an invalid solution, one may pick an edge between the $i$ -th and $j$ -th blocks, and an edge between the $i$ -th and $j^{\prime}$ -th blocks, such that the two vertices (let them be $v_{1}$ and $v_{2}$ ) in the $i$ -th block are not the same. In such a case, $v_{1}$ and $v_{2}$ must differ in at least one bit, and thus cannot fulfill the requirements in the $(i,\ell)$ -th partition system where $\ell$ is the position of that bit.

Specifically, for all $i\in[k],\ell\in[\log n]$ , the $(i,\ell)$ -th partition system contains 2 rows and $(k-1)$ columns. The choices of rows represent the choices of the bit to be 0 or 1, and the columns test agreement of the $(k-1)$ labels (edges between the $i$ -th block and each remaining blocks). For an edge between $v\in V_{i}$ and $w\in V_{j}$ , we include $P_{v[\ell],j}$ into its set. At last, the $\binom{k}{2}$ -SetCover instance is the union of those $k\cdot\log n$ hypercube partition systems.

The instance size is $\text{poly}(n)$ since there are that many edges, while the universe size is $k\cdot\log n\cdot(k-1)^{2}=\text{poly}(k)\cdot\log n$ . ∎

The W[1]-hardness of $k$ -SetCover then follows. It is worth noting that in such FPT reductions, the parameter $k^{\prime}$ can be arbitrarily amplified as long as it is a function of $k$ . Assuming W[1] $\neq$ FPT, $k$ -SetCover is hard to approximate to within a factor of $\left(\frac{\log n}{\log\log n}\right)^{1/\binom{k}{2}}$ , then for any function $\varepsilon(k)$ which is $o(1)$ as $k$ goes to infinity, take large enough $k^{\prime}$ such that $\varepsilon(k^{\prime})<1/\binom{k}{2}$ , then for large enough $n$ we have $(\log n)^{\varepsilon(k^{\prime})}<\left(\frac{\log n}{\log\log n}\right)^{1/\binom{k}{2}}$ , which means $k^{\prime}$ -SetCover (by padding the parameter from $k$ to $k^{\prime}$ ) cannot be approximated to a factor of $(\log n)^{\varepsilon(k^{\prime})}$ in FPT time.

Instead of introducing their $k$ -SUM Hypothesis result here, we want to make some comments on this technique. Note that the maximum size of a right super-node in the threshold graph is $\frac{\log n}{\log|U|}$ . Thus when $|U|$ is not as small as $\log n$ , it may still be possible to obtain some inapproximability results. It remains a big open question that whether we can base total FPT inapproximability of $k$ -SetCover on W[2] $\neq$ FPT. If we can construct threshold graphs with a gap such that each right super-nodes consist of only $O(1)$ vertices, we can obtain W[2] hardness of $k$ -SetCover, respectively. Note that the construction in Section 3.2 does not suffice, because the size of their right super-nodes is $\Sigma^{k}$ , which is too large even if $|\Sigma|=O(1)$ .

Acknowledgements

I want to express my deep gratitude to Prof. Bingkai Lin, who brought me into the beautiful world of hardness of approximation, discussed with me regularly and guided me patiently. I would also like to thank my talented friends Yican Sun and Xiuhan Wang for their bright ideas and unreserved help. I really enjoy working with them.

References

[ALM⁺98] Sanjeev Arora, Carsten Lund, Rajeev Motwani, Madhu Sudan, and Mario Szegedy. Proof verification and the hardness of approximation problems. J. ACM, 45(3):501–555, may 1998.
[ARW17] Amir Abboud, Aviad Rubinstein, and R. Ryan Williams. Distributed PCP theorems for hardness of approximation in P. In Chris Umans, editor, 58th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2017, Berkeley, CA, USA, October 15-17, 2017, pages 25–36. IEEE Computer Society, 2017.
[AYZ95] Noga Alon, Raphael Yuster, and Uri Zwick. Color-coding. J. ACM, 42(4):844–856, 1995.
[BBE⁺19] Arnab Bhattacharyya, Édouard Bonnet, László Egri, Suprovat Ghoshal, Karthik C. S., Bingkai Lin, Pasin Manurangsi, and Dániel Marx. Parameterized intractability of even set and shortest vector problem. Electron. Colloquium Comput. Complex., 26:115, 2019.
[BGKM18] Arnab Bhattacharyya, Suprovat Ghoshal, C. S. Karthik, and Pasin Manurangsi. Parameterized intractability of even set and shortest vector problem from gap-eth. Electron. Colloquium Comput. Complex., 25:57, 2018.
[BGLR93] Mihir Bellare, Shafi Goldwasser, Carsten Lund, and Alexander Russell. Efficient probabilistically checkable proofs and applications to approximations. In Proceedings of the twenty-fifth annual ACM symposium on Theory of computing, pages 294–304, 1993.
[BLR93] Manuel Blum, Michael Luby, and Ronitt Rubinfeld. Self-testing/correcting with applications to numerical problems. Journal of Computer and System Sciences, 47(3):549–595, 1993.
[BS94] Mihir Bellare and Madhu Sudan. Improved non-approximability results. In Proceedings of the twenty-sixth annual ACM symposium on Theory of computing, pages 184–193, 1994.
[CCK⁺17] Parinya Chalermsook, Marek Cygan, Guy Kortsarz, Bundit Laekhanukit, Pasin Manurangsi, Danupon Nanongkai, and Luca Trevisan. From gap-eth to fpt-inapproximability: Clique, dominating set, and more. In Chris Umans, editor, 58th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2017, Berkeley, CA, USA, October 15-17, 2017, pages 743–754. IEEE Computer Society, 2017.
[CHKX06] Jianer Chen, Xiuzhen Huang, Iyad A. Kanj, and Ge Xia. Strong computational lower bounds via parameterized complexity. J. Comput. Syst. Sci., 72(8):1346–1367, 2006.
[CL19] Yijia Chen and Bingkai Lin. The constant inapproximability of the parameterized dominating set problem. SIAM J. Comput., 48(2):513–533, 2019.
[CW89] Aviad Cohen and Avi Wigderson. Dispersers, deterministic amplification, and weak random sources. In 30th Annual Symposium on Foundations of Computer Science, pages 14–19. IEEE Computer Society, 1989.
[Din07] Irit Dinur. The PCP theorem by gap amplification. J. ACM, 54(3):12, 2007.
[Din16] Irit Dinur. Mildly exponential reduction from gap 3sat to polynomial-gap label-cover. Electron. Colloquium Comput. Complex., 23:128, 2016.
[DS14] Irit Dinur and David Steurer. Analytical approach to parallel repetition. In David B. Shmoys, editor, Symposium on Theory of Computing, STOC 2014, New York, NY, USA, May 31 - June 03, 2014, pages 624–633. ACM, 2014.
[FGL⁺96] Uriel Feige, Shafi Goldwasser, Laszlo Lovász, Shmuel Safra, and Mario Szegedy. Interactive proofs and the hardness of approximating cliques. Journal of the ACM (JACM), 43(2):268–292, 1996.
[FK00] Uriel Feige and Joe Kilian. Two-prover protocols—low error at affordable rates. SIAM Journal on Computing, 30(1):324–346, 2000.
[Gol98] Shafi Goldwasser. Introduction to special section on probabilistic proof systems. SIAM Journal on Computing, 27(3):737, 1998.
[Has96] Johan Hastad. Clique is hard to approximate within $n^{1-\epsilon}$ . In Proceedings of 37th Conference on Foundations of Computer Science, pages 627–636. IEEE, 1996.
[IP01] Russell Impagliazzo and Ramamohan Paturi. On the complexity of k-sat. Journal of Computer and System Sciences, 62:367–375, 2001.
[IPZ01] Russell Impagliazzo, Ramamohan Paturi, and Francis Zane. Which problems have strongly exponential complexity? Journal of Computer and System Sciences, 63(4):512–530, 2001.
[KLM19] C. S. Karthik, Bundit Laekhanukit, and Pasin Manurangsi. On the parameterized complexity of approximating dominating set. J. ACM, 66(5):33:1–33:38, 2019.
[KN21] C. S. Karthik and Inbal Livni Navon. On hardness of approximation of parameterized set cover and label cover: Threshold graphs from error correcting codes. In Hung Viet Le and Valerie King, editors, 4th Symposium on Simplicity in Algorithms, SOSA 2021, Virtual Conference, January 11-12, 2021, pages 210–223. SIAM, 2021.
[Lin18] Bingkai Lin. The parameterized complexity of the k-biclique problem. J. ACM, 65(5):34:1–34:23, 2018.
[Lin19] Bingkai Lin. A simple gap-producing reduction for the parameterized set cover problem. In Christel Baier, Ioannis Chatzigiannakis, Paola Flocchini, and Stefano Leonardi, editors, 46th International Colloquium on Automata, Languages, and Programming, ICALP 2019, July 9-12, 2019, Patras, Greece, volume 132 of LIPIcs, pages 81:1–81:15. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2019.
[Lin21] Bingkai Lin. Constant approximating k-clique is w[1]-hard. In Samir Khuller and Virginia Vassilevska Williams, editors, STOC ’21: 53rd Annual ACM SIGACT Symposium on Theory of Computing, Virtual Event, Italy, June 21-25, 2021, pages 1749–1756. ACM, 2021.
[LRSW21] Bingkai Lin, Xuandi Ren, Yican Sun, and Xiuhan Wang. On lower bounds of approximating parameterized $k$ -clique, 2021.
[LRSZ20] Daniel Lokshtanov, M. S. Ramanujan, Saket Saurabh, and Meirav Zehavi. Parameterized complexity and approximability of directed odd cycle transversal. In Shuchi Chawla, editor, Proceedings of the 2020 ACM-SIAM Symposium on Discrete Algorithms, SODA 2020, Salt Lake City, UT, USA, January 5-8, 2020, pages 2181–2200. SIAM, 2020.
[MR17] Pasin Manurangsi and Prasad Raghavendra. A Birthday Repetition Theorem and Complexity of Approximating Dense CSPs. In Ioannis Chatzigiannakis, Piotr Indyk, Fabian Kuhn, and Anca Muscholl, editors, 44th International Colloquium on Automata, Languages, and Programming (ICALP 2017), volume 80 of Leibniz International Proceedings in Informatics (LIPIcs), pages 78:1–78:15, Dagstuhl, Germany, 2017. Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik.
[PW10] Mihai Patrascu and Ryan Williams. On the possibility of faster SAT algorithms. In Moses Charikar, editor, Proceedings of the Twenty-First Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2010, Austin, Texas, USA, January 17-19, 2010, pages 1065–1075. SIAM, 2010.
[Tov84] C. Tovey. A simplified np-complete satisfiability problem. Discret. Appl. Math., 8:85–89, 1984.
[Yao79] Andrew Chi-Chih Yao. Some complexity questions related to distributive computing (preliminary report). In Michael J. Fischer, Richard A. DeMillo, Nancy A. Lynch, Walter A. Burkhard, and Alfred V. Aho, editors, Proceedings of the 11h Annual ACM Symposium on Theory of Computing, April 30 - May 2, 1979, Atlanta, Georgia, USA, pages 209–213. ACM, 1979.
[Zuc96a] David Zuckerman. On unapproximable versions of np-complete problems. SIAM J. Comput., 25(6):1293–1304, 1996.
[Zuc96b] David Zuckerman. Simulating BPP using a general weak random source. Algorithmica, 16(4/5):367–391, 1996.
[Zuc07] David Zuckerman. Linear degree extractors and the inapproximability of max clique and chromatic number. Theory Comput., 3(1):103–128, 2007.

A Survey on Parameterized Inapproximability: kk-Clique, kk-SetCover, and More

Abstract

1 Introduction

1.1 Organization of the Survey

2 Preliminaries

2.1 FPT Approximation

2.2 Problems

2.3 Hypotheses

Hypothesis 2.1 (W[1]≠\neqFPT).

Hypothesis 2.2 (W[2]≠\neqFPT).

Hypothesis 2.3 (Exponential Time Hypothesis (ETH)[IP01, IPZ01, Tov84]).

Hypothesis 2.4 (Gap Exponential Time Hypothesis (Gap-ETH) [Din16, MR17]).

Hypothesis 2.5 (Strong Exponential Time Hypothesis (SETH) [IP01, IPZ01]).

Hypothesis 2.6 (Parameterized Inapproximability Hypothesis (PIH) [LRSZ20]).

3 MaxCover and MinLabel

Theorem 3.1.

Proof.

3.1 Hardness Results Based on Gap-ETH

Theorem 3.2.

Theorem 3.3 (Theorem 4.2 in [CCK+17]).

Definition 3.4 (Disperser[CW89, Zuc96a, Zuc96b]).

Lemma 3.5.

Theorem 3.6 (Theorem 4.3 in [CCK+17]).

Theorem 3.7 (Theorem 4.4 in [CCK+17] ).

3.2 Gap Producing via Threshold Graph Composition

Definition 3.8 (Error Correcting Codes).

Definition 3.9 (Collision Number).

Theorem 3.10 (Theorem 4.3 in [KN21] ).

Proof.

Theorem 3.11 (Theorem 4.4 in [KN21] ).

Proof.

4 kk-Clique

4.1 Reduction from MaxCover with Projection Property

Theorem 4.1.

4.2 kk-VectorSum

Theorem 4.2.

Proof.

Theorem 4.3.

Proof.

4.3 Gap Producing via Hadamard Codes

Definition 4.4 (Walsh-Hadamard Code).

5 kk-SetCover

5.1 Hypercube Partition System

Definition 5.1 (Hypercube Partition System).

5.2 Reduction from MinLabel

5.3 Gap Producing via Distributed PCP

Definition 5.2 (Simultaneous Message Protocol).

Theorem 5.3 (Theorem 5.2 in [KLM19], slightly simplified).

Proof.

5.4 Gap Producing via Threshold Graph Composition

Definition 5.4 (Perfect Hash Family).

Lemma 5.5 (Alon et al. [AYZ95] ).

Theorem 5.6.

Proof.

References

A Survey on Parameterized Inapproximability: $k$ -Clique, $k$ -SetCover, and More

Hypothesis 2.1 (W[1] $\neq$ FPT).

Hypothesis 2.2 (W[2] $\neq$ FPT).

Theorem 3.3 (Theorem 4.2 in [CCK⁺17]).

Theorem 3.6 (Theorem 4.3 in [CCK⁺17]).

Theorem 3.7 (Theorem 4.4 in [CCK⁺17] ).

4 $k$ -Clique

4.2 $k$ -VectorSum

5 $k$ -SetCover