\newaliascnt

proptheorem \aliascntresettheprop \newaliascntlemmatheorem \aliascntresetthelemma \newaliascntobservationtheorem \aliascntresettheobservation \newaliascntcorollarytheorem \aliascntresetthecorollary \newaliascntconjecturetheorem \aliascntresettheconjecture \newaliascntclaimtheorem \aliascntresettheclaim

A Tight Approximation Algorithm for the
Cluster Vertex Deletion Problem

Manuel Aprile , Matthew Drescher , Samuel Fiorini and Tony Huynh
Département de Mathématique
Université libre de Bruxelles
Brussels, Belgium [email protected], [email protected], [email protected]
School of Mathematics
Monash University
Melbourne, Australia [email protected]

Abstract.

We give the first $2$ -approximation algorithm for the cluster vertex deletion problem. This is tight, since approximating the problem within any constant factor smaller than $2$ is UGC-hard. Our algorithm combines the previous approaches, based on the local ratio technique and the management of true twins, with a novel construction of a “good” cost function on the vertices at distance at most $2$ from any vertex of the input graph.

As an additional contribution, we also study cluster vertex deletion from the polyhedral perspective, where we prove almost matching upper and lower bounds on how well linear programming relaxations can approximate the problem.

Key words and phrases:

Approximation algorithm and Cluster vertex deletion and Linear programming relaxation and Sherali-Adams hierarchy.

This project was supported by ERC Consolidator Grant 615640-ForEFront. Samuel Fiorini and Manuel Aprile are also supported by FNRS grant T008720F-35293308-BD-OCP. Tony Huynh is also supported by the Australian Research Council.

1. Introduction

A cluster graph is a graph that is a disjoint union of complete graphs. Let $G$ be any graph. A set $X\subseteq V(G)$ is called a hitting set if $G-X$ is a cluster graph. Given a graph $G$ and (vertex) cost function $c:V(G)\to\mathbb{Q}_{\geqslant 0}$ , the cluster vertex deletion problem (Cluster-VD) asks to find a hitting set $X$ whose cost $c(X):=\sum_{v\in X}c(v)$ is minimum. We denote by $\operatorname{\mathrm{OPT}}(G,c)$ the minimum cost of a hitting set.

If $G$ and $H$ are two graphs, we say that $G$ contains $H$ if some induced subgraph of $G$ is isomorphic to $H$ . Otherwise, $G$ is said to be $H$ -free. Denoting by $P_{k}$ the path on $k$ vertices, we easily see that a graph is a cluster graph if and only if it is $P_{3}$ -free. Hence, $X\subseteq V(G)$ is a hitting set if and only if $X$ contains a vertex from each induced $P_{3}$ .

Cluster-VD has applications in graph modeled data clustering in which an unknown set of samples may be contaminated. An optimal solution for Cluster-VD can recover a clustered data model, retaining as much of the original data as possible [24]. Vertex deletion problems such as Cluster-VD, where one seeks to locate vertices whose removal leaves a graph with desirable properties, often arise when measuring robustness and attack tolerance of real-life networks [1, 4, 25].

From what precedes, Cluster-VD is a hitting set problem in a $3$ -uniform hypergraph, and as such has a “textbook” $3$ -approximation algorithm (see for instance the introduction of [18]). Moreover, the problem has an approximation-preserving reduction from Vertex Cover. By adding a pendant edge to each vertex of $G$ , one checks that solving Cluster-VD on the new graph is equivalent to solving Vertex Cover on the original graph (see Proposition 4 for more details). Hence, obtaining a $(2-\varepsilon)$ -approximation algorithm for some $\varepsilon>0$ would contradict either the Unique Games Conjecture or P $\neq$ NP.

The first non-trivial approximation algorithm for Cluster-VD was a $5/2$ -approximation due to You, Wang and Cao [39]. Shortly afterward, Fiorini, Joret and Schaudt gave a $7/3$ -approximation [18], and subsequently a $9/4$ -approximation [19].

1.1. Our contribution

In this paper, we close the gap between $2$ and $9/4=2.25$ and prove the following tight result.

Theorem 1.

Cluster-VD has a $2$ -approximation algorithm.

All previous approximation algorithms for Cluster-VD are based on the local ratio technique. See the survey of Bar-Yehuda, Bendel, Freund, and Rawitz [22] for background on this standard algorithmic technique. Our algorithm is no exception, see Algorithm 1 below. However, it significantly differs from previous algorithms in its crucial step (see Step 14 below). In fact, almost all our efforts in this paper focus on that particular step of the algorithm (see Theorem 2), which searches for a special type of induced subgraph of $G$ , which we now describe.

Let $H$ be an induced subgraph of $G$ , and let $c_{H}:V(H)\to\mathbb{Q}_{\geqslant 0}$ . The weighted graph $(H,c_{H})$ is said to be $\alpha$ -good in $G$ (for some factor $\alpha\geqslant 1$ ) if $c_{H}$ is not identically $0$ and $c_{H}(X\cap V(H))\leqslant\alpha\cdot\operatorname{\mathrm{OPT}}(H,c_{H})$ holds for every (inclusionwise) minimal hitting set $X$ of $G$ . We overload terminology and say that an induced subgraph $H$ is $\alpha$ -good in $G$ if there exists a cost function $c_{H}$ such that $(H,c_{H})$ is $\alpha$ -good in $G$ . We stress that the local cost function $c_{H}$ is defined obliviously of the global cost function $c:V(G)\to\mathbb{Q}_{\geqslant 0}$ .

A pair of vertices $u,u^{\prime}$ of $G$ are called twins¹¹1We warn the reader that, in other papers, twins are usually called true twins, whereas two vertices which have the same set of neighbours are called false twins. Since we have no need of false twins in this paper, we have chosen to use twins in place of true twins. if $uu^{\prime}\in E(G)$ , and for all $v\in V(G-u-u^{\prime})$ , $uv\in E(G)$ if and only if $u^{\prime}v\in E(G)$ . We say that $G$ is twin-free if $G$ has no twins. As in [18, 19], if $G$ has a pair of twins $u,u^{\prime}$ , then Cluster-VD admits an easy reduction step (see Steps 8–12). The idea is simply to add the cost of $u^{\prime}$ to that of $u$ and delete $u^{\prime}$ . This works since $u^{\prime}$ belongs to a minimal hitting set of $G$ if and only if $u$ does (see [18] for a complete proof). Therefore, when searching for $\alpha$ -good induced subgraphs, we may assume that $G$ is twin-free, which is crucial for our proofs.

Algorithm 1

\textsc{Cluster-VD-apx}(G,c)

(G,c)

a weighted graph

X

a minimal hitting set of

G

1: if

G

is a cluster graph then

X\leftarrow\varnothing

3: else if there exists

u\in V(G)

with

c(u)=0

then

G^{\prime}\leftarrow G-u

c^{\prime}(v)\leftarrow c(v)

for

v\in V(G^{\prime})

X^{\prime}\leftarrow\textsc{Cluster-VD-apx}(G^{\prime},c^{\prime})

X\leftarrow X^{\prime}

X^{\prime}

is a hitting set of

G

;

X\leftarrow X^{\prime}\cup\{u\}

otherwise

8: else if there exist twins

u,u^{\prime}\in V(G)

then

G^{\prime}\leftarrow G-u^{\prime}

10:

c^{\prime}(u)\leftarrow c(u)+c(u^{\prime})

;

c^{\prime}(v)\leftarrow c(v)

for

v\in V(G^{\prime}-u)

11:

X^{\prime}\leftarrow\textsc{Cluster-VD-apx}(G^{\prime},c^{\prime})

12:

X\leftarrow X^{\prime}

X^{\prime}

does not contain

u

;

X\leftarrow X^{\prime}\cup\{u^{\prime}\}

otherwise

13: else

14: find a weighted induced subgraph

(H,c_{H})

that is

2

-good in

G

15:

\lambda^{*}\leftarrow\max\{\lambda\mid\forall v\in V(H):c(v)-\lambda c_{H}(v)\geqslant 0\}

16:

G^{\prime}\leftarrow G

17:

c^{\prime}(v)\leftarrow c(v)-\lambda^{*}c_{H}(v)

for

v\in V(H)

;

c^{\prime}(v)\leftarrow c(v)

for

v\in V(G)\setminus V(H)

18:

X\leftarrow\textsc{Cluster-VD-apx}(G^{\prime},c^{\prime})

19: end if

20: return

X

We will use two methods to establish $\alpha$ -goodness of induced subgraphs. We say that $(H,c_{H})$ is strongly $\alpha$ -good if $c_{H}$ is not identically $0$ and $c_{H}(V(H))\leqslant\alpha\cdot\operatorname{\mathrm{OPT}}(H,c_{H})$ . Clearly, if $(H,c_{H})$ is strongly $\alpha$ -good then $(H,c_{H})$ is $\alpha$ -good in $G$ , for every graph $G$ which contains $H$ . We say that $H$ itself is strongly $\alpha$ -good if $(H,c_{H})$ is strongly $\alpha$ -good for some cost function $c_{H}$ .

Let $N_{\leqslant i}[v]$ (resp. $N_{i}(v)$ ) be the set of vertices at distance at most (resp. equal to) $i$ from vertex $v$ . We abbreviate $N(v):=N_{1}(v)$ and $N[v]:=N_{\leqslant 1}[v]$ . If we cannot find a strongly $\alpha$ -good induced subgraph in $G$ , we will find an induced subgraph $H$ that has a special vertex $v_{0}$ such that $N[v_{0}]$ is entirely contained in $H$ , and a cost function $c_{H}:V(H)\to\mathbb{Z}_{\geqslant 0}$ such that $c_{H}(v)\geqslant 1$ for all vertices $v\in N[v_{0}]$ and $c_{H}(V(H))\leqslant\alpha\cdot\operatorname{\mathrm{OPT}}(H,c_{H})+1$ . Notice that no minimal hitting set $X$ can contain all the vertices of $N[v_{0}]$ , since if $X$ contains $N(v_{0})$ , then $v_{0}$ is an isolated clique. Hence, $c_{H}(X\cap V(H))\leqslant c_{H}(V(H))-1\leqslant\alpha\cdot\operatorname{\mathrm{OPT}}(H,c_{H})$ and so $(H,c_{H})$ is $\alpha$ -good in $G$ . We say that $(H,c_{H})$ (sometimes simply $H$ ) is centrally $\alpha$ -good (in $G$ ) with respect to $v_{0}$ . Moreover, we call $v_{0}$ the root vertex.

In order to illustrate these ideas, consider the following two examples (see Figure 1). First, let $H$ be a $C_{4}$ (that is, a $4$ -cycle) contained in $G$ and $\mathbf{1}_{H}$ denote the unit cost function on $V(H)$ . Then $(H,\mathbf{1}_{H})$ is strongly $2$ -good, since $\sum_{v\in V(H)}\mathbf{1}_{H}(v)=4=2\operatorname{\mathrm{OPT}}(H,\mathbf{1}_{H})$ . Second, let $H$ be a $P_{3}$ contained in $G$ , starting at a vertex $v_{0}$ that has degree- $1$ in $G$ . Then $(H,\mathbf{1}_{H})$ is centrally $2$ -good with respect to $v_{0}$ , but it is not strongly $2$ -good.

Figure 1.

(C_{4},\mathbf{1}_{C_{4}})

on the left is strongly

2

-good.

(P_{3},\mathbf{1}_{P_{3}})

, on the right, is centrally

2

-good in

G

with respect to the gray vertex, which has degree 1 in

G

Each time we find a $2$ -good weighted induced subgraph $H$ in $G$ , the local ratio technique allows us to recurse on an induced subgraph $G^{\prime}$ of $G$ in which at least one vertex of $H$ is deleted from $G$ . For example, the $2$ -good induced subgraphs mentioned above allow us to reduce to input graphs $G$ that are $C_{4}$ -free and have minimum degree at least $2$ .

The crux of our algorithm, Step 14, relies on the following structural result.

Theorem 2.

Let $G$ be a twin-free graph, let $v_{0}$ be any vertex of $G$ , and let $H$ be the subgraph of $G$ induced by $N_{\leqslant 2}[v_{0}]$ . There exists a cost function $c_{H}:V(H)\to\mathbb{Z}_{\geqslant 0}$ such that $(H,c_{H})$ is either strongly $2$ -good, or centrally $2$ -good in $G$ with respect to $v_{0}$ . Moreover, $c_{H}$ can be constructed in polynomial time.

We also study Cluster-VD from the polyhedral point of view. In particular we investigate how well linear programming (LP) relaxations can approximate the optimal value of Cluster-VD. As in [16, 13, 9], we use the following notion of LP relaxations which, by design, allows for extended formulations.

Fix a graph $G$ . Let $d\in\mathbb{Z}_{\geqslant 0}$ be an arbitrary dimension. A system of linear inequalities $Ax\geqslant b$ in $\mathbb{R}^{d}$ defines an LP relaxation of Cluster-VD on $G$ if the following hold: (i) For every hitting set $X\subseteq V(G)$ , we have a point $\pi^{X}\in\mathbb{R}^{d}$ satisfying $A\pi^{X}\geqslant b$ ; (ii) For every cost function $c:V(G)\to\mathbb{Q}_{\geqslant 0}$ , we have an affine function $f_{c}:\mathbb{R}^{d}\to\mathbb{R}$ ; (iii) For all hitting sets $X\subseteq V(G)$ and cost functions $c:V(G)\to\mathbb{Q}_{\geqslant 0}$ , the condition $f_{c}(\pi^{X})=c(X)$ holds.

The size of the LP relaxation $Ax\geqslant b$ is defined as the number of rows of $A$ . For every cost function $c$ , the quantity $\operatorname{\mathrm{LP}}(G,c):=\min\{f_{c}(x)\mid Ax\geqslant b\}$ gives a lower bound on $\operatorname{\mathrm{OPT}}(G,c)$ . The integrality gap of the LP relaxation $Ax\geqslant b$ is defined as $\sup\{\operatorname{\mathrm{OPT}}(G,c)/\operatorname{\mathrm{LP}}(G,c)\mid c\in\mathbb{Q}_{\geqslant 0}^{V(G)}\}$ .

Letting $\mathcal{P}_{3}(G)$ denote the collection of all vertex sets $\{u,v,w\}$ that induce a $P_{3}$ in $G$ , we define

P(G):=\{x\in[0,1]^{V(G)}\mid\forall\{u,v,w\}\in\mathcal{P}_{3}(G):x_{u}+x_{v}+x_{w}\geqslant 1\}.

We let $\operatorname{\mathsf{SA}}_{r}(G)$ denote the relaxation obtained from $P(G)$ by applying $r$ rounds of the Sherali-Adams hierarchy [33], a standard procedure to derive strengthened LP relaxations of binary linear programming problems. If a cost function $c:V(G)\to\mathbb{Q}_{\geqslant 0}$ is provided, we let

\operatorname{\mathsf{SA}}_{r}(G,c):=\min\{\sum_{v\in V(G)}c(v)x_{v}\mid x\in\operatorname{\mathsf{SA}}_{r}(G)\}

denote the optimum value of the corresponding linear programming relaxation.

It is not hard to see that the straightforward LP relaxation $P(G)$ has worst case integrality gap equal to $3$ (by worst case, we mean that we take the supremum over all graphs $G$ ). Indeed, for a random $n$ -vertex graph, $\operatorname{\mathrm{OPT}}(G,\mathbf{1}_{G})=n-O(\log^{2}n)$ with high probability. This can be easily proved via the probabilistic method. A similar proof can be found, for instance, in the introduction of [2]). On the other hand, $\operatorname{\mathrm{LP}}(G,\mathbf{1}_{G})\leqslant n/3$ , since the vector with all coordinates equal to $\frac{1}{3}$ is feasible for $P(G)$ .

On the positive side, we show how applying one round of the Sherali-Adams hierarchy gives a relaxation with integrality gap at most $5/2=2.5$ , see Theorem 3. To complement this, we prove that the worst case integrality gap of the relaxation is precisely $5/2$ , see Theorem 4. We then show that the integrality gap decreases to $2+\varepsilon$ after applying $\mathrm{poly}(1/\varepsilon)$ rounds, see Theorem 5.

On the negative side, applying known results on Vertex Cover [9], we show that no polynomial-size LP relaxation of Cluster-VD can have integrality gap at most $2-\varepsilon$ for some $\varepsilon>0$ . As is the case for similar lower bounds (see [31, 8]), this result is unconditional: it does not rely on P $\neq$ NP nor the Unique Games Conjecture.

1.2. Comparison to previous works

We now revisit all previous approximation algorithms for Cluster-VD [39, 18, 19]. The presentation given here slightly departs from [39, 18], and explains in a unified manner what is the bottleneck in each of the algorithms.

Fix $k\in\{3,4,5\}$ , and let $\alpha:=(2k-1)/(k-1)$ . Notice that $\alpha=5/2$ if $k=3$ , $\alpha=7/3$ if $k=4$ and $\alpha=9/4$ if $k=5$ . In [19, Lemma 3], it is shown that if a twin-free graph $G$ contains a $k$ -clique, then one can find an induced subgraph $H$ containing the $k$ -clique and a cost function $c_{H}$ such that $(H,c_{H})$ is strongly $\alpha$ -good.

Therefore, in order to derive an $\alpha$ -approximation for Cluster-VD, one may assume without loss of generality that the input graph $G$ is twin-free and has no $k$ -clique. Let $v_{0}$ be a maximum degree vertex in $G$ , and let $H$ denote the subgraph of $G$ induced by $N_{\leqslant 2}[v_{0}]$ . In [19], it is shown by a tedious case analysis that one can construct a cost function $c_{H}$ such that $(H,c_{H})$ is $2$ -good in $G$ , using the fact that $G$ has no $k$ -clique.

The simplest case occurs when $k=3$ . Then $N(v_{0})$ is a stable set. Letting $c_{H}(v_{0}):=d(v_{0})-1$ , $c_{H}(v):=1$ for $v\in N(v_{0})$ and $c_{H}(v):=0$ for the other vertices of $H$ , one easily sees that $(H,c_{H})$ is (centrally) $2$ -good in $G$ . For higher values of $k$ , one has to work harder.

In this paper, we show that one can always, and in polynomial time, construct a cost function $c_{H}$ on the vertices at distance at most $2$ from $v_{0}$ that makes $(H,c_{H})$ $2$ -good in $G$ , provided that $G$ is twin-free, see Theorem 2. This result was the main missing ingredient in previous approaches, and single-handedly closes the approximability status of Cluster-VD.

1.3. Other related works

Cluster-VD has also been widely studied from the perspective of fixed parameter tractability. Given a graph $G$ and parameter $k$ as input, the task is to decide if $G$ has a hitting set $X$ of size at most $k$ . A $2^{k}n^{\mathcal{O}(1)}$ -time algorithm for this problem was given by Hüffner, Komusiewicz, Moser, and Niedermeier [24]. This was subsequently improved to a $1.911^{k}n^{\mathcal{O}(1)}$ -time algorithm by Boral, Cygan, Kociumaka, and Pilipczuk [11], and a $1.811^{k}n^{\mathcal{O}(1)}$ -time algorithm by Tsur [37]. By the general framework of Fomin, Gaspers, Lokshtanov, and Saurabh [20], these parametrized algorithms can be transformed into exponential algorithms which compute the size of a minimum hitting set for $G$ exactly, the fastest of which runs in time $\mathcal{O}(1.488^{n})$ .

For polyhedral results, Hosseinian and Butenko [23] gives some facet-defining inequalities of the Cluster-VD polytope, as well as complete linear descriptions for special classes of graphs.

Another related problem is the feedback vertex set problem in tournaments (FVST). Given a tournament $T$ with costs on the vertices, the task is to find a minimum cost set of vertices $X$ such that $T-X$ does not contain a directed cycle.

For unit costs, note that Cluster-VD is equivalent to the problem of deleting as few elements as possible from a symmetric relation to obtain a transitive relation, while FVST is equivalent to the problem of deleting as few elements as possible from an antisymmetric and complete relation to obtain a transitive relation.

In a tournament, hitting all directed cycles is equivalent to hitting all directed triangles, so FVST is also a hitting set problem in a $3$ -uniform hypergraph. Moreover, FVST is also UGC-hard to approximate to a constant factor smaller than $2$ . Cai, Deng, and Zang [14] gave a $5/2$ -approximation algorithm for FVST, which was later improved to a $7/3$ -approximation algorithm by Mnich, Williams, and Végh [30]. Lokshtanov, Misra, Mukherjee, Panolan, Philip, and Saurabh [29] recently gave a randomized $2$ -approximation algorithm, but no deterministic (polynomial-time) $2$ -approximation algorithm is known. For FVST, one round of the Sherali-Adams hierarchy actually provides a $7/3$ -approximation [5].

Among other related covering and packing problems, Fomin, Le, Lokshtanov, Saurabh, Thomassé, and Zehavi [21] studied both Cluster-VD and FVST from the kernelization perspective. They proved that the unweighted versions of both problems admit subquadratic kernels: $\mathcal{O}(k^{\frac{5}{3}})$ for Cluster-VD and $\mathcal{O}(k^{\frac{3}{2}})$ for FVST.

1.4. Overview of the proof

We give a sketch of the proof of Theorem 2. Recall that $H=G[N_{\leqslant 2}[v_{0}]]$ . If the subgraph induced by $N(v_{0})$ contains a hole (that is, an induced cycle of length at least $4$ ), then $H$ is strongly $2$ -good by Lemma 2.1. If the subgraph induced by $N(v_{0})$ contains an induced $2P_{3}$ (that is, two disjoint copies of $P_{3}$ with no edges between each other), then $H$ is strongly $2$ -good by Lemma 2.1. This allows us to reduce to the case where the subgraph induced by $N(v_{0})$ is chordal and $2P_{3}$ -free.

Lemma 2.2 then gives a direct construction of a cost function $c_{H}$ which certifies that $(H,c_{H})$ is centrally $2$ -good, provided that the subgraph induced by $N[v_{0}]$ is twin-free. This is the crucial step of the proof. It serves as the base case of the induction. Here, we use a slick observation due to Lokshtanov [28]: since the subgraph induced by $N(v_{0})$ is chordal and $2P_{3}$ -free, it has a hitting set that is a clique. In a previous version, our proof of Theorem 2 was slightly more complicated.

We show inductively that we can reduce to the case where the subgraph induced by $N[v_{0}]$ is twin-free. The idea is to delete vertices from $H$ to obtain a smaller graph $H^{\prime}$ , while preserving certain properties, and then compute a suitable cost function $c_{H}$ for $H$ , given a suitable cost function $c_{H^{\prime}}$ for $H^{\prime}$ . We delete vertices at distance $2$ from $v_{0}$ . When this creates twins in $H$ , we delete one vertex from each pair of twins. At the end, we obtain a twin-free induced subgraph of $H[N[v_{0}]]$ , which corresponds to our base case.

We conclude the introduction with a brief description of the different sections of the paper. Section 2 is entirely devoted to the proof of Theorem 2. The proof of Theorem 1 is given in Section 3, together with a complexity analysis of Algorithm 1. Section 4 presents our polyhedral results. A conclusion is given in Section 5. There, we state a few open problems for future research.

Conference version. An abstract of this paper appeared in the proceedings of IPCO 2021 [6]. The current paper is an extended version with full proofs and detailed discussions. In particular, the running time analysis of our algorithm in Section 3 is new, and the polyhedral results of Section 4 appear without proof in the conference version.

2. Finding $2$ -good induced subgraphs

The goal of this section is to prove Theorem 2. Our proof is by induction on the number of vertices in $H:=G[N_{\leq 2}[v_{0}]]$ . First, we quickly show that we can assume that the subgraph induced by $N(v_{0})$ is chordal and $2P_{3}$ -free. Using this, we prove the theorem in the particular case where the subgraph induced by $N[v_{0}]$ is twin-free. Finally, we prove the theorem in the general case by showing how to deal with twins.

2.1. Restricting to chordal, $2P_{3}$ -free neighborhoods

A vertex of $G$ is apex if it is adjacent to all the other vertices of $G$ . A wheel is a graph obtained from a cycle by adding an apex vertex (called the center).

As pointed out earlier in the introduction, $4$ -cycles are strongly $2$ -good. This implies that the wheel on $5$ vertices is strongly $2$ -good (putting a zero cost on the center). We now show that all wheels on at least $5$ vertices are strongly $2$ -good. This allows our algorithm to restrict to input graphs such that the subgraph induced on each neighborhood is chordal. In a similar way, we show that we can further restrict such neighborhoods to be $2P_{3}$ -free.

Lemma \thelemma.

Let $H:=W_{k}$ be a wheel on $k\geqslant 5$ vertices and center $v_{0}$ , let $c_{H}(v_{0}):=k-5$ and $c_{H}(v):=1$ for $v\in V(H-v_{0})$ . Then $(H,c_{H})$ is strongly $2$ -good.

Proof.

Notice that $\operatorname{\mathrm{OPT}}(H,c_{H})\geqslant k-3$ since a hitting set either contains $v_{0}$ and at least $2$ more vertices, or does not contain $v_{0}$ but contains $k-3$ other vertices. Hence, $\sum_{v\in V(H)}c_{H}(v)=k-5+k-1=2(k-3)\leqslant 2\operatorname{\mathrm{OPT}}(H,c_{H})$ . ∎

Lemma \thelemma.

Let $H$ be the graph obtained from $2P_{3}$ by adding an apex vertex $v_{0}$ . Let $c_{H}(v_{0}):=2$ and $c_{H}(v):=1$ for $v\in V(H-v_{0})$ . Then $(H,c_{H})$ is strongly $2$ -good.

Proof.

It is easy to check that $\operatorname{\mathrm{OPT}}(H,c_{H})\geqslant 4$ . Thus, $\sum_{v\in V(H)}c_{H}(v)=8\leqslant 2\operatorname{\mathrm{OPT}}(H,c_{H})$ . ∎

2.2. When $H$ is twin-free

Throughout this section, we assume that $H$ is a twin-free graph with an apex vertex $v_{0}$ such that $H-v_{0}$ is chordal and $2P_{3}$ -free. Our goal is to construct a cost function $c_{H}$ that certifies that $H$ is centrally $2$ -good.

It turns out to be easier to define the cost function on $V(H-v_{0})=N(v_{0})$ first, and then adjust the cost of $v_{0}$ . This is the purpose of the next lemma. Below, $\omega(G,c)$ denotes the maximum weight of a clique in weighted graph $(G,c)$ .

Lemma \thelemma.

Let $H$ be a graph with an apex vertex $v_{0}$ and $H^{\prime}:=H-v_{0}$ . Let $c_{H^{\prime}}:V(H^{\prime})\rightarrow\mathbb{Z}_{\geqslant 1}$ be a cost function such that

(i)

$c_{H^{\prime}}(V(H^{\prime}))\geqslant 2\omega(H^{\prime},c_{H^{\prime}})$ and
(ii)

$\operatorname{\mathrm{OPT}}(H^{\prime},c_{H^{\prime}})\geqslant\omega(H^{\prime},c_{H^{\prime}})-1$ .

Then we can extend $c_{H^{\prime}}$ to a function $c_{H}:V(H)\rightarrow\mathbb{Z}_{\geqslant 1}$ such that $c_{H}(V(H))\leqslant 2\operatorname{\mathrm{OPT}}(H,c_{H})+1$ . In other words, $(H,c_{H})$ is centrally $2$ -good with respect to $v_{0}$ .

Proof.

Notice that

\operatorname{\mathrm{OPT}}(H,c_{H})=\min(c_{H}(v_{0})+\operatorname{\mathrm{OPT}}(H^{\prime},c_{H^{\prime}}),c_{H^{\prime}}(V(H^{\prime}))-\omega(H^{\prime},c_{H^{\prime}})),

since if $X$ is hitting set of $H$ that does not contain $v_{0}$ , then $H-X$ is a clique.

Let $a=\max(1,c_{H^{\prime}}(V(H^{\prime}))-2\operatorname{\mathrm{OPT}}(H^{\prime},c_{H^{\prime}})-1)$ and $b=c_{H^{\prime}}(V(H^{\prime}))-2\omega(H^{\prime},c_{H^{\prime}})+1$ . Choose $c_{H}(v_{0})\in\mathbb{Z}_{\geqslant 1}$ such that $a\leqslant c_{H}(v_{0})\leqslant b$ . Note that $c_{H}(v_{0})$ exists since $a\leqslant b$ by conditions (i) and (ii).

Suppose $\operatorname{\mathrm{OPT}}(H,c_{H})=c_{H}(v_{0})+\operatorname{\mathrm{OPT}}(H^{\prime},c_{H^{\prime}})$ . Since $a\leqslant c_{H}(v_{0})$ ,

c_{H}(V(H))\leqslant 2c_{H}(v_{0})+2\operatorname{\mathrm{OPT}}(H^{\prime},c_{H^{\prime}})+1=2\operatorname{\mathrm{OPT}}(H,c_{H})+1.

Suppose $\operatorname{\mathrm{OPT}}(H,c_{H})=c_{H^{\prime}}(V(H^{\prime}))-\omega(H^{\prime},c_{H^{\prime}})$ . Since $c_{H}(v_{0})\leqslant b$ ,

c_{H}(V(H))\leqslant 2c_{H^{\prime}}(V(H^{\prime}))-2\omega(H^{\prime},c_{H^{\prime}})+1=2\operatorname{\mathrm{OPT}}(H,c_{H})+1.

In either case, $c_{H}(V(H))\leqslant 2\operatorname{\mathrm{OPT}}(H,c_{H})+1$ , as required. ∎

We abuse notation and regard a clique $X$ of a graph as both a set of vertices and a subgraph. We call a hitting set $X$ of a graph $G$ a hitting clique if $X$ is also a clique.

Lemma \thelemma.

Every chordal, $2P_{3}$ -free graph contains a hitting clique.

Proof.

Let $G$ be a chordal, $2P_{3}$ -free graph. Since $G$ is chordal, $G$ admits a clique tree $T$ (see [10]). In $T$ , the vertices are the maximal cliques of $G$ and, for every two maximal cliques $K$ , $K^{\prime}$ , the intersection $K\cap K^{\prime}$ is contained in every clique of the $K$ – $K^{\prime}$ path in $T$ . For an edge $e:=KK^{\prime}$ of $T$ , Let $T_{1}$ and $T_{2}$ be the components of $T-e$ and $G_{1}$ and $G_{2}$ be the subgraphs of $G$ induced by the union of all the cliques in $T_{1}$ and $T_{2}$ , respectively. It is easy to see that deleting $K\cap K^{\prime}$ separates $G_{1}$ from $G_{2}$ in $G$ . Now, since $G$ is $2P_{3}$ -free, at least one of $G_{1}^{\prime}:=G_{1}-(K\cap K^{\prime})$ or $G_{2}^{\prime}:=G_{2}-(K\cap K^{\prime})$ is a cluster graph. If both $G_{1}^{\prime}$ and $G_{2}^{\prime}$ are cluster graphs, we are done since $K\cap K^{\prime}$ is the desired hitting clique. Otherwise, if $G_{i}^{\prime}$ is not a cluster graph, then we can orient $e$ towards $T_{i}$ . Applying this argument on each edge, we define an orientation of $T$ , which must have a sink $K_{0}$ . But then removing $K_{0}$ from $G$ leaves a cluster graph, and we are done. Since the clique tree of a chordal graph can be constructed in polynomial time [10], the hitting clique can be found in polynomial time. ∎

Figure 2. Here

H

is twin-free,

v_{0}

is the gray vertex and the blue vertices form a hitting clique

K_{0}

for

H-v_{0}

, which is chordal and

2P_{3}

-free. For

v\in K_{0}

, the set

S_{v}

defined as in the proof of Lemma 2.2 consists of the unique maximal independent set containing

v

. We obtain

c_{H}=(\mathbf{6},{\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}{1}},{\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}{1}},{\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}{1}},{\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}{1}},3,3,3)

, which is easily seen to be centrally 2-good with respect to

v_{0}

We are ready to prove the base case for Theorem 2. For a graph $H$ , we let $|H|$ denote the number of vertices of $H$ .

Lemma \thelemma.

Let $H$ be a twin-free graph with an apex vertex $v_{0}$ such that $H-v_{0}$ is chordal and $2P_{3}$ -free. There exists a cost function $c_{H}$ such that $(H,c_{H})$ is centrally $2$ -good with respect to $v_{0}$ . Moreover, $c_{H}$ can be found in time $\mathcal{O}(|H|^{3})$ .

Proof.

By Lemma 2.2, some maximal clique of $H-v_{0}$ , say $K_{0}$ , is a hitting set.

We claim that there is a family of stable sets $\mathcal{S}=\{S_{v}\mid v\in K_{0}\}$ of $H-v_{0}$ satisfying the following properties:

(P1)

every vertex of $H-v_{0}$ is contained in some $S_{v}$ ;
(P2)

for each $v\in K_{0}$ , $S_{v}$ contains $v$ and at least one other vertex;
(P3)

for every two distinct vertices $v,v^{\prime}\in K_{0}$ , $H[S_{v}\cup S_{v^{\prime}}]$ contains a $P_{3}$ .

Before proving the claim, we prove that it implies the lemma. Consider the cost function $c_{H^{\prime}}:=\sum_{v\in K_{0}}\chi^{S_{v}}$ on the vertices of $H^{\prime}:=H-v_{0}$ defined by giving to each vertex $u$ a cost equal to the number of stable sets $S_{v}$ that contain $u$ (see Figure 2). It suffices to show that $c_{H^{\prime}}$ satisfies the conditions of Lemma 2.2 and can therefore be extended to a cost function $c_{H}$ on $V(H)$ such that $(H,c_{H})$ is centrally $2$ -good with respect to $v_{0}$ .

First, by (P1), we have $c_{H^{\prime}}(u)\in\mathbb{Z}_{\geqslant 1}$ for all $u\in V(H^{\prime})$ . Second, condition (i) of Lemma 2.2 follows from (P2) since each stable set $S_{v}$ contributes at least two units to $c_{H^{\prime}}(V(H^{\prime}))$ and at most one unit to $\omega(H^{\prime},c_{H^{\prime}})$ . Third, (P3) implies that every hitting set of $H^{\prime}$ meets every stable set $S_{v}$ , except possibly one. Hence, $\operatorname{\mathrm{OPT}}(H^{\prime},c_{H^{\prime}})\geqslant|K_{0}|-1$ . Also, every clique of $H^{\prime}$ meets every stable set $S_{v}$ in at most one vertex, implying that $\omega(H^{\prime},c_{H^{\prime}})\leqslant|K_{0}|$ , and equality holds since $c_{H^{\prime}}(K_{0})=|K_{0}|$ . Putting the last two observations together, we see that $\operatorname{\mathrm{OPT}}(H^{\prime},c_{H^{\prime}})\geqslant|K_{0}|-1=\omega(H^{\prime},c_{H^{\prime}})-1$ and hence condition (ii) of Lemma 2.2 holds.

Now, we prove that our claim holds. Let $K_{1},\dots,K_{t}$ denote the clusters (maximal cliques) of cluster graph $H-v_{0}-K_{0}$ . For $i\in[t]$ , consider the submatrix $A_{i}$ of the adjacency matrix $A(H)$ with rows indexed by the vertices of $K_{0}$ and columns indexed by the vertices of $K_{i}$ .

Notice that $A_{i}$ contains neither $\begin{pmatrix}1&0\\ 0&1\end{pmatrix}$ nor $\begin{pmatrix}0&1\\ 1&0\end{pmatrix}$ as a submatrix, as this would give a $C_{4}$ contained in $H-v_{0}$ , contradicting the chordality of $H-v_{0}$ . Hence, after permuting its rows and columns if necessary, $A_{i}$ can be assumed to be staircase-shaped. That is, every row of $A_{i}$ is nonincreasing and every column nondecreasing. Notice also that $A_{i}$ does not have two equal columns, since these would correspond to two vertices of $K_{i}$ that are twins.

For each $K_{i}$ that is not complete to $v\in K_{0}$ , define $\varphi_{i}(v)$ as the vertex $u\in K_{i}$ whose corresponding column in $A_{i}$ is the first containing a $0$ in row $v$ . Now, for each $v$ , let $S_{v}$ be the set including $v$ , and $\varphi_{i}(v)$ , for each $K_{i}$ that is not complete to $v$ .

Because $K$ is maximal, no vertex $u\in K_{i}$ is complete to $K_{0}$ . Since no two columns of $A_{i}$ are identical, we must have $u=\varphi_{i}(v)$ for some $v\in K_{0}$ . This proves (P1).

Notice that $v\in S_{v}$ by construction and that $|S_{v}|\geqslant 2$ since otherwise, $v$ would be apex in $H$ and thus a twin of $v_{0}$ . Hence, (P2) holds.

Finally, consider any two distinct vertices $v,v^{\prime}\in K_{0}$ . Since $v,v^{\prime}$ are not twins, the edge $vv^{\prime}$ must be in a $P_{3}$ contained in $H-v_{0}$ . Assume, without loss of generality, that there is a vertex $u\in K_{i}$ adjacent to $v$ and not to $v^{\prime}$ for some $i\in[t]$ . Then $\{v,v^{\prime},\varphi_{i}(v^{\prime})\}$ induces a $P_{3}$ contained in $H[S_{v}\cup S_{v^{\prime}}]$ , proving (P3). This concludes the proof of the claim.

We observe that the collection $\mathcal{S}$ can be computed in $\mathcal{O}(|H|^{3})$ time. This yields the restriction $c_{H}$ to $H^{\prime}$ . Since $H^{\prime}$ is chordal, $\omega(H^{\prime},c_{H^{\prime}})$ can be computed in $\mathcal{O}(|H^{\prime}|^{2})$ time. We then just let $c_{H}(v_{0}):=c_{H^{\prime}}(V(H^{\prime}))-2\omega(H^{\prime},c_{H^{\prime}})+1=c_{H^{\prime}}(V(H^{\prime}))-2|K_{0}|+1$ . This sets $c_{H}(v_{0})$ equal to the value $b$ in the proof of Lemma 2.2. Therefore, $c_{H}$ can be constructed in $\mathcal{O}(|H|^{3})$ time. ∎

2.3. Handling twins in $G[N[v_{0}]]$

We now deal with the general case where $G[N[v_{0}]]$ contains twins. We start with an extra bit of terminology relative to twins. Let $G$ be a twin-free graph, and $v_{0}\in V(G)$ . Suppose that $u,u^{\prime}$ are twins in $G[N[v_{0}]]$ . Since $G$ is twin-free, there exists a vertex $v$ that is adjacent to exactly one of $u$ , $u^{\prime}$ in $G$ . We say that $v$ is a distinguisher for the edge $uu^{\prime}$ (or for the pair $\{u,u^{\prime}\}$ ). Notice that either $uu^{\prime}v$ or $u^{\prime}uv$ is an induced $P_{3}$ . Notice also that $v$ is at distance $2$ from $v_{0}$ .

Now, consider a graph $H$ with a special vertex $v_{0}\in V(H)$ (the root vertex) such that

(H1)

every vertex is at distance at most $2$ from $v_{0}$ , and
(H2)

every pair of vertices that are twins in $H[N[v_{0}]]$ has a distinguisher.

Let $v$ be any vertex that is at distance $2$ from $v_{0}$ . Consider the equivalence relation $\equiv$ on $N[v_{0}]$ with $u\equiv u^{\prime}$ whenever $u=u^{\prime}$ or $u,u^{\prime}$ are twins in $H-v$ . Observe that the equivalence classes of $\equiv$ are of size at most $2$ since, if $u,u^{\prime},u^{\prime\prime}$ are distinct vertices with $u\equiv u^{\prime}\equiv u^{\prime\prime}$ , then $v$ cannot distinguish every edge of the triangle on $u$ , $u^{\prime}$ and $u^{\prime\prime}$ . Hence, two of these vertices are twins in $H$ , which contradicts (H2).

From what precedes, the edges contained in $N[v_{0}]$ that do not have a distinguisher in $H-v$ form a matching $M:=\{u_{1}u^{\prime}_{1},\ldots,u_{k}u^{\prime}_{k}\}$ (possibly, $k=0$ ). Let $H^{\prime}$ denote the graph obtained from $H$ by deleting $v$ and exactly one endpoint from each edge of $M$ . Notice that the resulting subgraph is the same, up to isomorphism, no matter which endpoints are chosen.

The lemma below states how we can obtain a cost function $c_{H}$ that certifies that $H$ is centrally $2$ -good from a cost function $c_{H^{\prime}}$ that certifies that $H^{\prime}$ is centrally $2$ -good. It is inspired by [19, Lemma 3]. See Figure 3 for an example.

Lemma \thelemma.

Let $H$ be any graph satisfying (H1) and (H2) for some $v_{0}\in V(H)$ . Let $v\in N_{2}(v_{0})$ . Let $M:=\{u_{1}u^{\prime}_{1},\ldots,u_{k}u^{\prime}_{k}\}$ be the matching formed by the edges in $N[v_{0}]$ whose unique distinguisher is $v$ , where $u^{\prime}_{i}\neq v_{0}$ for all $i$ (we allow the case $k=0$ ). Let $H^{\prime}:=H-u^{\prime}_{1}-\dots-u^{\prime}_{k}-v$ . Given a cost function $c_{H^{\prime}}$ on $V(H^{\prime})$ , define a cost function $c_{H}$ on $V(H)$ by letting $c_{H}(u^{\prime}_{i}):=c_{H^{\prime}}(u_{i})$ for $i\in[k]$ , $c_{H}(v):=\sum_{i=1}^{k}c_{H^{\prime}}(u_{i})=\sum_{i=1}^{k}c_{H}(u^{\prime}_{i})$ , and $c_{H}(u):=c_{H^{\prime}}(u)$ otherwise. First, $H^{\prime}$ satisfies (H1) and (H2). Second, if $(H^{\prime},c_{H^{\prime}})$ is centrally $2$ -good, then $(H,c_{H})$ is centrally $2$ -good.

Proof.

For the first part, notice that $H^{\prime}$ satisfies (H1) by our choice of $v$ . Indeed, deleting $v$ does not change the distance of the remaining vertices from $v_{0}$ .

We now prove that $H^{\prime}$ satisfies (H2). First notice that in $H-v$ the twins in $N[v_{0}]$ are exactly $(u_{1},u^{\prime}_{1}),\ldots,(u_{k},u^{\prime}_{k})$ . Next, for each edge $e\in E(H[N[v_{0}]])$ , $e$ has at least one distinguisher different than $v$ unless $e\in M$ . Moreover, each $u_{i}^{\prime}$ is a distinguisher for $e$ if and only if $u_{i}$ is a distinguisher for $e$ . Thus, each edge of $H^{\prime}[N[v_{0}]]$ still has at least one distinguisher, which proves (H2).

For the second part, notice that $c_{H}(u)\geqslant 1$ for all $u\in N[v_{0}]$ since $c_{H^{\prime}}(u)\geqslant 1$ for all $u\in N[v_{0}]\setminus\{v,u^{\prime}_{1},\ldots,u^{\prime}_{k}\}$ . To argue that $c_{H}(V(H))\leqslant 2\operatorname{\mathrm{OPT}}(H,c_{H})+1$ , one can check that any hitting set of $H$ must either contain $v$ or at least one endpoint of each edge $u_{i}u_{i}^{\prime}\in M$ . Hence $\operatorname{\mathrm{OPT}}(H,c_{H})\geqslant\sum_{i=1}^{k}c_{H^{\prime}}(u_{i})+\operatorname{\mathrm{OPT}}(H^{\prime},c_{H^{\prime}})$ .

Since $(H,c_{H^{\prime}})$ is centrally $2$ -good, $c_{H^{\prime}}(V(H^{\prime}))\leqslant 2\operatorname{\mathrm{OPT}}(H^{\prime},c_{H^{\prime}})+1$ . It follows that

	$\displaystyle c_{H}(V(H))$	$\displaystyle=\overbrace{c_{H}(v)+\sum_{i=1}^{k}c_{H}(u^{\prime}_{i})}^{=2\sum_{i=1}^{k}c_{H^{\prime}}(u_{i})}+\overbrace{c_{H^{\prime}}(V(H^{\prime}))}^{\leqslant 2\operatorname{\mathrm{OPT}}(H^{\prime},c_{H^{\prime}})+1}$
		$\displaystyle\leqslant 2\operatorname{\mathrm{OPT}}(H,c_{H})+1\,.$

Figure 3. Here

v_{0}

is the gray vertex and

v

is the red vertex.

H-v

violates (H2), and contains two pairs of twins, indicated by the red edges. Lemma 2.3 applies. We see that

H^{\prime}

is a

P_{3}

, for which Lemma 2.2 gives

c_{H^{\prime}}=\mathbf{1}_{H^{\prime}}

. In

(H,c_{H})

, all vertices get a unit cost except

v

, which gets a cost of

2

, since there are

2

pairs of twins in

H-v

. Thus,

c_{H}=(\mathbf{1},{\color[rgb]{1,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{1,0,0}{2}},1,1,1,1)

, where the entries corresponding to

v_{0}

and

v

are bold and red, respectively.

2.4. Proof of Theorem 2

We are ready to prove Theorem 2.

Proof.

We can decide in polynomial time (see for instance [35]) if $H[N(v_{0})]$ is chordal, and if not, output a hole of $H[N(v_{0})]$ . If the latter holds, we are done by Lemma 2.1. If the former holds, we can decide in polynomial time (see [34], and the proof of Lemma 2.2) whether $H$ contains a $2P_{3}$ . If it does, we are done by Lemma 2.1.

From now on, assume that the subgraph induced by $N(v_{0})$ is chordal and $2P_{3}$ -free. This is done without loss of generality. Notice that hypotheses (H1) and (H2) from Section 2.3 hold for $H$ . This is obvious for (H1). To see why (H2) holds, remember that $G$ is twin-free. Hence, every edge $uu^{\prime}$ contained in $N[v_{0}]$ must have a distinguisher in $G$ , which is in $N_{\leqslant 2}[v_{0}]$ . (In fact, notice that if $u$ and $u^{\prime}$ are twins in $H[N[v_{0}]]$ then the distinguisher is necessarily in $N_{2}(v_{0})$ .)

We repeatedly apply Lemma 2.3 in order to delete each vertex of $N_{2}(v_{0})$ one after the other and reduce to the case where $H$ is a twin-free graph for which $v_{0}$ is apex. We can then apply Lemma 2.2. The whole process takes polynomial time. ∎

3. Running-time Analysis

We now analyse the running-time of Algorithm 1. We assume that input graphs are given by their adjacency matrix. We need the following easy lemma, whose proof we include for completeness.

Lemma \thelemma.

Given a matrix $N\in\{0,1\}^{r\times c}$ , the set of all equivalence classes of equal rows of $N$ can be found in time $\mathcal{O}(rc)$ .

Proof.

Let $R_{0}$ and $R_{1}$ be the set of rows of $N$ whose first entry is $0$ and $1$ , respectively. We can determine $R_{0}$ and $R_{1}$ by reading the first column of $N$ , which takes time $O(r)$ . We then recurse on $N_{0}^{\prime}$ and $N_{1}^{\prime}$ , where $N_{i}^{\prime}$ is the submatrix of $N$ induced by $R_{i}$ and the last $c-1$ columns of $N$ . ∎

Before proving the next lemma we remark that, given a graph $H$ with $n$ vertices and $m$ edges, one can check whether $H$ is a cluster graph by checking that each of its components is a clique, which takes $\mathcal{O}(n^{2})$ time.

Lemma \thelemma.

Let $G$ be an $n$ -vertex, twin-free graph. In $\mathcal{O}(n^{3})$ time, we can find an induced subgraph $H$ of $G$ and a cost function $c_{H}$ on $V(H)$ such that $(H,c_{H})$ is $2$ -good.

Proof.

We fix any vertex $v_{0}\in V(G)$ , and let $H=G[N_{\leqslant 2}[v_{0}]]$ . We can check in $\mathcal{O}(n^{2})$ time whether $H[N(v_{0})]$ is chordal by using the algorithm from [35]. If $H[N(v_{0})]$ is not chordal this algorithm returns, as a certificate, a hole $C$ . By Lemma 2.1, $H[V(C)+v_{0}]$ is strongly 2-good and the corresponding function $c_{H}$ can be computed straightforwardly, hence we are done. Suppose now that $H[N(v_{0})]$ is chordal. We can construct the clique tree of $H[N(v_{0})]$ (see for instance [34]) in $\mathcal{O}(n^{2})$ time. Each edge of the tree induces a separation of $H[N(v_{0})]$ , and we can check if each side is a cluster graph in $\mathcal{O}(n^{2})$ time. If neither side is a cluster graph, then we have found a $2P_{3}$ in $H[N(v_{0})]$ . Hence, $H[V(2P_{3})+v_{0}]$ is strongly 2-good and the corresponding function $c_{H}$ can be computed straightforwardly. Since the clique tree has at most $|H|\leq n$ vertices, by orienting its edges as in the proof of Lemma 2.2 we find, in $\mathcal{O}(n^{3})$ time, a hitting clique. By applying Lemma 2.3 to get rid of twins, which can be done in $\mathcal{O}(n^{2})$ time, we obtain a subgraph satisfying the hypotheses of Lemma 2.2. Finally, by Lemma 2.2, the cost function in the statement of Lemma 2.2 can be constructed in $\mathcal{O}(n^{3})$ time. ∎

Lemma \thelemma.

Algorithm 1 runs in $\mathcal{O}(n^{4})$ -time.

Proof.

By Lemma 3, finding all twins in $G$ can be done in time $\mathcal{O}(n^{2})$ . Therefore, the most expensive recursive call of the algorithm is the construction of the $2$ -good weighted induced subgraph $(H,c_{H})$ from Lemma 3, which can be done in time $\mathcal{O}(n^{3})$ . Therefore, the running-time $T(n)$ of the algorithm satisfies $T(n)\leqslant T(n-1)+\mathcal{O}(n^{3})$ , which gives $T(n)=\mathcal{O}(n^{4})$ . ∎

of Theorem 1.

The proof is identical to [19, Proof of Theorem 1, pages 365–366], except that factor $9/4$ needs to be replaced everywhere by $2$ . One easily proves by induction that the vertex set $X$ output by the algorithm on input $(G,c)$ is a minimal hitting set with $c(X)\leqslant 2\operatorname{\mathrm{OPT}}(G,c)$ . We do not include more details here, and instead refer the reader to [19]. Theorem 2 guarantees that Algorithm 1 runs in polynomial time. ∎

4. Polyhedral results

In this section we study how well LP-relaxtions can approximate Cluster-VD, as already described in Section 1. We begin with a brief description of the Sherali-Adams hierarchy [33], which is a standard procedure to obtain strengthened LP relaxations for binary linear programs. For a more thorough introduction, we refer the reader to [27]. Throughout the section we closely follow the exposition given in [5], where the Sherali-Adams hierarchy and a very similar concept of diagonal (defined below) are used to approach the FVST problem, as mentioned in Section 1.3. In particular, our Lemma 4 is similar to another lemma in [5].

Let $P=\{x\in\mathbb{R}^{n}\mid Ax\geqslant b\}$ be a polytope contained in $[0,1]^{n}$ and $P_{I}:=\mathrm{conv}(P\cap\mathbb{Z}^{n})$ . For each $r\in\mathbb{N}$ , we define a polytope $P\supseteq\operatorname{\mathsf{SA}}_{1}(P)\supseteq\dots\supseteq\operatorname{\mathsf{SA}}_{r}(P)\supseteq P_{I}$ as follows. Let $N_{r}$ be the nonlinear system obtained from $P$ by multiplying each constraint by $\prod_{i\in I}x_{i}\prod_{j\in J}(1-x_{j})$ for all disjoint subsets $I,J$ of $[n]$ such that $1\leqslant|I|+|J|\leqslant r$ . Note that if $x_{i}\in\{0,1\}$ , then $x_{i}^{2}=x_{i}$ . Therefore, we can obtain a linear system $L_{r}$ from $N_{r}$ by setting $x_{i}^{2}:=x_{i}$ for all $i\in[n]$ and then $x_{I}:=\prod_{i\in I}x_{i}$ for all $I\subseteq[n]$ with $|I|\geqslant 2$ . We then let $\operatorname{\mathsf{SA}}_{r}(P)$ be the projection of $L_{r}$ onto the variables $x_{i}$ , $i\in[n]$ .

Let $\mathcal{P}_{3}(G)$ denote the collection of all vertex sets $\{u,v,w\}$ that induce a $P_{3}$ in $G$ and let $\operatorname{\mathsf{SA}}_{r}(G):=\operatorname{\mathsf{SA}}_{r}(P(G))$ , where

P(G):=\{x\in[0,1]^{V(G)}\mid\forall\{u,v,w\}\in\mathcal{P}_{3}(G):x_{u}+x_{v}+x_{w}\geqslant 1\}\,.

If a cost function $c:V(G)\to\mathbb{R}_{\geqslant 0}$ is provided, we let

\operatorname{\mathsf{SA}}_{r}(G,c):=\min\left\{\sum_{v\in V(G)}c(v)x_{v}\mid x\in\operatorname{\mathsf{SA}}_{r}(G)\right\}

denote the optimum value of the corresponding linear programming relaxation. For the sake of simplicity, we sometimes denote by $\operatorname{\mathsf{SA}}_{r}(G,c)$ the above linear program itself.

We say vertices $a$ and $b$ form a diagonal if there are vertices $u,v$ such that $\{u,v,a\}\in\mathcal{P}_{3}(G)$ and $\{u,v,b\}\in\mathcal{P}_{3}(G)$ . We say that a path contains a diagonal if any of its pairs of vertices are diagonals. Note that a diagonal pair in a path need not be an edge in the path.

Our first results concern $\operatorname{\mathsf{SA}}_{1}(G)$ . For later use, we list here the inequalities defining $\operatorname{\mathsf{SA}}_{1}(G)$ . For all $\{u,v,w\}\in\mathcal{P}_{3}(G)$ and $z\in V(G-u-v-w)$ , we have the inequalities

(1)	$\displaystyle x_{u}+x_{v}+x_{w}$	$\displaystyle\geqslant 1+x_{uv}+x_{vw}\,,$
(2)	$\displaystyle x_{uz}+x_{vz}+x_{wz}$	$\displaystyle\geqslant x_{z}\quad\text{and}$
(3)	$\displaystyle x_{u}+x_{v}+x_{w}+x_{z}$	$\displaystyle\geqslant 1+x_{uz}+x_{vz}+x_{wz}\,.$

In addition, there are the inequalities

(4)

1\geqslant x_{v}\geqslant x_{vu}\geqslant 0

for all distinct $u,v\in V(G)$ . The polytope $\operatorname{\mathsf{SA}}_{1}(G)$ is the set of all $(x_{v})$ such that there exists $(x_{uv})$ such that inequalities (1)–(4) are satisfied. Note that by definition, $x_{uv}$ and $x_{vu}$ are the same variable.

In order to establish the integrality gap of $\operatorname{\mathsf{SA}}_{1}(G)$ , we need two preliminary lemmas.

Lemma \thelemma.

Let $x\in\operatorname{\mathsf{SA}}_{1}(G)$ . If $G$ contains a $P_{3}$ which has a diagonal, then $x_{v}\geqslant 2/5$ for some vertex $v$ of $G$ .

Proof.

Assume by contradiction that $a$ , $b$ form a diagonal and all components of $x$ are less than 2/5. By the definition of diagonal, there exist $u,v\in V(G)$ with $\{u,v,a\},\{u,v,b\}\in\mathcal{P}_{3}(G)$ . In particular, from (1) we have $x_{a}+x_{u}+x_{v}\geqslant 1+x_{au}+x_{av}$ and from (2) $x_{ab}+x_{au}+x_{av}\geqslant x_{a}$ .

Adding these two inequalities, we obtain $x_{u}+x_{v}+x_{ab}\geqslant 1$ . We must have $x_{ab}\geqslant 1/5$ since otherwise $\max(x_{u},x_{v})\geqslant 2/5$ . Now let $c$ be the third vertex of the $P_{3}$ containing $a,b$ (notice that $c$ can be the middle vertex of the $P_{3}$ ). By (1) and (4), we have $x_{a}+x_{b}+x_{c}\geqslant 1+x_{ab}+x_{ac}\geqslant 6/5$ which means that $\max(x_{a},x_{b},x_{c})\geqslant 2/5$ , a contradiction which concludes the proof. ∎

Lemma \thelemma.

Let $G$ be a path or a cycle, and $c:V(G)\to\mathbb{R}_{\geqslant 0}$ . Then:

(i)

there is an efficient algorithm that solves Cluster-VD for $(G,c)$ ;
(ii)

the basic LP $\operatorname{\mathsf{SA}}_{0}(G)$ has integrality gap at most $2$ .

Proof.

First, let $G$ be a path. We notice that the coefficient matrix of the basic LP is totally unimodular, by the consecutive ones property [32]. Hence solving the LP yields an integral optimal solution, which corresponds to a hitting set $X$ of $G$ of cost equal to $\operatorname{\mathrm{OPT}}(G,c)=\operatorname{\mathsf{SA}}_{0}(G,c)$ . This proves (i), (ii) when $G$ is a path.

Now, let $G$ be a cycle, and let $v\in V(G)$ . Suppose that $v$ belongs to a minimum cost hitting set $X$ of $G$ . Then $X\setminus\{v\}$ is a minimum cost hitting set of $G-v$ , hence it can be found efficiently since $G-v$ is a path. By iterating over all $v\in V(G)$ and taking the hitting set of minimum cost, we efficiently solve Cluster-VD for $(G,c)$ . This concludes the proof of (i).

Finally, let $\tilde{x}$ be an optimal solution of $\operatorname{\mathsf{SA}}_{0}(G,c)$ . First, assume that there is some $v\in V(G)$ such that $\tilde{x}_{v}\geqslant 1/2$ . Since $G-v$ is a path, the optimal hitting set $X^{\prime}$ in $G-v$ has cost $c(X^{\prime})\leqslant\sum_{u\neq v}c(u)\tilde{x}_{u}$ . Hence, we see that $X:=X^{\prime}+v$ is a hitting set of $H$ with $c(X)=c(v)+c(X^{\prime})\leqslant c(v)+\sum_{u\neq v}c(u)\tilde{x}_{u}\leqslant 2\sum_{u}c(u)\tilde{x}_{u}=2\operatorname{\mathsf{SA}}_{0}(G,c)$ . On the other hand, if $\tilde{x}_{v}<1/2$ for all vertices $v\in V(G)$ , then the constraint $\tilde{x}_{u}+\tilde{x}_{v}+\tilde{x}_{w}\geqslant 1$ (where $u,w$ are the neighbors of $v$ ) implies that there can be no vertex $v$ with $\tilde{x}_{v}=0$ . So $0<\tilde{x}_{v}<1/2$ for all $v\in V(G)$ . Therefore, extreme point $\tilde{x}$ is the unique solution of $|G|$ equations of the form $x_{u}+x_{v}+x_{w}=1$ for $\{u,v,w\}\in\mathcal{P}_{3}(G)$ . Hence $\tilde{x}_{v}=1/3$ for all vertices. Thus $\operatorname{\mathsf{SA}}_{0}(G,c)=1/3\cdot c(V(G))$ . Now notice that since $G$ is a cycle we can partition its vertices into two disjoint hitting sets $X$ and $Y$ . Without loss of generality assume that $c(X)\leqslant 1/2\cdot c(V(G))$ . Then $c(X)\leqslant 3/2\cdot\operatorname{\mathsf{SA}}_{0}(G,c)$ . This concludes the proof of (ii). ∎

Theorem 3.

There is a polynomial-time algorithm that, given a graph $G$ and $c:V(G)\to\mathbb{R}_{\geqslant 0}$ , outputs a hitting set $X$ of $G$ such that $c(X)\leqslant 5/2\cdot\operatorname{\mathsf{SA}}_{1}(G,c)$ . In particular, the integrality gap of $\operatorname{\mathsf{SA}}_{1}(G)$ is at most $5/2$ .

Proof.

Let $\bar{x}$ be an optimal solution of $\operatorname{\mathsf{SA}}_{1}(G,c)$ . Let $U=\{v\in V(G):\bar{x}_{v}\geq 2/5\}$ , and $H=G\setminus U$ . Notice that the restriction of $\bar{x}$ to $V(H)$ is a feasible solution for $\operatorname{\mathsf{SA}}_{1}(H,c)$ whose components are all strictly less than $2/5$ . Hence, by Lemma 4, $H$ cannot contain a $P_{3}$ which has a diagonal. We will now show that, after possibly getting rid of twins, $H$ is a disjoint union of paths and cycles. Then, by applying Lemma 4, we will obtain a minimum cost hitting set of $H$ , which, together with $U$ , will form our desired hitting set.

First, if $H$ contains twins $u$ and $v$ , we can delete $v$ , and set $c^{\prime}(u):=c(u)+c(v)$ , $c^{\prime}(w):=c(w)$ for $w\in V(H)\setminus\{u,v\}$ to obtain a smaller weighted graph $(H^{\prime},c^{\prime})$ . We claim that $\operatorname{\mathsf{SA}}_{1}(H^{\prime},c^{\prime})\leqslant\operatorname{\mathsf{SA}}_{1}(H,c)$ . To see this, we show that one can turn any feasible solution $x$ of $\operatorname{\mathsf{SA}}_{1}(H,c)$ into a feasible solution of $\operatorname{\mathsf{SA}}_{1}(H^{\prime},c^{\prime})$ without increasing the cost. Let $x^{\prime}_{u}:=\min(x_{u},x_{v})$ and $x^{\prime}_{w}:=x_{w}$ for $w\neq u,v$ . It is easy to check that this defines a feasible solution $x^{\prime}$ to $\operatorname{\mathsf{SA}}_{1}(H^{\prime},c^{\prime})$ , whose cost is

\sum_{w\neq v}c^{\prime}(w)x^{\prime}_{w}=(c(u)+c(v))\min(x_{u},x_{v})+\sum_{w\neq u,v}c(w)x_{w}\leqslant\sum_{w}c(w)x_{w}\,.

This proves the claim. Moreover, a hitting set $X$ for $H$ can be immediately obtained from a hitting set $X^{\prime}$ of $H^{\prime}$ by adding $v$ if and only if $u\in X^{\prime}$ . Finally, notice that there is a feasible solution for $\operatorname{\mathsf{SA}}_{1}(H^{\prime},c^{\prime})$ obtained from the restriction of $\bar{x}$ whose components are all strictly less than $2/5$ . Hence, from now on we assume that $H$ is twin-free.

Now, we claim that $H$ is triangle-free and claw-free. Suppose that $H$ contains a triangle with vertices $u$ , $v$ and $w$ . Since $H$ is twin-free, every edge of the triangle has a distinguisher. Without loss of generality, $\mathcal{P}_{3}(H)$ contains $\{u,v,y\}$ , $\{u,w,y\}$ , $\{v,w,z\}$ and $\{u,v,z\}$ where $y,z$ are distinct vertices outside the triangle. It is easy to see that, for instance, edge $uw$ is a diagonal contained in a $P_{3}$ , a contradiction. A similar argument shows that $H$ cannot contain a claw. This proves the claim, and implies that $H$ has maximum degree at most 2. That is, $H$ is a disjoint union of paths and cycles. By part (i) of Lemma 4, (applied to each component of $H$ ), we obtain a minimum cost hitting set $X$ of $H$ which, thanks to (ii) of Lemma 4, satisfies $c(X)\leq 2\operatorname{\mathsf{SA}}_{0}(H,c)\leq\frac{5}{2}\operatorname{\mathsf{SA}}_{1}(H,c)$ .

Finally, consider $X\cup U$ . Clearly, it is a hitting set of $G$ . Moreover, one has

c(X\cup U)=c(X)+c(U)\leqslant\frac{5}{2}\left(\operatorname{\mathsf{SA}}_{1}(H,c)+\frac{2}{5}c(U)\right)\leqslant

\frac{5}{2}\left(\operatorname{\mathsf{SA}}_{1}(G,c)-\sum_{u\in U}c(u)\bar{x}_{u}+\frac{2}{5}c(U)\right)\leqslant\frac{5}{2}\cdot\operatorname{\mathsf{SA}}_{1}(G,c).

It follows that the integrality gap of $\operatorname{\mathsf{SA}}_{1}(G)$ is at most $5/2$ , concluding the proof. ∎

We now complement the result above by showing a lower bound on the integrality gap of $\operatorname{\mathsf{SA}}_{1}(G,c)$ .

Theorem 4.

For every $\varepsilon>0$ there is some instance $(G,c)$ of Cluster-VD such that $\operatorname{\mathrm{OPT}}(G,c)\geqslant(5/2-\varepsilon)\operatorname{\mathsf{SA}}_{1}(G,c)$ .

Proof.

We show there is a graph $G$ for which every hitting set $X$ has $c(X)\geqslant(5/2-\varepsilon)\operatorname{\mathsf{SA}}_{1}(G,c)$ for $c:=\mathbf{1}_{G}$ . Let $G$ be a graph whose girth is at least $k$ for some constant $k\geqslant 5$ and with the independence number $\alpha(G)\leqslant n/k$ where $n:=|G|$ . It can be shown via the probabilistic method that such a $G$ exists, see [2]. Set $c(v):=1$ for all $v\in V(G)$ . We have $c(X)\geqslant n(1-2/k)$ for every hitting set $X$ . To see this observe that since $G$ is triangle-free and $\alpha(G)\leqslant n/k$ , when we remove $X$ we will get at most $n/k$ components each of size at most 2. Thus there are at most $2n/k$ vertices in $G-X$ , so $|X|\geqslant n-2n/k$ . Therefore, $\operatorname{\mathrm{OPT}}(G,c)\geqslant(1-2/k)n$ .

In order to show $\operatorname{\mathsf{SA}}_{1}(G,c)\leqslant 2n/5$ , we construct the following feasible solution $x$ to $\operatorname{\mathsf{SA}}_{1}(G,c)$ . Set $x_{v}:=2/5$ for all $v\in V(G)$ and $x_{vw}:=0$ if $vw\in E(G)$ and $x_{vw}:=1/5$ if $vw\notin E(G)$ . The inequalities defining $\operatorname{\mathsf{SA}}_{1}(G)$ are all satisfied by $x$ . This is obvious for inequalities (1), (3) and (4). For inequality (2), notice that at most one of $uz$ , $vz$ , $wz$ can be an edge of $G$ , since otherwise $G$ would have a cycle of length at most $4$ . Thus (2) is satisfied too, $x\in\operatorname{\mathsf{SA}}_{1}(G)$ and $\operatorname{\mathsf{SA}}_{1}(G,c)\leqslant 2n/5$ .

This completes the proof since, by taking $k\geqslant 5/\varepsilon$ , we have $\operatorname{\mathrm{OPT}}(G,c)\geqslant n(1-2/k)\geqslant(5/2-\varepsilon)2n/5\geqslant(5/2-\varepsilon)\operatorname{\mathsf{SA}}_{1}(G,c)$ . ∎

We now show that the integrality gap decreases to $2+\varepsilon$ after applying $\mathrm{poly}(1/\varepsilon)$ rounds of Sherali-Adams. We first need the following lemma.

Lemma \thelemma.

Fix $\alpha\geqslant 1$ and $r\in\mathbb{Z}_{\geqslant 0}$ . Let $(G,c)$ be a minimum order weighted graph such that $\operatorname{\mathrm{OPT}}(G,c)>\alpha\cdot\operatorname{\mathsf{SA}}_{r}(G,c)$ . The following two assertions hold:

(i)

if $x$ is an optimal solution to $\operatorname{\mathsf{SA}}_{r}(G,c)$ , then $x_{v}<1/\alpha$ for all $v\in V(G)$ ;
(ii)

$G$ is connected and twin-free.

Proof.

(i) Suppose for a contradiction that $x_{v}\geqslant 1/\alpha$ , for some $v\in V(G)$ . Note that $x$ restricted to $V(G)\setminus\{v\}$ is a feasible solution to $\operatorname{\mathsf{SA}}_{r}(G-v,c)$ . Thus $\operatorname{\mathsf{SA}}_{r}(G-v,c)\leqslant\operatorname{\mathsf{SA}}_{r}(G,c)-c(v)x_{v}$ . By the minimality of $G$ , there is a hitting set $X^{\prime}$ of $G-v$ such that $c(X^{\prime})\leqslant\alpha\cdot\operatorname{\mathsf{SA}}_{r}(G-v,c)$ . Therefore $X:=X^{\prime}+v$ is a hitting set of $G$ with $c(X)=c(v)+c(X^{\prime})\leqslant c(v)+\alpha\cdot\operatorname{\mathsf{SA}}_{r}(G-v,c)\leqslant\alpha\cdot c(v)x_{v}+\alpha\cdot\operatorname{\mathsf{SA}}_{r}(G-v,c)\leqslant\alpha\cdot\operatorname{\mathsf{SA}}_{r}(G,c)$ , a contradiction.

(ii) Note that $G$ is connected, otherwise there exists a connected component $H$ of $G$ such that $\operatorname{\mathrm{OPT}}(H,c_{H})>\alpha\cdot\operatorname{\mathsf{SA}}_{r}(H,c_{H})$ , where $c_{H}$ is the restriction of $c$ to $V(H)$ , contradicting the minimality of $G$ . To show that $G$ is twin-free, we proceed exactly as in the proof of Theorem 3. ∎

Theorem 5.

For every fixed $\varepsilon>0$ , performing $r=\mathrm{poly}(1/\varepsilon)$ rounds of the Sherali-Adams hierarchy produces an LP relaxation of Cluster-VD whose integrality gap is at most $2+\varepsilon$ . That is, $\operatorname{\mathrm{OPT}}(G,c)\leqslant(2+\varepsilon)\operatorname{\mathsf{SA}}_{r}(G,c)$ for all weighted graphs $(G,c)$ .

Proof.

In order to simplify the notation below, let us assume that $2/\varepsilon$ is integer. For instance, we could restrict to $\varepsilon=2^{-l}$ for some $l\in\mathbb{Z}_{\geqslant 1}$ . This does not hurt the generality of the argument. We take $r:=1+(2/\varepsilon)^{4}$ . We may assume that $\varepsilon<1/2$ since otherwise we invoke Theorem 3 (taking $r=1$ suffices in this case).

Let $(G,c)$ be a counterexample to the theorem, with $|G|$ minimum. By Lemma 4.(i), for every optimal solution $x$ to $\operatorname{\mathsf{SA}}_{r}(G,c)$ , every vertex $v\in V(G)$ has $x_{v}<1/(2+\varepsilon)$ . By Lemma 4.(ii), $G$ is twin-free (and connected).

We will use the following fact several times in the proof: for all $R\subseteq V(G)$ with $|R|\leqslant r$ and every $x\in\operatorname{\mathsf{SA}}_{r}(G)$ , the restriction of $x$ to the variables in $R$ is a convex combination of hitting sets of $G[R]$ . This is easy to see since, denoting by $x_{R}$ the restriction of $x$ , we get $x_{R}\in\operatorname{\mathsf{SA}}_{r}(G[R])$ and the Sherali-Adams hierarchy is known to converge in at most “dimension-many” rounds, see for instance [17].

First, we claim that $G$ has no clique of size at least $2/\varepsilon$ . Suppose otherwise. Let $C$ be a clique of size $k:=2/\varepsilon$ and let $D$ be a minimal set such that each edge of $C$ has a distinguisher in $D$ . Let $H:=G[C\cup D]$ . Then, following the construction from Section 2.3, one can obtain a cost function $c_{H}$ such that $c_{H}(H)=2k-1$ , and $c_{H}(X)\geqslant k-1$ for any hitting set $X$ of $H$ . See [19, Lemma 3] for the full construction, whose proof also shows that $|H|\leqslant 2k-1\leqslant r$ . Since every valid inequality supported on at most $r$ vertices is valid for $\operatorname{\mathsf{SA}}_{r}(G)$ , the inequality $\sum_{v}c_{H}(v)x_{v}\geqslant k-1$ is valid for $\operatorname{\mathsf{SA}}_{r}(G)$ . Since $c_{H}(H)=2k-1$ , this implies that for all $x\in\operatorname{\mathsf{SA}}_{r}(G)$ , there is some vertex $a\in V(H)$ with $x_{a}\geqslant(k-1)/(2k-1)$ . Since $(k-1)/(2k-1)\geqslant 1/(2+\varepsilon)$ , we get a contradiction. This proves our first claim.

Second, we claim that for every $v_{0}\in V(G)$ , the subgraph of $G$ induced by the neighborhood $N(v_{0})$ has no stable set of size at least $2/\varepsilon$ . The proof is similar to that for cliques given above, except that this time we let $H$ be the induced star $K_{1,k}$ with apex $v_{0}$ and $k=2/\varepsilon$ . The cost function $c_{H}$ given by Lemma 2.2 has $c_{H}(v_{0})=k-1$ and $c_{H}(v)=1$ for all $v\in S$ . Notice that once again $c_{H}(H)=2k-1$ . The star inequality $\sum_{v}c_{H}(v)x_{v}\geqslant k-1$ is valid for $\operatorname{\mathsf{SA}}_{r}(G)$ , which guarantees that for every $x\in\operatorname{\mathsf{SA}}_{r}(G)$ there is some $a\in V(H)$ which has $x_{a}\geqslant(k-1)/(2k-1)\geqslant 1/(2+\varepsilon)$ . This establishes our second claim.

Third, we claim that the neighborhood of every vertex $v_{0}$ induces a chordal subgraph of $G$ . Suppose that $C$ is a hole in $G[N(v_{0})]$ . We first deal with the case $|C|\leqslant r-1=(2/\varepsilon)^{4}$ . We can repeat the same proof as above, letting $H$ be the induced wheel on $V(C)+v_{0}$ and using the cost function $c_{H}$ defined in the proof of Lemma 2.1. Consider the wheel inequality $\sum_{v}c_{H}(v)x_{v}\geqslant k-3$ , where $k:=|H|=|C|+1$ . Since the wheel has at most $r$ vertices, the wheel inequality is valid for $\operatorname{\mathsf{SA}}_{r}(G)$ . Since $c_{H}(H)=2k-6=2(k-3)$ , for every $x\in\operatorname{\mathsf{SA}}_{r}(G)$ , there is some $a\in V(H)$ which has $x_{a}\geqslant 1/2\geqslant 1/(2+\varepsilon)$ . This concludes the case where $|C|$ is “small”.

Now, assume that $|C|\geqslant r$ , and consider the wheel inequality with right-hand side scaled by $2/(2+\varepsilon)$ . Suppose this inequality is valid for $\operatorname{\mathsf{SA}}_{r}(G)$ . This still implies that some vertex $a$ of $H$ has $x_{a}\geqslant 1/(2+\varepsilon)$ , for all $x\in\operatorname{\mathsf{SA}}_{r}(G)$ , which produces the desired contradiction. It remains to prove that the scaled wheel inequality is valid for $\operatorname{\mathsf{SA}}_{r}(G)$ .

Let $F$ denote any $r$ -vertex induced subgraph of $H$ that is a fan.²²2A fan is a graph obtained from a path by adding an apex vertex. Hence, $F$ contains $v_{0}$ as an apex vertex, plus a path on $r-1$ vertices. Letting $c_{F}(v_{0}):=r-3-\lfloor(r-1)/3\rfloor$ and $c_{F}(v):=1$ for $v\in V(F-v_{0})$ , we get the inequality $\sum_{v}c_{F}(v)x_{v}\geqslant r-3$ , which is valid for $\operatorname{\mathsf{SA}}_{r}(G)$ . By taking all possible choices for $F$ , and averaging the corresponding inequalities, we see that the inequality

		$\displaystyle\left(r-3-\left\lfloor\frac{r-1}{3}\right\rfloor\right)x_{v_{0}}+\frac{r-1}{k-1}\sum_{v\in V(H-v_{0})}x_{v}\geqslant r-3$
	$\displaystyle\iff$	$\displaystyle\frac{k-1}{r-1}\left(r-3-\left\lfloor\frac{r-1}{3}\right\rfloor\right)x_{v_{0}}+\sum_{v\in V(H-v_{0})}x_{v}\geqslant\frac{k-1}{r-1}(r-3)$

is valid for $\operatorname{\mathsf{SA}}_{r}(G)$ . It can be seen that this inequality dominates the scaled wheel inequality, in the sense that each left-hand side coefficient is not larger than the corresponding coefficient in the scaled wheel inequality, while the right-hand side is not smaller than the right-hand side of the scaled wheel inequality. Therefore, the scaled wheel inequality is valid for $\operatorname{\mathsf{SA}}_{r}(G)$ . This concludes the proof of our third claim.

By the first, second and third claim³³3The first inequality follows since $|H|\leqslant\alpha(H)\cdot\omega(H)$ , for every perfect graph $H$ ., $|N(v_{0})|\leqslant\omega(G[N(v_{0})])\cdot\alpha(G[N(v_{0})])\leqslant 4/\varepsilon^{2}$ for all choices of $v_{0}$ . This implies in particular that $|N_{\leqslant 2}[v_{0}]|\leqslant 1+16/\varepsilon^{4}=r$ . Now let $H:=G[N_{\leqslant 2}[v_{0}]]$ . Theorem 2 applies since $G$ is twin-free, by our second claim. Let $c_{H}$ be the corresponding cost function. The inequality $\sum_{v}c_{H}(v)x_{v}\geqslant\operatorname{\mathrm{OPT}}(H,c_{H})$ is valid for $\operatorname{\mathsf{SA}}_{r}(G)$ .

Let $\lambda^{*}$ be defined as in Step 15 of Algorithm 1, and let $a\in V(G)$ denote any vertex such that $(c-\lambda^{*}c_{H})(a)=0$ . By minimality of $G$ , there exists in $(G^{\prime},c^{\prime}):=(G-a,c-\lambda^{*}c_{H})$ a minimal hitting set $X^{\prime}$ of cost $c^{\prime}(X^{\prime})\leqslant(2+\varepsilon)\operatorname{\mathsf{SA}}_{r}(G^{\prime},c^{\prime})$ . We let $X:=X^{\prime}$ in case $X^{\prime}$ is a hitting set of $G$ , and $X:=X^{\prime}+a$ otherwise. Assume that $X=X^{\prime}+a$ , the other case is easier. We have

	$\displaystyle c(X)$	$\displaystyle=c^{\prime}(X^{\prime})+\lambda^{*}c_{H}(X)$
		$\displaystyle\leqslant(2+\varepsilon)\operatorname{\mathsf{SA}}_{r}(G^{\prime},c^{\prime})+\lambda^{*}(c_{H}(H)-1)$
		$\displaystyle\leqslant(2+\varepsilon)\operatorname{\mathsf{SA}}_{r}(G^{\prime},c^{\prime})+2\lambda^{*}\operatorname{\mathrm{OPT}}(H,c_{H})$
		$\displaystyle\leqslant(2+\varepsilon)\left(\operatorname{\mathsf{SA}}_{r}(G^{\prime},c^{\prime})+\lambda^{*}\operatorname{\mathrm{OPT}}(H,c_{H})\right)\,.$

By LP duality, we have $\operatorname{\mathsf{SA}}_{r}(G,c)\geqslant\operatorname{\mathsf{SA}}_{r}(G^{\prime},c^{\prime})+\lambda^{*}\operatorname{\mathrm{OPT}}(H,c_{H})$ . This implies that $c(X)\leqslant(2+\varepsilon)\operatorname{\mathsf{SA}}_{r}(G,c)$ , contradicting the fact that $(G,c)$ is a counterexample. This concludes the proof. ∎

We now complement the result above by showing that every LP relaxation of Cluster-VD with (worst case) integrality gap at most $2-\varepsilon$ must have super-polynomial size. The result is a simple consequence of an analogous result of [9] on the integrality gap of Vertex Cover, and of the straightforward reduction from Vertex Cover to Cluster-VD.

Proposition \theprop.

For infinitely many values of $n$ , there is a graph $G$ on $n$ vertices such that every size- $n^{o(\log n/\log\log n)}$ LP relaxation of Cluster-VD on $G$ has integrality gap $2-o(1)$ .

Proof.

In [9] a similar result is proved for LP-relaxations of Vertex Cover: for infinitely many values of $n$ , there is a graph $G$ on $n$ vertices such that every size- $n^{o(\log n/\log\log n)}$ LP relaxation of Vertex Cover on $G$ has integrality gap at least $2-\varepsilon$ , where $\varepsilon=\varepsilon(n)=o(1)$ is a non-negative function.

Let $G$ be such a graph, and let $G^{+}$ be the graph obtained from $G$ by attaching a pendant edge to every vertex. It is easy to see that $U\subseteq V(G)$ is a hitting set for $G^{+}$ if and only if $U$ is a vertex cover of $G$ .

Toward a contradiction, suppose that $Ax\geqslant b$ is a size- $n^{o(\log n/\log\log n)}$ LP relaxation of Cluster-VD on $G^{+}$ with integrality gap at most $2-\delta$ , for a fixed $\delta>\varepsilon$ (where $x\in\mathbb{R}^{d}$ for some dimension $d$ depending on $G$ ). For every $c^{+}\in\mathbb{Q}^{V(G^{+})}_{\geqslant 0}$ there exists a hitting set $X$ of $G^{+}$ such that $c^{+}(X)\leqslant(2-\delta)\operatorname{\mathrm{LP}}(G^{+},c^{+})$ .

We can easily turn $Ax\geqslant b$ into an LP relaxation for Vertex Cover. For every vertex cover $U$ of $G$ , we let the corresponding point be the point $\pi^{U}\in\mathbb{R}^{d}$ for $U$ seen as a hitting set in $G^{+}$ . For every $c\in\mathbb{Q}_{\geqslant 0}^{V(G)}$ , we define $c^{+}\in\mathbb{Q}^{V(G^{+})}_{\geqslant 0}$ via $c^{+}(v):=c(v)$ for $v\in V(G)$ , and $c^{+}(v):=\sum_{u\in V(G)}c(u)$ for $v\in V(G^{+})\setminus V(G)$ . Then, we let the affine function $f_{c}$ for $c$ be the affine function $f_{c^{+}}$ for $c^{+}$ .

Since the integrality gap of $Ax\geqslant b$ , seen as an LP relaxation of Cluster-VD, is at most $2-\delta$ , for every $c\in\mathbb{Q}_{\geqslant 0}^{V(G)}$ there exists a hitting set $X$ of $G^{+}$ whose cost is at most $(2-\delta)\operatorname{\mathrm{LP}}(G^{+},c^{+})$ , where $c^{+}$ is the cost function corresponding to $c$ . If $X$ contains any vertex of $V(G^{+})\setminus V(G)$ , we can replace this vertex by its unique neighbor in $V(G)$ , without any increase in cost. In this way, we can find a vertex cover $U$ of $G$ whose cost satisfies $c(U)\leqslant c^{+}(X)\leqslant(2-\delta)\operatorname{\mathrm{LP}}(G^{+},c^{+})=(2-\delta)\operatorname{\mathrm{LP}}(G,c)$ . Hence, the integrality gap of $Ax\geqslant b$ as an LP relaxation of Vertex Cover is also at most $2-\delta<2-\varepsilon$ . As the size of $Ax\geqslant b$ is $n^{o(\log n/\log\log n)}$ , this provides the desired contradiction. ∎

We point out that the size bound in the previous result can be improved. Kothari, Meka and Raghavendra [26] have shown that for every $\varepsilon>0$ there is a constant $\delta=\delta(\varepsilon)>0$ such that no LP relaxation of size less than $2^{n^{\delta}}$ has integrality gap less than $2-\varepsilon$ for Max-CUT. Since Max-CUT acts as the source problem in [9], one gets a $2^{n^{\delta}}$ size lower bound for Vertex Cover in order to achieve integrality gap $2-\varepsilon$ . This also follows in a black-box manner from [26] and [12]. The proof of Proposition 4 shows that the same bound applies to Cluster-VD.

5. Conclusion

In this paper we provide a tight approximation algorithm for the cluster vertex deletion problem (Cluster-VD). Our main contribution is the efficient construction of a local cost function on the vertices at distance at most $2$ from any vertex $v_{0}$ such that every minimal hitting set of the input graph has local cost at most twice the local optimum. If the subgraph induced by $N(v_{0})$ (the first neighborhood of $v_{0}$ ) contains a hole, or a $2P_{3}$ , then this turns out to be straightforward. The most interesting case arises when the local subgraph $H$ is twin-free, has radius $1$ , and moreover $H[N(v_{0})]=H-v_{0}$ is chordal and $2P_{3}$ -free. Such graphs are very structured, which we crucially exploit.

Lemma 2.2 allows us to define the local cost function on the vertices distinct from $v_{0}$ and then later adjust the cost of $v_{0}$ . We point out that condition (ii) basically says that the local cost function should define a hyperplane that “almost” separates the hitting set polytope and the clique polytope of the chordal, $2P_{3}$ -free graph $H-v_{0}$ . This was a key intuition which led us to the proof of Theorem 2. If these polytopes were disjoint, this would be easy. But actually it is not the case since they have a common vertex (as we show, $H-v_{0}$ has a hitting clique).

One natural question arising from our approach of Cluster-VD in general graphs is the following: is the problem polynomial-time solvable on chordal graphs? This seems to be a non-trivial open question, also mentioned in [15], where similar vertex deletion problems are studied for chordal graphs. It could well be that Cluster-VD in general chordal graphs is hard. Now, what about chordal, $2P_{3}$ -free graphs? We propose this last question as our first open question.

Our second contribution are polyhedral results for the Cluster-VD problem, in particular with respect to the tightness of the Sherali-Adams hierarchy. Our results on Sherali-Adams fail to match the 2-approximation factor of our algorithm (by epsilon), and we suspect this is not by chance. We believe that, already for certain classes of triangle-free graphs, the LP relaxation given by a bounded number of rounds of the Sherali-Adams hierarchy has an integrality gap strictly larger than $2$ . Our intuition goes as follows. Consider the star inequality $(k-1)x_{v_{0}}+\sum_{i=1}^{k}x_{v_{i}}\geqslant k-1$ , valid when $N(v_{0})=\{v_{1},\dots,v_{k}\}$ is a stable set. Capturing all star inequalities is sufficient to achieve an integrality gap of at most 2 for all triangle-free graphs [19, Algorithm 1]. However, we suspect that Sherali-Adams will have a hard time recovering these in a constant number of rounds. The star inequality is very similar to the clique inequality $\sum_{i=1}^{k}x_{v_{i}}\geqslant k-1$ , which is valid for Vertex Cover when $\{v_{1},\dots,v_{k}\}$ is a clique. It is known that Sherali-Adams is unable to capture all clique inequalities in a constant number of rounds of the Vertex Cover relaxation (see [27, Section 6.1] for an equivalent statement on clique inequalities for the stable set polytope). Whether this intuition is accurate is our second open question.

As mentioned already in the introduction, we do not know any polynomial-size LP or SDP relaxation with integrality gap at most $2$ for Cluster-VD. In order to obtain such a relaxation, it suffices to derive each valid inequality implied by Lemmas 2.1, 2.1, 2.2 and also somehow simulate Lemma 2.3. Here, different techniques to construct extended formulations (see for instance [7, 3, 36]) could be used. A partial result in this direction is that the star inequality has a bounded-degree sum-of-squares proof. This implies that a bounded number of rounds of the Lasserre hierarchy provides an SDP relaxation for Cluster-VD with integrality gap at most $2$ , whenever the input graph is triangle-free. This should readily generalize to the wheel inequalities of Lemma 2.1 and of course to the inequality of Lemma 2.1 (since the underlying graph has bounded size). However, we do not know how for instance to derive the inequalities of Lemma 2.2. We leave this for future work as our third open question.

Our fourth open question: what is the best running time for Algorithm 1? We think that it is possible to improve on our $\mathcal{O}(n^{4})$ upper bound.

Another intriguing problem is to what extent our methods can be adapted to hitting set problems in other $3$ -uniform hypergraphs. We mention an open question due to László Végh [38]: for which classes of $3$ -uniform hypergraphs and which $\varepsilon>0$ does the hitting set problem admit a $(3-\varepsilon)$ -approximation algorithm?

As mentioned in the introduction, FVST (feedback vertex set in tournaments) is another hitting set problem in a $3$ -uniform hypergraph, which is also UGC-hard to approximate to a factor smaller than $2$ . There is a recent randomized $2$ -approximation algorithm [29], but no deterministic (polynomial-time) algorithm is known. Let us repeat here the relevant open question from [29]: does FVST admit a deterministic $2$ -approximation algorithm?

Acknowledgements

We are grateful to Daniel Lokshtanov for suggesting Lemma 2.2, which allowed us to simplify our algorithm and its proof. We also thank two anonymous referees for their helpful comments, which improved the presentation of the paper.

References

[1] R. Albert, H. Jeong, and A.-L. Barabási. Error and attack tolerance of complex networks. Nature, 406(6794):378–382, 2000.
[2] N. Alon and J. H. Spencer. The Probabilistic Method. Wiley Publishing, 4th edition, 2016.
[3] M. Aprile. Extended formulations for matroid polytopes through randomized protocols. arXiv preprint arXiv:2106.12453, 2021.
[4] M. Aprile, N. Castro, G. Ferreira, J. Piccini, F. Robledo, and P. Romero. Graph fragmentation problem: analysis and synthesis. International Transactions in Operational Research, 26(1):41–53, 2019.
[5] M. Aprile, M. Drescher, S. Fiorini, and T. Huynh. A simple 7/3-approximation algorithm for feedback vertex set in tournaments. arXiv preprint arXiv:2008.08779, 2020.
[6] M. Aprile, M. Drescher, S. Fiorini, and T. Huynh. A tight approximation algorithm for the cluster vertex deletion problem. In International Conference on Integer Programming and Combinatorial Optimization, pages 340–353. Springer, 2021.
[7] M. Aprile and Y. Faenza. Extended formulations from communication protocols in output-efficient time. Mathematical Programming, 183(1):41–59, 2020.
[8] M. Aprile, Y. Faenza, S. Fiorini, T. Huynh, and M. Macchia. Extension complexity of stable set polytopes of bipartite graphs. volume 10520, pages 75–87. Springer, Cham, 2017.
[9] A. Bazzi, S. Fiorini, S. Pokutta, and O. Svensson. No small linear program approximates vertex cover within a factor $2-\epsilon$ . Mathematics of Operations Research, 44(1):147–172, 2019.
[10] J. R. Blair and B. Peyton. An introduction to chordal graphs and clique trees. In Graph theory and sparse matrix computation, pages 1–29. Springer, 1993.
[11] A. Boral, M. Cygan, T. Kociumaka, and M. Pilipczuk. A fast branching algorithm for cluster vertex deletion. Theory Comput. Syst., 58(2):357–376, 2016.
[12] G. Braun, S. Pokutta, and A. Roy. Strong reductions for extended formulations. Math. Program., 172(1–2):591–620, 2018.
[13] G. Braun, S. Pokutta, and D. Zink. Inapproximability of combinatorial problems via small LPs and SDPs. In Proceedings of STOC 2015, pages 107–116, New York, NY, USA, 2015. ACM.
[14] M.-C. Cai, X. Deng, and W. Zang. An approximation algorithm for feedback vertex sets in tournaments. SIAM J. Comput., 30(6):1993–2007, 2001.
[15] Y. Cao, Y. Ke, Y. Otachi, and J. You. Vertex deletion problems on chordal graphs. Theoretical Computer Science, 745:75–86, 2018.
[16] S. O. Chan, J. R. Lee, P. Raghavendra, and D. Steurer. Approximate constraint satisfaction requires large lp relaxations. Journal of the ACM (JACM), 63(4):1–22, 2016.
[17] M. Conforti, G. Cornuéjols, G. Zambelli, et al. Integer programming, volume 271. Springer, 2014.
[18] S. Fiorini, G. Joret, and O. Schaudt. Improved approximation algorithms for hitting 3-vertex paths. In International Conference on Integer Programming and Combinatorial Optimization, pages 238–249. Springer, 2016.
[19] S. Fiorini, G. Joret, and O. Schaudt. Improved approximation algorithms for hitting 3-vertex paths. Math. Program., 182(1-2, Ser. A):355–367, 2020.
[20] F. V. Fomin, S. Gaspers, D. Lokshtanov, and S. Saurabh. Exact algorithms via monotone local search. J. ACM, 66(2):Art. 8, 23, 2019.
[21] F. V. Fomin, T. Le, D. Lokshtanov, S. Saurabh, S. Thomassé, and M. Zehavi. Subquadratic kernels for implicit 3-hitting set and 3-set packing problems. ACM Trans. Algorithms, 15(1):13:1–13:44, 2019.
[22] A. Freund, R. Bar-Yehuda, and K. Bendel. Local ratio: a unified framework for approximation algorithms. ACM Computing Surveys, 36:422–463, 2005.
[23] S. Hosseinian and S. Butenko. Polyhedral properties of the induced cluster subgraphs. Discrete Applied Mathematics, 297:80–96, 2021.
[24] F. Hüffner, C. Komusiewicz, H. Moser, and R. Niedermeier. Fixed-parameter algorithms for cluster vertex deletion. Theory Comput. Syst., 47(1):196–217, 2010.
[25] E. Jahanpour and X. Chen. Analysis of complex network performance and heuristic node removal strategies. Communications in Nonlinear Science and Numerical Simulation, 18(12):3458–3468, 2013.
[26] P. K. Kothari, R. Meka, and P. Raghavendra. Approximating rectangles by juntas and weakly-exponential lower bounds for LP relaxations of CSPs. In H. Hatami, P. McKenzie, and V. King, editors, Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2017, Montreal, QC, Canada, June 19-23, 2017, pages 590–603. ACM, 2017.
[27] M. Laurent. A comparison of the Sherali-Adams, Lovász-Schrijver, and Lasserre relaxations for 0-1 programming. Math. Oper. Res., 28(3):470–496, 2003.
[28] D. Lokshtanov. Personal communication.
[29] D. Lokshtanov, P. Misra, J. Mukherjee, F. Panolan, G. Philip, and S. Saurabh. $2$ -approximating feedback vertex set in tournaments. In Proceedings of the Fourteenth Annual ACM-SIAM Symposium on Discrete Algorithms, pages 1010–1018. SIAM, 2020.
[30] M. Mnich, V. V. Williams, and L. A. Végh. A 7/3-approximation for feedback vertex sets in tournaments. In P. Sankowski and C. D. Zaroliagis, editors, 24th Annual European Symposium on Algorithms, ESA 2016, August 22-24, 2016, Aarhus, Denmark, volume 57 of LIPIcs, pages 67:1–67:14. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2016.
[31] T. Rothvoß. The Lasserre hierarchy in approximation algorithms. Lecture Notes for the MAPSP, pages 1–25, 2013.
[32] A. Schrijver. Theory of Linear and Integer Programming. Wiley, 1998.
[33] H. D. Sherali and W. P. Adams. A hierarchy of relaxations between the continuous and convex hull representations for zero-one programming problems. SIAM J. Discrete Math., 3(3):411–430, 1990.
[34] R. E. Tarjan. Decomposition by clique separators. Discrete mathematics, 55(2):221–232, 1985.
[35] R. E. Tarjan and M. Yannakakis. Simple linear-time algorithms to test chordality of graphs, test acyclicity of hypergraphs, and selectively reduce acyclic hypergraphs. SIAM Journal on Computing, 13(3):566–579, 1984.
[36] H. R. Tiwary, M. Kouteckỳ, and P. Kolman. Extension complexity, mso logic, and treewidth. Discrete Mathematics & Theoretical Computer Science, 22, 2020.
[37] D. Tsur. Faster parameterized algorithm for cluster vertex deletion. CoRR, abs/1901.07609, 2019.
[38] L. A. Végh. Personal communication.
[39] J. You, J. Wang, and Y. Cao. Approximate association via dissociation. Discret. Appl. Math., 219:202–209, 2017.

A Tight Approximation Algorithm for the Cluster Vertex Deletion Problem

Abstract.

Key words and phrases:

1. Introduction

1.1. Our contribution

Theorem 1.

Theorem 2.

1.2. Comparison to previous works

1.3. Other related works

1.4. Overview of the proof

2. Finding 22-good induced subgraphs

2.1. Restricting to chordal, 2​P32P_{3}-free neighborhoods

Lemma \thelemma.

Proof.

Lemma \thelemma.

Proof.

2.2. When HH is twin-free

Lemma \thelemma.

Proof.

Lemma \thelemma.

Proof.

Lemma \thelemma.

Proof.

2.3. Handling twins in G​[N​[v0]]G[N[v_{0}]]

Lemma \thelemma.

Proof.

2.4. Proof of Theorem 2

Proof.

3. Running-time Analysis

Lemma \thelemma.

Proof.

Lemma \thelemma.

Proof.

Lemma \thelemma.

Proof.

of Theorem 1.

4. Polyhedral results

Lemma \thelemma.

Proof.

Lemma \thelemma.

Proof.

Theorem 3.

Proof.

Theorem 4.

Proof.

Lemma \thelemma.

Proof.

Theorem 5.

Proof.

Proposition \theprop.

Proof.

5. Conclusion

Acknowledgements

References

A Tight Approximation Algorithm for the
Cluster Vertex Deletion Problem

2. Finding $2$ -good induced subgraphs

2.1. Restricting to chordal, $2P_{3}$ -free neighborhoods

2.2. When $H$ is twin-free

2.3. Handling twins in $G[N[v_{0}]]$