proptheorem \aliascntresettheprop \newaliascntlemmatheorem \aliascntresetthelemma \newaliascntobservationtheorem \aliascntresettheobservation \newaliascntcorollarytheorem \aliascntresetthecorollary \newaliascntconjecturetheorem \aliascntresettheconjecture \newaliascntclaimtheorem \aliascntresettheclaim
A Tight Approximation Algorithm for the
Cluster Vertex Deletion Problem
Abstract.
We give the first -approximation algorithm for the cluster vertex deletion problem. This is tight, since approximating the problem within any constant factor smaller than is UGC-hard. Our algorithm combines the previous approaches, based on the local ratio technique and the management of true twins, with a novel construction of a “good” cost function on the vertices at distance at most from any vertex of the input graph.
As an additional contribution, we also study cluster vertex deletion from the polyhedral perspective, where we prove almost matching upper and lower bounds on how well linear programming relaxations can approximate the problem.
Key words and phrases:
Approximation algorithm and Cluster vertex deletion and Linear programming relaxation and Sherali-Adams hierarchy.1. Introduction
A cluster graph is a graph that is a disjoint union of complete graphs. Let be any graph. A set is called a hitting set if is a cluster graph. Given a graph and (vertex) cost function , the cluster vertex deletion problem (Cluster-VD) asks to find a hitting set whose cost is minimum. We denote by the minimum cost of a hitting set.
If and are two graphs, we say that contains if some induced subgraph of is isomorphic to . Otherwise, is said to be -free. Denoting by the path on vertices, we easily see that a graph is a cluster graph if and only if it is -free. Hence, is a hitting set if and only if contains a vertex from each induced .
Cluster-VD has applications in graph modeled data clustering in which an unknown set of samples may be contaminated. An optimal solution for Cluster-VD can recover a clustered data model, retaining as much of the original data as possible [24]. Vertex deletion problems such as Cluster-VD, where one seeks to locate vertices whose removal leaves a graph with desirable properties, often arise when measuring robustness and attack tolerance of real-life networks [1, 4, 25].
From what precedes, Cluster-VD is a hitting set problem in a -uniform hypergraph, and as such has a “textbook” -approximation algorithm (see for instance the introduction of [18]). Moreover, the problem has an approximation-preserving reduction from Vertex Cover. By adding a pendant edge to each vertex of , one checks that solving Cluster-VD on the new graph is equivalent to solving Vertex Cover on the original graph (see Proposition 4 for more details). Hence, obtaining a -approximation algorithm for some would contradict either the Unique Games Conjecture or P NP.
The first non-trivial approximation algorithm for Cluster-VD was a -approximation due to You, Wang and Cao [39]. Shortly afterward, Fiorini, Joret and Schaudt gave a -approximation [18], and subsequently a -approximation [19].
1.1. Our contribution
In this paper, we close the gap between and and prove the following tight result.
Theorem 1.
Cluster-VD has a -approximation algorithm.
All previous approximation algorithms for Cluster-VD are based on the local ratio technique. See the survey of Bar-Yehuda, Bendel, Freund, and Rawitz [22] for background on this standard algorithmic technique. Our algorithm is no exception, see Algorithm 1 below. However, it significantly differs from previous algorithms in its crucial step (see Step 14 below). In fact, almost all our efforts in this paper focus on that particular step of the algorithm (see Theorem 2), which searches for a special type of induced subgraph of , which we now describe.
Let be an induced subgraph of , and let . The weighted graph is said to be -good in (for some factor ) if is not identically and holds for every (inclusionwise) minimal hitting set of . We overload terminology and say that an induced subgraph is -good in if there exists a cost function such that is -good in . We stress that the local cost function is defined obliviously of the global cost function .
A pair of vertices of are called twins111We warn the reader that, in other papers, twins are usually called true twins, whereas two vertices which have the same set of neighbours are called false twins. Since we have no need of false twins in this paper, we have chosen to use twins in place of true twins. if , and for all , if and only if . We say that is twin-free if has no twins. As in [18, 19], if has a pair of twins , then Cluster-VD admits an easy reduction step (see Steps 8–12). The idea is simply to add the cost of to that of and delete . This works since belongs to a minimal hitting set of if and only if does (see [18] for a complete proof). Therefore, when searching for -good induced subgraphs, we may assume that is twin-free, which is crucial for our proofs.
We will use two methods to establish -goodness of induced subgraphs. We say that is strongly -good if is not identically and . Clearly, if is strongly -good then is -good in , for every graph which contains . We say that itself is strongly -good if is strongly -good for some cost function .
Let (resp. ) be the set of vertices at distance at most (resp. equal to) from vertex . We abbreviate and . If we cannot find a strongly -good induced subgraph in , we will find an induced subgraph that has a special vertex such that is entirely contained in , and a cost function such that for all vertices and . Notice that no minimal hitting set can contain all the vertices of , since if contains , then is an isolated clique. Hence, and so is -good in . We say that (sometimes simply ) is centrally -good (in ) with respect to . Moreover, we call the root vertex.
In order to illustrate these ideas, consider the following two examples (see Figure 1). First, let be a (that is, a -cycle) contained in and denote the unit cost function on . Then is strongly -good, since . Second, let be a contained in , starting at a vertex that has degree- in . Then is centrally -good with respect to , but it is not strongly -good.
Each time we find a -good weighted induced subgraph in , the local ratio technique allows us to recurse on an induced subgraph of in which at least one vertex of is deleted from . For example, the -good induced subgraphs mentioned above allow us to reduce to input graphs that are -free and have minimum degree at least .
The crux of our algorithm, Step 14, relies on the following structural result.
Theorem 2.
Let be a twin-free graph, let be any vertex of , and let be the subgraph of induced by . There exists a cost function such that is either strongly -good, or centrally -good in with respect to . Moreover, can be constructed in polynomial time.
We also study Cluster-VD from the polyhedral point of view. In particular we investigate how well linear programming (LP) relaxations can approximate the optimal value of Cluster-VD. As in [16, 13, 9], we use the following notion of LP relaxations which, by design, allows for extended formulations.
Fix a graph . Let be an arbitrary dimension. A system of linear inequalities in defines an LP relaxation of Cluster-VD on if the following hold: (i) For every hitting set , we have a point satisfying ; (ii) For every cost function , we have an affine function ; (iii) For all hitting sets and cost functions , the condition holds.
The size of the LP relaxation is defined as the number of rows of . For every cost function , the quantity gives a lower bound on . The integrality gap of the LP relaxation is defined as .
Letting denote the collection of all vertex sets that induce a in , we define
We let denote the relaxation obtained from by applying rounds of the Sherali-Adams hierarchy [33], a standard procedure to derive strengthened LP relaxations of binary linear programming problems. If a cost function is provided, we let
denote the optimum value of the corresponding linear programming relaxation.
It is not hard to see that the straightforward LP relaxation has worst case integrality gap equal to (by worst case, we mean that we take the supremum over all graphs ). Indeed, for a random -vertex graph, with high probability. This can be easily proved via the probabilistic method. A similar proof can be found, for instance, in the introduction of [2]). On the other hand, , since the vector with all coordinates equal to is feasible for .
On the positive side, we show how applying one round of the Sherali-Adams hierarchy gives a relaxation with integrality gap at most , see Theorem 3. To complement this, we prove that the worst case integrality gap of the relaxation is precisely , see Theorem 4. We then show that the integrality gap decreases to after applying rounds, see Theorem 5.
On the negative side, applying known results on Vertex Cover [9], we show that no polynomial-size LP relaxation of Cluster-VD can have integrality gap at most for some . As is the case for similar lower bounds (see [31, 8]), this result is unconditional: it does not rely on P NP nor the Unique Games Conjecture.
1.2. Comparison to previous works
We now revisit all previous approximation algorithms for Cluster-VD [39, 18, 19]. The presentation given here slightly departs from [39, 18], and explains in a unified manner what is the bottleneck in each of the algorithms.
Fix , and let . Notice that if , if and if . In [19, Lemma 3], it is shown that if a twin-free graph contains a -clique, then one can find an induced subgraph containing the -clique and a cost function such that is strongly -good.
Therefore, in order to derive an -approximation for Cluster-VD, one may assume without loss of generality that the input graph is twin-free and has no -clique. Let be a maximum degree vertex in , and let denote the subgraph of induced by . In [19], it is shown by a tedious case analysis that one can construct a cost function such that is -good in , using the fact that has no -clique.
The simplest case occurs when . Then is a stable set. Letting , for and for the other vertices of , one easily sees that is (centrally) -good in . For higher values of , one has to work harder.
In this paper, we show that one can always, and in polynomial time, construct a cost function on the vertices at distance at most from that makes -good in , provided that is twin-free, see Theorem 2. This result was the main missing ingredient in previous approaches, and single-handedly closes the approximability status of Cluster-VD.
1.3. Other related works
Cluster-VD has also been widely studied from the perspective of fixed parameter tractability. Given a graph and parameter as input, the task is to decide if has a hitting set of size at most . A -time algorithm for this problem was given by Hüffner, Komusiewicz, Moser, and Niedermeier [24]. This was subsequently improved to a -time algorithm by Boral, Cygan, Kociumaka, and Pilipczuk [11], and a -time algorithm by Tsur [37]. By the general framework of Fomin, Gaspers, Lokshtanov, and Saurabh [20], these parametrized algorithms can be transformed into exponential algorithms which compute the size of a minimum hitting set for exactly, the fastest of which runs in time .
For polyhedral results, Hosseinian and Butenko [23] gives some facet-defining inequalities of the Cluster-VD polytope, as well as complete linear descriptions for special classes of graphs.
Another related problem is the feedback vertex set problem in tournaments (FVST). Given a tournament with costs on the vertices, the task is to find a minimum cost set of vertices such that does not contain a directed cycle.
For unit costs, note that Cluster-VD is equivalent to the problem of deleting as few elements as possible from a symmetric relation to obtain a transitive relation, while FVST is equivalent to the problem of deleting as few elements as possible from an antisymmetric and complete relation to obtain a transitive relation.
In a tournament, hitting all directed cycles is equivalent to hitting all directed triangles, so FVST is also a hitting set problem in a -uniform hypergraph. Moreover, FVST is also UGC-hard to approximate to a constant factor smaller than . Cai, Deng, and Zang [14] gave a -approximation algorithm for FVST, which was later improved to a -approximation algorithm by Mnich, Williams, and Végh [30]. Lokshtanov, Misra, Mukherjee, Panolan, Philip, and Saurabh [29] recently gave a randomized -approximation algorithm, but no deterministic (polynomial-time) -approximation algorithm is known. For FVST, one round of the Sherali-Adams hierarchy actually provides a -approximation [5].
Among other related covering and packing problems, Fomin, Le, Lokshtanov, Saurabh, Thomassé, and Zehavi [21] studied both Cluster-VD and FVST from the kernelization perspective. They proved that the unweighted versions of both problems admit subquadratic kernels: for Cluster-VD and for FVST.
1.4. Overview of the proof
We give a sketch of the proof of Theorem 2. Recall that . If the subgraph induced by contains a hole (that is, an induced cycle of length at least ), then is strongly -good by Lemma 2.1. If the subgraph induced by contains an induced (that is, two disjoint copies of with no edges between each other), then is strongly -good by Lemma 2.1. This allows us to reduce to the case where the subgraph induced by is chordal and -free.
Lemma 2.2 then gives a direct construction of a cost function which certifies that is centrally -good, provided that the subgraph induced by is twin-free. This is the crucial step of the proof. It serves as the base case of the induction. Here, we use a slick observation due to Lokshtanov [28]: since the subgraph induced by is chordal and -free, it has a hitting set that is a clique. In a previous version, our proof of Theorem 2 was slightly more complicated.
We show inductively that we can reduce to the case where the subgraph induced by is twin-free. The idea is to delete vertices from to obtain a smaller graph , while preserving certain properties, and then compute a suitable cost function for , given a suitable cost function for . We delete vertices at distance from . When this creates twins in , we delete one vertex from each pair of twins. At the end, we obtain a twin-free induced subgraph of , which corresponds to our base case.
We conclude the introduction with a brief description of the different sections of the paper. Section 2 is entirely devoted to the proof of Theorem 2. The proof of Theorem 1 is given in Section 3, together with a complexity analysis of Algorithm 1. Section 4 presents our polyhedral results. A conclusion is given in Section 5. There, we state a few open problems for future research.
Conference version. An abstract of this paper appeared in the proceedings of IPCO 2021 [6]. The current paper is an extended version with full proofs and detailed discussions. In particular, the running time analysis of our algorithm in Section 3 is new, and the polyhedral results of Section 4 appear without proof in the conference version.
2. Finding -good induced subgraphs
The goal of this section is to prove Theorem 2. Our proof is by induction on the number of vertices in . First, we quickly show that we can assume that the subgraph induced by is chordal and -free. Using this, we prove the theorem in the particular case where the subgraph induced by is twin-free. Finally, we prove the theorem in the general case by showing how to deal with twins.
2.1. Restricting to chordal, -free neighborhoods
A vertex of is apex if it is adjacent to all the other vertices of . A wheel is a graph obtained from a cycle by adding an apex vertex (called the center).
As pointed out earlier in the introduction, -cycles are strongly -good. This implies that the wheel on vertices is strongly -good (putting a zero cost on the center). We now show that all wheels on at least vertices are strongly -good. This allows our algorithm to restrict to input graphs such that the subgraph induced on each neighborhood is chordal. In a similar way, we show that we can further restrict such neighborhoods to be -free.
Lemma \thelemma.
Let be a wheel on vertices and center , let and for . Then is strongly -good.
Proof.
Notice that since a hitting set either contains and at least more vertices, or does not contain but contains other vertices. Hence, . ∎
Lemma \thelemma.
Let be the graph obtained from by adding an apex vertex . Let and for . Then is strongly -good.
Proof.
It is easy to check that . Thus, . ∎
2.2. When is twin-free
Throughout this section, we assume that is a twin-free graph with an apex vertex such that is chordal and -free. Our goal is to construct a cost function that certifies that is centrally -good.
It turns out to be easier to define the cost function on first, and then adjust the cost of . This is the purpose of the next lemma. Below, denotes the maximum weight of a clique in weighted graph .
Lemma \thelemma.
Let be a graph with an apex vertex and . Let be a cost function such that
-
(i)
and
-
(ii)
.
Then we can extend to a function such that . In other words, is centrally -good with respect to .
Proof.
Notice that
since if is hitting set of that does not contain , then is a clique.
Suppose . Since ,
Suppose . Since ,
In either case, , as required. ∎
We abuse notation and regard a clique of a graph as both a set of vertices and a subgraph. We call a hitting set of a graph a hitting clique if is also a clique.
Lemma \thelemma.
Every chordal, -free graph contains a hitting clique.
Proof.
Let be a chordal, -free graph. Since is chordal, admits a clique tree (see [10]). In , the vertices are the maximal cliques of and, for every two maximal cliques , , the intersection is contained in every clique of the – path in . For an edge of , Let and be the components of and and be the subgraphs of induced by the union of all the cliques in and , respectively. It is easy to see that deleting separates from in . Now, since is -free, at least one of or is a cluster graph. If both and are cluster graphs, we are done since is the desired hitting clique. Otherwise, if is not a cluster graph, then we can orient towards . Applying this argument on each edge, we define an orientation of , which must have a sink . But then removing from leaves a cluster graph, and we are done. Since the clique tree of a chordal graph can be constructed in polynomial time [10], the hitting clique can be found in polynomial time. ∎
We are ready to prove the base case for Theorem 2. For a graph , we let denote the number of vertices of .
Lemma \thelemma.
Let be a twin-free graph with an apex vertex such that is chordal and -free. There exists a cost function such that is centrally -good with respect to . Moreover, can be found in time .
Proof.
By Lemma 2.2, some maximal clique of , say , is a hitting set.
We claim that there is a family of stable sets of satisfying the following properties:
-
(P1)
every vertex of is contained in some ;
-
(P2)
for each , contains and at least one other vertex;
-
(P3)
for every two distinct vertices , contains a .
Before proving the claim, we prove that it implies the lemma. Consider the cost function on the vertices of defined by giving to each vertex a cost equal to the number of stable sets that contain (see Figure 2). It suffices to show that satisfies the conditions of Lemma 2.2 and can therefore be extended to a cost function on such that is centrally -good with respect to .
First, by (P1), we have for all . Second, condition (i) of Lemma 2.2 follows from (P2) since each stable set contributes at least two units to and at most one unit to . Third, (P3) implies that every hitting set of meets every stable set , except possibly one. Hence, . Also, every clique of meets every stable set in at most one vertex, implying that , and equality holds since . Putting the last two observations together, we see that and hence condition (ii) of Lemma 2.2 holds.
Now, we prove that our claim holds. Let denote the clusters (maximal cliques) of cluster graph . For , consider the submatrix of the adjacency matrix with rows indexed by the vertices of and columns indexed by the vertices of .
Notice that contains neither nor as a submatrix, as this would give a contained in , contradicting the chordality of . Hence, after permuting its rows and columns if necessary, can be assumed to be staircase-shaped. That is, every row of is nonincreasing and every column nondecreasing. Notice also that does not have two equal columns, since these would correspond to two vertices of that are twins.
For each that is not complete to , define as the vertex whose corresponding column in is the first containing a in row . Now, for each , let be the set including , and , for each that is not complete to .
Because is maximal, no vertex is complete to . Since no two columns of are identical, we must have for some . This proves (P1).
Notice that by construction and that since otherwise, would be apex in and thus a twin of . Hence, (P2) holds.
Finally, consider any two distinct vertices . Since are not twins, the edge must be in a contained in . Assume, without loss of generality, that there is a vertex adjacent to and not to for some . Then induces a contained in , proving (P3). This concludes the proof of the claim.
We observe that the collection can be computed in time. This yields the restriction to . Since is chordal, can be computed in time. We then just let . This sets equal to the value in the proof of Lemma 2.2. Therefore, can be constructed in time. ∎
2.3. Handling twins in
We now deal with the general case where contains twins. We start with an extra bit of terminology relative to twins. Let be a twin-free graph, and . Suppose that are twins in . Since is twin-free, there exists a vertex that is adjacent to exactly one of , in . We say that is a distinguisher for the edge (or for the pair ). Notice that either or is an induced . Notice also that is at distance from .
Now, consider a graph with a special vertex (the root vertex) such that
-
(H1)
every vertex is at distance at most from , and
-
(H2)
every pair of vertices that are twins in has a distinguisher.
Let be any vertex that is at distance from . Consider the equivalence relation on with whenever or are twins in . Observe that the equivalence classes of are of size at most since, if are distinct vertices with , then cannot distinguish every edge of the triangle on , and . Hence, two of these vertices are twins in , which contradicts (H2).
From what precedes, the edges contained in that do not have a distinguisher in form a matching (possibly, ). Let denote the graph obtained from by deleting and exactly one endpoint from each edge of . Notice that the resulting subgraph is the same, up to isomorphism, no matter which endpoints are chosen.
The lemma below states how we can obtain a cost function that certifies that is centrally -good from a cost function that certifies that is centrally -good. It is inspired by [19, Lemma 3]. See Figure 3 for an example.
Lemma \thelemma.
Let be any graph satisfying (H1) and (H2) for some . Let . Let be the matching formed by the edges in whose unique distinguisher is , where for all (we allow the case ). Let . Given a cost function on , define a cost function on by letting for , , and otherwise. First, satisfies (H1) and (H2). Second, if is centrally -good, then is centrally -good.
Proof.
For the first part, notice that satisfies (H1) by our choice of . Indeed, deleting does not change the distance of the remaining vertices from .
We now prove that satisfies (H2). First notice that in the twins in are exactly . Next, for each edge , has at least one distinguisher different than unless . Moreover, each is a distinguisher for if and only if is a distinguisher for . Thus, each edge of still has at least one distinguisher, which proves (H2).
For the second part, notice that for all since for all . To argue that , one can check that any hitting set of must either contain or at least one endpoint of each edge . Hence .
Since is centrally -good, . It follows that
2.4. Proof of Theorem 2
We are ready to prove Theorem 2.
Proof.
We can decide in polynomial time (see for instance [35]) if is chordal, and if not, output a hole of . If the latter holds, we are done by Lemma 2.1. If the former holds, we can decide in polynomial time (see [34], and the proof of Lemma 2.2) whether contains a . If it does, we are done by Lemma 2.1.
From now on, assume that the subgraph induced by is chordal and -free. This is done without loss of generality. Notice that hypotheses (H1) and (H2) from Section 2.3 hold for . This is obvious for (H1). To see why (H2) holds, remember that is twin-free. Hence, every edge contained in must have a distinguisher in , which is in . (In fact, notice that if and are twins in then the distinguisher is necessarily in .)
3. Running-time Analysis
We now analyse the running-time of Algorithm 1. We assume that input graphs are given by their adjacency matrix. We need the following easy lemma, whose proof we include for completeness.
Lemma \thelemma.
Given a matrix , the set of all equivalence classes of equal rows of can be found in time .
Proof.
Let and be the set of rows of whose first entry is and , respectively. We can determine and by reading the first column of , which takes time . We then recurse on and , where is the submatrix of induced by and the last columns of . ∎
Before proving the next lemma we remark that, given a graph with vertices and edges, one can check whether is a cluster graph by checking that each of its components is a clique, which takes time.
Lemma \thelemma.
Let be an -vertex, twin-free graph. In time, we can find an induced subgraph of and a cost function on such that is -good.
Proof.
We fix any vertex , and let . We can check in time whether is chordal by using the algorithm from [35]. If is not chordal this algorithm returns, as a certificate, a hole . By Lemma 2.1, is strongly 2-good and the corresponding function can be computed straightforwardly, hence we are done. Suppose now that is chordal. We can construct the clique tree of (see for instance [34]) in time. Each edge of the tree induces a separation of , and we can check if each side is a cluster graph in time. If neither side is a cluster graph, then we have found a in . Hence, is strongly 2-good and the corresponding function can be computed straightforwardly. Since the clique tree has at most vertices, by orienting its edges as in the proof of Lemma 2.2 we find, in time, a hitting clique. By applying Lemma 2.3 to get rid of twins, which can be done in time, we obtain a subgraph satisfying the hypotheses of Lemma 2.2. Finally, by Lemma 2.2, the cost function in the statement of Lemma 2.2 can be constructed in time. ∎
Lemma \thelemma.
Algorithm 1 runs in -time.
Proof.
By Lemma 3, finding all twins in can be done in time . Therefore, the most expensive recursive call of the algorithm is the construction of the -good weighted induced subgraph from Lemma 3, which can be done in time . Therefore, the running-time of the algorithm satisfies , which gives . ∎
of Theorem 1.
The proof is identical to [19, Proof of Theorem 1, pages 365–366], except that factor needs to be replaced everywhere by . One easily proves by induction that the vertex set output by the algorithm on input is a minimal hitting set with . We do not include more details here, and instead refer the reader to [19]. Theorem 2 guarantees that Algorithm 1 runs in polynomial time. ∎
4. Polyhedral results
In this section we study how well LP-relaxtions can approximate Cluster-VD, as already described in Section 1. We begin with a brief description of the Sherali-Adams hierarchy [33], which is a standard procedure to obtain strengthened LP relaxations for binary linear programs. For a more thorough introduction, we refer the reader to [27]. Throughout the section we closely follow the exposition given in [5], where the Sherali-Adams hierarchy and a very similar concept of diagonal (defined below) are used to approach the FVST problem, as mentioned in Section 1.3. In particular, our Lemma 4 is similar to another lemma in [5].
Let be a polytope contained in and . For each , we define a polytope as follows. Let be the nonlinear system obtained from by multiplying each constraint by for all disjoint subsets of such that . Note that if , then . Therefore, we can obtain a linear system from by setting for all and then for all with . We then let be the projection of onto the variables , .
Let denote the collection of all vertex sets that induce a in and let , where
If a cost function is provided, we let
denote the optimum value of the corresponding linear programming relaxation. For the sake of simplicity, we sometimes denote by the above linear program itself.
We say vertices and form a diagonal if there are vertices such that and . We say that a path contains a diagonal if any of its pairs of vertices are diagonals. Note that a diagonal pair in a path need not be an edge in the path.
Our first results concern . For later use, we list here the inequalities defining . For all and , we have the inequalities
(1) | ||||
(2) | ||||
(3) |
In addition, there are the inequalities
(4) |
for all distinct . The polytope is the set of all such that there exists such that inequalities (1)–(4) are satisfied. Note that by definition, and are the same variable.
In order to establish the integrality gap of , we need two preliminary lemmas.
Lemma \thelemma.
Let . If contains a which has a diagonal, then for some vertex of .
Proof.
Assume by contradiction that , form a diagonal and all components of are less than 2/5. By the definition of diagonal, there exist with . In particular, from (1) we have and from (2) .
Adding these two inequalities, we obtain . We must have since otherwise . Now let be the third vertex of the containing (notice that can be the middle vertex of the ). By (1) and (4), we have which means that , a contradiction which concludes the proof. ∎
Lemma \thelemma.
Let be a path or a cycle, and . Then:
-
(i)
there is an efficient algorithm that solves Cluster-VD for ;
-
(ii)
the basic LP has integrality gap at most .
Proof.
First, let be a path. We notice that the coefficient matrix of the basic LP is totally unimodular, by the consecutive ones property [32]. Hence solving the LP yields an integral optimal solution, which corresponds to a hitting set of of cost equal to . This proves (i), (ii) when is a path.
Now, let be a cycle, and let . Suppose that belongs to a minimum cost hitting set of . Then is a minimum cost hitting set of , hence it can be found efficiently since is a path. By iterating over all and taking the hitting set of minimum cost, we efficiently solve Cluster-VD for . This concludes the proof of (i).
Finally, let be an optimal solution of . First, assume that there is some such that . Since is a path, the optimal hitting set in has cost . Hence, we see that is a hitting set of with . On the other hand, if for all vertices , then the constraint (where are the neighbors of ) implies that there can be no vertex with . So for all . Therefore, extreme point is the unique solution of equations of the form for . Hence for all vertices. Thus . Now notice that since is a cycle we can partition its vertices into two disjoint hitting sets and . Without loss of generality assume that . Then . This concludes the proof of (ii). ∎
Theorem 3.
There is a polynomial-time algorithm that, given a graph and , outputs a hitting set of such that . In particular, the integrality gap of is at most .
Proof.
Let be an optimal solution of . Let , and . Notice that the restriction of to is a feasible solution for whose components are all strictly less than . Hence, by Lemma 4, cannot contain a which has a diagonal. We will now show that, after possibly getting rid of twins, is a disjoint union of paths and cycles. Then, by applying Lemma 4, we will obtain a minimum cost hitting set of , which, together with , will form our desired hitting set.
First, if contains twins and , we can delete , and set , for to obtain a smaller weighted graph . We claim that . To see this, we show that one can turn any feasible solution of into a feasible solution of without increasing the cost. Let and for . It is easy to check that this defines a feasible solution to , whose cost is
This proves the claim. Moreover, a hitting set for can be immediately obtained from a hitting set of by adding if and only if . Finally, notice that there is a feasible solution for obtained from the restriction of whose components are all strictly less than . Hence, from now on we assume that is twin-free.
Now, we claim that is triangle-free and claw-free. Suppose that contains a triangle with vertices , and . Since is twin-free, every edge of the triangle has a distinguisher. Without loss of generality, contains , , and where are distinct vertices outside the triangle. It is easy to see that, for instance, edge is a diagonal contained in a , a contradiction. A similar argument shows that cannot contain a claw. This proves the claim, and implies that has maximum degree at most 2. That is, is a disjoint union of paths and cycles. By part (i) of Lemma 4, (applied to each component of ), we obtain a minimum cost hitting set of which, thanks to (ii) of Lemma 4, satisfies .
Finally, consider . Clearly, it is a hitting set of . Moreover, one has
It follows that the integrality gap of is at most , concluding the proof. ∎
We now complement the result above by showing a lower bound on the integrality gap of .
Theorem 4.
For every there is some instance of Cluster-VD such that .
Proof.
We show there is a graph for which every hitting set has for . Let be a graph whose girth is at least for some constant and with the independence number where . It can be shown via the probabilistic method that such a exists, see [2]. Set for all . We have for every hitting set . To see this observe that since is triangle-free and , when we remove we will get at most components each of size at most 2. Thus there are at most vertices in , so . Therefore, .
In order to show , we construct the following feasible solution to . Set for all and if and if . The inequalities defining are all satisfied by . This is obvious for inequalities (1), (3) and (4). For inequality (2), notice that at most one of , , can be an edge of , since otherwise would have a cycle of length at most . Thus (2) is satisfied too, and .
This completes the proof since, by taking , we have . ∎
We now show that the integrality gap decreases to after applying rounds of Sherali-Adams. We first need the following lemma.
Lemma \thelemma.
Fix and . Let be a minimum order weighted graph such that . The following two assertions hold:
-
(i)
if is an optimal solution to , then for all ;
-
(ii)
is connected and twin-free.
Proof.
(i) Suppose for a contradiction that , for some . Note that restricted to is a feasible solution to . Thus . By the minimality of , there is a hitting set of such that . Therefore is a hitting set of with , a contradiction.
(ii) Note that is connected, otherwise there exists a connected component of such that , where is the restriction of to , contradicting the minimality of . To show that is twin-free, we proceed exactly as in the proof of Theorem 3. ∎
Theorem 5.
For every fixed , performing rounds of the Sherali-Adams hierarchy produces an LP relaxation of Cluster-VD whose integrality gap is at most . That is, for all weighted graphs .
Proof.
In order to simplify the notation below, let us assume that is integer. For instance, we could restrict to for some . This does not hurt the generality of the argument. We take . We may assume that since otherwise we invoke Theorem 3 (taking suffices in this case).
Let be a counterexample to the theorem, with minimum. By Lemma 4.(i), for every optimal solution to , every vertex has . By Lemma 4.(ii), is twin-free (and connected).
We will use the following fact several times in the proof: for all with and every , the restriction of to the variables in is a convex combination of hitting sets of . This is easy to see since, denoting by the restriction of , we get and the Sherali-Adams hierarchy is known to converge in at most “dimension-many” rounds, see for instance [17].
First, we claim that has no clique of size at least . Suppose otherwise. Let be a clique of size and let be a minimal set such that each edge of has a distinguisher in . Let . Then, following the construction from Section 2.3, one can obtain a cost function such that , and for any hitting set of . See [19, Lemma 3] for the full construction, whose proof also shows that . Since every valid inequality supported on at most vertices is valid for , the inequality is valid for . Since , this implies that for all , there is some vertex with . Since , we get a contradiction. This proves our first claim.
Second, we claim that for every , the subgraph of induced by the neighborhood has no stable set of size at least . The proof is similar to that for cliques given above, except that this time we let be the induced star with apex and . The cost function given by Lemma 2.2 has and for all . Notice that once again . The star inequality is valid for , which guarantees that for every there is some which has . This establishes our second claim.
Third, we claim that the neighborhood of every vertex induces a chordal subgraph of . Suppose that is a hole in . We first deal with the case . We can repeat the same proof as above, letting be the induced wheel on and using the cost function defined in the proof of Lemma 2.1. Consider the wheel inequality , where . Since the wheel has at most vertices, the wheel inequality is valid for . Since , for every , there is some which has . This concludes the case where is “small”.
Now, assume that , and consider the wheel inequality with right-hand side scaled by . Suppose this inequality is valid for . This still implies that some vertex of has , for all , which produces the desired contradiction. It remains to prove that the scaled wheel inequality is valid for .
Let denote any -vertex induced subgraph of that is a fan.222A fan is a graph obtained from a path by adding an apex vertex. Hence, contains as an apex vertex, plus a path on vertices. Letting and for , we get the inequality , which is valid for . By taking all possible choices for , and averaging the corresponding inequalities, we see that the inequality
is valid for . It can be seen that this inequality dominates the scaled wheel inequality, in the sense that each left-hand side coefficient is not larger than the corresponding coefficient in the scaled wheel inequality, while the right-hand side is not smaller than the right-hand side of the scaled wheel inequality. Therefore, the scaled wheel inequality is valid for . This concludes the proof of our third claim.
By the first, second and third claim333The first inequality follows since , for every perfect graph ., for all choices of . This implies in particular that . Now let . Theorem 2 applies since is twin-free, by our second claim. Let be the corresponding cost function. The inequality is valid for .
Let be defined as in Step 15 of Algorithm 1, and let denote any vertex such that . By minimality of , there exists in a minimal hitting set of cost . We let in case is a hitting set of , and otherwise. Assume that , the other case is easier. We have
By LP duality, we have . This implies that , contradicting the fact that is a counterexample. This concludes the proof. ∎
We now complement the result above by showing that every LP relaxation of Cluster-VD with (worst case) integrality gap at most must have super-polynomial size. The result is a simple consequence of an analogous result of [9] on the integrality gap of Vertex Cover, and of the straightforward reduction from Vertex Cover to Cluster-VD.
Proposition \theprop.
For infinitely many values of , there is a graph on vertices such that every size- LP relaxation of Cluster-VD on has integrality gap .
Proof.
In [9] a similar result is proved for LP-relaxations of Vertex Cover: for infinitely many values of , there is a graph on vertices such that every size- LP relaxation of Vertex Cover on has integrality gap at least , where is a non-negative function.
Let be such a graph, and let be the graph obtained from by attaching a pendant edge to every vertex. It is easy to see that is a hitting set for if and only if is a vertex cover of .
Toward a contradiction, suppose that is a size- LP relaxation of Cluster-VD on with integrality gap at most , for a fixed (where for some dimension depending on ). For every there exists a hitting set of such that .
We can easily turn into an LP relaxation for Vertex Cover. For every vertex cover of , we let the corresponding point be the point for seen as a hitting set in . For every , we define via for , and for . Then, we let the affine function for be the affine function for .
Since the integrality gap of , seen as an LP relaxation of Cluster-VD, is at most , for every there exists a hitting set of whose cost is at most , where is the cost function corresponding to . If contains any vertex of , we can replace this vertex by its unique neighbor in , without any increase in cost. In this way, we can find a vertex cover of whose cost satisfies . Hence, the integrality gap of as an LP relaxation of Vertex Cover is also at most . As the size of is , this provides the desired contradiction. ∎
We point out that the size bound in the previous result can be improved. Kothari, Meka and Raghavendra [26] have shown that for every there is a constant such that no LP relaxation of size less than has integrality gap less than for Max-CUT. Since Max-CUT acts as the source problem in [9], one gets a size lower bound for Vertex Cover in order to achieve integrality gap . This also follows in a black-box manner from [26] and [12]. The proof of Proposition 4 shows that the same bound applies to Cluster-VD.
5. Conclusion
In this paper we provide a tight approximation algorithm for the cluster vertex deletion problem (Cluster-VD). Our main contribution is the efficient construction of a local cost function on the vertices at distance at most from any vertex such that every minimal hitting set of the input graph has local cost at most twice the local optimum. If the subgraph induced by (the first neighborhood of ) contains a hole, or a , then this turns out to be straightforward. The most interesting case arises when the local subgraph is twin-free, has radius , and moreover is chordal and -free. Such graphs are very structured, which we crucially exploit.
Lemma 2.2 allows us to define the local cost function on the vertices distinct from and then later adjust the cost of . We point out that condition (ii) basically says that the local cost function should define a hyperplane that “almost” separates the hitting set polytope and the clique polytope of the chordal, -free graph . This was a key intuition which led us to the proof of Theorem 2. If these polytopes were disjoint, this would be easy. But actually it is not the case since they have a common vertex (as we show, has a hitting clique).
One natural question arising from our approach of Cluster-VD in general graphs is the following: is the problem polynomial-time solvable on chordal graphs? This seems to be a non-trivial open question, also mentioned in [15], where similar vertex deletion problems are studied for chordal graphs. It could well be that Cluster-VD in general chordal graphs is hard. Now, what about chordal, -free graphs? We propose this last question as our first open question.
Our second contribution are polyhedral results for the Cluster-VD problem, in particular with respect to the tightness of the Sherali-Adams hierarchy. Our results on Sherali-Adams fail to match the 2-approximation factor of our algorithm (by epsilon), and we suspect this is not by chance. We believe that, already for certain classes of triangle-free graphs, the LP relaxation given by a bounded number of rounds of the Sherali-Adams hierarchy has an integrality gap strictly larger than . Our intuition goes as follows. Consider the star inequality , valid when is a stable set. Capturing all star inequalities is sufficient to achieve an integrality gap of at most 2 for all triangle-free graphs [19, Algorithm 1]. However, we suspect that Sherali-Adams will have a hard time recovering these in a constant number of rounds. The star inequality is very similar to the clique inequality , which is valid for Vertex Cover when is a clique. It is known that Sherali-Adams is unable to capture all clique inequalities in a constant number of rounds of the Vertex Cover relaxation (see [27, Section 6.1] for an equivalent statement on clique inequalities for the stable set polytope). Whether this intuition is accurate is our second open question.
As mentioned already in the introduction, we do not know any polynomial-size LP or SDP relaxation with integrality gap at most for Cluster-VD. In order to obtain such a relaxation, it suffices to derive each valid inequality implied by Lemmas 2.1, 2.1, 2.2 and also somehow simulate Lemma 2.3. Here, different techniques to construct extended formulations (see for instance [7, 3, 36]) could be used. A partial result in this direction is that the star inequality has a bounded-degree sum-of-squares proof. This implies that a bounded number of rounds of the Lasserre hierarchy provides an SDP relaxation for Cluster-VD with integrality gap at most , whenever the input graph is triangle-free. This should readily generalize to the wheel inequalities of Lemma 2.1 and of course to the inequality of Lemma 2.1 (since the underlying graph has bounded size). However, we do not know how for instance to derive the inequalities of Lemma 2.2. We leave this for future work as our third open question.
Our fourth open question: what is the best running time for Algorithm 1? We think that it is possible to improve on our upper bound.
Another intriguing problem is to what extent our methods can be adapted to hitting set problems in other -uniform hypergraphs. We mention an open question due to László Végh [38]: for which classes of -uniform hypergraphs and which does the hitting set problem admit a -approximation algorithm?
As mentioned in the introduction, FVST (feedback vertex set in tournaments) is another hitting set problem in a -uniform hypergraph, which is also UGC-hard to approximate to a factor smaller than . There is a recent randomized -approximation algorithm [29], but no deterministic (polynomial-time) algorithm is known. Let us repeat here the relevant open question from [29]: does FVST admit a deterministic -approximation algorithm?
Acknowledgements
We are grateful to Daniel Lokshtanov for suggesting Lemma 2.2, which allowed us to simplify our algorithm and its proof. We also thank two anonymous referees for their helpful comments, which improved the presentation of the paper.
References
- [1] R. Albert, H. Jeong, and A.-L. Barabási. Error and attack tolerance of complex networks. Nature, 406(6794):378–382, 2000.
- [2] N. Alon and J. H. Spencer. The Probabilistic Method. Wiley Publishing, 4th edition, 2016.
- [3] M. Aprile. Extended formulations for matroid polytopes through randomized protocols. arXiv preprint arXiv:2106.12453, 2021.
- [4] M. Aprile, N. Castro, G. Ferreira, J. Piccini, F. Robledo, and P. Romero. Graph fragmentation problem: analysis and synthesis. International Transactions in Operational Research, 26(1):41–53, 2019.
- [5] M. Aprile, M. Drescher, S. Fiorini, and T. Huynh. A simple 7/3-approximation algorithm for feedback vertex set in tournaments. arXiv preprint arXiv:2008.08779, 2020.
- [6] M. Aprile, M. Drescher, S. Fiorini, and T. Huynh. A tight approximation algorithm for the cluster vertex deletion problem. In International Conference on Integer Programming and Combinatorial Optimization, pages 340–353. Springer, 2021.
- [7] M. Aprile and Y. Faenza. Extended formulations from communication protocols in output-efficient time. Mathematical Programming, 183(1):41–59, 2020.
- [8] M. Aprile, Y. Faenza, S. Fiorini, T. Huynh, and M. Macchia. Extension complexity of stable set polytopes of bipartite graphs. volume 10520, pages 75–87. Springer, Cham, 2017.
- [9] A. Bazzi, S. Fiorini, S. Pokutta, and O. Svensson. No small linear program approximates vertex cover within a factor . Mathematics of Operations Research, 44(1):147–172, 2019.
- [10] J. R. Blair and B. Peyton. An introduction to chordal graphs and clique trees. In Graph theory and sparse matrix computation, pages 1–29. Springer, 1993.
- [11] A. Boral, M. Cygan, T. Kociumaka, and M. Pilipczuk. A fast branching algorithm for cluster vertex deletion. Theory Comput. Syst., 58(2):357–376, 2016.
- [12] G. Braun, S. Pokutta, and A. Roy. Strong reductions for extended formulations. Math. Program., 172(1–2):591–620, 2018.
- [13] G. Braun, S. Pokutta, and D. Zink. Inapproximability of combinatorial problems via small LPs and SDPs. In Proceedings of STOC 2015, pages 107–116, New York, NY, USA, 2015. ACM.
- [14] M.-C. Cai, X. Deng, and W. Zang. An approximation algorithm for feedback vertex sets in tournaments. SIAM J. Comput., 30(6):1993–2007, 2001.
- [15] Y. Cao, Y. Ke, Y. Otachi, and J. You. Vertex deletion problems on chordal graphs. Theoretical Computer Science, 745:75–86, 2018.
- [16] S. O. Chan, J. R. Lee, P. Raghavendra, and D. Steurer. Approximate constraint satisfaction requires large lp relaxations. Journal of the ACM (JACM), 63(4):1–22, 2016.
- [17] M. Conforti, G. Cornuéjols, G. Zambelli, et al. Integer programming, volume 271. Springer, 2014.
- [18] S. Fiorini, G. Joret, and O. Schaudt. Improved approximation algorithms for hitting 3-vertex paths. In International Conference on Integer Programming and Combinatorial Optimization, pages 238–249. Springer, 2016.
- [19] S. Fiorini, G. Joret, and O. Schaudt. Improved approximation algorithms for hitting 3-vertex paths. Math. Program., 182(1-2, Ser. A):355–367, 2020.
- [20] F. V. Fomin, S. Gaspers, D. Lokshtanov, and S. Saurabh. Exact algorithms via monotone local search. J. ACM, 66(2):Art. 8, 23, 2019.
- [21] F. V. Fomin, T. Le, D. Lokshtanov, S. Saurabh, S. Thomassé, and M. Zehavi. Subquadratic kernels for implicit 3-hitting set and 3-set packing problems. ACM Trans. Algorithms, 15(1):13:1–13:44, 2019.
- [22] A. Freund, R. Bar-Yehuda, and K. Bendel. Local ratio: a unified framework for approximation algorithms. ACM Computing Surveys, 36:422–463, 2005.
- [23] S. Hosseinian and S. Butenko. Polyhedral properties of the induced cluster subgraphs. Discrete Applied Mathematics, 297:80–96, 2021.
- [24] F. Hüffner, C. Komusiewicz, H. Moser, and R. Niedermeier. Fixed-parameter algorithms for cluster vertex deletion. Theory Comput. Syst., 47(1):196–217, 2010.
- [25] E. Jahanpour and X. Chen. Analysis of complex network performance and heuristic node removal strategies. Communications in Nonlinear Science and Numerical Simulation, 18(12):3458–3468, 2013.
- [26] P. K. Kothari, R. Meka, and P. Raghavendra. Approximating rectangles by juntas and weakly-exponential lower bounds for LP relaxations of CSPs. In H. Hatami, P. McKenzie, and V. King, editors, Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2017, Montreal, QC, Canada, June 19-23, 2017, pages 590–603. ACM, 2017.
- [27] M. Laurent. A comparison of the Sherali-Adams, Lovász-Schrijver, and Lasserre relaxations for 0-1 programming. Math. Oper. Res., 28(3):470–496, 2003.
- [28] D. Lokshtanov. Personal communication.
- [29] D. Lokshtanov, P. Misra, J. Mukherjee, F. Panolan, G. Philip, and S. Saurabh. -approximating feedback vertex set in tournaments. In Proceedings of the Fourteenth Annual ACM-SIAM Symposium on Discrete Algorithms, pages 1010–1018. SIAM, 2020.
- [30] M. Mnich, V. V. Williams, and L. A. Végh. A 7/3-approximation for feedback vertex sets in tournaments. In P. Sankowski and C. D. Zaroliagis, editors, 24th Annual European Symposium on Algorithms, ESA 2016, August 22-24, 2016, Aarhus, Denmark, volume 57 of LIPIcs, pages 67:1–67:14. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2016.
- [31] T. Rothvoß. The Lasserre hierarchy in approximation algorithms. Lecture Notes for the MAPSP, pages 1–25, 2013.
- [32] A. Schrijver. Theory of Linear and Integer Programming. Wiley, 1998.
- [33] H. D. Sherali and W. P. Adams. A hierarchy of relaxations between the continuous and convex hull representations for zero-one programming problems. SIAM J. Discrete Math., 3(3):411–430, 1990.
- [34] R. E. Tarjan. Decomposition by clique separators. Discrete mathematics, 55(2):221–232, 1985.
- [35] R. E. Tarjan and M. Yannakakis. Simple linear-time algorithms to test chordality of graphs, test acyclicity of hypergraphs, and selectively reduce acyclic hypergraphs. SIAM Journal on Computing, 13(3):566–579, 1984.
- [36] H. R. Tiwary, M. Kouteckỳ, and P. Kolman. Extension complexity, mso logic, and treewidth. Discrete Mathematics & Theoretical Computer Science, 22, 2020.
- [37] D. Tsur. Faster parameterized algorithm for cluster vertex deletion. CoRR, abs/1901.07609, 2019.
- [38] L. A. Végh. Personal communication.
- [39] J. You, J. Wang, and Y. Cao. Approximate association via dissociation. Discret. Appl. Math., 219:202–209, 2017.