Structural Iterative Rounding for Generalized $k$ -Median Problems

Anupam Gupta , Benjamin Moseley and Rudy Zhou

Abstract.

This paper considers approximation algorithms for generalized $k$ -median problems. This class of problems can be informally described as $k$ -median with a constant number of extra constraints, and includes $k$ -median with outliers, and knapsack median. Our first contribution is a pseudo-approximation algorithm for generalized $k$ -median that outputs a $6.387$ -approximate solution, with a constant number of fractional variables. The algorithm builds on the iterative rounding framework introduced by Krishnaswamy, Li, and Sandeep for $k$ -median with outliers. The main technical innovation is allowing richer constraint sets in the iterative rounding and taking advantage of the structure of the resulting extreme points.

Using our pseudo-approximation algorithm, we give improved approximation algorithms for $k$ -median with outliers and knapsack median. This involves combining our pseudo-approximation with pre- and post-processing steps to round a constant number of fractional variables at a small increase in cost. Our algorithms achieve approximation ratios $6.994+\epsilon$ and $6.387+\epsilon$ for $k$ -median with outliers and knapsack median, respectively. These improve on the best-known approximation ratio $7.081+\epsilon$ for both problems [KLS18].

Computer Science Department, Carnegie Mellon University, Pittsburgh, PA 15213. Research supported in part by NSF awards CCF-1907820, CCF1955785, and CCF-2006953.

Tepper School of Business, Carnegie Mellon University, Pittsburgh, PA 15213. Moseley and Zhou were supported in part by a Google Research Award, an Infor Research Award, a Carnegie Bosch Junior Faculty Chair and NSF grants CCF-1824303, CCF-1845146, CCF-1733873 and CMMI-1938909

1. Introduction

Clustering is a fundamental problem in combinatorial optimization, where we wish to partition a set of data points into clusters such that points within the same cluster are more similar than points across different clusters. In this paper, we focus on generalizations of the $k$ -median problem. Recall that in this problem, we are given a set $F$ of facilities, a set $C$ of clients, a metric $d$ on $F\cup C$ , and a parameter $k\in\mathbb{N}$ . The goal is to choose a set $S\subset F$ of $k$ facilities to open to minimize the sum of connection costs of each client to its closest open facility. That is, to minimize the objective $\sum_{j\in C}d(j,S)$ , where we define $d(j,S)=\min_{i\in S}d(i,j)$ .

The $k$ -median problem is well-studied from the perspective of approximation algorithms, and many new algorithmic techniques have been discovered while studying it. Examples include linear program rounding [BPR⁺17, LS16], primal-dual algorithms [JV01], local search [AGK⁺04], and large data techniques [LG18, MKC⁺15, GLZ17, GMM⁺03, IQM⁺20]. Currently, the best approximation ratio for $k$ -median is $2.675+\epsilon$ [BPR⁺17], and there is a lower bound of $1+2/e$ assuming $P\neq NP$ [JMS02].

Recently, there has been significant interest in generalizations of the $k$ -median problem [CKMN01, KKN⁺15]. One such generalization is the knapsack median problem. In knapsack median, each facility has a non-negative weight, and we are given budget $B\geq 0$ . The goal is to choose a set of open facilities of total weight at most $B$ (instead of having cardinality at most $k$ ) to minimize the same objective function. That is, the open facilities must satisfy a knapsack constraint. Another commonly-studied generalization is $k$ -median with outliers, also known as robust $k$ -median. Here we open $k$ facilities $S$ , as in basic $k$ -median, but we no longer have to serve all the clients; now, we are only required to serve at least $m$ clients $C^{\prime}\subset C$ of our choice. Formally, the objective function is now $\sum_{j\in C^{\prime}}d(j,S)$ .

Both knapsack median and $k$ -median with outliers have proven to be much more difficult than the standard $k$ -median problem. Algorithmic techniques that have been successful in approximating $k$ -median often lead to only a pseudo-approximation for these generalizations—that is, they violate the knapsack constraint or serve fewer than $m$ clients [BPR⁺18, CKMN01, FKRS19, IQM⁺20]. Obtaining “true” approximation algorithms requires new ideas beyond those of $k$ -median. Currently the best approximation ratio for both problems is $7.081+\epsilon$ due to the beautiful iterative rounding framework of Krishnaswamy, Li, and Sandeep [KLS18]. The first and only other true approximation for $k$ -median with outliers is a local search algorithm due to Ke Chen [Che08].

1.1. Generalized $k$ -median

Observe that both knapsack median and $k$ -median with outliers maintain the salient features of $k$ -median; that is, the goal is to open facilities to minimize the connection costs of served clients. These variants differ in the way we put constraints on the open facilities and served clients. In particular, in standard $k$ -median, we have a cardinality constraint on the open facilities, whereas for knapsack median the open facilities are subject to a knapsack constraint; in both cases we must serve all clients. For $k$ -median with outliers, we are constrained to open at most $k$ facilities, and serve at least $m$ clients.

In this paper, we consider a further generalization of $k$ -median that we call generalized $k$ -median (GKM). As in $k$ -median, our goal is to open facilities to minimize the connection costs of served clients. In GKM, the open facilities must satisfy $r_{1}$ given knapsack constraints, and the served clients must satisfy $r_{2}$ given coverage constraints. We define $r=r_{1}+r_{2}$ .

1.2. Our Results

The main contribution of this paper is a refined iterative rounding algorithm for GKM. Specifically, we show how to round the natural linear program (LP) relaxation of GKM to ensure all except $O(r)$ of the variables are integral, and the objective function is increased by at most a $6.387$ -factor. It is not difficult to show that the iterative rounding framework in [KLS18] can be extended to show a similar result. Indeed, a $7.081$ -approximation for GKM with at most $O(r)$ fractional facilities is implicit in their work. The improvement in this work is the smaller loss in the objective value.

Our improvement relies on analyzing the extreme points of certain set-cover-like LPs. These extreme points arise at the intermediate steps of our iterative rounding, and by leveraging their structural properties, we obtain our improved pseudo-approximation for GKM. This work reveals some of the structure of such extreme points, and it shows how this structure can lead to improvements.

Our second contribution is improved “true” approximation algorithms for two special cases of GKM: knapsack median and $k$ -median with outliers. For both problems, applying the pseudo-approximation algorithm for GKM gives a solution with $O(1)$ fractional facilities. Thus, the remaining work is to round a constant number of fractional facilities to obtain an integral solution. To achieve this goal, we apply known sparsification techniques [KLS18] to pre-process the instance, and then develop new post-processing algorithms to round the final $O(1)$ fractional facilities.

We show how to round these remaining variables for knapsack median at arbitrarily small loss, giving a $6.387+\epsilon$ -approximation, improving on the best $7.081+\epsilon$ -approximation. For $k$ -median with outliers, a more sophisticated post-processing is needed to round the $O(1)$ fractional facilities. This procedure loses more in the approximation ratio. In the end, we obtain a $6.994+\epsilon$ -approximation, modestly improving on the best known $7.081+\epsilon$ -approximation.

1.3. Overview of Techniques

To illustrate our techniques, we first introduce a natural LP relaxations for GKM. The problem admits an integer program formulation, with variables $\{x_{ij}\}_{i\in F,j\in C}$ and $\{y_{i}\}_{i\in F}$ , where $x_{ij}$ indicates that client $j$ connects to facility $i$ and $y_{i}$ indicates that facility $i$ is open. Relaxing the integrality constraints gives the linear program relaxation $LP_{1}$ .

\begin{array}[]{lll|lll}(LP_{1})\min_{x,y}&{}{}&\sum_{i\in F}\sum_{j\in C}d(i,j)\,x_{ij}&\qquad(LP_{2}):&\min_{y}&\sum_{i\in F}\sum_{j\in C:i\in F_{j}}d(i,j)\,y_{i}\\ &&\sum_{i\in F}x_{ij}\leq 1\qquad\forall j\in C&&&y(F_{j})\leq 1\qquad\forall j\in C\\ &&x_{ij}\leq y_{i}\qquad\forall i\in F,j\in C&&&\\ &&Wy\leq b&&&Wy\leq b\\ &&\sum_{j\in C}a_{j}(\sum_{i\in F}x_{ij})\geq c&&&\sum_{j\in C}a_{j}y(F_{j})\geq c\\ &&x_{ij},y_{i}\in[0,1]\qquad\forall i\in F,j\in C&&&y_{i}\in[0,1]\qquad\forall i\in F\end{array}

We focus on $LP_{1}$ for now. The linear program $LP_{1}$ is the standard $k$ -median LP with the extra side constraints. Note that $\sum_{i\in F}x_{ij}\leq 1$ may seem opposite to the intuition that we want clients to get “enough” coverage from the facilities, but that will be guaranteed by the coverage constraints below.

The constraint $Wy\leq b$ corresponds to the $r_{1}$ knapsack constraints on the facilities $y$ , where $W\in\mathbb{R}_{+}^{r_{1}\times F}$ and $b\in\mathbb{R}_{+}^{r_{1}}$ . These $r_{1}$ packing constraints can be thought of as a multidimensional knapsack constraint over the facilities, and ensure that “few” facilities are opened. Next, $\sum_{j\in C}a_{j}(\sum_{i}x_{ij})\geq c$ corresponds to the $r_{2}$ coverage constraints on the clients, where $a_{j}\in\mathbb{R}_{+}^{r_{2}}$ for all $j\in C$ and $c\in\mathbb{R}_{+}^{r_{2}}$ . These coverage constraints ensure that “enough” clients are served. E.g., having one packing constraint $\sum_{i\in F}y_{i}\leq k$ and one covering constraint $\sum_{j\in C}\sum_{i\in F}x_{ij}\geq m$ ensures that at least $m$ clients are covered by at most $k$ facilities; this is the $k$ -median with outliers problem.

Reducing the variables in the LP: We get $LP_{2}$ by eliminating the $x$ variables from $LP_{1}$ , thereby reducing the number of constraints. The idea from [KLS18] is to prescribe a set $F_{j}\subseteq F$ of permissible facilities for each client $j$ such that $x_{ij}$ is implicitly set to $y_{i}\mathbf{1}(i\in F_{j})$ . The details of this reduction and the procedure for creating $F_{j}$ are given in 2.1. Using this procedure, $LP_{2}$ is also a relaxation for GKM. Note that in $LP_{2}$ , we use the notation $y(F^{\prime})=\sum_{i\in F^{\prime}}y_{i}$ for $F^{\prime}\subset F$ .

Now consider solving $LP_{2}$ to obtain an optimal extreme point $\bar{y}$ . There must be $|F|$ linearly independent tight constraints at $\bar{y}$ , and we call these constraints the basis for $\bar{y}$ . The tight constraints of interest are the $y(F_{j})\leq 1$ constraints; in general, there are at most $\lvert C\rvert$ such tight constraints, and we have little structural understanding of the $F_{j}$ -sets.

Prior Iterative Rounding Framework: Consider the family of $F_{j}$ sets corresponding to tight constraints, so $\mathcal{F}=\{F_{j}\mid j\in C,\,\bar{y}(F_{j})=1\}$ . If $\mathcal{F}$ is a family of disjoint sets , then the tight constraints of $LP_{2}$ form a face of a partition matroid polytope intersected with at most $r$ side constraints (the knapsack and coverage constraints). Using ideas from, e.g., [KLS18, GRSZ14], we can show that $\bar{y}$ has at most $O(r)$ fractional variables.

Indeed, the goal of the iterative rounding framework in [KLS18] is to control the set family $\mathcal{F}$ to obtain an optimal extreme point where $\mathcal{F}$ is a disjoint family. To achieve this goal, they iteratively round an auxiliary LP based on $LP_{2}$ , where they have the constraint $y(F_{j})=1$ for all clients $j$ in a special set $C^{*}\subset C$ . Roughly, they regulate what clients are added to $C^{*}$ and delete constraints $y(F_{j})\leq 1$ for some clients. The idea is that a client $j$ whose constraint is deleted must be close to some client $j^{\prime}$ in $C^{*}$ . Since $y(F_{j^{\prime}})=1$ we can serve $j$ with the facility for $j^{\prime}$ , and the cost is small if $j^{\prime}$ ’s facility is close to $j$ .

To get intuition, assume each client $j$ can pay the farthest distance to a facility in $F_{j}$ , and call this the radius of $F_{j}$ . (Precisely, clients may not be able to afford this distance, but we use this assumption to highlight the ideas behind our algorithmic decisions.) For simplicity, assume all radii are powers of two. Over time, this radius shrinks if some $y$ variables in $F_{j}$ are set to zero. Consider applying the following iterative steps until none are applicable, in which case $C^{*}$ corresponds to the tight constraints: (1) delete a constraint for $j\notin C^{*}$ if the radius of $F_{j}$ is at least that of some $F_{j^{\prime}}$ for $j^{\prime}\in C^{*}$ and $F_{j}\cap F_{j^{\prime}}\neq\emptyset$ . (2) add $j\notin C^{*}$ to $C^{*}$ if $y(F_{j})=1$ and for every $j^{\prime}\in C^{*}$ such that $F_{j}\cap F_{j^{\prime}}\neq\emptyset$ it is the case that $F_{j^{\prime}}$ has a radius strictly larger than $F_{j}$ . If added then remove all $j^{\prime}$ from $C^{*}$ where $j$ ’s radius is half or less of the radius of $j^{\prime}$ and $F_{j}\cap F_{j^{\prime}}\neq\emptyset$ .

The approximation ratio is bounded by how much a client $j$ with a deleted constraint pays to get to a facility serving a client in $C^{*}$ . After removing $j$ ’s constraint, the case to worry about is if $j$ ’s closest client $j^{\prime}\in C^{*}$ is later removed from $C^{*}$ . This happens only if $j^{\prime\prime}$ is added to $C^{*}$ , with $F_{j^{\prime\prime}}$ having half the radius of $F_{j^{\prime}}$ . Thus every time we remove $j$ ’s closest client in $C^{*}$ , we guarantee that $j$ ’s cost only increases geometrically. The approximation ratio is proportional to the total distance that $j$ must travel and can be directly related to the distance of “ball-chasing” though these $F_{j}$ sets. See Figure 1.

Refer to caption — Figure 1. Half and quarter ball chasing

New Framework via Structured Extreme Points: The target of our framework is to ensure that the radii decreases in the ball-chasing using a smaller factor, in particular one-quarter. This will give closer facilities for clients whose constraints are deleted and a better approximation ratio. See Figure 1. To achieve this ”quarter ball-chasing,” we can simply change half to one-quarter in step (2) above.

Making this change immediately decreases the approximation ratio; however, the challenge is that $\mathcal{F}$ is no longer disjoint. Indeed, it can be the case that $j,j^{\prime}\in C^{*}$ such that $F_{j}\cap F_{j^{\prime}}\neq\emptyset$ if their radii differ by only a one half factor. Instead, our quarter ball-chasing algorithm maintains that $\mathcal{F}$ is not disjoint, but has a bipartite intersection graph.

The main technical challenge now is obtaining an extreme point with $O(r)$ fractional variables, which is no longer guaranteed as when $\mathcal{F}$ was disjoint. Indeed, if $\mathcal{F}$ has bipartite intersection graph, then the tight constraints form a face of the intersection of two partition matroid polytopes intersected with at most $r$ side constraints. In general, we cannot upper bound the number of fractional variables arising in the extreme points of such polytopes. However, such extreme points have a nice combinatorial structure: the intersection graph can be decomposed into $O(r)$ disjoint paths. We exploit this “chain decomposition” of extreme points arising in our iterative rounding to discover clients $j$ that can be removed from $C^{*}$ even if there is not a $j^{\prime}\in C^{*}$ where $F_{j^{\prime}}$ has one quarter of the radius of $F_{j}$ . We continue this procedure until we are left with only $O(r)$ fractional variables.

The main technical contribution of this work is showing how the problem can be reduced to structural characterization of extreme points corresponding to bipartite matching. This illustrates some of the structural properties of polytopes defined by $k$ -median-type problems. We hope that this helps lead to other structural characterizations of these polytopes and ultimately improved algorithms.

1.4. Organization

In §2, we introduce the auxiliary LP for GKM that our iterative rounding algorithm operates on. We note that this is the same LP used in the algorithm of [KLS18]. Then §3–5 give the pseudo-approximation for GKM. In particular, §3 describes the basic iterative rounding phase, where we iteratively update the auxiliary LP such that $\mathcal{F}^{*}=\{F_{j}\mid j\in C^{*}\}$ has a bipartite intersection graph. In §4, we characterize the structure of the resulting extreme points and use it to define a new iterative operation, which allows us to reduce the number of fractional variables to $O(r)$ . Finally, in §5, we combine the algorithms from §3 and §4 to obtain our pseduo-approximation algorithm for GKM.

We then obtain true approximations for knapsack median and $k$ -median with outliers: in §6, we describe our framework to turn pseudo-approximation algorithms into true approximations for both problems, and apply it to knapsack median. Then in §7, we give a more involved application of the same framework to $k$ -median with outliers.

2. Auxiliary LP for Iterative Rounding

In this section, we construct the auxiliary LP, $LP_{iter}$ , that our algorithm will use. We note that we use the same relaxation used in [KLS18]. Recall the two goals of iterative rounding, outlined in §1.3; we want to maintain a set of clients $C^{*}\subset C$ such that $\{F_{j}\mid j\in C^{*}\}$ has bipartite intersection graph, and $C^{*}$ should provide a good set of open facilities for the clients that are not in $C^{*}$ . Thus, we want to define $LP_{iter}$ to accommodate moving clients in and out of $C^{*}$ , while having the LP faithfully capture how much we think the clients outside of $C^{*}$ should pay in connection costs. For all missing proofs in this section, see §A.

2.1. Defining $F$ -balls

Our starting point is $LP_{2}$ , so we assume that we have sets $F_{j}\subset F$ for all $j\in C$ . The next proposition states that such sets can be found efficiently so that $LP_{2}$ is a relaxation of GKM.

Proposition 2.1.

There exists a polynomial time algorithm that given GKM instance $\mathcal{I}$ , duplicates facilities and outputs sets $F_{j}\subseteq F$ for $j\in C$ such that $Opt(LP_{2})\leq Opt(\mathcal{I})$ .

In §1.3, we assumed the radii of the $F_{j}$ sets were powers of two. To formalize this idea, we discretize the distances to powers of $\tau>1$ (up to some random offset.) The choice of $\tau$ is to optimize the final approximation ratio. The main ideas of the algorithm remain the same if we discretize to powers of, say $2$ , with no random offset. Our discretization procedure is the following:

Fix some $\tau>1$ and sample the random offset $\alpha\in[1,\tau)$ such that $\log_{e}\alpha$ is uniformly distributed in $[0,\log_{e}\tau)$ . Without loss of generality, we may assume that the smallest non-zero inter-point distance is $1$ . Then we define the possible discretized distances, $L(-2)=-1,L(-1)=0,\dots,L(\ell)=\alpha\tau^{\ell}$ for all $\ell\in\mathbb{N}$ .

For each $p,q\in F\cup C$ , we round $d(p,q)$ up to the next largest discretized distance. Let $d^{\prime}(p,q)$ denote the rounded distances. Observe that $d(p,q)\leq d^{\prime}(p,q)$ for all $p,q\in F\cup C$ . See §A for proof of the following proposition, which we use to bound the cost of discretization.

Proposition 2.2.

For all $p,q\in F\cup C$ , we have $\mathbb{E}[d^{\prime}(p,q)]=\frac{\tau-1}{\log_{e}\tau}d(p,q)$

Now using the discretized distances, we can define the radius level of $F_{j}$ for all $j\in C$ by:

\ell_{j}=\min\limits_{\ell\geq-1}\{\ell\mid d^{\prime}(j,i)\leq L(\ell)\quad\forall i\in F_{j}\}.

One should imagine that $F_{j}$ is a ball of radius $L(\ell_{j})$ in terms of the $d^{\prime}$ -distances. Thus, we will often refer to $F_{j}$ as the $F$ -ball of client $j$ . Further, to accommodate “shrinking” the $F_{j}$ sets, we define the inner ball of $F_{j}$ by:

B_{j}=\{i\in F_{j}\mid d^{\prime}(j,i)\leq L(\ell_{j}-1)\}.

Note that we defined $L(-2)=-1$ so that if $\ell_{j}=-1$ , then $B_{j}=\emptyset$ .

2.2. Constructing $LP_{iter}$

Our auxiliary LP will maintain three sets of clients: $C_{part},C_{full}$ , and $C^{*}$ . $C_{part}$ consists of all clients, whom we have not yet decided whether we should serve them or not. Then for all clients in $C_{full}$ and $C^{*}$ , we decide to serve them fully. The difference between the clients in $C_{full}$ and $C^{*}$ is that for the former, we remove the constraint $y(F_{j})=1$ from the LP, while for the latter we still require $y(F_{j})=1$ . Thus although we commit to serving $C_{full}$ , such clients rely on $C^{*}$ to find an open facility to connect to. Using the discretized distances, radius levels, inner balls, and these three sets of clients, we are ready to define $LP_{iter}$ :

( $LP_{iter}$ )		$\displaystyle\min\limits_{y}~{}~{}$	$\displaystyle\sum\limits_{j\in C_{part}}\sum\limits_{i\in F_{j}}d^{\prime}(i,j)y_{i}+\sum\limits_{j\in C_{full\cup C^{*}}}(\sum\limits_{i\in B_{j}}d^{\prime}(i,j)y_{i}+(1-y(B_{j}))L(\ell_{j}))$
	s.t.	$\displaystyle y(F_{j})\leq 1\quad\forall j\in C_{part}$
		$\displaystyle y(B_{j})\leq 1\quad\forall j\in C_{full}$
		$\displaystyle y(F_{j})=1\quad\forall j\in C^{*}$
		$\displaystyle Wy\leq b$
		$\displaystyle\sum\limits_{j\in C_{part}}a_{j}y(F_{j})\geq c-\sum\limits_{j\in C_{full}\cup C^{*}}a_{j}$
		$\displaystyle 0\leq y\leq 1$

Note that we use the rounded distances in the definition of $LP_{iter}$ rather than the original distances. Keeping this in mind, if $C_{part}=C$ and $C_{full},C^{*}=\emptyset$ , then $LP_{iter}$ is the same as $LP_{2}$ up to the discretized distances, so the following proposition is immediate.

Proposition 2.3.

Suppose $C_{part}=C$ and $C_{full},C^{*}=\emptyset$ . Then $\mathbb{E}[Opt(LP_{iter})]\leq\frac{\tau-1}{\log_{e}\tau}Opt(LP_{2})$ .

We now take some time to parse the definition of $LP_{iter}$ . Initially, all clients are in $C_{part}$ . For clients in $C_{part}$ , we are not sure yet whether we should serve them or not. Thus for these clients, we simply require $y(F_{j})\leq 1$ , so they can be served any amount, and in the objective, the contribution of a client from $C_{part}$ is exactly its connection cost (up to discretization) to $F_{j}$ .

The clients in $C_{full}$ correspond to the “deleted” constraints in §1.3. Importantly, for $j\in C_{full}$ , we do not require that $y(F_{j})=1$ ; rather, we relax this condition to $y(B_{j})\leq 1$ . Recall that we made the assumption that every client can pay the radius of its $F_{j}$ set in §1.3. To realize this idea, we require that each $j\in C_{full}$ pays its connection costs to $B_{j}$ in the objective. Then, to serve $j$ fully, $j$ must find $(1-y(B_{j}))$ units of open facility to connect to beyond $B_{j}$ . Now $j$ truly pays its radius, $L(\ell_{j})$ , for this $(1-y(B_{j}))$ units of connections in $LP_{iter}$ , so we can do “ball-chasing” to $C^{*}$ to find these facilities. In this case, we say that we re-route the client $j$ to some destination.

For clients in $C^{*}$ , we require $y(F_{j})=1$ . Note that the contribution of a $j\in C^{*}$ to the objective of $LP_{iter}$ is exactly its connection cost to $F_{j}$ . The purpose of $C^{*}$ is to provide destinations for $C_{full}$ .

Finally, because we have decided to fully serve all clients in $C_{full}$ and $C^{*}$ , regardless of how much they are actually served in their $F$ -balls, we imagine that they every $j\in C_{full}\cup C^{*}$ contributes $a_{j}$ to the coverage constraints, which is reflected in $LP_{iter}$ .

2.3. Properties of $LP_{iter}$

Throughout our algorithm, we will modify the data of $LP_{iter}$ - we will move clients between $C_{part},C_{full}$ , and $C^{*}$ and modify the $F$ -balls and radius levels. However, we still want the data of $LP_{iter}$ to satisfy some consistent properties, which we call our Basic Invariants.

Definition 2.4 (Basic Invariants).

We call the following properties our Basic Invariants:

(1)

$C_{part}\cup C_{full}\cup C^{*}$ partitions $C$ .
(2)

For all $j\in C$ , we have $d^{\prime}(j,i)\leq L_{\ell_{j}}$ for all $i\in F_{j}$ .
(3)

For all $j\in C$ , we have $B_{j}=\{i\in F_{j}\mid d^{\prime}(j,i)\leq L_{\ell_{j}-1}\}$ .
(4)

For all $j\in C$ , we have $\ell_{j}\geq-1$ .
(5)

(Distinct Neighbors) For all $j_{1},j_{2}\in C^{*}$ , if $F_{j_{1}}\cap F_{j_{2}}\neq\emptyset$ , then $\lvert\ell_{j_{1}}-\ell_{j_{2}}\rvert=1$ . In words, if the $F$ -balls of two clients in $C^{*}$ intersect, then they differ by exactly one radius level.

We want to emphasize Basic Invariant 2.4(5), which we call the Distinct Neighbors Property. It is not difficult to see that the Distinct Neighbors Property implies that $\{F_{j}\mid j\in C^{*}\}$ has bipartite intersection graph.

Definition 2.5 (Intersection Graph).

Let $\mathcal{F}=\{F_{j}\mid j\in C^{*}\}$ be a set family indexed by $C^{*}$ . The intersection graph of $\mathcal{F}$ is the undirected graph with vertex set $C^{*}$ such that two vertices $j$ and $j^{\prime}$ are connected by an edge if any only if $F_{j}\cap F_{j^{\prime}}\neq\emptyset$ .

Proposition 2.6.

Suppose $LP_{iter}$ satisfies the Distinct Neighbors Property. Then the intersection graph of $\mathcal{F}=\{F_{j}\mid j\in C^{*}\}$ is bipartite.

The following proposition will also be useful.

Proposition 2.7.

Suppose $LP_{iter}$ satisfies the Distinct Neighbors Property. Then each facility is in at most two $F$ -balls for clients in $C^{*}$ .

We summarize the relevant properties of $LP_{iter}$ in the following lemma. The algorithm described by the lemma is exactly the steps we took in this section.

Lemma 2.8.

There exists a polynomial time algorithm that takes as input a GKM instance $\mathcal{I}$ and outputs $LP_{iter}$ such that $\mathbb{E}[Opt(LP_{iter})]\leq\frac{\tau-1}{\log_{e}\tau}Opt(I)$ and $LP_{iter}$ satisfies all Basic Invariants.

2.4. Notations

Throughout this paper, we will always index facilities using $i$ and clients using $j$ .

For any client $j\in C$ , we say that $j$ is supported on facility $i\in F$ if $i\in F_{j}$ . Then for any $C^{\prime}\subset C$ , we let $F(C^{\prime})\subset F$ be the set of all facilities supported on at least one client in $C^{\prime}$ .

Given a setting of the $y$ -variables of $LP_{iter}$ , we say a facility $i$ is fractional (with respect to the given $y$ -variables) if $y_{i}<1$ . Otherwise, facility $i$ is integral. Similarly, we say a client $j$ is fractional if $F_{j}$ contains only fractional facilities, and $j$ is integral otherwise. Using these definitions, for any $F^{\prime}\subset F$ , we can partition $F^{\prime}$ into $F^{\prime}_{<1}\cup F^{\prime}_{=1}$ , where $F^{\prime}_{<1}$ is the subset of fractional facilities and $F^{\prime}_{=1}$ is the subset of integral facilities. An analogous partition holds for a subset of clients $C^{\prime}\subset C$ , so we have $C^{\prime}=C^{\prime}_{<1}\cup C^{\prime}_{=1}$ .

3. Basic Iterative Rounding Phase

In this section, we describe the iterative rounding phase of our algorithm. This phase has two main goals: (a) to simplify the constraint set of $LP_{iter}$ , and (b) to decide which clients to serve and how to serve them. To make these two decisions, we repeatedly solve $LP_{iter}$ to obtain an optimal extreme point, and then use the structure of tight constraints to update $LP_{iter}$ , and reroute clients accordingly.

3.1. The Algorithm

Our algorithm repeatedly solves $LP_{iter}$ to obtain an optimal extreme point $\bar{y}$ , and then performs one of the following three possible updates, based on the tight constraints:

(1)

If some facility $i$ is set to zero in $\bar{y}$ , we delete it from the instance.
(2)

If constraint $\bar{y}(F_{j})\leq 1$ is tight for some $j\in C_{part}$ , then we decide to fully serve client $j$ by moving $j$ to either $C_{full}$ or $C^{*}$ . Initially, we add $j$ to $C_{full}$ then run Algorithm 2 to decide if $j$ should be in $C^{*}$ instead.
(3)

If constraint $\bar{y}(B_{j})\leq 1$ is tight for some $j\in C_{full}$ , we shrink $F_{j}$ by one radius level (so $j$ ’s new $F$ -ball is exactly $B_{j}$ .) Then we possibly move $j$ to $C^{*}$ by running Algorithm 2 for $j$ .

These steps are made formal in Algorithms 1 (IterativeRound) and 2 (ReRoute). IterativeRound relies on the subroutine ReRoute, which gives our criterion for moving a client to $C^{*}$ . This criterion for adding clients to $C^{*}$ is the key way in which our algorithm differs from that of [KLS18]. In [KLS18], the criterion used ensures that $\{F_{j}\mid j\in C^{*}\}$ is a family of disjoint sets. In contrast, we allow $F$ -balls for clients in $C^{*}$ to intersect, as long as they satisfy the Distinct Neighbors Property from Definition 2.4(5). Thus, our algorithm allows for rich structures in the set system $\{F_{j}\mid j\in C^{*}\}$ .

Input:

LP_{iter}

satisfying all Basic Invariants

Result: Modifies

LP_{iter}

and outputs an optimal extreme point of

LP_{iter}

2repeat

3 Solve

LP_{iter}

to obtain optimal extreme point

\bar{y}

4 if there exists a facility $i\in F$ such that $\bar{y}_{i}\geq 0$ is tight then

5 Delete

i

from

F

7 else if there exists a client $j\in C_{part}$ such that $y(F_{j})\leq 1$ is tight then

8 Move

j

from

C_{part}

C_{full}

\textsc{ReRoute}(j)

11 else if there exists a client $j\in C_{full}$ such that $\bar{y}(B_{j})\leq 1$ is tight then

12 Update

F_{j}\leftarrow B_{j}

and decrement

\ell_{j}

1

13 Update

B_{j}\leftarrow\{i\in F_{j}\mid d^{\prime}(j,i)\leq L(\ell_{j}-1)\}

\textsc{ReRoute}(j)

16 else

17 Output

\bar{y}

and Terminate.

18until termination

Algorithm 1 IterativeRound

Input: Client

j\in C_{full}

Result: Decide whether to move

j

C^{*}

or not

2if $\ell_{j}\leq\ell_{j^{\prime}}-1$ for all $j^{\prime}\in C^{*}$ such that $F_{j}\cap F_{j^{\prime}}\neq\emptyset$ then

3 Move

j

from

C_{full}

C^{*}

4 For all

j^{\prime}\in C^{*}

such that

F_{j}\cap F_{j^{\prime}}\neq\emptyset

and

\ell_{j^{\prime}}\geq\ell_{j}+2

, move

j^{\prime}

from

C^{*}

C_{full}

Algorithm 2 ReRoute

The modifications made by IterativeRound do not increase $Opt(LP_{iter})$ , so upon termination of our algorithm, we have an optimal extreme point $\bar{y}$ to $LP_{iter}$ such that $LP_{iter}$ is still a relaxation of GKM and no non-negativity constraint, $C_{part}$ -constraint, or $C_{full}$ -constraint is tight for $\bar{y}$ . This is formalized in the following theorem, whose proof is similar to [KLS18], and is deferred to Appendix B.1.

Theorem 3.1.

IterativeRound is a polynomial time algorithm that maintains all Basic Invariants, weakly decreases $Opt(LP_{iter})$ , and outputs an optimal extreme point to $LP_{iter}$ such that no $C_{part}$ -, $C_{full}$ -, or non-negativity constraint is tight.

Recall the goals from the beginning of the section: procedure IterativeRound achieves goal (a) of making $\{F_{j}\mid j\in C^{*}\}$ simpler while maintaining the Distinct Neighbors Property. Since we moved facilities between $C^{*}$ and $C_{full}$ , achieving goal (b) means deciding which facilities to open, and guaranteeing that each client has a “close-by” open facility. (Recall from §2 that $C^{*}$ is the set of clients such that their $F_{j}$ -balls are guaranteed to contain an open facility, and $C_{full}$ are the clients which are guaranteed to be served but using facilities opened in $C^{*}$ .)

Here’s the high-level idea of how we achieve goal (b). Suppose we move $j$ from $C_{full}$ to $C^{*}$ and some $j^{\prime}$ from $C^{*}$ to $C_{full}$ ; we want to find a good destination for $j^{\prime}$ . We claim $j$ ’s facility is a good destination for $j^{\prime}$ . Indeed, since $j$ is now in $C^{*}$ , we can use the constraint $y(F_{j})=1$ to bound the distance of $j^{\prime}$ to this unit of facility by $L(\ell_{j^{\prime}})+2L(\ell_{j})\leq(1+\frac{2}{\tau^{2}})L(\ell_{j^{\prime}})$ , using the facts that $\ell_{j^{\prime}}\geq\ell_{j}+2$ and $F_{j}\cap F_{j^{\prime}}\neq\emptyset$ , which are guaranteed by ReRoute. Of course, if $j$ is removed from $C^{*}$ later, we re-route it to some client that is at least two radius levels smaller, and can send $j^{\prime}$ to that client. This corresponds to the “quarter ball-chasing” of §1.3. Indeed every further re-routing step for $j^{\prime}$ has geometrically decreasing cost, which give a cost of $O(1)L(\ell_{j^{\prime}})$ . We defer the formal analysis to 5.3, after we combine IterativeRound with another (new) iterative operation, which we present in the next section.

4. Iterative Operation for Structured Extreme Points

In this section, we achieve two goals: (a) we show that the structure of the extreme points of $LP_{iter}$ obtained from 3.1 are highly structured, and admit a chain decomposition. Then, (b) we exploit this chain decomposition to define a new iterative operation that is applicable whenever $\bar{y}$ has “many” (i.e., more than $O(r)$ ) fractional variables. We emphasize that this characterization of the extreme points is what enables the new iterative rounding algorithm.

4.1. Chain Decomposition

A chain is a sequence of clients in $C^{*}$ where the $F$ -ball of each client $j$ contains exactly two facilities—one shared with the previous ball and other with the next.

Definition 4.1 (Chain).

A chain is a sequence of clients $(j_{1},\dots,j_{p})\subseteq C^{*}$ satisfying:

•

$\lvert F_{j_{q}}\rvert=2$ for all $q\in[p]$ , and
•

$F_{j_{q}}\cap F_{j_{q+1}}\neq\emptyset$ for all $q\in[p-1]$ .

Our chain decomposition is a partition of the fractional $C^{*}$ -clients given in the next theorem, which is our main structural characterization of the extreme points of $LP_{iter}$ . (Recall that a client $j$ is fractional if all facilities in $F_{j}$ are fractional; we denote the fractional clients in $C^{*}$ by $C^{*}_{<1}$ .)

Theorem 4.2 (Chain Decomposition).

Suppose $LP_{iter}$ satisfies all Basic Invariants. Let $\bar{y}$ be an extreme point of $LP_{iter}$ such that no $C_{part}$ -, $C_{full}$ -, or non-negativity constraint is tight. Then there exists a partition of $C^{*}_{<1}$ into at most $3r$ chains, along with a set of at most $2r$ violating clients (clients that are not in any chain.)

The proof relies on analyzing the extreme points of a set-cover-like polytope with $r$ side constraints; we defer it to §8 and proceed instead to define the new iterative operation.

4.2. Iterative Operation for Chain Decompositions

Composing 3.1 and 4.2, consider an optimal extreme point $\bar{y}$ of $LP_{iter}$ , and a chain decomposition. We show that if the number of fractional variables in $\bar{y}$ is sufficiently large, there exists a useful structure in the chain decomposition, which we call a candidate configuration.

Definition 4.3 (Candidate Configuration).

Let $\bar{y}$ be an optimal extreme point of $LP_{iter}$ . A candidate configuration is a pair of two clients $(j,j^{\prime})\subset C^{*}_{<1}$ such that:

(1)

$F_{j}\cap F_{j^{\prime}}\neq\emptyset$
(2)

$\ell_{j^{\prime}}\leq\ell_{j}-1$
(3)

Every facility in $F_{j}$ and $F_{j^{\prime}}$ is in at exactly two $F$ -balls for clients in $C^{*}$
(4)

$\lvert F_{j}\rvert=2$ and $\lvert F_{j^{\prime}}\rvert=2$

Lemma 4.4.

Suppose $LP_{iter}$ satisfies all Basic Invariants, and let $\bar{y}$ be an optimal extreme point of $LP_{iter}$ such that no $C_{part}$ -, $C_{full}$ -, or non-negativity constraint is tight. If $\lvert F_{<1}\rvert\geq 15r$ , then there exist a candidate configuration in $C^{*}_{<1}$ .

Our new iterative operation is easy to state: Find a candidate configuration $(j,j^{\prime})$ and move $j$ from $C^{*}$ to $C_{full}$ .

Input: An optimal extreme point

\bar{y}

LP_{iter}

s.t. there exists an candidate configuration

Result: Modify

LP_{iter}

2Let

(j,j^{\prime})\subset C^{*}_{<1}

be any candidate configuration.

3 Move

j

from

C^{*}

C_{full}

Algorithm 3 ConfigReRoute

The first two properties of candidate configurations are used to re-route $j$ to $j^{\prime}$ . Observe a key difference between ReRoute and ConfigReRoute: In the former, if a client $j^{\prime}$ is moved from $C^{*}$ to $C_{full}$ , there exists a client $j\in C^{*}$ such that $F_{j^{\prime}}\cap F_{j}\neq\emptyset$ and $\ell_{j}\leq\ell_{j^{\prime}}-2$ . Thus we re-route $j^{\prime}$ to a client at least two radius levels smaller. This corresponds to “quarter ball-chasing.” On the other hand, in ConfigReRoute, we only guarantee a client of at least one radius level smaller, which corresponds to “half ball-chasing.” This raises the worry that if all re-routings are due to ConfigReRoute, any potential gains by ReRoute are not realized in the worst case. However we show that, roughly speaking, the last two properties of candidate configurations guarantee that the more expensive re-routings of ConfigReRoute happen at most half the time. The main properties of ConfigReRoute appear in the next theorem (whose proof is in Appendix B.2).

Theorem 4.5.

ConfigReRoute is a polynomial-time algorithm that maintains all Basic Invariants and weakly decreases $Opt(LP_{iter})$ .

Again, we defer the analysis of the re-routing cost of ConfigReRoute to §5.3, where we analyze the interactions between ConfigReRoute and ReRoute, and present our final pseudo-approximation algorithm next.

5. Pseudo-Approximation Algorithm for GKM

The pseudo-approximation algorithm for GKM combine the iterative rounding algorithm IterativeRound from §3 with the re-routing operation ConfigReRoute from §4 to construct a solution to $LP_{iter}$ .

Theorem 5.1 (Pseudo-Approximation Algorithm for GKM).

There exists an polynomial time randomized algorithm PseudoApproximation that takes as input an instance $\mathcal{I}$ of GKM and outputs a feasible solution to $LP_{1}$ with at most $O(r)$ fractional facilities and expected cost at most $6.387\cdot Opt(\mathcal{I})$ .

Input:

LP_{iter}

satisfying all Basic Invariants

Result: Modifies

LP_{iter}

and outputs an optimal extreme point of

LP_{iter}

1 repeat

2 Run IterativeRound to obtain an optimal extreme point

\bar{y}

LP_{iter}

3 if there exists a candidate configuration then

4 Run ConfigReRoute

5 else

6 Output

\bar{y}

and Terminate

7until Termination

Algorithm 4 PseudoApproximation

There are two main components to analyzing PseudoApproximation. First, we show that the output extreme point has $O(r)$ fractional variables. Second, we bound the re-routing cost. The first part follows directly by combining the analogous theorems for IterativeRound and ConfigReRoute. We defer its proof to Appendix B.

Theorem 5.2.

PseudoApproximation is a polynomial time algorithm that maintains all Basic Invariants, weakly decreases $Opt(LP_{iter})$ , and outputs an optimal extreme point of $LP_{iter}$ with at most $15r$ fractional variables.

5.1. Analysis of Re-Routing Cost

We now bound the re-routing cost by analyzing how $C^{*}$ evolves throughout PseudoApproximation. This is one of the main technical contributions of our paper, and it is where our richer $C^{*}$ -set and relaxed re-routing rules are used. [KLS18] prove an analogous result about the re-routing cost of their algorithm. In the language of the following theorem statement, they show that $\alpha=\frac{\tau+1}{\tau-1}$ for the case $\beta=1$ . We improve on this factor by analyzing the interactions between ReRoute and ConfigReRoute. Interestingly, analyzing each of ReRoute and ConfigReRoute separately would not yield any improvement over [KLS18] in the worst case, even with our richer set $C^{*}$ . It is only by using the properties of candidate configurations and analyzing sequences of calls to ReRoute and ConfigReRoute that we get an improvement.

Theorem 5.3 (Re-Routing Cost).

Upon termination of PseudoApproximation, let $S\subset F$ be a set of open facilities and $\beta\geq 1$ such that $d(j,S)\leq\beta L(\ell_{j})$ for all $j\in C^{*}$ . Then for all $j\in C_{full}\cup C^{*}$ , $d(j,S)\leq(2+\alpha)L(\ell_{j})$ , where $\alpha=\max(\beta,1+\frac{1+\beta}{\tau},\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1})$ .

We will need the following discretized version of the triangle inequality.

Proposition 5.4.

Let $j,j^{\prime}\in C$ such that $F_{j}$ and $F_{j^{\prime}}$ intersect. Then $d(j,j^{\prime})\leq L(\ell_{j})+L(\ell_{j^{\prime}})$ .

Proof.

Let $i\in F_{j}\cap F_{j^{\prime}}$ . Then using the triangle inequality we can bound:

\displaystyle d(j,j^{\prime})\leq d(j,i)+d(i,j^{\prime})\leq d^{\prime}(j,i)+d^{\prime}(i,j^{\prime})\leq L(\ell_{j})+L(\ell_{j^{\prime}}).\qed

The next lemma analyzes the life-cycle of a client that enters $C^{*}$ at some point in PseudoApproximation. Our improvement over [KLS18] comes from this lemma.

Lemma 5.5.

Upon termination of PseudoApproximation, let $S\subset F$ be a set of open facilities and $\beta\geq 1$ such that $d(j,S)\leq\beta L(\ell_{j})$ for all $j\in C^{*}$ . Suppose client $j$ is added to $C^{*}$ at radius level $\ell$ during PseudoApproximation (it may be removed later.) Then upon termination of PseudoApproximation, we have $d(j,S)\leq\alpha L(\ell)$ , where $\alpha=\max(\beta,1+\frac{1+\beta}{\tau},\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1})$ .

Proof.

Consider a client $j$ added to $C^{*}$ with radius level $\ell$ . If $j$ remains in $C^{*}$ until termination, the lemma holds for $j$ because $\alpha\geq\beta$ . Thus, consider the case where $j$ is later removed from $C^{*}$ in PseudoApproximation. Note that the only two operations that can possibly cause this removal are ReRoute and ConfigReRoute. We prove the lemma by induction on $\ell=-1,0,\dots$ . If $\ell=-1$ , then $j$ remains in $C^{*}$ until termination because it has the smallest possible radius level and both ReRoute and ConfigReRoute remove a client from $C^{*}$ only if there exists another client with strictly smaller radius level.

Similarly, if $\ell=0$ , we note that ReRoute removes a client from $C^{*}$ only if there exists another client with radius level at least two smaller, which is not possible for $j$ . Thus, if $j$ does not remain in $C^{*}$ until termination, there must exist some $j^{\prime}$ that is later added to $C^{*}$ with radius level at most $\ell-1=-1$ such that $F_{j}\cap F_{j^{\prime}}\neq\emptyset$ . We know that $j^{\prime}$ remains in $C^{*}$ until termination since it is of the lowest radius level. Thus:

d(j,S)\leq d(j,j^{\prime})+d(j^{\prime},S)\leq L(0)+L(-1)+\beta L(-1)=L(0).

Now consider $\ell>0$ where $j$ can possibly be removed from $C^{*}$ by either ReRoute or ConfigReRoute. In the first case, $j$ is removed by ReRoute, so there exists $j^{\prime}$ that is added to $C^{*}$ such that $\ell_{j^{\prime}}\leq\ell-2$ and $F_{j}\cap F_{j^{\prime}}\neq\emptyset$ . Applying the inductive hypothesis to $j^{\prime}$ , we can bound:

d(j,S)\leq d(j,j^{\prime})+d(j^{\prime},S)\leq L(\ell)+L(\ell-2)+\alpha L(\ell-2)\leq(1+\frac{1+\alpha}{\tau^{2}})L(\ell).

It is easy to verify by routine calculations that $1+\frac{1+\alpha}{\tau^{2}}\leq\alpha$ given that $\alpha\geq\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1}$ .

For our final case, suppose $j$ is removed by ConfigReRoute. Then there exists $j^{\prime}\in C^{*}$ such that $F_{j}\cap F_{j^{\prime}}\neq\emptyset$ and $\ell_{j^{\prime}}\leq\ell-1$ . Further, $\lvert F_{j^{\prime}}\rvert=2$ . If $j^{\prime}$ remains in $C^{*}$ until termination, then:

d(j,S)\leq d(j,j^{\prime})\leq L(\ell)+L(\ell-1)+\beta L(\ell-1)\leq(1+\frac{1+\beta}{\tau})L(\ell).

Otherwise, $j^{\prime}$ is removed by ReRoute at an even later time because some $j^{\prime\prime}$ is added to $C^{*}$ such that $\ell_{j^{\prime\prime}}\leq\ell_{j^{\prime}}-2$ and $F_{j^{\prime}}\cap F_{j^{\prime\prime}}\neq\emptyset$ . Applying the inductive hypothesis to $j^{\prime\prime}$ , we can bound:

d(j,S)\leq d(j,j^{\prime})+d(j^{\prime},j^{\prime\prime})+d(j^{\prime\prime},S)\leq(1+\frac{2}{\tau}+\frac{1+\alpha}{\tau^{3}})L(\ell).

where $\alpha\geq\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1}$ implies $1+\frac{2}{\tau}+\frac{1+\alpha}{\tau^{3}}\leq\alpha$ .

Now, we consider the case where $j^{\prime}$ is later removed by ConfigReRoute. To analyze this case, consider when $j$ was removed by ConfigReRoute. At this time, we have $\lvert F_{j^{\prime}}\rvert=2$ by definition of Candidate Configuration. Because $F_{j}\cap F_{j^{\prime}}\neq\emptyset$ , consider any facility $i\in F_{j}\cap F_{j^{\prime}}$ . When $j$ is removed from $C^{*}$ by ConfigReRoute, we have that $i$ is in exactly two $F$ -balls for clients in $C^{*}$ , exactly $F_{j}$ and $F_{j^{\prime}}$ . However, after removing $j$ from $C^{*}$ , $i$ is only in one $F$ -ball for clients in $C^{*}$ - namely $F_{j^{\prime}}$ .

Later, at the time $j^{\prime}$ is removed by ConfigReRoute, it must be the case that $\lvert F_{j^{\prime}}\rvert=2$ still, so $F_{j^{\prime}}$ is unchanged between the time that $j$ is removed and the time that $j^{\prime}$ is removed. Thus the facility $i$ that was previously in $F_{j}\cap F_{j^{\prime}}$ must still be present in $F_{j^{\prime}}$ . Then this facility must be in exactly two $F$ -balls for clients in $C^{*}$ , one of which is $j^{\prime}$ . It must be the case that the other $F$ -ball containing $i$ , say $F_{j^{\prime\prime}}$ , was added to $C^{*}$ between the removal of $j$ and $j^{\prime}$ .

Note that the only operation that adds clients to $C^{*}$ is ReRoute, so we consider the time between the removal of $j$ and $j^{\prime}$ when $j^{\prime\prime}$ is added to $C^{*}$ . Refer to Figure 2. At this time, we have $j^{\prime}\in C^{*}$ , and $F_{j^{\prime}}\cap F_{j^{\prime\prime}}\neq\emptyset$ because of the facility $i$ . Then it must be the case that $j^{\prime\prime}$ has strictly smaller radius level than $j^{\prime}$ , so $\ell_{j^{\prime\prime}}\leq\ell_{j^{\prime}}-1\leq\ell-2$ . To conclude the proof, we note that $F_{j}\cap F_{j^{\prime\prime}}\neq\emptyset$ due to the facility $i$ , and apply the inductive hypothesis to $j^{\prime\prime}$ :

d(j,S)\leq d(j,j^{\prime\prime})+d(j^{\prime\prime},S)\leq(1+\frac{1+\alpha}{\tau^{2}})L(\ell,)

which is at most $\alpha L(\ell)$ . ∎

Now using the above lemma, we can prove Theorem 5.3.

Proof of Theorem 5.3.

Consider any client $j$ that is in $C_{full}\cup C^{*}$ upon termination of PseudoApproximation. It must be the case that $\textsc{ReRoute}(j)$ was called at least once during PseudoApproximation. Consider the time of the last such call to $\textsc{ReRoute}(j)$ . If $j$ is added to $C^{*}$ at this time, note that its radius level from now until termination remains unchanged, so applying Lemma 5.5 gives that $d(j,S)\leq\alpha L(\ell_{j})$ , as required. Otherwise, if $j$ is not added to $C^{*}$ at this time, then there must exist some $j^{\prime}\in C^{*}$ such that $F_{j}\cap F_{j^{\prime}}\neq\emptyset$ and $\ell_{j^{\prime}}\leq\ell_{j}$ . Then applying Lemma 5.5 to $j^{\prime}$ , we have:

\displaystyle d(j,S)\leq d(j,j^{\prime})+d(j^{\prime},S)\leq L(\ell_{j})+L(\ell_{j^{\prime}})+\alpha L(\ell_{j^{\prime}})\leq(2+\alpha)L(\ell_{j}).\qed

5.2. Putting it all Together: Pseudo-Approximation for GKM

In this section, we prove Theorem 5.1. In particular, we use the output of PseudoApproximation to construct a setting of the $x$ -variables with the desired properties.

Proof of Theorem 5.1.

Given as input an instance $\mathcal{I}$ of GKM, our algorithm is first to run the algorithm guaranteed by Lemma 2.8 to construct $LP_{iter}$ from $LP_{1}$ such that $\mathbb{E}[Opt(LP_{iter})]\leq\frac{\tau-1}{\log_{e}\tau}Opt(\mathcal{I})$ and $LP_{iter}$ satisfies all Basic Invariants. Note that we will choose $\tau>1$ later to optimize our final approximation ratio. Then we run PseudoApproximation on $LP_{iter}$ , which satisfies all Basic Invariants, so by Theorem 5.2, PseudoApproximation outputs in polynomial time $LP_{iter}$ along with an optimal solution $\bar{y}$ with $O(r)$ fractional variables.

Given $\bar{y}$ , we define a setting $\bar{x}$ for the $x$ -variables: for all $j\in C_{part}$ , connect $j$ to all facilities in $F_{j}$ by setting $\bar{x}_{ij}=\bar{y}_{i}$ for all $i\in F_{j}$ . For all $j\in C^{*}$ , we have $\bar{y}(F_{j})=1$ , so connect $j$ to all facilities in $F_{j}$ . Finally, to connect every $j\in C_{full}$ to one unit of open facilities, we use the following modification of Theorem 5.3:

Proposition 5.6.

When PseudoApproximation terminates, for all $j\in C_{full}\cup C^{*}$ , there exists one unit of open facilities with respect to $\bar{y}$ within distance $(2+\alpha)L(\ell_{j})$ of $j$ , where $\alpha=\max(1,1+\frac{2}{\tau},\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1})$ .

The proof of the above proposition is analogous to that of Theorem 5.3 in the case $\beta=1$ , so we omit it. To see this, note that for all $j\in C^{*}$ , we have $\bar{y}(F_{j})=1$ . This implies that each $j\in C^{*}$ has one unit of fractional facility within distance $L(\ell_{j})$ . Following an analogous inductive argument as in Lemma 5.5 gives the desired result.

By routine calculations, it is easy to see that $\alpha=\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1}$ for all $\tau>1$ . Now, for all $j\in C_{full}$ , we connect $j$ to all facilities in $B_{j}$ . We want to connect $j$ to one unit of open facilities, so to find the remaining $1-\bar{y}(B_{j})$ units, we connect $j$ to an arbitrary $1-\bar{y}(B_{j})$ units of open facilities within distance $(2+\alpha)L(\ell_{j})$ of $j$ , whose existence is guaranteed by Proposition 5.6. This completes the description of $\bar{x}$ .

It is easy to verify that $(\bar{x},\bar{y})$ is feasible for $LP_{1}$ , because $\bar{y}$ satisfies all knapsack constraints, and every client’s contribution to the coverage constraints in $LP_{1}$ is exactly its contribution in $LP_{iter}$ . Thus it remains to bound the cost of this solution. We claim that $LP_{1}(\bar{x},\bar{y})\leq(2+\alpha)Opt(LP_{iter})$ , because each client in $C_{part}$ and $C^{*}$ contributes the same amount to $LP_{1}$ and $LP_{iter}$ (up to discretization), and each client $j\in C_{full}$ has connection cost at most $2+\alpha$ times its contribution to $LP_{iter}$ .

In conclusion, the expect cost of the solution $(\bar{x},\bar{y})$ to $LP_{1}$ is at most:

(2+\alpha)\,\mathbb{E}[Opt(LP_{iter})]\leq\frac{\tau-1}{\log_{e}\tau}\left(2+\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1}\right)\;Opt(\mathcal{I}).

Choosing $\tau>1$ to minimize $\frac{\tau-1}{\log_{e}\tau}(2+\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1})$ gives $\tau=2.046$ and $\frac{\tau-1}{\log_{e}\tau}(2+\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1})=6.387$ . ∎

6. From Pseudo-Approximation to Approximation

In this section, we leverage the pseudo-approximation algorithm for GKM defined in Section 5 to construct improved approximation algorithms for two special cases of GKM: knapsack median and $k$ -median with outliers.

Recall that knapsack median is an instance of GKM with a single arbitrary knapsack constraint and a single coverage constraint that states we must serve every client in $C$ . Similarly, $k$ -median with outiers is an instance of GKM with a single knapsack constraint, stating that we can open at most $k$ facilities, and a single coverage constraint, stating that we must serve at least $m$ clients. Note that both special cases have $r=2$ .

Our main results for these two problems are given by the following theorems:

Theorem 6.1 (Approximation Algorithm for Knapsack Median).

There exists a polynomial time randomized algorithm that takes as input an instance $\mathcal{I}$ of knapsack median and parameter $\epsilon\in(0,1/2)$ and in time $n^{O(1/\epsilon)}$ , outputs a feasible solution to $\mathcal{I}$ of expected cost at most $(6.387+\epsilon)Opt(\mathcal{I})$ .

Theorem 6.2 (Approximation Algorithm for $k$ -Median with Outliers).

There exists a polynomial time randomized algorithm that takes as input an instance $\mathcal{I}$ of $k$ -median with outliers and parameter $\epsilon\in(0,1/2)$ and in time $n^{O(1/\epsilon)}$ , outputs a feasible solution to $\mathcal{I}$ of expected cost at most $(6.994+\epsilon)Opt(\mathcal{I})$ .

6.1. Overview

The centerpiece for both of our approximation algorithms is the pseudo-approximation algorithm PseudoApproximation for GKM. For both of these special cases, we can obtain via PseudoApproximation a solution to $LP_{iter}$ with only $O(1)$ fractional facilities and bounded re-routing cost. Now our remaining task is to turn this solution with $O(1)$ fractional facilities into an integral one.

Unfortunately, the basic LP relaxations for knapsack median and $k$ -median with outliers have an unbounded integrality gap. To overcome this bad integrality gap, we use known sparsification tools to pre-process the given instance. Our main technical contribution in this section is a post-processing algorithm that rounds the final $O(1)$ fractional variables at a small cost increase for the special cases of knapsack median and $k$ -median with outliers.

Thus our approximation algorithms for knapsack median and $k$ -median with outliers consist of a known pre-processing algorithm [KLS18], our new pseudo-approximation algorithm, and our new post-processing algorithm.

6.2. Approximation Algorithm for Knapsack Median

To illustrate our approach, we give the pre- and post-processing algorithms for knapsack median, which is the simpler of the two variants. Our pre-processing instance modifies the data of the input instance, so the next definition is useful to specify the input instance and pre-processed instance.

Definition 6.3 (Instance of Knapsack Median).

An instance of knapsack median is of the form $\mathcal{I}=(F,C,d,w,B)$ , where $F$ is a set of facilities, $C$ is a set of clients, $d$ is a metric on $F\cup C$ , $w\in\mathbb{R}_{+}^{F}$ is the weights of the facilities, and $B\geq 0$ is the budget.

Note that for knapsack median, the two side constraints in $LP_{iter}$ are the knapsack constraint, $\sum\limits_{i\in F}w_{i}y_{i}\leq B$ , and the coverage constraint, $\sum\limits_{j\in C_{part}}y(F_{j})\geq\lvert C\rvert-\lvert C_{full}\cup C^{*}\rvert$ .

We utilize the same pre-processing as in [KLS18]. Roughly speaking, given a knapsack median instance $\mathcal{I}$ , we first handle the expensive parts of the optimal solution using enumeration. Once we pre-open the facilities and decide what clients should be assigned there for this expensive part of the instance, we are left with a sub-instance, say $\mathcal{I}^{\prime}$ . In $\mathcal{I}^{\prime}$ , our goal is to open some more facilities to serve the remaining clients.

Roughly speaking, $\mathcal{I}^{\prime}$ is the “cheap” part of the input instance. Thus, when we construct $LP_{iter}$ for this sub-instance, we initialize additional invariants which we call our Extra Invariants.

To state our Extra Invariants, we need to define the $P$ -ball centered at $p$ with radius $r$ for any $P\subset F\cup C$ , $p\in F\cup C$ , and $r\geq 0$ , which is the set:

B_{P}(p,r)=\{q\in P\mid d(p,q)\leq r\}.

Definition 6.4 (Extra Invariants for Knapsack Median).

Let $\rho,\delta\in(0,1/2)$ , $U\geq 0$ , $S_{0}\subset F$ , and $R\in\mathbb{R}_{+}^{C}$ be given. Then we call the following properties our Extra Invariants:

(1)

For all $i\in S_{0}$ , there exists a dummy client $j(i)\in C^{*}$ such that $F_{j(i)}=\{i^{\prime}\in F\mid i^{\prime}\text{ collocated with $i$}\}$ with radius level $\ell_{j(i)}=-1$ . We let $C_{0}\subset C$ be the collection of these dummy clients.
(2)

For all $i\in F$ that is not collocated with some $i^{\prime}\in S_{0}$ , we have $\sum\limits_{j\mid i\in F_{j}}d(i,j)\leq 2\rho U$
(3)

For all $j\in C$ , we have $L(\ell_{j})\leq\tau R_{j}$
(4)

For all $j\in C$ and $r\leq R_{j}$ , we have: $\lvert B_{C}(j,\delta r)\rvert r\leq\rho U.$

Extra Invariant 6.4(1) guarantees that we open the set of guessed facilities $S_{0}$ in our final solution. Then for all non-guessed facilities, so the set $F\setminus S_{0}$ , Extra Invariant 6.4(2) captures the idea that these facilities are “cheap.” Taken together, Extra Invariants 6.4(3) and 6.4(4) capture the idea that all remaining clients are “cheap.”

The next theorem describes our pre-processing algorithm for knapsack median, which is a convenient re-packaging of the pre-processing used in [KLS18]. The theorem essentially states that given $\rho,\delta$ , and $U$ , we can efficiently guess a set $C\setminus C^{\prime}$ of clients and $S_{0}$ of facilities that capture the expensive part of the input instance $\mathcal{I}$ . Then when we construct $LP_{iter}$ for the cheap sub-instance, we can obtain the Extra Invariants, and the cost of extending a solution of the sub-instance to the whole instance is bounded with respect to $U$ , which one should imagine is $Opt(\mathcal{I})$ .

Theorem 6.5 (Pre-Processing for Knapsack Median).

Let $\mathcal{I}=(F,C,d,w,B)$ be an instance of knapsack median. Then, given as input instance $\mathcal{I}$ , parameters $\rho,\delta\in(0,1/2)$ , and an upper bound $U$ on $Opt(\mathcal{I})$ , there exists an algorithm that runs in time $n^{O(1/\rho)}$ and outputs $n^{O(1/\rho)}$ -many sub-instances of the form $\mathcal{I}^{\prime}=(F,C^{\prime}\subset C,d,w,B)$ along with the data for $LP_{iter}$ on $\mathcal{I}^{\prime}$ , a set of facilities $S_{0}\subset F$ , and a vector $R\in\mathbb{R}_{+}^{C^{\prime}}$ such that:

(1)

$LP_{iter}$ satisfies all Basic and Extra Invariants
(2)

$\frac{\log_{e}\tau}{\tau-1}\mathbb{E}[Opt(LP_{iter})]+\frac{1-\delta}{1+\delta}\sum\limits_{j\in C\setminus C^{\prime}}d(j,S_{0})\leq U$

The proof is implicit in [KLS18]. For completeness, we prove the analogous theorem for $k$ -median with outliers, 6.13, in §F.

We will show that if $LP_{iter}$ satisfies the Extra Invariants for knapsack median, then we can give a post-processing algorithm with bounded cost. It is not difficult to see that PseudoApproximation maintains the Extra Invariants as well, so we use the Extra Invariants in our post-processing.

Proposition 6.6.

PseudoApproximation maintains all Extra Invariants for knapsack median.

Now we move on to describing our post-processing algorithm. Suppose we run the pre-processing algorithm guaranteed by Theorem 6.5 to obtain $LP_{iter}$ satisfying all Basic- and Extra Invariants. Then we can run PseudoApproximation to obtain an optimal extreme point of $LP_{iter}$ with $O(1)$ fractional facilities, and $LP_{iter}$ still satisfies all Basic- and Extra Invariants.

It turns out, to round these $O(1)$ fractional facilities, it suffices to open one facility in each $F$ -ball for clients in $C^{*}$ . Then we can apply Theorem 5.3 to bound the re-routing cost. The main difficulty in this approach is that we must also round some fractional facilities down to zero to maintain the knapsack constraint.

Note that closing a facility can incur an unbounded multiplicative cost in the objective. To see this, consider a fractional facility $i$ that is almost open, so $\bar{y}_{i}\sim 1$ . Then suppose there exists $j\in C_{full}$ such that $i\in B_{j}$ and $d(j,i)\ll L(\ell_{j})$ . Then $j$ ’s contribution to the objective of $LP_{iter}$ is $\sim d(j,i)$ . However, if we close $i$ , then $j$ ’s contribution increases to $L(\ell_{j})\gg d(j,i)$ .

To bound the cost of closing facilities, we use the Extra Invariants. In particular, we use the next technical lemma, which states that if we want to close down a facility $i$ , and every client $j$ that connects to $i$ has a back-up facility to go to within distance $O(1)L(\ell_{j})$ , then closing $i$ incurs only a small increase in cost. For proof, see §C.

Lemma 6.7.

Suppose $LP_{iter}$ satisfies all Basic and Extra Invariants for knapsack median, and let $S\subset F$ and $\alpha\geq 1$ . Further, consider a facility $i\notin S\cup S_{0}$ and set of clients $C^{\prime}\subset C$ such that for all $j\in C^{\prime}$ , we have $i\in F_{j}$ and there exists some facility in $S$ within distance $\alpha L(\ell_{j})$ of $j$ . Then $\sum\limits_{j\in C^{\prime}}d(j,S)=O(\frac{\rho}{\delta})U$ .

By the next proposition, rounding a facility up to one does not incur any cost increase, because every client must be fully connected.

Proposition 6.8.

Upon termination of PseudoApproximation on a knapsack median instance, we have $C_{part}=\emptyset$ .

Proof.

We observe that the single coverage constraint in $LP_{iter}$ for a knapsack median instance is of the form:

\sum\limits_{j\in C_{part}}y(F_{j})\geq\lvert C\rvert-\lvert C_{full}\cup C^{*}\rvert=\lvert C_{part}\rvert

, where we use the fact that $C_{part}$ , $C_{full}$ , and $C^{*}$ partition $C$ due to Basic Invariant 2.4(1). Combining this with the constraint $y(F_{j})\leq 1$ for all $j\in C_{part}$ gives that $y(F_{j})=1$ for all $j\in C_{part}$ for any feasible solution to $LP_{iter}$ . By assumption, no $C_{part}$ -constraint is tight upon termination of PseudoApproximation, so the proposition follows. ∎

To summarize, the goal of our post-processing algorithm is to find an integral setting of the $O(1)$ fractional facilities in the output of PseudoApproximation such that the knapsack constraint is satisfied and there is an open facility in each $F$ -ball for clients in $C^{*}$ .

Lemma 6.9.

Upon termination of PseudoApproximation on a knapsack median instance, let $\bar{y}$ be the outputted extreme point of $LP_{iter}$ , and suppose $LP_{iter}$ satisfies all Basic- and Extra Invariants. Then there exists an integral setting of the fractional facilities such that the knapsack constraint is satisfied, there is an open facility in each $F$ -ball for clients in $C^{*}$ , and every facility in $S_{0}$ is open.

Proof.

Consider the following LP:

\mathcal{LP}=\min\limits_{y}\{\sum\limits_{i\in F}w_{i}y_{i}\mid y(F_{j})=1\quad\forall j\in C^{*},\,y_{i}=1\quad\forall i\in F_{=1},\,y\in[0,1]^{F}\}

The first constraint states that we want one open facility in each $F$ -ball for clients in $C^{*}$ , and the second states that our solution should agree on the integral facilities in $\bar{y}$ .

Because $LP_{iter}$ satisfies all Basic Invariants, the intersection graph of $\{F_{j}\mid j\in C^{*}\}$ is bipartite by Proposition 2.6. Then the feasible region of $\mathcal{LP}$ is a face of the intersection of two partition matroids (each side of the biparitition of $\{F_{j}\mid j\in C^{*}\}$ defines one parititon matroid), and thus $\mathcal{LP}$ is integral.

To conclude the proof, we observe that $\bar{y}$ is feasible for $\mathcal{LP}$ , so $Opt(\mathcal{LP})\leq\sum\limits_{i\in F}w_{i}\bar{y}_{i}\leq B$ . Thus there exists an integral setting of facilities that opens one facility in each $F$ -bal for all clients in $C^{*}$ , agrees with all of $\bar{y}$ ’s integral facilities, and has total weight at most $B$ . Finally, by Extra Invariant 6.4(1), $C_{0}\subset C^{*}$ , so we open every facility in $S_{0}$ . ∎

Thus, in light of Lemma 6.9, our post-processing algorithm is to enumerate over all integral settings of the fractional variables to find one that satisfies the knapsack constraint, opens one facility in each $F$ -ball for clients in $C^{*}$ , and opens $S_{0}$ . Combining our post-processing algorithm with PseudoApproximation gives the following theorem.

Theorem 6.10.

There exists a polynomial time algorithm that takes as input $LP_{iter}$ for knapsack median instance $\mathcal{I}$ satisfying all Basic- and Extra Invariants and outputs a feasible solution to $\mathcal{I}$ such that the solution opens all facilities in $S_{0}$ and has cost at most $(2+\alpha)Opt(LP_{iter})+O(\rho/\delta)U$ , where $\alpha=\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1}$ .

Proof.

Our algorithm is to run PseudoApproximation on $LP_{iter}$ and then run our post-processing algorithm, which is to enumerate over all integral settings of the fractional variables, and then output the feasible solution that opens $S_{0}$ of lowest cost (if such a solution exists.)

Let $\bar{y}$ be the optimal extreme point of $LP_{iter}$ output by PseudoApproximation, which has $O(1)$ fractional variables by 5.2. Because $\bar{y}$ has $O(1)$ fractional variables, our post-processing algorithm is clearly efficient, which establishes the runtime of our overall algorithm.

Note that upon termination, $LP_{iter}$ still satisfies all Basic- and Extra Invariants. Then by Lemma 6.9, there exists an integral setting of the fractional variables that is feasible, opens $S_{0}$ , and opens a facility in each $F$ -ball for clients in $C^{*}$ . It suffices to bound the cost of this solution. Let $S\subset F$ denote the facilities opened by this integral solution, so $d(j,S)\leq L(\ell_{j})$ for all $j\in C^{*}$ . Applying Lemma 5.3 with $\beta=1$ , we obtain that $d(j,S)\leq(2+\alpha)L(\ell_{j})$ for all $j\in C_{full}\cup C^{*}$ , where $\alpha=\max(1,1+\frac{2}{\tau},\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1})$ . It is easy to check that $\alpha=\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1}$ for all $\tau>1$ .

To bound the cost of the solution $S$ relative to $Opt(LP_{iter})$ , we must bound the cost of closing the $O(1)$ -many facilities in $F_{<1}\setminus S$ . We recall that by Proposition 6.8, we have $C=C_{full}\cup C^{*}$ , so all clients must be fully connected in $LP_{iter}$ .

First we consider any client $j\in C$ that is not supported on any facility in $F_{<1}\setminus S$ . Such a client is not affected by closing $F_{<1}\setminus S$ , so if $F_{j}$ is empty, then $d(j,S)\leq(2+\alpha)L(\ell_{j})$ , which is at most $(2+\alpha)$ times $j$ ’s contribution to $LP_{iter}$ . Otherwise, $F_{j}$ contains an integral facility in $S$ to connect to, so $d(j,S)$ is at most $j$ ’s contribution to $LP_{iter}$ .

It remains to consider the clients whose $F$ -balls contain a facility in $F_{<1}\setminus S$ . Because there are only $O(1)$ -many facilities in $F_{<1}\setminus S$ , it suffices to show that for each $i\in F_{<1}\setminus S$ , the additive cost of connecting all clients supported on $i$ is at most $O(\rho/\delta)U$ . Here we apply Lemma 6.7 to the set of clients $C^{\prime}=\{j\in C\mid i\in F_{j}\}$ to obtain $\sum\limits_{j\in C^{\prime}}d(j,S)=O(\rho/\delta)U$ .

To summarize, the cost of connecting the clients not supported on $F_{<1}\setminus S$ is at most $(2+\alpha)Opt(LP_{iter})$ , and the cost of the remaining clients is $O(\rho/\delta)U$ , as required. ∎

Now our complete approximation for knapsack median follows from combining the pre-processing with the above theorem and tuning parameters.

Proof of Theorem 6.1.

Let $\epsilon^{\prime}>0$ . We will later choose $\epsilon^{\prime}$ with respect to the given $\epsilon$ to obtain the desired approximation ratio and runtime. First, we choose parameters $\rho,\delta\in(0,1/2)$ and $U\geq 0$ for our pre-processing algorithm guaranteed by Theorem 6.5. We take $\rho={\epsilon^{\prime}}^{2}$ and $\delta=\epsilon^{\prime}$ . We require that $U$ is an upper bound on $Opt(\mathcal{I})$ . Using a standard binary search idea, we can guess $Opt(\mathcal{I})$ up to a multiplicative $(1+\epsilon^{\prime})$ -factor in time $n^{O(1/\epsilon^{\prime})}$ , so we guess $U$ such that $Opt(\mathcal{I})\leq U\leq(1+\epsilon^{\prime})Opt(\mathcal{I})$ .

With these choices of parameters, we run the algorithm guaranteed by Theorem 6.5 to obtain $n^{O(1/\epsilon^{\prime})}$ many sub-instances such that one such sub-instance is of the form $\mathcal{I}^{\prime}=(F,C^{\prime}\subset C,d,w,B)$ , where $LP_{iter}$ for $\mathcal{I}^{\prime}$ satisfies all Basic- and Extra Invariants, and we have:

(1)

\frac{\log_{e}\tau}{\tau-1}\mathbb{E}[Opt(LP_{iter})]+\frac{1-\epsilon^{\prime}}{1+\epsilon^{\prime}}\sum\limits_{j\in C\setminus C^{\prime}}d(j,S_{0})\leq U

Then for each sub-instance output by the pre-processing, we run the algorithm guaranteed by Theorem 6.10 to obtain a solution to each sub-instance. Finally, out of these solutions, we output the one that is feasible for the whole instance with smallest cost. This completes the description of our approximation algorithm for knapsack median. The runtime is $n^{O(1/\epsilon^{\prime})}$ , so it remains to bound the cost of the output solution and to choose the parameters $\epsilon^{\prime}$ and $\tau$ .

To bound the cost, it suffices to consider the solution output on the instance $\mathcal{I}^{\prime}$ where $LP_{iter}$ satisfies all Basic- and Extra Invariants and Equation 1. By running the algorithm guaranteed by Theorem 6.10 on this $LP_{iter}$ , we obtain a feasible solution $S\subset F$ to $\mathcal{I}^{\prime}$ such that $S_{0}\subset S$ , and the cost of connecting $C^{\prime}$ to $S$ is at most $(2+\alpha)Opt(LP_{iter})+O(\epsilon^{\prime})U$ , where $\alpha=\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1}$ . To extend this solution on the sub-instance to a solution on the whole instance $\mathcal{I}$ , we must connect $C\setminus C^{\prime}$ to $S$ . Because $S_{0}\subset S$ , applying Equation 1 allows us to upper bound the expected cost of connecting $C$ to $S$ by:

(2+\alpha)\mathbb{E}[Opt(LP_{iter})]+O(\epsilon^{\prime})U+\sum\limits_{j\in C\setminus C^{\prime}}d(j,S_{0})\leq(2+\alpha)\frac{\tau-1}{\log_{e}\tau}\frac{1+\epsilon^{\prime}}{1-\epsilon^{\prime}}U+O(\epsilon^{\prime})U.

Now choosing $\tau>1$ to minimize $(2+\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1})\frac{\tau-1}{\log_{e}\tau}$ gives $\tau=2.046$ and $\frac{\tau-1}{\log_{e}\tau}(2+\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1})=6.387$ . Thus the expected cost of this solution is at most $6.387\frac{1+\epsilon^{\prime}}{1-\epsilon^{\prime}}U+O(\epsilon^{\prime})U$ , where $U\leq(1+\epsilon^{\prime})Opt(\mathcal{I})$ . Finally, by routine calculations, we can choose $\epsilon^{\prime}=\theta(\epsilon)$ so that expected cost is at most $(6.387+\epsilon)Opt(\mathcal{I})$ , as required. Note that the runtime of our algorithm is $n^{O(1/\epsilon^{\prime})}=n^{O(1/\epsilon)}$ . ∎

6.3. Approximation Algorithm for $k$ -Median with Outliers

Our approximation algorithm for $k$ -median with outliers follows the same general steps as our algorithm for knapsack median. We state the analogous Extra Invariants for $k$ -median with outliers and pre-processing algorithm here. The only differences between the Extra Invariants for knapsack median and $k$ -median with outliers is in the final Extra Invariant.

Definition 6.11 (Instance of $k$ -Median with Outliers).

An instance of $k$ -median with outliers is of the form $\mathcal{I}=(F,C,d,k,m)$ , where $F$ is a set of facilities, $C$ is a set of clients, $d$ is a metric on $F\cup C$ , $k$ is the number of facilities to open, and $m$ is the number of clients to serve.

Note that for $k$ -median with outliers, the two side constraints in $LP_{iter}$ are the knapsack constraint, $y(F)\leq k$ , and the coverage constraint, $\sum\limits_{j\in C_{part}}y(F_{j})\geq m-\lvert C_{full}\cup C^{*}\rvert$ .

Definition 6.12 (Extra Invariants for $k$ -Median with Outliers).

Let $\rho,\delta\in(0,1/2)$ , $U\geq 0$ , $S_{0}\subset F$ , and $R\in\mathbb{R}_{+}^{C}$ be given. Then we call the following properties our Extra Invariants:

(1)

For all $i\in S_{0}$ , there exists a dummy client $j(i)\in C^{*}$ such that $F_{j(i)}=\{i^{\prime}\in F\mid i^{\prime}\text{ colocated with $i$}\}$ with radius level $\ell_{j(i)}=-1$ . We let $C_{0}\subset C$ be the collection of these dummy clients.
(2)

For all $i\in F$ that is not collocated with some $i^{\prime}\in S_{0}$ , we have $\sum\limits_{j\mid i\in F_{j}}d(i,j)\leq\rho(1+\delta)U$
(3)

For all $j\in C$ , we have $L(\ell_{j})\leq\tau R_{j}$

(4)

For every $t>0$ and $p\in F\cup C$ , we have:

\lvert\{j\in B_{C}(p,\frac{\delta t}{4+3\delta})\mid R_{j}\geq t\}\rvert\leq\frac{\rho(1+3\delta/4)}{1-\delta/4}\frac{U}{t}.

Again, the pre-processing of [KLS18] gives the next theorem. For proof, see §F.

Theorem 6.13 (Pre-Processing for $k$ -Median with Outliers).

Let $\mathcal{I}=(F,C,d,k,m)$ be an instance of $k$ -median with outliers with optimal solution $(S^{*},C^{*})$ . Then, given as input instance $\mathcal{I}$ , parameters $\rho,\delta\in(0,1/2)$ , and an upper bound $U$ on $Opt(\mathcal{I})$ , there exists an algorithm that runs in time $n^{O(1/\rho)}$ and outputs $n^{O(1/\rho)}$ -many sub-instances of the form $\mathcal{I}^{\prime}=(F,C^{\prime}\subset C,d,k,m^{\prime}=m-\lvert C^{*}\setminus C^{\prime}\rvert)$ along with the data for $LP_{iter}$ on $\mathcal{I}^{\prime}$ , a set of facilities $S_{0}\subset F$ , and a vector $R\in\mathbb{R}_{+}^{C^{\prime}}$ such that:

(1)

$LP_{iter}^{\prime}$ satisfies all Basic and Extra Invariants
(2)

$\frac{\log_{e}\tau}{(\tau-1)(1+\delta/2)}\mathbb{E}[Opt(LP_{iter})]+\frac{1-\delta}{1+\delta}\sum\limits_{j\in C^{*}\setminus C^{\prime}}d(j,S_{0})\leq U$

It is easy to check that PseudoApproximation maintains all Extra Invariants for $k$ -median with outliers as well, and we have an analogous technical lemma to bound the cost of closing facilities. For proof of the lemma, see §C.

Proposition 6.14.

PseudoApproximation maintains all Extra Invariants for $k$ -median with outliers.

Lemma 6.15.

Suppose $LP_{iter}$ satisfies all Basic and Extra Invariants for $k$ -median with outliers, and let $S\subset F$ and $\alpha\geq 1$ . Further, consider a facility $i\notin S\cup S_{0}$ and set of clients $C^{\prime}\subset C$ such that for all $j\in C^{\prime}$ , we have $i\in F_{j}$ and there exists some facility in $S$ within distance $\alpha L(\ell_{j})$ of $j$ . Then $\sum\limits_{j\in C^{\prime}}d(j,S)=O(\frac{\rho}{\delta})U$ .

Now we focus on the main difference between the two algorithms: the post-processing. In particular, the coverage constraint of $k$ -median with outliers introduces two difficulties in rounding the final $O(1)$ fractional facilities: (a) we are no longer guaranteed that $C_{part}=\emptyset$ , and (b) we must satisfy the coverage constraint.

The difficulty with (a) is that now rounding a facility up to one can also incur an unbounded multiplicative cost in the objective. To see this, consider a fractional facility $i$ that is almost closed, so $\bar{y}_{i}\sim 0$ . Consider rounding this facility up to one. Then for a client $j\in C_{part}$ that fractionally connects to $i$ in the solution $\bar{y}$ , if we fully connect $j$ to $i$ , this costs $d(j,i)\gg d(j,i)\bar{y}_{i}$ . The solution here is to use Extra Invariant 6.12(2) to bound the additive cost of opening facilities.

The more troublesome issue is (b). Note that the same approach that we used to prove that there exists a good integral setting of the $O(1)$ fractional variables in 6.9 does not work here because putting the coverage constraint in the objective of the LP could result in a solution covering the same client multiple times. Our solution to (b) is a more sophisticated post-processing algorithm that first re-routes clients in $C_{part}$ . After re-routing, we carefully pick facilities to open that do not double-cover any remaining $C_{part}$ -clients. We defer the details of our post-processing algorithm for §7. For now, we present the guarantees of our pseudo-approximation combined with post-processing:

Theorem 6.16.

There exists a polynomial time algorithm that takes as input $LP_{iter}$ for $k$ -median with outliers instance $\mathcal{I}$ satisfying all Basic- and Extra Invariants and outputs a feasible set of facilities $S\supset S_{0}$ such that the cost of connecting $m$ clients to $S$ is at most $(2+\alpha)Opt(LP_{iter})+O(\rho/\delta)U$ , where $\alpha=\max(3+2\tau^{-c},1+\frac{4+2\tau^{-c}}{\tau},\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1})$ for any constant $c\in\mathbb{N}$ .

Combining the pre-processing of 6.13 with 6.16 and tuning parameters gives our final approximation algorithm for $k$ -median with outliers. The proof of 6.2 is analogous to 6.1, so we defer it to §C.

7. Post-Processing for $k$ -Median with Outliers

In this section we develop the post-processing algorithm for $k$ -median with outliers that is guaranteed by 6.16. The structure of our algorithm is recursive. First, we give a procedure to round at least one fractional facility or serve at least one client. Then we recurse on the remaining instance until we obtain an integral solution.

7.1. Computing Partial Solutions

In this section, we show how to round at least one fractional facility or serve at least one client. We interpret this algorithm as computing a partial solution to the given $k$ -median with outliers instance.

The main idea of this algorithm is to re-route clients in $C_{part}$ . In particular, we maintain a subset $\bar{C}\subset C^{*}$ such that for every client in $\bar{C}$ , we guarantee to open an integral facility in their $F$ -ball. We also maintain a subset $C_{covered}\subset C_{part}$ of $C_{part}$ -clients that we re-route; that is, we guarantee to serve them even if no open facility is in their $F$ -balls. Crucially, every client in $C_{part}\setminus C_{covered}$ is supported on at most one $F$ -ball for clients in $\bar{C}$ . Thus, we do not have to worry about double-covering those clients when we round the facilities in $F(\bar{C})$ .

The partial solution we output consists of one open facility for each client in $\bar{C}$ (along with the facilities that happen to be integral already), and we serve the clients in $C_{full}$ , $C^{*}$ , $C_{covered}$ , and the $C_{part}$ -clients supported on our open facilities. See Algorithm 5 (ComputePartial) for the formal algorithm to compute partial solutions. Note that $c\in\mathbb{N}$ is a parameter of ComputePartial.

Input:

LP_{iter}

and

\bar{y}

output by PseudoApproximation on a

k

-median with outliers instance such that

C^{*}_{<1}\not\subset C_{0}

Output: Output a partial solution

S\subset F

and modify

LP_{iter}

1 Initialize

C_{covered}=\emptyset

and

\bar{C}=\{j\in C^{*}_{<1}\mid F_{j}\cap S_{0}=\emptyset\}

2 for all clients $\bar{j}\in\bar{C}$ in increasing order of $\ell_{\bar{j}}$ do

3 For all

j\in\bar{C}

such that

j\neq\bar{j}

and

F_{j}\cap F_{\bar{j}}\neq\emptyset

, remove

j

from

\bar{C}

4 while there exists a client $j^{\prime}\in C_{part}\setminus C_{covered}$ such that $F_{j^{\prime}}$ intersects $F_{\bar{j}}$ and $F_{j}$ for some other $j\in\bar{C}$ do

5 if $\ell_{j^{\prime}}\leq\ell_{\bar{j}}-c$ then

6 Remove

j

from

\bar{C}

8 else

9 Add

j^{\prime}

C_{covered}

11For all

i\in F

, we define

w_{i}=\lvert\{j\in C_{part}\setminus C_{covered}\mid i\in F_{j}\}\rvert

12 Construct the set

\bar{S}\subset F(\bar{C})

by greedily picking the facility

i\in F_{j}

with largest

w_{i}

for each

j\in\bar{C}

13 Define the set

C_{part}(\bar{S})=\{j\in C_{part}\mid F_{j}\cap\bar{S}\neq\emptyset\}

14 Define the partial set of facilities by

S=(\bar{S}\cup F_{=1})\setminus S_{0}

and the partial set of clients by

C^{\prime}=C_{part}(\bar{S})\cup C_{covered}\cup C_{full}\cup(C^{*}\setminus C_{0})

15 Update

LP_{iter}

by deleting

S

and

F(\bar{C})

from

F

, deleting all clients in

C^{\prime}

from

C

, decrementing

k

\lvert S\rvert

, and decrementing

m

\lvert C^{\prime}\rvert

16 Output the partial solution

S

Algorithm 5 ComputePartial

We note that to define a solution for $k$ -median with outliers, it suffices to specify the set of open facilities $S$ , because we can choose the clients to serve as the $m$ closest clients to $S$ . Thus when we output a partial solution, we only output the set of open facilities.

We summarize the performance of ComputePartial with the next theorem, which we prove in §7.4. In the next section, we use 7.1 to define our recursive post-processing algorithm.

Theorem 7.1.

Let $LP_{iter}$ and $\bar{y}$ be the input to ComputePartial. Then let $S$ be the partial solution output by ComputePartial and $LP_{iter}^{1}$ be the modified LP. Then $LP_{iter}^{1}$ satisfies all Basic- and Extra Invariants and we have:

Opt(LP_{iter}^{1})+\frac{1}{2+\alpha}\sum\limits_{j\in C^{\prime}}d(j,S\cup S_{0})\leq Opt(LP_{iter})+O(\frac{\rho}{\delta})U,

where $\alpha=\max(3+2\tau^{-c},1+\frac{4+2\tau^{-c}}{\tau},\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1})$ .

We want $LP_{iter}^{1}$ to satisfy all Basic- and Extra Invariants so we can continue recursing on $LP_{iter}^{1}$ . The second property of 7.1 allows us to extend a solution computed on $LP_{iter}^{1}$ with the partial solution $S$ .

7.2. Recursive Post-Processing Algorithm

To complete our post-processing algorithm, we recursively apply ComputePartial until we have an integral solution.

The main idea is that we run PseudoApproximation to obtain an optimal extreme point with $O(1)$ fractional variables. Then using this setting of the $y$ -variables, we construct a partial solution consisting of some open facilities along with the clients that they serve. However, if there are still fractional facilities remaining, we recurse on $LP_{iter}$ (after the modifications by ComputePartial.) Our final solution consists of the union of all recursively computed partial solutions. See Algorithm 6 (OutliersPostProcess.)

Input:

LP_{iter}

for a

k

-median with outliers instance satisfying all Basic- and Extra Invariants

Output: Solution

S\subset F

1 Run PseudoApproximation to obtain extreme point

\bar{y}

LP_{iter}

2 if $\bar{y}$ is integral then

3 Output the solution

F_{=1}

5else if $C^{*}_{<1}\subset C_{0}$ then

6 By 7.2,

\bar{y}

has at most two fractional variables, say

a,b\in F

7 Without loss of generality, we may assume

\lvert\{j\in C_{part}\mid a\in F_{j}\}\rvert\geq\lvert\{j\in C_{part}\mid b\in F_{j}\}\rvert

8 Output the solution

F_{=1}\cup\{a\}

10else

11 Run ComputePartial to obtain partial solution

S^{\prime}

and update

LP_{iter}

12 Run ComputePartial on updated

LP_{iter}

to obtain partial solution

S^{\prime\prime}

13 Output solution

S^{\prime}\cup S^{\prime\prime}

Algorithm 6 OutliersPostProcess

For proof of the next lemma, see §D.

Lemma 7.2.

If $C^{*}_{<1}\subset C_{0}$ , then $\bar{y}$ has at most two fractional variables.

7.3. Analysis of OutliersPostProcess

In this section, we show that OutliersPostProcess satisfies the guarantees of 6.16. All missing proofs can be found in §D. We let $\bar{y}$ be the output of PseudoApproximation in the first line of OutliersPostProcess and $\alpha=\max(3+2\tau^{-c},1+\frac{4+2\tau^{-c}}{\tau},\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1})$ . First, we handle the base cases of OutliersPostProcess.

The easiest base is when $\bar{y}$ is integral. Here, we do not need the Extra Invariants - all we need is that every client $j\in C_{full}\cup C^{*}$ has an open facility within distance $(2+\alpha)L(\ell_{j})$ .

Lemma 7.3.

If $\bar{y}$ is integral, then the output solution $F_{=1}$ is feasible, contains $S_{0}$ , and connecting $m$ clients costs at most $(2+\alpha)Opt(LP_{iter})$ .

Now, in the other base case, $\bar{y}$ is not integral, but we know that $C^{*}_{<1}\subset C_{0}$ . By 7.2, we may assume without loss of generality $\bar{y}$ has exactly two fractional facilities, say $a,b\in F_{<1}$ . Further, we may assume that the $k$ -constraint is tight, because opening more facilities can only improve the objective value. It follows that $\bar{y}_{a}+\bar{y}_{b}=1$ . For the sake of analysis, we define the sets $C_{part}(a)=\{j\in C_{part}\mid a\in F_{j}\}$ and $C_{part}(b)=\{j\in C_{part}\mid b\in F_{j}\}$ where we may assume $\lvert C_{part}(a)\rvert\geq\lvert C_{part}(b)\rvert$ .

It remains to bound the cost of the solution $F_{=1}\cup\{a\}$ . One should imagine that we obtain this solution from $\bar{y}$ by closing down the facility $b$ and opening up $a$ . First we handle the degenerate case where $a\in S_{0}$ . In this case, $a$ and $b$ must both be co-located copies of a facility in $S_{0}$ , so opening one or the other does not change the cost. Thus, we may assume without loss of generality that $a\notin S_{0}$ . Here, we need to Extra Invariants to bound the cost of opening $a$ and closing $b$

Lemma 7.4.

Suppose $a\notin S_{0}$ . Then the output solution $F_{=1}\cup\{a\}$ is feasible, contains $S_{0}$ , and connecting $m$ clients costs at most $(2+\alpha)Opt(LP_{iter})+O(\frac{\rho}{\delta})U.$

This completes the analysis of the base cases. To handle the recursive step, we apply 7.1.

Proof of Theorem 6.16.

First, we show that OutliersPostProcess terminates in polynomial time. It suffices to show that the number of recursive calls to ComputePartial is polynomial. To see this, note that for each recursive call, it must be the case that $C^{*}_{<1}\not\subset C_{0}$ . In particular, there exists some non-dummy client in $C^{*}\setminus C_{0}$ . Thus, we are guaranteed to remove at least once client from $C$ in each recursive call.

Now it remains to show that the output solution is feasible, contains $S_{0}$ , and connecting $m$ clients costs at most $(2+\alpha)Opt(LP_{iter})+O(\frac{\rho}{\delta})U$ . Let $\bar{y}$ be the extreme point computed by PseudoApproximation in the first line of OutliersPostProcess. If $\bar{y}$ is integral or $C^{*}_{<1}\subset C_{0}$ , then we are done by the above lemmas.

Then it remains to consider the case where $C^{*}_{<1}\not\subset C_{0}$ . Let $LP_{iter}$ denote the input to OutliersPostProcess and $LP_{iter}^{1}$ the updated $LP_{iter}$ at the end of ComputePartial as in the statement of Theorem 7.1. We note that Theorem 7.1 implies that $LP_{iter}^{1}$ satisfies all Basic and Extra Invariants, so $LP_{iter}^{1}$ is a valid input to OutliersPostProcess. Then we may assume inductively that the recursive call to OutliersPostProcess on $LP_{iter}^{1}$ outputs a feasible solution $S^{\prime\prime}$ to $LP_{iter}^{1}$ such that $S_{0}\subset S^{\prime\prime}$ and the cost of connecting $m^{1}$ clients from $C^{1}$ to $S^{\prime\prime}$ is at most $(2+\alpha)Opt(LP_{iter}^{1})+O(\frac{\rho}{\delta})U$ .

Further, let $S^{\prime}$ be the partial solution output by ComputePartial on $LP_{iter}$ . Now we combine the solutions $S^{\prime}$ and $S^{\prime\prime}$ to obtain the solution output by OutliersPostProcess. First, we check that $S^{\prime}\cup S^{\prime\prime}$ is is feasible. This follows, because $\lvert S^{\prime\prime}\rvert\leq k^{1}\leq k-\lvert S^{\prime}\rvert$ by definition of ComputePartial. Also, $S_{0}\subset S^{\prime\prime}$ by the inductive hypothesis.

It remains to bound the cost of connecting $m$ clients to $S^{\prime}\cup S^{\prime\prime}$ . Consider serving the $m^{1}$ closest clients in $C^{1}$ with $S^{\prime\prime}$ and $C^{\prime}$ with $S^{\prime}\cup S_{0}$ . Because $m_{1}=m-\lvert C^{\prime}\rvert$ , this is enough clients. Connecting the $m^{1}$ closest clients in $C^{1}$ to $S^{\prime\prime}$ costs at most $(2+\alpha)Opt(LP_{iter}^{1})+O(\frac{\rho}{\delta})U$ by the inductive hypothesis. Now we use the guarantee of 7.1, which we recall is:

Opt(LP_{iter}^{1})+\frac{1}{2+\alpha}\sum\limits_{j\in C^{\prime}}d(j,S^{\prime}\cup S_{0})\leq Opt(LP_{iter})+O(\frac{\rho}{\delta})U.

Thus, the total connection cost is at most:

(2+\alpha)Opt(LP_{iter}^{1})+O(\frac{\rho}{\delta})U+\sum\limits_{j\in C^{\prime}}d(j,S^{\prime}\cup S_{0})\leq(2+\alpha)Opt(LP_{iter})+O(\frac{\rho}{\delta})U.

Note that the additive $O(\frac{\rho}{\delta})U$ terms which we accrue in each recursive call are still $O(\frac{\rho}{\delta})U$ overall. This is because we keep recursing on a subset of the remaining fractional facilities – which is always $O(1)$ – and we open/close each fractional facility at most once over all recursive calls. Thus, we can bound the additive cost of each opening/closing by $O(\frac{\rho}{\delta})U$ . ∎

7.4. Proof of 7.1

For all missing proofs in this section, see §D. We let $LP_{iter}$ and $\bar{y}$ denote the input to ComputePartial and $LP_{iter}^{1}$ the updated LP that is output at the end ComputePartial. We begin with three properties of ComputePartial that will be useful throughout our analysis.

The first is immediate by definition of ComputePartial.

Proposition 7.5.

Upon termination of ComputePartial, the set family $\{F_{j}\mid j\in\bar{C}\}$ is disjoint, and every client $j\in C_{part}\setminus C_{covered}$ , $F_{j}$ intersects at most one $F$ -ball for clients in $\bar{C}$ .

Proposition 7.6.

ComputePartial initializes and maintains the invariants that $C_{covered}\subset C_{part}$ and $\bar{C}\subset\{j\in C^{*}_{<1}\mid F_{j}\cap S_{0}=\emptyset\}$

Proof.

We initialize $C_{covered}=\emptyset$ and only add clients from $C_{part}$ to $C_{covered}$ . Similarly, we initialize $\bar{C}=\{j\in C^{*}_{<1}\mid F_{j}\cap S_{0}=\emptyset\}$ and only remove clients from $\bar{C}$ . ∎

Lemma 7.7.

Every $\bar{j}\in\bar{C}$ that is reached by the For loop remains in $\bar{C}$ until termination.

Now we are ready to prove both properties of 7.1. It is not difficult to see that $LP_{iter}^{1}$ satisfies all Basic- and Extra Invariants by construction.

Lemma 7.8.

$LP_{iter}^{1}$ satisfies all Basic- and Extra Invariants.

Now it remains to show $Opt(LP_{iter}^{1})+\frac{1}{2+\alpha}\sum\limits_{j\in C^{\prime}}d(j,S\cup S_{0})\leq Opt(LP_{iter})+O(\frac{\rho}{\delta})U$ . To do this, we partition $C$ into $C^{1}$ and $C^{\prime}=C_{part}(\bar{S})\cup C_{covered}\cup C_{full}\cup(C^{*}\setminus C_{0})$ . For each client in $C^{1}$ , we show that its contribution to the objective of $LP_{iter}^{1}$ is at most its contribution to $LP_{iter}$ . Then for each client $j\in C^{\prime}$ , either $d(j,S\cup S_{0})$ is at most $2+\alpha$ times $j$ ’s contribution to $Opt(LP_{iter})$ or we can charge $j$ ’s connection cost to an additive $O(\frac{\rho}{\delta})U$ term.

First, we focus on $C^{1}$ . For these clients, it suffices to show that $\bar{y}$ (restricted to $F^{1}$ ) is feasible for $LP_{iter}^{1}$ . This is because for all $j\in C^{1}$ , either $j\in C_{part}^{1}\subset C_{part}$ or $j\in C_{0}$ . The clients in $C_{0}$ contribute zero to the cost of $LP_{iter}^{1}$ and $LP_{iter}$ . This is because both $LP_{iter}$ and $LP_{iter}^{1}$ satisfy Extra Invariant 6.12(1), so every dummy client $j(i)\in C_{0}$ is co-located with one unit of open facility corresponding to $i\in S_{0}$ .

Thus it remains to consider the clients $j\in C_{part}^{1}$ . We recall that $C_{part}^{1}\subset C_{part}$ and $F_{j}^{1}\subset F_{j}$ for all $j\in C^{1}$ , so each $j\in C_{part}^{1}$ costs less in $LP_{iter}^{1}$ than in $LP_{iter}$ .

To complete the cost analysis of $C^{1}$ , we go back to prove feasibility. The main difficulty is showing that the coverage constraint is still satisfied. Recall that we construct $\bar{S}$ by greedily opening the facility in each $F$ -ball for clients in $\bar{C}$ that covers the most $C_{part}\setminus C_{covered}$ -clients. 7.5 ensures that this greedy choice is well-defined (because $\{F_{j}\mid j\in\bar{C}\}$ is disjoint), and that we do not double-cover any $C_{part}\setminus C_{covered}$ -clients.

Then by definition of greedy, we show that our partial solution covers more clients than the fractional facilities we delete. This proposition is the key to showing that the coverage constraint is still satisfied.

Proposition 7.9.

Upon termination of ComputePartial, we have $\lvert C_{part}(\bar{S})\rvert\geq\sum\limits_{j\in\bar{C}}\sum\limits_{i\in F_{j}}w_{i}\bar{y}_{i}$ .

Proof.

For each $j\in\bar{C}$ , let $i(j)\in\bar{S}$ be the unique open facility in $F_{j}$ . By definition of $C_{part}(\bar{S})$ , for all $j\in C_{part}(\bar{S})$ we have $F_{j}\cap\bar{S}\neq\emptyset$ . Further, by Proposition 7.5, $F_{j}$ intersects exactly one $F$ -ball among clients in $\bar{C}$ , so each $i(j)$ for $j\in\bar{C}$ covers a unique set of clients. This implies the equality:

\lvert C_{part}(\bar{S})\rvert=\sum\limits_{j\in\bar{C}}w_{i(j)}.

Combining this equality with the facts that for all $j\in\bar{C}$ , we have $\bar{y}(F_{j})=1$ and $w_{i(j)}\geq w_{i}$ for all $i\in F_{j}$ gives the desired inequality:

\lvert C_{part}(\bar{S})\rvert=\sum\limits_{j\in\bar{C}}w_{i(j)}\geq\sum\limits_{j\in\bar{C}}w_{i(j)}\bar{y}(F_{j})\geq\sum\limits_{j\in\bar{C}}\sum\limits_{i\in F_{j}}w_{i}\bar{y}_{i}.

∎

Finally, we can complete the analysis of $C^{1}$ . It is easy to check that constraints except for the coverage constraint are satisfied. To handle the coverage constraint, we use 7.9.

Lemma 7.10.

$\bar{y}$ restricted to $F^{1}$ is feasible for $LP_{iter}^{1}$ .

For the final property, we must upper bound the connection cost of $C^{\prime}=C_{part}(\bar{S})\cup C_{covered}\cup C_{full}\cup(C^{*}\setminus C_{0})$ to $S\cup S_{0}$ . We bound the connection cost in a few steps. First, we bound the re-routing cost of $C_{full}\cup C^{*}$ . Second, we show that every $j\in C_{covered}$ has a ”back-up” facility within $O(1)L(\ell_{j})$ . This is used to bound the additive cost of connecting $C_{covered}$ . Finally, for the clients in $C_{part}(\bar{S})$ , we guarantee to open a facility in their $F$ -balls, so we also bound their additive cost.

The next lemma and corollary allow us to bound the cost of $C_{full}\cup C^{*}$ .

Lemma 7.11.

Upon termination of ComputePartial, for all $j\in C^{*}$ , we have $d(j,S\cup S_{0})\leq(3+\frac{2}{\tau^{c}})L(\ell_{j})$ .

Proof.

There are a few cases to consider. First, if $j\in C^{*}_{=1}$ , then the lemma is trivial because by definition there exists an integral facility in $F_{j}$ . Otherwise, if $j\in C^{*}_{<1}$ , but $F_{j}\cap S_{0}\neq\emptyset$ , then again the lemma is trivial.

Thus it remains to consider clients $j\in\{j\in C^{*}_{<1}\mid F_{j}\cap S_{0}=\emptyset\}$ . We note that such a client $j$ is initially in $\bar{C}$ . If $j$ remains in $\bar{C}$ until termination, then we are done, because we are guaranteed to open a facility in $F_{j}$ for all $j\in\bar{C}$ (this is exactly the set $\bar{S}$ of facilities.)

For the final case, we suppose client $j$ is removed from $\bar{C}$ in the iteration where we consider $\bar{j}\in\bar{C}$ . Then either $F_{j}\cap F_{\bar{j}}\neq\emptyset$ or there exists $j^{\prime}\in C_{part}\setminus C_{covered}$ such that $F_{j^{\prime}}$ intersects both $F_{j}$ and $F_{\bar{j}}$ and $\ell_{j^{\prime}}\leq\ell_{\bar{j}}-c$ . Note that because the For loop considers clients in increasing order of radius level, we have $\ell_{j}\geq\ell_{\bar{j}}$

In the former case, by the Distinct Neighbors Property, we have $\ell_{j}\geq\ell_{\bar{j}}+1$ . Further, by 7.7, we know that $\bar{j}$ remains in $\bar{C}$ until termination, so $d(\bar{j},S)\leq L(\ell_{\bar{j}})$ . Then we can upper bound:

d(j,S)\leq d(j,\bar{j})+d(\bar{j},S)\leq L(\ell_{j})+L(\ell_{\bar{j}})+L(\ell_{\bar{j}})\leq(1+\frac{2}{\tau})L(\ell_{j})<3L(\ell_{j})

, where in the final inequality, we use the fact that $\tau>1$ .

In the latter case, we again have $d(\bar{j},S)\leq L(\ell_{\bar{j}})$ . Then we can bound the distance from $j$ to $S$ by first going from $j$ to $j^{\prime}$ , then from $j^{\prime}$ to $\bar{j}$ , and finally from $\bar{j}$ to $S$ , where $\ell_{j^{\prime}}\leq\ell_{\bar{j}}-c$ and $\ell_{\bar{j}}\leq\ell_{j}$ :

d(j,S)\leq d(j,j^{\prime})+d(j^{\prime},\bar{j})+d(\bar{j},S)\leq L(\ell_{j})+L(\ell_{j^{\prime}})+L(\ell_{j^{\prime}})+L(\ell_{\bar{j}})+L(\ell_{\bar{j}})\leq(3+\frac{2}{\tau^{c}})L(\ell_{j}).

∎

Corollary 7.12.

Upon termination of ComputePartial, for all $j\in C_{full}\cup C^{*}$ , we have $d(j,S\cup S_{0})\leq(2+\alpha)L(\ell_{j})$ , where $\alpha=\max(3+2\tau^{-c},1+\frac{4+2\tau^{-c}}{\tau},\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1})$

Proof.

Apply 5.3 with $\beta=3+2\tau^{-c}$ . ∎

Similarly, the next lemma ensures that $C_{covered}$ can also be re-routed.

Lemma 7.13.

Upon termination of ComputePartial, for all $j\in C_{covered}$ , we have $d(j,\bar{S})\leq(1+2\tau^{c})L(\ell_{j})$ .

Proof.

Because $j\in C_{covered}$ , it must be the case that we put $j$ in $C_{covered}$ in some iteration of the For loop where we consider client $\bar{j}\in\bar{C}$ . Thus we have $\ell_{j}\geq\ell_{\bar{j}}-c+1$ and $F_{j}\cap F_{\bar{j}}\neq\emptyset$ . Also, by Lemma 7.7, $\bar{j}$ remains in $\bar{C}$ until termination, so $d(\bar{j},\bar{S})\leq L(\ell_{\bar{j}})$ . Using these facts, we can bound:

d(j,S)\leq d(j,\bar{j})+d(\bar{j},S)\leq L(\ell_{j})+L(\ell_{\bar{j}})+L(\ell_{\bar{j}})\leq(1+2\tau^{c-1})L(\ell_{j}).

∎

Using the above lemmas, we are ready to bound the connection cost of our partial solution. Note that we bound the cost of serving $C^{\prime}$ not with $S$ , which is the partial solution we output, but rather with $S\cup S_{0}$ . Thus, we are implicitly assuming that $S_{0}$ will be opened by the later recursive calls.

We recall that $C^{\prime}=C_{part}(\bar{S})\cup C_{covered}\cup C_{full}\cup(C^{*}\setminus C_{0})$ and $S\cup S_{0}=F_{=1}\cup\bar{S}\cup S_{0}$ .

To begin, we bound the cost of connecting $C_{part}(\bar{S})$ to $\bar{S}$ . By definition, every client $j\in C_{part}(\bar{S})$ has some facility from $\bar{S}$ in its $F$ -ball. Further, $\bar{S}\subset F(\bar{C})$ by definition, and $F(\bar{C})\cap S_{0}=\emptyset$ using Proposition 7.5. Thus we can apply Extra Invariant 6.12(2) to each facility in $\bar{S}$ . Further, we know that $\lvert\bar{S}\rvert=O(1)$ , because there are only $O(1)$ fractional facilities, and every facility in $\bar{S}$ is fractional by definition. Then we can bound:

\sum\limits_{j\in C_{part}(\bar{S})}d(j,\bar{S})\leq\sum\limits_{i\in\bar{S}}\sum\limits_{j\in C_{part}(\bar{S})\mid i\in F_{j}}d(j,i)\leq O(\rho)U

, where we apply Extra Invariant 6.12(2) for each $i\in\bar{S}$ . Thus, we have shown that the connection cost of $C_{part}(\bar{S})$ is at most an additive $O(\rho)U$ .

Now we move on to the rest of $C^{\prime}$ , that is - the clients in $C_{covered}$ , $C_{full}$ and $C^{*}\setminus C_{0}$ . For the clients in $C_{covered}$ , we know by Lemma 7.13 that every client $j\in C_{covered}$ has an open facility in $\bar{S}$ at distance at most $(1+2\tau^{c})L(\ell_{j})$ . Further, by definition of $C_{covered}$ , each $j\in C_{covered}$ is supported on a fractional facility not in $S_{0}$ . To see this, note that for all $j\in C_{covered}$ , there exists $\bar{j}\in\bar{C}$ such that $F_{j}\cap F_{\bar{j}}\neq\emptyset$ , and $F_{\bar{j}}\cap S_{0}=\emptyset$ .

Then we can use Lemma 6.15 for each fractional facility $i\notin S_{0}$ to bound the cost of connecting all $C_{covered}$ -clients supported on $i$ to $\bar{S}$ . For each such fractional facility $i\notin S_{0}$ , the connection cost of these clients is at most $O(\frac{\rho}{\delta})U$ . By summing over all $O(1)$ -many fractional $i\notin S_{0}$ , we connect all of $C_{covered}$ at additive cost at most $O(\frac{\rho}{\delta})U$ .

Finally, we handle the clients in $C_{full}\cup(C^{*}\setminus C_{0})$ . For convenience, we denote this set of clients by $\hat{C}$ . Using Lemma 7.11, every $j\in\hat{C}$ has a facility in $S\cup S_{0}$ at distance at most $(2+\alpha)L(\ell_{j})$ . There are a few cases to consider. For the first case, we consider the clients $\{j\in\hat{C}\mid F_{j}\setminus(S_{0}\cup F_{=1})\neq\emptyset\}$ , that is the set of $\hat{C}$ -clients whose $F$ -balls contain some cheap, fractional facility. By an analogous argument as for $C_{covered}$ , we can apply Lemma 6.15 to each fractional $i\notin S_{0}$ to bound the cost of connecting $\{j\in\hat{C}\mid F_{j}\setminus(S_{0}\cup F_{=1})\neq\emptyset\}$ to $S\cup S_{0}$ by $O(\frac{\rho}{\delta})U$ .

Then it remains to consider the clients $j\in\hat{C}$ such that $F_{j}\subset F_{=1}\cup S_{0}$ . For such a client $j$ , if $F_{j}=\emptyset$ , then $j$ ’s contribution to the objective of $LP_{iter}$ is exactly $L(\ell_{j})$ , so connecting $j$ to $S\cup S_{0}$ costs at most $(2+\alpha)$ times $j$ ’s contribution to the objective of $LP_{iter}$ .

Similarly, if $F_{j}$ happens to contain a facility in $F_{=1}\cup S_{0}$ , then we can simply connect $j$ to the closest such facility in $F_{j}$ . Note that $F_{=1}\cup S_{0}\subset S\cup S_{0}$ , so $j$ ’s connection cost in this case it as most its contribution to the objective in $LP_{iter}$ . In conclusion, the connection cost of $\{j\in\bar{C}\mid F_{j}\subset F_{=1}\cup S_{0}\}$ is at most $(2+\alpha)Opt(LP_{iter})$ . Summing the costs of these different groups of clients completes the proof.

8. Chain Decompositions of Extreme Points

In this section, we prove a more general version of Theorem 4.2 that applies to set-cover-like polytopes with $r$ side constraints. In particular, we consider polytopes of the form:

\mathcal{P}=\{y\in\mathbb{R}^{F}\mid y(F_{j})=1\quad\forall j\in C^{*},\,0\leq y\leq 1,\,Ay\leq b\}

, where $F$ , $C^{*}$ , and the $F$ -balls are defined as in $LP_{iter}$ , and $Ay\leq b$ is an arbitrary system of $r$ linear inequalities. We note that other than the $C_{part}$ -, and $C_{full}$ -constraints, $\mathcal{P}$ generalizes the feasible region of $LP_{iter}$ by taking the system $Ay\leq b$ to be the $r_{1}$ knapsack constraints and $r_{2}$ coverage constraints.

Although we phrase $\mathcal{P}$ in terms of facilities and clients, one can interpret $\mathcal{P}$ as a set cover polytope with side constraints as saying that we must choose elements in $F$ to cover each set in the family $\mathcal{F}=\{F_{j}\mid j\in C^{*}\}$ subject to the constraints $Ay\leq b$ . The main result of this section is that if $\mathcal{F}$ has bipartite intersection graph, there exists a chain decomposition of the extreme points of $\mathcal{P}$ .

Note that $\mathcal{P}$ can also also be interpreted as the intersection of two partition matroid polytopes or as a bipartite matching polytope both with $r$ side constraints. Our chain decomposition theorem shares some parallels with the work of Grandoni, Ravi, Singh, and Zenklusen, who studied the structure of bipartite matching polytopes with $r$ side constraints [GRSZ14].

Theorem 8.1 (General Chain Decomposition).

Suppose we have a polytope:

\mathcal{P}=\{y\in\mathbb{R}^{F}\mid y(F_{j})=1\quad\forall j\in C^{*},\,0\leq y\leq 1,\,Ay\leq b\},

such that $F$ is a finite ground set of elements (facilities), $\{F_{j}\subset F\mid j\in C^{*}\}$ is a set family indexed by $C^{*}$ (clients), and $Ay\leq b$ is a system of $r$ linear inequalities. Further, let $\bar{y}$ be an extreme point for $\mathcal{P}$ such that no non-negativity constraint is tight. If $\mathcal{F}$ has bipartite intersection graph, then $C^{*}_{<1}$ admits a partition into $3r$ chains along with at most $2r$ violating clients (clients that are not in any chain.)

We will use the following geometric fact about extreme points of polyhedra:

Fact 8.2.

Let $\mathcal{P}$ be a polyhedron in $\mathbb{R}^{n}$ . Then $x\in\mathbb{R}^{n}$ is an extreme point of $\mathcal{P}$ if and only if there exist $n$ linearly independent constraints of $\mathcal{P}$ that are tight at $x$ . We call such a set of constraints a basis for $x$ .

Theorem 4.2 follows almost immediately from Theorem 8.1.

Proof of Theorem 4.2.

Let $\bar{y}$ be an extreme point of $LP_{iter}$ such that no $C_{part}$ -, $C_{full}$ -, or non-negativity constraint is tight, and suppose $LP_{iter}$ satisfies the Distinct Neighbors Now consider the polytope:

\mathcal{P}=\{y\in\mathbb{R}^{F}\mid y(F_{j})=1\quad\forall j\in C^{*},\,0\leq y\leq 1,\,Ay\leq b\},

, where $Ay\leq b$ consists of the $r_{1}$ knapsack constraints and $r_{2}$ coverage constraints of $LP_{iter}$ . We claim that $\bar{y}$ is an extreme point of $\mathcal{P}$ . To see this, note that $\bar{y}$ is an extreme point of $LP_{iter}$ , so fix any basis for $\bar{y}$ using tight constraints of $LP_{iter}$ . By assumption, this basis uses no $C_{part}$ -, $C_{full}$ -, or non-negativity constraint. In particular, it only uses constraints of $LP_{iter}$ that are also present in $\mathcal{P}$ , so this basis certifies that $\bar{y}$ is an extreme point of $\mathcal{P}$ .

Further, by Proposition 2.6, the set family $\{F_{j}\mid j\in C^{*}\}$ has bipartite intersection graph. Then we can apply Theorem 8.1 to $\bar{y}$ and polytope $\mathcal{P}$ , which gives the desired result. ∎

8.1. Proof of Theorem 8.1

Now we go back to prove the more general chain decomposition theorem: Theorem 8.1. For all missing proofs, see §E. Throughout this section, let

\mathcal{P}=\{y\in\mathbb{R}^{F}\mid y(F_{j})=1\quad\forall j\in C^{*},\,0\leq y\leq 1,\,Ay\leq b\}

be a polytope satisfying the properties of Theorem 8.1. In particular, the intersection graph of $\mathcal{F}=\{F_{j}\mid j\in C^{*}\}$ is bipartite. Further, let $\bar{y}$ be an extreme point of $\mathcal{P}$ such that no non-negativity constraint is tight for $\bar{y}$ .

The crux of our proof is the next lemma, which allows us to bound the complexity of the intersection graph with respect to the number of side constraints $r$ . We prove the lemma by constructing an appropriate basis for $\bar{y}$ . The next definition is useful for constructing a basis.

Definition 8.3.

For any subset $C^{\prime}\subset C^{*}$ , let $dim(C^{\prime})$ denote the maximum number of linearly independent $C^{\prime}$ -constraints, so the constraint set $\{y(F_{j})=1\mid j\in C^{\prime}\}$ .

Lemma 8.4.

Let $\bar{y}$ be an extreme point of $\mathcal{P}$ such that no non-negativity constraint is tight. Then the number of fractional facilities in $\bar{y}$ satisfies $\lvert F_{<1}\rvert\leq dim(C^{*}_{<1})+r$ (recall that $r$ is the number of constraints of $Ay\leq b$ .)

Now, to find a chain decomposition of $C^{*}_{<1}$ , first we find the violating clients. We note that every $F$ -ball contains at least two facilities. The violating clients will be those clients whose $F$ -balls contain strictly more than two facilities, so we let $V=\{j\in C^{*}_{<1}\mid\lvert F_{j}\rvert>2\}$ be the set of violating clients. It remains to bound the size of $V$ , which follows from a standard counting argument.

Proposition 8.5.

$\lvert V\rvert\leq 2r$

Now that we have decided on the violating clients, it remains to partition $C^{*}_{<1}\setminus V$ into the desired chains. Importantly, for all $j\in C^{*}_{<1}\setminus V$ , we have $\lvert F_{j}\rvert=2$ . To find our chains, we consider the intersection graph of $C^{*}_{<1}$ , so the intersection graph of the set family $\{F_{j}\mid j\in C^{*}_{<1}\}$ . We let $G$ denote this graph. Note that $G$ is a subgraph of the standard intersection graph, so it is also bipartite by assumption.

We consider deleting the vertices $V$ from $G$ , which breaks $G$ into some connected components, say $H_{1},\dots,H_{\ell}$ . Let $V_{k}$ denote the vertex set of $H_{k}$ , so we have that $V_{k}\cup\dots\cup V_{k}$ partitions $C^{*}_{<1}\setminus V$ . Further, for all $k\in[\ell]$ , every $F$ -ball for clients in $V_{k}$ contains exactly two facilities, and every facility is in at most two $F$ -balls. Translating these statements into properties of the intersection graph, we can see that every vertex of $H_{k}$ has degree at most two, and $H_{k}$ is connected, so we can conclude that each $H_{k}$ is a path or even cycle (we eliminate the odd cycle case because the intersection graph is bipartite.)

Proposition 8.6.

Each $V_{k}$ is a chain.

To complete the proof, it remains to upper bound the number of chains. To do this, we first split the inequality given by Lemma 8.4 into the contribution by each $H_{k}$ . Importantly, we observe that the $F(V_{k})$ ’s are disjoint for all $k$ because the $V_{k}$ ’s correspond to distinct connected components. Then we can write:

\sum\limits_{k\in[\ell]}\lvert F(V_{k})\rvert\leq\lvert F_{<1}\rvert\leq dim(C^{*}_{<1})+r\leq\sum\limits_{k\in[\ell]}dim(V_{k})+dim(V)+r\leq\sum\limits_{k\in[\ell]}dim(V_{k})+3r.

The way to interpret this inequality is that each chain, $V_{k}$ , has a budget of $dim(V_{k})$ fractional facilities to use in its chain, but we have an extra $3r$ facilities to pay for any facilities beyond each $V_{k}$ ’s allocated budget. We will show that each chain uses at least one extra facility from this $3r$ surplus, which allows us to upper bound $\ell$ by $3r$ .

For the path components, we use the fact that every path has one more vertex than edge.

Proposition 8.7.

If $H_{k}$ is a path, then $\lvert F(V_{k})\rvert>dim(V_{k})$ .

For the even cycle components, we show that the corresponding $C^{*}$ -constraints are not linearly independent.

Proposition 8.8.

If $H_{k}$ is an even cycle, then $\lvert F(V_{k})\rvert>dim(V_{k})$ .

Applying the above two propositions, we complete the proof by bounding $\ell$ :

\sum\limits_{k\in[\ell]}dim(V_{k})+3r\geq\sum\limits_{k\in[\ell]}\lvert F(V_{k})\rvert\geq\sum\limits_{k\in[\ell]}dim(V_{k})+\ell\Rightarrow 3r\geq\ell

References

[AGK⁺04] Vijay Arya, Naveen Garg, Rohit Khandekar, Adam Meyerson, Kamesh Munagala, and Vinayaka Pandit. Local search heuristics for k-median and facility location problems. SIAM J. Comput., 33(3):544–562, 2004.
[BPR⁺17] Jaroslaw Byrka, Thomas W. Pensyl, Bartosz Rybicki, Aravind Srinivasan, and Khoa Trinh. An improved approximation for k-median and positive correlation in budgeted optimization. ACM Trans. Algorithms, 13(2):23:1–23:31, 2017.
[BPR⁺18] Jaroslaw Byrka, Thomas W. Pensyl, Bartosz Rybicki, Joachim Spoerhase, Aravind Srinivasan, and Khoa Trinh. An improved approximation algorithm for knapsack median using sparsification. Algorithmica, 80(4):1093–1114, 2018.
[Che08] Ke Chen. A constant factor approximation algorithm for $k$ -median clustering with outliers. In SODA, pages 826–835, 2008.
[CKMN01] Moses Charikar, Samir Khuller, David M. Mount, and Giri Narasimhan. Algorithms for facility location problems with outliers. In S. Rao Kosaraju, editor, Proceedings of the Twelfth Annual Symposium on Discrete Algorithms, January 7-9, 2001, Washington, DC, USA, pages 642–651. ACM/SIAM, 2001.
[FKRS19] Zachary Friggstad, Kamyar Khodamoradi, Mohsen Rezapour, and Mohammad R. Salavatipour. Approximation schemes for clustering with outliers. ACM Trans. Algorithms, 15(2):26:1–26:26, 2019.
[GLZ17] Sudipto Guha, Yi Li, and Qin Zhang. Distributed partial clustering. In Proceedings of the 29th ACM Symposium on Parallelism in Algorithms and Architectures, pages 143–152. ACM, 2017.
[GMM⁺03] Sudipto Guha, Adam Meyerson, Nina Mishra, Rajeev Motwani, and Liadan O’Callaghan. Clustering data streams: Theory and practice. TKDE, 15(3):515–528, 2003.
[GRSZ14] Fabrizio Grandoni, R. Ravi, Mohit Singh, and Rico Zenklusen. New approaches to multi-objective optimization. Math. Program., 146(1-2):525–554, 2014.
[IQM⁺20] Sungjin Im, Mahshid Montazer Qaem, Benjamin Moseley, Xiaorui Sun, and Rudy Zhou. Fast noise removal for k-means clustering. In The 23rd International Conference on Artificial Intelligence and Statistics, AISTATS 2020, 26-28 August 2020, Online [Palermo, Sicily, Italy], pages 456–466, 2020.
[JMS02] Kamal Jain, Mohammad Mahdian, and Amin Saberi. A new greedy approach for facility location problems. In Proceedings of the Thiry-fourth Annual ACM Symposium on Theory of Computing, STOC ’02, pages 731–740, New York, NY, USA, 2002. ACM.
[JV01] Kamal Jain and Vijay V. Vazirani. Approximation algorithms for metric facility location and k-median problems using the primal-dual schema and lagrangian relaxation. J. ACM, 48(2):274–296, 2001.
[KKN⁺15] Ravishankar Krishnaswamy, Amit Kumar, Viswanath Nagarajan, Yogish Sabharwal, and Barna Saha. Facility location with matroid or knapsack constraints. Math. Oper. Res., 40(2):446–459, 2015.
[KLS18] Ravishankar Krishnaswamy, Shi Li, and Sai Sandeep. Constant approximation for k-median and k-means with outliers via iterative rounding. In Ilias Diakonikolas, David Kempe, and Monika Henzinger, editors, Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2018, Los Angeles, CA, USA, June 25-29, 2018, pages 646–659. ACM, 2018.
[LG18] Shi Li and Xiangyu Guo. Distributed $k$ -clustering for data with heavy noise. In Advances in Neural Information Processing Systems, pages 7838–7846, 2018.
[LS16] Shi Li and Ola Svensson. Approximating k-median via pseudo-approximation. SIAM J. Comput., 45(2):530–547, 2016.
[MKC⁺15] Gustavo Malkomes, Matt Kusner, Wenlin Chen, Kilian Weinberger, and Benjamin Moseley. Fast distributed $k$ -center clustering with outliers on massive data. In NIPS, pages 1063–1071, 2015.

Appendix A Missing Proofs from §2: Construction of $LP_{iter}$

Proof of Proposition 2.1.

Let $\mathcal{I}$ be the given instance of GKM and $(x^{*},y^{*})$ be an optimal solution to $LP_{1}$ .

Observe that if $x^{*}_{ij}\in\{0,y^{*}_{i}\}$ for all $i\in F,j\in C$ , then we can define $F_{j}=\{i\in F\mid x^{*}_{ij}>0\}$ for all $j\in C$ . It is easy to verify in this case that $y^{*}$ is feasible for $LP_{2}$ and achieves the same objective value in $LP_{2}$ as $(x^{*},y^{*})$ achieves in $LP_{1}$ , which completes the proof.

Thus our goal is to duplicate facilities in $F$ and re-allocate the $x$ - and $y$ -values appropriately until $x^{*}_{ij}\in\{0,y^{*}_{i}\}$ for all $i\in F,j\in C$ . To prevent confusion, let $F$ denote the original set of facilities, and let $F^{\prime}$ denote the modified set of facilities, where make $n=\lvert C\rvert$ copies of each facility in $F$ , so for each $i\in F$ , we have copies $i_{1},\dots,i_{n}\in F^{\prime}$ .

Now we define $x^{\prime}\in[0,1]^{F^{\prime}\times C}$ and $y^{\prime}\in[0,1]^{F^{\prime}}$ with the desired properties. For each $i\in F$ , we assume without loss of generality that $0\leq x_{i1}\leq x_{i2}\leq\dots\leq x_{in}\leq y_{i}$ . We define $x^{\prime}_{i_{1}1},\dots,x^{\prime}_{i_{n}n}$ and $y^{\prime}_{i_{1}},\dots,y^{\prime}_{i_{n}}$ recursively:

Let $y^{\prime}_{i_{1}}=x_{i1}$ and $x^{\prime}_{i_{1}j}=x_{ij}$ for all $j\in[n]$ .

Now for $k>1$ , let $y^{\prime}_{i_{k}}=x_{ik}-x_{i(k-1)}$ and $x^{\prime}_{i_{k}j}=\begin{cases}0&,j<k\\ y^{\prime}_{i_{k}}&,j\geq k\end{cases}$ for all $j\in[n]$ .

It is easy to verify that $(x^{\prime},y^{\prime})$ is feasible for $LP_{1}$ (after duplicating facilities) and $x^{\prime}_{ij}\in\{0,y^{\prime}_{i}\}$ for all $i\in F^{\prime},j\in C$ , as required. Further, it is clear that this algorithm is polynomial time. ∎

Proof of Proposition 2.2.

If $d(p,q)=0$ , then the claim is trivial. Suppose $d(p,q)\geq 1$ . We can rewrite $d(p,q)=\tau^{\ell+f}$ for some $\ell\in\mathbb{N},f\in[0,1)$ . Also, for convenience we define $\beta=\log_{\tau}\alpha$ . Because $\log_{e}\alpha$ is uniformly distributed in $[0,\log_{e}\tau)$ , it follows that $\beta$ is uniformly distributed in $[0,1)$ .

It follows, $d(p,q)$ is rounded to $\alpha\tau^{\ell}=\tau^{\ell+\beta}$ exactly when $\beta\geq f$ , and otherwise $d(p,q)$ is rounded to $\tau^{\ell+\beta+1}$ when $\beta<f$ . Thus we compute:

	$\displaystyle\mathbb{E}[d^{\prime}(p,q)]$	$\displaystyle=\int_{\beta=0}^{f}\tau^{\ell+\beta+1}~{}d\beta+\int_{\beta=f}^{1}\tau^{\ell+\beta}~{}d\beta$
		$\displaystyle=\frac{1}{\log_{e}\tau}(\tau^{\ell+\beta+1}\rvert_{\beta=0}^{f}+\tau^{\ell+\beta}\rvert_{\beta=f}^{1})$
		$\displaystyle=\frac{1}{\log_{e}\tau}(\tau^{\ell+f+1}-\tau^{\ell+1}+\tau^{\ell+1}-\tau^{\ell+f})$
		$\displaystyle=\frac{1}{\log_{e}\tau}(\tau^{\ell+f+1}-\tau^{\ell+f})$
		$\displaystyle=\frac{\tau-1}{\log_{e}\tau}d(p,q).$

∎

Proof of Proposition 2.6.

Assume for contradiction that the intersection graph of $\mathcal{F}$ is not bipartite, so there exists an odd cycle, say $j_{1}\rightarrow\dots\rightarrow j_{\ell}\rightarrow j_{1}$ such that each vertex $j_{1},\dots,j_{\ell}\in C^{*}$ . Further, along each edge $j_{k}\rightarrow j_{k+1}$ , we have $F_{j_{k}}\cap F_{j_{k+1}}\neq\emptyset$ , so $\ell_{j_{k}}$ and $\ell_{j_{k+1}}$ differ by exactly one. In particular, the radius level can either increase by one or decrease by one along each edge.

Consider traversing the cycle starting from $j_{1}$ all the way to $j_{\ell}$ and then back to $j_{1}$ , and count the number of increases and decreases along the way. The number of increases and decreases must be equal when we return to $j_{1}$ , but this cycle has an odd number of edges, so the number of increases and decreases cannot be the same. This is a contradiction. ∎

Proof of Proposition 2.7.

Assume for contradiction that there exists a facility $i$ such that $i\in F_{j_{1}}\cap F_{j_{2}}\cap F_{j_{3}}$ for distinct clients $j_{1},j_{2},j_{3}\in C^{*}$ . Then the intersection graph of $C^{*}$ contains an odd cycle $j_{1}\rightarrow j_{2}\rightarrow j_{3}\rightarrow j_{1}$ . This contradicts the fact that the intersection graph is bipartite. ∎

Proof of Lemma 2.8.

Our algorithm is to first run the algorithm guaranteed by Lemma 2.1 to obtain $LP_{2}$ and the $F$ -balls such that $Opt(LP_{2})\leq Opt(\mathcal{I})$ . Then we follow the construction in §2- that is, we randomly discretize the distances to obtain $d^{\prime}$ , define the $F$ - and $B$ - balls and radius levels, and initialize $C_{part}=C$ , $C_{full}=\emptyset$ and $C^{*}=\emptyset$ . This completes the description of $LP_{iter}$ .

By Proposition 2.3, we have $\mathbb{E}[Opt(LP_{iter})]\leq\frac{\tau-1}{\log_{e}\tau}Opt(LP_{2})\leq\frac{\tau-1}{\log_{e}\tau}Opt(\mathcal{I})$ , as required. Finally, it is easy to check that $LP_{iter}$ satisfies all Basic Invariants. ∎

Appendix B Missing Proofs from §3 - 5: Analysis of PseudoApproximation

In this section, we present all missing proofs from the analyses of PseudoApproximation and its sub-routines IterativeRound and ConfigReRoute.

B.1. Analysis of IterativeRound

The goal of this section is to prove Theorem 3.1. First we show that IterativeRound maintains all Basic Invariants. It is easy to see that the first three Basic Invariants are maintained by IterativeRound, so we only prove the last two.

Lemma B.1 (Basic Invariant 2.4(4)).

IterativeRound maintains the invariant that $\ell_{j}\geq-1$ for all $j\in C$ .

Proof.

Consider any $j\in C$ . Suppose the invariant holds at the beginning of IterativeRound, so initially we have $\ell_{j}\geq-1$ . Note that a necessary condition for decreasing $\ell_{j}$ is that $\bar{y}(B_{j})\leq 1$ is tight at some iteration of IterativeRound, and in this case we decrease $\ell_{j}$ by one.

Suppose $\ell_{j}=-1$ . Then $L(\ell_{j}-1)=-1$ , so $B_{j}=\emptyset$ . Thus it cannot be the case that $\bar{y}(B_{j})\leq 1$ is tight. We conclude that for all $j\in C$ , we never decrease $\ell_{j}$ beyond negative one. ∎

Lemma B.2 (Basic Invariant 2.4(5): Distinct Neighbors Property).

IterativeRound maintains the Distinct Neighbors Property.

Proof.

It suffices to show that ReRoute maintains the Distinct Neighbors Property, because in IterativeRound, the only time $C^{*}$ is modified is when ReRoute is called. Thus we consider an arbitrary call to $\textsc{ReRoute}(j)$ . To prevent confusion, let $C^{*}$ denote the status of $C^{*}$ before the call to ReRoute. If we do not move $j$ from $C_{full}$ to $C^{*}$ , then the invariant is clearly maintained, so we may assume that we move $j$ from $C_{full}$ to $C^{*}$ .

Then for all $j^{\prime}\in C^{*}$ such that $F_{j}\cap F_{j^{\prime}}\neq\emptyset$ , we have $\ell_{j^{\prime}}\geq\ell_{j}+1$ . Finally, after adding $j$ to $C^{*}$ , we remove all such $j^{\prime}$ with $\ell_{j^{\prime}}\geq\ell_{j}+2$ , so for all remaining clients in $C^{*}$ whose $F$ -balls intersect $j$ ’s, their radius level is exactly one larger than $j$ ’s. ∎

To show that IterativeRound weakly decreases $Opt(LP_{iter})$ , it suffices to show that each iteration weakly decreases $Opt(LP_{iter})$ . We show that in any iteration of IterativeRound, the $\bar{y}$ computed at the beginning of the iteration is still feasible after we update $LP_{iter}$ in that iteration. Then, we show that $\bar{y}$ achieves the same objective value before and after the updates.

Lemma B.3.

Each iteration of IterativeRound weakly decreases $Opt(LP_{iter})$ .

Proof.

Consider any iteration of IterativeRound. Let $\bar{y}$ be the optimal solution to $LP_{iter}$ computed at the beginning of the iteration. There are three possible modifications we can make to $LP_{iter}$ in this iteration: remove a facility from $F$ , move a client from $C_{part}$ to $C_{full}$ , or shrink a $F$ -ball for a client in $C_{full}$ . For each operation, we show that $\bar{y}$ is still feasible and achieves the same objective value afterwards.

For the first operation, if we remove a facility $i$ from $F$ , then it must be the case that $\bar{y}_{i}=0$ . Thus it is immediate that $\bar{y}$ (restricted to $F\setminus\{i\}$ ) is feasible after deleting $i$ , and achieves the same objective value.

Otherwise, suppose there exists $j\in C_{part}$ such that $\bar{y}(F_{j})=1$ , and we move $j$ from $C_{part}$ to $C_{full}$ and then $\textsc{ReRoute}(j)$ . Thus $j$ ends up it either $C_{full}$ or $C^{*}$ at the end of this iteration. In either case, we have $\bar{y}(B_{j})\leq\bar{y}(F_{j})=1$ , so $\bar{y}$ satisfies the corresponding constraint after updating $LP_{iter}$ . Further, because $\bar{y}(F_{j})=1$ , we have:

\sum\limits_{i\in F_{j}}d^{\prime}(j,i)\bar{y}_{i}=\sum\limits_{i\in B_{j}}d^{\prime}(j,i)\bar{y}_{i}+(1-\bar{y}(B_{j}))L(\ell_{j}),

so contribution of $j$ to the objective before is the same as its contribution after.

In the final case, suppose there exists $j\in C_{full}$ such that $\bar{y}(B_{j})=1$ , so we shrink $F_{j}$ and then $\textsc{ReRoute}(j)$ . Let $LP_{iter}^{\prime}$ index the data at the end of the iteration, so $F_{j}^{\prime}=B_{j}$ . Then we have $\bar{y}(B_{j}^{\prime})\leq\bar{y}(F_{j}^{\prime})=1$ , so $\bar{y}$ satisfies the corresponding constraint for $j$ whether $j$ is in $C_{full}$ or $C^{*}$ . To compare the contribution of $j$ to the objective in $LP_{iter}$ and $LP_{iter}^{\prime}$ , we compute:

	$\displaystyle\sum\limits_{i\in B_{j}}d^{\prime}(i,j)\bar{y}_{i}+(1-\bar{y}(B_{j}))L_{\ell_{j}}$	$\displaystyle=\sum\limits_{i\in B_{j}}d^{\prime}(i,j)\bar{y}_{i}+0$
		$\displaystyle=\sum\limits_{i\in B_{j}^{\prime}}d^{\prime}(i,j)\bar{y}_{i}+\sum\limits_{i\in F_{j}^{\prime}\setminus B_{j}^{\prime}}d^{\prime}(i,j)\bar{y}_{i}$
		$\displaystyle=\sum\limits_{i\in B_{j}^{\prime}}d^{\prime}(i,j)\bar{y}_{i}+(1-\bar{y}(B_{j}^{\prime}))L(\ell_{j}^{\prime}).$

∎

Finally, we note that if IterativeRound terminates, then it is clear that no $C_{part}$ -, $C_{full}$ -, or non-negativity constraint is tight for $\bar{y}$ by definition of our iterative operations. Thus it suffices to show that IterativeRound terminates in polynomial time.

Lemma B.4.

IterativeRound terminates in polynomial time.

Proof.

It suffices to show that the number of iterations of IterativeRound is polynomial. In each iteration, we make one of three actions. We either delete a facility from $F$ , move a client from $C_{part}$ to $C_{full}$ or shrink a $F$ -ball by one radius level for a client in $j\in C_{full}$ .

We can delete each facility from $F$ at most once, so we make at most $\lvert F\rvert$ deletions. Each client can move from $C_{part}$ to $C_{full}$ at most once, because we never move clients back from $C_{full}$ to $C_{part}$ , so we do this operations at most $\lvert C\rvert$ times.

Finally, observe that $\ell_{j}\geq-1$ for all $j\in C$ over all iterations by Basic Invariant 2.4(4). We conclude that we can shrink each $F$ -ball only polynomially many times. ∎

B.2. Analysis of ConfigReRoute

To prove Lemma 4.4, which bounds the number of fractional facilities needed to have a candidate configuration, we first prove a bound on the number of factional clients needed. The bound on the number of facilities will follow.

Lemma B.5.

Suppose $LP_{iter}$ satisfies all Basic Invariants, and let $\bar{y}$ be an optimal extreme point of $LP_{iter}$ such that no $C_{part}$ -, $C_{full}$ -, or non-negativity constraint is tight. If $\lvert C^{*}_{<1}\rvert\geq 14r$ , then there exist a candidate configuration in $C^{*}_{<1}$ .

Proof.

We claim that in order for $C^{*}_{<1}$ to have a candidate configuration, it suffices to have a chain of length at least four in $C^{*}_{<1}$ . To see this, let $(j_{1},j_{2},j_{3},j_{4},\dots)\subset C^{*}_{<1}$ be a chain of length at least four. Then $F_{j_{2}}\cap F_{j_{3}}\neq\emptyset$ , and by the Distinct Neighbors Property, either $\ell_{j_{3}}=\ell_{j_{2}}-1$ or $\ell_{j_{2}}=\ell_{j_{3}}-1$ .

We only consider the former case, because both cases are analogous. Thus, if $\ell_{j_{3}}=\ell_{j_{2}}-1$ , then we claim that $(j_{2},j_{3})$ forms a candidate configuration. We already have the first two properties of a candidate configuration. Now we verify the last two. Because $j_{2}$ and $j_{3}$ are part of a chain, we have $\lvert F_{j_{2}}\rvert=2$ and $\lvert F_{j_{3}}\rvert=2$ . Further, $j_{2}$ has neighbors $j_{1}$ and $j_{3}$ along the chain. By Proposition 2.7, each facility in $F_{j_{2}}$ is in at most two $F$ -balls for clients in $C^{*}$ . In particular, one of the facilities in $F_{j_{2}}$ is shared by $F_{j_{1}}$ and $F_{j_{2}}$ , and the other must be shared by $F_{j_{2}}$ and $F_{j_{3}}$ . Thus, each facility in $F_{j_{2}}$ is in exactly two $F$ -balls for clients in $C^{*}$ . An analogous argument holds for $F_{j_{3}}$ , so $(j_{2},j_{3})$ satisfies all properties of a candidate configuration, as required.

Now suppose $\lvert C^{*}_{<1}\rvert\geq 14r$ . By Theorem 4.2, $C^{*}_{<1}$ admits a chain decomposition into at most $3r$ chains and a set of at most $2r$ violating clients. Then at least $12r$ of the clients in $C^{*}_{<1}$ belong to the $3r$ chains. By averaging, there must exist a chain with size at least $\frac{12r}{3r}=4$ , as required. ∎

Lemma 4.4 is a corollary of the above lemma.

Proof of Lemma 4.4.

By the previous lemma, it suffices to show that $\lvert F_{<1}\rvert\geq 15r$ implies that $\lvert C^{*}_{<1}\rvert\geq 14r$ . Applying Lemma 8.4, we have:

\lvert F_{<1}\rvert\leq dim(C^{*}_{<1})+r\leq\lvert C^{*}_{<1}\rvert+r

, which combined with $\lvert F_{<1}\rvert\geq 15r$ gives the desired result. ∎

Proof of Theorem 4.5.

It is clear that ConfigReRoute can be implemented to run in polynomial time and maintains all Basic Invariants, because ConfigReRoute only moves clients from $C^{*}$ to $C_{full}$ . Thus it remains to show that ConfigReRoute weakly decreases $Opt(LP_{iter})$ .

Using the same strategy as in B.3, we let $LP_{iter}$ denote the LP at the beginning of ConfigReRoute and $\bar{y}$ the optimal extreme point of $LP_{iter}$ . Then we show that $\bar{y}$ is feasible after the operation and achieves the same objective value.

In this call to ConfigReRoute, we move some client $j$ from $C^{*}$ to $C_{full}$ . We have $\bar{y}(B_{j})\leq\bar{y}(F_{j})=1$ , so $\bar{y}$ is feasible after ConfigReRoute. Finally, moving $j$ from $C^{*}$ to $C_{full}$ does not affect its contribution to the objective. ∎

B.3. Analysis of PseudoApproximation

Proof of Theorem 5.2.

It is immediate that PseudoApproximation maintains all Basic Invariants by Theorems 3.1 and 4.5. Further, both of these sub-routines are polynomial time, so to show that PseudoApproximation runs in polynomial time, it suffices to show that the number of calls to IterativeRound and ConfigReRoute is polynomial.

In every iteration of PseudoApproximation, either we terminate or we are guaranteed to move a client from $C^{*}$ to $C_{full}$ in ConfigReRoute. Each client can be removed from $C^{*}$ only polynomially many times, because each time a client is removed, in order to be re-added to $C^{*}$ , it must be the case that we shrunk the $F$ -ball of that client. However, by Basic Invariant 2.4(4), we can shrink each $F$ -ball only polynomially many times.

Finally, upon termination of PseudoApproximation, there is no candidate configuration, so Lemma 4.4 implies that $\bar{y}$ has at most $15r$ fractional variables. ∎

Appendix C Missing Proofs from §6

Proof of Lemma 6.7.

We let $i^{*}\in S$ be the closest facility to $i$ in $S$ . We show that the cost of connecting $C^{\prime}$ to $i^{*}$ is at most $O(\frac{\rho}{\delta})U$ . To do so, we partition $C^{\prime}$ into two sets of clients: those that are far from $i$ relative to $d(i,i^{*})$ , and those that are close to $i$ . In particular, let $\gamma>0$ be a constant that we choose later. Then we partition $C^{\prime}$ into $C^{\prime}_{far}$ and $C^{\prime}_{close}$ , where:

C^{\prime}_{far}=\{j\in C^{\prime}\mid d(j,i)\geq\gamma d(i,i^{*})\},

and

C^{\prime}_{close}=\{j\in C^{\prime}\mid d(j,i)>\gamma d(i,i^{*})\}.

First we bound the connection cost of $C^{\prime}_{far}$ to $i^{*}$ using the fact that $i\notin S_{0}$ , so Extra Invariant 6.4(2) says that $\sum\limits_{j\mid i\in F_{j}}d(i,j)\leq 2\rho U$ . Thus we compute:

\sum\limits_{j\in C^{\prime}_{far}}d(j,i^{*})\leq(1+\frac{1}{\gamma})\sum\limits_{j\in C\mid i\in F_{j}}d(j,i)\leq(1+\frac{1}{\gamma})O(\rho)U

Now suppose $C^{\prime}_{close}\neq\emptyset$ . Fix any $j^{*}\in C^{\prime}_{close}$ . Then for all $j\in C^{\prime}_{close}$ , we have $d(j,j^{*})\leq d(j,i)+d(j^{*},i)\leq 2\gamma d(i,i^{*})$ . It follows that $C^{\prime}_{close}\subset B_{C}(j^{*},2\gamma d(i,i^{*}))$ . Our strategy is to use Extra Invariant 6.4(4):

\lvert B_{C}(j^{*},\delta r)\rvert r\leq\rho U,

for all $r\leq R_{j^{*}}$ . Thus we want $2\gamma d(i,i^{*})\leq\delta R_{j^{*}}$ . To lower bound $R_{j^{*}}$ with respect to $d(i,i^{*})$ , we use our assumption that there exists some $\bar{i}\in S$ such that $d(j^{*},\bar{i})\leq\alpha L(\ell_{j^{*}})$ , where $L(\ell_{j^{*}})\leq\tau R_{j^{*}}$ by Extra Invariant 6.4(4). Thus we have:

d(j^{*},\bar{i})\leq\alpha\tau R_{j^{*}}.

Further, using the triangle inequality and the fact that $i^{*}$ is the closest facility to $i$ in $S$ , we have:

d(j^{*},\bar{i})\geq d(i,\bar{i})-d(i,j^{*})\geq d(i,i^{*})-d(i,j^{*})\geq(1-\gamma)d(i,i^{*}).

Combining these two inequalities gives the lower bound $R_{j^{*}}\geq\frac{1-\gamma}{\alpha\tau}d(i,i^{*})$ .

Now we are ready to choose $\gamma$ . Recall that we want $2\gamma d(i,i^{*})\leq\delta R_{j^{*}}$ , so it suffices to choose $\gamma$ such that:

2\gamma d(i,i^{*})\leq\delta\frac{1-\gamma}{\alpha\tau}d(i,i^{*})

Routine calculations show that we can take $\gamma=\Theta(\delta)$ to satisfy this inequality. Now with this choice of $\gamma$ , we can bound:

	$\displaystyle\sum\limits_{j\in C^{\prime}_{close}}d(j,i^{*})$	$\displaystyle\leq(1+\gamma)\lvert C^{\prime}_{close}\rvert d(i,i^{*})$
		$\displaystyle\leq(1+\gamma)\lvert B_{C}(j^{},\delta R_{j^{}})\rvert d(i,i^{*})$
		$\displaystyle\leq(1+\gamma)\frac{\rho U}{R_{j^{}}}d(i,i^{})$
		$\displaystyle\leq\frac{\rho U}{R_{j^{}}}O(1)R_{j^{}}=O(\rho)U$

To conclude the proof, the connection cost of $C^{\prime}_{far}$ is at most $(1+\frac{1}{\gamma})O(\rho)U=O(\frac{\rho}{\delta})U$ and the connection cost of $C^{\prime}_{close}$ is at most $O(\rho)U$ . Summing these costs gives the desired result. ∎

Proof of Lemma 6.15.

C^{\prime}_{far}=\{j\in C^{\prime}\mid d(j,i)\geq\gamma d(i,i^{*})\}

, and

C^{\prime}_{close}=\{j\in C^{\prime}\mid d(j,i)>\gamma d(i,i^{*})\}

First we bound the connection cost of $C^{\prime}_{far}$ to $i^{*}$ using the fact that $i\notin S_{0}$ , so Extra Invariant 6.12(2) says that $i$ is cheap. Thus we compute:

\sum\limits_{j\in C^{\prime}_{far}}d(j,i^{*})\leq(1+\frac{1}{\gamma})\sum\limits_{j\in C\mid i\in F_{j}}d(j,i)\leq(1+\frac{1}{\gamma})O(\rho)U

Now suppose $C^{\prime}_{close}\neq\emptyset$ . Importantly, all of these clients are within distance $\gamma d(i,i^{*})$ of $i$ , so we have $C^{\prime}_{close}\subset B_{C}(i,\gamma d(i,i^{*}))$ . Our strategy to bound the connection cost of $C^{\prime}_{close}$ is to leverage Extra Invariant 6.12(4), so in particular we want to use the fact:

\lvert\{j\in B_{C}(i,\frac{\delta t}{4+3\delta})\mid R_{j}\geq t\}\rvert\leq\frac{\rho(1+3\delta/4)}{1-\delta/4}\frac{U}{t}

for any $t>0$ . We want to choose $\gamma,t>0$ such that $\gamma d(i,i^{*})\leq\frac{\delta t}{4+3+\delta}$ and $R_{j}\geq t$ for all $j\in C^{\prime}_{close}$ . To see why this is useful, for such $\gamma$ and $t$ , we have $C^{\prime}_{close}\subset\{j\in B_{C}(i,\frac{\delta t}{4+3\delta})\mid R_{j}\geq t\}$ . Then we can bound:

	$\displaystyle\sum\limits_{j\in C^{\prime}_{close}}d(j,i^{*})$	$\displaystyle\leq\sum\limits_{j\in C^{\prime}_{close}}(1+\gamma)d(i,i^{*})$
		$\displaystyle=(1+\gamma)\lvert C^{\prime}_{close}\rvert d(i,i^{*})$
		$\displaystyle\leq(1+\gamma)\lvert\{j\in B_{C}(i,\frac{\delta t}{4+3\delta})\mid R_{j}\geq t\}\rvert d(i,i^{*})$
		$\displaystyle\leq(1+\gamma)(\frac{\rho(1+3\delta/4)}{1-\delta/4}\frac{U}{t})d(i,i^{*})$

Now we go back and specify our choice of $\gamma$ and $t$ , which will allow us to complete the bound of the connection costs. First we lower bound $R_{j}$ in terms of $d(i,i^{*})$ for any $j\in C^{\prime}_{close}$ . We recall that by assumption there exists some $\bar{i}\in S$ such that $d(j,\bar{i})\leq\alpha L(\ell_{j})$ , where $L(\ell_{j})\leq\tau R_{j}$ by Extra Invariant 6.12(4). Thus we have:

d(j,\bar{i})\leq\alpha\tau R_{j}

Further, using the triangle inequality and the fact that $i^{*}$ is the closest facility to $i$ in $S$ , we have:

d(j,\bar{i})\geq d(i,\bar{i})-d(i,j)\geq d(i,i^{*})-d(i,j)\geq(1-\gamma)d(i,i^{*})

Combining this inequality with the upper bound on $d(j,\bar{i})$ gives that $R_{j}\geq\frac{1-\gamma}{\alpha\tau}d(i,i^{*})$ for all $j\in C^{\prime}_{close}$ . Then we define $t=\frac{1-\gamma}{\alpha\tau}d(i,i^{*})$ . This gives us $R_{j}\geq t$ for all $j\in C^{\prime}_{close}$ . Now we can choose $\gamma>0$ satisfying:

\gamma d(i,i^{*})\leq\frac{\delta t}{4+3\delta}\Rightarrow\gamma\leq\frac{\delta}{4+3\delta}\frac{1-\gamma}{\alpha\tau}

Taking $\gamma=\frac{\delta}{12\alpha\tau}=\Theta(\delta)$ suffices.

Using these choices of $\gamma$ and $t$ , we can bound:

\sum\limits_{j\in C^{\prime}_{far}}d(j,i^{*})=O(\frac{\rho}{\delta})U

, and

\sum\limits_{j\in C^{\prime}_{close}}d(j,i^{*})\leq(1+\gamma)(\frac{\rho(1+3\delta/4)}{1-\delta/4}U)(\frac{\alpha\tau}{1-\gamma})=O(\rho)U

Summing these two costs gives the desired result. ∎

Proof of Theorem 6.2.

With these choices of parameters, we run the algorithm guaranteed by Theorem 6.13 to obtain $n^{O(1/\epsilon^{\prime})}$ many sub-instances such that one such sub-instance is of the form $\mathcal{I}^{\prime}=(F,C^{\prime}\subset C,d,k,m^{\prime}=m-\lvert C^{*}\setminus C^{\prime}\rvert)$ , where $LP_{iter}$ for $\mathcal{I}^{\prime}$ satisfies all Basic- and Extra Invariants, and we have:

(2)

\frac{\log_{e}\tau}{(\tau-1)(1+\epsilon^{\prime}/2)}\mathbb{E}[Opt(LP_{iter})]+\frac{1-\epsilon^{\prime}}{1+\epsilon^{\prime}}\sum\limits_{j\in C^{*}\setminus C^{\prime}}d(j,S_{0})\leq U

Then for each sub-instance output by the pre-processing, we run the algorithm guaranteed by Theorem 6.16 to obtain a solution to each sub-instance. Finally, out of these solutions, we output the one that is feasible for the whole instance with smallest cost. This completes the description of our approximation algorithm for $k$ -median with outliers. The runtime is $n^{O(1/\epsilon^{\prime})}$ , so it remains to bound the cost of the output solution and to choose the parameters $\epsilon^{\prime}$ and $\tau$ and $c$ .

To bound the cost, it suffices to consider the solution output on the instance $\mathcal{I}^{\prime}$ where $LP_{iter}$ satisfies all Basic- and Extra Invariants and Equation 2. By running the algorithm guaranteed by Theorem 6.16 on this $LP_{iter}$ , we obtain a feasible solution $S\subset F$ to $\mathcal{I}^{\prime}$ such that $S_{0}\subset S$ , and the cost of connecting $m^{\prime}$ clients from $C^{\prime}$ to $S$ is at most $(2+\alpha)Opt(LP_{iter})+O(\epsilon^{\prime})U$ , where $\alpha=\max(3+2\tau^{-c},1+\frac{4+2\tau^{-c}}{\tau},\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1})$ . To extend this solution on the sub-instance to a solution on the whole instance $\mathcal{I}$ , we must connect $m-m^{\prime}=\lvert C^{*}\setminus C^{\prime}\rvert$ clients from $C\setminus C^{\prime}$ to $S$ . Because $S_{0}\subset S$ , applying Equation 1 allows us to upper bound the expected cost of connecting $m$ clients to $S$ by:

(2+\alpha)\mathbb{E}[Opt(LP_{iter})]+O(\epsilon^{\prime})U+\sum\limits_{j\in C^{*}\setminus C^{\prime}}d(j,S_{0})\leq(2+\alpha)\frac{\tau-1}{\log_{e}\tau}\frac{(1+\epsilon^{\prime})^{2}}{1-\epsilon^{\prime}}U+O(\epsilon^{\prime})U

Now choosing $\tau>1$ to minimize $\alpha^{\prime}=(2+\max(3,1+\frac{4}{\tau},\frac{\tau^{3}+2\tau^{2}+1}{\tau^{3}-1}))\frac{\tau-1}{\log_{e}\tau}$ (note that we ignore the $2\tau^{-c}$ terms), we obtain $\tau=1.2074$ and $\alpha^{\prime}=6.947$ . We can choose $c\geq 1$ sufficiently large with respect to $\tau$ such that $2\tau^{-c}$ is sufficiently small to guarantee $(2+\alpha)\frac{\tau-1}{\log_{e}\tau}\leq 6.947+\epsilon^{\prime}$

Thus the expected cost of this solution is at most $(6.947+\epsilon^{\prime})\frac{(1+\epsilon^{\prime})^{2}}{1-\epsilon^{\prime}}U+O(\epsilon^{\prime})U$ , where $U\leq(1+\epsilon^{\prime})Opt(\mathcal{I})$ . Finally, by routine calculations, we can choose $\epsilon^{\prime}=\theta(\epsilon)$ so that expected cost is at most $(6.947+\epsilon)Opt(\mathcal{I})$ , as required. Note that the runtime of our algorithm is $n^{O(1/\epsilon^{\prime})}=n^{O(1/\epsilon)}$ . ∎

Appendix D Missing Proofs from §7: Analysis of OutliersPostProcess

In this section we present all missing proofs from the analysis of OutliersPostProcess and its subroutine ComputePartial.

D.1. Missing Proofs from Analysis of OutliersPostProcess

Proof of Lemma 7.2.

Without loss of generality, we may assume that no facilities in $S_{0}$ are co-located with each other, so $\{F_{j}\mid j\in C_{0}\}$ is a disjoint family. This implies that $\{F_{j}\mid j\in C^{*}_{<1}\}$ is also a disjoint family. Now we construct a basis for $\bar{y}$ . For every integral facility $i\in F_{=1}$ , we add the constraint $y_{i}\leq 1$ to our basis. To complete the basis, we need to add $\lvert F_{<1}\rvert$ further linearly independent tight constraints.

We recall that upon termination of PseudoApproximation, no $C_{part}$ -, $C_{full}$ -, or non-negativity constraint is tight for $\bar{y}$ , so the only constraints we can choose are the $C^{*}$ -constraints, the $k$ -constraint, or the coverage constraint. We claim that we cannot add any $C^{*}_{=1}$ -constraint to our basis, because such a constraint is supported only on integral facilities, whose constraints we already added to the basis. However, we can add every $C^{*}_{<1}$ -constraint to our basis, because their supports are disjoint and they contain no integral facilities. Thus, our partial basis consists of all tight integrality constraints and all $C^{*}_{<1}$ -constraints.

Now we consider adding the $k$ -constraint to our basis. Importantly, the $k$ -constraint is linearly independent with the current partial basis only if there exists at least one fractional facility not supported on any $F$ -ball for clients in $C^{*}_{<1}$ . Further, we may assume the $k$ -constraint is tight (otherwise we cannot add it anyways), so there must be at least two fractional facilities not supported on any $F$ -ball for clients in $C^{*}_{<1}$

However, we note that each $F$ -ball for clients in $C^{*}_{<1}$ contains at least two fractional facilities. Because these $F$ -balls are disjoint, we have $\lvert F_{<1}\rvert\geq 2\lvert C^{*}_{<1}\rvert$ . If we cannot add the $k$ -constraint to our basis, then we are done. This is because the coverage constraint is the only further constraint we can add the the basis, so we can bound $\lvert F_{<1}\rvert\leq\lvert C^{*}_{<1}\rvert+1$ . This implies implies $\lvert F_{<1}\rvert\leq\frac{1}{2}\lvert F_{<1}\rvert+1\Rightarrow\lvert F_{<1}\rvert\leq 2$ using the previous inequality.

Otherwise, we add the $k$ -constraint to our basis, which implies $\lvert F_{<1}\rvert\geq 2\lvert C^{*}_{<1}\rvert+2$ because of the two fractional facilities outside $F(C^{*}_{<1})$ and $\lvert F_{<1}\rvert\leq\lvert C^{*}_{<1}\rvert+2$ because the $k$ -constraint and coverage constraint contribute are the only further constraints we can add. Again combining these two inequalities gives $\lvert F_{<1}\rvert\leq 2$ . ∎

Proof of Lemma 7.3.

Let $S=F_{=1}$ be the set of open facilities. It is immediate that $\lvert S\rvert\leq k$ . Further, $LP_{iter}$ satisfies all Extra Invariants, so $C_{0}\subset C^{*}$ . Because $\bar{y}$ is integral, it is clear that we open $S_{0}$ . Thus it remains to show that the connecting $m$ clients to $S$ has cost at most $(2+\alpha)Opt(LP_{iter})$ .

It suffices to show that connecting $C_{full}$ and $C^{*}$ to $S$ is enough clients and achieves the desired cost. Because $\bar{y}$ is integral and by definition of PseudoApproximation, we have that no $C_{part}$ -, $C_{full}$ -, or non-negativity constraint is tight for $\bar{y}$ . It follows, $F_{j}=\emptyset$ for all $j\in C_{part}$ and $B_{j}=\emptyset$ for all $j\in C_{full}$ .

Then the coverage constraint of $LP_{iter}$ implies:

\sum\limits_{j\in C_{part}}\bar{y}(F_{j})\geq m-\lvert C_{full}\cup C^{*}\rvert\ \Rightarrow\lvert C_{full}\cup C^{*}\rvert\geq m

, so this solution connects enough clients.

To bound the cost, we compare the connection cost of each client with its contribution to the objective of $LP_{iter}$ . For all $j\in C^{*}$ we have $\bar{y}(F_{j})=1$ , so $d(j,S)\leq\sum\limits_{i\in F_{j}}d^{\prime}(j,i)\bar{y}_{i}$ , which is exactly $j$ ’s contribution to $LP_{iter}$ .

For all $j\in C_{full}$ , we note that $\bar{y}(B_{j})=0$ , so $j$ ’s contribution to $LP_{iter}$ is exactly $L(\ell_{j})$ . We can apply 5.3 with $\beta=1$ and set of facilities $S$ to show that $d(j,S)\leq(2+\alpha)L(\ell_{j})$ for all $j\in C_{full}$ . To conclude, the connection cost of each client is at most $(2+\alpha)$ times its contribution to $LP_{iter}$ , a required. ∎

Proof of Lemma 7.4.

Let $S=F_{=1}\cup\{a\}$ be the output solution. First, note that $\lvert S\rvert\leq k$ because $\bar{y}_{a}+\bar{y}_{b}=1$ , and those are the only two fractional variables. Second, because $a\notin S_{0}$ , it must be the case that $b\notin S_{0}$ , because $a,b$ are the only fractional facilities, and by Extra Invariant 6.12 (1), there is one unit of open facility co-located at each $i\in S_{0}$ . Note that this implies that $S_{0}\subset F_{=1}\subset S$ .

Now there are two cases, either $a,b\in F_{j}$ for some $j\in C^{*}$ , or $a,b\notin F(C^{*})$ . Note that in either case, we close $b$ and open $a$ , so we still maintain the property that $\bar{y}(F_{j})=1$ for all $j\in C^{*}$ . Thus, can apply 5.3 with $\beta=1$ and set of facilities $S$ to show that $d(j,S)\leq(2+\alpha)L(\ell_{j})$ for all $j\in C_{full}\cup C^{*}$ .

We consider connecting the clients $C_{part}(a)\cup C_{full}\cup C^{*}$ to $S$ . First, we show that this is at least $m$ clients. We observe that $a$ and $b$ are the only fractional facilities in $\bar{y}$ , and no $C_{part}$ -constraint is tight. It follows that for all $j\in C_{part}$ , we have $F_{j}=\{a\}$ , $\{b\}$ , or $\emptyset$ , so we can rewrite the coverage constraint as:

\lvert C_{part}(a)\rvert\bar{y}_{a}+\lvert C_{part}(b)\rvert\bar{y}_{b}\geq m-\lvert C_{full}\cup C^{*}\rvert

Then because $\bar{y}_{a}+\bar{y}_{b}=1$ and $\lvert C_{part}(a)\rvert\geq\lvert C_{part}(b)\rvert$ by assumption, we conclude that $\lvert C_{part}(a)\rvert\geq m-\lvert C_{full}\cup C^{*}\rvert$ , as required.

Now it remains to show that the cost of connecting $C_{part}(a)$ to $a$ plus the cost of connecting $C_{full}\cup C^{*}$ to $S$ is at most $\alpha Opt(LP_{iter})+O(\frac{\rho}{\delta})U$ . First we handle $C_{part}(a)$ . By assumption, $a\notin S_{0}$ , so by Extra Invariant 6.12(2), we can bound:

\sum\limits_{j\in C_{part}(a)}d(j,a)\leq\sum\limits_{j\in C\mid a\in F_{j}}d(j,a)=O(\rho)U

For the clients in $C_{full}\cup C^{*}$ that are not supported on $b$ , closing $b$ does not affect their connection cost; in particular, each such client either has an integral facility in its $F$ -ball to connect to (because we open $a$ and all other facilities are integral), or its $F$ -ball is empty, and there exists an integral facility within $(2+\alpha)L(\ell_{j})$ to connect to. In both cases, each client’s connection cost is at most $(2+\alpha)$ times its contribution to the objective of $LP_{iter}$ .

The only remaining cost to bound is the clients in $C_{full}\cup C^{*}$ that are supported on $b$ . Let $C^{\prime}=\{j\in C_{full}\cup C^{*}\mid b\in F_{j}\}$ be these clients. We show that the cost of connecting all of $C^{\prime}$ to $S$ is at most $O(\frac{\rho}{\delta})U$ using Lemma 6.15. Because every client in $j\in C_{full}\cup C^{*}$ has an open facility in $S$ within distance $(2+\alpha)L(\ell_{j})$ , Lemma 6.15 is applicable to $C^{\prime}$ with set of facilities $S$ and $i=b\notin S\cup S_{0}$ .

To summarize, the connection costs of $C_{part}(a)$ and $C^{\prime}$ are at most $O(\frac{\rho}{\delta})U$ , and the connection cost of all remaining clients in $C_{full}\cup C^{*}$ that are not supported on $b$ is at most $(2+\alpha)Opt(LP_{iter})$ , so the total connection cost, which is the sum of these terms, it at most the desired bound. ∎

D.2. Missing Proofs from Analysis of ComputePartial

Proof of 7.7.

Assume for contradiction that there exists $\bar{j}$ that is reached by the For loop, but $\bar{j}$ does not remain in $\bar{C}$ until termination. Note that $\bar{j}$ cannot be removed from $\bar{C}$ in the iteration that it is considered in the For loop. Thus there must exist a later iteration for client, say $j$ in which $\bar{j}$ is removed from $\bar{C}$ . In the iteration for client $j$ , there are only two possible ways that $\bar{j}$ is removed from $\bar{C}$ . Either $F_{j}\cap F_{\bar{j}}\neq\emptyset$ or there exists a client $j^{\prime}\in C_{part}\setminus C_{covered}$ such that $F_{j^{\prime}}$ intersects both $F_{j}$ and $F_{\bar{j}}$ and $\ell_{j^{\prime}}\leq\ell_{j}-c$ .

In the former case, because we consider $\bar{j}$ before $j$ , it must be the case that we removed $j$ from $\bar{C}$ in $\bar{j}$ ’s iteration. This is a contradiction. Similarly, in the second case if such a $j^{\prime}$ exists, then in $\bar{j}$ ’s iteration, we either remove $j$ from $\bar{C}$ or add $j^{\prime}$ to $C_{covered}$ . In either case, this is a contradiction. ∎

Proof of 7.8.

By assumption, the input to ComputePartial, $LP_{iter}$ , satisfies all Basic and Extra Invariants. To obtain $LP_{iter}^{1}$ from $LP_{iter}$ , we delete some clients and facilities. Thus the only change to the $F$ - and $B$ -balls for clients in $C^{1}$ is that we possibly remove some facilities from their $F$ - and $B$ -balls; importantly, the radius levels, $\ell_{j}$ for all clients $j$ , remain the same. Thus, it is easy to see that $LP_{iter}^{1}$ satisfies all Basic Invariants.

Similarly, for all remaining clients $j$ , we have not changed $\ell_{j}$ or $R_{j}$ , so the only Extra Invariant that requires some care to verify is Extra Invariant 6.12(1). However, we recall that to obtain ${C^{*}}^{1}$ , we delete all clients in $C^{*}\setminus C_{0}$ from the instance, so ${C^{*}}^{1}=C_{0}$ . This is because $C_{0}\subset C^{*}$ by the assumption that $LP_{iter}$ satisfies all Extra Invariants. ∎

Proof of 7.10.

We note that $C^{1}=C\setminus(C_{part}(\bar{S})\cup C_{covered}\cup C_{full}\cup(C^{*}\setminus C_{0}))$ and $F^{1}=F\setminus(F_{=1}\cup F(\bar{C})\setminus S_{0})$ . Then we have $C_{part}^{1}=C_{part}\setminus(C_{part}(\bar{S})\cup C_{covered})$ , $C_{full}^{1}=\emptyset$ , and ${C^{*}}^{1}=C_{0}$ .

It suffices to show that all $C_{part}^{1}$ -constraints, ${C^{*}}^{1}$ -constraint, the $k$ -constraint, and the coverage constraint are satisfied by $\bar{y}$ restricted to $F^{1}$ .

Consider any $j\in C_{part}^{1}$ . We observe that the $F_{j}^{1}\subset F_{j}$ for all $j\in C_{part}^{1}$ and $C_{part}^{1}\subset C_{part}$ .Then $\bar{y}(F_{j}^{1})\leq\bar{y}(F_{j})\leq 1$ for all $j\in C_{part}^{1}$ .

Now for any $j\in{C^{*}}^{1}=C_{0}$ , it suffices to show that we do not delete any copies of $i\in S_{0}$ when going from from $F$ to $F^{1}$ , but this is immediate because $S_{0}\subset F$ , and we do not delete any facility from $S_{0}$ to obtain $F^{1}$ . Thus, every ${C^{*}}^{1}$ -constraint is satisfied.

For the $k$ -constraint, we have $\bar{y}(F)\leq k$ . We want to show $\bar{y}(F^{1})\leq k-\lvert S\rvert$ . By definition $\lvert\bar{S}\rvert=\sum\limits_{j\in\bar{C}}\bar{y}(F_{j})=\bar{y}(F(\bar{C}))$ , where the final equality follows because $\{F_{j}\mid j\in\bar{C}\}$ is a disjoint collection by Proposition 7.5. Then we compute:

\bar{y}(F^{1})=\bar{y}(F)-\bar{y}(F(\bar{C}))-\lvert F_{=1}\setminus S_{0}\rvert\leq k-\lvert\bar{S}\rvert-\lvert F_{=1}\setminus S_{0}\rvert=k-\lvert S\rvert

, as required.

Finally, for the coverage constraint, we want to show:

\sum\limits_{j\in C_{part}^{1}}\bar{y}(F_{j}^{1})\geq m^{1}-\lvert C_{full}^{1}\cup{C^{*}}^{1}\rvert,

where $C_{part}^{1}=C_{part}\setminus(C_{part}(\bar{S})\cup C_{covered})$ , $m^{1}=m-\lvert C_{part}(\bar{S})\cup C_{covered}\cup C_{full}\cup(C^{*}\setminus C_{0})\rvert$ , and $C_{full}^{1}=\emptyset$ and ${C^{*}}^{1}=C_{0}$ , so $\lvert C_{full}^{1}\cup{C^{*}}^{1}\rvert=\lvert C_{0}\rvert$ . Thus we can re-write the coverage constraint as:

\sum\limits_{j\in C_{part}^{1}}\bar{y}(F_{j}^{1})\geq m-\lvert C_{part}(\bar{S})\cup C_{covered}\cup C_{full}\cup{C^{*}}\rvert.

Recall that the coverage constraint of $LP_{iter}$ implies:

\sum\limits_{j\in C_{part}}\bar{y}(F_{j})\geq m-\lvert C_{full}\cup C^{*}\rvert.

By splitting this inequality into the contribution by $C_{part}^{1}$ and $C_{part}\setminus C_{part}^{1}=C_{part}(\bar{S})\cup C_{covered}$ , we obtain:

	$\displaystyle\sum\limits_{j\in C_{part}}\bar{y}(F_{j})$	$\displaystyle\geq m-\lvert C_{full}\cup C^{*}\rvert$
	$\displaystyle\sum\limits_{j\in C_{part}^{1}}\bar{y}(F_{j})+\sum\limits_{j\in C_{part}(\bar{S})\cup C_{covered}}\bar{y}(F_{j})$	$\displaystyle\geq m-\lvert C_{full}\cup C^{*}\rvert$
	$\displaystyle\sum\limits_{j\in C_{part}^{1}}\bar{y}(F_{j})+\sum\limits_{j\in C_{part}(\bar{S})}\bar{y}(F_{j})$	$\displaystyle\geq m-\lvert C_{covered}C_{full}\cup C^{*}\rvert$

, where in the final inequality we use the fact that $\bar{y}(F_{j})\leq 1$ for all $j\in C_{covered}\subset C_{part}$ . Now, we recall that $F_{j}^{1}=F_{j}\setminus F(\bar{C})$ for all $j\in C_{part}^{1}$ , because $C_{part}^{1}\subset C_{part}$ . We can re-write:

\sum\limits_{j\in C_{part}^{1}}\bar{y}(F_{j})=\sum\limits_{j\in C_{part}^{1}}(\bar{y}(F_{j}^{1})+\bar{y}(F_{j}\cap F(\bar{C})).

To show that the coverage constraint is satisfied, it suffices to show:

\sum\limits_{j\in C_{part^{1}}}\bar{y}(F_{j}\cap F(\bar{C}))+\sum\limits_{j\in C_{part}(\bar{S})}\bar{y}(F_{j})\leq\lvert C_{part}(\bar{S})\rvert.

To see this, observe that the first sum is over all clients in $C_{part}\setminus C_{covered}$ supported on some facility in $F(\bar{C})\setminus\bar{S}$ but none in $\bar{S}$ (otherwise these clients would be in $C_{part}(\bar{S})$ .) The second sum is over all clients in $C_{part}\setminus C_{covered}$ supported on some facility in $\bar{S}$ . Thus, recalling that $w_{i}=\lvert\{j\in C_{part}\setminus C_{covered}\mid i\in F_{j}\}\rvert$ , we have:

\sum\limits_{j\in C_{part^{1}}}\bar{y}(F_{j}\cap F(\bar{C}))+\sum\limits_{j\in C_{part}(\bar{S})}\bar{y}(F_{j})=\sum\limits_{j\in\bar{C}}\sum\limits_{i\in F_{j}}w_{i}\bar{y}_{i}\leq\lvert C_{part}(\bar{S})\rvert,

where in the final inequality we apply 7.9. ∎

Appendix E Missing Proofs from §8: Chain Decomposition

Proof of 8.4.

We construct a basis $\bar{y}$ . First, for each integral facility $i\in F_{=1}$ , we add the integrality constraint $\bar{y}_{i}\leq 1$ to our basis. Thus we currently have $\lvert F_{=1}\rvert$ constraints in our basis.

It remains to choose $\lvert F_{<1}\rvert$ further linearly independent constraints to add to our basis. Note that we have already added all tight integrality constraints to our basis, and no non-negativity constraint is tight. Then the only remaining constraints we can add are the $C^{*}$ -constraints and the $r$ constraints of $Ay\leq b$ .

We claim that we cannot add any $C^{*}_{=1}$ -constraints, because every $C^{*}_{=1}$ -constraint is of the form $y(F_{j})=y_{i_{j}}=1$ for the unique integral facility $i_{j}\in F_{1}$ . Note that here we used the fact that there is no facility that is set to zero. Thus every $C^{*}_{=1}$ -constraint is linearly dependent with the tight integrality constraints, which we already chose.

It follows, the only possible constraints we can choose are the $C^{*}_{<1}$ -constraints and the $r$ constraints of $Ay\leq b$ so:

\lvert F_{<1}\rvert\leq dim(C^{*}_{<1})+r.

∎

Proof of 8.5.

It suffices to upper bound the quantity $\sum\limits_{j\in C^{*}_{<1}}(\lvert F_{j}\rvert-2)$ by $2r$ , because each client in $V$ contributes at least one to this sum, and every term of this sum is non-negative (because every $F$ -ball contains at least two facilities.)

Using Proposition 2.7, we can bound $\sum\limits_{j\in C^{*}_{<1}}\lvert F_{j}\rvert\leq 2\lvert F(C^{*}_{<1})\rvert\leq 2\lvert F_{<1}\rvert$ , where in the final inequality, we use the fact that every $F$ -ball for clients in $C^{*}_{<1}$ is supported on only fractional facilities. To upper bound our desired quantity, we use this inequality combined with Lemma 8.4 to obtain:

\sum\limits_{j\in C^{*}_{<1}}(\lvert F_{j}\rvert-2)\leq 2\lvert F_{<1}\rvert-2\lvert C^{*}_{<1}\rvert\leq 2(dim(C^{*}_{<1})+r)-2\lvert C^{*}_{<1}\rvert\leq 2r.

∎

Proof of 8.6.

Consider any $V_{k}$ , which is the vertex set of $H_{k}$ . We already established that $H_{k}$ is either a path or even cycle. In both cases, we can order $V_{k}=\{j_{1},\dots,j_{p}\}$ such that $j_{1}\rightarrow j_{2}\rightarrow\dots\rightarrow j_{p}$ is a path in the intersection graph. We verify that $V_{k}$ satisfies both properties of a chain.

Because $V_{k}\subset C^{*}_{<1}\setminus V$ , we have $\lvert F_{j_{q}}\rvert=2$ for all $q\in[p]$ , and because $j_{q}\rightarrow j_{q+1}$ is an edge in the intersection graph for all $q\in[p-1]$ , we have $F_{j_{q}}\cap F_{j_{q+1}}\neq\emptyset$ for all $q\in[p-1]$ . ∎

Proof of 8.7.

Because $H_{k}$ is a path, suppose $V_{k}=\{j_{1},\dots,j_{p}\}$ such that $j_{1}\rightarrow\dots\rightarrow j_{p}$ . There are two cases to consider.

We first handle the degenerate case where there exists $q\in[p-1]$ such that $\lvert F_{j_{q}}\cap F_{j_{q+1}}\rvert=2$ . Note that each $F$ -ball for clients in $V_{k}$ has size exactly two, so we have $F_{j_{q}}=F_{j_{q+1}}$ . Then both facilities in $F_{j_{q}}$ are already in exactly two $F$ -balls, so $j_{q}$ and $j_{q+1}$ can have no other neighbors in the intersection graph. This implies that $H_{k}$ is path of length two, so $j_{1}\rightarrow j_{2}$ , such that $F_{j_{1}}=F_{j_{2}}$ . To finish this case, we note that $\lvert F(V_{k})\rvert=2$ , but $dim(V_{k})=1$ , because both constraints $y(F_{j_{1}})=1$ and $y(F_{j_{2}})=1$ are the same.

In the second case, for all $q\in[p-1]$ , we have $\lvert F_{j_{q}}\cap F_{j_{q+1}}\rvert=1$ . Using the fact that each facility is in at most two $F$ -balls (Proposition 2.7), we have that each non-leaf client on the path has two facilities in its $F$ -ball - one that it shares with the previous client on the path and one that it shares with the next. For the leaf clients, they share one facility with their single neighbor in the path, and they have one facility that is not shared with any other client in $V_{k}$ . With these observations, we can bound:

dim(V_{k})\leq\lvert V_{k}\rvert=\frac{1}{2}\sum\limits_{j\in V_{k}}\lvert F_{j}\rvert=\frac{1}{2}(2\lvert F(V_{k})\rvert-2)=\lvert F(V_{k})\rvert-1.

∎

Proof of 8.8.

Each $F$ -ball for clients along this cycle contains exactly two facilities. Using Proposition 2.7, each client along this cycle shares one of its facilities with the previous client in the cycle and one with the next client. This implies that each facility in $F(V_{k})$ is in exactly two $F$ -balls. Combining these two observations gives:

\lvert V_{k}\rvert=\frac{1}{2}\sum\limits_{j\in V_{k}}\lvert F_{j}\rvert=\frac{1}{2}(2\lvert F(V_{k})\rvert)=\lvert F(V_{k})\rvert.

Thus, to prove $\lvert F(V_{k})\rvert>dim(V_{k})$ , it suffices to show that $dim(V_{k})<\lvert V_{k}\rvert$ . We do this by showing that the constraints $\{y(F_{j})=1\mid j\in V_{k}\}$ are not linearly independent. By assumption, $H_{k}$ is bipartite with bipartition, say $L\cup R=V_{k}$ . Consider the linear combination of the constraints $\{y(F_{j})=1\mid j\in V_{k}\}$ , where every constraint indexed by $L$ has coefficient $1$ and every constraint indexed by $R$ has coefficient $-1$ . Then for every facility in $F(V_{k})$ , it is in exactly two $F$ -balls, and these two $F$ -balls must be on opposite sides of the bipartition, so each facility in $F(V_{k})$ has coefficient $0$ in this linear combination. In conclusion, we have constructed a non-trivial linear combination of the constraints $\{y(F_{j})\mid j\in V_{k}\}$ whose left hand side is the zero vector, so $dim(V_{k})\leq\lvert V_{k}\rvert$ . ∎

Appendix F Proof of Theorem 6.13: $k$ -Median with Outliers Pre-Processing

The goal of this section is to prove Theorem 6.13 using the relevant theorems from [KLS18]. Note that we follow exactly the same pre-processing steps; the only difference is that we summarize the results of their pre-processing in a single theorem.

The proof of the knapsack pre-processing, 6.5, follows analogously from the pre-processing steps in [KLS18] as well.

F.1. Preliminaries

We define the notions of extended instances and sparse extended instances for $k$ -median with outliers. These definitions are useful to capture the properties of our pre-processing.

Extended instances are used to handle the fact that in our pre-processing, we will guess some facilities to pre-open. Then $S_{0}$ is the set of guessed facilities.

Definition F.1 (Extended Instance for $k$ -Median with Outliers).

An extended instance for $k$ -median with outliers is of the form $\mathcal{I}=(F,C,d,k,m,S_{0})$ , where $F$ , $C$ , $d$ , $k$ , and $m$ are defined as in a standard $k$ -median with outliers instance (see Definition 6.11), and $S_{0}\subset F$ .

As in $k$ -median with outliers, the goal is to choose a set of at most $k$ open facilities $S\subset F$ and at least $m$ clients $C^{\prime}\subset C$ to serve to minimize the connection costs of the served clients to the open facilities, so $\sum\limits_{j\in C^{\prime}}d(j,S)$ . However, we add the additional constraint that the set of open facilities must include $S_{0}$ .

Further, sparse extended instances give our properties for what it means for the facilities and clients to be “cheap” (see the second and third properties in the next definition, respectively.)

Definition F.2 (Sparse Extended Instance for $k$ -Median with Outliers).

Let $\mathcal{I}^{\prime}=(F,C^{\prime},d,k,m^{\prime},S_{0})$ be an extended $k$ -median with outliers instance and $\rho,\delta\in(0,\frac{1}{2})$ , $U\geq 0$ be parameters. We say that $\mathcal{I}^{\prime}$ is $(\rho,\delta,U)$ -sparse with respect to solution $(S^{*},{C^{*}}^{\prime})$ if the following three properties hold:

(1)

the cost of the solution $(S^{*},{C^{*}}^{\prime})$ to $\mathcal{I}^{\prime}$ is at most $U$
(2)

for all $i\in S^{*}\setminus S_{0}$ , we have $\sum\limits_{j\in{C^{*}}^{\prime}\mid d(j,S^{*})=d(j,i)}d(j,i)\leq\rho U$
(3)

for all $p\in F\cup C^{\prime}$ , we have $\lvert B_{C^{\prime}}(p,\delta d(p,S^{*}))\rvert d(p,S^{*})\leq\rho U$

F.2. Sparsification

In this section, we pass from the input $k$ -median with outliers instance to a sparse extended sub-instance by guessing the expensive parts of the input instance. Then on this sparse extended sub-instance, we can strengthen $LP_{1}$ . The following theorems are directly from [KLS18], so we omit the proofs in this paper. The first theorem states that we can efficiently compute a sparse extended sub-instance at the cost of a small increase in approximation ratio.

Theorem F.3.

Let $\mathcal{I}=(F,C,d,m,k)$ be an instance of $k$ -median with outliers with optimal solution $(S^{*},C^{*})$ and $\rho,\delta\in(0,1/2)$ be parameters. Then there exists a $n^{O(1/\rho)}$ -time algorithm that given $\mathcal{I}$ , $\rho$ , $\delta$ , and an upper bound $U$ on the cost of the optimal solution $(S^{*},C^{*})$ ¹¹1Note that we are given $U$ , but not the solution $(S^{*},C^{*})$ , outputs $n^{O(1/\rho)}$ -many extended $k$ -median with outliers instances of the form $\mathcal{I}^{\prime}=(F,C^{\prime},d,m^{\prime},k,S_{0})$ such that $C^{\prime}\subset C$ , $m^{\prime}=\lvert C^{*}\cap C^{\prime}\rvert$ , and $S_{0}\subset S$ . Further, one such instance $\mathcal{I}^{\prime}$ is $(\rho,\delta,U)$ -sparse with respect to the solution $(S^{*},C^{*}\cap C^{\prime})$ and satisfies:

(3)

\frac{1-\delta}{1+\delta}\sum\limits_{j\in C^{*}\setminus C^{\prime}}d(j,S_{0})+\sum\limits_{j\in C^{*}\cap C^{\prime}}d(j,S^{*})\leq U

Once we have our sparse extended sub-instance, say $\mathcal{I}^{\prime}$ , we use these sparsity properties to compute the $R$ -vector, which is needed for our Extra Invariants.

Theorem F.4.

Let $\mathcal{I}^{\prime}=(F,C^{\prime},d,m^{\prime},k,S_{0})$ be an extended $k$ -median with outliers instance and $\rho,\delta\in(0,1/2)$ and $U\geq 0$ . Suppose $\mathcal{I}^{\prime}$ is $(\rho,\delta,U)$ -sparse instance with respect to solution $(S^{*},{C^{*}}^{\prime})$ to $\mathcal{I}^{\prime}$ such that $(S^{*},{C^{*}}^{\prime})$ has cost $U^{\prime}$ on $\mathcal{I}^{\prime}$ . Then there exists a polynomial time algorithm that takes as input $\mathcal{I}^{\prime}$ , $\rho$ , $\delta$ , and $U$ and outputs $R\in\mathbb{R}_{+}^{C^{\prime}}$ satisfying:

(1)

For every $t>0$ and $p\in F\cup C^{\prime}$ , we have:

\lvert\{j\in B_{C^{\prime}}(p,\frac{\delta t}{4+3\delta})\mid R_{j}\geq t\}\rvert\leq\frac{\rho(1+3\delta/4)}{1-\delta/4}\frac{U}{t}

(2)

There exists a solution to $\mathcal{I}^{\prime}$ of cost at most $(1+\delta/2)U^{\prime}$ such that if client $j$ is connected to facility $i$ , then $d(j,i)\leq R_{j}$ and for any facility $i\notin S_{0}$ , the total cost of clients connected to $i$ in this solution is at most $\rho(1+\delta/2)U$

F.3. Putting it all Together: Proving Theorem 6.13

Combining the algorithms guaranteed by these above two theorems, we show how to construct $LP_{iter}$ with the desired properties.

Suppose we are given a $k$ -median with outliers instance $\mathcal{I}=(F,C,d,m,k)$ , parameters $\rho,\delta\in(0,\frac{1}{2})$ , and an upper bound $U$ of $Opt(\mathcal{I})$ . First we run the algorithm guaranteed by Theorem F.3 to obtain $n^{O(1/\rho)}$ -many extended $k$ -median with outliers instances. Then for each instance, we run the algorithm guaranteed by Theorem F.4 to obtain a vector $R$ for each such instance.

By Theorem F.3, let $\mathcal{I}^{\prime}=(F,C^{\prime}\subset C,d,m^{\prime}=m-\lvert C^{*}\setminus C^{\prime}\rvert,k,S_{0})$ be the instance output by the first algorithm such that $\mathcal{I}^{\prime}$ is $(\rho,\delta,U)$ -sparse with respect to the solution $(S^{*},C^{*}\cap C^{\prime})$ and satisfies Equation 3. This sub-instance will be the one that is guaranteed by Theorem 6.13, so from here we need to compute the $R$ -vector, and construct $LP_{iter}$ with the desired properties.

Note that the cost of solution $(S^{*},C^{*}\cap C^{\prime})$ to $\mathcal{I}^{\prime}$ is exactly $U^{\prime}=\sum\limits_{j\in C^{*}\cap C^{\prime}}d(j,S^{*})$ . It follows, on this instance $\mathcal{I}^{\prime}$ , the algorithm guaranteed by Theorem F.4 outputs a vector $R\in\mathbb{R}_{+}^{C^{\prime}}$ such that for every $t>0$ and $p\in F\cup C^{\prime}$ , we have:

\lvert\{j\in B_{C^{\prime}}(p,\frac{\delta t}{4+3\delta})\mid R_{j}\geq t\}\rvert\leq\frac{\rho(1+3\delta/4)}{1-\delta/4}\frac{U}{t}

, and there exists a solution, say $(\bar{S},\bar{C})$ to $\mathcal{I}^{\prime}$ of cost at most $(1+\delta/2)U^{\prime}$ such that if $j$ is connected to facility $i$ , then $d(j,i)\leq R_{j}$ and for any facility $i\in\bar{S}\setminus S_{0}$ , the total cost of clients connected to $i$ in this solution is at most $\rho(1+\delta/2)U$ .

It remains to construct $LP_{iter}$ . To do so, first we construct a strengthened LP for the instance $\mathcal{I}^{\prime}$ such that $(\bar{S},\bar{C})$ is feasible for the strengthened LP, which we call $LP_{1}^{\prime}$ :

( $LP_{1}^{\prime}$ )		$\displaystyle\min\limits_{x,y}~{}~{}$	$\displaystyle\sum\limits_{i\in F,j\in C^{\prime}}d(i,j)x_{ij}$
	s.t.	$\displaystyle x_{ij}\leq y_{i}\quad\forall i\in F,j\in C^{\prime}$	$\displaystyle y_{i}=1\quad\forall i\in S_{0}$
		$\displaystyle\sum\limits_{i\in F}x_{ij}\leq 1\quad\forall j\in C^{\prime}$	$\displaystyle x_{ij}=0\quad\forall i\in F,j\in C^{\prime}\text{ s.t. }d(i,j)>R_{j}$
		$\displaystyle\sum\limits_{i\in F}y_{i}\leq k$	$\displaystyle\sum\limits_{j\in C^{\prime}}d(i,j)x_{ij}\leq\rho(1+\delta/2)Uy_{i}\quad\forall i\notin S_{0}$
		$\displaystyle\sum\limits_{j\in C^{\prime},i\in F}x_{ij}\geq m$	$\displaystyle x_{ij}=0\quad\forall i\notin S_{0},j\in C^{\prime}\text{ s.t. }d(i,j)>\rho(1+\delta/2)U$
		$\displaystyle 0\leq x,y\leq 1$

The left column of constraints are the same as $LP_{1}$ and the right column of constraints are extra constraints that are valid for the solution $(\bar{S},\bar{C})$ to our sub-instance $\mathcal{I}^{\prime}$ . Because these constraints are valid for the solution $(\bar{S},\bar{C})$ , the following proposition is immediate.

Proposition F.5.

$Opt(LP_{1}^{\prime})\leq(1+\delta/2)U^{\prime}$ .

From here, we want to carry out a similar construction as in §2, where we construct $LP_{iter}$ satisfying all Basic Invariants from $LP_{1}$ . We note that the main difference in our procedure here when compared to §2 is how we eliminate the $x$ -variables. To compute the $F$ -balls for $LP_{1}^{\prime}$ , we must carefully duplicate facilities to capture the constraints: $\sum\limits_{j\in C^{\prime}}d(i,j)x_{ij}\leq\rho(1+\delta/2)Uy_{i}\quad\forall i\notin S_{0}$ . We pass from $LP_{1}^{\prime}$ to $LP_{2}$ with the next lemma.

Lemma F.6.

There exists a polynomial time algorithm that takes as input $LP_{1}^{\prime}$ , duplicates facilities in $F$ , and outputs a vector $\bar{y}\in[0,1]^{F}$ and sets $F_{j}\subset B_{F}(j,R_{j})$ for all $j\in C^{\prime}$ such that:

(1)

$\bar{y}(F_{j})\leq 1$ for all $j\in C^{\prime}$
(2)

$\bar{y}(F)\leq k$
(3)

$\sum\limits_{j\in C^{\prime}}\bar{y}(F_{j})\geq m$
(4)

$\sum\limits_{j\in C}\sum\limits_{i\in F_{j}}d(i,j)\bar{y}_{i}\leq Opt(LP_{1}^{\prime})$
(5)

For all $i\in S_{0}$ , there is one unit of open facility co-located with $i$ in $\bar{y}$
(6)

For every facility $i$ not co-located with a facility in $S_{0}$ , we have $\sum\limits_{j\in C^{\prime}\mid i\in F_{j}}d(i,j)\leq 2\rho(1+\delta/2)U$

Applying the algorithm guaranteed by the above lemma to $LP_{1}^{\prime}$ , we can obtain the $F_{j}$ -sets. Using these $F$ -balls, we proceed similarly as in §2. Thus, next we randomly discretize the distances to powers of $\tau>1$ (up to a random offset) to obtain $d^{\prime}(p,q)$ for all $p,q\in F\cup C$ . Again, the possible discretized distances are $L(-2)=-1,L(-1)=0,\dots,L(\ell)=\alpha\tau^{\ell}$ for all $\ell\in\mathbb{N}$ , and $d^{\prime}$ satisfies Lemma 2.2.

Then we define the radius levels and inner balls in the exact same way, so:

\ell_{j}=\min\limits_{\ell\geq-1}\{\ell\mid d^{\prime}(j,i)\leq L(\ell)\quad\forall i\in F_{j}\}

B_{j}=\{i\in F_{j}\mid d^{\prime}(j,i)\leq L(\ell_{j}-1)\}

To complete the data of $LP_{iter}$ for $\mathcal{I}^{\prime}$ , we need to define the sets $C_{part}$ , $C_{full}$ , and $C^{*}$ . Here we must slightly modify the construction of §2 to accommodate the set of pre-opened facilities, $S_{0}$ . To satisfy Extra Invariant 6.12(1), we create a set $C_{0}$ of dummy clients such that for each $i\in S_{0}$ , there exists a dummy client $j(i)\in C_{0}$ that is co-located with $i$ such that $F_{j(i)}$ has radius level $-1$ and consists of all co-located copies of $i$ . Thus, we define $C_{part}=C^{\prime}$ , $C_{full}=\emptyset$ , and $C^{*}=C^{0}$ .

This completes the description of $LP_{iter}$ for sub-instance $\mathcal{I}^{\prime}$ . To complete our algorithm, we output each $\mathcal{I}^{\prime}$ along with $LP_{iter}$ , $S_{0}$ , and $R$ .

To summarize, our algorithm is to first run the algorithm guaranteed by Theorem F.3 to obtain $n^{O(1/\rho)}$ -many sub-instances. For each sub-instance, we compute $R$ using Theorem F.4, construct $LP_{1}^{\prime}$ , construct the $F$ -balls using Lemma F.6, and define the rest of the data of $LP_{iter}$ as in §2. The runtime of our algorithm is immediate, so it suffices to show that one of the outputs has the desired properties.

In particular, we consider the sub-instance $\mathcal{I}^{\prime}=(F,C^{\prime}\subset C,d,m^{\prime}=m-\lvert C^{*}\setminus C^{\prime}\rvert,k,S_{0})$ output by the algorithm guaranteed by Theorem F.3 such that $\mathcal{I}^{\prime}$ is $(\rho,\delta,U)$ -sparse with respect to the solution $(S^{*},C^{*}\cap C^{\prime})$ and satisfies Equation 3. For the remainder of this section, we consider the $LP_{iter}$ constructed for this specific sub-instance. To complete the proof, we verify that $LP_{iter}$ satisfies the two desired properties.

Proposition F.7.

$LP_{iter}$ satisfies all Basic and Extra Invariants.

Proof.

It is easy to verify that $LP_{iter}$ satisfies all Basic Invariants by construction. For the Extra Invariants, we handle them one-by-one.

Extra Invariant 6.12(1) holds by construction of $LP_{iter}$ . To show Extra Invariant 6.12(2), we again apply Lemma F.6, which states that the $F$ -balls have the desired property.

To show Extra Invariant 6.12(3), we note that in Lemma F.6, the $F$ -balls are constructed such that $F_{j}\subset B_{F}(j,R_{j})$ , for all $j\in C^{\prime}$ . Thus, for all $j\in C^{\prime}$ and $i\in F_{j}$ , we have $d(i,j)\leq R_{j}$ . Then by definition of the radius levels, we have $L(\ell_{j})\leq\tau R_{j}$ , as required. Finally, Extra Invariant 6.12(4) follows from the guarantee of Theorem F.4. ∎

Proposition F.8.

$\frac{\log_{e}\tau}{(\tau-1)(1+\delta/2)}\mathbb{E}[Opt(LP_{iter})]+\frac{1-\delta}{1+\delta}\sum\limits_{j\in C^{*}\setminus C^{\prime}}d(j,S_{0})\leq U$

Proof.

We first show that $\mathbb{E}[Opt(LP_{iter})]\leq\frac{\tau-1}{\log_{e}\tau}Opt(LP_{1}^{\prime})$ . We have $C_{part}=C^{\prime}$ , $C_{full}=\emptyset$ , and $C^{*}=C_{0}$ . Note that the dummy clients in $C^{*}$ contribute zero to the objective of $LP_{iter}$ , because they are co-located with one unit of open facility in their $F$ -balls. Thus Lemma F.6 implies that there exists a feasible solution $\bar{y}$ to $LP_{iter}$ of cost at most $Opt(LP_{1}^{\prime})$ up to the discretization of the distances. The cost of discretization is bounded by Lemma 2.2, and immediately gives the extra $\frac{\tau-1}{\log_{e}\tau}$ -factor. The factor of $\frac{\tau-1}{\log_{e}\tau}$ is due to the cost of discretization, which is bounded by Lemma 2.2.

Now we relate $Opt(LP_{1}^{\prime})$ to $U^{\prime}=\sum\limits_{j\in C^{*}\cap C^{\prime}}d(j,S^{*})$ , which satisfies Equation 3 by the guarantees of Theorem F.3. Combining Equation 3 with Proposition F.5, we can obtain our final bound:

	$\displaystyle\frac{\log_{e}\tau}{(\tau-1)(1+\delta/2)}\mathbb{E}[Opt(LP_{iter})]+\frac{1-\delta}{1+\delta}\sum\limits_{j\in C^{*}\setminus C^{\prime}}d(j,S_{0})$	$\displaystyle\leq\frac{1}{1+\delta/2}Opt(LP_{1}^{\prime})+\frac{1-\delta}{1+\delta}\sum\limits_{j\in C^{*}\setminus C^{\prime}}d(j,S_{0})$
		$\displaystyle\leq\sum\limits_{j\in C^{}\cap C^{\prime}}d(j,S^{})+\frac{1-\delta}{1+\delta}\sum\limits_{j\in C^{*}\setminus C^{\prime}}d(j,S_{0})$
		$\displaystyle\leq U$

∎

	$\displaystyle\sum\limits_{j\in C^{\prime}_{close}}d(j,i^{*})$	$\displaystyle\leq(1+\gamma)\lvert C^{\prime}_{close}\rvert d(i,i^{*})$
		$\displaystyle\leq(1+\gamma)\lvert B_{C}(j^{},\delta R_{j^{}})\rvert d(i,i^{*})$
		$\displaystyle\leq(1+\gamma)\frac{\rho U}{R_{j^{}}}d(i,i^{})$
		$\displaystyle\leq\frac{\rho U}{R_{j^{}}}O(1)R_{j^{}}=O(\rho)U$

Structural Iterative Rounding for Generalized kk-Median Problems

Abstract.

1. Introduction

1.1. Generalized kk-median

1.2. Our Results

1.3. Overview of Techniques

1.4. Organization

2. Auxiliary LP for Iterative Rounding

2.1. Defining FF-balls

Proposition 2.1.

Proposition 2.2.

2.2. Constructing L​Pi​t​e​rLP_{iter}

Proposition 2.3.

2.3. Properties of L​Pi​t​e​rLP_{iter}

Definition 2.4 (Basic Invariants).

Definition 2.5 (Intersection Graph).

Proposition 2.6.

Proposition 2.7.

Lemma 2.8.

2.4. Notations

3. Basic Iterative Rounding Phase

3.1. The Algorithm

Theorem 3.1.

4. Iterative Operation for Structured Extreme Points

4.1. Chain Decomposition

Definition 4.1 (Chain).

Theorem 4.2 (Chain Decomposition).

4.2. Iterative Operation for Chain Decompositions

Definition 4.3 (Candidate Configuration).

Lemma 4.4.

Theorem 4.5.

5. Pseudo-Approximation Algorithm for GKM

Theorem 5.1 (Pseudo-Approximation Algorithm for GKM).

Theorem 5.2.

5.1. Analysis of Re-Routing Cost

Theorem 5.3 (Re-Routing Cost).

Proposition 5.4.

Proof.

Lemma 5.5.

Proof.

Proof of Theorem 5.3.

5.2. Putting it all Together: Pseudo-Approximation for GKM

Proof of Theorem 5.1.

Proposition 5.6.

6. From Pseudo-Approximation to Approximation

Theorem 6.1 (Approximation Algorithm for Knapsack Median).

Theorem 6.2 (Approximation Algorithm for kk-Median with Outliers).

6.1. Overview

6.2. Approximation Algorithm for Knapsack Median

Definition 6.3 (Instance of Knapsack Median).

Definition 6.4 (Extra Invariants for Knapsack Median).

Theorem 6.5 (Pre-Processing for Knapsack Median).

Proposition 6.6.

Lemma 6.7.

Proposition 6.8.

Proof.

Lemma 6.9.

Proof.

Theorem 6.10.

Proof.

Proof of Theorem 6.1.

6.3. Approximation Algorithm for kk-Median with Outliers

Definition 6.11 (Instance of kk-Median with Outliers).

Definition 6.12 (Extra Invariants for kk-Median with Outliers).

Theorem 6.13 (Pre-Processing for kk-Median with Outliers).

Proposition 6.14.

Lemma 6.15.

Theorem 6.16.

7. Post-Processing for kk-Median with Outliers

7.1. Computing Partial Solutions

Theorem 7.1.

7.2. Recursive Post-Processing Algorithm

Lemma 7.2.

7.3. Analysis of OutliersPostProcess

Lemma 7.3.

Lemma 7.4.

Proof of Theorem 6.16.

7.4. Proof of 7.1

Proposition 7.5.

Proposition 7.6.

Structural Iterative Rounding for Generalized $k$ -Median Problems

1.1. Generalized $k$ -median

2.1. Defining $F$ -balls

2.2. Constructing $LP_{iter}$

2.3. Properties of $LP_{iter}$

Theorem 6.2 (Approximation Algorithm for $k$ -Median with Outliers).

6.3. Approximation Algorithm for $k$ -Median with Outliers

Definition 6.11 (Instance of $k$ -Median with Outliers).

Definition 6.12 (Extra Invariants for $k$ -Median with Outliers).

Theorem 6.13 (Pre-Processing for $k$ -Median with Outliers).

7. Post-Processing for $k$ -Median with Outliers

Appendix A Missing Proofs from §2: Construction of $LP_{iter}$

Appendix F Proof of Theorem 6.13: $k$ -Median with Outliers Pre-Processing

Definition F.1 (Extended Instance for $k$ -Median with Outliers).

Definition F.2 (Sparse Extended Instance for $k$ -Median with Outliers).