This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

11institutetext: Abolfazl Asudeh 22institutetext: Department of Computer Science, University of Illinois at Chicago, Chicago, IL 60607, USA
22email: [email protected]
33institutetext: Tanya Berger-Wolf 44institutetext: Department of Computer Science and Engineering, Ohio State University, Columbus, OH 43210, USA
44email: [email protected]
55institutetext: Bhaskar DasGupta (corresponding author) 66institutetext: Department of Computer Science, University of Illinois at Chicago, Chicago, IL 60607, USA
Tel.: +312-355-1319
Fax: +312-413-0024
66email: [email protected]
77institutetext: Anastasios Sidiropoulos 88institutetext: Department of Computer Science, University of Illinois at Chicago, Chicago, IL 60607, USA
88email: [email protected]

Maximizing coverage while ensuring fairness: a tale of conflicting objectives

Abolfazl Asudeh    Tanya Berger-Wolf    Bhaskar DasGupta    Anastasios Sidiropoulos
(Received: date / Accepted: date)
Abstract

Ensuring fairness in computational problems has emerged as a key topic during recent years, buoyed by considerations for equitable resource distributions and social justice. It is possible to incorporate fairness in computational problems from several perspectives, such as using optimization, game-theoretic or machine learning frameworks. In this paper we address the problem of incorporation of fairness from a combinatorial optimization perspective. We formulate a combinatorial optimization framework, suitable for analysis by researchers in approximation algorithms and related areas, that incorporates fairness in maximum coverage problems as an interplay between two conflicting objectives. Fairness is imposed in coverage by using coloring constraints that minimizes the discrepancies between number of elements of different colors covered by selected sets; this is in contrast to the usual discrepancy minimization problems studied extensively in the literature where (usually two) colors are not given a priori but need to be selected to minimize the maximum color discrepancy of each individual set. Our main results are a set of randomized and deterministic approximation algorithms that attempts to simultaneously approximate both fairness and coverage in this framework.

Keywords:
fairness maximum coverage combinatorial optimization approximation algorithms
journal:

1 Introduction

In this paper we introduce and analyze a combinatorial optimization framework capturing two conflicting objectives: optimize the main objective while trying to ensure that the selected solution is as fair as possible. We illustrate the framework with the following simple graph-theoretic illustration. Consider the graph GG of 1010 nodes and 1818 edges as shown in Fig. 1 where each edge is colored from one of χ=3\chi=3 colors (red, blue or green) representing three different attributes. Suppose that we want to select exactly k=3k=3 nodes that maximizes the number of edges they “cover” subject to the “fairness” constraint that the proportion of red, blue and green edges in the selected edges are the same. An optimal solution is shown in Fig. 1 by the solid black nodes u1,u2,u3u_{1},u_{2},u_{3} covering 66 edges; Fig. 1 also shows that the solution is quite different from what it would have been (the yellow corner nodes v1,v2,v3v_{1},v_{2},v_{3} covering 1111 edges) if the fairness constraint was absent. A simple consequence of the analysis of our algorithms for a more general setting is that, assuming that there exists at least one feasible solution and assuming kk is large enough, we can find a randomized solution to this fair coverage problem for graphs where we select exactly kk nodes, cover at least 63%63\% of the optimal number of edges on an average and, for every pair of colors, with high probability the ratio of the number of edges of these two colors among the selected edges is O(1)O(1).

Refer to caption

Figure 1: A simple illustration of fairness in maximum coverage problems for graphs.

In this paper we consider this type of problem in more general settings. Of course, in the example in Fig. 1 (and in general) there is nothing special about requiring that the proportion of red, blue and green edges in the covered edges should be exactly equal as opposed to a pre-specified unequal proportion. For example, we may also require that the proportion of edges of different colors in our solution should mimic that in the entire graph, i.e., in Fig. 1 among the covered edges the proportion of red, blue and green edges should be q1,q2q_{1},q_{2} and q3q_{3} where q1=1/6q_{1}=\nicefrac{{1}}{{6}}, q2=1/3q_{2}=\nicefrac{{1}}{{3}}, and q3=1/2q_{3}=\nicefrac{{1}}{{2}}. Our algorithms will work with easy modifications for any constant values of q1q_{1}, q2q_{2} and q3q_{3}.

1.1 Different research perspectives in ensuring fairness

Theoretical investigations of ensuring fairness in computation can be pursued from many perspectives. We briefly comment on a few of them.

One line of research dealing with the goal of ensuring fairness uses the optimization framework, i.e., we model the problem as an optimization problem with precisely defined fairness constraints. This is a common framework used by researchers in combinatorial and graph-theoretic algorithms, such as research works that involve designing exact or approximation algorithms, investigating fixed-parameter tractability issues or proving inapproximability results. In this paper we use such a framework. Fairness is imposed in coverage by using coloring constraints that minimizes the discrepancies between different colors among elements covered by selected sets; this is in contrast to the usual discrepancy minimization problems studied extensively in the literature C00 where the (usually two) colors are not given a priori but need to be selected to minimize the maximum color discrepancy of each individual set.

A second line of research dealing with fairness involves machine learning frameworks. Even though it is a relatively new research area, there is already a large body of research dealing with ensuring fairness in machine learning algorithms by preprocessing the data used in the algorithms, optimization of statistical outcomes with appropriate fairness criteria and metric during the training, or by post-processing the answers of the algorithms Ze13 ; HPS16 ; ZLM18 .

A third line of research dealing with fairness involves game theoretic frameworks. For example, developments of solutions for fair ways of sharing transferable utilities in cooperative game-theoretic environments have given rise to interesting concepts such as Shapley values and Rabin’s fairness model. We refer the reader to the excellent textbook in algorithmic game theory by Nisan et al. NRTV07 for further details on these research topics.

Yet another more recent line of research dealing with fairness involve applying fairness criteria in the context of clustering of points in a metric space under kk-means objective, kk-median or other p\ell_{p}-norm objectives BCFN19 ; CKLV17 . The assumption of an underlying metric space allows the development of efficient algorithms for these frameworks.

2 Fair maximum coverage: notations, definitions and related concepts

The Fair Maximum Coverage problem with χ\chi colors is defined as follows. We are given an universe 𝒰={u1,,un}\mathcal{U}=\{u_{1},\dots,u_{n}\} of nn elements, a weight function w:𝒰w:\mathcal{U}\mapsto{\mathbb{R}} assigning a non-negative weight to every element, a color function 𝒞:𝒰{1,,χ}\mathcal{C}:\mathcal{U}\mapsto\{1,\dots,\chi\} assigning a color to every element, a collection of mm sets 𝒮1,,𝒮m𝒰\mathcal{S}_{1},\dots,\mathcal{S}_{m}\subseteq\mathcal{U}, and a positive integer kk. A collection of kk distinct subsets, say 𝒮i1,,𝒮ik\mathcal{S}_{i_{1}},\dots,\mathcal{S}_{i_{k}}, with the set of “covered” elements j=1k𝒮ij\bigcup_{j=1}^{k}\mathcal{S}_{i_{j}} containing pip_{i} elements of color ii is considered a valid solution111For a more general version of the problem we are given χ\chi “color-proportionality constants” q1,,qχ(0,1]q_{1},\dots,q_{\chi}\in(0,1] with q1++qχ=1q_{1}+\dots+q_{\chi}=1, and a valid solution must satisfy pi/pj=qi/qj\nicefrac{{p_{i}}}{{p_{j}}}=\nicefrac{{q_{i}}}{{q_{j}}} for all ii and jj. As we mentioned already, with suitable modifications our algorithms will work with similar asymptotic performance guarantee for any constant values of q1,,qχq_{1},\dots,q_{\chi}, but to simplify exposition we will assume the simple requirement of q1==qχq_{1}=\dots=q_{\chi} in the sequel. provided pi=pjp_{i}=p_{j} for all ii and jj. The objective is to maximize the sum of weights of the covered elements. More explicitly, our problem is defined as follows:

Problem name: Fair Maximum Coverage with χ\chi colors (Fmc(χ,k\chi,k))
Input: \bullet universe 𝒰={u1,,un}\mathcal{U}=\{u_{1},\dots,u_{n}\}
\bullet (element) weight function w:𝒰+{0}w:\mathcal{U}\mapsto{\mathbb{R}}^{+}\cup\{0\}
\bullet (element) color function 𝒞:𝒰{1,,χ}\mathcal{C}:\mathcal{U}\mapsto\{1,\dots,\chi\}
\bullet sets 𝒮1,,𝒮m𝒰\mathcal{S}_{1},\dots,\mathcal{S}_{m}\subseteq\mathcal{U}
\bullet integer k>0k>0
Valid solution: collection of kk distinct subsets 𝒮i1,,𝒮ik\mathcal{S}_{i_{1}},\dots,\mathcal{S}_{i_{k}} satsifying
i,j{1,,χ}:\forall i,j\in\{1,\dots,\chi\}:
              pi=def|{u|uj=1k𝒮ij and 𝒞(u)=i}|p_{i}\stackrel{{\scriptstyle\mathrm{def}}}{{=}}\big{|}\,\{u_{\ell}\,|\,u_{\ell}\in\bigcup_{j=1}^{k}\mathcal{S}_{i_{j}}\text{ and }\mathcal{C}(u_{\ell})=i\}\,\big{|}
              ==
              pj=def|{u|uj=1k𝒮ij and 𝒞(u)=j}|p_{j}\stackrel{{\scriptstyle\mathrm{def}}}{{=}}\big{|}\,\{u_{\ell}\,|\,u_{\ell}\in\bigcup_{j=1}^{k}\mathcal{S}_{i_{j}}\text{ and }\mathcal{C}(u_{\ell})=j\}\,\big{|}
Objective: maximize uj=1k𝒮ijw(u)\sum_{u_{\ell}\in\bigcup_{j=1}^{k}\mathcal{S}_{i_{j}}}w(u_{\ell})

We denote Fmc(χ,k\chi,k) by just Fmc when χ\chi and kk are clear from the context. In the sequel, we will distinguish between the following two versions of the problem:

  1. (i)

    unweighted Fmc in which w(u)=1w(u_{\ell})=1 for all {1,,n}\ell\in\{1,\dots,n\} and thus the objective is to maximize the number of elements covered, and

  2. (ii)

    weighted Fmc in which w(u)0w(u_{\ell})\geq 0 for all {1,,n}\ell\in\{1,\dots,n\}.

For the purpose of stating and analyzing algorithmic performances, we define the following notations and natural parameters associated with an instance of Fmc(χ,k\chi,k):

  1. \triangleright

    a{2,3,,n}a\in\{2,3,\dots,n\} denotes the maximum of the cardinalities (number of elements) of all sets.

  2. \triangleright

    f{1,2,,m}f\in\{1,2,\dots,m\} denotes the maximum of the frequencies of all elements, where the frequency of an element is the number of sets in which it belongs.

  3. \triangleright

    𝖮𝖯𝖳{\mathsf{OPT}} denotes the optimal objective value of the given instance of Fmc.

  4. \triangleright

    𝖮𝖯𝖳#{\mathsf{OPT}}_{\#} denotes the number of covered elements in an optimal solution of the given instance of Fmc. For weighted Fmc, if there are multiple optimal solutions then 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#} will the maximum number of elements covered among these optimal solutions. Note that 𝖮𝖯𝖳=𝖮𝖯𝖳#{\mathsf{OPT}}={\mathsf{OPT}}_{\#} for unweighted Fmc. The reason we need to consider 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#} separately from 𝖮𝖯𝖳{\mathsf{OPT}} for weighted Fmc is because the coloring constraints are tied to 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#} whereas the optimization objective is tied to 𝖮𝖯𝖳{\mathsf{OPT}}.

  5. \triangleright

    The performance ratios of many of our algorithms are expressed using the function ϱ()\varrho(\cdot):

    ϱ(x)=def(11/x)x{\varrho(x)\stackrel{{\scriptstyle\mathrm{def}}}{{=}}{\big{(}1-\nicefrac{{1}}{{x}}\big{)}}^{x}}

    Note that ϱ(x)<ϱ(y)\varrho(x)<\varrho(y) for x>y>0x>y>0 and ϱ(x)>1𝐞1\varrho(x)>1-{\mathbf{e}}^{-1} for all x>0x>0.

For 𝖭𝖯\mathsf{NP}-completeness results, if the problem is trivially in 𝖭𝖯\mathsf{NP} then we will not mention it. To analyze our algorithms in this paper, we have used several standard mathematical equalities or inequalities which are listed explicitly below for the convenience of the reader:

x[0,1]:𝐞x1x\displaystyle\forall\,x\in[0,1]\,:\,{\mathbf{e}}^{-x}\geq 1-x (1)
x:𝐞x=1x+(x2/2)𝐞ξ for some ξ[0,x]\displaystyle\forall\,x\,:\,{\mathbf{e}}^{-x}=1-x+({x^{2}}/{2}){\mathbf{e}}^{-\xi}\text{ for some }\xi\in[0,x] (2)
α1,,αq0:(1qj=1qαj)qj=1qαj\displaystyle\textstyle\forall\,\alpha_{1},\dots,\alpha_{q}\geq 0\,:\,\left(\frac{1}{q}{\sum_{j=1}^{q}\alpha_{j}}\right)^{\!q}\geq\prod_{j=1}^{q}\alpha_{j} (3)
x[0,1]y1: 1(1xy)y(1(11y)y)x\displaystyle\textstyle\forall\,x\in[0,1]\,\,\forall\,y\geq 1\,:\,1-\left(1-\frac{x}{y}\right)^{y}\geq\left(1-\left(1-\frac{1}{y}\right)^{y}\right)x (4)

2.1 Three special cases of the general version of Fmc

In this subsection we state three important special cases of the general framework of Fmc.

Fair maximum kk-node coverage or Node-fmc

This captures the scenario posed by the example in Fig. 1. We are given a connected undirected edge-weighted graph G=(V,E)G=(V,E) where w(e)0w(e)\geq 0 denotes the weight assigned to edge eEe\in E, a color function 𝒞:E{1,,χ}\mathcal{C}:E\mapsto\{1,\dots,\chi\} assigning a color to every edge, and a positive integer kk. A node vv is said to cover an edge ee if ee is incident on vv. A collection of kk nodes vi1,,vikv_{i_{1}},\dots,v_{i_{k}} covering pip_{i} edges of color ii for each ii is considered a valid solution provided pi=pjp_{i}=p_{j} for all ii and jj. The objective is to maximize the sum of weights of the covered edges. It can be easily seen that this is a special case of Fmc by using the standard translation from node cover to set cover, i.e., the edges are the set of elements, and corresponding to every node vv there is a set containing the edges incident on vv. Note that for this special case f=2f=2 and aa is equal to the maximum node-degree in the graph.

Segregated Fmc or Segr-fmc

Segregated Fmc is the special case of Fmc when all the elements in any set have the same color, i.e.,

j{1,,m}up,uq𝒮j:𝒞(up)=𝒞(uq)\forall\,j\in\{1,\dots,m\}\,\forall\,u_{p},u_{q}\in\mathcal{S}_{j}\,:\,\mathcal{C}(u_{p})=\mathcal{C}(u_{q})

Another equivalent way of describing Segr-fmc is as follows. Let CjC_{j} is the set of all elements colored jj for j{1,,χ}j\in\{1,\dotsc,\chi\} in the given instance of Segr-fmc. Let the notation 2A2^{A} denote the power set for any set AA. Then, Segr-fmc is the special case when 𝒮jj=1χ2Cj\mathcal{S}_{j}\in\bigcup_{j=1}^{\chi}2^{C_{j}} holds for all j{1,,m}j\in\{1,\dots,m\}. A simple example of an instance of Segr-fmc with n=6n=6, m=10m=10 and χ=2\chi=2 is shown below:

𝒰={u1,u2,u3,u4,u5,u6}\displaystyle\mathcal{U}=\{u_{1},u_{2},u_{3},u_{4},u_{5},u_{6}\}
𝒞(u1)=𝒞(u2)=𝒞(u3)=𝒞(u4)=1,𝒞(u5)=𝒞(u6))=2\displaystyle\mathcal{C}(u_{1})=\mathcal{C}(u_{2})=\mathcal{C}(u_{3})=\mathcal{C}(u_{4})=1,\,\,\,\mathcal{C}(u_{5})=\mathcal{C}(u_{6}))=2
𝒮1={u1,u2,u3},𝒮2={u2,u3,u4},𝒮3={u5,u6},𝒮4={u6}\displaystyle\mathcal{S}_{1}=\{u_{1},u_{2},u_{3}\},\,\,\,\mathcal{S}_{2}=\{u_{2},u_{3},u_{4}\},\,\,\,\mathcal{S}_{3}=\{u_{5},u_{6}\},\,\,\,\mathcal{S}_{4}=\{u_{6}\}
w(u1)=w(u2)=7,,w(u3)=9,,w(u4)=w(u5)=w(u6)=1\displaystyle w(u_{1})=w(u_{2})=7,,\,\,\,w(u_{3})=9,,\,\,\,w(u_{4})=w(u_{5})=w(u_{6})=1

Even though computing an exact solution of Segr-fmc is still 𝖭𝖯\mathsf{NP}-complete, it is much easier to approximate (see Section 11.1). From our application point of view as discussed in Section 3, this may for example model cases in which city neighborhoods are segregated in some manner, e.g., racially or based on income.

Δ\Delta-balanced Fmc or Δ\Delta-bal-fmc

Δ\Delta-balanced Fmc is the special case of Fmc when the number of elements of each color in a set are within an additive range of Δ\Delta, i.e.,

j{1,,m}p{1,,χ}:max{1,|𝒮j|χΔ}|u|(u𝒮j)(𝒞(u)=p)||𝒮j|χ+Δ\forall\,j\in\{1,\dots,m\}\,\forall\,p\in\{1,\dots,\chi\}\,:\,\\ \max\left\{1,\,\left\lfloor\frac{|\mathcal{S}_{j}|}{\chi}\right\rfloor-\Delta\right\}\leq\left|u_{\ell}\,|\,(u_{\ell}\in\mathcal{S}_{j})\wedge(\mathcal{C}(u_{\ell})=p)\right|\leq\left\lceil\frac{|\mathcal{S}_{j}|}{\chi}\right\rceil+\Delta

Similar to Segr-fmc, it is much easier to approximate Δ\Delta-bal-fmc for small Δ\Delta (see Section 11.2).

Geometric Fmc or Geom-fmc

In this unweighted “geometric” version of Fmc, the elements are points in [0,Δ]d[0,\Delta]^{d} for some Δ\Delta and some constant d2d\geq 2, the sets are unit radius balls in d{\mathbb{R}}^{d}, and the distributions of points of different colors are given by χ\chi Lipschitz-bounded measures. More precisely, the distribution of points of color ii is given by a probability measure μi\mu_{i} supported on [0,Δ]d[0,\Delta]^{d} with a CC-Lipschitz density function222A function f:Δf:\Delta\mapsto{\mathbb{R}} for some subset Δ\Delta of real numbers is CC-Lipschitz provided |f(x)f(y)|C|xy||f(x)-f(y)|\leq C\,|x-y| for all real numbers x,yΔx,y\in\Delta. for some C>0C>0 that is upper-bounded by 11. Given a set of kk unit balls 1,,kd\mathcal{B}_{1},\dots,\mathcal{B}_{k}\subset{\mathbb{R}}^{d}, the number of points pip_{i} of color ii covered by these balls is given by μi(i=1ki)\mu_{i}\big{(}\bigcup_{i=1}^{k}\mathcal{B}_{i}\big{)}, and the total number of points covered by these balls is given by i=1χμi(i=1ki)\sum_{i=1}^{\chi}\mu_{i}\big{(}\bigcup_{i=1}^{k}\mathcal{B}_{i}\big{)}. This variant has an almost optimal polynomial-time approximation algorithm for fixed dd and under some mild assumption on 𝖮𝖯𝖳{\mathsf{OPT}} (see Section 12).

3 Sketch of application scenarios

Fmc and its variants are core abstractions of many data-driven societal domain applications. We present three diverse categories of application and highlight the real-world fairness issues addressed by our problem formulations (leaving other applications in the cited references).

Service/Facility Allocation

One of the most common data based policy decisions is assigning services/facilities across different places, e.g., placing schools ChicagoSchoolClosure , bus stops, or police/fire stations, choosing a few hospitals for specific medical facilities or services, or deciding where to put cell-phone towers. Of course, a major objective in such assignments is to serve the maximum number of people (i.e., maximize the coverage). Unfortunately, historical discriminations, such as redlining redlining , through their long drawn-out effects of manifestations in different aspects of public policy are still hurting the minorities. As a result, blindly optimizing for maximum coverage biases the assignment against equitable distribution of services. Below are examples of two real cases that further underline the importance of fairness while maximizing coverage:

  1. \triangleright

    Bike sharing: As more and more cities adopt advanced transportation systems such as bike-sharing, concerns such as equity and fairness arise with them yan2019fairness . For instance, according to BikeShare1 the bike-sharing network at NYC neglects many low-income neighborhoods and communities of color while giving the priority to well-to-do neighborhoods. Here the location of bike stations (or bikes) determines the set of people that will have access to the service, perpetuating the unhealthy cycle of lack of transportation, movement, etc.

  2. \triangleright

    Delivery services for online shopping: Online shopping has by now gained a major share of the shopping market. Platforms such as Amazon provide services such as same-day delivery to make e-shopping even more convenient to their customers. While Amazon’s main aim is to maximize the number of customers covered by this service, by not considering fairness it demonstrably failed to provide such services for predominantly black communities AmazonDeliveryBias1 ; AmazonDeliveryBias2 .

Data Integration

Combining multiple data sources to augment the power of any individual data source is a popular method for data collection. Naturally the main objective of data integration is to collect (“cover”) a maximum number of data points. However, failing to include an adequate number of instances from minorities, known as population bias, in datasets used for training machine learning models is a major reason for model unfairness olteanu2019social ; asudeh2019assessing . For example, image recognition and motion detection services by Google google-gorilla and HP hp1 with a reasonable overall performance failed to tag/detect African Americans since their training datasets did not include enough instances from this minority group. While solely optimizing for coverage may result in biased datasets, considering fairness for integration may help remove population bias.

Targeted advertisement

Targeted advertising is popular in social media. Consider a company that wants to target its “potential customers”. To do so, the company needs to select a set of features (such as “single” or “college student”) that specify the groups of users to be targeted. Of course, the company wants to maximize coverage over its customers. However, solely optimizing for coverage may result in incidents such as racism in the Facebook advertisements AdRacism or sexism in the job advertisements adSexism . Thus, a desirable goal for the company would be to select the keywords such that it provides fair coverage over users of diverse demographic groups.

4 Review of prior related works

To the best of our knowledge, Fmc in its full generalities has not been separately investigated before. However, there are several prior lines of research that conceptually intersect with Fmc.

Maximum kk-set coverage and kk-node coverage problems

The maximum kk-set coverage and kk-node coverage problems are the same as the Fmc and Node-fmc problems, respectively, without element colors and without coloring constraints. These problems have been extensively studied in the algorithmic literature, e.g., see AS04 ; F98 ; Ho97 for kk-set coverage and GLL18a ; GLL18b ; GNW07 ; M18 ; FL01 ; AS14 ; HYZZ02 for kk-node coverage. A summary of these results are as follows:

kk-set coverage:

The best approximation algorithm for kk-set coverage is a deterministic algorithm that has an approximation ratio of max{ϱ(f),ϱ(k)}>11/𝐞\max\big{\{}\varrho(f),\,\varrho(k)\big{\}}>1-\nicefrac{{1}}{{{\mathbf{e}}}} AS04 ; Ho97 . On the inapproximability side, assuming 𝖯𝖭𝖯\mathsf{P}\!\neq\!\mathsf{NP} an asymptotically optimal inapproximability ratio of 11/𝐞+ε1-\nicefrac{{1}}{{{\mathbf{e}}}}+\varepsilon (for any ε>0\varepsilon>0) is known for any polynomial-time algorithm F98 .

kk-node coverage:

The best approximation for kk-node coverage is a randomized algorithm that has an approximation ratio of 0.75040.7504 with high probability FL01 ; HYZZ02 . On the inapproximability side, kk-node coverage is 𝖭𝖯\mathsf{NP}-complete even for bipartite graphs AS14 , and cannot be approximated within a ratio of 1ε1-\varepsilon for some (small) constant ε>0\varepsilon>0 L98 ; P94 . More recently, Manurangsi M18 provided an semidefinite programming based approximation algorithm with an approximation ratio of 0.920.92, and Austrin and Stankovic AS19 used the results in AKS11 to provide an almost matching upper bound of 0.929+ε0.929+\varepsilon (for any ε>0\varepsilon>0) on the approximation ratio of any polynomial time algorithm assuming the unique games conjecture is true. There is also a significant body of prior research on the fixed parameter tractability issues for the kk-node coverage problem: for example, kk-node coverage is unlikely to allow an FPT algorithm as it is W[1]W[1]-hard GNW07 , but Marx designed an FPT approximation scheme in M08 whose running times were subsequently improved in Gupta, Lee and Li in GLL18a ; GLL18b .

However, the coloring constraints make Fmc fundamentally different from the maximum set or node coverage problems. Below we point out some of the significant aspects of these differences. For comparison purposes, for an instance of Fmc let 𝖮𝖯𝖳coverage{\mathsf{OPT}}_{\mathrm{coverage}} denote the objective value of an optimal solution for the corresponding maximum kk-set coverage problem for this instance by ignoring element colors and coloring constraints.

Existence of a feasible solution:

For the maximum kk-set coverage problem, a feasible solution trivially exists for any kk. However, a valid solution for Fmc(χ,k\chi,k) may not exist for some or all kk even if χ=2\chi=2 and in fact our results (Lemma 1) show that even deciding if there exists a valid solution is 𝖭𝖯\mathsf{NP}-complete. The 𝖭𝖯\mathsf{NP}-completeness result holds even if f=1f=1 (i.e., the sets are mutually disjoint); note that if f=1f=1 then it is trivial to compute an optimal solution to the maximum kk-set coverage problem. That is why for algorithmic purposes we will assume the existence of at least one feasible solution333Actually, our 𝖫𝖯{\mathsf{LP}}-relaxation based algorithms require only the existence of a feasible fractional solution but we cannot say anything about the approximation ratio in the absence of a feasible integral solution. and for showing computational hardness results we will show the existence of at least one trivial feasible solution.

Number of covered elements:

The number of covered elements and the corresponding selected sets in an optimal solution in Fmc can differ vastly from that in the maximum kk-set coverage problem on the same instance. The reason for the discrepancy is because in Fmc one may need to select fewer covered elements to satisfy the coloring constraints.

Exactly kk sets vs. at most kk sets:

For the maximum kk-set coverage problem any solution trivially can use exactly kk sets and therefore there is no change to the solution space whether the problem formulation requires exactly kk sets or at most kk sets. However, the corresponding situation for Fmc is different since it may be non-trivial to convert a feasible solution containing k<kk^{\prime}<k sets to one containing exactly kk sets because of the coloring constraints.

Discrepancy minimization problems

Informally, the discrepancy minimization problem for set systems (Min-disc) is orthogonal to unweighted Fmc. Often Min-disc is studied in the context of two colors, say red and blue, and is defined as follows. Like unweighted Fmc we are given mm sets over nn elements. However, unlike Fmc element colors are not given a priori but the goal to color every element red or blue to minimize the maximum discrepancy over all sets, where the discrepancy of a set is the absolute difference of the number of red and blue elements it contains. The Beck-Fiala theorem BF81 shows that the discrepancy of any set system is at most 2f2f, Spencer showed in S85 that the discrepancy of any set system is O(nlog(2m/n))O(\sqrt{n\log(2m/n)}\,), Bansal provided a randomized polynomial time algorithm achieving Spencer’s bound in B10 , and a deterministic algorithm with similar bounds were provided in LRR17 . On the lower bound side, it is possible to construct set systems such that the discrepancy is Ω(n)\Omega(\sqrt{n}\,) C00 . For generalization of the formulation to more than two colors and corresponding results, see for example D02 ; DS99 ; S03 .

Maximization of non-decreasing submodular set functions with linear inequality constraints

Kulik et al. in KST09 provided approximation algorithms for maximizing a non-decreasing submodular set function subject to multiple linear inequality constraints over the elements. Unfortunately, because the linear constraints in Segr-fmc are equality constraints, Segr-fmc cannot be put in the framework of KST09 and the approximation algorithms in KST09 do not directly apply to Segr-fmc.

5 Summary of our contribution

5.1 Feasibility hardness results

Obviously Fmc (resp., Node-fmc) obeys all the inapproximability results for the maximum kk-set coverage (resp., kk-node coverage) problem. We show in Lemma 1 that determining feasibility of Fmc instances is 𝖭𝖯\mathsf{NP}-complete even under very restricted parameter values; the proofs cover (or can be easily modified to cover) all the spacial cases of Fmc investigated in this paper. However, our subsequent algorithmic results show that even the existence of one feasible solution gives rise to non-trivial approximation bounds for the objective and the coloring constraints.

5.2 Algorithmic results

A summary of our algorithmic results is shown in Table 1. Based on the discussion in the previous section, all of our algorithms assume that at least one feasible solution for the Fmc instance exists.

coloring constraints
approximation
parameter and other restrictions
(if any)
theorem
problem
name
algorithm
name
& type
approximation
ratio
ε\varepsilon-approximation
ε=\varepsilon=
randomized
ε\varepsilon-approximation
ε=\varepsilon=
strong
randomized
ε\varepsilon-approximation
ε=\varepsilon=
𝖮𝖯𝖳#{\mathsf{OPT}_{\#}} χ\chi other
Fmc
Alg-large-opt#
𝒜𝔩𝔤{\mathcal{R}\mathcal{A}\mathfrak{l}\mathfrak{g}}
ϱ(f)\varrho(f) N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} 3.16f3.16f O(f)O(f) Ω(χnlogχ)\Omega(\chi\sqrt{n}\log\chi)\,\,\,\, N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} Theorem 9.1
Alg-medium-opt#
𝒜𝔩𝔤{\mathcal{R}\mathcal{A}\mathfrak{l}\mathfrak{g}}
ϱ(f)\varrho(f) N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} 3.16f3.16f O(f2)O(f^{2})
Ω(aχlogχ)\Omega(a\chi\log\chi)
and
O(χnlogχ)O(\chi\sqrt{n\log\chi})
N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} Theorem 9.1
Alg-small-opt#
𝒜𝔩𝔤{\mathcal{R}\mathcal{A}\mathfrak{l}\mathfrak{g}}
ϱ(f)\varrho(f) N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} 3.16f3.16f O(f2aχ𝖮𝖯𝖳#)O(f^{2}\sqrt{a\,\chi{\mathsf{OPT}}_{\#}}\,) O(max{1,lognlogm})O(\max\{1,\,\frac{\log n}{\log m}\}) N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} Theorem 9.1
Alg-iter-round 𝒟𝒜𝔩𝔤{\mathcal{D}\mathcal{A}\mathfrak{l}\mathfrak{g}} 1/f\nicefrac{{1}}{{f}} O(f2)O(f^{2}) N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} O(1)O(1)
at most
k+χ12k+\frac{\chi-1}{2} sets
Theorem 10.2
1/f\nicefrac{{1}}{{f}} O(f2+χ2f)O(f^{2}+\chi^{2}f) N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}}
at most
k+χ1k+\chi-1 sets
Theorem 10.2
Node-fmc Alg-iter-round 𝒟𝒜𝔩𝔤{\mathcal{D}\mathcal{A}\mathfrak{l}\mathfrak{g}} 1/2\nicefrac{{1}}{{2}} 4+4χ4+4\chi N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} O(1)O(1)
at most
k+χ12k+\frac{\chi-1}{2} sets
Theorem 10.1
1/2\nicefrac{{1}}{{2}} 4+2χ+4χ24+2\chi+4\chi^{2} N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}}
at most
k+χ1k+\chi-1 sets
Theorem 10.1
Other results: same as Fmc with f=2f=2
Segr-fmc
Alg-greed-plus
𝒟𝒜𝔩𝔤{\mathcal{D}\mathcal{A}\mathfrak{l}\mathfrak{g}}
ϱ\varrho 22 N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}}
at most
kk sets
Theorem 11.1
Δ\Delta-bal-fmc
Alg-greedy
𝒟𝒜𝔩𝔤{\mathcal{D}\mathcal{A}\mathfrak{l}\mathfrak{g}}
ϱ\varrho O(Δf)O(\Delta f) N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} Proposition 2
Geom-fmc
Alg-geom
𝒜𝔩𝔤{\mathcal{R}\mathcal{A}\mathfrak{l}\mathfrak{g}}
1O(δ)1-O(\delta) N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}} 1+δ1+\delta
d=O(1)d=O(1)
𝖮𝖯𝖳χη\frac{{\mathsf{OPT}}}{\chi}\geq\eta
Theorem 12.1
𝒟𝒜𝔩𝔤{\mathcal{D}\mathcal{A}\mathfrak{l}\mathfrak{g}}: deterministic algorithm         𝒜𝔩𝔤{\mathcal{R}\mathcal{A}\mathfrak{l}\mathfrak{g}}: randomized algorithm
N/A\nicefrac{{\mathrm{N}}}{{\mathrm{A}}}: not applicable         —: unrestricted
ϱ(x)=(11/x)x>11/𝐞{\varrho(x)={\big{(}1-\nicefrac{{1}}{{x}}\big{)}}^{x}}>1-\nicefrac{{1}}{{{\mathbf{e}}}}         ϱ=max{ϱ(f),ϱ(k)}>11/𝐞\varrho=\max\{\varrho(f),\,\varrho(k)\}>1-\nicefrac{{1}}{{{\mathbf{e}}}}
δ\delta: any constant in the range (0,1]         η\eta: any constant strictly greater than 11
Table 1: A summary of our algorithmic results. The O()O(\cdot) notation is used when constants are irrelevant or not precisely calculated. Precise definitions of (deterministic) ε\varepsilon-approximation, randomized ε\varepsilon-approximation and strong randomized ε\varepsilon-approximation appear in Section 8.

5.3 Remarks on proof techniques

Distributions on level sets with negative correlations

Our randomized algorithms use the sampling result by Srinivasan S01 which allows one to sample variables satisfying an equality precisely while still ensuring that the variables are negatively correlated and therefore the tail bounds by Panconesi and Srinivasan PS97 can be applied. This allows the randomized algorithms in Theorem 9.1 to select precisely kk sets while still preserving the properties of the distribution of variables that are needed for the proof.

Strengthening 𝖫𝖯{\mathsf{LP}}-relaxation via additional inequalities

As we show in Section 9.2, a straightforward 𝖫𝖯{\mathsf{LP}}-relaxation of Fmc based on a corresponding known 𝖫𝖯{\mathsf{LP}}-relaxation of maximum kk-set coverage problems does not have an finite integrality gap and therefore unsuitable for further analysis. To get around this, we use an approach similar to what was used by prior researchers (e.g., see the works by Carnes and Shmoys CD08 and Carr et al. CFLP00 ) by introducing extra O(fn)O(fn) covering inequalities which brings down the integrality gap and allows the results in Theorem 9.1 to go through.

Moreover, we had to separately modify existing constraints of or add new constraints to the basic 𝖫𝖯{\mathsf{LP}}-relaxation for the three algorithms, namely algorithms Alg-small-opt#, Alg-medium-opt# and Alg-iter-round. Modifications for Alg-small-opt# and Alg-medium-opt# in Theorem 9.1 are done to encode the coloring constraints suitably to optimize their coloring constraint approximation bounds for the corresponding parameter ranges. The modifications for Alg-iter-round in Theorem 10.1 and Theorem 10.2 are necessary for the iterated rounding approach to go through.

Doob martingales and Azuma’s inequality

The analysis of the rounding step of our various 𝖫𝖯{\mathsf{LP}}-relaxations are further complicated by the fact that the random element-selection variables may not be pairwise independent; in fact, it is easy to construct examples in which each element-selection variable may be correlated to about afaf other element-selection variables, thereby ruling out straightforward use of Chernoff-type tail bounds. For sufficient large 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#}, we remedy this situation by using Doob martinagales and Azuma’s inequality in the analysis of Alg-large-opt# in Theorem 9.1.

Iterated rounding of 𝖫𝖯{\mathsf{LP}}-relaxation

The analysis of our deterministic algorithm Alg-iter-round uses the iterated rounding approach originally introduced by Jain in J01 and subsequently used by many researchers (the book by Lau, Ravi and Singh LRS11 provides an excellent overview of the topic). A crucial ingredient of this technique used in our proof is the rank lemma. In order to use this technique, we had to modify the 𝖫𝖯{\mathsf{LP}}-relaxation again. When χ=O(1)\chi=O(1), we can do two exhaustive enumeration steps in polynomial time, giving rise to a somewhat better approximation of the coloring constraints.

Random shifting technique

The analysis of our deterministic algorithm Alg-geom uses the random shifting technique that has been used by prior researchers such as A98 ; M99 .

6 Organization of the paper and proof structures

The rest of the paper is organized as follows.

  1. \triangleright

    In Section 7 we present our result in Lemma 1 on the computational hardness of finding a feasible solution of Fmc.

  2. \triangleright

    Based on the results in Section 7, we need to make some minimal assumptions and need to consider appropriate approximate variants of the coloring constraints. They are discussed in Section 8 for the purpose of designing (deterministic or randomized) approximation algorithms.

  3. \triangleright

    In Section 9 we design and analyze our 𝖫𝖯{\mathsf{LP}}-relaxation based randomized approximation algorithms for Fmc. In particular, in Theorem 9.1 we employ two different 𝖫𝖯{\mathsf{LP}}-relaxation of Fmc and combine three randomized rounding analysis on them to get an approximation algorithm whose approximation qualities depend on the range of relevant parameters.

    1. \triangleright

      Parts of the algorithm and analysis specific to the three algorithms Alg-large-opt#, Alg-medium-opt# and Alg-small-opt# are discussed in Section 9.4, Section 9.5 and Section 9.6, respectively.

    2. \triangleright

      Proposition 1 in Section 9.7 shows that the dependence of the coloring constraint bounds in Theorem 9.1(e)(i)–(ii) on ff cannot be completely eliminated by better analysis of our 𝖫𝖯{\mathsf{LP}}-relaxations even for χ=2\chi=2.

  4. \triangleright

    In Section 10 we provide polynomial-time deterministic approximations of Fmc via iterated rounding of a new 𝖫𝖯{\mathsf{LP}}-relaxation. Our approximation qualities depend on the parameters ff and χ\chi.

    1. \triangleright

      For better understanding, we first prove our result for the special case Node-fmc of Fmc in Theorem 10.1 (Section 10.1) and later on describe how to adopt the same approach for Fmc in Theorem 10.2 (Section 10.2).

    2. \triangleright

      The proofs for both Theorem 10.1 and Theorem 10.2 are themselves divided into two parts depending on whether χ=O(1)\chi=O(1) or not.

  5. \triangleright

    In Section 11 we provide deterministic approximation algorithms for two special cases of Fmc, namely Segr-fmc and Δ\Delta-bal-fmc.

  6. \triangleright

    Section 12 provides the deterministic approximation for Geom-fmc.

Our proofs are structured as follows. A complex proof is divided into subsections corresponding to logical sub-divisions of the proofs and the algorithms therein. Often we provide some informal intuitions behind the proofs (including some intuition about why other approaches may not work, if appropriate) before describing the actual proofs.

7 Computational hardness of finding a feasible solution of Fmc

We show that determining if a given instance of Fmc has even one feasible solution is 𝖭𝖯\mathsf{NP}-complete even in very restricted parameter settings. The relevant parameters of importance for Fmc is aa, ff and χ\chi; Lemma 1 shows that the 𝖭𝖯\mathsf{NP}-completeness result holds even for very small values of these parameters.

Lemma 1

Determining feasibility of an instance of Fmc of nn elements is 𝖭𝖯\mathsf{NP}-complete even with the following restrictions:

  1. \triangleright

    the instances correspond to the unweighted version,

  2. \triangleright

    the following combinations of maximum set-size aa, frequency ff and number of colors χ\chi are satisfied:

    1. (a)

      f{1,3}f\in\{1,3\}, all but one set contains exactly 33 elements and all χ2\chi\geq 2,

    2. (b)

      the instances correspond to Node-fmc (which implies f=2f=2), a=O(n)a=O(\sqrt{n}\,), and all χ2\chi\geq 2, or

    3. (c)

      f=1f=1, a=3a=3 and χ=n/3\chi=\nicefrac{{n}}{{3}}.

Moreover, the following assertions also hold:

  1. \triangleright

    The instances of Fmc generated in (a) and (b) actually are instances of Segr-fmc.

  2. \triangleright

    For the instances of Fmc generated in (c), 𝖮𝖯𝖳#=χ=n/3{\mathsf{OPT}}_{\#}=\chi=\nicefrac{{n}}{{3}} and, assuming P𝖭𝖯P\neq\mathsf{NP}, there is no polynomial time approximation algorithm that has either a finite approximation ratio or satisfies the coloring constraints in the ε\varepsilon-approximate sense ((cf. eq. (5))) for any finite ε\varepsilon.

A proof of Lemma 1 appears in the appendix.

Remark 1

It may be tempting to conclude that an approach similar to what is stated below using the kk-set coverage problem as a “black box” may make the claims in Lemma 1 completely obvious. We simply take any hard instance of kk-set coverage and equi-partition the universe arbitrarily into χ\chi many color classes and let this be the corresponding instance of Fmc. Using a suitable standard reductions of 𝖭𝖯\mathsf{NP}-hardness the kk-set coverage problem, if there is a feasible solution of Fmc then the kk sets trivially cover the entire universe and thus trivially satisfy the color constraints but otherwise one may (incorrectly) claim that no kk sets cover the universe and so the fairness constraints cannot be satisfied. Additionally, one may be tempted to argue that if one takes a kk-set coverage problem instance with any additional structure (e.g., bounded occurrence of universe elements) then the coloring does not affect this additional property at all and hence the property is retained in the kk-set coverage problem instance with color constraints.

However, such a generic reduction will fail because it is incorrect and because it will not capture all the special parameter restrictions imposed in Lemma 1. For example:

  1. \triangleright

    Even though the kk sets may not cover the entire universe, it is still possible that they may satisfy the color constraints. For example, consider the following instance of the kk-set coverage problem:

    𝒰={u1,u2,u3,u4,u5,u6},k=2,𝒮1={u1,u2},𝒮2={u3,u4},𝒮3={u5,u6}\displaystyle\mathcal{U}=\{u_{1},u_{2},u_{3},u_{4},u_{5},u_{6}\},\,k=2,\,\mathcal{S}_{1}=\{u_{1},u_{2}\},\,\mathcal{S}_{2}=\{u_{3},u_{4}\},\,\mathcal{S}_{3}=\{u_{5},u_{6}\}

    Suppose we select the equi-partition {u1,u3,u5},{u2,u4,u6}\{u_{1},u_{3},u_{5}\},\{u_{2},u_{4},u_{6}\}, thus setting 𝒞(u1)=𝒞(u3)=𝒞(u5)=1\mathcal{C}(u_{1})=\mathcal{C}(u_{3})=\mathcal{C}(u_{5})=1 and 𝒞(u2)=𝒞(u4)=𝒞(u6)=2\mathcal{C}(u_{2})=\mathcal{C}(u_{4})=\mathcal{C}(u_{6})=2. Then any two selected sets will satisfy the coloring constraints.

  2. \triangleright

    Consider the requirements of the Fmc instances in part (c). Since f=1f=1, every element occurs in exactly one set and thus the set systems for the kk-set coverage problem form a partition of the universe. Such an instance cannot be a hard instance of kk-set coverage since it admits a trivial polynomial time solution: sort the sets in non-decreasing order of their cardinalities and simply take the first kk sets.

8 Relaxing coloring constraints for algorithmic designs

Based on Lemma 1 we need to make the following minimal assumptions for the purpose of designing approximation algorithms with finite approximation ratios:

  1. (i)

    We assume the existence of at least one feasible solution for the given instance of Fmc.

  2. (ii)

    We assume that 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#} is sufficiently large compared to χ\chi, e.g., 𝖮𝖯𝖳#cχ{\mathsf{OPT}}_{\#}\geq c\chi for some large constant c>1c>1.

Lemma 1 and the example in Fig. 1 also show that satisfying the color constraint exactly (i.e., requiring pi/pj\nicefrac{{p_{i}}}{{p_{j}}} to be exactly equal to 11 for all ii and jj) need to be relaxed for the purpose of designing efficient algorithms since non-exact solutions of Fmc may not satisfy these constraints exactly. We define an (deterministic) ε{\varepsilon}-approximate coloring of Fmc (for some ε1\varepsilon\geq 1) to be a coloring that satisfies the coloring constraints in the following manner:

Deterministic ε-approximate coloring:i,j{1,,χ}:piεpj\displaystyle\text{\bf Deterministic ${\varepsilon}$-approximate coloring}:\,\,\,\,\forall\,i,j\in\{1,\dots,\chi\}\,:\,p_{i}\leq\varepsilon p_{j} (5)

Note that (5) automatically implies that pipj/εp_{i}\geq p_{j}/\varepsilon for all ii and jj. Thus, in our terminology, a 11-approximate coloring corresponds to satisfying the coloring constraints exactly. Finally, if our algorithm is randomized, then the pjp_{j}’s could be a random values, and then we will assume that the relevant constraints will be satisfied in expectation or with high probability in an appropriate sense. More precisely, (5) will be modified as follows:444We do not provide a bound on 𝔼[pi/pj]{\mathbb{E}}[\nicefrac{{p_{i}}}{{p_{j}}}] since pi/pj=\nicefrac{{p_{i}}}{{p_{j}}}=\infty when pj=0p_{j}=0 and pjp_{j} may be zero with a strictly positive probability, and for arbitrary χ\chi selecting a set individually for each to avoid this situation in our randomized algorithms may select too many sets.

Randomized ε{\varepsilon}-approximate coloring:

i,j{1,,χ}:𝔼[pi]ε𝔼[pj]\forall\,i,j\in\{1,\dots,\chi\}\,:\,{\mathbb{E}}[p_{i}]\leq\varepsilon\,{\mathbb{E}}[p_{j}] (5)

Randomized strong ε{\varepsilon}-approximate coloring:

i,j{1,,χ}(Pr[piεpj])1o(1)\bigwedge_{i,j\in\{1,\dots,\chi\}}\Big{(}\Pr\left[p_{i}\leq\varepsilon p_{j}\right]\Big{)}\geq 1-o(1) (5)′′

Unless otherwise stated explicitly, our algorithms will select exactly kk sets.

9 𝖫𝖯{\mathsf{LP}}-relaxation based randomized approximation algorithms for Fmc

If kk is a constant then we can solve Fmc(χ,k\chi,k) exactly in polynomial (i.e., O(nk)O(n^{k})) time by exhaustive enumeration, so we assume that kk is at least a sufficiently large constant. In this section we will employ two slightly different 𝖫𝖯{\mathsf{LP}}-relaxation of Fmc and combine three randomized rounding analysis on them to get an approximation algorithm whose approximation qualities depend on the range of various relevant parameters. The combined approximation result is stated in Theorem 9.1. In the proof of this theorem no serious attempt was made to optimize most constants since we are mainly interested in the asymptotic nature of the bounds, and to simplify exposition constants have been over-estimated to get nice integers. In the statement of Theorem 9.1 and in its proof we will refer to the three algorithms corresponding to the two 𝖫𝖯{\mathsf{LP}}-relaxations as Alg-small-opt#, Alg-medium-opt# and Alg-large-opt#.

Theorem 9.1

Suppose that the instance of Fmc(χ,k\chi,k) has nn elements and mm sets. Then, we can design three randomized polynomial-time algorithms Alg-small-opt#, Alg-medium-opt# and Alg-large-opt# with the following properties:

  1. (a)

    All the three algorithms select kk sets ((with probability 1)1).

  2. (b)

    All the three algorithms are randomized ϱ(f)\varrho(f)-approximation for Fmc, i.e., the expected total weight of the selected elements for both algorithms is at least ϱ(f)>11/𝐞\varrho(f)>1-\nicefrac{{1}}{{{\mathbf{e}}}} times 𝖮𝖯𝖳{\mathsf{OPT}}.

  3. (c)

    All the three algorithms satisfy the randomized ε\varepsilon-approximate coloring constraints ((cf. Inequality (5)){}^{\prime}) for ε=O(f)\varepsilon=O(f), i.e., for all i,j{1,,χ}i,j\in\{1,\dots,\chi\}, 𝔼[pi]𝔼[pj]2fϱ(f)<3.16f\frac{{\mathbb{E}}[p_{i}]}{{\mathbb{E}}[p_{j}]}\leq\frac{2f}{\varrho(f)}<3.16f.

  4. (d)

    The algorithms satisfy the strong randomized ε\varepsilon-approximate coloring constraints ((cf. Equation (5)′′, i.e., i,j{1,,χ}(Pr[piεpj])1o(1))\bigwedge_{i,j\in\{1,\dots,\chi\}}\big{(}\Pr\left[p_{i}\leq\varepsilon p_{j}\right]\big{)}\geq 1-o(1)) for the values of ε\varepsilon, 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#} and χ\chi as shown below:

    ε\varepsilon range of 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#} range of χ\chi algorithm
    (i) O(f)O(f) Ω(χnlogχ)\Omega(\chi\sqrt{n}\log\chi) unrestricted Alg-large-opt#
    (ii) O(f2)O(f^{2}) Ω(aχlogχ)\Omega(a\chi\log\chi) and O(χnlogχ)O(\chi\sqrt{n\log\chi}) unrestricted Alg-medium-opt#
    (iii) O(f2aχ𝖮𝖯𝖳#)O(f^{2}\sqrt{a\,\chi{\mathsf{OPT}}_{\#}}\,) unrestricted χ=O(max{1,lognlogm})\chi=O(\max\{1,\,\frac{\log n}{\log m}\}) Alg-small-opt#
Remark 2

Note that the high-probability ε=O(f)\varepsilon=O(f) bound in Theorem 9.1(d)(i) is asymptotically the same as the “ratio of expectation” bound in Theorem 9.1(c).

Remark 3

The dependence on ff of the bounds for ε\varepsilon in Theorem 9.1(d)(i)–(ii) can be contrasted with the Beck-Fiala theorem in discrepancy minimization that shows that the discrepancy of any set system is at most 2f2f.

Remark 4

Consider the special case Node-fmc with χ=O(1)\chi=O(1): for this case f=2f=2 and aa is equal to the maximum node-degree 𝖽𝖾𝗀max\mathsf{deg}_{\max} in the graph. The bounds in Theorem 9.1(d)(i)–(ii) for this special case imply a O(1)O(1)-approximation of color constraints unless 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#} is not sufficiently large compared to 𝖽𝖾𝗀max\mathsf{deg}_{\max}. To illustrate the bound for smaller 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#} in Theorem 9.1(d)(iii), if 𝖮𝖯𝖳#=𝖽𝖾𝗀max(1/2)ε{\mathsf{OPT}}_{\#}=\mathsf{deg}_{\max}^{({1}/{2})-\varepsilon} for some ε>0\varepsilon>0 then the approximation bound of the coloring constraints is O(𝖽𝖾𝗀max1ε)O(\mathsf{deg}_{\max}^{1-\varepsilon}).

A proof of Theorem 9.1 is discussed in the remaining subsections of this section. The following notations will be used uniformly throughout the proof.

  1. \triangleright

    x1,,xn{0,1}x_{1},\dots,x_{n}\in\{0,1\} and y1,,ym{0,1}y_{1},\dots,y_{m}\in\{0,1\} are the usual indicator variables for the elements u1,,unu_{1},\dots,u_{n} and the sets 𝒮1,,𝒮m\mathcal{S}_{1},\dots,\mathcal{S}_{m}, respectively, Their values in an optimal solution of the 𝖫𝖯{\mathsf{LP}}-relaxation under consideration will be denoted by x1,,xnx_{1}^{\ast},\dots,x_{n}^{\ast} and y1,,ymy_{1}^{\ast},\dots,y_{m}^{\ast}, respectively.

  2. \triangleright

    CjC_{j} is the set of all elements colored jj for j{1,,χ}j\in\{1,\dotsc,\chi\} in the given instance of Fmc.

  3. \triangleright

    𝖮𝖯𝖳frac{\mathsf{OPT}}_{\mathrm{frac}} is the optimum value of the objective function of the 𝖫𝖯{\mathsf{LP}}-relaxation under consideration.

9.1 An obvious generalization of 𝖫𝖯{\mathsf{LP}}-relaxation of maximum kk-set coverage fails

maximize i=1nw(ui)xi\sum_{i=1}^{n}w(u_{i})x_{i}
subject to xjuj𝒮yx_{j}\leq\sum_{u_{j}\in\mathcal{S}_{\ell}}y_{\ell} for j=1,,nj=1,\dots,n
=1my=k\sum_{\ell=1}^{m}y_{\ell}=k
0xj10\leq x_{j}\leq 1 for j=1,,nj=1,\dots,n
0y10\leq y_{\ell}\leq 1 for =1,,m\ell=1,\dots,m
Figure 2: A well-known 𝖫𝖯{\mathsf{LP}}-relaxation of the element-weighted maximum kk-set coverage problem.

It is well-known that the 𝖫𝖯{\mathsf{LP}}-relaxation for the element-weighted maximum kk-set coverage problem as shown in Fig. 2 followed by a suitable deterministic or randomized rounding provides an optimal approximation algorithm for the problem (e.g., see AS04 ; MR95 ). A straightforward way to extend this 𝖫𝖯{\mathsf{LP}}-relaxation is to add the following χ(χ1)/2{\chi(\chi-1)}/{2} additional constraints, one corresponding to each pair of colors:

uCix=uCjx for i,j{1,,χ},i<j\sum_{u_{\ell}\in C_{i}}x_{\ell}=\!\!\sum_{u_{\ell}\in C_{j}}x_{\ell}\text{ for }i,j\in\{1,\dots,\chi\},i<j

Unfortunately, this may not lead to a ε\varepsilon-approximate coloring (cf. Equation (5)) for any non-trivial ε\varepsilon as the following example shows. Suppose our instance of an unweighted Fmc(2,22,2) has four sets 𝒮1={u1}\mathcal{S}_{1}=\{u_{1}\}, 𝒮2={u2,,un2}\mathcal{S}_{2}=\{u_{2},\dots,u_{n-2}\}, 𝒮3={un1}\mathcal{S}_{3}=\{u_{n-1}\} and 𝒮4={un}\mathcal{S}_{4}=\{u_{n}\} with the elements u1u_{1} and un1u_{n-1} having color 11 and all other elements having color 22. Clearly the solution to this instance consists of the sets 𝒮3\mathcal{S}_{3} and 𝒮4\mathcal{S}_{4} with 𝖮𝖯𝖳=2{\mathsf{OPT}}=2. On the other hand, the fractional solution y1=y2=x1=x2=1y_{1}^{\ast}=y_{2}^{\ast}=x_{1}^{\ast}=x_{2}^{\ast}=1 and all remaining variables being zero is also an optimal solution of the 𝖫𝖯{\mathsf{LP}}-relaxation, but any rounding approach that does not change the values of zero-valued variables in the fractional solution must necessarily result in an integral solutions in which p2/p1=n3p_{2}/p_{1}=n-3. The example is easily generalized for arbitrary kk.

9.2 Alg-large-opt#: strengthening 𝖫𝖯{\mathsf{LP}}-relaxation via additional inequalities

One problem that the 𝖫𝖯{\mathsf{LP}}-relaxation in Fig. 2 faces when applied to Fmc is the following. We would like each element-indicator variable xjx_{j} to satisfy xj=min{1,uj𝒮y}x_{j}^{\ast}=\min\big{\{}1,\,\sum_{u_{j}\in\mathcal{S}_{\ell}}y_{\ell}^{\ast}\big{\}} in an optimum solution of the 𝖫𝖯{\mathsf{LP}}, but this may not be true as shown in the simple example in the previous section. Past researchers have corrected this kind of situation by introducing extra valid inequalities that hold for any solution to the problem but restrict the feasible region of the 𝖫𝖯{\mathsf{LP}}. For example, Carnes and Shmoys in CD08 and Carr et al. in CFLP00 introduced a set of additional inequalities, which they called the KC (Knapsack Cover) inequalities, to strengthen the integrality gaps of certain types of capacitated covering problems. Following their ideas, we add the extra O(fn)O(fn) “covering inequalities” which are satisfied by any integral solution of the 𝖫𝖯{\mathsf{LP}}:

xjy for j=1,,n,=1,,m, and uj𝒮\displaystyle x_{j}\geq y_{\ell}\text{ for }j=1,\dots,n,\,\ell=1,\dots,m,\text{ and }u_{j}\in\mathcal{S}_{\ell}

In addition, we adjust our 𝖫𝖯{\mathsf{LP}}-relaxation in the following manner. Since 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#} is an integer from {χ,2χ,,(n/χ)χ}\{\chi,2\chi,\dots,(\lfloor\nicefrac{{n}}{{\chi}}\rfloor)\chi\}, we can “guess” the correct value of 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#} by running the algorithm for each of the n/χ\lfloor\nicefrac{{n}}{{\chi}}\rfloor possible value of 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#}, consider those solutions that maximized its objective function and select that one among these solutions that has the largest value of 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#}. Thus, we may assume that our 𝖫𝖯{\mathsf{LP}}-relaxation knows the value of 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#} exactly, and we add the following extra equality:

i=1nxi=𝖮𝖯𝖳#\sum_{i=1}^{n}x_{i}={\mathsf{OPT}}_{\#}

The resulting 𝖫𝖯{\mathsf{LP}}-relaxation is shown in its completeness in Fig. 3 for convenience.

maximize i=1nw(ui)xi\sum_{i=1}^{n}w(u_{i})x_{i}
subject to xjuj𝒮yx_{j}\leq\sum_{u_{j}\in\mathcal{S}_{\ell}}y_{\ell} for j=1,,nj=1,\dots,n
=1my=k\sum_{\ell=1}^{m}y_{\ell}=k
xjyx_{j}\geq y_{\ell} for j=1,,nj=1,\dots,n, =1,,m\ell=1,\dots,m, and uj𝒮u_{j}\in\mathcal{S}_{\ell}
i=1nxi=𝖮𝖯𝖳#\sum_{i=1}^{n}x_{i}={\mathsf{OPT}}_{\#}
uCix=uCjx\sum_{u_{\ell}\in C_{i}}x_{\ell}=\sum_{u_{\ell}\in C_{j}}x_{\ell} for i,j{1,,χ}i,j\in\{1,\dots,\chi\}, i<ji<j
0xj10\leq x_{j}\leq 1 for j=1,,nj=1,\dots,n
0y10\leq y_{\ell}\leq 1 for =1,,m\ell=1,\dots,m
Figure 3: A 𝖫𝖯{\mathsf{LP}}-relaxation for Alg-large-opt# with n+mn+m variables and O(fn+χ2)O\left(fn+\chi^{2}\right) constraints.

9.3 A general technique for obtaining joint high-probability statement

Suppose that our randomized 𝖫𝖯{\mathsf{LP}}-relaxation based algorithm guarantees that Pr[pi/pj>ε]<1/(cχ2)\Pr\left[p_{i}/p_{j}>\varepsilon\right]<\nicefrac{{1}}{{(c\chi^{2})}} for some constant c3c\geq 3 independently for all i,j{1,,χ}i,j\in\{1,\dots,\chi\}. Then

i,j{1,,χ}Pr[piεpj]=1i,j{1,,χ}Pr[pi>εpj]1i,j{1,,χ}Pr[piεpj]>1(χ2)1cχ2=11c=c\bigwedge_{i,j\in\{1,\dots,\chi\}}\Pr\left[p_{i}\leq\varepsilon\,p_{j}\right]=1-\bigvee_{i,j\in\{1,\dots,\chi\}}\Pr\left[p_{i}>\varepsilon\,p_{j}\right]\\ \geq 1-\sum_{i,j\in\{1,\dots,\chi\}}\Pr\left[p_{i}\geq\varepsilon\,p_{j}\right]>1-(\chi^{2})\frac{1}{c\chi^{2}}=1-\frac{1}{c}=c^{\prime}

To boost the success probability, we repeat the randomized rounding clnnc^{\prime}\ln n times, compute the quantity σ=maxi,j{1,,χ},pj0{pi/pj}\sigma=\max_{i,j\in\{1,\dots,\chi\},\,p_{j}\neq 0}\{\nicefrac{{p_{i}}}{{p_{j}}}\} in each iteration, and output the solution in that iteration that resulted in the minimum value of σ\sigma. It then follows that for the selected solution satisfies the strong randomized ε\varepsilon-approximate coloring constraints since i,j{1,,χ}Pr[piεpj]1(1/c)clnn>11/n2\bigwedge_{i,j\in\{1,\dots,\chi\}}\Pr\left[p_{i}\leq\varepsilon\,p_{j}\right]\geq 1-\left(\nicefrac{{1}}{{c^{\prime}}}\right)^{c^{\prime}\ln n}>1-\nicefrac{{1}}{{n^{2}}}.

9.4 Alg-large-opt#: further details and relevant analysis

For our randomized rounding approach, we recall the following result from S01 .

Fact 1

S01 Given numbers p1,,pr[0,1]p_{1},\dots,p_{r}\in[0,1] such that =i=1rpi\ell=\sum_{i=1}^{r}p_{i} is an integer, there exists a polynomial-time algorithm that generates a sequence of integers X1,,XrX_{1},\dots,X_{r} such that (a)(a) i=1rXi\sum_{i=1}^{r}X_{i} with probability 11, (b)(b) Pr[Xi=1]=pi\Pr\left[X_{i}=1\right]=p_{i} for all i{1,,r}i\in\{1,\dots,r\}, and (c)(c) for any real numbers α1,,αr[0,1]\alpha_{1},\dots,\alpha_{r}\in[0,1] the sum i=1rαiXi\sum_{i=1}^{r}\alpha_{i}X_{i} satisfies standard Chernoff bounds.

We round y1,,ymy_{1}^{\ast},\dots,y_{m}^{\ast} to y1+,,ym+y_{1}^{+},\dots,y_{m}^{+} using the algorithm mentioned in Fact 1; this ensures =1my+==1my=k\sum_{\ell=1}^{m}y_{\ell}^{+}=\sum_{\ell=1}^{m}y_{\ell}^{\ast}=k resulting in selection of exactly kk sets. This proves the claim in (b). We round x1,,xnx_{1}^{\ast},\dots,x_{n}^{\ast} to x1+,,xn+x_{1}^{+},\dots,x_{n}^{+} in the following way: for j=1,,nj=1,\dots,n, if uj𝒮u_{j}\in\mathcal{S}_{\ell} for some y+=1y_{\ell}^{+}=1 then set xj+=1x_{j}^{+}=1.

Proof of (b)

Our proof of (c) is similar to that for the maximum kk-set coverage and is included for the sake of completeness. Note that xj+=0x_{j}^{+}=0 if and only if y+=0y_{\ell}^{+}=0 for every set 𝒮\mathcal{S}_{\ell} containing uju_{j} and thus:

𝔼[xj+]=Pr[xj+=1]=1Pr[xj+=0]=1uj𝒮Pr[y+=0]=1uj𝒮(1y)1(uj𝒮(1y)fj)fj=1(1uj𝒮yfj)fj1(1uj𝒮yf)f\textstyle{\mathbb{E}}[x_{j}^{+}]=\Pr[x_{j}^{+}=1]=1-\Pr[x_{j}^{+}=0]=1-\prod_{u_{j}\in\mathcal{S}_{\ell}}\Pr[y_{\ell}^{+}=0]=1-\prod_{u_{j}\in\mathcal{S}_{\ell}}(1-y_{\ell}^{\ast})\\ \textstyle\geq 1-\Big{(}\frac{\sum_{u_{j}\in\mathcal{S}_{\ell}}\big{(}1-y_{\ell}^{\ast}\big{)}}{f_{j}}\Big{)}^{f_{j}}=1-\Big{(}1-\frac{\sum_{u_{j}\in\mathcal{S}_{\ell}}y_{\ell}^{\ast}}{f_{j}}\Big{)}^{f_{j}}\geq 1-\Big{(}1-\frac{\sum_{u_{j}\in\mathcal{S}_{\ell}}y_{\ell}^{\ast}}{f}\Big{)}^{f} (6)

where we have used inequality (3). If uj𝒮y1\sum_{u_{j}\in\mathcal{S}_{\ell}}y_{\ell}^{\ast}\geq 1 then obviously (6) implies 𝔼[xj+]ϱ(f)ϱ(f)xj{\mathbb{E}}[x_{j}^{+}]\geq\varrho(f)\geq\varrho(f)x_{j}^{\ast}. Otherwise, xjuj𝒮y<1x_{j}^{\ast}\leq\sum_{u_{j}\in\mathcal{S}_{\ell}}y_{\ell}^{\ast}<1 and then by (4) we get

𝔼[xj+]1(1uj𝒮yf)f>1(1xjf)fϱ(f)xj\displaystyle\textstyle{\mathbb{E}}[x_{j}^{+}]\geq 1-\left(1-\frac{\sum_{u_{j}\in\mathcal{S}_{\ell}}y_{\ell}^{\ast}}{f}\right)^{f}>1-\left(1-\frac{x_{j}^{\ast}}{f}\right)^{f}\geq\varrho(f)x_{j}^{\ast} (7)

This implies our bound since

𝔼[i=1nw(ui)xi+]=i=1nw(ui)𝔼[xi+]i=1nw(ui)ϱ(f)xj=ϱ(f)𝖮𝖯𝖳fracϱ(f)𝖮𝖯𝖳\displaystyle\textstyle{\mathbb{E}}[\sum_{i=1}^{n}w(u_{i})x_{i}^{+}]=\sum_{i=1}^{n}w(u_{i}){\mathbb{E}}[x_{i}^{+}]\geq\sum_{i=1}^{n}w(u_{i})\varrho(f)x_{j}^{\ast}=\varrho(f){\mathsf{OPT}}_{\mathrm{frac}}\geq\varrho(f){\mathsf{OPT}}

Proof of (c)

Note that inequalities (1) and (2) imply 1x𝐞x1x+(x2/2)1-x\leq{\mathbf{e}}^{-x}\leq 1-x+(x^{2}/2) for all x[0,1]x\in[0,1]. In particular, the following implication holds:

c>1x[0,(2/c2)(c1)]: 1x1cx+(c2x2/2)𝐞cx\displaystyle\forall\,c>1\,\,\forall\,x\in[0,(2/c^{2})(c-1)]\,:\,1-x\geq 1-cx+(c^{2}x^{2}/2)\geq{\mathbf{e}}^{-cx} (8)

We estimate an upper bound on 𝔼[xj+]{\mathbb{E}}[x_{j}^{+}] in terms of xjx_{j}^{\ast} in the following manner:

Case 1: \boldsymbol{\exists\,\ell} such that uj𝒮\boldsymbol{u_{j}\in\mathcal{S}_{\ell}} and y>𝟏/𝟐\boldsymbol{y_{\ell}^{\ast}>\nicefrac{{1}}{{2}}}.

Thus, xjy>1/2x_{j}^{\ast}\geq y_{\ell}^{\ast}>\nicefrac{{1}}{{2}}, and 𝔼[xj+]12xj{\mathbb{E}}[x_{j}^{+}]\leq 1\leq 2x_{j}^{\ast}.

Case 2: y𝟏/𝟐\boldsymbol{y_{\ell}^{\ast}\leq\nicefrac{{1}}{{2}}} for every index \boldsymbol{\ell} satisfying uj𝒮\boldsymbol{u_{j}\in\mathcal{S}_{\ell}}.

Note that xj(uj𝒮y)/fx_{j}^{\ast}\geq\left(\sum_{u_{j}\in\mathcal{S}_{\ell}}y_{\ell}^{\ast}\right)/f, and setting c=2c=2 in inequality (8) we get 1x𝐞2x1-x\geq{\mathbf{e}}^{-2x} for all x[0,1/2]x\in[0,\nicefrac{{1}}{{2}}]. Now, standard calculations show the following:

𝔼[xj+]=1uj𝒮(1y)1uj𝒮𝐞2y=1𝐞2uj𝒮y=1𝐞2fxj1(12fxj)=2fxj{\mathbb{E}}[x_{j}^{+}]=1-\prod_{u_{j}\in\mathcal{S}_{\ell}}(1-y_{\ell}^{\ast})\leq 1-\prod_{u_{j}\in\mathcal{S}_{\ell}}{\mathbf{e}}^{-2y_{\ell}^{\ast}}=1-{\mathbf{e}}^{-2\sum_{u_{j}\in\mathcal{S}_{\ell}}y_{\ell}^{\ast}}\\ =1-{\mathbf{e}}^{-2fx_{j}^{\ast}}\leq 1-(1-2fx_{j}^{\ast})=2fx_{j}^{\ast}

Combining all the cases and using (7), it follows that ϱ(f)xj𝔼[xj+]min{1, 2fxj}\varrho(f)x_{j}^{\ast}\leq{\mathbb{E}}[x_{j}^{+}]\leq\min\big{\{}1,\,2fx_{j}^{\ast}\big{\}}. Recall that uCjx=𝖮𝖯𝖳#/χ>0\sum_{u_{\ell}\in C_{j}}x_{\ell}^{\ast}=\nicefrac{{{\mathsf{OPT}}_{\#}}}{{\chi}}>0 for every j{1,,χ}j\in\{1,\dots,\chi\}. Since 𝔼[pj]=𝔼[uCjx+]=uCj𝔼[x+]{\mathbb{E}}[p_{j}]={\mathbb{E}}[\sum_{u_{\ell}\in C_{j}}x_{\ell}^{+}]=\sum_{u_{\ell}\in C_{j}}{\mathbb{E}}[x_{\ell}^{+}], we get the following bounds for all j{1,,χ}j\in\{1,\dots,\chi\}:

ϱ(f)𝖮𝖯𝖳#χ=uCjϱ(f)x𝔼[pj]=uCj𝔼[x+]uCj(2f)x=2f𝖮𝖯𝖳#χ\displaystyle\textstyle\varrho(f)\frac{{\mathsf{OPT}}_{\#}}{\chi}=\sum_{u_{\ell}\in C_{j}}\varrho(f)x_{\ell}^{\ast}\leq{\mathbb{E}}[p_{j}]=\sum_{u_{\ell}\in C_{j}}{\mathbb{E}}[x_{\ell}^{+}]\leq\sum_{u_{\ell}\in C_{j}}(2f)x_{\ell}^{\ast}=2f\frac{{\mathsf{OPT}}_{\#}}{\chi} (9)

which gives the bound 𝔼[ci]𝔼[cj]2fϱ(f)\frac{{\mathbb{E}}[c_{i}]}{{\mathbb{E}}[c_{j}]}\leq\frac{2f}{\varrho(f)} for all i,j{1,,χ}i,j\in\{1,\dots,\chi\}.


Proofs of (d)(i) via Doob martingales

Note that the random variables x1+,,xn+x_{1}^{+},\dots,x_{n}^{+} may not be pairwise independent since two distinct elements belonging to the same set are correlated, and consequently the random variables p1,,pχp_{1},\dots,p_{\chi} also may not be pairwise independent. Indeed in the worst case an element-selection variable may be correlated to (a1)f(a-1)f other element-selection variables, thereby ruling out straightforward use of Chernoff-type tail bounds.

For sufficient large 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#}, this situation can be somewhat remedied by using Doob martinagales and Azuma’s inequality by finding a suitable ordering of the element-selection variables conditional on the rounding of the set-selection variables. We assume that the reader is familiar with basic definitions and results for the theory of martingales (e.g., see (MR95, , Section 4.44.4)). Fix an arbitrary ordered sequence y1+,,ym+y_{1}^{+},\dots,y_{m}^{+} of the set-indicator variables. Call an element-indicator variable xi+x_{i}^{+} “settled” at the ttht^{\mathrm{th}} step if and only if ui𝒮j{yj}{y1+,,yt1+}\cup_{u_{i}\in\mathcal{S}_{j}}\{y_{j}\}\not\subseteq\{y_{1}^{+},\dots,y_{t-1}^{+}\} and ui𝒮j{yj}{y1+,,yt+}\cup_{u_{i}\in\mathcal{S}_{j}}\{y_{j}\}\subseteq\{y_{1}^{+},\dots,y_{t}^{+}\}. The elementary event in our underlying sample space 𝛀\boldsymbol{\Omega} are all possible 2n2^{n} assignments of 0-11 values to the variables x1+,,xn+x_{1}^{+},\dots,x_{n}^{+}. For each t{1,,m}t\in\{1,\dots,m\}, let VtV_{t} be the subset of element-selection variables whose values are settled at the ttht^{\mathrm{th}} step, let πt\pi_{t} be an arbitrary ordering of the variables in VtV_{t}, and let us relabel the element-indicator variable names so that x1+,x2+,,xn+x_{1}^{+},x_{2}^{+},\dots,x_{n}^{+} be the ordering of all element-selection variables given by the ordering π1,,πm\pi_{1},\dots,\pi_{m}. For each t{0,1,,m}t\in\{0,1,\dots,m\} and each w1,,wt{0,1}w_{1},\dots,w_{t}\in\{0,1\}, let 𝐁w1,,wt\mathbf{B}_{w_{1},\dots,w_{t}} denote the event that yj+=wjy_{j}^{+}=w_{j} for j{1,,t}j\in\{1,\dots,t\}. Let x1+,x2+,,xqt+x_{1}^{+},x_{2}^{+},\dots,x_{q_{t}}^{+} be the union of set of all qtq_{t} element-indicator variables that are settled at the ithi^{\mathrm{th}} step over all i{1,,t}i\in\{1,\dots,t\}, and suppose that the event 𝐁w1,,wt\mathbf{B}_{w_{1},\dots,w_{t}} induces the following assignment of values to the element-indicator variables: x1+=b1,,xqt+=bqtx_{1}^{+}=b_{1},\dots,x_{q_{t}}^{+}=b_{q_{t}} for some b1,,bqt{0,1}b_{1},\dots,b_{q_{t}}\in\{0,1\}. Define the block 𝐁w1,,wt𝛀\mathbf{B}^{\prime}_{w_{1},\dots,w_{t}}\subseteq\boldsymbol{\Omega} induced by the event 𝐁w1,,wt\mathbf{B}_{w_{1},\dots,w_{t}} as 𝐁w1,,wt={b1,,bqtrqt+1rn|rqt+1,,rn{0,1}}\mathbf{B}^{\prime}_{w_{1},\dots,w_{t}}=\{b_{1},\dots,b_{q_{t}}r_{q_{t}+1}\dots r_{n}\,|\,r_{q_{t}+1},\dots,r_{n}\in\{0,1\}\}. Letting 𝔽t\boldsymbol{\mathbb{F}}_{t} be the σ\sigma-field generated by the partition of 𝛀\boldsymbol{\Omega} into the blocks 𝐁w1,,wt\mathbf{B}^{\prime}_{w_{1},\dots,w_{t}} for each w1,,wt{0,1}w_{1},\dots,w_{t}\in\{0,1\}, it follows that 𝔽0,𝔽1,,𝔽m\boldsymbol{\mathbb{F}}_{0},\boldsymbol{\mathbb{F}}_{1},\dots,\boldsymbol{\mathbb{F}}_{m} form a filter for the σ\sigma-field (𝛀,2𝛀)(\boldsymbol{\Omega},2^{\boldsymbol{\Omega}}). Suppose that 𝒰\mathcal{U} contains ni>0n_{i}>0 elements of color ii, let xα1+,,xαni+x_{\alpha_{1}}^{+},\dots,x_{\alpha_{n_{i}}}^{+} be the ordered sequence of the element-selection variables for elements of color ii determined by the subsequence of these variables in the ordering x1+,,xn+x_{1}^{+},\dots,x_{n}^{+}, and suppose that xαj+x_{\alpha_{j}}^{+} was settled at the tαjtht_{\alpha_{j}}^{\mathrm{th}} step. Let Xi=j=1nixαj+X^{i}=\sum_{j=1}^{n_{i}}x_{\alpha_{j}}^{+}, and define the Doob martingale sequence X0,X1,,XniX_{0},X_{1},\dots,X_{n_{i}} where X0=𝔼[Xi]=j=1ni𝔼[xαj+]X_{0}={\mathbb{E}}[X^{i}]=\sum_{j=1}^{n_{i}}{\mathbb{E}}[x_{\alpha_{j}}^{+}], and X=𝔼[Xi|y1+,,ytα+]X_{\ell}={\mathbb{E}}[X^{i}\,|\,y_{1}^{+},\dots,y_{t_{\alpha_{\ell}}}^{+}] for all {1,,ni}\ell\in\{1,\dots,n_{i}\}. Since ni<nn_{i}<n, Xni=XiX_{n_{i}}=X^{i} and |XX1|1|X_{\ell}-X_{\ell-1}|\leq 1 for for all {1,,ni}\ell\in\{1,\dots,n_{i}\}, by Azuma’s inequality (for any Δ>𝐞\Delta>{\mathbf{e}}),

Pr[|j=1nixαj+j=1ni𝔼[xαj+]|3lnΔn]Pr[|j=1nixαj+X0|3lnΔni]=Pr[|XniX0|3lnΔni]𝐞4lnΔ=Δ4Pr[j=1ni𝔼[xαj+]3lnΔn<j=1nixαj+<j=1ni𝔼[xαj+]+3lnΔn]>1Δ4\Pr\left[\,\left|{\textstyle\sum_{j=1}^{n_{i}}}x_{\alpha_{j}}^{+}-{\textstyle\sum_{j=1}^{n_{i}}}{\mathbb{E}}[x_{\alpha_{j}}^{+}]\right|\geq 3\sqrt{\ln\Delta}\sqrt{n}\right]\leq\Pr\left[\,\left|{\textstyle\sum_{j=1}^{n_{i}}}x_{\alpha_{j}}^{+}-X_{0}\right|\geq 3\sqrt{\ln\Delta}\sqrt{n_{i}}\right]\\ =\Pr\left[|X_{n_{i}}-X_{0}|\geq 3\sqrt{\ln\Delta}\sqrt{n_{i}}\right]\leq{\mathbf{e}}^{-4\ln\Delta}=\Delta^{-4}\\ \Rightarrow\,\Pr\left[\,{\textstyle\sum_{j=1}^{n_{i}}}{\mathbb{E}}[x_{\alpha_{j}}^{+}]-3\sqrt{\ln\Delta}\sqrt{n}<{\textstyle\sum_{j=1}^{n_{i}}}x_{\alpha_{j}}^{+}<{\textstyle\sum_{j=1}^{n_{i}}}{\mathbb{E}}[x_{\alpha_{j}}^{+}]+3\sqrt{\ln\Delta}\sqrt{n}\right]>1-\Delta^{-4}

Recall from (9) that (11/𝐞)𝖮𝖯𝖳#χ<j=1ni𝔼[xαj+]2f𝖮𝖯𝖳#χ(1-\nicefrac{{1}}{{{\mathbf{e}}}})\frac{{\mathsf{OPT}}_{\#}}{\chi}<\sum_{j=1}^{n_{i}}{\mathbb{E}}[x_{\alpha_{j}}^{+}]\leq 2f\frac{{\mathsf{OPT}}_{\#}}{\chi}, and the inequality j=1ni𝔼[xαj+]6lnΔn\sum_{j=1}^{n_{i}}{\mathbb{E}}[x_{\alpha_{j}}^{+}]\geq 6\sqrt{\ln\Delta}\sqrt{n} is therefore satisfied provided 𝖮𝖯𝖳6(11/𝐞)1(nlnΔ)χ{\mathsf{OPT}}\geq 6\big{(}1-\nicefrac{{1}}{{{\mathbf{e}}}}\big{)}^{-1}(\sqrt{n\ln\Delta})\chi. Setting Δ=2χ\Delta=2\chi and remembering that pi=j=1nixαj+p_{i}={\textstyle\sum_{j=1}^{n_{i}}}x_{\alpha_{j}}^{+}, we get the following bound for any i{1,,χ}i\in\{1,\dots,\chi\}:

Pr[11/𝐞2(𝖮𝖯𝖳#χ)<pi<3f(𝖮𝖯𝖳#χ)]>1(1/16)χ4\displaystyle\Pr\left[\,\textstyle{\frac{1-\nicefrac{{1}}{{{\mathbf{e}}}}}{2}}\left(\frac{{\mathsf{OPT}}_{\#}}{\chi}\right)<p_{i}<3f\left(\frac{{\mathsf{OPT}}_{\#}}{\chi}\right)\right]>1-(1/16)\chi^{-4}

and therefore Pr[pipj611/𝐞f]>1(1/8)χ4\Pr\left[\frac{p_{i}}{p_{j}}\leq\frac{6}{1-\nicefrac{{1}}{{{\mathbf{e}}}}}f\right]>1-(1/8)\chi^{-4}. Thus, it follows that

i,j{1,,χ}Pr[pi611/𝐞pj]=1i,j{1,,χ}Pr[pi>611/𝐞pj]1i,j{1,,χ}Pr[pi611/𝐞pj]>1(χ2)18χ4132\bigwedge_{i,j\in\{1,\dots,\chi\}}\Pr\left[p_{i}\leq{\textstyle\frac{6}{1-\nicefrac{{1}}{{{\mathbf{e}}}}}}\,p_{j}\right]=1-\bigvee_{i,j\in\{1,\dots,\chi\}}\Pr\left[p_{i}>{\textstyle\frac{6}{1-\nicefrac{{1}}{{{\mathbf{e}}}}}}\,p_{j}\right]\\ \geq 1-\sum_{i,j\in\{1,\dots,\chi\}}\Pr\left[p_{i}\geq{\textstyle\frac{6}{1-\nicefrac{{1}}{{{\mathbf{e}}}}}}\,p_{j}\right]>{\textstyle 1-(\chi^{2})\frac{1}{8\chi^{4}}\geq\frac{1}{32}}

This implies our claim in (d)(i) using the technique in Section 9.3.

9.5 Alg-medium-opt#: details and proofs of relevant claims in (a)–(c) and (d)(ii)

Alg-medium-opt#: idea behind the modified 𝖫𝖯{\mathsf{LP}}-relaxation and approach

A limitation of Alg-large-opt# is that we could not use Fact 1 of Srinivasan to the fullest extent. Although Fact 1 guaranteed that the set-indicator variables are negatively correlated and hence Chernoff-type tail bounds can be applied to them due to the result by Panconesi and Srinivasan PS97 , our coloring constraints are primarily indicated by element-indicator variables which depend implicitly on the set-indicator variables. In fact, it is not difficult to see that the element-indicator variables are not negatively correlated in the sense of PS97 ; S01 555A set of binary random variables z1,,zr{0,1}z_{1},\dots,z_{r}\in\{0,1\} are called negatively correlated in PS97 ; S01 if the following holds: I{1,,r}:Pr[iI(zi=0)]iIPr[zi=0] and Pr[iI(zi=1)]iIPr[zi=1]\forall\,I\subseteq\{1,\dots,r\}:\,\Pr\left[\wedge_{i\in I}(z_{i}=0)\right]\leq\prod_{i\in I}\Pr\left[z_{i}=0\right]\text{ and }\Pr\left[\wedge_{i\in I}(z_{i}=1)\right]\leq\prod_{i\in I}\Pr\left[z_{i}=1\right] even if the set-indicator variables are negatively correlated.

Our idea is to remedy the situation by expressing the coloring constraints also by set-indicator variables and use the element-indicator variables to implicitly control the set-indicator variables in these coloring constraints. This will also necessitate using additional variables.


A modification of the 𝖫𝖯{\mathsf{LP}}-relaxation in Fig. 3

To begin, we quantify the number of elements of different colors in a set 𝒮i\mathcal{S}_{i} using the following notation: for j{1,,χ}j\in\{1,\dots,\chi\}, let νi,j\nu_{i,j} be the number of elements in 𝒮i\mathcal{S}_{i} of color jj. Note that 0νi,ja0\leq\nu_{i,j}\leq a. Fix an optimal integral solution of Fmc(χ,k\chi,k) covering 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#} elements and a color value jj, and consider the following two quantities: 𝒜=uCjx\mathcal{A}=\sum_{u_{\ell}\in C_{j}}x_{\ell} and =i=1mνi,jyi\mathcal{B}=\sum_{i=1}^{m}\nu_{i,j}\,y_{i}. Note that 𝒜=𝖮𝖯𝖳#χ\mathcal{A}=\frac{{\mathsf{OPT}}_{\#}}{\chi}, {k,k+1,,ak}\mathcal{B}\in\{k,k+1,\dots,ak\}, and 𝒜f𝒜\mathcal{A}\leq\mathcal{B}\leq f\mathcal{A} by definition of ff. Thus, =𝔥j𝖮𝖯𝖳#\mathcal{B}={\mathfrak{h}}_{j}{\mathsf{OPT}}_{\#} is satisfied by a 𝔥j{\mathfrak{h}}_{j} that is a rational number from the set {1χ,1χ+1𝖮𝖯𝖳#,1χ+2𝖮𝖯𝖳#,,fχ1𝖮𝖯𝖳#,fχ}\big{\{}\frac{1}{\chi},\frac{1}{\chi}+\frac{1}{{\mathsf{OPT}}_{\#}},\frac{1}{\chi}+\frac{2}{{\mathsf{OPT}}_{\#}},\dots,\frac{f}{\chi}-\frac{1}{{\mathsf{OPT}}_{\#}},\frac{f}{\chi}\big{\}}. We will use the 𝖫𝖯{\mathsf{LP}}-relaxation in Fig. 3 with χ\chi additional variables 𝔥1,,𝔥χ{\mathfrak{h}}_{1},\dots,{\mathfrak{h}}_{\chi} and the following additional constraints

i=1mνi,jyi=𝔥j𝖮𝖯𝖳#\sum_{i=1}^{m}\nu_{i,j}\,y_{i}={\mathfrak{h}}_{j}{\mathsf{OPT}}_{\#} for j{1,,χ}j\in\{1,\dots,\chi\}
1/χ𝔥jf/χ\nicefrac{{1}}{{\chi}}\leq{\mathfrak{h}}_{j}\leq\nicefrac{{f}}{{\chi}} for j=1,,χj=1,\dots,\chi

For reader’s convenience, the new 𝖫𝖯{\mathsf{LP}}-relaxation in its entirety is shown in Fig. 4.

maximize i=1nw(ui)xi\sum_{i=1}^{n}w(u_{i})x_{i}
subject to xjuj𝒮yx_{j}\leq\sum_{u_{j}\in\mathcal{S}_{\ell}}y_{\ell} for j=1,,nj=1,\dots,n
=1my=k\sum_{\ell=1}^{m}y_{\ell}=k
xjyx_{j}\geq y_{\ell} for j=1,,nj=1,\dots,n, =1,,m\ell=1,\dots,m, and uj𝒮u_{j}\in\mathcal{S}_{\ell}
i=1nxi=𝖮𝖯𝖳#\sum_{i=1}^{n}x_{i}={\mathsf{OPT}}_{\#}
i=1mνi,jyi=𝔥j𝖮𝖯𝖳#\sum_{i=1}^{m}\nu_{i,j}\,y_{i}={\mathfrak{h}}_{j}{\mathsf{OPT}}_{\#} for j{1,,χ}j\in\{1,\dots,\chi\}
uCix=uCjx\sum_{u_{\ell}\in C_{i}}x_{\ell}=\sum_{u_{\ell}\in C_{j}}x_{\ell} for i,j{1,,χ}i,j\in\{1,\dots,\chi\}, i<ji<j
0xj10\leq x_{j}\leq 1 for j=1,,nj=1,\dots,n
0y10\leq y_{\ell}\leq 1 for =1,,m\ell=1,\dots,m
1/χ𝔥jf/χ\nicefrac{{1}}{{\chi}}\leq{\mathfrak{h}}_{j}\leq\nicefrac{{f}}{{\chi}} for j=1,,χj=1,\dots,\chi
Figure 4: A modified 𝖫𝖯{\mathsf{LP}}-relaxation for Alg-medium-opt# with with n+m+χn+m+\chi variables and O(fn+χ2)O\left(fn+\chi^{2}\right) constraints.

Analysis of the modified 𝖫𝖯{\mathsf{LP}}-relaxation

By our assumption on 𝔥j{\mathfrak{h}}_{j}’s, the 𝖫𝖯{\mathsf{LP}}-relaxation has a feasible solution. We use the same randomized rounding procedure (using Fact 1) as in Section 9.4 for Alg-large-opt#. The proofs for parts (b)–(d) are the same as before since all prior relevant inequalities are still included. Thus, we concentrate on the proof of (e)(ii). A crucial thing to note is the following simple observation:

Consider the sum Δ=i=1mνi,jyi\Delta=\sum_{i=1}^{m}\nu_{i,j}\,y_{i} for any assignment of values y1,,ym{0,1}y_{1},\dots,y_{m}\in\{0,1\}. Then, the number of elements covered by the sets corresponding to those variables that are set to 11 is between Δ/f\Delta/f and Δ\Delta.

Fix a color jj. Let 𝒦j=𝔥j𝖮𝖯𝖳#\mathcal{K}_{j}={\mathfrak{h}}_{j}{\mathsf{OPT}}_{\#}, αi=νi,ja\alpha_{i}=\frac{\nu_{i,j}}{a} and consider the summation j=i=1mαiyi+\mathcal{L}_{j}=\sum_{i=1}^{m}\alpha_{i}\,y_{i}^{+}. Since αi[0,1]\alpha_{i}\in[0,1] for all ii, by Fact 1 we can apply standard Chernoff bounds MR95 for j\mathcal{L}_{j}. Note that 𝔼[j]=i=1mαiyi=𝒦ja{\mathbb{E}}[\mathcal{L}_{j}]=\sum_{i=1}^{m}\alpha_{i}\,y_{i}^{\ast}=\frac{\mathcal{K}_{j}}{a}. Assuming 𝒦j16alnχ\mathcal{K}_{j}\geq 16a\ln\chi, we get the following for the tail-bounds:

Pr[i=1mνi,jyi+>5𝒦j]=Pr[j>5(𝒦j/a)]<26𝒦j/a296lnχ<χ96\displaystyle\Pr\left[{\textstyle\sum_{i=1}^{m}\nu_{i,j}\,y_{i}^{+}>5\mathcal{K}_{j}}\right]=\Pr\left[\mathcal{L}_{j}>5(\nicefrac{{\mathcal{K}_{j}}}{{a}})\right]<2^{-{6\mathcal{K}_{j}}/{a}}\leq 2^{-96\ln\chi}<{\chi}^{-96}
Pr[i=1mνi,jyi+<𝒦j/2]=Pr[j<𝒦j/a2]<𝐞𝒦j8a𝐞2lnχ=χ2\displaystyle\Pr\left[{\textstyle\sum_{i=1}^{m}\nu_{i,j}\,y_{i}^{+}<\nicefrac{{\mathcal{K}_{j}}}{{2}}}\right]=\Pr\left[{\textstyle\mathcal{L}_{j}<\frac{{\mathcal{K}_{j}}/{a}}{2}}\right]<{\mathbf{e}}^{-\frac{\mathcal{K}_{j}}{8a}}\leq{\mathbf{e}}^{-2\ln\chi}={\chi}^{-2}

Remember that pj=uCjx+p_{j}=\sum_{u_{\ell}\in C_{j}}x_{\ell}^{+} is the random variable denoting the number of elements of color jj selected by our randomized algorithm. Since 1fi=1mνi,jyi+cji=1mνi,jyi+\frac{1}{f}\sum_{i=1}^{m}\nu_{i,j}\,y_{i}^{+}\leq c_{j}\leq\sum_{i=1}^{m}\nu_{i,j}\,y_{i}^{+} we get

Pr[pj>5𝒦j]Pr[i=1mνi,jyi+>5𝒦j]<χ96,\displaystyle\Pr\left[p_{j}>5\mathcal{K}_{j}\right]\leq\Pr\left[\textstyle\sum_{i=1}^{m}\nu_{i,j}\,y_{i}^{+}>5\mathcal{K}_{j}\right]<{\chi}^{-96},
Pr[pj<𝒦j2f]Pr[i=1mνi,jyi+<𝒦j2]<χ2\displaystyle\Pr\left[\textstyle p_{j}<\frac{\mathcal{K}_{j}}{2f}\right]\leq\Pr\left[\textstyle\sum_{i=1}^{m}\nu_{i,j}\,y_{i}^{+}<\frac{\mathcal{K}_{j}}{2}\right]<{\chi}^{-2}

Note that 𝖮𝖯𝖳#χ𝒦j=𝔥j𝖮𝖯𝖳#f𝖮𝖯𝖳#χ\frac{{\mathsf{OPT}}_{\#}}{\chi}\leq\mathcal{K}_{j}={\mathfrak{h}}_{j}{\mathsf{OPT}}_{\#}\leq\frac{f{\mathsf{OPT}}_{\#}}{\chi}, and therefore 1/f𝒦i𝒦jf\nicefrac{{1}}{{f}}\leq\frac{\mathcal{K}_{i}}{\mathcal{K}_{j}}\leq f for any two i,j{1,,χ}i,j\in\{1,\dots,\chi\}. Let j\mathcal{E}_{j} be the event defined as j=def𝒦j2fcj5𝒦j\mathcal{E}_{j}\stackrel{{\scriptstyle\mathrm{def}}}{{=}}\frac{\mathcal{K}_{j}}{2f}\leq c_{j}\leq 5\mathcal{K}_{j}. Then for any two i,j{1,,χ}i,j\in\{1,\dots,\chi\} we get

Pr[pipj10f2]Pr[ij]=1Pr[i¯j¯]1Pr[i¯]Pr[j¯]1Pr[pi>5𝒦i]Pr[pi<𝒦i/(2f)]Pr[pj>5𝒦j]Pr[pj<𝒦j/(2f)]>12(χ96+χ2)\Pr\left[{\textstyle\frac{p_{i}}{p_{j}}\leq 10f^{2}}\right]\geq\Pr\left[\mathcal{E}_{i}\wedge\mathcal{E}_{j}\right]=1-\Pr\left[\,\overline{\mathcal{E}_{i}}\vee\overline{\mathcal{E}_{j}}\,\right]\geq 1-\Pr\left[\,\overline{\mathcal{E}_{i}}\,\right]-\Pr\left[\,\overline{\mathcal{E}_{j}}\,\right]\\ \geq 1-\Pr\left[p_{i}>5\mathcal{K}_{i}\right]-\Pr\left[p_{i}<\nicefrac{{\mathcal{K}_{i}}}{{(2f)}}\right]-\Pr\left[p_{j}>5\mathcal{K}_{j}\right]-\Pr\left[p_{j}<\nicefrac{{\mathcal{K}_{j}}}{{(2f)}}\right]\\ >1-2\big{(}{\chi}^{-96}+{\chi}^{-2}\big{)} (10)

The assumption of 𝒦j16alnχ\mathcal{K}_{j}\geq 16a\ln\chi, is satisfied provided 𝖮𝖯𝖳#16aχlnχ{\mathsf{OPT}}_{\#}\geq 16a\chi\ln\chi. (10) implies our claim in (d)(ii) using the technique in Section 9.3.

9.6 Alg-small-opt#: details and proofs of relevant claims in (a)–(c) and (d)(iii)

Another modification of the 𝖫𝖯{\mathsf{LP}}-relaxation in Fig. 3

Note that for this case χ=O(1)\chi=O(1). Fix an optimal solution for our instance of Fmc. Let 𝒜i\mathcal{A}_{i} be the collection of those sets that contain at least one element of color ii and let Zi=𝒮𝒜iyZ_{i}=\sum_{\mathcal{S}_{\ell}\in\mathcal{A}_{i}}y_{\ell} indicate the number of sets from 𝒜i\mathcal{A}_{i} selected in an integral solution of the 𝖫𝖯{\mathsf{LP}}; obviously Zi1Z_{i}\geq 1. We consider two cases for ZiZ_{i} depending on whether it is at most 5lnχ5\ln\chi or not. We cannot know a priori whether Zi5lnχZ_{i}\leq 5\ln\chi or not. However, for our analysis it suffices if we can guess just one set belonging to ZiZ_{i} correctly. We can do this by trying out all relevant possibilities exhaustively in the following manner. Let Ψ={1,,χ}\Psi=\{1,\dots,\chi\} be the set of indices of all colors. For each of the 2Ψ12^{\Psi}-1 subsets Ψ\Psi^{\prime} of Ψ\Psi, we “guess” that Zi5lnχZ_{i}\leq 5\ln\chi if and only if iΨi\in\Psi^{\prime}. Of course, we still do not know one set among these 5lnχ5\ln\chi subset for each such ii, so we will exhaustively try out one each of the at most |𝒜i|m|\mathcal{A}_{i}|\leq m sets for each ii. For every such choice of Ψ\Psi^{\prime} and every such choice of a set 𝒮iΨ𝒜i\mathcal{S}_{i_{\Psi^{\prime}}}\in\mathcal{A}_{i} for each iΨi\in\Psi^{\prime}, we perform the following steps:

  1. \triangleright

    Select the sets 𝒮iΨ\mathcal{S}_{i_{\Psi^{\prime}}} and their elements for each iΨi\in\Psi^{\prime} Set the variables corresponding to these sets and elements to 11 in the 𝖫𝖯{\mathsf{LP}}-relaxation in Fig. 3, i.e., set yiΨ=1y_{i_{\Psi^{\prime}}}=1 and xj=1x_{j}=1 for every iΨ𝒮iΨi\in\Psi^{\prime}\mathcal{S}_{i_{\Psi^{\prime}}} and jj\in. Remove any constraint that is already satisfied after the above step.

  2. \triangleright

    Add the following additional (at most χ\chi) constraints to the 𝖫𝖯{\mathsf{LP}}-relaxation:

    (uCi)(u𝒮j)yj>5lnχ\sum_{(u_{\ell}\in C_{i})\wedge(u_{\ell}\in\mathcal{S}_{j})}y_{j}>5\ln\chi for iΨi\notin\Psi^{\prime}

Note that the total number of iterations of the basic iterations that is needed is at most O((2m)χ)O({(2m)}^{\chi}), which is polynomial provided χ=O(max{1,lognlogm})\chi=O\big{(}\max\big{\{}1,\,\frac{\log n}{\log m}\big{\}}\big{)}.


Analysis of the modified 𝖫𝖯{\mathsf{LP}}-relaxation

We now analyze that iteration of the 𝖫𝖯{\mathsf{LP}}-relaxation that correctly guesses the value of 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#}, the subset ΨΨ\Psi^{\prime}\subseteq\Psi and the sets 𝒮iΨ𝒜i\mathcal{S}_{i_{\Psi^{\prime}}}\in\mathcal{A}_{i} for each iΨi\in\Psi^{\prime}. As already mentioned elsewhere, the random variables x1+,,xn+x_{1}^{+},\dots,x_{n}^{+} may not be pairwise independent since two distinct elements belonging to the same set are correlated, and consequently the random variables p1,,pχp_{1},\dots,p_{\chi} also may not be pairwise independent. For convenience, let μi=𝔼[xi+]\mu_{i}={\mathbb{E}}[x_{i}^{+}] and i\mathcal{E}_{i} denote the event ixi+=1\mathcal{E}_{i}\equiv x_{i}^{+}=1; note that (1𝐞1)xi<Pr[i]=μimin{1, 2fxi}(1-{\mathbf{e}}^{-1})x_{i}^{\ast}<\Pr\left[\mathcal{E}_{i}\right]=\mu_{i}\leq\min\{1,\,2fx_{i}^{\ast}\}. We first calculate a bound on cov(xi+,xj+)\mbox{cov}(x_{i}^{+},x_{j}^{+}) for all iji\neq j as follows. If xi+x_{i}^{+} and xj+x_{j}^{+} are independent then of course cov(xi+,xj+)=0\mbox{cov}(x_{i}^{+},x_{j}^{+})=0, otherwise

min{μi,μj}Pr[i]Pr[j]Pr[ij]Pr[i]Pr[j]=𝔼[xi+xj+]μiμj=cov(xi+,xj+)Pr[ij]μiμjmin{μi,μj}μiμj<min{μi,μj}-\min\left\{\mu_{i},\,\mu_{j}\right\}\leq-\Pr\left[\mathcal{E}_{i}\right]\Pr\left[\mathcal{E}_{j}\right]\leq\Pr\left[\mathcal{E}_{i}\wedge\mathcal{E}_{j}\right]-\Pr\left[\mathcal{E}_{i}\right]\Pr\left[\mathcal{E}_{j}\right]={\mathbb{E}}[x_{i}^{+}x_{j}^{+}]-\mu_{i}\mu_{j}\\ =\mbox{cov}(x_{i}^{+},x_{j}^{+})\leq\Pr\left[\mathcal{E}_{i}\wedge\mathcal{E}_{j}\right]-\mu_{i}\mu_{j}\leq\min\left\{\mu_{i},\,\mu_{j}\right\}-\mu_{i}\mu_{j}<\min\left\{\mu_{i},\,\mu_{j}\right\}

giving the following bounds:

min{ϱ(f)xi,ϱ(f)xj}cov(xi+,xj+)min{2fxi, 2fxj, 1}\displaystyle-\min\left\{\varrho(f)x_{i}^{\ast},\,\varrho(f)x_{j}^{\ast}\,\right\}\leq\mbox{cov}(x_{i}^{+},x_{j}^{+})\leq\min\left\{2fx_{i}^{\ast},\,2fx_{j}^{\ast},\,1\right\} (11)

For notational convenience, let 𝒟i,j={|ui,uj𝒮,ji}\mathcal{D}_{i,j}=\{\ell\,|\,u_{i},u_{j}\in\mathcal{S}_{\ell},\,j\neq i\} be the indices of those sets in which both the elements uiu_{i} and uju_{j} appear, and let 𝒟i=j=1n𝒟i,j\mathcal{D}_{i}=\cup_{j=1}^{n}\mathcal{D}_{i,j}. Note that |𝒟i,j|f|\mathcal{D}_{i,j}|\leq f, |𝒟i|(a1)f|\mathcal{D}_{i}|\leq(a-1)f, and the random variable xi+x_{i}^{+} is independent of all xj+x_{j}^{+} satisfying j𝒟ij\notin\mathcal{D}_{i}. Using this observation and (11), for any i{1,,n}i\in\{1,\dots,n\} we get

j=1ncov(xi+,xj+)j𝒟i(min{μi,μj})|𝒟i|μiafμimin{2af2xi,af}\displaystyle\sum_{j=1}^{n}\mbox{cov}(x_{i}^{+},x_{j}^{+})\leq\sum_{j\in\mathcal{D}_{i}}\!\!\big{(}\min\left\{\mu_{i},\,\mu_{j}\right\}\big{)}\leq|\mathcal{D}_{i}|\,\mu_{i}\leq af\mu_{i}\leq\min\big{\{}2af^{2}x_{i}^{\ast},\,af\big{\}} (12)
j=1ncov(xi+,xj+)j𝒟imin{μi,μj}min{|𝒟i|μi,ϱ(f)j𝒟ixj}min{(a1)fμi,ϱ(f)j𝒟ixj}>afxi\sum_{j=1}^{n}\mbox{cov}(x_{i}^{+},x_{j}^{+})\geq-\sum_{j\in\mathcal{D}_{i}}\min\left\{\mu_{i},\,\mu_{j}\right\}\geq-\min\Big{\{}|\mathcal{D}_{i}|\,\mu_{i},\,\varrho(f)\sum_{j\in\mathcal{D}_{i}}x_{j}^{\ast}\Big{\}}\\ \geq-\min\Big{\{}(a-1)f\,\mu_{i},\,\varrho(f)\sum_{j\in\mathcal{D}_{i}}x_{j}^{\ast}\Big{\}}>-afx_{i}^{\ast} (13)

The above bounds can be used to bound the total pairwise co-variance between elements in two same or different color classes as follows. Consider two color classes CiC_{i} and CjC_{j} (i=ji=j is allowed). Then,

urCiusCjcov(xr+,xs+)urCij=1ncov(xr+,xj+)urCimin{2af2xr,af}=min{2af2urCixr,af|Ci|}min{2af2𝖮𝖯𝖳#χ,afn}\sum_{u_{r}\in C_{i}}\sum_{u_{s}\in C_{j}}\mbox{cov}(x_{r}^{+},x_{s}^{+})\leq\sum_{u_{r}\in C_{i}}\sum_{j=1}^{n}\mbox{cov}(x_{r}^{+},x_{j}^{+})\leq\sum_{u_{r}\in C_{i}}\min\big{\{}2af^{2}x_{r}^{\ast},\,af\big{\}}\\ \textstyle=\min\Big{\{}2af^{2}\sum_{u_{r}\in C_{i}}x_{r}^{\ast},\,af|C_{i}|\Big{\}}\leq\min\big{\{}2af^{2}\frac{{\mathsf{OPT}}_{\#}}{\chi},\,afn\big{\}} (14)
urCiusCjcov(xr+,xs+)urCij𝒟rmin{μr,μj}>urCiafxr=af𝖮𝖯𝖳#χ\displaystyle\sum_{u_{r}\in C_{i}}\sum_{u_{s}\in C_{j}}\mbox{cov}(x_{r}^{+},x_{s}^{+})\geq-\sum_{u_{r}\in C_{i}}\sum_{j\in\mathcal{D}_{r}}\min\{\mu_{r},\,\mu_{j}\}>-\sum_{u_{r}\in C_{i}}afx_{r}^{\ast}=\textstyle-af\frac{{\mathsf{OPT}}_{\#}}{\chi} (15)

For calculations of probabilities of events of the form “pi>Δpjp_{i}>\Delta p_{j}”, we first need to bound the probability of events “pj=0p_{j}=0” for j{1,,χ}j\in\{1,\dots,\chi\}. If jΨj\in\Psi^{\prime} then Pr[pj=0]=0\Pr\left[p_{j}=0\right]=0 since at least one set containing an element of color jj is always selected. Otherwise, 𝒮𝒜jy>5lnχ\sum_{\mathcal{S}_{\ell}\in\mathcal{A}_{j}}y_{\ell}^{\ast}>5\ln\chi, and pj=0p_{j}=0 if and only if y+=0y_{\ell}^{+}=0 for every 𝒮𝒜j\mathcal{S}_{\ell}\in\mathcal{A}_{j}. This gives us the following bound for jΨj\notin\Psi^{\prime}:

Pr[pj=0]=𝒮𝒜jPr[y+=0]=𝒮𝒜j(1y)𝒮𝒜j𝐞y=𝐞𝒮𝒜jy𝐞5lnχ=χ5\Pr\left[p_{j}=0\right]=\prod_{\mathcal{S}_{\ell}\in\mathcal{A}_{j}}\Pr\left[y_{\ell}^{+}=0\right]=\prod_{\mathcal{S}_{\ell}\in\mathcal{A}_{j}}(1-y_{\ell}^{\ast})\leq\prod_{\mathcal{S}_{\ell}\in\mathcal{A}_{j}}{\mathbf{e}}^{-y_{\ell}^{\ast}}\\ ={\mathbf{e}}^{-\sum_{\mathcal{S}_{\ell}\in\mathcal{A}_{j}}y_{\ell}^{\ast}}\leq{\mathbf{e}}^{-5\ln\chi}=\chi^{-5}

Combining both cases, we have Pr[pj=0]1/χ5\Pr\left[p_{j}=0\right]\leq\nicefrac{{1}}{{\chi^{5}}} for all jj.

We now can calculate the probabilities of events of the form “pi>Δpjp_{i}>\Delta p_{j}” for Δ=Δ1+Δ21\Delta=\Delta_{1}+\Delta_{2}\geq 1, Δ1,Δ20\Delta_{1},\Delta_{2}\geq 0 and i,j{1,,χ}i,j\in\{1,\dots,\chi\} as follows:

Pr[piΔpj]Pr[pj=0]+Pr[piΔpj|pj1]χ5+Pr[piΔ1pj+Δ2|pj1]=χ5+Pr[(piΔ1pj+Δ2)(pj1)]Pr[pj1]<χ5+Pr[piΔ1pj+Δ2]1Pr[pj=0]<χ5+Pr[piΔ1pj+Δ2]1χ5\Pr\left[p_{i}\geq\Delta p_{j}\right]\leq\Pr\left[p_{j}=0\right]+\Pr\left[p_{i}\geq\Delta p_{j}\,|\,p_{j}\geq 1\right]\leq\chi^{-5}+\Pr\left[p_{i}\geq\Delta_{1}p_{j}+\Delta_{2}\,|\,p_{j}\geq 1\right]\\ =\chi^{-5}+\frac{\Pr\left[(p_{i}\geq\Delta_{1}p_{j}+\Delta_{2})\wedge(p_{j}\geq 1)\right]}{\Pr\left[p_{j}\geq 1\right]}<\chi^{-5}+\frac{\Pr\left[p_{i}\geq\Delta_{1}p_{j}+\Delta_{2}\right]}{1-\Pr\left[p_{j}=0\right]}\\ <\chi^{-5}+\frac{\Pr\left[p_{i}\geq\Delta_{1}p_{j}+\Delta_{2}\right]}{1-\chi^{-5}} (16)

For a real number ζ>0\zeta>0, let δi,j=piζpj\delta_{i,j}=p_{i}-\zeta p_{j}. We have the following bound on 𝔼[δi,j]{\mathbb{E}}[\delta_{i,j}] for all ζ3f\zeta\geq 3f:

𝔼[δi,j]=𝔼[pi]ζ𝔼[pj]2f𝖮𝖯𝖳#χζϱ(f)𝖮𝖯𝖳#χ<0\displaystyle{\mathbb{E}}[\delta_{i,j}]={\mathbb{E}}[p_{i}]-\zeta\,{\mathbb{E}}[p_{j}]\leq 2f\frac{{\mathsf{OPT}}_{\#}}{\chi}-\zeta\varrho(f)\frac{{\mathsf{OPT}}_{\#}}{\chi}<0

Therefore, using Chebyshev’s inequality we get (for all ζ3f\zeta\geq 3f and λ>1\lambda>1):

Pr[piζpj+λvar(δi,j)]=Pr[δi,jλvar(δi,j)]<Pr[|δi,j𝔼[δi,j]|>λvar(δi,j)]1/λ2\Pr\left[p_{i}\geq\zeta p_{j}+\lambda\sqrt{\mbox{var}(\delta_{i,j})}\right]=\Pr\left[\delta_{i,j}\geq\lambda\sqrt{\mbox{var}(\delta_{i,j})}\right]\\ <\Pr\left[\left|\,\delta_{i,j}-{\mathbb{E}}[\delta_{i,j}]\,\right|>\lambda\sqrt{\mbox{var}(\delta_{i,j})}\right]\leq\nicefrac{{1}}{{\lambda^{2}}} (17)

Using (17) in (16) with Δ1=ζ\Delta_{1}=\zeta, Δ2=λvar(δi,j)\Delta_{2}=\lambda\sqrt{\mbox{var}(\delta_{i,j})} and λ=10χ\lambda=10\chi we get

Pr[pi<(ζ+10χvar(δi,j))pj]=1Pr[pi(ζ+10χvar(δi,j))pj]>1χ5χ1001χ5>1χ4\textstyle\Pr\left[p_{i}<\left(\zeta+10\chi\sqrt{\mbox{var}(\delta_{i,j})}\,\right)p_{j}\right]=1-\Pr\left[p_{i}\geq\left(\zeta+10\chi\sqrt{\mbox{var}(\delta_{i,j})}\,\right)p_{j}\right]\\ \textstyle>1-\chi^{-5}-\frac{\chi^{-100}}{1-\chi^{-5}}>1-\chi^{-4} (18)

We now calculate a bound on var(δi,j)\mbox{var}(\delta_{i,j}) using (14) and (15) as follows:

var(δi,j)=var(piζpj)=var(uCix++uCj(ζx+))=uCivar(x+)+ζ2uCjvar(x+)+ur,usCi,rscov(xr+,xs+)+ur,usCj,rscov(ζxr+,ζxs+)+urCi,usCjcov(xr+,ζxs+)uCiμ+ζ2uCjμ+2af2𝖮𝖯𝖳#χ+ζ2ur,usCj,rscov(xr+,xs+)ζurCi,usCjcov(xr+,xs+)𝔼[ci]+ζ2𝔼[cj]+2af2𝖮𝖯𝖳#χ+2ζ2af2𝖮𝖯𝖳#χ+ζaf𝖮𝖯𝖳#χ2f𝖮𝖯𝖳#χ+ζ22f𝖮𝖯𝖳#χ+2af2𝖮𝖯𝖳#χ+ζ22af2𝖮𝖯𝖳#χ+ζaf𝖮𝖯𝖳#χ4ζ2af2𝖮𝖯𝖳#χvar(δi,j)2ζaf𝖮𝖯𝖳#/χ\mbox{var}(\delta_{i,j})=\mbox{var}(p_{i}-\zeta p_{j})=\mbox{var}\Big{(}\sum_{u_{\ell}\in C_{i}}\!\!x_{\ell}^{+}+\sum_{u_{\ell}\in C_{j}}\!\!(-\zeta x_{\ell}^{+})\Big{)}\\ =\sum_{u_{\ell}\in C_{i}}\!\!\mbox{var}\big{(}x_{\ell}^{+}\big{)}+\zeta^{2}\sum_{u_{\ell}\in C_{j}}\!\!\mbox{var}\big{(}x_{\ell}^{+}\big{)}+\!\!\!\!\!\!\sum_{u_{r},u_{s}\in C_{i},r\neq s}\!\!\!\!\!\!\!\!\!\mbox{cov}(x_{r}^{+},x_{s}^{+})+\!\!\!\!\!\!\sum_{u_{r},u_{s}\in C_{j},r\neq s}\!\!\!\!\!\!\!\!\!\mbox{cov}(-\zeta x_{r}^{+},-\zeta x_{s}^{+})\\ +\!\!\!\!\!\!\sum_{u_{r}\in C_{i},u_{s}\in C_{j}}\!\!\!\!\!\!\!\!\!\mbox{cov}(x_{r}^{+},-\zeta x_{s}^{+})\\ \leq\sum_{u_{\ell}\in C_{i}}\!\!\mu_{\ell}+\zeta^{2}\sum_{u_{\ell}\in C_{j}}\!\!\mu_{\ell}+2af^{2}{\textstyle\frac{{\mathsf{OPT}}_{\#}}{\chi}}+\zeta^{2}\!\!\!\!\!\!\sum_{u_{r},u_{s}\in C_{j},r\neq s}\!\!\!\!\!\!\!\!\!\mbox{cov}(x_{r}^{+},x_{s}^{+})-\zeta\!\!\!\!\!\!\sum_{u_{r}\in C_{i},u_{s}\in C_{j}}\!\!\!\!\!\!\!\!\!\mbox{cov}(x_{r}^{+},x_{s}^{+})\\ \leq{\mathbb{E}}[c_{i}]+\zeta^{2}{\mathbb{E}}[c_{j}]+2af^{2}{\textstyle\frac{{\mathsf{OPT}}_{\#}}{\chi}}+2\zeta^{2}af^{2}{\textstyle\frac{{\mathsf{OPT}}_{\#}}{\chi}}+\zeta af{\textstyle\frac{{\mathsf{OPT}}_{\#}}{\chi}}\\ \leq 2f{\textstyle\frac{{\mathsf{OPT}}_{\#}}{\chi}}+\zeta^{2}2f{\textstyle\frac{{\mathsf{OPT}}_{\#}}{\chi}}+2af^{2}{\textstyle\frac{{\mathsf{OPT}}_{\#}}{\chi}}+\zeta^{2}2af^{2}{\textstyle\frac{{\mathsf{OPT}}_{\#}}{\chi}}+\zeta af{\textstyle\frac{{\mathsf{OPT}}_{\#}}{\chi}}\leq 4\zeta^{2}af^{2}{\textstyle\frac{{\mathsf{OPT}}_{\#}}{\chi}}\\ \Rightarrow\,\sqrt{\mbox{var}(\delta_{i,j})}\leq 2\zeta\sqrt{a}f\sqrt{{\mathsf{OPT}}_{\#}/\chi} (19)

Setting ζ=3f\zeta=3f and using (19) in (18) we get Pr[ci<(3f+60af2𝖮𝖯𝖳#χ)cj]>1χ4\Pr\left[c_{i}<\left(3f+60\sqrt{a}f^{2}\sqrt{{\mathsf{OPT}}_{\#}\chi}\,\right)c_{j}\right]>1-\chi^{-4}. This implies our claim in (d)(iii) using the technique in Section 9.3.

9.7 Limitations of our 𝖫𝖯{\mathsf{LP}}-relaxation: “a gap of factor ff” for coloring constraints

The coloring constraint bounds in Theorem 9.1(e)(i)–(ii) depend on ff or f2f^{2} only. It is natural to ask as a possible first direction of improvement whether this dependence can be eliminated or improved by better analysis of our 𝖫𝖯{\mathsf{LP}}-relaxations. Proposition 1 shows that this may not be possible even for χ=2\chi=2 unless one uses a significantly different 𝖫𝖯{\mathsf{LP}}-relaxation for Fmc(χ,k\chi,k).

Proposition 1

There exists optimal non-integral solutions of Fmc with the following property: any rounding approach that does not change the values of zero-valued variables in the fractional solution must necessarily result in an integral solutions in which the color constraints differ by at least a factor of ff.

Proof

We will show our result for the 𝖫𝖯{\mathsf{LP}}-relaxation in Fig. 3; proofs for other modified versions of this 𝖫𝖯{\mathsf{LP}}-relaxation are similar. Consider α1\alpha\gg 1 disjoint collections of sets and elements of the following type: for j{1,,α}j\in\{1,\dots,\alpha\}, the jthj^{\mathrm{th}} collection consists of a set of α+1\alpha+1 elements 𝒰j={u1j,,uα+1j}\mathcal{U}^{j}=\{u_{1}^{j},\dots,u_{\alpha+1}^{j}\} with 𝒞(u1j)=1\mathcal{C}(u_{1}^{j})=1 and 𝒞(u2j)==𝒞(uα+1j)=2\mathcal{C}(u_{2}^{j})=\dots=\mathcal{C}(u_{\alpha+1}^{j})=2, and the α+1\alpha+1 sets 𝒮1j,,𝒮α+1j\mathcal{S}_{1}^{j},\dots,\mathcal{S}_{\alpha+1}^{j} where 𝒮ij=𝒰j{uij}\mathcal{S}_{i}^{j}=\mathcal{U}^{j}\setminus\{u_{i}^{j}\} for i{1,,α+1}i\in\{1,\dots,\alpha+1\} (note that each element uiju_{i}^{j} is in exactly α\alpha sets). Add to these collections the additional 2α+22\alpha+2 elements u1,u2u_{1}^{\ell},u_{2}^{\ell} with 𝒞(u1)=1\mathcal{C}(u_{1}^{\ell})=1 and 𝒞(u2)=2\mathcal{C}(u_{2}^{\ell})=2, and the α+1\alpha+1 sets 𝒮={u1,u2}\mathcal{S}^{\ell}=\{u_{1}^{\ell},u_{2}^{\ell}\} for {α+1,,2α+1}\ell\in\{\alpha+1,\dots,2\alpha+1\}. Note that for our created instance f=αf=\alpha. Consider the following two different solutions of the 𝖫𝖯{\mathsf{LP}}-relaxation:

(1)

For a non-integral solution, let y1j==yα+1j=1/αy_{1}^{j}=\dots=y_{\alpha+1}^{j}=\nicefrac{{1}}{{\alpha}}, let x1j=1x_{1}^{j}=1 and let x2j==xα+1j=1/αx_{2}^{j}=\dots=x_{\alpha+1}^{j}=\nicefrac{{1}}{{\alpha}} for j{1,,α}j\in\{1,\dots,\alpha\}, and set all other variables to zero. This results in a solution with summation of set variables being α+1\alpha+1 (i.e., α+1\alpha+1 sets are selected non-integrally), and summation of element variables being 2α+22\alpha+2 (i.e., 2α+22\alpha+2 elements are selected non-integrally). Moreover, the summation of element variables with the color value of 11 is precisely the same as summation of element variables with the color value of 22 since both are equal to α+1\alpha+1.

(2)

For an integral solution, let y=x1=x2=1y^{\ell}=x_{1}^{\ell}=x_{2}^{\ell}=1 for {α+1,,2α+1}\ell\in\{\alpha+1,\dots,2\alpha+1\}. This also results in a solution in which α+1\alpha+1 sets are selected, the number of elements covered is 2α+22\alpha+2 and the number of elements of each color is α+1\alpha+1.

The crucial things to note here is that the two above solutions are disjoint (i.e., non-zero variables in one solution are zero in the other and vice versa), and thus any rounding approach for the solution in (1) that does not change values of the zero-valued variables results in an integral solution in which the number of elements of color 22 is ff times the number of elements of color 11.

10 A tale of fewer colors: deterministic approximation for Fmc when χ\chi is “not too large”

In this section we provide polynomial-time deterministic approximations of Fmc via the iterated rounding technique for 𝖫𝖯{\mathsf{LP}}-relaxations. We assume that the reader is familiar with the basic concepts related to this approach as described, for example, in LRS11 . Our approximation qualities will depend on the parameters ff and χ\chi and the coloring constraint bounds are interesting only if χ\chi is not too large, e.g., no more than, say, poly-logarithmic in nn. For better understanding of the idea, we will first consider the special case Node-fmc of Fmc for which f=2f=2, and later on describe how to adopt the same approach for arbitrary ff. As per the proof of Theorem 9.1 (see Section 9.2) we may assume we know the value of 𝖮𝖯𝖳#{\mathsf{OPT}_{\#}} exactly. A main ingredient of the iterated rounding approach is the following “rank lemma”.

Fact 2 (Rank lemma)

(LRS11, , Lemma 2.1.4) Consider any convex polytope P=def{𝐱n|𝐀j𝐱bj for j{1,,m},𝐱𝟎}P\stackrel{{\scriptstyle\mathrm{def}}}{{=}}\{\mathbf{x}\in{\mathbb{R}}^{n}\,|\,\mathbf{A}_{j}\mathbf{x}\geq b_{j}\text{ for }j\in\{1,\dots,m\},\,\mathbf{x}\geq\mathbf{0}\} for some 𝐀1,,𝐀mn\mathbf{A}_{1},\dots,\mathbf{A}_{m}\in{\mathbb{R}}^{n} and (b1,,bm)Tm(b_{1},\dots,b_{m})^{\mathrm{T}}\in{\mathbb{R}}^{m}. Then the following property holds for every extreme-point for PP: the number of any maximal set of linearly independent tight constraints ((i.e., constraints satisfying 𝐀j𝐱=bj\mathbf{A}_{j}\mathbf{x}=b_{j} for some j)j) in this solution equals the number of non-zero variables.

10.1 Approximating Node-fmc

Theorem 10.1

We can design a deterministic polynomial-time approximation algorithm Alg-iter-round for Node-fmc with the following properties:

  1. (a)

    The algorithm selects τ\tau nodes where τ{k+χ12,if χ=O(1)k+χ1,otherwise\tau\leq\begin{cases}k+\frac{\chi-1}{2},&\mbox{if $\chi=O(1)$}\\ k+\chi-1,&\mbox{otherwise}\end{cases}

  2. (b)

    The algorithm is a 12\frac{1}{2}-approximation for Node-fmc, i.e., the total weight of the selected elements is at least 𝖮𝖯𝖳/2\nicefrac{{{\mathsf{OPT}}}}{{2}}.

  3. (c)

    The algorithm satisfies the ε\varepsilon-approximate coloring constraints ((cf. Inequality (5))) as follows:

    for all i,j{1,,χ}i,j\in\{1,\dots,\chi\}, pipj<{4+4χ,if χ=O(1)4+2χ+4χ2,otherwise\frac{p_{i}}{p_{j}}<\begin{cases}4+4\chi,&\mbox{if $\chi=O(1)$}\\ 4+2\chi+4\chi^{2},&\mbox{otherwise}\end{cases}

We discuss the proof in the rest of this section. Let G=(V,E)G=(V,E) be the given graph, and let 𝖽𝖾𝗀(v)\mathsf{deg}(v) denote the degree of node vv. Assume that GG has no isolated nodes.

10.1.1 The case of χ=O(1)\chi=O(1)

Since the problem can be exactly solved in polynomial time by exhaustive enumeration if kk is a constant, we can assume kk is at least a sufficiently large integer, e.g., assume that k>10χk>10\chi.

Initial preprocessing

To begin, we “guess” χ+1\chi+1 nodes, say v1,,vχ+1Vv_{1},\dots,v_{\chi+1}\in V with 𝖽𝖾𝗀(v1)𝖽𝖾𝗀(v2)𝖽𝖾𝗀(vχ+1)\mathsf{deg}(v_{1})\leq\mathsf{deg}(v_{2})\leq\dots\leq\mathsf{deg}(v_{\chi+1}) such that there exists an optimal solution contains these χ+1\chi+1 nodes with the following property: “the remaining k(χ+1)k-(\chi+1) nodes in the solution have degree at most 𝖽𝖾𝗀(v1)\mathsf{deg}(v_{1})”. Since there are at most (nχ+1)=nO(1)\binom{n}{\chi+1}=n^{O(1)} choices for such χ+1\chi+1 nodes, we can try them out in an exhaustive fashion. Thus, we only need to analyze that run of our algorithm where the our guess is correct. Once these χ+1\chi+1 nodes have been selected, we will use the following sets of nodes in V^\widehat{V} and ”incidence-indexed” edges E^\widehat{E} as input to our algorithm (note that an edge e=def{u,v}e\stackrel{{\scriptstyle\mathrm{def}}}{{=}}\{u,v\} may appear as two members (e,u)(e,u) and (e,v)(e,v) in E^\widehat{E} if both uu and vv are in V^\widehat{V}):

V^=V({v1,,vχ+1}{v|degree of v in G is strictly larger than 𝖽𝖾𝗀(v1) })\displaystyle\widehat{V}=V\setminus\big{(}\,\{v_{1},\dots,v_{\chi+1}\}\,\cup\,\{v\,|\,\text{degree of $v$ in $G$ is strictly larger than $\mathsf{deg}(v_{1})$ }\}\,\big{)}
E^={(e,u)|(e=def{u,v}E)(v{v1,,vχ+1})(uV^)}\displaystyle\widehat{E}=\{\,(e,u)\,|\,\big{(}e\stackrel{{\scriptstyle\mathrm{def}}}{{=}}\{u,v\}\in E\big{)}\bigwedge\big{(}v\notin\{v_{1},\dots,v_{\chi+1}\}\big{)}\bigwedge\big{(}u\in\widehat{V}\big{)}\,\}

Fix an optimal solution VoptVV_{\mathrm{opt}}\subseteq V that includes the nodes v1,,vχ+1v_{1},\dots,v_{\chi+1}. We next make the following parameter adjustments:

  1. \triangleright

    We update an estimate for pjp_{j} (the number of edges of color jj covered by the optimal solution) from its initial value of 𝖮𝖯𝖳#/χ\nicefrac{{{\mathsf{OPT}}_{\#}}}{{\chi}} in the following manner. Let μj\mu_{j} be the number of edges of color jj incident on at least one of the nodes in {v1,,vχ+1}\{v_{1},\dots,v_{\chi+1}\}. Consider the quantity q^j=uVopt{v1,,vχ+1}|{e=def{u,v}E|(v{v1,,vχ+1})(𝒞(e)=j)}|{\widehat{q}}_{j}=\sum_{u\in V_{\mathrm{opt}}\setminus\{v_{1},\dots,v_{\chi+1}\}}\big{|}\,\big{\{}e\stackrel{{\scriptstyle\mathrm{def}}}{{=}}\{u,v\}\in E\,|\,(v\notin\{v_{1},\dots,v_{\chi+1}\})\,\wedge\,(\mathcal{C}(e)=j)\big{\}}\,\big{|}. Note that i=1χ+1𝖽𝖾𝗀(vi)2𝖮𝖯𝖳#\sum_{i=1}^{\chi+1}\mathsf{deg}(v_{i})\leq 2{\mathsf{OPT}_{\#}} and q^j{\widehat{q}}_{j} is an integer in the set {𝖮𝖯𝖳#χμj,𝖮𝖯𝖳#χμj+1,,2(𝖮𝖯𝖳#χμj)}{0,1,2,,n}\Big{\{}\frac{{\mathsf{OPT}_{\#}}}{\chi}-\mu_{j},\frac{{\mathsf{OPT}_{\#}}}{\chi}-\mu_{j}+1,\dots,2\big{(}\frac{{\mathsf{OPT}_{\#}}}{\chi}-\mu_{j}\big{)}\Big{\}}\subseteq\{0,1,2,\dots,n\} since any edge can be covered by either one or two nodes. Note that there are at most 𝖮𝖯𝖳#χμj+1𝖮𝖯𝖳#χnχ\frac{{\mathsf{OPT}_{\#}}}{\chi}-\mu_{j}+1\leq\frac{{\mathsf{OPT}_{\#}}}{\chi}\leq\frac{n}{\chi} possible number of integers values that each q^j{\widehat{q}}_{j} may take. Since χ=O(1)\chi=O(1), we can try out all possible combinations of q^j{\widehat{q}}_{j} values over all colors in polynomial time since (n/χ)χ=nO(1)(n/\chi)^{\chi}=n^{O(1)}. Thus, we henceforth assume that we know the correct value of q^j{\widehat{q}}_{j} for each j{1,,χ}j\in\{1,\dots,\chi\}. Note that q^j0{\widehat{q}}_{j}\geq 0 since our guess is correct.

  2. \triangleright

    Update kk (the number of nodes to be selected) by subtracting χ+1\chi+1 from it, and call the new value k^\widehat{k}.

  3. \triangleright

    Update CiC_{i} (the set of edges of color ii) to be the set of edges in E^\widehat{E} that are of color ii.

Yet another 𝖫𝖯{\mathsf{LP}}-relaxation

Let |V^|=n^|\widehat{V}|=\widehat{n} and |E^|=m^|\widehat{E}|=\widehat{m}. We will start with an initial 𝖫𝖯{\mathsf{LP}}-relaxation of Node-fmc on G^\widehat{G} which will be iteratively modified by our rounding approach. Our 𝖫𝖯{\mathsf{LP}}-relaxation is the following modified version of the 𝖫𝖯{\mathsf{LP}}-relaxation in Fig. 3.

  1. \triangleright

    There is a node indicator variable yvy_{v} for every node vV^v\in\widehat{V} and an edge indicator variable xe,ux_{e,u} for every edge (e,u)E^(e,u)\in\widehat{E}; thus we have n^+m^\widehat{n}+\widehat{m} variables in total.

  2. \triangleright

    Constraints of the form “xjyx_{j}\geq y_{\ell}” and “xjuj𝒮yx_{j}\leq\sum_{u_{j}\in\mathcal{S}_{\ell}}y_{\ell}” in Fig. 3 are removed now and instead replaced by at most two constraints xe,u=yux_{e,u}=y_{u} if yuV^y_{u}\in\widehat{V} and xe,v=yvx_{e,v}=y_{v} if yvV^y_{v}\in\widehat{V} . This is done so that we can apply the rank lemma in a meaningful way.

  3. \triangleright

    Note that the quantity uV^e=def{u,v}Cixe,u\sum_{u\in\widehat{V}}\sum_{e\stackrel{{\scriptstyle\mathrm{def}}}{{=}}\{u,v\}\in C_{i}}x_{e,u} for each color ii is the integer q^i{\widehat{q}}_{i} mentioned before.

  4. \triangleright

    To maximize the parameter ranges over which our algorithm can be applied, we replace the (χ2)\binom{\chi}{2} constraints in Fig. 3 of the form “uCix=uCjx\sum_{u_{\ell}\in C_{i}}x_{\ell}=\sum_{u_{\ell}\in C_{j}}x_{\ell} for i,j{1,,χ}i,j\in\{1,\dots,\chi\}, i<ji<j” by the χ\chi constraints uV^e=def{u,v}Cixe,u=qi\sum_{u\in\widehat{V}}\sum_{e\stackrel{{\scriptstyle\mathrm{def}}}{{=}}\{u,v\}\in C_{i}}x_{e,u}=q_{i}^{\prime} for i{1,,χ}i\in\{1,\dots,\chi\}.

The entire initial 𝖫𝖯{\mathsf{LP}}-relaxation \mathcal{L} for G^\widehat{G} is shown in Fig. 5 for convenience. Note that the number of constraints in lines (1)(3) of Fig. 5 is exactly m^+χ+1\widehat{m}+\chi+1.

maximize ψ=uV^(e,u)E^w(e)xe,u\psi=\sum_{u\in\widehat{V}}\sum_{(e,u)\in\widehat{E}}w(e)x_{e,u}
subject to
(1) xe,u=yux_{e,u}=y_{u} for all uV^u\in\widehat{V} and (e,u)E^(e,u)\in\widehat{E}
(2) vV1yv=k^\sum_{v\in V_{1}}y_{v}=\widehat{k}
(3) uV^e=def{u,v}Cixe,u=q^i\sum_{u\in\widehat{V}}\sum_{e\stackrel{{\scriptstyle\mathrm{def}}}{{=}}\{u,v\}\in C_{i}}x_{e,u}={\widehat{q}}_{i} for all i{1,,χ}i\in\{1,\dots,\chi\}
(4) 0xe,u10\leq x_{e,u}\leq 1 for all (e,u)E^(e,u)\in\widehat{E}
(5) 0yv10\leq y_{v}\leq 1 for all vV^v\in\widehat{V}
Figure 5: The initial 𝖫𝖯{\mathsf{LP}}-relaxation =(0)\mathcal{L}=\mathcal{L}^{(0)} for the graph G^\widehat{G} used in Theorem 10.1. The iterated rounding approach will successively modify the 𝖫𝖯{\mathsf{LP}} to create a sequence (1),(2),\mathcal{L}^{(1)},\mathcal{L}^{(2)},\dots of 𝖫𝖯{\mathsf{LP}}’s.

Details of iterated rounding

We will use the variable t{0,1,2,,n}t\in\{0,1,2,\dots,n\} to denote the iteration number of our rounding, with t=0t=0 being the situation before any rounding has been performed, and we will use a “superscript (t)(t)” for the relevant quantities to indicate their values or status after the ttht^{\mathrm{th}} iteration of the rounding, e.g., n^(0)=n^\widehat{n}^{(0)}=\widehat{n} and n^(1)\widehat{n}^{(1)} is the value of n^\widehat{n} after the first iteration of rounding. Our iterated rounding algorithm Alg-iter-round in high level details is shown in Fig. 6, where the following notation is used for brevity for a node uV^u\in\widehat{V}:

Zuvariables={yu}{xe,u|(e,u)E^}\displaystyle Z_{u}^{\mathrm{variables}}=\{y_{u}\}\,\bigcup\,\big{\{}\,x_{e,u}\,|\,(e,u)\in\widehat{E}\big{\}}

For concise analysis of our algorithm, we will use the following notations:

  1. \triangleright

    Wχ+1W^{\chi+1} is the sum of weights of all the edges incident to one or more nodes from the set of nodes {v1,,vχ+1}\{v_{1},\dots,v_{\chi+1}\}.

  2. \triangleright

    w(X)=xu,eXw(e)w(X)=\sum_{x_{u,e}\in X}w(e) for a subset of variable X{xe,u|(e,u)E^}X\subseteq\{x_{e,u}\,|\,(e,u)\in\widehat{E}\}.

  3. \triangleright

    WAlg(t)=w(X^sol(t))W_{\!\!\text{\tiny\sc Alg}}^{(t)}=w(\widehat{X}_{\mathrm{sol}}^{(t)}) is the sum of weights of the edges whose variables are in X^sol(t)\widehat{X}_{\mathrm{sol}}^{(t)} (thus, for example, WAlg(0)=0W_{\!\!\text{\tiny\sc Alg}}^{(0)}=0).

  4. \triangleright

    𝖮𝖯𝖳frac(t){\mathsf{OPT}}_{\mathrm{frac}}^{(t)} is the optimum value of the objective function of the 𝖫𝖯{\mathsf{LP}}-relaxation (t)\mathcal{L}^{(t)} during the ttht^{\mathrm{th}} iteration of rounding.

  5. \triangleright

    p^i(t){\widehat{p}}_{i}^{(t)} is the number of edges of color ii selected by Alg-iter-round up to and including the ttht^{\mathrm{th}} iteration of rounding.

  6. \triangleright

    tfinalt^{\mathrm{final}} is the value of tt in the last iteration of rounding.

( initialization )(*\text{ initialization }*)
t0t\leftarrow 0     V^sol\widehat{V}_{\mathrm{sol}}\leftarrow\emptyset;     X^sol\widehat{X}_{\mathrm{sol}}\leftarrow\emptyset;     var-countremainingn^+m^\mbox{var-count}_{\mathrm{remaining}}\leftarrow\widehat{n}+\widehat{m};     r^m^\widehat{r}\leftarrow\widehat{m};     n^n^\widehat{n}\leftarrow\widehat{n};
( iterations of rounding )(*\text{ iterations of rounding }*)
while (var-countremaining0)(\mbox{var-count}_{\mathrm{remaining}}\neq 0) do
         tt+1t\leftarrow t+1
         find an extreme-point optimal solution of objective value 𝖮𝖯𝖳frac(t1){\mathsf{OPT}}_{\mathrm{frac}}^{(t-1)} for the 𝖫𝖯{\mathsf{LP}} (t1)\mathcal{L}^{(t-1)} (cf. Fig. 5)
         begin cases
           Case 1: there exists a variable yuy_{u} in the solution such that yu=0y_{u}=0
               var-countremainingvar-countremaining|Zuvariables|\mbox{var-count}_{\mathrm{remaining}}\leftarrow\mbox{var-count}_{\mathrm{remaining}}-\big{|}\,Z_{u}^{\mathrm{variables}}\,\big{|}; n^n^1\,\,\,\widehat{n}\leftarrow\widehat{n}-1
               r^r^|{xe,u|xe,uZuvariables}|\widehat{r}\leftarrow\widehat{r}-\big{|}\,\{\,x_{e,u}\,|\,x_{e,u}\in Z_{u}^{\mathrm{variables}}\,\}\,\big{|}; X^solX^sol{xe,u|xe,uZuvariables}\,\,\,\widehat{X}_{\mathrm{sol}}\leftarrow\widehat{X}_{\mathrm{sol}}\cup\{\,x_{e,u}\,|\,x_{e,u}\in Z_{u}^{\mathrm{variables}}\,\}
               remove the variables in ZuvariablesZ_{u}^{\mathrm{variables}} from (t1)\mathcal{L}^{(t-1)}, and delete or update the constraints
                      and the objective function to reflect the removal of variables
           Case 2: there exists a variable yuy_{u} in the solution such that yu=1y_{u}=1
               V^solV^sol{u}\widehat{V}_{\mathrm{sol}}\leftarrow\widehat{V}_{\mathrm{sol}}\cup\{u\}; var-countremainingvar-countremaining|Zuvariables|\,\,\,\mbox{var-count}_{\mathrm{remaining}}\leftarrow\mbox{var-count}_{\mathrm{remaining}}-\big{|}\,Z_{u}^{\mathrm{variables}}\,\big{|}
               k^k^1\widehat{k}\leftarrow\widehat{k}-1; n^n^1\,\,\,\widehat{n}\leftarrow\widehat{n}-1; r^r^|{xe,u|xe,uZuvariables}|\,\,\,\widehat{r}\leftarrow\widehat{r}-\big{|}\,\{\,x_{e,u}\,|\,x_{e,u}\in Z_{u}^{\mathrm{variables}}\,\}\,\big{|}
               X^solX^sol{xe,u|xe,uZuvariables}\widehat{X}_{\mathrm{sol}}\leftarrow\widehat{X}_{\mathrm{sol}}\cup\{\,x_{e,u}\,|\,x_{e,u}\in Z_{u}^{\mathrm{variables}}\,\}
               remove the variables in ZuvariablesZ_{u}^{\mathrm{variables}} from (t1)\mathcal{L}^{(t-1)}, and delete or update the constraints
                      and the objective function to reflect the removal of variables
               i{1,,χ}:q^iq^i|{xe,u|xe,uZuvariables and 𝒞(e)=i}|\forall\,i\in\{1,\dots,\chi\}:\,{\widehat{q}}_{i}\leftarrow{\widehat{q}}_{i}-\big{|}\,\{\,x_{e,u}\,|\,x_{e,u}\in Z_{u}^{\mathrm{variables}}\text{ and }\mathcal{C}(e)=i\,\}\,\big{|}
               subtract the value xe,uZuvariablesw(e)xe,u\sum_{x_{e,u}\in Z_{u}^{\mathrm{variables}}}w(e)x_{e,u} from the objective function ψ(t1)\psi^{(t-1)}
           Case 3: 1n^χ+11\leq\widehat{n}\leq\chi+1
               let yu1,,yun^y_{u_{1}},\dots,y_{u_{\widehat{n}^{\prime}}} be the remaining non-zero 1n^n^/21\leq\widehat{n}^{\prime}\leq\nicefrac{{\widehat{n}}}{{2}} node indicator variables
               var-countremaining0\mbox{var-count}_{\mathrm{remaining}}\leftarrow 0; k^0\,\,\,\widehat{k}\leftarrow 0; n^0\,\,\,\widehat{n}\leftarrow 0; r^0\,\,\,\widehat{r}\leftarrow 0
               i{1,,χ}:q^iq^i|{xe,u|xe,uZuvariables and 𝒞(e)=i}|\forall\,i\in\{1,\dots,\chi\}:\,{\widehat{q}}_{i}\leftarrow{\widehat{q}}_{i}-\big{|}\,\{\,x_{e,u}\,|\,x_{e,u}\in Z_{u}^{\mathrm{variables}}\text{ and }\mathcal{C}(e)=i\,\}\,\big{|}
               V^solV^sol{u1,,un^}\widehat{V}_{\mathrm{sol}}\leftarrow\widehat{V}_{\mathrm{sol}}\cup\{u_{1},\dots,u_{\widehat{n}^{\prime}}\}; X^solX^sol{xe,uj|xe,ujZujvariables,j{1,,n^}}\,\,\,\widehat{X}_{\mathrm{sol}}\leftarrow\widehat{X}_{\mathrm{sol}}\cup\{\,x_{e,u_{j}}\,|\,x_{e,u_{j}}\in Z_{u_{j}}^{\mathrm{variables}},\,j\in\{1,\dots,\widehat{n}^{\prime}\}\,\}
         end cases
end while
𝖮𝖯𝖳frac0{\mathsf{OPT}}_{\mathrm{frac}}\leftarrow 0
returnV^sol\widehat{V}_{\mathrm{sol}}
Figure 6: Pseudo-code of the iterated rounding algorithm Alg-iter-round used in Theorem 10.1. V^sol(tfinal)\widehat{V}_{\mathrm{sol}}^{(t^{\mathrm{final}})} is the set of nodes selected in our solution.
Lemma 2

Alg-iter-round terminates after at most nn iterations and selects at most k+χ12k+\frac{\chi-1}{2} nodes.

Proof

For finite termination, it suffices to show that at least one of the three cases in Alg-iter-round always applies. Consider the first iteration, say when t=αt=\alpha, when neither Case 1 nor Case 2 applies. Note that this also implies that xe,u{0,1}x_{e,u}\notin\{0,1\} for any variable xe,ux_{e,u} in 𝖫𝖯(α){\mathsf{LP}}^{(\alpha)} since otherwise the variable yuy_{u} in 𝖫𝖯(α){\mathsf{LP}}^{(\alpha)} will be either 0 or 11 via the equality constraint yu=xe,uy_{u}=x_{e,u} and one of Case 1 or Case 2 will apply. Thus the total number of non-zero variables is n^(α)+r^(α)\widehat{n}^{(\alpha)}+\widehat{r}^{(\alpha)}. Since the constraints in lines (4)(5) of Fig. 5 are not strict constraints now (i.e., not satisfied with equalities), the total number of any maximal set of strict constraints is at most the total number of constraints in lines (1)(3) of Fig. 5, i.e., at most r^(α)+χ+1\widehat{r}^{(\alpha)}+\chi+1. By the rank lemma (Fact 2) r^(α)+χ+1n^(α)+r^(α)n^(α)χ+1\widehat{r}^{(\alpha)}+\chi+1\geq\widehat{n}^{(\alpha)}+\widehat{r}^{(\alpha)}\equiv\widehat{n}^{(\alpha)}\leq\chi+1, which implies Case 3 applies and the algorithm terminates.

We now prove the bound on the number of selected sets. The value of k^\widehat{k} decreases by 11 every time a new node is selected in Case 2 and remains unchanged in Case 1 where no node is selected. In the very last iteration involving Case 3, since GG has no isolated nodes the number of node indicator variables is at least the number of edge indicator variables, implying n^n^/2χ+12\widehat{n}^{\prime}\leq\nicefrac{{\widehat{n}}}{{2}}\leq\frac{\chi+1}{2}. Since k^(tfinal1)1\widehat{k}^{(t^{\mathrm{final}}-1)}\geq 1, the total number of nodes selected is at most k+(n^1)k+χ12k+(\widehat{n}^{\prime}-1)\leq k+\frac{\chi-1}{2}.

Lemma 3

The sum of weights Γ\Gamma of the edges selected by Alg-iter-round is at least 𝖮𝖯𝖳/2\nicefrac{{{\mathsf{OPT}}}}{{2}}.

Proof

Let WAlgW(t)=WAlg(t)Wχ+1W_{\!\!\text{\footnotesize\sc Alg}\!-W}^{(t)}=W_{\!\!\text{\tiny\sc Alg}}^{(t)}-W^{\chi+1}, and 𝖮𝖯𝖳W=𝖮𝖯𝖳Wχ+1{\mathsf{OPT}}_{\!-W}={\mathsf{OPT}}-W^{\chi+1}. The proof of Lemma 2 shows that Case 3 of Alg-iter-round is executed only when t=tfinalt=t^{\mathrm{final}}. Thus, the details of Alg-iter-round in Fig. 6 imply the following sequence of assertions:

  1. (i)

    𝖮𝖯𝖳frac(0)𝖮𝖯𝖳W{\mathsf{OPT}}_{\mathrm{frac}}^{(0)}\geq{\mathsf{OPT}}_{\!-W} and 𝖮𝖯𝖳frac(t)=𝖮𝖯𝖳frac(t1)(w(X^sol(t))w(X^sol(t1))){\mathsf{OPT}}_{\mathrm{frac}}^{(t)}={\mathsf{OPT}}_{\mathrm{frac}}^{(t-1)}-\big{(}w(\widehat{X}_{\mathrm{sol}}^{(t)})-w(\widehat{X}_{\mathrm{sol}}^{(t-1)})\big{)} for t{1,,tfinal1}t\in\{1,\dots,t^{\mathrm{final}}-1\}. Since the variables xeu,jX^sol(tfinal)X^sol(tfinal1)x_{e_{u,j}}\in\widehat{X}_{\mathrm{sol}}^{(t^{\mathrm{final}})}\setminus\widehat{X}_{\mathrm{sol}}^{(t^{\mathrm{final}}-1)} are at most 11, we have

    𝖮𝖯𝖳frac(tfinal)=𝖮𝖯𝖳frac(tfinal1)xeu,jX^sol(tfinal)X^sol(tfinal1)w(e)xeu,j𝖮𝖯𝖳frac(tfinal1)xeu,jX^sol(tfinal)X^sol(tfinal1)w(e)=𝖮𝖯𝖳frac(tfinal1)(w(X^sol(tfinal))w(X^sol(tfinal1))){\mathsf{OPT}}_{\mathrm{frac}}^{(t^{\mathrm{final}})}={\mathsf{OPT}}_{\mathrm{frac}}^{(t^{\mathrm{final}}-1)}-\sum_{x_{e_{u,j}}\in\widehat{X}_{\mathrm{sol}}^{(t^{\mathrm{final}})}\setminus\widehat{X}_{\mathrm{sol}}^{(t^{\mathrm{final}}-1)}}w(e)x_{e_{u,j}}\geq{\mathsf{OPT}}_{\mathrm{frac}}^{(t^{\mathrm{final}}-1)}-\sum_{x_{e_{u,j}}\in\widehat{X}_{\mathrm{sol}}^{(t^{\mathrm{final}})}\setminus\widehat{X}_{\mathrm{sol}}^{(t^{\mathrm{final}}-1)}}w(e)\\ ={\mathsf{OPT}}_{\mathrm{frac}}^{(t^{\mathrm{final}}-1)}-\big{(}w(\widehat{X}_{\mathrm{sol}}^{(t^{\mathrm{final}})})-w(\widehat{X}_{\mathrm{sol}}^{(t^{\mathrm{final}}-1)})\big{)}

    Using the fact that 𝖮𝖯𝖳frac(tfinal)=0{\mathsf{OPT}}_{\mathrm{frac}}^{(t^{\mathrm{final}})}=0, we can therefore unravel the recurrence to get

    𝖮𝖯𝖳frac(tfinal)𝖮𝖯𝖳frac(0)w(X^sol(tfinal))w(X^sol(tfinal))𝖮𝖯𝖳frac(0)𝖮𝖯𝖳W\displaystyle{\mathsf{OPT}}_{\mathrm{frac}}^{(t^{\mathrm{final}})}\geq{\mathsf{OPT}}_{\mathrm{frac}}^{(0)}-w(\widehat{X}_{\mathrm{sol}}^{(t^{\mathrm{final}})})\,\Rightarrow\,w(\widehat{X}_{\mathrm{sol}}^{(t^{\mathrm{final}})})\geq{\mathsf{OPT}}_{\mathrm{frac}}^{(0)}\geq{\mathsf{OPT}}_{\!-W} (20)
  2. (ii)

    WAlgW(0)=0W_{\!\!\text{\footnotesize\sc Alg}\!-W}^{(0)}=0 and WAlgW(t)=WAlgW(t1)+(w(X^sol(t))w(X^sol(t1)))W_{\!\!\text{\footnotesize\sc Alg}\!-W}^{(t)}=W_{\!\!\text{\footnotesize\sc Alg}\!-W}^{(t-1)}+\big{(}w(\widehat{X}_{\mathrm{sol}}^{(t)})-w(\widehat{X}_{\mathrm{sol}}^{(t-1)})\big{)} for t{1,,tfinal}t\in\{1,\dots,t^{\mathrm{final}}\}. Using (20) we can unravel the recurrence we get

    WAlgW(tfinal)=w(X^sol(tfinal))𝖮𝖯𝖳frac(0)𝖮𝖯𝖳W\displaystyle W_{\!\!\text{\footnotesize\sc Alg}\!-W}^{(t^{\mathrm{final}})}=w(\widehat{X}_{\mathrm{sol}}^{(t^{\mathrm{final}})})\geq{\mathsf{OPT}}_{\mathrm{frac}}^{(0)}\geq{\mathsf{OPT}}_{\!-W}

Noting that an edge e=def{u,v}e\stackrel{{\scriptstyle\mathrm{def}}}{{=}}\{u,v\} can contribute the value of w(e)w(e) twice in WAlg(tfinal)W_{\!\!\text{\tiny\sc Alg}}^{(t^{\mathrm{final}})} corresponding to the two variables xe,ux_{e,u} and xe,vx_{e,v}, the total weight Γ\Gamma of selected edges in our solution is at least

ΓWχ+1+12WAlg(tfinal)Wχ+1+WAlg(tfinal)2Wχ+1+𝖮𝖯𝖳W2=𝖮𝖯𝖳2\displaystyle\Gamma\geq W^{\chi+1}+\frac{1}{2}W_{\!\!\text{\tiny\sc Alg}}^{(t^{\mathrm{final}})}\geq\frac{W^{\chi+1}+W_{\!\!\text{\tiny\sc Alg}}^{(t^{\mathrm{final}})}}{2}\geq\frac{W^{\chi+1}+{\mathsf{OPT}}_{\!-W}}{2}=\frac{{\mathsf{OPT}}}{2}\qed

Our proof of Theorem 10.1 is therefore completed once we prove the following lemma.

Lemma 4

For all i,j{1,,χ}i,j\in\{1,\dots,\chi\} p^i(tfinal)p^j(tfinal)4+4χ\frac{{\widehat{p}}_{i}^{(t^{\mathrm{final}})}}{{\widehat{p}}_{j}^{(t^{\mathrm{final}})}}\leq 4+4\chi.

Proof

When t=tfinalt=t^{\mathrm{final}} Case 3 applies and, since the variables xeu,jX^sol(tfinal)X^sol(tfinal1)x_{e_{u,j}}\in\widehat{X}_{\mathrm{sol}}^{(t^{\mathrm{final}})}\setminus\widehat{X}_{\mathrm{sol}}^{(t^{\mathrm{final}}-1)} are at most 11, q^i(tfinal)0{\widehat{q}}_{i}^{(t^{\mathrm{final}})}\leq 0 and consequently q^i(tfinal1)q^i(tfinal)q^i(tfinal1){\widehat{q}}_{i}^{(t^{\mathrm{final}}-1)}-{\widehat{q}}_{i}^{(t^{\mathrm{final}})}\geq{\widehat{q}}_{i}^{(t^{\mathrm{final}}-1)}. Noting that an edge e=def{u,v}e\stackrel{{\scriptstyle\mathrm{def}}}{{=}}\{u,v\} can contribute twice in the various q^i(t){\widehat{q}}_{i}^{(t)}’s corresponding to the two variables xe,ux_{e,u} and xe,vx_{e,v} and remembering that q^i(0)=q^i{\widehat{q}}_{i}^{(0)}={\widehat{q}}_{i}, we get

p^i(tfinal)12(t=1tfinal(q^i(t1)q^i(t)))+μi12(t=1tfinal1(q^i(t1)q^i(t))+q^i(tfinal1))+μi=q^i(0)2+μi=q^i2+μi𝖮𝖯𝖳#2χμi2+μi=𝖮𝖯𝖳#2χ+μi2{\widehat{p}}_{i}^{(t^{\mathrm{final}})}\geq\frac{1}{2}\Big{(}\sum_{t=1}^{t^{\mathrm{final}}}\!\!\big{(}{\widehat{q}}_{i}^{(t-1)}-{\widehat{q}}_{i}^{(t)}\big{)}\Big{)}+\mu_{i}\geq\frac{1}{2}\Big{(}\sum_{t=1}^{t^{\mathrm{final}}-1}\!\!\!\big{(}{\widehat{q}}_{i}^{(t-1)}-{\widehat{q}}_{i}^{(t)}\big{)}+{\widehat{q}}_{i}^{(t^{\mathrm{final}}-1)}\Big{)}+\mu_{i}=\frac{{\widehat{q}}_{i}^{(0)}}{2}+\mu_{i}\\ =\frac{{\widehat{q}}_{i}}{2}+\mu_{i}\geq\frac{{\mathsf{OPT}}_{\#}}{2\chi}-\frac{\mu_{i}}{2}+\mu_{i}=\frac{{\mathsf{OPT}}_{\#}}{2\chi}+\frac{\mu_{i}}{2}

We can get an upper bound on p^i(tfinal){\widehat{p}}_{i}^{(t^{\mathrm{final}})} by getting an upper bound on q^i(tfinal1)q^i(tfinal){\widehat{q}}_{i}^{(t^{\mathrm{final}}-1)}-{\widehat{q}}_{i}^{(t^{\mathrm{final}})} in the following manner. Consider the n^(tfinal)<n^(tfinal)χ+1\widehat{n}^{(t^{\mathrm{final}})^{\prime}}<\widehat{n}^{(t^{\mathrm{final}})}\leq\chi+1 nodes u1,,un^(tfinal)u_{1},\dots,u_{\widehat{n}^{(t^{\mathrm{final}})^{\prime}}} in Case 3. By choice of the nodes v1,,vχ+1v_{1},\dots,v_{\chi+1} of degrees 𝖽𝖾𝗀(v1),,𝖽𝖾𝗀(vχ+1)\mathsf{deg}(v_{1}),\dots,\mathsf{deg}(v_{\chi+1}), respectively, the number of edges incident on uiu_{i} is at most 𝖽𝖾𝗀(vi)\mathsf{deg}(v_{i}) for all i{1,,n^(tfinal)}i\in\{1,\dots,\widehat{n}^{(t^{\mathrm{final}})^{\prime}}\}. Thus, we get q^i(tfinal1)q^i(tfinal)j=1χ+1𝖽𝖾𝗀(vj)2𝖮𝖯𝖳#{\widehat{q}}_{i}^{(t^{\mathrm{final}}-1)}-{\widehat{q}}_{i}^{(t^{\mathrm{final}})}\leq\sum_{j=1}^{\chi+1}\mathsf{deg}(v_{j})\leq 2{\mathsf{OPT}}_{\#}, and consequently

p^i(tfinal)t=1tfinal(q^i(t1)q^i(t))t=1tfinal1(q^i(t1)q^i(t))+2𝖮𝖯𝖳#=q^i+2𝖮𝖯𝖳#2(𝖮𝖯𝖳#χμi)+2𝖮𝖯𝖳#=(2+2χ)𝖮𝖯𝖳#χ{\widehat{p}}_{i}^{(t^{\mathrm{final}})}\leq\sum_{t=1}^{t^{\mathrm{final}}}\!\!\big{(}{\widehat{q}}_{i}^{(t-1)}-{\widehat{q}}_{i}^{(t)}\big{)}\leq\sum_{t=1}^{t^{\mathrm{final}}-1}\!\!\big{(}{\widehat{q}}_{i}^{(t-1)}-{\widehat{q}}_{i}^{(t)}\big{)}+2{\mathsf{OPT}}_{\#}={\widehat{q}}_{i}+2{\mathsf{OPT}}_{\#}\\ \leq 2\left({\textstyle\frac{{\mathsf{OPT}_{\#}}}{\chi}-\mu_{i}}\right)+2{\mathsf{OPT}}_{\#}={\textstyle(2+2\chi)\frac{{\mathsf{OPT}}_{\#}}{\chi}}

Thus, for all i,j{1,,χ}i,j\in\{1,\dots,\chi\} we have

p^i(tfinal)p^j(tfinal)(2+2χ)𝖮𝖯𝖳#χ𝖮𝖯𝖳#2χ+μj2<4+4χ\frac{{\widehat{p}}_{i}^{(t^{\mathrm{final}})}}{{\widehat{p}}_{j}^{(t^{\mathrm{final}})}}\leq\frac{{(2+2\chi)\frac{{\mathsf{OPT}}_{\#}}{\chi}}}{\frac{{\mathsf{OPT}}_{\#}}{2\chi}+\frac{\mu_{j}}{2}}<4+4\chi\qed

10.1.2 The case of arbitrary χ\chi

As stated below, there are two steps in the previous algorithm that cannot be executed in polynomial time when χ\chi is not a constant:

  1. (1)

    We cannot guess the χ+1\chi+1 nodes v1,,vχ+1v_{1},\dots,v_{\chi+1} in polynomial time. Instead, we guess only one node v1v_{1} such that there exists an optimal solution contains v1v_{1} with the following property: “the remaining k1k-1 nodes in the solution have degree at most 𝖽𝖾𝗀(v1)\mathsf{deg}(v_{1})”.

  2. (2)

    We cannot guess the exact value of q^i{\widehat{q}}_{i} by exhaustive enumeration and therefore we cannot use the χ\chi constraints “uV^e=def{u,v}Cixe,u=q^i\sum_{u\in\widehat{V}}\sum_{e\stackrel{{\scriptstyle\mathrm{def}}}{{=}}\{u,v\}\in C_{i}}x_{e,u}={\widehat{q}}_{i}” in line (3) of the 𝖫𝖯{\mathsf{LP}}-relaxation in Fig. 5 anymore. However, note that it still holds that q^i{\widehat{q}}_{i} is an integer in the set {𝖮𝖯𝖳#χμi,𝖮𝖯𝖳#χμi+1,,2(𝖮𝖯𝖳#χμi)}\big{\{}\frac{{\mathsf{OPT}_{\#}}}{\chi}-\mu_{i},\frac{{\mathsf{OPT}_{\#}}}{\chi}-\mu_{i}+1,\dots,2\big{(}\frac{{\mathsf{OPT}_{\#}}}{\chi}-\mu_{i}\big{)}\big{\}}. Thus, instead we use the 2χ2\chi constraints

    (3) 𝖮𝖯𝖳#χμiuV^e=def{u,v}Cixe,u=q^i2(𝖮𝖯𝖳#χμi) for all i{1,,χ}\textbf{(3) }\,\,\,\,\,\,{\textstyle\frac{{\mathsf{OPT}_{\#}}}{\chi}-\mu_{i}}\leq\sum_{u\in\widehat{V}}\sum_{e\stackrel{{\scriptstyle\mathrm{def}}}{{=}}\{u,v\}\in C_{i}}x_{e,u}={\widehat{q}}_{i}\leq{\textstyle 2\big{(}\frac{{\mathsf{OPT}_{\#}}}{\chi}-\mu_{i}\big{)}}\,\,\,\,\,\,\text{ for all }i\in\{1,\dots,\chi\}

We need modifications of the bounds in the previous proof to reflect these changes as follows:

  1. \triangleright

    We make some obvious parameter value adjustments such as: V^=V{v1}\widehat{V}=V\setminus\{v_{1}\}, E^=E{{u,v1}|{u,v1}E}\widehat{E}=E\setminus\{\{u,v_{1}\}\,|\,\{u,v_{1}\}\in E\}, k^(0)=k1\widehat{k}^{(0)}=k-1, μi𝖽𝖾𝗀(v1)\mu_{i}\leq\mathsf{deg}(v_{1}) for all ii.

  2. \triangleright

    The number of constraints in lines (1)(3) of Fig. 5 is now m^+2χ+1\widehat{m}+2\chi+1.

  3. \triangleright

    The condition in Case 3 of Fig. 6 is now 1n^2χ+11\leq\widehat{n}\leq 2\chi+1.

  4. \triangleright

    In Lemma 2, we select at most k+2χ12=k+χ1\big{\lfloor}k+\frac{2\chi-1}{2}\big{\rfloor}=k+\chi-1 nodes.

  5. \triangleright

    The calculations for the upper bound for p^i(tfinal){\widehat{p}}_{i}^{(t^{\mathrm{final}})} in Lemma 4 change as follows. By choice of the node v1v_{1} of degree 𝖽𝖾𝗀(v1)\mathsf{deg}(v_{1}), the number of edges incident on uiu_{i} is at most 𝖽𝖾𝗀(v1)\mathsf{deg}(v_{1}) for all i{1,,n^(tfinal)}i\in\{1,\dots,\widehat{n}^{(t^{\mathrm{final}})^{\prime}}\}. This now gives q^i(tfinal1)q^i(tfinal)(2χ+1)𝖽𝖾𝗀(v1)(2χ+1)𝖮𝖯𝖳#{\widehat{q}}_{i}^{(t^{\mathrm{final}}-1)}-{\widehat{q}}_{i}^{(t^{\mathrm{final}})}\leq(2\chi+1)\mathsf{deg}(v_{1})\leq(2\chi+1){\mathsf{OPT}_{\#}}, and therefore p^i(tfinal)q^i+(2χ+1)𝖮𝖯𝖳#2(𝖮𝖯𝖳#χμi)+(2χ+1)𝖮𝖯𝖳#<(2+χ+2χ2)𝖮𝖯𝖳#χ{\widehat{p}}_{i}^{(t^{\mathrm{final}})}\leq{\widehat{q}}_{i}+(2\chi+1){\mathsf{OPT}_{\#}}\leq 2\left({\textstyle\frac{{\mathsf{OPT}_{\#}}}{\chi}-\mu_{i}}\right)+(2\chi+1){\mathsf{OPT}_{\#}}<(2+\chi+2\chi^{2})\frac{{\mathsf{OPT}_{\#}}}{\chi}. This gives us the following updated bound:

    p^i(tfinal)p^j(tfinal)(2+χ+2χ2)𝖮𝖯𝖳#χ𝖮𝖯𝖳#2χ+μj2<4+2χ+4χ2\frac{{\widehat{p}}_{i}^{(t^{\mathrm{final}})}}{{\widehat{p}}_{j}^{(t^{\mathrm{final}})}}\leq\frac{{(2+\chi+2\chi^{2})\frac{{\mathsf{OPT}}_{\#}}{\chi}}}{\frac{{\mathsf{OPT}}_{\#}}{2\chi}+\frac{\mu_{j}}{2}}<4+2\chi+4\chi^{2}

10.2 The general case: approximating Fmc

Theorem 10.2 (generalizing Theorem 10.1 for Fmc)

We can design a deterministic polynomial-time approximation algorithm for Fmc with the following properties:

  1. (a)

    The algorithm selects τ\tau sets where τ{k+χ12,if χ=O(1)k+χ1,otherwise\tau\leq\begin{cases}k+\frac{\chi-1}{2},&\mbox{if $\chi=O(1)$}\\ k+\chi-1,&\mbox{otherwise}\end{cases}

  2. (b)

    The algorithm is a 1/f\nicefrac{{1}}{{f}}-approximation for Node-fmc, i.e., the total weight of the selected elements is at least 𝖮𝖯𝖳/f\nicefrac{{{\mathsf{OPT}}}}{{f}}.

  3. (c)

    The algorithm satisfies the ε\varepsilon-approximate coloring constraints ((cf. Inequality (5))) as follows:

    for all i,j{1,,χ}i,j\in\{1,\dots,\chi\}, pipj<{O(min{χ2f,χf2}),if χ=O(1)O(f2+χ2f),otherwise\frac{p_{i}}{p_{j}}<\begin{cases}O(\min\{\chi^{2}f,\,\chi f^{2}\}),&\mbox{if $\chi=O(1)$}\\ O(f^{2}+\chi^{2}f),&\mbox{otherwise}\end{cases}

The proof of Theorem 10.2 is a suitable modified version of the proof of Theorem 10.1. We point out the important alterations that are needed.

General modifications

  1. \triangleright

    Nodes and edges now correspond to sets and elements, respectively, incidence of an edge on a node corresponds to membership of an element in a set, and degree of a node correspond to number of elements in a set.

  2. \triangleright

    There is a set indicator variable yjy_{j} for every element 𝒮jV^\mathcal{S}_{j}\in\widehat{V}. For every element (ui,𝒮j)E^(u_{i},\mathcal{S}_{j})\in\widehat{E}, there is an element indicator variable xi,jx_{i,j} and a constraint xi,j=yjx_{i,j}=y_{j}.

  3. \triangleright

    Now i=1χ+1|𝒮i|min{χ,f}𝖮𝖯𝖳#\sum_{i=1}^{\chi+1}|\mathcal{S}_{i}|\leq\min\{\chi,f\}{\mathsf{OPT}_{\#}} since any element in any one of the sets from 𝒮1,,𝒮χ+1\mathcal{S}_{1},\dots,\mathcal{S}_{\chi+1} can appear in at most min{χ,f}\min\{\chi,f\} other sets in the collection of sets 𝒮1,,𝒮χ+1\mathcal{S}_{1},\dots,\mathcal{S}_{\chi+1}. Also, q^j{\widehat{q}}_{j} is an integer in the set {𝖮𝖯𝖳#χμj,𝖮𝖯𝖳#χμj+1,,f(𝖮𝖯𝖳#χμj)}{0,1,2,,n}\Big{\{}\frac{{\mathsf{OPT}_{\#}}}{\chi}-\mu_{j},\frac{{\mathsf{OPT}_{\#}}}{\chi}-\mu_{j}+1,\dots,f\big{(}\frac{{\mathsf{OPT}_{\#}}}{\chi}-\mu_{j}\big{)}\Big{\}}\subseteq\{0,1,2,\dots,n\} since any element can appear in at most ff sets.

  4. \triangleright

    An element uiu_{i} appearing in fiff_{i}\leq f sets, say sets 𝒮1,,𝒮fi\mathcal{S}_{1},\dots,\mathcal{S}_{f_{i}}, can now contribute the value of w(ui)w(u_{i}) at most fiff_{i}\leq f times in WAlg(tfinal)W_{\!\!\text{\tiny\sc Alg}}^{(t^{\mathrm{final}})} corresponding to the fif_{i} variables xi,1,,xi,fix_{i,1},\dots,x_{i,f_{i}}. Thus, we get a 1/f\nicefrac{{1}}{{f}}-approximation to the objective function.

Modifications related to χ=O(1)\chi=O(1) case

  1. \triangleright

    An element uiu_{i} appearing in fiff_{i}\leq f sets, say sets 𝒮1,,𝒮fi\mathcal{S}_{1},\dots,\mathcal{S}_{f_{i}}, can now contribute at most fiff_{i}\leq f times in the various q^i(t){\widehat{q}}_{i}^{(t)}’s corresponding to the fif_{i} variables xi,1,,xi,fix_{i,1},\dots,x_{i,f_{i}}. This modifies the relevant inequality for p^i(tfinal){\widehat{p}}_{i}^{(t^{\mathrm{final}})} as follows:

    p^i(tfinal)q^i(0)f+μi𝖮𝖯𝖳#fχμif+μi>𝖮𝖯𝖳#fχ\displaystyle{\widehat{p}}_{i}^{(t^{\mathrm{final}})}\geq\frac{{\widehat{q}}_{i}^{(0)}}{f}+\mu_{i}\geq\frac{{\mathsf{OPT}}_{\#}}{f\chi}-\frac{\mu_{i}}{f}+\mu_{i}>\frac{{\mathsf{OPT}}_{\#}}{f\chi}
    p^i(tfinal)q^i+i=1χ+1|𝒮i|f(𝖮𝖯𝖳#χμi)+min{χ,f}𝖮𝖯𝖳#<min{χ2+f, 2χf}𝖮𝖯𝖳#χ\displaystyle{\widehat{p}}_{i}^{(t^{\mathrm{final}})}\leq{\widehat{q}}_{i}+\sum_{i=1}^{\chi+1}|\mathcal{S}_{i}|\leq{\textstyle f\big{(}\frac{{\mathsf{OPT}_{\#}}}{\chi}-\mu_{i}\big{)}}+\min\{\chi,f\}{\mathsf{OPT}_{\#}}<{\textstyle\min\{\chi^{2}+f,\,2\chi f\}\frac{{\mathsf{OPT}_{\#}}}{\chi}}
    p^i(tfinal)p^j(tfinal)min{χ2+f, 2χf}𝖮𝖯𝖳#χ𝖮𝖯𝖳#fχmin{χ2f+f2, 2χf2}=O(min{χ2f,χf2})\displaystyle\frac{{\widehat{p}}_{i}^{(t^{\mathrm{final}})}}{{\widehat{p}}_{j}^{(t^{\mathrm{final}})}}\leq\frac{{\textstyle\min\{\chi^{2}+f,\,2\chi f\}\frac{{\mathsf{OPT}_{\#}}}{\chi}}}{\frac{{\mathsf{OPT}}_{\#}}{f\chi}}\leq\min\{\chi^{2}f+f^{2},\,2\chi f^{2}\}=O(\min\{\chi^{2}f,\,\chi f^{2}\})

Modifications related to the arbitrary χ\chi case

  1. \triangleright

    The calculations for the upper bound for p^i(tfinal){\widehat{p}}_{i}^{(t^{\mathrm{final}})} in Lemma 4 change as follows.

    f(𝖮𝖯𝖳#χμi)+(2χ+1)𝖮𝖯𝖳#<(f+χ+2χ2)𝖮𝖯𝖳#χ<(f+3χ2)𝖮𝖯𝖳#χ\displaystyle{\textstyle f\big{(}\frac{{\mathsf{OPT}_{\#}}}{\chi}-\mu_{i}\big{)}}+(2\chi+1){\mathsf{OPT}_{\#}}<{\textstyle\big{(}f+\chi+2\chi^{2}\big{)}\frac{{\mathsf{OPT}_{\#}}}{\chi}}<{\textstyle\big{(}f+3\chi^{2}\big{)}\frac{{\mathsf{OPT}_{\#}}}{\chi}}

    This gives the final bound of p^i(tfinal)p^j(tfinal)f2+3χ2f=O(f2+χ2f)\frac{{\widehat{p}}_{i}^{(t^{\mathrm{final}})}}{{\widehat{p}}_{j}^{(t^{\mathrm{final}})}}\leq f^{2}+3\chi^{2}f=O(f^{2}+\chi^{2}f).

11 Approximation algorithms for two special cases of Fmc

For approximating these special cases of Fmc, which are still 𝖭𝖯\mathsf{NP}-complete, we will be specific about the various constants and will try to provide approximation algorithms with as tight a constant as we can. For this section, let ϱ=max{ϱ(f),ϱ(k)}\varrho=\max\{\varrho(f),\,\varrho(k)\}. Note that ϱ>11/𝐞\varrho>1-\nicefrac{{1}}{{{\mathbf{e}}}}.

11.1 Segr-fmc: almost optimal deterministic approximation with “at most” kk sets

Note that Lemma 1 shows that finding a feasible solution is 𝖭𝖯\mathsf{NP}-complete even for unweighted Segr-fmc with χ=2\chi=2. Further inapproximability results for Segr-fmc are stated in Remark 5.

Theorem 11.1

There exists a polynomial-time deterministic algorithm Alg-greed-plus that, given an instance of unweighted Segr-fmc (χ,k\chi,k) outputs a solution with the following properties:

(a)

The number of selected sets is at most kk.

(b)

The approximation ratio is at least ϱ>11/𝐞\varrho>1-\nicefrac{{1}}{{{\mathbf{e}}}}.

(c)

The coloring constraints are 22-approximately satisfied ((cf. (5))), i.e.,

i,j{1,,χ}:pi2pj\forall\,i,j\in\{1,\dots,\chi\}:\,p_{i}\leq 2\,p_{j}
Remark 5

Based on the (11/𝐞)(1-\nicefrac{{1}}{{{\mathbf{e}}}})-inapproximability result of Feige in F98 for the maximum kk-set coverage problem, it is not difficult to see the two constants in Theorem 11.1, namely ρ\rho and 22, cannot be improved beyond 11/𝐞+ε1-\nicefrac{{1}}{{{\mathbf{e}}}}+\varepsilon and (11/𝐞)1+ε1.58+ε{(1-\nicefrac{{1}}{{{\mathbf{e}}}})}^{-1}+\varepsilon\approx 1.58+\varepsilon, respectively, for any ε>0\varepsilon>0 and all χ2\chi\geq 2 assuming P𝖭𝖯\mbox{P}\neq\mathsf{NP}.

Remark 6

The “at most kk sets” part of the proof arises in the following steps of the algorithm. Since we cannot know krk_{r} exactly, we can only assume kr^kr\widehat{k_{r}}\leq k_{r} since it is possible that the algorithm for the maximum kk-set coverage also covers at least ρ𝖮𝖯𝖳#χ\rho\frac{{\mathsf{OPT}}_{\#}}{\chi} elements for some k<krk<k_{r}. Secondly, even if we have the guessed the correct value of krk_{r}, the algorithm for the maximum krk_{r}-set coverage may cover more than 2ρ𝖮𝖯𝖳#χ2\rho\frac{{\mathsf{OPT}}_{\#}}{\chi} elements, and thus we have to “un-select” some of the selected sets to get the desired bounds (the proof shows that sometimes we may have to un-select all but one set). The following example shows that a solution that insists on selecting exactly kk sets may need to select sets all of which are not in our solution. Consider the following instance of unweighted Fmc(1,1,\ell): 𝒰={u1,,un}\mathcal{U}=\{u_{1},\dotsc,u_{n}\}, =n/2\ell=\nicefrac{{n}}{{2}}, 𝒮1={u1,,un/2}\mathcal{S}_{1}=\{u_{1},\dotsc,u_{n/2}\}, and 𝒮j+1={u(n/2)+j}\mathcal{S}_{j+1}=\{u_{(n/2)+j}\} for j=1,,n/2j=1,\dots,\nicefrac{{n}}{{2}}. Our algorithm will select the set 𝒮1\mathcal{S}_{1} whereas any solution that selects exactly \ell sets must selects the sets 𝒮2,,𝒮(n/2)+1\mathcal{S}_{2},\dots,\mathcal{S}_{(n/2)+1}.

Proof

We reuse the notations, terminologies and bounds shown in the proof of Theorem 9.1 as needed. Let 𝒰1,,𝒰χ\mathcal{U}_{1},\dots,\mathcal{U}_{\chi} be the partition of the universe based on the color of the elements, i.e., 𝒰r={u|𝒞(u)=r}\mathcal{U}_{r}=\{u_{\ell}\,|\,\mathcal{C}(u_{\ell})=r\} for r{1,,χ}r\in\{1,\dots,\chi\}. By the definition of Segr-fmc every set contains elements from exactly one such partition and thus, after renaming the sets and elements for notational convenience, we may set assume that our collection 𝒮1,,𝒮m\mathcal{S}_{1},\dots,\mathcal{S}_{m} of mm sets is partitioned into χ\chi collection of sets, where the rthr^{\mathrm{th}} collection (for r{1,,χ}r\in\{1,\dots,\chi\}) contains the sets 𝒮1r,,𝒮mrr\mathcal{S}_{1}^{r},\dots,\mathcal{S}_{m_{r}}^{r} over the universe 𝒰r={u1,,unr}\mathcal{U}_{r}=\{u_{1},\dots,u_{n_{r}}\} of nrn_{r} elements such that r=1χmr=m\sum_{r=1}^{\chi}m_{r}=m and r=1χnr=n\sum_{r=1}^{\chi}n_{r}=n. For r{1,,χ}r\in\{1,\dots,\chi\} and any \ell let Fmc (1,)r\!\!{}_{r}(1,\ell) be the unweighted Fmc(1,1,\ell) problem defined over the universe 𝒰r\mathcal{U}_{r} and the collection of sets 𝒮1r,,𝒮mrr\mathcal{S}_{1}^{r},\dots,\mathcal{S}_{m_{r}}^{r}. The following observation holds trivially.

Unweighted Segr-fmc (χ,k\chi,k) has a valid solution covering {χ,2χ,,n/χχ}\ell\in\{\chi,2\chi,\dots,\left\lfloor\nicefrac{{n}}{{\chi}}\right\rfloor\chi\} elements if and only if (i) for each r{1,,χ}r\in\{1,\dots,\chi\}, Fmc (1,kr)r\!\!{}_{r}(1,k_{r}) has a valid solution covering /χ\nicefrac{{\ell}}{{\chi}} elements for some kr>0k_{r}>0, and (ii) r=1χkr=k\sum_{r=1}^{\chi}k_{r}=k.

The above observation suggests that we can guess the value of 𝖮𝖯𝖳#{\mathsf{OPT}}_{\#} by trying out all possible values of \ell just like the algorithms in Theorem 9.1, and for each such value of \ell we can solve χ\chi independent Fmc instances and combine them to get a solution of the original Segr-fmc instance. Although we cannot possibly solve the Fmc (1,kr)r\!\!{}_{r}(1,k_{r}) problems exactly, appropriate approximate solutions of these problems do correspond to a similar approximate solution of Segr-fmc (χ,k\chi,k) as stated in the following observation:

Suppose that for each r{1,,χ}r\in\{1,\dots,\chi\} we have a solution 𝒮i1r,,𝒮ikr^r𝒰r\mathcal{S}_{i_{1}}^{r},\dots,\mathcal{S}_{i_{\widehat{k_{r}}}}^{r}\subseteq\mathcal{U}_{r} of Fmc (1,kr)r\!{}_{r}(1,k_{r}) with the following properties (for some η11\eta_{1}\leq 1 and η21\eta_{2}\geq 1): (i) η1(/χ)|p=1kr^𝒮ipr|η2(/χ)\eta_{1}(\nicefrac{{\ell}}{{\chi}})\leq|\cup_{p=1}^{\widehat{k_{r}}}\mathcal{S}_{i_{p}}^{r}|\leq\eta_{2}(\nicefrac{{\ell}}{{\chi}}), and (ii) kr^kr\widehat{k_{r}}\leq k_{r}. Then, the collection of sets {𝒮irr|r{1,,kr^},r{1,,χ}}\big{\{}\mathcal{S}_{i_{\ell_{r}}}^{r}\,|\,\ell_{r}\in\{1,\dots,\widehat{k_{r}}\},\,r\in\{1,\dots,\chi\}\big{\}} outputs a solution of Segr-fmc (χ,k\chi,k) with the following properties: (a) the number of selected sets is at most kk, (b) the number of elements covered is at least η1\eta_{1}\ell, and (c) for any pair i,j{1,,χ}i,j\in\{1,\dots,\chi\}, pi/pjη2/η1\nicefrac{{p_{i}}}{{p_{j}}}\leq\nicefrac{{\eta_{2}}}{{\eta_{1}}}.

By the above observation, to prove our claim it suffices if we can find a solution for Fmc (1,kr)r\!{}_{r}(1,k_{r}) for any rr with =𝖮𝖯𝖳#\ell={\mathsf{OPT}}_{\#}, η1=ϱ\eta_{1}=\varrho and η2=2ϱ\eta_{2}=2\,\varrho. For convenience, we will omit the superscript rr from the set labels while dealing with Fmc (1,kr)r\!{}_{r}(1,k_{r}). Remove from consideration any sets from 𝒮1,,𝒮mr\mathcal{S}_{1},\dots,\mathcal{S}_{m_{r}} that contains more than /χ\nicefrac{{\ell}}{{\chi}} elements, and consider the standard (unweighted) maximum kk-set coverage problem, that ignores constraint (i) of the above observation, on these remaining collection of sets 𝒯\mathcal{T} over the universe 𝒰r\mathcal{U}_{r}. Since we have guessed the correct value of \ell, there is at least one valid solution and thus the following assertions hold: (I) there exists a set of krk_{r} sets that covers 𝖮𝖯𝖳#χ\frac{{\mathsf{OPT}}_{\#}}{\chi} elements, and (II) |𝒯|kr|\mathcal{T}|\geq k_{r}. Let νk\nu_{k} denote the maximum number of elements that can be covered by selecting kk sets from 𝒯\mathcal{T}. There are the following two well-known algorithm algorithms for the maximum kk-set coverage problem both of which select exactly kk sets: the greedy algorithm covers at least ϱ(k)νk\varrho(k)\nu_{k} elements (F98, , Proposition 5.1), where the pipage-rounding algorithm (based on the 𝖫𝖯{\mathsf{LP}}-relaxation in Fig. 2) covers at least ϱ(f)νk\varrho(f)\nu_{k} elements AS04 . Note that we do not know the exact value of krk_{r} and we cannot guess by enumerating every possible krk_{r} values for every r{1,,χ}r\in\{1,\dots,\chi\} in polynomial time. To overcome this obstacle, we use the following steps.

  1. \triangleright

    We run both the algorithms for maximum kk-set coverage for k=1,2,k=1,2,\dots until we find the first (smallest) index kr^kr\widehat{k_{r}}\leq k_{r} such that the better of the two algorithms cover at least max{ϱ(kr^),ϱ(f)}𝖮𝖯𝖳#χρ𝖮𝖯𝖳#χ\max\{\varrho(\widehat{k_{r}}),\,\varrho(f)\}\frac{{\mathsf{OPT}}_{\#}}{\chi}\geq\rho\frac{{\mathsf{OPT}}_{\#}}{\chi} elements.

  2. \triangleright

    Suppose that this algorithm selects the kr^\widehat{k_{r}} sets (after possible re-numbering of set indices) 𝒮1,,𝒮kr^\mathcal{S}_{1},\dots,\mathcal{S}_{\widehat{k_{r}}}, where we have ordered the sets such that for every j{2,3,,kr^}j\in\{2,3,\dots,\widehat{k_{r}}\} the number of elements covered by 𝒮j\mathcal{S}_{j} and not covered by any of the sets 𝒮1,,𝒮j1\mathcal{S}_{1},\dots,\mathcal{S}_{j-1} is at least as many as the number of elements covered by 𝒮\mathcal{S}_{\ell} and not covered by any of the sets 𝒮1,,𝒮j1\mathcal{S}_{1},\dots,\mathcal{S}_{j-1} for any >j\ell>j. Remember that maxj{1,,kr^}{|𝒮j|}𝖮𝖯𝖳#/χ\max_{j\in\{1,\dots,\widehat{k_{r}}\}}\{|\mathcal{S}_{j}|\}\leq\nicefrac{{{\mathsf{OPT}}_{\#}}}{{\chi}}. Let jj be the smallest index such |=1j1𝒮j|<ϱ𝖮𝖯𝖳#χ|\cup_{\ell=1}^{j-1}\mathcal{S}_{j}|<\varrho\frac{{\mathsf{OPT}}_{\#}}{\chi} but |=1j𝒮j|ϱ𝖮𝖯𝖳#χ|\cup_{\ell=1}^{j}\mathcal{S}_{j}|\geq\varrho\frac{{\mathsf{OPT}}_{\#}}{\chi}. We have the following cases.

    1. \triangleright

      If |𝒮j|ϱ𝖮𝖯𝖳#χ|\mathcal{S}_{j}|\geq\varrho\frac{{\mathsf{OPT}}_{\#}}{\chi} then we select 𝒮j\mathcal{S}_{j} as our solution since ϱ𝖮𝖯𝖳#χ|𝒮j|𝖮𝖯𝖳#χ<2ϱ𝖮𝖯𝖳#χ\varrho\frac{{\mathsf{OPT}}_{\#}}{\chi}\leq|\mathcal{S}_{j}|\leq\frac{{\mathsf{OPT}}_{\#}}{\chi}<2\,\varrho\frac{{\mathsf{OPT}}_{\#}}{\chi}.

    2. \triangleright

      Otherwise |𝒮j|<ϱ𝖮𝖯𝖳#χ|\mathcal{S}_{j}|<\varrho\frac{{\mathsf{OPT}}_{\#}}{\chi} and in this case we select the jkr^krj\leq\widehat{k_{r}}\leq k_{r} sets 𝒮1,,𝒮j\mathcal{S}_{1},\dots,\mathcal{S}_{j} in our solution since ϱ𝖮𝖯𝖳#χ|=1j𝒮j|2ϱ𝖮𝖯𝖳#χ\varrho\frac{{\mathsf{OPT}}_{\#}}{\chi}\leq|\cup_{\ell=1}^{j}\mathcal{S}_{j}|\leq 2\,\varrho\frac{{\mathsf{OPT}}_{\#}}{\chi}.

11.2 Δ\Delta-bal-fmc: improved deterministic approximation

Proposition 2

There exists a polynomial-time deterministic algorithm Alg-greedy that, given an instance of unweighted Δ\Delta-bal-fmc (χ,k)\!\!(\chi,k) outputs a solution with the following properties:

(a)

The number of selected sets is ((exactly)) kk.

(b)

The approximation ratio is at least ϱ>11/𝐞\varrho>1-\nicefrac{{1}}{{{\mathbf{e}}}}.

(c)

The coloring constraints are O(Δf)O(\Delta f)-approximately satisfied ((cf. (5))), i.e., i,j{1,,χ}:pi/pj(2+2Δ)f\forall\,i,j\in\{1,\dots,\chi\}:\,\nicefrac{{p_{i}}}{{p_{j}}}\leq(2+2\Delta)f.

Proof

As already mentioned in the proof of Theorem 11.1 and elsewhere, there is a deterministic polynomial-time algorithm for the maximum kk-set coverage problem with an approximation ratio of ϱ\varrho. For the given instance of Δ\Delta-bal-fmc (χ,k)\!\!(\chi,k), we run this algorithms (ignoring element colors) selecting kk sets, say 𝒮1,,𝒮k\mathcal{S}_{1},\dots,\mathcal{S}_{k}. Obviously, the total weight of all the elements covered in the selected solution is at least ϱ𝖮𝖯𝖳\varrho\,{\mathsf{OPT}}. Let α+=i=1k|𝒮i|/χ+Δ\alpha^{+}=\sum_{i=1}^{k}\lceil\nicefrac{{|\mathcal{S}_{i}|}}{{\chi}}\rceil+\Delta and α=i=1kmax{1,|𝒮i|/χΔ}\alpha^{-}=\sum_{i=1}^{k}\max\big{\{}1,\,\lfloor\nicefrac{{|\mathcal{S}_{i}|}}{{\chi}}\rfloor-\Delta\big{\}}. Note that kαα+α+(2Δ+1)kk\leq\alpha^{-}\leq\alpha^{+}\leq\alpha^{-}+(2\Delta+1)k. Since each of the sets in the solution is balanced, an upper bound for the number pip_{i} of elements of color ii in the solution is given by piα+p_{i}\leq\alpha^{+}. Also note that by definition of ff we have piαfp_{i}\geq\frac{\alpha^{-}}{f}. It thus follows that for any ii and jj we have pi/pjf×α+α(2+2Δ)f\nicefrac{{p_{i}}}{{p_{j}}}\leq f\times\frac{\alpha^{+}}{\alpha^{-}}\leq(2+2\Delta)f.

12 Approximating Geom-fmc via randomized shifting

We refer the reader to textbooks such as V01 for a general overview of the randomized shifting technique (textbook V01 illustrates the technique in the context of Euclidean travelling salesperson problem).

Theorem 12.1

For any constant 0<ε<10<\varepsilon<1, we can design a randomized algorithm Alg-geom for Geom-fmc with the following properties:

  1. (a)

    Alg-geom runs in O((Δ/d)d2(Cd/ε)O(d)kd)O((\Delta/d)^{d}2^{(Cd/\varepsilon)^{O(d)}}k^{d}) time.

  2. (b)

    Alg-geom satisfies the following properties with probability 1o(1)1-o(1) ((cf. Inequality (5)–(5))′′{}^{\prime\prime}):

    1. \triangleright

      The algorithm covers at least (1O(ε))(𝖮𝖯𝖳εχ)(1-O(\varepsilon))({\mathsf{OPT}}-\varepsilon\chi) points.

    2. \triangleright

      The algorithm satisfies the (1+ε)(1+\varepsilon)-approximate coloring constraints ((cf. Inequality (5))′′{}^{\prime\prime}), i.e., for all i,j{1,,χ}i,j\in\{1,\dots,\chi\}, pi(1+ε)pjp_{i}\leq(1+\varepsilon)p_{j}.

Remark 7

In many geometric applications, the dimension parameter dd is a fixed constant. For this case, Alg-geom runs in polynomial time, and moreover, under the mild assumption of 𝖮𝖯𝖳χη\frac{{\mathsf{OPT}}}{\chi}\geq\eta for some constant η>1\eta>1, Alg-geom covers at least (1O(ε))𝖮𝖯𝖳(1-O(\varepsilon)){\mathsf{OPT}} points, i.e., under these conditions Alg-geom behaves like a randomized polynomial-time approximation scheme.

Proof

Fix an optimal solution having kk unit balls 1,,kd\mathcal{B}_{1}^{\ast},\ldots,\mathcal{B}_{k}^{\ast}\subset{\mathbb{R}}^{d}, such that for all i,j{1,,χ}i,j\in\{1,\dots,\chi\}, μi()=μj()\mu_{i}(\mathcal{B}^{\ast})=\mu_{j}(\mathcal{B}^{\ast}), where =i=1ki\mathcal{B}^{\ast}=\bigcup_{i=1}^{k}\mathcal{B}_{i}^{\ast}. Thus, we need to show that our algorithm Alg-geom computes in (Δ/d)d2(Cd/ε)O(d)kd(\Delta/d)^{d}2^{(Cd/\varepsilon)^{O(d)}}k^{d} time a set of unit balls B1,,BkdB_{1},\ldots,B_{k}\subset\mathbb{R}^{d} such that the following assertions hold with probability 1o(1)1-o(1) (where B=i=1kBiB=\bigcup_{i=1}^{k}B_{i}):

i=1χμi(B)>(1O(ε))i=1χ(μi()ε)\displaystyle\sum_{i=1}^{\chi}\mu_{i}(B)>(1-O(\varepsilon))\sum_{i=1}^{\chi}(\mu_{i}(\mathcal{B}^{\ast})-\varepsilon)
i,j{1,,χ}:μi(B)(1+ε)μj(B)\displaystyle\forall\,i,j\in\{1,\ldots,\chi\}:\,\mu_{i}(B)\leq(1+\varepsilon)\mu_{j}(B)

Set L=8d/εL=8d/\varepsilon. Let GdG\subset\mathbb{R}^{d} be an axis-parallel grid such that every connected component of dG{\mathbb{R}}^{d}\setminus G is an open dd-dimensional hypercube isometric to (0,L)d(0,L)^{d}. In other words, GG is the union of dd infinite families of axis-parallel (d1)(d-1)-dimensional hyperplanes, spaced apart by LL in each orthonormal direction. Let α[0,L)\alpha\in[0,L) be chosen uniformly at random, and let G=G+αG^{\prime}=G+\alpha be the random translation of GG by α\alpha, i.e.,

(p1+α,,pd+α)G(p1,,pd)G(p_{1}+\alpha,\dots,p_{d}+\alpha)\in G^{\prime}\,\,\equiv\,\,(p_{1},\dots,p_{d})\in G

Let FF is the set of indices of all balls i\mathcal{B}_{i}^{\ast} that has a non-empty intersection with the randomly shifted grid GG^{\prime}, i.e.,

F={i{1,,k}:iG}F=\{i\in\{1,\dots,k\}:\mathcal{B}^{\ast}_{i}\cap G^{\prime}\neq\emptyset\}

Let ,F=iFi\mathcal{B}^{\ast,F}=\bigcup_{i\in F}\mathcal{B}_{i}^{\ast}. Any point pdp\in{\mathbb{R}}^{d} is contained in ,F\mathcal{B}^{\ast,F} only if it is contained in some unit ball intersecting GG^{\prime}. Therefore, p,Fp\in\mathcal{B}^{\ast,F} only if it is at distance at most 22 from GG^{\prime} (in other words, ,F\mathcal{B}^{\ast,F} is contained in the 22-neighborhood of GG^{\prime}). The probability that any particular point pp is at distance at most 22 from any family of parallel randomly shifted hyperplanes in GG^{\prime} is exactly 4/L4/L. By the union bound over all dimensions, Pr[p,F]4d/L\Pr[p\in\mathcal{B}^{\ast,F}]\leq 4d/L. Therefore, by the linearity of expectation,

𝔼[μi(,F)]=pi=1kiPr[p,F]μi(p)4dLμi(){\mathbb{E}}[\mu_{i}(\mathcal{B}^{\ast,F})]=\sum_{p\in\cup_{i=1}^{k}\mathcal{B}_{i}^{\ast}}\!\!\!\!\Pr\left[p\in\mathcal{B}^{\ast,F}\,\right]\,\mu_{i}(p)\leq\frac{4d}{L}\mu_{i}(\mathcal{B}^{\ast})

Consequently, by Markov’s inequality, we get

Pr[μi(,F)8dRμi()]1/2\Pr\left[\textstyle\mu_{i}(\mathcal{B}^{\ast,F})\geq\frac{8d}{R}\mu_{i}(\mathcal{B}^{\ast})\right]\leq\nicefrac{{1}}{{2}}

Set δ=2Θ(d)ε/C\delta=2^{-\Theta(d)}\varepsilon/C. Let B1,,BkB_{1}^{\ast\ast},\ldots,B_{k}^{\ast\ast} be the collection of unit balls in d{\mathbb{R}}^{d} obtained as follows. For each i{1,,k}Fi\in\{1,\dots,k\}\setminus F, obtain a unit ball by translating i\mathcal{B}_{i}^{\ast} such that its center has coordinates that are integer multiples of δ\delta, i.e., it is an element of the dilated integer lattice δd\delta\cdot\mathbb{Z}^{d}. For every iFi\in F, we obtain a unit ball by picking an arbitrary ball obtained for some j{1,,k}Fj\in\{1,\dots,k\}\setminus F as described above. Essentially, the new solution B1,,BkB^{\ast\ast}_{1},\dots,B^{\ast\ast}_{k} is missing all the balls that intersect GG^{\prime}, and rounds every other ball so that its center is contained in some integer lattice. In this construction, each ball i\mathcal{B}_{i}^{\ast}, with iFi\notin F, gets translated by at most some distance dδ\sqrt{d}\delta. Since each for all j{1,,χ}j\in\{1,\ldots,\chi\}, μj\mu_{j} is CC-Lipschitz, it follows that

|μj(i)μj(Bi)|vol(i)dδC2Θ(d)δC|\mu_{j}(\mathcal{B}^{\ast}_{i})-\mu_{j}(B^{\ast\ast}_{i})|\leq\mathrm{vol}(\mathcal{B}^{\ast}_{i})\,\sqrt{d}\,\delta\,C\leq 2^{\Theta(d)}\delta C

Letting B=i=1kBiB^{\ast\ast}=\bigcup_{i=1}^{k}B^{\ast\ast}_{i}, we get that for all i{1,,χ}i\in\{1,\dots,\chi\},

|μj()μj(B)|k2Θ(d)δC|\mu_{j}(\mathcal{B}^{\ast})-\mu_{j}(B^{\ast\ast})|\leq k2^{\Theta(d)}\delta C

Let {\cal I} be the set of connected components of [0,Δ]dG[0,\Delta]^{d}\setminus G^{\prime}. We refer to the elements of {\cal I} as cells. For each AA\in{\cal I}, we enumerate the set, 𝒮A{\cal S}_{A}, of all possible subsets of at most kk unit balls with centers in A(δd)A\cap(\delta\cdot\mathbb{Z}^{d}). There are at most (L/δ)d(L/\delta)^{d} lattice points in AA, and thus there are at most 2(L/δ)d2^{(L/\delta)^{d}} such subsets of unit balls. Since ||(Δ/L)d|{\cal I}|\leq(\lceil\Delta/L\rceil)^{d}, it follows that this enumeration takes O((Δ/L)d 2(L/δ)d)O((\Delta/L)^{d}\,2^{(L/\delta)^{d}}) time.

For each enumerated subset J𝒮AJ\in{\cal S}_{A} of unit balls, we record the vector

(|J|,εkμ1(X)kε,,εkμk(X)kε)\left(|J|,\frac{\varepsilon}{k}\left\lfloor\mu_{1}(X)\frac{k}{\varepsilon}\right\rfloor,\dots,\frac{\varepsilon}{k}\left\lfloor\mu_{k}(X)\frac{k}{\varepsilon}\right\rfloor\right)

where X=YJYX=\bigcup_{Y\in J}Y. There are at most (2O(d)k/ε)d(2^{O(d)}k/\varepsilon)^{d} such vectors for each cell in {\cal I}. Via standard dynamic programming, we can inductively compute all possible sums of vectors such that we pick at most one vector from each cell, and the total sum of the first coordinate, i.e., the number of unit balls, is at most kk. This can be done in O((Δ/L)d2(L/δ)d(2O(d)k/ε)d)O((\Delta/L)^{d}2^{(L/\delta)^{d}}(2^{O(d)}k/\varepsilon)^{d}) time. For the correct choice of vectors that corresponds to the solution BB^{\ast\ast}, the sum of the vectors we compute is correct up to an additive factor of ε\varepsilon on each coordinate. This means that we compute a solution B1,,BkB_{1},\ldots,B_{k}, with the following property:

i=1kμi(B)(1ε)i=1χμi(B)(1ε)[i=1χ(μi()2Θ(d)δCμi(,F))]( 1ε(8d/L))[i=1χ(μi()2Θ(d)δC)]=( 12ε)[i=1χ(μi()ε)]\sum_{i=1}^{k}\mu_{i}(B)\geq(1-\varepsilon)\sum_{i=1}^{\chi}\mu_{i}(B^{\ast\ast})\geq(1-\varepsilon)\Big{[}\sum_{i=1}^{\chi}\big{(}\,\mu_{i}(\mathcal{B}^{\ast})-2^{\Theta(d)}\delta C-\mu_{i}(\mathcal{B}^{\ast,F})\,\big{)}\Big{]}\\ \geq\big{(}\,1-\varepsilon-(8d/L)\,\big{)}\Big{[}\sum_{i=1}^{\chi}\big{(}\,\mu_{i}(\mathcal{B}^{\ast})-2^{\Theta(d)}\delta C\,\big{)}\Big{]}=\big{(}\,1-2\varepsilon\,\big{)}\Big{[}\sum_{i=1}^{\chi}\big{(}\,\mu_{i}(\mathcal{B}^{\ast})-\varepsilon\,\big{)}\Big{]}

with probability at least 1/2\nicefrac{{1}}{{2}}. Repeating the algorithm O(logn)O(\log n) times and returning the best solution found, results in the high-probability assertion, which concludes the proof.

13 Conclusion and open problems

In this paper we formulated a natural combinatorial optimization framework for incorporating fairness issues in coverage problems and provided a set of approximation algorithms for the general version of the problem as well as its special cases. Of course, it is possible to design other optimization frameworks depending on the particular application in hand, and we encourage researchers to do that. Below we list some future research questions related to our framework:

Eliminating the gap of factor ff in 𝖫𝖯{\mathsf{LP}}-relaxation:

As noted in Section 9.7, all of our 𝖫𝖯{\mathsf{LP}}-relaxations incur a gap of factor ff in the coloring constraints while rounding. It seems non-trivial to close the gap using additional linear inequalities while preserving the same approximation ratio. However, it may be possible to improve the gap using 𝖲𝖣𝖯{\mathsf{SDP}}-relaxations.

Primal-dual schema:

Another line of attack for the Fmc problems is via the primal-dual approach V01 . For example, can the primal-dual approach for partial coverage problem by Gandhi, Khuller and Srinivasan GKS04 be extended to Fmc? A key technical obstacle seems to center around effective interpretation of the dual of the coloring constraints. Our iterated rounding approach was able to go around this obstacle but the case when χ=ω(1)\chi=\omega(1) may be improvable.

Fixed parameter tractability:

As mentioned in Section 4 fixed-parameter tractability issues for kk-node coverage have been investigated by prior researchers such as Marx M08 and Gupta, Lee and Li in GLL18a ; GLL18b . It would be interesting to extend these results to Node-fmc.

Generalizing to non-decreasing submodular set objective functions:

The proofs and proof techniques in this paper do not generalize to the case when the objective function for our Fmc problems is a (more general) non-decreasing submodular set function. It would be interesting to devise new algorithmic techniques and proofs for this more general case. Approximation algorithms for such generalizations for the standard maximum kk-set coverage problem (see Section 4) were provided in KST09 .

References

  • (1) A. Ageev and M. Sviridenko, Pipage Rounding: A New Method of Constructing Algorithms with Proven Performance Guarantee, Journal of Combinatorial Optimization, 8(3), 307-328, 2004.
  • (2) J. Angwin and T. Parris, Facebook Lets Advertisers Exclude Users by Race, Propublica, October 28, 2016 (www.propublica.org/article/facebook-lets-advertisers-exclude-users-by-race).
  • (3) N. Apollonio and B. Simeone, The maximum vertex coverage problem on bipartite graphs, Discrete Applied Mathematics, 165, 37-48, 2014.
  • (4) S. Arora. Polynomial time approximation schemes for euclidean traveling salesman and other geometric problems, Journal of the ACM, 45(5), 753-782, 1998.
  • (5) A. Asudeh, Z. Jin and H. V. Jagadish, Assessing and remedying coverage for a given dataset, 35th35^{\mathrm{th}} annual IEEE International Conference on Data Engineering, 554-565, 2019.
  • (6) P. Austrin, S. Khot and M. Safra, Inapproximability of vertex cover and independent set in bounded degree graphs, Theory of Computing, 7(1), 27-43, 2011.
  • (7) P. Austrin and A. Stankovic, Global Cardinality Constraints Make Approximating Some Max-2-CSPs Harder, in Achlioptas D. and Végh L. A. (eds),, Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, Leibniz International Proceedings in Informatics, 145, 24:1-24:17, 2019.
  • (8) N. Bansal, Constructive Algorithms for Discrepancy Minimization, 51st51^{\mathrm{st}} Annual IEEE Symposium on Foundations of Computer Science, 3-10, 2010.
  • (9) J. Beck and T. Fiala, “Integer-making” theorems, Discrete Applied Mathematics, 3(1), 1-8, 1981.
  • (10) S. Bera, D. Chakrabarty, N. Flores and M. Negahbani, Fair Algorithms for Clustering, in Advances in Neural Information Processing Systems, Wallach H., Larochelle H., Beygelzimer A., d'Alché-Buc F., Fox E. and Garnett R. (eds), 32, 4954-4965, 2019.
  • (11) T. Carnes and D. Shmoys, Primal-Dual Schema for Capacitated Covering Problems, in Lodi A., Panconesi A. and Rinaldi G. (eds), Integer Programming and Combinatorial Optimization, Lecture Notes in Computer Science, 5035, Springer, 288-302, 2008.
  • (12) R. D. Carr, L. K. Fleischer, V. J. Leung and C. A. Phillips, Strengthening Integrality Gaps for Capacitated Network Design and Covering Problems, 11th11^{\mathrm{th}} Annual ACM-SIAM Symposium on Discrete Algorithms, 106-115, 2000.
  • (13) B. Chazelle, The Discrepancy Method: Randomness and Complexity, Cambridge University Press, 2000.
  • (14) C. Chekuri, J. Vondrák and R. Zenklusen, Dependent Randomized Rounding via Exchange Properties of Combinatorial Structures, 51st51^{\mathrm{st}} Annual IEEE Symposium on Foundations of Computer Science, 575-584, 2010.
  • (15) F. Chierichetti, R. Kumar, S. Lattanzi and S. Vassilvitskii, Fair Clustering Through Fairlets, in Advances in Neural Information Processing Systems, Guyon I., Luxburg U. V., Bengio S., Wallach H., Fergus R., Vishwanathan S. and Garnett R. (eds), 30, 5029-5037, 2017.
  • (16) B. Doerr, Discrepancy in different numbers of colors, Discrete Mathematics, 250, 63-70, 2002.
  • (17) B. Doerr and A. Srivastav, Approximation of multi-color discrepancy, in Hochbaum D., Jansen K., Rolim J. D. P. and Sinclair A. (eds), Randomization, Approximation and Combinatorial Optimization, Lecture Notes in Computer Science, 1671, Springer, 39-50, 1999.
  • (18) U. Feige, A Threshold of lnn\ln n for Approximating Set Cover, Journal of the ACM, 45(4), 634-652, 1998.
  • (19) U. Feige and M. Langberg, Approximation algorithms for maximization problems arising in graph partitioning, Journal of Algorithms, 41, 174-211, 2001.
  • (20) R. Gandhi, S. Khuller and A. Srinivasan, Approximation algorithms for partial covering problems, Journal of Algorithms, 53(1), 55-84, 2004.
  • (21) M. R. Garey and D. S. Johnson, Computers and Intractability - A Guide to the Theory of NP-Completeness (W. H. Freeman & Co., San Francisco, CA, 1979).
  • (22) J. Guo, R. Niedermeier and S. Wernicke, Parameterized complexity of vertex cover variants, Theory of Computing Systems, 41(3), 501-520, 2007.
  • (23) A. Gupta, E. Lee and J. Li, An FPT algorithm beating 2 approximation for k-cut, 29th29^{\mathrm{th}} Annual ACM-SIAM Symposium on Discrete Algorithms, 2821-2837, 2018.
  • (24) A. Gupta, E. Lee and J. Li, Faster exact and approximate algorithms for k-cut, 59th59^{\mathrm{th}} Annual IEEE Symposium on Foundations of Computer Science, 113-123, 2018.
  • (25) C. Guse, Citi Bike neglects poor NYC neighborhoods and communities of color: report, New York Daily News, July 10, 2019 (https://bit.ly/3aEB4rz)
  • (26) Q. Han, Y. Ye, H. Zhang and J. Zhang, On approximation of max-vertex-cover, European Journal of Operational Research, 143, 342-355, 2002.
  • (27) M. Hardt, E. Price and N. Srebro, Equality of Opportunity in Supervised Learning, 30th30^{\mathrm{th}} International Conference on Neural Information Processing Systems, 3323-3331, 2016.
  • (28) D. S. Hochbaum, Approximating Covering and Packing Problems: Set Cover, Vertex Cover, Independent Set, and Related Problems, in Hochbaum, D. S. (ed.), Approximation Algorithms for NP-Hard Problems, 94-143, (PWS Publishing Company, 1997).
  • (29) A. Hunter, Job ads show sexism still prevalent in most industries, HRD, February 22, 2019 (bit.ly/38eom11).
  • (30) D. Ingold and S. Soper, Amazon Doesn’t Consider the Race of Its Customers. Should It?, Bloomberg, April 21, 2016.
  • (31) K. Jain, A Factor 2 Approximation Algorithm for the Generalized Steiner Network Problem, Combinatorica, 21, 39-60, 2001
  • (32) T. Jan, Redlining was banned 50 years ago. It’s still hurting minorities today, Washington Post, March 28, 2018.
  • (33) A. Kulik, H. Shachnai and T. Tamir, Maximizing Submodular Set Functions Subject to Multiple Linear Constraints, 20th20^{\mathrm{th}} Annual ACM-SIAM Symposium on Discrete Algorithms, 545-554, 2009.
  • (34) M. Langberg, Approximation algorithms for maximization problems arising in graph partitioning, M. Sc. Thesis, Weizmann Institute of Science, 1998.
  • (35) L. C. Lau, R. Ravi and M. Singh, Iterative Methods in Combinatorial Optimization, Cambridge University Press, 2011.
  • (36) J. Lee and C. Lubienski, The impact of school closures on equity of access in Chicago, Education and Urban Society, 49(1), 53-80, 2017.
  • (37) A. Levy, H. Ramadas and T. Rothvoss, Deterministic Discrepancy Minimization via the Multiplicative Weight Update Method, in Eisenbrand F., Koenemann J. (eds), Integer Programming and Combinatorial Optimization, Lecture Notes in Computer Science, 10328, 380-391, Springer, 2017.
  • (38) P. Manurangsi, A Note on Max k-Vertex Cover: Faster FPT-AS, Smaller Approximate Kernel and Improved Approximation 2nd2^{\mathrm{nd}} Symposium on Simplicity in Algorithms, 15:1-15:21, 2019.
  • (39) D. Marx, Parameterized complexity and approximation algorithms, The Computer Journal, 51(1), 60-78, 2008.
  • (40) J. Mitchell. Guillotine subdivisions approximate polygonal subdivisions: A simple polynomial-time approximation scheme for geometric tsp, k-mst, and related problems, SIAM Journal on Computing, 28(4), 1298-1309, 1999.
  • (41) R. Motwani and P. Raghavan, Randomized Algorithms (Cambridge University Press, 1995).
  • (42) M. Mulshine, A major flaw in Google’s algorithm allegedly tagged two black people’s faces with the word ’gorillas’, Business Insider, July 1, 2015.
  • (43) N. Nisan, T. Roughgarden, E. Tardos and V. V. Vazirani, Algorithmic Game Theory, Cambridge University Press, 2007.
  • (44) A. Olteanu, C. Castillo, F. Diaz and E. Kiciman, Social data: Biases, methodological pitfalls, and ethical boundaries, Frontiers in Big Data, 2, pp. 13, 2019.
  • (45) A. Panconesi and A. Srinivasan, Randomized Distributed Edge Coloring via an Extension of the Chernoff–Hoeffding Bounds, SIAM Journal of Computing, 26(2), 350-368, 1997.
  • (46) E. Petrank, The hardness of approximation: Gap location, Computational Complexity, 4, 133-157, 1994.
  • (47) M. Simon, HP looking into claim webcams can’t see black people, CNN, December 23, 2009.
  • (48) J. Spencer, Six standard deviations suffice, Transactions of American Mathematical Society, 289, 679-706, 1985.
  • (49) S. Spencer, Amazon to Bring Same-Day Delivery to Roxbury After Outcry, Bloomberg, April 26, 2016.
  • (50) A. Srinivasan, Distributions on level-sets with applications to approximation algorithms, 42nd42^{\mathrm{nd}} Annual IEEE Symposium on Foundations of Computer Science, 588-597, 2001.
  • (51) A. Srivastav, Multicolour Discrepancies, Combinatorics, Probability and Computing, 12, 365-399, 2003.
  • (52) V. Vazirani, Approximation Algorithms, Springer, 2001.
  • (53) A. Yan and B. Howe, Fairness in Practice: A Survey on Equity in Urban Mobility, Data Engineering Bulletin, 42(4), 49, 2019.
  • (54) R. Zemel, Y. Wu, K. Swersky, T. Pitassi and C. Dwork, Learning Fair Representations, 30th30^{\mathrm{th}} International Conference on Machine Learning, PMLR 28(3), 325-333, 2013.
  • (55) B. H. Zhang, B. Lemoine and M. Mitchell, Mitigating Unwanted Biases with Adversarial Learning, AAAI/ACM Conference on AI, Ethics, and Society, 335-340, 2018.

Appendix

Proof of Lemma 1

(a) We describe the proof for χ=2\chi=2; generalization to χ>2\chi>2 is obvious. The reduction is from the Exact Cover by 3-sets (X3C) problem which is defined as follows. We are given an universe 𝒰={u1,,un}\mathcal{U}^{\prime}=\{u_{1},\dots,u_{n^{\prime}}\} of nn^{\prime} elements for some nn^{\prime} that is a multiple of 33, and a collection of nn^{\prime} subsets 𝒮1,,𝒮n\mathcal{S}_{1},\dots,\mathcal{S}_{n^{\prime}} of 𝒰\mathcal{U} such that j=1n𝒮j=𝒰\bigcup_{j=1}^{n^{\prime}}\mathcal{S}_{j}=\mathcal{U}, every element of 𝒰\mathcal{U}^{\prime} occurs in exactly 33 sets and |𝒮j|=3|\mathcal{S}_{j}|=3 for j=1,,nj=1,\dots,n^{\prime}. The goal is to decide if there exists a collection of n/3\nicefrac{{n^{\prime}}}{{3}} (disjoint) sets whose union is 𝒰\mathcal{U}^{\prime}. X3C is known to be 𝖭𝖯\mathsf{NP}-complete GJ79 . Given an instance 𝒰,𝒮1,,𝒮n\langle\mathcal{U}^{\prime},\mathcal{S}_{1},\dots,\mathcal{S}_{n^{\prime}}\rangle of X3C as described, we create the following instance 𝒰,𝒮1,,𝒮n+1,k\langle\mathcal{U},\mathcal{S}_{1},\dots,\mathcal{S}_{n^{\prime}+1},k\rangle of Fmc(2,k2,k):

  1. (i)

    The universe is 𝒰={u1,,un}{un+1,,u2n}\mathcal{U}=\{u_{1},\dots,u_{n^{\prime}}\}\cup\{u_{n^{\prime}+1},\dots,u_{2n^{\prime}}\} (and thus n=2nn=2n^{\prime}),

  2. (ii)

    w(uj)=1w(u_{j})=1 for j=1,,2nj=1,\dots,2n^{\prime},

  3. (iii)

    the sets are 𝒮1,,𝒮n\mathcal{S}_{1},\dots,\mathcal{S}_{n^{\prime}} and a new set 𝒮n+1={un+1,,u2n}\mathcal{S}_{n^{\prime}+1}=\{u_{n^{\prime}+1},\dots,u_{2n^{\prime}}\},

  4. (iv)

    the coloring function is given by 𝒞(uj)={1,if 1jn2,otherwise\mathcal{C}(u_{j})=\left\{\begin{array}[]{r l}1,&\mbox{if $1\leq j\leq n^{\prime}$}\\ 2,&\mbox{otherwise}\end{array}\right., and

  5. (v)

    k=n3+1=n6+1k=\frac{n^{\prime}}{3}+1=\frac{n}{6}+1.

Clearly, every element of 𝒰\mathcal{U} occurs in no more than 33 sets and all but the set 𝒮n+1\mathcal{S}_{n^{\prime}+1} contains exactly 33 elements. The proof is completed once the following is shown:

(\ast) the given instance of X3C has a solution if and only if the transformed instance of Fmc(2,1+n/62,1+\nicefrac{{n}}{{6}}) has a solution.

A proof of (\ast) is easy: since the set 𝒮n+1\mathcal{S}_{n^{\prime}+1} must appear in any valid solution of Fmc, a solution 𝒮i1,,𝒮in/3\mathcal{S}_{i_{1}},\dots,\mathcal{S}_{i_{n^{\prime}/3}} of X3C corresponds to a solution 𝒮i1,,𝒮in/3,𝒮n+1\mathcal{S}_{i_{1}},\dots,\mathcal{S}_{i_{n^{\prime}/3}},\mathcal{S}_{n^{\prime}+1} of Fmc(2,k2,k) and vice versa.


(b) The proof is similar to that in (a) but now instead of X3C we reduce the node cover problem for cubic (i.e., 33-regular) graphs (VC3) which is defined as follows: given a cubic graph G=(V,E)G=(V,E) of nn^{\prime} nodes and 3n/23n^{\prime}/2 edges and an integer kk^{\prime}, determine if there is a set of kk^{\prime} nodes that cover all the edges. VC3VC_{3} is known to be 𝖭𝖯\mathsf{NP}-complete even if GG is planar GJ79 . For the translation to an instance of Fmc(2,k2,k), edges of GG are colored with color 11, we add a new connected component 𝒦(3n/2)+1\mathcal{K}_{(3n^{\prime}/2)+1} to GG that is a complete graph of (3n/2)+1(3n^{\prime}/2)+1 nodes with every edge having color 22, transform this to the set-theoretic version of Fmc using the standard transformation from node cover to set cover and set k=k+1k=k^{\prime}+1; note that n=3n/2+((3n/2)+12)=Θ((n)2)n=3n^{\prime}/2+\binom{(3n^{\prime}/2)+1}{2}=\Theta((n^{\prime})^{2}) and a=3n/2=O(n)a=3n^{\prime}/2=O(\sqrt{n}\,). To complete the proof, note that any feasible solution for the Fmc(2,k2,k) instance must contain exactly one node from 𝒦(3n/2)+1\mathcal{K}_{(3n^{\prime}/2)+1} covering 3n/23n^{\prime}/2 edges and therefore the solution for the edges with color 11 must correspond to a node cover in GG (and vice versa).


(c) We given a different reduction from X3C. Given an instance 𝒰,𝒮1,,𝒮n\langle\mathcal{U}^{\prime},\mathcal{S}_{1},\dots,\mathcal{S}_{n^{\prime}}\rangle of X3C as in (a), we create the following instance 𝒰,𝒯1,,𝒯n,k\langle\mathcal{U},\mathcal{T}_{1},\dots,\mathcal{T}_{n^{\prime}},k\rangle of Fmc(n,kn^{\prime},k):

  1. (i)

    For every set 𝒮i={ui1,ui2,ui3}\mathcal{S}_{i}=\big{\{}u_{i_{1}},u_{i_{2}},u_{i_{3}}\big{\}} of X3C we have three elements ui1i,ui2i,ui3iu_{i_{1}}^{i},u_{i_{2}}^{i},u_{i_{3}}^{i} and a set 𝒯i={ui1i,ui2i,ui3i}\mathcal{T}_{i}=\big{\{}u_{i_{1}}^{i},u_{i_{2}}^{i},u_{i_{3}}^{i}\big{\}} in Fmc (and thus n=3nn=3n^{\prime}, a=3a=3 and f=1f=1),

  2. (ii)

    w(uiji)=1w(u_{i_{j}}^{i})=1 and 𝒞(uiji)=ij\mathcal{C}(u_{i_{j}}^{i})=i_{j} for i{1,,n},j{1,2,3}i\in\{1,\dots,n^{\prime}\},j\in\{1,2,3\} (and thus χ=n=n/3\chi=n^{\prime}=\nicefrac{{n}}{{3}}),

  3. (iii)

    k=n/3=n/9k=\nicefrac{{n^{\prime}}}{{3}}=\nicefrac{{n}}{{9}}.

The proof is completed by showing the given instance of X3C has a solution if and only if the transformed instance of Fmc(n/3,n/9\nicefrac{{n}}{{3}},\nicefrac{{n}}{{9}}) has a solution. This can be shown as follows. We include the set 𝒯i\mathcal{T}_{i} in the solution for Fmc if and only if the set 𝒮i\mathcal{S}_{i} is in the solution for X3C. For any valid solution of X3C and every j{1,,n}j\in\{1,\dotsc,n^{\prime}\} the element uj𝒰u_{j}\in\mathcal{U}^{\prime} appears in exactly one set, say 𝒮={u1,u2,u3}\mathcal{S}_{\ell}=\big{\{}u_{\ell_{1}},u_{\ell_{2}},u_{\ell_{3}}\big{\}}, of X3C where one of the elements, say u1u_{\ell_{1}}, is uju_{j}. Then, the solution of Fmc contains exactly one element, namely the element u1u_{\ell_{1}}^{\ell}, of color 1=j\ell_{1}=j. Conversely, given a feasible solution of Fmc with at most kn/9k\leq\nicefrac{{n}}{{9}} sets, first note that if k<n/9k<\nicefrac{{n}}{{9}} then the total number of colors of various elements in the solution is 3k<n3k<n^{\prime} and thus the given solution is not valid. Thus, k=n/9k=\nicefrac{{n}}{{9}} and therefore the solution of X3C contains n/9=n/3\nicefrac{{n}}{{9}}=\nicefrac{{n^{\prime}}}{{3}} sets. Now, for every color jj the solution of Fmc contains a set, say 𝒯={u1,u2,u3}\mathcal{T}_{\ell}=\big{\{}u_{\ell_{1}}^{\ell},u_{\ell_{2}}^{\ell},u_{\ell_{3}}^{\ell}\big{\}}, containing an element of color jj, say the element u1u_{\ell_{1}}^{\ell}. Then 1=j\ell_{1}=j and the element uju_{j} appears in a set in the solution of X3C. To see that remaining claims about the reduction, there is no solution of Fmc that includes at least one element of every color and that is not a solution of X3C.