∎
22email: [email protected] 33institutetext: Tanya Berger-Wolf 44institutetext: Department of Computer Science and Engineering, Ohio State University, Columbus, OH 43210, USA
44email: [email protected] 55institutetext: ∗Bhaskar DasGupta (corresponding author) 66institutetext: Department of Computer Science, University of Illinois at Chicago, Chicago, IL 60607, USA
Tel.: +312-355-1319
Fax: +312-413-0024
66email: [email protected] 77institutetext: Anastasios Sidiropoulos 88institutetext: Department of Computer Science, University of Illinois at Chicago, Chicago, IL 60607, USA
88email: [email protected]
Maximizing coverage while ensuring fairness: a tale of conflicting objectives
Abstract
Ensuring fairness in computational problems has emerged as a key topic during recent years, buoyed by considerations for equitable resource distributions and social justice. It is possible to incorporate fairness in computational problems from several perspectives, such as using optimization, game-theoretic or machine learning frameworks. In this paper we address the problem of incorporation of fairness from a combinatorial optimization perspective. We formulate a combinatorial optimization framework, suitable for analysis by researchers in approximation algorithms and related areas, that incorporates fairness in maximum coverage problems as an interplay between two conflicting objectives. Fairness is imposed in coverage by using coloring constraints that minimizes the discrepancies between number of elements of different colors covered by selected sets; this is in contrast to the usual discrepancy minimization problems studied extensively in the literature where (usually two) colors are not given a priori but need to be selected to minimize the maximum color discrepancy of each individual set. Our main results are a set of randomized and deterministic approximation algorithms that attempts to simultaneously approximate both fairness and coverage in this framework.
Keywords:
fairness maximum coverage combinatorial optimization approximation algorithms1 Introduction
In this paper we introduce and analyze a combinatorial optimization framework capturing two conflicting objectives: optimize the main objective while trying to ensure that the selected solution is as fair as possible. We illustrate the framework with the following simple graph-theoretic illustration. Consider the graph of nodes and edges as shown in Fig. 1 where each edge is colored from one of colors (red, blue or green) representing three different attributes. Suppose that we want to select exactly nodes that maximizes the number of edges they “cover” subject to the “fairness” constraint that the proportion of red, blue and green edges in the selected edges are the same. An optimal solution is shown in Fig. 1 by the solid black nodes covering edges; Fig. 1 also shows that the solution is quite different from what it would have been (the yellow corner nodes covering edges) if the fairness constraint was absent. A simple consequence of the analysis of our algorithms for a more general setting is that, assuming that there exists at least one feasible solution and assuming is large enough, we can find a randomized solution to this fair coverage problem for graphs where we select exactly nodes, cover at least of the optimal number of edges on an average and, for every pair of colors, with high probability the ratio of the number of edges of these two colors among the selected edges is .
In this paper we consider this type of problem in more general settings. Of course, in the example in Fig. 1 (and in general) there is nothing special about requiring that the proportion of red, blue and green edges in the covered edges should be exactly equal as opposed to a pre-specified unequal proportion. For example, we may also require that the proportion of edges of different colors in our solution should mimic that in the entire graph, i.e., in Fig. 1 among the covered edges the proportion of red, blue and green edges should be and where , , and . Our algorithms will work with easy modifications for any constant values of , and .
1.1 Different research perspectives in ensuring fairness
Theoretical investigations of ensuring fairness in computation can be pursued from many perspectives. We briefly comment on a few of them.
One line of research dealing with the goal of ensuring fairness uses the optimization framework, i.e., we model the problem as an optimization problem with precisely defined fairness constraints. This is a common framework used by researchers in combinatorial and graph-theoretic algorithms, such as research works that involve designing exact or approximation algorithms, investigating fixed-parameter tractability issues or proving inapproximability results. In this paper we use such a framework. Fairness is imposed in coverage by using coloring constraints that minimizes the discrepancies between different colors among elements covered by selected sets; this is in contrast to the usual discrepancy minimization problems studied extensively in the literature C00 where the (usually two) colors are not given a priori but need to be selected to minimize the maximum color discrepancy of each individual set.
A second line of research dealing with fairness involves machine learning frameworks. Even though it is a relatively new research area, there is already a large body of research dealing with ensuring fairness in machine learning algorithms by preprocessing the data used in the algorithms, optimization of statistical outcomes with appropriate fairness criteria and metric during the training, or by post-processing the answers of the algorithms Ze13 ; HPS16 ; ZLM18 .
A third line of research dealing with fairness involves game theoretic frameworks. For example, developments of solutions for fair ways of sharing transferable utilities in cooperative game-theoretic environments have given rise to interesting concepts such as Shapley values and Rabin’s fairness model. We refer the reader to the excellent textbook in algorithmic game theory by Nisan et al. NRTV07 for further details on these research topics.
Yet another more recent line of research dealing with fairness involve applying fairness criteria in the context of clustering of points in a metric space under -means objective, -median or other -norm objectives BCFN19 ; CKLV17 . The assumption of an underlying metric space allows the development of efficient algorithms for these frameworks.
2 Fair maximum coverage: notations, definitions and related concepts
The Fair Maximum Coverage problem with colors is defined as follows. We are given an universe of elements, a weight function assigning a non-negative weight to every element, a color function assigning a color to every element, a collection of sets , and a positive integer . A collection of distinct subsets, say , with the set of “covered” elements containing elements of color is considered a valid solution111For a more general version of the problem we are given “color-proportionality constants” with , and a valid solution must satisfy for all and . As we mentioned already, with suitable modifications our algorithms will work with similar asymptotic performance guarantee for any constant values of , but to simplify exposition we will assume the simple requirement of in the sequel. provided for all and . The objective is to maximize the sum of weights of the covered elements. More explicitly, our problem is defined as follows:
Problem name: | Fair Maximum Coverage with colors (Fmc()) | ||||
---|---|---|---|---|---|
Input: | universe | ||||
(element) weight function | |||||
(element) color function | |||||
sets | |||||
integer | |||||
Valid solution: | collection of distinct subsets satsifying | ||||
|
|||||
Objective: | maximize |
We denote Fmc() by just Fmc when and are clear from the context. In the sequel, we will distinguish between the following two versions of the problem:
-
(i)
unweighted Fmc in which for all and thus the objective is to maximize the number of elements covered, and
-
(ii)
weighted Fmc in which for all .
For the purpose of stating and analyzing algorithmic performances, we define the following notations and natural parameters associated with an instance of Fmc():
-
denotes the maximum of the cardinalities (number of elements) of all sets.
-
denotes the maximum of the frequencies of all elements, where the frequency of an element is the number of sets in which it belongs.
-
denotes the optimal objective value of the given instance of Fmc.
-
denotes the number of covered elements in an optimal solution of the given instance of Fmc. For weighted Fmc, if there are multiple optimal solutions then will the maximum number of elements covered among these optimal solutions. Note that for unweighted Fmc. The reason we need to consider separately from for weighted Fmc is because the coloring constraints are tied to whereas the optimization objective is tied to .
-
The performance ratios of many of our algorithms are expressed using the function :
Note that for and for all .
For -completeness results, if the problem is trivially in then we will not mention it. To analyze our algorithms in this paper, we have used several standard mathematical equalities or inequalities which are listed explicitly below for the convenience of the reader:
(1) | |||
(2) | |||
(3) | |||
(4) |
2.1 Three special cases of the general version of Fmc
In this subsection we state three important special cases of the general framework of Fmc.
Fair maximum -node coverage or Node-fmc
This captures the scenario posed by the example in Fig. 1. We are given a connected undirected edge-weighted graph where denotes the weight assigned to edge , a color function assigning a color to every edge, and a positive integer . A node is said to cover an edge if is incident on . A collection of nodes covering edges of color for each is considered a valid solution provided for all and . The objective is to maximize the sum of weights of the covered edges. It can be easily seen that this is a special case of Fmc by using the standard translation from node cover to set cover, i.e., the edges are the set of elements, and corresponding to every node there is a set containing the edges incident on . Note that for this special case and is equal to the maximum node-degree in the graph.
Segregated Fmc or Segr-fmc
Segregated Fmc is the special case of Fmc when all the elements in any set have the same color, i.e.,
Another equivalent way of describing Segr-fmc is as follows. Let is the set of all elements colored for in the given instance of Segr-fmc. Let the notation denote the power set for any set . Then, Segr-fmc is the special case when holds for all . A simple example of an instance of Segr-fmc with , and is shown below:
Even though computing an exact solution of Segr-fmc is still -complete, it is much easier to approximate (see Section 11.1). From our application point of view as discussed in Section 3, this may for example model cases in which city neighborhoods are segregated in some manner, e.g., racially or based on income.
-balanced Fmc or -bal-fmc
-balanced Fmc is the special case of Fmc when the number of elements of each color in a set are within an additive range of , i.e.,
Similar to Segr-fmc, it is much easier to approximate -bal-fmc for small (see Section 11.2).
Geometric Fmc or Geom-fmc
In this unweighted “geometric” version of Fmc, the elements are points in for some and some constant , the sets are unit radius balls in , and the distributions of points of different colors are given by Lipschitz-bounded measures. More precisely, the distribution of points of color is given by a probability measure supported on with a -Lipschitz density function222A function for some subset of real numbers is -Lipschitz provided for all real numbers . for some that is upper-bounded by . Given a set of unit balls , the number of points of color covered by these balls is given by , and the total number of points covered by these balls is given by . This variant has an almost optimal polynomial-time approximation algorithm for fixed and under some mild assumption on (see Section 12).
3 Sketch of application scenarios
Fmc and its variants are core abstractions of many data-driven societal domain applications. We present three diverse categories of application and highlight the real-world fairness issues addressed by our problem formulations (leaving other applications in the cited references).
Service/Facility Allocation
One of the most common data based policy decisions is assigning services/facilities across different places, e.g., placing schools ChicagoSchoolClosure , bus stops, or police/fire stations, choosing a few hospitals for specific medical facilities or services, or deciding where to put cell-phone towers. Of course, a major objective in such assignments is to serve the maximum number of people (i.e., maximize the coverage). Unfortunately, historical discriminations, such as redlining redlining , through their long drawn-out effects of manifestations in different aspects of public policy are still hurting the minorities. As a result, blindly optimizing for maximum coverage biases the assignment against equitable distribution of services. Below are examples of two real cases that further underline the importance of fairness while maximizing coverage:
-
Bike sharing: As more and more cities adopt advanced transportation systems such as bike-sharing, concerns such as equity and fairness arise with them yan2019fairness . For instance, according to BikeShare1 the bike-sharing network at NYC neglects many low-income neighborhoods and communities of color while giving the priority to well-to-do neighborhoods. Here the location of bike stations (or bikes) determines the set of people that will have access to the service, perpetuating the unhealthy cycle of lack of transportation, movement, etc.
-
Delivery services for online shopping: Online shopping has by now gained a major share of the shopping market. Platforms such as Amazon provide services such as same-day delivery to make e-shopping even more convenient to their customers. While Amazon’s main aim is to maximize the number of customers covered by this service, by not considering fairness it demonstrably failed to provide such services for predominantly black communities AmazonDeliveryBias1 ; AmazonDeliveryBias2 .
Data Integration
Combining multiple data sources to augment the power of any individual data source is a popular method for data collection. Naturally the main objective of data integration is to collect (“cover”) a maximum number of data points. However, failing to include an adequate number of instances from minorities, known as population bias, in datasets used for training machine learning models is a major reason for model unfairness olteanu2019social ; asudeh2019assessing . For example, image recognition and motion detection services by Google google-gorilla and HP hp1 with a reasonable overall performance failed to tag/detect African Americans since their training datasets did not include enough instances from this minority group. While solely optimizing for coverage may result in biased datasets, considering fairness for integration may help remove population bias.
Targeted advertisement
Targeted advertising is popular in social media. Consider a company that wants to target its “potential customers”. To do so, the company needs to select a set of features (such as “single” or “college student”) that specify the groups of users to be targeted. Of course, the company wants to maximize coverage over its customers. However, solely optimizing for coverage may result in incidents such as racism in the Facebook advertisements AdRacism or sexism in the job advertisements adSexism . Thus, a desirable goal for the company would be to select the keywords such that it provides fair coverage over users of diverse demographic groups.
4 Review of prior related works
To the best of our knowledge, Fmc in its full generalities has not been separately investigated before. However, there are several prior lines of research that conceptually intersect with Fmc.
Maximum -set coverage and -node coverage problems
The maximum -set coverage and -node coverage problems are the same as the Fmc and Node-fmc problems, respectively, without element colors and without coloring constraints. These problems have been extensively studied in the algorithmic literature, e.g., see AS04 ; F98 ; Ho97 for -set coverage and GLL18a ; GLL18b ; GNW07 ; M18 ; FL01 ; AS14 ; HYZZ02 for -node coverage. A summary of these results are as follows:
- -set coverage:
- -node coverage:
-
The best approximation for -node coverage is a randomized algorithm that has an approximation ratio of with high probability FL01 ; HYZZ02 . On the inapproximability side, -node coverage is -complete even for bipartite graphs AS14 , and cannot be approximated within a ratio of for some (small) constant L98 ; P94 . More recently, Manurangsi M18 provided an semidefinite programming based approximation algorithm with an approximation ratio of , and Austrin and Stankovic AS19 used the results in AKS11 to provide an almost matching upper bound of (for any ) on the approximation ratio of any polynomial time algorithm assuming the unique games conjecture is true. There is also a significant body of prior research on the fixed parameter tractability issues for the -node coverage problem: for example, -node coverage is unlikely to allow an FPT algorithm as it is -hard GNW07 , but Marx designed an FPT approximation scheme in M08 whose running times were subsequently improved in Gupta, Lee and Li in GLL18a ; GLL18b .
However, the coloring constraints make Fmc fundamentally different from the maximum set or node coverage problems. Below we point out some of the significant aspects of these differences. For comparison purposes, for an instance of Fmc let denote the objective value of an optimal solution for the corresponding maximum -set coverage problem for this instance by ignoring element colors and coloring constraints.
- Existence of a feasible solution:
-
For the maximum -set coverage problem, a feasible solution trivially exists for any . However, a valid solution for Fmc() may not exist for some or all even if and in fact our results (Lemma 1) show that even deciding if there exists a valid solution is -complete. The -completeness result holds even if (i.e., the sets are mutually disjoint); note that if then it is trivial to compute an optimal solution to the maximum -set coverage problem. That is why for algorithmic purposes we will assume the existence of at least one feasible solution333Actually, our -relaxation based algorithms require only the existence of a feasible fractional solution but we cannot say anything about the approximation ratio in the absence of a feasible integral solution. and for showing computational hardness results we will show the existence of at least one trivial feasible solution.
- Number of covered elements:
-
The number of covered elements and the corresponding selected sets in an optimal solution in Fmc can differ vastly from that in the maximum -set coverage problem on the same instance. The reason for the discrepancy is because in Fmc one may need to select fewer covered elements to satisfy the coloring constraints.
- Exactly sets vs. at most sets:
-
For the maximum -set coverage problem any solution trivially can use exactly sets and therefore there is no change to the solution space whether the problem formulation requires exactly sets or at most sets. However, the corresponding situation for Fmc is different since it may be non-trivial to convert a feasible solution containing sets to one containing exactly sets because of the coloring constraints.
Discrepancy minimization problems
Informally, the discrepancy minimization problem for set systems (Min-disc) is orthogonal to unweighted Fmc. Often Min-disc is studied in the context of two colors, say red and blue, and is defined as follows. Like unweighted Fmc we are given sets over elements. However, unlike Fmc element colors are not given a priori but the goal to color every element red or blue to minimize the maximum discrepancy over all sets, where the discrepancy of a set is the absolute difference of the number of red and blue elements it contains. The Beck-Fiala theorem BF81 shows that the discrepancy of any set system is at most , Spencer showed in S85 that the discrepancy of any set system is , Bansal provided a randomized polynomial time algorithm achieving Spencer’s bound in B10 , and a deterministic algorithm with similar bounds were provided in LRR17 . On the lower bound side, it is possible to construct set systems such that the discrepancy is C00 . For generalization of the formulation to more than two colors and corresponding results, see for example D02 ; DS99 ; S03 .
Maximization of non-decreasing submodular set functions with linear inequality constraints
Kulik et al. in KST09 provided approximation algorithms for maximizing a non-decreasing submodular set function subject to multiple linear inequality constraints over the elements. Unfortunately, because the linear constraints in Segr-fmc are equality constraints, Segr-fmc cannot be put in the framework of KST09 and the approximation algorithms in KST09 do not directly apply to Segr-fmc.
5 Summary of our contribution
5.1 Feasibility hardness results
Obviously Fmc (resp., Node-fmc) obeys all the inapproximability results for the maximum -set coverage (resp., -node coverage) problem. We show in Lemma 1 that determining feasibility of Fmc instances is -complete even under very restricted parameter values; the proofs cover (or can be easily modified to cover) all the spacial cases of Fmc investigated in this paper. However, our subsequent algorithmic results show that even the existence of one feasible solution gives rise to non-trivial approximation bounds for the objective and the coloring constraints.
5.2 Algorithmic results
A summary of our algorithmic results is shown in Table 1. Based on the discussion in the previous section, all of our algorithms assume that at least one feasible solution for the Fmc instance exists.
|
|
theorem | |||||||||||||||||||||||
|
|
|
|
|
|
other | |||||||||||||||||||
Fmc |
|
— | Theorem 9.1 | ||||||||||||||||||||||
|
|
— | Theorem 9.1 | ||||||||||||||||||||||
|
— | Theorem 9.1 | |||||||||||||||||||||||
Alg-iter-round | — |
|
Theorem 10.2 | ||||||||||||||||||||||
— | — |
|
Theorem 10.2 | ||||||||||||||||||||||
Node-fmc | Alg-iter-round | — |
|
Theorem 10.1 | |||||||||||||||||||||
— | — |
|
Theorem 10.1 | ||||||||||||||||||||||
Other results: same as Fmc with | |||||||||||||||||||||||||
Segr-fmc |
|
— | — |
|
Theorem 11.1 | ||||||||||||||||||||
-bal-fmc |
|
— | — | Proposition 2 | |||||||||||||||||||||
Geom-fmc |
|
— | — |
|
Theorem 12.1 | ||||||||||||||||||||
: deterministic algorithm : randomized algorithm | |||||||||||||||||||||||||
: not applicable —: unrestricted | |||||||||||||||||||||||||
: any constant in the range (0,1] : any constant strictly greater than |
5.3 Remarks on proof techniques
Distributions on level sets with negative correlations
Our randomized algorithms use the sampling result by Srinivasan S01 which allows one to sample variables satisfying an equality precisely while still ensuring that the variables are negatively correlated and therefore the tail bounds by Panconesi and Srinivasan PS97 can be applied. This allows the randomized algorithms in Theorem 9.1 to select precisely sets while still preserving the properties of the distribution of variables that are needed for the proof.
Strengthening -relaxation via additional inequalities
As we show in Section 9.2, a straightforward -relaxation of Fmc based on a corresponding known -relaxation of maximum -set coverage problems does not have an finite integrality gap and therefore unsuitable for further analysis. To get around this, we use an approach similar to what was used by prior researchers (e.g., see the works by Carnes and Shmoys CD08 and Carr et al. CFLP00 ) by introducing extra covering inequalities which brings down the integrality gap and allows the results in Theorem 9.1 to go through.
Moreover, we had to separately modify existing constraints of or add new constraints to the basic -relaxation for the three algorithms, namely algorithms Alg-small-opt#, Alg-medium-opt# and Alg-iter-round. Modifications for Alg-small-opt# and Alg-medium-opt# in Theorem 9.1 are done to encode the coloring constraints suitably to optimize their coloring constraint approximation bounds for the corresponding parameter ranges. The modifications for Alg-iter-round in Theorem 10.1 and Theorem 10.2 are necessary for the iterated rounding approach to go through.
Doob martingales and Azuma’s inequality
The analysis of the rounding step of our various -relaxations are further complicated by the fact that the random element-selection variables may not be pairwise independent; in fact, it is easy to construct examples in which each element-selection variable may be correlated to about other element-selection variables, thereby ruling out straightforward use of Chernoff-type tail bounds. For sufficient large , we remedy this situation by using Doob martinagales and Azuma’s inequality in the analysis of Alg-large-opt# in Theorem 9.1.
Iterated rounding of -relaxation
The analysis of our deterministic algorithm Alg-iter-round uses the iterated rounding approach originally introduced by Jain in J01 and subsequently used by many researchers (the book by Lau, Ravi and Singh LRS11 provides an excellent overview of the topic). A crucial ingredient of this technique used in our proof is the rank lemma. In order to use this technique, we had to modify the -relaxation again. When , we can do two exhaustive enumeration steps in polynomial time, giving rise to a somewhat better approximation of the coloring constraints.
Random shifting technique
6 Organization of the paper and proof structures
The rest of the paper is organized as follows.
-
In Section 9 we design and analyze our -relaxation based randomized approximation algorithms for Fmc. In particular, in Theorem 9.1 we employ two different -relaxation of Fmc and combine three randomized rounding analysis on them to get an approximation algorithm whose approximation qualities depend on the range of relevant parameters.
-
In Section 10 we provide polynomial-time deterministic approximations of Fmc via iterated rounding of a new -relaxation. Our approximation qualities depend on the parameters and .
-
In Section 11 we provide deterministic approximation algorithms for two special cases of Fmc, namely Segr-fmc and -bal-fmc.
-
Section 12 provides the deterministic approximation for Geom-fmc.
Our proofs are structured as follows. A complex proof is divided into subsections corresponding to logical sub-divisions of the proofs and the algorithms therein. Often we provide some informal intuitions behind the proofs (including some intuition about why other approaches may not work, if appropriate) before describing the actual proofs.
7 Computational hardness of finding a feasible solution of Fmc
We show that determining if a given instance of Fmc has even one feasible solution is -complete even in very restricted parameter settings. The relevant parameters of importance for Fmc is , and ; Lemma 1 shows that the -completeness result holds even for very small values of these parameters.
Lemma 1
Determining feasibility of an instance of Fmc of elements is -complete even with the following restrictions:
-
the instances correspond to the unweighted version,
-
the following combinations of maximum set-size , frequency and number of colors are satisfied:
-
(a)
, all but one set contains exactly elements and all ,
-
(b)
the instances correspond to Node-fmc (which implies ), , and all , or
-
(c)
, and .
-
(a)
Moreover, the following assertions also hold:
-
The instances of Fmc generated in (a) and (b) actually are instances of Segr-fmc.
-
For the instances of Fmc generated in (c), and, assuming , there is no polynomial time approximation algorithm that has either a finite approximation ratio or satisfies the coloring constraints in the -approximate sense cf. eq. (5) for any finite .
A proof of Lemma 1 appears in the appendix.
Remark 1
It may be tempting to conclude that an approach similar to what is stated below using the -set coverage problem as a “black box” may make the claims in Lemma 1 completely obvious. We simply take any hard instance of -set coverage and equi-partition the universe arbitrarily into many color classes and let this be the corresponding instance of Fmc. Using a suitable standard reductions of -hardness the -set coverage problem, if there is a feasible solution of Fmc then the sets trivially cover the entire universe and thus trivially satisfy the color constraints but otherwise one may (incorrectly) claim that no sets cover the universe and so the fairness constraints cannot be satisfied. Additionally, one may be tempted to argue that if one takes a -set coverage problem instance with any additional structure (e.g., bounded occurrence of universe elements) then the coloring does not affect this additional property at all and hence the property is retained in the -set coverage problem instance with color constraints.
However, such a generic reduction will fail because it is incorrect and because it will not capture all the special parameter restrictions imposed in Lemma 1. For example:
-
Even though the sets may not cover the entire universe, it is still possible that they may satisfy the color constraints. For example, consider the following instance of the -set coverage problem:
Suppose we select the equi-partition , thus setting and . Then any two selected sets will satisfy the coloring constraints.
-
Consider the requirements of the Fmc instances in part (c). Since , every element occurs in exactly one set and thus the set systems for the -set coverage problem form a partition of the universe. Such an instance cannot be a hard instance of -set coverage since it admits a trivial polynomial time solution: sort the sets in non-decreasing order of their cardinalities and simply take the first sets.
8 Relaxing coloring constraints for algorithmic designs
Based on Lemma 1 we need to make the following minimal assumptions for the purpose of designing approximation algorithms with finite approximation ratios:
-
(i)
We assume the existence of at least one feasible solution for the given instance of Fmc.
-
(ii)
We assume that is sufficiently large compared to , e.g., for some large constant .
Lemma 1 and the example in Fig. 1 also show that satisfying the color constraint exactly (i.e., requiring to be exactly equal to for all and ) need to be relaxed for the purpose of designing efficient algorithms since non-exact solutions of Fmc may not satisfy these constraints exactly. We define an (deterministic) -approximate coloring of Fmc (for some ) to be a coloring that satisfies the coloring constraints in the following manner:
(5) |
Note that (5) automatically implies that for all and . Thus, in our terminology, a -approximate coloring corresponds to satisfying the coloring constraints exactly. Finally, if our algorithm is randomized, then the ’s could be a random values, and then we will assume that the relevant constraints will be satisfied in expectation or with high probability in an appropriate sense. More precisely, (5) will be modified as follows:444We do not provide a bound on since when and may be zero with a strictly positive probability, and for arbitrary selecting a set individually for each to avoid this situation in our randomized algorithms may select too many sets.
Unless otherwise stated explicitly, our algorithms will select exactly sets.
9 -relaxation based randomized approximation algorithms for Fmc
If is a constant then we can solve Fmc() exactly in polynomial (i.e., ) time by exhaustive enumeration, so we assume that is at least a sufficiently large constant. In this section we will employ two slightly different -relaxation of Fmc and combine three randomized rounding analysis on them to get an approximation algorithm whose approximation qualities depend on the range of various relevant parameters. The combined approximation result is stated in Theorem 9.1. In the proof of this theorem no serious attempt was made to optimize most constants since we are mainly interested in the asymptotic nature of the bounds, and to simplify exposition constants have been over-estimated to get nice integers. In the statement of Theorem 9.1 and in its proof we will refer to the three algorithms corresponding to the two -relaxations as Alg-small-opt#, Alg-medium-opt# and Alg-large-opt#.
Theorem 9.1
Suppose that the instance of Fmc() has elements and sets. Then, we can design three randomized polynomial-time algorithms Alg-small-opt#, Alg-medium-opt# and Alg-large-opt# with the following properties:
-
(a)
All the three algorithms select sets with probability .
-
(b)
All the three algorithms are randomized -approximation for Fmc, i.e., the expected total weight of the selected elements for both algorithms is at least times .
-
(c)
All the three algorithms satisfy the randomized -approximate coloring constraints cf. Inequality (5) for , i.e., for all , .
-
(d)
The algorithms satisfy the strong randomized -approximate coloring constraints cf. Equation (5)′′, i.e., for the values of , and as shown below:
range of range of algorithm (i) unrestricted Alg-large-opt# (ii) and unrestricted Alg-medium-opt# (iii) unrestricted Alg-small-opt#
Remark 2
Remark 3
The dependence on of the bounds for in Theorem 9.1(d)(i)–(ii) can be contrasted with the Beck-Fiala theorem in discrepancy minimization that shows that the discrepancy of any set system is at most .
Remark 4
Consider the special case Node-fmc with : for this case and is equal to the maximum node-degree in the graph. The bounds in Theorem 9.1(d)(i)–(ii) for this special case imply a -approximation of color constraints unless is not sufficiently large compared to . To illustrate the bound for smaller in Theorem 9.1(d)(iii), if for some then the approximation bound of the coloring constraints is .
A proof of Theorem 9.1 is discussed in the remaining subsections of this section. The following notations will be used uniformly throughout the proof.
-
and are the usual indicator variables for the elements and the sets , respectively, Their values in an optimal solution of the -relaxation under consideration will be denoted by and , respectively.
-
is the set of all elements colored for in the given instance of Fmc.
-
is the optimum value of the objective function of the -relaxation under consideration.
9.1 An obvious generalization of -relaxation of maximum -set coverage fails
maximize | ||
---|---|---|
subject to | for | |
for | ||
for |
It is well-known that the -relaxation for the element-weighted maximum -set coverage problem as shown in Fig. 2 followed by a suitable deterministic or randomized rounding provides an optimal approximation algorithm for the problem (e.g., see AS04 ; MR95 ). A straightforward way to extend this -relaxation is to add the following additional constraints, one corresponding to each pair of colors:
Unfortunately, this may not lead to a -approximate coloring (cf. Equation (5)) for any non-trivial as the following example shows. Suppose our instance of an unweighted Fmc() has four sets , , and with the elements and having color and all other elements having color . Clearly the solution to this instance consists of the sets and with . On the other hand, the fractional solution and all remaining variables being zero is also an optimal solution of the -relaxation, but any rounding approach that does not change the values of zero-valued variables in the fractional solution must necessarily result in an integral solutions in which . The example is easily generalized for arbitrary .
9.2 Alg-large-opt#: strengthening -relaxation via additional inequalities
One problem that the -relaxation in Fig. 2 faces when applied to Fmc is the following. We would like each element-indicator variable to satisfy in an optimum solution of the , but this may not be true as shown in the simple example in the previous section. Past researchers have corrected this kind of situation by introducing extra valid inequalities that hold for any solution to the problem but restrict the feasible region of the . For example, Carnes and Shmoys in CD08 and Carr et al. in CFLP00 introduced a set of additional inequalities, which they called the KC (Knapsack Cover) inequalities, to strengthen the integrality gaps of certain types of capacitated covering problems. Following their ideas, we add the extra “covering inequalities” which are satisfied by any integral solution of the :
In addition, we adjust our -relaxation in the following manner. Since is an integer from , we can “guess” the correct value of by running the algorithm for each of the possible value of , consider those solutions that maximized its objective function and select that one among these solutions that has the largest value of . Thus, we may assume that our -relaxation knows the value of exactly, and we add the following extra equality:
The resulting -relaxation is shown in its completeness in Fig. 3 for convenience.
maximize | ||
---|---|---|
subject to | for | |
for , , and | ||
for , | ||
for | ||
for |
9.3 A general technique for obtaining joint high-probability statement
Suppose that our randomized -relaxation based algorithm guarantees that for some constant independently for all . Then
To boost the success probability, we repeat the randomized rounding times, compute the quantity in each iteration, and output the solution in that iteration that resulted in the minimum value of . It then follows that for the selected solution satisfies the strong randomized -approximate coloring constraints since .
9.4 Alg-large-opt#: further details and relevant analysis
For our randomized rounding approach, we recall the following result from S01 .
Fact 1
S01 Given numbers such that is an integer, there exists a polynomial-time algorithm that generates a sequence of integers such that with probability , for all , and for any real numbers the sum satisfies standard Chernoff bounds.
We round to using the algorithm mentioned in Fact 1; this ensures resulting in selection of exactly sets. This proves the claim in (b). We round to in the following way: for , if for some then set .
Proof of (b)
Our proof of (c) is similar to that for the maximum -set coverage and is included for the sake of completeness. Note that if and only if for every set containing and thus:
(6) |
where we have used inequality (3). If then obviously (6) implies . Otherwise, and then by (4) we get
(7) |
This implies our bound since
Proof of (c)
Note that inequalities (1) and (2) imply for all . In particular, the following implication holds:
(8) |
We estimate an upper bound on in terms of in the following manner:
- Case 1: such that and .
-
Thus, , and .
- Case 2: for every index satisfying .
-
Note that , and setting in inequality (8) we get for all . Now, standard calculations show the following:
Combining all the cases and using (7), it follows that . Recall that for every . Since , we get the following bounds for all :
(9) |
which gives the bound for all .
Proofs of (d)(i) via Doob martingales
Note that the random variables may not be pairwise independent since two distinct elements belonging to the same set are correlated, and consequently the random variables also may not be pairwise independent. Indeed in the worst case an element-selection variable may be correlated to other element-selection variables, thereby ruling out straightforward use of Chernoff-type tail bounds.
For sufficient large , this situation can be somewhat remedied by using Doob martinagales and Azuma’s inequality by finding a suitable ordering of the element-selection variables conditional on the rounding of the set-selection variables. We assume that the reader is familiar with basic definitions and results for the theory of martingales (e.g., see (MR95, , Section )). Fix an arbitrary ordered sequence of the set-indicator variables. Call an element-indicator variable “settled” at the step if and only if and . The elementary event in our underlying sample space are all possible assignments of - values to the variables . For each , let be the subset of element-selection variables whose values are settled at the step, let be an arbitrary ordering of the variables in , and let us relabel the element-indicator variable names so that be the ordering of all element-selection variables given by the ordering . For each and each , let denote the event that for . Let be the union of set of all element-indicator variables that are settled at the step over all , and suppose that the event induces the following assignment of values to the element-indicator variables: for some . Define the block induced by the event as . Letting be the -field generated by the partition of into the blocks for each , it follows that form a filter for the -field . Suppose that contains elements of color , let be the ordered sequence of the element-selection variables for elements of color determined by the subsequence of these variables in the ordering , and suppose that was settled at the step. Let , and define the Doob martingale sequence where , and for all . Since , and for for all , by Azuma’s inequality (for any ),
Recall from (9) that , and the inequality is therefore satisfied provided . Setting and remembering that , we get the following bound for any :
and therefore . Thus, it follows that
This implies our claim in (d)(i) using the technique in Section 9.3.
9.5 Alg-medium-opt#: details and proofs of relevant claims in (a)–(c) and (d)(ii)
Alg-medium-opt#: idea behind the modified -relaxation and approach
A limitation of Alg-large-opt# is that we could not use Fact 1 of Srinivasan to the fullest extent. Although Fact 1 guaranteed that the set-indicator variables are negatively correlated and hence Chernoff-type tail bounds can be applied to them due to the result by Panconesi and Srinivasan PS97 , our coloring constraints are primarily indicated by element-indicator variables which depend implicitly on the set-indicator variables. In fact, it is not difficult to see that the element-indicator variables are not negatively correlated in the sense of PS97 ; S01 555A set of binary random variables are called negatively correlated in PS97 ; S01 if the following holds: even if the set-indicator variables are negatively correlated.
Our idea is to remedy the situation by expressing the coloring constraints also by set-indicator variables and use the element-indicator variables to implicitly control the set-indicator variables in these coloring constraints. This will also necessitate using additional variables.
A modification of the -relaxation in Fig. 3
To begin, we quantify the number of elements of different colors in a set using the following notation: for , let be the number of elements in of color . Note that . Fix an optimal integral solution of Fmc() covering elements and a color value , and consider the following two quantities: and . Note that , , and by definition of . Thus, is satisfied by a that is a rational number from the set . We will use the -relaxation in Fig. 3 with additional variables and the following additional constraints
for | |
for |
For reader’s convenience, the new -relaxation in its entirety is shown in Fig. 4.
maximize | ||
---|---|---|
subject to | for | |
for , , and | ||
for | ||
for , | ||
for | ||
for | ||
for |
Analysis of the modified -relaxation
By our assumption on ’s, the -relaxation has a feasible solution. We use the same randomized rounding procedure (using Fact 1) as in Section 9.4 for Alg-large-opt#. The proofs for parts (b)–(d) are the same as before since all prior relevant inequalities are still included. Thus, we concentrate on the proof of (e)(ii). A crucial thing to note is the following simple observation:
Consider the sum for any assignment of values . Then, the number of elements covered by the sets corresponding to those variables that are set to is between and .
Fix a color . Let , and consider the summation . Since for all , by Fact 1 we can apply standard Chernoff bounds MR95 for . Note that . Assuming , we get the following for the tail-bounds:
Remember that is the random variable denoting the number of elements of color selected by our randomized algorithm. Since we get
Note that , and therefore for any two . Let be the event defined as . Then for any two we get
(10) |
The assumption of , is satisfied provided . (10) implies our claim in (d)(ii) using the technique in Section 9.3.
9.6 Alg-small-opt#: details and proofs of relevant claims in (a)–(c) and (d)(iii)
Another modification of the -relaxation in Fig. 3
Note that for this case . Fix an optimal solution for our instance of Fmc. Let be the collection of those sets that contain at least one element of color and let indicate the number of sets from selected in an integral solution of the ; obviously . We consider two cases for depending on whether it is at most or not. We cannot know a priori whether or not. However, for our analysis it suffices if we can guess just one set belonging to correctly. We can do this by trying out all relevant possibilities exhaustively in the following manner. Let be the set of indices of all colors. For each of the subsets of , we “guess” that if and only if . Of course, we still do not know one set among these subset for each such , so we will exhaustively try out one each of the at most sets for each . For every such choice of and every such choice of a set for each , we perform the following steps:
-
Select the sets and their elements for each Set the variables corresponding to these sets and elements to in the -relaxation in Fig. 3, i.e., set and for every and . Remove any constraint that is already satisfied after the above step.
-
Add the following additional (at most ) constraints to the -relaxation:
for
Note that the total number of iterations of the basic iterations that is needed is at most , which is polynomial provided .
Analysis of the modified -relaxation
We now analyze that iteration of the -relaxation that correctly guesses the value of , the subset and the sets for each . As already mentioned elsewhere, the random variables may not be pairwise independent since two distinct elements belonging to the same set are correlated, and consequently the random variables also may not be pairwise independent. For convenience, let and denote the event ; note that . We first calculate a bound on for all as follows. If and are independent then of course , otherwise
giving the following bounds:
(11) |
For notational convenience, let be the indices of those sets in which both the elements and appear, and let . Note that , , and the random variable is independent of all satisfying . Using this observation and (11), for any we get
(12) |
(13) |
The above bounds can be used to bound the total pairwise co-variance between elements in two same or different color classes as follows. Consider two color classes and ( is allowed). Then,
(14) |
(15) |
For calculations of probabilities of events of the form “”, we first need to bound the probability of events “” for . If then since at least one set containing an element of color is always selected. Otherwise, , and if and only if for every . This gives us the following bound for :
Combining both cases, we have for all .
We now can calculate the probabilities of events of the form “” for , and as follows:
(16) |
For a real number , let . We have the following bound on for all :
Therefore, using Chebyshev’s inequality we get (for all and ):
(17) |
Using (17) in (16) with , and we get
(18) |
We now calculate a bound on using (14) and (15) as follows:
(19) |
Setting and using (19) in (18) we get . This implies our claim in (d)(iii) using the technique in Section 9.3.
9.7 Limitations of our -relaxation: “a gap of factor ” for coloring constraints
The coloring constraint bounds in Theorem 9.1(e)(i)–(ii) depend on or only. It is natural to ask as a possible first direction of improvement whether this dependence can be eliminated or improved by better analysis of our -relaxations. Proposition 1 shows that this may not be possible even for unless one uses a significantly different -relaxation for Fmc().
Proposition 1
There exists optimal non-integral solutions of Fmc with the following property: any rounding approach that does not change the values of zero-valued variables in the fractional solution must necessarily result in an integral solutions in which the color constraints differ by at least a factor of .
Proof
We will show our result for the -relaxation in Fig. 3; proofs for other modified versions of this -relaxation are similar. Consider disjoint collections of sets and elements of the following type: for , the collection consists of a set of elements with and , and the sets where for (note that each element is in exactly sets). Add to these collections the additional elements with and , and the sets for . Note that for our created instance . Consider the following two different solutions of the -relaxation:
- (1)
-
For a non-integral solution, let , let and let for , and set all other variables to zero. This results in a solution with summation of set variables being (i.e., sets are selected non-integrally), and summation of element variables being (i.e., elements are selected non-integrally). Moreover, the summation of element variables with the color value of is precisely the same as summation of element variables with the color value of since both are equal to .
- (2)
-
For an integral solution, let for . This also results in a solution in which sets are selected, the number of elements covered is and the number of elements of each color is .
The crucial things to note here is that the two above solutions are disjoint (i.e., non-zero variables in one solution are zero in the other and vice versa), and thus any rounding approach for the solution in (1) that does not change values of the zero-valued variables results in an integral solution in which the number of elements of color is times the number of elements of color .
10 A tale of fewer colors: deterministic approximation for Fmc when is “not too large”
In this section we provide polynomial-time deterministic approximations of Fmc via the iterated rounding technique for -relaxations. We assume that the reader is familiar with the basic concepts related to this approach as described, for example, in LRS11 . Our approximation qualities will depend on the parameters and and the coloring constraint bounds are interesting only if is not too large, e.g., no more than, say, poly-logarithmic in . For better understanding of the idea, we will first consider the special case Node-fmc of Fmc for which , and later on describe how to adopt the same approach for arbitrary . As per the proof of Theorem 9.1 (see Section 9.2) we may assume we know the value of exactly. A main ingredient of the iterated rounding approach is the following “rank lemma”.
Fact 2 (Rank lemma)
(LRS11, , Lemma 2.1.4) Consider any convex polytope for some and . Then the following property holds for every extreme-point for : the number of any maximal set of linearly independent tight constraints i.e., constraints satisfying for some in this solution equals the number of non-zero variables.
10.1 Approximating Node-fmc
Theorem 10.1
We can design a deterministic polynomial-time approximation algorithm Alg-iter-round for Node-fmc with the following properties:
-
(a)
The algorithm selects nodes where
-
(b)
The algorithm is a -approximation for Node-fmc, i.e., the total weight of the selected elements is at least .
-
(c)
The algorithm satisfies the -approximate coloring constraints cf. Inequality (5) as follows:
for all ,
We discuss the proof in the rest of this section. Let be the given graph, and let denote the degree of node . Assume that has no isolated nodes.
10.1.1 The case of
Since the problem can be exactly solved in polynomial time by exhaustive enumeration if is a constant, we can assume is at least a sufficiently large integer, e.g., assume that .
Initial preprocessing
To begin, we “guess” nodes, say with such that there exists an optimal solution contains these nodes with the following property: “the remaining nodes in the solution have degree at most ”. Since there are at most choices for such nodes, we can try them out in an exhaustive fashion. Thus, we only need to analyze that run of our algorithm where the our guess is correct. Once these nodes have been selected, we will use the following sets of nodes in and ”incidence-indexed” edges as input to our algorithm (note that an edge may appear as two members and in if both and are in ):
Fix an optimal solution that includes the nodes . We next make the following parameter adjustments:
-
We update an estimate for (the number of edges of color covered by the optimal solution) from its initial value of in the following manner. Let be the number of edges of color incident on at least one of the nodes in . Consider the quantity . Note that and is an integer in the set since any edge can be covered by either one or two nodes. Note that there are at most possible number of integers values that each may take. Since , we can try out all possible combinations of values over all colors in polynomial time since . Thus, we henceforth assume that we know the correct value of for each . Note that since our guess is correct.
-
Update (the number of nodes to be selected) by subtracting from it, and call the new value .
-
Update (the set of edges of color ) to be the set of edges in that are of color .
Yet another -relaxation
Let and . We will start with an initial -relaxation of Node-fmc on which will be iteratively modified by our rounding approach. Our -relaxation is the following modified version of the -relaxation in Fig. 3.
-
There is a node indicator variable for every node and an edge indicator variable for every edge ; thus we have variables in total.
-
Constraints of the form “” and “” in Fig. 3 are removed now and instead replaced by at most two constraints if and if . This is done so that we can apply the rank lemma in a meaningful way.
-
Note that the quantity for each color is the integer mentioned before.
-
To maximize the parameter ranges over which our algorithm can be applied, we replace the constraints in Fig. 3 of the form “ for , ” by the constraints for .
The entire initial -relaxation for is shown in Fig. 5 for convenience. Note that the number of constraints in lines (1)–(3) of Fig. 5 is exactly .
maximize | |||
---|---|---|---|
subject to | |||
(1) | for all and | ||
(2) | |||
(3) | for all | ||
(4) | for all | ||
(5) | for all |
Details of iterated rounding
We will use the variable to denote the iteration number of our rounding, with being the situation before any rounding has been performed, and we will use a “superscript ” for the relevant quantities to indicate their values or status after the iteration of the rounding, e.g., and is the value of after the first iteration of rounding. Our iterated rounding algorithm Alg-iter-round in high level details is shown in Fig. 6, where the following notation is used for brevity for a node :
For concise analysis of our algorithm, we will use the following notations:
-
is the sum of weights of all the edges incident to one or more nodes from the set of nodes .
-
for a subset of variable .
-
is the sum of weights of the edges whose variables are in (thus, for example, ).
-
is the optimum value of the objective function of the -relaxation during the iteration of rounding.
-
is the number of edges of color selected by Alg-iter-round up to and including the iteration of rounding.
-
is the value of in the last iteration of rounding.
; ; ; ; ; |
while do |
find an extreme-point optimal solution of objective value for the (cf. Fig. 5) |
begin cases |
Case 1: there exists a variable in the solution such that |
; |
; |
remove the variables in from , and delete or update the constraints |
and the objective function to reflect the removal of variables |
Case 2: there exists a variable in the solution such that |
; |
; ; |
remove the variables in from , and delete or update the constraints |
and the objective function to reflect the removal of variables |
subtract the value from the objective function |
Case 3: |
let be the remaining non-zero node indicator variables |
; ; ; |
; |
end cases |
end while |
return |
Lemma 2
Alg-iter-round terminates after at most iterations and selects at most nodes.
Proof
For finite termination, it suffices to show that at least one of the three cases in Alg-iter-round always applies. Consider the first iteration, say when , when neither Case 1 nor Case 2 applies. Note that this also implies that for any variable in since otherwise the variable in will be either or via the equality constraint and one of Case 1 or Case 2 will apply. Thus the total number of non-zero variables is . Since the constraints in lines (4)–(5) of Fig. 5 are not strict constraints now (i.e., not satisfied with equalities), the total number of any maximal set of strict constraints is at most the total number of constraints in lines (1)–(3) of Fig. 5, i.e., at most . By the rank lemma (Fact 2) , which implies Case 3 applies and the algorithm terminates.
We now prove the bound on the number of selected sets. The value of decreases by every time a new node is selected in Case 2 and remains unchanged in Case 1 where no node is selected. In the very last iteration involving Case 3, since has no isolated nodes the number of node indicator variables is at least the number of edge indicator variables, implying . Since , the total number of nodes selected is at most .
Lemma 3
The sum of weights of the edges selected by Alg-iter-round is at least .
Proof
Let , and . The proof of Lemma 2 shows that Case 3 of Alg-iter-round is executed only when . Thus, the details of Alg-iter-round in Fig. 6 imply the following sequence of assertions:
-
(i)
and for . Since the variables are at most , we have
Using the fact that , we can therefore unravel the recurrence to get
(20) -
(ii)
and for . Using (20) we can unravel the recurrence we get
Noting that an edge can contribute the value of twice in corresponding to the two variables and , the total weight of selected edges in our solution is at least
Our proof of Theorem 10.1 is therefore completed once we prove the following lemma.
Lemma 4
For all .
Proof
When Case 3 applies and, since the variables are at most , and consequently . Noting that an edge can contribute twice in the various ’s corresponding to the two variables and and remembering that , we get
We can get an upper bound on by getting an upper bound on in the following manner. Consider the nodes in Case 3. By choice of the nodes of degrees , respectively, the number of edges incident on is at most for all . Thus, we get , and consequently
Thus, for all we have
10.1.2 The case of arbitrary
As stated below, there are two steps in the previous algorithm that cannot be executed in polynomial time when is not a constant:
-
(1)
We cannot guess the nodes in polynomial time. Instead, we guess only one node such that there exists an optimal solution contains with the following property: “the remaining nodes in the solution have degree at most ”.
-
(2)
We cannot guess the exact value of by exhaustive enumeration and therefore we cannot use the constraints “” in line (3) of the -relaxation in Fig. 5 anymore. However, note that it still holds that is an integer in the set . Thus, instead we use the constraints
We need modifications of the bounds in the previous proof to reflect these changes as follows:
-
We make some obvious parameter value adjustments such as: , , , for all .
-
The number of constraints in lines (1)–(3) of Fig. 5 is now .
-
The condition in Case 3 of Fig. 6 is now .
-
In Lemma 2, we select at most nodes.
-
The calculations for the upper bound for in Lemma 4 change as follows. By choice of the node of degree , the number of edges incident on is at most for all . This now gives , and therefore . This gives us the following updated bound:
10.2 The general case: approximating Fmc
Theorem 10.2 (generalizing Theorem 10.1 for Fmc)
We can design a deterministic polynomial-time approximation algorithm for Fmc with the following properties:
-
(a)
The algorithm selects sets where
-
(b)
The algorithm is a -approximation for Node-fmc, i.e., the total weight of the selected elements is at least .
-
(c)
The algorithm satisfies the -approximate coloring constraints cf. Inequality (5) as follows:
for all ,
The proof of Theorem 10.2 is a suitable modified version of the proof of Theorem 10.1. We point out the important alterations that are needed.
General modifications
-
Nodes and edges now correspond to sets and elements, respectively, incidence of an edge on a node corresponds to membership of an element in a set, and degree of a node correspond to number of elements in a set.
-
There is a set indicator variable for every element . For every element , there is an element indicator variable and a constraint .
-
Now since any element in any one of the sets from can appear in at most other sets in the collection of sets . Also, is an integer in the set since any element can appear in at most sets.
-
An element appearing in sets, say sets , can now contribute the value of at most times in corresponding to the variables . Thus, we get a -approximation to the objective function.
Modifications related to case
-
An element appearing in sets, say sets , can now contribute at most times in the various ’s corresponding to the variables . This modifies the relevant inequality for as follows:
Modifications related to the arbitrary case
-
The calculations for the upper bound for in Lemma 4 change as follows.
This gives the final bound of .
11 Approximation algorithms for two special cases of Fmc
For approximating these special cases of Fmc, which are still -complete, we will be specific about the various constants and will try to provide approximation algorithms with as tight a constant as we can. For this section, let . Note that .
11.1 Segr-fmc: almost optimal deterministic approximation with “at most” sets
Note that Lemma 1 shows that finding a feasible solution is -complete even for unweighted Segr-fmc with . Further inapproximability results for Segr-fmc are stated in Remark 5.
Theorem 11.1
There exists a polynomial-time deterministic algorithm Alg-greed-plus that, given an instance of unweighted Segr-fmc () outputs a solution with the following properties:
- (a)
-
The number of selected sets is at most .
- (b)
-
The approximation ratio is at least .
- (c)
-
The coloring constraints are -approximately satisfied cf. (5), i.e.,
Remark 5
Remark 6
The “at most sets” part of the proof arises in the following steps of the algorithm. Since we cannot know exactly, we can only assume since it is possible that the algorithm for the maximum -set coverage also covers at least elements for some . Secondly, even if we have the guessed the correct value of , the algorithm for the maximum -set coverage may cover more than elements, and thus we have to “un-select” some of the selected sets to get the desired bounds (the proof shows that sometimes we may have to un-select all but one set). The following example shows that a solution that insists on selecting exactly sets may need to select sets all of which are not in our solution. Consider the following instance of unweighted Fmc(): , , , and for . Our algorithm will select the set whereas any solution that selects exactly sets must selects the sets .
Proof
We reuse the notations, terminologies and bounds shown in the proof of Theorem 9.1 as needed. Let be the partition of the universe based on the color of the elements, i.e., for . By the definition of Segr-fmc every set contains elements from exactly one such partition and thus, after renaming the sets and elements for notational convenience, we may set assume that our collection of sets is partitioned into collection of sets, where the collection (for ) contains the sets over the universe of elements such that and . For and any let Fmc be the unweighted Fmc() problem defined over the universe and the collection of sets . The following observation holds trivially.
Unweighted Segr-fmc () has a valid solution covering elements if and only if (i) for each , Fmc has a valid solution covering elements for some , and (ii) .
The above observation suggests that we can guess the value of by trying out all possible values of just like the algorithms in Theorem 9.1, and for each such value of we can solve independent Fmc instances and combine them to get a solution of the original Segr-fmc instance. Although we cannot possibly solve the Fmc problems exactly, appropriate approximate solutions of these problems do correspond to a similar approximate solution of Segr-fmc () as stated in the following observation:
Suppose that for each we have a solution of Fmc with the following properties (for some and ): (i) , and (ii) . Then, the collection of sets outputs a solution of Segr-fmc () with the following properties: (a) the number of selected sets is at most , (b) the number of elements covered is at least , and (c) for any pair , .
By the above observation, to prove our claim it suffices if we can find a solution for Fmc for any with , and . For convenience, we will omit the superscript from the set labels while dealing with Fmc . Remove from consideration any sets from that contains more than elements, and consider the standard (unweighted) maximum -set coverage problem, that ignores constraint (i) of the above observation, on these remaining collection of sets over the universe . Since we have guessed the correct value of , there is at least one valid solution and thus the following assertions hold: (I) there exists a set of sets that covers elements, and (II) . Let denote the maximum number of elements that can be covered by selecting sets from . There are the following two well-known algorithm algorithms for the maximum -set coverage problem both of which select exactly sets: the greedy algorithm covers at least elements (F98, , Proposition 5.1), where the pipage-rounding algorithm (based on the -relaxation in Fig. 2) covers at least elements AS04 . Note that we do not know the exact value of and we cannot guess by enumerating every possible values for every in polynomial time. To overcome this obstacle, we use the following steps.
-
We run both the algorithms for maximum -set coverage for until we find the first (smallest) index such that the better of the two algorithms cover at least elements.
-
Suppose that this algorithm selects the sets (after possible re-numbering of set indices) , where we have ordered the sets such that for every the number of elements covered by and not covered by any of the sets is at least as many as the number of elements covered by and not covered by any of the sets for any . Remember that . Let be the smallest index such but . We have the following cases.
-
If then we select as our solution since .
-
Otherwise and in this case we select the sets in our solution since .
-
11.2 -bal-fmc: improved deterministic approximation
Proposition 2
There exists a polynomial-time deterministic algorithm Alg-greedy that, given an instance of unweighted -bal-fmc outputs a solution with the following properties:
- (a)
-
The number of selected sets is exactly .
- (b)
-
The approximation ratio is at least .
- (c)
-
The coloring constraints are -approximately satisfied cf. (5), i.e., .
Proof
As already mentioned in the proof of Theorem 11.1 and elsewhere, there is a deterministic polynomial-time algorithm for the maximum -set coverage problem with an approximation ratio of . For the given instance of -bal-fmc , we run this algorithms (ignoring element colors) selecting sets, say . Obviously, the total weight of all the elements covered in the selected solution is at least . Let and . Note that . Since each of the sets in the solution is balanced, an upper bound for the number of elements of color in the solution is given by . Also note that by definition of we have . It thus follows that for any and we have .
12 Approximating Geom-fmc via randomized shifting
We refer the reader to textbooks such as V01 for a general overview of the randomized shifting technique (textbook V01 illustrates the technique in the context of Euclidean travelling salesperson problem).
Theorem 12.1
For any constant , we can design a randomized algorithm Alg-geom for Geom-fmc with the following properties:
-
(a)
Alg-geom runs in time.
- (b)
Remark 7
In many geometric applications, the dimension parameter is a fixed constant. For this case, Alg-geom runs in polynomial time, and moreover, under the mild assumption of for some constant , Alg-geom covers at least points, i.e., under these conditions Alg-geom behaves like a randomized polynomial-time approximation scheme.
Proof
Fix an optimal solution having unit balls , such that for all , , where . Thus, we need to show that our algorithm Alg-geom computes in time a set of unit balls such that the following assertions hold with probability (where ):
Set . Let be an axis-parallel grid such that every connected component of is an open -dimensional hypercube isometric to . In other words, is the union of infinite families of axis-parallel -dimensional hyperplanes, spaced apart by in each orthonormal direction. Let be chosen uniformly at random, and let be the random translation of by , i.e.,
Let is the set of indices of all balls that has a non-empty intersection with the randomly shifted grid , i.e.,
Let . Any point is contained in only if it is contained in some unit ball intersecting . Therefore, only if it is at distance at most from (in other words, is contained in the -neighborhood of ). The probability that any particular point is at distance at most from any family of parallel randomly shifted hyperplanes in is exactly . By the union bound over all dimensions, . Therefore, by the linearity of expectation,
Consequently, by Markov’s inequality, we get
Set . Let be the collection of unit balls in obtained as follows. For each , obtain a unit ball by translating such that its center has coordinates that are integer multiples of , i.e., it is an element of the dilated integer lattice . For every , we obtain a unit ball by picking an arbitrary ball obtained for some as described above. Essentially, the new solution is missing all the balls that intersect , and rounds every other ball so that its center is contained in some integer lattice. In this construction, each ball , with , gets translated by at most some distance . Since each for all , is -Lipschitz, it follows that
Letting , we get that for all ,
Let be the set of connected components of . We refer to the elements of as cells. For each , we enumerate the set, , of all possible subsets of at most unit balls with centers in . There are at most lattice points in , and thus there are at most such subsets of unit balls. Since , it follows that this enumeration takes time.
For each enumerated subset of unit balls, we record the vector
where . There are at most such vectors for each cell in . Via standard dynamic programming, we can inductively compute all possible sums of vectors such that we pick at most one vector from each cell, and the total sum of the first coordinate, i.e., the number of unit balls, is at most . This can be done in time. For the correct choice of vectors that corresponds to the solution , the sum of the vectors we compute is correct up to an additive factor of on each coordinate. This means that we compute a solution , with the following property:
with probability at least . Repeating the algorithm times and returning the best solution found, results in the high-probability assertion, which concludes the proof.
13 Conclusion and open problems
In this paper we formulated a natural combinatorial optimization framework for incorporating fairness issues in coverage problems and provided a set of approximation algorithms for the general version of the problem as well as its special cases. Of course, it is possible to design other optimization frameworks depending on the particular application in hand, and we encourage researchers to do that. Below we list some future research questions related to our framework:
- Eliminating the gap of factor in -relaxation:
-
As noted in Section 9.7, all of our -relaxations incur a gap of factor in the coloring constraints while rounding. It seems non-trivial to close the gap using additional linear inequalities while preserving the same approximation ratio. However, it may be possible to improve the gap using -relaxations.
- Primal-dual schema:
-
Another line of attack for the Fmc problems is via the primal-dual approach V01 . For example, can the primal-dual approach for partial coverage problem by Gandhi, Khuller and Srinivasan GKS04 be extended to Fmc? A key technical obstacle seems to center around effective interpretation of the dual of the coloring constraints. Our iterated rounding approach was able to go around this obstacle but the case when may be improvable.
- Fixed parameter tractability:
- Generalizing to non-decreasing submodular set objective functions:
-
The proofs and proof techniques in this paper do not generalize to the case when the objective function for our Fmc problems is a (more general) non-decreasing submodular set function. It would be interesting to devise new algorithmic techniques and proofs for this more general case. Approximation algorithms for such generalizations for the standard maximum -set coverage problem (see Section 4) were provided in KST09 .
References
- (1) A. Ageev and M. Sviridenko, Pipage Rounding: A New Method of Constructing Algorithms with Proven Performance Guarantee, Journal of Combinatorial Optimization, 8(3), 307-328, 2004.
- (2) J. Angwin and T. Parris, Facebook Lets Advertisers Exclude Users by Race, Propublica, October 28, 2016 (www.propublica.org/article/facebook-lets-advertisers-exclude-users-by-race).
- (3) N. Apollonio and B. Simeone, The maximum vertex coverage problem on bipartite graphs, Discrete Applied Mathematics, 165, 37-48, 2014.
- (4) S. Arora. Polynomial time approximation schemes for euclidean traveling salesman and other geometric problems, Journal of the ACM, 45(5), 753-782, 1998.
- (5) A. Asudeh, Z. Jin and H. V. Jagadish, Assessing and remedying coverage for a given dataset, annual IEEE International Conference on Data Engineering, 554-565, 2019.
- (6) P. Austrin, S. Khot and M. Safra, Inapproximability of vertex cover and independent set in bounded degree graphs, Theory of Computing, 7(1), 27-43, 2011.
- (7) P. Austrin and A. Stankovic, Global Cardinality Constraints Make Approximating Some Max-2-CSPs Harder, in Achlioptas D. and Végh L. A. (eds),, Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, Leibniz International Proceedings in Informatics, 145, 24:1-24:17, 2019.
- (8) N. Bansal, Constructive Algorithms for Discrepancy Minimization, Annual IEEE Symposium on Foundations of Computer Science, 3-10, 2010.
- (9) J. Beck and T. Fiala, “Integer-making” theorems, Discrete Applied Mathematics, 3(1), 1-8, 1981.
- (10) S. Bera, D. Chakrabarty, N. Flores and M. Negahbani, Fair Algorithms for Clustering, in Advances in Neural Information Processing Systems, Wallach H., Larochelle H., Beygelzimer A., d'Alché-Buc F., Fox E. and Garnett R. (eds), 32, 4954-4965, 2019.
- (11) T. Carnes and D. Shmoys, Primal-Dual Schema for Capacitated Covering Problems, in Lodi A., Panconesi A. and Rinaldi G. (eds), Integer Programming and Combinatorial Optimization, Lecture Notes in Computer Science, 5035, Springer, 288-302, 2008.
- (12) R. D. Carr, L. K. Fleischer, V. J. Leung and C. A. Phillips, Strengthening Integrality Gaps for Capacitated Network Design and Covering Problems, Annual ACM-SIAM Symposium on Discrete Algorithms, 106-115, 2000.
- (13) B. Chazelle, The Discrepancy Method: Randomness and Complexity, Cambridge University Press, 2000.
- (14) C. Chekuri, J. Vondrák and R. Zenklusen, Dependent Randomized Rounding via Exchange Properties of Combinatorial Structures, Annual IEEE Symposium on Foundations of Computer Science, 575-584, 2010.
- (15) F. Chierichetti, R. Kumar, S. Lattanzi and S. Vassilvitskii, Fair Clustering Through Fairlets, in Advances in Neural Information Processing Systems, Guyon I., Luxburg U. V., Bengio S., Wallach H., Fergus R., Vishwanathan S. and Garnett R. (eds), 30, 5029-5037, 2017.
- (16) B. Doerr, Discrepancy in different numbers of colors, Discrete Mathematics, 250, 63-70, 2002.
- (17) B. Doerr and A. Srivastav, Approximation of multi-color discrepancy, in Hochbaum D., Jansen K., Rolim J. D. P. and Sinclair A. (eds), Randomization, Approximation and Combinatorial Optimization, Lecture Notes in Computer Science, 1671, Springer, 39-50, 1999.
- (18) U. Feige, A Threshold of for Approximating Set Cover, Journal of the ACM, 45(4), 634-652, 1998.
- (19) U. Feige and M. Langberg, Approximation algorithms for maximization problems arising in graph partitioning, Journal of Algorithms, 41, 174-211, 2001.
- (20) R. Gandhi, S. Khuller and A. Srinivasan, Approximation algorithms for partial covering problems, Journal of Algorithms, 53(1), 55-84, 2004.
- (21) M. R. Garey and D. S. Johnson, Computers and Intractability - A Guide to the Theory of NP-Completeness (W. H. Freeman & Co., San Francisco, CA, 1979).
- (22) J. Guo, R. Niedermeier and S. Wernicke, Parameterized complexity of vertex cover variants, Theory of Computing Systems, 41(3), 501-520, 2007.
- (23) A. Gupta, E. Lee and J. Li, An FPT algorithm beating 2 approximation for k-cut, Annual ACM-SIAM Symposium on Discrete Algorithms, 2821-2837, 2018.
- (24) A. Gupta, E. Lee and J. Li, Faster exact and approximate algorithms for k-cut, Annual IEEE Symposium on Foundations of Computer Science, 113-123, 2018.
- (25) C. Guse, Citi Bike neglects poor NYC neighborhoods and communities of color: report, New York Daily News, July 10, 2019 (https://bit.ly/3aEB4rz)
- (26) Q. Han, Y. Ye, H. Zhang and J. Zhang, On approximation of max-vertex-cover, European Journal of Operational Research, 143, 342-355, 2002.
- (27) M. Hardt, E. Price and N. Srebro, Equality of Opportunity in Supervised Learning, International Conference on Neural Information Processing Systems, 3323-3331, 2016.
- (28) D. S. Hochbaum, Approximating Covering and Packing Problems: Set Cover, Vertex Cover, Independent Set, and Related Problems, in Hochbaum, D. S. (ed.), Approximation Algorithms for NP-Hard Problems, 94-143, (PWS Publishing Company, 1997).
- (29) A. Hunter, Job ads show sexism still prevalent in most industries, HRD, February 22, 2019 (bit.ly/38eom11).
- (30) D. Ingold and S. Soper, Amazon Doesn’t Consider the Race of Its Customers. Should It?, Bloomberg, April 21, 2016.
- (31) K. Jain, A Factor 2 Approximation Algorithm for the Generalized Steiner Network Problem, Combinatorica, 21, 39-60, 2001
- (32) T. Jan, Redlining was banned 50 years ago. It’s still hurting minorities today, Washington Post, March 28, 2018.
- (33) A. Kulik, H. Shachnai and T. Tamir, Maximizing Submodular Set Functions Subject to Multiple Linear Constraints, Annual ACM-SIAM Symposium on Discrete Algorithms, 545-554, 2009.
- (34) M. Langberg, Approximation algorithms for maximization problems arising in graph partitioning, M. Sc. Thesis, Weizmann Institute of Science, 1998.
- (35) L. C. Lau, R. Ravi and M. Singh, Iterative Methods in Combinatorial Optimization, Cambridge University Press, 2011.
- (36) J. Lee and C. Lubienski, The impact of school closures on equity of access in Chicago, Education and Urban Society, 49(1), 53-80, 2017.
- (37) A. Levy, H. Ramadas and T. Rothvoss, Deterministic Discrepancy Minimization via the Multiplicative Weight Update Method, in Eisenbrand F., Koenemann J. (eds), Integer Programming and Combinatorial Optimization, Lecture Notes in Computer Science, 10328, 380-391, Springer, 2017.
- (38) P. Manurangsi, A Note on Max k-Vertex Cover: Faster FPT-AS, Smaller Approximate Kernel and Improved Approximation Symposium on Simplicity in Algorithms, 15:1-15:21, 2019.
- (39) D. Marx, Parameterized complexity and approximation algorithms, The Computer Journal, 51(1), 60-78, 2008.
- (40) J. Mitchell. Guillotine subdivisions approximate polygonal subdivisions: A simple polynomial-time approximation scheme for geometric tsp, k-mst, and related problems, SIAM Journal on Computing, 28(4), 1298-1309, 1999.
- (41) R. Motwani and P. Raghavan, Randomized Algorithms (Cambridge University Press, 1995).
- (42) M. Mulshine, A major flaw in Google’s algorithm allegedly tagged two black people’s faces with the word ’gorillas’, Business Insider, July 1, 2015.
- (43) N. Nisan, T. Roughgarden, E. Tardos and V. V. Vazirani, Algorithmic Game Theory, Cambridge University Press, 2007.
- (44) A. Olteanu, C. Castillo, F. Diaz and E. Kiciman, Social data: Biases, methodological pitfalls, and ethical boundaries, Frontiers in Big Data, 2, pp. 13, 2019.
- (45) A. Panconesi and A. Srinivasan, Randomized Distributed Edge Coloring via an Extension of the Chernoff–Hoeffding Bounds, SIAM Journal of Computing, 26(2), 350-368, 1997.
- (46) E. Petrank, The hardness of approximation: Gap location, Computational Complexity, 4, 133-157, 1994.
- (47) M. Simon, HP looking into claim webcams can’t see black people, CNN, December 23, 2009.
- (48) J. Spencer, Six standard deviations suffice, Transactions of American Mathematical Society, 289, 679-706, 1985.
- (49) S. Spencer, Amazon to Bring Same-Day Delivery to Roxbury After Outcry, Bloomberg, April 26, 2016.
- (50) A. Srinivasan, Distributions on level-sets with applications to approximation algorithms, Annual IEEE Symposium on Foundations of Computer Science, 588-597, 2001.
- (51) A. Srivastav, Multicolour Discrepancies, Combinatorics, Probability and Computing, 12, 365-399, 2003.
- (52) V. Vazirani, Approximation Algorithms, Springer, 2001.
- (53) A. Yan and B. Howe, Fairness in Practice: A Survey on Equity in Urban Mobility, Data Engineering Bulletin, 42(4), 49, 2019.
- (54) R. Zemel, Y. Wu, K. Swersky, T. Pitassi and C. Dwork, Learning Fair Representations, International Conference on Machine Learning, PMLR 28(3), 325-333, 2013.
- (55) B. H. Zhang, B. Lemoine and M. Mitchell, Mitigating Unwanted Biases with Adversarial Learning, AAAI/ACM Conference on AI, Ethics, and Society, 335-340, 2018.
Appendix
Proof of Lemma 1
(a) We describe the proof for ; generalization to is obvious. The reduction is from the Exact Cover by 3-sets (X3C) problem which is defined as follows. We are given an universe of elements for some that is a multiple of , and a collection of subsets of such that , every element of occurs in exactly sets and for . The goal is to decide if there exists a collection of (disjoint) sets whose union is . X3C is known to be -complete GJ79 . Given an instance of X3C as described, we create the following instance of Fmc():
-
(i)
The universe is (and thus ),
-
(ii)
for ,
-
(iii)
the sets are and a new set ,
-
(iv)
the coloring function is given by , and
-
(v)
.
Clearly, every element of occurs in no more than sets and all but the set contains exactly elements. The proof is completed once the following is shown:
() the given instance of X3C has a solution if and only if the transformed instance of Fmc() has a solution.
A proof of () is easy: since the set must appear in any valid solution of Fmc, a solution of X3C corresponds to a solution of Fmc() and vice versa.
(b) The proof is similar to that in (a) but now instead of X3C we reduce the node cover problem for cubic (i.e., -regular) graphs (VC3) which is defined as follows: given a cubic graph of nodes and edges and an integer , determine if there is a set of nodes that cover all the edges. is known to be -complete even if is planar GJ79 . For the translation to an instance of Fmc(), edges of are colored with color , we add a new connected component to that is a complete graph of nodes with every edge having color , transform this to the set-theoretic version of Fmc using the standard transformation from node cover to set cover and set ; note that and . To complete the proof, note that any feasible solution for the Fmc() instance must contain exactly one node from covering edges and therefore the solution for the edges with color must correspond to a node cover in (and vice versa).
(c) We given a different reduction from X3C. Given an instance of X3C as in (a), we create the following instance of Fmc():
-
(i)
For every set of X3C we have three elements and a set in Fmc (and thus , and ),
-
(ii)
and for (and thus ),
-
(iii)
.
The proof is completed by showing the given instance of X3C has a solution if and only if the transformed instance of Fmc() has a solution. This can be shown as follows. We include the set in the solution for Fmc if and only if the set is in the solution for X3C. For any valid solution of X3C and every the element appears in exactly one set, say , of X3C where one of the elements, say , is . Then, the solution of Fmc contains exactly one element, namely the element , of color . Conversely, given a feasible solution of Fmc with at most sets, first note that if then the total number of colors of various elements in the solution is and thus the given solution is not valid. Thus, and therefore the solution of X3C contains sets. Now, for every color the solution of Fmc contains a set, say , containing an element of color , say the element . Then and the element appears in a set in the solution of X3C. To see that remaining claims about the reduction, there is no solution of Fmc that includes at least one element of every color and that is not a solution of X3C.