¹¹institutetext: Indian Institute of Technology Madras, India
¹¹email: [email protected], [email protected], [email protected] ²²institutetext: FLAME University, Pune, India
²²email: [email protected]

Generalized Capacity Planning for the Hospital-Residents Problem^†^†thanks: A preliminary version of this work appeared in IWOCA 2023 [15].

Haricharan Balasundaram 11 Girija Limaye This work is partially done when the author was a Ph.D. scholar at IIT Madras.22 Meghana Nasre 11 Abhinav Raja 11

Abstract

The Hospital Residents setting models important problems like school choice, assignment of undergraduate students to degree programs, among many others. In this setting, fixed quotas are associated with the programs that limit the number of agents that can be assigned to them. Motivated by scenarios where all agents must be matched, we propose and study a generalized capacity planning problem, which allows cost-controlled flexibility with respect to quotas.

Our setting is an extension of the Hospital Resident setting where programs have the usual quota as well as an associated cost, indicating the cost of matching an agent beyond the initial quotas. We seek to compute a matching that matches all agents and is optimal with respect to preferences, and minimizes either a local or a global objective on cost.

We show that there is a sharp contrast – minimizing the local objective is polynomial-time solvable, whereas minimizing the global objective is $\sf NP$ -hard. On the positive side, we present approximation algorithms for the global objective in the general case and a particular hard case. We achieve the approximation guarantee for the special hard case via a linear programming based algorithm. We strengthen the $\sf NP$ -hardness by showing a matching lower bound to our algorithmic result.

Keywords:

Stable matchings, capacity augmentation, matchings under preferences

1 Introduction

The problem of computing optimal many-to-one matchings under two-sided preferences is extensively investigated in the literature [10, 18, 16, 4, 1]. This setting is commonly known as the Hospital Residents (hr) setting.

In the hr setting [10] we are given a set of agents (residents), a set of programs (hospitals), and a set of mutually acceptable pairs between them. Each agent and every program has a preference ordering over its mutually acceptable partners. Additionally, every program has a positive integral capacity that denotes the maximum number of agents that can be assigned to the program. The goal is to compute a matching, that is, an assignment between agents and programs that is optimal with respect to preferences and capacities. This setting models several real-world problems, such as assigning students to schools [1], elective courses [16], assigning medical interns to hospitals [18], and assigning undergraduate students to university programs [4].

In this setting, stability is a well-accepted notion of optimality. An assignment between agents and programs is stable if no agent-program pair has an incentive to deviate from it [10]. It is known that all stable assignments match the same set of agents [10]. In certain applications of the hr setting, every agent must be matched. For instance, in school choice [1] every child must find a school; while matching sailors to billets in the US Navy [21, 17], every sailor must be assigned to some billet. In the hr setting, the rigid upper-quotas limit the number of agents that can be matched in any matching.

The problem of capacity expansion is investigated very recently in [6, 5, 2, 7]. In the capacity expansion problem, the quotas of programs are augmented to improve the welfare of the agents. In another work, Gajulapalli et al. [9] study a two-round mechanism for school admissions in which the goal of the second round is to accommodate more students by suggesting quota increments to schools.

In our work, we are interested in the capacity augmentation problem to ensure that every agent is matched in a stable matching of the resulting instance. Our setting is similar to the hr setting except that in addition to the (initial) capacities, every program also specifies a cost of matching additional agents to it. The capacity of programs can be augmented by spending an additional cost per augmented seat such that a stable matching in the augmented instance matches every agent.

Two special cases of this setting have been studied recently by Chen et al. [7] and Nasre and Limaye [15]. In [7], the authors assume that each program has a unit cost per augmented seat. In [15], the authors assume that the cost per augmented seat can be a non-negative integer but the initial capacities are zero. In both these works, the two main problems investigated are as follows: given an instance with agents, programs, preferences on both sides, capacities, and costs, augment the instance so that it admits a stable matching that matches every agent such that one of the following goals is achieved.

•

the maximum augmentation cost spent at a program is minimum
•

the total augmentation cost spent across all programs is minimum

In the generalized setting considered in this paper, we allow both, that is, an arbitrary positive integral initial capacities (unlike zero initial capacities in [15]) and arbitrary non-negative integral costs (unlike unit costs in [7]). We are ready to define our problems formally.

1.1 Notation and Problem Definition

We are given a bipartite graph $G=(\mathcal{A}\cup\mathcal{P},E)$ , where $\mathcal{A}$ denotes the set of agents and $\mathcal{P}$ denotes the set of programs. An edge $(a,p)\in E$ indicates that $a$ and $p$ form an acceptable agent-program pair. For each vertex $v\in\mathcal{A}\cup\mathcal{P}$ , we define $\mathcal{N}(v)$ to be the set of vertices adjacent to $v$ (that is, the neighborhood of $v$ ). Each vertex $v\in\mathcal{A}\cup\mathcal{P}$ ranks vertices in $\mathcal{N}(v)$ in a strict order, called the preference list of vertex $v$ . For any vertex $v\in\mathcal{A}\cup\mathcal{P}$ , if $v$ prefers $x$ over $y$ then we denote it by $x\succ_{v}y$ , equivalently, $y\prec_{v}x$ . The length of the longest preference list of an agent is denoted by $\ell_{a}$ and the length of the longest preference list of a program is denoted by $\ell_{p}$ . Each program $p$ has an initial quota $q(p)$ which is a positive integer. In literature, this instance is referred to as the hr instance (programs being the hospitals and agents being the residents). In our setting, in addition to the quotas, a program $p$ has an associated non-negative integral cost $c(p)$ . As long as a matching matches upto $q(p)$ many agents to $p$ , no cost is incurred. For each agent matched above the quota of $p$ , we incur a cost $c(p)$ for exceeding the quota of $p$ .

A many-to-one matching (called matching here onwards) $M\subseteq E$ in an hr instance is an assignment of agents to programs such that each agent is matched to at most one program and each program $p$ is matched to at most $q(p)$ many agents. Let $M(a)$ denote the program to which the agent $a\in\mathcal{A}$ is matched. We say $M(a)=\bot$ if $a$ is unmatched in $M$ . Let $M(p)$ denote the set of agents matched to program $p$ . We call a program $p\in\mathcal{P}$ under-subscribed in a matching $M$ if $|M(p)|<q(p)$ and fully-subscribed if $|M(p)|=q(p)$ . We assume that any agent prefers to be matched (to some program) rather than being unmatched.

Definition 1 (Stable Matching).

A pair $(a,p)\in E\setminus M$ is a blocking pair w.r.t. the matching $M$ if $p\succ_{a}M(a)$ and $p$ is either under-subscribed in M or there exists at least one agent $a^{\prime}\in M(p)$ such that $a\succ_{p}a^{\prime}$ . A matching M is said to be stable if there is no blocking pair w.r.t. M.

Given an hr instance, a stable matching is guaranteed to exist and can be computed in linear time by the celebrated Gale and Shapley algorithm [10]. A matching $M$ is $\mathcal{A}$ -perfect if every agent is matched in $M$ . In this work, our goal is to compute a matching that is stable and $\mathcal{A}$ -perfect. There exist simple hr instances that do not admit an $\mathcal{A}$ -perfect stable matching. An hr instance may admit multiple stable matchings, however, by the Rural Hospitals’ Theorem [18], it is known that all stable matchings in an hr instance match the same set of agents. Thus, one can efficiently determine whether an hr instance admits an $\mathcal{A}$ -perfect stable matching.

In this work, to achieve $\mathcal{A}$ -perfectness, we consider quota augmentation for programs. Let $\mathbb{N}$ be the set of non-negative integers. Let $\tilde{q}:\mathcal{P}\rightarrow\mathbb{N}$ be a function that maps a non-negative integer to every program $p\in\mathcal{P}$ . The $\tilde{q}(p)$ indicates the amount by which the quota at program $p$ should be increased such that the modified hr instance admits an $\mathcal{A}$ -perfect stable matching $M$ and for every program $p$ the quota of $p$ is equal to $q(p)+\tilde{q}(p)$ .

A trivial quota augmentation wherein for every program $p$ , $\tilde{q}(p)$ is set such that $q(p)+\tilde{q}(p)=|\mathcal{N}(p)|$ always results in an $\mathcal{A}$ -perfect stable matching. To control $\tilde{q}(p)$ , we use the cost function $c:\mathcal{P}\rightarrow\mathbb{N}$ over the set of programs. Given a matching $M$ , the cost incurred at a program $p$ is $\tilde{q}(p)\cdot c(p)$ . In other words, the initial $q(p)$ many positions of a program $p$ are free.

Given an hr instance along with costs, our goal is to compute an $\mathcal{A}$ -perfect, stable matching that incurs the minimum cost. We consider two natural notions of minimization over cost:

•

minimize the total cost incurred, that is, $\sum_{p\in\mathcal{P}}(\tilde{q}(p)\cdot c(p))$ .
•

minimize the maximum cost incurred at any program, that is, $\max_{p\in\mathcal{P}}\{\tilde{q}(p)\cdot c(p)\}$ .

Based on this, we define the MinSum and the MinMax problems as follows:

MinSum problem: Given $G=(\mathcal{A}\cup\mathcal{P},E)$ , preferences of agents and programs, quota $q(p)$ for every program $p$ , a cost function $c$ (defined earlier), the MinSum problem asks for a quota augmentation function $\tilde{q}$ and an $\mathcal{A}$ -perfect stable matching in the augmented instance such that the total cost of augmentation is minimized.

MinMax problem: Given $G=(\mathcal{A}\cup\mathcal{P},E)$ , preferences of agents and programs, quota $q(p)$ for every program $p$ , a cost function $c$ (defined earlier), the MinMax problem asks for a quota augmentation function $\tilde{q}$ and an $\mathcal{A}$ -perfect stable matching in the augmented instance such that the maximum augmentation cost spent at a program is minimized.

Next, we define two special cases of MinSum and MinMax.

1.

Unit cost for augmentation: When every program $p$ has $c(p)=1$ , we denote the problems as MinSumQ and MinMaxQ. This is equivalent to the setting investigated by Chen et al. [7].
2.

Initial quotas being zero: When every program $p$ has $q(p)=0$ , we denote the problems as MinSumC and MinMaxC. This is equivalent to the cost-controlled quotas setting investigated by Limaye and Nasre [15].

The notion of stability considered earlier, is defined with respect to input quotas. In a setting where initial quotas may be zero, we use the following well-studied relaxation of stability. Recall that if the matching $M$ is not stable then there exists a blocking pair $(a,p)$ with respect to $M$ . The blocking pair may arise due to under-subscription of the program or may arise due to the matching $M$ assigning to $p$ an agent that $p$ prefers lower than $a$ . If we allow blocking pairs arising due to an under-subscribed program $p$ , then we get a relaxation of stability, called envy-freeness [20].

Definition 2 (Envy-Freeness).

Given a matching $M$ , an agent $a$ has a justified envy (here onwards, called envy) towards another agent $a^{\prime}$ if $(a^{\prime},p)\in M$ , $p\succ_{a}M(a)$ and $a\succ_{p}a^{\prime}$ . The pair $(a,a^{\prime})$ is called an envy-pair w.r.t. $M$ . A matching $M$ is envy-free if there is no envy-pair w.r.t. $M$ .

We observe the following about envy-freeness and stability when the initial quotas of all programs are zero. Let $H$ be an instance in our setting in which the initial quotas of all programs are zero. Let $M$ be an $\mathcal{A}$ -perfect matching in the augmented instance $H^{\prime}$ . If $M$ is stable in $H^{\prime}$ , then by definition, $M$ is also envy-free in $H^{\prime}$ . Next, suppose that $M$ is an envy-free matching in $H^{\prime}$ . Let $G$ denote the hr instance wherein the preferences are borrowed from $H^{\prime}$ (that is, $H$ ), and for every program $p$ , we set its quota in $G$ to be equal to $|M(p)|$ . Then, $M$ is stable in $G$ . This implies that when we start with initial quotas of all programs being zero, envy-freeness and stability are equivalent.

Example. Consider an instance shown in Fig. 1 with three agents $a_{1},a_{2},a_{3}$ and three programs $p_{1},p_{2},p_{3}$ . The tuple $(q,c)$ preceding a program indicates that the program has initial quota $q$ and the cost $c$ of matching a single agent to it. That is, $q(p_{1})=1$ , $q(p_{2})=1$ , $q(p_{3})=0$ and $c(p_{1})=0$ , $c(p_{2})=3$ , $c(p_{3})=4$ . Consider the two $\mathcal{A}$ -perfect stable matchings: $M_{1}=\{(a_{1},p_{2}),(a_{2},p_{2}),(a_{3},p_{2})\}$ and $M_{2}=\{(a_{1},p_{2}),(a_{2},p_{3}),(a_{3},p_{2})\}$ . The total augmentation cost spent in $M_{1}$ and $M_{2}$ is $6$ and $7$ respectively, whereas the maximum augmentation cost spent in $M_{1}$ and $M_{2}$ is $6$ and $4$ respectively.

It is straightforward to verify that $M_{1}$ is the unique optimal solution for MinSum, whereas $M_{2}$ is the unique optimal solution for MinMax.

	$\displaystyle a_{1}$	$\displaystyle:p_{2}\succ p_{1}$
	$\displaystyle a_{2}$	$\displaystyle:p_{3}\succ p_{2}$
	$\displaystyle a_{3}$	$\displaystyle:p_{2}$

	$\displaystyle(1,0)\ p_{1}$	$\displaystyle:a_{1}$
	$\displaystyle(1,3)\ p_{2}$	$\displaystyle:a_{1}\succ a_{2}\succ a_{3}$
	$\displaystyle(0,4)\ p_{3}$	$\displaystyle:a_{2}$

Figure 1: An illustrative example for MinSum and MinMax.

1.2 Our results

We show that the MinMax problem can be solved in polynomial time whereas the MinSum problem is $\sf NP$ -hard. Moreover, the MinSum problem is inapproximable under standard complexity-theoretic assumptions.

Theorem 1.1.

The MinMax problem can be solved in $O(m\log m)$ time where $m=|E|$ .

We say that the preferences of elements of a particular set, say the agent set, are derived from a master list if there is an ordering of the programs and the preferences of all agents respect this ordering.

Theorem 1.2.

MinSumC cannot be approximated within a constant factor unless $\sf P=\sf NP$ . The result holds even when the preferences of agents and programs are derived from a master list and there are only two distinct costs in the instance.

The above theorem immediately implies that the MinSum problem is constant-factor inapproximable. The constant factor inapproximability of the MinSum problem is known from the result of Chen et al. [7], however their result does not imply Theorem 1.2. We further show that under the Unique Games Conjecture (UGC) [13], MinSumC cannot be approximated to within $(\ell_{a}-\epsilon)$ for any $\epsilon>0$ .

Theorem 1.3.

MinSumC cannot be approximated to within a factor of ( $\ell_{a}-\epsilon$ ) for any $\epsilon>0$ under UGC. This holds even when the preferences of agents and programs are derived from a master list and there are only two distinct costs.

Theorem 1.3 implies that the MinSum problem is also ( $\ell_{a}-\epsilon$ )-inapproximable ( $\epsilon>0$ ) under UGC. This gives another lower bound for the MinSum problem.

We now state our algorithmic results for MinSum. We present two approximation algorithms for the general instances of the MinSum problem.

Theorem 1.4.

MinSum admits a $|P|$ -approximation algorithm which runs in $O(m\log m)$ time.

Theorem 1.5.

MinSum admits an $\ell_{p}$ -approximation algorithm which runs in $O(m\cdot\ell_{p})$ time, where $\ell_{p}$ denotes the length of the longest preference list of a program.

We present an approximation algorithm with a guarantee of $\ell_{a}$ when the instance has two distinct costs, thereby meeting the lower bound presented in Theorem 1.3.

Theorem 1.6.

MinSumC admits an $\ell_{a}$ -approximation algorithm when the instance has two distinct costs.

Our results are summarized in Table 1.

MinMax	MinSum
In $\sf P$ (Theorem 1.1)	• $\sf NP$ -hardness and constant factor inapproximability (follows from Theorem 1.2) • $(\ell_{a}-\epsilon)$ -inapproximability (for any $\epsilon>0$ ) under UGC (Theorem 1.3)	• $\|\mathcal{P}\|$ -approximation (Theorem 1.4) • $\ell_{p}$ -approximation (Theorem 1.5) • $\ell_{a}$ -approximation for MinSumC when two distinct costs (Theorem 1.6)

Table 1: Summary of our results

Outline of the paper. In Section 2 we present a brief literature review. In Section 3 we present our algorithmic results for MinMax and general instances of MinSum. In Section 4 we present an algorithm for the restricted case of MinSumC problem with two distinct costs. Section 5 presents our hardness and inapproximability results. We conclude in Section 6.

2 Related Work and Background

Capacity planning and or capacity augmentation has received attention in the recent past since it is a natural and practical approach to circumvent rigid quotas for matching problems. The capacity planning problem with a similar motivation as ours is studied extensively under the two-sided preference setting in [9, 6, 5, 2, 7, 3]. In the two-round school choice problem studied by Gajulapalli et al. [9], their goal in round-2 is to match all agents in a particular set. This set is derived from the matching in round-1 and they need to match the agents in an envy-free manner (called stability preserving in their work). This can still leave certain agents unassigned. It can be shown that the MinSumC setting generalizes the matching problems in round-2. We remark that in [9] the authors state that a variant of MinSumC problem (Problem 33, Section 7) is $\sf NP$ -hard. However, they do not investigate the problem in detail.

In the very recent works [6, 5, 2] the authors consider the problem of distributing extra seats (beyond the input quotas) limited by a budget that leads to the best outcome for agents. Their setting does not involve costs, and importantly, $\mathcal{A}$ -perfectness is not guaranteed. Bobbio et al. [6] show the $\sf NP$ -hardness of their problem. Bobbio et al. [5] and Abe et al. [2] propose a set of approaches which include heuristics along with empirical evaluations. In our work, we present algorithms with theoretical guarantees. Chen and Csáji [7] investigate a variant of the capacity augmentation problem mentioned earlier and present hardness, approximation algorithms, and parameterized complexity results. Chen and Csáji [7] investigate several variants of MinSumQ with objectives such as Pareto efficiency and student popularity instead of $\mathcal{A}$ -perfectness. They show these variants to be hard, which implies the hardness of the corresponding objectives in the generalized MinSum setting as well. Since these variants do not require $\mathcal{A}$ -perfectness, they are trivial in the cost-controlled quotas setting (as in [15]) - if there are no programs with cost $0$ , the solution will be the empty matching. If there are programs with cost $0$ , the solution can be obtained by matching every agent to their highest-preferred such program.

Capacity augmentation along with costs has also been considered in the one-sided preference list setting, known as the house allocation problem. Here every agent has a preference ordering over a subset of the programs and programs are indifferent between the its neighbours. In the one-sided preference list setting various notions of optimality like rank-maximilaity, fairness, popularity and weak dominance are studied. Kavitha and Nasre [11] and Kavitha et al. [12] address the capacity augmentation problem for the notion of popularity. It is known that a popular matching is not guaranteed to exist in the one-sided preference list setting. Therefore, their objective was to optimally increase program quotas to create an instance that admits a popular matching. In their setting, the min-max version turns out to be $\sf NP$ -hard whereas the min-sum problem (without the $\mathcal{A}$ -perfectness requirement) is polynomial time solvable.

The cost-based quotas considered in our work are also considered in the one-sided preference setting by Santhini et al. [19]. In their work, the authors ask for stronger guarantees on the output matching than $\mathcal{A}$ -perfectness. This is expressed in terms of a signature of a matching which allows encoding requirements about the number of agents matched to a particular rank. They consider the problem of computing a min-cost matching with a desired signature and show that it is efficiently solvable. This results in the one-sided setting are in contrast to the hardness and inapproximability results we show for a similar optimization problem under two-sided preferences.

Before we proceed to present our results, we discuss important connections between envy-free matchings and stable matchings in an hr instance.

2.1 Envy-free Matchings to Stable Matchings

As noted earlier, envy-freeness and stability are not equivalent in the hr setting. We begin by noting a useful property about any blocking pair with respect to an envy-free matching.

Lemma 1.

If $M$ is an envy-free matching that is not stable, then all the blocking pairs have under-subscribed programs.

Proof.

Suppose for the sake of contradiction that there exists a blocking pair $(a,p)$ such that the program $p$ is fully-subscribed ( $|M(p)|=q(p)$ ). Since $(a,p)$ is a blocking pair, we must have $q(p)>0$ . Hence, $|M(p)|>1$ and there exists an agent $a^{\prime}\in M(p)$ , such that $a\succ_{p}a^{\prime}$ . This implies that $(a,a^{\prime})$ is an envy pair, a contradiction to the fact that $M$ is an envy-free matching. ∎

Next we show that it is possible to convert an envy-free matching $M$ in an hr instance to a stable matching $M_{s}$ in the same hr instance (with the program quotas unchanged) such that the agents that are matched in $M$ remain matched in $M_{s}$ . Algorithm 1 gives a simple procedure for the same.

Algorithm 1 Convert an envy-free matching to a stable matching

1:Input: An envy-free matching

M

in an hr instance

G

2:Output: A stable matching

M_{s}

G

such that agents matched in

M

remain matched in

M_{s}

M_{s}\leftarrow M

4:while

\exists

blocking pair

(a^{\prime},p)

w.r.t.

M_{s}

5: Let

a

be the highest-preferred agent by

p

such that

p\succ_{a}M_{s}(a)

M_{s}\leftarrow M_{s}\setminus\{(a,M_{s}(a))\}\cup\{(a,p)\}

7:return

M_{s}

It is easy to see that no agent who is matched in $M$ gets unmatched in $M_{s}$ . Next, we prove that in each iteration of the algorithm in line 6, $M_{s}$ remains envy-free.

Lemma 2.

The matching $M_{s}$ remains envy-free after every execution of line 6.

Proof.

Let $a$ be the agent and $p$ be the program selected in line 5. Agent $a$ is promoted in line 6, so $a$ will not envy any new agents after this step. Since $a$ is the highest-preferred agent that forms a blocking pair with respect to $p$ , after the promotion in line 6, no new agents will envy $a$ either. To see this, note that all agents $a^{\prime\prime}$ such that $a^{\prime\prime}\succ_{p}a$ are either already matched to $p$ or matched to some $p^{\prime}$ such that $p^{\prime}\succ_{a^{\prime\prime}}p$ . Since the initial matching $M$ is envy-free and no new envy pairs are created after each execution of line 6, $M_{s}$ remains envy-free. ∎

Lemma 3.

Algorithm 1 terminates and outputs a stable matching $M_{s}$ .

Proof.

The while loop of the algorithm runs as long as the matching $M_{s}$ is not stable. At each iteration, since by Lemma 2 the matching $M_{s}$ is envy-free and possibly not yet stable, we are guaranteed to find a blocking pair. By Lemma 1 such a blocking pair is guaranteed to be of an under-subscription type blocking pair. Furthermore, once a blocking pair is found, the agent $a$ gets promoted. Since agent preference lists are finite, this procedure must terminate in $O(m)$ iterations. Finally, when we exit the while loop, the matching does not admit any blocking pair which implies that $M_{s}$ is stable. ∎

This establishes the following lemma.

Lemma 4.

Given an envy-free matching $M$ in an hr instance, we can obtain a stable matching $M_{s}$ in the same hr instance such that the agents matched in $M$ remain matched in $M_{s}$ .

Lemma 4 implies that if $M$ is an $\mathcal{A}$ -perfect, envy-free matching in an hr instance , then there exists an $\mathcal{A}$ -perfect, stable matching $M_{s}$ in the same instance.

3 Algorithmic results

In this section, we present our algorithmic results for the MinMax problem (Theorem 1.1) and our approximation algorithms for general instances of the MinSum problem (Theorem 1.4 and Theorem 1.5).

3.1 Polynomial time algorithm for MinMax

In this section, we present a polynomial time algorithm for the MinMax problem. We begin with some observations. Let $M^{*}$ be an optimal solution for the MinMax problem and let $t^{*}=\max_{p}\{c(p)\cdot\tilde{q}(p)\}$ be the cost of $M^{*}$ . For an integer $t$ , we define $G_{t}$ to be the hr instance such that for every program $p\in\mathcal{P}$ , its quota is $q(p)+\left\lfloor\frac{t}{c(p)}\right\rfloor$ . We observe the following:

•

For any integer $t\geq t^{*}$ , there exists an $\mathcal{A}$ -perfect stable matching in $G_{t}$ . This holds because $M^{*}$ is an $\mathcal{A}$ -perfect stable matching in $G_{t}$ and for every program $p$ , $q(p)$ in $G_{t}$ is at least as much as $q(p)$ in $G_{t^{*}}$ . Therefore, $M^{*}$ is an $\mathcal{A}$ -perfect envy-free matching in $G_{t}$ . Combining these facts with Lemma 4 we get the desired result.
•

For any integer $t<t^{*}$ , there does not exist an $\mathcal{A}$ -perfect stable matching in $G_{t}$ .

In the following lemma, we show an upper bound on the number of distinct values that $t^{*}$ can take. Recall that $m=|E|$ .

Lemma 5.

There are at most $m+1$ many distinct values that $t^{*}$ possibly takes.

Proof.

For any program $p$ , $0\leq\tilde{q}(p)\leq|\mathcal{N}(p)|-q(p)$ . This holds because in any solution ( $\mathcal{A}$ -perfect stable matching), for any program $p$ , the initial quota and the quota augmentation together cannot be more than $|\mathcal{N}(p)|$ .

The cost $t^{*}$ is either equal to $0$ or equal to $c(p)\cdot k$ , for some $k$ and some $p\in\mathcal{P}$ such that $1\leq k\leq|\mathcal{N}(p)|-q(p)\leq|\mathcal{N}(p)|$ . Thus, for a fixed program $p$ , we have at most $|\mathcal{N}(p)|$ many non-zero values, if $c(p)>0$ . Thus, $t^{*}$ takes at most $\sum_{p\in\mathcal{P}}|\mathcal{N}(p)|=|E|=m$ many distinct non-zero values. Including $0$ , the lemma holds. ∎

We now give a polynomial-time algorithm for the MinMax problem – construct a sorted array $\hat{c}$ such that it contains all the possible values of $t^{*}$ . Then perform a binary search on this array. For each cost $t$ being considered, construct the hr instance $G_{t}$ and check whether $G_{t}$ admits an $\mathcal{A}$ -perfect stable matching. If yes, search over values less than or equal to $t$ , otherwise search over values strictly greater than $t$ .

By Lemma 5, there are $O(m)$ values in the array $\hat{c}$ and binary search over these costs takes $O(\log m)$ iterations. In each iteration a stable matching is computed and checked for $\mathcal{A}$ -perfectness, thereby spending $O(m)$ time per iteration. This implies that the algorithm runs in $O(m\log{m})$ time. This proves Theorem 1.1.

3.2 $|P|$ -approximation algorithm for MinSum

Using the algorithm for the MinMax problem presented in Section 3.1, we present a $|P|$ -approximation algorithm for the MinSum problem.

Lemma 6.

The optimal solution for the MinMax problem is a $|\mathcal{P}|$ -approximation for the MinSum problem on the same instance.

Proof.

Given an instance $G$ , let $M^{*}$ be an optimal solution for MinMax and let its cost be $t^{*}$ . Let the optimal cost of MinSum on the same instance be $y^{*}$ . Clearly, $y^{*}\geq t^{*}$ (otherwise it contradicts the optimality of $t^{*}$ for MinMax).

Since $t^{*}$ is the maximum augmentation cost spent at any program in $M^{*}$ , the total cost of $M^{*}$ is at most $|\mathcal{P}|\cdot t^{*}$ . Hence, the total cost of $M^{*}$ is at most $|\mathcal{P}|\cdot y^{*}$ , giving us the desired approximation. ∎

This proves Theorem 1.4.

3.3 $\ell_{p}$ -approximation algorithm for MinSum

In this section, we present an $\ell_{p}$ -approximation algorithm for the MinSum problem. This algorithm is an extension of the algorithm presented in [15] for a restricted setting wherein all initial quotas are zero.

Let $G$ be the instance of the MinSum problem. Let $M_{I}$ be any stable matching in $G$ . We call $M_{I}$ the initial stable matching. If $M_{I}$ is $\mathcal{A}$ -perfect then we return $M_{I}$ . Otherwise, at least one agent is unmatched in $M_{I}$ . Let $\mathcal{A}_{u}$ denote the set of agents that are unmatched in $M_{I}$ . It is well known that all stable matchings of an hr instance match the same set of agents [18]. Hence, the set $\mathcal{A}_{u}$ is invariant of $M_{I}$ . For every agent $a$ , let $p_{a}^{*}$ denote the least-cost program occurring in the preference list of $a$ . If there are multiple such programs, let $p_{a}^{*}$ be the highest-preferred one among them. Using $M_{I}$ we construct an $\mathcal{A}$ -perfect matching by matching every agent $u\in\mathcal{A}_{u}$ to $p_{u}^{*}$ . This gives us an intermediate matching, we call it $M_{L}$ . We observe that $M_{L}$ is $\mathcal{A}$ -perfect but not necessarily stable. Next, we promote agents to ensure that $M_{L}$ is stable, as described below.

We start with the matching $M=M_{L}$ . We consider a program $p$ and consider agents in the reverse order of the preference list of $p$ . If an agent $a$ envies agent $a^{\prime}\in M(p)$ then we promote $a$ by unmatching it and matching it to $p$ (see Fig. 2). This process repeats for every program. The pseudo-code is given in Algorithm 2.

Figure 2: Promotion of agents who envy

Algorithm 2

\ell_{p}

-approximation for MinSum problem

1:Input: an instance

G

of the MinSum problem

2:Output: an

\mathcal{A}

-perfect stable matching in an augmented instance of

G

3:let

M_{I}

be a stable matching in the

G

4:if

M_{I}

\mathcal{A}

-perfect then

5: return

M_{I}

6:let

M_{L}

be the matching obtained as follows:

M_{L}\leftarrow\begin{cases}p_{a}^{*}&a\in\mathcal{A}_{u}\\ M_{I}(a)&a\notin\mathcal{A}_{u}\end{cases}

M\leftarrow M_{L}

8:for every program

p

9: for every agent

a\in\mathcal{A}

in reverse preference list ordering of

p

10: if

\exists a^{\prime}\in M(p)

such that

a\succ_{p}a^{\prime}

and

p\succ_{a}M(a)

then

11:

M=M\setminus\{(a,M(a)\}\cup\{(a,p)\}

12:return

M

We proceed to prove the correctness. We begin by showing that the matching $M$ computed by Algorithm 2 is an $\mathcal{A}$ -perfect stable matching.

Lemma 7.

The matching $M$ computed by Algorithm 2 is an $\mathcal{A}$ -perfect stable matching.

Proof.

If the algorithm returns the matching $M_{I}$ then it is an $\mathcal{A}$ -perfect stable matching. We show that the matching $M$ returned at line 12 is $\mathcal{A}$ -perfect and stable.

It is clear that $M_{L}$ is $\mathcal{A}$ -perfect, by construction (line 6). Thus, we start with matching $M$ that is $\mathcal{A}$ -perfect. During the execution of the loop at line 8, agents are only promoted and no agent becomes unmatched. Thus, $M$ remains $\mathcal{A}$ -perfect at the termination.

Next, we show that $M$ is stable. Suppose for contradiction that there exists a blocking pair $(a,p)$ with respect to $M$ . Then $p\succ_{a}M(a)$ and there exists an $a^{\prime}\in M(p)$ such that $a\succ_{p}a^{\prime}$ . Consider the iteration of the for loop (in line 8) when $p$ was considered. Agent $a^{\prime}$ was either already matched to $p$ (before the iteration began) or is assigned to $p$ in this iteration. Note that $a\succ_{p}a^{\prime}$ and agents are considered in the reverse order of preference of $p$ . Thus, in either case, when $a$ was considered in line 9, $M(a^{\prime})=p$ holds.

If $M(a)\succ_{a}p$ or $M(a)=p$ at this point, since agents never get demoted in the algorithm, $M(a)\succ_{a}p$ or $M(a)=p$ holds at the end of the algorithm. Thus, we must have that $M(a)\prec_{a}p$ at this point. This implies that the algorithm matched $a$ to $p$ in this iteration. Since $a$ could only get promoted during the subsequent iterations, $M(a)=p$ or $M(a)\succ_{a}p$ at the end of the algorithm. This contradicts the claimed blocking pair. This proves the lemma. ∎

Next, we show that $M$ is an $\ell_{p}$ -approximation of MinSum. Let $P_{\text{empty}}$ denote the set of programs $p$ such that no agent is matched to $p$ in the matching $M_{I}$ and $P_{\text{LC}}$ denote the set of programs $p$ such that for at least one agent $u\in\mathcal{A}_{u}$ , $p^{*}_{u}=p$ . Let $S^{c}$ denote the complement of $S$ with respect to $\mathcal{P}$ . We define the following four sets of programs. Clearly, these four sets are pairwise disjoint (see Fig. 3).

•

$P_{1}=P_{\text{empty}}^{c}\cap P_{\text{LC}}^{c}$
•

$P_{2}=P_{\text{empty}}^{c}\cap P_{\text{LC}}$
•

$P_{3}=P_{\text{empty}}\cap P_{\text{LC}}$
•

$P_{4}=P_{\text{empty}}\cap P_{\text{LC}}^{c}$

Figure 3: A schematic depicting the various types of programs. Blue edges denote edges in initial matching

M_{I}

while red edges indicate the edges between an agent and its least-cost program. The graph contains edges from the matching

M_{L}

Now we proceed to prove the approximation guarantee by observing important properties of the programs in these sets.

Lemma 8.

No agent is promoted and matched to any program in $P_{1}\cup P_{4}$ during the execution of the for loop in line 8.

Proof.

Let $p\in P_{1}\cup P_{4}$ . By definition of $P_{1}$ and $P_{4}$ , $p\in P_{\text{LC}}^{c}$ . This implies that for no agent $a\in A_{u}$ , $p=p_{a}^{*}$ . Therefore, after the execution of line 6, $(a,p)\in M_{L}$ if and only if $(a,p)\in M_{I}$ . Subsequently, at line 7, we have $(a,p)\in M$ if and only if $(a,p)\in M_{I}$ .

For the sake of contradiction, assume that some agent is promoted and matched to $p$ . Let $a$ be the first such agent. For this promotion to happen, there must exist agent $a^{\prime}\in M(p)$ such that $a\succ_{p}a^{\prime}$ and $p\succ_{a}M(a)$ in this iteration. By the choice of $a$ , we have $(a^{\prime},p)\in M_{I}$ . Note that $M(a)=M_{I}(a)$ or $M(a)\succ_{a}M_{I}(a)$ since agents can only get promoted during the execution of the loop. By stability of $M_{I}$ , we have that either $a\in A_{u}$ and $a^{\prime}\succ_{p}a$ or $M_{I}(a)\succ_{a}p$ . The former case contradicts that $a\succ_{p}a^{\prime}$ whereas the latter case contradicts that $M(a)=M_{I}(a)$ or $M(a)\succ_{a}M_{I}(a)$ . This proves the lemma. ∎

Lemma 9.

Matching $M$ computed by Algorithm 2 is an $\ell_{p}$ -approximation of the MinSum problem.

Proof.

Let $c(M)$ denote the total augmentation cost of the matching $M$ . During the execution of the for loop in line 8, the cost is spent only when an agent is promoted and matched to some program $p$ . By Lemma 8, agents are promoted and matched to some program $p$ such that $p\in P_{2}\cup P_{3}$ .

Let $len(p)$ denote the length of the preference list of program $p$ . During the execution of the for loop in line 8, at most $len(p)$ many agents can be promoted and matched to the program $p\in P_{2}\cup P_{3}$ . Then $c(M)\leq\sum_{p\in P_{2}\cup P_{3}}len(p)\cdot c(p)$ .

Programs in $P_{2}\cup P_{3}$ are least-cost programs for some agent $a\in A_{u}$ . Let ${\sf OPT}$ denote an optimal solution and $c({\sf OPT})$ denote the optimal cost. Since ${\sf OPT}$ must match all agents in $A_{u}$ , $c({\sf OPT})\geq\sum_{a\in A_{u}}c(p_{a}^{*})\geq\sum_{p\in P2\cup P3}c(p)$ . Moreover, for every $p$ , we have $len(p)\leq\ell_{p}$ . Thus,

c(M)\leq\sum_{p\in P_{2}\cup P_{3}}len(p)\cdot c(p)\leq\ell_{p}\sum_{p\in P_{2}\cup P_{3}}c(p)\leq\ell_{p}\cdot c({\sf OPT})

∎

Matchings $M_{I}$ and $M_{L}$ can be computed in $O(m)$ time using the Gale-Shapley algorithm. The for loop in line 8 takes $O(m\ell_{p})$ time. Thus, algorithm 2 runs in $O(m\ell_{p})$ time. This proves Theorem 1.5.

4 MinSumC with two distinct costs

In this section, we present a linear program (LP) for the MinSumC problem followed by an approximation algorithm for a restricted hard case. Recall that under the MinSumC setting, an envy-free matching is itself stable. Therefore, in this section, we compute an envy-free matching.

4.1 Linear Program and its dual

Fig. 4 shows the LP relaxation for the MinSumC problem. Let $H=(\mathcal{A}\cup\mathcal{P},E)$ be the underlying graph of the MinSumC instance. Let $x_{a,p}$ be a primal variable for the edge $(a,p)\in E$ : $x_{a,p}$ is $1$ if $a$ is matched to $p$ , $0$ otherwise. The objective of the primal LP (Eq. 1) is to minimize the total cost of all matched edges. Eq. 2 encodes the envy-freeness constraint: if agent $a$ is matched to $p$ then every agent $a^{\prime}\succ_{p}a$ must be matched to either $p$ or a higher-preferred program than $p$ , otherwise $a^{\prime}$ envies $a$ . In the primal LP, the envy-freeness constraint is present for a triplet $(a^{\prime},p,a)$ where $a^{\prime}\succ_{p}a$ . We call such a triplet a valid triplet. Eq. 3 encodes $\mathcal{A}$ -perfectness constraint.

Primal: minimize

\sum\limits_{p\in\mathcal{P}}{c(p)\cdot\sum\limits_{(a,p)\in E}{x_{a,p}}}

(1)

subject to

\sum_{\begin{subarray}{c}p^{\prime}:\\ p^{\prime}=p\ \text{or}\\ p^{\prime}\succ_{a^{\prime}}p\end{subarray}}{x_{a^{\prime},p^{\prime}}}\geq x_{a,p},\ \operatorname{\forall}(a^{\prime},p)\in E,a\prec_{p}a^{\prime}

(2)

\sum\limits_{(a,p)\in E}{x_{a,p}}=1,\ \ \ \operatorname{\forall}a\in\mathcal{A}

(3)

x_{a,p}\geq 0,\ \ \ \operatorname{\forall}(a,p)\in E

(4)

Dual: maximize

\sum\limits_{a\in\mathcal{A}}{y_{a}}

(5)

subject to

y_{a}+\sum_{\begin{subarray}{c}p^{\prime}:\\ p^{\prime}=p\ \text{or}\\ p^{\prime}\prec_{a}p\end{subarray}}\ \ {\sum\limits_{\begin{subarray}{c}a^{\prime}:\\ a^{\prime}\prec_{p^{\prime}}a\end{subarray}}{z_{a,p^{\prime},a^{\prime}}}}-\sum\limits_{\begin{subarray}{c}a^{\prime}:\\ a\prec_{p}a^{\prime}\end{subarray}}{z_{a^{\prime},p,a}}\\ \leq c(p),\ \ \ \ \operatorname{\forall}(a,p)\in E

(6)

z_{a^{\prime},p,a}\geq 0,\ \ \ \operatorname{\forall}(a^{\prime},p)\in E,a\prec_{p}a^{\prime}

(7)

Figure 4: Linear Program and its dual for the MinSumC problem

Figure 5: Let

(a^{\prime},p,a)

be a valid triplet and

p^{\prime}\succ_{a^{\prime}}p

. The edges shown in the figure are those whose dual constraint contains the variable

z_{a^{\prime},p,a}

in either positive or negative form.

In the dual LP, we have two kinds of variables, the $y$ variables which correspond to every agent and the $z$ variables which correspond to every valid triplet in the primal program. The dual constraint (Eq. 6) is for every edge $(a,p)$ in $E$ . The $y_{a}$ variable corresponding to an agent $a$ appears in the dual constraint corresponding to every edge incident on $a$ . The value $y_{a}$ can be interpreted as the cost paid by agent $a$ for matching $a$ to one of the programs in $\mathcal{N}(a)$ . For an edge $(a,p)$ and an agent $a^{\prime}\succ_{p}a$ , the dual variable $z_{a^{\prime},p,a}$ appears in negative form in exactly one constraint and it is for the edge $(a,p)$ . The same dual variable $z_{a^{\prime},p,a}$ appears in positive form in the constraint for every edge $(a^{\prime},p^{\prime})$ such that $p^{\prime}=p$ or $p^{\prime}\succ_{a^{\prime}}p$ (refer Fig. 5). The value of $z_{a^{\prime},p,a}$ can be interpreted as the cost paid by agent $a$ in matching $a^{\prime}$ to a program $p^{\prime}$ such that $p^{\prime}=p$ or $p^{\prime}\succ_{a^{\prime}}p$ to resolve potential envy-pair $(a^{\prime},a)$ if $a$ gets matched to $p$ . Following are the useful facts about the linear program and its dual.

Fact 1. Let $a$ be a fixed agent. If $y_{a}$ is incremented by a positive value $\Delta$ then it increments the left-hand side (lhs) of the dual constraint of every edge $(a,p)$ by $\Delta$ and it does not affect the dual constraint of any edge incident on agent $a^{\prime}\neq a$ .∎

Fact 2. Let $(a^{\prime},p,a)$ be a fixed valid triplet. If $z_{a^{\prime},p,a}$ is incremented by a positive value $\Delta$ then it increments the lhs of the dual constraint of every edge $(a^{\prime},p^{\prime})$ by $\Delta$ such that $p^{\prime}=p$ or $p^{\prime}\succ_{a^{\prime}}p$ , reduces the lhs of the dual constraint of exactly one edge $(a,p)$ by $\Delta$ and does not affect the dual constraint of any edge incident on agent $a^{\prime\prime}\notin\{a,a^{\prime}\}$ .∎

For a given dual setting and an edge, if Eq. 6 is satisfied with equality then we call such an edge as a tight edge, otherwise it is a slack edge. For an edge $(a,p)$ , $slack(a,p)$ denotes its slack. When referring to a $z$ variable, when a specific agent or program occurring in it does not matter, we use $\times$ in its place.

Definition 3 (Threshold agent).

Let $M$ be a matching in the instance. For every program $p$ , $thresh(p)$ is the most-preferred agent $a$ , if it exists, such that $p\succ_{a}M(a)$ , otherwise $thresh(p)$ is $\bot$ .

The definition of threshold agent is similar to the threshold resident defined in [14] and a barrier (vertex) defined in [9]. We remark that the threshold agent depends on the matching $M$ , hence when $M$ gets modified, the threshold agents for programs may change.

Definition 4 (Matchable edge).

For an envy-free matching $M$ , and an agent $a$ (matched or unmatched), we say that an edge $(a,p)\notin M$ is matchable if the dual constraint on $(a,p)$ is tight and $a=thresh(p)$ , otherwise the edge is non-matchable.

It is straightforward to verify that for an envy-free matching $M$ , if we match agent $a$ along a matchable edge then the resultant matching remains envy-free.

4.2 An $\ell_{a}$ -approximation algorithm for ${\sf\textsc{MinSumC}_{c1,c2}}$

A reader may find the discussion in the Appendix about the challenges involved in designing a primal-dual algorithm for the general case of the MinSumC problem. In this section, we show that when the MinSumC instance has only two distinct costs $c_{1}$ and $c_{2}$ where $c_{1}<c_{2}$ , we can circumvent the challenges and obtain an $\ell_{a}$ -approximation algorithm for the MinSumC problem. We recall from Theorem 1.2 that even in this restricted setting, the problem remains NP-hard.

High-level idea of the algorithm. Our LP based algorithm begins with an initial feasible dual setting and an envy-free matching $M$ which need not be $\mathcal{A}$ -perfect. As long as $M$ is not $\mathcal{A}$ -perfect, we pick an unmatched agent $a$ and increase the dual variable $y_{a}$ . We show that for an unmatched agent such an increase is possible and all edges incident on $a$ become tight due to the update. However, none of the edges incident on $a$ may be matchable (since for every $p\in\mathcal{N}(a)$ , $thresh(p)\neq a$ ). Under the restricted setting of two distinct costs we ensure that after a bounded number of updates to the $z$ variables, at least one edge incident on $a$ is matchable. Throughout we maintain the following invariants with respect to the matching $M$ .

•

$M$ is envy-free, not necessarily $\mathcal{A}$ -perfect and every matched edge is tight.
•

For an agent $a$ (matched or unmatched), for every $p\succ_{a}M(a)$ , either (i) $(a,p)$ is tight and $thresh(p)\neq a$ or (ii) $slack(a,p)=c_{2}-c_{1}$ .

Recall that when the matching is modified, thresholds may change, due to which a tight, non-matchable edge may become matchable. As long as there exists such an edge, we match it. This is achieved by the free-promotions routine. The free-promotions routine checks if there exists a matchable edge $(a,p)$ . If there is no such edge, the routine terminates. Otherwise, it matches $(a,p)$ , re-computes the threshold agents and repeats the search. Checking for a matchable edge and computing threshold agents takes $O(m)$ time where $m=|E|$ . the free-promotions routine runs in $O(m^{2})$ time.

Description of the algorithm. Algorithm 3 gives the pseudo-code. In the Appendix, we give an illustrative example which depicts the key steps of the algorithm on a ${\sf\textsc{MinSumC}_{c1,c2}}$ instance.

We begin with an empty matching $M$ and by setting all $y$ variables to $c_{1}$ and all $z$ variables to $0$ (line 1). Following this, for every agent $a$ with a cost $c_{1}$ program in $\mathcal{N}(a)$ we match the agent to its most-preferred program with cost $c_{1}$ (for loop at line 2). Next, we compute the threshold agent for every program w.r.t. $M$ . As long as $M$ is not $\mathcal{A}$ -perfect, we pick an arbitrary unmatched agent $a$ and update the dual variables as follows.

1:let

M=\emptyset

, all

y

variables are set to

c_{1}

and all

z

variables are set to

0

2:for every agent

a\in\mathcal{A}

s.t.

\exists p\in\mathcal{N}(a)

such that

c(p)=c_{1}

3: let

p

be the most-preferred program in

\mathcal{N}(a)

s.t.

c(p)=c_{1}

and let

M=M\cup\{(a,p)\}

4:compute

thresh(p)

for every program

p\in\mathcal{P}

5:while

M

is not

\mathcal{A}

-perfect do

6: let

a

be an unmatched agent

7: while

a

is unmatched do

8: set

y_{a}=y_{a}+c_{2}-c_{1}

9: if there exists a matchable edge incident on

a

then

10:

M=M\cup\{(a,p)\mid(a,p)

is the most-preferred matchable edge for

a\}

11: perform free-promotions routine and re-compute thresholds

12: else

13:

\mathcal{P}(a)=\{p\in\mathcal{N}(a)\mid p\succ_{a}M(a),(a,p)

is tight and

thresh(p)\neq a\}

14: while

\mathcal{P}(a)\neq\emptyset

15: let

a^{\prime}

be the threshold agent of some program in

\mathcal{P}(a)

16: let

\mathcal{P}(a,a^{\prime})

denote the set of programs in

\mathcal{P}(a)

whose threshold agent is

a^{\prime}

17: let

p

be the least-preferred program for

a^{\prime}

\mathcal{P}(a,a^{\prime})

18: set

z_{a^{\prime},p,a}=c_{2}-c_{1}

19: let

(a^{\prime},p^{\prime})

be the most-preferred matchable edge incident on

a^{\prime}

. Unmatch

a^{\prime}

if matched and let

M=M\cup\{(a^{\prime},p^{\prime})\}

20: execute free-promotions routine, re-compute thresholds and the set

\mathcal{P}(a)

21:return

M

Algorithm 3 Algorithm to compute an

\ell_{a}

-approximation of MinSumC on

{\sf\textsc{MinSumC}_{c1,c2}}

1.

For the agent $a$ , we increase $y_{a}$ by $c_{2}-c_{1}$ . We ensure that the dual setting is feasible and all edges incident on $a$ become tight for the dual constraint in Eq. 6. Although this step makes all edges incident on $a$ tight, they may not be necessarily matchable. Recall that a tight edge $(a,p)$ is matchable if $thresh(p)=a$ .
2.

If there is a program $p$ such that $(a,p)$ is matchable, then $a$ is immediately matched to the most-preferred such program $p$ (line 10) and we are done with matching agent $a$ . Since the matching is modified, we execute the free-promotions routine.
3.

In case there is no such program for which $a$ is the threshold agent, we update carefully selected $z$ variables in order to either promote the threshold agent (if matched) or match the (unmatched) threshold agent via the following steps.
1. (a)
  
  We compute the set $\mathcal{P}(a)$ of programs $p\in\mathcal{N}(a)$ such that the dual constraint on edge $(a,p)$ is tight and $thresh(p)\neq a$ and $p\succ_{a}M(a)$ (line 13). In other words, $\mathcal{P}(a)$ is the set of programs in the neighbourhood of $a$ such that $p$ is higher-preferred over $M(a)$ and edge $(a,p)$ is tight but not matchable.
2. (b)
  
  By the definition of $\mathcal{P}(a)$ , for every $p_{j}\in\mathcal{P}(a)$ , there exists $thresh(p_{j})=a^{\prime}\neq a$ . We pick an arbitrary agent $a^{\prime}$ that is a threshold of some program in $\mathcal{P}(a)$ (line 15). Note that the agent $a^{\prime}$ can be the threshold agent of more than one program in $\mathcal{P}(a)$ , and we let $\mathcal{P}(a,a^{\prime})$ denote the set of programs in $\mathcal{P}(a)$ for whom $a^{\prime}$ is the threshold. Let $p$ be the least-preferred program for $a^{\prime}$ in $\mathcal{P}(a,a^{\prime})$ (line 17).
3. (c)
  
  Our goal is to match $a^{\prime}$ to a program $p^{\prime}$ such that $p^{\prime}=p$ or $p^{\prime}\succ_{a^{\prime}}p$ . By the choice of $a,a^{\prime}$ and $p$ and from the primal LP, $(a^{\prime},p,a)$ is a valid triplet and therefore there exists a dual variable $z_{a^{\prime},p,a}$ (refer Fig. 5). We set $z_{a^{\prime},p,a}$ to $c_{2}-c_{1}$ (line 18). We ensure that this update maintains dual feasibility.
4. (d)
  
  Recall that the variable $z_{a^{\prime},p,a}$ appears in the positive form in the dual constraint of every edge $(a^{\prime},p^{\prime})$ such that $p^{\prime}=p$ or $p^{\prime}\succ_{a^{\prime}}p$ . We ensure that this update results in making all edges $(a^{\prime},p^{\prime})$ tight and at least one of these becomes matchable. We match $a^{\prime}$ along the most-preferred matchable edge (line 19). Recall that $z_{a^{\prime},p,a}$ variable appears in negative form in the dual constraint of edge $(a,p)$ , hence edge $(a,p)$ becomes slack after this update.
5. (e)
  
  Since $M$ is modified, we execute the free-promotions routine. If a tight edge incident on $a$ becomes matchable, then $a$ is matched inside the free-promotions routine.
6. (f)
  
  We remark that the set $\mathcal{P}(a)$ computed in line 13 is dependent on the matching $M$ , specifically $M(a)$ and the threshold agents w.r.t. $M$ . In order to maintain a specific slack value on the edges that is useful in maintaining dual feasibility and ensuring progress, we re-compute the set $\mathcal{P}(a)$ (line 20) and re-enter the loop in line 14 if $\mathcal{P}(a)\neq\emptyset$ .

4.3 Proof of correctness

We start by observing the following properties.

(P1) At line 4, no agent is assigned to any program with cost $c_{2}$ and for every agent $a$ (matched or unmatched), every program $p\succ_{a}M(a)$ has cost $c_{2}$ .

(P2) A matched agent never gets demoted.

(P3) A tight edge incident on a matched agent remains tight.

(P4) All matched edges are tight at the end of the algorithm.

(P1) is a simple observation about the matching at line 4. Whenever a matched agent $a$ changes its partner from $M(a)$ to $p^{\prime}$ , we have $thresh(p^{\prime})=a$ . By the definition of the threshold agent, $p^{\prime}\succ_{a}M(a)$ , which implies (P2). Note that the only edge that can become slack during the execution is the edge $(a,p)$ which is incident on an unmatched agent $a$ (line 18). This implies (P3). We observe that when the edge is matched, it is tight. By (P3), a matched edge (being incident on a matched agent) always remains tight, implying (P4).

Now, we proceed to prove Theorem 1.6. In the following lemma, we prove that the matching $M$ computed by the algorithm is envy-free.

Lemma 10.

Matching $M$ is envy-free throughout the execution of the algorithm.

Proof.

Matching $M$ is trivially envy-free after line 1. Any two agents $a$ and $a^{\prime}$ that are matched in line 3 are matched to a program with cost $c_{1}$ and by the choice made in line 3, it is clear that they do not form an envy-pair. By (P1), every unmatched agent $a$ has only cost $c_{2}$ programs in $\mathcal{N}(a)$ thus, no unmatched agent envies an agent matched in line 3. Thus, $M$ is envy-free before entering the loop at line 5.

Suppose $M$ is envy-free before a modification in $M$ inside the loop. We show that it remains envy-free after the modification. Matching $M$ is modified either at line 10 or line 19 or inside the free-promotions routine. In all these places, only a matchable edge $(a_{i},p_{j})$ is matched. Therefore no agent $a^{\prime}\neq a_{i}$ envies $a_{i}$ after this modification. Before this modification $a_{i}$ did not envy $a^{\prime}\neq a_{i}$ and by (P2) $a_{i}$ (if matched) is not demoted, therefore $a_{i}$ does not envy $a^{\prime}\neq a_{i}$ after the modification. Thus, $M$ remains envy-free. ∎

Next, we proceed to prove the dual feasibility, termination, and $\mathcal{A}$ -perfectness. We make the following observation about the innermost while loop (line 14).

Lemma 11.

Let $a$ be a fixed unmatched agent selected in line 6 and consider an iteration of the loop at line 7 during which the algorithm enters else part. Suppose during an iteration of the loop at line 14, for some $p_{k}\in\mathcal{N}(a)$ , $p=p_{k}$ is selected at line 17. Then at the end of iteration, $slack(a,p_{k})=c_{2}-c_{1}$ and $p\neq p_{k}$ during subsequent iterations of the loop. Therefore, at most $\ell_{a}$ many distinct $z_{\times,p_{k},a}$ variables are updated during the iteration of the loop at line 7.

Proof.

By the choice of $p_{k}$ , the edge $(a,p_{k})$ was tight before this iteration. By Fact 2, the update on $z_{\times,p_{k},a}$ reduces the lhs of the dual constraint of the edge $(a,p_{k})$ by $c_{2}-c_{1}$ . Thus, after this update, $slack(a,p_{k})=c_{2}-c_{1}$ . Therefore, when $\mathcal{P}(a)$ is re-computed at line 20, $p_{k}\notin\mathcal{P}(a)$ . Also observe that no other dual update in $z_{\times,p_{j},a}$ inside the loop at line 14 for $p_{j}\neq p_{k}$ affects the slack of edge $(a,p_{k})$ . Thus, in a subsequent iteration of this loop, $p_{k}$ is never selected as $p$ again.

For every $p_{k}$ selected as $p$ in line 17, a distinct $z_{\times,p_{k},a}$ variable is updated. Thus, there are at most $\mid\mathcal{P}(a)\mid$ many distinct $z_{\times,p_{k},a}$ variables are updated inside the loop at line 14 in an iteration of the loop at line 7. By observing that $\mathcal{P}(a)\subseteq\mathcal{N}(a)$ , we get $\mid\mathcal{P}(a)\mid\leq\ell_{a}$ , hence the claim follows. ∎

Recall that if edge $(\hat{a},\hat{p})$ is non-matchable then either $(\hat{a},\hat{p})$ is slack or $thresh(\hat{p})\neq\hat{a}$ . In our algorithm, we maintain a stronger invariant: for every agent $a$ and for every program $p$ higher-preferred over $M(a)$ , we maintain that either all non-matchable edges $(a,p)$ are slack or all non-matchable edges $(a,p)$ are tight and for every such edge, $thresh(p)\neq a$ . Moreover, we also maintain a specific slack value when the edges are slack. We categorize agents based on these two cases (see Fig. 6 and Fig. 7).

Definition 5 (type-1 and type-2 agents).

An agent $a$ is called a type-1 agent if for every program $p\succ_{a}M(a)$ , $slack(a,p)=c_{2}-c_{1}$ . An agent $a$ is called a type-2 agent if $a$ is matched and for every program $p\succ_{a}M(a)$ , $slack(a,p)=0$ and $thresh(p)\neq a$ .

Refer to caption — Figure 6: Type-1 agent $\hat{a}$ : $slack(\hat{a},p_{j})=c_{2}-c_{1}$ , $\operatorname{\forall}p_{j}\succ_{\hat{a}}p=M(\hat{a})$ , if $\hat{a}$ is matched, otherwise, $\operatorname{\forall}p_{j}\in\mathcal{N}(\hat{a})$

We remark that type-1 agents could be either matched or unmatched but type-2 agents are always matched. Recall that if $a^{\prime}=a_{j}$ is unmatched then $M(a_{j})=\bot$ and therefore, every program $p_{j}\in\mathcal{N}(a_{j})$ satisfies the condition that $p_{j}\succ_{a_{j}}M(a_{j})=\bot$ . We claim that a type-1 agent is selected as $a^{\prime}$ at most once inside the loop at line 14.

Lemma 12.

Let $a_{j}$ be a type-1 agent such that $a^{\prime}=a_{j}$ is selected in an arbitrary iteration of the loop at line 14. Then, at the termination of the loop, $a_{j}$ is a type-2 agent and in subsequent iterations of the loop, $a^{\prime}\neq a_{j}$ .

Proof.

Since $a_{j}$ is a type-1 agent, for every program $p_{j}\succ_{\hat{a}}M(\hat{a})$ , $slack(a_{j},p_{j})=c_{2}-c_{1}$ . Suppose $p=p_{k}$ is selected in line 17. Then by Fact 2, for every $p_{t}$ such that $p_{t}=p_{k}$ or $p_{t}\succ_{a_{j}}p_{k}$ , the dual update in line 18 results in making all $(a_{j},p_{t})$ edges tight. Also, since $thresh(p_{k})=a_{j}$ , at least one of these newly tight edges (specifically, $(a_{j},p_{k})$ ) becomes matchable. Therefore, $M(a_{j})$ is modified inside the iteration (line 19), implying that $a_{j}$ is either matched or promoted. The choice of $M(a_{j})$ , that is, $p^{\prime}$ in line 19 is such that for every $p_{j}\succ_{a_{j}}M(a_{j})=p^{\prime}$ , the edge $(a_{j},p_{j})$ is tight and $thresh(p_{j})\neq a_{j}$ . Thus, when the iteration ends, $a_{j}$ is a type-2 agent.

By (P3), the tight edges incident on $a_{j}$ remain tight throughout the algorithm. In subsequent iterations, agent $a_{j}$ may further get promoted by the free-promotions routine such that for every $p_{j}\succ_{a_{j}}M(a_{j})$ , $slack(a_{j},p_{j})=0$ and $thresh(p_{j})\neq a_{j}$ . Therefore, $a_{j}$ remains a type-2 agent in all subsequent iterations of the loop. This implies that $a_{j}$ is not the threshold for any program $p_{j}\succ_{a_{j}}M(a_{j})$ , in particular for any program $p_{j}\in\mathcal{N}(a)$ for the chosen $a$ . Thus, during subsequent iterations of the loop, $a^{\prime}\neq a_{j}$ . ∎

In Lemma 13, we establish that at a specific step during the algorithm, every agent is either type-1 or type-2. This property is crucial in showing dual feasibility and termination.

Lemma 13.

Before every iteration of the loop starting at line 7, an agent $\hat{a}$ is either a type-1 agent or a type-2 agent.

Proof.

We prove this by induction. Before the first iteration of the loop at line 7, suppose agent $\hat{a}$ is matched. Then (P1) and the initial dual setting together imply that for every program $p_{j}\succ_{\hat{a}}M(\hat{a})$ , $slack(\hat{a},p_{j})=c_{2}-c_{1}$ . Therefore $\hat{a}$ is a matched type-1 agent. Suppose $\hat{a}$ is unmatched. Then, by (P1), every program $p_{j}\in\mathcal{N}(\hat{a})$ , $c(p_{j})=c_{2}$ , therefore the initial dual setting implies that $slack(\hat{a},p_{j})=c_{2}-c_{1}$ . This implies that $\hat{a}$ is an unmatched type-1 agent.

Consider an arbitrary agent $\hat{a}$ . Suppose that $\hat{a}$ is either type-1 or type-2 before $l$ -th iteration of the loop. It is clear that $a$ selected in line 6 is different than $a^{\prime}$ selected at line 15. During the $l$ -th iteration, either $a=\hat{a}$ in line 6 or $a^{\prime}=\hat{a}$ in line 15 or $\hat{a}$ is promoted inside the free-promotions routine. We show that in each of the cases, $\hat{a}$ is either type-1 or type-2 before $(l+1)$ -th iteration begins.

(i)
$a=\hat{a}$ in line 6: It implies that $\hat{a}$ is unmatched. By induction hypothesis, $\hat{a}$ is a type-1 agent, therefore for every $p_{j}\in\mathcal{N}(\hat{a})$ , $slack(\hat{a},p_{j})=c_{2}-c_{1}$ . Then, the update in line 8 results in making all edges incident on $\hat{a}$ tight. We consider the following two cases – $\hat{a}$ remains unmatched during the $l$ -th iteration or $\hat{a}$ gets matched.
- •
  
  $\hat{a}$ remains unmatched during the $l$ -th iteration: Then the while loop at line 14 must have been executed. During an iteration of the loop at line 14, if $p=p_{j}$ then the slack of the edge $(\hat{a},p_{j})$ becomes $c_{2}-c_{1}$ after the dual update in line 18 (by Fact 2). We show that for every $p_{j}\in\mathcal{N}(\hat{a})$ , there is some iteration of the loop at line 14 such that $p=p_{j}$ is selected, thereby implying that when the loop terminates, for every edge $(\hat{a},p_{j})$ , slack becomes $c_{2}-c_{1}$ . Once this is shown, it is clear that before the $(l+1)$ -th iteration, $\hat{a}$ is a type-1 agent.
  
  Suppose for contradiction that for some program $p_{j}$ , $p=p_{j}$ is never selected. Since the edge $(\hat{a},p_{j})$ is tight before the loop execution began, it must be the case that either $p_{j}\prec_{\hat{a}}M(\hat{a})$ or $thresh(p_{j})=\hat{a}$ . The first case implies that $M(\hat{a})\neq\bot$ , a contradiction that $\hat{a}$ remains unmatched during the $l$ -th iteration. In the second case, since $thresh(p_{j})=\hat{a}$ , the edge $(\hat{a},p_{j})$ was matchable inside the free-promotions routine, thus $\hat{a}$ must have been matched inside the free-promotions routine, leading to a contradiction again. Thus, for every $p_{j}\in\mathcal{N}(\hat{a})$ , there is some iteration of the loop during which $p=p_{j}$ . This implies that when the loop at line 14 terminates, for every $p_{j}\in\mathcal{N}(\hat{a})$ , $slack(\hat{a},p_{j})=c_{2}-c_{1}$ .
- •
  
  $\hat{a}$ gets matched during the $l$ -th iteration: Recall that all edges incident on $\hat{a}$ are tight after the dual update in line 8. If $\hat{a}$ is matched at line 10 then the $l$ -th iteration immediately terminates. Thus, before the $(l+1)$ -th iteration, for every $p_{j}\succ_{\hat{a}}M(\hat{a})$ , $slack(\hat{a},p_{j})=0$ and by the choice made in line 10, $thresh(p_{j})\neq\hat{a}$ , implying that $\hat{a}$ is a type-2 agent.
  
  If $\hat{a}$ is matched inside the loop at line 14 then the free-promotions routine must have matched it. Consider the last iteration of the loop at line 14 during which the free-promotions routine matched or promoted $\hat{a}$ and let $M(\hat{a})=p_{t}$ . We will show that for $p_{j}\succ_{\hat{a}}p_{t}$ , $slack(a_{j},p_{j})=c_{2}-c_{1}$ , thereby implying that $\hat{a}$ is a matched type-1 agent before $(l+1)$ -th iteration begins.
  
  By Lemma 11, it is enough to show that for every $p_{j}\succ_{\hat{a}}p_{t}$ , $p=p_{j}$ is chosen is some iteration of the loop at line 14. Suppose not. Then, there exists some $p_{j}$ such that $(\hat{a},p_{j})$ is tight after the loop at line 14 terminates. By the choice of $p_{t}$ inside the free-promotions routine, $(\hat{a},p_{j})$ was non-matchable, implying that $thresh(p_{j})\neq\hat{a}$ . Hence during the last iteration of the loop, when $\mathcal{P}(\hat{a})$ was re-computed in line 20, $p_{j}\in\mathcal{P}(\hat{a})$ , that is, $\mathcal{P}(\hat{a})\neq\emptyset$ . This contradicts that the loop terminated after this iteration. Therefore, for every $p_{j}\succ_{\hat{a}}p_{t}$ , $p_{j}$ was selected in some iteration of the loop at line 14, thereby implying that before the $(l+1)$ -th iteration of the loop at line 7, $\hat{a}$ is a matched type-1 agent.
(ii)

$a^{\prime}=\hat{a}$ at line 15: Consider the first iteration of the loop at line 14 when this happens. Note that the dual update in line 8 does not affect the slack on edges incident on $\hat{a}$ . Since $\hat{a}$ is a threshold for some program $p_{j}\succ_{\hat{a}}M(\hat{a})$ , by the induction hypothesis, $\hat{a}$ is a type-1 agent. Therefore, for every $p_{j}\succ_{\hat{a}}M(\hat{a})$ , $slack(\hat{a},p_{j})=c_{2}-c_{1}$ before this iteration of the loop at line 14. By Lemma 12, $\hat{a}$ is a type-2 agent when the loop terminates. Therefore when $(l+1)$ -th iteration of the loop at line 7 begins, $\hat{a}$ is a type-2 agent.
(iii)

$a\neq\hat{a}$ and $a^{\prime}\neq\hat{a}$ but $\hat{a}$ is promoted inside the free-promotions routine: First note that none of the dual updates in the $l$ -th iteration affect any edge incident on $\hat{a}$ . Thus, if $\hat{a}$ is promoted inside the free-promotions routine, then by the induction hypothesis, $\hat{a}$ must be a type-2 agent. Thus, for every $p_{j}\succ_{\hat{a}}M(\hat{a})$ , $slack(\hat{a},p_{j})=0$ and $thresh(p_{j})\neq\hat{a}$ and some update in the matching must have made one of these edges matchable, that is, for some tight edge $(\hat{a},p_{j})$ , $thresh(p_{j})=\hat{a}$ . Consider the last iteration of the loop at line 14 when the free-promotions routine promoted $\hat{a}$ . Then, by the choice of $M(\hat{a})$ inside the routine, for every program $p_{j}\succ_{\hat{a}}M(\hat{a})$ , edge $(\hat{a},p_{j})$ is non-matchable. This implies that for every such $p_{j}$ , $thresh(p_{j})\neq\hat{a}$ . Thus, $\hat{a}$ remains a type-2 agent when the $(l+1)$ -th iteration begins.

This completes the proof of the lemma. ∎

Next, we show that the dual setting is feasible.

Lemma 14.

The dual setting is feasible throughout the algorithm.

Proof.

It is clear that the dual setting is feasible before entering the loop after line 7 for the first time. We show that if the dual setting is feasible before an arbitrary dual update (either line 8 or line 18) then it remains feasible after the update.

•

Update at line 8: Since $a$ is unmatched, by Lemma 13, $a$ is a type-1 agent and therefore, the slack on every edge incident on $a$ is $c_{2}-c_{1}$ . By Fact 1, this update increases the lhs of every edge incident on $a$ by $c_{2}-c_{1}$ and the iteration of the loop at line 7 terminates. Therefore the dual setting is feasible.
•

Update at line 18: We note that the update in line 18 increases the lhs of a subset of edges incident on agent $a^{\prime}$ (by Fact 2). Therefore we show that for an arbitrary agent $a_{j}$ selected as $a^{\prime}$ , the dual setting on the affected edges is feasible after the update.

Consider the first iteration of the loop at line 14 wherein an arbitrary $a_{j}$ is selected as $a^{\prime}$ in line 15. Since $a\neq a^{\prime}=a_{j}$ , the type of $a_{j}$ before execution of the loop at line 14 began is same as its type before entering the loop at line 7. Suppose $a_{j}$ is a type-2 agent then the fact that $a_{j}$ is threshold at some program in $\mathcal{P}(a)$ contradicts that for every program $p_{j}\succ_{a_{j}}M(a_{j})$ , $thresh(p_{j})\neq a_{j}$ . Therefore, $a_{j}$ is a type-1 agent. This implies that for every $p_{j}\succ_{a_{j}}M(a_{j})$ , the slack of the edge $(a_{j},p_{j})$ is $c_{2}-c_{1}$ , therefore the dual update in line 18 maintains dual feasibility. By Lemma 12, this is the only iteration of the loop at line 14 when $a^{\prime}=a_{j}$ . Therefore, when the execution of loop at line 14 terminates (followed by immediate termination of the loop at line 7), the dual setting remains feasible.

This completes the proof of the lemma. ∎

Now, we show that the algorithm terminates in polynomial time and computes an $\mathcal{A}$ -perfect matching $M$ .

Lemma 15.

Algorithm 3 terminates by computing an $\mathcal{A}$ -perfect matching in polynomial time.

Proof.

We first show that in every iteration of the loop in line 7, either an unmatched agent is matched or at least one agent is promoted: by Lemma 13 and Fact 1, after the dual update in line 8 all edges incident on $a$ become tight. Either $a$ gets matched in line 10 or the loop in line 14 executes at least once. Since $\mathcal{P}(a)\neq\emptyset$ every time the loop at line 14 is entered, an agent $a^{\prime}$ is selected in line 15. By the choice of $a^{\prime}$ , Lemma 13, Fact 2 and the choice of $p$ in line 17, the dual update in line 18 ensures that at least one edge $(a^{\prime},p_{j})$ , for $p_{j}\succ_{a^{\prime}}M(a^{\prime})$ becomes matchable and $a^{\prime}$ gets matched along that edge. By (P2), this modification does not demote $a^{\prime}$ (if $a^{\prime}$ was already matched). Therefore, either an unmatched agent (either $a$ in line 10 or $a^{\prime}$ in line 19) is matched or at least one agent ( $a^{\prime}$ in line 19) is promoted during an iteration.

Thus after $O(m)$ iterations of the loop in line 7, a fixed unmatched agent $a$ gets matched and the loop in line 7 terminates. As mentioned earlier, the free-promotions routine takes $O(m^{2})$ time. Thus, the loop in line 7 terminates in $O(m^{3})$ time for a fixed unmatched agent $a$ and the loop in 5 terminates in $O(m^{3}$ $\mid\mathcal{A}\mid)$ time. By the termination condition of the loop, $M$ is an $\mathcal{A}$ -perfect matching. ∎

Remark on the running time. We observe that the initial setting of dual variables takes $O(m$ $\mid\mathcal{A}\mid)$ time because there are $O(m$ $\mid\mathcal{A}\mid)$ valid triplets. Since the algorithm guarantees (P2), with careful implementation of the free-promotions routine and efficiently computing the threshold agents, the running time of the algorithm can be improved. ∎

Finally, we show that the matching $M$ computed by Algorithm 3 is an $\ell_{a}$ -approximation.

Lemma 16.

Matching $M$ computed by Algorithm 3 is an $\ell_{a}$ -approximation of MinSumC.

Proof.

Let ${\sf OPT}$ be an optimal matching and $c(M)$ and $c({\sf OPT})$ denote the cost of $M$ and ${\sf OPT}$ respectively. By the LP duality, $c({\sf OPT})\geq\sum\limits_{a\in\mathcal{A}}{y_{a}}$ . By (P4), $(a,p)\in M$ implies that the edge $(a,p)$ is tight. Thus, we have

	$\displaystyle c(M)=\sum\limits_{(a,p)\in M}{c(p)}$	$\displaystyle=\sum\limits_{(a,p)\in M}\Big{(}y_{a}+\sum_{\begin{subarray}{c}p^{\prime}=p\ \text{or}\\ p^{\prime}\prec_{a}p\end{subarray}}\ \ {\sum\limits_{a^{\prime}\prec_{p^{\prime}}a}{z_{a,p^{\prime},a^{\prime}}}}-\sum\limits_{a\prec_{p}a^{\prime}}{z_{a^{\prime},p,a}}\Big{)}$
		$\displaystyle=\sum\limits_{a\in\mathcal{A}}{y_{a}}+\underbrace{\sum\limits_{(a,p)\in M}\Big{(}\sum_{\begin{subarray}{c}p^{\prime}=p\ \text{or}\\ p^{\prime}\prec_{a}p\end{subarray}}\ \ {\sum\limits_{a^{\prime}\prec_{p^{\prime}}a}{z_{a,p^{\prime},a^{\prime}}}}-\sum\limits_{a\prec_{p}a^{\prime}}{z_{a^{\prime},p,a}}\Big{)}}_{S(Z)}$

where the first equality is from Eq. 1, the second equality is from Eq. 6 and the third equality follows because $M$ is $\mathcal{A}$ -perfect. Let $S(Z)$ denote the second summation in the above cost. Our goal is to show that $S(Z)$ is upper-bounded by $(\ell_{a}-1)\sum\limits_{a\in\mathcal{A}}{y_{a}}$ thereby implying that $c(M)\leq\ell_{a}\cdot\sum\limits_{a\in\mathcal{A}}{y_{a}}$ .

We first note that all the $z$ variables are set to $0$ initially and they are updated only inside the loop at line 14. We charge the update in every $z$ variable to a specific unmatched agent $a$ picked at line 6 and upper-bound the total update in $z$ charged to $a$ in terms of $y_{a}$ . Let $A^{\prime}$ be the set of agents unmatched before the loop at line 5 is entered. During every iteration of the loop in line 5, an unmatched agent $a$ from $A^{\prime}$ is picked and the loop in line 7 executes until $a$ is matched. Suppose that after picking $a$ in line 6, the loop in line 7 runs for $\kappa(a)$ iterations. Then, $y_{a}$ is incremented by $c_{2}-c_{1}$ for $\kappa(a)$ times and since $a$ is matched, it is not picked again at line 6. Thus, at the end of algorithm, $y_{a}=c_{1}+\kappa(a)(c_{2}-c_{1})$ , that is $y_{a}\geq\kappa(a)(c_{2}-c_{1})$ .

We first present a simpler analysis that proves an $(\ell_{a}+1)$ -approximation. Recall that the $z$ variables are non-negative (Eq. 7). Thus, we upper-bound the total value of $z$ variables appearing in positive form in $S(Z)$ . During the iterations $1$ to $\kappa(a)-1$ , the algorithm must enter the else part and in the $\kappa(a)\text{-}th$ iteration, the loop may or may not enter the else part. Suppose the algorithm enters the else part. Then by Lemma 11, for a fixed $a$ when the algorithm enters the else part, at most $\ell_{a}$ many $z$ variables are set to $c_{2}-c_{1}$ . Thus, at most $\kappa(a)\ell_{a}(c_{2}-c_{1})$ total update in $S(Z)$ occurs during execution of the loop in line 7 when agent $a$ is picked. We charge this cost to agent $a$ , thus agent $a\in A^{\prime}$ is charged at most $\ell_{a}y_{a}$ . Thus,

	$\displaystyle c(M)=\sum\limits_{a\in\mathcal{A}}{y_{a}}+S(Z)$	$\displaystyle\leq\sum\limits_{a\in\mathcal{A}\setminus A^{\prime}}{y_{a}}+\sum\limits_{a\in A^{\prime}}{y_{a}}+\sum\limits_{a\in A^{\prime}}{\ell_{a}y_{a}}$
		$\displaystyle\leq(\ell_{a}+1)\sum\limits_{a\in\mathcal{A}}{y_{a}}\leq(\ell_{a}+1)c({\sf OPT})$

Now, we proceed to a better analysis that shows an $\ell_{a}$ -approximation. Recall that if $(a^{\prime},p,a)$ is a valid triplet then the variable $z_{a^{\prime},p,a}$ appears in the dual constraint of possibly multiple edges incident on $a^{\prime}$ in positive form and in the dual constraint of exactly one edge, that is, the edge $(a,p)$ in negative form. We show that there exist certain valid triplets such that the corresponding $z$ variable occurring in positive form in the dual constraint of a matched edge also appears in negative form in the dual constraint of another matched edge, thereby canceling out their contribution in $S(Z)$ . Thus, it is enough to upper-bound the update in $z$ variables that are not cancelled. We prove that the total update in such $z$ variables that is charged to an agent $a\in A^{\prime}$ can be upper-bounded by $(\ell_{a}-1)y_{a}$ instead of $\ell_{a}y_{a}$ as done earlier.

Let $a\in A^{\prime}$ be an arbitrary agent. Suppose that after $a$ is selected at line 6, $a$ is matched to some program $\overline{p}$ and that $M(a)=p_{k}$ at the end of the algorithm. By (P2), $p_{k}=\overline{p}$ or $p_{k}\succ_{a}\overline{p}$ . Also, during iterations $1$ to $\kappa(a)-1$ , $thresh(p_{k})\neq a$ and the loop in line 14 executes. It implies that in each of the iterations, there exists an agent $a_{j}$ such that $thresh(p_{k})=a_{j}$ and $z_{a_{j},p_{k},a}$ is updated. Also, $a_{j}$ was matched to $p^{\prime}$ such that $p^{\prime}=p_{k}$ or $p^{\prime}\succ_{a_{j}}p_{k}$ . By (P2), at the end of the algorithm, $M(a_{j})=p^{\prime}$ or $M(a_{j})\succ_{a^{\prime}}p^{\prime}$ . Thus, the variable $z_{a_{j},p_{k},a}$ appears in positive form in the dual constraint of the edge $(a_{j},M(a_{j}))$ . Since $(a,p_{k})\in M$ and the variable $z_{a_{j},p_{k},a}$ appears in negative form in the dual constraint of edge $(a,p_{k})$ . Therefore, the variable $z_{a_{j},p_{k},a}$ cancels out in $S(Z)$ . This implies that for each of the iterations $1$ to $\kappa(a)-1$ , at most $\ell_{a}-1$ many $z$ variables are set to $c_{2}-c_{1}$ such that they may not cancel out. We charge the update in these variables to $a$ .

In the last $\kappa(a)$ -th iteration, $a$ gets matched. If $a$ is matched at line 10 then no $z$ variable is updated during this iteration. Otherwise, $a$ is matched in one of the iterations of the loop in line 14 by the free-promotions routine. Recall that by our assumption, $a$ is matched to $\overline{p}$ in this step. By the choice of $\overline{p}$ in the free-promotions routine, the edge $(a,\overline{p})$ must have been matchable, that is, it is tight and $thresh(\overline{p})=a$ . The fact that edge $(a,\overline{p})$ was tight implies (by Fact 2) that no variable of the form $z_{\times,\overline{p},a}$ was updated so far inside the loop at line 14 during the $\kappa(a)$ -th iteration. When $\mathcal{P}(a)$ is re-computed, $\overline{p}\notin\mathcal{P}(a)$ because $M(a)=\overline{p}$ at this step. Thus, in the subsequent iterations of the loop in line 14, no agent $a^{\prime}$ could have selected $\overline{p}$ in line 17. This implies that no $z$ variable of the form $z_{\times,\overline{p},a}$ is updated during the rest of the execution of the loop at line 14 of the $\kappa(a)$ -th iteration. This implies that during the $\kappa(a)$ -th iteration, the $z$ variables that are set to $c_{2}-c_{1}$ are of the form $z_{\times,\hat{p},a}$ where $\overline{p}\neq\hat{p}$ . By the fact that $\overline{p}\in\mathcal{N}(a)$ , $\hat{p}\in\mathcal{N}(a)$ and $\mid\mathcal{N}(a)\mid$ $\leq\ell_{a}$ , the number such $z$ variables is at most $\ell_{a}-1$ .

Thus, during $\kappa(a)$ many iterations for the agent $a\in A^{\prime}$ at most $\kappa(a)(\ell_{a}-1)(c_{2}-c_{1})$ total update in $S(Z)$ is charged to $a$ . Recall that $y_{a}\geq\kappa(a)(c_{2}-c_{1})$ . Thus, agent $a\in A^{\prime}$ contributes at most $(\ell_{a}-1)y_{a}$ in $S(Z)$ . This gives

	$\displaystyle c(M)=\sum\limits_{a\in\mathcal{A}}{y_{a}}+S(Z)$	$\displaystyle\leq\sum\limits_{a\in\mathcal{A}\setminus A^{\prime}}{y_{a}}+\sum\limits_{a\in A^{\prime}}{y_{a}}+\sum\limits_{a\in A^{\prime}}{(\ell_{a}-1)\cdot y_{a}}$
		$\displaystyle\leq\ell_{a}\sum\limits_{a\in\mathcal{A}}{y_{a}}\leq\ell_{a}\cdot c({\sf OPT})$

This completes the proof of the lemma. ∎

This establishes Theorem 1.6.

5 Hardness and inapproximability

In this section, we prove hardness and inapproximability results for MinSumC. These results are extended versions of the hardness results in [15]. Recall that under the MinSumC setting, envy-freeness and stability are equivalent.

5.1 Constant factor inapproximability of MinSumC

In this section, we show that MinSumC cannot be approximated within any constant factor. This result holds even when there are only two distinct costs in the instance and the preference lists follow master list ordering on both sides. We present a reduction from the Set cover problem.

The Set cover problem: Given a universe $\mathcal{U}$ of $n$ elements $\{e_{1},e_{2},\ldots,e_{n}\}$ and a collection $\mathcal{C}$ of $m$ sets $\{C_{1},C_{2},\ldots,C_{m}\}$ where each set $C_{j}\subseteq\mathcal{U}$ , a set cover is a set $T\subseteq\mathcal{C}$ such that $\bigcup_{C\in T}C=\mathcal{U}$ . Given an integer $k$ , the goal is to decide whether there is a set cover with cardinality at most $k$ .

A subset $C_{j}$ is said to cover an element $e_{i}$ if $e_{i}\in C_{j}$ . A set $T\subseteq\mathcal{C}$ is said to cover element $e_{i}$ if there exists $C_{j}\in T$ such that $C_{j}$ covers $e_{i}$ .

Reduction to MinSumC: Given a set cover instance $I=(\mathcal{U},\mathcal{C},k)$ , the corresponding MinSumC instance $I^{\prime}$ is constructed as follows: for every element $e_{i}\in\mathcal{U}$ , there is an element-agent $a_{i}$ . For every subset $C_{j}\in\mathcal{C}$ , there is a subset-program $c_{j}$ and $n$ dummy agents $u_{j}^{l}$ and $n$ dummy programs $w_{j}^{l}$ where $1\leq l\leq n$ . Therefore, in the reduced instance, there are $n+nm$ agents and $m+mn$ programs.

Next, we define the cost of the programs and the preference lists in the reduced instance. For all subset-programs, $c(c_{j})=1$ and for all dummy programs, $c(w_{j}^{l})=0$ . Note that in the reduced instance, there are only two distinct costs. For a set $Q$ , let $\langle Q\rangle$ denote the elements of $Q$ ordered in a fixed but arbitrary way. Every element-agent $a_{i}$ ranks all subset-programs $c_{j}$ such that the corresponding element $e_{i}$ is in the subset $C_{j}$ . Every dummy agent $u_{j}^{i}$ corresponding to the subset $C_{j}$ prefers the subset-program $c_{j}$ over the dummy program corresponding to $C_{j}$ . Every subset-program $c_{j}$ prefers its corresponding dummy agents $u_{j}^{l}$ over the element-agents $a_{i}$ corresponding to elements in the subset $C_{j}$ . Every dummy program ranks only its dummy agents. The preference lists are shown below. Note that $1\leq i\leq n$ , $1\leq j\leq m$ .

Preference lists of agents

	$\displaystyle a_{i}$	$\displaystyle:\langle\{c_{j}\|e_{i}\in C_{j}\}\rangle$
	$\displaystyle u^{l}_{j}$	$\displaystyle:c_{j}\succ w^{l}_{j}$

Preference lists of programs

	$\displaystyle\hskip 2.84544ptc_{j}(0,1)$	$\displaystyle:\langle\{u_{j}^{l}\|1\leq l\leq n\}\rangle\succ\langle\{a_{i}\|e_{i}\in C_{j}\}$
	$\displaystyle w^{l}_{j}(0,0)$	$\displaystyle:u^{l}_{j}$

We define $k^{\prime}$ as $(k+1)n$ . We also assume that $k>1$ , without loss of generality.

Lemma 17.

Given a set cover $T$ in $I$ such that $|T|\leq k$ , we can construct an $\mathcal{A}$ -perfect envy-free matching $M$ in $I^{\prime}$ such that $c(M)\leq k^{\prime}$ .

Proof.

Let $C_{open}=\{c_{j}\mid C_{j}\in T\}$ . Note that $|C_{open}|=|T|$ . Construct $M$ by matching every element-agent $a_{i}$ to its highest-preferred program in $C_{open}$ . Dummy agents corresponding to programs in $C_{open}$ are matched to their respective programs, and the remaining dummy agents are matched to their dummy programs. Thus the matching $M$ is $\mathcal{A}$ -perfect.

We show that the constructed matching $M$ is envy-free. For element-agents $a_{i}$ , any program $p$ such that $p\succ_{a_{i}}M(a_{i})$ will have no agents matched to it (by construction). The same holds for dummy agents corresponding to programs not in $C_{open}$ . The dummy agents corresponding to programs in $C_{open}$ are matched to their highest-preferred programs. Therefore, no agent envies another agent.

Next, we show that $c(M)\leq k^{\prime}$ . For every program $c_{j}$ in $C_{open}$ , some number of element-agents and all dummy agents of $c_{j}$ are matched to it in $M$ . There are $n$ such dummy agents per program, while the element-agents together contribute a cost of $n$ over all programs. Therefore, $c(M)=|C_{open}|n+n=(|T|+1)n$ . Given that $|T|\leq k$ , we get $c(M)\leq(k+1)n=k^{\prime}$ . ∎

Lemma 18.

Given an $\mathcal{A}$ -perfect envy-free matching $M$ in $I^{\prime}$ such that $c(M)\leq\alpha k^{\prime}$ for some constant $\alpha>1$ , we can construct a set cover $T$ in $I$ such that $|T|\leq 2\alpha k$ .

Proof.

Given $M$ , define the set of programs $C_{open}=\{c_{j}\mid\exists a_{i}\text{ such that }M(a_{i})=c_{j}\}$ . Let $T=\{C_{j}\mid c_{j}\in C_{open}\}$ . We show that the set $T$ is a set cover. Suppose for the contradiction that $T$ is not a set cover. Then there exists an element $e_{i}$ which is not covered by $T$ . None of the potential partners of the corresponding element-agent $a_{i}$ are in $C_{open}$ , so $a_{i}$ must be unmatched in $M$ . This implies that $M$ is not $\mathcal{A}$ -perfect, leading to a contradiction. Thus, $T$ is a set cover.

Next, we show that the size of $T$ is at most $2\alpha k$ . Note that $|T|=|C_{open}|$ . Since $M$ is an $\mathcal{A}$ -perfect matching, each $a_{i}$ is matched in $M$ , thereby contributing a cost of one unit each in $c(M)$ . Moreover, since $M$ is envy-free, if an agent $a_{i}$ is matched to program $c_{j}$ then the $n$ dummy agents $u_{j}^{l}$ corresponding to $c_{j}$ must be matched to $c_{j}$ in $M$ . They together contribute a cost of $n|C_{open}|$ since such $c_{j}\in C_{open}$ . The dummy agents corresponding to each program $c_{j}\notin C_{open}$ may be matched to either $c_{j}$ or $w_{j}^{l}$ . Thus, $c(M)\geq n+n|C_{open}|=n(1+|T|)$ . Given that $c(M)\leq\alpha k^{\prime}$ , we get $n(|T|+1)\leq\alpha(k+1)n$ . This implies that $|T|\leq|T|+1\leq 2\alpha k$ .

∎

Suppose, for the sake of contradiction, that there exists an $\alpha$ -approximation algorithm for the MinSumC problem. For a given set cover instance, we can use the above reduction to construct a MinSumC instance. Then, using the $\alpha$ -approximation algorithm and Lemma 17 and Lemma 18, we can get a $2\alpha$ -approximation algorithm for the Set Cover problem. However, the Set Cover problem cannot be approximated within any constant factor unless $\sf P=\sf NP$ [8]. This implies that for any constant $\alpha$ , MinSumC does not admit an $\alpha$ -approximation algorithm.

Finally, we note that the following master list ordering over agents and programs holds in the reduced MinSumC instance.

	$\displaystyle u_{1}^{1}\succ u_{1}^{2}\succ\dots\succ u_{1}^{n}\succ u_{2}^{1}\succ u_{2}^{2}\succ\dots\succ u_{m}^{n-1}\succ u_{m}^{n}\succ a_{1}\succ a_{2}\succ\dots\succ a_{n}$
	$\displaystyle c_{1}\succ c_{2}\succ\dots\succ c_{m}\succ w_{1}^{1}\succ w_{1}^{2}\succ\dots\succ w_{1}^{n}\succ w_{2}^{1}\succ w_{2}^{2}\succ\dots\succ w_{m}^{n-1}\succ w_{m}^{n}$

This establishes Theorem 1.2.

Remark: Note that in the MinSumC problem, stable matchings and envy-free matchings are the same. Thus, if we had an $\alpha$ -approximation algorithm for the MinSum problem, we would have an $\alpha$ -approximation algorithm for MinSumC, which would contradict Theorem 1.2. Thus, the MinSum problem is also constant-factor inapproximable. As mentioned earlier, this also follows from Theorem 2 in [7].

5.2 $(\ell_{a}-\epsilon)$ -inapproximability of MinSumC

In this section, we show that our algorithmic result for the MinSumC problem (Theorem 1.6) is tight modulo Unique Games Conjecture [13]. Specifically, we show that for any $\epsilon>0$ , MinSumC does not admit an $(\ell_{a}-\epsilon)$ approximation algorithm. We present a reduction from the Vertex cover problem, which is a special case of the Set cover problem defined in Section 5.1. The construction of the MinSumC instance presented below is similar to the construction presented in Section 5.1.

We first note the following: if MinSumC admits an approximation algorithm with guarantee $(\ell_{a}-\alpha)$ then it admits an approximation algorithm with guarantee $(\ell_{a}-\beta)$ for any constants $\beta<\alpha$ . Therefore, it is enough to show that MinSumC does not admit an approximation algorithm with guarantee $(\ell_{a}-\epsilon)$ for $\epsilon\leq\frac{1}{2}$ .

The Vertex cover problem: Given a graph $G(V,E)$ where $|V|=n$ and $|E|=m$ , a set $T\subseteq V$ is called a vertex cover if for every edge $e\in E$ , there is a vertex $v\in T$ such that $e$ is incident on $v$ . Given an integer $k$ , our goal is to decide whether there exists a vertex cover with cardinality at most $k$ . An instance $(G,k)$ of the Vertex cover problem can be reduced to the instance of the Set cover problem by taking $\mathcal{U}=E$ , $\mathcal{C}=\{\mathcal{N}(v)\mid v\in V\}$ and the same value of $k$ .

Reduction to MinSumC: Given a vertex cover instance $I$ , construct the corresponding set cover instance $\overline{I}$ . Then construct the MinSumC instance $I^{\prime}$ from $\overline{I}$ as presented in Section 5.1 with the following change: instead of constructing $n$ dummy agents and programs per subset $C_{j}\in\mathcal{C}$ , construct $f(n,\epsilon)$ -many such dummy agents and dummy programs corresponding to each subset $C_{j}$ , where $f(n,\epsilon)=\frac{2n(1-\epsilon)}{\epsilon}\geq 1$ (since $\epsilon\leq\frac{1}{2}$ ).

Note that in the reduced instance, there are $n+f(n,\epsilon)m$ agents and $m+f(n,\epsilon)m$ programs. The preference lists for all agents and programs are constructed identically as presented in Section 5.1. We define $k^{\prime}=n+kf(n,\epsilon)$ .

Lemma 19.

Given a vertex cover $T$ in $I$ such that $|T|\leq k$ , we can construct an $\mathcal{A}$ -perfect envy-free matching $M$ in $I^{\prime}$ such that $c(M)\leq k^{\prime}$ .

Proof.

Let $C_{open}=\{c_{j}\mid C_{j}\in T\}$ . Note that $|C_{open}|=|T|$ , as in Lemma 17. Again, $M$ is constructed by matching every element-agent $a_{i}$ to its highest-preferred program in $C_{open}$ . Dummy agents corresponding to programs in $C_{open}$ are matched to their respective programs, and the remaining dummy agents are matched to their dummy programs. As in Lemma 17, we see that $M$ is $\mathcal{A}$ -perfect and envy-free. Moreover, $c(M)\leq k^{\prime}$ . ∎

Lemma 20.

If $I^{\prime}$ admits an $\mathcal{A}$ -perfect envy-free matching $M$ with $c(M)\leq\left(\ell_{a}-\epsilon\right)k^{\prime}$ for $\frac{1}{2}\geq\epsilon>0$ , then $I$ admits a vertex cover $T$ with $|T|\leq\left(2-\epsilon^{\prime}\right)k$ where $\epsilon^{\prime}=\frac{\epsilon}{2}$ .

Proof.

Given an $\mathcal{A}$ -perfect envy-free matching $M$ , define the set of opened programs $C_{open}$ and construct a vertex cover $T$ using $C_{open}$ , as done in the proof of Lemma 18. Hence, $|T|=|C_{open}|$ . As it follows from Lemma 18, $T$ is a valid set cover. Next, we show that $|T|\leq\left(2-\frac{\epsilon}{2}\right)k$ . Since we have $f(n,\epsilon)$ -many dummy agents for every program, specifically for every $c_{j}\in C_{open}$ , the cost $c(M)\geq n+|T|\cdot f(n,\epsilon)$ . Also, in the reduced instance $I^{\prime}$ , we have $\ell_{a}=2$ since each element of $\overline{I}$ is contained in exactly two sets (corresponding to the two end-points of that edge in $I$ ), so the element-agents have only two programs in their preference lists, dummy agents always have only two programs in their preference list. Therefore, we get

	$\displaystyle n+\|T\|\cdot f(n,\epsilon)$	$\displaystyle\leq c(M)\leq(\ell_{a}-\epsilon)k^{\prime}=(\ell_{a}-\epsilon)(n+kf(n,\epsilon))$
	$\displaystyle\implies\|T\|$	$\displaystyle\leq(\ell_{a}-\epsilon)k+\frac{(\ell_{a}-\epsilon-1)n}{f(n,\epsilon)}=(2-\epsilon)k+\frac{(1-\epsilon)n}{f(n,\epsilon)}$	$\displaystyle(\ell_{a}=2)$

Substituting $f(n,\epsilon)=\frac{n(1-\epsilon)}{\left(\frac{\epsilon}{2}\right)}$ followed by $k\geq 1$ , we get

\displaystyle|T|

\displaystyle\leq(2-\epsilon)k+\frac{\epsilon}{2}\leq(2-\epsilon)k+\frac{k\epsilon}{2}=\left(2-\frac{\epsilon}{2}\right)k=(2-\epsilon^{\prime})k

where $\epsilon^{\prime}\equiv\frac{\epsilon}{2}>0$ . ∎

If the MinSumC problem admits an $(\ell_{a}-\epsilon)$ -approximation algorithm ( $\epsilon>0$ ), then using Lemma 20 and Lemma 19 we get a $(2-\epsilon^{\prime})$ -approximation algorithm ( $\epsilon^{\prime}>0$ ) for the Vertex cover problem (where $\epsilon^{\prime}=\frac{\epsilon}{2}$ ). However, under the Unique Games Conjecture, it is known that the Vertex cover problem cannot be approximated within a factor of $(2-\epsilon^{\prime})$ for $\epsilon^{\prime}>0$ [13]. Therefore, the MinSumC problem does not admit an $(\ell_{a}-\epsilon)$ -approximation algorithm, for any $\epsilon>0$ .

We further notice that in the reduced instance, there is a master list ordering over agents and programs, as described in Section 5.1. This establishes Theorem 1.3.

Remark: An argument similar to that in Section 5.1 shows that Theorem 1.3 implies $(\ell_{a}-\epsilon)$ -inapproximability for the MinSum problem as well.

6 Concluding remarks

In this work we propose and investigate the generalized capacity planning problem for the many-to-one matchings under two-sided preferences. Motivated by the need to match every agent, we propose a setting wherein costs control the extent to which a program is matched. We aim to compute a stable matching in an optimally cost-augmented instance such that it matches every agent. We investigate two optimization problems. We prove that the MinMax problem is efficiently solvable but the MinSum problem turns out to be $\sf NP$ -hard. We present approximation algorithms for MinSum with varying approximation guarantees and an improved approximation algorithm for a special hard case. A specific open direction is to bridge the gap between the upper bound and lower bound for general instances of the MinSum problem. It is also interesting to extend the LP algorithm for general instances.

References

[1] Abdulkadiroğlu, A., Sönmez, T.: School Choice: A Mechanism Design Approach. American Economic Review 93(3), 729–747 (2003). https://doi.org/10.1257/000282803322157061
[2] Abe, K., Komiyama, J., Iwasaki, A.: Anytime Capacity Expansion in Medical Residency Match by Monte Carlo Tree Search. In: Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, IJCAI 2022, Vienna, Austria, 23-29 July 2022. pp. 3–9 (2022). https://doi.org/10.24963/ijcai.2022/1
[3] Afacan, M.O., Dur, U., Van der Linden, M.: Capacity design in school choice. Games and Economic Behavior 146, 277–291 (2024). https://doi.org/10.1016/j.geb.2024.05.002
[4] Baswana, S., Chakrabarti, P.P., Chandran, S., Kanoria, Y., Patange, U.: Centralized Admissions for Engineering Colleges in India. Interfaces 49(5), 338–354 (2019). https://doi.org/10.1287/inte.2019.1007
[5] Bobbio, F., Carvalho, M., Lodi, A., Rios, I., Torrico, A.: Capacity Planning in Stable Matching: An Application to School Choice. In: Proceedings of the Twenty-Fourth ACM Conference on Economics and Computation. p. 295 (2023). https://doi.org/10.1145/3580507.3597771
[6] Bobbio, F., Carvalho, M., Lodi, A., Torrico, A.: Capacity Variation in the Many-to-one Stable Matching (2022). https://doi.org/10.48550/ARXIV.2205.01302
[7] Chen, J., Csáji, G.: Optimal Capacity Modification for Many-To-One Matching Problems. In: Proceedings of the Twenty-Second International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2023. pp. 2880–2882 (2023), https://dl.acm.org/doi/10.5555/3545946.3599110
[8] Dinur, I., Steurer, D.: Analytical approach to parallel repetition. In: Proceedings of the Forty-Sixth Annual ACM symposium on Theory of Computing. pp. 624–633 (2014). https://doi.org/10.1145/2591796.2591884
[9] Gajulapalli, K., Liu, J.A., Mai, T., Vazirani, V.V.: Stability-Preserving, Time-Efficient Mechanisms for School Choice in Two Rounds. In: Proceedings of the Fortieth IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science, (FSTTCS 2020). pp. 21:1–21:15 (2020). https://doi.org/10.4230/LIPIcs.FSTTCS.2020.21
[10] Gale, D., Shapley, L.S.: College admissions and the stability of marriage. The American Mathematical Monthly 69(1), 9–15 (1962), http://www.jstor.org/stable/2312726
[11] Kavitha, T., Nasre, M.: Popular matchings with variable item copies. Theor. Comput. Sci. 412(12-14), 1263–1274 (2011). https://doi.org/10.1016/J.TCS.2010.12.067
[12] Kavitha, T., Nasre, M., Nimbhorkar, P.: Popularity at minimum cost. J. Comb. Optim. 27(3), 574–596 (2014). https://doi.org/10.1007/S10878-012-9537-0
[13] Khot, S., Regev, O.: Vertex cover might be hard to approximate within 2- $\epsilon$ . Journal of Computer and System Sciences 74(3), 335–349 (2008). https://doi.org/10.1016/j.jcss.2007.06.019
[14] Krishnapriya, A.M., Nasre, M., Nimbhorkar, P., Rawat, A.: How Good Are Popular Matchings? In: Proceedings of the Seventeenth International Symposium on Experimental Algorithms (SEA 2018). pp. 9:1–9:14 (2018). https://doi.org/10.4230/LIPIcs.SEA.2018.9
[15] Limaye, G., Nasre, M.: Optimal Cost-Based Allocations Under Two-Sided Preferences. In: Proceedings of the Thirty-Fourth International Workshop on Combinatorial Algorithms (IWOCA 2023). pp. 259–270 (2023), https://link.springer.com/chapter/10.1007/978-3-031-34347-6_22
[16] Othman, A., Sandholm, T., Budish, E.: Finding approximate competitive equilibria: efficient and fair course allocation. In: Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010). p. 873–880 (2010), https://dl.acm.org/doi/10.5555/1838206.1838323
[17] Robards, P.A.: Applying two-sided matching processes to the United States Navy Enlisted assignment process. Tech. rep., Naval Postgraduate School Monterey CA (2001), https://hdl.handle.net/10945/10845
[18] Roth, A.E.: On the Allocation of Residents to Rural Hospitals: A General Property of Two-Sided Matching Markets. Econometrica 54(2), 425–427 (1986), http://www.jstor.org/stable/1913160
[19] Santhini, K.A., Sankar, G.S., Nasre, M.: Optimal matchings with one-sided preferences: Fixed and cost-based quotas. In: Proceedings of the Twenty-First International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2022. pp. 696–704 (2022), https://dl.acm.org/doi/10.5555/3535850.3535929
[20] Wu, Q., Roth, A.E.: The lattice of envy-free matchings. Games and Economic Behavior 109, 201–211 (2018). https://doi.org/10.1016/j.geb.2017.12.016
[21] Yang, W., Giampapa, J., Sycara, K.: Two-sided matching for the US Navy Detailing Process with Market Complication. Tech. rep., Robotics Institute, Carnegie-Mellon University (2003), https://www.ri.cmu.edu/publications/two-sided-matching-for-the-u-s-navy-detailing-process-with-market-complication/

Appendix 0.A Appendix

0.A.1 Challenges in designing a primal-dual algorithm for MinSumC

A standard primal-dual approach for the MinSumC problem would be to begin with a dual feasible solution. The algorithm then repeatedly updates the dual till we obtain a primal feasible solution using the tight edges w.r.t. to the dual setting. We illustrate the challenges in using such an approach for the general MinSumC problem. Consider the MinSumC instance in Fig. 8. Recall that the tuple $(q,c)$ preceding a program indicates that the initial quota and cost of that program is $q$ and $c$ respectively. Since the instance in Fig. 8 is of the MinSumC problem, the initial quotas are 0 for each program.

	$\displaystyle a_{1}$	$\displaystyle:p_{1}\succ p_{0}$
	$\displaystyle a_{2}$	$\displaystyle:p_{1}\succ p_{0}$
	$\displaystyle a_{3}$	$\displaystyle:p_{1}\succ p_{0}$
	$\displaystyle a_{4}$	$\displaystyle:p_{1}\succ p_{2}\succ p_{0}$
	$\displaystyle a_{5}$	$\displaystyle:p_{2}\succ p_{3}$

	$\displaystyle(0,0)\ p_{0}$	$\displaystyle:a_{1}\succ a_{2}\succ a_{3}\succ a_{4}$
	$\displaystyle(0,1)\ p_{1}$	$\displaystyle:a_{1}\succ a_{2}\succ a_{3}\succ a_{4}$
	$\displaystyle(0,6)\ p_{2}$	$\displaystyle:a_{4}\succ a_{5}$
	$\displaystyle(0,11)\ p_{3}$	$\displaystyle:a_{5}$

Figure 8: A MinSumC instance used in illustrating the challenges

Assume that we begin with an initial dual setting where all dual variables are set to $0$ . The matching $M=\{(a_{1},p_{0}),(a_{2},p_{0}),(a_{3},p_{0}),(a_{4},p_{0})\}$ obtained on the tight edges is envy-free but does not match agent $a_{5}$ and hence is not primal feasible. Since no edge incident on $a_{5}$ is tight (slack on $(a_{5},p_{2})$ and $(a_{5},p_{3})$ is 6 and 11 respectively) we can set $y_{5}$ to 6 while maintaining dual feasibility. We observe that while this update makes the edge $(a_{5},p_{2})$ tight, adding the edge to the matching $M$ introduces an envy pair – namely $a_{4}$ envying $a_{5}$ . We note that this is our first difficulty, that is, while there are tight edges incident on an unmatched agent, none of them may be matchable.

The second difficulty stems from the following: in order to match $a_{5}$ along the (non-matchable) tight edge $(a_{5},p_{2})$ we must first resolve the potential envy pair $(a_{4},a_{5})$ , that is, we must promote agent $a_{4}$ . With the current dual setting, $y_{4}$ cannot be increased hence a natural way is to update a $z$ variable. This can indeed be achieved by setting $z_{{a_{4}},{p_{2}},{a_{5}}}=1$ , thus making $(a_{4},p_{1})$ tight. However, as encountered earlier, this edge is not matchable, since matching $a_{4}$ to $p_{1}$ introduces several other envy pairs. Note that this chain of potential envy resolutions is triggered by the unmatched agent $a_{5}$ . Since, this chain can be arbitrarily long, several $z$ updates may be required. It is not immediate if these updates in $z$ variables can be charged to an update in some $y$ variable, thereby achieving a reasonable approximation ratio.

However, as seen in Section 4.2, for the restricted hard case of two distinct costs, we are able to resolve these challenges.

0.A.2 Example illustrating the execution of Algorithm 3

	$\displaystyle a_{1}$	$\displaystyle:p_{1}\succ p_{2}\succ p_{0}$
	$\displaystyle a_{2}$	$\displaystyle:p_{2}\succ p_{3}\succ p_{0}$
	$\displaystyle a_{3}$	$\displaystyle:p_{1}\succ p_{2}\succ p_{3}$
	$\displaystyle(0,0)\ p_{0}$	$\displaystyle:a_{1}\succ a_{2}$
	$\displaystyle(0,1)\ p_{1}$	$\displaystyle:a_{1}\succ a_{3}$
	$\displaystyle(0,1)\ p_{2}$	$\displaystyle:a_{1}\succ a_{2}\succ a_{3}$
	$\displaystyle(0,1)\ p_{3}$	$\displaystyle:a_{2}\succ a_{3}$

•

$M=\{(a_{1},p_{0}),(a_{2},p_{0})\}$
•

(1) $a=a_{3}$ , $y_{a_{3}}=1$ , tight edges on $a_{3}$ are $\{(a_{3},p_{1})$ , $(a_{3},p_{2})$ , $(a_{3},p_{3})\}$ , $thresh(p_{1})=thresh(p_{2})=a_{1}$ and $thresh(p_{3})=a_{2}$
•

(3a) $\mathcal{P}(a_{3})=\{p_{1},p_{2},p_{3}\}$
•

(3b,3c) let $a^{\prime}=a_{1}$ , then $p=p_{2}$ , $z_{a_{1},p_{2},a_{3}}=1$
•

(3d) Tight edges on $a_{1}$ are $\{(a_{1},p_{1}),(a_{1},p_{2})\}$ , $p^{\prime}=p_{1}$ , $M=\{(a_{1},p_{1}),(a_{2},p_{0})\}$ , tight edges on $a_{3}$ are $\{(a_{3},p_{1}),(a_{3},p_{3})\}$
•

(3e) $thresh(p_{1})=a_{3}$ , $M=\{(a_{1},p_{1}),(a_{2},p_{0}),(a_{3},p_{1})\}$
•

(3f) $\mathcal{P}(a_{3})=\emptyset$

Figure 9: A

{\sf\textsc{MinSumC}_{c1,c2}}

instance. An execution of Algorithm 3 is illustrated by giving the state of the algorithm. The blue content in the brackets correspond to the labels of steps mentioned in the description of the algorithm.

Generalized Capacity Planning for the Hospital-Residents Problem††thanks: A preliminary version of this work appeared in IWOCA 2023 [15].

Abstract

Keywords:

1 Introduction

1.1 Notation and Problem Definition

Definition 1 (Stable Matching).

Definition 2 (Envy-Freeness).

1.2 Our results

Theorem 1.1.

Theorem 1.2.

Theorem 1.3.

Theorem 1.4.

Theorem 1.5.

Theorem 1.6.

2 Related Work and Background

2.1 Envy-free Matchings to Stable Matchings

Lemma 1.

Proof.

Lemma 2.

Proof.

Lemma 3.

Proof.

Lemma 4.

3 Algorithmic results

3.1 Polynomial time algorithm for MinMax

Lemma 5.

Proof.

3.2 |P||P|-approximation algorithm for MinSum

Lemma 6.

Proof.

3.3 ℓp\ell_{p}-approximation algorithm for MinSum

Lemma 7.

Proof.

Lemma 8.

Proof.

Lemma 9.

Proof.

4 MinSumC with two distinct costs

4.1 Linear Program and its dual

Definition 3 (Threshold agent).

Definition 4 (Matchable edge).

4.2 An ℓa\ell_{a}-approximation algorithm for MinSumC𝖼𝟣,𝖼𝟤{\sf\textsc{MinSumC}_{c1,c2}}

4.3 Proof of correctness

Lemma 10.

Proof.

Lemma 11.

Proof.

Definition 5 (type-1 and type-2 agents).

Lemma 12.

Proof.

Lemma 13.

Proof.

Lemma 14.

Proof.

Lemma 15.

Proof.

Lemma 16.

Proof.

5 Hardness and inapproximability

5.1 Constant factor inapproximability of MinSumC

Lemma 17.

Proof.

Lemma 18.

Proof.

5.2 (ℓa−ϵ)(\ell_{a}-\epsilon)-inapproximability of MinSumC

Lemma 19.

Proof.

Lemma 20.

Proof.

6 Concluding remarks

References

Appendix 0.A Appendix

0.A.1 Challenges in designing a primal-dual algorithm for MinSumC

0.A.2 Example illustrating the execution of Algorithm 3

Generalized Capacity Planning for the Hospital-Residents Problem^†^†thanks: A preliminary version of this work appeared in IWOCA 2023 [15].

3.2 $|P|$ -approximation algorithm for MinSum

3.3 $\ell_{p}$ -approximation algorithm for MinSum

4.2 An $\ell_{a}$ -approximation algorithm for ${\sf\textsc{MinSumC}_{c1,c2}}$

5.2 $(\ell_{a}-\epsilon)$ -inapproximability of MinSumC