¹¹institutetext: Peking University ²²institutetext: Columbia University
²²email: allenyzj@stu.pku.edu.cn, {ll3530, christian.kroer}@columbia.edu

Greedy-Based Online Fair Allocation with Adversarial Input: Enabling Best-of-Many-Worlds Guarantees

Zongjun Yang 11 Luofeng Liao 22 Christian Kroer 22

Abstract

We study an online allocation problem with sequentially arriving items and adversarially chosen agent values, with the goal of balancing fairness and efficiency. Our goal is to study the performance of algorithms that achieve strong guarantees under other input models such as stochastic inputs, in order to achieve robust guarantees against a variety of inputs. To that end, we study the PACE (Pacing According to Current Estimated utility) algorithm, an existing algorithm designed for stochastic input. We show that in the equal-budgets case, PACE is equivalent to the integral greedy algorithm. We go on to show that with natural restrictions on the adversarial input model, both integral greedy allocation and PACE have asymptotically bounded multiplicative envy as well as competitive ratio for Nash welfare, with the multiplicative factors either constant or with optimal order dependence on the number of agents. This completes a “best-of-many-worlds” guarantee for PACE, since past work showed that PACE achieves guarantees for stationary and stochastic-but-non-stationary input models.

Keywords:

online fair allocation envy analysis Nash Welfare optimization.

1 Introduction

We study online fair allocation, where items arrive sequentially in $T$ rounds, and we need to distribute them among a set of $n$ agents with heterogeneous preferences, with the goal of achieving both fairness and efficiency properties. In each round, we observe each agent’s value for the item and make an irrevocable allocation. Frequently in the literature on this problem, the items are assumed to be divisible, and agents’ utilities are linear and additive. We also consider linear and additive utilities, but indivisible items.

Fair allocation problem in the offline setting has been well-studied. A classical objective to optimize is the Nash welfare (NW), defined as the geometric mean of the agents’ utilities. Maximizing Nash welfare provides a balance between efficiency and fairness due to the multiplicative nature of the objective. For divisible items, an offline optimal allocation can be computed via solving the Eisenberg-Gale (EG) convex program (Eisenberg and Gale, 1959). The solution enjoys both envy-freeness and proportionality, which are important measures of fairness. For indivisible items, finding an allocation that maximizes NW is an APX-hard problem even in the offline setting (Moulin, 2004; Lee, 2017), though constant-factor approximation algorithms are known (Cole and Gkatzelis, 2018; Cole et al., 2017; McGlaughlin and Garg, 2020).

In the online setting, Gao et al. (2021) provides a simple allocation algorithm called PACE (Pace According to Current Estimated utility), which generates asymptotically fair and efficient allocations when items are drawn in an i.i.d. manner. PACE gives each agent a per-round budget of faux currency, and simulates a first-price auction in each round. The fair allocation is achieved by having each agent shade their bid with a pacing multiplier, which is a projection of their current estimated inverse bang-per-buck to a fixed interval. Liao et al. (2022) extend these results to non-stationary inputs, where the distribution of items may change over time. They show that in this case, PACE still achieves asymptotic fairness and efficiency guarantees, up to linear error terms from the amount of non-stationarity.

Yet in many real-world scenarios, we cannot expect items to be drawn in a stochastic manner, even if from non-stationary distributions. This motivates the investigation of algorithms with competitive ratio guarantee for adversarial settings. To fit arbitrary inputs, including extreme ones, some algorithms adopt “conservative” designs for fairness, such as allocating half of each item purely equally (Banerjee et al., 2022). Although this helps to provide worst-case guarantees, it damages the efficiency in the average case, which may not be acceptable in some practical applications. Moreover, this conservative allocation requires each item to be divisible, or at least for random allocation to be acceptable.

This motivates us to move in another direction: Instead of developing algorithms to fit extreme adversarial inputs, we seek to find worst-case guarantees for existing algorithms that are designed for stochastic inputs. In particular, we focus on the performance of the algorithm PACE (Gao et al., 2021; Liao et al., 2022), and explore the question:

How does PACE perform under adversarial input?

Our first contribution is to show that, in the case where all agents have the same weight (or budget in market equilibrium terminology), PACE is equivalent to the integral greedy algorithm, assuming no projection of the pacing multiplier. Due to this equivalence, we start by studying the integral greedy allocation method. Our results for that method are of independent interest, as it is a natural allocation method.

Although both integral greedy allocation and PACE have infinite envy and $\Omega(T)$ competitive ratio when inputs are completely arbitrary, we notice that such pessimistic results only occur under extreme inputs where the ratio between the largest and smallest non-zero values for an agent differ by an exponential factor. We show that, once we rule out such extreme instances by introducing mild assumptions (such as a constant ratio between nonzero values observed by an agent) both algorithms converge with approximate envy-freeness and bounded competitive ratio as $T$ increases to infinity. The upper bounds are either constant, or in near-optimal order of $n$ , see Table 1. Combined with existing results under stationary (Gao et al., 2021) and non-stationary (Liao et al., 2022) input models, this establishes a “best-of-many-worlds” guarantee for PACE: it is the first online algorithm that simultaneously guarantees asymptotic fairness and efficiency guarantees under stochastic, stochastic but nonstationary, and adversarial inputs. As such, we believe our results show that PACE is a natural and robust algorithm for online fair allocation in real-world settings, since it achieves strong guarantees under many different utility models, and is thus likely to perform well on a variety of real-world inputs.

Algorithm	Assumptions	Measure	Upper-Bound ( $T\rightarrow\infty$ )	Theorem
Integral Greedy	Assumption 3.1	multiplicative envy	$1+2\log\frac{1}{\varepsilon}$	Theorem 3.1
	Assumption 3.1	competitive ratio w.r.t. NW	${\lambda\cdot(n!)^{1/n+\alpha},\forall\alpha>0}^{*}$	Theorem 3.3
	seed utility $\delta$	utility ratio with seeds	$O(\log T)$	Theorem 3.5
PACE	Assumption 4.1	multiplicative envy	$1+2\log\frac{1}{\varepsilon}$	Theorem 4.1
PACE	Assumption 4.1	competitive ratio w.r.t. NW	$\left(1+2\log\frac{1}{\varepsilon}\right)\frac{1}{c}$	Theorem 4.2

^∗ The lower bound for online algorithms is at least $(n!)^{1/n}$ when $n\rightarrow\infty$ .

Table 1: Summary of Results

1.1 Related Work

In this subsection, we review previous works that are most closely related to ours. An extensive review of other related works is provided in Appendix 0.A.

The PACE Algorithm

Our work is a direct generalization of the PACE algorithm (Pace According to Current Estimated utility) (Gao et al., 2021; Liao et al., 2022) to adversarial inputs. We will review PACE in detail in Section 2.

Online Fair Division

For maximizing Nash welfare in the online setting, the competitive ratio with respect to the offline optimal allocation is trivially $\Omega(n)$ and $\Omega(T)$ , when the input is completely adversarial (Banerjee et al., 2022). These pessimistic results under arbitrary input motivate the introduction of assumptions. Azar et al. (2016) adopted an assumption that we also make: that the minimum nonzero valuation of each agent is at least $\varepsilon$ times the largest. However, their analysis does not remove the dependence on $T$ in their upper bound of the competitive ratio $O\left(\log\frac{nT}{\varepsilon}\right)$ , meaning that the ratio is unbounded in the online setting with a fixed number of agents and an unbounded horizon. Banerjee et al. (2022) assumes extra prior knowledge of the monopolistic utility of each agent, and gives an $O(\log n)$ and $O(\log T)$ -competitive algorithm; their algorithm involves allocating half of each item uniformly across agents; since we are interested in algorithms with asymptotic convergence guarantees on non-adversarial inputs, such an approach cannot be used. Huang et al. (2022) assumes the input to be $\lambda$ -balanced or $\mu$ -impartial, where $\lambda$ and $\mu$ characterize the desired properties of the input; their competitive ratio upper bounds are logarithmic in the parameter and $n$ . However, the parameters still implicitly depend on the horizon length $T$ .

The above works on online NW maximization consider greedy-style algorithms as we do. In contrast to them, our work focuses on an asymptotic bound that does not depend on $T$ as $T\rightarrow\infty$ . Moreover, while their algorithms can only deal with divisible items, our paper adopts integral allocation, and show that integral decision is enough for the convergence of multiplicative envy and competitive ratio w.r.t. the optimal continuous allocation, given our assumptions.

Online Allocation with Resource Constraints

We also briefly discuss how this paper differs from existing works in online resource allocation, where a sequence of requests arrive over time, with each request consisting of a reward and cost function, and at each time step the algorithm must select a decision with the goal of maximizing the sum of rewards while satisfying long-term cost constraints on each resource. In that setting, strong best-of-many-worlds guarantees are known (Balseiro et al., 2023; Celli et al., 2022; Castiglioni et al., 2022). In online resource allocation, the optimization objective is separable across timesteps, e.g., in the form of $\sum_{s=1}^{T}f_{s}(x)$ (Balseiro et al., 2023). Time-separability is crucial for the regret bounds in these works, as it enables translating dual regret to primal regret through weak duality. However, in online fair allocation with the Nash welfare objective, time-separability no longer holds, since we take the logarithm of the utility over time. Therefore, our results cannot be derived with similar techniques to those papers. Indeed, the types of competitive-ratio guarantees achieved e.g. by Balseiro et al. (2023) are impossible in the online fair allocation setting, where hard input sequences are known (Banerjee et al., 2022; Gao et al., 2021).

2 Setup

2.1 Online Fair Allocation

Consider a problem instance with $n$ agents and $T$ items. For $i\in[n]$ and $t\in[T]$ , let $v_{i}^{t}\geq 0$ be agent $i$ ’s value for a unit of item $t$ . The input of our problem is a sequence of agent valuations $\bm{v}=(v_{i}^{t})^{n\times T}$ . We assume that for each item $j$ at least one agent values it, i.e. there exists an agent $i$ such that $v_{i}^{t}>0$ . Each agent $i\in[n]$ has a non-negative weight $B_{i}$ , which can also be interpreted as a budget of faux currency in a Fisher market (Varian, 1974). An allocation $\bm{x}=(x_{i}^{t})^{n\times T}$ distributes each item to an agent, where $x_{i}^{t}$ is the amount of item $t$ that is allocated to agent $i$ gets item $t$ . We assume each item has a unit supply. An allocation is feasible if $\sum_{i\in[n]}x_{i}^{t}\leq 1$ for each $t$ . An allocation is integral if $x_{i}^{t}\in\{0,1\}$ . For a feasible, integral allocation $x$ , let $A_{i}=\{t:x_{i}^{t}=1\}$ be the set of items allocated to agent $i$ .

We assume additive, linear utility for all agents. That is to say, $U_{i}^{t}=\sum_{s=1}^{t}x_{i}^{s}v_{i}^{s}$ , where $U_{i}^{t}$ is the utility agent $i$ derives from the first $t$ items. For a subset of items $A\subseteq[T]$ , agent $i$ ’s total value on the bundle $A$ is denoted as $U_{i}(A)=\sum_{s\in A}v_{i}^{s}$ . Agent $i$ ’s monopolistic utility $V_{i}$ is defined as his total value for all items $V_{i}=U_{i}([T])=\sum_{t=1}^{T}v_{i}^{t}.$

We focus on the online setting where items arrive sequentially, while the set of agents is fixed. An online allocation algorithm is one that makes an irrevocable choice to distribute the item in each round based only on the information of past rounds. Concretely, it maps the history $\mathcal{H}^{t}=\left\{(v_{i}^{s})_{s=1}^{t},(x_{i}^{s})_{s=1}^{t-1}\right\}_{i=1}^{n}$ to a decision $(x_{1}^{t},\cdots,x_{n}^{t})$ such that $\sum_{i=1}^{n}x_{i}^{t}=1$ .

In this paper, we are interested in envy and Nash welfare (NW) as measures of fairness. The multiplicative envy of agent $i$ to agent $j$ is defined as the ratio between the utility that agent $i$ would get from the allocation $x_{j}$ of agent $j$ to the utility of their own allocation $x_{i}$ , adjusted by their respective budget. As a criterion for fairness, it measures the extent to which an agent prefers someone else’s bundle to his own.

\mathrm{Envy}_{ij}=\frac{B_{i}}{B_{j}}\frac{\sum_{t=1}^{T}x_{j}^{t}v_{i}^{t}}{\sum_{t=1}^{T}x_{i}^{t}v_{i}^{t}}.

Notice that when the allocation is integral, the above definition becomes ${U_{i}(A_{j})}/{U_{i}(A_{i})}$ .

The Nash welfare of an allocation is defined as the weighted geometric mean of all agents’ utilities:

\mathrm{NW}=\prod_{i=1}^{n}\left(U_{i}^{T}\right)^{B_{i}/\sum B_{j}}.

Maximizing the geometric mean is a well-studied proxy for balancing fairness and efficiency in fair allocation problems. It is also equivalent to maximizing the objective of the Eisenberg-Gale convex program in a Fisher market. For a Nash welfare maximizing allocation $\bm{x}^{\star}=\{x^{\star,t}_{i}\}$ , let $U_{i}^{\star}=\sum_{t=1}^{T}x^{\star,t}_{i}v_{i}^{t}$ be the utility of agent $i\in[n]$ . Notice that the optimal allocation might not be unique. In this paper, we measure the performance in terms of Nash welfare maximization based on the competitive ratio of our allocation, which is defined as the supremum of the ratio between the online allocation and an optimal offline NW-maximizing allocation, over the space of all possible inputs.

While we allow the input sequence to be adversarial, we restrict our attention to a subset of adversarial input sequences, where we use $\mathcal{V}^{T}$ to denote the space of valid length $T$ input sequences. Concrete assumptions on $\mathcal{V}^{T}$ will be specified in Section 3 and Section 4 before analyzing specific algorithms.

We consider the asymptotic worst-case envy and competitive ratio (w.r.t. Nash welfare) over all possible inputs $\mathcal{V}^{T}$ when $T\rightarrow\infty$ . Concretely, we will investigate

\lim_{T\rightarrow\infty}\sup_{v\in\mathcal{V}^{T}}\max_{i,j}\mathrm{Envy}_{ij},\ \lim_{T\rightarrow\infty}\sup_{v\in\mathcal{V}^{T}}\prod_{i=1}^{n}\left(\frac{U_{i}^{T}}{U_{i}^{\star}}\right)^{B_{i}/\sum B_{j}}.

In our analysis, we will show that with proper assumptions, both measures converge with an asymptotic upper bound which is independent of $T$ . Our upper bounds will be either constant or with a near-optimal order dependence on $n$ .

We emphasize that both measures adopted in this paper, multiplicative envy, and competitive ratio, are defined as ratios, not differences. This is mainly because of the scale invariant property of fair division, which is an important and desirable property for allocation algorithms: if an agent’s values for all items are multiplied by a constant factor, the resulting allocation stays the same. The algorithms that we are interested in, together with NW-maximizing allocations, are all scale invariant. Hence, it is more useful to consider multiplicative performance measures, which are also invariant to valuation scaling.

2.2 Algorithms

We introduce the two major algorithms that we study in this paper: the PACE (Pace According to Current Estimated Utility) algorithm (Gao et al., 2021; Liao et al., 2022), and the integral greedy algorithm. Moreover, we will discuss the “greedy-based” nature of PACE by showing its equivalence to the integral greedy algorithm under certain conditions.

Pseudocode for PACE is shown in Algorithm 1. In each round $t$ , the agent utilities are revealed. Each agent then places a bid for the item, which is equal to their value for the item multiplied by the current pacing multiplier $\beta_{i}^{t}$ . The whole item is allocated to the highest bidder, preferring the bidder with the smallest index when a tie occurs. Each agent then observes their realized utility at this round, and updates their current estimated utility. The pacing multiplier is updated to be the weight $B_{i}$ divided by the estimated utility, then projected to an interval $[a,b]$ .

Input: number of agents

n

, time horizon

T

, truncation parameters

a

and

b

Initialization:

U_{i}^{0}=0,\beta^{1}=1^{n}

for $t=1,\cdots,T$ , do

Agent

i

bids

\beta_{i}^{t}v_{i}^{t}

The whole item

t

is allocated to the highest bidder, with arbitrary tie-breaking:

i^{t}:=\min\arg\max_{i\in[n]}\beta_{i}^{t}v_{i}^{t},\ \ x_{i}^{t}=\bm{1}(i=i^{t}).

Agent

i

updates his estimated utility

\bar{u}_{i}^{t}=\frac{1}{t}\cdot x_{i}^{t}v_{i}^{t}+\frac{t-1}{t}\bar{u}_{i}^{t-1}.

Agent

i

updates the pacing multiplier

\beta_{i}^{t+1}=\prod_{[a,b]}\left(\frac{B_{i}}{\bar{u}_{i}^{t}}\right)

Algorithm 1 PACE

2.2.1 Performance Under Stochastic Input

As shown by Gao et al. (2021), PACE is an instantiation of stochastic unregularized dual averaging (Xiao, 2009) applied to the dual of the underlying allocation program where the supplies are given by the density of each item. With i.i.d. input, PACE converges to the equilibrium of a potentially infinite-dimensional Fisher market (Gao and Kroer, 2023), which is also closely related to the game-theoretic solution concept of pacing equilibrium (Conitzer et al., 2022). The agent utilities under PACE converge to those associated with the offline NW-maximizing allocation in the mean-square sense.

Theorem 2.1.

(Theorem 4. in Gao et al. (2021)) Let $u_{i}^{\star}:=U_{i}^{\star}/T$ be agent $i$ ’s time-averaged utility under the Nash-welfare-maximizing allocation with supplies given by some underlying distribution. When the values in different rounds are i.i.d. chosen from the same distribution, it holds that

\bm{E}\left[\sum_{t=1}^{T}(\bar{u}_{i}^{t}-u_{i}^{\star})\right]\leq C\cdot\frac{\log T}{T},

(1)

where $C$ is a constant independent of $T$ .

Liao et al. (2022) generalizes Theorem 2.1 to non-stationary inputs, which have a stochastic component yet change over time. Particularly, they consider three types of inputs: independent yet adversarially corrupted input, ergodic input, and periodic input, and show that for all three cases $\bm{E}\|u^{T}-u^{\star}\|\rightarrow 0$ is still preserved, up to errors due to non-stationarity.

In the stationary and non-stationary cases, mean-square convergence of time-averaged utility implies an asymptotic competitive ratio of $1$ w.r.t Nash welfare. For both cases, Gao et al. (2021); Liao et al. (2022) also provide a theoretical guarantee that PACE is asymptotically envy-free (again up to a non-stationarity error in the non-stationary case). In this paper, we provide bounds on PACE’s performance with adversarial inputs with assumptions. Combined with the results from Gao et al. (2021); Liao et al. (2022), this is the first “best-of-many-worlds” guarantee for online fair allocation under stationary, non-stationary, and adversarial inputs.

While we make an attempt to take an algorithm for stochastic inputs and show its performance on adversarial input, the other direction seems to be difficult. It is hard for some algorithms that are designed for adversarial inputs to achieve optimality in stochastic scenarios, due to “conservative” routines that they adopt to deal with extremely bad inputs. For instance, Banerjee et al. (2022) divides half of the resources equally, which can be undesirable with stochastic input, see Example 1

Example 1.

Consider an online scenario with $n$ agents and $n$ types of items $\{\theta_{j}\}_{j=1}^{n}$ . Agent $n$ ’s value for a unit of type- $j$ item $v_{i}(\theta_{j})=1$ if $i=j$ , and $v_{i}(\theta_{j})=0.01$ otherwise. In each round, the item type is drawn i.i.d. from a uniform distribution. In this scenario,

•

The equilibrium of the underlying Fisher market allocates all type- $i$ items to agent $i$ . PACE converges to this equilibrium.
•

The proposed algorithm proposed by (Banerjee et al., 2022) allocates at most $\left(\frac{1}{2}+\frac{1}{n}\right)$ fraction of type- $i$ item to agent $i$ , which is clearly not optimal.

2.2.2 Greedy Interpretation of PACE.

In the standard configuration of online fair allocation, where agents have equal weight $B_{1}=\cdots=B_{n}=1$ , we now show that PACE, when the projection of multipliers is disregarded, can be interpreted as greedily maximizing Nash welfare with integral decisions.

To show this, the following optimization program (2) maximizes NW up to round $t$ greedily, given the history of previous $t-1$ rounds. Its decision is integral.

$\displaystyle\max_{x_{i}^{t}\in\{0,1\}}$	$\displaystyle\ \ \sum_{i=1}^{n}B_{i}\log U_{i}^{t}$	(2)
s.t.	$\displaystyle\ \ U_{i}^{t}=\sum_{s=1}^{t-1}x_{i}^{s}v_{i}^{s}+x_{i}^{t}v_{i}^{t},\ \forall i\in[n]$
	$\displaystyle\ \ \sum_{i=1}^{n}x_{i}^{t}=1$

The above program allocates the item to the agent $i^{t}$ that gives the maximum increment to the objective. This is equivalent to the following decision rule:

i^{t}\in\arg\max_{i\in[n]}B_{i}\log\left(1+\frac{v_{i}^{t}}{U_{i}^{t-1}}\right).

When agent weights are equal, $i^{t}$ is the agent $i$ that maximizes ${v_{i}^{t}}/{U_{i}^{t-1}}$ ; this coincides exactly with the decision of PACE with no projections.

This interpretation motivates us to first consider the behavior of a greedy algorithm (without projecting the multiplier), and then study PACE based on the insights derived for integral greedy. We focus on weight-adapted integral greedy algorithm, or simply integral greedy algorithm, as shown in Algorithm 2.

Input: number of agents

n

, time horizon

T

Initialization:

U^{0}_{i}=0

for all

i

for $t=1,\cdots,T$ , do

Observe agent values for item

t

, and allocates the whole item to agent

i^{t}

i^{t}:=\min\bigg{(}\mathrm{arg}\max_{i\in[n]}\frac{B_{i}v_{i}^{t}}{U_{i}^{t-1}}\bigg{)},\ \ x_{i}^{t}=\bm{1}(i=i^{t}).

Agent

i

updates his current utility

U_{i}^{t}=U_{i}^{t-1}+x_{i}^{t}v_{i}^{t}.

Algorithm 2 Integral Greedy Algorithm (weight-adapted)

Algorithm 2 is equivalent to greedily maximizing the NW objective with integral decisions when agent weights are equal. However, with unequal weights the equivalence breaks down, so does its equivalence between PACE. For this consideration, in the following discussions, we assume equal weights $B_{1}=\cdots=B_{n}$ , which is a standard setting in the Nash welfare maximization and online fair allocation literature. We will also remark on part of our results that can be generalized to unequal weights. Note that the PACE algorithm itself extends to unequal weights, and the stationary and nonstationary results extend as well.

With equal weight assumption, PACE can be regarded as greedy-based decision using $\hat{U}_{i}^{t}$ as estimated utility. $\hat{U}_{i}^{t}$ is a projected utility, where $\ell$ and $r$ are reinterpreted projection bounds:

i^{t}:=\min\mathrm{arg}\max_{i\in[n]}\frac{v_{i}^{t}}{\hat{U}_{i}^{t-1}},\ \hat{U}_{i}^{t}=\prod_{[\ell t,rt]}U_{i}^{t}.

3 Analysis of the Integral Greedy Algorithm

This section is devoted to show that integral greedy algorithm achieves convergence of multiplicative envy and competitive ratio, under reasonable assumptions. The missing proofs in this section are deferred to Appendix 0.B.

3.1 Assumptions on the Input

We begin with introducing the assumptions on the input space, as well as the necessity of doing so. We focus on input space $\mathcal{V}_{\varepsilon}^{T}$ , which is parametrized by $\varepsilon\in(0,1]$ and defined as follows:

{assumption}

For each integer $T$ , $\mathcal{V}_{\varepsilon}^{T}$ is the set of inputs which satisfy the following requirements:

•

The monopolistic utility of each agent is infinite: $V_{i}=\infty\ (T\rightarrow\infty).$
•

For each $i\in[n]$ and $t\in[T]$ , $v_{i}^{t}\in\{0\}\cup[\varepsilon,1]$ .

The first requirement helps to avoid allocating nothing to some agent, since the decision is integral. We further require the number of nonzero valuations of each agent is not upper-bounded by a constant, so it is meaningful to consider the asymptotic sense in $T$ . For the second requirement, we note that it is equivalent to assuming a constant bound on the ratio of the minimum nonzero item value and the maximum:

\frac{\min_{t}\{v_{i}^{t}:v_{i}^{t}>0\}}{\max_{t}\{v_{i}^{t}\}}\geq\varepsilon,\ \forall i\in[n].

This equivalence is due to the scale-invariant property of integral greedy allocation, as well as the NW maximizing allocation.

If we do not include any extra assumptions on the input space, the worst-case envy and the competitive ratio is infinite, which makes the analysis of the integral greedy algorithm trivial and uninteresting.

Lemma 1.

When the values are arbitrarily chosen from $[0,1]$ , it holds that

1.

(Banerjee et al., 2022) Any online allocation algorithm has $\Omega(T)$ competitive ratio with respective to NW.
2.

The integral greedy algorithm has $\Omega(T)$ worst-case multiplicative envy.

even when the first requirement in Section 3.1 is satisfied.

Proof.

We construct a sequence of instances where $\mathrm{Envy}_{12}=\Omega(T)$ . Given horizon length $T$ , fix agent $1$ ’s valuation to $1$ in all rounds. For agent $2$ , we set $v_{2}^{t}=a^{t-T}\leq 1$ for some $a>2$ . In the integral greedy allocation, agent $1$ receives only one item and has total utility $1$ . This proves the lower bound of multiplicative envy. ∎

The above hard instance features exponential growth in agent $2$ ’s valuation. Hard instances with similar spirit for the continuous problem have previously been given by Banerjee et al. (2022). The vulnerability of allocation algorithms under such instances can be explained by their non-anticipating nature: it does not know the future, so it fails to recognize that agent $2$ ’s valuation up to current rounds is only a negligible fraction of the entire horizon. More generally, it is difficult for online algorithms to distinguish agents that are hard to satisfy in the future, with those who are easily satisfied.

However, we note that such adversarial instances are arguably not “natural.” For allocation problem in a real-world market, items are usually similar in nature, e.g., they are all food or ad slots. It is unlikely for an agent to have exponentially diverging nonzero values on these items. By requiring the ratio of minimal and maximal nonzero value to be bounded by $\varepsilon$ , section 3.1 rules out extreme cases where values from a single agent diverge drastically. In the rest of this section, we will show that once the above assumptions are introduced, both multiplicative envy and competitive ratio of the greedy algorithm are independent of $T$ asymptotically, i.e., converge to a constant (which depends only on $n$ and $\varepsilon$ ).

3.2 Envy Analysis for Greedy

In this subsection we analyze the worst-case envy of agents in integral greedy allocation. We observe that, the envy between any pair of agents can be reduced to the case $n=2$ by the following lemma, which characterized an “induction” structure of the integral greedy allocation.

Lemma 2 (Inductive structure of greedy allocation.).

For any $n$ -agent instance $\bm{v}$ and agent subset $I\subseteq[n]$ , define a new instance $\bm{v}|_{I}$ obtained by transforming $\bm{v}$ as:

•

Remove all agents that are not in $I$ (by setting their values to $0$ on all items).
•

Remove all items that are not in $\bigcup_{i\in I}A_{i}$ (by setting all agents’ value to $0$ on them).

Then, the resulting allocation is the same when the integral greedy algorithm is run on $\bm{v}$ and $\bm{v}|_{I}$ .

We show that the multiplicative envy of the integral greedy algorithm is upper bounded by $1$ plus a logarithmic term in $1/\varepsilon$ , which also generalizes to unequal weights.

Theorem 3.1 (Upper Bound for Multiplicative Envy).

Even with unequal agent weights, for inputs $\bm{v}$ satisfying Assumption 3.1 and any $i,j$ ,

\sup_{v\in\mathcal{V}_{\varepsilon}^{T}}\mathrm{Envy}_{ij}\leq 1+2\log\frac{1}{\varepsilon}+O\left(\frac{1}{T}\right).

Proof Sketch.

Due to Lemma 2 and symmetry, it suffices to consider $\mathrm{Envy}_{21}$ in $2$ -agent inputs. In the proof sketch, we assume $B_{1}=B_{2}=1$ for simplicity. We transform the input by:

1.

Set agent $1$ ’s valuation for all items in $A_{2}$ to $0$ .
2.

Move all items in $A_{2}$ to the beginning of the input sequence, and all items in $A_{1}$ to the end of the input sequence (preserving order).

One can show that the agent’s utilities under the budget-adapted greedy algorithm are invariant to this transformation. Therefore, it suffices to consider only the transformed inputs, where agent $2$ receives his entire share only in beginning $R$ rounds.

To find the worst-case envy for transformed inputs, a question from an adversarial point of view will be: given $(U_{1}^{R},U_{2}^{R})=(0,U)$ , how can we design a value sequence for the coming rounds, such that agent $2$ ’s total valuation on items over $S$ rounds is maximized, while ensuring that nothing is allocated to agent $2$ ? This can be characterized by an optimization program:

$\displaystyle\max_{v_{1}^{t},v_{2}^{t}}$	$\displaystyle\ \ \frac{1}{U}\sum_{t=1}^{\infty}v_{2}^{t}$		(3)
s.t.	$\displaystyle\ \ v_{1}^{t},v_{2}^{t}\in\{0\}\cup[\varepsilon,1],\ \$	$\displaystyle\forall t\geq 1$
	$\displaystyle\ \ \frac{v_{2}^{t}}{v_{1}^{t}}\leq\frac{U}{U_{1}^{t-1}},\ \$	$\displaystyle\forall t\geq 1$
	$\displaystyle\ \ U_{1}^{t}\geq\sum_{s=1}^{t}v_{1}^{s},\ \$	$\displaystyle\forall t\geq 1$

Notice that in (3) we re-index the rounds by starting with index $1$ at round $R+1$ . We call (3) a canonical optimization program for multiplicative envy maximization, parametrized by $U$ . We observe that $v_{2}^{t}/v_{1}^{t}$ can be upper bounded by $q(U_{1}^{t-1})$ , defined as

q(U_{1}^{t-1})=\begin{cases}\min\left\{\frac{U}{U_{1}^{t-1}},\frac{1}{\varepsilon}\right\},&0\leq U_{1}^{t-1}\leq\frac{U}{\varepsilon}\\ 0,&U_{1}^{t-1}>\frac{U}{\varepsilon}\end{cases}.

We can then give an upper-bound of the objective of the canonical optimization program (3),

\frac{1}{U}\sum_{t=1}^{\infty}v_{2}^{t}=\frac{1}{U}\sum_{t=1}^{\infty}v_{1}^{t}\cdot\frac{v_{2}^{t}}{v_{1}^{t}}\leq\frac{1}{U}\sum_{t=1}^{\infty}v_{1}^{t}\cdot q(U_{1}^{t-1}).

(4)

Each increment $v_{1}^{t}\leq 1$ is small when $U\rightarrow\infty$ . One can show that the right hand side of (4) converges to a definite integral asymptotically as $U\rightarrow\infty$ :

\frac{1}{U}\sum_{t=1}^{\infty}v_{1}^{t}\cdot q(U_{1}^{t-1})\rightarrow\frac{1}{U}\int_{0}^{U/\varepsilon}q(U_{1})\mathrm{d}U_{1}=1+2\log\frac{1}{\varepsilon}.

The convergence rate on the order of $O(1/T)$ is then achieved by more carefully calculating the above upper bound. $\hfill\square$

The analysis of the canonical optimization program (3) also gives an upper bound on the number of items that agent $2$ “envies of” agent $1$ , concretely, items in $A_{1}$ with nonzero value from agent $2$ . Lemma 3 will be useful in Section 4.

Lemma 3.

Let $C_{i}:=\{t:v_{i}^{t}>0\}$ be the set of items on which agent $i$ has non-zero value. When agents have equal weights, for $\bm{v}\in\mathcal{V}_{\varepsilon}^{T}$ and any $i,j\in[n]$ ,

\lim_{T\rightarrow\infty}\sup_{v\in V_{\varepsilon}^{T}}\frac{|C_{i}\cap A_{j}|}{U_{i}(A_{i})}\leq\frac{1}{\varepsilon}\left(1+\log\frac{1}{\varepsilon}\right).

Next, we complement Theorem 3.1 with a lower bound showing that the bound $1+2\log 1/\varepsilon$ on envy is tight for the integral greedy allocation.

Theorem 3.2 (Lower Bound for Multiplicative Envy).

For input $\bm{v}\in\mathcal{V}_{\varepsilon}^{T}$ , we have for any $i,j\in[n]$ ,

\lim_{T\rightarrow\infty}\sup_{v\in\mathcal{V}_{\varepsilon}^{T}}\frac{B_{i}}{B_{j}}\cdot\frac{U_{i}(A_{j})}{U_{i}(A_{i})}\geq 1+2\log\frac{1}{\varepsilon}.

3.3 Nash Welfare Analysis for Greedy

We give an upper bound on the asymptotic competitive ratio w.r.t. Nash welfare for the integral greedy algorithm.

Theorem 3.3 (Upper Bound for Competitive Ratio).

For input space $\mathcal{V}_{\varepsilon}^{T}$ and any given $\alpha>0$ there exists a constant $\lambda>0$ (independent of $n$ and $T$ ), such that

\lim_{T\rightarrow\infty}\sup_{v\in\mathcal{V}^{T}_{\varepsilon}}\left(\frac{\prod_{i=1}^{n}U_{i}(A_{i})}{\prod_{i=1}^{n}U_{i}(A_{i}^{\star})}\right)^{1/n}\leq\lambda\cdot\left(n!\right)^{\frac{1+\alpha}{n}}.

For $\varepsilon=1$ , the above holds with $\lambda=1,\alpha=0$ .

Proof Sketch

Assume without loss of generality that $U_{1}(A_{1})\leq\cdots\leq U_{n}(A_{n})$ . The main idea of the proof is to show that $U_{i}^{\star}/U_{i}$ is bounded by $(n-i+1)\cdot i^{\alpha}$ asymptotically. Suppose this is not true, we show that $x_{i}^{\star}$ will include large proportion of $x_{j}$ ( $j<i$ ), which will lead to contradiction with the optimality of $\bm{x}^{\star}$ .

Although Azar et al. (2016) gives an $O(\log(nT/\varepsilon))$ algorithm, we show that $(n!)^{1/n}$ factor is inevitable if one aims to remove the dependence on $T$ . Hence, integral greedy algorithm is near-optimal in terms of $n$ .

Theorem 3.4 (Lower Bound for Competitive Ratio).

Even in the case $\varepsilon=1$ , for any feasible, deterministic online algorithm we have

\left(\frac{\prod_{i=1}^{n}U_{i}(A_{i})}{\prod_{i=1}^{n}U_{i}^{\star}}\right)^{1/n}\geq(n!)^{1/n}.

Proof Sketch.

We sketch the construction of an adaptive adversary, who attempts to make low-utility agents hard to satisfy in the future. Divide the horizon into $n$ phases, each with length $T_{i}$ , satisfying $T_{i}/T_{i-1}\rightarrow\infty.$ The adversary maintains a set of “active agents”, initially containing all agents. In each round, only currently active agents see nonzero values. At the end of each phase, the agent in the active set who has lowest utility is eliminated. This results in a competitive ratio of $(n!)^{1/n}$ , see the detailed proof in Section 0.B.8.

3.4 Nash Welfare Analysis without Section 3.1

As an extension for our analysis on the integral greedy algorithm, we show how it can be adapted when Section 3.1 does not hold. Despite the $\Omega(T)$ lower bound, we show that, when each agent begins with a seed utility $\delta$ , the competitive ratio of integral greedy algorithm is of order $O(\log T)$ . The performance is measured w.r.t. to the ratio of seeded welfare.

The seeded integral greedy algorithm is identical to Algorithm 2, except that all agents are given an initial seed utility $\delta$ , which is taken into account when deciding the winning agent:

\displaystyle i^{t}:=\min\bigg{(}\mathrm{arg}\max_{i\in[n]}\frac{v_{i}^{t}}{\delta+U_{i}^{t-1}}\bigg{)},\ \ x_{i}^{t}=\bm{1}(i=i^{t}).

(5)

The full pseudocode for the seeded algorithm is presented in Algorithm 3.

Input: number of agents

n

, time horizon

T

, seed utility

\delta

Initialization:

U^{0}_{i}=0

for all

i

for $t=1,\cdots,T$ , do

Observe agent values for item

t

, and allocates the whole item to agent

i^{t}

\displaystyle i^{t}:=\min\bigg{(}\mathrm{arg}\max_{i\in[n]}\frac{v_{i}^{t}}{\delta+U_{i}^{t-1}}\bigg{)},\ \ x_{i}^{t}=\bm{1}(i=i^{t}).

(6)

Agent

i

updates current utility

U_{i}^{t}=U_{i}^{t-1}+x_{i}^{t}v_{i}^{t}.

Output: Allocations

\{x_{i}^{t}\}_{i,t}

Algorithm 3 Seeded Integral Greedy Algorithm (Equal Weights)

For the seeded algorithm, we study the criterion $R_{\delta}(\bm{v})$ which is also defined w.r.t. the seeded utility:

\displaystyle R_{\delta}(v)=\sup_{\widetilde{U}}\Bigg{\{}\frac{1}{n}\sum_{i=1}^{n}\frac{\widetilde{U}_{i}+\delta}{{U}_{i}+\delta}\Bigg{\}}\;,

where the supremum runs over all feasible hindsight allocations, with resulting utility $\widetilde{U}_{1},\cdots,\widetilde{U}_{n}.$ Notice that by the AM-GM inequality, $R_{\delta}$ is also competitive w.r.t. the geometric mean. However, due to the presence of the seed utility $\delta$ , it is not directly comparable to the criterion $(\prod_{i=1}^{n}{U^{\star}_{i}}/{U_{i}})^{1/n}$ .

Theorem 3.5 (Upper Bound of $R_{\delta}$ for Seeded Integral Greedy).

Run the seeded integral greedy algorithm with seed utility $\delta$ . For any input $\bm{v}$ satisfying $v_{i}^{t}\in[0,1]$ ,

\displaystyle R_{\delta}(v)\leq 3+\frac{4}{\delta}+2\log\Big{(}1+\frac{1}{\delta}\Big{)}+2\log T\;.

(7)

The ratio is of order $\log T$ , similar to the result given by Azar et al. (2016). Unlike our previous results, here the ratio is unbounded as $T\to\infty$ since we are considering a broader range of inputs beyond Section 3.1. Also, there is a trade-off in choosing seed utility $\delta$ . Although a larger $\delta$ brings a better bound in (7), it is less able to tell us how the algorithm compares to the offline optimal, since $R_{\delta}(v)\rightarrow 1$ as $\delta$ grows large.

4 Analysis of PACE

We continue to analyzing the performance of PACE. Missing proofs in this section can be found in Appendix 0.C.

4.1 Assumptions on the Input

To analyze PACE in the adversarial setting, it is necessary to adopt more assumptions than for the integral greedy algorithm. This is because PACE projects the average utility up to round $t$ to $[\ell,r]$ in its decision, or equivalently, projects the total utility in round $t$ to $[\ell t,rt]$ . In the stationary and non-stationary setting of Liao et al. (2022), the expected utility of each agent grows uniformly with time; in that case the projection helps PACE achieve theoretical guarantees on its performance. However, in the adversarial setting, the input may vary drastically over time, which makes the projection operation more problematic. In the worst case, adversarial input might cause extreme behavior of the projection, such as not allocating any item to a certain agent.

In their most generic form, our assumptions require that each agent achieves infinite utility as $T\rightarrow\infty$ and bounded non-zero valuation ratios; see Assumption 4.1. Notice that unbounded utility is necessary if we hope to derive meaningful convergence guarantees for worst-case envy and Nash welfare. {assumption} For each integer $T>0$ , ${\mathcal{V}}_{\varepsilon}^{T}(\ell,r)$ is the set of inputs which satisfy:

•

The utility of each agent under PACE is infinite when the projection bounds are set to be $[\ell,r]$ .
•

For each $i\in[n]$ and $t\in[T]$ , $v_{i}^{t}\in\{0\}\cup[\varepsilon,1]$ .

Since section 4.1 is potentially hard to verify, we next identify sufficient conditions under which PACE guarantees infinite utility for each agent. {assumption} For integer $T>0$ , and $c\in(0,1]$ , ${\mathcal{V}}_{\varepsilon,c}^{T}$ is the set of inputs which satisfy:

•

For each $i\in[n]$ , $V_{i}\geq cT$ , that is, the monopolistic utility of each agent under PACE is $\Theta(T)$ .
•

For each $i\in[n]$ and $t\in[T]$ , $v_{i}^{t}\in\{0\}\cup[\varepsilon,1]$ .

Assumption 4.1 strengthens Assumption 3.1 by requiring the monopolistic utility of each agent to be linear in $T$ . Intuitively, this matches the linear projection bounds $[\ell t,rt]$ on utility. We will show in Lemma 6 that, with appropriate initialization of the projection bounds, Assumption 4.1 with any $c>0$ leads to infinite agent utilities, and thus implies Assumption 4.1 for some $\ell$ and $r$ .

Conversely, if an agent’s inputs have $o(T)$ agent monopolistic utility, the following example shows that problems may occur, see Example 2.

Example 2.

Cases with $\sum_{t=1}^{T}\bm{1}\{v_{i}^{t}>0\}=f(T)=o(T)$ for $i=1,2$ in an $n$ -agent case ( $n>2$ ). For any $\ell\in(0,\varepsilon)$ , there exists a sufficiently large $T_{1}$ such that $f(T_{1})<\frac{\ell}{\varepsilon}T_{1}$ . We can construct an instance where agents $1$ and $2$ have value zero for the first $\left(1-\frac{\ell}{\varepsilon}\right)T_{1}$ items and value $\varepsilon$ for the remaining items. One can check that both utilities never reach $\ell t$ , and thus $\hat{U}_{1}^{t}=\hat{U}_{2}^{t}=\ell t$ always. Under lexicographical tie-breaking, agent $1$ will receive all items. Agent $2$ will receive nothing at all.

We remark that in order to initialize $\ell$ and $r$ , PACE requires knowing a lower bound on $\varepsilon$ . In contrast, the integral greedy algorithm does not require such knowledge. A minimal requirement is to set $\ell\leq\varepsilon$ :

Example 3.

Case with $\ell>\varepsilon$ : Consider an input where every agent has value $\varepsilon$ in each round. The projection is then activated in every round for every agent, and the algorithm allocates all items to the first agent.

4.2 Envy Analysis for PACE

Next we show that the PACE algorithm, as long as it is appropriately initialized and section 4.1 holds, achieves $1+2\log 1/\varepsilon$ multiplicative envy bound asymptotically. Our proof is performed by reducing the adversarial envy-maximizing problem in PACE to the canonical program (3) by showing that PACE with our assumptions lead to a weakly harder problem than (3) for the adversary.

Theorem 4.1 (Upper Bound of Multiplicative Envy for PACE).

Under Assumption 4.1 with

\ell<\frac{\varepsilon^{2}}{\varepsilon+1+(n-1)\left(1+\log(1/\varepsilon)\right)},\ r=1,

PACE achieves

\lim_{T\rightarrow\infty}\sup_{\bm{v}\in{\mathcal{V}}_{\varepsilon}^{T}(\ell,r)}\mathrm{Envy}_{ij}\leq 1+2\log\frac{1}{\varepsilon},\ \ \forall i,j\in[n].

Proof.

Due to symmetry, we only focus on the envy of agent $1$ . Similar to the proof of Theorem 3.1, we first transform the input $\bm{v}$ into $\bm{v}^{\prime}$ , such that the allocation of PACE is preserved. Notice that in this proof we use a different transformation, since the problem cannot be reduced to pairwise 2-agent cases. It is no longer obvious that the transformation preserves the PACE allocation; we will prove this later. The transformation is as follows:

1.

For agent $j\in\{2,3,\cdots,n\}$ , set agent $j$ ’s value to $0$ for all items that are not allocated to $j$ .
2.

Move all items on which agent $1$ has value $0$ to the end of the input sequence.
3.

Move all items in $A_{1}$ to the beginning of the input sequence.

Allocation for items in Step 2. is clearly preserved under PACE and does not affect the envy of agent 1, so we can assume such items do not exist without loss of generality.

Denote $S=|A_{1}|.$ For item $s$ in the original input $\bm{v}$ , let $s^{\prime}>s$ be the round it appears in $\bm{v}^{\prime}$ . We define $t^{*}:=\frac{\varepsilon S}{\varepsilon-\ell}\left(1+\frac{1}{\varepsilon}(n-1)\left(1+\log\frac{1}{\varepsilon}\right)\right).$ The proof is divided into three parts.

For the first part, Lemma 4 shows that in the first $t^{*}$ rounds in $\bm{v}^{\prime}$ , the transformation preserves the PACE allocation.

Lemma 4.

If $s^{\prime}<t^{*}$ , then for agent $j\in\{2,\cdots,n\}$ ,

U_{j}^{s^{\prime}}(\bm{v}^{\prime})<\ell s^{\prime}\implies\frac{\hat{U}_{j}^{s^{\prime}}(\bm{v}^{\prime})}{\hat{U}_{1}^{s^{\prime}}(\bm{v}^{\prime})}<\varepsilon.

Also, all items in the first $t^{*}$ rounds of $\bm{v}^{\prime}$ has the same allocation result as $\bm{v}$ .

For the second part, we show that in the first $t^{*}$ rounds, the problem of maximizing envy under PACE can be relaxed into a canonical program defined as (3). Lemma 5 gives upper bounds on multiplicative envy in the first $t^{*}$ rounds, as well as the number of rounds within $t^{*}$ for agent $j\in\{2,\cdots,n\}$ to have non-zero values.

Lemma 5.

For any agent $j\in\{2,\cdots,n\}$ , let $B_{j}$ be the set of items that belongs to $A_{j}$ in the original sequence $\bm{v}$ , and are permuted to the first $t^{*}$ position in the transformed sequence. Then it holds that

\lim_{S\rightarrow\infty}\frac{U_{1}(B_{j})}{U_{1}(A_{1})}\leq 1+2\log\frac{1}{\varepsilon},\ \lim_{S\rightarrow\infty}\frac{|B_{j}|}{S}\leq\frac{1}{\varepsilon}\left(1+\log\frac{1}{\varepsilon}\right).

For the final part, we show that the length of the transformed sequence cannot exceed $t^{*}$ . Let $m:=\frac{S}{\varepsilon}\left(1+\log\frac{1}{\varepsilon}\right)$ be the asymptotic upper bound of $|B_{j}|$ given in Lemma 5. It is straightforward to show with some simple algebra that $S+(n-1)m\leq t^{*}$ . Notice that $S+(n-1)m$ is an asymptotic upper bound on the number of non-empty items (an empty item is one that is evaluated zero by all buyers) within $t^{*}$ rounds in the transformed sequence. Therefore, $S+(n-1)m\leq t^{*}$ implies that the transformed sequence has to end within $t^{*}$ rounds. Thus $t^{*}$ is an upper bound of both the original and transformed sequence, whose allocation results are guaranteed to be the same by Lemma 4. The upper bound for envy in Lemma 5 is then a valid upper bound for the original sequence.

$\hfill\square$

4.3 Nash Welfare Analysis for PACE

Next we focus on establishing a worst-case guarantee for the asymptotic competitive ratio of Nash Welfare for PACE under Assumption 4.1. To begin with, we show that Assumption 4.1 implies infinite agent utilities with appropriate initialization.

Lemma 6.

For any input $\bm{v}\in\mathcal{V}_{\varepsilon,c}^{T}$ (defined in section 4.1), if

\ell<\frac{c\varepsilon^{2}}{1+(n-1)(1+\log 1/\varepsilon)},\ r=1,

then there exists a constant $d>0$ (which depends on $\ell$ ), such that for each $i\in[n]$ , PACE satisfies $\lim_{T\rightarrow\infty}U_{i}({A_{i}})\geq dT.$ Furthermore, $\frac{d}{c}=\Omega\left(\frac{1}{n}\right)$ as $n\rightarrow\infty$ .

The proof of Lemma 6 is also by a reduction from the canonical problem (3). We remark that Lemma 6 yields more than infinite agent utilities: it also tells us that the utilities are linear in $T$ . Moreover, from $d/c=\Omega\left(n^{-1}\right)$ we know that PACE computes an asymptotic approximate proportional allocation, which helps to derive a bounded competitive ratio w.r.t. Nash welfare. The bound can be furthermore refined using the envy results.

Theorem 4.2 (Upper Bound of Competitive Ratio for PACE).

Under Assumption 4.1, for any input $\bm{v}\in\mathcal{V}_{\varepsilon,c}^{T}$ , if

\ell<\frac{c\varepsilon^{2}}{1+\varepsilon+(n-1)(1+\log 1/\varepsilon)},\ r=1,

PACE achieves

\lim_{T\rightarrow\infty}\sup_{v\in\mathcal{V}^{T}_{\varepsilon}}\left(\frac{\prod_{i=1}^{n}U_{i}^{\star}}{\prod_{i=1}^{n}U_{i}(A_{i})}\right)^{1/n}\leq\left(1+2\log\frac{1}{\varepsilon}\right)\cdot\frac{1}{c}.

Theorem 4.2 gives an upper bound depends only on parameters $\varepsilon$ and $c$ , which are both independent of $T$ . We remark that the constant $1/c$ might not be tight.

We also remark that in both Theorem 4.1 and Theorem 4.2 $\ell$ is at most the order of $1/n$ , which is aligned with the stationary and non-stationary setting (projecting utilities to an $\Omega(1/n)$ bound is unreasonable since there are $n$ agents). However with adversarial input $\ell$ decreases as $\varepsilon\rightarrow 0$ , which means that PACE requires a wider projection interval when its input becomes potentially more extreme.

5 Conclusion

We proved horizon-independent bounds for envy and Nash welfare for the integral greedy algorithm and PACE under adversarial inputs with mild assumptions. Our results complete the first best-of-many-worlds result for online fair allocation, since PACE thus achieves guarantees under stochastic (Gao et al., 2021), stochastic but nonstationary (Liao et al., 2022), and adversarial inputs. Moreover, our results on integral greedy are of independent interest, as they characterize assumptions needed to achieve guarantees for that algorithm.

It remains open whether the constant in Theorem 4.2 can be improved. A more general open question is to explore more best-of-many-worlds online fair allocation algorithms, with potentially different performance measures and assumptions.

References

Anari et al. [2017] Nima Anari, Shayan Oveis Gharan, Amin Saberi, and Mohit Singh. Nash social welfare, matrix permanent, and stable polynomials. In 8th Innovations in Theoretical Computer Science Conference (ITCS 2017). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik, 2017.
Anari et al. [2018] Nima Anari, Tung Mai, Shayan Oveis Gharan, and Vijay V Vazirani. Nash social welfare for indivisible items under separable, piecewise-linear concave utilities. In Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, pages 2274–2290. SIAM, 2018.
Azar et al. [2016] Yossi Azar, Niv Buchbinder, and Kamal Jain. How to allocate goods in an online market? Algorithmica, 74(2):589–601, 2016.
Bach and Levy [2019] Francis Bach and Kfir Y Levy. A universal algorithm for variational inequalities adaptive to smoothness and noise. In Conference on learning theory, pages 164–194. PMLR, 2019.
Balseiro et al. [2023] Santiago R Balseiro, Haihao Lu, and Vahab Mirrokni. The best of many worlds: Dual mirror descent for online allocation problems. Operations Research, 71(1):101–119, 2023.
Banerjee et al. [2022] Siddhartha Banerjee, Vasilis Gkatzelis, Artur Gorokh, and Billy Jin. Online nash social welfare maximization with predictions. In Proceedings of the 2022 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1–19. SIAM, 2022.
Barman et al. [2018] Siddharth Barman, Sanath Kumar Krishnamurthy, and Rohit Vaish. Finding fair and efficient allocations. In Proceedings of the 2018 ACM Conference on Economics and Computation, pages 557–574, 2018.
Barman et al. [2020] Siddharth Barman, Umang Bhaskar, Anand Krishna, and Ranjani G Sundaram. Tight approximation algorithms for p-mean welfare under subadditive valuations. arXiv preprint arXiv:2005.07370, 2020.
Benade et al. [2018] Gerdus Benade, Aleksandr M Kazachkov, Ariel D Procaccia, and Christos-Alexandros Psomas. How to make envy vanish over time. In Proceedings of the 2018 ACM Conference on Economics and Computation, pages 593–610, 2018.
Bogomolnaia et al. [2022] Anna Bogomolnaia, Hervé Moulin, and Fedor Sandomirskiy. On the fair division of a random object. Management Science, 68(2):1174–1194, 2022.
Caragiannis et al. [2019] Ioannis Caragiannis, David Kurokawa, Hervé Moulin, Ariel D Procaccia, Nisarg Shah, and Junxing Wang. The unreasonable fairness of maximum nash welfare. ACM Transactions on Economics and Computation (TEAC), 7(3):1–32, 2019.
Castiglioni et al. [2022] Matteo Castiglioni, Andrea Celli, and Christian Kroer. Online learning under budget and ROI constraints and applications to bidding in non-truthful auctions. arXiv preprint arXiv:2302.01203, 2022.
Celli et al. [2022] Andrea Celli, Matteo Castiglioni, and Christian Kroer. Best of many worlds guarantees for online learning with knapsacks. arXiv preprint arXiv:2202.13710, 2022.
Chaudhury et al. [2021] Bhaskar R Chaudhury, Jugal Garg, and Ruta Mehta. Fair and efficient allocations under subadditive valuations. In Proceedings of the AAAI Conference on Artificial Intelligence, 2021.
Cole and Gkatzelis [2018] Richard Cole and Vasilis Gkatzelis. Approximating the nash social welfare with indivisible items. SIAM Journal on Computing, 47(3):1211–1236, 2018.
Cole et al. [2017] Richard Cole, Nikhil Devanur, Vasilis Gkatzelis, Kamal Jain, Tung Mai, Vijay V Vazirani, and Sadra Yazdanbod. Convex program duality, fisher markets, and nash social welfare. In Proceedings of the 2017 ACM Conference on Economics and Computation, pages 459–460, 2017.
Conitzer et al. [2022] Vincent Conitzer, Christian Kroer, Debmalya Panigrahi, Okke Schrijvers, Nicolas E Stier-Moses, Eric Sodomka, and Christopher A Wilkens. Pacing equilibrium in first price auction markets. Management Science, 68(12):8515–8535, 2022.
Eisenberg and Gale [1959] Edmund Eisenberg and David Gale. Consensus of subjective probabilities: The pari-mutuel method. The Annals of Mathematical Statistics, 30(1):165–168, 1959.
Gao and Kroer [2023] Yuan Gao and Christian Kroer. Infinite-dimensional fisher markets and tractable fair division. Operations Research, 71(2):688–707, 2023.
Gao et al. [2021] Yuan Gao, Alex Peysakhovich, and Christian Kroer. Online market equilibrium with application to fair division. Advances in Neural Information Processing Systems, 34:27305–27318, 2021.
Garg et al. [2018] Jugal Garg, Martin Hoefer, and Kurt Mehlhorn. Approximating the nash social welfare with budget-additive valuations. In Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, pages 2326–2340. SIAM, 2018.
Garg et al. [2020] Jugal Garg, Pooja Kulkarni, and Rucha Kulkarni. Approximating nash social welfare under submodular valuations through (un) matchings. In Proceedings of the fourteenth annual ACM-SIAM symposium on discrete algorithms, pages 2673–2687. SIAM, 2020.
He et al. [2019] Jiafan He, Ariel D Procaccia, CA Psomas, and David Zeng. Achieving a fairer future by changing the past. IJCAI’19, 2019.
Huang et al. [2022] Zhiyi Huang, Minming Li, Xinkai Shu, and Tianze Wei. Online nash welfare maximization without predictions. arXiv preprint arXiv:2211.03077, 2022.
Kaneko and Nakamura [1979] Mamoru Kaneko and Kenjiro Nakamura. The nash social welfare function. Econometrica: Journal of the Econometric Society, pages 423–435, 1979.
Lee [2017] Euiwoong Lee. Apx-hardness of maximizing nash social welfare with indivisible items. Information Processing Letters, 122:17–20, 2017.
Li and Vondrák [2022] Wenzheng Li and Jan Vondrák. A constant-factor approximation algorithm for nash social welfare with submodular valuations. In 2021 IEEE 62nd Annual Symposium on Foundations of Computer Science (FOCS), pages 25–36. IEEE, 2022.
Liao et al. [2022] Luofeng Liao, Yuan Gao, and Christian Kroer. Nonstationary dual averaging and online fair allocation. Advances in Neural Information Processing Systems, 35:37159–37172, 2022.
McGlaughlin and Garg [2020] Peter McGlaughlin and Jugal Garg. Improving nash social welfare approximations. Journal of Artificial Intelligence Research, 68:225–245, 2020.
Moulin [2004] Hervé Moulin. Fair division and collective welfare. MIT press, 2004.
Nash Jr [1950] John F Nash Jr. The bargaining problem. Econometrica: Journal of the econometric society, pages 155–162, 1950.
Varian [1974] Hal R Varian. Equity, envy, and efficiency. Journal of Economic Theory, 9(1):63–91, 1974.
Xiao [2009] Lin Xiao. Dual averaging method for regularized stochastic learning and online optimization. Advances in Neural Information Processing Systems, 22, 2009.
Zeng and Psomas [2020] David Zeng and Alexandros Psomas. Fairness-efficiency tradeoffs in dynamic fair division. In Proceedings of the 21st ACM Conference on Economics and Computation, pages 911–912, 2020.

Appendix 0.A Additional Related Work

Offline Nash Welfare Approximation

To explore PACE’s behavior under adversarial inputs, we focus on the approximation of the EG objective, which is equivalent to maximizing Nash welfare. Nash welfare was introduced by Nash Jr [1950], and is one of the ideal proxies for balancing fairness and efficiency in allocation problems [Kaneko and Nakamura, 1979]. Maximizing Nash welfare is well-studied in the offline settings. For divisible items, it is equivalent to the Eisenberg-Gale convex program [Eisenberg and Gale, 1959]. For indivisible items, computing a Nash-welfare-maximizing allocation is APX-hard Lee [2017] for additive utilities. Cole and Gkatzelis [2018] gave the first approximation algorithm, with $2e^{1/e}$ competitive ratio. Anari et al. [2017] applied matrix permanent and stable polynomials to get an $e$ -approximation. Cole et al. [2017] improved the ratio to $2$ . The state-of-the-art algorithm, given subsequently by Barman et al. [2018], has a ratio of $e^{1/e}$ . More general utility classes have been considered beyond additive ones. Garg et al. [2018] achieved a $2e^{1/e}+o(1)$ ratio under budget-additive values. Anari et al. [2018] considered separable, piecewise-linear and concave valuations and gave an $e^{2}$ -competitive algorithm. For submodular utilities, Garg et al. [2020] achieved an $O(n\log n)$ -approximation, which was recently improved to a constant ratio [Li and Vondrák, 2022]. For more generalized forms of submodular utilities, an $O(n)$ ratio was reached by Barman et al. [2020] and Chaudhury et al. [2021] independently.

Online Allocation with Envy Guarantees

Besides Nash welfare maximization, our analysis on multiplicative envy is also related to the line of works focused on achieving (possibly approximate) envy-freeness in the online setting. Bogomolnaia et al. [2022] assume stochastic input and enforce envy-freeness as a soft constraint, while maximizing social welfare. He et al. [2019] further allow reallocating previous items, and show that $O(T)$ reallocation is enough to achieve envy-freeness up to an item. Benade et al. [2018] considers envy minimization in the stochastic, indivisible setting, and show that allocating each item to each agent uniformly at random is near-optimal up to logarithmic factors. Zeng and Psomas [2020] considers the indivisible setting with non-adaptive adversary, showing that nontrivial approximation of envy-freeness and Pareto-optimality is hard to achieve simultaneously. In contrast to the above works, we focus on multiplicative envy instead of additive envy. While it is shown by Caragiannis et al. [2019] that Nash welfare maximizing allocation has approximate envy-freeness, our analysis on multiplicative envy is independent of other fairness measures. Finally note that the PACE algorithm, while focused on the stronger guarantee of asymptotically maximizing Nash welfare in the stochastic setting, actually achieves asymptotic envy-freeness as well, since it converges to the equilibrium allocation of the underlying Fisher market.

Appendix 0.B Missing Proofs in Section 3

0.B.1 Proof of Theorem 3.1

According to Lemma 2, it suffices to bound the envy of all $2$ –agent instances. We will show that for any input $\bm{v}\in\mathcal{V}_{\varepsilon}^{T}$ ,

\frac{B_{2}U_{2}(A_{1})}{B_{1}U_{2}(A_{2})}\leq 1+2\log\frac{1}{\varepsilon}+\frac{(1+\varepsilon^{4})(1+\varepsilon^{2})}{\varepsilon^{6}}\frac{1}{T}+\frac{(1+\varepsilon^{2})^{2}}{\varepsilon^{8}}\frac{1}{T^{2}}

(8)

Consider the following transformation operation from instance $\bm{v}$ into ${\bm{v}}^{\prime}$ :

1.

Set agent $1$ ’s valuation for all items in $A_{2}$ to $0$ .
2.

Move all items in $A_{2}$ to the beginning of the input sequence, and all items in $A_{1}$ to the end of the input sequence (in arbitrary order).

We claim that the allocation result under the budget-adapted greedy allocation algorithm is invariant to this transformation. Clearly, all items in $A_{2}$ will still be allocated to agent $2$ , since we’ve set agent $1$ ’s valuation on them to zero. For item $t\in A_{1}$ , let $t^{\prime}$ be the round it appears in $\bm{v}^{\prime}$ . One can check

\frac{B_{1}v_{1}^{t}}{B_{2}v_{2}^{t}}\geq\frac{U_{1}^{t}(\bm{v})}{U_{2}^{t}(\bm{v})}\geq\frac{U_{1}^{t^{\prime}}(\bm{v}^{\prime})}{U_{2}^{t^{\prime}}(\bm{v}^{\prime})},

where the first inequality is due to $t\in A_{1}$ , and the second inequality is because $U_{1}^{t^{\prime}}(\bm{v}^{\prime})=U_{1}^{t}(\bm{v})$ and $U_{2}^{t^{\prime}}(\bm{v}^{\prime})=\sum_{s=1}^{T}x_{2}^{s}v_{2}^{s}\geq U_{1}^{t}(\bm{v})$ .

Therefore, it suffices to consider only the transformed sequences, i.e., ones with agent $2$ getting every item in the first $R$ rounds, and then nothing thereafter. To find the transformed sequence of maximum envy, we next note that we can increase agent $2$ ’s valuation on the items occurring after round $R$ as much as possible, while ensuring that none of them are allocated to agent $2$ by the algorithm, while weakly increasing envy. This is equivalent to the following optimization problem, with the multiplicative envy of agent $2$ upper bounded by its objective:

	$\displaystyle\max_{v_{1}^{t},v_{2}^{t}}$	$\displaystyle\ \ \frac{B_{2}}{B_{1}V_{2}^{0}}\sum_{t=R+1}^{\infty}v_{2}^{t}$
	s.t.	$\displaystyle\ \ v_{1}^{t},v_{2}^{t}\in\{0\}\cup[\varepsilon,1]$
		$\displaystyle\ \ \frac{v_{1}^{t}}{v_{2}^{t}}\geq\frac{B_{2}U_{1}^{t-1}}{B_{1}V_{2}^{0}}$
		$\displaystyle\ \ U_{1}^{t}=\sum_{s=R+1}^{t}v_{1}^{s}$

Notice the constraint that all items after round $R$ are allocated to agent $1$ . Together with the $\{0\}\cup[\varepsilon,1]$ range for values, this gives an upper bound of $v_{2}^{t}/v_{1}^{t}$ on each round:

\frac{v_{2}^{t}}{v_{1}^{t}}\leq q(U_{1}^{t-1}):=\begin{cases}\min\left\{\frac{B_{1}V_{2}^{0}}{B_{2}U_{1}^{t-1}},\frac{1}{\varepsilon}\right\}&U_{1}^{t-1}\leq\frac{B_{1}}{B_{2}\varepsilon}V_{2}^{0}\\ 0&U_{1}^{t-1}>\frac{B_{1}}{B_{2}\varepsilon}V_{2}^{0}\end{cases}

The multiplicative envy can then be further upper bounded by

\frac{B_{2}U_{2}(A_{1})}{B_{1}U_{2}(A_{2})}\leq\frac{B_{2}}{B_{1}V_{2}^{0}}\sum_{t=R+1}^{R+S}v_{1}^{t}q(U_{1}^{t-1})

(9)

where $S$ is the number of rounds in the above optimization problem with $v_{1}^{t},v_{2}^{t}>0$ . Since we should not allocate anything to agent $2$ after round $R$ , we have $U_{1}^{S}\in\left(0,\frac{B_{1}V_{2}^{0}}{B_{2}\varepsilon}+1\right]$ .

Next, we show that the right hand side of (9) can be approximated with an integration. We have

		$\displaystyle\frac{B_{2}}{B_{1}V_{2}^{0}}\left(\sum_{t=R+1}^{R+S}v_{1}^{t}q(U_{1}^{t-1})-\int_{0}^{U_{1}^{R+S}}q(u)\mathrm{d}u\right)$
	$\displaystyle=$	$\displaystyle\frac{B_{2}}{B_{1}V_{2}^{0}}\sum_{t=R+1}^{R+S}\left(v_{1}^{t}q(U_{1}^{t-1})-\int_{U_{1}^{t-1}}^{U_{1}^{t}}q(u)\mathrm{d}u\right)$
	$\displaystyle=$	$\displaystyle\frac{B_{2}}{B_{1}V_{2}^{0}}\sum_{t=R+1}^{R+S}\int_{U_{1}^{t-1}}^{U_{1}^{t}}\left(q(U_{1}^{t-1})-q(u)\right)\mathrm{d}u$
	$\displaystyle\overset{\text{(a)}}{\leq}$	$\displaystyle\frac{B_{2}}{B_{1}V_{2}^{0}}\sum_{t=R+1}^{R+S}(v_{1}^{t})\left(q(U_{1}^{t-1})-q(U_{1}^{t})\right)$
	$\displaystyle\overset{\text{(b)}}{\leq}$	$\displaystyle\frac{B_{2}}{B_{1}V_{2}^{0}}\sum_{t=R+1}^{R+S}\frac{B_{1}}{B_{2}\varepsilon^{2}V_{2}^{0}}(v_{1}^{t})^{2}$
	$\displaystyle=$	$\displaystyle\frac{1}{\varepsilon^{2}(V_{2}^{0})^{2}}\sum_{t=R+1}^{R+S}(v_{1}^{t})^{2}$
	$\displaystyle\overset{\text{(c)}}{\leq}$	$\displaystyle\frac{U_{1}^{R+S}}{\varepsilon^{2}(V_{2}^{0})^{2}}$

where (a) is because $q(u)$ is non–increasing, (b) is because the right–side derivative of $q(u)$ is upper bounded by $\frac{B_{1}}{B_{2}\varepsilon^{2}V_{2}^{0}}$ , and (c) is because $v_{1}^{t}\leq 1$ . Combined with 9 we have

\frac{B_{2}U_{2}(A_{1})}{B_{1}U_{2}(A_{2})}\leq\frac{B_{2}}{B_{1}V_{2}^{0}}\int_{0}^{U_{1}^{R+S}}q(u)\mathrm{d}u+\frac{U_{1}^{R+S}}{\varepsilon^{2}(V_{2}^{0})^{2}}\\

(10)

We then deal with $U_{1}^{R+S}$ using the relation $T\geq R+S$ . This is true because we allow no “null” items in a $T$ –round instance. From $\varepsilon S\leq U_{1}^{R+S}\leq V_{2}^{0}/\varepsilon+1\leq R/\varepsilon+1$ we know $V_{2}^{0}\geq\frac{\varepsilon^{3}}{1+\varepsilon^{2}}T$ . Hence,

	$\displaystyle\frac{B_{2}U_{2}(A_{1})}{B_{1}U_{2}(A_{2})}$	$\displaystyle\leq\frac{B_{2}}{B_{1}V_{2}^{0}}\int_{0}^{V_{2}^{0}/\varepsilon+1}q(u)\mathrm{d}u+\frac{1}{\varepsilon^{2}V_{2}^{0}}\left(\frac{1}{\varepsilon}+\frac{1}{V_{2}^{0}}\right)$
		$\displaystyle=\frac{B_{2}}{B_{1}V_{2}^{0}}\int_{0}^{V_{2}^{0}/\varepsilon}q(u)\mathrm{d}u+\frac{B_{2}}{B_{1}V_{2}^{0}}\int_{V_{2}^{0}/\varepsilon}^{V_{2}^{0}/\varepsilon+1}q(u)\mathrm{d}u+\frac{1}{\varepsilon^{2}V_{2}^{0}}\left(\frac{1}{\varepsilon}+\frac{1}{V_{2}^{0}}\right)$
		$\displaystyle\leq 1+2\log\frac{1}{\varepsilon}+\frac{\varepsilon}{V_{2}^{0}}+\frac{1}{\varepsilon^{2}V_{2}^{0}}\left(\frac{1}{\varepsilon}+\frac{1}{V_{2}^{0}}\right)$
		$\displaystyle\leq 1+2\log\frac{1}{\varepsilon}+\frac{(1+\varepsilon^{4})(1+\varepsilon^{2})}{\varepsilon^{6}}\frac{1}{T}+\frac{(1+\varepsilon^{2})^{2}}{\varepsilon^{8}}\frac{1}{T^{2}}$
		$\displaystyle=1+2\log\frac{1}{\varepsilon}+O\left(\frac{1}{T}\right)$

This finishes our proof.

0.B.2 Proof of Lemma 3

For the same reason as Theorem 3.1, it suffices to consider $j=1,i=2$ in $2$ -agent instances, with all inputs transformed as described in the proof of Theorem 3.1. $U$ is the short for $U_{2}(A_{2})$ , and also the parameter for the canonical optimization problem.

Consider the two phases in the horizon each with length $T_{1}$ and $T_{2}$ . We upper bound them respectively:

1.

Rounds with $U_{1}^{t}\leq U$ . Since the increment of $v_{1}^{t}$ is at least $\varepsilon$ whenever agent $1$ receives an item, there are at most $U/\varepsilon$ rounds in this phase.

$\lim_{U\rightarrow\infty}\frac{T_{1}}{U}\leq\frac{1}{\varepsilon}.$

Rounds with $U<U_{1}^{t}\leq U/\varepsilon$ . Assume this phase begins at round $s$ . By the analysis of the canonical optimization problem (3), agent $2$ ’s envy in this phase can be upper bounded asymptotically.

\frac{1}{U}\sum_{t=s}^{s+T_{2}-1}v_{2}^{t}\leq\frac{1}{U}\sum_{t=s}^{\infty}v_{1}^{t}q(U_{1}^{t-1})\rightarrow\frac{1}{U}\int_{U}^{U/\varepsilon}q(U_{1})\mathrm{d}U_{1}=1+\log\frac{1}{\varepsilon}.

Since $v_{2}^{t}$ is at least $\varepsilon$ in each round,

\lim_{U\rightarrow\infty}\frac{T_{2}}{U}\leq\frac{1}{\varepsilon}\leq\lim_{U\rightarrow\infty}\frac{\sum_{t=s}^{s+T_{2}-1}v_{2}^{t}}{U}\cdot\frac{1}{\varepsilon}\leq\frac{1}{\varepsilon}\left(1+\log\frac{1}{\varepsilon}\right).

Combining the bounds for $T_{1}$ and $T_{2}$ proves Lemma 3.

0.B.3 Proof of Theorem 3.2

We prove Theorem 3.2 by constructing a hard $2$ –agent instance that reaches $1+2\log\frac{1}{\varepsilon}$ envy asymptotically. The construction follows the spirit of the hard instances described in the proof of Theorem 3.1, where agent $2$ receives items in the beginning $T_{0}$ rounds, but nothing afterwards.

We construct a hard instance for $B_{1}=B_{2}$ . For unequal weights the construction is similar. For $T_{0},a>0$ , consider an instance with $2k+2$ phases. In each phase, the same item appears for many rounds. The agent’s valuations on these items are described in Table 2.

Phase	Length	Valuation 1	Valuation 2	$U_{1}$ (after the phase)	$U_{2}$ (after the phase)
A1	$T_{0}$	$0$	$1$	$0$	$T_{0}$
A2	$T_{0}$	$\varepsilon$	$1$	$\varepsilon\cdot T_{0}$	$T_{0}$
B1	$\left(1-\frac{1}{a}\right)T_{0}$	$\varepsilon\cdot a^{1}$	$1$	$\varepsilon\cdot a^{1}\cdot T_{0}$	$T_{0}$
B2	$\left(1-\frac{1}{a}\right)T_{0}$	$\varepsilon\cdot a^{2}$	$1$	$\varepsilon\cdot a^{2}\cdot T_{0}$	$T_{0}$
…	…	…	…	…	…
B $k$	$\left(1-\frac{1}{a}\right)T_{0}$	$\varepsilon\cdot a^{k}=1$	$1$	$\varepsilon\cdot a^{k}\cdot T_{0}=T_{0}$	$T_{0}$
C1	$\frac{1}{\varepsilon}\left(1-\frac{1}{a}\right)T_{0}$	$\varepsilon\cdot a^{1}$	$\varepsilon$	$a^{1}\cdot T_{0}$	$T_{0}$
C2	$\frac{1}{\varepsilon}\left(1-\frac{1}{a}\right)T_{0}$	$\varepsilon\cdot a^{2}$	$\varepsilon$	$a^{2}\cdot T_{0}$	$T_{0}$
…	…	…	…	…	…
C $k$	$\frac{1}{\varepsilon}\left(1-\frac{1}{a}\right)T_{0}$	$\varepsilon\cdot a^{k}=1$	$\varepsilon$	$a^{k}\cdot T_{0}=\frac{1}{\varepsilon}T_{0}$	$T_{0}$

Table 2: A worst-case instance for envy

One can check that the boundary condition $v_{1}^{t}/v_{2}^{t}=U_{1}^{t}/U_{2}^{t}$ holds at the end of each phase, and all items after phase A1 is allocated to agent $1$ . Agent $2$ ’s monopolistic utility is

V_{2}=2\left(1+\left(1-\frac{1}{a}\right)\cdot\log_{a}{\frac{1}{\varepsilon}}\right)T_{0}

As $T_{0}\rightarrow\infty$ , we consider $a\rightarrow 1$ , we have

\left(1-\frac{1}{a}\right)\cdot\log_{a}{\frac{1}{\varepsilon}}\rightarrow\log\frac{1}{\varepsilon},\ \frac{V_{2}}{T_{0}}=2\left(1+\log\frac{1}{\varepsilon}\right).

Out of his monopolistic utility, agent $2$ only gets utility $T_{0}$ . This gives $1+2\log 1/\varepsilon$ multiplicative envy asymptotically.

0.B.4 Proof of Theorem 3.3, $\varepsilon=1$

Without loss of generality, we assume $U_{1}(A_{1})\leq U_{2}(A_{2})\leq\cdots\leq U_{n}(A_{n})$ . Our main proof strategy is to show that for any input instance $\bm{v}\in\mathcal{V}_{1}^{T}$ , we have for $i\in[n]$ ,

\lim_{T\rightarrow\infty}\frac{U_{i}^{\star}}{U_{i}(A_{i})}\leq n-i+1.

(11)

For $i\in[n]$ , define $S_{i}\subseteq[T]$ as the set of items on which at least one agent in $\{1,2,\cdots,i\}$ has nonzero valuation,

S_{i}=\{t:\ \exists\ j\in[i],\ v_{j}^{t}=1\}.

Consider agent $k$ where $k>i$ . Notice that the integral greedy algorithm guarantees that the number of items in $S_{i}$ which are allocated to agent $k$ is strictly upper bounded by $U_{i}(A_{i})+1$ . If not, then the last such item $t$ should have been allocated to some agent $j\leq i$ since $U_{k}^{t-1}>U_{i}(A_{i})>U_{j}(A_{j})\geq U_{j}^{t-1}$ , which is a contradiction. Therefore, we get an upper bound on the amount of items that buyer $k>i$ receives in $S_{i}$ :

|A_{k}\cap S_{i}|\leq U_{i}(A_{i})\ \ (1\leq i\leq{n-1},k>i).

Define the item set $D_{i}=\bigcup_{k\geq i}(A_{k}\cap S_{i})$ . Then,

|D_{i}|\leq(n-i+1)U_{i}(A_{i})+(n-i).

(12)

Intuitively, it is possible that in an offline optimal allocation, all items in $D_{i}$ should be completely shared among the first $i$ agents, i.e., nothing in $D_{i}$ is allocated to agent $k>i$ . To verify this, simply consider a case where $|S_{i}|$ is negligibly small compared with $|S_{i+1}|$ . Further, it is also possible that the all items in $D$ are given to agent $i$ in an optimal allocation, for example, in the case where buyer $1,2,\cdots,i-1$ have zero valuation on these items. This will lead to ${U_{i}^{\star}}=U_{i}(A_{i})+(n-i)(U_{i}(A_{i})+1)$ .

However, we will show that $U_{i}^{\star}$ can not be any larger by contradiction. Suppose $U_{i}(A_{i}^{\star})\geq(n-i+1)(U_{i}(A_{i})+1)$ . By inequality (12), at least one item in $\bar{D}_{i}=\bigcup_{j<i}(A_{j}\cap S_{i-1})$ is partly allocated to agent $i$ in $\bm{x}_{i}^{\star}$ . This means that an item that belongs to agent $j$ $(j<i)$ in the integral greedy allocation should be re-allocated to agent $i$ in the offline optimal allocation, even with a large $U_{i}^{\star}>U_{j}^{\star}$ . This is impossible for an optimal allocation, since allocating this item to agent $j$ strictly increases the geometric mean.

Therefore,

U_{i}(A_{i}^{\star})/U_{i}(A_{i})\leq(n-i+1)+\frac{1}{U_{i}(A_{i}^{\star})}.

Combining this with the fact that the optimal offline allocation has $U_{i}(A_{i})\rightarrow\infty\ (T\rightarrow\infty)$ for all $i\in[n]$ , this proves (11). $\hfill\square$

0.B.5 Proof of Theorem 3.3, $\varepsilon\in(0,1)$

Without loss of generality, we assume the integral greedy algorithm gives us $U_{1}(A_{1})\leq U_{2}(A_{2})\leq\cdots\leq U_{n}(A_{n})$ . To prove Theorem 3.3, it suffices to show that for any $\alpha>0$ there exists a constant $\lambda$ such that for all $i\in[n]$ and any input instance in $\mathcal{V}_{\varepsilon}^{T}$ ,

\lim_{T\rightarrow\infty}\frac{U_{i}^{\star}}{U_{i}(A_{i})}\leq\lambda\cdot i^{\alpha}\cdot\left(1+(n-i)\left(1+2\log\frac{1}{\varepsilon}\right)\right).

(13)

We introduce two lemmas to prove (13). Lemma 7 shows that for a set of item $A\subset A_{i}$ which belongs to agent $i$ in the greedy algorithm, there is an upper bound on the increment of welfare if one transfers $A$ to other agents. It is a generalization of the envy result. The proof of 7 is similar to 3.1, and is deferred to Section 0.B.6.

Lemma 7.

For $i\in[n]$ , let $A\subseteq A_{i}$ be a set of items that are allocated to agent $i$ by the integral greedy algorithm, such that $|A|\rightarrow\infty$ as $T\rightarrow\infty$ . Let $J\subseteq[n]\backslash\{i\}$ be a set of agents not including $i$ , and $U=\max_{j\in J}U_{j}(A_{j})$ . Then for any $\bm{v}\in\mathcal{V}_{\varepsilon}^{T}$ ,

\lim_{T\rightarrow\infty}\frac{\sum_{s\in A}\max_{j\in J}v_{j}^{s}}{U}\leq\int_{0}^{\min\left\{\frac{1}{\varepsilon},\frac{U_{1}(A)}{U}\right\}}r(u)\mathrm{d}u,

where

r(u)=\begin{cases}\frac{1}{\varepsilon}&0\leq u\leq\varepsilon\\ \frac{1}{u}&\varepsilon<u\leq\frac{1}{\varepsilon}\\ 0&u>\frac{1}{\varepsilon}\end{cases}.

The next lemma guarantees that asymptotically, the utility of any single agent in the offline optimal allocation can be at most $\lambda$ times as large as the utility of the “wealthiest” agent in the integral greedy allocation. $\lambda$ is a positive constant which does not depend on $T$ and $n$ . The proof of Lemma 8 can be found in Section 0.B.7.

Lemma 8.

For any given input $\bm{v}\in\mathcal{V}_{\varepsilon}^{T}$ and $\alpha>0$ , there exists a constant $\lambda>0$ such that for any $k\in[n]$ ,

\lim_{T\rightarrow\infty}\frac{U_{k}^{\star}}{\max_{j\in[n]}U_{j}(A_{j})}\leq\lambda n^{\alpha}.

After introducing the two auxiliary lemmas, we return to the proof of Theorem 3.3.

For $i\in[n]$ , define $S_{i}\subseteq[T]$ as the set of items on which at least one agent in $\{1,2,\cdots,i\}$ has nonzero valuation.

S_{i}=\{t:\ \exists\ j\in[i],\ v_{j}^{t}=1\}.

For agent $k\geq i$ , we consider $A_{k}\cap S_{i}$ , which is the set of items in $S_{i}$ that are allocated to agent $k$ .

In the proof sketch, we give the intuition that, it is possible that the optimal allocation gives all items in $A_{k}\cap S_{i},(k>i)$ to agent the first $i$ agents, which might cause a large ratio of $U_{i}^{\star}/U_{i}(A_{i})$ . To bound the ratio, we first apply Lemma 7 to bound the maximum welfare of the first $i$ agents on item set $A_{k}\cap S_{i}$ . For $k>i$ ,

\lim_{T\rightarrow\infty}\frac{\sum_{s\in S_{i}\cap A_{k}}\max_{j\leq i}v_{j}^{s}}{U_{i}}\leq 1+2\log\frac{1}{\varepsilon}.

(14)

Then, consider the input $\bm{v}^{\prime}$ , which is defined as setting agent $k$ ’s value to $0$ on every item in $S_{i}$ . Because agent $1,2,\cdots,i$ are given strictly more items in $\bm{v}^{\prime}$ , both the integral greedy algorithms should have non-decreasing agent utilities when $T$ is large enough. (The amount of decrease should be negligible in $T$ ).

	$\displaystyle\lim_{T\rightarrow\infty}\frac{U_{j}^{\star}(\bm{v})}{U_{i}(\bm{v})}\leq\lim_{T\rightarrow\infty}\frac{U_{j}^{\star}(\bm{v}^{\prime})}{U_{i}(\bm{v})},\forall j\in[i]$		(15)
	$\displaystyle\lim_{T\rightarrow\infty}\frac{U_{j}(\bm{v})}{U_{i}(\bm{v})}\leq\lim_{T\rightarrow\infty}\frac{U_{j}(\bm{v}^{\prime})}{U_{i}(\bm{v})},\forall j\in[i]$		(15)

Consider the integral greedy allocation after transforming from $\bm{v}$ to $\bm{v}^{\prime}$ . By equation (14), the total utility increase of the $i$ agents is asymptotically upper bounded by $(n-i)\left(1+2\log\frac{1}{\varepsilon}\right)$ . Then the utility of the “wealthiest” agent after the transformation should be upper bounded by

\lim_{T\rightarrow\infty}\frac{\max_{j\in[i]}U_{j}(\bm{v}^{\prime})}{U_{i}(\bm{v})}\leq 1+(n-i)\left(1+2\log\frac{1}{\varepsilon}\right).

(16)

If (16) does not hold, this will contradict the the non-decreasing property in (15).

Combining (16) and the first inequality in (15),

	$\displaystyle\lim_{T\rightarrow\infty}\frac{U_{i}^{\star}(\bm{v})}{U_{i}(\bm{v})}$	$\displaystyle\leq\lim_{T\rightarrow\infty}\frac{U_{i}^{\star}(\bm{v}^{\prime})}{U_{i}(\bm{v})}$
		$\displaystyle=\lim_{T\rightarrow\infty}\frac{\max_{j\in[i]}U_{j}(\bm{v}^{\prime})}{U_{i}(\bm{v})}\cdot\lim_{T\rightarrow\infty}\frac{U_{i}^{\star}(\bm{v}^{\prime})}{\max_{j\in[i]}U_{j}(\bm{v}^{\prime})}$
		$\displaystyle\leq\left(1+(n-i)\left(1+2\log\frac{1}{\varepsilon}\right)\right)\lim_{T\rightarrow\infty}\frac{U_{i}^{\star}(\bm{v}^{\prime})}{\max_{j\in[i]}U_{j}(\bm{v}^{\prime})}$
		$\displaystyle\overset{\text{(a)}}{\leq}\lambda\cdot i^{\alpha}\cdot\left(1+(n-i)\left(1+2\log\frac{1}{\varepsilon}\right)\right).$

In the last step (a), we applied Lemma 8 to $\bm{v}^{\prime}$ , since $\bm{v}^{\prime}$ can be regarded as an input sequence with $i$ agents. This proves (13).

0.B.6 Proof of Lemma 7

The proof of 7 follows the same pattern as the proof of Theorem 3.1. First, transform the input while preserving the result of integral greedy algorithm. Then, use an optimization program to characterize that gives maximum value to the left-hand side of 7.

Thanks to Lemma 2, we do not need to consider agents that are not in $J\cup\{i\}$ , thus assume without loss of generality that $i=n$ , $J=\{1,2,\cdots,n-1\}$ and $U_{1}(A_{1})\leq\cdots\leq U_{n-1}(A_{n-1})=U$ .

For any given input $\bm{v}$ , we transform it into $\bm{v}^{\prime}$ without changing the result of integral greedy allocation:

1.

For each $j\in\{1,2,\cdots,n-1\}$ and each item in $A_{j}$ , set all agents’ value to $0$ except agent $j$ .
2.

Put all items in $A_{n}$ to the end of the sequence.

Therefore, it suffices to only consider sequences that only allocate items to agent $n$ after other agents have received all items in the beginning $R$ rounds.

	$\displaystyle\max_{v_{i}^{t}}$	$\displaystyle\ \ \frac{1}{U}\sum_{t=R+1}^{\infty}\max_{j<n}v_{j}^{t}$
	s.t.	$\displaystyle\ \ v_{i}^{t}\in\{0\}\cup[\varepsilon,1],\ \ \forall i\in[n]$
		$\displaystyle\ \ \frac{v_{n}^{t}}{v_{j}^{t}}\geq\frac{U_{n}^{t-1}}{U_{j}(A_{j})},\ \ \forall j\in[n-1]$
		$\displaystyle\ \ U_{n}^{t}=\sum_{s=R+1}^{t}v_{n}^{s}$

An upper bound of $(\max_{j<n}v_{j}^{t})/v_{n}^{t}$ is given by $\min\left\{\frac{1}{\varepsilon},\frac{U}{U_{n}^{t-1}}\right\}$ . The rest of the analysis is identical to the analysis of the canonical optimization program (3) in Theorem 3.1.

0.B.7 Proof of Lemma 8

For simplicity of notation, in this proof we denote $U_{j}=U_{j}(A_{j})$ .

Consider modifying the integral greedy allocation into an optimal one: for each $i,j\in[n]$ , some items that belong to $i$ in the integral greedy allocation might be partly re-allocated to $j$ in the optimal allocation, and vice versa. To characterize this procedure of modification, we define the following variables:

•

$Z_{i}:=\sum_{t\in A_{i}}(1-x_{i}^{\star,t})\cdot v_{i}^{t}$ is agent $i$ ’s value on the items that belong to him in the integral greedy allocation, but not in the optimal allocation.
•

$Y_{ji}:=\sum_{t\in A_{j}}x_{i}^{\star,t}v_{i}^{t}$ as agent $i$ ’s value on the items that belong to agent $j$ in the integral greedy allocation, but now belong to agent $i$ in the optimal allocation. Specifically, we let $Y_{jj}=0$ .

From the above definition,

U_{i}^{\star}=U_{i}-Z_{i}+\sum_{j\neq i}Y_{ji},\ i\in[n].

(17)

Divide all variables in (17) by $\max_{j\in[n]}U_{j}(A_{j})$ to get $u_{i}^{\star},u_{i},z_{i}$ and $y_{ji}$ . Then (17) becomes

u_{i}^{\star}=u_{i}-z_{i}+\sum_{j\neq i}y_{ji},\ i\in[n].

(18)

All above variables should be non-negative. Applying Lemma 7, we get further constraints on $y_{ji}$ :

	$\displaystyle y_{ji}$	$\displaystyle\leq u_{i}\int_{0}^{\min\{\frac{1}{\varepsilon},\frac{z_{j}}{u_{i}}\}}r(u)\mathrm{d}u+o_{T}(1)\leq c_{1}\min\{u_{i},u_{j}\}+o_{T}(1)$		(19)
	$\displaystyle\sum_{i\neq j}y_{ji}$	$\displaystyle\leq\max_{i\neq j}u_{i}\int_{0}^{\min\{\frac{1}{\varepsilon},\frac{z_{j}}{u_{i}}\}}r(u)\mathrm{d}u+o_{T}(1)\leq c_{2}\min\{\max_{i\neq j}u_{i},u_{j}\}+o_{T}(1)$		(20)

In the above conditions we use $o_{T}$ to explicitly indicate that the asymptotic notation is on $T$ . $c_{1},c_{2}\leq 1/\varepsilon$ are constants.

Notice that so far we have not yet used the condition that $(u_{1}^{\star},\cdots,u_{n}^{\star})$ is an optimal allocation. We use a necessary condition (not sufficient) to characterize optimality:

u_{j}^{\star}<\varepsilon u_{i}^{\star}\implies y_{ji}=0

(21)

Condition (21) holds because when $u_{j}^{\star}<\varepsilon u_{i}^{\star}$ in an optimal allocation, agent $j$ must have no envy towards agent $i$ . If not, then allocating any envied item to agent $j$ will strictly increase the geometric mean, since all non-negative values are in $[\varepsilon,1]$ .

Next, we show that $u_{k}^{\star}=o(n^{\alpha})$ , for any $\alpha>0$ . If $u_{k}^{\star}$ has a constant upper bound, this is obviously true. Next, we show that if $u_{k}^{\star}\geq g(n)$ as $T$ goes to infinity, where $g(n)\rightarrow\infty$ as $n\rightarrow\infty$ , we can derive $g(n)=o(n^{\alpha})$ for any positive $\alpha$ . Notice that the asymptotic notion is on $n$ when we refer to $g(n)$ .

Consider a directed graph $D$ with $[n]$ as its vertex set, and the set of edges is defined as:

E(D)=\left\{(i,j):y_{ji}>0\right\}

Define $J(\ell)\subseteq[n]$ as the set of all vertices $j$ such that the distance from $i$ to $j$ is $\ell$ , i.e., the shortest directed path from $i$ to $j$ has length $\ell$ . $J(0)=\{i\}$ . We denote $m_{\ell}:=|J(\ell)|$ .

For $j\in J(\ell)$ , there exists a path $j_{0}j_{1}\cdots j_{\ell}$ where $j_{0}=k,j_{\ell}=j$ . By condition (21), the existence of edge $(j_{s},j_{r})$ implies $u_{j_{r}}^{\star}\geq\varepsilon u_{j_{s}}^{\star}$ . Thus $u_{j}^{\star}\geq\varepsilon u_{j-1}^{\star}\geq\cdots\geq\varepsilon^{\ell}u_{k}^{\star}=g(n)$ . This gives a lower bound on the sum of $u_{j}^{\star}$ where $j\in J(\ell)$ :

\sum_{j\in J(\ell)}u_{j}^{\star}\geq\varepsilon^{\ell}\cdot g(n)\cdot m_{\ell}.

(22)

Next, we show inductively that $m_{\ell}=\Omega(g(n)\cdot m_{\ell-1}).$ For $\ell=1$ , this is true because as $T\rightarrow\infty$ , by condition (19),

g(n)\leq u_{k}^{\star}\leq u_{k}+\sum_{j\in J(1)}y_{jk}\leq u_{k}+c_{1}m_{1}.

For $\ell\geq 1$ , suppose $m_{\ell^{\prime}}=\Omega(g(n)\cdot m_{\ell^{\prime}-1})=\Omega(g^{\ell^{\prime}}(n))$ is true for $\ell^{\prime}=1,2,\cdots,\ell$ . Consider again the sum of all $u_{j}^{\star}$ where $j\in J(\ell)$ , as $T\rightarrow\infty$ ,

$\displaystyle\sum_{j\in J(\ell)}u_{j}^{\star}$	$\displaystyle\leq\sum_{j\in J(\ell)}u_{j}+\sum_{j\in J(\ell)}\sum_{i\neq j}y_{ij}$
	$\displaystyle\overset{\text{(a)}}{=}\sum_{j\in J(\ell)}u_{j}+\sum_{j\in J(\ell)}\sum_{\ell^{\prime}=0}^{\ell+1}\sum_{i\in J(\ell^{\prime})}y_{ij}$
	$\displaystyle=\sum_{j\in J(\ell)}u_{j}+\sum_{\ell^{\prime}=0}^{\ell+1}\sum_{i\in J(\ell^{\prime})}\sum_{j\in J(\ell)}y_{ij}$
	$\displaystyle\overset{\text{(b)}}{\leq}\sum_{j\in J(\ell)}u_{j}+\sum_{\ell^{\prime}=0}^{\ell+1}\sum_{i\in J(\ell^{\prime})}c_{2}u_{i}$
	$\displaystyle\leq(1+c_{2})m_{\ell}+c_{2}m_{\ell+1}+\sum_{\ell^{\prime}=0}^{\ell+1}c_{2}{m_{\ell}^{\prime}}$
	$\displaystyle\overset{\text{(c)}}{=}O(m_{\ell})+c_{2}m_{\ell+1},$	(23)

where (a) is because $y_{ij}=0$ for $i\in J(\ell^{\prime})$ if $\ell^{\prime}\geq\ell+2$ , (b) is a direct application of condition 20, and (c) is by the induction hypothesis.

Combining (22) and (23), $c_{2}m_{\ell+1}+O(m_{\ell})\geq\varepsilon^{\ell}g(n)m_{\ell}=\Omega(m_{\ell}g(n))$ . This tells us $m_{\ell+1}=\Omega(g(n)\cdot m_{\ell})$ and thus completes the induction.

Define $\ell^{*}:=\max\{\ell:J(\ell)\text{ is non-empty}\}$ , i.e., $\ell^{*}$ is the length of the longest path starting from $S$ . Because $m_{\ell^{*}}=\Omega\left(g^{\ell^{*}}(n)\right)\leq n$ , we have

\ell^{*}=O\left(\frac{\log n}{\log g(n)}\right).

(24)

Meanwhile, by (23), $m_{\ell^{*}}=\Omega(\varepsilon^{\ell^{*}}g(n)m_{\ell^{*}})$ . We then know that,

g(n)=O\left(\frac{1}{\varepsilon^{\ell^{*}}}\right).

(25)

Combining (24) and (25) one can derive $g(n)=O(e^{\sqrt{\log n}})=o(n^{\alpha})$ for any $\alpha>0$ .

0.B.8 Proof of Theorem 3.4

Consider dividing the horizon of length $T$ into $n$ different phases, with length $T_{1},\cdots,T_{n}$ , satisfying

\sum_{j=1}^{n}T_{j}=T,\ \lim_{T\rightarrow\infty}\frac{T_{j-1}}{T_{j}}=0\ \ (\forall j=\{1,2,\cdots,n\})

For the first phase (i.e. for $t\in\{1,2,\cdots,T_{1}\}$ ), $v_{i}^{t}=1$ for all $i\in[n]$ . Because there are $T_{1}$ items and $n$ buyers, for any online integral algorithm there exists a buyer $i_{1}$ whose utility after phase 1 satisfies ${U}_{i_{1}}^{T_{1}}\leq\frac{T_{1}}{n}$ .

Recursively, for each phase $k\geq 2$ , we set for all $t=T_{k-1}+1,\cdots,T_{k}$ :

v_{i}^{t}=\begin{cases}0&i\in\{i_{1},\cdots,i_{k-1}\}\\ 1&\text{o.w.}\end{cases},

and let $i_{k}\not\in\{i_{1},\cdots,i_{k-1}\}$ be a buyer whose utility after round $T_{k}$ satisfies ${U}_{i_{k}}^{T_{k}}\leq\frac{1}{n}\sum_{j=1}^{k}T_{j}$ .

Notice that the optimal offline allocation will allocate everything in phase $k$ to buyer $i_{k}$ . We have $\lim_{T\rightarrow\infty}U_{i}^{\star}=T_{n-i+1}$ and

\lim_{T\rightarrow\infty}\frac{({U}_{i})^{\star,T}}{{U}_{i}^{T}}\geq\frac{1}{i}.

This leads to a ratio at least $(n!)^{1/n}$ .

0.B.9 Proof of Theorem 3.5

Define ${v_{\mathrm{max}}}=\max_{i,\tau}v^{\tau}_{i}$ , ${v_{i,\mathrm{max}}}=\max_{\tau}{v^{\tau}_{i}}$ , and $\bar{v}_{\mathrm{max}}=\frac{1}{n}{\sum_{i=1}^{n}}v_{i,\mathrm{max}}$ . In this proof, we study the general case with ${v_{i}^{\tau}}\in[0,v_{\mathrm{max}}]$ . We will prove that

\displaystyle R_{\delta}(v)\leq 3+\frac{4}{\delta}\bar{v}_{\mathrm{max}}+2\log\Big{(}1+\frac{{v_{\mathrm{max}}}}{\delta}\Big{)}+2\log T\;.

(26)

The proof is divided into two major steps.

0.B.9.1 Step 1. Ratio bounded by prices.

We first introduce notations for quantities that appear during the run of Algorithm 3.

Let ${p^{\tau}}=\max_{i}\beta^{\tau}_{i}{v^{\tau}_{i}}={v^{\tau}_{i^{\tau}}}/({U^{\tau-1}_{i^{\tau}}+\delta})$ be the price of item $\tau$ , and $\beta^{\tau}_{i}=\frac{1}{\delta+{U^{\tau-1}_{i}}}$ . Then Algorithm 3 Equation 5 can be written as ${i^{\tau}}=\min\arg\max_{i}\{\beta_{i}^{\tau}v_{i}^{\tau}\}$ . Also let ${u^{\tau}_{i}}={v_{i}^{\tau}}1(i=i^{\tau})$ be the utility of agent $i$ from item $\tau$ according to the allocation of the seeded integral greedy algorithm. Let $U^{\tau}_{i}=\sum_{s=1,\dots,\tau}u^{s}_{i}$ be the cumulative utilities for $\tau\geq 1$ and $U^{0}_{i}=0$ and $U_{i}=U_{i}^{T}$ .

We begin with the following claim.

Claim.

For any feasible hindsight allocation $x^{*}$ and its resulting utilities $\widetilde{U}_{i}$ ,

\displaystyle\frac{1}{n}{\sum_{i=1}^{n}}\frac{\widetilde{U}_{i}+\delta}{U_{i}+\delta}\leq 1+\frac{1}{n}{\sum_{\tau=1}^{T}}{p^{\tau}}=1+\frac{1}{n}{\sum_{i=1}^{n}}{\sum_{\tau=1}^{T}}\frac{{u^{\tau}_{i}}}{{U^{\tau-1}_{i}}+\delta}\;.

(27)

Notice that $\frac{1}{n}{\sum_{i=1}^{n}}\frac{\widetilde{U}_{i}+\delta}{U_{i}+\delta}\leq\frac{1}{n}{\sum_{i=1}^{n}}\frac{\widetilde{U}_{i}}{U_{i}+\delta}+1$ . Then,

	$\displaystyle\frac{1}{n}{\sum_{i=1}^{n}}\frac{\widetilde{U}_{i}}{U_{i}+\delta}$
	$\displaystyle=\frac{1}{n}{\sum_{i=1}^{n}}\bigg{(}\frac{{\sum_{\tau=1}^{T}}{x^{*,\tau}_{i}}{v^{\tau}_{i}}}{U_{i}+\delta}\bigg{)}$
	$\displaystyle=\frac{1}{n}{\sum_{i=1}^{n}}{\sum_{\tau=1}^{T}}\bigg{(}\frac{{v^{\tau}_{i}}}{U_{i}+\delta}\cdot{x^{*,\tau}_{i}}\bigg{)}$
	$\displaystyle\leq\frac{1}{n}{\sum_{i=1}^{n}}{\sum_{\tau=1}^{T}}\bigg{(}\frac{{v^{\tau}_{i}}}{{{U^{\tau-1}_{i}}}+\delta}\cdot{x^{*,\tau}_{i}}\bigg{)}=:(\text{A})\;.$

By the definition of ${p^{\tau}}$ , it holds for all agent $i$ ,

\displaystyle\frac{{v^{\tau}_{i}}}{{{U^{\tau-1}_{i}}}+\delta}\leq{p^{\tau}}\;.

Applying the supply feasibility constraint,

\displaystyle(\mathrm{A})\leq\frac{1}{n}{\sum_{i=1}^{n}}{\sum_{\tau=1}^{T}}{p^{\tau}}\cdot{x^{*,\tau}_{i}}\leq\frac{1}{n}{\sum_{\tau=1}^{T}}{p^{\tau}}\;.

Next, use integrality of Algorithm 3 and rewrite the sum of prices as the sum of running ratios:

	$\displaystyle\frac{1}{n}{\sum_{\tau=1}^{T}}{p^{\tau}}$
	$\displaystyle=\frac{1}{n}{\sum_{\tau=1}^{T}}\frac{v^{\tau}_{i^{\tau}}}{U^{\tau-1}_{i^{\tau}}+\delta}\cdot x^{\tau}_{{i^{\tau}}}$
	$\displaystyle=\frac{1}{n}{\sum_{\tau=1}^{T}}\bigg{(}\frac{v^{\tau}_{i^{\tau}}}{U^{\tau-1}_{i^{\tau}}+\delta}\cdot x^{\tau}_{{i^{\tau}}}+\sum_{i\neq{i^{\tau}}}\frac{{v^{\tau}_{i}}}{{U^{\tau-1}_{i}}+\delta}\cdot{x^{\tau}_{i}}\bigg{)}$
	$\displaystyle=\frac{1}{n}{\sum_{\tau=1}^{T}}{\sum_{i=1}^{n}}\bigg{(}\frac{{v^{\tau}_{i}}}{{U^{\tau-1}_{i}}+\delta}\cdot{x^{\tau}_{i}}\bigg{)}\;.$

0.B.9.2 Step 2. Introducing a basic inequality.

Lemma 9 (Bach and Levy [2019]).

For nonnegative numbers $a_{1},\ldots,a_{n}\in[0,a]$ and any $a_{0}\geq 0$ , it holds

\displaystyle\sum_{i=1}^{n}\frac{a_{i}}{a_{0}+\sum_{j=1}^{i-1}a_{j}}\leq 2+\frac{4a}{a_{0}}+2\log\left(1+\sum_{i=1}^{n-1}a_{i}/a_{0}\right)\;.

We use the above inequality to bound the right-hand side of (27). For a fixed agent $i$ , let ${v_{i,\mathrm{max}}}=\max_{\tau}{v^{\tau}_{i}}$ , then

	$\displaystyle{\sum_{\tau=1}^{T}}\bigg{(}\frac{{v^{\tau}_{i}}}{{U^{\tau-1}_{i}}+\delta}\cdot{x^{\tau}_{i}}\bigg{)}$
	$\displaystyle={\sum_{\tau=1}^{T}}\bigg{(}\frac{{v^{\tau}_{i}}{x^{\tau}_{i}}}{{\sum_{s=1}^{\tau-1}}{v^{s}_{i}}{x^{s}_{i}}+\delta}\bigg{)}$
	$\displaystyle\leq 2+4\frac{{v_{i,\mathrm{max}}}}{\delta}+2\log\bigg{(}1+\sum_{\tau=1}^{T-1}\frac{{u^{\tau}_{i}}}{\delta}\bigg{)}$		(Invoking Lemma 9)
	$\displaystyle=2+4\frac{{v_{i,\mathrm{max}}}}{\delta}+2\log T+2\log\bigg{(}\frac{1}{T}+\frac{1}{T}\sum_{\tau=1}^{T-1}\frac{{u^{\tau}_{i}}}{\delta}\bigg{)}$
	$\displaystyle\leq 2+4\frac{{v_{i,\mathrm{max}}}}{\delta}+2\log T+2\log\bigg{(}1+\frac{{v_{\mathrm{max}}}}{\delta}\bigg{)}\;.$

Finally, putting together we have the desired result.

	$\displaystyle\frac{1}{n}{\sum_{i=1}^{n}}\frac{\widetilde{U}_{i}+\delta}{U_{i}+\delta}$
	$\displaystyle\leq 1+\frac{1}{n}{\sum_{i=1}^{n}}\Bigg{(}2+4\frac{{v_{i,\mathrm{max}}}}{\delta}+2\log T+2\log\bigg{(}1+\frac{{v_{\mathrm{max}}}}{\delta}\bigg{)}\Bigg{)}$
	$\displaystyle=3+\frac{4}{\delta}\bar{v}_{\mathrm{max}}+2\log\bigg{(}1+\frac{{v_{\mathrm{max}}}}{\delta}\bigg{)}+2\log T\;.$

Appendix 0.C Missing Proofs in Section 4

0.C.1 Proof of Lemma 4

By the definition of $t^{*}$ and the bound on $\ell$ one can check:

U_{1}(A_{1})\cdot S>\frac{1}{\varepsilon}\ell t^{*}.

If $U_{j}^{s^{\prime}-1}(\bm{v}^{\prime})<\ell(s^{\prime}-1)$ , then

\frac{\hat{U}_{j}^{s^{\prime}-1}(\bm{v}^{\prime})}{\hat{U}_{1}^{s^{\prime}-1}(\bm{v}^{\prime})}=\frac{\ell t^{*}}{U_{1}(A_{1})\cdot S}<\varepsilon.

Now we show that the PACE allocation is preserved for the first $t^{*}$ items in the transformed sequence. For an original item $s\in A_{j}$ and corresponding item in $s^{\prime}$ in $\bm{v}^{\prime}$ ( $s^{\prime}\leq t^{*}$ ), we check the two cases:

When $U_{j}^{s^{\prime}-1}(\bm{v}^{\prime})\geq\ell(s^{\prime}-1)$ , we have

	$\displaystyle\hat{U}_{1}^{s^{\prime}-1}(\bm{v}^{\prime})$	$\displaystyle=\max\{U_{1}(A_{1}),\ell(s^{\prime}-1)\}\geq\max\{U_{1}^{s}(\bm{v}),\ell(s-1)\}\geq\hat{U}_{1}^{s}(\bm{v}),$
	$\displaystyle\hat{U}_{j}^{s^{\prime}-1}(\bm{v}^{\prime})$	$\displaystyle=\max\{U_{j}^{s^{\prime}-1}(\bm{v}^{\prime}),\ell s^{\prime}\}=U_{j}^{s^{\prime}-1}(\bm{v}^{\prime})=\max\{U_{j}^{s-1}(\bm{v}),\ell(s-1)\}=\hat{U}_{j}^{s}(\bm{v}).$

Hence $\hat{U}_{j}^{s^{\prime}-1}(\bm{v}^{\prime})/\hat{U}_{1}^{s^{\prime}-1}(\bm{v}^{\prime})\geq\hat{U}_{j}^{s}(\bm{v})/\hat{U}_{1}^{s}(\bm{v})>v_{j}^{s}/v_{1}^{s}$ .

When $U_{j}^{t^{\prime}-1}(\bm{v}^{\prime})<\ell(t^{\prime}-1)$ , by Lemma 4,

\displaystyle\frac{\hat{U}_{j}^{s^{\prime}-1}(\bm{v}^{\prime})}{\hat{U}_{1}^{s^{\prime}-1}(\bm{v}^{\prime})}<\varepsilon\leq\frac{v_{j}^{s}}{v_{1}^{s}}

Because we have set all agents’ valuations to $0$ except agent $1$ and $j$ , in both cases the item is indeed allocated to agent $j$ , which is the same as the original input $\bm{v}$ .

0.C.2 Proof of Lemma 5

In this proof, we let $a={U_{1}(A_{1})}/{|A_{1}|}\in[\varepsilon,1]$ be the per-round utility gain of agent $1$ in the beginning $S$ rounds.

To identify the sequence that gives maximum envy of agent $1$ on agent $j$ ’s bundle $B_{j}$ , we use the same technique as the proof of Theorem 3.1. The problem of maximizing agent $1$ ’s envy is equivalent to the following program:

$\displaystyle\max_{v_{1}^{t_{i}},v_{j}^{t_{i}},t_{i}}$	$\displaystyle\ \ \frac{1}{aS}\sum_{i=1}^{k}v_{1}^{t_{i}}$	(28)
s.t.	$\displaystyle\ \ S<t_{1}<\cdots<t_{k}\leq t^{*}$
	$\displaystyle\ \ v_{1}^{t_{i}},v_{j}^{t_{i}}\in\{0\}\cup[\varepsilon,1]$
	$\displaystyle\ \ \frac{v_{j}^{t_{i}}}{v_{1}^{t_{i}}}\geq\frac{\hat{U}_{j}^{t_{i-1}}}{\hat{U}_{1}^{t_{i-1}}}$
	$\displaystyle\ \ \hat{U}_{1}^{t_{i}}=\max\{\ell{t_{i}},aS\}$
	$\displaystyle\ \ \hat{U}_{j}^{t_{i}}=\max\{\ell{t_{i}},{U}_{j}^{t_{i}}\}$
	$\displaystyle\ \ {U}_{j}^{t_{i}}=\sum_{i=1}^{k}v_{j}^{t_{i}}$

$\{t_{1},\cdots,t_{k}\}$ are $k$ positions where we put the items of $B_{j}$ .

Recall the technique we have used when proving Theorem 3.1, where we introduced the function $q$ such that $q(u)$ is an upper bound of $v_{1}^{t+1}/v_{j}^{t+1}$ when $U_{j}^{t}=u$ . When the horizon is infinitely long, each choice of $v_{1}^{t}$ makes up only a small step, and the envy is upper–bounded by the integral $\frac{1}{U_{1}^{S}}\int_{0}^{\infty}q(u)\mathrm{d}u$ in the asymptotic sense.

There is a major difference when it comes to PACE. In the integral greedy algorithm, the upper bound of $v_{1}^{t+1}/v_{j}^{t+1}$ is clearly given by $\min\{\frac{1}{\varepsilon},\frac{U_{1}^{S}}{u}\}$ once we know that $U_{j}^{t}=u$ . However, in PACE, knowing only $U_{j}^{t}=u$ is not sufficient to bound on $v_{1}^{t+1}/v_{2}^{t+1}$ . The bound may also depend on $t$ , the position of the item.

\frac{v_{1}^{t}}{v_{2}^{t}}\leq\min\left\{\frac{1}{\varepsilon},\frac{\max\{\ell(t-1),aS\}}{\max\{\ell(t-1),{U}_{2}^{t-1}\}}\right\}.

Thanks to Lemma 4 and the fact that $\ell t^{*}<aS$ , we can simplify the above equation and remove the dependence on $t$ .

\frac{v_{1}^{t}}{v_{2}^{t}}\leq\begin{cases}\frac{1}{\varepsilon}&U_{2}^{t-1}/aS\leq\varepsilon\\ \frac{aS}{U_{2}^{t-1}}&\varepsilon<U_{2}^{t-1}/aS\leq 1/\varepsilon\\ 0&U_{2}^{t-1}/aS\geq 1/\varepsilon\end{cases}.

(29)

The upper bound (29) is identical to the constraint in a canonical optimization program of an envy maximizer parametrized by $aS$ , defined as (3) in the proof of Theorem 3.1. Hence, both asymptotic upper bounds for envy and $|B_{j}\cap C_{1}|$ in the canonical optimization program still hold.

0.C.3 Proof of Lemma 6

To prove Lemma 6, we show that for any given $c$ , by choosing $\ell\leq\bar{\ell}$ where $\bar{\ell}<\frac{c\varepsilon}{1+(n-1)(1+\log 1/\varepsilon)}$ , we have for any $i\in[n]$ ,

\lim_{T\rightarrow\infty}\frac{U_{i}(A_{i})}{T}\geq{\bar{\ell}}.

We show this by contradiction. Assume there exists $i\in[n]$ such that $U_{i}(A_{i})<T$ . Let $C_{i}=\{t:v_{i}^{t}\}$ be the set of items which agent $i$ has non-zero value. Consider item $s$ , the last item in $A_{j}\cap C_{i}$ . To construct a contradiction, we show that $U_{i}(A_{i})<\bar{\ell}T$ implies an upper bound on each $|A_{j}\cap C_{i}|$ , and thus $|C_{i}|$ (with a union bound). This might contradict the lower bound $V_{i}$ .

Consider an adversarial setting where the adversary attempts to maximize $|A_{j}\cap C_{i}|$ , subject to the assumptions on the input and the allocation rule of PACE. Suppose $A_{j}\cap C_{i}=\{t_{1},\cdots,t_{m}\}$ where $t_{1}<\cdots<t_{m}$ . We relax the constraint on the adversary as follows:

1.

For $t\in A_{j}\cap C_{i}$ , the adversary subject to constraint

$\frac{v_{j}^{t}}{\hat{U}_{j}^{t-1}}\geq\max_{k\in[n]}\frac{v_{k}^{t}}{\hat{U}_{k}^{t-1}}.$

We relax this into

$\frac{v_{j}^{t}}{\hat{U}_{j}^{t-1}}\geq\frac{v_{i}^{t}}{\hat{U}_{i}^{t-1}}.$ (30)
2.

Notice that $\hat{U}_{j}^{t}\geq U_{j}^{t}$ and $\hat{U}_{i}^{t}\leq\bar{\ell}T$ by assumption. Then we can further relax (30) into

$\frac{v_{j}^{t}}{{U}_{j}^{t-1}}\geq\frac{v_{i}^{t}}{\bar{\ell}T}.$ (31)
3.

The constraint on $U_{j}^{t}$ is minimum:

$U_{j}^{t}\geq\sum_{s\in A_{j}\cap C_{i},s<t}v_{j}^{s}.$ (32)

The relaxed adversarial setting consists of constraint (31), (32) and the value domain requirement $v_{i}^{t},v_{j}^{t}\in[0,1]$ . The value it concerns are $v_{i}^{t},v_{j}^{t}$ where $t\in C_{i}\cap A_{j}$ .

Recall the definition of the canonical optimization program for an envy-maximizer in (3). We notice that the above relaxed constraint is identical to a canonical optimization program parametrized by $\bar{\ell}T$ . By Lemma 3, the length of the value sequence in the relaxed setting is asymptotically bounded,

\lim_{T\rightarrow\infty}\frac{|C_{i}\cap A_{j}|}{T}\leq\frac{\bar{\ell}}{\varepsilon}\left(1+\log\frac{1}{\varepsilon}\right).

(33)

Since (33) is an upper bound for the relaxed setting, it must also hold for the true adversarial setting. We then know that

	$\displaystyle\lim_{T\rightarrow\infty}\frac{\|C_{i}\|}{T}$	$\displaystyle=\lim_{T\rightarrow\infty}\frac{\|A_{i}\|}{T}+\sum_{j\neq i}\lim_{T\rightarrow\infty}\frac{\|C_{i}\cap A_{j}\|}{T}$
		$\displaystyle\leq\frac{\bar{\ell}}{\varepsilon}\left(1+(n-1)\left(1+\log\frac{1}{\varepsilon}\right)\right)$
		$\displaystyle<c,$

where the last step is by the definition of $\bar{\ell}$ .

However, this is impossible, since $|C_{i}|<cT$ implies $V_{i}<cT$ . By contradiction we must have $U_{i}(A_{i})\geq\bar{\ell}T$ for sufficiently large $T$ .

0.C.4 Proof of Theorem 4.2

Since $\ell<\frac{c\varepsilon}{1+(n-1)\left(1+\log 1/\varepsilon\right)},r=1$ , by Lemma 6,

\bm{v}\in\mathcal{V}_{\varepsilon,c}^{T}\implies\bm{v}\in\mathcal{V}_{\varepsilon}^{T}(\ell,r).

We can then apply the envy bound in Theorem 4.1 to $\bm{v}\in\mathcal{V}_{\varepsilon,c}^{T}$ to get that for all $i\neq j$ ,

\lim_{T\rightarrow\infty}\frac{U_{i}(A_{j})}{U_{i}(A_{i})}\leq{1+2\log\frac{1}{\varepsilon}}.

Summing over all $j\in[n]$ and using $V_{i}=\sum_{j=1}^{n}U_{i}(A_{j})$ , we have

\lim_{T\rightarrow\infty}\frac{V_{i}}{U_{i}(A_{i})}=\lim_{T\rightarrow\infty}\sum_{j=1}^{n}\frac{U_{i}(A_{j})}{U_{i}(A_{i})}\leq 1+(n-1)\left({1+2\log\frac{1}{\varepsilon}}\right).

Applying the condition $V_{i}\geq cT$ ,

\lim_{T\rightarrow\infty}\frac{U_{i}(A_{i})}{T}\geq c\left(1+(n-1)\left(1+2\log\frac{1}{\varepsilon}\right)\right)^{-1}\geq\frac{c}{n}\left(1+2\log\frac{1}{\varepsilon}\right)^{-1}.

On the other hand, by the AM-GM inequality, the optimal value of Nash Welfare can be bounded as follows:

\prod_{i=1}^{n}{U_{i}^{\star}}\leq\left(\frac{T}{n}\right)^{n}.

Following from the above two inequalities,

\displaystyle\lim_{T\rightarrow\infty}\prod_{i=1}^{n}\frac{U_{i}^{\star}}{U_{i}(A_{i})}\leq\lim_{T\rightarrow\infty}\prod_{i=1}^{n}U_{i}^{\star}\cdot\frac{T}{U_{i}(A_{i})}\cdot T\leq\frac{1}{c}\left(1+2\log\frac{1}{\varepsilon}\right).

This proves Theorem 4.2.

Greedy-Based Online Fair Allocation with Adversarial Input: Enabling Best-of-Many-Worlds Guarantees

Abstract

Keywords:

1 Introduction

1.1 Related Work

The PACE Algorithm

Online Fair Division

Online Allocation with Resource Constraints

2 Setup

2.1 Online Fair Allocation

2.2 Algorithms

2.2.1 Performance Under Stochastic Input

Theorem 2.1.

Example 1.

2.2.2 Greedy Interpretation of PACE.

3 Analysis of the Integral Greedy Algorithm

3.1 Assumptions on the Input

Lemma 1.

Proof.

3.2 Envy Analysis for Greedy

Lemma 2 (Inductive structure of greedy allocation.).

Theorem 3.1 (Upper Bound for Multiplicative Envy).

Proof Sketch.

Lemma 3.

Theorem 3.2 (Lower Bound for Multiplicative Envy).

3.3 Nash Welfare Analysis for Greedy

Theorem 3.3 (Upper Bound for Competitive Ratio).

Proof Sketch

Theorem 3.4 (Lower Bound for Competitive Ratio).

Proof Sketch.

3.4 Nash Welfare Analysis without Section 3.1

Theorem 3.5 (Upper Bound of RδR_{\delta} for Seeded Integral Greedy).

4 Analysis of PACE

4.1 Assumptions on the Input

Example 2.

Example 3.

4.2 Envy Analysis for PACE

Theorem 4.1 (Upper Bound of Multiplicative Envy for PACE).

Proof.

Lemma 4.

Lemma 5.

4.3 Nash Welfare Analysis for PACE

Lemma 6.

Theorem 4.2 (Upper Bound of Competitive Ratio for PACE).

5 Conclusion

References

Appendix 0.A Additional Related Work

Offline Nash Welfare Approximation

Online Allocation with Envy Guarantees

Appendix 0.B Missing Proofs in Section 3

0.B.1 Proof of Theorem 3.1

0.B.2 Proof of Lemma 3

0.B.3 Proof of Theorem 3.2

0.B.4 Proof of Theorem 3.3, ε=1\varepsilon=1

0.B.5 Proof of Theorem 3.3, ε∈(0,1)\varepsilon\in(0,1)

Lemma 7.

Lemma 8.

0.B.6 Proof of Lemma 7

0.B.7 Proof of Lemma 8

0.B.8 Proof of Theorem 3.4

0.B.9 Proof of Theorem 3.5

0.B.9.1 Step 1. Ratio bounded by prices.

Claim.

0.B.9.2 Step 2. Introducing a basic inequality.

Lemma 9 (Bach and Levy [2019]).

Appendix 0.C Missing Proofs in Section 4

0.C.1 Proof of Lemma 4

0.C.2 Proof of Lemma 5

0.C.3 Proof of Lemma 6

0.C.4 Proof of Theorem 4.2

Theorem 3.5 (Upper Bound of $R_{\delta}$ for Seeded Integral Greedy).

0.B.4 Proof of Theorem 3.3, $\varepsilon=1$

0.B.5 Proof of Theorem 3.3, $\varepsilon\in(0,1)$