Online Market Equilibrium with Application to Fair Division

Yuan Gao
Columbia University
[email protected]
&Christian Kroer
Columbia University
[email protected]
email
&Alex Peysakhovich
Facebook AI Research
[email protected]

Abstract

Computing market equilibria is a problem of both theoretical and applied interest. Much research to date focuses on the case of static Fisher markets with full information on buyers’ utility functions and item supplies. Motivated by real-world markets, we consider an online setting: individuals have linear, additive utility functions; items arrive sequentially and must be allocated and priced irrevocably. We define the notion of an online market equilibrium in such a market as time-indexed allocations and prices which guarantee buyer optimality and market clearance in hindsight. We propose a simple, scalable and interpretable allocation and pricing dynamics termed as PACE. When items are drawn i.i.d. from an unknown distribution (with a possibly continuous support), we show that PACE leads to an online market equilibrium asymptotically. In particular, PACE ensures that buyers’ time-averaged utilities converge to the equilibrium utilities w.r.t. a static market with item supplies being the unknown distribution and that buyers’ time-averaged expenditures converge to their per-period budget. Hence, many desirable properties of market equilibrium-based fair division such as no envy, Pareto optimality, and the proportional-share guarantee are also attained asymptotically in the online setting. Next, we extend the dynamics to handle quasilinear buyer utilities, which gives the first online algorithm for computing first-price pacing equilibria. Finally, numerical experiments on real and synthetic datasets show that the dynamics converges quickly under various metrics.

1 Introduction

A market is said to be in equilibrium when supply is equal to demand. Computing prices and allocations which constitute a market equilibrium (ME) has long been a topic of interest [43, 28, 38, 20, 17, 31]. Most existing work focuses on the case of static markets. However, in this paper we consider the case of online markets where items arrive sequentially. We consider the extension of market equilibrium to this setting and provide market dynamics which quickly converge to an equilibrium in the case of online Fisher markets.

In static Fisher markets there is a fixed supply of each item, individual preferences are linear, additive, and items are divisible (or equivalently, randomization is allowed so individuals can purchase not just items but lotteries over items). In general, finding market equilibria is a hard problem [14, 47, 39]. However, in static linear Fisher markets, equilibrium prices and allocations can be computed via solving the Eisenberg-Gale (EG) convex program [22, 37].

We consider an online extension of Fisher markets where buyers are constantly present but items arrive one-at-a-time. Buyers’ budgets are per-period and represent their respective ‘bidding powers’ instead of being binding constraints. We extend the definition of market equilibrium to the online setting: online equilibrium allocations and prices are time-indexed and, when averaged across time, form an equilibrium in a corresponding static Fisher market where item supplies are proportional to item arrival probabilities. Due to the stochastic nature of online Fisher markets, any online algorithm can only attain an online market equilibrium asymptotically, that is, the allocations and prices approximately satisfy the equilibrium conditions after running the algorithm for a long time.

We propose market dynamics that find these equilibria in an online fashion based on the dual averaging algorithm applied to a reformulation of the dual of the EG convex program. We refer to this mechanism as PACE (Pace According to Current Estimated utility). In PACE, each buyer is assigned a utility pacing multiplier at time $0$ . When an item arrives, the individual with the highest adjusted utility (its valuation times the multiplier) receives that item and pays a price equal to its adjusted utility. The pacing multipliers of all individuals are then adjusted according to a closed-form rule which is given by the time average of the subgradient of the dual of the EG program. Intuitively, the pacing multipliers of those that did not receive the item go up while the receiver’s typically (but not always) goes down. We show that PACE yields item allocations and prices that satisfy various equilibrium properties asymptotically, for example no-regret and envy-freeness.

One important application of market equilibrium is fair allocation using the competitive equilibrium from equal incomes (CEEI) mechanism [46, 12]. In CEEI, each individual is given an endowment of faux currency and reports her valuations for items; then, a market equilibrium is computed and the items are allocated accordingly. However, many fair division problems are online rather than static. These include the allocation of impressions to content in certain recommender systems [34], workers to shifts, donations to food banks [2], scarce compute time to requestors [25, 40, 29], or blood donations to blood banks [32]. Similarly, online advertising can also be thought of as the allocation of impressions to advertisers via a market though with a budget of real money rather than faux currency. In the static CEEI case with linear additive preferences, the resulting equilibrium outcomes (i.e. results of the EG program) have been described as “perfect justice” [3]. In the online case, PACE achieves the same fair allocations as CEEI asymptotically. See Appendix A for more related work in the areas of (static and online) equilibrium computation and fair division.

We evaluate PACE experimentally in several market datasets. Convergence to good outcomes happens quickly in experiments. Taken together our results, we conclude that PACE is an attractive algorithm for both computing online market equilibria and online fair division.

Main contributions.

We consider the problem of allocating and pricing sequentially arriving items to $n$ buyers. This setting is termed as an online Fisher market. Given a sequence of item arrivals, we define an online market equilibrium as the items’ allocations and prices that, in hindsight, ensure buyer optimality and market clearance. We propose the PACE dynamics, which can be viewed as a nontrivial instantiation of the dual averaging algorithm on a reformulation of the dual of the Eisenberg-Gale convex program. Leveraging the convergence theory of dual averaging, we show that, when item arrivals are drawn from an (unknown) underlying distribution $s$ , possibly over an infinite/continuous item space, PACE ensures the following.

•

The pacing multipliers generated by PACE converge to the static equilibrium utility prices. Here, “static” means w.r.t. to an underlying static Fisher market.
•

Buyers’ time-averaged utilities converge to the static equilibrium utilities.
•

Buyers’ time-averaged expenditures converge to their respective budgets.

These convergences are all in mean square with rates $O((\log t)/t)$ , $O((\log t)/t)$ and $O((\log t)^{2}/t)$ , respectively, where the constants in these rates involve moderate polynomials of $n$ . In this way, PACE generates allocations and prices that constitute an online market equilibrium in the limit. In particular, the allocations and prices ensure that the allocation is Pareto optimal, and buyers have no regret, no envy, and get at least their proportional share asymptotically. We also extend PACE to the case of quasilinear buyer utilities, which yields the first online algorithm for computing first-price pacing equilibria. Finally, numerical experiments suggest that PACE converges much faster than its theoretical rates in terms of pacing multipliers, utilities and expenditures.

2 Static and Online Fisher Markets

Static Fisher markets and equilibria.

We first introduce static Fisher markets and their equilibria. Following the recent work [24, §2], we consider a measurable (possibly continuous) item space. Below are the technical preliminaries for the subsequent online setting. They can be skimmed through and referred back to as needed.

From now on, we define $[k]:=\{1,\dots,k\}$ for any $k\in\mathbb{N}:=\{0,1,2,\dots\}$ and $\mathbb{R}_{+}$ ( $\mathbb{R}_{++}$ , resp.) as the set of nonnegative (positive, resp.) real numbers. Let $\mathbb{I}\{A\}\in\{0,1\}$ denote the indicator function of an event $A$ .

(a)

There are $n$ buyers (individuals), each having a budget $B_{i}>0$ .
(b)
The item space is a finite measurable space $(\Theta,\mathcal{M},\mu)$ with $0<\mu(\Theta)<\infty$ , where $\Theta$ is a ( $\mu$ -)measurable subset of $\mathbb{R}^{d}$ , $\mathcal{M}$ is a $\sigma$ -algebra and $\mu:\mathcal{M}\rightarrow\mathbb{R}_{+}$ is a (finite) measure. From now on, $L^{p}$ (and $L^{p}_{+}$ , resp.) denote the set of (nonnegative, resp.) $L^{p}$ functions on $\Theta$ for any $p\in[1,\infty]$ (including $p=\infty$ ). Below are some concrete special cases for illustration.
1. (i)
  
  Finite: $\Theta=[m]$ , $\mathcal{M}=2^{[m]}=\{A:A\subseteq[m]\}$ and $\mu(A)=\sum_{a\in A}\mu(a)$ (all $2^{m}$ subsets are measurable and the measure is given by a point mass on each item).
2. (ii)
  
  Lebesgue-measurable: $\mu$ is the Lebesgue measure on $\mathbb{R}^{d}$ , $\mathcal{M}$ is the Lebesgue $\sigma$ -algebra and $\Theta$ is a (Lebesgue-)measurable subset of $\mathbb{R}^{d}$ with positive finite measure. For example, $\Theta$ can be a compact subset of $\mathbb{R}^{d}$ with a nonempty interior.
3. (iii)
  
  Countably infinite: $\Theta=\mathbb{N}$ and $\mu(A)=\sum_{a\in A}\mu(a)$ for any $A\subseteq\mathbb{N}$ , where $\mu(\mathbb{N})<0$ . For example, $\mu(a)$ can be the probability mass of a Poisson distribution, in which case $(\mathbb{N},\mathcal{M},\mu)$ is a probability space.
(c)

The supplies of items is $s\in L^{\infty}_{+}$ , i.e., item $\theta\in\Theta$ has supply $s(\theta)$ . Since $\Theta$ is compact, it is measurable with a finite measure. For the finite case $\Theta=[m]$ , we have $s=(s_{1},\dots,s_{m})\in\mathbb{R}^{m}_{+}$ .
(d)

The valuation of each buyer $i$ on all items is $v_{i}\in L^{1}_{+}$ , i.e., buyer $i$ has valuation $v_{i}(\theta)$ on item $\theta\in\Theta$ . For the finite case $\Theta=[m]$ , we have $v_{i}=(v_{i1},\dots,v_{im})\in\mathbb{R}^{m}_{+}$ .
(e)

For buyer $i$ , an allocation of items $x_{i}\in L^{\infty}_{+}$ gives a utility of

$u_{i}(x_{i}):=\langle v_{i},x_{i}\rangle:=\int_{\Theta}v_{i}(\theta)x_{i}(\theta)d\theta,$

where the angle brackets are based on the notation of applying a bounded linear functional $x_{i}$ to a vector $v_{i}$ in the Banach space $L^{1}$ and the integral is the usual Lebesgue integral. For the finite case $\Theta=[m]$ , we have $x_{i}=(x_{i1},\dots,x_{im})\in\mathbb{R}^{m}_{+}$ and the utility is

$u_{i}(x_{i})=\langle v_{i},x_{i}\rangle=\sum_{j}v_{ij}x_{ij},$

the usual Euclidean vector inner product. We will use $x\in(L^{\infty}_{+})^{n}$ to denote the aggregate allocation of items to all buyers, i.e., the concatenation of all buyers’ allocations.
(f)

The prices of items are modeled as $p\in L^{1}_{+}$ ; in other words, the price of item $\theta\in\Theta$ is $p(\theta)$ . For the finite case $\Theta=[m]$ , we have $p=(p_{1},\dots,p_{m})\in\mathbb{R}^{m}_{+}$ .
(g)

For a measurable item subset $A\subseteq\Theta$ , let $v_{i}(A):=\int_{A}v_{i}(\theta)d\theta$ (and similarly for $p(A)$ and $s(A)$ ), the $v_{i}$ -induced measure of $A$ . For the finite case $\Theta=[m]$ , for any item subset $A\subset[m]$ , $v_{i}(A)=\sum_{j\in A}v_{ij}$ (and similarly for $p(A)$ and $s(A)$ ).
(h)

Without loss of generality, we assume a unit total budget $\|B\|_{1}=1$ , a unit total supply $s(\Theta)=1$ and normalized buyer valuations $\langle v_{i},s\rangle=1$ . In other words, all items have a total value of $1$ for every buyer.

Definition 1.

Given item prices $p\in L^{1}_{+}$ , the demand of buyer $i$ is its set of utility-maximizing allocations given the prices and budget:

D_{i}(p):=\operatorname*{arg\,max}\{\langle v_{i},x_{i}\rangle:x_{i}\in L^{\infty}_{+},\,\langle p,x_{i}\rangle\leq B_{i}\}.

The associated utility level $\hat{U}_{i}(p)$ is defined as the value of $\langle v_{i},x_{i}\rangle$ for any $x_{i}\in D_{i}(p)$ .

Definition 2.

A market equilibrium (ME) is an allocation-price pair $(x^{*},p^{*})\in(L^{\infty}_{+})^{n}\times L^{1}_{+}$ such that the following holds.

(i)

Supply feasibility: $\sum_{i}x^{*}_{i}\leq s$ .
(ii)

Buyer optimality: $x^{*}_{i}\in D_{i}(p^{*})$ for all $i$ .
(iii)

Market clearance: $\langle p^{*},s-\sum_{i}x^{*}_{i}\rangle=0$ (any item with a positive price is fully allocated).

In the above definition and subsequently, all equations involving measurable functions are understood as “holding almost everywhere.” For example, $\sum_{i}x_{i}\leq s$ means the (measurable) set $\{\theta\in\Theta:\sum_{i}x_{i}(\theta)\leq s(\theta)\}$ has the same measure as $\Theta$ . Given a ME $(x^{*},p^{*})$ , we often denote the (unique) equilibrium utilities as $u^{*}_{i}=\langle v_{i},x^{*}_{i}\rangle$ . For a finite-dimensional linear Fisher market, it is well known that a ME can be computed via solving the EG convex program. Recently, [24] generalized this framework to handle the case of an infinite item space. More specifically, consider the following (possibly infinite-dimensional) convex programs.

\displaystyle\sup_{x\in(L^{\infty}_{+})^{n}}\sum_{i}B_{i}\log\langle v_{i},x_{i}\rangle\ \ {\rm s.t.}\ \sum_{i}x_{i}\leq s.

(

\mathcal{P}_{\rm EG}

)

\displaystyle\begin{split}\inf_{p\in L^{1}_{+},\,\beta\in\mathbb{R}^{n}_{+}}\left(\langle p,s\rangle-\sum_{i}B_{i}\log\beta_{i}\right)\ \ {\rm s.t.}\ p\geq\beta_{i}v_{i},\ \forall\,i.\end{split}

(

\mathcal{D}_{\rm EG}

)

The following theorem summarizes the results in [24, §3] regarding the above convex programs capturing market equilibria. As shown in that work, the above convex programs satisfy strong duality and their optimal solutions (which correspond to ME) can be characterized by the KKT optimality conditions. We slightly generalize the assumptions of [24] by allowing non-uniform item supplies $s$ instead of $s(\theta)=1$ for all $\theta\in\Theta$ . For completeness, a proof, which is mainly based on the proofs of the results in [24, §3], can be found in the Appendix.

Theorem 1.

The following hold regarding ( $\mathcal{P}_{\rm EG}$ ) and ( $\mathcal{D}_{\rm EG}$ ).

•

Both suprema are attained.
•

Given $x^{*}$ feasible to ( $\mathcal{P}_{\rm EG}$ ) and $(p^{*},\beta^{*})$ feasible to ( $\mathcal{D}_{\rm EG}$ ), they are both optimal if and only if the following holds: (i) $\langle p^{*},s-\sum_{i}x^{*}_{i}\rangle=0$ (market clearance), (ii) $\langle p^{*}-\beta^{*}_{i}v_{i},x^{*}_{i}\rangle=0$ (buyer $i$ only receives items within its ‘winning set’ $\{p^{*}=\beta^{*}_{i}v_{i}\}$ ) (ii) and $\langle v_{i},x^{*}_{i}\rangle=u^{*}_{i}:=B_{i}/\beta^{*}_{i}$ (buyer $i$ gets its maximum possible utility from $x^{*}_{i}$ ). In this case, $(x^{*},p^{*})$ is a ME.
•

Conversely, for a ME $(x^{*},p^{*})$ , it holds that (i) $x^{*}$ is an optimal solution of ( $\mathcal{P}_{\rm EG}$ ) and (ii) $(p^{*},\beta^{*})$ , where $\beta^{*}_{i}:=B_{i}/\langle v_{i},x^{*}_{i}\rangle$ , is an optimal solution of ( $\mathcal{D}_{\rm EG}$ ).

In the above theorem, $\beta^{*}_{i}$ is known as buyer $i$ ’s utility price, i.e., price per unit utility at equilibrium. As is well known, in a ME $(x^{*},p^{*})$ , the allocations $x^{*}$ are

•

Pareto optimal,
•

envy-free (in a budget weighted sense, i.e., $\langle v_{i},x^{*}_{i}\rangle/B_{i}\geq\langle v_{i},x^{*}_{k}\rangle/B_{k}$ for all $k\neq i$ ),
•

proportional (i.e., $\langle v_{i},x^{*}_{i}\rangle\geq\langle v_{i},s\rangle/n=1/n$ ); see, e.g.,[24, Theorem 3].

Online Fisher markets and equilibria.

We now consider a simple online variant of the Fisher market setting, referred to as an online Fisher market (OFM). There are $n$ buyers, each with a valuation $v_{i}\in L^{1}_{+}$ . Assume there are discrete time steps $t=1,2,\dots$ . At each time step $t$ , an item $\theta_{t}$ arrives and each buyer $i$ sees a value $v_{i}(\theta_{t})$ . The item must be allocated irrevocably to one buyer. Each buyer $i$ has a budget $B_{i}>0$ representing her per-period expenditure rate. ¹¹1This assumption is similar to one made in the literature on budget management in auctions, where each buyers has a per-period expenditure rate and the overall budget equal to the rate times the number of time periods. If a hard budget cap across all time periods is desired, then PACE and similar mechanisms may deplete some buyers’ budgets close to the end of the horizon [8, 6, 7].

Next, we introduce the notions of demand, utility level and online market equilibrium in an OFM. All of them are defined based on sequences of arrived items and their prices; they do not require any distributional assumption on the item arrivals.

Definition 3.

Let the arrived items be $(\theta_{\tau})_{\tau\in[t]}$ . An allocation (of arrived items) is $(x^{\tau}_{i})_{(\tau,i)\in[t]\times[n]}$ , where $x^{\tau}_{i}\in[0,1]$ is the fraction of the item $\theta_{\tau}$ allocated to buyer $i$ .²²2We allow fractional allocations in the definition for more generality. As we will see, fractional allocation is not needed: PACE generates allocations and prices that satisfy the OME conditions asymptotically via assigning each arrived item to one buyer. Let the prices of the arrived items be $(p^{\tau}(\theta_{\tau}))_{\tau\in[t]}$ . The demand of each buyer $i$ (in hindsight) at time $t$ is

\displaystyle D^{t}_{i}=\operatorname*{arg\,max}_{(z^{\tau}_{i})_{\tau\in[t]}}\left\{\frac{1}{t}\sum_{\tau=1}^{t}v_{i}(\theta_{\tau})z^{\tau}_{i}:0\leq z^{\tau}_{i}\leq 1,\,\forall\,\tau,\ \frac{1}{t}\sum_{\tau=1}^{t}p^{\tau}(\theta_{\tau})z^{\tau}_{i}\leq B_{i}\right\}.

(1)

Let $\hat{U}^{t}_{i}$ be the utility level associated with this demand, i.e., the maximum value in (1). An online market equilibrium (OME) is a pair of allocations $(x^{\tau}_{i})_{(\tau,i)\in[t]\times[n]}$ and prices $p^{\tau}(\theta_{\tau})$ such that the following holds.

(i)

Total allocation does not exceed the unit amount of the item $\sum_{i}x^{\tau}_{i}\leq 1$ for all $\tau$ .
(ii)

Buyers’ realized allocations are optimal in hindsight: $(x^{\tau}_{i})_{\tau\in[t]}\in D^{t}_{i}$ for all $i$ .
(iii)

Market clearance: $\sum_{i}x^{\tau}_{i}=1$ for $\tau$ such that $p^{\tau}(\theta_{\tau})>0$ .

In words, $\hat{U}^{t}_{i}$ is the maximum possible (time-averaged) utility buyer $i$ could have attained via choosing from the arrived items $(\theta_{\tau})_{\tau\in[t]}$ in hindsight, subject to their respective posted prices $(p^{\tau}(\theta_{\tau}))_{\tau\in[t]}$ and her current total budget $tB_{i}$ , with $D^{t}_{i}$ being the set of such utility-maximizing (time-indexed) allocations subject to per-period item availability constraints. An OME is a pair of allocations and prices that make buyers optimal in hindsight and market cleared.

Given an OFM, we define the associated underlying static Fisher market as having the same $n$ buyers and an item space $\Theta$ with supply $s$ being the (unknown) distribution from which the arriving items $\theta_{t}$ are drawn. To clarify the concepts of OFM and OME, we consider some simple special cases.

•

Suppose all item arrivals $\theta_{1},\dots,\theta_{t}$ are known in advance. Then, the OFM is the same as a static $n\times t$ Fisher market with the same buyers and the $t$ items, each having a unit supply. Here, buyer $i$ ’s valuation of item $\tau$ is $v_{i\tau}=v_{i}(\theta_{\tau})$ . To compute an OME, it suffices to solve the classical (finite-dimensional) Eisenberg-Gale convex program, that is, ( $\mathcal{P}_{\rm EG}$ ) with $\Theta=[t]$ , $s=(1,\dots,1)\in\mathbb{R}^{t}_{+}$ and $x\in\mathbb{R}_{+}^{n\times t}$ . Let the static ME be $(x^{*},p^{*})\in\mathbb{R}_{+}^{n\times t}\times\mathbb{R}_{++}^{t}$ . When each item $\theta_{\tau}$ arrives, OME allocates a fraction $x^{*}_{i\tau}$ of the item to each buyer $i$ and set its price as $p^{*}_{\tau}$ .

•

Suppose the sequentially arriving items are drawn i.i.d. from a known underlying distribution $s\in L^{\infty}_{+}$ (which specifies a random variable $\theta\sim s$ such that $\mathbf{P}[\theta\in A]=s(A)$ for any measurable set $A\subseteq\Theta$ ) and all buyers’ valuations $v_{i}$ are known. Suppose we have also computed a static ME $(x^{*},p^{*})$ (Definition 2) of a market with buyer valuations $v_{i}$ , budgets $B_{i}$ and item supplies being the distribution $s$ (the underlying static market). Then, when a new item $\theta_{t}$ (which is drawn from the distribution $s$ ) arrives at time $t$ , set its price as $p^{*}(\theta_{t})$ and allocate a fraction $x^{*}_{i}(\theta_{t})/s(\theta_{t})$ of it to each buyer $i$ (assume $s(\theta_{t})>0$ , i.e., only items with positive supplies can appear). Then, the time-averaged utility of each buyer $i$ is $\frac{1}{t}\sum_{\tau=1}^{t}v_{i}(\theta_{t})x^{*}_{i}(\theta_{t})/s(\theta_{t})$ , which converges to

\mathbf{E}_{\theta\sim s}[v_{i}(\theta)x_{i}(\theta)/s(\theta)]=\int_{\Theta}v_{i}(\theta)x^{*}_{i}(\theta)d\theta=u^{*}_{i}\ {\rm a.s.}

by to the Strong Law of Large Numbers. Since the online process is carried out using static equilibrium prices and allocations, the static ME properties (Definition 2) ensure the required OME properties hold asymptotically.

The above special cases require full knowledge of either the exact future item arrivals or the underlying static market to attain an OME. Next, we propose a simple, distributed dynamics which generates allocations and prices that satisfy the OME conditions asymptotically without requiring such knowledge (in particular, without knowledge of the distribution $s$ ).

3 The PACE Dynamics

In this section, we introduce the PACE (Pacing According to Current Estimated utility) dynamics that prices and allocates sequentially arriving items via (i) maintaining a pacing multiplier for each buyer and (ii) simple, distributed updates.³³3Pacing and pacing multipliers are terminology in budget management in large-scale ad auctions [18, 19]. In §5, we will show that PACE is an instantiation of dual averaging [48], a stochastic first-order method for regularized optimization, applied to a reformulation of ( $\mathcal{D}_{\rm EG}$ ).

In the PACE dynamics, each buyer maintains a pacing multiplier $\beta^{t}_{i}$ (with an initial value $\beta^{0}_{i}=(B_{i}+1)/2$ , or any value in $[B_{i},1]$ ). At time step $t$ , the following events take place.

(a)

An item $\theta_{t}$ appears and each buyer $i$ sees a value $v_{i}(\theta_{t})$ for the item.
(b)

Each buyer $i$ bids their paced value $\beta^{t}_{i}v_{i}(\theta_{t})$ for the item.
(c)

The item is allocated to the highest bidder (the winner at $t$ ): $i_{t}=\operatorname*{arg\,max}_{i}\beta^{t}_{i}v_{i}(\theta_{t})$ , with ties broken arbitrarily. For concreteness, we always choose the lowest winning index, i.e.,

$i_{t}=\min\operatorname*{arg\,max}_{i}\beta^{t}_{i}v_{i}(\theta_{t}).$

Then, the price of $\theta_{t}$ is set by the first-price rule

$p^{t}(\theta_{t})=\max_{i}\beta^{t}_{i}v_{i}(\theta_{t})=\beta^{t}_{i_{t}}v_{i}(\theta_{t})$

and the winner $i_{t}$ pays this price $p^{t}(\theta_{t})$ for the item $\theta_{t}$ .
(d)

Each buyer $i$ gets a utility

$u^{t}_{i}=v_{i}(\theta_{t})\mathbb{I}\{i=i_{t}\}.$

In other words, the winner $i_{t}$ gets $v_{i_{t}}(\theta_{t})$ and other buyers get zero.
(e)

Each buyer $i$ updates its cumulative average utility $\bar{u}^{t}_{i}$ :

$\bar{u}^{t}_{i}=\frac{1}{t}\sum_{\tau=1}^{t}u^{\tau}_{i}=\frac{t-1}{t}\bar{u}^{t-1}_{i}+\frac{1}{t}u^{t}_{i}.$
(f)

Each buyer $i$ updates their pacing multiplier $\beta^{t+1}_{i}$ as follows:

$\beta^{t+1}_{i}=\Pi_{[l_{i},h_{i}]}(B_{i}/\bar{u}^{t}_{i}):=\min\{\max\{l_{i},B_{i}/\bar{u}^{t}_{i}\},h_{i}\}.$

where $l_{i}=B_{i}/(1+\delta_{0})$ and $h_{i}=1+\delta_{0}$ for some fixed $\delta_{0}>0$ (e.g., $\delta_{0}=0.05$ ).

As will be seen in §5, buyer $i$ ’s equilibrium pacing multiplier (utility price) satisfies $l_{i}<\beta^{*}_{i}<h_{i}$ and her per-period utility $u^{t}_{i}$ corresponds to the $i$ th component of a stochastic subgradient of a function on $\beta$ in a reformulation of the convex program ( $\mathcal{D}_{\rm EG}$ ), on which we run dual averaging. Furthermore, the update rule for $\beta_{i}^{t+1}$ is such that, if the realized utilities $\bar{u}^{t}_{i}$ were the true static equilibrium utility for buyer $i$ , then $\beta_{i}^{t+1}$ would be the equilibrium multiplier. Note that PACE does not randomize (any randomness can only come from the market environment from which item arrivals are drawn) and assigns every item to a single buyer without splitting it.

The simplicity and distributed nature of PACE makes it desirable for large-scale practical use.

•

It can be run on arbitrary sequential item arrivals and only requires buyers’ valuations $v_{i}(\theta_{t})$ on the arrived items (rather than all valuations $v_{i}$ over the potentially large item space). No parameter tuning is needed (in particular, no stepsize tuning as in many first-order optimization methods).
•

When run as a centralized allocation mechanism, PACE only needs to maintain $O(n)$ scalars, namely, $\beta^{t}_{i}$ , $B_{i}$ and $\bar{u}^{t}_{i}$ for all $i$ . At time $t$ , it observes buyers’ valuations $v_{i}(\theta_{t})$ of the item $\theta_{t}$ , compute bids $\beta^{t}_{i}v_{i}(\theta_{t})$ , finds the winner $i_{t}$ , set the price as the maximal bid $\beta^{t}_{i_{t}}v_{i_{t}}(\theta_{t})$ and allocates the item to the winner; finally, it updates $\bar{u}^{t}$ and $\beta^{t+1}$ as in f, which takes $O(n)$ time.
•

PACE can also be run among the buyers in a decentralized manner, in which case each buyer only maintains two scalar values: the pacing multiplier $\beta^{t}_{i}$ and time-averaged utility $\bar{u}^{t}_{i}$ . When a new item arrives, each buyer only performs a few simple arithmetic operations to create a bid $\beta^{t}_{i}v_{i}(\theta_{t})$ , receives her utility (if she wins) and subsequently updates $\bar{u}^{t}_{i}$ and $\beta^{t+1}_{i}$ .

These make PACE suitable for Internet-scale online fair division and online Fisher market applications. In particular, it is very reminiscent of how Internet advertising auctions are run. There, a similar auction-based system is used, with the pacing multiplier ensuring that each advertiser smooths out their budget expenditure across the many auctions. The primary difference between this and our setting is that (i) the auction can be first-price or second-price and (ii) buyers usually have quasilinear utilities, that is, utility of the item minus the expenditure (price paid) [18, 8, 6, 7]. In §C, we extend PACE to quasilinear utilities, which provides a novel online algorithm for first-price pacing equilibrium computation [19].

4 Dual Averaging

In this section, we briefly recap the setup and general convergence results of dual averaging [48, 35], which will be used in the analysis of PACE. First, we introduce some notation for this and subsequent sections. Let $\mathbf{e}^{(i)}$ denote the $i$ ’th unit basis vector in $\mathbb{R}^{n}$ and $\mathbf{1}\in\mathbb{R}^{n}$ denote the vector of $1$ ’s. For $x,y\in\mathbb{R}^{n}$ , $[x,y]$ denotes the Cartesian product of intervals $\prod_{i=1}^{n}[x_{k},y_{k}]\subseteq\mathbb{R}^{n}$ . All norms $\|\cdot\|$ without a subscript are Euclidean $2$ -norms, unless otherwise stated.

Let $\Psi$ be a closed convex function with domain ${\rm dom}\,\Psi:=\{w\in\mathbb{R}^{n}:\Psi(w)<\infty\}$ . Let $Z\subseteq\mathbb{R}^{d}$ be an arbitrary sample space. For each $z\in Z$ , let $f_{z}$ be a convex and subdifferentiable function on ${\rm dom}\,\Psi$ . Considers the following regularized convex optimization problem [48, §1.1]:

\displaystyle\min_{w}\mathbf{E}f_{z}(w)+\Psi(w),

(2)

where the expectation is taken over a probability distribution $\mathcal{D}$ on $Z$ . A more general online optimization setting, as described in [48, §1.2] is as follows. At each time $t=1,2,3,\dots$ , we must choose an action $w^{t}$ before a new, unknown convex loss function $f_{t}$ arrives, which incurs a loss $f_{t}(w^{t})$ (a special case is i.i.d. sampled functions, i.e., $f_{t}=f_{z_{t}}$ , where $z_{t}$ are i.i.d. samples drawn from $\mathcal{D}$ ). The goal is to minimize regret when comparing our sequence of actions $w^{1},w^{2},\ldots$ to any fixed action $w$ . Here, the regret against $w$ is defined as

R_{t}(w):=\sum_{\tau=1}^{t}\left(f_{\tau}(w^{\tau})+\Psi(w^{\tau})\right)-\sum_{\tau=1}^{t}(f_{\tau}(w)+\Psi(w))

and the overall (maximal) regret up to time $t$ is $R_{t}=\max_{w}R_{t}(w)$ . We assume access to an oracle that, given any $f_{t}$ and $w\in{\rm dom}\,\Psi$ , returns a subgradient $g^{t}\in\partial f_{t}(w)$ . The dual averaging algorithm (DA) [48, Algorithm 1] is as follows. First, set $w_{1}\in{\rm dom}\,\Psi$ and $\bar{g}^{0}=0$ . Then, for each $t=1,2,\dots$ , DA performs the following steps:

(1)

Observe $f_{t}$ and compute $g^{t}\in\partial f_{t}(w^{t})$ .
(2)

Update the average subgradient (the dual average) via $\bar{g}^{t}=\frac{t-1}{t}\bar{g}^{t-1}+\frac{1}{t}\bar{g}^{t}$ .
(3)

Compute the next iterate $w^{t+1}=\operatorname*{arg\,min}_{w}\{\langle\bar{g}^{t},w\rangle+\Psi(w)\}$ .

Here, we do not employ any auxiliary regularizing function, since our problem has a natural source of strong convexity (i.e., a strongly convex $\Psi$ ) through the $-B_{i}\log\beta_{i}$ terms in ( $\mathcal{D}_{\rm EG}$ ). The following convergence guarantee on DA is proved as part of the proof of Corollary 4 in [48].

Theorem 2.

Dual averaging generates iterates $w^{t}$ such that

\mathbf{E}\|w^{t}-w^{*}\|^{2}\leq\frac{(6+\log t)G^{2}}{t\sigma^{2}},

where $G^{2}$ is an upper bound on $\mathbf{E}\|g^{t}\|^{2}$ , $t=1,2,\dots$ and $\sigma$ is the strong convexity modulus of $\Psi$ .

When solving the stochastic optimization problem (2), in Theorem 2, we can set $G^{2}$ to be an upper bound on $\sup_{w\in{\rm dom}\,\Psi}\mathbf{E}\|g_{z}(w)\|^{2}$ , where $g_{z}(w)$ is a subgradient oracle mapping each $(z,w)\in Z\times{\rm dom}\,\Psi$ to a subgradient and the expectation is over $z\sim\mathcal{D}$ and possible randomness of the subgradient oracle. We will shortly see that a reformulation of $\mathcal{D}_{\rm EG}$ , when cast into the form (2), exhibits stochastic subgradients that are exactly buyers’ received utilities in each time step. Using Theorem 2, we can show that the sequence of pacing multipliers $\beta^{t}$ generated by PACE converges to the underlying (equilibrium) utility prices $\beta^{*}$ of the static Fisher market.

5 Convergence analysis of the PACE dynamics

We will now show that PACE correspond to running DA on the vector $\beta^{t}$ of pacing multipliers for the buyers. To this end, we first reformulate ( $\mathcal{D}_{\rm EG}$ ) into a (finite-dimensional) convex program in $\beta$ in the form of (2):

\displaystyle\min_{\beta}\ \left(\langle\max_{i}\beta_{i}v_{i},s\rangle-\sum_{i}B_{i}\log\beta_{i}\right)\ {\rm s.t.}\ \beta\in[B/(1+\delta_{0}),(1+\delta_{0})\mathbf{1}],

(3)

where $\delta_{0}>0$ is an arbitrarily small constant. The bounds on $\beta$ do not change the optimal solution, because $\beta^{*}_{i}\in(B_{i},1)$ for each $i$ . Detailed steps of the reformulation are given in Appendix B.

In order to run DA, we need to compute a subgradient of $f_{\theta}:\beta\mapsto\max_{i}\beta_{i}v_{i}(\theta)$ at any $\theta\in\Theta$ . Following [24, §5], since $f_{\theta}$ is a piecewise linear function, a subgradient is

g_{\theta}(\beta):=v_{i^{*}}(\theta)\mathbf{e}^{(i^{*})}\in\partial f_{\theta}(\beta),

where $i^{*}=\min\operatorname*{arg\,max}_{i}\beta_{i}v_{i}(\theta)$ is the winner (see, e.g., [10, Theorem 3.50]).

We can now show that the PACE dynamics corresponds to running DA on (3). First, choose an arbitrary $\beta^{0}\in(B/(1+\delta_{0}),(1+\delta_{0})\mathbf{1})$ and $\bar{g}^{0}=0$ . At each time step $t=1,2,\dots$ , given the current pacing multiplier $\beta^{t}$ , DA applied to (3) unrolls the following steps.

•

An item $\theta_{t}$ arrives, having value $v_{i}(\theta_{t})$ for each buyer $i$ . The function $f_{t}$ in DA is

$f_{\theta_{t}}:\beta\mapsto\max_{i}\beta_{i}v_{i}(\theta_{t}).$
•

The winner is $i_{t}=\min\operatorname*{arg\,max}_{i}\beta^{t}_{i}v_{i}(\theta_{t})$ and a subgradient is

$g^{t}=v_{i_{t}j_{t}}\mathbf{e}^{(i_{t})}\in\partial f_{t}(\beta^{t}).$

Its $i$ th entry is exactly the realized (single-period) utility of individual $i$ at time $t$ in PACE, that is, $g^{t}_{i}=v_{i}(\theta_{t})\mathbb{I}\{i=i_{t}\}=u^{t}_{i}$ .
•

Update the dual average (time-averaged utilities): for each $i$ , compute $\bar{g}^{t}=\frac{t-1}{t}\bar{g}^{t-1}+\frac{1}{t}g^{t}$ , i.e.,

$\bar{g}^{t}_{i}=\frac{t-1}{t}\bar{g}^{t-1}_{i}+\frac{1}{t}v_{i}(\theta_{t})\mathbb{I}\{i=i_{t}\}.$

•

Update the pacing multipliers:

\beta^{t+1}=\operatorname*{arg\,min}_{\beta\in[B/(1+\delta_{0}),(1+\delta_{0})\mathbf{1}]}\left\{\langle\bar{g}^{t},\beta\rangle-\sum_{i}B_{i}\log\beta_{i}\right\}.

The minimization problem is separable in each $i$ and exhibits a simple and explicit solution which recovers step f in PACE (where $\bar{g}^{t}_{i}=\bar{u}^{t}_{i}$ ):

\beta^{t+1}_{i}=\operatorname*{arg\,min}_{\beta_{i}\in[B/(1+\delta_{0}),1+\delta_{0}]}\left\{\bar{g}^{t}_{i}\beta_{i}-B_{i}\log\beta_{i}\right\}\ \Rightarrow\ \beta^{t+1}_{i}=\Pi_{[B_{i}/(1+\delta_{0}),1+\delta_{0}]}\left(\frac{B_{i}}{\bar{u}^{t}_{i}}\right).

As mentioned earlier, PACE does not require a stepsize parameter. This is because DA is stepsize-free given a strongly convex regularizer $\Psi$ , which is indeed the case in our reformulation (3). In addition, in the above update step for $\beta^{t+1}_{i}$ , the directions of change are as follows.

•

For a non-winner $i\neq i_{t}$ , we have $u^{t}_{i}=0$ and hence $\bar{u}^{t}_{i}\leq\bar{u}^{t-1}_{i}$ . This implies $\beta^{t+1}_{i}\geq\beta^{t}_{i}$ . In words, a non-winner’s pacing multiplier weakly increases. The increase is strict if $\bar{u}_{i}^{t-1}>0$ , i.e., buyer $i$ has already received a nonzero utility.
•

For the winner $i_{t}$ , $\bar{u}^{t}_{i_{t}}$ may become greater than $\bar{g}^{t-1}_{i_{t}}$ , in which case $\beta^{t+1}_{i_{t}}\leq\beta^{t}_{i_{t}}$ . In words, the winner’s pacing multiplier may go up or down.

In order to analyze PACE, we assume $v_{i}(\Theta)=1$ , $v_{i}\in L^{\infty}_{+}$ (normalized and a.e.-bounded valuations)⁴⁴4The a.e.-boundedness assumption is needed in subsequent convergence analysis. Since $\Theta$ has a finite measure, it holds that $L^{\infty}_{+}\subseteq L^{1}_{+}$ . For a finite item space $\Theta=[m]$ , both are equal to $\mathbb{R}^{m}_{+}$ . and that there is an underlying item distribution $s\in L^{\infty}_{+}$ from which the item arrivals $\theta_{t}$ , $t=1,2,\dots$ are drawn i.i.d.⁵⁵5The distributional assumption on item arrivals (i.e., they are drawn i.i.d. from an unknown distribution $s$ ) is needed to establish asymptotic equilibrium properties of PACE. See Appendix B for an example that any algorithm can yield arbitrarily suboptimal allocations without such a distributional assumption. Define the underlying static Fisher market as one having the same $n$ buyers (each with valuation $v_{i}$ and budget $B_{i}$ ) and item supplies $s$ . Denote the equilibrium utilities and utility prices w.r.t. the underlying static market as $u^{*}$ and $\beta^{*}$ , respectively. We further assume that the valuations are $v_{i}\in L^{\infty}_{+}$ (i.e., a.e.-bounded on the item space). This is not restrictive: since an individual item $\theta$ has value $v_{i}(\theta)$ for each buyer $i$ , it should be a finite value.

Convergence of pacing multipliers.

After aligning PACE with DA, the convergence of the pacing multipliers $\beta^{t}$ follows directly from Theorem 2.

Theorem 3.

PACE generates pacing multipliers $\beta^{t}$ , $t=1,2,\dots$ such that

\mathbf{E}\|\beta^{t}-\beta^{*}\|^{2}\leq\frac{(6+\log t)G^{2}}{t\sigma^{2}},

where $G^{2}=\max_{i}\mathbf{E}_{\theta\sim s}[v_{i}(\theta)^{2}]\leq\max_{i}\|v_{i}\|_{\infty}^{2}$ , $\sigma=\frac{\min_{i}B_{i}}{(1+\delta_{0})^{2}}$ .

In other words, we have mean-square convergence of $\beta^{t}$ to $\beta^{*}$ at a $O((\log t)/t)$ rate. Since $\|B\|_{1}=1$ , we have $\min_{i}B_{i}\leq 1/n$ . Hence, $\sigma=O(1/n)$ and the constant in the bound is $\Omega(n^{2})$ . Whether such dependence on $n$ can be improved via new analysis remains an interesting research question.

Convergence of utilities.

We next show that the time-averaged utility $\bar{u}^{t}$ (which is equal to the dual average $\bar{g}^{t}$ ) converges to the equilibrium utility vector $u^{*}$ of the underlying Fisher market. A key step in the proof is to bound the probability of a projection in updating $\beta^{t+1}_{i}$ , that is, $\mathbf{P}[B_{i}/\bar{u}^{t}_{i}\notin[l_{i},u_{i}]]$ .

Theorem 4.

For each $i$ , let $\epsilon_{i}:=\min\{h_{i}-\beta^{*}_{i},\beta^{*}_{i}-l_{i}\}>0$ be the minimum distance to the endpoints of the pacing-multiplier interval and $\|v\|_{\infty}:=\max_{i}\|v_{i}\|_{\infty}$ . It holds that

\mathbf{E}(\bar{u}^{t}_{i}-u^{*}_{i})^{2}\leq\left(\frac{\|v_{i}\|_{\infty}^{2}}{\epsilon_{i}^{2}}+\left(\frac{1+\delta_{0}}{B_{i}}\right)^{2}\right)\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}.

Hence, letting $C=\frac{1}{(\min_{i}B_{i})^{2}}\left((\|v\|_{\infty}/\delta_{0})^{2}+(1+\delta_{0})^{2}\right)$ , we have

\mathbf{E}\|\bar{u}^{t}-u^{*}\|^{2}\leq C\cdot\frac{(6+\log(t+1))G^{2}}{(t+1)\sigma^{2}}.

Note that $C=\Omega(n^{2})$ . Hence, in this and the next theorems, the constant in the bound is $\Omega(n^{4})$ , which arises from $C$ and $\sigma=O(1/n)$ .

Convergence of expenditures.

The expenditure of buyer $i$ at time step $t$ is

b^{t}_{i}=\beta^{t}_{i}v_{i}(\theta_{t})\mathbb{I}\{i=i_{\tau}\}.

In other words, only the winner $i_{t}$ spends a nonzero amount, which is its bid. Let $\bar{b}^{t}_{i}=\frac{1}{t}\sum_{\tau=1}^{t}b^{\tau}_{i}$ be buyer $i$ ’s average expenditure. Utilizing the above convergence results, we show mean-squared convergence of $\bar{b}^{t}$ to $B$ at an $O((\log t)^{2}/t)$ rate.

Theorem 5.

For each $i$ , it holds that

\mathbf{E}(\bar{b}^{t}_{i}-B_{i})\leq 2\left[(\beta^{*}_{i})^{2}\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2}+2\|v_{i}\|_{\infty}^{2}\frac{1}{t}\sum_{\tau=1}^{t}\mathbf{E}(\beta^{\tau}_{i}-\beta^{*}_{i})^{2}\right].

For $t\geq 3$ and the constant $C$ defined in Theorem 4, we have

\mathbf{E}\|\bar{b}^{t}-B\|^{2}\leq\frac{2G^{2}}{t\sigma^{2}}\left(6(C+\|v\|_{\infty}^{2})+(C+6\|v\|_{\infty}^{2})\log t+\frac{\|v\|_{\infty}^{2}}{2}(\log t)^{2}\right).

PACE attains OME asymptotically.

Next, we show that PACE attains OME asymptotically, i.e., it generates allocations and prices that make buyers no-regret and envy-free in the limit (these notions will be clarified shortly). Let $x^{t}_{i}:=\mathbb{I}\{i=i_{t}\}$ denote whether buyer $i_{t}$ is the winner (i.e., whether she is allocated the item $\theta_{t}$ at time $t$ ) Utilizing Theorems 4 and 5, we can show that buyer $i$ ’s regret, that is, the difference between the maximum possible utility in hindsight $\hat{U}^{t}_{i}$ (Definition 3) and the realized utility $\bar{u}^{t}_{i}$ , vanishes as $t$ grows. The same holds for each buyer’s envy. In other words, at a large $t$ , in hindsight, no buyer prefers another buyer’s set of allocated items (up to a vanishing error).⁶⁶6In a static market, given an allocation $x\in\mathbb{R}^{n\times m}_{+}$ , the (maximum, budget-weighted) envy of buyer $i$ toward others’ bundles is $\rho_{i}(x)=\max_{k}\langle v_{i},x_{k}\rangle/B_{k}-\langle v_{i},x_{i}\rangle/B_{i}$ (see, e.g., [46, 12]). It is well-known that $\rho_{i}(x^{*})=0$ for all $i$ at equilibrium, a consequence of buyer optimality (Definition 2).

Theorem 6.

Denote

\xi^{t}_{i}=|\bar{u}^{t}_{i}-u^{*}_{i}|,\ \Delta^{t}_{i}=|\bar{b}^{t}_{i}-B_{i}|,\ \gamma_{t}=\frac{\|v\|_{\infty}}{t}\sum_{\tau=1}^{t}\|\beta^{\tau}-\beta^{*}\|_{\infty}.

Let $r^{t}_{i}:=\max\{\hat{U}^{t}_{i}-\bar{u}^{t}_{i},0\}$ be the regret of buyer $i$ at time $t$ . Then, it holds that

r^{t}_{i}\leq\xi^{t}_{i}+\gamma_{t}/B_{i},\ \mathbf{E}(r^{t}_{i})^{2}=O\left((\log t)^{2}/t\right).

Furthermore, let the envy of buyer $i$ (w.r.t. all other buyers) at time $t$ be

\rho^{t}_{i}=\max_{k}\bar{u}^{t}_{ik}/B_{k}-\bar{u}^{t}_{i}/B_{i},

where $\bar{u}^{t}_{ik}=\frac{1}{t}\sum_{\tau=1}^{t}v_{i}(\theta_{\tau})x^{\tau}_{k}$ is buyer $i$ ’s time-averaged utility given her own valuations and of buyer $k$ ’s allocations. Denote

\eta^{t}_{i}=\frac{1}{t}\sum_{\tau=1}^{t}(p^{*}(\theta_{t})-\beta^{\tau}_{i}v_{i}(\theta_{t}))x^{t}_{i}.

It holds that

\displaystyle\rho^{t}_{i}\leq\frac{1}{B_{i}}\left(\xi^{t}_{i}+\max_{k\neq i}\frac{\Delta^{t}_{k}+\eta^{t}_{k}}{B_{k}}\right)\ \ \text{and}\ \ \mathbf{E}(\eta^{t}_{i})^{2}\leq\frac{\|v\|_{\infty}^{2}G^{2}}{t\sigma^{2}}\left(6(1+\log t)+\frac{(\log t)^{2}}{2}\right).

Hence, $\mathbf{E}(\rho^{t}_{i})^{2}=O\left((\log t)^{2}/t\right)$ .

In light of Definition 3, Theorem 6 shows that $(x^{\tau}_{i})_{(i,\tau)\in[n]\times[t]}$ is approximately optimal for buyer $i$ . Since PACE also clears the market, we conclude that it attains OME asymptotically. Recall that theorem 4 ensures that buyers’ $\bar{u}^{t}_{i}$ converge to their static equilibrium utilities $u^{*}_{i}$ . Since the latter satisfy Pareto optimality and proportional share guarantee ( $u^{*}_{i}\geq B_{i}$ for all $i$ ), so are the time-averaged realized utilities in the limit. Together with Theorem 6, we conclude that PACE achieves the said fairness and efficiency guarantees, namely, Pareto optimality, envy-freeness and proportional-share guarantee, asymptotically.

6 Experiments

We evaluate the PACE dynamics in several real and synthetic datasets, namely, MovieLens, Household Items and an infinite-dimensional market instance with item space $\Theta=[0,1]$ and $v_{i}$ being linear functions on $[0,1]$ . For the first two datasets, see [31] for more information and exploratory data analysis. For all datasets, we consider the CEEI (fair division) setting where $B_{i}=1/n$ for all $i$ . For each dataset (with number of buyers $n=1500,2876,100$ , respectively), we run PACE for $T=10n$ time steps (iterations). More details on the experiments and additional plots displaying convergence of expenditures can be found in Appendix D. Figure 1 displays the mean values of the average and maximum relative errors of the pacing multipliers and time-averaged cumulative utilities over $10$ repeated experiments with different seeds (relative errors of cumulative spending w.r.t. total budgets are plotted separately in Appendix D). The standard errors are also displayed as vertical bars but are very small and nearly invisible. Vertical dotted lines indicate $t=10n$ The figures do not show the initial iterates $t=1,\dots,5n$ .

We see that PACE converges very quickly numerically: within $10$ epochs ( $10n$ time steps) average deviations in most quantities falls within $5\%$ of the equilibrium quantity, with the worst case not far behind. An important point is that budget spend takes much longer to converge than utility. This demonstrates an important practical difference for using PACE in an allocation scenario where budgets are ‘real money’ (e.g. Internet ad impressions) as compared to a CEEI-like setting, where budgets are faux currency only used for fair division.

7 Conclusion

We introduced the concept of an online Fisher market and proposed the PACE dynamics. We showed that when items arrive sequentially and stochastically, PACE converges to equilibrium outcomes of the underlying market model. Furthermore, we showed that, as a consequence of this, PACE can be used in online fair division problems to generate an online allocation that, asymptotically, achieves the compelling fairness properties of CEEI.

Many questions remain for future research. We mostly focused on the case where budgets are faux currency and there are many open questions for adapting PACE to a real-money budget-management setting as well as more complicated nonlinear utility models. Another imperative question, especially for practitioners, is whether PACE guarantees some level of incentive-compatibility.

Refer to caption — Figure 1: In all of our markets, iterates of the PACE dynamics quickly converges to their static equilibrium values both in the average case and the worst-off-buyer case. The horizontal line shows the fraction of $u^{*}$ achieved by the proportional share solution. The PACE utilities quickly outperform the proportional share utilities. Vertical lines indicate when $t$ is a multiple of $10n$ .

References

Aleksandrov and Walsh [2020] M. Aleksandrov and T. Walsh. Online fair division: A survey. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 13557–13562, 2020.
Aleksandrov et al. [2015] M. Aleksandrov, H. Aziz, S. Gaspers, and T. Walsh. Online fair division: Analysing a food bank problem. arXiv preprint arXiv:1502.07571, 2015.
Arnsperger [1994] C. Arnsperger. Envy-freeness and distributive justice. Journal of Economic Surveys, 8(2):155–186, 1994.
Azar et al. [2016] Y. Azar, N. Buchbinder, and K. Jain. How to allocate goods in an online market? Algorithmica, 74(2):589–601, 2016.
Aziz et al. [2015] H. Aziz, S. Gaspers, S. Mackenzie, and T. Walsh. Fair assignment of indivisible objects under ordinal preferences. Artificial Intelligence, 227:71–92, 2015.
Balseiro et al. [2017] S. Balseiro, A. Kim, M. Mahdian, and V. Mirrokni. Budget management strategies in repeated auctions. In Proceedings of the 26th International Conference on World Wide Web, pages 15–23, 2017.
Balseiro and Gur [2019] S. R. Balseiro and Y. Gur. Learning in repeated auctions with budgets: Regret minimization and equilibrium. Management Science, 65(9):3952–3968, 2019.
Balseiro et al. [2015] S. R. Balseiro, O. Besbes, and G. Y. Weintraub. Repeated auctions with budgets in ad exchanges: Approximations and design. Management Science, 61(4):864–884, 2015.
Bateni et al. [2018] M. H. Bateni, Y. Chen, D. Ciocan, and V. Mirrokni. Fair resource allocation in a volatile marketplace. Available at SSRN 2789380, 2018.
Beck [2017] A. Beck. First-order methods in optimization, volume 25. SIAM, 2017.
Birnbaum et al. [2011] B. Birnbaum, N. R. Devanur, and L. Xiao. Distributed algorithms via gradient descent for fisher markets. In Proceedings of the 12th ACM conference on Electronic commerce, pages 127–136. ACM, 2011.
Budish [2011] E. Budish. The combinatorial assignment problem: Approximate competitive equilibrium from equal incomes. Journal of Political Economy, 119(6):1061–1103, 2011.
Caragiannis et al. [2016] I. Caragiannis, D. Kurokawa, H. Moulin, A. D. Procaccia, N. Shah, and J. Wang. The unreasonable fairness of maximum Nash welfare. In Proceedings of the 2016 ACM Conference on Economics and Computation, pages 305–322. ACM, 2016.
Chen and Teng [2009] X. Chen and S.-H. Teng. Spending is not easier than trading: on the computational equivalence of fisher and arrow-debreu equilibria. In International Symposium on Algorithms and Computation, pages 647–656. Springer, 2009.
Cheung et al. [2019] Y. K. Cheung, M. Hoefer, and P. Nakhe. Tracing equilibrium in dynamic markets via distributed adaptation. In Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, pages 1225–1233, 2019.
Cole and Gkatzelis [2018] R. Cole and V. Gkatzelis. Approximating the Nash social welfare with indivisible items. SIAM Journal on Computing, 47(3):1211–1236, 2018.
Cole et al. [2017] R. Cole, N. R. Devanur, V. Gkatzelis, K. Jain, T. Mai, V. V. Vazirani, and S. Yazdanbod. Convex program duality, fisher markets, and Nash social welfare. In 18th ACM Conference on Economics and Computation, EC 2017. Association for Computing Machinery, Inc, 2017.
Conitzer et al. [2018] V. Conitzer, C. Kroer, E. Sodomka, and N. E. Stier-Moses. Multiplicative pacing equilibria in auction markets. In International Conference on Web and Internet Economics, 2018.
Conitzer et al. [2019] V. Conitzer, C. Kroer, D. Panigrahi, O. Schrijvers, E. Sodomka, N. E. Stier-Moses, and C. Wilkens. Pacing equilibrium in first-price auction markets. In Proceedings of the 2019 ACM Conference on Economics and Computation. ACM, 2019.
Daskalakis et al. [2009] C. Daskalakis, P. W. Goldberg, and C. H. Papadimitriou. The complexity of computing a Nash equilibrium. SIAM Journal on Computing, 39(1):195–259, 2009.
Dvoretzky et al. [1951] A. Dvoretzky, A. Wald, J. Wolfowitz, et al. Relations among certain ranges of vector measures. Pacific Journal of Mathematics, 1(1):59–74, 1951.
Eisenberg and Gale [1959] E. Eisenberg and D. Gale. Consensus of subjective probabilities: The pari-mutuel method. The Annals of Mathematical Statistics, 30(1):165–168, 1959.
Gao and Kroer [2020a] Y. Gao and C. Kroer. First-order methods for large-scale market equilibrium computation. In Neural Information Processing Systems 2020, NeurIPS 2020, 2020a.
Gao and Kroer [2020b] Y. Gao and C. Kroer. Infinite-dimensional fisher markets and tractable fair division. arXiv preprint arXiv:2010.03025, 2020b.
Ghodsi et al. [2011] A. Ghodsi, M. Zaharia, B. Hindman, A. Konwinski, S. Shenker, and I. Stoica. Dominant resource fairness: Fair allocation of multiple resource types. In Nsdi, volume 11, pages 24–24, 2011.
Goldberg et al. [2001] K. Goldberg, T. Roeder, D. Gupta, and C. Perkins. Eigentaste: A constant time collaborative filtering algorithm. information retrieval, 4(2):133–151, 2001.
Harper and Konstan [2015] F. M. Harper and J. A. Konstan. The movielens datasets: History and context. Acm transactions on interactive intelligent systems (tiis), 5(4):1–19, 2015.
Kantorovich [1975] L. Kantorovich. Mathematics in economics: achievements, difficulties, perspectives. Technical report, Nobel Prize Committee, 1975.
Kash et al. [2014] I. Kash, A. D. Procaccia, and N. Shah. No agent left behind: Dynamic fair division of multiple resources. Journal of Artificial Intelligence Research, 51:579–603, 2014.
Kroer and Peysakhovich [2019] C. Kroer and A. Peysakhovich. Scalable fair division for ’at most one’ preferences. arXiv preprint arXiv:1909.10925, 2019.
Kroer et al. [2019] C. Kroer, A. Peysakhovich, E. Sodomka, and N. E. Stier-Moses. Computing large market equilibria using abstractions. In Proceedings of the 2019 ACM Conference on Economics and Computation, pages 745–746, 2019.
McElfresh et al. [2020] D. C. McElfresh, C. Kroer, S. Pupyrev, E. Sodomka, K. A. Sankararaman, Z. Chauvin, N. Dexter, and J. P. Dickerson. Matching algorithms for blood donation. In Proceedings of the 21st ACM Conference on Economics and Computation, pages 463–464, 2020.
Murray et al. [2020a] R. Murray, C. Kroer, A. Peysakhovich, and P. Shah. Robust market equilibria with uncertain preferences. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 2192–2199, 2020a.
Murray et al. [2020b] R. Murray, C. Kroer, A. Peysakhovich, and P. Shah. https://research.fb.com/blog/2020/09/robust-market-equilibria-how-to-model-uncertain-buyer-preferences/, Sep 2020b.
Nesterov [2009] Y. Nesterov. Primal-dual subgradient methods for convex problems. Mathematical programming, 120(1):221–259, 2009.
Nesterov and Shikhman [2018] Y. Nesterov and V. Shikhman. Computation of fisher–gale equilibrium by auction. Journal of the Operations Research Society of China, 6(3):349–389, 2018.
Nisan et al. [2007] N. Nisan, T. Roughgarden, E. Tardos, and V. V. Vazirani. Algorithmic game theory. Cambridge University Press, 2007.
Othman et al. [2010] A. Othman, T. Sandholm, and E. Budish. Finding approximate competitive equilibria: efficient and fair course allocation. In AAMAS, volume 10, pages 873–880, 2010.
Othman et al. [2016] A. Othman, C. Papadimitriou, and A. Rubinstein. The complexity of fairness through equilibrium. ACM Transactions on Economics and Computation (TEAC), 4(4):1–19, 2016.
Parkes et al. [2015] D. C. Parkes, A. D. Procaccia, and N. Shah. Beyond dominant resource fairness: Extensions, limitations, and indivisibilities. ACM Transactions on Economics and Computation (TEAC), 3(1):1–22, 2015.
Peysakhovich and Kroer [2019] A. Peysakhovich and C. Kroer. Fair division without disparate impact. Mechanism Design for Social Good, 2019.
Plaut and Roughgarden [2020] B. Plaut and T. Roughgarden. Almost envy-freeness with general valuations. SIAM Journal on Discrete Mathematics, 34(2):1039–1068, 2020.
Scarf et al. [1967] H. Scarf et al. On the computation of equilibrium prices. Number 232. Cowles Foundation for Research in Economics at Yale University New Haven, CT, 1967.
Shmyrev [2009] V. I. Shmyrev. An algorithm for finding equilibrium in the linear exchange model with fixed budgets. Journal of Applied and Industrial Mathematics, 3(4):505, 2009.
Sinclair et al. [2021] S. R. Sinclair, S. Banerjee, and C. L. Yu. Sequential fair allocation: Achieving the optimal envy-efficiency tradeoff curve. arXiv preprint arXiv:2105.05308, 2021.
Varian et al. [1974] H. R. Varian et al. Equity, envy, and efficiency. Journal of Economic Theory, 9(1):63–91, 1974.
Vazirani and Yannakakis [2011] V. V. Vazirani and M. Yannakakis. Market equilibrium under separable, piecewise-linear, concave utilities. Journal of the ACM (JACM), 58(3):1–25, 2011.
Xiao [2010] L. Xiao. Dual averaging methods for regularized stochastic learning and online optimization. Journal of Machine Learning Research, 11:2543–2596, 2010.
Zhang [2011] L. Zhang. Proportional response dynamics in the fisher market. Theoretical Computer Science, 412(24):2691–2698, 2011.

Appendix A Related Work

The problem of market equilibrium computation has been of interest in economics for a long time (see, e.g., Nisan et al. [37]). There is a large literature focusing on computation of equilibrium in the specific case of (finite-dimensional) Fisher markets through various convex optimization formulations [22, 44, 17, 31] and gradient-based methods [11, 36, 23]. Other works extend these results to settings such as quasilinear utilities, capped utilities, indivisible items, or imperfectly specified utility functions [17, 16, 13, 33, 30, 41]. One of the most well-known algorithm for computing static market equilibria under Constant Elasticity of Substitution (CES) utilities is the Proportional Response (PR) dynamics Zhang [49], Birnbaum et al. [11]. Recently, [24] extends the classical Fisher market model to a measurable (possibly continuous) item space and shows that infinite-dimensional EG-type convex programs capture ME under this setting. Our work extends these ideas to a Fisher market-like scenario where items arrive sequentially.

The Fisher market literature above focuses on divisible items or randomized allocations of indivisible items. There is also a large literature on fair allocation of indivisible items (e.g. Aziz et al. [5], Caragiannis et al. [13], Plaut and Roughgarden [42]) including approximate ME-based methods [12, 39]. We note that all allocations in our setting are discrete and the relationship to Fisher markets happens in the time-average sense.

Perhaps most similar to our setting is that of [4], who study how to allocate allocate items in an online fashion in order to obtain a market-equilibrium-like allocation. However, they consider competitive ratios, and give a primal-dual algorithm that suffers at most a logarithmic loss compared to the best hindsight optimal solution, even for worst-case arrivals. In addition to the lack of asymptotic convergence, they also only show guarantees on various (arithmetic, geometric, harmonic) averages of the utilities. In contrast to this, our work considers stochastic arrivals, and gives an adaptive algorithm for asymptotically achieving all the desireable market equilibrium properties (e.g. no envy, Pareto optimality, equilibrium utilities). Another important difference is that our approach is easily implemented as a distributed dynamics that requires only a first-price auction allocation mechanism with indivisible allocations, in $O(n)$ time per item arrival ( $n$ being the number of buyers). At each time step, the algorithm only uses buyers’ valuations of the current arriving item. This makes our approach suitable for implementation in large-scale systems with a huge, possibly infinite item space.

Other methods for online fair division have been studied by various authors. In this literature, there are various notions of “online:” either buyers, items, or both can arrive online. Here we survey only related work where items arrive online. [2] studies a simple mechanism where agents can declare if they like an item, and then a coin is flipped to determine which of the agents that liked the item will get it. [29] studies online allocation for Leontief utilities (where each agent wants a bundle with items of fixed proportions) and shows how to achieve various properties for this setting. [15] studies an evolving market environment and shows that the PR dynamics generates iterates that are close to the (changing) equilibrium. Similar to the classical PR dynamics, in each time step, all buyers’ valuations of all items are known to the algorithm. In contrast, our work allows an unknown underlying market from which items are sampled: in each time step, PACE only uses buyers’ valuations of the current arriving item. [9] studies an online fair allocation problem and proposes a stochastic approximation scheme, which relies on frequently resolving the EG convex program, that ensures a constant approximation ratio (in terms of a proportional fairness measure) relative to the offline fair allocation. Very recently, [45] studies the problem of online fair division in which there is a fixed (finite) number of divisible items and sequentially arriving agents. The authors show that envy-freeness and (Pareto) efficiency cannot be minimized simultaneously; instead, there exists a boundary such that any algorithm can only possibly achieve the envy-efficiency combinations on one side of it, while they also propose such an algorithm. See also [1] for a survey of further works in this area.

The idea of pacing has been studied in the context of budget management in second-price auctions. The work most related to our work is [7], which studies an online version of that setting. [7] and our work are very different in terms of problem settings, results, and analysis. Here, we point out some key differences. [7] first show that, in the typical setting of a single bidder interacting with second-price auctions where values and prices are drawn from a stochastic environment, an adaptive pacing strategy achieves $O(1/\sqrt{T})$ regret and ergodic convergence of the bidder’s pacing multipliers. Then, under additional assumptions on monotonicity of the bidders’ expected expenditures, the authors establish game-theoretic equilibrium properties when all bidders use the same strategy, i.e., under the “simultaneous learning” setting. In contrast, we focus on market equilibrium properties such as fairness and efficiency. Furthermore, in terms of technical assumptions and convergence results, our PACE dynamics is stepsize-free, both in theory and numerically, while the adaptive pacing algorithm in [7] requires careful stepsizing rules; in our setting, we have last-iterate convergence of pacing multipliers (Theorem 7), whereas [7] only establishes ergodic convergence without a rate. These differences are, fundamentally, due to the use of different first-order optimization methods, and the fact that we leverage strong convexity of the EG dual. The analysis in [7] builds upon the convergence properties of mirror descent and stochastic approximation. Hence, it requires pre-determined vanishing stepsizes (their Assumption 1). In the simultaneous learning setting, the stepsizes of all bidders furthermore need to be carefully selected in joint fashion (their Assumption 3). In contrast, PACE is completely stepsize free, thanks to structure of the reformulated dual EG convex program (3), on which dual averaging can be applied directly without any stepsize parameter or an auxiliary regularization term. This makes it vastly easier to apply PACE in practice.

Appendix B Proofs, Derivations and Examples

Proof of Theorem 1

In [24, §3], it is assumed that item supplies are uniform, i.e., $s(\theta)=1$ for all $\theta\in\Theta$ . As is well-known in the finite case ( $\Theta=[m]$ ), this assumption is w.l.o.g. when studying a static Fisher market. Here, we show that all results in [24, §3] can be easily generalized to the case of non-uniform supplies $s$ (Theorem 1). For any market instance $M$ with buyer valuations $v_{i}\in L^{1}_{+}$ , budgets $B_{i}$ , $i\in[n]$ and item supplies $s\in L^{\infty}_{+}$ (all normalized as described in h in §2), consider another market instance $\tilde{M}$ with supplies being the constant function taking $1$ on $\Theta$ (denoted as $\mathbf{1}$ ), valuations

\tilde{v}_{i}(\theta)=v_{i}(\theta)s(\theta)

and the same budgets. First, note that $\tilde{v}_{i}\in L^{1}_{+}$ since $s\in L^{\infty}_{+}$ and $\Theta$ has a finite measure ( $\|s\|_{\infty}=\inf\{M:|s|\leq M\,\textnormal{a.e.}\}$ ):

\int_{\Theta}\tilde{v}_{i}(\theta)d\theta\leq\|s\|_{\infty}\int_{\Theta}v_{i}(\theta)d\theta=\|s\|_{\infty}v_{i}(\Theta)=\|s\|_{\infty}.

Denote the set of feasible allocations of $M$ as $F$ , that is, the set of $(x_{i})$ such that $x_{i}\in L^{\infty}_{+}$ for all $i$ and $\sum_{i}x_{i}\leq s$ . Similarly, denote the set of feasible allocations of $\tilde{M}$ as $\tilde{F}$ . For any $x\in F$ , consider

\displaystyle\tilde{x}_{i}(\theta)=\begin{cases}x_{i}(\theta)/s(\theta)&{\rm if}\ \theta>0,\\ 0&{\rm o.w.}\end{cases}

(4)

Since $0\leq x_{i}\leq s$ (a.e.), we have $0\leq\tilde{x}_{i}\leq\mathbf{1}$ (which means $\tilde{x}_{i}\in L^{\infty}_{+}$ ). Since $\sum_{i}x_{i}\leq s$ , we have

\sum_{i}\tilde{x}_{i}\leq\mathbf{1}.

Therefore, the set of utilities attainable by allocations in $F$ is the same as the set of utilities attainable by $\tilde{F}$ . In other words,

U=\left\{u\in\mathbb{R}^{n}_{+}:x\in F,\ \langle v_{i},x_{i}\rangle=u_{i},\,i\in[n]\right\}=\tilde{U}=\left\{\tilde{u}\in\mathbb{R}^{n}_{+}:\tilde{x}\in\tilde{F},\ \langle\tilde{v}_{i},\tilde{x}_{i}\rangle=\tilde{u}_{i}\right\}.

By [24, Lemma 1] and its proof there (here, we only need the compactness, not the existence of a pure allocation for any feasible utility vector; hence, invoking [21, Theorem 1] suffices), $\tilde{U}$ is convex and compact and so is $U$ . Hence, the suprema of ( $\mathcal{P}_{\rm EG}$ ) is attained. Completely analogous to the proof of [24, Theorem 2], we can show that both suprema of ( $\mathcal{P}_{\rm EG}$ ) is attained. Furthermore, completely analogous to the proofs of Lemma 3 and Theorem 2 there, we can show that strong duality holds for ( $\mathcal{P}_{\rm EG}$ ) and ( $\mathcal{D}_{\rm EG}$ ). More specifically, for any $x$ feasible to ( $\mathcal{P}_{\rm EG}$ ) and $(p,\beta)$ feasible to ( $\mathcal{D}_{\rm EG}$ ), and any $0\leq u_{i}\leq\langle v_{i},x_{i}\rangle$ (i.e., adding auxiliary variable $u_{i}$ to ( $\mathcal{P}_{\rm EG}$ )), it holds that

	$\displaystyle\sum_{i}B_{i}\log u_{i}$	$\displaystyle\leq\sum_{i}B_{i}\log u_{i}-\sum_{i}\beta_{i}(u_{i}-\langle v_{i},x_{i}\rangle)-\left\langle p,\sum_{i}x_{i}-s\right\rangle$
		$\displaystyle=\sum_{i}(B_{i}\log u_{i}-\beta_{i}u_{i})-\sum_{i}\langle p-\beta_{i}v_{i},x_{i}\rangle+\langle p,s\rangle$
		$\displaystyle\leq\sum_{i}(B_{i}\log\frac{B_{i}}{\beta_{i}}-\beta_{i}\cdot\frac{B_{i}}{\beta_{i}})-\sum_{i}\langle p-\beta_{i}v_{i},x_{i}\rangle+\langle p,s\rangle$
		$\displaystyle\leq\sum_{i}(B_{i}\log B_{i}-B_{i})+\langle p,s\rangle-\sum_{i}B_{i}\log\beta_{i}$
		$\displaystyle=\langle p,s\rangle-\sum_{i}B_{i}\log\beta_{i}-C,$

where the constant $C=\sum_{i}B_{i}(1-\log B_{i})$ . In the above derivation, the first inequality uses $\beta_{i}\geq 0$ , $u_{i}\leq\langle v_{i},x_{i}\rangle$ , $\sum_{i}x_{i}\leq s$ ; the second inequality uses the fact that $u_{i}=B_{i}/\beta_{i}$ maximizes the function

u_{i}\mapsto B_{i}\log u_{i}-\beta_{i}u_{i}

for any $\beta_{i}>0$ (i.e., substituting $u_{i}=B_{i}/\beta_{i}$ into the first line); the third inequality uses feasibility w.r.t. ( $\mathcal{D}_{\rm EG}$ ), i.e., $p\geq\beta_{i}v_{i}$ for all $i$ . Hence, when all inequalities are tight at a pair of solutions $x^{*}$ and $(p^{*},\beta^{*})$ feasible to ( $\mathcal{P}_{\rm EG}$ ) and ( $\mathcal{D}_{\rm EG}$ ), respectively (i.e., both optima are attained), the following KKT conditions must hold:

•

$\langle p^{*},s-\sum_{i}x_{i}^{*}\rangle=0$ (via the first inequality above being tight).
•

$u^{*}=B_{i}/\beta^{*}_{i}$ for all $i$ (via the second above).
•

$\langle p^{*}-\beta^{*}_{i}v_{i},x^{*}_{i}\rangle=0$ for for all $i$ (via the third above).

As the proof of [24, Theorem 2] shows, these conditions (together with feasibility w.r.t. the two convex programs) are necessary and sufficient for $(x^{*},p^{*})$ being a ME.

An example: the distributional assumption on item arrivals

In the definitions of OFM and OME in §2, we do not impose any distributional assumption on the sequentially arriving items $\theta_{t}$ . The PACE dynamics does not require any distributional assumption either. It is in the analysis of PACE in §5 that we assume that $\theta_{t}$ are drawn i.i.d. from an (unknown) underlying distribution $s\in L^{\infty}_{+}$ (where $s(\Theta)=1$ since it is a distribution). We define an underlying static Fisher market with the same buyers and item supplies $s$ . Then, §5 essentially shows that PACE guarantees that various (time-averaged) quantities converge to their static equilibrium quantities in the underlying market.

To justify the necessity of such a distributional assumption on item arrivals, consider the following example in which items arrivals are chosen by an adaptive “adversary” whose goal is to make the buyers’ time-averaged utilities of any online algorithm deviate from the static equilibrium utilities (defined by the “hindsight market,” i.e., the $n\times T$ market with items $\theta_{1},\dots,\theta_{T}$ ) as much as possible. For simplicity (and w.l.o.g.), budgets, valuations and supplies are not normalized in this example.

•

There are $n$ buyers with equal budgets $B_{i}=1$ and an item space of $\Theta=[n+1]$ .
•

Let the valuation matrix be as follows, for some large $M>n$ :

$v=\begin{bmatrix}1&M&M&M&0\\ \vdots&\vdots&\vdots&0&M\\ \vdots&M&0&\vdots&\vdots\\ 1&0&M&M&M\end{bmatrix}.$

In other words, for item $1$ , all buyers have valuation $1$ . For each item $j=2,\dots,n+1$ , buyer $(n+2-j)$ has valuation $0$ and other buyers have valuation $M$ .
•

The item supplies are $1$ for all $j\in[n+1]$ .
•

The number of time periods $T$ is large.

Given any fixed $j_{0}=2,\dots,n+1$ , the $n\times T$ static market with $T/2$ items of type $1$ and $T/2$ items of type $j_{0}$ exhibits the following ME:

•

Buyer $i_{0}:=n+2-j^{*}$ receives all $T/2$ items of type $1$ with a utility of $u^{*}_{i_{0}}=T/2$ and $\beta^{*}_{i_{0}}=2/T$ .
•

Each buyer $i\in[n]\setminus\{i_{0}\}$ receives $T/(2(n-1))$ items of type $2$ (i.e., all items of type $2$ are evenly distributed among them) with a utility $u^{*}_{i}=\frac{MT}{2(n-1)}$ and $\beta^{*}_{i}=\frac{2(n-1)}{MT}$ .
•
The price of item $1$ (with $T/2$ copies) is $p^{*}_{1}=\max\left\{2/T,\frac{2(n-1)}{MT}\right\}=\frac{2}{T}$ . The price of item $j_{0}$ (with $T/2$ copies) is $p^{*}_{j_{0}}=\max\left\{\beta^{*}_{i_{0}}\cdot 0,\frac{2(n-1)}{MT}\cdot M\right\}=2(n-1)/T$ . To verify these are equilibrium prices, note the following:
- –
  
  Buyer $i_{0}$ has $v_{i_{0}j_{0}}=0<v_{i_{0}1}=1$ . Hence, given $p^{*}$ , will strictly prefer type $1$ items over type $j_{0}$ . Her equilibrium allocation also consists of only type- $1$ items that cost exactly her budget: $\frac{T}{2}\cdot\frac{2}{T}=1$ .
- –
  
  Buyer $i\neq i_{0}$ prefers type $j_{0}$ over type $1$ since the former has a higher value-per-unit-price:
  
  $\frac{M}{p^{*}_{j_{0}}}=\frac{MT}{2(n-1)}>\frac{T}{2}=\frac{1}{p^{*}_{1}}.$
  
  Her equilibrium allocation also consists of only type- $j_{0}$ items that cost exactly her budget: $\frac{T}{2(n-1)}\cdot\frac{2(n-1)}{T}=1$ .

Given any online algorithm, consider the following adaptive adversary.

•

In the first $T/2$ time steps, every item arrival is of type $1$ , that is, $\theta_{t}=1$ , $t=1,\dots,T/2$ . The algorithm must irrevocably allocate them to the buyers.
•

Since every buyer has the same valuation $1$ on type $1$ , there exists a buyer $i_{0}$ that receives $u_{i_{0}}\leq T/(2n)$ up to $T/2$ .
•

Then, the adversary picks $j_{0}=n+2-i_{0}$ and every item arrival in the remaining $T/2$ periods i s of type $j_{0}$ (which has value zero for $i_{0}$ ).

In this way, buyer $i_{0}$ only receives a total utility of $u_{i_{0}}$ across all $T$ . As shown above, in the static (hindsight) $n\times T$ market, she should have received an equilibrium utility of $u^{*}_{i_{0}}=T/2$ (i.e., being allocated all type- $1$ items). Hence, the realized utility of buyer $i_{0}$ is only $1/n$ of her static equilibrium utility.

Reformulation of ( $\mathcal{D}_{\rm EG}$ ) into (3)

The reformulation is mainly based on [24, §5], except that we now allow non-uniform supplies $s$ instead of $s(\theta)=1$ for all $\theta\in\Theta$ . Assuming uniform supplies is w.l.o.g. in the static Fisher market (via rescaling all $v_{i}$ ) but is not so in OFM, since $s$ in an FOM represents the arbitrary, unknown underlying item distribution from which item arrivals are drawn.

In ( $\mathcal{D}_{\rm EG}$ ), fixing a $\beta>0$ , setting

p=\max_{i}\beta_{i}v_{i}\in L^{1}(\Theta)_{+},

i.e., the smallest $L^{1}$ function greater than or equal to $\beta_{i}v_{i}$ for all $i$ , clearly minimizes the objective subject to the constraints. Hence, we can eliminate $p$ in this way and write ( $\mathcal{D}_{\rm EG}$ ) as a finite-dimensional convex program in $\beta$ . Here, $\beta_{i}v_{i},s\rangle$ is convex in $\beta$ since $\beta\mapsto\max_{i}\beta_{i}v_{i}(\theta)$ is convex for any $\theta\in\Theta$ . More specifically, for any $\beta,\gamma\in\mathbb{R}^{n}_{+}$ , $\lambda\in[0,1]$ , $\theta\in\Theta$ , we have

\max_{i}\,(\lambda\beta_{i}+(1-\lambda)\gamma_{i})v_{i}(\theta)\leq\lambda\max_{i}\beta_{i}v_{i}(\theta)+(1-\lambda)\max_{i}\gamma_{i}v_{i}(\theta).

Hence,

\langle\max_{i}\,(\lambda\beta+(1-\lambda)\gamma)v_{i},s\rangle\leq\lambda\langle\max_{i}\beta_{i}v_{i},s\rangle+(1-\lambda)\langle\max_{i}\gamma_{i}v_{i},s\rangle.

Due to the strong convexity assumption in Theorem 2, we would need the function

\beta\mapsto\sum_{i}B_{i}\log\beta_{i}

to be strongly convex on its domain. However, it is only strictly but not strongly convex on $\mathbb{R}_{++}^{n}$ . To resolve this, we use the following lemma. It is similar to [24, Lemma 4] except we allow non-uniform supplies.

Lemma 1.

Assume the normalizations in h in §2. Then, the equilibrium utilities satisfy $B_{i}\leq u^{*}_{i}\leq 1$ and hence $B_{i}\leq\beta^{*}_{i}=B_{i}/u^{*}_{i}\leq 1$ .

Proof.

Since any buyer can get at most the entire set of items (given by the supply $s$ ),

u^{*}_{i}\leq\langle v_{i},s\rangle=1,

where the last inequality is due to the normalization $\langle v_{i},s\rangle=1$ in h in §2. In any ME $(x^{*},p^{*})$ , Theorem 1 implies

\langle p^{*},x^{*}_{i}\rangle=\beta^{*}_{i}\langle v_{i},x^{*}_{i}\rangle=\beta^{*}_{i}u^{*}_{i}=B_{i},

that is, each buyer $i$ spends her entire budget. Hence, by the normalization $\|B\|_{1}=1$ and market clearance $\langle p^{*},s-\sum_{i}x^{*}_{i}\rangle=0$ , we have

\langle p^{*},s\rangle=\sum_{i}\langle p^{*},x^{*}_{i}\rangle=\sum_{i}B_{i}=\|B\|_{1}=1\ \Rightarrow\ \langle p^{*},B_{i}s\rangle=B_{i}.

In other words, given item price $p^{*}$ , each buyer $i$ can afford the proportional allocation $x^{\circ}_{i}:=B_{i}s$ . Hence, the buyer optimality property of ME implies that buyer $i$ ’s equilibrium utility is at least the proportional share:

\displaystyle u^{*}_{i}\geq\langle v_{i},x^{\circ}_{i}\rangle=B_{i}\langle v_{i},s\rangle=B_{i}.

Since $B_{i}\leq u^{*}_{i}\leq 1$ and $\beta^{*}_{i}=B_{i}/u^{*}_{i}$ at equilibrium (Theorem 1), we have

B_{i}\leq\beta^{*}_{i}\leq 1.

∎

By Lemma 1, adding the constraints

B_{i}/(1+\delta_{0})\leq\beta_{i}\leq 1+\delta_{0},\ \forall\,i

to the convex program does not affect its optimal solution $\beta^{*}$ . Here, $\delta_{0}>0$ is to ensure $\beta^{*}_{i}\in(l_{i},h_{i})$ (the open interval), which facilitates the convergence analysis of cumulative utilities. To simplify the constants, one can take $\delta_{0}=1$ . Numerical experiments suggest that its value does not affect the speeds of convergence of quantities of interest.

Combining the above yields the reformulation (3). To align (3) with (2), for each $\theta\in\Theta$ (corresponding to $Z$ in §4) and $\beta\in\mathbb{R}^{n}_{+}$ (corresponding to $w$ in §4), let

f_{\theta}(\beta):=\max_{i}\beta_{i}v_{ij}.

Then,

f(\beta):=\mathbf{E}f_{\theta}(\beta)=\langle\max_{i}\beta_{i}v_{i},s\rangle,

where the expectation is over $\theta\sim s$ , i.e., a random variable with distribution $s$ (corresponding to $z\sim\mathcal{D}$ in §4).

Proof of Theorem 3

It follows immediately from Theorem 2, as long as the function

\Psi(\beta)=-\sum_{i}B_{i}\log\beta_{i}

is strongly convex modulo $\sigma$ and $\mathbf{E}\|v_{i_{t}}(\theta_{t})\mathbf{e}^{(i_{t})}\|^{2}\leq G^{2}$ . We now show them. Note that $\Psi$ is twice differentiable and has a diagonal Hessian

\nabla^{2}\Psi(\beta)=\begin{bmatrix}\frac{B_{1}}{\beta_{1}^{2}}&&\\ &\ddots&\\ &&\frac{B_{n}}{\beta_{n}^{2}}\end{bmatrix}

at any $\beta>0$ . Clearly, its smallest eigenvalue can be bounded as

\lambda_{\min}(\nabla^{2}\Psi(\beta))\geq\min_{i}\frac{B_{i}}{\beta_{i}}.

Denote $\kappa=1/(\min_{i}B_{i})$ . For any $\beta$ feasible to (3), by the constraints $B_{i}/(1+\delta_{0})\leq\beta_{i}\leq 1+\delta_{0}$ , we have

\lambda_{\min}(\nabla^{2}\Psi(\beta))\geq\min_{i}\min_{\beta_{i}\in[B_{i}/(1+\delta_{0}),1+\delta_{0}]}\frac{B_{i}}{\beta_{i}^{2}}=\min_{i}\frac{B_{i}}{(1+\delta_{0})^{2}}=\frac{1}{\kappa(1+\delta_{0})^{2}}.

Therefore, $\Psi$ is strongly convex on $[B/(1+\delta_{0}),(1+\delta_{0})\mathbf{1}]$ with modulus $\sigma=\frac{1}{\kappa(1+\delta_{0})^{2}}$ . Finally, we have

\mathbf{E}\|v_{i_{t}}\mathbf{e}^{(i_{t})}\|^{2}\leq\max_{i}\mathbf{E}_{\theta\sim s}[v_{i}(\theta)^{2}]=G^{2}\leq\max_{i}\|v_{i}\|_{\infty}^{2}.

Proof of Theorem 4

Intuitively, our proof uses the fact that if $\beta^{t}_{i}$ and $\beta_{i}^{*}$ are near each other, then $\frac{B_{i}}{\beta^{t}_{i}}$ will be near $\frac{B_{i}}{\beta_{i}^{*}}=u_{i}^{*}$ as well. Recall that $g^{t}_{i}=u^{t}_{i}$ (i.e., the subgradient of $\beta\mapsto\max_{i}\beta_{i}v_{i}(\theta_{t})$ that we choose corresponds to the utility buyer $i$ receives at time $t$ ) and hence $\bar{g}^{t}_{i}=\bar{u}^{t}_{i}$ . Since

\beta^{t+1}=\Pi_{[l_{i},h_{i}]}\left(\frac{B_{i}}{\bar{g}^{t}_{i}}\right),

we know that if no projection occurs (i.e., if $\frac{B_{i}}{\bar{g}^{t}_{i}}\in[l_{i},h_{i}]$ ) at iteration $t$ , then

\frac{B_{i}}{\beta^{t+1}_{i}}=\bar{g}_{i}^{t}.

Thus, we split our proof into two cases: the case where projection occurs (i.e., $\frac{B_{i}}{\bar{g}^{t}_{i}}\notin[l_{i},h_{i}]$ ), and the case where projection does not occur. As we will see, the probability of a projection at time step $t$ converges to $0$ as $t$ grows.

For each $i$ , consider the event that no projection occurs:

A^{t}_{i}:=\{l_{i}\leq B_{i}/\bar{g}^{t}_{i}\leq h_{i}\}.

Conditioning on the complementary event $(A^{t}_{i})^{c}=\{\bar{g}^{t}_{i}\notin[l_{i},h_{i}]\}$ , it holds that

|\beta^{t+1}_{i}-\beta^{*}_{i}|>\epsilon_{i}\ \Rightarrow\ \mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}\geq\mathbf{P}[(A^{t}_{i})^{c}]\epsilon_{i}^{2}\ \Rightarrow\ \mathbf{P}[(A^{t}_{i})^{c}]\leq\frac{1}{\epsilon_{i}^{2}}\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}.

Conditioning on $A^{t}_{i}$ , we have $B_{i}/\bar{g}^{t}_{i}=\beta^{t+1}_{i}$ . Furthermore, since

0\leq\bar{g}^{t}_{i}=\frac{1}{t}\sum_{\tau=1}^{t}v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}\leq\|v_{i}\|_{\infty}

and $\|v_{i}\|_{\infty}\geq 1\geq u^{*}_{i}$ , we have the following upper bound on the difference between the time average of realized utilities and the equilibrium utility of buyer $i$ :

|\bar{g}^{t}_{i}-u^{*}_{i}|\leq\max\{u^{*}_{i},\|v_{i}\|_{\infty}\}=\|v_{i}\|_{\infty}.

Now, splitting the expectation by the two complementary events $A^{t}_{i}$ and $(A^{t}_{i})^{c}$ , we can apply the above bounds to get

	$\displaystyle\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2}$	$\displaystyle=\mathbf{E}[\mathbb{I}_{(A^{t}_{i})^{c}}\cdot(\bar{g}^{t}_{i}-u^{}_{i})^{2}]+\mathbf{E}\left[\mathbb{I}_{A^{t}_{i}}\cdot\left(\frac{B_{i}}{\beta^{t+1}_{i}}-u^{}_{i}\right)^{2}\right]$
		$\displaystyle\leq\\|v_{i}\\|_{\infty}^{2}\mathbf{E}[\mathbb{I}_{(A^{t}_{i})^{c}}]+(u^{}_{i})^{2}\mathbf{E}\left[\mathbb{I}_{A^{t}_{i}}\cdot\left(\frac{B_{i}}{\beta^{t+1}_{i}u^{}_{i}}-1\right)^{2}\right]$
		$\displaystyle\leq\\|v_{i}\\|_{\infty}^{2}\mathbf{P}[(A^{t}_{i})^{c}]+(u^{}_{i})^{2}\cdot\mathbf{E}\left(\frac{\beta^{t+1}_{i}-\beta^{}_{i}}{\beta^{t+1}_{i}}\right)^{2}$
		$\displaystyle\leq\frac{\\|v_{i}\\|_{\infty}^{2}}{\epsilon_{i}^{2}}\mathbf{E}(\beta^{t+1}_{i}-\beta^{}_{i})^{2}+\left(\frac{(1+\delta_{0})u^{}_{i}}{B_{i}}\right)^{2}\cdot\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}$
		$\displaystyle\leq\left(\frac{\\|v_{i}\\|_{\infty}^{2}}{\epsilon_{i}^{2}}+\left(\frac{1+\delta_{0}}{B_{i}}\right)^{2}\right)\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}.$

Since $B_{i}\leq\beta^{*}_{i}\leq 1$ , we have ( $\kappa:=1/(\min_{i}B_{i})$ )

\epsilon_{i}\geq B_{i}\delta_{0}/(1+\delta_{0})>\delta_{0}/\kappa>0.

Summing up across all $i$ , using Theorem 3 and the above bound, we get

	$\displaystyle\mathbf{E}\\|\bar{g}^{t}-u^{*}\\|^{2}$	$\displaystyle\leq\sum_{i}\left(\frac{\\|v_{i}\\|_{\infty}^{2}}{\epsilon_{i}^{2}}+\left(\frac{1+\delta_{0}}{B_{i}}\right)^{2}\right)\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}$
		$\displaystyle\leq\left(\\|v\\|_{\infty}^{2}\left(\frac{\kappa}{\delta_{0}}\right)^{2}+((1+\delta_{0})\kappa)^{2}\right)\sum_{i}\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}$
		$\displaystyle\leq\left(\\|v\\|_{\infty}^{2}\left(\frac{\kappa}{\delta_{0}}\right)^{2}+((1+\delta_{0})\kappa)^{2}\right)\frac{(6+\log(t+1))G^{2}}{(t+1)\sigma^{2}}$
		$\displaystyle=C\cdot\frac{(6+\log(t+1))G^{2}}{(t+1)\sigma^{2}}.$

Proof of Theorem 5

First, note that $\bar{b}^{t}_{i}$ can be decomposed as follows.

	$\displaystyle\bar{b}^{t}_{i}$	$\displaystyle=\frac{1}{t}\sum_{\tau=1}^{t}\beta^{\tau}_{i}v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}$
		$\displaystyle=\beta^{}_{i}\cdot\frac{1}{t}\sum_{\tau=1}^{t}v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}+\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{}_{i})v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}$
		$\displaystyle=\beta^{}_{i}\bar{g}^{t}_{i}+\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{}_{i})v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}.$

Next, we bound the second term as follows, using convexity of $(\cdot)^{2}$ and $\|v_{ij_{\tau}}\|\leq\|v_{i}\|_{\infty}$ :

\left(\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{*}_{i})v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}\right)^{2}\leq\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{*}_{i})^{2}\|v_{i}\|_{\infty}^{2}.

Then, we bound the square difference between expenditure and budget as follows, using $(x+y)^{2}\leq 2(x^{2}+y^{2})$ for any $x,y\in\mathbb{R}$ :

\displaystyle(\bar{b}^{t}_{i}-B_{i})^{2}

\displaystyle\leq 2\left[(\beta^{*}_{i}\bar{g}^{t}_{i}-B_{i})^{2}+\left(\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{*}_{i})v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}\right)^{2}\right].

Combining the above two inequalities, taking expectation on both sides and using $\beta^{*}_{i}=B_{i}/u^{*}_{i}$ , we have

\displaystyle\mathbf{E}(\bar{b}^{t}_{i}-B_{i})^{2}\leq 2\left[(\beta^{*}_{i})^{2}\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2}+\|v_{i}\|_{\infty}^{2}\frac{1}{t}\sum_{\tau=1}^{t}\mathbf{E}(\beta^{\tau}_{i}-\beta^{*}_{i})^{2}\right].

(5)

When $t\geq 3$ , we have $\frac{\log(t+1)}{t+1}<\frac{\log t}{t}$ (since $(\frac{\log t}{t})^{\prime}=\frac{1-\log t}{t^{2}}<0$ for all $t\geq 3$ ). By the proof of [48, Corollary 4],

\displaystyle\frac{1}{t}\sum_{\tau=1}^{t}\frac{(6+\log\tau)G^{2}}{\tau\sigma^{2}}\leq\frac{1}{t}\left(6(1+\log t)+\frac{(\log t)^{2}}{2}\right)\frac{G^{2}}{\sigma^{2}}.

(6)

Finally, summing up (5) across all $i$ , using $\beta^{*}_{i}\leq 1$ , Theorems 3 and 4, and (6), we have

	$\displaystyle\mathbf{E}\\|\bar{b}^{t}-B\\|^{2}$	$\displaystyle\leq 2\left[\mathbf{E}\\|\bar{g}^{t}-u^{}\\|^{2}+\\|v\\|_{\infty}^{2}\frac{1}{t}\sum_{\tau=1}^{t}\mathbf{E}\\|\beta^{\tau}-\beta^{}\\|^{2}\right]$
		$\displaystyle\leq 2\left[C\cdot\frac{(6+\log t)G^{2}}{t\sigma^{2}}+\\|v\\|_{\infty}^{2}\frac{1}{t}\left(6(1+\log t)+\frac{(\log t)^{2}}{2}\right)\frac{G^{2}}{\sigma^{2}}\right]$
		$\displaystyle=\frac{2G^{2}}{t\sigma^{2}}\left(6(C+\\|v\\|_{\infty}^{2})+(C+6\\|v\\|_{\infty}^{2})\log t+\frac{\\|v\\|_{\infty}^{2}}{2}(\log t)^{2}\right).$

Proof of Theorem 6

For any $\theta\in\Theta$ , since $\|\cdot\|_{\infty}$ is $1$ -Lipschitz continuous w.r.t. itself, we have

$\displaystyle\left\|p^{*}(\theta)-\max_{i}\beta^{t}_{i}v_{i}(\theta)\right\|$	$\displaystyle\leq\left\|\max_{i}\beta^{*}_{i}v_{i}(\theta)-\max_{i}\beta^{t}_{i}v_{i}(\theta)\right\|$
	$\displaystyle\leq\max_{i}\|\beta^{*}_{i}v_{i}(\theta)-\beta^{t}_{i}v_{i}(\theta)\|$
	$\displaystyle\leq\\|v\\|_{\infty}\\|\beta^{t}-\beta^{*}\\|_{\infty}.$	(7)

Analysis of regret $r^{t}_{i}$ .

Let $(z^{\tau}_{i})_{\tau\in[t]}\in[0,1]^{t}$ be any feasible allocation on the arrived items $\theta_{\tau}$ , $\tau\in[t]$ such that

\frac{1}{t}\sum_{\tau=1}^{t}p^{\tau}(\theta_{t})z^{\tau}_{i}\leq B_{i}.

Using $p^{\tau}(\theta_{\tau})=\max_{i}\beta^{\tau}_{i}v_{i}(\theta_{\tau})$ , we have

	$\displaystyle\frac{1}{t}\sum_{\tau=1}^{t}p^{*}(\theta_{\tau})z^{\tau}_{i}$	$\displaystyle=\frac{1}{t}\sum_{\tau=1}^{t}p^{\tau}(\theta_{\tau})z^{t}_{i}+\frac{1}{t}\sum_{\tau=1}^{t}(p^{*}(\theta_{\tau})-p^{\tau}(\theta_{\tau}))z^{\tau}_{i}$
		$\displaystyle\leq B_{i}+\frac{1}{t}\\|v\\|_{\infty}\sum_{\tau=1}^{t}\\|\beta^{\tau}-\beta^{}\\|_{\infty}\quad\text{[by \eqref{eq:bound-\|p(j)-max-beta(i,t)*v(i,j)\|-any-j} and $0\leq z^{\tau}_{ij}\leq 1$] }.$		(8)

Denote

\gamma_{t}=\frac{1}{t}\|v\|_{\infty}\sum_{\tau=1}^{t}\|\beta^{\tau}-\beta^{*}\|_{\infty}.

In a static ME, by Theorem 1 and the constraints in ( $\mathcal{D}_{\rm EG}$ ), we have $p^{*}\geq\beta^{*}_{i}v_{i}$ . Hence,

\displaystyle\frac{1}{t}\sum_{\tau=1}^{t}p^{*}(\theta_{\tau})z^{\tau}_{i}\geq\beta^{*}_{i}\left(\frac{1}{t}\sum_{\tau=1}^{t}v_{i}(\theta_{\tau})z^{\tau}_{i}\right).

(9)

By (8), (9), $u^{*}_{i}=B_{i}/\beta^{*}_{i}$ (Theorem 1) and the definition of $\xi^{t}_{i}$ , we have

\displaystyle\frac{1}{t}\sum_{\tau=1}^{t}v_{i}(\theta_{\tau})z^{\tau}_{i}\leq\frac{1}{\beta^{*}_{i}}(B_{i}+\gamma_{t})=u^{*}_{i}\left(1+\frac{\gamma_{t}}{B_{i}}\right)\leq u^{*}_{i}+\frac{\gamma_{i}}{B_{i}}\leq\bar{u}^{t}_{i}+\xi^{t}_{i}+\frac{\gamma_{t}}{B_{i}}.

Hence, the utility level $\hat{U}^{t}_{i}$ (Definition 3) satisfies

\displaystyle\hat{U}^{t}_{i}\leq\bar{u}^{t}_{i}+\xi^{t}_{i}+\frac{\gamma_{t}}{B_{i}}.

(10)

Note that $\mathbf{E}(\gamma_{t}^{2})$ can be bounded as follows:

\displaystyle\mathbf{E}(\gamma_{t}^{2})\leq\|v\|_{\infty}^{2}\frac{1}{t}\sum_{\tau=1}^{t}\mathbf{E}\|\beta^{\tau}-\beta^{*}\|^{2}\leq\frac{\|v\|_{\infty}^{2}}{t}\left(6(1+\log t)+\frac{(\log t)^{2}}{2}\right)\frac{G^{2}}{\sigma^{2}}=O\left(\frac{(\log t)^{2}}{t}\right).

(11)

where the second inequality is due to Theorem 3 and (6). Combining (10), (11) and $\mathbf{E}(\xi^{t}_{i})^{2}=O((\log t)/t)$ (Theorem 4), we have

\mathbf{E}(r^{t}_{i})^{2}\leq 2\left(\mathbf{E}(\xi^{t}_{i})^{2}+\frac{1}{B_{i}^{2}}\mathbf{E}(\gamma_{t}^{2})\right)=O\left(\frac{(\log t)^{2}}{t}\right).

Analysis of envy $\rho^{t}_{i}$ .

Let $p^{*}=\max_{i}\beta^{*}_{i}v_{i}$ (a.e.) be the equilibrium prices. Similar to 8, for any $i$ , we have

	$\displaystyle\frac{1}{t}\sum_{\tau=1}^{t}p^{*}(\theta_{\tau})x^{\tau}_{i}$	$\displaystyle=\bar{b}^{t}_{i}+\frac{1}{t}\sum_{\tau=1}^{t}(p^{*}(\theta_{\tau})-\beta^{\tau}_{i}v_{i}(\theta_{\tau}))x^{\tau}_{i}$
		$\displaystyle\leq B_{i}+\Delta^{t}_{i}+\eta^{t}_{i}.$		(12)

Using the above (replacing $i$ with $k$ ) and $p^{*}\geq\beta^{*}_{i}v_{i}$ , we have

\displaystyle\beta^{*}_{i}\bar{u}^{t}_{ik}=\frac{1}{t}\sum_{\tau=1}^{t}\beta^{*}_{i}v_{i}(\theta_{\tau})x^{\tau}_{k}\leq\frac{1}{\tau}\sum_{\tau=1}^{t}p^{*}(\theta_{\tau})x^{\tau}_{k}\leq B_{k}+\Delta^{t}_{k}+\eta^{t}_{k}.

Hence, using $u^{*}_{i}=B_{i}/\beta^{*}_{i}\leq 1$ (Theorem 1 and Lemma 1),

	$\displaystyle\frac{\bar{u}^{t}_{ik}}{B_{k}}$	$\displaystyle\leq\frac{1}{B_{k}}\cdot\frac{1}{\beta^{*}_{i}}(B_{k}+\Delta^{t}_{k}+\eta^{t}_{k})$
		$\displaystyle\leq\frac{u^{*}_{i}}{B_{i}}\left(1+\frac{\Delta^{t}_{i}+\eta^{t}_{i}}{B_{k}}\right)$
		$\displaystyle\leq\frac{\bar{u}^{t}_{i}}{B_{i}}+\frac{\xi^{t}_{i}}{B_{i}}+\frac{\Delta^{t}_{i}+\eta^{t}_{i}}{B_{k}B_{i}}\quad\text{[by definition of $\xi^{t}_{i}$]}.$

Using the above inequality, we can bound the envy as follows:

\displaystyle\rho^{t}_{i}\leq\frac{\xi^{t}_{i}}{B_{i}}+\frac{1}{B_{i}}\max_{k}\frac{\Delta^{t}_{k}+\eta^{t}_{k}}{B_{k}}\leq\kappa\xi^{t}_{i}+\kappa^{2}\max_{k}(\Delta^{t}_{k}+\eta^{t}_{k}).

(13)

Next, we show the convergence of $\eta^{t}_{i}$ . By (7), we have

\displaystyle|\eta^{t}_{i}|\leq\sum_{\ell}|\eta^{t}_{\ell}|\leq\frac{1}{t}\sum_{\tau=1}^{t}|p^{*}_{j_{\tau}}-\beta^{\tau}_{i_{\tau}}v_{i_{\tau}j_{\tau}}|\leq\frac{1}{t}\sum_{\tau=1}^{t}\|v\|_{\infty}\|\beta^{\tau}-\beta^{*}\|_{\infty}=\gamma_{t}.

(14)

Hence, same as (11),

\displaystyle\mathbf{E}(\eta^{t}_{i})^{2}\leq\|v\|_{\infty}^{2}\frac{1}{t}\sum_{\tau=1}^{t}\mathbf{E}\|\beta^{\tau}-\beta^{*}\|^{2}\leq\|v\|_{\infty}^{2}\frac{1}{t}\left(6(1+\log t)+\frac{(\log t)^{2}}{2}\right)\frac{G^{2}}{\sigma^{2}}.

(15)

By Theorems 3 and 5, we know that $\mathbf{E}(\xi^{t}_{i})^{2}=O\left((\log t)/t\right)$ and $\mathbf{E}(\Delta^{t}_{i})^{2}=O\left((\log t)^{2}/t\right)$ . Together with (15) and (13), we have

\mathbf{E}(\rho^{t}_{i})^{2}\leq\kappa\mathbf{E}(\xi^{t}_{i})^{2}+\kappa^{2}\sum_{\ell}(\mathbf{E}(\Delta^{t}_{\ell})^{2}+\mathbf{E}(\eta^{t}_{\ell})^{2})=O\left(\frac{(\log t)^{2}}{t}\right).

Appendix C Extension to quasilinear utilities

We show that PACE can be easily extended to the case of a quasilinear (QL) market (i.e., where buyers have QL utilities). We show that most of the convergence results in §5 still holds. The static quasilinear market setup is the same as the linear case in §2 (which allows a possibly infinite item space $\Theta$ ), except the following:

•

For given item prices $p\in L^{1}_{+}$ , each buyer $i$ has a quasilinear utility function, i.e.,

$u_{i}(x_{i})=\langle v_{i},x_{i}\rangle-\langle p,x_{i}\rangle.$
•

Without loss of generality, assume $\|B\|_{1}=1$ and all buyers’ valuations are nontrivial, i.e., $\langle v_{i},s\rangle>0$ for all $i$ . Due to the structure of QL utilities, we cannot normalize the valuations $v_{i}$ and budgets $B_{i}$ separately without loss of generality. Instead, they can only be scaled at the same time by the same constant.

Same as before, each buyer $i$ has a budget $B_{i}>0$ and can only choose among budget-feasible allocations, that is, $x_{i}$ such that $\langle p,x_{i}\rangle\leq B_{i}$ . In this case, an allocation-price pair $(x^{*},p^{*})$ is a quasilinear market equilibrium (QLME) if the following holds (see [24, §6] and [17, §4]):

•

Buyers are optimal: $x^{*}_{i}\in D_{i}(p^{*}):=\operatorname*{arg\,max}\{\langle v_{i}-p^{*},x_{i}\rangle:x_{i}\in L^{\infty}_{+},\,\langle p^{*},x_{i}\rangle\leq B_{i}\}$ .
•

The market clears: $\sum_{i}x_{i}\leq s$ and $\langle p^{*},s-\sum_{i}x^{*}_{i}\rangle=0$ .

As shown in [24, §6], the following pair of (possibly infinite-dimensional) convex programs capture QLME:⁷⁷7There, the authors assume $s=\mathbf{1}$ , which is w.l.o.g. for static Fisher markets. Similar to the case of Theorem 1 for linear utilities, all results can be easily extended to the case of $s\in L^{\infty}_{+}$ .

\displaystyle\begin{split}\sup\,&\sum_{i}(B_{i}\log u_{i}-\delta_{i})\\ {\rm s.t.}&u_{i}\leq\langle v_{i},x_{i}\rangle+\delta_{i},\,\forall\,i\in[n],\\ &\sum_{i}x_{i}\leq s,\,\\ &u_{i}\geq 0,\ \delta_{i}\geq 0,\ x_{i}\in L_{1}(\Theta)_{+},\ \forall\,i\in[n].\end{split}

(

\mathcal{P}_{\rm QLEG}

)

\displaystyle\begin{split}\inf\,&\langle p,s\rangle-\sum_{i}B_{i}\log\beta_{i}\\ {\rm s.t.}&p\geq\beta_{i}v_{i},\ \beta_{i}\leq 1,\ \forall\,i\in[n],\\ &p\in L_{1}(\Theta)_{+},\ \beta\in\mathbb{R}^{d}_{+}.\end{split}

(

\mathcal{D}_{\rm QLEG}

)

In the sequel, we use $(x^{*},u^{*},\delta^{*})$ to denote an optimal solution of ( $\mathcal{P}_{\rm QLEG}$ ) (in which $u^{*}$ and $\delta^{*}$ are unique) and $(p^{*},\beta^{*})$ to denote the optimal solution of ( $\mathcal{D}_{\rm QLEG}$ ). As shown in [24, §6], the following KKT conditions of ( $\mathcal{P}_{\rm QLEG}$ ) and ( $\mathcal{D}_{\rm QLEG}$ ) are necessary and sufficient for $(x^{*},p^{*})$ being a QLME.

•

$\delta^{*}_{i}(1-\beta^{*}_{i})=0$ for all $i$ (complementary slackness).
•

$u^{*}_{i}=B_{i}/\beta^{*}_{i}$ for all $i$ .
•

$p^{*}=\max_{i}\beta^{*}v_{i}$ (a.e.) for all $j$ .
•

$\langle p^{*}-\beta^{*}_{i}v_{i},x^{*}_{i}\rangle=0$ for all $i$ .

Let $(x^{*},p^{*})$ denote a QLME. The equilibrium utility of buyer $i$ (i.e., the amount of utility buyer $i$ receives at a QLME) is

u^{\rm QLME}_{i}:=\langle v_{i}-p^{*},x^{*}_{i}\rangle=(1-\beta^{*}_{i})\langle v_{i},x^{*}_{i}\rangle=(1-\beta^{*}_{i})(u^{*}_{i}-\delta^{*}_{i}),

which is unique and does not depend on the choice of the equilibrium allocation $x^{*}$ . In general, $u^{\rm QLME}_{i}$ is not the same as $u^{*}_{i}$ in the optimal solution of ( $\mathcal{P}_{\rm QLEG}$ ). In comparison, the term $\langle v_{i},x^{*}_{i}\rangle$ can be viewed as the equilibrium gross utility before subtracting the price $\langle p^{*},x^{*}_{i}\rangle$ of the allocation $x^{*}_{i}$ . The above equilibrium quantities satisfy the following Gao and Kroer [24, §6].

•

If $\beta^{*}_{i}=1$ , then $\langle p^{*}-\beta^{*}_{i}v_{i},x^{*}_{i}\rangle=0$ implies its gross utility and expenditure are equal, which give an equilibrium utility of zero:

\langle v_{i},x^{*}_{i}\rangle=\beta^{*}_{i}\langle v_{i},x^{*}_{i}\rangle=\langle p^{*},x^{*}_{i}\rangle=u^{*}_{i}-\delta^{*}_{i}\ \Rightarrow\ u^{\rm QLME}_{i}=\langle v_{i}-p^{*},x^{*}_{i}\rangle=0.

•

If $\beta^{*}_{i}<1$ , then $\delta^{*}_{i}=0$ by complementary slackness (the first KKT condition above). Hence, the gross utility is $\langle v_{i},x^{*}_{i}\rangle=u^{*}_{i}$ and

u^{\rm QLME}=\langle v_{i}-p^{*},x^{*}_{i}\rangle=(1-\beta^{*}_{i})\langle v_{i},x^{*}_{i}\rangle=(1-\beta^{*}_{i})u^{*}_{i}.

Similar to the proof of [23, Lemma 5], we can show that

u^{*}_{i}\leq\langle v_{i},s\rangle+B_{i}.

Hence,

\beta^{*}=\frac{B_{i}}{u^{*}_{i}}\geq\frac{B_{i}}{\langle v_{i},s\rangle+B_{i}}>\beta^{\min}_{i}:=\frac{B_{i}}{\langle v_{i},s\rangle+2B_{i}}>0.

The choice of $\beta^{\min}_{i}$ is to ensure that $\beta^{*}_{i}-\beta^{\min}_{i}>0$ , which simplifies the analysis of the dynamics. Substituting $p=\max_{i}\beta_{i}v_{i}$ and using the bounds $\beta^{\min}_{i}\leq\beta^{*}_{i}\leq 1$ , we can solve the following convex program for the equilibrium utility prices $\beta^{*}$ , where $\beta^{\min}:=(\beta^{\min}_{1},\dots,\beta^{\min}_{n})$ :

\displaystyle\min_{\beta\in[\beta^{\min},\mathbf{1}]}\ \langle p,s\rangle-\sum_{i}B_{i}\log\beta_{i}.

(16)

Applying dual averaging to the convex program (16), we arrive at the following PACE dynamics for an online QL market (i.e., an OFM with buyers having QL utilities). At time $t$ , the following steps take place.

•

An item $\theta_{t}\in\Theta$ arrives, which determines a winner $i_{t}=\min\operatorname*{arg\,max}_{i}\beta^{t}_{i}v_{i}(\theta_{t})$ .
•

The stochastic subgradient is $g^{t}=v_{i_{t}}(\theta_{t})\mathbf{e}^{(i_{t})}$ , or $g^{t}_{i}=v_{i}(\theta_{t})\mathbb{I}\{i=i_{t}\}$ for each $i$ .
•

Each buyer $i$ pays a price (expenditure)

$b^{t}_{i}=\beta^{t}_{i}v_{i}(\theta_{t})\mathbb{I}\{i=i_{t}\}$

and receives a (net) utility of

$\displaystyle u^{t}_{i}=g^{t}_{i}-b^{t}_{i}=(1-\beta^{t}_{i})v_{i}(\theta_{t})\mathbb{I}\{i=i_{t}\},$ (17)

which is is the value of the item minus the price paid. Here, only the winning buyer $i_{t}$ may get a potentially nonzero utility $u^{t}_{i_{t}}$ ; other buyers $i\neq i_{t}$ gets $0$ (and pays zero).
•

Update the dual average: for each $i$ , $\bar{g}^{t}=\frac{t-1}{t}\bar{g}^{t-1}+\frac{1}{t}g^{t}$ , which ensures $\bar{g}^{t}_{i}=\frac{1}{t}\sum_{\tau=1}^{t}v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}$ for all $i$ (same as in the linear case).

•

Compute the next pacing multiplier (similar to the linear case, except the lower bound for $\beta^{t}_{i}$ being $\beta^{\min}_{i}$ instead of $B_{i}$ ):

\beta^{t+1}_{i}=\operatorname*{arg\,min}_{\beta_{i}\in[\beta^{\min}_{i},1]}\left\{\bar{g}^{t}_{i}\beta_{i}-B_{i}\log\beta_{i}\right\}\ \Rightarrow\ \beta^{t+1}_{i}=\Pi_{[\beta^{\min}_{i},1]}\left(\frac{B_{i}}{\bar{g}^{t}_{i}}\right).

Same as in the linear case, we do not need any distributional assumption on the item arrivals to run PACE. In subsequent convergence analysis, however, we assume that the items $\theta_{t}$ are drawn i.i.d. from a distribution $s$ (i.e., $s\in L^{\infty}_{+}$ and $s(\Theta)=1$ ). We also assume that $v_{i}\in L^{\infty}_{+}$ for all $i$ . Let $(x^{*},p^{*})$ denote a QLME of the underlying static QL market with supplies $s$ .

Convergence of QL pacing multipliers.

Analogous to Theorem 3, in the QL case, we can show that the pacing multipliers $\beta^{t}$ converge to the equilibrium utility prices $\beta^{*}$ in mean square. It is a direct consequence of the general convergence result of dual averaging (Theorem 2).

Theorem 7.

For $t=1,2,\dots$ , it holds that

\mathbf{E}\|\beta^{t}-\beta^{*}\|^{2}\leq\frac{(6+\log t)G^{2}}{t\sigma^{2}},

where $G^{2}$ , $\kappa$ are the same as in Theorem 3 and

\sigma=\min_{i}\min_{\beta_{i}\in[\beta^{\min}_{i},1]}\frac{B_{i}}{\beta_{i}^{2}}=\min_{i}B_{i}=\frac{1}{\kappa}.

Convergence of QL utilities and expenditures.

Next, we show mean-square convergence of time-averaged utilities $\bar{u}^{t}=\frac{1}{t}\sum_{\tau=1}^{t}u^{t}_{i}$ and expenditures $\bar{b}^{t}=\frac{1}{t}\sum_{\tau=1}^{t}\beta^{\tau}_{i}v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}$ . In the QL case, the dual average $g^{t}_{i}$ can also be viewed as the per-period gross utility before subtracting the price $\beta^{t}_{i}v_{i}(\theta_{t})$ . In this way, $\bar{g}^{t}_{i}$ is the time-averaged gross utility of buyer $i$ .

Theorem 8.

For $t=1,2,\dots$ and each $i$ , the following holds.

•

If $\beta^{*}_{i}=1$ , then $u^{\rm QLME}_{i}=0$ and $B_{i}=u^{*}_{i}$ . In this case,

\displaystyle\mathbf{E}\left(u^{t}_{i}-u^{\rm QLME}_{i}\right)^{2}=\mathbf{E}|u^{t}_{i}|^{2}\leq\|v_{i}\|_{\infty}^{2}\mathbf{E}(1-\beta^{t}_{i})^{2}=O\left(\frac{\log t}{t}\right).

(18)

•

If $\beta^{*}_{i}<1$ , then $u^{\rm QLME}_{i}>0$ , $\delta^{*}_{i}=0$ and $u^{\rm QLME}=(1-\beta^{*}_{i})u^{*}_{i}$ . In this case, the gross utility $\bar{g}^{t}_{i}$ , realized (net) utility $\bar{u}^{t}_{i}$ and expenditures $\bar{b}^{t}_{i}$ converge as follows:

	$\displaystyle\mathbf{E}(\bar{g}^{t}_{i}-u^{}_{i})^{2}\leq C_{i}\mathbf{E}(\beta^{t+1}_{i}-\beta^{}_{i})^{2}=O\left(\frac{\log t}{t}\right),$
	$\displaystyle\mathbf{E}\left(\bar{u}^{t}_{i}-u^{\rm QLME}_{i}\right)^{2}\leq R^{t}_{i},\ \ \mathbf{E}(\bar{b}^{t}_{i}-B_{i})^{2}\leq R^{t}_{i},$

where

	$\displaystyle C_{i}$	$\displaystyle=\frac{\\|v_{i}\\|_{\infty}^{2}}{\epsilon_{i}^{2}}+\frac{\left(\frac{\\|v_{i}\\|_{1}}{m}+2B_{i}\right)^{4}}{B_{i}^{2}},\ \ \epsilon_{i}=\min\{1-\beta^{}_{i},\beta^{}_{i}-\beta^{\min}_{i}\},$
	$\displaystyle R^{t}_{i}$	$\displaystyle=2\left[\mathbf{E}(\bar{g}^{t}_{i}-u^{}_{i})^{2}+\frac{\\|v_{i}\\|_{\infty}^{2}}{t}\sum_{\tau=1}^{t}\mathbf{E}(\beta^{\tau}_{i}-\beta^{}_{i})^{2}\right],\ \ \mathbf{E}R^{t}_{i}=O\left(\frac{(\log t)^{2}}{t}\right).$

Hence, the mean-square error $\mathbf{E}\left\|\bar{u}^{t}-u^{\rm QLME}\right\|^{2}$ is either $O((\log t)^{2}/t)$ (when some $\beta^{*}_{i}<1$ ) or $O((\log t)/t)$ (when all $\beta^{*}_{i}=1$ ).

Proof.

The $\beta^{*}_{i}=1$ case is clear. We prove the $\beta^{*}_{i}<1$ case.

Convergence of $\bar{g}^{t}_{i}$ .

Denote the event $A^{t}_{i}=\{B_{i}\leq\bar{g}^{t}_{i}\leq B_{i}/\beta^{\min}_{i}\}$ . Then, $(A^{t}_{i})^{c}$ means $B_{i}/\bar{g}^{t}_{i}\notin[\beta^{\min}_{i},1]$ and hence $|\beta^{t+1}-\beta^{*}_{i}|>\epsilon_{i}$ . Similar to the linear case, we deduce

\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}\geq\mathbf{P}[(A^{t}_{i})^{c}]\epsilon_{i}^{2}\Rightarrow\ \mathbf{P}[(A^{t}_{i})^{c}]\leq\frac{1}{\epsilon_{i}^{2}}\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}.

Furthermore, since $0\leq\bar{g}^{t}_{i}\leq\|v_{i}\|_{\infty}$ (same as in the linear case) and $u^{*}_{i}\leq\langle v_{i},s\rangle+B_{i}$ , we have

	$\displaystyle\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2}$	$\displaystyle=\mathbf{E}[\mathbb{I}_{(A^{t}_{i})^{c}}\cdot(\bar{g}^{t}_{i}-u^{}_{i})^{2}]+\mathbf{E}\left[\mathbb{I}_{A^{t}_{i}}\cdot\left(\frac{B_{i}}{\beta^{t+1}_{i}}-u^{}_{i}\right)^{2}\right]$
		$\displaystyle\leq\\|v_{i}\\|_{\infty}^{2}\mathbf{E}[\mathbb{I}_{(A^{t}_{i})^{c}}]+(u^{}_{i})^{2}\mathbf{E}\left[\mathbb{I}_{A^{t}_{i}}\cdot\left(\frac{B_{i}}{\beta^{t+1}_{i}u^{}_{i}}-1\right)^{2}\right]$
		$\displaystyle\leq\\|v_{i}\\|_{\infty}^{2}\mathbf{P}[(A^{t}_{i})^{c}]+(u^{}_{i})^{2}\cdot\mathbf{E}\left(\frac{\beta^{t+1}_{i}-\beta^{}_{i}}{\beta^{t+1}_{i}}\right)^{2}$
		$\displaystyle\leq\frac{\\|v_{i}\\|_{\infty}^{2}}{\epsilon_{i}^{2}}\mathbf{E}(\beta^{t+1}_{i}-\beta^{}_{i})^{2}+\left(\frac{u^{}_{i}}{\beta^{\min}_{i}}\right)^{2}\cdot\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}$
		$\displaystyle\leq\left(\frac{\\|v_{i}\\|_{\infty}^{2}}{\epsilon_{i}^{2}}+\left(\frac{\left(\langle v_{i},s\rangle+B_{i}\right)\left(\langle v_{i},s\rangle+2B_{i}\right)}{B_{i}}\right)^{2}\right)\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}$
		$\displaystyle=C_{i}\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}=O\left(\frac{\log t}{t}\right).$

Convergence of expenditures $\bar{b}^{t}_{i}$ .

Similar to the linear case, note that $\bar{b}^{t}_{i}$ can be decomposed as follows (where $x^{\tau}_{i}:=\mathbb{I}\{i=i_{\tau}\}$ denotes whether buyer $i$ wins at time step $\tau$ ):

\displaystyle\bar{b}^{t}_{i}:=\frac{1}{t}\sum_{\tau=1}^{t}\beta^{\tau}_{i}v_{i}(\theta_{\tau})x^{\tau}_{i}=\beta^{*}_{i}\bar{g}^{t}_{i}+\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{*}_{i})v_{i}(\theta_{\tau})x^{\tau}_{i}.

Hence, using $\beta^{*}_{i}=B_{i}/u^{*}_{i}\leq 1$ , $(x+y)^{2}\leq 2(x^{2}+y^{2})$ , convexity of $(\cdot)^{2}$ and $v_{i}(\theta_{t})x^{\tau}_{i}\leq\|v_{i}\|_{\infty}$ , we have

$\displaystyle\mathbf{E}(\bar{b}^{t}_{i}-B_{i})^{2}$	$\displaystyle\leq 2\left[\mathbf{E}(\beta^{}_{i}\bar{g}^{t}_{i}-B_{i})^{2}+\mathbf{E}\left(\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{}_{i})v_{i}(\theta_{\tau})x^{\tau}_{i}\right)^{2}\right]$
	$\displaystyle\leq 2\left[(\beta^{}_{i})^{2}\mathbf{E}(\bar{g}^{t}_{i}-u^{}_{i})^{2}+\frac{\\|v_{i}\\|_{\infty}^{2}}{t}\sum_{\tau=1}^{t}\mathbf{E}(\beta^{\tau}_{i}-\beta^{*}_{i})^{2}\right]$
	$\displaystyle\leq 2\left[\mathbf{E}(\bar{g}^{t}_{i}-u^{}_{i})^{2}+\frac{\\|v_{i}\\|_{\infty}^{2}}{t}\sum_{\tau=1}^{t}\mathbf{E}(\beta^{\tau}_{i}-\beta^{}_{i})^{2}\right]=R^{t}_{i}.$	(19)

The order of $\mathbf{E}R^{t}_{i}$ is given by those of $\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2}$ and $\sum_{\tau=1}^{t}\mathbf{E}(\beta^{\tau}_{i}-\beta^{*}_{i})$ , which are $O((\log t)/t)$ and $O((\log t)^{2}/t)$ , respectively.

Convergence of utilities $\bar{u}^{t}_{i}$ .

For a buyer $i$ with $\beta^{*}_{i}<1$ , let

\epsilon_{i}=\min\{1-\beta^{*}_{i},\beta^{*}_{i}-\beta^{\min}_{i}\}>0.

Express $\bar{u}^{t}_{i}$ as follows:

	$\displaystyle\bar{u}^{t}_{i}$	$\displaystyle=\frac{1}{t}\sum_{\tau=1}^{t}(1-\beta^{\tau}_{i})v_{i}(\theta_{\tau})x^{\tau}_{i}$
		$\displaystyle=(1-\beta^{}_{i})\bar{g}^{t}_{i}+\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{}_{i})v_{i}(\theta_{\tau})x^{\tau}_{i}.$

Since $u^{\rm QLME}_{i}=(1-\beta^{*}_{i})u^{*}_{i}$ , similar to (19), we have

	$\displaystyle\mathbf{E}\left(\bar{u}^{t}_{i}-u^{\rm QLME}_{i}\right)^{2}$	$\displaystyle\leq 2\left[(1-\beta^{}_{i})\mathbf{E}(\bar{g}^{t}_{i}-u^{}_{i})^{2}+\mathbf{E}\left(\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{*}_{i})v_{i}(\theta_{\tau})x^{\tau}_{i}\right)^{2}\right]$
		$\displaystyle\leq 2\left[\mathbf{E}(\bar{g}^{t}_{i}-u^{}_{i})^{2}+\frac{\\|v_{i}\\|_{\infty}^{2}}{t}\sum_{\tau=1}^{t}\mathbf{E}(\beta^{\tau}_{i}-\beta^{}_{i})^{2}\right]=R^{t}_{i}.$

Finally, if all $\beta^{*}_{i}=1$ , (18) implies that $\mathbf{E}\|\bar{u}^{t}-\bar{u}^{\rm QLME}\|^{2}=O((\log t)/t)$ . It some $\beta^{*}_{i}<1$ , since $\mathbf{E}R^{t}_{i}=O((\log t)^{2}/t)$ , so is $\mathbf{E}\|\bar{u}^{t}-\bar{u}^{\rm QLME}\|^{2}$ . ∎

Appendix D More details on the experiments

In each experiment, we will have some underlying valuations, items will be drawn one-at-a-time, uniformly at random, from the set of possible items, on which we run the PACE dynamics. We have several outcome measures of interest for asking how close we are to the static equilibrium quantities at each point. First, we look at convergence of realized utilities. In each case we consider the realized utilities up to time $t$ and look at the deviation from equilibrium utility normalized by the equilibrium utility level. We look at both the average and the worst-case deviations. Formally these are calculated as $\|(\bar{u}^{t}-u^{*})/u^{*}\|_{1}/n$ for the average deviation and $\|(\bar{u}^{t}-u^{*})/u^{*}\|_{\infty}$ for the maximum (over buyers) deviation. We also measure deviations of the pacing multiplier $\beta^{t}$ from $\beta^{*}$ and deviations of time-averaged cumulative expenditure $\bar{b}^{t}$ from buyers’ budgets $B=(B_{1},\dots,B_{n})$ using analogous normalizations. In the plots, we add horizontal lines for the same error measures for the proportional shares of the static underlying Fisher market (each buyer receiving $B_{i}$ of each item), a ‘baseline’ solution.

We consider 3 different market datasets. The first two datasets are recommender systems which we turn into markets. The final is taken from a survey experiment. We point the reader to [31] for a more in-depth discussion and exploratory data analysis of these $3$ datasets. The first dataset uses MovieLens [27]. MovieLens is a dataset of individual ratings of movies, [31] turn it into a market by using matrix completion to fill in missing user-movie ratings, they then take the top 1500 most active users and 1500 most rated movies and set the valuations $v_{ij}$ as the predicted ratings from the matrix completion. We also use the Jester Jokes dataset [26]. Here, we have $7200$ individuals that have rated $100$ jokes. We treat the jokes as the item to be allocated.Finally, we use the Household Items dataset introduced in [31]. Here we have $2876$ survey takes entering a willingness to pay for $50$ household items (vacuum cleaners, toasters, gas grills, etc.). For each dataset, we first rescale (w.l.o.g.) buyer valuations as described in §5.

We also consider an experiment on a simple infinite-dimensional market instance (which we refer to as “Inf-Dim”) of $n=100$ buyers and item space $\Theta=[0,1]$ , similar to the examples in [24, §4.2]. Let each buyer valuation $v_{i}$ be normalized linear functions on $[0,1]$ , that is, $v_{i}(\theta)=c_{i}(\theta)+d_{i}$ such that $v_{i}(\Theta):=\int_{\Theta}v_{i}d\mu=\int_{0}^{1}v_{i}(\theta)d\theta=1\ \Leftrightarrow\ \frac{c_{i}}{2}+d_{i}=1$ . We randomly generate $(c_{i},d_{i})$ , $i=1,\dots,n$ and run the dynamics for $T=100n$ time steps.

For the finite dimensional datasets we compute equilibrium utilities $u^{*}$ and utility prices $\beta^{*}$ by solving the corresponding static instances using standard methods. For the infinite dimensional synthetic instance, we use the approach based on convex conic reformulation [24, §4] to compute $\beta^{*}$ .

Figure 1 in §6 contains the plots for the MovieLens, Household Items and Inf-Dim datasets. Figure 2 contains the plots for the Jokes dataset.

Since items arrive one at a time, $t=100$ time steps in a market with $n=10$ buyers is very different from the same number of time steps in a market with $n=1000$ buyers. To deal with this, we run PACE for $T=100n$ time steps, referring to each $n$ time steps as an epoch.

We record the average and maximum values of relative errors of the pacing multipliers $\beta^{t}$ , time-averaged cumulative utilities $\bar{u}^{t}$ and time-averaged expenditures $\bar{b}^{t}$ .

Convergence of expenditures to total budget.

For each $i$ , the quantity

\left|\frac{\bar{b}^{t}_{i}-B_{i}}{B_{i}}\right|=\left|\frac{\sum_{\tau=1}^{t}b^{t}_{i}-tB_{i}}{tB_{i}}\right|

can be viewed as the relative deviation of current cumulative expenditure at time $t$ from the total budget $tB_{i}$ available up to $t$ . Hence, the residuals $\left\|(\bar{b}^{t}_{i}-B)/B\right\|/n$ and $\left\|(\bar{b}^{t}_{i}-B)/B\right\|_{\infty}$ are the average and maximum such deviations across all buyers. For each dataset (MovieLens, Household, Jokes and Inf-Dim), we plot the various quartiles of these residuals across all seeds, as shown in Figure 3.

	$\displaystyle\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2}$	$\displaystyle=\mathbf{E}[\mathbb{I}_{(A^{t}_{i})^{c}}\cdot(\bar{g}^{t}_{i}-u^{}_{i})^{2}]+\mathbf{E}\left[\mathbb{I}_{A^{t}_{i}}\cdot\left(\frac{B_{i}}{\beta^{t+1}_{i}}-u^{}_{i}\right)^{2}\right]$
		$\displaystyle\leq\\|v_{i}\\|_{\infty}^{2}\mathbf{E}[\mathbb{I}_{(A^{t}_{i})^{c}}]+(u^{}_{i})^{2}\mathbf{E}\left[\mathbb{I}_{A^{t}_{i}}\cdot\left(\frac{B_{i}}{\beta^{t+1}_{i}u^{}_{i}}-1\right)^{2}\right]$
		$\displaystyle\leq\\|v_{i}\\|_{\infty}^{2}\mathbf{P}[(A^{t}_{i})^{c}]+(u^{}_{i})^{2}\cdot\mathbf{E}\left(\frac{\beta^{t+1}_{i}-\beta^{}_{i}}{\beta^{t+1}_{i}}\right)^{2}$
		$\displaystyle\leq\frac{\\|v_{i}\\|_{\infty}^{2}}{\epsilon_{i}^{2}}\mathbf{E}(\beta^{t+1}_{i}-\beta^{}_{i})^{2}+\left(\frac{(1+\delta_{0})u^{}_{i}}{B_{i}}\right)^{2}\cdot\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}$
		$\displaystyle\leq\left(\frac{\\|v_{i}\\|_{\infty}^{2}}{\epsilon_{i}^{2}}+\left(\frac{1+\delta_{0}}{B_{i}}\right)^{2}\right)\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}.$

	$\displaystyle\mathbf{E}\\|\bar{g}^{t}-u^{*}\\|^{2}$	$\displaystyle\leq\sum_{i}\left(\frac{\\|v_{i}\\|_{\infty}^{2}}{\epsilon_{i}^{2}}+\left(\frac{1+\delta_{0}}{B_{i}}\right)^{2}\right)\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}$
		$\displaystyle\leq\left(\\|v\\|_{\infty}^{2}\left(\frac{\kappa}{\delta_{0}}\right)^{2}+((1+\delta_{0})\kappa)^{2}\right)\sum_{i}\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}$
		$\displaystyle\leq\left(\\|v\\|_{\infty}^{2}\left(\frac{\kappa}{\delta_{0}}\right)^{2}+((1+\delta_{0})\kappa)^{2}\right)\frac{(6+\log(t+1))G^{2}}{(t+1)\sigma^{2}}$
		$\displaystyle=C\cdot\frac{(6+\log(t+1))G^{2}}{(t+1)\sigma^{2}}.$

	$\displaystyle\mathbf{E}\\|\bar{b}^{t}-B\\|^{2}$	$\displaystyle\leq 2\left[\mathbf{E}\\|\bar{g}^{t}-u^{}\\|^{2}+\\|v\\|_{\infty}^{2}\frac{1}{t}\sum_{\tau=1}^{t}\mathbf{E}\\|\beta^{\tau}-\beta^{}\\|^{2}\right]$
		$\displaystyle\leq 2\left[C\cdot\frac{(6+\log t)G^{2}}{t\sigma^{2}}+\\|v\\|_{\infty}^{2}\frac{1}{t}\left(6(1+\log t)+\frac{(\log t)^{2}}{2}\right)\frac{G^{2}}{\sigma^{2}}\right]$
		$\displaystyle=\frac{2G^{2}}{t\sigma^{2}}\left(6(C+\\|v\\|_{\infty}^{2})+(C+6\\|v\\|_{\infty}^{2})\log t+\frac{\\|v\\|_{\infty}^{2}}{2}(\log t)^{2}\right).$

$\displaystyle\left\|p^{*}(\theta)-\max_{i}\beta^{t}_{i}v_{i}(\theta)\right\|$	$\displaystyle\leq\left\|\max_{i}\beta^{*}_{i}v_{i}(\theta)-\max_{i}\beta^{t}_{i}v_{i}(\theta)\right\|$
	$\displaystyle\leq\max_{i}\|\beta^{*}_{i}v_{i}(\theta)-\beta^{t}_{i}v_{i}(\theta)\|$
	$\displaystyle\leq\\|v\\|_{\infty}\\|\beta^{t}-\beta^{*}\\|_{\infty}.$	(7)

	$\displaystyle\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2}$	$\displaystyle=\mathbf{E}[\mathbb{I}_{(A^{t}_{i})^{c}}\cdot(\bar{g}^{t}_{i}-u^{}_{i})^{2}]+\mathbf{E}\left[\mathbb{I}_{A^{t}_{i}}\cdot\left(\frac{B_{i}}{\beta^{t+1}_{i}}-u^{}_{i}\right)^{2}\right]$
		$\displaystyle\leq\\|v_{i}\\|_{\infty}^{2}\mathbf{E}[\mathbb{I}_{(A^{t}_{i})^{c}}]+(u^{}_{i})^{2}\mathbf{E}\left[\mathbb{I}_{A^{t}_{i}}\cdot\left(\frac{B_{i}}{\beta^{t+1}_{i}u^{}_{i}}-1\right)^{2}\right]$
		$\displaystyle\leq\\|v_{i}\\|_{\infty}^{2}\mathbf{P}[(A^{t}_{i})^{c}]+(u^{}_{i})^{2}\cdot\mathbf{E}\left(\frac{\beta^{t+1}_{i}-\beta^{}_{i}}{\beta^{t+1}_{i}}\right)^{2}$
		$\displaystyle\leq\frac{\\|v_{i}\\|_{\infty}^{2}}{\epsilon_{i}^{2}}\mathbf{E}(\beta^{t+1}_{i}-\beta^{}_{i})^{2}+\left(\frac{u^{}_{i}}{\beta^{\min}_{i}}\right)^{2}\cdot\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}$
		$\displaystyle\leq\left(\frac{\\|v_{i}\\|_{\infty}^{2}}{\epsilon_{i}^{2}}+\left(\frac{\left(\langle v_{i},s\rangle+B_{i}\right)\left(\langle v_{i},s\rangle+2B_{i}\right)}{B_{i}}\right)^{2}\right)\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}$
		$\displaystyle=C_{i}\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}=O\left(\frac{\log t}{t}\right).$

Online Market Equilibrium with Application to Fair Division

Abstract

1 Introduction

Main contributions.

2 Static and Online Fisher Markets

Static Fisher markets and equilibria.

Definition 1.

Definition 2.

Theorem 1.

Online Fisher markets and equilibria.

Definition 3.

3 The PACE Dynamics

4 Dual Averaging

Theorem 2.

5 Convergence analysis of the PACE dynamics

Convergence of pacing multipliers.

Theorem 3.

Convergence of utilities.

Theorem 4.

Convergence of expenditures.

Theorem 5.

PACE attains OME asymptotically.

Theorem 6.

6 Experiments

7 Conclusion

References

Appendix A Related Work

Appendix B Proofs, Derivations and Examples

Proof of Theorem 1

An example: the distributional assumption on item arrivals

Reformulation of (𝒟EG\mathcal{D}_{\rm EG}) into (3)

Lemma 1.

Proof.

Proof of Theorem 3

Proof of Theorem 4

Proof of Theorem 5

Proof of Theorem 6

Analysis of regret ritr^{t}_{i}.

Analysis of envy ρit\rho^{t}_{i}.

Appendix C Extension to quasilinear utilities

Convergence of QL pacing multipliers.

Theorem 7.

Convergence of QL utilities and expenditures.

Theorem 8.

Proof.

Convergence of g¯it\bar{g}^{t}_{i}.

Convergence of expenditures b¯it\bar{b}^{t}_{i}.

Convergence of utilities u¯it\bar{u}^{t}_{i}.

Appendix D More details on the experiments

Convergence of expenditures to total budget.

Reformulation of ( $\mathcal{D}_{\rm EG}$ ) into (3)

Analysis of regret $r^{t}_{i}$ .

Analysis of envy $\rho^{t}_{i}$ .

Convergence of $\bar{g}^{t}_{i}$ .

Convergence of expenditures $\bar{b}^{t}_{i}$ .

Convergence of utilities $\bar{u}^{t}_{i}$ .