This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Online Market Equilibrium with Application to Fair Division

Yuan Gao
Columbia University
[email protected]
&Christian Kroer
Columbia University
[email protected]
email
&Alex Peysakhovich
Facebook AI Research
[email protected]
Abstract

Computing market equilibria is a problem of both theoretical and applied interest. Much research to date focuses on the case of static Fisher markets with full information on buyers’ utility functions and item supplies. Motivated by real-world markets, we consider an online setting: individuals have linear, additive utility functions; items arrive sequentially and must be allocated and priced irrevocably. We define the notion of an online market equilibrium in such a market as time-indexed allocations and prices which guarantee buyer optimality and market clearance in hindsight. We propose a simple, scalable and interpretable allocation and pricing dynamics termed as PACE. When items are drawn i.i.d. from an unknown distribution (with a possibly continuous support), we show that PACE leads to an online market equilibrium asymptotically. In particular, PACE ensures that buyers’ time-averaged utilities converge to the equilibrium utilities w.r.t. a static market with item supplies being the unknown distribution and that buyers’ time-averaged expenditures converge to their per-period budget. Hence, many desirable properties of market equilibrium-based fair division such as no envy, Pareto optimality, and the proportional-share guarantee are also attained asymptotically in the online setting. Next, we extend the dynamics to handle quasilinear buyer utilities, which gives the first online algorithm for computing first-price pacing equilibria. Finally, numerical experiments on real and synthetic datasets show that the dynamics converges quickly under various metrics.

1 Introduction

A market is said to be in equilibrium when supply is equal to demand. Computing prices and allocations which constitute a market equilibrium (ME) has long been a topic of interest [43, 28, 38, 20, 17, 31]. Most existing work focuses on the case of static markets. However, in this paper we consider the case of online markets where items arrive sequentially. We consider the extension of market equilibrium to this setting and provide market dynamics which quickly converge to an equilibrium in the case of online Fisher markets.

In static Fisher markets there is a fixed supply of each item, individual preferences are linear, additive, and items are divisible (or equivalently, randomization is allowed so individuals can purchase not just items but lotteries over items). In general, finding market equilibria is a hard problem [14, 47, 39]. However, in static linear Fisher markets, equilibrium prices and allocations can be computed via solving the Eisenberg-Gale (EG) convex program [22, 37].

We consider an online extension of Fisher markets where buyers are constantly present but items arrive one-at-a-time. Buyers’ budgets are per-period and represent their respective ‘bidding powers’ instead of being binding constraints. We extend the definition of market equilibrium to the online setting: online equilibrium allocations and prices are time-indexed and, when averaged across time, form an equilibrium in a corresponding static Fisher market where item supplies are proportional to item arrival probabilities. Due to the stochastic nature of online Fisher markets, any online algorithm can only attain an online market equilibrium asymptotically, that is, the allocations and prices approximately satisfy the equilibrium conditions after running the algorithm for a long time.

We propose market dynamics that find these equilibria in an online fashion based on the dual averaging algorithm applied to a reformulation of the dual of the EG convex program. We refer to this mechanism as PACE (Pace According to Current Estimated utility). In PACE, each buyer is assigned a utility pacing multiplier at time 0. When an item arrives, the individual with the highest adjusted utility (its valuation times the multiplier) receives that item and pays a price equal to its adjusted utility. The pacing multipliers of all individuals are then adjusted according to a closed-form rule which is given by the time average of the subgradient of the dual of the EG program. Intuitively, the pacing multipliers of those that did not receive the item go up while the receiver’s typically (but not always) goes down. We show that PACE yields item allocations and prices that satisfy various equilibrium properties asymptotically, for example no-regret and envy-freeness.

One important application of market equilibrium is fair allocation using the competitive equilibrium from equal incomes (CEEI) mechanism [46, 12]. In CEEI, each individual is given an endowment of faux currency and reports her valuations for items; then, a market equilibrium is computed and the items are allocated accordingly. However, many fair division problems are online rather than static. These include the allocation of impressions to content in certain recommender systems [34], workers to shifts, donations to food banks [2], scarce compute time to requestors [25, 40, 29], or blood donations to blood banks [32]. Similarly, online advertising can also be thought of as the allocation of impressions to advertisers via a market though with a budget of real money rather than faux currency. In the static CEEI case with linear additive preferences, the resulting equilibrium outcomes (i.e. results of the EG program) have been described as “perfect justice” [3]. In the online case, PACE achieves the same fair allocations as CEEI asymptotically. See Appendix A for more related work in the areas of (static and online) equilibrium computation and fair division.

We evaluate PACE experimentally in several market datasets. Convergence to good outcomes happens quickly in experiments. Taken together our results, we conclude that PACE is an attractive algorithm for both computing online market equilibria and online fair division.

Main contributions.

We consider the problem of allocating and pricing sequentially arriving items to nn buyers. This setting is termed as an online Fisher market. Given a sequence of item arrivals, we define an online market equilibrium as the items’ allocations and prices that, in hindsight, ensure buyer optimality and market clearance. We propose the PACE dynamics, which can be viewed as a nontrivial instantiation of the dual averaging algorithm on a reformulation of the dual of the Eisenberg-Gale convex program. Leveraging the convergence theory of dual averaging, we show that, when item arrivals are drawn from an (unknown) underlying distribution ss, possibly over an infinite/continuous item space, PACE ensures the following.

  • The pacing multipliers generated by PACE converge to the static equilibrium utility prices. Here, “static” means w.r.t. to an underlying static Fisher market.

  • Buyers’ time-averaged utilities converge to the static equilibrium utilities.

  • Buyers’ time-averaged expenditures converge to their respective budgets.

These convergences are all in mean square with rates O((logt)/t)O((\log t)/t), O((logt)/t)O((\log t)/t) and O((logt)2/t)O((\log t)^{2}/t), respectively, where the constants in these rates involve moderate polynomials of nn. In this way, PACE generates allocations and prices that constitute an online market equilibrium in the limit. In particular, the allocations and prices ensure that the allocation is Pareto optimal, and buyers have no regret, no envy, and get at least their proportional share asymptotically. We also extend PACE to the case of quasilinear buyer utilities, which yields the first online algorithm for computing first-price pacing equilibria. Finally, numerical experiments suggest that PACE converges much faster than its theoretical rates in terms of pacing multipliers, utilities and expenditures.

2 Static and Online Fisher Markets

Static Fisher markets and equilibria.

We first introduce static Fisher markets and their equilibria. Following the recent work [24, §2], we consider a measurable (possibly continuous) item space. Below are the technical preliminaries for the subsequent online setting. They can be skimmed through and referred back to as needed.

From now on, we define [k]:={1,,k}[k]:=\{1,\dots,k\} for any k:={0,1,2,}k\in\mathbb{N}:=\{0,1,2,\dots\} and +\mathbb{R}_{+} (++\mathbb{R}_{++}, resp.) as the set of nonnegative (positive, resp.) real numbers. Let 𝕀{A}{0,1}\mathbb{I}\{A\}\in\{0,1\} denote the indicator function of an event AA.

  1. (a)

    There are nn buyers (individuals), each having a budget Bi>0B_{i}>0.

  2. (b)

    The item space is a finite measurable space (Θ,,μ)(\Theta,\mathcal{M},\mu) with 0<μ(Θ)<0<\mu(\Theta)<\infty, where Θ\Theta is a (μ\mu-)measurable subset of d\mathbb{R}^{d}, \mathcal{M} is a σ\sigma-algebra and μ:+\mu:\mathcal{M}\rightarrow\mathbb{R}_{+} is a (finite) measure. From now on, LpL^{p} (and L+pL^{p}_{+}, resp.) denote the set of (nonnegative, resp.) LpL^{p} functions on Θ\Theta for any p[1,]p\in[1,\infty] (including p=p=\infty). Below are some concrete special cases for illustration.

    1. (i)

      Finite: Θ=[m]\Theta=[m], =2[m]={A:A[m]}\mathcal{M}=2^{[m]}=\{A:A\subseteq[m]\} and μ(A)=aAμ(a)\mu(A)=\sum_{a\in A}\mu(a) (all 2m2^{m} subsets are measurable and the measure is given by a point mass on each item).

    2. (ii)

      Lebesgue-measurable: μ\mu is the Lebesgue measure on d\mathbb{R}^{d}, \mathcal{M} is the Lebesgue σ\sigma-algebra and Θ\Theta is a (Lebesgue-)measurable subset of d\mathbb{R}^{d} with positive finite measure. For example, Θ\Theta can be a compact subset of d\mathbb{R}^{d} with a nonempty interior.

    3. (iii)

      Countably infinite: Θ=\Theta=\mathbb{N} and μ(A)=aAμ(a)\mu(A)=\sum_{a\in A}\mu(a) for any AA\subseteq\mathbb{N}, where μ()<0\mu(\mathbb{N})<0. For example, μ(a)\mu(a) can be the probability mass of a Poisson distribution, in which case (,,μ)(\mathbb{N},\mathcal{M},\mu) is a probability space.

  3. (c)

    The supplies of items is sL+s\in L^{\infty}_{+}, i.e., item θΘ\theta\in\Theta has supply s(θ)s(\theta). Since Θ\Theta is compact, it is measurable with a finite measure. For the finite case Θ=[m]\Theta=[m], we have s=(s1,,sm)+ms=(s_{1},\dots,s_{m})\in\mathbb{R}^{m}_{+}.

  4. (d)

    The valuation of each buyer ii on all items is viL+1v_{i}\in L^{1}_{+}, i.e., buyer ii has valuation vi(θ)v_{i}(\theta) on item θΘ\theta\in\Theta. For the finite case Θ=[m]\Theta=[m], we have vi=(vi1,,vim)+mv_{i}=(v_{i1},\dots,v_{im})\in\mathbb{R}^{m}_{+}.

  5. (e)

    For buyer ii, an allocation of items xiL+x_{i}\in L^{\infty}_{+} gives a utility of

    ui(xi):=vi,xi:=Θvi(θ)xi(θ)𝑑θ,u_{i}(x_{i}):=\langle v_{i},x_{i}\rangle:=\int_{\Theta}v_{i}(\theta)x_{i}(\theta)d\theta,

    where the angle brackets are based on the notation of applying a bounded linear functional xix_{i} to a vector viv_{i} in the Banach space L1L^{1} and the integral is the usual Lebesgue integral. For the finite case Θ=[m]\Theta=[m], we have xi=(xi1,,xim)+mx_{i}=(x_{i1},\dots,x_{im})\in\mathbb{R}^{m}_{+} and the utility is

    ui(xi)=vi,xi=jvijxij,u_{i}(x_{i})=\langle v_{i},x_{i}\rangle=\sum_{j}v_{ij}x_{ij},

    the usual Euclidean vector inner product. We will use x(L+)nx\in(L^{\infty}_{+})^{n} to denote the aggregate allocation of items to all buyers, i.e., the concatenation of all buyers’ allocations.

  6. (f)

    The prices of items are modeled as pL+1p\in L^{1}_{+}; in other words, the price of item θΘ\theta\in\Theta is p(θ)p(\theta). For the finite case Θ=[m]\Theta=[m], we have p=(p1,,pm)+mp=(p_{1},\dots,p_{m})\in\mathbb{R}^{m}_{+}.

  7. (g)

    For a measurable item subset AΘA\subseteq\Theta, let vi(A):=Avi(θ)𝑑θv_{i}(A):=\int_{A}v_{i}(\theta)d\theta (and similarly for p(A)p(A) and s(A)s(A)), the viv_{i}-induced measure of AA. For the finite case Θ=[m]\Theta=[m], for any item subset A[m]A\subset[m], vi(A)=jAvijv_{i}(A)=\sum_{j\in A}v_{ij} (and similarly for p(A)p(A) and s(A)s(A)).

  8. (h)

    Without loss of generality, we assume a unit total budget B1=1\|B\|_{1}=1, a unit total supply s(Θ)=1s(\Theta)=1 and normalized buyer valuations vi,s=1\langle v_{i},s\rangle=1. In other words, all items have a total value of 11 for every buyer.

Definition 1.

Given item prices pL+1p\in L^{1}_{+}, the demand of buyer ii is its set of utility-maximizing allocations given the prices and budget:

Di(p):=argmax{vi,xi:xiL+,p,xiBi}.D_{i}(p):=\operatorname*{arg\,max}\{\langle v_{i},x_{i}\rangle:x_{i}\in L^{\infty}_{+},\,\langle p,x_{i}\rangle\leq B_{i}\}.

The associated utility level U^i(p)\hat{U}_{i}(p) is defined as the value of vi,xi\langle v_{i},x_{i}\rangle for any xiDi(p)x_{i}\in D_{i}(p).

Definition 2.

A market equilibrium (ME) is an allocation-price pair (x,p)(L+)n×L+1(x^{*},p^{*})\in(L^{\infty}_{+})^{n}\times L^{1}_{+} such that the following holds.

  1. (i)

    Supply feasibility: ixis\sum_{i}x^{*}_{i}\leq s.

  2. (ii)

    Buyer optimality: xiDi(p)x^{*}_{i}\in D_{i}(p^{*}) for all ii.

  3. (iii)

    Market clearance: p,sixi=0\langle p^{*},s-\sum_{i}x^{*}_{i}\rangle=0 (any item with a positive price is fully allocated).

In the above definition and subsequently, all equations involving measurable functions are understood as “holding almost everywhere.” For example, ixis\sum_{i}x_{i}\leq s means the (measurable) set {θΘ:ixi(θ)s(θ)}\{\theta\in\Theta:\sum_{i}x_{i}(\theta)\leq s(\theta)\} has the same measure as Θ\Theta. Given a ME (x,p)(x^{*},p^{*}), we often denote the (unique) equilibrium utilities as ui=vi,xiu^{*}_{i}=\langle v_{i},x^{*}_{i}\rangle. For a finite-dimensional linear Fisher market, it is well known that a ME can be computed via solving the EG convex program. Recently, [24] generalized this framework to handle the case of an infinite item space. More specifically, consider the following (possibly infinite-dimensional) convex programs.

supx(L+)niBilogvi,xis.t.ixis.\displaystyle\sup_{x\in(L^{\infty}_{+})^{n}}\sum_{i}B_{i}\log\langle v_{i},x_{i}\rangle\ \ {\rm s.t.}\ \sum_{i}x_{i}\leq s. (𝒫EG\mathcal{P}_{\rm EG})
infpL+1,β+n(p,siBilogβi)s.t.pβivi,i.\displaystyle\begin{split}\inf_{p\in L^{1}_{+},\,\beta\in\mathbb{R}^{n}_{+}}\left(\langle p,s\rangle-\sum_{i}B_{i}\log\beta_{i}\right)\ \ {\rm s.t.}\ p\geq\beta_{i}v_{i},\ \forall\,i.\end{split} (𝒟EG\mathcal{D}_{\rm EG})

The following theorem summarizes the results in [24, §3] regarding the above convex programs capturing market equilibria. As shown in that work, the above convex programs satisfy strong duality and their optimal solutions (which correspond to ME) can be characterized by the KKT optimality conditions. We slightly generalize the assumptions of [24] by allowing non-uniform item supplies ss instead of s(θ)=1s(\theta)=1 for all θΘ\theta\in\Theta. For completeness, a proof, which is mainly based on the proofs of the results in [24, §3], can be found in the Appendix.

Theorem 1.

The following hold regarding (𝒫𝐸𝐺\mathcal{P}_{\rm EG}) and (𝒟𝐸𝐺\mathcal{D}_{\rm EG}).

  • Both suprema are attained.

  • Given xx^{*} feasible to (𝒫𝐸𝐺\mathcal{P}_{\rm EG}) and (p,β)(p^{*},\beta^{*}) feasible to (𝒟𝐸𝐺\mathcal{D}_{\rm EG}), they are both optimal if and only if the following holds: (i) p,sixi=0\langle p^{*},s-\sum_{i}x^{*}_{i}\rangle=0 (market clearance), (ii) pβivi,xi=0\langle p^{*}-\beta^{*}_{i}v_{i},x^{*}_{i}\rangle=0 (buyer ii only receives items within its ‘winning set’ {p=βivi}\{p^{*}=\beta^{*}_{i}v_{i}\}) (ii) and vi,xi=ui:=Bi/βi\langle v_{i},x^{*}_{i}\rangle=u^{*}_{i}:=B_{i}/\beta^{*}_{i} (buyer ii gets its maximum possible utility from xix^{*}_{i}). In this case, (x,p)(x^{*},p^{*}) is a ME.

  • Conversely, for a ME (x,p)(x^{*},p^{*}), it holds that (i) xx^{*} is an optimal solution of (𝒫𝐸𝐺\mathcal{P}_{\rm EG}) and (ii) (p,β)(p^{*},\beta^{*}), where βi:=Bi/vi,xi\beta^{*}_{i}:=B_{i}/\langle v_{i},x^{*}_{i}\rangle, is an optimal solution of (𝒟𝐸𝐺\mathcal{D}_{\rm EG}).

In the above theorem, βi\beta^{*}_{i} is known as buyer ii’s utility price, i.e., price per unit utility at equilibrium. As is well known, in a ME (x,p)(x^{*},p^{*}), the allocations xx^{*} are

  • Pareto optimal,

  • envy-free (in a budget weighted sense, i.e., vi,xi/Bivi,xk/Bk\langle v_{i},x^{*}_{i}\rangle/B_{i}\geq\langle v_{i},x^{*}_{k}\rangle/B_{k} for all kik\neq i),

  • proportional (i.e., vi,xivi,s/n=1/n\langle v_{i},x^{*}_{i}\rangle\geq\langle v_{i},s\rangle/n=1/n); see, e.g.,[24, Theorem 3].

Online Fisher markets and equilibria.

We now consider a simple online variant of the Fisher market setting, referred to as an online Fisher market (OFM). There are nn buyers, each with a valuation viL+1v_{i}\in L^{1}_{+}. Assume there are discrete time steps t=1,2,t=1,2,\dots. At each time step tt, an item θt\theta_{t} arrives and each buyer ii sees a value vi(θt)v_{i}(\theta_{t}). The item must be allocated irrevocably to one buyer. Each buyer ii has a budget Bi>0B_{i}>0 representing her per-period expenditure rate. 111This assumption is similar to one made in the literature on budget management in auctions, where each buyers has a per-period expenditure rate and the overall budget equal to the rate times the number of time periods. If a hard budget cap across all time periods is desired, then PACE and similar mechanisms may deplete some buyers’ budgets close to the end of the horizon [8, 6, 7].

Next, we introduce the notions of demand, utility level and online market equilibrium in an OFM. All of them are defined based on sequences of arrived items and their prices; they do not require any distributional assumption on the item arrivals.

Definition 3.

Let the arrived items be (θτ)τ[t](\theta_{\tau})_{\tau\in[t]}. An allocation (of arrived items) is (xiτ)(τ,i)[t]×[n](x^{\tau}_{i})_{(\tau,i)\in[t]\times[n]}, where xiτ[0,1]x^{\tau}_{i}\in[0,1] is the fraction of the item θτ\theta_{\tau} allocated to buyer ii.222We allow fractional allocations in the definition for more generality. As we will see, fractional allocation is not needed: PACE generates allocations and prices that satisfy the OME conditions asymptotically via assigning each arrived item to one buyer. Let the prices of the arrived items be (pτ(θτ))τ[t](p^{\tau}(\theta_{\tau}))_{\tau\in[t]}. The demand of each buyer ii (in hindsight) at time tt is

Dit=argmax(ziτ)τ[t]{1tτ=1tvi(θτ)ziτ:0ziτ1,τ,1tτ=1tpτ(θτ)ziτBi}.\displaystyle D^{t}_{i}=\operatorname*{arg\,max}_{(z^{\tau}_{i})_{\tau\in[t]}}\left\{\frac{1}{t}\sum_{\tau=1}^{t}v_{i}(\theta_{\tau})z^{\tau}_{i}:0\leq z^{\tau}_{i}\leq 1,\,\forall\,\tau,\ \frac{1}{t}\sum_{\tau=1}^{t}p^{\tau}(\theta_{\tau})z^{\tau}_{i}\leq B_{i}\right\}. (1)

Let U^it\hat{U}^{t}_{i} be the utility level associated with this demand, i.e., the maximum value in (1). An online market equilibrium (OME) is a pair of allocations (xiτ)(τ,i)[t]×[n](x^{\tau}_{i})_{(\tau,i)\in[t]\times[n]} and prices pτ(θτ)p^{\tau}(\theta_{\tau}) such that the following holds.

  1. (i)

    Total allocation does not exceed the unit amount of the item ixiτ1\sum_{i}x^{\tau}_{i}\leq 1 for all τ\tau.

  2. (ii)

    Buyers’ realized allocations are optimal in hindsight: (xiτ)τ[t]Dit(x^{\tau}_{i})_{\tau\in[t]}\in D^{t}_{i} for all ii.

  3. (iii)

    Market clearance: ixiτ=1\sum_{i}x^{\tau}_{i}=1 for τ\tau such that pτ(θτ)>0p^{\tau}(\theta_{\tau})>0.

In words, U^it\hat{U}^{t}_{i} is the maximum possible (time-averaged) utility buyer ii could have attained via choosing from the arrived items (θτ)τ[t](\theta_{\tau})_{\tau\in[t]} in hindsight, subject to their respective posted prices (pτ(θτ))τ[t](p^{\tau}(\theta_{\tau}))_{\tau\in[t]} and her current total budget tBitB_{i}, with DitD^{t}_{i} being the set of such utility-maximizing (time-indexed) allocations subject to per-period item availability constraints. An OME is a pair of allocations and prices that make buyers optimal in hindsight and market cleared.

Given an OFM, we define the associated underlying static Fisher market as having the same nn buyers and an item space Θ\Theta with supply ss being the (unknown) distribution from which the arriving items θt\theta_{t} are drawn. To clarify the concepts of OFM and OME, we consider some simple special cases.

  • Suppose all item arrivals θ1,,θt\theta_{1},\dots,\theta_{t} are known in advance. Then, the OFM is the same as a static n×tn\times t Fisher market with the same buyers and the tt items, each having a unit supply. Here, buyer ii’s valuation of item τ\tau is viτ=vi(θτ)v_{i\tau}=v_{i}(\theta_{\tau}). To compute an OME, it suffices to solve the classical (finite-dimensional) Eisenberg-Gale convex program, that is, (𝒫EG\mathcal{P}_{\rm EG}) with Θ=[t]\Theta=[t], s=(1,,1)+ts=(1,\dots,1)\in\mathbb{R}^{t}_{+} and x+n×tx\in\mathbb{R}_{+}^{n\times t}. Let the static ME be (x,p)+n×t×++t(x^{*},p^{*})\in\mathbb{R}_{+}^{n\times t}\times\mathbb{R}_{++}^{t}. When each item θτ\theta_{\tau} arrives, OME allocates a fraction xiτx^{*}_{i\tau} of the item to each buyer ii and set its price as pτp^{*}_{\tau}.

  • Suppose the sequentially arriving items are drawn i.i.d. from a known underlying distribution sL+s\in L^{\infty}_{+} (which specifies a random variable θs\theta\sim s such that 𝐏[θA]=s(A)\mathbf{P}[\theta\in A]=s(A) for any measurable set AΘA\subseteq\Theta) and all buyers’ valuations viv_{i} are known. Suppose we have also computed a static ME (x,p)(x^{*},p^{*}) (Definition  2) of a market with buyer valuations viv_{i}, budgets BiB_{i} and item supplies being the distribution ss (the underlying static market). Then, when a new item θt\theta_{t} (which is drawn from the distribution ss) arrives at time tt, set its price as p(θt)p^{*}(\theta_{t}) and allocate a fraction xi(θt)/s(θt)x^{*}_{i}(\theta_{t})/s(\theta_{t}) of it to each buyer ii (assume s(θt)>0s(\theta_{t})>0, i.e., only items with positive supplies can appear). Then, the time-averaged utility of each buyer ii is 1tτ=1tvi(θt)xi(θt)/s(θt)\frac{1}{t}\sum_{\tau=1}^{t}v_{i}(\theta_{t})x^{*}_{i}(\theta_{t})/s(\theta_{t}), which converges to

    𝐄θs[vi(θ)xi(θ)/s(θ)]=Θvi(θ)xi(θ)𝑑θ=uia.s.\mathbf{E}_{\theta\sim s}[v_{i}(\theta)x_{i}(\theta)/s(\theta)]=\int_{\Theta}v_{i}(\theta)x^{*}_{i}(\theta)d\theta=u^{*}_{i}\ {\rm a.s.}

    by to the Strong Law of Large Numbers. Since the online process is carried out using static equilibrium prices and allocations, the static ME properties (Definition 2) ensure the required OME properties hold asymptotically.

The above special cases require full knowledge of either the exact future item arrivals or the underlying static market to attain an OME. Next, we propose a simple, distributed dynamics which generates allocations and prices that satisfy the OME conditions asymptotically without requiring such knowledge (in particular, without knowledge of the distribution ss).

3 The PACE Dynamics

In this section, we introduce the PACE (Pacing According to Current Estimated utility) dynamics that prices and allocates sequentially arriving items via (i) maintaining a pacing multiplier for each buyer and (ii) simple, distributed updates.333Pacing and pacing multipliers are terminology in budget management in large-scale ad auctions [18, 19]. In §5, we will show that PACE is an instantiation of dual averaging [48], a stochastic first-order method for regularized optimization, applied to a reformulation of (𝒟EG\mathcal{D}_{\rm EG}).

In the PACE dynamics, each buyer maintains a pacing multiplier βit\beta^{t}_{i} (with an initial value βi0=(Bi+1)/2\beta^{0}_{i}=(B_{i}+1)/2, or any value in [Bi,1][B_{i},1]). At time step tt, the following events take place.

  1. (a)

    An item θt\theta_{t} appears and each buyer ii sees a value vi(θt)v_{i}(\theta_{t}) for the item.

  2. (b)

    Each buyer ii bids their paced value βitvi(θt)\beta^{t}_{i}v_{i}(\theta_{t}) for the item.

  3. (c)

    The item is allocated to the highest bidder (the winner at tt): it=argmaxiβitvi(θt)i_{t}=\operatorname*{arg\,max}_{i}\beta^{t}_{i}v_{i}(\theta_{t}), with ties broken arbitrarily. For concreteness, we always choose the lowest winning index, i.e.,

    it=minargmaxiβitvi(θt).i_{t}=\min\operatorname*{arg\,max}_{i}\beta^{t}_{i}v_{i}(\theta_{t}).

    Then, the price of θt\theta_{t} is set by the first-price rule

    pt(θt)=maxiβitvi(θt)=βittvi(θt)p^{t}(\theta_{t})=\max_{i}\beta^{t}_{i}v_{i}(\theta_{t})=\beta^{t}_{i_{t}}v_{i}(\theta_{t})

    and the winner iti_{t} pays this price pt(θt)p^{t}(\theta_{t}) for the item θt\theta_{t}.

  4. (d)

    Each buyer ii gets a utility

    uit=vi(θt)𝕀{i=it}.u^{t}_{i}=v_{i}(\theta_{t})\mathbb{I}\{i=i_{t}\}.

    In other words, the winner iti_{t} gets vit(θt)v_{i_{t}}(\theta_{t}) and other buyers get zero.

  5. (e)

    Each buyer ii updates its cumulative average utility u¯it\bar{u}^{t}_{i}:

    u¯it=1tτ=1tuiτ=t1tu¯it1+1tuit.\bar{u}^{t}_{i}=\frac{1}{t}\sum_{\tau=1}^{t}u^{\tau}_{i}=\frac{t-1}{t}\bar{u}^{t-1}_{i}+\frac{1}{t}u^{t}_{i}.
  6. (f)

    Each buyer ii updates their pacing multiplier βit+1\beta^{t+1}_{i} as follows:

    βit+1=Π[li,hi](Bi/u¯it):=min{max{li,Bi/u¯it},hi}.\beta^{t+1}_{i}=\Pi_{[l_{i},h_{i}]}(B_{i}/\bar{u}^{t}_{i}):=\min\{\max\{l_{i},B_{i}/\bar{u}^{t}_{i}\},h_{i}\}.

    where li=Bi/(1+δ0)l_{i}=B_{i}/(1+\delta_{0}) and hi=1+δ0h_{i}=1+\delta_{0} for some fixed δ0>0\delta_{0}>0 (e.g., δ0=0.05\delta_{0}=0.05).

As will be seen in §5, buyer ii’s equilibrium pacing multiplier (utility price) satisfies li<βi<hil_{i}<\beta^{*}_{i}<h_{i} and her per-period utility uitu^{t}_{i} corresponds to the iith component of a stochastic subgradient of a function on β\beta in a reformulation of the convex program (𝒟EG\mathcal{D}_{\rm EG}), on which we run dual averaging. Furthermore, the update rule for βit+1\beta_{i}^{t+1} is such that, if the realized utilities u¯it\bar{u}^{t}_{i} were the true static equilibrium utility for buyer ii, then βit+1\beta_{i}^{t+1} would be the equilibrium multiplier. Note that PACE does not randomize (any randomness can only come from the market environment from which item arrivals are drawn) and assigns every item to a single buyer without splitting it.

The simplicity and distributed nature of PACE makes it desirable for large-scale practical use.

  • It can be run on arbitrary sequential item arrivals and only requires buyers’ valuations vi(θt)v_{i}(\theta_{t}) on the arrived items (rather than all valuations viv_{i} over the potentially large item space). No parameter tuning is needed (in particular, no stepsize tuning as in many first-order optimization methods).

  • When run as a centralized allocation mechanism, PACE only needs to maintain O(n)O(n) scalars, namely, βit\beta^{t}_{i}, BiB_{i} and u¯it\bar{u}^{t}_{i} for all ii. At time tt, it observes buyers’ valuations vi(θt)v_{i}(\theta_{t}) of the item θt\theta_{t}, compute bids βitvi(θt)\beta^{t}_{i}v_{i}(\theta_{t}), finds the winner iti_{t}, set the price as the maximal bid βittvit(θt)\beta^{t}_{i_{t}}v_{i_{t}}(\theta_{t}) and allocates the item to the winner; finally, it updates u¯t\bar{u}^{t} and βt+1\beta^{t+1} as in f, which takes O(n)O(n) time.

  • PACE can also be run among the buyers in a decentralized manner, in which case each buyer only maintains two scalar values: the pacing multiplier βit\beta^{t}_{i} and time-averaged utility u¯it\bar{u}^{t}_{i}. When a new item arrives, each buyer only performs a few simple arithmetic operations to create a bid βitvi(θt)\beta^{t}_{i}v_{i}(\theta_{t}), receives her utility (if she wins) and subsequently updates u¯it\bar{u}^{t}_{i} and βit+1\beta^{t+1}_{i}.

These make PACE suitable for Internet-scale online fair division and online Fisher market applications. In particular, it is very reminiscent of how Internet advertising auctions are run. There, a similar auction-based system is used, with the pacing multiplier ensuring that each advertiser smooths out their budget expenditure across the many auctions. The primary difference between this and our setting is that (i) the auction can be first-price or second-price and (ii) buyers usually have quasilinear utilities, that is, utility of the item minus the expenditure (price paid) [18, 8, 6, 7]. In §C, we extend PACE to quasilinear utilities, which provides a novel online algorithm for first-price pacing equilibrium computation [19].

4 Dual Averaging

In this section, we briefly recap the setup and general convergence results of dual averaging [48, 35], which will be used in the analysis of PACE. First, we introduce some notation for this and subsequent sections. Let 𝐞(i)\mathbf{e}^{(i)} denote the ii’th unit basis vector in n\mathbb{R}^{n} and 𝟏n\mathbf{1}\in\mathbb{R}^{n} denote the vector of 11’s. For x,ynx,y\in\mathbb{R}^{n}, [x,y][x,y] denotes the Cartesian product of intervals i=1n[xk,yk]n\prod_{i=1}^{n}[x_{k},y_{k}]\subseteq\mathbb{R}^{n}. All norms \|\cdot\| without a subscript are Euclidean 22-norms, unless otherwise stated.

Let Ψ\Psi be a closed convex function with domain domΨ:={wn:Ψ(w)<}{\rm dom}\,\Psi:=\{w\in\mathbb{R}^{n}:\Psi(w)<\infty\}. Let ZdZ\subseteq\mathbb{R}^{d} be an arbitrary sample space. For each zZz\in Z, let fzf_{z} be a convex and subdifferentiable function on domΨ{\rm dom}\,\Psi. Considers the following regularized convex optimization problem [48, §1.1]:

minw𝐄fz(w)+Ψ(w),\displaystyle\min_{w}\mathbf{E}f_{z}(w)+\Psi(w), (2)

where the expectation is taken over a probability distribution 𝒟\mathcal{D} on ZZ. A more general online optimization setting, as described in [48, §1.2] is as follows. At each time t=1,2,3,t=1,2,3,\dots, we must choose an action wtw^{t} before a new, unknown convex loss function ftf_{t} arrives, which incurs a loss ft(wt)f_{t}(w^{t}) (a special case is i.i.d. sampled functions, i.e., ft=fztf_{t}=f_{z_{t}}, where ztz_{t} are i.i.d. samples drawn from 𝒟\mathcal{D}). The goal is to minimize regret when comparing our sequence of actions w1,w2,w^{1},w^{2},\ldots to any fixed action ww. Here, the regret against ww is defined as

Rt(w):=τ=1t(fτ(wτ)+Ψ(wτ))τ=1t(fτ(w)+Ψ(w))R_{t}(w):=\sum_{\tau=1}^{t}\left(f_{\tau}(w^{\tau})+\Psi(w^{\tau})\right)-\sum_{\tau=1}^{t}(f_{\tau}(w)+\Psi(w))

and the overall (maximal) regret up to time tt is Rt=maxwRt(w)R_{t}=\max_{w}R_{t}(w). We assume access to an oracle that, given any ftf_{t} and wdomΨw\in{\rm dom}\,\Psi, returns a subgradient gtft(w)g^{t}\in\partial f_{t}(w). The dual averaging algorithm (DA) [48, Algorithm 1] is as follows. First, set w1domΨw_{1}\in{\rm dom}\,\Psi and g¯0=0\bar{g}^{0}=0. Then, for each t=1,2,t=1,2,\dots, DA performs the following steps:

  1. (1)

    Observe ftf_{t} and compute gtft(wt)g^{t}\in\partial f_{t}(w^{t}).

  2. (2)

    Update the average subgradient (the dual average) via g¯t=t1tg¯t1+1tg¯t\bar{g}^{t}=\frac{t-1}{t}\bar{g}^{t-1}+\frac{1}{t}\bar{g}^{t}.

  3. (3)

    Compute the next iterate wt+1=argminw{g¯t,w+Ψ(w)}w^{t+1}=\operatorname*{arg\,min}_{w}\{\langle\bar{g}^{t},w\rangle+\Psi(w)\}.

Here, we do not employ any auxiliary regularizing function, since our problem has a natural source of strong convexity (i.e., a strongly convex Ψ\Psi) through the Bilogβi-B_{i}\log\beta_{i} terms in (𝒟EG\mathcal{D}_{\rm EG}). The following convergence guarantee on DA is proved as part of the proof of Corollary 4 in [48].

Theorem 2.

Dual averaging generates iterates wtw^{t} such that

𝐄wtw2(6+logt)G2tσ2,\mathbf{E}\|w^{t}-w^{*}\|^{2}\leq\frac{(6+\log t)G^{2}}{t\sigma^{2}},

where G2G^{2} is an upper bound on 𝐄gt2\mathbf{E}\|g^{t}\|^{2}, t=1,2,t=1,2,\dots and σ\sigma is the strong convexity modulus of Ψ\Psi.

When solving the stochastic optimization problem (2), in Theorem 2, we can set G2G^{2} to be an upper bound on supwdomΨ𝐄gz(w)2\sup_{w\in{\rm dom}\,\Psi}\mathbf{E}\|g_{z}(w)\|^{2}, where gz(w)g_{z}(w) is a subgradient oracle mapping each (z,w)Z×domΨ(z,w)\in Z\times{\rm dom}\,\Psi to a subgradient and the expectation is over z𝒟z\sim\mathcal{D} and possible randomness of the subgradient oracle. We will shortly see that a reformulation of 𝒟EG\mathcal{D}_{\rm EG}, when cast into the form (2), exhibits stochastic subgradients that are exactly buyers’ received utilities in each time step. Using Theorem 2, we can show that the sequence of pacing multipliers βt\beta^{t} generated by PACE converges to the underlying (equilibrium) utility prices β\beta^{*} of the static Fisher market.

5 Convergence analysis of the PACE dynamics

We will now show that PACE correspond to running DA on the vector βt\beta^{t} of pacing multipliers for the buyers. To this end, we first reformulate (𝒟EG\mathcal{D}_{\rm EG}) into a (finite-dimensional) convex program in β\beta in the form of (2):

minβ(maxiβivi,siBilogβi)s.t.β[B/(1+δ0),(1+δ0)𝟏],\displaystyle\min_{\beta}\ \left(\langle\max_{i}\beta_{i}v_{i},s\rangle-\sum_{i}B_{i}\log\beta_{i}\right)\ {\rm s.t.}\ \beta\in[B/(1+\delta_{0}),(1+\delta_{0})\mathbf{1}], (3)

where δ0>0\delta_{0}>0 is an arbitrarily small constant. The bounds on β\beta do not change the optimal solution, because βi(Bi,1)\beta^{*}_{i}\in(B_{i},1) for each ii. Detailed steps of the reformulation are given in Appendix B.

In order to run DA, we need to compute a subgradient of fθ:βmaxiβivi(θ)f_{\theta}:\beta\mapsto\max_{i}\beta_{i}v_{i}(\theta) at any θΘ\theta\in\Theta. Following [24, §5], since fθf_{\theta} is a piecewise linear function, a subgradient is

gθ(β):=vi(θ)𝐞(i)fθ(β),g_{\theta}(\beta):=v_{i^{*}}(\theta)\mathbf{e}^{(i^{*})}\in\partial f_{\theta}(\beta),

where i=minargmaxiβivi(θ)i^{*}=\min\operatorname*{arg\,max}_{i}\beta_{i}v_{i}(\theta) is the winner (see, e.g., [10, Theorem 3.50]).

We can now show that the PACE dynamics corresponds to running DA on (3). First, choose an arbitrary β0(B/(1+δ0),(1+δ0)𝟏)\beta^{0}\in(B/(1+\delta_{0}),(1+\delta_{0})\mathbf{1}) and g¯0=0\bar{g}^{0}=0. At each time step t=1,2,t=1,2,\dots, given the current pacing multiplier βt\beta^{t}, DA applied to (3) unrolls the following steps.

  • An item θt\theta_{t} arrives, having value vi(θt)v_{i}(\theta_{t}) for each buyer ii. The function ftf_{t} in DA is

    fθt:βmaxiβivi(θt).f_{\theta_{t}}:\beta\mapsto\max_{i}\beta_{i}v_{i}(\theta_{t}).
  • The winner is it=minargmaxiβitvi(θt)i_{t}=\min\operatorname*{arg\,max}_{i}\beta^{t}_{i}v_{i}(\theta_{t}) and a subgradient is

    gt=vitjt𝐞(it)ft(βt).g^{t}=v_{i_{t}j_{t}}\mathbf{e}^{(i_{t})}\in\partial f_{t}(\beta^{t}).

    Its iith entry is exactly the realized (single-period) utility of individual ii at time tt in PACE, that is, git=vi(θt)𝕀{i=it}=uitg^{t}_{i}=v_{i}(\theta_{t})\mathbb{I}\{i=i_{t}\}=u^{t}_{i}.

  • Update the dual average (time-averaged utilities): for each ii, compute g¯t=t1tg¯t1+1tgt\bar{g}^{t}=\frac{t-1}{t}\bar{g}^{t-1}+\frac{1}{t}g^{t}, i.e.,

    g¯it=t1tg¯it1+1tvi(θt)𝕀{i=it}.\bar{g}^{t}_{i}=\frac{t-1}{t}\bar{g}^{t-1}_{i}+\frac{1}{t}v_{i}(\theta_{t})\mathbb{I}\{i=i_{t}\}.
  • Update the pacing multipliers:

    βt+1=argminβ[B/(1+δ0),(1+δ0)𝟏]{g¯t,βiBilogβi}.\beta^{t+1}=\operatorname*{arg\,min}_{\beta\in[B/(1+\delta_{0}),(1+\delta_{0})\mathbf{1}]}\left\{\langle\bar{g}^{t},\beta\rangle-\sum_{i}B_{i}\log\beta_{i}\right\}.

    The minimization problem is separable in each ii and exhibits a simple and explicit solution which recovers step f in PACE (where g¯it=u¯it\bar{g}^{t}_{i}=\bar{u}^{t}_{i}):

    βit+1=argminβi[B/(1+δ0),1+δ0]{g¯itβiBilogβi}βit+1=Π[Bi/(1+δ0),1+δ0](Biu¯it).\beta^{t+1}_{i}=\operatorname*{arg\,min}_{\beta_{i}\in[B/(1+\delta_{0}),1+\delta_{0}]}\left\{\bar{g}^{t}_{i}\beta_{i}-B_{i}\log\beta_{i}\right\}\ \Rightarrow\ \beta^{t+1}_{i}=\Pi_{[B_{i}/(1+\delta_{0}),1+\delta_{0}]}\left(\frac{B_{i}}{\bar{u}^{t}_{i}}\right).

As mentioned earlier, PACE does not require a stepsize parameter. This is because DA is stepsize-free given a strongly convex regularizer Ψ\Psi, which is indeed the case in our reformulation (3). In addition, in the above update step for βit+1\beta^{t+1}_{i}, the directions of change are as follows.

  • For a non-winner iiti\neq i_{t}, we have uit=0u^{t}_{i}=0 and hence u¯itu¯it1\bar{u}^{t}_{i}\leq\bar{u}^{t-1}_{i}. This implies βit+1βit\beta^{t+1}_{i}\geq\beta^{t}_{i}. In words, a non-winner’s pacing multiplier weakly increases. The increase is strict if u¯it1>0\bar{u}_{i}^{t-1}>0, i.e., buyer ii has already received a nonzero utility.

  • For the winner iti_{t}, u¯itt\bar{u}^{t}_{i_{t}} may become greater than g¯itt1\bar{g}^{t-1}_{i_{t}}, in which case βitt+1βitt\beta^{t+1}_{i_{t}}\leq\beta^{t}_{i_{t}}. In words, the winner’s pacing multiplier may go up or down.

In order to analyze PACE, we assume vi(Θ)=1v_{i}(\Theta)=1, viL+v_{i}\in L^{\infty}_{+} (normalized and a.e.-bounded valuations)444The a.e.-boundedness assumption is needed in subsequent convergence analysis. Since Θ\Theta has a finite measure, it holds that L+L+1L^{\infty}_{+}\subseteq L^{1}_{+}. For a finite item space Θ=[m]\Theta=[m], both are equal to +m\mathbb{R}^{m}_{+}. and that there is an underlying item distribution sL+s\in L^{\infty}_{+} from which the item arrivals θt\theta_{t}, t=1,2,t=1,2,\dots are drawn i.i.d.555The distributional assumption on item arrivals (i.e., they are drawn i.i.d. from an unknown distribution ss) is needed to establish asymptotic equilibrium properties of PACE. See Appendix B for an example that any algorithm can yield arbitrarily suboptimal allocations without such a distributional assumption. Define the underlying static Fisher market as one having the same nn buyers (each with valuation viv_{i} and budget BiB_{i}) and item supplies ss. Denote the equilibrium utilities and utility prices w.r.t. the underlying static market as uu^{*} and β\beta^{*}, respectively. We further assume that the valuations are viL+v_{i}\in L^{\infty}_{+} (i.e., a.e.-bounded on the item space). This is not restrictive: since an individual item θ\theta has value vi(θ)v_{i}(\theta) for each buyer ii, it should be a finite value.

Convergence of pacing multipliers.

After aligning PACE with DA, the convergence of the pacing multipliers βt\beta^{t} follows directly from Theorem 2.

Theorem 3.

PACE generates pacing multipliers βt\beta^{t}, t=1,2,t=1,2,\dots such that

𝐄βtβ2(6+logt)G2tσ2,\mathbf{E}\|\beta^{t}-\beta^{*}\|^{2}\leq\frac{(6+\log t)G^{2}}{t\sigma^{2}},

where G2=maxi𝐄θs[vi(θ)2]maxivi2G^{2}=\max_{i}\mathbf{E}_{\theta\sim s}[v_{i}(\theta)^{2}]\leq\max_{i}\|v_{i}\|_{\infty}^{2}, σ=miniBi(1+δ0)2\sigma=\frac{\min_{i}B_{i}}{(1+\delta_{0})^{2}}.

In other words, we have mean-square convergence of βt\beta^{t} to β\beta^{*} at a O((logt)/t)O((\log t)/t) rate. Since B1=1\|B\|_{1}=1, we have miniBi1/n\min_{i}B_{i}\leq 1/n. Hence, σ=O(1/n)\sigma=O(1/n) and the constant in the bound is Ω(n2)\Omega(n^{2}). Whether such dependence on nn can be improved via new analysis remains an interesting research question.

Convergence of utilities.

We next show that the time-averaged utility u¯t\bar{u}^{t} (which is equal to the dual average g¯t\bar{g}^{t}) converges to the equilibrium utility vector uu^{*} of the underlying Fisher market. A key step in the proof is to bound the probability of a projection in updating βit+1\beta^{t+1}_{i}, that is, 𝐏[Bi/u¯it[li,ui]]\mathbf{P}[B_{i}/\bar{u}^{t}_{i}\notin[l_{i},u_{i}]].

Theorem 4.

For each ii, let ϵi:=min{hiβi,βili}>0\epsilon_{i}:=\min\{h_{i}-\beta^{*}_{i},\beta^{*}_{i}-l_{i}\}>0 be the minimum distance to the endpoints of the pacing-multiplier interval and v:=maxivi\|v\|_{\infty}:=\max_{i}\|v_{i}\|_{\infty}. It holds that

𝐄(u¯itui)2(vi2ϵi2+(1+δ0Bi)2)𝐄(βit+1βi)2.\mathbf{E}(\bar{u}^{t}_{i}-u^{*}_{i})^{2}\leq\left(\frac{\|v_{i}\|_{\infty}^{2}}{\epsilon_{i}^{2}}+\left(\frac{1+\delta_{0}}{B_{i}}\right)^{2}\right)\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}.

Hence, letting C=1(miniBi)2((v/δ0)2+(1+δ0)2)C=\frac{1}{(\min_{i}B_{i})^{2}}\left((\|v\|_{\infty}/\delta_{0})^{2}+(1+\delta_{0})^{2}\right), we have

𝐄u¯tu2C(6+log(t+1))G2(t+1)σ2.\mathbf{E}\|\bar{u}^{t}-u^{*}\|^{2}\leq C\cdot\frac{(6+\log(t+1))G^{2}}{(t+1)\sigma^{2}}.

Note that C=Ω(n2)C=\Omega(n^{2}). Hence, in this and the next theorems, the constant in the bound is Ω(n4)\Omega(n^{4}), which arises from CC and σ=O(1/n)\sigma=O(1/n).

Convergence of expenditures.

The expenditure of buyer ii at time step tt is

bit=βitvi(θt)𝕀{i=iτ}.b^{t}_{i}=\beta^{t}_{i}v_{i}(\theta_{t})\mathbb{I}\{i=i_{\tau}\}.

In other words, only the winner iti_{t} spends a nonzero amount, which is its bid. Let b¯it=1tτ=1tbiτ\bar{b}^{t}_{i}=\frac{1}{t}\sum_{\tau=1}^{t}b^{\tau}_{i} be buyer ii’s average expenditure. Utilizing the above convergence results, we show mean-squared convergence of b¯t\bar{b}^{t} to BB at an O((logt)2/t)O((\log t)^{2}/t) rate.

Theorem 5.

For each ii, it holds that

𝐄(b¯itBi)2[(βi)2𝐄(g¯itui)2+2vi21tτ=1t𝐄(βiτβi)2].\mathbf{E}(\bar{b}^{t}_{i}-B_{i})\leq 2\left[(\beta^{*}_{i})^{2}\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2}+2\|v_{i}\|_{\infty}^{2}\frac{1}{t}\sum_{\tau=1}^{t}\mathbf{E}(\beta^{\tau}_{i}-\beta^{*}_{i})^{2}\right].

For t3t\geq 3 and the constant CC defined in Theorem 4, we have

𝐄b¯tB22G2tσ2(6(C+v2)+(C+6v2)logt+v22(logt)2).\mathbf{E}\|\bar{b}^{t}-B\|^{2}\leq\frac{2G^{2}}{t\sigma^{2}}\left(6(C+\|v\|_{\infty}^{2})+(C+6\|v\|_{\infty}^{2})\log t+\frac{\|v\|_{\infty}^{2}}{2}(\log t)^{2}\right).

PACE attains OME asymptotically.

Next, we show that PACE attains OME asymptotically, i.e., it generates allocations and prices that make buyers no-regret and envy-free in the limit (these notions will be clarified shortly). Let xit:=𝕀{i=it}x^{t}_{i}:=\mathbb{I}\{i=i_{t}\} denote whether buyer iti_{t} is the winner (i.e., whether she is allocated the item θt\theta_{t} at time tt) Utilizing Theorems 4 and 5, we can show that buyer ii’s regret, that is, the difference between the maximum possible utility in hindsight U^it\hat{U}^{t}_{i} (Definition 3) and the realized utility u¯it\bar{u}^{t}_{i}, vanishes as tt grows. The same holds for each buyer’s envy. In other words, at a large tt, in hindsight, no buyer prefers another buyer’s set of allocated items (up to a vanishing error).666In a static market, given an allocation x+n×mx\in\mathbb{R}^{n\times m}_{+}, the (maximum, budget-weighted) envy of buyer ii toward others’ bundles is ρi(x)=maxkvi,xk/Bkvi,xi/Bi\rho_{i}(x)=\max_{k}\langle v_{i},x_{k}\rangle/B_{k}-\langle v_{i},x_{i}\rangle/B_{i} (see, e.g., [46, 12]). It is well-known that ρi(x)=0\rho_{i}(x^{*})=0 for all ii at equilibrium, a consequence of buyer optimality (Definition 2).

Theorem 6.

Denote

ξit=|u¯itui|,Δit=|b¯itBi|,γt=vtτ=1tβτβ.\xi^{t}_{i}=|\bar{u}^{t}_{i}-u^{*}_{i}|,\ \Delta^{t}_{i}=|\bar{b}^{t}_{i}-B_{i}|,\ \gamma_{t}=\frac{\|v\|_{\infty}}{t}\sum_{\tau=1}^{t}\|\beta^{\tau}-\beta^{*}\|_{\infty}.

Let rit:=max{U^itu¯it,0}r^{t}_{i}:=\max\{\hat{U}^{t}_{i}-\bar{u}^{t}_{i},0\} be the regret of buyer ii at time tt. Then, it holds that

ritξit+γt/Bi,𝐄(rit)2=O((logt)2/t).r^{t}_{i}\leq\xi^{t}_{i}+\gamma_{t}/B_{i},\ \mathbf{E}(r^{t}_{i})^{2}=O\left((\log t)^{2}/t\right).

Furthermore, let the envy of buyer ii (w.r.t. all other buyers) at time tt be

ρit=maxku¯ikt/Bku¯it/Bi,\rho^{t}_{i}=\max_{k}\bar{u}^{t}_{ik}/B_{k}-\bar{u}^{t}_{i}/B_{i},

where u¯ikt=1tτ=1tvi(θτ)xkτ\bar{u}^{t}_{ik}=\frac{1}{t}\sum_{\tau=1}^{t}v_{i}(\theta_{\tau})x^{\tau}_{k} is buyer ii’s time-averaged utility given her own valuations and of buyer kk’s allocations. Denote

ηit=1tτ=1t(p(θt)βiτvi(θt))xit.\eta^{t}_{i}=\frac{1}{t}\sum_{\tau=1}^{t}(p^{*}(\theta_{t})-\beta^{\tau}_{i}v_{i}(\theta_{t}))x^{t}_{i}.

It holds that

ρit1Bi(ξit+maxkiΔkt+ηktBk)and𝐄(ηit)2v2G2tσ2(6(1+logt)+(logt)22).\displaystyle\rho^{t}_{i}\leq\frac{1}{B_{i}}\left(\xi^{t}_{i}+\max_{k\neq i}\frac{\Delta^{t}_{k}+\eta^{t}_{k}}{B_{k}}\right)\ \ \text{and}\ \ \mathbf{E}(\eta^{t}_{i})^{2}\leq\frac{\|v\|_{\infty}^{2}G^{2}}{t\sigma^{2}}\left(6(1+\log t)+\frac{(\log t)^{2}}{2}\right).

Hence, 𝐄(ρit)2=O((logt)2/t)\mathbf{E}(\rho^{t}_{i})^{2}=O\left((\log t)^{2}/t\right).

In light of Definition 3, Theorem 6 shows that (xiτ)(i,τ)[n]×[t](x^{\tau}_{i})_{(i,\tau)\in[n]\times[t]} is approximately optimal for buyer ii. Since PACE also clears the market, we conclude that it attains OME asymptotically. Recall that theorem 4 ensures that buyers’ u¯it\bar{u}^{t}_{i} converge to their static equilibrium utilities uiu^{*}_{i}. Since the latter satisfy Pareto optimality and proportional share guarantee (uiBiu^{*}_{i}\geq B_{i} for all ii), so are the time-averaged realized utilities in the limit. Together with Theorem 6, we conclude that PACE achieves the said fairness and efficiency guarantees, namely, Pareto optimality, envy-freeness and proportional-share guarantee, asymptotically.

6 Experiments

We evaluate the PACE dynamics in several real and synthetic datasets, namely, MovieLens, Household Items and an infinite-dimensional market instance with item space Θ=[0,1]\Theta=[0,1] and viv_{i} being linear functions on [0,1][0,1]. For the first two datasets, see [31] for more information and exploratory data analysis. For all datasets, we consider the CEEI (fair division) setting where Bi=1/nB_{i}=1/n for all ii. For each dataset (with number of buyers n=1500,2876,100n=1500,2876,100, respectively), we run PACE for T=10nT=10n time steps (iterations). More details on the experiments and additional plots displaying convergence of expenditures can be found in Appendix D. Figure 1 displays the mean values of the average and maximum relative errors of the pacing multipliers and time-averaged cumulative utilities over 1010 repeated experiments with different seeds (relative errors of cumulative spending w.r.t. total budgets are plotted separately in Appendix D). The standard errors are also displayed as vertical bars but are very small and nearly invisible. Vertical dotted lines indicate t=10nt=10n The figures do not show the initial iterates t=1,,5nt=1,\dots,5n.

We see that PACE converges very quickly numerically: within 1010 epochs (10n10n time steps) average deviations in most quantities falls within 5%5\% of the equilibrium quantity, with the worst case not far behind. An important point is that budget spend takes much longer to converge than utility. This demonstrates an important practical difference for using PACE in an allocation scenario where budgets are ‘real money’ (e.g. Internet ad impressions) as compared to a CEEI-like setting, where budgets are faux currency only used for fair division.

7 Conclusion

We introduced the concept of an online Fisher market and proposed the PACE dynamics. We showed that when items arrive sequentially and stochastically, PACE converges to equilibrium outcomes of the underlying market model. Furthermore, we showed that, as a consequence of this, PACE can be used in online fair division problems to generate an online allocation that, asymptotically, achieves the compelling fairness properties of CEEI.

Many questions remain for future research. We mostly focused on the case where budgets are faux currency and there are many open questions for adapting PACE to a real-money budget-management setting as well as more complicated nonlinear utility models. Another imperative question, especially for practitioners, is whether PACE guarantees some level of incentive-compatibility.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 1: In all of our markets, iterates of the PACE dynamics quickly converges to their static equilibrium values both in the average case and the worst-off-buyer case. The horizontal line shows the fraction of uu^{*} achieved by the proportional share solution. The PACE utilities quickly outperform the proportional share utilities. Vertical lines indicate when tt is a multiple of 10n10n.

References

  • Aleksandrov and Walsh [2020] M. Aleksandrov and T. Walsh. Online fair division: A survey. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 13557–13562, 2020.
  • Aleksandrov et al. [2015] M. Aleksandrov, H. Aziz, S. Gaspers, and T. Walsh. Online fair division: Analysing a food bank problem. arXiv preprint arXiv:1502.07571, 2015.
  • Arnsperger [1994] C. Arnsperger. Envy-freeness and distributive justice. Journal of Economic Surveys, 8(2):155–186, 1994.
  • Azar et al. [2016] Y. Azar, N. Buchbinder, and K. Jain. How to allocate goods in an online market? Algorithmica, 74(2):589–601, 2016.
  • Aziz et al. [2015] H. Aziz, S. Gaspers, S. Mackenzie, and T. Walsh. Fair assignment of indivisible objects under ordinal preferences. Artificial Intelligence, 227:71–92, 2015.
  • Balseiro et al. [2017] S. Balseiro, A. Kim, M. Mahdian, and V. Mirrokni. Budget management strategies in repeated auctions. In Proceedings of the 26th International Conference on World Wide Web, pages 15–23, 2017.
  • Balseiro and Gur [2019] S. R. Balseiro and Y. Gur. Learning in repeated auctions with budgets: Regret minimization and equilibrium. Management Science, 65(9):3952–3968, 2019.
  • Balseiro et al. [2015] S. R. Balseiro, O. Besbes, and G. Y. Weintraub. Repeated auctions with budgets in ad exchanges: Approximations and design. Management Science, 61(4):864–884, 2015.
  • Bateni et al. [2018] M. H. Bateni, Y. Chen, D. Ciocan, and V. Mirrokni. Fair resource allocation in a volatile marketplace. Available at SSRN 2789380, 2018.
  • Beck [2017] A. Beck. First-order methods in optimization, volume 25. SIAM, 2017.
  • Birnbaum et al. [2011] B. Birnbaum, N. R. Devanur, and L. Xiao. Distributed algorithms via gradient descent for fisher markets. In Proceedings of the 12th ACM conference on Electronic commerce, pages 127–136. ACM, 2011.
  • Budish [2011] E. Budish. The combinatorial assignment problem: Approximate competitive equilibrium from equal incomes. Journal of Political Economy, 119(6):1061–1103, 2011.
  • Caragiannis et al. [2016] I. Caragiannis, D. Kurokawa, H. Moulin, A. D. Procaccia, N. Shah, and J. Wang. The unreasonable fairness of maximum Nash welfare. In Proceedings of the 2016 ACM Conference on Economics and Computation, pages 305–322. ACM, 2016.
  • Chen and Teng [2009] X. Chen and S.-H. Teng. Spending is not easier than trading: on the computational equivalence of fisher and arrow-debreu equilibria. In International Symposium on Algorithms and Computation, pages 647–656. Springer, 2009.
  • Cheung et al. [2019] Y. K. Cheung, M. Hoefer, and P. Nakhe. Tracing equilibrium in dynamic markets via distributed adaptation. In Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, pages 1225–1233, 2019.
  • Cole and Gkatzelis [2018] R. Cole and V. Gkatzelis. Approximating the Nash social welfare with indivisible items. SIAM Journal on Computing, 47(3):1211–1236, 2018.
  • Cole et al. [2017] R. Cole, N. R. Devanur, V. Gkatzelis, K. Jain, T. Mai, V. V. Vazirani, and S. Yazdanbod. Convex program duality, fisher markets, and Nash social welfare. In 18th ACM Conference on Economics and Computation, EC 2017. Association for Computing Machinery, Inc, 2017.
  • Conitzer et al. [2018] V. Conitzer, C. Kroer, E. Sodomka, and N. E. Stier-Moses. Multiplicative pacing equilibria in auction markets. In International Conference on Web and Internet Economics, 2018.
  • Conitzer et al. [2019] V. Conitzer, C. Kroer, D. Panigrahi, O. Schrijvers, E. Sodomka, N. E. Stier-Moses, and C. Wilkens. Pacing equilibrium in first-price auction markets. In Proceedings of the 2019 ACM Conference on Economics and Computation. ACM, 2019.
  • Daskalakis et al. [2009] C. Daskalakis, P. W. Goldberg, and C. H. Papadimitriou. The complexity of computing a Nash equilibrium. SIAM Journal on Computing, 39(1):195–259, 2009.
  • Dvoretzky et al. [1951] A. Dvoretzky, A. Wald, J. Wolfowitz, et al. Relations among certain ranges of vector measures. Pacific Journal of Mathematics, 1(1):59–74, 1951.
  • Eisenberg and Gale [1959] E. Eisenberg and D. Gale. Consensus of subjective probabilities: The pari-mutuel method. The Annals of Mathematical Statistics, 30(1):165–168, 1959.
  • Gao and Kroer [2020a] Y. Gao and C. Kroer. First-order methods for large-scale market equilibrium computation. In Neural Information Processing Systems 2020, NeurIPS 2020, 2020a.
  • Gao and Kroer [2020b] Y. Gao and C. Kroer. Infinite-dimensional fisher markets and tractable fair division. arXiv preprint arXiv:2010.03025, 2020b.
  • Ghodsi et al. [2011] A. Ghodsi, M. Zaharia, B. Hindman, A. Konwinski, S. Shenker, and I. Stoica. Dominant resource fairness: Fair allocation of multiple resource types. In Nsdi, volume 11, pages 24–24, 2011.
  • Goldberg et al. [2001] K. Goldberg, T. Roeder, D. Gupta, and C. Perkins. Eigentaste: A constant time collaborative filtering algorithm. information retrieval, 4(2):133–151, 2001.
  • Harper and Konstan [2015] F. M. Harper and J. A. Konstan. The movielens datasets: History and context. Acm transactions on interactive intelligent systems (tiis), 5(4):1–19, 2015.
  • Kantorovich [1975] L. Kantorovich. Mathematics in economics: achievements, difficulties, perspectives. Technical report, Nobel Prize Committee, 1975.
  • Kash et al. [2014] I. Kash, A. D. Procaccia, and N. Shah. No agent left behind: Dynamic fair division of multiple resources. Journal of Artificial Intelligence Research, 51:579–603, 2014.
  • Kroer and Peysakhovich [2019] C. Kroer and A. Peysakhovich. Scalable fair division for ’at most one’ preferences. arXiv preprint arXiv:1909.10925, 2019.
  • Kroer et al. [2019] C. Kroer, A. Peysakhovich, E. Sodomka, and N. E. Stier-Moses. Computing large market equilibria using abstractions. In Proceedings of the 2019 ACM Conference on Economics and Computation, pages 745–746, 2019.
  • McElfresh et al. [2020] D. C. McElfresh, C. Kroer, S. Pupyrev, E. Sodomka, K. A. Sankararaman, Z. Chauvin, N. Dexter, and J. P. Dickerson. Matching algorithms for blood donation. In Proceedings of the 21st ACM Conference on Economics and Computation, pages 463–464, 2020.
  • Murray et al. [2020a] R. Murray, C. Kroer, A. Peysakhovich, and P. Shah. Robust market equilibria with uncertain preferences. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 2192–2199, 2020a.
  • Murray et al. [2020b] R. Murray, C. Kroer, A. Peysakhovich, and P. Shah. https://research.fb.com/blog/2020/09/robust-market-equilibria-how-to-model-uncertain-buyer-preferences/, Sep 2020b.
  • Nesterov [2009] Y. Nesterov. Primal-dual subgradient methods for convex problems. Mathematical programming, 120(1):221–259, 2009.
  • Nesterov and Shikhman [2018] Y. Nesterov and V. Shikhman. Computation of fisher–gale equilibrium by auction. Journal of the Operations Research Society of China, 6(3):349–389, 2018.
  • Nisan et al. [2007] N. Nisan, T. Roughgarden, E. Tardos, and V. V. Vazirani. Algorithmic game theory. Cambridge University Press, 2007.
  • Othman et al. [2010] A. Othman, T. Sandholm, and E. Budish. Finding approximate competitive equilibria: efficient and fair course allocation. In AAMAS, volume 10, pages 873–880, 2010.
  • Othman et al. [2016] A. Othman, C. Papadimitriou, and A. Rubinstein. The complexity of fairness through equilibrium. ACM Transactions on Economics and Computation (TEAC), 4(4):1–19, 2016.
  • Parkes et al. [2015] D. C. Parkes, A. D. Procaccia, and N. Shah. Beyond dominant resource fairness: Extensions, limitations, and indivisibilities. ACM Transactions on Economics and Computation (TEAC), 3(1):1–22, 2015.
  • Peysakhovich and Kroer [2019] A. Peysakhovich and C. Kroer. Fair division without disparate impact. Mechanism Design for Social Good, 2019.
  • Plaut and Roughgarden [2020] B. Plaut and T. Roughgarden. Almost envy-freeness with general valuations. SIAM Journal on Discrete Mathematics, 34(2):1039–1068, 2020.
  • Scarf et al. [1967] H. Scarf et al. On the computation of equilibrium prices. Number 232. Cowles Foundation for Research in Economics at Yale University New Haven, CT, 1967.
  • Shmyrev [2009] V. I. Shmyrev. An algorithm for finding equilibrium in the linear exchange model with fixed budgets. Journal of Applied and Industrial Mathematics, 3(4):505, 2009.
  • Sinclair et al. [2021] S. R. Sinclair, S. Banerjee, and C. L. Yu. Sequential fair allocation: Achieving the optimal envy-efficiency tradeoff curve. arXiv preprint arXiv:2105.05308, 2021.
  • Varian et al. [1974] H. R. Varian et al. Equity, envy, and efficiency. Journal of Economic Theory, 9(1):63–91, 1974.
  • Vazirani and Yannakakis [2011] V. V. Vazirani and M. Yannakakis. Market equilibrium under separable, piecewise-linear, concave utilities. Journal of the ACM (JACM), 58(3):1–25, 2011.
  • Xiao [2010] L. Xiao. Dual averaging methods for regularized stochastic learning and online optimization. Journal of Machine Learning Research, 11:2543–2596, 2010.
  • Zhang [2011] L. Zhang. Proportional response dynamics in the fisher market. Theoretical Computer Science, 412(24):2691–2698, 2011.

Appendix A Related Work

The problem of market equilibrium computation has been of interest in economics for a long time (see, e.g., Nisan et al. [37]). There is a large literature focusing on computation of equilibrium in the specific case of (finite-dimensional) Fisher markets through various convex optimization formulations [22, 44, 17, 31] and gradient-based methods [11, 36, 23]. Other works extend these results to settings such as quasilinear utilities, capped utilities, indivisible items, or imperfectly specified utility functions [17, 16, 13, 33, 30, 41]. One of the most well-known algorithm for computing static market equilibria under Constant Elasticity of Substitution (CES) utilities is the Proportional Response (PR) dynamics Zhang [49], Birnbaum et al. [11]. Recently, [24] extends the classical Fisher market model to a measurable (possibly continuous) item space and shows that infinite-dimensional EG-type convex programs capture ME under this setting. Our work extends these ideas to a Fisher market-like scenario where items arrive sequentially.

The Fisher market literature above focuses on divisible items or randomized allocations of indivisible items. There is also a large literature on fair allocation of indivisible items (e.g. Aziz et al. [5], Caragiannis et al. [13], Plaut and Roughgarden [42]) including approximate ME-based methods [12, 39]. We note that all allocations in our setting are discrete and the relationship to Fisher markets happens in the time-average sense.

Perhaps most similar to our setting is that of [4], who study how to allocate allocate items in an online fashion in order to obtain a market-equilibrium-like allocation. However, they consider competitive ratios, and give a primal-dual algorithm that suffers at most a logarithmic loss compared to the best hindsight optimal solution, even for worst-case arrivals. In addition to the lack of asymptotic convergence, they also only show guarantees on various (arithmetic, geometric, harmonic) averages of the utilities. In contrast to this, our work considers stochastic arrivals, and gives an adaptive algorithm for asymptotically achieving all the desireable market equilibrium properties (e.g. no envy, Pareto optimality, equilibrium utilities). Another important difference is that our approach is easily implemented as a distributed dynamics that requires only a first-price auction allocation mechanism with indivisible allocations, in O(n)O(n) time per item arrival (nn being the number of buyers). At each time step, the algorithm only uses buyers’ valuations of the current arriving item. This makes our approach suitable for implementation in large-scale systems with a huge, possibly infinite item space.

Other methods for online fair division have been studied by various authors. In this literature, there are various notions of “online:” either buyers, items, or both can arrive online. Here we survey only related work where items arrive online. [2] studies a simple mechanism where agents can declare if they like an item, and then a coin is flipped to determine which of the agents that liked the item will get it. [29] studies online allocation for Leontief utilities (where each agent wants a bundle with items of fixed proportions) and shows how to achieve various properties for this setting. [15] studies an evolving market environment and shows that the PR dynamics generates iterates that are close to the (changing) equilibrium. Similar to the classical PR dynamics, in each time step, all buyers’ valuations of all items are known to the algorithm. In contrast, our work allows an unknown underlying market from which items are sampled: in each time step, PACE only uses buyers’ valuations of the current arriving item. [9] studies an online fair allocation problem and proposes a stochastic approximation scheme, which relies on frequently resolving the EG convex program, that ensures a constant approximation ratio (in terms of a proportional fairness measure) relative to the offline fair allocation. Very recently, [45] studies the problem of online fair division in which there is a fixed (finite) number of divisible items and sequentially arriving agents. The authors show that envy-freeness and (Pareto) efficiency cannot be minimized simultaneously; instead, there exists a boundary such that any algorithm can only possibly achieve the envy-efficiency combinations on one side of it, while they also propose such an algorithm. See also [1] for a survey of further works in this area.

The idea of pacing has been studied in the context of budget management in second-price auctions. The work most related to our work is [7], which studies an online version of that setting. [7] and our work are very different in terms of problem settings, results, and analysis. Here, we point out some key differences. [7] first show that, in the typical setting of a single bidder interacting with second-price auctions where values and prices are drawn from a stochastic environment, an adaptive pacing strategy achieves O(1/T)O(1/\sqrt{T}) regret and ergodic convergence of the bidder’s pacing multipliers. Then, under additional assumptions on monotonicity of the bidders’ expected expenditures, the authors establish game-theoretic equilibrium properties when all bidders use the same strategy, i.e., under the “simultaneous learning” setting. In contrast, we focus on market equilibrium properties such as fairness and efficiency. Furthermore, in terms of technical assumptions and convergence results, our PACE dynamics is stepsize-free, both in theory and numerically, while the adaptive pacing algorithm in [7] requires careful stepsizing rules; in our setting, we have last-iterate convergence of pacing multipliers (Theorem 7), whereas [7] only establishes ergodic convergence without a rate. These differences are, fundamentally, due to the use of different first-order optimization methods, and the fact that we leverage strong convexity of the EG dual. The analysis in [7] builds upon the convergence properties of mirror descent and stochastic approximation. Hence, it requires pre-determined vanishing stepsizes (their Assumption 1). In the simultaneous learning setting, the stepsizes of all bidders furthermore need to be carefully selected in joint fashion (their Assumption 3). In contrast, PACE is completely stepsize free, thanks to structure of the reformulated dual EG convex program (3), on which dual averaging can be applied directly without any stepsize parameter or an auxiliary regularization term. This makes it vastly easier to apply PACE in practice.

Appendix B Proofs, Derivations and Examples

Proof of Theorem 1

In [24, §3], it is assumed that item supplies are uniform, i.e., s(θ)=1s(\theta)=1 for all θΘ\theta\in\Theta. As is well-known in the finite case (Θ=[m]\Theta=[m]), this assumption is w.l.o.g. when studying a static Fisher market. Here, we show that all results in [24, §3] can be easily generalized to the case of non-uniform supplies ss (Theorem 1). For any market instance MM with buyer valuations viL+1v_{i}\in L^{1}_{+}, budgets BiB_{i}, i[n]i\in[n] and item supplies sL+s\in L^{\infty}_{+} (all normalized as described in h in §2), consider another market instance M~\tilde{M} with supplies being the constant function taking 11 on Θ\Theta (denoted as 𝟏\mathbf{1}), valuations

v~i(θ)=vi(θ)s(θ)\tilde{v}_{i}(\theta)=v_{i}(\theta)s(\theta)

and the same budgets. First, note that v~iL+1\tilde{v}_{i}\in L^{1}_{+} since sL+s\in L^{\infty}_{+} and Θ\Theta has a finite measure (s=inf{M:|s|Ma.e.}\|s\|_{\infty}=\inf\{M:|s|\leq M\,\textnormal{a.e.}\}):

Θv~i(θ)𝑑θsΘvi(θ)𝑑θ=svi(Θ)=s.\int_{\Theta}\tilde{v}_{i}(\theta)d\theta\leq\|s\|_{\infty}\int_{\Theta}v_{i}(\theta)d\theta=\|s\|_{\infty}v_{i}(\Theta)=\|s\|_{\infty}.

Denote the set of feasible allocations of MM as FF, that is, the set of (xi)(x_{i}) such that xiL+x_{i}\in L^{\infty}_{+} for all ii and ixis\sum_{i}x_{i}\leq s. Similarly, denote the set of feasible allocations of M~\tilde{M} as F~\tilde{F}. For any xFx\in F, consider

x~i(θ)={xi(θ)/s(θ)ifθ>0,0o.w.\displaystyle\tilde{x}_{i}(\theta)=\begin{cases}x_{i}(\theta)/s(\theta)&{\rm if}\ \theta>0,\\ 0&{\rm o.w.}\end{cases} (4)

Since 0xis0\leq x_{i}\leq s (a.e.), we have 0x~i𝟏0\leq\tilde{x}_{i}\leq\mathbf{1} (which means x~iL+\tilde{x}_{i}\in L^{\infty}_{+}). Since ixis\sum_{i}x_{i}\leq s, we have

ix~i𝟏.\sum_{i}\tilde{x}_{i}\leq\mathbf{1}.

Therefore, the set of utilities attainable by allocations in FF is the same as the set of utilities attainable by F~\tilde{F}. In other words,

U={u+n:xF,vi,xi=ui,i[n]}=U~={u~+n:x~F~,v~i,x~i=u~i}.U=\left\{u\in\mathbb{R}^{n}_{+}:x\in F,\ \langle v_{i},x_{i}\rangle=u_{i},\,i\in[n]\right\}=\tilde{U}=\left\{\tilde{u}\in\mathbb{R}^{n}_{+}:\tilde{x}\in\tilde{F},\ \langle\tilde{v}_{i},\tilde{x}_{i}\rangle=\tilde{u}_{i}\right\}.

By [24, Lemma 1] and its proof there (here, we only need the compactness, not the existence of a pure allocation for any feasible utility vector; hence, invoking [21, Theorem 1] suffices), U~\tilde{U} is convex and compact and so is UU. Hence, the suprema of (𝒫EG\mathcal{P}_{\rm EG}) is attained. Completely analogous to the proof of [24, Theorem 2], we can show that both suprema of (𝒫EG\mathcal{P}_{\rm EG}) is attained. Furthermore, completely analogous to the proofs of Lemma 3 and Theorem 2 there, we can show that strong duality holds for (𝒫EG\mathcal{P}_{\rm EG}) and (𝒟EG\mathcal{D}_{\rm EG}). More specifically, for any xx feasible to (𝒫EG\mathcal{P}_{\rm EG}) and (p,β)(p,\beta) feasible to (𝒟EG\mathcal{D}_{\rm EG}), and any 0uivi,xi0\leq u_{i}\leq\langle v_{i},x_{i}\rangle (i.e., adding auxiliary variable uiu_{i} to (𝒫EG\mathcal{P}_{\rm EG})), it holds that

iBilogui\displaystyle\sum_{i}B_{i}\log u_{i} iBiloguiiβi(uivi,xi)p,ixis\displaystyle\leq\sum_{i}B_{i}\log u_{i}-\sum_{i}\beta_{i}(u_{i}-\langle v_{i},x_{i}\rangle)-\left\langle p,\sum_{i}x_{i}-s\right\rangle
=i(Biloguiβiui)ipβivi,xi+p,s\displaystyle=\sum_{i}(B_{i}\log u_{i}-\beta_{i}u_{i})-\sum_{i}\langle p-\beta_{i}v_{i},x_{i}\rangle+\langle p,s\rangle
i(BilogBiβiβiBiβi)ipβivi,xi+p,s\displaystyle\leq\sum_{i}(B_{i}\log\frac{B_{i}}{\beta_{i}}-\beta_{i}\cdot\frac{B_{i}}{\beta_{i}})-\sum_{i}\langle p-\beta_{i}v_{i},x_{i}\rangle+\langle p,s\rangle
i(BilogBiBi)+p,siBilogβi\displaystyle\leq\sum_{i}(B_{i}\log B_{i}-B_{i})+\langle p,s\rangle-\sum_{i}B_{i}\log\beta_{i}
=p,siBilogβiC,\displaystyle=\langle p,s\rangle-\sum_{i}B_{i}\log\beta_{i}-C,

where the constant C=iBi(1logBi)C=\sum_{i}B_{i}(1-\log B_{i}). In the above derivation, the first inequality uses βi0\beta_{i}\geq 0, uivi,xiu_{i}\leq\langle v_{i},x_{i}\rangle, ixis\sum_{i}x_{i}\leq s; the second inequality uses the fact that ui=Bi/βiu_{i}=B_{i}/\beta_{i} maximizes the function

uiBiloguiβiuiu_{i}\mapsto B_{i}\log u_{i}-\beta_{i}u_{i}

for any βi>0\beta_{i}>0 (i.e., substituting ui=Bi/βiu_{i}=B_{i}/\beta_{i} into the first line); the third inequality uses feasibility w.r.t. (𝒟EG\mathcal{D}_{\rm EG}), i.e., pβivip\geq\beta_{i}v_{i} for all ii. Hence, when all inequalities are tight at a pair of solutions xx^{*} and (p,β)(p^{*},\beta^{*}) feasible to (𝒫EG\mathcal{P}_{\rm EG}) and (𝒟EG\mathcal{D}_{\rm EG}), respectively (i.e., both optima are attained), the following KKT conditions must hold:

  • p,sixi=0\langle p^{*},s-\sum_{i}x_{i}^{*}\rangle=0 (via the first inequality above being tight).

  • u=Bi/βiu^{*}=B_{i}/\beta^{*}_{i} for all ii (via the second above).

  • pβivi,xi=0\langle p^{*}-\beta^{*}_{i}v_{i},x^{*}_{i}\rangle=0 for for all ii (via the third above).

As the proof of [24, Theorem 2] shows, these conditions (together with feasibility w.r.t. the two convex programs) are necessary and sufficient for (x,p)(x^{*},p^{*}) being a ME.

An example: the distributional assumption on item arrivals

In the definitions of OFM and OME in §2, we do not impose any distributional assumption on the sequentially arriving items θt\theta_{t}. The PACE dynamics does not require any distributional assumption either. It is in the analysis of PACE in §5 that we assume that θt\theta_{t} are drawn i.i.d. from an (unknown) underlying distribution sL+s\in L^{\infty}_{+} (where s(Θ)=1s(\Theta)=1 since it is a distribution). We define an underlying static Fisher market with the same buyers and item supplies ss. Then, §5 essentially shows that PACE guarantees that various (time-averaged) quantities converge to their static equilibrium quantities in the underlying market.

To justify the necessity of such a distributional assumption on item arrivals, consider the following example in which items arrivals are chosen by an adaptive “adversary” whose goal is to make the buyers’ time-averaged utilities of any online algorithm deviate from the static equilibrium utilities (defined by the “hindsight market,” i.e., the n×Tn\times T market with items θ1,,θT\theta_{1},\dots,\theta_{T}) as much as possible. For simplicity (and w.l.o.g.), budgets, valuations and supplies are not normalized in this example.

  • There are nn buyers with equal budgets Bi=1B_{i}=1 and an item space of Θ=[n+1]\Theta=[n+1].

  • Let the valuation matrix be as follows, for some large M>nM>n:

    v=[1MMM00MM010MMM].v=\begin{bmatrix}1&M&M&M&0\\ \vdots&\vdots&\vdots&0&M\\ \vdots&M&0&\vdots&\vdots\\ 1&0&M&M&M\end{bmatrix}.

    In other words, for item 11, all buyers have valuation 11. For each item j=2,,n+1j=2,\dots,n+1, buyer (n+2j)(n+2-j) has valuation 0 and other buyers have valuation MM.

  • The item supplies are 11 for all j[n+1]j\in[n+1].

  • The number of time periods TT is large.

Given any fixed j0=2,,n+1j_{0}=2,\dots,n+1, the n×Tn\times T static market with T/2T/2 items of type 11 and T/2T/2 items of type j0j_{0} exhibits the following ME:

  • Buyer i0:=n+2ji_{0}:=n+2-j^{*} receives all T/2T/2 items of type 11 with a utility of ui0=T/2u^{*}_{i_{0}}=T/2 and βi0=2/T\beta^{*}_{i_{0}}=2/T.

  • Each buyer i[n]{i0}i\in[n]\setminus\{i_{0}\} receives T/(2(n1))T/(2(n-1)) items of type 22 (i.e., all items of type 22 are evenly distributed among them) with a utility ui=MT2(n1)u^{*}_{i}=\frac{MT}{2(n-1)} and βi=2(n1)MT\beta^{*}_{i}=\frac{2(n-1)}{MT}.

  • The price of item 11 (with T/2T/2 copies) is p1=max{2/T,2(n1)MT}=2Tp^{*}_{1}=\max\left\{2/T,\frac{2(n-1)}{MT}\right\}=\frac{2}{T}. The price of item j0j_{0} (with T/2T/2 copies) is pj0=max{βi00,2(n1)MTM}=2(n1)/Tp^{*}_{j_{0}}=\max\left\{\beta^{*}_{i_{0}}\cdot 0,\frac{2(n-1)}{MT}\cdot M\right\}=2(n-1)/T. To verify these are equilibrium prices, note the following:

    • Buyer i0i_{0} has vi0j0=0<vi01=1v_{i_{0}j_{0}}=0<v_{i_{0}1}=1. Hence, given pp^{*}, will strictly prefer type 11 items over type j0j_{0}. Her equilibrium allocation also consists of only type-11 items that cost exactly her budget: T22T=1\frac{T}{2}\cdot\frac{2}{T}=1.

    • Buyer ii0i\neq i_{0} prefers type j0j_{0} over type 11 since the former has a higher value-per-unit-price:

      Mpj0=MT2(n1)>T2=1p1.\frac{M}{p^{*}_{j_{0}}}=\frac{MT}{2(n-1)}>\frac{T}{2}=\frac{1}{p^{*}_{1}}.

      Her equilibrium allocation also consists of only type-j0j_{0} items that cost exactly her budget: T2(n1)2(n1)T=1\frac{T}{2(n-1)}\cdot\frac{2(n-1)}{T}=1.

Given any online algorithm, consider the following adaptive adversary.

  • In the first T/2T/2 time steps, every item arrival is of type 11, that is, θt=1\theta_{t}=1, t=1,,T/2t=1,\dots,T/2. The algorithm must irrevocably allocate them to the buyers.

  • Since every buyer has the same valuation 11 on type 11, there exists a buyer i0i_{0} that receives ui0T/(2n)u_{i_{0}}\leq T/(2n) up to T/2T/2.

  • Then, the adversary picks j0=n+2i0j_{0}=n+2-i_{0} and every item arrival in the remaining T/2T/2 periods i s of type j0j_{0} (which has value zero for i0i_{0}).

In this way, buyer i0i_{0} only receives a total utility of ui0u_{i_{0}} across all TT. As shown above, in the static (hindsight) n×Tn\times T market, she should have received an equilibrium utility of ui0=T/2u^{*}_{i_{0}}=T/2 (i.e., being allocated all type-11 items). Hence, the realized utility of buyer i0i_{0} is only 1/n1/n of her static equilibrium utility.

Reformulation of (𝒟EG\mathcal{D}_{\rm EG}) into (3)

The reformulation is mainly based on [24, §5], except that we now allow non-uniform supplies ss instead of s(θ)=1s(\theta)=1 for all θΘ\theta\in\Theta. Assuming uniform supplies is w.l.o.g. in the static Fisher market (via rescaling all viv_{i}) but is not so in OFM, since ss in an FOM represents the arbitrary, unknown underlying item distribution from which item arrivals are drawn.

In (𝒟EG\mathcal{D}_{\rm EG}), fixing a β>0\beta>0, setting

p=maxiβiviL1(Θ)+,p=\max_{i}\beta_{i}v_{i}\in L^{1}(\Theta)_{+},

i.e., the smallest L1L^{1} function greater than or equal to βivi\beta_{i}v_{i} for all ii, clearly minimizes the objective subject to the constraints. Hence, we can eliminate pp in this way and write (𝒟EG\mathcal{D}_{\rm EG}) as a finite-dimensional convex program in β\beta. Here, βivi,s\beta_{i}v_{i},s\rangle is convex in β\beta since βmaxiβivi(θ)\beta\mapsto\max_{i}\beta_{i}v_{i}(\theta) is convex for any θΘ\theta\in\Theta. More specifically, for any β,γ+n\beta,\gamma\in\mathbb{R}^{n}_{+}, λ[0,1]\lambda\in[0,1], θΘ\theta\in\Theta, we have

maxi(λβi+(1λ)γi)vi(θ)λmaxiβivi(θ)+(1λ)maxiγivi(θ).\max_{i}\,(\lambda\beta_{i}+(1-\lambda)\gamma_{i})v_{i}(\theta)\leq\lambda\max_{i}\beta_{i}v_{i}(\theta)+(1-\lambda)\max_{i}\gamma_{i}v_{i}(\theta).

Hence,

maxi(λβ+(1λ)γ)vi,sλmaxiβivi,s+(1λ)maxiγivi,s.\langle\max_{i}\,(\lambda\beta+(1-\lambda)\gamma)v_{i},s\rangle\leq\lambda\langle\max_{i}\beta_{i}v_{i},s\rangle+(1-\lambda)\langle\max_{i}\gamma_{i}v_{i},s\rangle.

Due to the strong convexity assumption in Theorem 2, we would need the function

βiBilogβi\beta\mapsto\sum_{i}B_{i}\log\beta_{i}

to be strongly convex on its domain. However, it is only strictly but not strongly convex on ++n\mathbb{R}_{++}^{n}. To resolve this, we use the following lemma. It is similar to [24, Lemma 4] except we allow non-uniform supplies.

Lemma 1.

Assume the normalizations in h in §2. Then, the equilibrium utilities satisfy Biui1B_{i}\leq u^{*}_{i}\leq 1 and hence Biβi=Bi/ui1B_{i}\leq\beta^{*}_{i}=B_{i}/u^{*}_{i}\leq 1.

Proof.

Since any buyer can get at most the entire set of items (given by the supply ss),

uivi,s=1,u^{*}_{i}\leq\langle v_{i},s\rangle=1,

where the last inequality is due to the normalization vi,s=1\langle v_{i},s\rangle=1 in h in §2. In any ME (x,p)(x^{*},p^{*}), Theorem 1 implies

p,xi=βivi,xi=βiui=Bi,\langle p^{*},x^{*}_{i}\rangle=\beta^{*}_{i}\langle v_{i},x^{*}_{i}\rangle=\beta^{*}_{i}u^{*}_{i}=B_{i},

that is, each buyer ii spends her entire budget. Hence, by the normalization B1=1\|B\|_{1}=1 and market clearance p,sixi=0\langle p^{*},s-\sum_{i}x^{*}_{i}\rangle=0, we have

p,s=ip,xi=iBi=B1=1p,Bis=Bi.\langle p^{*},s\rangle=\sum_{i}\langle p^{*},x^{*}_{i}\rangle=\sum_{i}B_{i}=\|B\|_{1}=1\ \Rightarrow\ \langle p^{*},B_{i}s\rangle=B_{i}.

In other words, given item price pp^{*}, each buyer ii can afford the proportional allocation xi:=Bisx^{\circ}_{i}:=B_{i}s. Hence, the buyer optimality property of ME implies that buyer ii’s equilibrium utility is at least the proportional share:

uivi,xi=Bivi,s=Bi.\displaystyle u^{*}_{i}\geq\langle v_{i},x^{\circ}_{i}\rangle=B_{i}\langle v_{i},s\rangle=B_{i}.

Since Biui1B_{i}\leq u^{*}_{i}\leq 1 and βi=Bi/ui\beta^{*}_{i}=B_{i}/u^{*}_{i} at equilibrium (Theorem 1), we have

Biβi1.B_{i}\leq\beta^{*}_{i}\leq 1.

By Lemma 1, adding the constraints

Bi/(1+δ0)βi1+δ0,iB_{i}/(1+\delta_{0})\leq\beta_{i}\leq 1+\delta_{0},\ \forall\,i

to the convex program does not affect its optimal solution β\beta^{*}. Here, δ0>0\delta_{0}>0 is to ensure βi(li,hi)\beta^{*}_{i}\in(l_{i},h_{i}) (the open interval), which facilitates the convergence analysis of cumulative utilities. To simplify the constants, one can take δ0=1\delta_{0}=1. Numerical experiments suggest that its value does not affect the speeds of convergence of quantities of interest.

Combining the above yields the reformulation (3). To align (3) with (2), for each θΘ\theta\in\Theta (corresponding to ZZ in §4) and β+n\beta\in\mathbb{R}^{n}_{+} (corresponding to ww in §4), let

fθ(β):=maxiβivij.f_{\theta}(\beta):=\max_{i}\beta_{i}v_{ij}.

Then,

f(β):=𝐄fθ(β)=maxiβivi,s,f(\beta):=\mathbf{E}f_{\theta}(\beta)=\langle\max_{i}\beta_{i}v_{i},s\rangle,

where the expectation is over θs\theta\sim s, i.e., a random variable with distribution ss (corresponding to z𝒟z\sim\mathcal{D} in §4).

Proof of Theorem 3

It follows immediately from Theorem 2, as long as the function

Ψ(β)=iBilogβi\Psi(\beta)=-\sum_{i}B_{i}\log\beta_{i}

is strongly convex modulo σ\sigma and 𝐄vit(θt)𝐞(it)2G2\mathbf{E}\|v_{i_{t}}(\theta_{t})\mathbf{e}^{(i_{t})}\|^{2}\leq G^{2}. We now show them. Note that Ψ\Psi is twice differentiable and has a diagonal Hessian

2Ψ(β)=[B1β12Bnβn2]\nabla^{2}\Psi(\beta)=\begin{bmatrix}\frac{B_{1}}{\beta_{1}^{2}}&&\\ &\ddots&\\ &&\frac{B_{n}}{\beta_{n}^{2}}\end{bmatrix}

at any β>0\beta>0. Clearly, its smallest eigenvalue can be bounded as

λmin(2Ψ(β))miniBiβi.\lambda_{\min}(\nabla^{2}\Psi(\beta))\geq\min_{i}\frac{B_{i}}{\beta_{i}}.

Denote κ=1/(miniBi)\kappa=1/(\min_{i}B_{i}). For any β\beta feasible to (3), by the constraints Bi/(1+δ0)βi1+δ0B_{i}/(1+\delta_{0})\leq\beta_{i}\leq 1+\delta_{0}, we have

λmin(2Ψ(β))miniminβi[Bi/(1+δ0),1+δ0]Biβi2=miniBi(1+δ0)2=1κ(1+δ0)2.\lambda_{\min}(\nabla^{2}\Psi(\beta))\geq\min_{i}\min_{\beta_{i}\in[B_{i}/(1+\delta_{0}),1+\delta_{0}]}\frac{B_{i}}{\beta_{i}^{2}}=\min_{i}\frac{B_{i}}{(1+\delta_{0})^{2}}=\frac{1}{\kappa(1+\delta_{0})^{2}}.

Therefore, Ψ\Psi is strongly convex on [B/(1+δ0),(1+δ0)𝟏][B/(1+\delta_{0}),(1+\delta_{0})\mathbf{1}] with modulus σ=1κ(1+δ0)2\sigma=\frac{1}{\kappa(1+\delta_{0})^{2}}. Finally, we have

𝐄vit𝐞(it)2maxi𝐄θs[vi(θ)2]=G2maxivi2.\mathbf{E}\|v_{i_{t}}\mathbf{e}^{(i_{t})}\|^{2}\leq\max_{i}\mathbf{E}_{\theta\sim s}[v_{i}(\theta)^{2}]=G^{2}\leq\max_{i}\|v_{i}\|_{\infty}^{2}.

Proof of Theorem 4

Intuitively, our proof uses the fact that if βit\beta^{t}_{i} and βi\beta_{i}^{*} are near each other, then Biβit\frac{B_{i}}{\beta^{t}_{i}} will be near Biβi=ui\frac{B_{i}}{\beta_{i}^{*}}=u_{i}^{*} as well. Recall that git=uitg^{t}_{i}=u^{t}_{i} (i.e., the subgradient of βmaxiβivi(θt)\beta\mapsto\max_{i}\beta_{i}v_{i}(\theta_{t}) that we choose corresponds to the utility buyer ii receives at time tt) and hence g¯it=u¯it\bar{g}^{t}_{i}=\bar{u}^{t}_{i}. Since

βt+1=Π[li,hi](Big¯it),\beta^{t+1}=\Pi_{[l_{i},h_{i}]}\left(\frac{B_{i}}{\bar{g}^{t}_{i}}\right),

we know that if no projection occurs (i.e., if Big¯it[li,hi]\frac{B_{i}}{\bar{g}^{t}_{i}}\in[l_{i},h_{i}]) at iteration tt, then

Biβit+1=g¯it.\frac{B_{i}}{\beta^{t+1}_{i}}=\bar{g}_{i}^{t}.

Thus, we split our proof into two cases: the case where projection occurs (i.e., Big¯it[li,hi]\frac{B_{i}}{\bar{g}^{t}_{i}}\notin[l_{i},h_{i}]), and the case where projection does not occur. As we will see, the probability of a projection at time step tt converges to 0 as tt grows.

For each ii, consider the event that no projection occurs:

Ait:={liBi/g¯ithi}.A^{t}_{i}:=\{l_{i}\leq B_{i}/\bar{g}^{t}_{i}\leq h_{i}\}.

Conditioning on the complementary event (Ait)c={g¯it[li,hi]}(A^{t}_{i})^{c}=\{\bar{g}^{t}_{i}\notin[l_{i},h_{i}]\}, it holds that

|βit+1βi|>ϵi𝐄(βit+1βi)2𝐏[(Ait)c]ϵi2𝐏[(Ait)c]1ϵi2𝐄(βit+1βi)2.|\beta^{t+1}_{i}-\beta^{*}_{i}|>\epsilon_{i}\ \Rightarrow\ \mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}\geq\mathbf{P}[(A^{t}_{i})^{c}]\epsilon_{i}^{2}\ \Rightarrow\ \mathbf{P}[(A^{t}_{i})^{c}]\leq\frac{1}{\epsilon_{i}^{2}}\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}.

Conditioning on AitA^{t}_{i}, we have Bi/g¯it=βit+1B_{i}/\bar{g}^{t}_{i}=\beta^{t+1}_{i}. Furthermore, since

0g¯it=1tτ=1tvijτ𝕀{i=iτ}vi0\leq\bar{g}^{t}_{i}=\frac{1}{t}\sum_{\tau=1}^{t}v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}\leq\|v_{i}\|_{\infty}

and vi1ui\|v_{i}\|_{\infty}\geq 1\geq u^{*}_{i}, we have the following upper bound on the difference between the time average of realized utilities and the equilibrium utility of buyer ii:

|g¯itui|max{ui,vi}=vi.|\bar{g}^{t}_{i}-u^{*}_{i}|\leq\max\{u^{*}_{i},\|v_{i}\|_{\infty}\}=\|v_{i}\|_{\infty}.

Now, splitting the expectation by the two complementary events AitA^{t}_{i} and (Ait)c(A^{t}_{i})^{c}, we can apply the above bounds to get

𝐄(g¯itui)2\displaystyle\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2} =𝐄[𝕀(Ait)c(g¯itui)2]+𝐄[𝕀Ait(Biβit+1ui)2]\displaystyle=\mathbf{E}[\mathbb{I}_{(A^{t}_{i})^{c}}\cdot(\bar{g}^{t}_{i}-u^{*}_{i})^{2}]+\mathbf{E}\left[\mathbb{I}_{A^{t}_{i}}\cdot\left(\frac{B_{i}}{\beta^{t+1}_{i}}-u^{*}_{i}\right)^{2}\right]
vi2𝐄[𝕀(Ait)c]+(ui)2𝐄[𝕀Ait(Biβit+1ui1)2]\displaystyle\leq\|v_{i}\|_{\infty}^{2}\mathbf{E}[\mathbb{I}_{(A^{t}_{i})^{c}}]+(u^{*}_{i})^{2}\mathbf{E}\left[\mathbb{I}_{A^{t}_{i}}\cdot\left(\frac{B_{i}}{\beta^{t+1}_{i}u^{*}_{i}}-1\right)^{2}\right]
vi2𝐏[(Ait)c]+(ui)2𝐄(βit+1βiβit+1)2\displaystyle\leq\|v_{i}\|_{\infty}^{2}\mathbf{P}[(A^{t}_{i})^{c}]+(u^{*}_{i})^{2}\cdot\mathbf{E}\left(\frac{\beta^{t+1}_{i}-\beta^{*}_{i}}{\beta^{t+1}_{i}}\right)^{2}
vi2ϵi2𝐄(βit+1βi)2+((1+δ0)uiBi)2𝐄(βit+1βi)2\displaystyle\leq\frac{\|v_{i}\|_{\infty}^{2}}{\epsilon_{i}^{2}}\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}+\left(\frac{(1+\delta_{0})u^{*}_{i}}{B_{i}}\right)^{2}\cdot\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}
(vi2ϵi2+(1+δ0Bi)2)𝐄(βit+1βi)2.\displaystyle\leq\left(\frac{\|v_{i}\|_{\infty}^{2}}{\epsilon_{i}^{2}}+\left(\frac{1+\delta_{0}}{B_{i}}\right)^{2}\right)\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}.

Since Biβi1B_{i}\leq\beta^{*}_{i}\leq 1, we have (κ:=1/(miniBi)\kappa:=1/(\min_{i}B_{i}))

ϵiBiδ0/(1+δ0)>δ0/κ>0.\epsilon_{i}\geq B_{i}\delta_{0}/(1+\delta_{0})>\delta_{0}/\kappa>0.

Summing up across all ii, using Theorem 3 and the above bound, we get

𝐄g¯tu2\displaystyle\mathbf{E}\|\bar{g}^{t}-u^{*}\|^{2} i(vi2ϵi2+(1+δ0Bi)2)𝐄(βit+1βi)2\displaystyle\leq\sum_{i}\left(\frac{\|v_{i}\|_{\infty}^{2}}{\epsilon_{i}^{2}}+\left(\frac{1+\delta_{0}}{B_{i}}\right)^{2}\right)\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}
(v2(κδ0)2+((1+δ0)κ)2)i𝐄(βit+1βi)2\displaystyle\leq\left(\|v\|_{\infty}^{2}\left(\frac{\kappa}{\delta_{0}}\right)^{2}+((1+\delta_{0})\kappa)^{2}\right)\sum_{i}\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}
(v2(κδ0)2+((1+δ0)κ)2)(6+log(t+1))G2(t+1)σ2\displaystyle\leq\left(\|v\|_{\infty}^{2}\left(\frac{\kappa}{\delta_{0}}\right)^{2}+((1+\delta_{0})\kappa)^{2}\right)\frac{(6+\log(t+1))G^{2}}{(t+1)\sigma^{2}}
=C(6+log(t+1))G2(t+1)σ2.\displaystyle=C\cdot\frac{(6+\log(t+1))G^{2}}{(t+1)\sigma^{2}}.

Proof of Theorem 5

First, note that b¯it\bar{b}^{t}_{i} can be decomposed as follows.

b¯it\displaystyle\bar{b}^{t}_{i} =1tτ=1tβiτvijτ𝕀{i=iτ}\displaystyle=\frac{1}{t}\sum_{\tau=1}^{t}\beta^{\tau}_{i}v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}
=βi1tτ=1tvijτ𝕀{i=iτ}+1tτ=1t(βiτβi)vijτ𝕀{i=iτ}\displaystyle=\beta^{*}_{i}\cdot\frac{1}{t}\sum_{\tau=1}^{t}v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}+\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{*}_{i})v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}
=βig¯it+1tτ=1t(βiτβi)vijτ𝕀{i=iτ}.\displaystyle=\beta^{*}_{i}\bar{g}^{t}_{i}+\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{*}_{i})v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}.

Next, we bound the second term as follows, using convexity of ()2(\cdot)^{2} and vijτvi\|v_{ij_{\tau}}\|\leq\|v_{i}\|_{\infty}:

(1tτ=1t(βiτβi)vijτ𝕀{i=iτ})21tτ=1t(βiτβi)2vi2.\left(\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{*}_{i})v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}\right)^{2}\leq\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{*}_{i})^{2}\|v_{i}\|_{\infty}^{2}.

Then, we bound the square difference between expenditure and budget as follows, using (x+y)22(x2+y2)(x+y)^{2}\leq 2(x^{2}+y^{2}) for any x,yx,y\in\mathbb{R}:

(b¯itBi)2\displaystyle(\bar{b}^{t}_{i}-B_{i})^{2} 2[(βig¯itBi)2+(1tτ=1t(βiτβi)vijτ𝕀{i=iτ})2].\displaystyle\leq 2\left[(\beta^{*}_{i}\bar{g}^{t}_{i}-B_{i})^{2}+\left(\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{*}_{i})v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}\right)^{2}\right].

Combining the above two inequalities, taking expectation on both sides and using βi=Bi/ui\beta^{*}_{i}=B_{i}/u^{*}_{i}, we have

𝐄(b¯itBi)22[(βi)2𝐄(g¯itui)2+vi21tτ=1t𝐄(βiτβi)2].\displaystyle\mathbf{E}(\bar{b}^{t}_{i}-B_{i})^{2}\leq 2\left[(\beta^{*}_{i})^{2}\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2}+\|v_{i}\|_{\infty}^{2}\frac{1}{t}\sum_{\tau=1}^{t}\mathbf{E}(\beta^{\tau}_{i}-\beta^{*}_{i})^{2}\right]. (5)

When t3t\geq 3, we have log(t+1)t+1<logtt\frac{\log(t+1)}{t+1}<\frac{\log t}{t} (since (logtt)=1logtt2<0(\frac{\log t}{t})^{\prime}=\frac{1-\log t}{t^{2}}<0 for all t3t\geq 3). By the proof of [48, Corollary 4],

1tτ=1t(6+logτ)G2τσ21t(6(1+logt)+(logt)22)G2σ2.\displaystyle\frac{1}{t}\sum_{\tau=1}^{t}\frac{(6+\log\tau)G^{2}}{\tau\sigma^{2}}\leq\frac{1}{t}\left(6(1+\log t)+\frac{(\log t)^{2}}{2}\right)\frac{G^{2}}{\sigma^{2}}. (6)

Finally, summing up (5) across all ii, using βi1\beta^{*}_{i}\leq 1, Theorems 3 and 4, and (6), we have

𝐄b¯tB2\displaystyle\mathbf{E}\|\bar{b}^{t}-B\|^{2} 2[𝐄g¯tu2+v21tτ=1t𝐄βτβ2]\displaystyle\leq 2\left[\mathbf{E}\|\bar{g}^{t}-u^{*}\|^{2}+\|v\|_{\infty}^{2}\frac{1}{t}\sum_{\tau=1}^{t}\mathbf{E}\|\beta^{\tau}-\beta^{*}\|^{2}\right]
2[C(6+logt)G2tσ2+v21t(6(1+logt)+(logt)22)G2σ2]\displaystyle\leq 2\left[C\cdot\frac{(6+\log t)G^{2}}{t\sigma^{2}}+\|v\|_{\infty}^{2}\frac{1}{t}\left(6(1+\log t)+\frac{(\log t)^{2}}{2}\right)\frac{G^{2}}{\sigma^{2}}\right]
=2G2tσ2(6(C+v2)+(C+6v2)logt+v22(logt)2).\displaystyle=\frac{2G^{2}}{t\sigma^{2}}\left(6(C+\|v\|_{\infty}^{2})+(C+6\|v\|_{\infty}^{2})\log t+\frac{\|v\|_{\infty}^{2}}{2}(\log t)^{2}\right).

Proof of Theorem 6

For any θΘ\theta\in\Theta, since \|\cdot\|_{\infty} is 11-Lipschitz continuous w.r.t. itself, we have

|p(θ)maxiβitvi(θ)|\displaystyle\left|p^{*}(\theta)-\max_{i}\beta^{t}_{i}v_{i}(\theta)\right| |maxiβivi(θ)maxiβitvi(θ)|\displaystyle\leq\left|\max_{i}\beta^{*}_{i}v_{i}(\theta)-\max_{i}\beta^{t}_{i}v_{i}(\theta)\right|
maxi|βivi(θ)βitvi(θ)|\displaystyle\leq\max_{i}|\beta^{*}_{i}v_{i}(\theta)-\beta^{t}_{i}v_{i}(\theta)|
vβtβ.\displaystyle\leq\|v\|_{\infty}\|\beta^{t}-\beta^{*}\|_{\infty}. (7)

Analysis of regret ritr^{t}_{i}.

Let (ziτ)τ[t][0,1]t(z^{\tau}_{i})_{\tau\in[t]}\in[0,1]^{t} be any feasible allocation on the arrived items θτ\theta_{\tau}, τ[t]\tau\in[t] such that

1tτ=1tpτ(θt)ziτBi.\frac{1}{t}\sum_{\tau=1}^{t}p^{\tau}(\theta_{t})z^{\tau}_{i}\leq B_{i}.

Using pτ(θτ)=maxiβiτvi(θτ)p^{\tau}(\theta_{\tau})=\max_{i}\beta^{\tau}_{i}v_{i}(\theta_{\tau}), we have

1tτ=1tp(θτ)ziτ\displaystyle\frac{1}{t}\sum_{\tau=1}^{t}p^{*}(\theta_{\tau})z^{\tau}_{i} =1tτ=1tpτ(θτ)zit+1tτ=1t(p(θτ)pτ(θτ))ziτ\displaystyle=\frac{1}{t}\sum_{\tau=1}^{t}p^{\tau}(\theta_{\tau})z^{t}_{i}+\frac{1}{t}\sum_{\tau=1}^{t}(p^{*}(\theta_{\tau})-p^{\tau}(\theta_{\tau}))z^{\tau}_{i}
Bi+1tvτ=1tβτβ[by (7) and 0zijτ1.\displaystyle\leq B_{i}+\frac{1}{t}\|v\|_{\infty}\sum_{\tau=1}^{t}\|\beta^{\tau}-\beta^{*}\|_{\infty}\quad\text{[by \eqref{eq:bound-|p*(j)-max-beta(i,t)*v(i,j)|-any-j} and $0\leq z^{\tau}_{ij}\leq 1$] }. (8)

Denote

γt=1tvτ=1tβτβ.\gamma_{t}=\frac{1}{t}\|v\|_{\infty}\sum_{\tau=1}^{t}\|\beta^{\tau}-\beta^{*}\|_{\infty}.

In a static ME, by Theorem 1 and the constraints in (𝒟EG\mathcal{D}_{\rm EG}), we have pβivip^{*}\geq\beta^{*}_{i}v_{i}. Hence,

1tτ=1tp(θτ)ziτβi(1tτ=1tvi(θτ)ziτ).\displaystyle\frac{1}{t}\sum_{\tau=1}^{t}p^{*}(\theta_{\tau})z^{\tau}_{i}\geq\beta^{*}_{i}\left(\frac{1}{t}\sum_{\tau=1}^{t}v_{i}(\theta_{\tau})z^{\tau}_{i}\right). (9)

By (8), (9), ui=Bi/βiu^{*}_{i}=B_{i}/\beta^{*}_{i} (Theorem 1) and the definition of ξit\xi^{t}_{i}, we have

1tτ=1tvi(θτ)ziτ1βi(Bi+γt)=ui(1+γtBi)ui+γiBiu¯it+ξit+γtBi.\displaystyle\frac{1}{t}\sum_{\tau=1}^{t}v_{i}(\theta_{\tau})z^{\tau}_{i}\leq\frac{1}{\beta^{*}_{i}}(B_{i}+\gamma_{t})=u^{*}_{i}\left(1+\frac{\gamma_{t}}{B_{i}}\right)\leq u^{*}_{i}+\frac{\gamma_{i}}{B_{i}}\leq\bar{u}^{t}_{i}+\xi^{t}_{i}+\frac{\gamma_{t}}{B_{i}}.

Hence, the utility level U^it\hat{U}^{t}_{i} (Definition 3) satisfies

U^itu¯it+ξit+γtBi.\displaystyle\hat{U}^{t}_{i}\leq\bar{u}^{t}_{i}+\xi^{t}_{i}+\frac{\gamma_{t}}{B_{i}}. (10)

Note that 𝐄(γt2)\mathbf{E}(\gamma_{t}^{2}) can be bounded as follows:

𝐄(γt2)v21tτ=1t𝐄βτβ2v2t(6(1+logt)+(logt)22)G2σ2=O((logt)2t).\displaystyle\mathbf{E}(\gamma_{t}^{2})\leq\|v\|_{\infty}^{2}\frac{1}{t}\sum_{\tau=1}^{t}\mathbf{E}\|\beta^{\tau}-\beta^{*}\|^{2}\leq\frac{\|v\|_{\infty}^{2}}{t}\left(6(1+\log t)+\frac{(\log t)^{2}}{2}\right)\frac{G^{2}}{\sigma^{2}}=O\left(\frac{(\log t)^{2}}{t}\right). (11)

where the second inequality is due to Theorem 3 and (6). Combining (10), (11) and 𝐄(ξit)2=O((logt)/t)\mathbf{E}(\xi^{t}_{i})^{2}=O((\log t)/t) (Theorem 4), we have

𝐄(rit)22(𝐄(ξit)2+1Bi2𝐄(γt2))=O((logt)2t).\mathbf{E}(r^{t}_{i})^{2}\leq 2\left(\mathbf{E}(\xi^{t}_{i})^{2}+\frac{1}{B_{i}^{2}}\mathbf{E}(\gamma_{t}^{2})\right)=O\left(\frac{(\log t)^{2}}{t}\right).

Analysis of envy ρit\rho^{t}_{i}.

Let p=maxiβivip^{*}=\max_{i}\beta^{*}_{i}v_{i} (a.e.) be the equilibrium prices. Similar to 8, for any ii, we have

1tτ=1tp(θτ)xiτ\displaystyle\frac{1}{t}\sum_{\tau=1}^{t}p^{*}(\theta_{\tau})x^{\tau}_{i} =b¯it+1tτ=1t(p(θτ)βiτvi(θτ))xiτ\displaystyle=\bar{b}^{t}_{i}+\frac{1}{t}\sum_{\tau=1}^{t}(p^{*}(\theta_{\tau})-\beta^{\tau}_{i}v_{i}(\theta_{\tau}))x^{\tau}_{i}
Bi+Δit+ηit.\displaystyle\leq B_{i}+\Delta^{t}_{i}+\eta^{t}_{i}. (12)

Using the above (replacing ii with kk) and pβivip^{*}\geq\beta^{*}_{i}v_{i}, we have

βiu¯ikt=1tτ=1tβivi(θτ)xkτ1ττ=1tp(θτ)xkτBk+Δkt+ηkt.\displaystyle\beta^{*}_{i}\bar{u}^{t}_{ik}=\frac{1}{t}\sum_{\tau=1}^{t}\beta^{*}_{i}v_{i}(\theta_{\tau})x^{\tau}_{k}\leq\frac{1}{\tau}\sum_{\tau=1}^{t}p^{*}(\theta_{\tau})x^{\tau}_{k}\leq B_{k}+\Delta^{t}_{k}+\eta^{t}_{k}.

Hence, using ui=Bi/βi1u^{*}_{i}=B_{i}/\beta^{*}_{i}\leq 1 (Theorem 1 and Lemma 1),

u¯iktBk\displaystyle\frac{\bar{u}^{t}_{ik}}{B_{k}} 1Bk1βi(Bk+Δkt+ηkt)\displaystyle\leq\frac{1}{B_{k}}\cdot\frac{1}{\beta^{*}_{i}}(B_{k}+\Delta^{t}_{k}+\eta^{t}_{k})
uiBi(1+Δit+ηitBk)\displaystyle\leq\frac{u^{*}_{i}}{B_{i}}\left(1+\frac{\Delta^{t}_{i}+\eta^{t}_{i}}{B_{k}}\right)
u¯itBi+ξitBi+Δit+ηitBkBi[by definition of ξit].\displaystyle\leq\frac{\bar{u}^{t}_{i}}{B_{i}}+\frac{\xi^{t}_{i}}{B_{i}}+\frac{\Delta^{t}_{i}+\eta^{t}_{i}}{B_{k}B_{i}}\quad\text{[by definition of $\xi^{t}_{i}$]}.

Using the above inequality, we can bound the envy as follows:

ρitξitBi+1BimaxkΔkt+ηktBkκξit+κ2maxk(Δkt+ηkt).\displaystyle\rho^{t}_{i}\leq\frac{\xi^{t}_{i}}{B_{i}}+\frac{1}{B_{i}}\max_{k}\frac{\Delta^{t}_{k}+\eta^{t}_{k}}{B_{k}}\leq\kappa\xi^{t}_{i}+\kappa^{2}\max_{k}(\Delta^{t}_{k}+\eta^{t}_{k}). (13)

Next, we show the convergence of ηit\eta^{t}_{i}. By (7), we have

|ηit||ηt|1tτ=1t|pjτβiττviτjτ|1tτ=1tvβτβ=γt.\displaystyle|\eta^{t}_{i}|\leq\sum_{\ell}|\eta^{t}_{\ell}|\leq\frac{1}{t}\sum_{\tau=1}^{t}|p^{*}_{j_{\tau}}-\beta^{\tau}_{i_{\tau}}v_{i_{\tau}j_{\tau}}|\leq\frac{1}{t}\sum_{\tau=1}^{t}\|v\|_{\infty}\|\beta^{\tau}-\beta^{*}\|_{\infty}=\gamma_{t}. (14)

Hence, same as (11),

𝐄(ηit)2v21tτ=1t𝐄βτβ2v21t(6(1+logt)+(logt)22)G2σ2.\displaystyle\mathbf{E}(\eta^{t}_{i})^{2}\leq\|v\|_{\infty}^{2}\frac{1}{t}\sum_{\tau=1}^{t}\mathbf{E}\|\beta^{\tau}-\beta^{*}\|^{2}\leq\|v\|_{\infty}^{2}\frac{1}{t}\left(6(1+\log t)+\frac{(\log t)^{2}}{2}\right)\frac{G^{2}}{\sigma^{2}}. (15)

By Theorems 3 and 5, we know that 𝐄(ξit)2=O((logt)/t)\mathbf{E}(\xi^{t}_{i})^{2}=O\left((\log t)/t\right) and 𝐄(Δit)2=O((logt)2/t)\mathbf{E}(\Delta^{t}_{i})^{2}=O\left((\log t)^{2}/t\right). Together with (15) and (13), we have

𝐄(ρit)2κ𝐄(ξit)2+κ2(𝐄(Δt)2+𝐄(ηt)2)=O((logt)2t).\mathbf{E}(\rho^{t}_{i})^{2}\leq\kappa\mathbf{E}(\xi^{t}_{i})^{2}+\kappa^{2}\sum_{\ell}(\mathbf{E}(\Delta^{t}_{\ell})^{2}+\mathbf{E}(\eta^{t}_{\ell})^{2})=O\left(\frac{(\log t)^{2}}{t}\right).

Appendix C Extension to quasilinear utilities

We show that PACE can be easily extended to the case of a quasilinear (QL) market (i.e., where buyers have QL utilities). We show that most of the convergence results in §5 still holds. The static quasilinear market setup is the same as the linear case in §2 (which allows a possibly infinite item space Θ\Theta), except the following:

  • For given item prices pL+1p\in L^{1}_{+}, each buyer ii has a quasilinear utility function, i.e.,

    ui(xi)=vi,xip,xi.u_{i}(x_{i})=\langle v_{i},x_{i}\rangle-\langle p,x_{i}\rangle.
  • Without loss of generality, assume B1=1\|B\|_{1}=1 and all buyers’ valuations are nontrivial, i.e., vi,s>0\langle v_{i},s\rangle>0 for all ii. Due to the structure of QL utilities, we cannot normalize the valuations viv_{i} and budgets BiB_{i} separately without loss of generality. Instead, they can only be scaled at the same time by the same constant.

Same as before, each buyer ii has a budget Bi>0B_{i}>0 and can only choose among budget-feasible allocations, that is, xix_{i} such that p,xiBi\langle p,x_{i}\rangle\leq B_{i}. In this case, an allocation-price pair (x,p)(x^{*},p^{*}) is a quasilinear market equilibrium (QLME) if the following holds (see [24, §6] and [17, §4]):

  • Buyers are optimal: xiDi(p):=argmax{vip,xi:xiL+,p,xiBi}x^{*}_{i}\in D_{i}(p^{*}):=\operatorname*{arg\,max}\{\langle v_{i}-p^{*},x_{i}\rangle:x_{i}\in L^{\infty}_{+},\,\langle p^{*},x_{i}\rangle\leq B_{i}\}.

  • The market clears: ixis\sum_{i}x_{i}\leq s and p,sixi=0\langle p^{*},s-\sum_{i}x^{*}_{i}\rangle=0.

As shown in [24, §6], the following pair of (possibly infinite-dimensional) convex programs capture QLME:777There, the authors assume s=𝟏s=\mathbf{1}, which is w.l.o.g. for static Fisher markets. Similar to the case of Theorem 1 for linear utilities, all results can be easily extended to the case of sL+s\in L^{\infty}_{+}.

supi(Biloguiδi)s.t.uivi,xi+δi,i[n],ixis,ui0,δi0,xiL1(Θ)+,i[n].\displaystyle\begin{split}\sup\,&\sum_{i}(B_{i}\log u_{i}-\delta_{i})\\ {\rm s.t.}&u_{i}\leq\langle v_{i},x_{i}\rangle+\delta_{i},\,\forall\,i\in[n],\\ &\sum_{i}x_{i}\leq s,\,\\ &u_{i}\geq 0,\ \delta_{i}\geq 0,\ x_{i}\in L_{1}(\Theta)_{+},\ \forall\,i\in[n].\end{split} (𝒫QLEG\mathcal{P}_{\rm QLEG})
infp,siBilogβis.t.pβivi,βi1,i[n],pL1(Θ)+,β+d.\displaystyle\begin{split}\inf\,&\langle p,s\rangle-\sum_{i}B_{i}\log\beta_{i}\\ {\rm s.t.}&p\geq\beta_{i}v_{i},\ \beta_{i}\leq 1,\ \forall\,i\in[n],\\ &p\in L_{1}(\Theta)_{+},\ \beta\in\mathbb{R}^{d}_{+}.\end{split} (𝒟QLEG\mathcal{D}_{\rm QLEG})

In the sequel, we use (x,u,δ)(x^{*},u^{*},\delta^{*}) to denote an optimal solution of (𝒫QLEG\mathcal{P}_{\rm QLEG}) (in which uu^{*} and δ\delta^{*} are unique) and (p,β)(p^{*},\beta^{*}) to denote the optimal solution of (𝒟QLEG\mathcal{D}_{\rm QLEG}). As shown in [24, §6], the following KKT conditions of (𝒫QLEG\mathcal{P}_{\rm QLEG}) and (𝒟QLEG\mathcal{D}_{\rm QLEG}) are necessary and sufficient for (x,p)(x^{*},p^{*}) being a QLME.

  • δi(1βi)=0\delta^{*}_{i}(1-\beta^{*}_{i})=0 for all ii (complementary slackness).

  • ui=Bi/βiu^{*}_{i}=B_{i}/\beta^{*}_{i} for all ii.

  • p=maxiβvip^{*}=\max_{i}\beta^{*}v_{i} (a.e.) for all jj.

  • pβivi,xi=0\langle p^{*}-\beta^{*}_{i}v_{i},x^{*}_{i}\rangle=0 for all ii.

Let (x,p)(x^{*},p^{*}) denote a QLME. The equilibrium utility of buyer ii (i.e., the amount of utility buyer ii receives at a QLME) is

uiQLME:=vip,xi=(1βi)vi,xi=(1βi)(uiδi),u^{\rm QLME}_{i}:=\langle v_{i}-p^{*},x^{*}_{i}\rangle=(1-\beta^{*}_{i})\langle v_{i},x^{*}_{i}\rangle=(1-\beta^{*}_{i})(u^{*}_{i}-\delta^{*}_{i}),

which is unique and does not depend on the choice of the equilibrium allocation xx^{*}. In general, uiQLMEu^{\rm QLME}_{i} is not the same as uiu^{*}_{i} in the optimal solution of (𝒫QLEG\mathcal{P}_{\rm QLEG}). In comparison, the term vi,xi\langle v_{i},x^{*}_{i}\rangle can be viewed as the equilibrium gross utility before subtracting the price p,xi\langle p^{*},x^{*}_{i}\rangle of the allocation xix^{*}_{i}. The above equilibrium quantities satisfy the following Gao and Kroer [24, §6].

  • If βi=1\beta^{*}_{i}=1, then pβivi,xi=0\langle p^{*}-\beta^{*}_{i}v_{i},x^{*}_{i}\rangle=0 implies its gross utility and expenditure are equal, which give an equilibrium utility of zero:

    vi,xi=βivi,xi=p,xi=uiδiuiQLME=vip,xi=0.\langle v_{i},x^{*}_{i}\rangle=\beta^{*}_{i}\langle v_{i},x^{*}_{i}\rangle=\langle p^{*},x^{*}_{i}\rangle=u^{*}_{i}-\delta^{*}_{i}\ \Rightarrow\ u^{\rm QLME}_{i}=\langle v_{i}-p^{*},x^{*}_{i}\rangle=0.
  • If βi<1\beta^{*}_{i}<1, then δi=0\delta^{*}_{i}=0 by complementary slackness (the first KKT condition above). Hence, the gross utility is vi,xi=ui\langle v_{i},x^{*}_{i}\rangle=u^{*}_{i} and

    uQLME=vip,xi=(1βi)vi,xi=(1βi)ui.u^{\rm QLME}=\langle v_{i}-p^{*},x^{*}_{i}\rangle=(1-\beta^{*}_{i})\langle v_{i},x^{*}_{i}\rangle=(1-\beta^{*}_{i})u^{*}_{i}.

Similar to the proof of [23, Lemma 5], we can show that

uivi,s+Bi.u^{*}_{i}\leq\langle v_{i},s\rangle+B_{i}.

Hence,

β=BiuiBivi,s+Bi>βimin:=Bivi,s+2Bi>0.\beta^{*}=\frac{B_{i}}{u^{*}_{i}}\geq\frac{B_{i}}{\langle v_{i},s\rangle+B_{i}}>\beta^{\min}_{i}:=\frac{B_{i}}{\langle v_{i},s\rangle+2B_{i}}>0.

The choice of βimin\beta^{\min}_{i} is to ensure that βiβimin>0\beta^{*}_{i}-\beta^{\min}_{i}>0, which simplifies the analysis of the dynamics. Substituting p=maxiβivip=\max_{i}\beta_{i}v_{i} and using the bounds βiminβi1\beta^{\min}_{i}\leq\beta^{*}_{i}\leq 1, we can solve the following convex program for the equilibrium utility prices β\beta^{*}, where βmin:=(β1min,,βnmin)\beta^{\min}:=(\beta^{\min}_{1},\dots,\beta^{\min}_{n}):

minβ[βmin,𝟏]p,siBilogβi.\displaystyle\min_{\beta\in[\beta^{\min},\mathbf{1}]}\ \langle p,s\rangle-\sum_{i}B_{i}\log\beta_{i}. (16)

Applying dual averaging to the convex program (16), we arrive at the following PACE dynamics for an online QL market (i.e., an OFM with buyers having QL utilities). At time tt, the following steps take place.

  • An item θtΘ\theta_{t}\in\Theta arrives, which determines a winner it=minargmaxiβitvi(θt)i_{t}=\min\operatorname*{arg\,max}_{i}\beta^{t}_{i}v_{i}(\theta_{t}).

  • The stochastic subgradient is gt=vit(θt)𝐞(it)g^{t}=v_{i_{t}}(\theta_{t})\mathbf{e}^{(i_{t})}, or git=vi(θt)𝕀{i=it}g^{t}_{i}=v_{i}(\theta_{t})\mathbb{I}\{i=i_{t}\} for each ii.

  • Each buyer ii pays a price (expenditure)

    bit=βitvi(θt)𝕀{i=it}b^{t}_{i}=\beta^{t}_{i}v_{i}(\theta_{t})\mathbb{I}\{i=i_{t}\}

    and receives a (net) utility of

    uit=gitbit=(1βit)vi(θt)𝕀{i=it},\displaystyle u^{t}_{i}=g^{t}_{i}-b^{t}_{i}=(1-\beta^{t}_{i})v_{i}(\theta_{t})\mathbb{I}\{i=i_{t}\}, (17)

    which is is the value of the item minus the price paid. Here, only the winning buyer iti_{t} may get a potentially nonzero utility uittu^{t}_{i_{t}}; other buyers iiti\neq i_{t} gets 0 (and pays zero).

  • Update the dual average: for each ii, g¯t=t1tg¯t1+1tgt\bar{g}^{t}=\frac{t-1}{t}\bar{g}^{t-1}+\frac{1}{t}g^{t}, which ensures g¯it=1tτ=1tvijτ𝕀{i=iτ}\bar{g}^{t}_{i}=\frac{1}{t}\sum_{\tau=1}^{t}v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\} for all ii (same as in the linear case).

  • Compute the next pacing multiplier (similar to the linear case, except the lower bound for βit\beta^{t}_{i} being βimin\beta^{\min}_{i} instead of BiB_{i}):

    βit+1=argminβi[βimin,1]{g¯itβiBilogβi}βit+1=Π[βimin,1](Big¯it).\beta^{t+1}_{i}=\operatorname*{arg\,min}_{\beta_{i}\in[\beta^{\min}_{i},1]}\left\{\bar{g}^{t}_{i}\beta_{i}-B_{i}\log\beta_{i}\right\}\ \Rightarrow\ \beta^{t+1}_{i}=\Pi_{[\beta^{\min}_{i},1]}\left(\frac{B_{i}}{\bar{g}^{t}_{i}}\right).

Same as in the linear case, we do not need any distributional assumption on the item arrivals to run PACE. In subsequent convergence analysis, however, we assume that the items θt\theta_{t} are drawn i.i.d. from a distribution ss (i.e., sL+s\in L^{\infty}_{+} and s(Θ)=1s(\Theta)=1). We also assume that viL+v_{i}\in L^{\infty}_{+} for all ii. Let (x,p)(x^{*},p^{*}) denote a QLME of the underlying static QL market with supplies ss.

Convergence of QL pacing multipliers.

Analogous to Theorem 3, in the QL case, we can show that the pacing multipliers βt\beta^{t} converge to the equilibrium utility prices β\beta^{*} in mean square. It is a direct consequence of the general convergence result of dual averaging (Theorem 2).

Theorem 7.

For t=1,2,t=1,2,\dots, it holds that

𝐄βtβ2(6+logt)G2tσ2,\mathbf{E}\|\beta^{t}-\beta^{*}\|^{2}\leq\frac{(6+\log t)G^{2}}{t\sigma^{2}},

where G2G^{2}, κ\kappa are the same as in Theorem 3 and

σ=miniminβi[βimin,1]Biβi2=miniBi=1κ.\sigma=\min_{i}\min_{\beta_{i}\in[\beta^{\min}_{i},1]}\frac{B_{i}}{\beta_{i}^{2}}=\min_{i}B_{i}=\frac{1}{\kappa}.

Convergence of QL utilities and expenditures.

Next, we show mean-square convergence of time-averaged utilities u¯t=1tτ=1tuit\bar{u}^{t}=\frac{1}{t}\sum_{\tau=1}^{t}u^{t}_{i} and expenditures b¯t=1tτ=1tβiτvijτ𝕀{i=iτ}\bar{b}^{t}=\frac{1}{t}\sum_{\tau=1}^{t}\beta^{\tau}_{i}v_{ij_{\tau}}\mathbb{I}\{i=i_{\tau}\}. In the QL case, the dual average gitg^{t}_{i} can also be viewed as the per-period gross utility before subtracting the price βitvi(θt)\beta^{t}_{i}v_{i}(\theta_{t}). In this way, g¯it\bar{g}^{t}_{i} is the time-averaged gross utility of buyer ii.

Theorem 8.

For t=1,2,t=1,2,\dots and each ii, the following holds.

  • If βi=1\beta^{*}_{i}=1, then uiQLME=0u^{\rm QLME}_{i}=0 and Bi=uiB_{i}=u^{*}_{i}. In this case,

    𝐄(uituiQLME)2=𝐄|uit|2vi2𝐄(1βit)2=O(logtt).\displaystyle\mathbf{E}\left(u^{t}_{i}-u^{\rm QLME}_{i}\right)^{2}=\mathbf{E}|u^{t}_{i}|^{2}\leq\|v_{i}\|_{\infty}^{2}\mathbf{E}(1-\beta^{t}_{i})^{2}=O\left(\frac{\log t}{t}\right). (18)
  • If βi<1\beta^{*}_{i}<1, then uiQLME>0u^{\rm QLME}_{i}>0, δi=0\delta^{*}_{i}=0 and uQLME=(1βi)uiu^{\rm QLME}=(1-\beta^{*}_{i})u^{*}_{i}. In this case, the gross utility g¯it\bar{g}^{t}_{i}, realized (net) utility u¯it\bar{u}^{t}_{i} and expenditures b¯it\bar{b}^{t}_{i} converge as follows:

    𝐄(g¯itui)2Ci𝐄(βit+1βi)2=O(logtt),\displaystyle\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2}\leq C_{i}\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}=O\left(\frac{\log t}{t}\right),
    𝐄(u¯ituiQLME)2Rit,𝐄(b¯itBi)2Rit,\displaystyle\mathbf{E}\left(\bar{u}^{t}_{i}-u^{\rm QLME}_{i}\right)^{2}\leq R^{t}_{i},\ \ \mathbf{E}(\bar{b}^{t}_{i}-B_{i})^{2}\leq R^{t}_{i},

    where

    Ci\displaystyle C_{i} =vi2ϵi2+(vi1m+2Bi)4Bi2,ϵi=min{1βi,βiβimin},\displaystyle=\frac{\|v_{i}\|_{\infty}^{2}}{\epsilon_{i}^{2}}+\frac{\left(\frac{\|v_{i}\|_{1}}{m}+2B_{i}\right)^{4}}{B_{i}^{2}},\ \ \epsilon_{i}=\min\{1-\beta^{*}_{i},\beta^{*}_{i}-\beta^{\min}_{i}\},
    Rit\displaystyle R^{t}_{i} =2[𝐄(g¯itui)2+vi2tτ=1t𝐄(βiτβi)2],𝐄Rit=O((logt)2t).\displaystyle=2\left[\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2}+\frac{\|v_{i}\|_{\infty}^{2}}{t}\sum_{\tau=1}^{t}\mathbf{E}(\beta^{\tau}_{i}-\beta^{*}_{i})^{2}\right],\ \ \mathbf{E}R^{t}_{i}=O\left(\frac{(\log t)^{2}}{t}\right).

Hence, the mean-square error 𝐄u¯tuQLME2\mathbf{E}\left\|\bar{u}^{t}-u^{\rm QLME}\right\|^{2} is either O((logt)2/t)O((\log t)^{2}/t) (when some βi<1\beta^{*}_{i}<1) or O((logt)/t)O((\log t)/t) (when all βi=1\beta^{*}_{i}=1).

Proof.

The βi=1\beta^{*}_{i}=1 case is clear. We prove the βi<1\beta^{*}_{i}<1 case.

Convergence of g¯it\bar{g}^{t}_{i}.

Denote the event Ait={Big¯itBi/βimin}A^{t}_{i}=\{B_{i}\leq\bar{g}^{t}_{i}\leq B_{i}/\beta^{\min}_{i}\}. Then, (Ait)c(A^{t}_{i})^{c} means Bi/g¯it[βimin,1]B_{i}/\bar{g}^{t}_{i}\notin[\beta^{\min}_{i},1] and hence |βt+1βi|>ϵi|\beta^{t+1}-\beta^{*}_{i}|>\epsilon_{i}. Similar to the linear case, we deduce

𝐄(βit+1βi)2𝐏[(Ait)c]ϵi2𝐏[(Ait)c]1ϵi2𝐄(βit+1βi)2.\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}\geq\mathbf{P}[(A^{t}_{i})^{c}]\epsilon_{i}^{2}\Rightarrow\ \mathbf{P}[(A^{t}_{i})^{c}]\leq\frac{1}{\epsilon_{i}^{2}}\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}.

Furthermore, since 0g¯itvi0\leq\bar{g}^{t}_{i}\leq\|v_{i}\|_{\infty} (same as in the linear case) and uivi,s+Biu^{*}_{i}\leq\langle v_{i},s\rangle+B_{i}, we have

𝐄(g¯itui)2\displaystyle\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2} =𝐄[𝕀(Ait)c(g¯itui)2]+𝐄[𝕀Ait(Biβit+1ui)2]\displaystyle=\mathbf{E}[\mathbb{I}_{(A^{t}_{i})^{c}}\cdot(\bar{g}^{t}_{i}-u^{*}_{i})^{2}]+\mathbf{E}\left[\mathbb{I}_{A^{t}_{i}}\cdot\left(\frac{B_{i}}{\beta^{t+1}_{i}}-u^{*}_{i}\right)^{2}\right]
vi2𝐄[𝕀(Ait)c]+(ui)2𝐄[𝕀Ait(Biβit+1ui1)2]\displaystyle\leq\|v_{i}\|_{\infty}^{2}\mathbf{E}[\mathbb{I}_{(A^{t}_{i})^{c}}]+(u^{*}_{i})^{2}\mathbf{E}\left[\mathbb{I}_{A^{t}_{i}}\cdot\left(\frac{B_{i}}{\beta^{t+1}_{i}u^{*}_{i}}-1\right)^{2}\right]
vi2𝐏[(Ait)c]+(ui)2𝐄(βit+1βiβit+1)2\displaystyle\leq\|v_{i}\|_{\infty}^{2}\mathbf{P}[(A^{t}_{i})^{c}]+(u^{*}_{i})^{2}\cdot\mathbf{E}\left(\frac{\beta^{t+1}_{i}-\beta^{*}_{i}}{\beta^{t+1}_{i}}\right)^{2}
vi2ϵi2𝐄(βit+1βi)2+(uiβimin)2𝐄(βit+1βi)2\displaystyle\leq\frac{\|v_{i}\|_{\infty}^{2}}{\epsilon_{i}^{2}}\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}+\left(\frac{u^{*}_{i}}{\beta^{\min}_{i}}\right)^{2}\cdot\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}
(vi2ϵi2+((vi,s+Bi)(vi,s+2Bi)Bi)2)𝐄(βit+1βi)2\displaystyle\leq\left(\frac{\|v_{i}\|_{\infty}^{2}}{\epsilon_{i}^{2}}+\left(\frac{\left(\langle v_{i},s\rangle+B_{i}\right)\left(\langle v_{i},s\rangle+2B_{i}\right)}{B_{i}}\right)^{2}\right)\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}
=Ci𝐄(βit+1βi)2=O(logtt).\displaystyle=C_{i}\mathbf{E}(\beta^{t+1}_{i}-\beta^{*}_{i})^{2}=O\left(\frac{\log t}{t}\right).

Convergence of expenditures b¯it\bar{b}^{t}_{i}.

Similar to the linear case, note that b¯it\bar{b}^{t}_{i} can be decomposed as follows (where xiτ:=𝕀{i=iτ}x^{\tau}_{i}:=\mathbb{I}\{i=i_{\tau}\} denotes whether buyer ii wins at time step τ\tau):

b¯it:=1tτ=1tβiτvi(θτ)xiτ=βig¯it+1tτ=1t(βiτβi)vi(θτ)xiτ.\displaystyle\bar{b}^{t}_{i}:=\frac{1}{t}\sum_{\tau=1}^{t}\beta^{\tau}_{i}v_{i}(\theta_{\tau})x^{\tau}_{i}=\beta^{*}_{i}\bar{g}^{t}_{i}+\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{*}_{i})v_{i}(\theta_{\tau})x^{\tau}_{i}.

Hence, using βi=Bi/ui1\beta^{*}_{i}=B_{i}/u^{*}_{i}\leq 1, (x+y)22(x2+y2)(x+y)^{2}\leq 2(x^{2}+y^{2}), convexity of ()2(\cdot)^{2} and vi(θt)xiτviv_{i}(\theta_{t})x^{\tau}_{i}\leq\|v_{i}\|_{\infty}, we have

𝐄(b¯itBi)2\displaystyle\mathbf{E}(\bar{b}^{t}_{i}-B_{i})^{2} 2[𝐄(βig¯itBi)2+𝐄(1tτ=1t(βiτβi)vi(θτ)xiτ)2]\displaystyle\leq 2\left[\mathbf{E}(\beta^{*}_{i}\bar{g}^{t}_{i}-B_{i})^{2}+\mathbf{E}\left(\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{*}_{i})v_{i}(\theta_{\tau})x^{\tau}_{i}\right)^{2}\right]
2[(βi)2𝐄(g¯itui)2+vi2tτ=1t𝐄(βiτβi)2]\displaystyle\leq 2\left[(\beta^{*}_{i})^{2}\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2}+\frac{\|v_{i}\|_{\infty}^{2}}{t}\sum_{\tau=1}^{t}\mathbf{E}(\beta^{\tau}_{i}-\beta^{*}_{i})^{2}\right]
2[𝐄(g¯itui)2+vi2tτ=1t𝐄(βiτβi)2]=Rit.\displaystyle\leq 2\left[\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2}+\frac{\|v_{i}\|_{\infty}^{2}}{t}\sum_{\tau=1}^{t}\mathbf{E}(\beta^{\tau}_{i}-\beta^{*}_{i})^{2}\right]=R^{t}_{i}. (19)

The order of 𝐄Rit\mathbf{E}R^{t}_{i} is given by those of 𝐄(g¯itui)2\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2} and τ=1t𝐄(βiτβi)\sum_{\tau=1}^{t}\mathbf{E}(\beta^{\tau}_{i}-\beta^{*}_{i}), which are O((logt)/t)O((\log t)/t) and O((logt)2/t)O((\log t)^{2}/t), respectively.

Convergence of utilities u¯it\bar{u}^{t}_{i}.

For a buyer ii with βi<1\beta^{*}_{i}<1, let

ϵi=min{1βi,βiβimin}>0.\epsilon_{i}=\min\{1-\beta^{*}_{i},\beta^{*}_{i}-\beta^{\min}_{i}\}>0.

Express u¯it\bar{u}^{t}_{i} as follows:

u¯it\displaystyle\bar{u}^{t}_{i} =1tτ=1t(1βiτ)vi(θτ)xiτ\displaystyle=\frac{1}{t}\sum_{\tau=1}^{t}(1-\beta^{\tau}_{i})v_{i}(\theta_{\tau})x^{\tau}_{i}
=(1βi)g¯it+1tτ=1t(βiτβi)vi(θτ)xiτ.\displaystyle=(1-\beta^{*}_{i})\bar{g}^{t}_{i}+\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{*}_{i})v_{i}(\theta_{\tau})x^{\tau}_{i}.

Since uiQLME=(1βi)uiu^{\rm QLME}_{i}=(1-\beta^{*}_{i})u^{*}_{i}, similar to (19), we have

𝐄(u¯ituiQLME)2\displaystyle\mathbf{E}\left(\bar{u}^{t}_{i}-u^{\rm QLME}_{i}\right)^{2} 2[(1βi)𝐄(g¯itui)2+𝐄(1tτ=1t(βiτβi)vi(θτ)xiτ)2]\displaystyle\leq 2\left[(1-\beta^{*}_{i})\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2}+\mathbf{E}\left(\frac{1}{t}\sum_{\tau=1}^{t}(\beta^{\tau}_{i}-\beta^{*}_{i})v_{i}(\theta_{\tau})x^{\tau}_{i}\right)^{2}\right]
2[𝐄(g¯itui)2+vi2tτ=1t𝐄(βiτβi)2]=Rit.\displaystyle\leq 2\left[\mathbf{E}(\bar{g}^{t}_{i}-u^{*}_{i})^{2}+\frac{\|v_{i}\|_{\infty}^{2}}{t}\sum_{\tau=1}^{t}\mathbf{E}(\beta^{\tau}_{i}-\beta^{*}_{i})^{2}\right]=R^{t}_{i}.

Finally, if all βi=1\beta^{*}_{i}=1, (18) implies that 𝐄u¯tu¯QLME2=O((logt)/t)\mathbf{E}\|\bar{u}^{t}-\bar{u}^{\rm QLME}\|^{2}=O((\log t)/t). It some βi<1\beta^{*}_{i}<1, since 𝐄Rit=O((logt)2/t)\mathbf{E}R^{t}_{i}=O((\log t)^{2}/t), so is 𝐄u¯tu¯QLME2\mathbf{E}\|\bar{u}^{t}-\bar{u}^{\rm QLME}\|^{2}. ∎

Appendix D More details on the experiments

In each experiment, we will have some underlying valuations, items will be drawn one-at-a-time, uniformly at random, from the set of possible items, on which we run the PACE dynamics. We have several outcome measures of interest for asking how close we are to the static equilibrium quantities at each point. First, we look at convergence of realized utilities. In each case we consider the realized utilities up to time tt and look at the deviation from equilibrium utility normalized by the equilibrium utility level. We look at both the average and the worst-case deviations. Formally these are calculated as (u¯tu)/u1/n\|(\bar{u}^{t}-u^{*})/u^{*}\|_{1}/n for the average deviation and (u¯tu)/u\|(\bar{u}^{t}-u^{*})/u^{*}\|_{\infty} for the maximum (over buyers) deviation. We also measure deviations of the pacing multiplier βt\beta^{t} from β\beta^{*} and deviations of time-averaged cumulative expenditure b¯t\bar{b}^{t} from buyers’ budgets B=(B1,,Bn)B=(B_{1},\dots,B_{n}) using analogous normalizations. In the plots, we add horizontal lines for the same error measures for the proportional shares of the static underlying Fisher market (each buyer receiving BiB_{i} of each item), a ‘baseline’ solution.

We consider 3 different market datasets. The first two datasets are recommender systems which we turn into markets. The final is taken from a survey experiment. We point the reader to [31] for a more in-depth discussion and exploratory data analysis of these 33 datasets. The first dataset uses MovieLens [27]. MovieLens is a dataset of individual ratings of movies, [31] turn it into a market by using matrix completion to fill in missing user-movie ratings, they then take the top 1500 most active users and 1500 most rated movies and set the valuations vijv_{ij} as the predicted ratings from the matrix completion. We also use the Jester Jokes dataset [26]. Here, we have 72007200 individuals that have rated 100100 jokes. We treat the jokes as the item to be allocated.Finally, we use the Household Items dataset introduced in [31]. Here we have 28762876 survey takes entering a willingness to pay for 5050 household items (vacuum cleaners, toasters, gas grills, etc.). For each dataset, we first rescale (w.l.o.g.) buyer valuations as described in §5.

We also consider an experiment on a simple infinite-dimensional market instance (which we refer to as “Inf-Dim”) of n=100n=100 buyers and item space Θ=[0,1]\Theta=[0,1], similar to the examples in [24, §4.2]. Let each buyer valuation viv_{i} be normalized linear functions on [0,1][0,1], that is, vi(θ)=ci(θ)+div_{i}(\theta)=c_{i}(\theta)+d_{i} such that vi(Θ):=Θvi𝑑μ=01vi(θ)𝑑θ=1ci2+di=1v_{i}(\Theta):=\int_{\Theta}v_{i}d\mu=\int_{0}^{1}v_{i}(\theta)d\theta=1\ \Leftrightarrow\ \frac{c_{i}}{2}+d_{i}=1. We randomly generate (ci,di)(c_{i},d_{i}), i=1,,ni=1,\dots,n and run the dynamics for T=100nT=100n time steps.

For the finite dimensional datasets we compute equilibrium utilities uu^{*} and utility prices β\beta^{*} by solving the corresponding static instances using standard methods. For the infinite dimensional synthetic instance, we use the approach based on convex conic reformulation [24, §4] to compute β\beta^{*}.

Figure 1 in §6 contains the plots for the MovieLens, Household Items and Inf-Dim datasets. Figure 2 contains the plots for the Jokes dataset.

Since items arrive one at a time, t=100t=100 time steps in a market with n=10n=10 buyers is very different from the same number of time steps in a market with n=1000n=1000 buyers. To deal with this, we run PACE for T=100nT=100n time steps, referring to each nn time steps as an epoch.

We record the average and maximum values of relative errors of the pacing multipliers βt\beta^{t}, time-averaged cumulative utilities u¯t\bar{u}^{t} and time-averaged expenditures b¯t\bar{b}^{t}.

Convergence of expenditures to total budget.

For each ii, the quantity

|b¯itBiBi|=|τ=1tbittBitBi|\left|\frac{\bar{b}^{t}_{i}-B_{i}}{B_{i}}\right|=\left|\frac{\sum_{\tau=1}^{t}b^{t}_{i}-tB_{i}}{tB_{i}}\right|

can be viewed as the relative deviation of current cumulative expenditure at time tt from the total budget tBitB_{i} available up to tt. Hence, the residuals (b¯itB)/B/n\left\|(\bar{b}^{t}_{i}-B)/B\right\|/n and (b¯itB)/B\left\|(\bar{b}^{t}_{i}-B)/B\right\|_{\infty} are the average and maximum such deviations across all buyers. For each dataset (MovieLens, Household, Jokes and Inf-Dim), we plot the various quartiles of these residuals across all seeds, as shown in Figure 3.

Refer to caption
Refer to caption
Figure 2: Results for the same experiments as in Figure 1 (convergence of pacing multipliers, utilities and expenditures) on the Jokes dataset.
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 3: The PACE cumulative expenditure τ=1tbit\sum_{\tau=1}^{t}b^{t}_{i} of each buyer are close to the total amount of budget tBitB_{i}, as the quartile plots show. Vertical lines indicate when tt is a multiple of 10n10n.