This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Asynchronous Proportional Response Dynamics: Convergence in Markets with Adversarial Scheduling

Yoav Kolumbus
Cornell University
[email protected]
&Menahem Levy
The Hebrew University of Jerusalem
[email protected]
&Noam Nisan
The Hebrew University of Jerusalem
[email protected]
Abstract

We study Proportional Response Dynamics (PRD) in linear Fisher markets, where participants act asynchronously. We model this scenario as a sequential process in which at each step, an adversary selects a subset of the players to update their bids, subject to liveness constraints. We show that if every bidder individually applies the PRD update rule whenever they are included in the group of bidders selected by the adversary, then, in the generic case, the entire dynamic converges to a competitive equilibrium of the market. Our proof technique reveals additional properties of linear Fisher markets, such as the uniqueness of the market equilibrium for generic parameters and the convergence of associated no swap regret dynamics and best response dynamics under certain conditions.

1 Introduction

A central notion in the study of markets is the equilibrium: a state of affairs where no single party wishes to unilaterally deviate from it. The main benefit of focusing on the notion of equilibria lies in what it ignores: how the market can reach an equilibrium (if at all). This latter question is obviously of much interest as well, especially when considering computational aspects,111As we know that finding an equilibrium may be computationally intractable in general. and a significant amount of research has been dedicated to the study of “market dynamics” and their possible convergence to an equilibrium. Almost all works that study market dynamics consider synchronous dynamics.

Synchronous Dynamics: At each time step tt, all participants simultaneously update their behavior based on the state at time t1t-1.

Such synchronization is clearly difficult to achieve in real markets, and so one might naturally wonder to what extent full synchrony is needed or whether convergence of market dynamics occurs even asynchronously. There are various possible levels of generality of asynchrony to consider. The simplest model considers a sequential scenario in which, at every time step tt, an adversary chooses a single participant, and only this participant updates their behavior based on the state at time t1t-1. The adversary is constrained to adhere to some liveness condition, such as scheduling every participant infinitely often or at least once every TT steps. In the most general model [59], the adversary may also introduce message delays, leading players to respond to dated information. In this paper, we focus on an intermediate level of permissible asynchrony where updates may occur in an arbitrary asynchronous manner, but message delays are always shorter than the granularity of activation. In our proofs, the adversary’s goal is to select the schedule of strategy updates by the players in a way that prevents the dynamic from converging to a market equilibrium.

Activation Asynchrony:222In [59] this was termed “simultaneous.” Every time step tt, an arbitrary subset of participants is chosen by the adversary and all of these participants update their behavior based on the state at time t1t-1. The adversary must adhere to the liveness condition where for every participant some set that includes them must be chosen at least once every TT consecutive steps.

The market dynamics that we study in this paper are linear Fisher markets [9] with proportional response dynamics (PRD), a model that has received much previous attention [5, 10, 19, 74] and for which synchronous convergence to equilibrium is known. While there are a few asynchronous convergence results known for other dynamics, specifically for tatonnement dynamics [18, 23], there are no such results known for proportional response dynamics, and achieving such results has been mentioned as an open problem in [19, 74].

Fisher Markets with Linear Utilities: There are nn players and mm goods. Each player ii has a budget BiB_{i} and each good jj has, w.l.o.g., a total quantity of one. Buyer ii’s utility from getting an allocation xi=(xi1,,xim)x_{i}=(x_{i1},...,x_{im}) is given by ui(xi)=jaijxiju_{i}(x_{i})=\sum_{j}a_{ij}x_{ij}, where the parameters aij0a_{ij}\geq 0 are part of the definition of the market. A market equilibrium is an allocation X=(xij)X=(x_{ij}) (where 0xij10\leq x_{ij}\leq 1) and a pricing p=(pj)p=(p_{j}) with the following properties. (1) Market clearing: for every good jj it holds that ixij=1\sum_{i}x_{ij}=1; (2) budget feasibility: for every player ii it holds that jxijpjBi\sum_{j}x_{ij}p_{j}\leq B_{i}; and (3) utility maximization: for every player ii and every alternative allocation y=(y1,,ym)y=(y_{1},...,y_{m}) with jyjpjBi\sum_{j}y_{j}p_{j}\leq B_{i} we have that ui(xi)ui(y)u_{i}(x_{i})\geq u_{i}(y).

Proportional Response Dynamics: At each time step tt, each player ii will make a bid bijt0b^{t}_{ij}\geq 0 for every good jj, where jbijt=Bi\sum_{j}b^{t}_{ij}=B_{i}. In the first step, the bid is arbitrary. Once bids for time tt are announced, we calculate pjt=ibijtp^{t}_{j}=\sum_{i}b^{t}_{ij} and allocate the goods proportionally: xijt=bijt/pjtx^{t}_{ij}=b^{t}_{ij}/p^{t}_{j}, providing each player ii with utility uit=jaijxijtu^{t}_{i}=\sum_{j}a_{ij}x^{t}_{ij}. At this point, player ii updates its bids for the next step by bidding on each good proportionally to the utility obtained from that good: bijt+1=Biaijxijt/uitb^{t+1}_{ij}=B_{i}\cdot a_{ij}x^{t}_{ij}/u^{t}_{i}.

From the perspective of the player, proportional response updates can be thought of as a simple parameter-free online learning heuristic, with some similarity to regret-matching [43] in its proportional updates, but one that considers the utilities directly, rather than the more sophisticated regret vector loss.

It is not difficult to see that a fixed point of this proportional response dynamic is indeed an equilibrium of the Fisher market. Significantly, it was shown in [5] that this dynamic does converge, in the synchronous model, to an equilibrium. As mentioned, the question of asynchronous convergence was left open. We provide the first analysis of proportional response dynamics in the asynchronous setting, and provide a positive answer to this open question in our “intermdiate” level of asynchrony.

Theorem 1.

For linear Fisher markets, proportional response dynamics with adversarial activation asynchrony, where each player is activated at least once every TT steps, approach the set of market equilibrium bid profiles. The prices in the dynamics converge to the unique equilibrium prices.

The dynamics approaching the set of equilibria means that the distance between the bids at time tt and the set of market equilibrium bid profiles converges to zero as tt\rightarrow\infty. Additionally, in Section 7, we show that for generic parameters (i.e., except for measure zero of possible (aij)(a_{ij})’s), the market equilibrium bid profile is unique, and thus Theorem 1 implies convergence of the bids to a point in the strategy space. We do not know whether the genericity condition is required for asynchronous convergence of the bids to a point, and we leave this as a minor open problem. We did not analyze the rate of convergence to equilibrium; we leave such analysis as a second open problem. Our main open problem, however, is the generalization to full asynchrony with message delays.

Open Problem: Does such convergence occur also in the full asynchronous model where the adversary may introduce arbitrary message delays?

Our techniques rely on considering an associated game obtained by using “modified” utility functions for each of the players: ui~(b)=jbijln(aij)+jpj(1ln(pj))\tilde{u_{i}}(b)=\sum_{j}{b_{ij}\ln(a_{ij})}+\sum_{j}{p_{j}(1-\ln(p_{j}))}. We show that a competitive market equilibrium (with the original utility functions) corresponds to a Nash equilibrium in the associated game.333It is worthwhile to emphasize, though, that a competitive market equilibrium is not a Nash equilibrium in the original market, since the players are price takers rather than fully rational. See Section 3. These modified utility functions are an adaptation to an individual utility of a function Φ(b)=ijbijln(aij)+jpj(1ln(pj))\Phi(b)=\sum_{ij}{b_{ij}\ln(a_{ij})}+\sum_{j}{p_{j}(1-\ln(p_{j}))} that was proposed in [65] as an objective for a convex program for equilibrium computation.444Notice that Φ\Phi is not the sum of the ui~\tilde{u_{i}}’s, as the second term appears only once. This function was first linked with proportional response dynamics in [5] where it was proven that synchronous proportional response dynamics act as mirror descent on this function. We show that Φ\Phi serves as a potential in our associated game.

Theorem 2.

The following three sets of bid profiles are identical: (1) the set of pure strategy Nash equilibria of the associated game, (2) the set of market equilibria of the Fisher market, and (3) the maximizing set of the potential function Φ\Phi.

The technical core of our proof is to show that not only does a synchronized proportional response step by all the players increase the potential function Φ\Phi but, in fact, every proportional response step by any subset of the players increases this potential function.

The point of view of market equilibria as Nash equilibria of the associated game offers several other advantages, e.g., suggesting several other dynamics that are related to proportional response that can be of interest. For example, we show that letting players best-respond in the associated game corresponds to the limit of a sequence of proportional response steps by a single player, but can be implemented as a single step of such a best-response, which can be computed efficiently by the players and may converge faster to the market equilibrium. Another possibility is using some (internal) regret-minimization dynamics (for the associated game), which would also converge to equilibrium in the generic case since, applying [56], it is the unique correlated Equilibrium as well.

The structure of the rest of the paper is as follows. In Section 2, we provide further formal details and notations that will be useful for our analysis. In Section 3, we present the associated game and its relation to the competitive equilibria of the market. In Section 4, we study best response dynamics in the associated game and their relation to PRD. In Section 5, we present a key lemma regarding the potential function of the associated game under bid updates by subsets of the players. Then, in Section 6 we complete our proof of convergence for asynchronous PRD. In Section 7, we show the uniqueness of the market equilibrium for generic markets. In Section 8, we provide simulation results that compare the convergence of proportional response dynamics with best response dynamics in the associated game in terms of the actual economic parameters in the market, namely, social welfare and the convergence of the bid profiles. Finally, in Section 9, we conclude and discuss the limitations of our technique and open questions. All proofs in this paper are deferred to the appendix.

1.1 Further Related Work

Proportional response dynamics (PRD) were originally studied in the context of bandwidth allocation in file-sharing systems, where it was proven to converge to equilibrium, albeit only for a restrictive setting [71]. Since then, PRD has been studied in a variety of other contexts, including Fisher markets, linear exchange economies, and production markets. See [10] for further references. In Fisher markets, synchronous PRD has been shown to converge to market equilibrium for Constant Elasticity of Substitution (CES) utilities in the substitutes regime [74]. For the linear Fisher market setting, synchronous PRD was explained in [5] as mirror descent on a convex program, previously discovered while developing an algorithm to compute the market equilibrium [65], and later proven to be equivalent to the famous Eisenberg-Gale program [22]. By advancing the approach of citebirnbaum2011distributed, synchronous PRD with mild modifications was proven to converge to a market equilibrium for CES utilities in the complements regime as well [19]. In linear exchange economies, synchronous PRD has been shown to converge to equilibrium in the space of utilities and allocations while prices do not converge and may cycle, whereas for a damped version of PRD, also the prices converge [11]. In production markets, synchronous PRD has been shown to increase both growth and inequalities in the market [13]. PRD has also been proven to converge with quasi-linear utilities [38], and to remain close to market equilibrium for markets with parameters that vary over time [20]. All the above works consider simultaneous updates by all the players, and the question of analyzing asynchronous dynamics and whether they converge was raised by several authors as an open problem [19, 74].

Asynchronous dynamics in markets have been the subject of study in several recent works. However, these works examine different models and dynamics than ours, and to our knowledge, our work presents the first analysis of asynchronous proportional response bidding dynamics. In [23], it is shown that tatonnement dynamics under the activation asynchrony model converge to equilibrium, with results for settings both with and without carryover effects between time units. A later work, [18], showed that tatonnement price dynamics converge to a market equilibrium under a model of sequential activation where in every step a single agent is activated, and where additionally, the information available to the activated seller about the current demand may be inaccurate within some interval that depends on the range of past demands. A different approach taken in [29] assumes that each seller has a set of rules affected by the other players’ actions and governing its price updates; it is shown that the dynamics in which sellers update the prices based on such rules converge to a unique equilibrium of prices in the activation asynchrony model.

Online learning with delayed feedback has been studied for a single agent in [61], which established regret bounds that are sub-linear in the total delay. A different approach, presented in [52], extends online gradient descent by estimating gradients when the reward of an action is not yet available due to delays. When considering multiple agents, [75] extended the results of [61]. They demonstrated that in continuous games with particular stability property, if agents experience equal delays in each round, online mirror descent converges to the set of Nash equilibria and a modification of the algorithm converges even when delays are not equal.

Classic results regarding the computation of competitive equilibria in markets mostly consider centralized computation and vary from combinatorial approaches using flow networks [26, 27, 28, 45], interior point [72], and ellipsoid [21, 44] methods, and many more [3, 25, 26, 30, 39, 40, 63]. Eisenberg and Gale devised a convex program which captures competitive equilibria of the Fisher model as its solution [31]. Notable also is the tatonnement model of price convergence in markets dated back to Walras [70] and studied extensively from Arrow [1] and in later works.

More broadly, in the game theoretic literature, our study is related to a long line of work on learning in games, starting from seminal works in the 1950s [6, 14, 42, 62], and continuing to be an active field of theoretical research [24, 35, 49, 60, 66], also covering a wide range of classic economic settings including competition in markets [7, 54], bilateral trade [16, 32], and auctions [2, 34, 48], as well as applications such as blockchain fee markets [50, 51, 57] and strategic queuing systems [36, 37]. For a broad introduction to the field of learning in games, see [17, 43]. The vast majority of this literature studies repeated games under the synchronous dynamics model. Notable examples of analyses of games with asynchronous dynamics are [46], which study best response dynamics with sequential activation, and [59], which explore best response dynamics in a full asynchrony setting which includes also information delays, and show that in a class of games called max-solvable, convergence of best response dynamics is guaranteed. Our analysis of best response dynamics in Section 4 takes a different route, and does not conclude whether the associated game that we study is max-solvable or not; such an analysis seems to require new ideas.

Our work is also related to a large literature on asynchronous distributed algorithms. We refer to a survey on this literature [41]. The liveness constraint that we consider in the dynamics555Intuitively, if one allows some of the parameters in the dynamic not to update, these parameters become irrelevant, as they will remain frozen, and thus one cannot hope to see any convergence of the entire system. is related to those, e.g., in [4, 67]. Recent works that are conceptually more closely related are [15, 69, 73], which propose asynchronous distributed algorithms for computing Nash equilibria in network games. Notably, [15] propose an algorithm that converges to an equilibrium in a large class of games in asynchronous settings with information delays. Their approach, however, does not capture proportional response dynamics and does not apply to our case of linear Fisher markets.

2 Model and Preliminaries

The Fisher market: We consider the classic Fisher model of a networked market in which there is a set of buyers \mathcal{B} and a set of divisible goods 𝒢\mathcal{G}. We denote the number of buyers and number of goods as n=||n=|\mathcal{B}|, m=|𝒢|m=|\mathcal{G}|, respectively, and index buyers with ii and goods with jj. Buyers are assigned budgets Bi+B_{i}\in\mathbb{R}^{+} and have some value666 For the ease of exposition, our proofs use w.l.o.g. aij>0a_{ij}>0. This is since in all cases where aij=0a_{ij}=0 might have any implication on the proof, such as ln(aij)\ln(a_{ij}), these expressions are multiplied by zero in our dynamics. aij0a_{ij}\geq 0 for each good jj. Buyers’ valuations are normalized such that jaij=1\sum_{j}a_{ij}=1. It is convenient to write the budgets as a vector B=(Bi)B=(B_{i}) and the valuations as a matrix An×m=(aij)A_{n\times m}=(a_{ij}), such that A,BA,B are the parameters defining the market. We denote the allocation of goods to buyers as a matrix X=(xij)X=(x_{ij}) where xij0x_{ij}\geq 0 is the (fractional) amount of good jj that buyer ii obtained. We assume w.l.o.g. (by proper normalization) that there is a unit quantity of each good. The price of good jj (which depends on the players’ actions in the market, as explained below) is denoted by pj0p_{j}\geq 0 and prices are listed as a vector p=(pj)p=(p_{j}). Buyers have a linear utility function ui(xi)=jaijxiju_{i}(x_{i})=\sum_{j}a_{ij}x_{ij} with the budget constraint jxijpjBi\sum_{j}x_{ij}p_{j}\leq B_{i}. We assume w.l.o.g. that the economy is normalized, i.e., iBi=jpj=1\sum_{i}B_{i}=\sum_{j}p_{j}=1.

Market equilibrium: The competitive equilibrium (or “market equilibrium”) is defined in terms of allocations and prices as follows.

Definition 1.

(Market Equilibrium): A pair of allocations and prices (X,p)(X^{*},p^{*}) is said to be market equilibrium if the following properties hold:

  1. 1.

    Market clearing:j,ixij=1\text{Market clearing:}\quad\quad\ \forall j,\ \sum_{i}x_{ij}^{*}=1,

  2. 2.

    Budget feasibility:i,jxijpjBi\text{Budget feasibility:}\quad\ \ \ \forall i,\ \sum_{j}x_{ij}^{*}p_{j}^{*}\leq B_{i},

  3. 3.

    Utility maximization:i,xiargmaxxiui(xi)\text{Utility maximization:}\ \ \ \forall i,\ x_{i}^{*}\in\arg\max_{x_{i}}u_{i}(x_{i}).

In words, under equilibrium prices all the goods are allocated, all budgets are used, and no player has an incentive to change their bids given that the prices remain fixed. Notice that this notion of equilibrium is different from a Nash equilibrium of the game where the buyers select their bids strategically, since in the former case, players do not consider the direct effect of possible deviation in their bids on the prices. We discuss this further in Section 3. For linear Fisher markets, it is well established that competitive equilibrium utilities uu^{*} and prices pp^{*} are unique, equilibrium allocations are known to form a convex set, and the following conditions are satisfied.

i,jaijpjuiBiandxij>0aijpj=uiBi.\forall i,j\quad\frac{a_{ij}}{p^{*}_{j}}\leq\frac{u_{i}^{*}}{B_{i}}\quad\quad\text{and}\quad x_{ij}>0\implies\frac{a_{ij}}{p^{*}_{j}}=\frac{u_{i}^{*}}{B_{i}}.

This is a detailed characterization of the equilibrium allocation: every buyer gets a bundle of goods in which all goods maximize the value per unit of money. The quantity aij/pja_{ij}/p^{*}_{j} is informally known as “bang-per-buck” (ch. 5 & 6 in [58]), the marginal profit from adding a small investment in good jj.

Market equilibrium bids are also known to maximize the Nash social welfare function (see [31]) NSW(X)=iui(xi)Bi\text{NSW}(X)=\prod_{i\in\mathcal{B}}u_{i}(x_{i})^{B_{i}} and to be Pareto efficient, i.e., no buyer can improve their utility without making anyone else worse off (as stated in the first welfare theorem).

The trading post mechanism and the market game (Shapley-Shubik): First described in [64] and studied under different names [33, 47], the trading post mechanism is an allocation and pricing mechanism which attempts to capture how a price is modified by demand. Buyers place bids on goods, where buyer ii places bid bijb_{ij} on good jj. Then, the mechanism computes the good’s price as the total amount spent on that good and allocates the good proportionally to the bids, i.e., for bids bb:

pj=i=1nbijxij={bijpjbij>00otherwise.p_{j}=\sum_{i=1}^{n}b_{ij}\quad\quad x_{ij}=\begin{cases}\frac{b_{ij}}{p_{j}}&\quad b_{ij}>0\\ \\ 0&\quad\text{otherwise.}\\ \end{cases}

Note that the trading post mechanism guarantees market clearing for every bid profile bb in which all goods have at least one buyer who is interested in buying. The feasible bid set of a buyer under the budget constraint is Si={bim|jbij0jbij=Bi}S_{i}=\{b_{i}\in\mathbb{R}^{m}|\forall j\ b_{ij}\geq 0\quad\sum_{j}b_{ij}=B_{i}\}, i.e., a scaled simplex. Denote S=iSiS=\prod_{i\in\mathcal{B}}S_{i} and Si=k{i}SkS_{-i}=\prod_{k\in\mathcal{B}\setminus\{i\}}S_{k}. Considering the buyers as strategic, one can define the market game as G={,(Si)i,(ui)i}G=\{\mathcal{B},(S_{i})_{i\in\mathcal{B}},(u_{i})_{i\in\mathcal{B}}\} where the utility functions can be written explicitly as ui(b)=ui(xi(b))=j=1maijbijpju_{i}(b)=u_{i}(x_{i}(b))=\sum_{j=1}^{m}\frac{a_{ij}b_{ij}}{p_{j}}. We sometimes use the notation ui(bi,bi)u_{i}(b_{i},b_{-i}), where bib_{i} is the bid vector of player ii and bib_{-i} denotes the bids of the other players.

Potential function and Nash equilibrium: For completeness, we add the following definitions. Potential function: A function Φ\Phi is an exact potential function[53] if i,biSi\forall i\in\mathcal{B},\forall b_{-i}\in S_{-i} and bi,biSi\forall b_{i},b_{i}^{\prime}\in S_{i} we have that Φ(bi,bi)Φ(bi,bi)=ui(bi,bi)ui(bi,bi)\Phi(b_{i}^{\prime},b_{-i})-\Phi(b_{i},b_{-i})=u_{i}(b_{i}^{\prime},b_{-i})-u_{i}(b_{i},b_{-i}), with uiu_{i} being ii’s utility function in the game. Best response: bib_{i}^{*} is a best response to bib_{-i} if biSiui(bi,bi)ui(bi,bi)\forall b_{i}\in S_{i}\quad u_{i}(b^{*}_{i},b_{-i})\geq u_{i}(b_{i},b_{-i}). That is, no other response of ii can yield a higher utility. Nash equilibrium: bb^{*} is Nash equilibrium if i\forall i bib^{*}_{i} is a best response to bib^{*}_{-i} (no player is incentivized to change their strategy).

Proportional response dynamics: As explained in the introduction, the proportional response dynamic is specified by an initial bid profile b0b^{0}, with bij0>0b_{ij}^{0}>0 whenever aij>0a_{ij}>0, and the following update rule for every player that is activated by the adversary: bijt+1=aijxijtui(xit)Bib^{t+1}_{ij}=\frac{a_{ij}x^{t}_{ij}}{u_{i}(x^{t}_{i})}B_{i}. See Section 5 for further details on activation of subsets of the players.

3 The Associated Game

As mentioned above, the Fisher market can be naturally thought of as a game in which every one of the nn players aims to optimize their individual utility ui(bi,bi)u_{i}(b_{i},b_{-i}) (see Section 2 for the formal definition). However, it is known that the set of Nash equilibria of this game does not coincide with the set of market equilibria [33, 64], and so a solution to this game (if indeed the players reach a Nash equilibrium) is economically inefficient [12].

A natural question that arises is whether there is some other objective for an individual player that when maximized by all the players, yields the market equilibrium. We answer positively to this question and show that there is a family of utility functions such that in the “associated games” with these utilities for the players, the set of Nash equilibria is identical to the set of market equilibria of the original game (for further details, see also Appendix A).

However, the fact that a Nash equilibrium of an associated game is a market equilibrium still does not guarantee that the players’ dynamics will indeed reach this equilibrium. A key element in our proof technique is that we identify, among this family of associated games, a single game, defined by the “associated utility” ui~(b)=jbijln(aij)+jpj(1ln(pj))\tilde{u_{i}}(b)=\sum_{j}{b_{ij}\ln(a_{ij})}+\sum_{j}{p_{j}(1-\ln(p_{j}))}, which admits an exact potential. We then use a relation which we show between this game and the proportional response update rule to prove the convergence of our dynamics (Theorem 1).

Definition 2.

(The Associated Game): Let GG be a market game. Define the associated utility of a player ii as ui~(b)=jbijln(aij)+jpj(1ln(pj))\tilde{u_{i}}(b)=\sum_{j}{b_{ij}\ln(a_{ij})}+\sum_{j}{p_{j}(1-\ln(p_{j}))}. The associated game G~\tilde{G} is the game with the associated utilities for the players and the same parameters as in GG.

Theorem 3.

For every Fisher market, the associated game G~\tilde{G} admits an exact potential function that is given by777Since we discuss the players’ associated utilities, we consider maximization of this potential. Of course, if the reader feels more comfortable with minimizing the potential, one can think of the negative function. Φ(b)=ijbijln(aij)+jpj(1ln(pj))\Phi(b)=\sum_{ij}{b_{ij}\ln(a_{ij})}+\sum_{j}{p_{j}(1-\ln(p_{j}))}.

G~\tilde{G} is constructed such that the function Φ\Phi is its potential. Note that although having similar structure, u~i\tilde{u}_{i} and Φ\Phi differ via summation on ii only in the first term (Φ\Phi is not the sum of the players’ utilities).

Once having the potential function defined, the proof is straightforward: the derivatives of the utilities ui~\tilde{u_{i}} and the potential Φ\Phi with respect to bib_{i} are equal for all ii. Theorem 2, formally restated below, connects between the associated game, the market equilibria and the potential.

Theorem.

(Restatement of Theorem 2). The following three sets of bid profiles are equal. (1) The set of pure-strategy Nash equilibria of the associated game: NE(G~)={b|bSui~(b)ui~(b)}NE(\tilde{G})=\{b^{*}|\ \forall b\in S\quad\tilde{u_{i}}(b^{*})\geq\tilde{u_{i}}(b)\}; (2) the set of market equilibrium bid profiles of the Fisher market: {b|(x(b),p(b)) satisfy Def. 1}\{b^{*}|\ (x(b^{*}),p(b^{*}))\text{ satisfy Def. \ref{def:ME}}\}; and (3) the maximizing set of the potential from Theorem 3: argmaxbSΦ(b)\arg\max_{b\in S}\Phi(b).

The proof uses a different associated game GG^{\prime} that has simpler structure than G~\tilde{G}, but does not have an exact potential, and shows that: (i) Nash equilibria of GG^{\prime} are the market equilibria; (ii) all the best responses of players ii to bid profiles bib_{-i} in GG^{\prime} are the same as those in G~\tilde{G}; and (iii) every equilibrium of G~\tilde{G} maximizes the potential Φ\Phi (immediate by the definition of potential).

4 Best Response Dynamics

In this section we explore another property of the associated game: we show that if instead of using the proportional response update rule, each player myopically plays their best response to the last bid profile with respect to their associated utility, then the entire asynchronous sequence of bids converges to a market equilibrium, as stated in the following theorem. We then show that there is a close relation between best response and proportional response dynamics.

Theorem 4.

For generic linear Fisher markets in a sequential asynchrony model where in every step a single player is activated, best response dynamics converge to the Market Equilibrium. For non-generic markets the prices are guaranteed to converge to the equilibrium prices.

The idea of the proof is to show that the best-response functions are single valued (i,bi:u~i(,bi)\forall i,b_{-i}:\tilde{u}_{i}(\cdot,b_{-i}) has a unique maximizer) and continuous (using the structure of best-response bids). Together with the existence of the potential function Φ\Phi it holds that the analysis of [46] applies for these dynamics with the associated utilities888Note that while the best-reply function with respect to the standard utility function is formally undefined at zero, the associated utility and its best-reply function are well defined and continuous at zero. and thus convergence is guaranteed.

One of the appealing points about proportional response dynamics is their simplicity — in each update, a player observes the obtained utilities and can easily compute the next set of bids. We show that also the best response of a player can be computed efficiently by reducing the calculation to a search over a small part of the subsets of all goods which can be solved by a simple iterative process.

Proposition 1.

For every player ii and any fixed bid profile bib_{-i} for the other players, the best response of ii is unique and can be computed in polynomial time.

Roughly, best responses are characterized uniquely by a one-dimensional variable cc^{*}. For every subset of goods ss we define a variable csc_{s} and prove that cc^{*} is the maximum amongst all csc_{s}. So finding cc^{*} is equivalent to searching a specific subset with maximal csc_{s}. The optimal subset of goods admits a certain property that allows to narrow down the search domain from all subsets to only mm subsets.

The relation between the best response and proportional response updates can intuitively be thought of as follows. While in PRD players split their budget between all the goods according the utility that each good yields, and so gradually shift more budget to the more profitable subset of goods, best response bids of player ii with respect to u~i\tilde{u}_{i} can be understood as spending the entire budget on a subset of goods which, after bidding so (considering the effect of bids on prices), will jointly have the maximum bang-per-buck (in our notation aij/pja_{ij}/p_{j}) amongst all subsets of goods, given the bids bitb_{-i}^{t} of the other players. Those bids can be regarded as “water-filling” bids as they level the bang-per-buck amongst all goods purchased by player ii (for a further discussion see the appendix).

It turns out that there is a clear formal connection between the best response of a player in the associated game and the proportional response update rule in the true game: the best response bids are the limit point of an infinite sequence of proportional response updates by the same player when the bids of the others are held fixed, as expressed in the following proposition.

Proposition 2.

Fix any player ii and fix any bid profile bib_{-i} for the other players. Let bi=argmaxbiSiu~i(bi,bi)b_{i}^{*}=argmax_{b_{i}\in S_{i}}\tilde{u}_{i}(b_{i},b_{-i}) and let (bit)t=1(b_{i}^{t})_{t=1}^{\infty} be a sequence of consecutive proportional response steps applied by player ii, where bib_{-i} is held fixed at all times tt. Then limtbit=bi\lim_{t\to\infty}b_{i}^{t}=b_{i}^{*}.

5 Simultaneous Play by Subsets of Agents

In this section, we shift our focus back to proportional response dynamics under the activation asynchrony model in which the adversary can choose in every step any subset of players to update their bids. Towards proving that proportional response dynamics converges to a market equilibrium in this setting, we utilize the associated game and potential function presented in Section 3 to show that any activated subset of players performing a PRD step will increase the potential. Formally, let vv\subseteq\mathcal{B} be a subset of players activated by the adversary and let fv(b)f_{v}(b) be a function that applies proportional response to members of vv and acts as the identity function for all the other players. The update for time t+1t+1 when the adversary activates a subset of the players vtv^{t}\subseteq\mathcal{B} is therefore:

bijt+1=(fvt(bt))ij={aijxijtuitBiifivtbijtotherwise.b_{ij}^{t+1}=(f_{v^{t}}(b^{t}))_{ij}=\begin{cases}\frac{a_{ij}x_{ij}^{t}}{u_{i}^{t}}B_{i}&\quad\text{if}\ i\in v^{t}\\ b_{ij}^{t}&\quad\text{otherwise}.\\ \end{cases}
Lemma 1.

For all vv\subseteq\mathcal{B} and for all bSb\in S it holds that Φ(fv(b))>Φ(b)\Phi(f_{v}(b))>\Phi(b), unless fv(b)=bf_{v}(b)=b.

The proof shows that for any subset of players vtv^{t}, a PRD step bt+1b^{t+1} is the solution to some maximization problem of a function gt(b)g^{t}(b) different from Φ\Phi, such that Φ(bt+1)>gt(bt+1)gt(bt)=Φ(bt)\Phi(b^{t+1})>g^{t}(b^{t+1})\geq g^{t}(b^{t})=\Phi(b^{t}).

Notable to mention is the sequential case where all subsets are singletons, i.e., for all tt, vt={it}v^{t}=\{i^{t}\} for some iti^{t}\in\mathcal{B}. In that case, the above result yields that the best-response bids can be expressed as the solution to an optimization problem over the bids bb on a function that is monotone in the KL divergence between the prices induced by bb and the current prices, whereas PRD is the solution to an optimization problem on a similar function, but one that depends on the KL divergence between the bids bb and the current bids. Thus, sequential PRD can be regarded as a relaxation of best response; on the one hand, it is somewhat simpler to compute a step, and on the other hand, it takes more steps to reach the best response (see Proposition 2 and the simulations in Section 8).

6 Convergence of Asynchronous Proportional Response Dynamics

With the results from the previous sections under our belts (namely, the associated game, Theorems 2, 3 about its potential and equilibria, and Lemma 1 about updates by several players simultaneously), we are now ready to complete the proof of Theorem 1 on the convergence of asynchronous proportional response dynamics. We explain here the idea of the proof. The full proof is in the appendix.

Proof idea of Theorem 1: Our starting point is that we now know that Proportional Response Dynamics (PRD) steps by subsets of players increase the potential. Therefore, the bids should somehow converge to reach the maximum potential, which is obtained only at the set of market equilibria. Technically, since the sequence of bids btb^{t} is bounded, it must have condensation points. The proof then proceeds by way of contradiction. If the sequence does not converge to the set of equilibrium bid profiles, ME={bb is a market equilibrium bid profile}ME=\{b^{*}\mid b^{*}\text{ is a market equilibrium bid profile}\}, then there is some subsequence that converges to a bid profile bb^{**} outside of this set, which by Theorem 2, must achieve a lower potential than any bMEb^{*}\in ME (since it is not a market equilibrium, and recall that only market equilibria maximize the potential function).

From this point, the main idea is to show that if the dynamic preserves a “livness” property where the maximum time interval between consecutive updates of a player is bounded by some constant TT, then the dynamic must reach a point where the bids are sufficiently close to bb^{**} such that there must be some future update by some subset of the players under which the potential increases to more than Φ(b)\Phi(b^{**}), contradicting the existence of condensation points other than market equilibria (note that the sequence of potential values Φ(bt)\Phi(b^{t}) is increasing in tt). To show this, the proof requires several additional arguments on the continuity of compositions of PRD update functions that arise under adversarial scheduling, and the impact of such compositions on the potential function. The full proof is in the appendix.

7 Generic Markets

Here we show that in the generic case, linear Fisher markets have a unique equilibrium bid profile. While it is well known that in linear Fisher markets equilibrium prices and utilities are unique, and the equilibrium bids and allocations form convex sets (see section 2), we show that multiplicity of equilibrium bid profiles can result only from a special degeneracy in the market parameters that has measure zero in the parameter space. In other words, if the market parameters are not carefully tailored to satisfy a particular equality (formally described below), or, equivalently, if the parameters are slightly perturbed, the market will have a unique equilibrium. Similar property was known for linear exchange markets [8] and we present a simple and concise proof for the Fisher model.

Definition 3.

A Fisher market is called generic if the non-zero valuations of the buyers (aij)(a_{ij}) do not admit any multiplicative equality. That is, for any distinct and non empty K,K×𝒢K,K^{\prime}\subseteq\mathcal{B}\times\mathcal{G} it holds that (i,j)Kaij(i,j)Kaij\prod_{(i,j)\in K}a_{ij}\neq\prod_{(i^{\prime},j^{\prime})\in K^{\prime}}a_{i^{\prime}j^{\prime}}.

Theorem 5.

Every generic linear fisher market has a unique market equilibrium bid profile bb^{*}.

Before discussing the proof of Theorem 5, we have the following corollaries.

Corollary 1.

For generic linear Fisher markets, proportional response dynamics with adversarial activation asynchrony, where each player is activated at least once every TT steps, converge to the unique market equilibrium.

The main theorem, Theorem 1, suggests that proportional response dynamics with adversarial activation asynchrony converges to the set of market equilibria. Since the previous theorem guarantees that a generic market has a unique market equilibrium bid profile, it is clear the dynamics converges to that bid profile.

Corollary 2.

In generic linear Fisher markets, no-swap regret dynamics in the associated game converge to the market equilibrium.

This follows from [56], who showed that in games with convex strategy sets and a continuously differentiable potential function Φ\Phi (as in our case), the set of correlated equilibria consists of mixtures of elements in argmaxbΦ\arg\max_{b}\Phi. Theorem 2 yields that argmaxbΦ=b\arg\max_{b}\Phi=b^{*}, and so there is a unique correlated equilibrium, which is the market equilibrium, and we have that no-swap regret guarantees convergence to it.

To prove Theorem 5, we use a representation of the bids in the market as a bipartite graph of players and goods Γ(b)={V,E}\Gamma(b)=\{V,E\} with V=𝒢V=\mathcal{B}\cup\mathcal{G} and E={(i,j)|bij>0}E=\{(i,j)|\ b_{ij}>0\}. The proof shows that if a market has more than one equilibrium bid profile, then there has to be an equilibrium bb with Γ(b)\Gamma(b) containing a cycle, whereas the following lemma forbids this for generic markets.

Lemma 2.

If bb^{*} are equilibrium bids in a generic linear Fisher market, then Γ(b)\Gamma(b^{*}) has no cycles.

A key observation for proving this lemma is that at a market equilibrium, for a particular buyer ii, the quantity aij/pja_{ij}/p^{*}_{j} is constant amongst goods purchased, and so it is possible to trace a cycle and have all the pjp^{*}_{j} cancel out and obtain an equation contradicting the genericity condition.

An observation that arises from Lemma 2 is that when the number of buyers in the market is of the same order of magnitude as the number of goods or larger, then, in equilibrium, most buyers will only buy a small number of goods. Since there are no cycles in Γ(b)\Gamma(b^{*}) and there are n+mn+m vertices, there are at most n+m1n+m-1 edges. Thus, with nn buyers, the average degree of a buyer is 1+m1n1+\frac{m-1}{n}.

8 Simulations

Next, we look at simulations of the dynamics that we study and compare the convergence of proportional response dynamics to best response dynamics in the associated game, as discussed in Section 4. The metrics we focus on here for every dynamic are the Nash social welfare [55], which, as mentioned in Section 2, is maximized at the market equilibrium, and the Euclidean distance between the bids at time tt and the equilibrium bids. Additionally, we look at the progression over time of the value of the potential Φ(bt)\Phi(b^{t}) (for the definition, see Section 3).

Figure 1 presents simulations of an ensemble of markets, each with ten buyers and ten goods. The parameters in each market (defined in the matrices AA and BB) are uniformly sampled, ensuring that the genericity condition (defined in Section 2) holds with probability one. These parameters are also normalized, as explained in Section 2. For each market, the parameters remain fixed throughout the dynamics. The initial condition in all simulation runs is the uniform distribution of bids over items, and the schedule is sequential, such that a single player updates their bids in each time step.

Figure 1(a) (main plot) show our metrics for PRD averaged over a sample of 300300 such simulations. The insets show the plots of a random sample of 5050 individual simulations (without averaging) over a longer time period. Figure 1(b) show similar plots for best response dynamics.

As could be expected based on our analysis in Section 4, best response dynamics converge faster than PRD, as can be seen in the different time scales on the horizontal axes. A closer look at the individual bid dynamics depicted in the insets reveals a qualitative difference between the two types of dynamics: in PRD, the bids in each dynamic smoothly approach the equilibrium profile, whereas best response bid dynamics are more irregular. Additionally, the collection of curves for the individual simulations shows that under uniformly distributed market parameters, both dynamics exhibit variance in convergence times, with a skewed distribution. In most markets, the dynamics converge quickly, but there is a distribution tail of slower-converging dynamics.

Refer to caption
(a) Proportional Response
Refer to caption
(b) Best Response
Refer to caption
Figure 1: Proportional response and best response dynamics. The main figures show the progression of the average metrics over time and the insets show a collection of individual dynamics over a longer time period.

9 Conclusion

We have shown that proportional response bid dynamics converge to a market equilibrium in a setting where the schedule of bid updates can be chosen adversarially, allowing for sequential or simultaneous updates for any subset of players. We proposed a novel approach to address this problem by identifying a family of associated games related to proportional response dynamics, showing their relation to the competitive equilibria of the market, and leveraging these relations to prove convergence of the dynamics. En route, we showed that other types of dynamics, such as myopic best response and no swap regret, also converge in the associated game. Additionally, we note that our result on the uniqueness of market equilibria in the generic case (e.g., if the market parameters have some element of randomness) may also be of interest for future research in the Fisher market setting.

One main open question that we did not analyze is whether proportional response dynamics converge under the full asynchrony model, which includes information delays. The analysis of this model raises several complications, as it creates further coupling between past and current bid profiles. We conjecture that if information delays are bounded, then convergence also occurs in this model. However, it is not clear whether our approach could be extended to argue that proportional response updates by subsets of players with respect to delayed information increase the potential in our associated game, or whether proving convergence in this setting will require new methods. One limitation of our analysis is that we provide a guarantee that under any bid update by any subset of players chosen by an adversary, the potential function of the associated game increases, but our technique does not specify by how much the potential increases in every step, and therefore, we do not provide speed of convergence results. Such analysis seems to require new techniques, and we see this as an interesting open problem for future work.

Acknowledgments

This work was supported by the European Research Council (ERC) under the European Union’s Horizon 2020 Research and Innovation Programme (grant agreement no. 740282) and by a grant from the Israeli Science Foundation (ISF number 505/23). During part of this project, Yoav Kolumbus has been affiliated with the Hebrew University of Jerusalem.

References

  • [1] Kenneth J Arrow, Henry D Block, and Leonid Hurwicz. On the stability of the competitive equilibrium, ii. Econometrica: Journal of the Econometric Society, pages 82–109, 1959.
  • [2] Santiago R Balseiro and Yonatan Gur. Learning in repeated auctions with budgets: Regret minimization and equilibrium. Management Science, 65(9):3952–3968, 2019.
  • [3] Xiaohui Bei, Jugal Garg, and Martin Hoefer. Ascending-price algorithms for unknown markets. ACM Transactions on Algorithms (TALG), 15(3):1–33, 2019.
  • [4] Dimitri P Bertsekas. Distributed asynchronous computation of fixed points. Mathematical Programming, 27(1):107–120, 1983.
  • [5] Benjamin Birnbaum, Nikhil R Devanur, and Lin Xiao. Distributed algorithms via gradient descent for fisher markets. In Proceedings of the 12th ACM conference on Electronic commerce, pages 127–136, 2011.
  • [6] David Blackwell et al. Controlled random walks. In Proceedings of the international congress of mathematicians, volume 3, pages 336–338, 1954.
  • [7] Avrim Blum, MohammadTaghi Hajiaghayi, Katrina Ligett, and Aaron Roth. Regret minimization and the price of total anarchy. In Proceedings of the fortieth annual ACM symposium on Theory of computing, pages 373–382, 2008.
  • [8] Jean-Marc Bonnisseau, Michael Florig, and Alejandro Jofré. Continuity and uniqueness of equilibria for linear exchange economies. Journal of Optimization Theory and Applications, 109:237–263, 2001.
  • [9] William C Brainard and Herbert E Scarf. How to compute equilibrium prices in 1891. American Journal of Economics and Sociology, 64(1):57–83, 2005.
  • [10] Simina Brânzei. Exchange markets: proportional response dynamics and beyond. ACM SIGecom Exchanges, 19(2):37–45, 2021.
  • [11] Simina Brânzei, Nikhil Devanur, and Yuval Rabani. Proportional dynamics in exchange economies. In Proceedings of the 22nd ACM Conference on Economics and Computation, pages 180–201, 2021.
  • [12] Simina Brânzei, Vasilis Gkatzelis, and Ruta Mehta. Nash social welfare approximation for strategic agents. Operations Research, 70(1):402–415, 2022.
  • [13] Simina Branzei, Ruta Mehta, and Noam Nisan. Universal growth in production economies. In S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 31. Curran Associates, Inc., 2018.
  • [14] George W Brown. Iterative solution of games by fictitious play. Activity analysis of production and allocation, 13(1):374–376, 1951.
  • [15] Carlo Cenedese, Giuseppe Belgioioso, Sergio Grammatico, and Ming Cao. An asynchronous distributed and scalable generalized nash equilibrium seeking algorithm for strongly monotone games. European Journal of Control, 58:143–151, 2021.
  • [16] Nicolò Cesa-Bianchi, Tommaso Cesari, Roberto Colomboni, Federico Fusco, and Stefano Leonardi. Bilateral trade: A regret minimization perspective. Mathematics of Operations Research, 2023.
  • [17] Nicolo Cesa-Bianchi and Gábor Lugosi. Prediction, learning, and games. Cambridge university press, 2006.
  • [18] Yun Kuen Cheung and Richard Cole. Amortized Analysis of Asynchronous Price Dynamics. In 26th Annual European Symposium on Algorithms (ESA 2018), volume 112 of Leibniz International Proceedings in Informatics (LIPIcs), pages 18:1–18:15, 2018.
  • [19] Yun Kuen Cheung, Richard Cole, and Yixin Tao. Dynamics of distributed updating in fisher markets. In Proceedings of the 2018 ACM Conference on Economics and Computation, pages 351–368, 2018.
  • [20] Yun Kuen Cheung, Martin Hoefer, and Paresh Nakhe. Tracing equilibrium in dynamic markets via distributed adaptation. arXiv preprint arXiv:1804.08017, 2018.
  • [21] Bruno Codenotti, Sriram V Pemmaraju, and Kasturi R Varadarajan. On the polynomial time computation of equilibria for certain exchange economies. In SODA, volume 5, pages 72–81, 2005.
  • [22] Richard Cole, Nikhil Devanur, Vasilis Gkatzelis, Kamal Jain, Tung Mai, Vijay V Vazirani, and Sadra Yazdanbod. Convex program duality, fisher markets, and nash social welfare. In Proceedings of the 2017 ACM Conference on Economics and Computation, pages 459–460, 2017.
  • [23] Richard Cole and Lisa Fleischer. Fast-converging tatonnement algorithms for one-time and ongoing market problems. In Proceedings of the fortieth annual ACM symposium on Theory of computing, pages 315–324, 2008.
  • [24] Constantinos Daskalakis, Maxwell Fishelson, and Noah Golowich. Near-optimal no-regret learning in general games. Advances in Neural Information Processing Systems, 34, 2021.
  • [25] Xiaotie Deng, Christos Papadimitriou, and Shmuel Safra. On the complexity of price equilibria. Journal of Computer and System Sciences, 67(2):311–324, 2003.
  • [26] Nikhil R Devanur, Christos H Papadimitriou, Amin Saberi, and Vijay V Vazirani. Market equilibrium via a primal–dual algorithm for a convex program. Journal of the ACM (JACM), 55(5):1–18, 2008.
  • [27] Nikhil R Devanur and Vijay V Vazirani. An improved approximation scheme for computing arrow-debreu prices for the linear case. In FST TCS 2003: Foundations of Software Technology and Theoretical Computer Science: 23rd Conference, Mumbai, India, December 15-17, 2003. Proceedings 23, pages 149–155. Springer, 2003.
  • [28] Ran Duan and Kurt Mehlhorn. A combinatorial polynomial algorithm for the linear arrow–debreu market. Information and Computation, 243:112–132, 2015.
  • [29] Krishnamurthy Dvijotham, Yuval Rabani, and Leonard J Schulman. Convergence of incentive-driven dynamics in fisher markets. Games and Economic Behavior, 134:361–375, 2022.
  • [30] Federico Echenique and Adam Wierman. Finding a walrasian equilibrium is easy for a fixed number of agents. 2011.
  • [31] Edmund Eisenberg and David Gale. Consensus of subjective probabilities: The pari-mutuel method. The Annals of Mathematical Statistics, 30(1):165–168, 1959.
  • [32] Kfir Eliaz and Alexander Frug. Bilateral trade with strategic gradual learning. Games and Economic Behavior, 107:380–395, 2018.
  • [33] Michal Feldman, Kevin Lai, and Li Zhang. The proportional-share allocation market for computational resources. IEEE Transactions on Parallel and Distributed Systems, 20(8):1075–1088, 2008.
  • [34] Zhe Feng, Chara Podimata, and Vasilis Syrgkanis. Learning to bid without knowing your value. In Proceedings of the 2018 ACM Conference on Economics and Computation, EC ’18, page 505–522, New York, NY, USA, 2018. Association for Computing Machinery.
  • [35] Dylan J Foster, Noah Golowich, and Sham M Kakade. Hardness of independent learning and sparse equilibrium computation in markov games. arXiv preprint arXiv:2303.12287, 2023.
  • [36] Jason Gaitonde and Éva Tardos. Stability and learning in strategic queuing systems. In Proceedings of the 21st ACM Conference on Economics and Computation, pages 319–347, 2020.
  • [37] Jason Gaitonde and Éva Tardos. Virtues of patience in strategic queuing systems. In Proceedings of the 22nd ACM Conference on Economics and Computation, pages 520–540, 2021.
  • [38] Yuan Gao and Christian Kroer. First-order methods for large-scale market equilibrium computation. Advances in Neural Information Processing Systems, 33:21738–21750, 2020.
  • [39] Jugal Garg, Ruta Mehta, Milind Sohoni, and Vijay V Vazirani. A complementary pivot (practical) algorithm for market equilibrium under separable, piecewise-linear concave utilities.
  • [40] Rahul Garg and Sanjiv Kapoor. Auction algorithms for market equilibrium. In Proceedings of the thirty-sixth annual ACM symposium on Theory of computing, pages 511–518, 2004.
  • [41] Felix C Gärtner. Fundamentals of fault-tolerant distributed computing in asynchronous environments. ACM Computing Surveys (CSUR), 31(1):1–26, 1999.
  • [42] James Hannan. Approximation to bayes risk in repeated play. In Contributions to the Theory of Games (AM-39), Volume III, pages 97–139. Princeton University Press, 1957.
  • [43] Sergiu Hart and Andreu Mas-Colell. Simple adaptive strategies: from regret-matching to uncoupled dynamics, volume 4. World Scientific, 2013.
  • [44] Kamal Jain. A polynomial time algorithm for computing an arrow–debreu market equilibrium for linear utilities. SIAM Journal on Computing, 37(1):303–318, 2007.
  • [45] Kamal Jain, Mohammad Mahdian, and Amin Saberi. Approximating market equilibria. In 6th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems, 2003 and 7th International Workshop on Randomization and Approximation Techniques in Computer Science, 2003. Springer, 2003.
  • [46] Martin Kaae Jensen. Stability of pure strategy nash equilibrium in best-reply potential games. University of Birmingham, Tech. Rep, 2009.
  • [47] Frank Kelly. Charging and rate control for elastic traffic. European transactions on Telecommunications, 8(1):33–37, 1997.
  • [48] Yoav Kolumbus and Noam Nisan. Auctions between regret-minimizing agents. In Proceedings of the ACM Web Conference 2022, pages 100–111, 2022.
  • [49] Yoav Kolumbus and Noam Nisan. How and why to manipulate your own agent: On the incentives of users of learning agents. Advances in Neural Information Processing Systems, 35:28080–28094, 2022.
  • [50] Stefanos Leonardos, Barnabé Monnot, Daniël Reijsbergen, Efstratios Skoulakis, and Georgios Piliouras. Dynamical analysis of the eip-1559 ethereum fee market. In Proceedings of the 3rd ACM Conference on Advances in Financial Technologies, pages 114–126, 2021.
  • [51] Stefanos Leonardos, Daniël Reijsbergen, Daniël Reijsbergen, Barnabé Monnot, and Georgios Piliouras. Optimality despite chaos in fee markets. arXiv preprint arXiv:2212.07175, 2022.
  • [52] Panayotis Mertikopoulos, Zhengyuan Zhou, et al. Gradient-free online learning in games with delayed rewards. In International Conference on Machine Learning. International Machine Learning Society (IMLS), 2020.
  • [53] Dov Monderer and Lloyd S Shapley. Potential games. Games and economic behavior, 14(1):124–143, 1996.
  • [54] Uri Nadav and Georgios Piliouras. No regret learning in oligopolies: Cournot vs. bertrand. In Algorithmic Game Theory: Third International Symposium, SAGT 2010, Athens, Greece, October 18-20, 2010. Proceedings 3, pages 300–311. Springer, 2010.
  • [55] John F Nash Jr. The bargaining problem. Econometrica: Journal of the econometric society, pages 155–162, 1950.
  • [56] Abraham Neyman. Correlated equilibrium and potential games. International Journal of Game Theory, 26(2):223–227, 1997.
  • [57] Noam Nisan. Serial monopoly on blockchains draft–comments welcome. 2023.
  • [58] Noam Nisan, Tim Roughgarden, Eva Tardos, and Vijay V Vazirani. Algorithmic game theory. Cambridge university press, 2007.
  • [59] Noam Nisan, Michael Schapira, and Aviv Zohar. Asynchronous best-reply dynamics. In Internet and Network Economics: 4th International Workshop, WINE 2008, Shanghai, China, December 17-20, 2008. Proceedings 4, pages 531–538. Springer, 2008.
  • [60] Christos Papadimitriou and Georgios Piliouras. From nash equilibria to chain recurrent sets: An algorithmic solution concept for game theory. Entropy, 20(10):782, 2018.
  • [61] Kent Quanrud and Daniel Khashabi. Online learning with adversarial delays. Advances in neural information processing systems, 28, 2015.
  • [62] Julia Robinson. An iterative method of solving a game. Annals of mathematics, pages 296–301, 1951.
  • [63] Herbert E Scarf. The computation of equilibrium prices: an exposition. Handbook of mathematical economics, 2:1007–1061, 1982.
  • [64] Lloyd Shapley and Martin Shubik. Trade using one commodity as a means of payment. Journal of political economy, 85(5):937–968, 1977.
  • [65] Vadim I Shmyrev. An algorithm for finding equilibrium in the linear exchange model with fixed budgets. Journal of Applied and Industrial Mathematics, 3(4):505, 2009.
  • [66] Vasilis Syrgkanis, Alekh Agarwal, Haipeng Luo, and Robert E Schapire. Fast convergence of regularized learning in games. Advances in Neural Information Processing Systems, 28:2989–2997, 2015.
  • [67] John Tsitsiklis, Dimitri Bertsekas, and Michael Athans. Distributed asynchronous deterministic and stochastic gradient optimization algorithms. IEEE transactions on automatic control, 31(9):803–812, 1986.
  • [68] Mark Voorneveld. Best-response potential games. Economics letters, 66(3):289–295, 2000.
  • [69] Mohamed Wahbi and Kenneth N Brown. A distributed asynchronous solver for nash equilibria in hypergraphical games. In Frontiers in Artificial Intelligence and Applications, pages 1291–1299, 2016.
  • [70] Léon Walras. Éléments d’économie politique pure: ou, Théorie de la richesse sociale. F. Rouge, 1900.
  • [71] Fang Wu and Li Zhang. Proportional response dynamics leads to market equilibrium. In Proceedings of the thirty-ninth annual ACM symposium on Theory of computing, pages 354–363, 2007.
  • [72] Yinyu Ye. A path to the arrow–debreu competitive market equilibrium. Mathematical Programming, 111(1-2):315–348, 2008.
  • [73] Peng Yi and Lacra Pavel. Asynchronous distributed algorithms for seeking generalized nash equilibria under full and partial-decision information. IEEE transactions on cybernetics, 50(6):2514–2526, 2019.
  • [74] Li Zhang. Proportional response dynamics in the fisher market. Theoretical Computer Science, 412(24):2691–2698, 2011.
  • [75] Zhengyuan Zhou, Panayotis Mertikopoulos, Nicholas Bambos, Peter W Glynn, and Claire Tomlin. Countering feedback delays in multi-agent learning. Advances in Neural Information Processing Systems, 30, 2017.

Appendices

In the following sections we provide the proofs for the results presented in the main text as well as further technical details and discussion.

In the following, we use bif\nabla_{b_{i}}f to denote the gradient of a function ff with respect to the bids bib_{i} of player ii only. We use bijf\partial_{b_{ij}}f to denote the partial derivative with respect to the bid of player ii on good jj, and bij2f\partial_{b_{ij}}^{2}f to denote the second derivative. We denote by θi=kibi\theta_{i}=\sum_{k\neq i}b_{i} as the ‘pre-prices,’ which represent the prices excluding the bids bib_{i} (and so, for every player ii and every bid profile, p=θi+bip=\theta_{i}+b_{i}). In some of the proofs, we use the abbreviated notation (f)+=max(f,0)(f)^{+}=\max(f,0). All other notations are as defined in the main text.

Appendix A The Associated Game

Proof.

(Theorem 3): A sufficient condition for Φ\Phi being an exact potential[53] is

ibiΦ(bi,bi)=biu~i(bi,bi).\forall i\quad\nabla_{b_{i}}\Phi(b_{i},b_{-i})=\nabla_{b_{i}}\tilde{u}_{i}(b_{i},b_{i}).

And indeed, in our case we have:

bijΦ(bi,bi)=ln(aij)ln(pj)=ln(aijpj),\ \partial b_{ij}\Phi(b_{i},b_{-i})=\ln(a_{ij})-\ln(p_{j})=\ln(\frac{a_{ij}}{p_{j}}),
biju~i(bi,bi)=ln(aij)ln(pj)=ln(aijpj).\ \partial b_{ij}\tilde{u}_{i}(b_{i},b_{i})=\ln(a_{ij})-\ln(p_{j})=\ln(\frac{a_{ij}}{p_{j}}).

In order to prove Theorem 2, we first define a different associated game denoted GG^{\prime} that differs from G~\tilde{G} only in having a different associated utility function ui=jaijln(pj)u_{i}^{\prime}=\sum_{j}a_{ij}\ln(p_{j}).

In fact, G~\tilde{G} and GG^{\prime} a part of a family of associated games of the market game GG, which have the property that they all share the same best responses to bid profiles (and therefore, also the same Nash equilibria) and for all these games, the function Φ\Phi is a best-response potential (see [68] for the definition of best-response potential games). Among this family of games, we are particularly interested in the games G~\tilde{G} and GG since the former admits Φ\Phi as an exact potential, and the latter has a particularly simple derivative for its utility uiu_{i}^{\prime}, which has a clear economic interpretation: bijui(b)=aij/pj\partial_{b_{ij}}u_{i}^{\prime}(b)=a_{ij}/p_{j}. This is simply the bang-per-buck of player ii from good jj (see the model section in the main text).

Next, we present several technical lemmas that will assist us in proving Theorem 2 and which will also be useful in our proofs later on.

Lemma 3.

For any player ii and fixed bib_{-i}, both u~i(bi,bi)\tilde{u}_{i}(b_{i},b_{-i}) and ui(bi,bi)u_{i}^{\prime}(b_{i},b_{-i}) are strictly concave in bib_{i}.

Proof.

We will show the proof for u~i\tilde{u}_{i}. We compute the Hessian and show that it is negative definite. The diagonal elements are

bij2u~i(bi,bi)=1pj,\partial^{2}_{b_{ij}}\tilde{u}_{i}(b_{i},b_{i})=-\frac{1}{p_{j}},

and all of the off-diagonal elements are

bikbiju~i(bi,bi)=0.\partial_{b_{ik}}\partial_{b_{ij}}\tilde{u}_{i}(b_{i},b_{i})=0.

Therefore, the Hessian is a diagonal matrix with all of its elements being negative, and thus, u~i\tilde{u}_{i} is strictly concave. The same argument works for uiu_{i}^{\prime} as well. ∎

Lemma 4.

Fix a player ii and any bid profile biSib_{-i}\in S_{-i} of the other players, then the following two facts hold.

  1. 1.

    bi=argmaxbiSiui(bi,bi)b_{i}^{{}^{\prime}*}=\arg\max_{b_{i}\in S_{i}}u_{i}^{\prime}(b_{i},b_{-i}) if and only if it holds that biSijaijpjbijjaijpjbij\forall b_{i}^{\prime}\in S_{i}\ \sum_{j}\frac{a_{ij}}{p_{j}^{{}^{\prime}*}}b_{ij}^{{}^{\prime}*}\geq\sum_{j}\frac{a_{ij}}{p_{j}^{{}^{\prime}*}}b_{ij}^{\prime}, where pj=θij+bijp_{j}^{{}^{\prime}*}=\theta_{ij}+b^{{}^{\prime}*}_{ij}.

  2. 2.

    b~i=argmaxbiSiu~i(bi,bi)\tilde{b}_{i}^{*}=\arg\max_{b_{i}\in S_{i}}\tilde{u}_{i}(b_{i},b_{-i}) if and only if it holds that bi~Sijln(aijp~j)b~ijjln(aijp~j)b~ij\forall\tilde{b_{i}}\in S_{i}\ \sum_{j}\ln(\frac{a_{ij}}{\tilde{p}_{j}^{*}})\tilde{b}_{ij}^{*}\geq\sum_{j}\ln(\frac{a_{ij}}{\tilde{p}_{j}^{*}})\tilde{b}_{ij}, where p~j=θij+b~ij\tilde{p}_{j}^{*}=\theta_{ij}+\tilde{b}^{*}_{ij}.

Proof.

We will show the proof for (1), and the proof for (2) is similar. Let bib_{i}^{*} be a best response to bib_{-i} and let bib^{\prime}_{i} be some other strategy. Consider the restriction of ui(bi)u_{i}^{\prime}(b_{i}) to the line segment [bi,bi][b_{i}^{\prime},b_{i}^{*}] as follows; define f(ξ)=ui(bi(ξ))f(\xi)=u_{i}^{\prime}(b_{i}(\xi)) for bi(ξ)=bi+ξ(bibi)b_{i}(\xi)=b_{i}^{\prime}+\xi(b_{i}^{*}-b_{i}^{\prime}) where ξ[0,1]\xi\in[0,1]. As uiu_{i}^{\prime} is strictly concave and bib_{i}^{*} is the unique maximizer of uiu_{i}^{\prime}, it holds that ff is strictly concave and monotone increasing in ξ\xi. Therefore the derivative of ff must satisfy at the maximum point ξ=1\xi=1 that ddξf(1)0\frac{d}{d\xi}f(1)\geq 0. This is explicitly given by

ddξf(ξ)=biui(bi(ξ))(bibi).\frac{d}{d\xi}f(\xi)=\nabla_{b_{i}}u_{i}^{\prime}(b_{i}(\xi))(b_{i}^{*}-b_{i}^{\prime}).

Therefore, when substituting ξ=1\xi=1 in the derivative we get bi(1)=bib_{i}(1)=b_{i}^{*}, and

0ddξf(1)=biui(bi)(bibi)=jaijpjbijjaijpjbij,\begin{split}0\leq\frac{d}{d\xi}f(1)&=\nabla_{b_{i}}u_{i}^{\prime}(b^{*}_{i})(b_{i}^{*}-b_{i}^{\prime})\\ &=\sum_{j}\frac{a_{ij}}{p^{*}_{j}}b^{*}_{ij}-\sum_{j}\frac{a_{ij}}{p^{*}_{j}}b^{\prime}_{ij},\end{split}

which implies jaijpjbijaijpjbij\sum_{j}\frac{a_{ij}}{p^{*}_{j}}b^{\prime}_{ij}\leq\frac{a_{ij}}{p^{*}_{j}}b^{*}_{ij}, as required.

To complete the second direction of the proof of (1), consider bib^{*}_{i} for which the expression stated in right hand side of (1) is true for all bib^{\prime}_{i}. Then, fix any bib^{\prime}_{i} and again consider the restriction of uiu_{i}^{\prime} to [bi,bi][b^{\prime}_{i},b^{*}_{i}]. By direct calculation, as before but in the inverse direction, it holds that f(1)0f(1)\geq 0, and as uiu_{i}^{\prime} and f(ξ)f(\xi) are strictly concave, it thus must be that ddξf(ξ)\frac{d}{d\xi}f(\xi) is monotone decreasing in ξ\xi. Thus, for all ξ\xi we have ddξf(ξ)ddξf(1)0\frac{d}{d\xi}f(\xi)\geq\frac{d}{d\xi}f(1)\geq 0. This must mean that ξ=1\xi=1 is the maximizer of f(ξ)f(\xi) since for all ξ\xi, ddξf(ξ)0\frac{d}{d\xi}f(\xi)\geq 0 implies that f(ξ)f(\xi) is monotone increasing and therefore ui(bi)ui(bi)u_{i}^{\prime}(b^{*}_{i})\geq u_{i}^{\prime}(b^{\prime}_{i}). Finally, note that this holds for any bib^{\prime}_{i} and hence bib^{*}_{i} must be a global maximum of uiu_{i}^{\prime}.

Lemma 5.

Let (cj)j[m]m(c_{j})_{j\in[m]}\in\mathbb{R}^{m}, if there exists xΔmx^{*}\in\Delta^{m} (the mm-dimensional simplex) such that xΔm\forall x\in\Delta^{m} it holds that jcjxjjcjxj:=α\sum_{j}c_{j}x_{j}\leq\sum_{j}c_{j}x^{*}_{j}:=\alpha then:

  1. 1.

    for all jj we have that cjαc_{j}\leq\alpha, and

  2. 2.

    if xj>0x^{*}_{j}>0 then cj=αc_{j}=\alpha.

Proof.

(1) Assume for the sake of contradiction that there exists kk with ck>αc_{k}>\alpha then x=ekx=e_{k} (the “one-hot” vector with 1 at the kk’th coordinate and 0 in all other coordinates) yields jcjxj=ck>α=jcjxj\sum_{j}c_{j}x_{j}=c_{k}>\alpha=\sum_{j}c_{j}x^{*}_{j}, a contradiction.

Now, to prove (2), assume for the sake of contradiction that there exists kk with xk>0x^{*}_{k}>0 and ck<αckxk<αxkc_{k}<\alpha\implies c_{k}x^{*}_{k}<\alpha x^{*}_{k}. From (1) we have that cjαcjxjαxjc_{j}\leq\alpha\implies c_{j}x^{*}_{j}\leq\alpha x^{*}_{j}, summing the strict inequality with the weak ones over all jj yields jcjxj<jαxj=α\sum_{j}c_{j}x^{*}_{j}<\sum_{j}\alpha x^{*}_{j}=\alpha, a contradiction. ∎

Lemma 6.

Fix a player ii and any bid profile biSib_{-i}\in S_{-i} of the other players, then the following properties of best-response bids hold in the modified games G~\tilde{G} and GG^{\prime}.

  1. 1.

    The support set of bib^{*}_{i}, defined as si={j|bij>0}s^{*}_{i}=\{j|b^{*}_{ij}>0\}, is equal to the set {j|aij>cθij}\{j|a_{ij}>c^{*}\theta_{ij}\}, and for every jsij\in s^{*}_{i} we have that aijpj=c\frac{a_{ij}}{p^{*}_{j}}=c^{*}.

  2. 2.

    Best-response bids with respect to the utilities uiu_{i}^{\prime} and u~i\tilde{u}_{i} are equal and unique. That is, in the definition from Lemma 4 we have bi=b~ib^{\prime*}_{i}=\tilde{b}_{i}^{*} (denoted simply as bib^{*}_{i}).

  3. 3.

    Best-response bids are given by bij=(aijcθij)+b^{*}_{ij}=(\frac{a_{ij}}{c^{*}}-\theta_{ij})^{+} for a unique constant c(0,m/Bi)c^{*}\in(0,\nicefrac{{m}}{{B_{i}}}).

Proof.

By Lemma 3, u~i\tilde{u}_{i} and uiu_{i}^{\prime} are strictly concave in bib_{i} for any fixed bib_{-i}, and so each admits a unique maximizer. To see that they are equal, we use Lemma 4 and introduce constants c,dc,d to obtain

bijaijpjbijBijaijpjbijBi=c,and bijln(aijp~j)bijBijln(aijp~j)b~ijBi=d,\begin{split}\forall b_{i}\quad&\sum_{j}\frac{a_{ij}}{p^{\prime*}_{j}}\frac{b_{ij}}{B_{i}}\leq\sum_{j}\frac{a_{ij}}{p^{\prime*}_{j}}\frac{b^{\prime*}_{ij}}{B_{i}}=c,\ \text{and }\\ \forall b_{i}\quad&\sum_{j}\ln(\frac{a_{ij}}{\tilde{p}^{*}_{j}})\frac{b_{ij}}{B_{i}}\leq\sum_{j}\ln(\frac{a_{ij}}{\tilde{p}^{*}_{j}})\frac{\tilde{b}^{*}_{ij}}{B_{i}}=d,\end{split}

where pj=θij+bijp^{\prime*}_{j}=\theta_{ij}+b^{\prime*}_{ij} and p~j=θij+b~ij\tilde{p}^{*}_{j}=\theta_{ij}+\tilde{b}^{*}_{ij}.

For the ease of exposition, we assume that θij>0\theta_{ij}>0 for all jj. All the results stated below remain valid also when θij=0\theta_{ij}=0 for some jj.

Proof of (1): Applying Lemma 5 to each of those inequalities (once with x=1Bibix^{*}=\frac{1}{B_{i}}b^{\prime*}_{i} and twice with x=1Bib~ix^{*}=\frac{1}{B_{i}}\tilde{b}^{*}_{i}) and denoting the support sets of bij,b~ijb^{\prime*}_{ij},\tilde{b}^{*}_{ij} as s,s~s^{\prime*},\tilde{s}^{*}, respectively, we obtain the following. (1) js\forall j\in s^{\prime*} we have aijpj=c\frac{a_{ij}}{p^{\prime*}_{j}}=c and js\forall j\notin s^{\prime*} we have aijpjc\frac{a_{ij}}{p^{\prime*}_{j}}\leq c. Therefore, js\forall j\in s^{\prime*} bids are positive and c=aijpj=aijθij+bij<aijθijc=\frac{a_{ij}}{p^{\prime*}_{j}}=\frac{a_{ij}}{\theta_{ij}+b^{\prime*}_{ij}}<\frac{a_{ij}}{\theta_{ij}} while js\forall j\notin s^{\prime*} the bids are zero, and aijθijaijθij+0=aijpjc\frac{a_{ij}}{\theta_{ij}}\leq\frac{a_{ij}}{\theta_{ij}+0}=\frac{a_{ij}}{p^{\prime*}_{j}}\leq c, hence s={j|c<aijθij}s^{\prime*}=\{j|c<\frac{a_{ij}}{\theta_{ij}}\}. To prove the second inequality, we use the same argument but with d=ln(aijp~j)d=\ln(\frac{a_{ij}}{\tilde{p}^{*}_{j}}), and so we have that s~={j|ed<aijθij}\tilde{s}^{*}=\{j|e^{d}<\frac{a_{ij}}{\theta_{ij}}\}.

Proof of (2): We will show that c=edc=e^{d} and thus obtain that the vectors bi,b~ib^{\prime*}_{i},\tilde{b}_{i}^{*} are identical. Assume by way of contradiction that c<edc<e^{d}, then js~c<ed<aijθijjsj\in\tilde{s}^{*}\implies c<e^{d}<\frac{a_{ij}}{\theta_{ij}}\implies j\in s^{\prime*}, i.e., s~s\tilde{s}^{*}\subseteq s^{\prime*}. For all js~j\in\tilde{s}^{*} it holds that aijpj=c<ed=aijp~jp~j<pjb~ij<bij\frac{a_{ij}}{p^{\prime*}_{j}}=c<e^{d}=\frac{a_{ij}}{\tilde{p}^{*}_{j}}\implies\tilde{p}^{*}_{j}<p^{\prime*}_{j}\implies\tilde{b}^{*}_{ij}<b^{\prime*}_{ij}. Now we sum those inequalities over s~\tilde{s}^{*} and extend to the support ss^{\prime*}. By using the subset relation we proved, we obtain a contradiction:

Bi=js~b~ij<js~bijjsbij=Bi.B_{i}=\sum_{j\in\tilde{s}^{*}}\tilde{b}^{*}_{ij}<\sum_{j\in\tilde{s}^{*}}b^{\prime*}_{ij}\leq\sum_{j\in s^{\prime*}}b^{\prime*}_{ij}=B_{i}.

The case where ed<ce^{d}<c follows similar arguments with inverse roles of s,s~s^{\prime*},\tilde{s}^{*}. Thus, c=edc=e^{d} and s=s~s^{\prime*}=\tilde{s}^{*}, which implies aijpj=aijp~j\frac{a_{ij}}{p^{\prime*}_{j}}=\frac{a_{ij}}{\tilde{p}^{*}_{j}}, meaning that the prices are equal as well for all goods purchased. Therefore bi=b~ib^{\prime*}_{i}=\tilde{b}_{i}^{*}.

Proof of (3): Finally, observe that for jsc=aijpj=aijθij+bijbij=aijcθijj\in s^{\prime*}\quad c=\frac{a_{ij}}{p^{\prime*}_{j}}=\frac{a_{ij}}{\theta_{ij}+b^{\prime*}_{ij}}\implies b^{*}_{ij}=\frac{a_{ij}}{c^{*}}-\theta_{ij}, while otherwise bij=0b^{*}_{ij}=0, and aijcθij0\frac{a_{ij}}{c^{*}}-\theta_{ij}\leq 0. For the bounds on cc, notice that by definition it is equal to uiBi\frac{u^{*}_{i}}{B_{i}} and that ui(0,m)u^{*}_{i}\in(0,m), as ii can receive as little as almost zero (by the definition of the allocation mechanism, if ii places a bid on a good it will receive a fraction of this good, no matter how tiny) and receive at most (almost) all the goods.

The intuition of the above Lemma 6 is that it shows a property of the structure of best-response bids. If we consider all the goods sorted by the parameter aijθij\frac{a_{ij}}{\theta_{ij}}, then the best-response bids are characterized by some value cc^{*} which partitions the goods into two parts: goods that can offer the player a bang-per-buck of value cc^{*} and those that cannot. The former set of goods is exactly the support ss^{*}. When a player increases its bid on some good jj, the bang-per-buck offered by that good decreases, so clearly, any good with caijθijc^{*}\leq\frac{a_{ij}}{\theta_{ij}} cannot be considered in any optimal bundle. Consider the situation where the player has started spending money on goods with aijθij>c\frac{a_{ij}}{\theta_{ij}}>c^{*}, and that for some goods jj and kk we have that aijpj=aikθik\frac{a_{ij}}{p_{j}}=\frac{a_{ik}}{\theta_{ik}}, then if the player increases the bid on jj without increasing the bid on kk, this means that the bids are not optimal since the player could have received higher bang-per-buck by bidding on kk. The optimal option is a ‘water-filling‘ one: to split the remaining budget and use it to place bids on both jj and kk, yielding equal bang-per-buck for both (as Lemma 6 shows).

With the above lemmas, we are now ready to prove Theorem 2.

Proof.

(Theorem 2): We start by making the following claim.

Claim: The set of Market equilibria is equal to the set of Nash equilibria in the game GG^{\prime}.

Proof.

By definition, bb^{*} is a Nash equilibrium of GG^{\prime} if and only if for every ii it holds that bi=argmaxbiSiui(bi,bi)b_{i}^{*}=\arg\max_{b_{i}\in S_{i}}u_{i}^{\prime}(b_{i},b^{*}_{-i}), where by Lemma 3, for any fixed bib_{-i}, the bid profile bib_{i}^{*} is unique. By Lemma 4, we have that for xij=bijpjx^{*}_{ij}=b^{*}_{ij}p_{j}^{*} and any other xij=bijpj,ui(xi)ui(xi)x^{\prime}_{ij}=b^{\prime}_{ij}p^{*}_{j}\text{,}\quad u_{i}^{\prime}(x^{*}_{i})\geq u_{i}^{\prime}(x^{\prime}_{i}), if and only if (X,p)(X^{*},p^{*}) is a market equilibrium (market clearing and budget feasibility hold trivially). That is, the set of Nash equilibria of the game GG^{\prime} corresponds to the set of market equilibria (i.e., every bid profile bb^{*} which is a market equilibrium must be a Nash equilibrium of GG^{\prime}, and vice versa). ∎

Then, by Lemma 6, best responses by every player ii to any bid profile bib_{-i} of the other players with respect to u~i\tilde{u}_{i} and with respect to uiu_{i}^{\prime} are the same. Therefore, every Nash equilibrium in one game must be a Nash equilibrium in the other. Thus, we have that Nash equilibria of the game G~\tilde{G} are market equilibria, and vice versa – every market equilibrium must be Nash equilibrium of G~\tilde{G}. Finally, at a Nash equilibrium, no player can unilaterally improve their utility, so no improvement is possible to the potential, and in the converse, if the potential is not maximized, then there exists some player with an action that improves the potential, and so by definition their utility function as well, thus contradicting the definition of a Nash equilibrium. Therefore, we have that every bid profile that maximizes the potential is a Nash equilibrium of G~\tilde{G} and a market equilibrium (and vice versa). ∎

Appendix B Best Response Dynamics

We start with the following characterizations of best-response bids in the games G~\tilde{G} and GG^{\prime}.

Lemma 7.

Fix θi\theta_{i} and let bib^{*}_{i} be ii’s best response to θi\theta_{i} with support ss^{*}. Define cs=jsaijBi+jsθijc_{s}=\frac{\sum_{j\in s}a_{ij}}{B_{i}+\sum_{j\in s}\theta_{ij}} for every subset s[m]s\subseteq[m]. Let cc^{*} be as described in Lemma 6. Then, it holds that c=cscsc^{*}=c_{s^{*}}\geq c_{s} for all s[m]s\subseteq[m]. Furthermore, if sss^{*}\not\subset s then cs>csc_{s^{*}}>c_{s}.

Proof.

Let bib^{*}_{i} be a best response to θi\theta_{i} with support ss^{*}. By Lemma 6 we have that bij=(aijcθij)+b^{*}_{ij}=(\frac{a_{ij}}{c^{*}}-\theta_{ij})^{+}. By summing over ss^{*} we obtain that Bi=jsaijcθijB_{i}=\sum_{j_{\in}s^{*}}\frac{a_{ij}}{c^{*}}-\theta_{ij}. Rearranging yields c=jsaijBi+jsθijc^{*}=\frac{\sum_{j\in s^{*}a_{ij}}}{B_{i}+\sum_{j\in s^{*}}\theta_{ij}} which is csc_{s^{*}} by definition. Now we prove that c=maxs[m]csc^{*}=\max_{s\subseteq[m]}c_{s}. A key observation to the proof is that, by Lemma 6, if jsj\in s^{*} then cθij<aijc^{*}\theta_{ij}<a_{ij} and otherwise cθijaijc^{*}\theta_{ij}\geq a_{ij}.

For a set ss^{\prime} distinct from ss^{*} we have two cases:

Case (1): sss^{*}\not\subset s^{\prime}. Consider a bid profile bib^{\prime}_{i} that for every good jj in sss^{\prime}\cap s^{*} (if the intersection is not empty) places a bid higher by ϵ>0\epsilon>0 than bijb^{*}_{ij} and distributes the rest of ii’s budget uniformly between all other goods in ss:

bij={bij+ϵif jss,Bijss(bij+ϵ)|ss|otherwise.b^{\prime}_{ij}=\begin{cases}b^{*}_{ij}+\epsilon&\quad\text{if }j\in s^{\prime}\cap s^{*},\\ \\ \frac{B_{i}-\sum_{j\in s^{\prime}\cap s^{*}}(b^{*}_{ij}+\epsilon)}{|s^{\prime}\setminus s^{*}|}&\quad\text{otherwise}.\\ \end{cases}

For ϵ\epsilon small enough, we have jsbij=Bi\sum_{j\in s^{\prime}}b^{\prime}_{ij}=B_{i} and the support of bib^{\prime}_{i} is indeed ss^{\prime}. For every jssj\in s^{*}\cap s^{\prime} we have bij>bijb^{\prime}_{ij}>b^{*}_{ij} and by adding θij\theta_{ij} to both sides we obtain pj>pjp^{\prime}_{j}>p^{*}_{j}; multiplying both sides by cc^{*} yields (i) cpj>cpj=aijc^{*}p^{\prime}_{j}>c^{*}p^{*}_{j}=a_{ij}, where the equality is by Lemma 6, while for every jssj\in s^{\prime}\setminus s^{*} it holds that cθijaijc^{*}\theta_{ij}\geq a_{ij} by which adding cbijc^{*}b^{\prime}_{ij} to the left hand side only increases it and implies (ii) cpj>aijc^{*}p^{\prime}_{j}>a_{ij}. Summing over inequalities (i) and (ii) for all jj appropriately, we obtain cjspj>jsaijc^{*}\sum_{j\in s^{\prime}}p^{\prime}_{j}>\sum_{j\in s^{\prime}}a_{ij}, observe that jspj=js(bij+θij)=Bi+jsθij\sum_{j\in s^{\prime}}p^{\prime}_{j}=\sum_{j\in s^{\prime}}(b^{\prime}_{ij}+\theta_{ij})=B_{i}+\sum_{j\in s^{\prime}}\theta_{ij}, and thus by division, we obtain the result: c>jsaijBi+jsθij=csc^{*}>\frac{\sum_{j\in s^{\prime}}a_{ij}}{B_{i}+\sum_{j\in s^{\prime}}\theta_{ij}}=c_{s^{\prime}}.

Case (2): sss^{*}\subset s^{\prime}. In this case, the idea used above can not be applied since adding ϵ\epsilon to every bid bijb^{*}_{ij} would create bids bijb^{\prime}_{ij} that exceed the budget BiB_{i}. As stated above, the equality c=jsaijBi+jsθijc^{*}=\frac{\sum_{j\in s^{*}}a_{ij}}{B_{i}+\sum_{j\in s^{*}}\theta_{ij}} holds where the sums are taken over all members of ss^{*}, by rearranging we get cBi+cjsθij=jsaijc^{*}B_{i}+c^{*}\sum_{j\in s^{*}}\theta_{ij}=\sum_{j\in s^{*}}a_{ij}. For all jssj\in s^{\prime}\setminus s^{*} it holds that cθijaijc^{*}\theta_{ij}\geq a_{ij} and by summing those inequalities for all jj and adding the equality above we obtain: cBi+cjsθijjsaijc^{*}B_{i}+c^{*}\sum_{j\in s^{\prime}}\theta_{ij}\geq\sum_{j\in s^{\prime}}a_{ij} Rearranging yields the result: cjsaijBi+jsθij=csc^{*}\geq\frac{\sum_{j\in s^{\prime}}a_{ij}}{B_{i}+\sum_{j\in s^{\prime}}\theta_{ij}}=c_{s^{\prime}}.

And so cc^{*} is obtained as the maximum over all csc_{s}, as required. ∎

Lemma 8.

The function BRi:SiSiBR_{i}:S_{-i}\to S_{i} which maps bib_{-i} to its best response bib_{i}^{*} is continuous.

Proof.

By Lemma 6, best-response bids are given by bij=max{aijcθij,0}b^{*}_{ij}=\max\{\frac{a_{ij}}{c^{*}}-\theta_{ij},0\}, with support sis^{*}_{i}. We wish to show that bib^{*}_{i} is a continuous in bib_{-i}. We do so by showing that bijb^{*}_{ij} is obtained by a composition of continuous functions. As θi\theta_{i} is a sum of elements from bib_{-i}, it suffices to prove continuity in the variable θi\theta_{i}. The expression for bijb^{*}_{ij} is the maximum between zero and a continuous function of θij\theta_{ij}, which is continuous in θi\theta_{i}, and so we are left to prove that aijcθij\frac{a_{ij}}{c^{*}}-\theta_{ij} is continuous in θi\theta_{i}. More specifically, it suffices to show that cc^{*} as defined in Lemma 6 is continuous in θi\theta_{i}.

By Lemma 7, cc^{*} is obtained as the maximum over all csc_{s} functions, where each is a continuous function itself in θi\theta_{i}, and thus cc^{*} is continuous in θi\theta_{i}. ∎

To prove Theorem 1 on the convergence of best-response dynamics we use the following known result (for further details see [46]).

Theorem (Jensen 2009 [46]): Let GG be a best-reply potential game with single-valued, continuous best-reply functions and compact strategy sets. Then any admissible sequential best-reply path converges to the set of pure strategy Nash equilibria.

Proof.

(Theorem 4): G~\tilde{G} is a potential game, which is a stricter notion than being a best-reply potential game (i.e., every potential game is also a best-reply potential game). By Lemma 6, best replies are unique, and so the function BRiBR_{i} is single valued. Furthermore, Lemma 8 shows that it is also a continuous function. By definition, for every ii the strategy set SiS_{i} is compact, and so their product SS is compact as well. Note that while the best-reply function with respect to the standard utility function is formally undefined when the bids of all other players on some good jj are zero, what we need is for the best-reply function to the associated utility to be continuous, and this is indeed the case. Admissibility of the dynamics is also guaranteed by the liveness constraint on adversarial scheduling of the dynamics, and thus by the theorem cited above, best-reply dynamics converges to the set of Nash equilibria of G~\tilde{G}. Since every element in this set is market equilibrium (by Theorem 2) and equilibrium prices are unique (see the model section in the main text), we have that any dynamic of the prices are guaranteed to converge to equilibrium prices. Furthermore for a generic market there is a unique market equilibrium (by Theorem 5) and convergence to the set in fact means convergence to the point bb^{*}, the market-equilibrium bid profile. ∎

Proof.

(Proposition 1): Fix a player ii, fix any bid profile bib_{-i} of the other players and let bib^{*}_{i} be ii’s best response to bib_{-i}, by Lemma 6, bij=(aijcθij)+b^{*}_{ij}=(\frac{a_{ij}}{c^{*}}-\theta_{ij})^{+} for cc^{*} being a unique constant. We present a simple algorithm which computes cc^{*} and has a run-time of 𝒪(mlog(m))\mathcal{O}(m\log(m)).

Algorithm 1 Compute cc^{*}
ai,Bi,θia_{i},B_{i},\theta_{i}
Sort the values ai,θia_{i},\theta_{i} according to aijθij\frac{a_{ij}}{\theta_{ij}} in a descending order. If there are goods with θij=0\theta_{ij}=0, sort them separately according to aija_{ij} and place them as a prefix (lower indices) before the other sorted goods. Equal values are sorted in a lexicographical order.
Set: a0,θ0,cs0,c0a\leftarrow 0,\quad\theta\leftarrow 0,\quad c_{s}\leftarrow 0,\quad c^{*}\leftarrow 0
for j=1,,mj=1,\dots,m do
     aa+aij,θθ+θija\leftarrow a+a_{ij},\ \theta\leftarrow\theta+\theta_{ij}
     csaθ+Bic_{s}\leftarrow\frac{a}{\theta+B_{i}}
     cmax{c,cs}c^{*}\leftarrow\max\{c^{*},c_{s}\}
end for
return cc^{*}

To see that this process indeed reaches cc^{*}, assume w.l.o.g. that the goods are sorted by aijθij\frac{a_{ij}}{\theta_{ij}} in a descending order. For ease of exposition, assume θij>0\theta_{ij}>0 for all jj; the case with θij=0\theta_{ij}=0 for some goods is similar. By Lemma 6 we have s={j|aij>cθij}s^{*}=\{j|a_{ij}>c^{*}\theta_{ij}\}. And so, if k<jk<j and jsj\in s^{*} then ksk\in s^{*}, since in this case aikθik>aijθij>c\frac{a_{ik}}{\theta_{ik}}>\frac{a_{ij}}{\theta_{ij}}>c^{*}. Therefore, ss^{*} must be one of the following sets: [1],[2],[3],,[m][1],[2],[3],\dots,[m]. By Lemma 7 we have c=maxs[m]csc^{*}=\max_{s\subseteq[m]}c_{s}. For any set mentioned, the algorithm computes cs=jsaijBi+jsθijc_{s}=\frac{\sum_{j\in s}a_{ij}}{B_{i}+\sum_{j\in s}\theta_{ij}} and finds the maximal among all such csc_{s}, and therefore it finds cc^{*}.

As for the running time of the algorithm, it is dominated by the running time of the sorting operation which is 𝒪(mlog(m))\mathcal{O}(m\log(m)). ∎

After proving that the best response to a bid profile can be computed efficiently, we can prove now that proportional response, applied by a single player while all the other players’ bids are held fix, converges in the limit to that best response.

Proof.

(Proposition 2): Fix a player ii and fix any bid profile bib_{-i} of the other players, let bib^{*}_{i} be the best response of ii to bib_{-i} with support ss^{*} and let (bit)t=1(b^{t}_{i})_{t=1}^{\infty} be a sequence of consecutive proportional responses made by ii. That is, bit+1=fi(bit)b^{t+1}_{i}=f_{i}(b^{t}_{i}). We start the proof with several claims proving that any sub-sequence of (bit)t=1(b^{t}_{i})_{t=1}^{\infty} cannot converge to any fixed point of fif_{i} other than bib^{*}_{i}. After establishing this, we prove that the sequence indeed converges to bib^{*}_{i}.

Claim 1: Every fixed point of Proportional Response Dynamic has equal ‘bang-per-buck‘ for all goods with a positive bid. That is, if bib^{**}_{i} is a fixed point of fif_{i} then aijpj=uiBi\frac{a_{ij}}{p^{**}_{j}}=\frac{u^{**}_{i}}{B_{i}} for every good jj with bij>0b^{**}_{ij}>0, where uiu^{**}_{i} is the utility achieved for ii with the bids bib^{**}_{i}.

Proof: By substituting bib^{**}_{i} into the PRD update rule, we have

bi=fi(bi)jbij=aij/pjui/Bibijeither bij=0 or aij/pj=ui/Bi.\begin{split}&b^{**}_{i}=f_{i}(b^{**}_{i})\\ \iff&\forall j\quad b^{**}_{ij}=\frac{\nicefrac{{a_{ij}}}{{p^{**}_{j}}}}{\nicefrac{{u^{**}_{i}}}{{B_{i}}}}b^{**}_{ij}\\ \iff&\text{either }b^{**}_{ij}=0\ \text{ or }\ \nicefrac{{a_{ij}}}{{p^{**}_{j}}}=\nicefrac{{u^{**}_{i}}}{{B_{i}}}.\end{split}

Claim 2: The following properties of bib^{*}_{i} hold.

  1. 1.

    Except bib^{*}_{i}, there are no other fixed points of fif_{i} with a support that contains the support of bib^{*}_{i}. Formally, there are no fixed points bibib^{**}_{i}\neq b^{*}_{i} of fif_{i} with support ss^{**} such that sss^{*}\subset s^{**}.

  2. 2.

    The bids bib^{*}_{i} achieve a higher utility in the original game GG, denoted uiu^{*}_{i}, than any other fixed point of Proportional Response Dynamics. Formally, let bib^{**}_{i} be any fixed point other than bib^{*}_{i}, with utility uiu^{**}_{i} in the original game GG, then ui>uiu^{*}_{i}>u^{**}_{i}.

Proof: Let bib_{i} be any fixed point of fif_{i}. By the previous claim it holds that aijpj=uiBi\frac{a_{ij}}{p_{j}}=\frac{u_{i}}{B_{i}} whenever bij>0b_{ij}>0. Multiplying by pjp_{j} yields aij=uiBipja_{ij}=\frac{u_{i}}{B_{i}}p_{j}. Summing over jj with bij>0b_{ij}>0 and rearranging yields uiBi=jsaijjspj=cs\frac{u_{i}}{B_{i}}=\frac{\sum_{j\in s}a_{ij}}{\sum_{j\in s}p_{j}}=c_{s} as defined in Lemma 7 with support ss. By that lemma, we have that cscsc_{s^{*}}\geq c_{s} for any set ss distinct from ss^{*}. Thus, we have that ui/Bi=cscs=ui/Bi\nicefrac{{u^{*}_{i}}}{{B_{i}}}=c_{s^{*}}\geq c_{s^{**}}=\nicefrac{{u^{**}_{i}}}{{B_{i}}} for bib^{**}_{i} being a fixed point of fif_{i} other than bib^{*}_{i} with support ss^{**} and utility value uiu^{**}_{i}.

Assume for the sake of contradiction that sss^{*}\subset s^{**}. If jsj\in s^{*} then jsj\in s^{**}. By Claim 1 for every such jj the following inequality holds,

aijpj=uiBiuiBi=aijpj,\frac{a_{ij}}{p^{*}_{j}}=\frac{u^{*}_{i}}{B_{i}}\geq\frac{u^{**}_{i}}{B_{i}}=\frac{a_{ij}}{p^{**}_{j}},

implying that pjpjp^{**}_{j}\geq p^{*}_{j}. Subtracting θij\theta_{ij} from both sides yields bijbijb^{**}_{ij}\geq b^{*}_{ij}. Summing over jsj\in s^{*} yields a contradiction:

Bi=jsbijjsbij<jsbij=Bi,B_{i}=\sum_{j\in s^{*}}b^{*}_{ij}\leq\sum_{j\in s^{*}}b^{**}_{ij}<\sum_{j\in s^{**}}b^{**}_{ij}=B_{i},

where the first inequality is as explained above, and the last by the strict set containment sss^{*}\subset s^{**}.

Finally, as there are no fixed points with support ss^{**} containing ss^{*}, by Lemma 7, the inequality stated above is strict, that is cs>csc_{s^{*}}>c_{s^{**}} and so ui>uiu^{*}_{i}>u^{**}_{i}.

Claim 3: If bibib^{**}_{i}\neq b^{*}_{i} is a fixed point of fif_{i} then bib^{**}_{i} is not a limit point of any sub-sequence of (bit)t=0(b^{t}_{i})_{t=0}^{\infty}.

Proof: The proof considers two cases: (1) When uiu_{i} is continuous at bb^{**} (2) when continuity doesn’t hold. Let (btk)k=1(b^{t_{k}})_{k=1}^{\infty} be a converging subsequence of (bit)t=0(b^{t}_{i})_{t=0}^{\infty}.

Case (1): The utility function uiu_{i} is continuous at bib^{**}_{i} when for every good jj it holds that θij>0\theta_{ij}>0 or bij>0b^{**}_{ij}>0. i.e. that there is no good jj with both θij=0\theta_{ij}=0 and bij=0b^{**}_{ij}=0. This is implied directly from the allocation rule xij=bijθij+bijx_{ij}=\frac{b_{ij}}{\theta_{ij}+b_{ij}} (see the formal definition in Section 2) and the fact that ui=jaijxiju_{i}=\sum_{j}a_{ij}x_{ij}. Examine the support of bib^{**}_{i}, by Claim 2 there are no fixed points with support set ss^{**} containing ss^{*}. Therefore sss^{*}\not\subset s^{**} implying that there exists a good jssj\in s^{*}\setminus s^{**}. That is, by definition of the supports, there exists jj with bij>0b^{*}_{ij}>0 and bij=0b^{**}_{ij}=0. Consider such jj and assume for the sake of contradiction that bib^{**}_{i} is indeed a limit point. Then, by definition, for every δ>0\delta^{**}>0 exists a TT s.t. if t>Tt>T then bitkbij<δ\|b^{t_{k}}_{i}-b^{**}_{ij}\|<\delta^{**}. Specifically it means that |bijtkbij|<δ|b^{t_{k}}_{ij}-b^{**}_{ij}|<\delta^{**} whenever t>Tt>T.

By Claim 2, ui>uiu^{*}_{i}>u^{**}_{i}. Then, by continuity there exists a δ\delta^{\prime} s.t. if bibi<δ\|b^{**}_{i}-b_{i}\|<\delta^{\prime} then |ui(bi)ui|<uiui|u_{i}(b_{i})-u^{**}_{i}|<u^{*}_{i}-u^{**}_{i}. Take δ<min{δ,bij}\delta^{**}<\min\{\delta^{\prime},b^{*}_{ij}\} and, by the assumption of convergence, there is a TT s.t. for t>Tt>T and we have that bitkbi<δ\|b^{t_{k}}_{i}-b^{**}_{i}\|<\delta^{**}. This implies (I) |bijtk0|<δ<bij|b^{t_{k}}_{ij}-0|<\delta^{**}<b^{*}_{ij} as bij=0b^{**}_{ij}=0 and (II) |uitkui|<uiuiuitk<ui|u^{t_{k}}_{i}-u^{**}_{i}|<u^{*}_{i}-u^{**}_{i}\implies u^{t_{k}}_{i}<u^{*}_{i}. From these two, we can conclude that

aijpjtk=aijθij+bijtk>aijθij+bij=uiBi>uitkBi.\frac{a_{ij}}{p^{t_{k}}_{j}}=\frac{a_{ij}}{\theta_{ij}+b^{t_{k}}_{ij}}>\frac{a_{ij}}{\theta_{ij}+b^{*}_{ij}}=\frac{u^{*}_{i}}{B_{i}}>\frac{u^{t_{k}}_{i}}{B_{i}}.

Finally, observe that by rearranging the PRD update rule we get bijt+1=aij/pjtkuitk/Bibijtkb^{t+1}_{ij}=\frac{\nicefrac{{a_{ij}}}{{p^{t_{k}}_{j}}}}{\nicefrac{{u^{t_{k}}_{i}}}{{B_{i}}}}b^{t_{k}}_{ij}, implying that bijtk+1>bijtkb^{t_{k}+1}_{ij}>b^{t_{k}}_{ij} since aij/pjtkuitk/Bi>1\frac{\nicefrac{{a_{ij}}}{{p^{t_{k}}_{j}}}}{\nicefrac{{u^{t_{k}}_{i}}}{{B_{i}}}}>1 for t>Tt>T and bij0>0b^{0}_{ij}>0. This means that for all tk>Tt_{k}>T we have bijtk>bijT+1b^{t_{k}}_{ij}>b^{T+1}_{ij}. That is, bijtkb^{t_{k}}_{ij} cannot converge to zero and thus the subsequence cannot converge to bib^{**}_{i}, a contradiction.

Case (2): When there exists a good jj with θij=0\theta_{ij}=0 and bij=0b^{**}_{ij}=0 we have that uiu_{i} is not continuous at bib^{**}_{i} and the previous idea doesn’t work. Instead we will contradict the PRD update rule. Assume of the sake of contradiction that bib^{**}_{i} is a limit point of a subsequence of PRD updates. Therefore for every ϵ\epsilon exists a TT s.t. if tk>Tt_{k}>T then |bijtkbij|<ϵ|b^{t_{k}}_{ij}-b^{**}_{ij}|<\epsilon. Note that bij=0b^{**}_{ij}=0 in this case and set ϵ<aijmBi\epsilon<\frac{a_{ij}}{m}B_{i} and so, for tk>Tt_{k}>T it holds that aijbijtk>mBi\frac{a_{ij}}{b^{t_{k}}_{ij}}>\frac{m}{B_{i}}. Also note that pjtk=θijtk+bijtk=bijtkp^{t_{k}}_{j}=\theta^{t_{k}}_{ij}+b^{t_{k}}_{ij}=b^{t_{k}}_{ij} and that the maximal utility a buyer may have is mm (when it is allocated every good entirely). Then overall we have that aijpjtk>mBi>uitkBi\frac{a_{ij}}{p^{t_{k}}_{j}}>\frac{m}{B_{i}}>\frac{u^{t_{k}}_{i}}{B_{i}}. The PRD update rule is bijtk+1=aij/pjtkuitk/Bibitkb^{t_{k}+1}_{ij}=\frac{\nicefrac{{a_{i}j}}{{p^{t_{k}}_{j}}}}{\nicefrac{{u^{t_{k}}_{i}}}{{B_{i}}}}b^{t_{k}}_{i}. But since the ratio aij/pijtkuitk/Bi\frac{\nicefrac{{a_{i}j}}{{p^{t_{k}}_{ij}}}}{\nicefrac{{u^{t_{k}}_{i}}}{{B_{i}}}} is greater than 1 it must be that bijtk+1>bijtkb^{t_{k}+1}_{ij}>b^{t_{k}}_{ij}. And so every subsequent element of the subsequence is bounded below by bijT+1>0b^{T+1}_{ij}>0 and as before, we reach a contradiction as the subsequence cannot converge to bib^{**}_{i}. ∎

Finally we can prove the convergence of the sequence (bit)t=1(b^{t}_{i})_{t=1}^{\infty}. As the action space SiS_{i} is compact, there exists a converging subsequence bitkb^{t_{k}}_{i} with the limit bib^{**}_{i}. If bi=bib^{**}_{i}=b^{*}_{i} for any such subsequence, then we are done. Otherwise, assume bibib^{**}_{i}\neq b^{*}_{i}. By the previous claim any fixed point of fif_{i} other than bib^{*}_{i} is not a limit point of any subsequence, thus bib^{**}_{i} is not a fixed point of fif_{i}. By Lemma 1, any subset of players performing proportional response, strictly increase the potential function unless performed at a fixed point. When discussing a proportional response of a single player, with all others remaining fixed, this implies, by the definition of potential function, that u~i\tilde{u}_{i} is increased at each such step. Let ϵ<u~i(fi(b))u~i(b)\epsilon<\tilde{u}_{i}(f_{i}(b^{**}))-\tilde{u}_{i}(b^{**}), this quantity is positive since bib^{**}_{i} is not a fixed point. The function u~ifi\tilde{u}_{i}\circ f_{i} is a continuous function and bitkb^{t_{k}}_{i} converges to bib^{**}_{i} therefore there exists a TT such that for all tk>Tt_{k}>T we have that |u~i(fi(b))u~i(fi(btk))|<ϵ|\tilde{u}_{i}(f_{i}(b^{**}))-\tilde{u}_{i}(f_{i}(b^{t_{k}}))|<\epsilon. Substituting ϵ\epsilon yields u~i(fi(b))u~i(fi(btk))<u~i(fi(b))u~i(b)\tilde{u}_{i}(f_{i}(b^{**}))-\tilde{u}_{i}(f_{i}(b^{t_{k}}))<\tilde{u}_{i}(f_{i}(b^{**}))-\tilde{u}_{i}(b^{**}) which implies u~i(b)<u~i(fi(btk))=u~i(btk+1)u~i(btk+1)\tilde{u}_{i}(b^{**})<\tilde{u}_{i}(f_{i}(b^{t_{k}}))=\tilde{u}_{i}(b^{t_{k}+1})\leq\tilde{u}_{i}(b^{t_{k+1}}). That is, the sequence ui(bitk)u_{i}(b^{t_{k}}_{i}) is bounded away from u~i(b)\tilde{u}_{i}(b^{**}) and since u~i\tilde{u}_{i} is a continuous function, this implies that bitkb^{t_{k}}_{i} is bounded away from bib^{**}_{i} — a contradiction to convergence. ∎

Appendix C Simultaneous Play by Subsets of Agents

In order to prove Lemma 1, we first need some further definitions and technical lemmas. We use the notation D(xy)D(x\|y) to denote the KL divergence between the vectors xx and yy, i.e., D(xy)=jxjln(xjyj)D(x\|y)=\sum_{j}x_{j}\ln(\frac{x_{j}}{y_{j}}). For a subset of the players vv\subseteq\mathcal{B}, the subscript vv on vectors denotes the restriction of the vector to the coordinates of the players in vv, that is, for a vector bb we use the notation bv=(bij)iv,j[m]b_{v}=(b_{ij})_{i\in v,j\in[m]} to express the restriction to the subset. Φ(bv;bv)\ell_{\Phi}(b_{v};b^{\prime}_{v}) denotes the linear approximation of Φ\Phi; that is, Φ(bv;bv)=Φ(bv)+bvΦ(bv)(bvbv)\ell_{\Phi}(b_{v};b^{\prime}_{v})=\Phi(b^{\prime}_{v})+\nabla_{b_{v}}\Phi(b^{\prime}_{v})(b_{v}-b^{\prime}_{v}).

The idea described in the next lemma to present the potential function as a linear approximation term and a divergence term was first described in [5] for a different scenario when all agents act together in a synchronized manner using mirror descent; we extend this idea to our asynchronous setting which requires using different methods and as well as embedding it in a game.

Lemma 9.

Fix a subset of the players vv\subset\mathcal{B} and a bid profile bvb_{-v} of the other players. Then, for all bv,bvSvb_{v},b^{\prime}_{v}\in S_{v} we have that Φ(bv)=Φ(bv;bv)D(pp)\Phi(b_{v})=\ell_{\Phi}(b_{v};b^{\prime}_{v})-D(p\|p^{\prime}), where p=ivbij+ivbijp=\sum_{i\notin v}b_{ij}+\sum_{i\in v}b_{ij} and p=ivbij+ivbijp^{\prime}=\sum_{i\notin v}b_{ij}+\sum_{i\in v}b^{\prime}_{ij}.

Proof.

Calculating the difference Φ(bv)Φ(bv;bv)\Phi(b_{v})-\ell_{\Phi}(b_{v};b^{\prime}_{v}) yields

Φ(bv)Φ(bv;bv)=Φ(bv)Φ(bv)bvΦ(bv)(bvbv).\begin{split}\Phi(b_{v})-\ell_{\Phi}(b_{v};b^{\prime}_{v})&=\Phi(b_{v})-\Phi(b^{\prime}_{v})-\nabla_{b_{v}}\Phi(b^{\prime}_{v})(b_{v}-b^{\prime}_{v}).\end{split}

We rearrange the term Φ(bv)Φ(bv)\Phi(b_{v})-\Phi(b^{\prime}_{v}) as follows.

Φ(bv)Φ(bv)=iv,j[m](bijbij)ln(aij)j(pjln(pj)pjln(pj))j(pjpj)=iv,j[m](bijbij)ln(aij)j(pjln(pj)pjln(pj)),\begin{split}\Phi(b_{v})-\Phi(b^{\prime}_{v})&=\sum_{i\in v,j\in[m]}(b_{ij}-b^{\prime}_{ij})\ln(a_{ij})-\sum_{j}(p_{j}\ln(p_{j})-p^{\prime}_{j}\ln(p^{\prime}_{j}))-\sum_{j}(p_{j}-p^{\prime}_{j})\\ &=\sum_{i\in v,j\in[m]}(b_{ij}-b^{\prime}_{ij})\ln(a_{ij})-\sum_{j}(p_{j}\ln(p_{j})-p^{\prime}_{j}\ln(p^{\prime}_{j})),\end{split}

where the last equality is since jpj=1\sum_{j}p_{j}=1 for any set of prices because the economy is normalized (see the model section in the main text).

The term bvΦ(bv)(bvbv)\nabla_{b_{v}}\Phi(b^{\prime}_{v})(b_{v}-b^{\prime}_{v}) is expanded as follows.

bvΦ(bv)(bvbv)=iv,j[m]ln(aijpj)(bijbij)=iv,j[m]ln(aij)(bijbij)iv,j[m]ln(pj)(bijbij).\begin{split}\nabla_{b_{v}}\Phi(b^{\prime}_{v})(b_{v}-b^{\prime}_{v})&=\sum_{i\in v,j\in[m]}\ln(\frac{a_{ij}}{p^{\prime}_{j}})(b_{ij}-b^{\prime}_{ij})\\ &=\sum_{i\in v,j\in[m]}\ln(a_{ij})(b_{ij}-b^{\prime}_{ij})-\sum_{i\in v,j\in[m]}\ln(p^{\prime}_{j})(b_{ij}-b^{\prime}_{ij}).\end{split}

Subtracting the latter from the former cancels out the term iv,j[m]ln(aij)(bijbij)\sum_{i\in v,j\in[m]}\ln(a_{ij})(b_{ij}-b^{\prime}_{ij}), and we are left with the following.

Φ(bv)Φ(bv;bv)=Φ(bv)Φ(bv)bvΦ(bv)(bvbv)=iv,j[m]ln(pj)(bijbij)j(pjln(pj)pjln(pj))=jpjln(pj)(pjivbij+ivbij)ln(pj)=jpjln(pj)(θvj+ivbij)ln(pj)=jpjln(pj)pjln(pj)=D(pp).\begin{split}\Phi(b_{v})-\ell_{\Phi}(b_{v};b^{\prime}_{v})&=\Phi(b_{v})-\Phi(b^{\prime}_{v})-\nabla_{b_{v}}\Phi(b^{\prime}_{v})(b_{v}-b^{\prime}_{v})\\ &=\sum_{i\in v,j\in[m]}\ln(p^{\prime}_{j})(b_{ij}-b^{\prime}_{ij})-\sum_{j}(p_{j}\ln(p_{j})-p^{\prime}_{j}\ln(p^{\prime}_{j}))\\ &=-\sum_{j}p_{j}\ln(p_{j})-(p^{\prime}_{j}-\sum_{i\in v}b^{\prime}_{ij}+\sum_{i\in v}b_{ij})\ln(p^{\prime}_{j})\\ &=-\sum_{j}p_{j}\ln(p_{j})-(\theta_{vj}+\sum_{i\in v}b_{ij})\ln(p^{\prime}_{j})\\ &=-\sum_{j}p_{j}\ln(p_{j})-p_{j}\ln(p^{\prime}_{j})\\ &=-D(p\|p^{\prime}).\end{split}

Lemma 10.

For any subset of the players vv\subset\mathcal{B} and any bid profile bvb_{-v} of the other players and for every bv,bvSvb_{v},b^{\prime}_{v}\in S_{v} it holds that D(pp)D(bvbv)D(p\|p^{\prime})\leq D(b_{v}\|b^{\prime}_{v}), with equality only when bv=bvb_{v}=b^{\prime}_{v}.

Proof.

We begin by proving a simpler case where v={i}v=\{i\} for some player ii and use it to prove the more general statement. Fix ii and bib_{-i}, which implies fixing some θi\theta_{i}. KL divergence is convex in both arguments with equality only if the arguments are equal; formally, for λ(0,1)\lambda\in(0,1) it holds that D(λθi+(1λ)biλθi+(1λ)bi)λD(θiθi)+(1λ)D(bibi)D(\lambda\theta_{i}+(1-\lambda)b_{i}\|\lambda\theta_{i}+(1-\lambda)b^{\prime}_{i})\leq\lambda D(\theta_{i}\|\theta_{i})+(1-\lambda)D(b_{i}\|b^{\prime}_{i}), which is equivalent to D(λθi+(1λ)biλθi+(1λ)bi)(1λ)D(bibi)D(\lambda\theta_{i}+(1-\lambda)b_{i}\|\lambda\theta_{i}+(1-\lambda)b^{\prime}_{i})\leq(1-\lambda)D(b_{i}\|b^{\prime}_{i}), with equality only if bi=bib_{i}=b^{\prime}_{i} (since D(θiθi)=0D(\theta_{i}\|\theta_{i})=0). Substituting λ=12\lambda=\frac{1}{2} and noting that pj=θij+bijp_{j}=\theta_{ij}+b_{ij} (and the same for pjp^{\prime}_{j} and bijb^{\prime}_{ij}), we obtain the following relation.

D(12p12p)=D(12θi+12bi12θi+12bi)12D(bibi).\begin{split}D(\frac{1}{2}p\|\frac{1}{2}p^{\prime})&=D(\frac{1}{2}\theta_{i}+\frac{1}{2}b_{i}\|\frac{1}{2}\theta_{i}+\frac{1}{2}b^{\prime}_{i})\\ &\leq\frac{1}{2}D(b_{i}\|b^{\prime}_{i}).\end{split}

On the other hand, the expression D(12p12p)D(\frac{1}{2}p\|\frac{1}{2}p^{\prime}) can be evaluated as follows.

D(12p12p)=j12pjln(1/2pj1/2pj)=12jpjln(pjpj)=12D(pp).\begin{split}D(\frac{1}{2}p\|\frac{1}{2}p^{\prime})&=\sum_{j}\frac{1}{2}p_{j}\ln(\frac{\nicefrac{{1}}{{2}}p_{j}}{\nicefrac{{1}}{{2}}p^{\prime}_{j}})\\ &=\frac{1}{2}\sum_{j}p_{j}\ln(\frac{p_{j}}{p^{\prime}_{j}})\\ &=\frac{1}{2}D(p\|p^{\prime}).\end{split}

And therefore, we have D(pp)D(bibi)D(p\|p^{\prime})\leq D(b_{i}\|b^{\prime}_{i}), with equality only if bi=bib_{i}=b^{\prime}_{i}.

Now we can prove the general case, as stated fix vv and bvb_{-v} and let bv,bvSvb_{v},b^{\prime}_{v}\in S_{v}. We know that for all ivi\in v it is true that D(pp)D(bibi)D(p\|p^{\prime})\leq D(b_{i}\|b^{\prime}_{i}), summing those inequalities for all ivi\in v yields |v|D(pp)ivD(bibi)|v|D(p\|p^{\prime})\leq\sum_{i\in v}D(b_{i}\|b^{\prime}_{i}), on the one hand clearly D(pp)|v|D(pp)D(p\|p^{\prime})\leq|v|D(p\|p^{\prime}) and on the other hand ivD(bibi)=ivjbijln(bijbij)=D(bvbv)\sum_{i\in v}D(b_{i}\|b^{\prime}_{i})=\sum_{i\in v}\sum_{j}b_{ij}\ln(\frac{b_{ij}}{b^{\prime}_{ij}})=D(b_{v}\|b^{\prime}_{v}) and the result is obtained. ∎

Lemma 11.

Let vv\subseteq\mathcal{B}, let fv:SSf_{v}:S\to S be a proportional response update function for members of vv and identity for the others, and let bSb^{\prime}\in S be some bid profile. Then, (fv(b))v=argmaxbvSv{Φ(bv;bv)D(bvbv)}(f_{v}(b^{\prime}))_{v}=\arg\max_{b_{v}\in S_{v}}\{\ell_{\Phi}(b_{v};b^{\prime}_{v})-D(b_{v}\|b^{\prime}_{v})\}.

Proof.

By adding and removing constants that do not change the maximizer of the expression on the right hand side, we obtain that the maximizer is exactly the proportional response update rule:

argmaxbvSv{Φ(bv;bv)D(bvbv)}=argmaxbvSv{Φ(bv)+bvΦ(bv)(bvbv)D(bvbv)}=argmaxbvSv{bvΦ(bv)bvD(bvbv)}=argmaxbvSv{bvΦ(bv)bvD(bvbv)ivBiln(uiBi)}.\begin{split}\arg\max_{b_{v}\in S_{v}}\{\ell_{\Phi}(b_{v};b^{\prime}_{v})-D(b_{v}\|b^{\prime}_{v})\}&=\arg\max_{b_{v}\in S_{v}}\{\Phi(b^{\prime}_{v})+\nabla_{b_{v}}\Phi(b^{\prime}_{v})(b_{v}-b^{\prime}_{v})-D(b_{v}\|b^{\prime}_{v})\}\\ &=\arg\max_{b_{v}\in S_{v}}\{\nabla_{b_{v}}\Phi(b^{\prime}_{v})b_{v}-D(b_{v}\|b^{\prime}_{v})\}\\ &=\arg\max_{b_{v}\in S_{v}}\{\nabla_{b_{v}}\Phi(b^{\prime}_{v})b_{v}-D(b_{v}\|b^{\prime}_{v})-\sum_{i\in v}B_{i}\ln(\frac{u^{\prime}_{i}}{B_{i}})\}.\end{split}

Rearranging the last expression by elements yields the following result,

bvΦ(bv)bvD(bvbv)ivBiln(uiBi)==iv,j[m]bijln(aijpj)iv,j[m]bijln(bijbij)iv,j[m]bijln(uiBi)=iv,j[m]bijln(aijpjbijbijBiui)=iv,j[m]bijln(aijxijuiBibij)=iv,j[m]bijln(bijaijxijuiBi),\begin{split}&\nabla_{b_{v}}\Phi(b^{\prime}_{v})b_{v}-D(b_{v}\|b^{\prime}_{v})-\sum_{i\in v}B_{i}\ln(\frac{u^{\prime}_{i}}{B_{i}})=\\ &=\sum_{i\in v,j\in[m]}b_{ij}\ln(\frac{a_{ij}}{p^{\prime}_{j}})-\sum_{i\in v,j\in[m]}b_{ij}\ln(\frac{b_{i}j}{b^{\prime}_{ij}})-\sum_{i\in v,j\in[m]}b_{ij}\ln(\frac{u^{\prime}_{i}}{B_{i}})\\ &=\sum_{i\in v,j\in[m]}b_{ij}\ln(\frac{a_{ij}}{p^{\prime}_{j}}\frac{b^{\prime}_{ij}}{b_{ij}}\frac{B_{i}}{u^{\prime}_{i}})\\ &=\sum_{i\in v,j\in[m]}b_{ij}\ln(\frac{\frac{a_{ij}x^{\prime}_{ij}}{u^{\prime}_{i}}B_{i}}{b_{ij}})\\ &=-\sum_{i\in v,j\in[m]}b_{ij}\ln(\frac{b_{ij}}{\frac{a_{ij}x^{\prime}_{ij}}{u^{\prime}_{i}}B_{i}}),\end{split}

which is exactly D(bv(fv(b))v)-D(b_{v}\|(f_{v}(b^{\prime}))_{v}), since (fv(b))ij=aijxijuiBi(f_{v}(b^{\prime}))_{ij}=\frac{a_{ij}x^{\prime}_{ij}}{u^{\prime}_{i}}B_{i} for ivi\in v by definition. That is, our maximization problem is equivalent to argmin{D(bv(fv(b))v)}\arg\min\{D(b_{v}\|(f_{v}(b^{\prime}))_{v})\}. Finally, note that KL divergence is minimized when both of its arguments are identical, and (fv(bv))vSv(f_{v}(b^{\prime}_{v}))_{v}\in S_{v}, the domain of the minimization. ∎

Proof.

(Lemma 1): Let vv\subseteq\mathcal{B} be a subset of players and let bSb\in S be some bid profile. By combining the lemmas proved in this section have that

Φ(fv(b))Φ(fv(b);b)D(fv(b)b)Φ(b;b)D(bb)=Φ(b),\Phi(f_{v}(b))\geq\ell_{\Phi}(f_{v}(b);b)-D(f_{v}(b)\|b)\geq\ell_{\Phi}(b;b)-D(b\|b)=\Phi(b),

where the first inequality is by Lemmas 9 and 10 with the inequality being strict whenever fv(b)bf_{v}(b)\neq b, and the second inequality is by Lemma 11, as fv(b)f_{v}(b) was shown to be the maximizer of this expression over all bSb\in S. ∎

An interesting case to note here is when v=iv={i}. In this case, the lemmas above show that if the players’ bids are btb^{t} and ii is being activated by the adversary, then the best response bids of ii to bitb^{t}_{-i} are the solutions to the optimization problem argmaxbiSi{u~i(bi;bit)D(ppt)}\arg\max_{b_{i}\in S_{i}}\{\ell_{\tilde{u}_{i}}(b_{i};b^{t}_{i})-D(p\|p^{t})\}. On the other hand, the proportional response to bitb^{t}_{-i} is the solution to the optimization problem argmaxbiSi{u~i(bi;bit)D(bibit)}\arg\max_{b_{i}\in S_{i}}\{\ell_{\tilde{u}_{i}}(b_{i};b^{t}_{i})-D(b_{i}\|b^{t}_{i})\}. This can be seen as a relaxation of the former, as proportional response does not increase u~i\tilde{u}_{i} (or equivalently the potential) as much as best response does. However, proportional response is somewhat easier to compute.

Appendix D Convergence of Asynchronous Proportional Response Dynamics

Proof.

(Theorem 1): Denote the distance between a point xx and a set SS as d(x,S)=infxSxxd(x,S)=\inf_{x^{*}\in S}\|x-x^{*}\|. By Theorem 2 we have that the set of potential maximizing bid profiles is identical to the set of market equilibria. Denote this set by MEME and the maximum value of the potential by Φ\Phi^{*}. More specifically, every bMEb^{*}\in ME achieves Φ(b)=Φ\Phi(b^{*})=\Phi^{*}, and Φ\Phi^{*} is achieved only by elements in MEME.

We start with the following lemma.

Lemma 12.

For every ϵ>0\epsilon>0 there exists a δ>0\delta>0 such that Φ(b)>Φδ\Phi(b)>\Phi^{*}-\delta implies d(b,ME)<ϵd(b,ME)<\epsilon.

Proof.

Assume otherwise that for some ϵ0\epsilon_{0} there exist a sequence (bt)(b_{t}) such that Φ(bt)Φ\Phi(b_{t})\rightarrow\Phi^{*} but infbMEbtbϵ0\inf_{b^{*}\in ME}\|b_{t}-b^{*}\|\geq\epsilon_{0} for all tt. Note that for all bMEb^{*}\in ME we have that Φ(b)=Φ\Phi(b^{*})=\Phi^{*} and btbinfbMEbtbϵ0\|b_{t}-b^{*}\|\geq\inf_{b^{\prime}\in ME}\|b_{t}-b^{\prime}\|\geq\epsilon_{0} for all tt. Take a condensation point bb^{**} of this sequence and a subsequence (tj)(t_{j}) that converges to bb^{**}. Thus by our assumption we have Φ(b)=limΦ(btj)=Φ\Phi(b^{**})=\lim\Phi(b_{t_{j}})=\Phi^{*}. For any bMEb^{*}\in ME we have bb=limbtjbϵ0>0\|b^{**}-b^{*}\|=\lim\|b_{t_{j}}-b^{*}\|\geq\epsilon_{0}>0. The former equality must imply bMEb^{**}\in ME, but the latter implies bMEb^{**}\notin ME. ∎

Next, for a subset of players AA\subset\mathcal{B} let fA:[0,1]n[0,1]nf_{A}:[0,1]^{n}\rightarrow[0,1]^{n} be the continuous function where iAi\in A do a proportional response update and the other players play the identity function. (i.e., do not change their bids, see Section 5 in the main text).

By Lemma 1 from the main text we have that (i) For all AA we have that fA(b)=bf_{A}(b)=b if and only if for all iAi\in A it holds that fi(b)=bf_{i}(b)=b; and (ii) Φ(fA(b))>Φ(b)\Phi(f_{A}(b))>\Phi(b) unless fA(b)=bf_{A}(b)=b.

Definition.

The stable set of bb^{**} is defined to be S(b)={i|fi(b)=b}S(b^{**})=\{i|f_{i}(b^{**})=b^{**}\}.

A corollary (i) and (ii) above is that if AS(b)A\subseteq S(b^{**}) then fA(b)=bf_{A}(b^{**})=b^{**}, but if AS(b)A\setminus S(b^{**})\neq\emptyset then Φ(fA(b))>Φ(b)\Phi(f_{A}(b^{**}))>\Phi(b^{**}).

Lemma 13.

: Let Φ(b)<Φ\Phi(b^{**})<\Phi^{*}. Then there exists δ>0\delta>0 such that for every bbδ\|b-b^{**}\|\leq\delta and every AS(b)A\setminus S(b^{**})\neq\emptyset we have that Φ(fA(b))>Φ(b)\Phi(f_{A}(b))>\Phi(b^{**}).

Proof.

Fix a set AA such that AS(b)A\setminus S(b^{**})\neq\emptyset and let α=Φ(fA(b))Φ(b)>0\alpha=\Phi(f_{A}(b^{**}))-\Phi(b^{**})>0. Since Φ(fA())\Phi(f_{A}(\cdot)) is continuous, there exists δ\delta so that |bb|δ|b-b^{**}|\leq\delta implies Φ(fA(b))Φ(fA(b))<α\Phi(f_{A}(b^{**}))-\Phi(f_{A}(b))<\alpha and thus Φ(fA(b))>Φ(b)\Phi(f_{A}(b))>\Phi(b^{**}). Now take the minimum δ\delta for all finitely many AA. ∎

Lemma 14.

Let Φ(b)<Φ\Phi(b^{**})<\Phi^{*} and let FF be a finite family of continuous functions such that for every fFf\in F we have that f(b)=bf(b^{**})=b^{**}. Then there exists ϵ>0\epsilon>0 such that for every bb such that bbϵ\|b-b^{**}\|\leq\epsilon and every fFf\in F and every AS(b)A\setminus S(b^{**})\neq\emptyset we have that Φ(fA(f(b)))>Φ(b)\Phi(f_{A}(f(b)))>\Phi(b^{**}).

Proof.

Fix fFf\in F and let δ\delta be as promised by the previous lemma, i.e. for every zbδ\|z-b^{**}\|\leq\delta and every AS(b)A\setminus S(b^{**})\neq\emptyset we have that Φ(fA(z))>Φ(b)\Phi(f_{A}(z))>\Phi(b^{**}). Since f(b)=bf(b^{**})=b^{**} and ff is continuous there exists ϵ>0\epsilon>0 so that bbϵ\|b-b^{**}\|\leq\epsilon implies f(b)f(b)=f(b)bδ\|f(b)-f(b^{**})\|=\|f(b)-b^{**}\|\leq\delta and thus Φ(fA(f(b)))>Φ(b)\Phi(f_{A}(f(b)))>\Phi(b^{**}). Now take the minimum ϵ\epsilon over the finitely many fFf\in F. ∎

Definition.

a sequence of sets AtA_{t}\subseteq\mathcal{B} is called TT-live if for every ii and for every tt there exists some ttt+Tt\leq t^{*}\leq t+T such that iSti\in S_{t^{*}}.

Lemma 15.

Fix a sequence b=(bt)b=(b_{t}) where bt+1=fAt(bt)b_{t+1}=f_{A_{t}}(b_{t}) such that the sequence AtA_{t} is TT-live. There are no condensation points of (bt)(b_{t}) outside of MEME.

Proof.

Assume that exists a condensation point bMEb^{*}*\notin ME and a subsequence that converges to it, then Φ(b)<Φ(b)\Phi(b^{**})<\Phi(b^{*}). Notice that Φ(bt)\Phi(b_{t}) is a non-decreasing sequence and so Φ(bt)Φ(b)\Phi(b_{t})\leq\Phi(b^{**}) for all tt. Let FF be a set of functions achieved by composition of at most TT functions from {fA|AS(b)}\{f_{A}|A\subset S(b^{**})\}. So for every fFf\in F we have that f(b)=bf(b^{**})=b^{**}, while for every BS(b)B\setminus S(b^{**})\neq\emptyset we have that Φ(fB(b))>Φ(b)\Phi(f_{B}(b^{**}))>\Phi(b^{**}). Let ϵ\epsilon be as promised by the previous lemma, i.e., for every bbϵ\|b-b^{**}\|\leq\epsilon and every fFf\in F and every BB such that BS(b)B\setminus S(b^{**})\neq\emptyset we have that Φ(fB(f(b)))>Φ(b)\Phi(f_{B}(f(b)))>\Phi(b^{**}). Since the subsequence converges to bb^{**} there exists tjt_{j} in the subsequence so that btjbϵ\|b_{t_{j}}-b^{**}\|\leq\epsilon. Now let t>tjt>t_{j} be the first time that AtS(b)A_{t}\setminus S(b^{**})\neq\emptyset. Now bt+1=fAt(f(btj))b_{t+1}=f_{A_{t}}(f(b_{t_{j}})), where ff is the composition of all fAf_{A} for the times tjt_{j} to tt. We can now apply the previous lemma to get that Φ(bt+1)=Φ(fAt(f(btj))>Φ(b)\Phi(b_{t+1})=\Phi(f_{A_{t}}(f(b_{t_{j}}))>\Phi(b^{**}), a contradiction. ∎

Lemma 16.

Fix a sequence b=(bt)b=(b_{t}) where bt+1=fAt(bt)b_{t+1}=f_{A_{t}}(b_{t}) such that the sequence AtA_{t} is TT-live. Then, it holds that limtd(bt,ME)=0\lim_{t\rightarrow\infty}d(b_{t},ME)=0.

Proof.

By lemma 1 we have that Φ(bt+1)Φ(bt)\Phi(b^{t+1})\geq\Phi(b^{t}) making the sequence Φ(bt)\Phi(b^{t}) monotone and bounded from above (Φ()\Phi(\cdot) is a bounded function). Hence it converges to some limit LL. Either L=ΦL=\Phi^{*} or L<ΦL<\Phi^{*}. In the former case, the result is immediate by lemma 12 and btb^{t} approaches the set MEME. We show that the latter yields a contradiction. If L<ΦL<\Phi^{*}, this implies that btb^{t} is bounded away from MEME, i.e. there exists ϵ0>0\epsilon_{0}>0 such that for all tt d(bt,ME)ϵ0d(b^{t},ME)\geq\epsilon_{0}. To see why this is true for all tt and not just in the limit, we observe that since the sequence Φ(bt)\Phi(b^{t}) is monotone, if we have d(bT,ME)=0d(b^{T},ME)=0 at some time TT, then we have Φ(bt)=Φ\Phi(b^{t})=\Phi^{*} for all t>Tt>T, which we currently assume is not the case. Therefore, every subsequence of btb^{t} is bounded away from MEME, implying that every condensation point of btb^{t} is not in MEME. The sequence btb^{t} is bounded and therefore has a converging subsequence with a condensation point not in MEME, which is a contradiction to the previous lemma. ∎

Regarding convergence of prices, as stated in section 2, equilibrium prices for each Fisher market are unique and attained by any bid profile bMEb^{*}\in ME, thus, since the prices are a continuous function of the bids, the convergence of bids to this set implies the convergence of prices ptpp^{t}\to p^{*}.

The last lemma concludes our proof of the Theorem 1.

Appendix E Generic Markets

Proof.

(Theorem 5): Assume by way of contradiction that a generic linear Fisher market has two distinct market equilibrium bid profiles bbb^{*}\neq b^{**}. For any market equilibrium bb it must hold that: (1) jibij=pj\forall j\quad\sum_{i}b_{ij}=p^{*}_{j} since equilibrium prices are unique, and (2) ijbij=Bi\forall i\quad\sum_{j}b_{ij}=B_{i} by budget feasibility. As bbb^{*}\neq b^{**}, there exists a pair (i,j)(i,j) with bijbijb^{*}_{ij}\neq b^{**}_{ij}, meaning that buyer ii has a different bid on good jj between bb^{*} and bb^{**}, and so by (1) it must be that exists a buyer kk whose bid on good jj was also changed so that the price pjp^{*}_{j} remains fixed; formally, bkjbkjb^{*}_{kj}\neq b^{**}_{kj}. In such case, by (2) there must be a good \ell for which buyer kk has a different bid as well, since it’s budget BkB_{k} is fixed and fully utilized; formally bkbkb^{*}_{k\ell}\neq b^{**}_{k\ell}. As the graph Γ={𝒢,E}\Gamma=\{\mathcal{B}\cup\mathcal{G},E\} with E={{i,j}|bijbij}E=\{\{i,j\}|b^{\prime}_{ij}\neq b^{*}_{ij}\} is finite, following the process described above while satisfying the constraints (1) and (2) must lead to a cycle in the graph Γ\Gamma.

Finally, we will show that there exists a market equilibrium with a cycle in its corresponding graph. Define b=λb+(1λ)bb^{\prime}=\lambda b^{*}+(1-\lambda)b^{**} for some λ(0,1)\lambda\in(0,1) and note that bb^{\prime} is also market equilibrium as the set of market equilibria is a convex set (see the model section in the main text). Let Γ(b)={𝒢,E(b)}\Gamma(b^{\prime})=\{\mathcal{B}\cup\mathcal{G},E(b^{\prime})\} with E(b)={{i,j}|bij>0}E(b^{\prime})=\{\{i,j\}|b^{\prime}_{ij}>0\} be the corresponding graph of bb^{\prime}. Observe that EE(b)E\subseteq E(b^{\prime}) since if bijbijb^{*}_{ij}\neq b^{**}_{ij} then it must be that bij>0b^{*}_{ij}>0 or bij>0b^{**}_{ij}>0 and in any such case bij>0b^{\prime}_{ij}>0. Thus, the graph Γ(b)\Gamma(b^{\prime}) contains a cycle, contradicting Lemma 2 from the main text ∎

Proof.

(Lemma 2): Assume for the sake of contradiction that exists a cycle CC in Γ(b)\Gamma(b^{*}), w.l.o.g. name the vertices of buyers and goods participating in the cycle in an ascending order; that is, C=b1g1b2g2bk1gkb1C=b_{1}g_{1}b_{2}g_{2}\dots b_{k-1}g_{k}b_{1}, where bib_{i} and gig_{i} represent buyers and goods ii, respectively. Recall that for any market equilibrium if xij>0x^{*}_{ij}>0 then aijpj=ci\frac{a_{ij}}{p^{*}_{j}}=c_{i} for some constant cic_{i} (see the model section in the main text). Applying this to the cycle CC yields the following equations. (1) By considering edges from buyers to goods bigib_{i}\to g_{i} we obtain for i[k1]ai,i=cipii\in[k-1]\quad a_{i,i}=c_{i}p^{*}_{i}, and (2) by considering edges from goods to buyers gibi+1g_{i}\to b_{i+1} we obtain for i[k1]ai+1,i=ci+1pii\in[k-1]\quad a_{i+1,i}=c_{i+1}p_{i}^{*} and the edge closing the cycle yields a1,k=c1pka_{1,k}=c_{1}p_{k}^{*}. Finally, by considering the product of ratios between valuations of buyers participating in the cycle we have the following condition.

a21a11a32a22a43a33ai+1,iai,iak,k1ak1,k1a1,kak,k=c2p1c1p1c3p2c2p2c4p3c3p3ci+1picipickpk1ck1pk1c1pkckpk=c2c1c3c2c4c3ci+1cickck1c1ck=1,\begin{split}\frac{a_{21}}{a_{11}}\frac{a_{32}}{a_{22}}\frac{a_{43}}{a_{33}}\dots\frac{a_{i+1,i}}{a_{i,i}}\dots\frac{a_{k,k-1}}{a_{k-1,k-1}}\frac{a_{1,k}}{a_{k,k}}&=\frac{c_{2}p^{*}_{1}}{c_{1}p^{*}_{1}}\frac{c_{3}p^{*}_{2}}{c_{2}p^{*}_{2}}\frac{c_{4}p^{*}_{3}}{c_{3}p^{*}_{3}}\dots\frac{c_{i+1}p^{*}_{i}}{c_{i}p^{*}_{i}}\dots\frac{c_{k}p^{*}_{k-1}}{c_{k-1}p^{*}_{k-1}}\frac{c_{1}p^{*}_{k}}{c_{k}p^{*}_{k}}\\ &=\frac{c_{2}}{c_{1}}\frac{c_{3}}{c_{2}}\frac{c_{4}}{c_{3}}\dots\frac{c_{i+1}}{c_{i}}\dots\frac{c_{k}}{c_{k-1}}\frac{c_{1}}{c_{k}}\\ &=1,\end{split}

which contradicts the genericity condition. ∎