Asynchronous Proportional Response Dynamics: Convergence in Markets with Adversarial Scheduling
Abstract
We study Proportional Response Dynamics (PRD) in linear Fisher markets, where participants act asynchronously. We model this scenario as a sequential process in which at each step, an adversary selects a subset of the players to update their bids, subject to liveness constraints. We show that if every bidder individually applies the PRD update rule whenever they are included in the group of bidders selected by the adversary, then, in the generic case, the entire dynamic converges to a competitive equilibrium of the market. Our proof technique reveals additional properties of linear Fisher markets, such as the uniqueness of the market equilibrium for generic parameters and the convergence of associated no swap regret dynamics and best response dynamics under certain conditions.
1 Introduction
A central notion in the study of markets is the equilibrium: a state of affairs where no single party wishes to unilaterally deviate from it. The main benefit of focusing on the notion of equilibria lies in what it ignores: how the market can reach an equilibrium (if at all). This latter question is obviously of much interest as well, especially when considering computational aspects,111As we know that finding an equilibrium may be computationally intractable in general. and a significant amount of research has been dedicated to the study of “market dynamics” and their possible convergence to an equilibrium. Almost all works that study market dynamics consider synchronous dynamics.
Synchronous Dynamics: At each time step , all participants simultaneously update their behavior based on the state at time .
Such synchronization is clearly difficult to achieve in real markets, and so one might naturally wonder to what extent full synchrony is needed or whether convergence of market dynamics occurs even asynchronously. There are various possible levels of generality of asynchrony to consider. The simplest model considers a sequential scenario in which, at every time step , an adversary chooses a single participant, and only this participant updates their behavior based on the state at time . The adversary is constrained to adhere to some liveness condition, such as scheduling every participant infinitely often or at least once every steps. In the most general model [59], the adversary may also introduce message delays, leading players to respond to dated information. In this paper, we focus on an intermediate level of permissible asynchrony where updates may occur in an arbitrary asynchronous manner, but message delays are always shorter than the granularity of activation. In our proofs, the adversary’s goal is to select the schedule of strategy updates by the players in a way that prevents the dynamic from converging to a market equilibrium.
Activation Asynchrony:222In [59] this was termed “simultaneous.” Every time step , an arbitrary subset of participants is chosen by the adversary and all of these participants update their behavior based on the state at time . The adversary must adhere to the liveness condition where for every participant some set that includes them must be chosen at least once every consecutive steps.
The market dynamics that we study in this paper are linear Fisher markets [9] with proportional response dynamics (PRD), a model that has received much previous attention [5, 10, 19, 74] and for which synchronous convergence to equilibrium is known. While there are a few asynchronous convergence results known for other dynamics, specifically for tatonnement dynamics [18, 23], there are no such results known for proportional response dynamics, and achieving such results has been mentioned as an open problem in [19, 74].
Fisher Markets with Linear Utilities: There are players and goods. Each player has a budget and each good has, w.l.o.g., a total quantity of one. Buyer ’s utility from getting an allocation is given by , where the parameters are part of the definition of the market. A market equilibrium is an allocation (where ) and a pricing with the following properties. (1) Market clearing: for every good it holds that ; (2) budget feasibility: for every player it holds that ; and (3) utility maximization: for every player and every alternative allocation with we have that .
Proportional Response Dynamics: At each time step , each player will make a bid for every good , where . In the first step, the bid is arbitrary. Once bids for time are announced, we calculate and allocate the goods proportionally: , providing each player with utility . At this point, player updates its bids for the next step by bidding on each good proportionally to the utility obtained from that good: .
From the perspective of the player, proportional response updates can be thought of as a simple parameter-free online learning heuristic, with some similarity to regret-matching [43] in its proportional updates, but one that considers the utilities directly, rather than the more sophisticated regret vector loss.
It is not difficult to see that a fixed point of this proportional response dynamic is indeed an equilibrium of the Fisher market. Significantly, it was shown in [5] that this dynamic does converge, in the synchronous model, to an equilibrium. As mentioned, the question of asynchronous convergence was left open. We provide the first analysis of proportional response dynamics in the asynchronous setting, and provide a positive answer to this open question in our “intermdiate” level of asynchrony.
Theorem 1.
For linear Fisher markets, proportional response dynamics with adversarial activation asynchrony, where each player is activated at least once every steps, approach the set of market equilibrium bid profiles. The prices in the dynamics converge to the unique equilibrium prices.
The dynamics approaching the set of equilibria means that the distance between the bids at time and the set of market equilibrium bid profiles converges to zero as . Additionally, in Section 7, we show that for generic parameters (i.e., except for measure zero of possible ’s), the market equilibrium bid profile is unique, and thus Theorem 1 implies convergence of the bids to a point in the strategy space. We do not know whether the genericity condition is required for asynchronous convergence of the bids to a point, and we leave this as a minor open problem. We did not analyze the rate of convergence to equilibrium; we leave such analysis as a second open problem. Our main open problem, however, is the generalization to full asynchrony with message delays.
Open Problem: Does such convergence occur also in the full asynchronous model where the adversary may introduce arbitrary message delays?
Our techniques rely on considering an associated game obtained by using “modified” utility functions for each of the players: . We show that a competitive market equilibrium (with the original utility functions) corresponds to a Nash equilibrium in the associated game.333It is worthwhile to emphasize, though, that a competitive market equilibrium is not a Nash equilibrium in the original market, since the players are price takers rather than fully rational. See Section 3. These modified utility functions are an adaptation to an individual utility of a function that was proposed in [65] as an objective for a convex program for equilibrium computation.444Notice that is not the sum of the ’s, as the second term appears only once. This function was first linked with proportional response dynamics in [5] where it was proven that synchronous proportional response dynamics act as mirror descent on this function. We show that serves as a potential in our associated game.
Theorem 2.
The following three sets of bid profiles are identical: (1) the set of pure strategy Nash equilibria of the associated game, (2) the set of market equilibria of the Fisher market, and (3) the maximizing set of the potential function .
The technical core of our proof is to show that not only does a synchronized proportional response step by all the players increase the potential function but, in fact, every proportional response step by any subset of the players increases this potential function.
The point of view of market equilibria as Nash equilibria of the associated game offers several other advantages, e.g., suggesting several other dynamics that are related to proportional response that can be of interest. For example, we show that letting players best-respond in the associated game corresponds to the limit of a sequence of proportional response steps by a single player, but can be implemented as a single step of such a best-response, which can be computed efficiently by the players and may converge faster to the market equilibrium. Another possibility is using some (internal) regret-minimization dynamics (for the associated game), which would also converge to equilibrium in the generic case since, applying [56], it is the unique correlated Equilibrium as well.
The structure of the rest of the paper is as follows. In Section 2, we provide further formal details and notations that will be useful for our analysis. In Section 3, we present the associated game and its relation to the competitive equilibria of the market. In Section 4, we study best response dynamics in the associated game and their relation to PRD. In Section 5, we present a key lemma regarding the potential function of the associated game under bid updates by subsets of the players. Then, in Section 6 we complete our proof of convergence for asynchronous PRD. In Section 7, we show the uniqueness of the market equilibrium for generic markets. In Section 8, we provide simulation results that compare the convergence of proportional response dynamics with best response dynamics in the associated game in terms of the actual economic parameters in the market, namely, social welfare and the convergence of the bid profiles. Finally, in Section 9, we conclude and discuss the limitations of our technique and open questions. All proofs in this paper are deferred to the appendix.
1.1 Further Related Work
Proportional response dynamics (PRD) were originally studied in the context of bandwidth allocation in file-sharing systems, where it was proven to converge to equilibrium, albeit only for a restrictive setting [71]. Since then, PRD has been studied in a variety of other contexts, including Fisher markets, linear exchange economies, and production markets. See [10] for further references. In Fisher markets, synchronous PRD has been shown to converge to market equilibrium for Constant Elasticity of Substitution (CES) utilities in the substitutes regime [74]. For the linear Fisher market setting, synchronous PRD was explained in [5] as mirror descent on a convex program, previously discovered while developing an algorithm to compute the market equilibrium [65], and later proven to be equivalent to the famous Eisenberg-Gale program [22]. By advancing the approach of citebirnbaum2011distributed, synchronous PRD with mild modifications was proven to converge to a market equilibrium for CES utilities in the complements regime as well [19]. In linear exchange economies, synchronous PRD has been shown to converge to equilibrium in the space of utilities and allocations while prices do not converge and may cycle, whereas for a damped version of PRD, also the prices converge [11]. In production markets, synchronous PRD has been shown to increase both growth and inequalities in the market [13]. PRD has also been proven to converge with quasi-linear utilities [38], and to remain close to market equilibrium for markets with parameters that vary over time [20]. All the above works consider simultaneous updates by all the players, and the question of analyzing asynchronous dynamics and whether they converge was raised by several authors as an open problem [19, 74].
Asynchronous dynamics in markets have been the subject of study in several recent works. However, these works examine different models and dynamics than ours, and to our knowledge, our work presents the first analysis of asynchronous proportional response bidding dynamics. In [23], it is shown that tatonnement dynamics under the activation asynchrony model converge to equilibrium, with results for settings both with and without carryover effects between time units. A later work, [18], showed that tatonnement price dynamics converge to a market equilibrium under a model of sequential activation where in every step a single agent is activated, and where additionally, the information available to the activated seller about the current demand may be inaccurate within some interval that depends on the range of past demands. A different approach taken in [29] assumes that each seller has a set of rules affected by the other players’ actions and governing its price updates; it is shown that the dynamics in which sellers update the prices based on such rules converge to a unique equilibrium of prices in the activation asynchrony model.
Online learning with delayed feedback has been studied for a single agent in [61], which established regret bounds that are sub-linear in the total delay. A different approach, presented in [52], extends online gradient descent by estimating gradients when the reward of an action is not yet available due to delays. When considering multiple agents, [75] extended the results of [61]. They demonstrated that in continuous games with particular stability property, if agents experience equal delays in each round, online mirror descent converges to the set of Nash equilibria and a modification of the algorithm converges even when delays are not equal.
Classic results regarding the computation of competitive equilibria in markets mostly consider centralized computation and vary from combinatorial approaches using flow networks [26, 27, 28, 45], interior point [72], and ellipsoid [21, 44] methods, and many more [3, 25, 26, 30, 39, 40, 63]. Eisenberg and Gale devised a convex program which captures competitive equilibria of the Fisher model as its solution [31]. Notable also is the tatonnement model of price convergence in markets dated back to Walras [70] and studied extensively from Arrow [1] and in later works.
More broadly, in the game theoretic literature, our study is related to a long line of work on learning in games, starting from seminal works in the 1950s [6, 14, 42, 62], and continuing to be an active field of theoretical research [24, 35, 49, 60, 66], also covering a wide range of classic economic settings including competition in markets [7, 54], bilateral trade [16, 32], and auctions [2, 34, 48], as well as applications such as blockchain fee markets [50, 51, 57] and strategic queuing systems [36, 37]. For a broad introduction to the field of learning in games, see [17, 43]. The vast majority of this literature studies repeated games under the synchronous dynamics model. Notable examples of analyses of games with asynchronous dynamics are [46], which study best response dynamics with sequential activation, and [59], which explore best response dynamics in a full asynchrony setting which includes also information delays, and show that in a class of games called max-solvable, convergence of best response dynamics is guaranteed. Our analysis of best response dynamics in Section 4 takes a different route, and does not conclude whether the associated game that we study is max-solvable or not; such an analysis seems to require new ideas.
Our work is also related to a large literature on asynchronous distributed algorithms. We refer to a survey on this literature [41]. The liveness constraint that we consider in the dynamics555Intuitively, if one allows some of the parameters in the dynamic not to update, these parameters become irrelevant, as they will remain frozen, and thus one cannot hope to see any convergence of the entire system. is related to those, e.g., in [4, 67]. Recent works that are conceptually more closely related are [15, 69, 73], which propose asynchronous distributed algorithms for computing Nash equilibria in network games. Notably, [15] propose an algorithm that converges to an equilibrium in a large class of games in asynchronous settings with information delays. Their approach, however, does not capture proportional response dynamics and does not apply to our case of linear Fisher markets.
2 Model and Preliminaries
The Fisher market: We consider the classic Fisher model of a networked market in which there is a set of buyers and a set of divisible goods . We denote the number of buyers and number of goods as , , respectively, and index buyers with and goods with . Buyers are assigned budgets and have some value666 For the ease of exposition, our proofs use w.l.o.g. . This is since in all cases where might have any implication on the proof, such as , these expressions are multiplied by zero in our dynamics. for each good . Buyers’ valuations are normalized such that . It is convenient to write the budgets as a vector and the valuations as a matrix , such that are the parameters defining the market. We denote the allocation of goods to buyers as a matrix where is the (fractional) amount of good that buyer obtained. We assume w.l.o.g. (by proper normalization) that there is a unit quantity of each good. The price of good (which depends on the players’ actions in the market, as explained below) is denoted by and prices are listed as a vector . Buyers have a linear utility function with the budget constraint . We assume w.l.o.g. that the economy is normalized, i.e., .
Market equilibrium: The competitive equilibrium (or “market equilibrium”) is defined in terms of allocations and prices as follows.
Definition 1.
(Market Equilibrium): A pair of allocations and prices is said to be market equilibrium if the following properties hold:
-
1.
,
-
2.
,
-
3.
.
In words, under equilibrium prices all the goods are allocated, all budgets are used, and no player has an incentive to change their bids given that the prices remain fixed. Notice that this notion of equilibrium is different from a Nash equilibrium of the game where the buyers select their bids strategically, since in the former case, players do not consider the direct effect of possible deviation in their bids on the prices. We discuss this further in Section 3. For linear Fisher markets, it is well established that competitive equilibrium utilities and prices are unique, equilibrium allocations are known to form a convex set, and the following conditions are satisfied.
This is a detailed characterization of the equilibrium allocation: every buyer gets a bundle of goods in which all goods maximize the value per unit of money. The quantity is informally known as “bang-per-buck” (ch. 5 & 6 in [58]), the marginal profit from adding a small investment in good .
Market equilibrium bids are also known to maximize the Nash social welfare function (see [31]) and to be Pareto efficient, i.e., no buyer can improve their utility without making anyone else worse off (as stated in the first welfare theorem).
The trading post mechanism and the market game (Shapley-Shubik): First described in [64] and studied under different names [33, 47], the trading post mechanism is an allocation and pricing mechanism which attempts to capture how a price is modified by demand. Buyers place bids on goods, where buyer places bid on good . Then, the mechanism computes the good’s price as the total amount spent on that good and allocates the good proportionally to the bids, i.e., for bids :
Note that the trading post mechanism guarantees market clearing for every bid profile in which all goods have at least one buyer who is interested in buying. The feasible bid set of a buyer under the budget constraint is , i.e., a scaled simplex. Denote and . Considering the buyers as strategic, one can define the market game as where the utility functions can be written explicitly as . We sometimes use the notation , where is the bid vector of player and denotes the bids of the other players.
Potential function and Nash equilibrium: For completeness, we add the following definitions. Potential function: A function is an exact potential function[53] if and we have that , with being ’s utility function in the game. Best response: is a best response to if . That is, no other response of can yield a higher utility. Nash equilibrium: is Nash equilibrium if is a best response to (no player is incentivized to change their strategy).
Proportional response dynamics: As explained in the introduction, the proportional response dynamic is specified by an initial bid profile , with whenever , and the following update rule for every player that is activated by the adversary: . See Section 5 for further details on activation of subsets of the players.
3 The Associated Game
As mentioned above, the Fisher market can be naturally thought of as a game in which every one of the players aims to optimize their individual utility (see Section 2 for the formal definition). However, it is known that the set of Nash equilibria of this game does not coincide with the set of market equilibria [33, 64], and so a solution to this game (if indeed the players reach a Nash equilibrium) is economically inefficient [12].
A natural question that arises is whether there is some other objective for an individual player that when maximized by all the players, yields the market equilibrium. We answer positively to this question and show that there is a family of utility functions such that in the “associated games” with these utilities for the players, the set of Nash equilibria is identical to the set of market equilibria of the original game (for further details, see also Appendix A).
However, the fact that a Nash equilibrium of an associated game is a market equilibrium still does not guarantee that the players’ dynamics will indeed reach this equilibrium. A key element in our proof technique is that we identify, among this family of associated games, a single game, defined by the “associated utility” , which admits an exact potential. We then use a relation which we show between this game and the proportional response update rule to prove the convergence of our dynamics (Theorem 1).
Definition 2.
(The Associated Game): Let be a market game. Define the associated utility of a player as . The associated game is the game with the associated utilities for the players and the same parameters as in .
Theorem 3.
For every Fisher market, the associated game admits an exact potential function that is given by777Since we discuss the players’ associated utilities, we consider maximization of this potential. Of course, if the reader feels more comfortable with minimizing the potential, one can think of the negative function. .
is constructed such that the function is its potential. Note that although having similar structure, and differ via summation on only in the first term ( is not the sum of the players’ utilities).
Once having the potential function defined, the proof is straightforward: the derivatives of the utilities and the potential with respect to are equal for all . Theorem 2, formally restated below, connects between the associated game, the market equilibria and the potential.
Theorem.
The proof uses a different associated game that has simpler structure than , but does not have an exact potential, and shows that: (i) Nash equilibria of are the market equilibria; (ii) all the best responses of players to bid profiles in are the same as those in ; and (iii) every equilibrium of maximizes the potential (immediate by the definition of potential).
4 Best Response Dynamics
In this section we explore another property of the associated game: we show that if instead of using the proportional response update rule, each player myopically plays their best response to the last bid profile with respect to their associated utility, then the entire asynchronous sequence of bids converges to a market equilibrium, as stated in the following theorem. We then show that there is a close relation between best response and proportional response dynamics.
Theorem 4.
For generic linear Fisher markets in a sequential asynchrony model where in every step a single player is activated, best response dynamics converge to the Market Equilibrium. For non-generic markets the prices are guaranteed to converge to the equilibrium prices.
The idea of the proof is to show that the best-response functions are single valued ( has a unique maximizer) and continuous (using the structure of best-response bids). Together with the existence of the potential function it holds that the analysis of [46] applies for these dynamics with the associated utilities888Note that while the best-reply function with respect to the standard utility function is formally undefined at zero, the associated utility and its best-reply function are well defined and continuous at zero. and thus convergence is guaranteed.
One of the appealing points about proportional response dynamics is their simplicity — in each update, a player observes the obtained utilities and can easily compute the next set of bids. We show that also the best response of a player can be computed efficiently by reducing the calculation to a search over a small part of the subsets of all goods which can be solved by a simple iterative process.
Proposition 1.
For every player and any fixed bid profile for the other players, the best response of is unique and can be computed in polynomial time.
Roughly, best responses are characterized uniquely by a one-dimensional variable . For every subset of goods we define a variable and prove that is the maximum amongst all . So finding is equivalent to searching a specific subset with maximal . The optimal subset of goods admits a certain property that allows to narrow down the search domain from all subsets to only subsets.
The relation between the best response and proportional response updates can intuitively be thought of as follows. While in PRD players split their budget between all the goods according the utility that each good yields, and so gradually shift more budget to the more profitable subset of goods, best response bids of player with respect to can be understood as spending the entire budget on a subset of goods which, after bidding so (considering the effect of bids on prices), will jointly have the maximum bang-per-buck (in our notation ) amongst all subsets of goods, given the bids of the other players. Those bids can be regarded as “water-filling” bids as they level the bang-per-buck amongst all goods purchased by player (for a further discussion see the appendix).
It turns out that there is a clear formal connection between the best response of a player in the associated game and the proportional response update rule in the true game: the best response bids are the limit point of an infinite sequence of proportional response updates by the same player when the bids of the others are held fixed, as expressed in the following proposition.
Proposition 2.
Fix any player and fix any bid profile for the other players. Let and let be a sequence of consecutive proportional response steps applied by player , where is held fixed at all times . Then .
5 Simultaneous Play by Subsets of Agents
In this section, we shift our focus back to proportional response dynamics under the activation asynchrony model in which the adversary can choose in every step any subset of players to update their bids. Towards proving that proportional response dynamics converges to a market equilibrium in this setting, we utilize the associated game and potential function presented in Section 3 to show that any activated subset of players performing a PRD step will increase the potential. Formally, let be a subset of players activated by the adversary and let be a function that applies proportional response to members of and acts as the identity function for all the other players. The update for time when the adversary activates a subset of the players is therefore:
Lemma 1.
For all and for all it holds that , unless .
The proof shows that for any subset of players , a PRD step is the solution to some maximization problem of a function different from , such that .
Notable to mention is the sequential case where all subsets are singletons, i.e., for all , for some . In that case, the above result yields that the best-response bids can be expressed as the solution to an optimization problem over the bids on a function that is monotone in the KL divergence between the prices induced by and the current prices, whereas PRD is the solution to an optimization problem on a similar function, but one that depends on the KL divergence between the bids and the current bids. Thus, sequential PRD can be regarded as a relaxation of best response; on the one hand, it is somewhat simpler to compute a step, and on the other hand, it takes more steps to reach the best response (see Proposition 2 and the simulations in Section 8).
6 Convergence of Asynchronous Proportional Response Dynamics
With the results from the previous sections under our belts (namely, the associated game, Theorems 2, 3 about its potential and equilibria, and Lemma 1 about updates by several players simultaneously), we are now ready to complete the proof of Theorem 1 on the convergence of asynchronous proportional response dynamics. We explain here the idea of the proof. The full proof is in the appendix.
Proof idea of Theorem 1: Our starting point is that we now know that Proportional Response Dynamics (PRD) steps by subsets of players increase the potential. Therefore, the bids should somehow converge to reach the maximum potential, which is obtained only at the set of market equilibria. Technically, since the sequence of bids is bounded, it must have condensation points. The proof then proceeds by way of contradiction. If the sequence does not converge to the set of equilibrium bid profiles, , then there is some subsequence that converges to a bid profile outside of this set, which by Theorem 2, must achieve a lower potential than any (since it is not a market equilibrium, and recall that only market equilibria maximize the potential function).
From this point, the main idea is to show that if the dynamic preserves a “livness” property where the maximum time interval between consecutive updates of a player is bounded by some constant , then the dynamic must reach a point where the bids are sufficiently close to such that there must be some future update by some subset of the players under which the potential increases to more than , contradicting the existence of condensation points other than market equilibria (note that the sequence of potential values is increasing in ). To show this, the proof requires several additional arguments on the continuity of compositions of PRD update functions that arise under adversarial scheduling, and the impact of such compositions on the potential function. The full proof is in the appendix.
7 Generic Markets
Here we show that in the generic case, linear Fisher markets have a unique equilibrium bid profile. While it is well known that in linear Fisher markets equilibrium prices and utilities are unique, and the equilibrium bids and allocations form convex sets (see section 2), we show that multiplicity of equilibrium bid profiles can result only from a special degeneracy in the market parameters that has measure zero in the parameter space. In other words, if the market parameters are not carefully tailored to satisfy a particular equality (formally described below), or, equivalently, if the parameters are slightly perturbed, the market will have a unique equilibrium. Similar property was known for linear exchange markets [8] and we present a simple and concise proof for the Fisher model.
Definition 3.
A Fisher market is called generic if the non-zero valuations of the buyers do not admit any multiplicative equality. That is, for any distinct and non empty it holds that .
Theorem 5.
Every generic linear fisher market has a unique market equilibrium bid profile .
Before discussing the proof of Theorem 5, we have the following corollaries.
Corollary 1.
For generic linear Fisher markets, proportional response dynamics with adversarial activation asynchrony, where each player is activated at least once every steps, converge to the unique market equilibrium.
The main theorem, Theorem 1, suggests that proportional response dynamics with adversarial activation asynchrony converges to the set of market equilibria. Since the previous theorem guarantees that a generic market has a unique market equilibrium bid profile, it is clear the dynamics converges to that bid profile.
Corollary 2.
In generic linear Fisher markets, no-swap regret dynamics in the associated game converge to the market equilibrium.
This follows from [56], who showed that in games with convex strategy sets and a continuously differentiable potential function (as in our case), the set of correlated equilibria consists of mixtures of elements in . Theorem 2 yields that , and so there is a unique correlated equilibrium, which is the market equilibrium, and we have that no-swap regret guarantees convergence to it.
To prove Theorem 5, we use a representation of the bids in the market as a bipartite graph of players and goods with and . The proof shows that if a market has more than one equilibrium bid profile, then there has to be an equilibrium with containing a cycle, whereas the following lemma forbids this for generic markets.
Lemma 2.
If are equilibrium bids in a generic linear Fisher market, then has no cycles.
A key observation for proving this lemma is that at a market equilibrium, for a particular buyer , the quantity is constant amongst goods purchased, and so it is possible to trace a cycle and have all the cancel out and obtain an equation contradicting the genericity condition.
An observation that arises from Lemma 2 is that when the number of buyers in the market is of the same order of magnitude as the number of goods or larger, then, in equilibrium, most buyers will only buy a small number of goods. Since there are no cycles in and there are vertices, there are at most edges. Thus, with buyers, the average degree of a buyer is .
8 Simulations
Next, we look at simulations of the dynamics that we study and compare the convergence of proportional response dynamics to best response dynamics in the associated game, as discussed in Section 4. The metrics we focus on here for every dynamic are the Nash social welfare [55], which, as mentioned in Section 2, is maximized at the market equilibrium, and the Euclidean distance between the bids at time and the equilibrium bids. Additionally, we look at the progression over time of the value of the potential (for the definition, see Section 3).
Figure 1 presents simulations of an ensemble of markets, each with ten buyers and ten goods. The parameters in each market (defined in the matrices and ) are uniformly sampled, ensuring that the genericity condition (defined in Section 2) holds with probability one. These parameters are also normalized, as explained in Section 2. For each market, the parameters remain fixed throughout the dynamics. The initial condition in all simulation runs is the uniform distribution of bids over items, and the schedule is sequential, such that a single player updates their bids in each time step.
Figure 1(a) (main plot) show our metrics for PRD averaged over a sample of such simulations. The insets show the plots of a random sample of individual simulations (without averaging) over a longer time period. Figure 1(b) show similar plots for best response dynamics.
As could be expected based on our analysis in Section 4, best response dynamics converge faster than PRD, as can be seen in the different time scales on the horizontal axes. A closer look at the individual bid dynamics depicted in the insets reveals a qualitative difference between the two types of dynamics: in PRD, the bids in each dynamic smoothly approach the equilibrium profile, whereas best response bid dynamics are more irregular. Additionally, the collection of curves for the individual simulations shows that under uniformly distributed market parameters, both dynamics exhibit variance in convergence times, with a skewed distribution. In most markets, the dynamics converge quickly, but there is a distribution tail of slower-converging dynamics.



9 Conclusion
We have shown that proportional response bid dynamics converge to a market equilibrium in a setting where the schedule of bid updates can be chosen adversarially, allowing for sequential or simultaneous updates for any subset of players. We proposed a novel approach to address this problem by identifying a family of associated games related to proportional response dynamics, showing their relation to the competitive equilibria of the market, and leveraging these relations to prove convergence of the dynamics. En route, we showed that other types of dynamics, such as myopic best response and no swap regret, also converge in the associated game. Additionally, we note that our result on the uniqueness of market equilibria in the generic case (e.g., if the market parameters have some element of randomness) may also be of interest for future research in the Fisher market setting.
One main open question that we did not analyze is whether proportional response dynamics converge under the full asynchrony model, which includes information delays. The analysis of this model raises several complications, as it creates further coupling between past and current bid profiles. We conjecture that if information delays are bounded, then convergence also occurs in this model. However, it is not clear whether our approach could be extended to argue that proportional response updates by subsets of players with respect to delayed information increase the potential in our associated game, or whether proving convergence in this setting will require new methods. One limitation of our analysis is that we provide a guarantee that under any bid update by any subset of players chosen by an adversary, the potential function of the associated game increases, but our technique does not specify by how much the potential increases in every step, and therefore, we do not provide speed of convergence results. Such analysis seems to require new techniques, and we see this as an interesting open problem for future work.
Acknowledgments
This work was supported by the European Research Council (ERC) under the European Union’s Horizon 2020 Research and Innovation Programme (grant agreement no. 740282) and by a grant from the Israeli Science Foundation (ISF number 505/23). During part of this project, Yoav Kolumbus has been affiliated with the Hebrew University of Jerusalem.
References
- [1] Kenneth J Arrow, Henry D Block, and Leonid Hurwicz. On the stability of the competitive equilibrium, ii. Econometrica: Journal of the Econometric Society, pages 82–109, 1959.
- [2] Santiago R Balseiro and Yonatan Gur. Learning in repeated auctions with budgets: Regret minimization and equilibrium. Management Science, 65(9):3952–3968, 2019.
- [3] Xiaohui Bei, Jugal Garg, and Martin Hoefer. Ascending-price algorithms for unknown markets. ACM Transactions on Algorithms (TALG), 15(3):1–33, 2019.
- [4] Dimitri P Bertsekas. Distributed asynchronous computation of fixed points. Mathematical Programming, 27(1):107–120, 1983.
- [5] Benjamin Birnbaum, Nikhil R Devanur, and Lin Xiao. Distributed algorithms via gradient descent for fisher markets. In Proceedings of the 12th ACM conference on Electronic commerce, pages 127–136, 2011.
- [6] David Blackwell et al. Controlled random walks. In Proceedings of the international congress of mathematicians, volume 3, pages 336–338, 1954.
- [7] Avrim Blum, MohammadTaghi Hajiaghayi, Katrina Ligett, and Aaron Roth. Regret minimization and the price of total anarchy. In Proceedings of the fortieth annual ACM symposium on Theory of computing, pages 373–382, 2008.
- [8] Jean-Marc Bonnisseau, Michael Florig, and Alejandro Jofré. Continuity and uniqueness of equilibria for linear exchange economies. Journal of Optimization Theory and Applications, 109:237–263, 2001.
- [9] William C Brainard and Herbert E Scarf. How to compute equilibrium prices in 1891. American Journal of Economics and Sociology, 64(1):57–83, 2005.
- [10] Simina Brânzei. Exchange markets: proportional response dynamics and beyond. ACM SIGecom Exchanges, 19(2):37–45, 2021.
- [11] Simina Brânzei, Nikhil Devanur, and Yuval Rabani. Proportional dynamics in exchange economies. In Proceedings of the 22nd ACM Conference on Economics and Computation, pages 180–201, 2021.
- [12] Simina Brânzei, Vasilis Gkatzelis, and Ruta Mehta. Nash social welfare approximation for strategic agents. Operations Research, 70(1):402–415, 2022.
- [13] Simina Branzei, Ruta Mehta, and Noam Nisan. Universal growth in production economies. In S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 31. Curran Associates, Inc., 2018.
- [14] George W Brown. Iterative solution of games by fictitious play. Activity analysis of production and allocation, 13(1):374–376, 1951.
- [15] Carlo Cenedese, Giuseppe Belgioioso, Sergio Grammatico, and Ming Cao. An asynchronous distributed and scalable generalized nash equilibrium seeking algorithm for strongly monotone games. European Journal of Control, 58:143–151, 2021.
- [16] Nicolò Cesa-Bianchi, Tommaso Cesari, Roberto Colomboni, Federico Fusco, and Stefano Leonardi. Bilateral trade: A regret minimization perspective. Mathematics of Operations Research, 2023.
- [17] Nicolo Cesa-Bianchi and Gábor Lugosi. Prediction, learning, and games. Cambridge university press, 2006.
- [18] Yun Kuen Cheung and Richard Cole. Amortized Analysis of Asynchronous Price Dynamics. In 26th Annual European Symposium on Algorithms (ESA 2018), volume 112 of Leibniz International Proceedings in Informatics (LIPIcs), pages 18:1–18:15, 2018.
- [19] Yun Kuen Cheung, Richard Cole, and Yixin Tao. Dynamics of distributed updating in fisher markets. In Proceedings of the 2018 ACM Conference on Economics and Computation, pages 351–368, 2018.
- [20] Yun Kuen Cheung, Martin Hoefer, and Paresh Nakhe. Tracing equilibrium in dynamic markets via distributed adaptation. arXiv preprint arXiv:1804.08017, 2018.
- [21] Bruno Codenotti, Sriram V Pemmaraju, and Kasturi R Varadarajan. On the polynomial time computation of equilibria for certain exchange economies. In SODA, volume 5, pages 72–81, 2005.
- [22] Richard Cole, Nikhil Devanur, Vasilis Gkatzelis, Kamal Jain, Tung Mai, Vijay V Vazirani, and Sadra Yazdanbod. Convex program duality, fisher markets, and nash social welfare. In Proceedings of the 2017 ACM Conference on Economics and Computation, pages 459–460, 2017.
- [23] Richard Cole and Lisa Fleischer. Fast-converging tatonnement algorithms for one-time and ongoing market problems. In Proceedings of the fortieth annual ACM symposium on Theory of computing, pages 315–324, 2008.
- [24] Constantinos Daskalakis, Maxwell Fishelson, and Noah Golowich. Near-optimal no-regret learning in general games. Advances in Neural Information Processing Systems, 34, 2021.
- [25] Xiaotie Deng, Christos Papadimitriou, and Shmuel Safra. On the complexity of price equilibria. Journal of Computer and System Sciences, 67(2):311–324, 2003.
- [26] Nikhil R Devanur, Christos H Papadimitriou, Amin Saberi, and Vijay V Vazirani. Market equilibrium via a primal–dual algorithm for a convex program. Journal of the ACM (JACM), 55(5):1–18, 2008.
- [27] Nikhil R Devanur and Vijay V Vazirani. An improved approximation scheme for computing arrow-debreu prices for the linear case. In FST TCS 2003: Foundations of Software Technology and Theoretical Computer Science: 23rd Conference, Mumbai, India, December 15-17, 2003. Proceedings 23, pages 149–155. Springer, 2003.
- [28] Ran Duan and Kurt Mehlhorn. A combinatorial polynomial algorithm for the linear arrow–debreu market. Information and Computation, 243:112–132, 2015.
- [29] Krishnamurthy Dvijotham, Yuval Rabani, and Leonard J Schulman. Convergence of incentive-driven dynamics in fisher markets. Games and Economic Behavior, 134:361–375, 2022.
- [30] Federico Echenique and Adam Wierman. Finding a walrasian equilibrium is easy for a fixed number of agents. 2011.
- [31] Edmund Eisenberg and David Gale. Consensus of subjective probabilities: The pari-mutuel method. The Annals of Mathematical Statistics, 30(1):165–168, 1959.
- [32] Kfir Eliaz and Alexander Frug. Bilateral trade with strategic gradual learning. Games and Economic Behavior, 107:380–395, 2018.
- [33] Michal Feldman, Kevin Lai, and Li Zhang. The proportional-share allocation market for computational resources. IEEE Transactions on Parallel and Distributed Systems, 20(8):1075–1088, 2008.
- [34] Zhe Feng, Chara Podimata, and Vasilis Syrgkanis. Learning to bid without knowing your value. In Proceedings of the 2018 ACM Conference on Economics and Computation, EC ’18, page 505–522, New York, NY, USA, 2018. Association for Computing Machinery.
- [35] Dylan J Foster, Noah Golowich, and Sham M Kakade. Hardness of independent learning and sparse equilibrium computation in markov games. arXiv preprint arXiv:2303.12287, 2023.
- [36] Jason Gaitonde and Éva Tardos. Stability and learning in strategic queuing systems. In Proceedings of the 21st ACM Conference on Economics and Computation, pages 319–347, 2020.
- [37] Jason Gaitonde and Éva Tardos. Virtues of patience in strategic queuing systems. In Proceedings of the 22nd ACM Conference on Economics and Computation, pages 520–540, 2021.
- [38] Yuan Gao and Christian Kroer. First-order methods for large-scale market equilibrium computation. Advances in Neural Information Processing Systems, 33:21738–21750, 2020.
- [39] Jugal Garg, Ruta Mehta, Milind Sohoni, and Vijay V Vazirani. A complementary pivot (practical) algorithm for market equilibrium under separable, piecewise-linear concave utilities.
- [40] Rahul Garg and Sanjiv Kapoor. Auction algorithms for market equilibrium. In Proceedings of the thirty-sixth annual ACM symposium on Theory of computing, pages 511–518, 2004.
- [41] Felix C Gärtner. Fundamentals of fault-tolerant distributed computing in asynchronous environments. ACM Computing Surveys (CSUR), 31(1):1–26, 1999.
- [42] James Hannan. Approximation to bayes risk in repeated play. In Contributions to the Theory of Games (AM-39), Volume III, pages 97–139. Princeton University Press, 1957.
- [43] Sergiu Hart and Andreu Mas-Colell. Simple adaptive strategies: from regret-matching to uncoupled dynamics, volume 4. World Scientific, 2013.
- [44] Kamal Jain. A polynomial time algorithm for computing an arrow–debreu market equilibrium for linear utilities. SIAM Journal on Computing, 37(1):303–318, 2007.
- [45] Kamal Jain, Mohammad Mahdian, and Amin Saberi. Approximating market equilibria. In 6th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems, 2003 and 7th International Workshop on Randomization and Approximation Techniques in Computer Science, 2003. Springer, 2003.
- [46] Martin Kaae Jensen. Stability of pure strategy nash equilibrium in best-reply potential games. University of Birmingham, Tech. Rep, 2009.
- [47] Frank Kelly. Charging and rate control for elastic traffic. European transactions on Telecommunications, 8(1):33–37, 1997.
- [48] Yoav Kolumbus and Noam Nisan. Auctions between regret-minimizing agents. In Proceedings of the ACM Web Conference 2022, pages 100–111, 2022.
- [49] Yoav Kolumbus and Noam Nisan. How and why to manipulate your own agent: On the incentives of users of learning agents. Advances in Neural Information Processing Systems, 35:28080–28094, 2022.
- [50] Stefanos Leonardos, Barnabé Monnot, Daniël Reijsbergen, Efstratios Skoulakis, and Georgios Piliouras. Dynamical analysis of the eip-1559 ethereum fee market. In Proceedings of the 3rd ACM Conference on Advances in Financial Technologies, pages 114–126, 2021.
- [51] Stefanos Leonardos, Daniël Reijsbergen, Daniël Reijsbergen, Barnabé Monnot, and Georgios Piliouras. Optimality despite chaos in fee markets. arXiv preprint arXiv:2212.07175, 2022.
- [52] Panayotis Mertikopoulos, Zhengyuan Zhou, et al. Gradient-free online learning in games with delayed rewards. In International Conference on Machine Learning. International Machine Learning Society (IMLS), 2020.
- [53] Dov Monderer and Lloyd S Shapley. Potential games. Games and economic behavior, 14(1):124–143, 1996.
- [54] Uri Nadav and Georgios Piliouras. No regret learning in oligopolies: Cournot vs. bertrand. In Algorithmic Game Theory: Third International Symposium, SAGT 2010, Athens, Greece, October 18-20, 2010. Proceedings 3, pages 300–311. Springer, 2010.
- [55] John F Nash Jr. The bargaining problem. Econometrica: Journal of the econometric society, pages 155–162, 1950.
- [56] Abraham Neyman. Correlated equilibrium and potential games. International Journal of Game Theory, 26(2):223–227, 1997.
- [57] Noam Nisan. Serial monopoly on blockchains draft–comments welcome. 2023.
- [58] Noam Nisan, Tim Roughgarden, Eva Tardos, and Vijay V Vazirani. Algorithmic game theory. Cambridge university press, 2007.
- [59] Noam Nisan, Michael Schapira, and Aviv Zohar. Asynchronous best-reply dynamics. In Internet and Network Economics: 4th International Workshop, WINE 2008, Shanghai, China, December 17-20, 2008. Proceedings 4, pages 531–538. Springer, 2008.
- [60] Christos Papadimitriou and Georgios Piliouras. From nash equilibria to chain recurrent sets: An algorithmic solution concept for game theory. Entropy, 20(10):782, 2018.
- [61] Kent Quanrud and Daniel Khashabi. Online learning with adversarial delays. Advances in neural information processing systems, 28, 2015.
- [62] Julia Robinson. An iterative method of solving a game. Annals of mathematics, pages 296–301, 1951.
- [63] Herbert E Scarf. The computation of equilibrium prices: an exposition. Handbook of mathematical economics, 2:1007–1061, 1982.
- [64] Lloyd Shapley and Martin Shubik. Trade using one commodity as a means of payment. Journal of political economy, 85(5):937–968, 1977.
- [65] Vadim I Shmyrev. An algorithm for finding equilibrium in the linear exchange model with fixed budgets. Journal of Applied and Industrial Mathematics, 3(4):505, 2009.
- [66] Vasilis Syrgkanis, Alekh Agarwal, Haipeng Luo, and Robert E Schapire. Fast convergence of regularized learning in games. Advances in Neural Information Processing Systems, 28:2989–2997, 2015.
- [67] John Tsitsiklis, Dimitri Bertsekas, and Michael Athans. Distributed asynchronous deterministic and stochastic gradient optimization algorithms. IEEE transactions on automatic control, 31(9):803–812, 1986.
- [68] Mark Voorneveld. Best-response potential games. Economics letters, 66(3):289–295, 2000.
- [69] Mohamed Wahbi and Kenneth N Brown. A distributed asynchronous solver for nash equilibria in hypergraphical games. In Frontiers in Artificial Intelligence and Applications, pages 1291–1299, 2016.
- [70] Léon Walras. Éléments d’économie politique pure: ou, Théorie de la richesse sociale. F. Rouge, 1900.
- [71] Fang Wu and Li Zhang. Proportional response dynamics leads to market equilibrium. In Proceedings of the thirty-ninth annual ACM symposium on Theory of computing, pages 354–363, 2007.
- [72] Yinyu Ye. A path to the arrow–debreu competitive market equilibrium. Mathematical Programming, 111(1-2):315–348, 2008.
- [73] Peng Yi and Lacra Pavel. Asynchronous distributed algorithms for seeking generalized nash equilibria under full and partial-decision information. IEEE transactions on cybernetics, 50(6):2514–2526, 2019.
- [74] Li Zhang. Proportional response dynamics in the fisher market. Theoretical Computer Science, 412(24):2691–2698, 2011.
- [75] Zhengyuan Zhou, Panayotis Mertikopoulos, Nicholas Bambos, Peter W Glynn, and Claire Tomlin. Countering feedback delays in multi-agent learning. Advances in Neural Information Processing Systems, 30, 2017.
Appendices
In the following sections we provide the proofs for the results presented in the main text as well as further technical details and discussion.
In the following, we use to denote the gradient of a function with respect to the bids of player only. We use to denote the partial derivative with respect to the bid of player on good , and to denote the second derivative. We denote by as the ‘pre-prices,’ which represent the prices excluding the bids (and so, for every player and every bid profile, ). In some of the proofs, we use the abbreviated notation . All other notations are as defined in the main text.
Appendix A The Associated Game
Proof.
In order to prove Theorem 2, we first define a different associated game denoted that differs from only in having a different associated utility function .
In fact, and a part of a family of associated games of the market game , which have the property that they all share the same best responses to bid profiles (and therefore, also the same Nash equilibria) and for all these games, the function is a best-response potential (see [68] for the definition of best-response potential games). Among this family of games, we are particularly interested in the games and since the former admits as an exact potential, and the latter has a particularly simple derivative for its utility , which has a clear economic interpretation: . This is simply the bang-per-buck of player from good (see the model section in the main text).
Next, we present several technical lemmas that will assist us in proving Theorem 2 and which will also be useful in our proofs later on.
Lemma 3.
For any player and fixed , both and are strictly concave in .
Proof.
We will show the proof for . We compute the Hessian and show that it is negative definite. The diagonal elements are
and all of the off-diagonal elements are
Therefore, the Hessian is a diagonal matrix with all of its elements being negative, and thus, is strictly concave. The same argument works for as well. ∎
Lemma 4.
Fix a player and any bid profile of the other players, then the following two facts hold.
-
1.
if and only if it holds that , where .
-
2.
if and only if it holds that , where .
Proof.
We will show the proof for (1), and the proof for (2) is similar. Let be a best response to and let be some other strategy. Consider the restriction of to the line segment as follows; define for where . As is strictly concave and is the unique maximizer of , it holds that is strictly concave and monotone increasing in . Therefore the derivative of must satisfy at the maximum point that . This is explicitly given by
Therefore, when substituting in the derivative we get , and
which implies , as required.
To complete the second direction of the proof of (1), consider for which the expression stated in right hand side of (1) is true for all . Then, fix any and again consider the restriction of to . By direct calculation, as before but in the inverse direction, it holds that , and as and are strictly concave, it thus must be that is monotone decreasing in . Thus, for all we have . This must mean that is the maximizer of since for all , implies that is monotone increasing and therefore . Finally, note that this holds for any and hence must be a global maximum of .
∎
Lemma 5.
Let , if there exists (the -dimensional simplex) such that it holds that then:
-
1.
for all we have that , and
-
2.
if then .
Proof.
(1) Assume for the sake of contradiction that there exists with then (the “one-hot” vector with 1 at the ’th coordinate and 0 in all other coordinates) yields , a contradiction.
Now, to prove (2), assume for the sake of contradiction that there exists with and . From (1) we have that , summing the strict inequality with the weak ones over all yields , a contradiction. ∎
Lemma 6.
Fix a player and any bid profile of the other players, then the following properties of best-response bids hold in the modified games and .
-
1.
The support set of , defined as , is equal to the set , and for every we have that .
-
2.
Best-response bids with respect to the utilities and are equal and unique. That is, in the definition from Lemma 4 we have (denoted simply as ).
-
3.
Best-response bids are given by for a unique constant .
Proof.
By Lemma 3, and are strictly concave in for any fixed , and so each admits a unique maximizer. To see that they are equal, we use Lemma 4 and introduce constants to obtain
where and .
For the ease of exposition, we assume that for all . All the results stated below remain valid also when for some .
Proof of (1): Applying Lemma 5 to each of those inequalities (once with and twice with ) and denoting the support sets of as , respectively, we obtain the following. (1) we have and we have . Therefore, bids are positive and while the bids are zero, and , hence . To prove the second inequality, we use the same argument but with , and so we have that .
Proof of (2): We will show that and thus obtain that the vectors are identical. Assume by way of contradiction that , then , i.e., . For all it holds that . Now we sum those inequalities over and extend to the support . By using the subset relation we proved, we obtain a contradiction:
The case where follows similar arguments with inverse roles of . Thus, and , which implies , meaning that the prices are equal as well for all goods purchased. Therefore .
Proof of (3): Finally, observe that for , while otherwise , and . For the bounds on , notice that by definition it is equal to and that , as can receive as little as almost zero (by the definition of the allocation mechanism, if places a bid on a good it will receive a fraction of this good, no matter how tiny) and receive at most (almost) all the goods.
∎
The intuition of the above Lemma 6 is that it shows a property of the structure of best-response bids. If we consider all the goods sorted by the parameter , then the best-response bids are characterized by some value which partitions the goods into two parts: goods that can offer the player a bang-per-buck of value and those that cannot. The former set of goods is exactly the support . When a player increases its bid on some good , the bang-per-buck offered by that good decreases, so clearly, any good with cannot be considered in any optimal bundle. Consider the situation where the player has started spending money on goods with , and that for some goods and we have that , then if the player increases the bid on without increasing the bid on , this means that the bids are not optimal since the player could have received higher bang-per-buck by bidding on . The optimal option is a ‘water-filling‘ one: to split the remaining budget and use it to place bids on both and , yielding equal bang-per-buck for both (as Lemma 6 shows).
With the above lemmas, we are now ready to prove Theorem 2.
Proof.
(Theorem 2): We start by making the following claim.
Claim: The set of Market equilibria is equal to the set of Nash equilibria in the game .
Proof.
By definition, is a Nash equilibrium of if and only if for every it holds that , where by Lemma 3, for any fixed , the bid profile is unique. By Lemma 4, we have that for and any other , if and only if is a market equilibrium (market clearing and budget feasibility hold trivially). That is, the set of Nash equilibria of the game corresponds to the set of market equilibria (i.e., every bid profile which is a market equilibrium must be a Nash equilibrium of , and vice versa). ∎
Then, by Lemma 6, best responses by every player to any bid profile of the other players with respect to and with respect to are the same. Therefore, every Nash equilibrium in one game must be a Nash equilibrium in the other. Thus, we have that Nash equilibria of the game are market equilibria, and vice versa – every market equilibrium must be Nash equilibrium of . Finally, at a Nash equilibrium, no player can unilaterally improve their utility, so no improvement is possible to the potential, and in the converse, if the potential is not maximized, then there exists some player with an action that improves the potential, and so by definition their utility function as well, thus contradicting the definition of a Nash equilibrium. Therefore, we have that every bid profile that maximizes the potential is a Nash equilibrium of and a market equilibrium (and vice versa). ∎
Appendix B Best Response Dynamics
We start with the following characterizations of best-response bids in the games and .
Lemma 7.
Fix and let be ’s best response to with support . Define for every subset . Let be as described in Lemma 6. Then, it holds that for all . Furthermore, if then .
Proof.
Let be a best response to with support . By Lemma 6 we have that . By summing over we obtain that . Rearranging yields which is by definition. Now we prove that . A key observation to the proof is that, by Lemma 6, if then and otherwise .
For a set distinct from we have two cases:
Case (1): . Consider a bid profile that for every good in (if the intersection is not empty) places a bid higher by than and distributes the rest of ’s budget uniformly between all other goods in :
For small enough, we have and the support of is indeed . For every we have and by adding to both sides we obtain ; multiplying both sides by yields (i) , where the equality is by Lemma 6, while for every it holds that by which adding to the left hand side only increases it and implies (ii) . Summing over inequalities (i) and (ii) for all appropriately, we obtain , observe that , and thus by division, we obtain the result: .
Case (2): . In this case, the idea used above can not be applied since adding to every bid would create bids that exceed the budget . As stated above, the equality holds where the sums are taken over all members of , by rearranging we get . For all it holds that and by summing those inequalities for all and adding the equality above we obtain: Rearranging yields the result: .
And so is obtained as the maximum over all , as required. ∎
Lemma 8.
The function which maps to its best response is continuous.
Proof.
By Lemma 6, best-response bids are given by , with support . We wish to show that is a continuous in . We do so by showing that is obtained by a composition of continuous functions. As is a sum of elements from , it suffices to prove continuity in the variable . The expression for is the maximum between zero and a continuous function of , which is continuous in , and so we are left to prove that is continuous in . More specifically, it suffices to show that as defined in Lemma 6 is continuous in .
By Lemma 7, is obtained as the maximum over all functions, where each is a continuous function itself in , and thus is continuous in . ∎
To prove Theorem 1 on the convergence of best-response dynamics we use the following known result (for further details see [46]).
Theorem (Jensen 2009 [46]): Let be a best-reply potential game with single-valued, continuous best-reply functions and compact strategy sets. Then any admissible sequential best-reply path converges to the set of pure strategy Nash equilibria.
Proof.
(Theorem 4): is a potential game, which is a stricter notion than being a best-reply potential game (i.e., every potential game is also a best-reply potential game). By Lemma 6, best replies are unique, and so the function is single valued. Furthermore, Lemma 8 shows that it is also a continuous function. By definition, for every the strategy set is compact, and so their product is compact as well. Note that while the best-reply function with respect to the standard utility function is formally undefined when the bids of all other players on some good are zero, what we need is for the best-reply function to the associated utility to be continuous, and this is indeed the case. Admissibility of the dynamics is also guaranteed by the liveness constraint on adversarial scheduling of the dynamics, and thus by the theorem cited above, best-reply dynamics converges to the set of Nash equilibria of . Since every element in this set is market equilibrium (by Theorem 2) and equilibrium prices are unique (see the model section in the main text), we have that any dynamic of the prices are guaranteed to converge to equilibrium prices. Furthermore for a generic market there is a unique market equilibrium (by Theorem 5) and convergence to the set in fact means convergence to the point , the market-equilibrium bid profile. ∎
Proof.
(Proposition 1): Fix a player , fix any bid profile of the other players and let be ’s best response to , by Lemma 6, for being a unique constant. We present a simple algorithm which computes and has a run-time of .
To see that this process indeed reaches , assume w.l.o.g. that the goods are sorted by in a descending order. For ease of exposition, assume for all ; the case with for some goods is similar. By Lemma 6 we have . And so, if and then , since in this case . Therefore, must be one of the following sets: . By Lemma 7 we have . For any set mentioned, the algorithm computes and finds the maximal among all such , and therefore it finds .
As for the running time of the algorithm, it is dominated by the running time of the sorting operation which is . ∎
After proving that the best response to a bid profile can be computed efficiently, we can prove now that proportional response, applied by a single player while all the other players’ bids are held fix, converges in the limit to that best response.
Proof.
(Proposition 2): Fix a player and fix any bid profile of the other players, let be the best response of to with support and let be a sequence of consecutive proportional responses made by . That is, . We start the proof with several claims proving that any sub-sequence of cannot converge to any fixed point of other than . After establishing this, we prove that the sequence indeed converges to .
Claim 1: Every fixed point of Proportional Response Dynamic has equal ‘bang-per-buck‘ for all goods with a positive bid. That is, if is a fixed point of then for every good with , where is the utility achieved for with the bids .
Proof: By substituting into the PRD update rule, we have
∎
Claim 2: The following properties of hold.
-
1.
Except , there are no other fixed points of with a support that contains the support of . Formally, there are no fixed points of with support such that .
-
2.
The bids achieve a higher utility in the original game , denoted , than any other fixed point of Proportional Response Dynamics. Formally, let be any fixed point other than , with utility in the original game , then .
Proof: Let be any fixed point of . By the previous claim it holds that whenever . Multiplying by yields . Summing over with and rearranging yields as defined in Lemma 7 with support . By that lemma, we have that for any set distinct from . Thus, we have that for being a fixed point of other than with support and utility value .
Assume for the sake of contradiction that . If then . By Claim 1 for every such the following inequality holds,
implying that . Subtracting from both sides yields . Summing over yields a contradiction:
where the first inequality is as explained above, and the last by the strict set containment .
Finally, as there are no fixed points with support containing , by Lemma 7, the inequality stated above is strict, that is and so .
∎
Claim 3: If is a fixed point of then is not a limit point of any sub-sequence of .
Proof: The proof considers two cases: (1) When is continuous at (2) when continuity doesn’t hold. Let be a converging subsequence of .
Case (1): The utility function is continuous at when for every good it holds that or . i.e. that there is no good with both and . This is implied directly from the allocation rule (see the formal definition in Section 2) and the fact that . Examine the support of , by Claim 2 there are no fixed points with support set containing . Therefore implying that there exists a good . That is, by definition of the supports, there exists with and . Consider such and assume for the sake of contradiction that is indeed a limit point. Then, by definition, for every exists a s.t. if then . Specifically it means that whenever .
By Claim 2, . Then, by continuity there exists a s.t. if then . Take and, by the assumption of convergence, there is a s.t. for and we have that . This implies (I) as and (II) . From these two, we can conclude that
Finally, observe that by rearranging the PRD update rule we get , implying that since for and . This means that for all we have . That is, cannot converge to zero and thus the subsequence cannot converge to , a contradiction.
Case (2): When there exists a good with and we have that is not continuous at and the previous idea doesn’t work. Instead we will contradict the PRD update rule. Assume of the sake of contradiction that is a limit point of a subsequence of PRD updates. Therefore for every exists a s.t. if then . Note that in this case and set and so, for it holds that . Also note that and that the maximal utility a buyer may have is (when it is allocated every good entirely). Then overall we have that . The PRD update rule is . But since the ratio is greater than 1 it must be that . And so every subsequent element of the subsequence is bounded below by and as before, we reach a contradiction as the subsequence cannot converge to . ∎
Finally we can prove the convergence of the sequence . As the action space is compact, there exists a converging subsequence with the limit . If for any such subsequence, then we are done. Otherwise, assume . By the previous claim any fixed point of other than is not a limit point of any subsequence, thus is not a fixed point of . By Lemma 1, any subset of players performing proportional response, strictly increase the potential function unless performed at a fixed point. When discussing a proportional response of a single player, with all others remaining fixed, this implies, by the definition of potential function, that is increased at each such step. Let , this quantity is positive since is not a fixed point. The function is a continuous function and converges to therefore there exists a such that for all we have that . Substituting yields which implies . That is, the sequence is bounded away from and since is a continuous function, this implies that is bounded away from — a contradiction to convergence. ∎
Appendix C Simultaneous Play by Subsets of Agents
In order to prove Lemma 1, we first need some further definitions and technical lemmas. We use the notation to denote the KL divergence between the vectors and , i.e., . For a subset of the players , the subscript on vectors denotes the restriction of the vector to the coordinates of the players in , that is, for a vector we use the notation to express the restriction to the subset. denotes the linear approximation of ; that is, .
The idea described in the next lemma to present the potential function as a linear approximation term and a divergence term was first described in [5] for a different scenario when all agents act together in a synchronized manner using mirror descent; we extend this idea to our asynchronous setting which requires using different methods and as well as embedding it in a game.
Lemma 9.
Fix a subset of the players and a bid profile of the other players. Then, for all we have that , where and .
Proof.
Calculating the difference yields
We rearrange the term as follows.
where the last equality is since for any set of prices because the economy is normalized (see the model section in the main text).
The term is expanded as follows.
Subtracting the latter from the former cancels out the term , and we are left with the following.
∎
Lemma 10.
For any subset of the players and any bid profile of the other players and for every it holds that , with equality only when .
Proof.
We begin by proving a simpler case where for some player and use it to prove the more general statement. Fix and , which implies fixing some . KL divergence is convex in both arguments with equality only if the arguments are equal; formally, for it holds that , which is equivalent to , with equality only if (since ). Substituting and noting that (and the same for and ), we obtain the following relation.
On the other hand, the expression can be evaluated as follows.
And therefore, we have , with equality only if .
Now we can prove the general case, as stated fix and and let . We know that for all it is true that , summing those inequalities for all yields , on the one hand clearly and on the other hand and the result is obtained. ∎
Lemma 11.
Let , let be a proportional response update function for members of and identity for the others, and let be some bid profile. Then, .
Proof.
By adding and removing constants that do not change the maximizer of the expression on the right hand side, we obtain that the maximizer is exactly the proportional response update rule:
Rearranging the last expression by elements yields the following result,
which is exactly , since for by definition. That is, our maximization problem is equivalent to . Finally, note that KL divergence is minimized when both of its arguments are identical, and , the domain of the minimization. ∎
Proof.
(Lemma 1): Let be a subset of players and let be some bid profile. By combining the lemmas proved in this section have that
where the first inequality is by Lemmas 9 and 10 with the inequality being strict whenever , and the second inequality is by Lemma 11, as was shown to be the maximizer of this expression over all . ∎
An interesting case to note here is when . In this case, the lemmas above show that if the players’ bids are and is being activated by the adversary, then the best response bids of to are the solutions to the optimization problem . On the other hand, the proportional response to is the solution to the optimization problem . This can be seen as a relaxation of the former, as proportional response does not increase (or equivalently the potential) as much as best response does. However, proportional response is somewhat easier to compute.
Appendix D Convergence of Asynchronous Proportional Response Dynamics
Proof.
(Theorem 1): Denote the distance between a point and a set as . By Theorem 2 we have that the set of potential maximizing bid profiles is identical to the set of market equilibria. Denote this set by and the maximum value of the potential by . More specifically, every achieves , and is achieved only by elements in .
We start with the following lemma.
Lemma 12.
For every there exists a such that implies .
Proof.
Assume otherwise that for some there exist a sequence such that but for all . Note that for all we have that and for all . Take a condensation point of this sequence and a subsequence that converges to . Thus by our assumption we have . For any we have . The former equality must imply , but the latter implies . ∎
Next, for a subset of players let be the continuous function where do a proportional response update and the other players play the identity function. (i.e., do not change their bids, see Section 5 in the main text).
By Lemma 1 from the main text we have that (i) For all we have that if and only if for all it holds that ; and (ii) unless .
Definition.
The stable set of is defined to be .
A corollary (i) and (ii) above is that if then , but if then .
Lemma 13.
: Let . Then there exists such that for every and every we have that .
Proof.
Fix a set such that and let . Since is continuous, there exists so that implies and thus . Now take the minimum for all finitely many . ∎
Lemma 14.
Let and let be a finite family of continuous functions such that for every we have that . Then there exists such that for every such that and every and every we have that .
Proof.
Fix and let be as promised by the previous lemma, i.e. for every and every we have that . Since and is continuous there exists so that implies and thus . Now take the minimum over the finitely many . ∎
Definition.
a sequence of sets is called -live if for every and for every there exists some such that .
Lemma 15.
Fix a sequence where such that the sequence is -live. There are no condensation points of outside of .
Proof.
Assume that exists a condensation point and a subsequence that converges to it, then . Notice that is a non-decreasing sequence and so for all . Let be a set of functions achieved by composition of at most functions from . So for every we have that , while for every we have that . Let be as promised by the previous lemma, i.e., for every and every and every such that we have that . Since the subsequence converges to there exists in the subsequence so that . Now let be the first time that . Now , where is the composition of all for the times to . We can now apply the previous lemma to get that , a contradiction. ∎
Lemma 16.
Fix a sequence where such that the sequence is -live. Then, it holds that .
Proof.
By lemma 1 we have that making the sequence monotone and bounded from above ( is a bounded function). Hence it converges to some limit . Either or . In the former case, the result is immediate by lemma 12 and approaches the set . We show that the latter yields a contradiction. If , this implies that is bounded away from , i.e. there exists such that for all . To see why this is true for all and not just in the limit, we observe that since the sequence is monotone, if we have at some time , then we have for all , which we currently assume is not the case. Therefore, every subsequence of is bounded away from , implying that every condensation point of is not in . The sequence is bounded and therefore has a converging subsequence with a condensation point not in , which is a contradiction to the previous lemma. ∎
Regarding convergence of prices, as stated in section 2, equilibrium prices for each Fisher market are unique and attained by any bid profile , thus, since the prices are a continuous function of the bids, the convergence of bids to this set implies the convergence of prices .
The last lemma concludes our proof of the Theorem 1.
∎
Appendix E Generic Markets
Proof.
(Theorem 5): Assume by way of contradiction that a generic linear Fisher market has two distinct market equilibrium bid profiles . For any market equilibrium it must hold that: (1) since equilibrium prices are unique, and (2) by budget feasibility. As , there exists a pair with , meaning that buyer has a different bid on good between and , and so by (1) it must be that exists a buyer whose bid on good was also changed so that the price remains fixed; formally, . In such case, by (2) there must be a good for which buyer has a different bid as well, since it’s budget is fixed and fully utilized; formally . As the graph with is finite, following the process described above while satisfying the constraints (1) and (2) must lead to a cycle in the graph .
Finally, we will show that there exists a market equilibrium with a cycle in its corresponding graph. Define for some and note that is also market equilibrium as the set of market equilibria is a convex set (see the model section in the main text). Let with be the corresponding graph of . Observe that since if then it must be that or and in any such case . Thus, the graph contains a cycle, contradicting Lemma 2 from the main text ∎
Proof.
(Lemma 2): Assume for the sake of contradiction that exists a cycle in , w.l.o.g. name the vertices of buyers and goods participating in the cycle in an ascending order; that is, , where and represent buyers and goods , respectively. Recall that for any market equilibrium if then for some constant (see the model section in the main text). Applying this to the cycle yields the following equations. (1) By considering edges from buyers to goods we obtain for , and (2) by considering edges from goods to buyers we obtain for and the edge closing the cycle yields . Finally, by considering the product of ratios between valuations of buyers participating in the cycle we have the following condition.
which contradicts the genericity condition. ∎