Asymptotically stable matchings and evolutionary dynamics of preference revelation games in marriage problems
Abstract
The literature on centralized matching markets often assumes that a true preference of each player is known to herself and fixed, but empirical evidence casts doubt on its plausibility. To circumvent the problem, we consider evolutionary dynamics of preference revelation games in marriage problems. We formulate the asymptotic stability of a matching, indicating the dynamical robustness against sufficiently small changes in players’ preference reporting strategies, and show that asymptotically stable matchings are stable when they exist. The simulation results of replicator dynamics are presented to demonstrate the asymptotic stability. We contribute a practical insight for market designers that a stable matching may be realized by introducing a learning period in which participants find appropriate reporting strategies through trial and error. We also open doors to a novel area of research by demonstrating ways to employ evolutionary game theory in studies on centralized markets.
keywords:
asymptotic stability , marriage problem , matching theory , evolutionary game theoryJEL:
C73 , C78 , D471 Introduction
The marriage problem is a simple and basic model of two-sided matching markets which involves one-to-one matchings. Suppose that there are two sets of players and , often referred to as men and women respectively. Each player has a preference relation defined over the other set and remaining single. A preference profile is a vector which consists of the preference relations of all players. A matching is said to be stable, if no player is worse off than to remain unmatched, and if no pair of a man and a woman prefer each other to their current partners. Gale and Shapley [1] formulated the marriage problem and showed that the set of stable matchings is always nonempty. In the proof, they presented the so-called deferred acceptance (DA) algorithm, which always yields a stable matching. They also showed the existence of the -optimal stable matching: i.e. a stable matching that is preferred to all the other stable matchings by all men. Similarly, there exists the -optimal stable matching.
In practice, a matching has to be determined based on the reported preferences, because a matchmaker can never know the true preferences of players. A matching algorithm induces the preference revelation game, a noncooperative game where players submit their preferences strategically. We refer to a preference profile submitted by the players as a report profile, in order to distinguish it from the true preference profile. A number of studies have investigated incentives of players to report their true preferences [2, 3, 4, 5, 6] and characterized the set of matchings resulting from the strategic reporting behavior [7, 8, 9, 10]. These studies considered centralized matching markets where participants submit their preferences to a clearinghouse based on which an algorithm generates the matching. Another line of research concerns decentralized markets and has analyzed dynamic processes in which matchings are updated in a pairwise manner [11, 12, 13, 14].
The literature on centralized markets often assumes that a true preference of each player is known to herself and fixed throughout the matching process. However several empirical works on school choice situations cast doubt on its plausibility. Dwenger et al. [15] concluded that participants in the university admissions centralized market in Germany chose to randomize their decisions. The preliminary results by Narita [16] and Grenet et al. [17] both suggest that preferences of players changed in time and learning was the prominent driver of those changes. The current paper explores the possibility of circumventing this problem by adopting an evolutionary uuapproach. The standard setting of game theory is that the game is played exactly once by fully rational players. The contrasting setting of evolutionary game theory [18, 19] is that players are not necessarily rational and their behavior changes in time through such processes as trial-and-error and social learning. Evolutionary dynamics of a game refers to such time evolution of the strategy profile. Since players learn their own and others’ preferences by experience, they need not be aware of their preferences when the process begins, unlike much of the literature on centralized markets.
In this paper, we theoretically investigate what kind of matchings will emerge through evolutionary dynamics of preference revelation games in marriage problems. We consider a class of continuous-time deterministic evolutionary dynamics which contains the famous replicator dynamics. We define the asymptotic stability of a matching, an analogue to the asymptotic stability of a fixed point in dynamical systems. When a matching is asymptotically stable, it is likely to be obtained at the equilibria of evolutionary dynamics. Furthermore, it turns out that an asymptotically stable matching may not exist, but when it does, it is also stable. The simulation results of the replicator dynamics are presented to demonstrate the dynamical robustness of an asymptotically stable matching against perturbation in the strategy profile. The asymptotic stability of a matching indicates the matching will be restored after another matching is realized due to sufficiently small changes in players’ reporting strategies. When a fixed point of a dynamical system is asymptotically stable, it intuitively means that sufficiently small perturbation results in a movement that goes back to the fixed point111Let be a fixed point of a system . is Lyapunov stable if, for each , there exists a such that if then for all . is attracting if there exists such that if then . is asymptotically stable if it is both Lyapunov stable and attracting [20]. . Our asymptotic stability of a matching is similar to the asymptotic stability in dynamical systems, but differs in that a focal state and perturbation are not in the same space. In matching problems, a focal state, a matching, is a point in the matching space, which is the set of all possible matchings. By contrast, perturbation corresponds to a change in the reporting strategy of a player, and players’ strategies are represented by a point in the strategy space.
This research contributes to matching theory in practical and theoretical manner. Firstly, we formulated the asymptotic stability of a matching, which characterize the matchings that emerge through evolutionary dynamics. An asymptotically stable matching is likely to be realized when players experiment or make mistakes occasionary. Furthermore we found that the asymptotic stability of a matching implies its stability. Thus the current paper suggests a novel practical strategy for market designers to implement stable matchings, which is to create a learning phase in which participants can try submitting various preferences and are informed of the would-be matchings many times over. Players would find reporting strategies that appropriately represent their true preferences during this learning phase. Accordingly an asymptotically stable and hence stable matching is realized. We emphasize that our argument does not neccesitate players’ initial awareness of their true preferences, unlike the literature. Secondly, we demonstrate how to employ evolutionary game theory in studying centralized matching markets, thereby opening doors to a novel area of research. Many directions can be taken: analytical research with machinery of evolutionary game theory, numerical analysis of evolutionary dynamics, and empirical work informed by such theoretical studies are all feasible.
The rest of the paper is organized as follows. Section 2 is the preliminaries, where formulation of marriage problems, preference revelation games, and some concepts from evolutionary game theory are presented. In the following section, we formulate the asymptotic stability of a matching. In section 4, we consider two instances of marriage problems and present simulation results of the replicator dynamics of preference revelation games. Section 5 is the discussion, and the concluding remark is in the last section.
2 Preliminaries
2.1 Marriage problem and the preference revelation game
An instance of the marriage problem is denoted by a tuple , where and are the sets of players on each side, and is the preference profile. A one-to-one matching is a mapping . Single players are considered to be paired with themselves. Thus for each , for each , and if then . A matching is said to be individually rational if no player prefers being single to the current partner. We say that a pair blocks a matching when prefers to and prefers to . is said to be stable if it is individually rational and has no blocking pair.
A matching algorithm induces the preference revelation game. A preference revelation game is characterized by a triplet , where is the set of players, is the pure strategy space, and is the tuple of payoff functions. Firstly, in standard cases, but when all men are assumed to play their dominant strategies and report truthfully [3, 2]. As for the pure strategy space, , the set of pure strategies for player , consists of ’s all possible preferences. is the pure strategy space of the game. Denoting the set of all possible preferences of male and female players by and respectively, for all and for all . Furthermore, a simplex denotes the mixed strategy space for player , and is the mixed strategy space of the game. Lastly, the payoff function, which maps a strategy profile to a real-valued payoff, is a composition of two mappings. The first mapping, a matching algorithm, describes the correspondence between a strategy profile, i.e. a report profile, and the resulting matching. The second mapping, which we call a utility function, maps each matching to a payoff value. Formally, let denote the matching space, which is the set of all possible matchings. When a matching algorithm in use always yields the same matching from a given report profile, the algorithm is a mapping from the pure strategy space to the matching space . In this study, we assume that players’ preferences are strict and a -optimal stable algorithm is used, so that the above requirement is met. A utility function of player , , maps the resulting matching to the player’s payoff. In the current paper, we write to describe a utility function where the payoff of when matched with -th preferable player is . Assuming , is considered as ’s cardinal preference. Player ’s payoff function of the game is the composition of the matching algorithm and the utility function : . in the triplet is the tuple of payoff functions: .
Our formulation of preference revelation game has two characteristics. First, players are to report their entire preference lists only once, so that evolutionary game theory on normal form games is applicable. If players are to reveal their preferences one by one, the game becomes an extensive-form game. The analysis of such situations seems possible with evolutionary game theory on extensive-form games, but it is beyond the scope of this paper. Second, we consider cardinal preferences, turning preference relations into numerical payoffs with a utility function . This enables us to formulate and solve systems of differential equations which represent evolutionary selection dynamics. When ordinal preferences are concerned, the payoff is determined solely by the rank of the partner in the preference list. With cardinal preferences, in contrast, payoffs may vary among players who are matched to their -th preferable partners. As the first attempt to study evolutionary selection dynamics of the preference revelation game, we would like to have as similar conditions as possible to existing studies, many of which adopted ordinal preferences. Therefore, we use identical for all players 222Assigning different for each player results in different selection pressures among players. Although such modificatoins lead to changes in specific trajectories in the strategy space, we expect that qualitative features of dynamics such as stabilities of fixed points would not be affected. , which makes payoff comparison among players pointless.
2.2 Evolutionary game dynamics
Replicator dynamics (RD) is one of the most significant evolutionary dynamics. A replicator corresponds to a pure strategy, and the payoff it gains is considered as its fitness. RD models the Darwinian natural selection, where strategies with higher payoffs prosper while ones with lower payoffs decline. This model was introduced by Taylor and Jonker [21] in the game theoretic context, and later named Replicator Dynamics by Schuster and Sigmund [22]. Researchers have actively studied RD thereafter, and it has also played a great role in analyzing human and social behavior [23]. Let be the number of players in the preference revelation game. Suppose there are large and finite populations of replicators, each of which corresponds to a player in the game. For instance, all replicators in the population corresponding to have the same true preferences, and the set of pure strategies available to them is . Choosing one replicator from each population randomly, we have replicators, each of which plays preference revelation game as the corresponding player. Replicators earning payoffs higher than the average increase their share within their populations, while ones with payoffs lower than the average decline. The dynamics of natural selection can be modeled by the repetition of this process.
Suppose that each replicator in a population of player is programmed to use a fixed pure strategy . Let denote the population share of replicators which always use pure strategy and whose roles are player . Then a vector indicates the state of the player population. It is formally equivalent to a mixed strategy of player : i.e. . Let be the average payoff of the replicators of player with pure strategy , which is equivalent to the expected payoff of . is the average payoff of all player replicators, or the expected payoff of mixed strategy . Replicator dynamics of a game with player is formulated as follows:
(1) |
where and
(2) |
The summation in is taken for , the strategies of players other than . In words, sums up to one for each player, is the average payoff for pure strategy of player , and is the average payoff for the player population. RD is a system of differential equations.
The idea of biological reproduction is the basis for RD. However, the selection of replicators, namely pure strategies, may well be driven by other mechanisms such as imitations, especially in the social and economic context. We consider some classes of continuous-time selection dynamics following Weibull [18], which are generalizations of RD.
To describe the selection process, the growth-rate function is considered. Let be the growth rate of the population share corresponding to player and pure strategy under population state , and
(3) |
The growth-rate function is a vector , whose each component is a growth-rate function for player , . Note that the growth-rate function of RD is . Selection dynamics are classified based on the properties of the growth-rate function. A regular growth-rate function is a Lipschitz continuous333 A function is Lipschitz continuous if there exists a constant such that for all . function with open domain containing , such that for all and [18, Definition 5.4]. denotes the set of regular growth-rate functions, and the selection dynamics induced by a regular growth-rate function will be called a regular selection dynamics on . A regular growth-rate function is payoff positive, if for all and , where denotes the sign of [18, Definition 5.6]. denotes the set of payoff-positive growth-rate functions, and the induced selection dynamics will be called a payoff-positive selection dynamics. Note that RD of equation (1) is payoff positive.
Under the payoff-positive selection dynamics, a subset of the mixed strategy space spanned by a set of pure strategy profiles is asymptotically stable if and only if is closed under weakly better replies [18, Theorem 5.3]. If a pure strategy earns at least as high payoff against a (mixed) strategy profile as , is a weakly better reply to . Let be
(4) |
denotes the payoff of for player position against , and is the average payoff for under . is closed under weakly better replies (cuwbr) if for all [18, Definition 5.9]. The property of cuwbr is a generalized concept of a strict Nash equilibrium444 A strategy profile is a strict Nash equilibrium if is the only best response to itself. to a set.
3 Asymptotic stability of a matching
In this section, we formulate asymptotic stability of a matching. When a matching is asymptotically stable, it will be restored after another matching is realized due to perturbation, which is a change in the reporting strategy of a player. Asymptotic stability of a matching differs from the asymptotic stability in dynamical systems in that a focal state and perturbation are not in the same space. Whereas a focal state is on the matching space, perturbation occurs in the strategy space. An evolutionary dynamics such as RD of equation (1) describes the time evolution of the strategy distributions in the mixed strategy space. A matching algorithm maps a point in the mixed strategy space to a set of matchings in the matching space. We utilize this relationship between the strategy space and the matching space, and define asymptotic stability of a matching through the asymptotic stability of a corresponding area in the mixed strategy space, where the conventional definition of the asymptotic stability in dynamical systems can be used.
First, we define a partial preimage of a matching , a subset of the pure strategy space that corresponds to . An obvious requirement for a partial preimage of is that any report profile in the set yields by the algorithm in use. In addition, we require the set to be a cartesian product of the subset of the each player’s pure strategy space. This is because players determine their strategy independently in preference revelation game. Formally, a partial preimage of a matching is defined as follows.
Definition 1.
Let be the matching algorithm in use. A subset of the pure strategy space is called a partial preimage of matching , when it satisfies the following:
a. | (5) | |||
b. | (6) |
is ”partial” in that it is included in the preimage of under : i.e. . There may exist multiple partial preimages for a matching. In particular, any subset of is a trivial partial preimage of . In later sections, we refer to a nontrivial one that is not included in any other partial preimage of as a non-included partial preimage of .
Now, we formulate asymptotic stability of a matching using its partial preimage . Let denote the subspace of the mixed strategy space spanned by :
(7) |
Note that any mixed report profile in yields only . Suppose that a mixed strategy profile is initially in , and then sufficiently small perturbation drives the point out of . While is not in , the algorithm may now yield matchings other than . If is asymptotically stable, moves back into after some time, yielding only . In short, will be restored after sufficiently small perturbation, if is asymptotically stable. As described in the previous section, under payoff-positive selection dynamics, is asymptotically stable if and only if is closed under weakly better replies (cuwbr). In this way, we formulate asymptotic stability of a matching.
Definition 2.
A matching is asymptotically stable under payoff-positive selection dynamics if it has a partial preimage that is closed under weakly better replies.
An asymptotically stable matching does not always exist, but if it does, it is also stable.
Theorem 1.
When -optimal or -optimal stable algorithms are used, a matching that is asymptotically stable under payoff-positive selection dynamics is stable.
Proof.
We concentrate on the case where -optimal stable algorithms are used. The same argument holds for -optimal stable algorithms. Players are indifferent as to which of -optimal stable algorithms is in use, because all of them yield the same matching from a given report profile. Therefore we assume that the DA algorithm with men proposing [1] is used without loss of generality.
Contrary to the theorem, suppose that an unstable matching is asymptotically stable. has a partial preimage that is cuwbr. The instability of implies that it has a blocking pair or it is not individually rational. When it has a blocking pair , the fact that and are not paired with each other under indicates that did not propose to . Thus all the reports of in rank above . Let be such a preference, where is ranked above , and be a preference of where is ranked above . Note that is not a element of .
-
i.
If all the reports of in rank above , can be better off by reporting . This contradicts the closure under weakly better replies (cuwbr) of .
-
ii.
If all the reports of in rank above , is not worse off by reporting . This contradicts the cuwbr of .
-
iii.
If reports of in include both types of preferences described above, is at least not worse off by reporting . This contradicts the cuwbr of .
Therefore an asymptotically stable matching cannot have a blocking pair. When is not individually rational, some player is paired with someone unacceptable: i.e. prefers itself to . This indicates that all the reports of in rank above , and thus can be better off by reporting a preference where is preferred to , which contradicts the cuwbr of . Therefore an asymptotically stable matching must be individually rational, and this completes the proof. ∎
4 Simulations of replicator dynamics
label | preference |
---|---|
label | preference |
---|---|
label | matching |
---|---|
![]() |
![]() |
So far, we have discussed game dynamics of preference revelation games theoretically. In this section, we take two instances marriage problems, and present numerical simulations of replicator dynamics (RD) of preference revelation games. We pay particular attention to matchings that will be realized after report profiles evolve for some time according to RD. We consider marriage problems with three players on each side: i.e. and . It is assumed that male players do not use weakly dominated strategies, which means they always report truthfully. Thus in this section, the set of strategic players is . All the possible reports are shown in tables 1, where denotes a preference of a woman who prefers to , and to . We will refer to a pure strategy by its label shown in the tables. The pure strategy space of the game is . It is assumed in this section that all players are acceptable. RD of equation (1) were numerically solved with the initial condition where all pure strategies evenly exist: i.e. for all . The DA algorithm with men proposing was used in obtaining a matching from a report profile, and matchings were mapped to payoffs by the utility function . For example, when a replicator is paired with the most preferable man, its payoff is 25. We say the state is at the stationary state at time , if the maximum changes in during the last few calculation steps were sufficiently small555Let the state at time , which corresponds to the -th calculation point, be . Let the state at calculation steps before be . We say is at the stationary at time , if for . . Furthermore, we say that a pure strategy survived when its population share at the stationary state was more than 0.1%.
The marriage problems we investigated are and , where
(8) |
The preference of is shown first in a preference profile, that of second and so on, the last being the preference of . In the first problem with , the -optimal and -optimal stable matchings are
(9) |
Under , is paired with , paired with , and paired with . The -optimal and -optimal stable matchings do not coincide, but no woman has the incentive to falsify assuming others are truthful666Theorem 1 of Gale and Sotomeyer [4] asserts that there is at least one woman who will be better off by falsifying, if there is more than one stable matching. In the problem with , no woman has the incentive to misreport even though there are multiple stable matchings, because we assumed that all players were acceptable. . In the second problem777 was taken from the proof of Theorem 3 in Roth [2]. with , the -optimal and -optimal stable matchings are
(10) |
Under -optimal stable algorithms, at least one woman has the incentive to falsify so that instead of is obtained. For instance, is realized if falsely reports .
Before presenting the simulation results, we check if asymptotically stable matchings exist in the two marriage problems888 In practice, we first simulated replicator dynamics and then studied asymptotically stable matchings referring to the simulation results. . In the first problem with , the following set is a non-included partial preimage of under -optimal stable algorithms:
(11) |
turns out to be cuwbr, and thus is asymptotically stable, as shown in B.1. In the proof, we essentially checked if of equation (4) satisfied the definition of cuwbr with payoff tables, which was made workable due to the assumption of . In the second problem with , one finds two non-included partial preimages of under -optimal stable algorithms:
(12) | ||||
(13) |
The following relations hold for and , where satisfies :
(14) | ||||
(15) | ||||
Equations (14) and (15) indicate that neither nor is cuwbr. Indeed, one can show that there is no partial preimage of that is cuwbr, implying that is not asymptotically stable. Details on the proofs are in B.2 and B.3. In addition, there is no partial preimage of that is cuwbr, since ’s reporting earns a higher payoff when others are reporting truthfully. Therefore, no asymptotically stable matching exists in the second problem.
In figures 1, the time evolution of and the resulting matchings are presented. In the first problem (figure 1), truthful reportings of all players have survived. All the possible report profiles at the stationary state result in the -optimal stable matching . One can see that although the values of some changed after perturbation (e.g. at ), the set of surviving reports were always , and the resulting matching remained regardless of the perturbations. This demonstrates the asymptotic stability of and thus . In the second problem (figure 1), truthful reporting went extinct in the population. All the possible report profiles at the stationary state result in the -optimal stable matching , not . However, while the set of surviving reports was at , it changed to as the result of the perturbation at . In short, even though seemed robust against the perturbations, it is not asymptotically stable in our definition. We will explore this situation in the next section.
To conclude this section, we briefly discuss the simulation results. Firstly, the selection favored only one matching in both cases, even though several strategies coexisted in many player populations at the stationary state. This is not due to the choice of the preference profiles. Assuming that three players are present on each side and all players are acceptable, there are possible preference profiles. We fixed the preference of without loss of generality, and conducted the simulation of RD for all possible preference profiles in case of . Except the 18 cases (0.2%) where kept fluctuating and thus we could not determine the final states, only one matching was possible at the final state in all instances. These observations imply that it is natural to consider the asymptotic stability of a matching. Secondly, the simulation results were consistent with previous research on the incentives of players. In the first problem, where no woman has the incentive to misreport, the truthful reporting strategies of women survived. In the second problem, at least one woman has the incentive to falsify, and only the false reporting survived in the population accordingly. Still, truthfully reported the most preferable men, which is consistent with the result of Roth [2] that no woman has the incentive to falsify her most preferred man.
5 Discussion
![]() |
![]() |
The simulation result of the second problem in figure 1 suggests that the -optimal stable matching was restored after small perturbations were given, even though is not asymptotically stable. What equations (14) and (15) tell us about two non-included partial preimages of are that is not cuwbr because of is a weakly better reply to some , and that is not cuwbr because of is a weakly better reply to some . Suppose that the current population state is , and some replicators in population start to use . This would move the population state to , where
(16) |
If we start from and some replicators in population come to use , the population state would also get . In addition, one can show that evolves into another state , where only is obtained, as long as selection dynamics are payoff-positive999Details are in C. . Therefore will in fact be restored after small perturbations are given.
In the second problem with , is not asymptotically stable, even though it will be restored after small perturbations. The essentials behind this confusing situation are two non-included partial preimages of , one of which contains a weakly better reply to the other and vice versa. All partial preimages are not cuwbr and thus is not asymptotically stable by our definition, but still, a mixed strategy profile moves only within partial preimages of the same matching. In order to characterize the robustness of , we propose a weaker property of a matching, quasi-asymptotic stability:
Definition 3.
A matching is quasi-asymptotically stable if it has a set of partial preimages
(17) |
When , it is equivalent to the asymptotic stability of Definition 2. is quasi-asymptotically stable with . We presume the quasi-asymptotic stability portrays some kind of dynamical stability of a matching similar to the asymptotic stability, but we have not yet been able to neither prove it nor find a counterexample.
In the current paper, preference relations are described by numerical values. Obviously, the payoff values must satisfy , but still there are countless candidates for the utility function. To examine whether the choice of the utility function affects the dynamics, we conducted simulations of RD in the two marriage problems with two other utility functions, and . The utility function used in the preceding section was . , and are convex, linear and concave, respectively. In the first problem with , the dynamics of did not seem to be affected by the choice of the utility function, but their time scales differed. The dynamics with the linear utility function is shown in figure 2. In the second problem with , transient dynamics slightly varied and the share of in the population at the last state was also affected by the choice of the utility function. Figure 2 shows the dynamics with the concave utility function. Nevertheless, strategies that survived remained the same regardless of the utility function. Our analysis have mainly focused on the last states of dynamics and the matchings which were obtained there. Thus, in view of the observations above, we expect our simulation results to be independent on the choice of the utility function.
6 Conclusion
Although the literature on centralized matching markets often assumes that a true preference of each player is known to herself and fixed throughout the matching process, several empirical results [15, 16, 17] cast doubt on the assumption. To circumvent the problem, evolutionary dynamics of preference revelation games are considered. We formulated the asymptotic stability of a matching, which indicates the dynamical robustness of the matching against sufficiently small changes in players’ reporting strategies. We showed that the asymptotic stability of a matching implies its stability, thereby contributing a practical suggestion on how to obtain stable matchings in centralized markets: by having a learning phase during which players find reporting strategies that suit their true preferences through trial and error, an asymptotically stable and hence stable matching would be obtained. Note that our insight does not assume players’ initial awareness of their true preferences, unlike much of the literature.
We believe the current paper lays a foundation for dynamical approach to centralized matching markets with evolutionary dynamics of preference revelation games, and there is plenty of room for future research. One issue that should be tackled is the difficulty in finding asymptotically stable matchings. We checked the asymptotic stability according to its definition, looking at payoff matrices, but this procedure quickly becomes unfeasible as the number of players increase. Our approach with continuous-time deterministic selection dynamics is applicable to wider range of problems, provided the payoff function is deterministic and thus payoffs are never determined randomly. We assumed preferences were strict and -optimal stable algorithms were used, but players’ preferences need not be strict. When preferences are not strict, deterministic payoff functions can be constructed by employing matching algorithms that have deterministic tie-breaking rules. Our approach can also be adopted to many-to-one or many-to-many matching problems. In these cases, payoff functions are likely to be much complicated and straightforward analysis may get cumbersome. Directions for future research include analyses of other types of game dynamics, such as ones with mutations or on graphs, and search for connections between the asymptotic stability of a matching and stochastic stability analyzed in research on decentralized markets [12, 13, 14]. Another interesting topic would be a partial preimage of a matching. We introduced a partial preimage to formulate the asymptotic stability of a matching based on the concept, but a partial preimage itself may hold information that characterizes the matching. For instance, the cardinality of a non-included partial preimage is likely to reflect the likelihood that a matching is realized when all players randomly submit their reports.
References
- [1] D. Gale, L. S. Shapley, College Admissions and the Stability of Marriage, The American Mathematical Monthly 69 (1) (1962) 9–15. doi:10.2307/2312726.
- [2] A. E. Roth, The Economics of Matching: Stability and Incentives, Mathematics of Operations Research 7 (4) (1982) 617–628. doi:10.1287/moor.7.4.617.
- [3] L. E. Dubins, D. A. Freedman, Machiavelli and the Gale-Shapley Algorithm, The American Mathematical Monthly 88 (7) (1981) 485–494. doi:10.2307/2321753.
- [4] D. Gale, M. Sotomayor, Ms. Machiavelli and the Stable Matching Problem, The American Mathematical Monthly 92 (4) (1985) 261–268. doi:10.2307/2323645.
- [5] N. Immorlica, M. Mahdian, Marriage, honesty, and stability, in: Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA ’05, Society for Industrial and Applied Mathematics, USA, 2005, pp. 53–62.
- [6] F. Kojima, P. A. Pathak, Incentives and Stability in Large Two-Sided Matching Markets, American Economic Review 99 (3) (2009) 608–627. doi:10.1257/aer.99.3.608.
- [7] A. E. Roth, Misrepresentation and stability in the marriage problem, Journal of Economic Theory 34 (2) (1984) 383–387. doi:10.1016/0022-0531(84)90152-2.
- [8] J. Ma, Stable Matchings and Rematching-Proof Equilibria in a Two-Sided Matching Market, Journal of Economic Theory 66 (2) (1995) 352–369. doi:10.1006/jeth.1995.1045.
- [9] J. Alcalde, Implementation of Stable Solutions to Marriage Problems, Journal of Economic Theory 69 (1) (1996) 240–254. doi:10.1006/jeth.1996.0050.
- [10] T. Sönmez, Games of Manipulation in Marriage Problems, Games and Economic Behavior 20 (2) (1997) 169–176. doi:10.1006/game.1997.0559.
- [11] A. E. Roth, J. H. V. Vate, Random Paths to Stability in Two-Sided Matching, Econometrica 58 (6) (1990) 1475. doi:10.2307/2938326.
- [12] M. O. Jackson, A. Watts, The Evolution of Social and Economic Networks, Journal of Economic Theory 106 (2) (2002) 265–295. doi:10.1006/jeth.2001.2903.
- [13] J. Newton, R. Sawa, A one-shot deviation principle for stability in matching problems, Journal of Economic Theory 157 (2015) 1–27. doi:10.1016/j.jet.2014.11.015.
- [14] E. Bilancini, L. Boncinelli, J. Newton, Evolution and Rawlsian social choice in matching, Games and Economic Behavior 123 (2020) 68–80. doi:10.1016/j.geb.2020.06.004.
- [15] N. Dwenger, D. Kübler, G. Weizsäcker, Flipping a coin: Evidence from university applications, Journal of Public Economics 167 (2018) 240–250. doi:10.1016/j.jpubeco.2018.09.014.
- [16] Y. Narita, Match or Mismatch? Learning and Inertia in School Choice, SSRN Scholarly Paper 3198417, Social Science Research Network, Rochester, NY (Jun. 2018). doi:10.2139/ssrn.3198417.
- [17] J. Grenet, Y. He, D. Kübler, Decentralizing Centralized Matching Markets: Implications from Early Offers in University Admissions (Jul. 2021). arXiv:2107.01532, doi:10.48550/arXiv.2107.01532.
- [18] J. W. Weibull, Evolutionary Game Theory, MIT Press, 1997.
- [19] J. Newton, Evolutionary Game Theory: A Renaissance, Games 9 (2) (2018) 31. doi:10.3390/g9020031.
- [20] S. Strogatz, Nonlinear Dynamics and Chaos: With Applications to Physics, Biology, Chemistry, and Engineering, 2nd Edition, Westview Press, Boulder, Colo., 2015.
- [21] P. D. Taylor, L. B. Jonker, Evolutionary stable strategies and game dynamics, Mathematical Biosciences 40 (1) (1978) 145–156. doi:10.1016/0025-5564(78)90077-9.
- [22] P. Schuster, K. Sigmund, Replicator dynamics, Journal of Theoretical Biology 100 (3) (1983) 533–538. doi:10.1016/0022-5193(83)90445-9.
- [23] R. Cressman, Y. Tao, The replicator equation and other game dynamics, Proceedings of the National Academy of Sciences 111 (Supplement_3) (2014) 10810–10817. doi:10.1073/pnas.1400823111.
Appendix A Payoff matrices
Appendix B Proofs for section 4
B.1 Closure under weakly better replies of
We show that of equation (11) is cuwbr.
Proof.
First, recall that is cuwbr if for all , where
(18) |
We consider the following sets:
(19) | ||||
(20) |
One can see that , , and , which indicate
(21) |
Therefore it suffices to examine if holds for .
Table 2 tells us that the reports of and do not change the outcome when reports or . We can therefore assume that and report without loss of generality, and what we need to show is that is cuwbr.
Let be . Each element in denotes the probability distribution describing the mixed strategy of each player, and is the probability that reports . When all players use , payoffs are and . From table 2(a), following inequalities hold:
(22) |
Thus holds, completing the proof. ∎
B.2 Nonclosure under weakly better replies of and
First, we show that of equation (12) is not cuwbr.
Proof.
As in the preceding subsections, it suffices to examine if holds for . Table 3 tells us that the report of does not affect the outcome. We therefore assume that submits without loss of generality, and what we need to show is that is not cuwbr.
Next, we show that of equation (13) is not cuwbr either.
Proof.
In the same way as above, we examine if holds for . Table 3 tells us that the report of does not affect the outcome. We therefore assume that submits without loss of generality, and what we need to show is that is not cuwbr.
B.3 Nonexistence of a partial preimage of closed under weakly better replies
We have already shown that and are not cuwbr. We prove that its other partial preimages are not cuwbr either to show that there is no partial preimage of that is cuwbr.
Proof.
In the possible report profiles for women, eighteen result in , all of which are in . Thus their subsets are the only candidates for a partial preimage that is cuwbr. Let us consider the partial preimage . Since the report of does not change the outcome, we may assume that reports without loss of generality. Then the partial preimage to examine is , which is a pure strategy profile. Since the cuwbr of is equivalent to that is a strict Nash equilibrium, and there is no strategy profile that is a strict Nash equilibrium in the current setting as we mentioned in section 4, is not cuwbr. The same argument holds for all other partial preimages of , and thus there is no partial preimage of that is cuwbr. ∎
Appendix C Proof for section 5
We show that a trajectory starting from ends up at some under payoff-positive selection dynamics.
Proof.
First, observe that is not a partial preimage of , since its subset
(25) |
is a partial preimage of . Because , trajectories starting in go into either or . Now, we look at dynamics. From the payoff matrices in Table 3, one finds that the report of does not change the outcome. Therefore we can assume always reports without loss of generality. When the state is in , for and for . Using , dynamics of become two-dimensional dynamical systems:
(26) |
Recalling the definition of the payoff-positivity and looking at the payoff matrix in Table 3(a), we find
(27) | ||||
(28) |
Hence, and until either or gets to be one. As when , and when , we have established that trajectories starting in go into either or under payoff-positive selection dynamics. ∎