The Optimal Way to Play the Most Difficult
Repeated Coordination Games

Antti Kuusisto University of Helsinki and Tampere University
Finland [email protected] Université Paris-Saclay, CNRS, ENS Paris-Saclay
France Raine Rönnholm Université Paris-Saclay, CNRS, ENS Paris-Saclay
France [email protected]

Abstract

This paper investigates repeated win-lose coordination games ( $\mathsf{WLC}$ -games). We analyse which protocols are optimal for these games, covering both the worst case and average case scenarios, i,e., optimizing the guaranteed and expected coordination times. We begin by analysing Choice Matching Games ( $\mathsf{CM}$ -games) which are a simple yet fundamental type of $\mathsf{WLC}$ -games, where the goal of the players is to pick the same choice from a finite set of initially indistinguishable choices. We give a fully complete classification of optimal expected and guaranteed coordination times in two-player $\mathsf{CM}$ -games and show that the corresponding optimal protocols are unique in every case—except in the $\mathsf{CM}$ -game with four choices, which we analyse separately.

Our results on $\mathsf{CM}$ -games are essential for proving a more general result on the difficulty of all $\mathsf{WLC}$ -games: we provide a complete analysis of least upper bounds for optimal expected coordination times in all two-player $\mathsf{WLC}$ -games as a function of game size. We also show that $\mathsf{CM}$ -games can be seen as the most difficult games among all two-player $\mathsf{WLC}$ -games, as they turn out to have the greatest optimal expected coordination times.

1 Introduction

Pure win-lose coordination games ( $\mathsf{WLC}$ -games) are simple yet fundamental games where all players receive the same payoffs: 1 (win) or 0 (lose). This paper studies repeated $\mathsf{WLC}$ -games, where the players make simultaneous choices in discrete rounds until (if ever) succeeding to coordinate on a winning profile. Choice matching games ( $\mathsf{CM}$ -games) are the simplest class of such games. The choice matching game $\mathsf{CM}_{\_}m^{n}$ has $n$ players with the goal to choose the same choice among $m$ different indistinguishable choices, with no communication during play. The players can use the history of the game (i.e., the players’ choices in previous rounds) for their benefit as the game proceeds. For simplicity, we denote the two-player game $\mathsf{CM}_{\_}m^{2}$ by $\mathsf{CM}_{\_}m$ .

A paradigmatic real-life scenario with a choice matching game relates to a phenomenon that has humorously been called “pavement tango” or “droitwich” in [2]. Here two people try to pass each other but may end up blocking each other by repeatedly moving sideways into the same direction. For another example of a choice matching game, consider $\mathsf{CM}_{\_}3$ , the coordination-based variant of the rock-paper-scissors game, pictured on the right. {vwcol}[widths=0.85,0.15, sep=2mm, justify=flush, rule=0pt, lines=7] Here the two players (i.e., columns) coordinate if they succeed choosing an edge from one of the three rows. The players first choose randomly; suppose they select the nodes in dotted circles. Next the players have three options: (1) to repeat their first choice, (2) to try to coordinate with the first choice of the other player, or (3) to select the choice which has not been selected yet. Supposing that the players behave symmetrically, (3) is the only way to guarantee coordination in the second round.

A general $n$ -player $\mathsf{WLC}$ -game is a generalization of $\mathsf{CM}_{\_}m^{n}$ where the players do not necessarily have to choose from the same row to coordinate, and it may not even suffice to choose from the same row. In classical matrix form representation, two-player choice matching games have ones on the diagonal and zeroes elsewhere, while general two-player $\mathsf{WLC}$ -games have general distributions of ones and zeroes; see Definition 2.1 for the full formal details.

In repeated $\mathsf{WLC}$ -games, it is natural to try to coordinate as quickly as possible. There are two main scenarios to be investigated: guaranteeing coordination (with certainty) in as few rounds as possible and minimizing the expected number of rounds for coordination. The former concerns the number of rounds it takes to coordinate in the worst case and is measured in terms of guaranteed coordination times ( $\mathsf{GCT}$ s). The latter relates to the average case analysis measured in terms of expected coordination times ( $\mathsf{ECT}$ s).

Our contributions. We provide a comprehensive study of upper bounds for coordination in all two-player repeated $\mathsf{WLC}$ -games, including a classification of related optimal strategies (called protocols in this work). $\mathsf{WLC}$ -games are represented in a novel way as relational structures, which is a key to many of the techniques used in the paper. $\mathsf{CM}$ -games are central to our work, being a fundamental class of games and also the most difficult games for coordination—in a sense made precise below.

Two protocols play a central role in our study. We introduce the so-called loop avoidance protocol $\mathsf{LA}$ (cf. Def. 4.1) that essentially instructs players to play so that the generated history of choices always reduces the symmetries (e.g., automorphisms) of the game structure. We also use the so-called wait-or-move protocol $\mathsf{WM}$ (cf. Def. 4.5), essentially telling players to randomly alternate between two choices that both coordinate with at least one of the opponent’s two choices. We show that $\mathsf{WM}$ leads to coordination in all two-player $\mathsf{WLC}$ -games very fast, the $\mathsf{ECT}$ being at most $3-2p$ , where $p$ is the probability of coordinating in the first round with random choices.

We then provide a complete analysis of the optimal $\mathsf{ECT}$ s and $\mathsf{GCT}$ s in all choice matching games $\mathsf{CM}_{\_}m$ . We also identify the protocols giving the optimal $\mathsf{ECT}$ s and $\mathsf{GCT}$ s and show their uniqueness, where possible. The table in Figure 1 on the next page summarizes these results.

	Optimal expected	Unique optimal	Optimal guaranteed	Unique optimal
$m$	coordination	protocol for	coordination	protocol for
	time in $\mathsf{CM}_{\_}m$	expected time	time in $\mathsf{CM}_{\_}m$	guaranteed time
$1$	1	(any)	1	(any)
$2$	$2$	$\mathsf{WM}$	$\infty$	—
$3$	$1+\frac{2}{3}$	$\mathsf{LA}$	$2$	$\mathsf{LA}$
$4$	$2+\frac{1}{2}$	—	$\infty$	—
$5$	$2+\frac{1}{3}$	$\mathsf{LA}$	$3$	$\mathsf{LA}$
\hdashline $6$	$2+\frac{2}{3}$	$\mathsf{WM}$	$\infty$	—
$7$	$2+\frac{5}{7}$	$\mathsf{WM}$	$4$	$\mathsf{LA}$
⋮	⋮	⋮	⋮	⋮
$2k$	$3-\frac{1}{k}$	$\mathsf{WM}$	$\infty$	—
$2k+1$	$3-\frac{2}{2k+1}$	$\mathsf{WM}$	$k$	$\mathsf{LA}$

Figure 1: A complete analysis of two-player choice matching games.

Our analysis is fully complete, as we prove that there exists a continuum of optimal protocols for $\mathsf{CM}_{\_}4$ and establish that for all even $m$ , no protocol guarantees a win in $\mathsf{CM}_{\_}m$ .

Concerning the more general class of all $\mathsf{WLC}$ -games, we provide the following complete characterization of upper bounds for the optimal $\mathsf{ECT}$ s in all two-player $\mathsf{WLC}$ -games as a function of game size (a game in a classical matrix form is of size $m$ when the maximum of the number of rows and columns is $m$ ):

Theorem. For any $m$ , the greatest optimal $\mathsf{ECT}$ among two-player $\mathsf{WLC}$ -games of size $m$ is as follows:

Game size	$m\in\mathbb{Z}_{\_}+\setminus\{3,5\}$	$m=5$	$m=3$
Greatest optimal $\mathsf{ECT}$	$3-\frac{2}{m}$	$2+\frac{1}{3}$	$\frac{1+\sqrt{4+\sqrt{17}}}{2}$ $(\approx 1.925)$

Also, concerning two-player $\mathsf{CM}$ -games, we establish that $\mathsf{CM}_{\_}m$ has the strictly greatest optimal $\mathsf{ECT}$ out of all two-player $\mathsf{WLC}$ -games of size $m\not=3$ , making $\mathsf{CM}$ -games the most difficult $\mathsf{WLC}$ -games to coordinating in. We give a separate full analysis of the case $m=3$ .

Two comments are in order. Firstly, in real-life scenarios it can be very inefficient to determine the absolutely optimal protocols taking into account the full game structure. Indeed, this generally requires an analysis of the game structure and thus identifying, e.g., automorphic choices. Computing automorphisms of (even bipartite graphs) is well known to be hard. However, our analysis gives an instant way of finding a fast protocol for any two-player $\mathsf{WLC}$ -game $G$ . The $\mathsf{ECT}$ given by this protocol in $G$ is better or equal to the greatest optimal $\mathsf{ECT}$ of the games of the same size (cf. the theorem above). Secondly, our main analysis concerns only two-player games. However, the arguments for this case are already quite involved, so the $n$ -player case is expected to be highly complex and is thus left for the future. Furthermore, the two-player case is an especially important special case covering, e.g., learning of communication protocols in distributed systems.

Related work. Coordination games (see, e.g., [5], [4]) are a key topic in game theory, with the early foundations laid, inter alia, in the works of Schelling [18] and Lewis [16]. Repeated games are—likewise—a key topic, see for example [13], [3], [17]. For seminal work on repeated coordination games, see for example the articles [7], [6], [15]. Repeated coordination games have a wide range of applications, from learning and social choice theory to symmetry breaking in distributed systems.

However, $\mathsf{WLC}$ -games are a simple class of games that have not been extensively studied in the literature. In particular, choice matching games clearly constitute a highly fundamental class of games, and it is thus surprising that the analysis of the current paper has not been previously carried out. Thus the related analysis is well justified; it closes a gap in the literature. Indeed, while $\mathsf{CM}$ -games are simple, they precisely capture the highly fundamental coordination problem of picking the same choice from a set of choices.

Our study differs from the classical game-theoretic work on repeated games where the focus is on accumulated payoffs instead of the discrete outcomes of $\mathsf{WLC}$ -games. Indeed, repeated $\mathsf{WLC}$ -games are based on reachability objectives. Especially our worst case (but also the average case) analysis has only superficial overlap with typical work on repeated games.

However, similar work exists, the most notable example being the seminal article [7] that studies a generalization of $\mathsf{WLC}$ -games in a framework that has some similarities with our setting. Also the papers [9], [11], [8], [10] have a similar focus, as they also concentrate on coordination games with discrete win-lose outcomes. However, the papers [9], [11], [8], [10] do not investigate optimality of protocols in repeated games, unlike the current paper. The seminal work [7] introduces (what is equivalent to) the two-player $\mathsf{CM}$ -games in their final section on general examples. They also essentially identify the optimal ways of playing $\mathsf{CM}_{\_}2$ and $\mathsf{CM}_{\_}3$ (discussed also in this article) although in a technically somewhat different setting with accumulated payoffs. Furthermore, they observe that a protocol essentially equivalent to $\mathsf{WM}$ is the best way to play $\mathsf{CM}_{\_}6$ , an observation we also make in our setting. However, optimality of $\mathsf{WM}$ in $\mathsf{CM}_{\_}6$ is not proved in [7]. This would require an extensive analysis proving that the players cannot make beneficial use of asymmetric histories created by non-coordinating choices. Indeed, the main technical difficulty in our corresponding setting is to show uniqueness of the optimal protocol, which we do in each case where a unique protocol exist.

Nonetheless, despite the differences, the framework of [7] bears some conceptual similarities to ours, e.g., the authors also identify structural protocols (cf. Definition 3.4 below) as the natural notion of strategy for studying their framework. Furthermore, they make extensive use of focal points [18] in analysing how asymmetric histories can be used for coordination.

Relating to uniqueness of protocols, [12] argues that individual rationality considerations not are not sufficient for “learning how to coordinate” in the setting of [7]. We agree with [12] that some conventions are needed if many protocols lead to the optimal result. However—in our framework—as we can prove uniqueness of the optimal protocols for $\mathsf{CM}_{\_}m$ (for $m\neq 4$ ), then arguably rational players should adopt precisely these protocols in $\mathsf{CM}$ -games.

The paper [14] is the preprint of the current submission with full proofs and further examples.

Techniques used. The core of our work relies on an original approach to games based on relational structures, as opposed to using the traditional matrix form representation. This approach enables us to use graph-theoretic ideas in our arguments. Both in the worst-case and in average-case analysis, the main technical work relies heavily on analysis of symmetries—especially the way the groups of automorphisms of games evolve when playing coordination games. The most involved result of the worst-case analysis, Theorem 5.2, is proved by reducing the cardinality of the automorphism group of the $\mathsf{WLC}$ -game studied in a maximally fast fashion. In the average-case analysis, Theorems 6.2, 6.3, 6.4 are proved via a combination of analysis of extrema; keeping track of groups of automorphisms; graph-theoretic methods; and focal points [18] for breaking symmetry. The most demanding part here is to show uniqueness of the protocols involved. Also in the average-case analysis, Theorem 7.2 relies on earlier theorems and an extensive and exhaustive analysis of certain bipartite graphs.

In principle one could of course reduce our arguments into the setting with games presented in matrix form. However, then it would become harder to apply natural graph-theoretic notions like degrees of nodes; special features such as cycles; general symmetries (automorphisms) et cetera. Such notions are key elements in our analysis.

2 Preliminaries

We define (pure) win-lose coordination games as relational structures as follows.

Definition 2.1.

An $n$ -player win-lose coordination game ( $\mathsf{WLC}$ -game) is a relational structure $G=(A,C_{\_}1,\dots,C_{\_}n,W_{\_}G)$ where $A$ is a finite domain of choices, each $C_{\_}i$ is a non-empty unary relation (representing the choices of player $i$ ) such that $C_{\_}1\cup\cdots\cup C_{\_}n=A$ , and $W_{\_}G\subseteq C_{\_}1\times\cdots\times C_{\_}n$ is an $n$ -ary winning relation. For technical convenience, we assume the players have pairwise disjoint choice sets, i.e., $C_{\_}i\cap C_{\_}j=\emptyset\text{ for all }i\text{ and }j\not=i$ . A tuple $\sigma\in C_{\_}1\times\cdots\times C_{\_}n$ is a choice profile for $G$ and the choice profiles in $W_{\_}G$ are winning (choice) profiles. We assume there are no surely losing choices, i.e., choices $c\in A$ that do not belong to any winning choice profile, as rational players would never select such choices.

We will use the visual representation of $\mathsf{WLC}$ games as hypergraphs; two-player games become just bipartite graphs under this scheme. The choices of each player are displayed as columns of nodes, starting from the choices of player 1 on the left and ending with the column of choices of player $n$ . The winning relation consists of lines that represent the winning choice profiles. Thus winning choice profiles are also called edges. See Example A.1 in [14] for an illustration.

Consider a $\mathsf{WLC}$ -game $G=(A,C_{\_}1,\dots,C_{\_}n,W_{\_}G)$ with $n$ players and $m$ winning choice profiles that do not intersect, i.e., none of the $m$ winning choice profiles share a choice $c\in A$ . Such games form a simple yet fundamental class of games, where the goal of the players is simply to pick the same “choice”, i.e., to simultaneously pick one of the $m$ winning profiles. These games are called choice matching games. We let $\mathsf{CM}^{n}_{\_}m$ denote the choice matching game with $n$ players and $m$ choices for each player. In this article, we extensively make use of the two-player choice matching games, $\mathsf{CM}^{2}_{\_}m$ . For these games, we will omit the superscript “ $2$ ” and simply denote them by $\mathsf{CM}_{\_}m$ . (Recall the example $\mathsf{CM}_{\_}3$ pictured in the introduction.)

Interestingly, out of all $n$ -player $\mathsf{WLC}$ -games where each of the $n$ players has $m$ choices, the game $\mathsf{CM}^{n}_{\_}m$ has the least probability of coordination when each player plays randomly. In this sense these games can be seen the most difficult for coordination. A fully compelling reason for the maximal difficulty of choice matching games is given later on by Theorem 7.3.

3 Repeated $\mathsf{WLC}$ -Games

A repeated play of a $\mathsf{WLC}$ -game $G$ consists of consecutive (one-step) plays of $G$ . The repeated play is continued until the players successfully coordinate, i.e., select their choices from a winning choice profile. This may lead to infinite plays. We assume that each player can remember the full history of the repeated play and use this information when planning choices. The history of the play after $k$ rounds is encoded in a sequence $\mathcal{H}_{\_}k$ defined as follows.

Definition 3.1.

Let $G$ be an $n$ -player $\mathsf{WLC}$ -game. A pair $(G,\mathcal{H}_{\_}k)$ is called a stage $k$ (or $k$ th stage) in a repeated play of $G$ , where the history $\mathcal{H}_{\_}k$ is a $k$ -sequence of choice profiles in $G$ . More precisely, $\mathcal{H}_{\_}k=\big{(}H_{\_}i\big{)}_{\_}{i\in\{1,\dots,k\}}$ where each $H_{\_}i$ is an $n$ -ary relation $H_{\_}i=\{(c_{\_}1,\dots,c_{\_}n)\}$ with a single tuple $(c_{\_}1,\dots,c_{\_}n)\in C_{\_}1\times\cdots\times C_{\_}n$ . In the case $k=0$ , we define $\mathcal{H}_{\_}0=\emptyset$ . The stage $(G,\mathcal{H}_{\_}0)$ is the initial stage (or the $0$ th stage). Like $G$ , also $(G,\mathcal{H}_{\_}k)$ is a relational structure.

A stage $k$ contains a history specifying precisely $k$ choice profiles chosen in a repeated play. A winning profile of $(G,\mathcal{H}_{\_}k)$ is called a touched edge if it contains some choice $c$ picked in some round $1,\dots,k$ leading to $(G,\mathcal{H}_{\_}k)$ . As we assume that the players only need to coordinate once, we consider repeated plays only up to the first stage where some winning choice profile is selected. If coordination occurs in the $k$ th round, the $k$ th stage is called the final stage of the repeated play. (But a play can take infinitely long without coordination.)

{vwcol}

[widths=0.85,0.15, sep=5mm, justify=flush, rule=0pt,lines=5] On the right is a drawing of the stage $2$ in a repeated play of $\mathsf{CM}_{\_}2$ , the “coordination game variant” of the matching pennies game (or the “pavement tango” from the introduction). Here the players have failed to coordinate in round 1 (having picked the choices with dotted circles) and then failed again by both swapping their choices in round 2 (solid circles).

A protocol gives a mixed strategy for all stages in all $\mathsf{WLC}$ -games and for all player roles $i$ :

Definition 3.2.

A protocol $\pi$ is a function outputting a probability distribution $f:C_{\_}i\rightarrow[0,1]$ (so $\sum_{\_}{c\in C_{\_}i}f(c)=1$ ) with the input of a player $i$ and a stage $(G,\mathcal{H}_{\_}k)$ of a repeated $\mathsf{WLC}$ -game.

Since a protocol can depend on the full history of the current stage, it gives a mixed, memory-based strategy for any repeated $\mathsf{WLC}$ -game. Thus protocols can informally be regarded as global “behaviour styles" of agents over the class of all repeated $\mathsf{WLC}$ -games. It is important to note that all players can see (and remember) the previous choices selected by all the other players—and also the order in which the choices have been made.

In the scenario that we study, it is obvious to require that the protocols should act independently of the names of choices and the names (or ordering) of player roles $i$ .¹¹1Note that if this assumption is not made, coordination can trivially be guaranteed in a single round in any $\mathsf{WLC}$ -game by using a protocol which chooses some winning choice profile with probability $1$ . In [7], this requirement follows from the “assumption of no common language” (for describing the game), and in [11], such protocols are called structural. We shall adopt the terminology of [11]. To extend this concept for repeated games, we first need to define the notion of a renaming. The intuitive idea of renamings is to extend isomorphisms between game graphs—including the history—to additionally enable permuting the players $1,\dots,n$ (see Example A.2 in [14] for an illustration of the definition).

Definition 3.3.

A renaming between stages $(G,\mathcal{H}_{\_}k)$ and $(G^{\prime},\mathcal{H}_{\_}k^{\prime})$ of $n$ -player $\mathsf{WLC}$ -games $G$ and $G^{\prime}$ is a pair $(\beta,h)$ where $\beta$ is a permutation of $\{1,\dots,n\}$ and $h$ a bijection from the domain of $G$ to that of $G^{\prime}$ such that

•

$c\in C_{\_}{\beta(i)}\Leftrightarrow h(c)\in C_{\_}i^{\prime}$ for all $i\leq n$ and $c$ in the domain of $G$ ,
•

$(c_{\_}1,\dots,c_{\_}n)\in W_{\_}G\Leftrightarrow(h(c_{\_}{\beta(1)}),\dots,h(c_{\_}{\beta(n)}))\in W_{\_}{G^{\prime}}$ ,
•

$(c_{\_}1,\dots,c_{\_}n)\in H_{\_}i\Leftrightarrow(h(c_{\_}{\beta(1)}),\dots,h(c_{\_}{\beta(n)}))\in H_{\_}i^{\prime}$ for all $i\leq k$ .

If $(G,\mathcal{H}_{\_}k)$ and $(G^{\prime},\mathcal{H}_{\_}k^{\prime})$ have the same domain $A$ , we say that $(\beta,h)$ is a renaming of $(G,\mathcal{H}_{\_}k)$ . Choices $c\in C_{\_}i$ and $d\in C_{\_}j$ are structurally equivalent, denoted by $c\sim d$ , if there is a renaming $(\beta,h)$ of $(G,\mathcal{H}_{\_}k)$ s.t. $\beta(i)=j$ and $h(c)=d$ . It is easy to see that $\sim$ is an equivalence relation on $A$ . We denote the equivalence class of a choice $c$ by $[c]$ .

Definition 3.4.

A protocol $\pi$ is structural if it is indifferent with respect to renamings, i.e., if $(G,\mathcal{H}_{\_}k)$ and $(G^{\prime},\mathcal{H}_{\_}k^{\prime})$ are stages with a renaming $(\beta,\pi)$ between them, then for any $i$ and any $c\in C_{\_}i$ , we have $f(c)=f^{\prime}(\pi(c)),$ where $f=\pi((G,\mathcal{H}_{\_}k),i)$ and $f^{\prime}=\pi((G^{\prime},\mathcal{H}_{\_}k^{\prime}),\beta(i))$ .

Note that a structural protocol may depend on the full history which records even the order in which the choices have been played. Hereafter we assume all protocols to be structural.

Definition 3.5.

Let $G$ be a $\mathsf{WLC}$ -game and let $S$ and $S^{\prime}$ be stages of $G$ . Let $\sim$ (respectively, $\sim^{\prime}$ ) be the structural equivalence relation over $S$ (respectively, $S^{\prime}$ ). We say that $S$ and $S^{\prime}$ are automorphism-equivalent if $\sim\,=\,\sim^{\prime}$ . The stages $S$ and $S^{\prime}$ are structurally similar if one can be obtained from the other by a chain of renamings and automorphism-equvalences.

A choice $c$ in a stage $S$ is a focal point if it is not structurally equivalent to any other choice in that same stage $S$ , with the possible exception of choices $c^{\prime}$ that all belong to some single edge with $c$ . A focal point breaks symmetry and can be used for winning a repeated coordination game. This requires that the players have some (possibly prenegotiated) way to agree on which focal point to use. See the following example for an illustration.

Example 3.6.

Consider the first two rounds of the game $\mathsf{CM}_{\_}5$ , pictured below, where the players fail to coordinate by first selecting the pair $(a_{\_}1,b_{\_}2)$ and then fail again by selecting the pair $(b_{\_}1,c_{\_}2)$ .

The structural equivalence classes become modified in this scenario as follows:

•

Initially all choices are structurally equivalent.
•

After the first round, the equivalence classes are $\{a_{\_}1,b_{\_}2\}$ , $\{b_{\_}1,a_{\_}2\}$ and $\{c_{\_}1,d_{\_}1,e_{\_}1,c_{\_}2,d_{\_}2,e_{\_}2\}$ .
•

After the second round, the equivalence classes are $\{a_{\_}1\}$ , $\{a_{\_}2\}$ , $\{b_{\_}1\}$ , $\{b_{\_}2\}$ , $\{c_{\_}1\}$ , $\{c_{\_}2\}$ and $\{d_{\_}1,e_{\_}1,d_{\_}2,e_{\_}2\}$ .

There are no focal points in the initial stage $S_{\_}0$ and the same is true for the next stage $S_{\_}1$ . However, in the stage $S_{\_}2$ , all the choices $a_{\_}1,b_{\_}1,c_{\_}1,a_{\_}2,b_{\_}2,c_{\_}2$ become focal points, and the players can thus immediately guarantee coordination in the third round by selecting any winning pair of focal points, i.e., any of the pairs $(a_{\_}1,a_{\_}2)$ , $(b_{\_}1,b_{\_}2)$ , $(c_{\_}1,c_{\_}2)$ . (We note that, from the point of view of the general study of rational choice, it may not be obvious which of these pairs should be selected, so a convention may be needed to fix which protocol to use.) For another type of example on focal points, see Example A.3 in [14].

In repeated coordination games, it is natural to try to coordinate as quickly as possible. There are two principal scenarios related to optimizing coordination times: the average case and the worst case. The former concerns the expected number rounds for coordination and the latter the maximum number in which coordination can be guaranteed with certainty.

Definition 3.7.

Let $(G,\mathcal{H}_{\_}k)$ be a stage and let $\pi$ be a protocol. The one-shot coordination probability ( $\mathsf{OSCP}$ ) from $(G,\mathcal{H}_{\_}k)$ with $\pi$ is the probability of coordinating in a single round from $(G,\mathcal{H}_{\_}k)$ when each player follows $\pi$ . The expected coordination time ( $\mathsf{ECT}$ ) from $(G,\mathcal{H}_{\_}k)$ with $\pi$ is the expected value for the number of rounds until coordination from $(G,\mathcal{H}_{\_}k)$ when all players follow $\pi$ . The guaranteed coordination time ( $\mathsf{GCT}$ ) from $(G,\mathcal{H}_{\_}k)$ with $\pi$ is the number $n$ such that the players are guaranteed to coordinate from $(G,\mathcal{H}_{\_}k)$ in $n$ rounds, but not in $n-1$ rounds, when all players follow $\pi$ , if such a number exists. Otherwise this value is $\infty$ .

The $\mathsf{OSCP}$ , $\mathsf{ECT}$ and $\mathsf{GCT}$ from the initial stage $(G,\emptyset)$ with $\pi$ are referred to as the $\mathsf{OSCP}$ , $\mathsf{ECT}$ and $\mathsf{GCT}$ in $G$ with $\pi$ . We say that $\pi$ is $\mathsf{ECT}$ -optimal for $G$ if $\pi$ gives the minimum $\mathsf{ECT}$ in $G$ , i.e., the $\mathsf{ECT}$ given by any protocol $\pi^{\prime}$ is at least as large as the one given by $\pi$ . $\mathsf{GCT}$ -optimality of $\pi$ for $G$ is defined analogously.

It is possible that there are several different protocols giving the optimal $\mathsf{ECT}$ (or $\mathsf{GCT}$ ) for a given $\mathsf{WLC}$ -game. If two protocols $\pi_{\_}1$ and $\pi_{\_}2$ are both optimal, it may be that the optimal value is nevertheless not obtained when some of the players follow $\pi_{\_}1$ and the others $\pi_{\_}2$ . This leads to a meta-coordination problem about choosing the same optimal protocol to follow. However, such a problem will be avoided if there exists a unique optimal protocol.

Definition 3.8.

Let $\pi$ be a protocol and $G$ a $\mathsf{WLC}$ -game. We say that $\pi$ is uniquely $\mathsf{ECT}$ -optimal for $G$ if $\pi$ is $\mathsf{ECT}$ -optimal for $G$ and the following holds for all other protocols $\pi^{\prime}$ that are $\mathsf{ECT}$ -optimal for $G$ : for any stage $S$ in $G$ that is reachable with $\pi$ , we have $\pi^{\prime}(S)=\pi(S)$ . Unique $\mathsf{GCT}$ -optimality of $\pi$ for $G$ is defined analogously.²²2Note that if two different protocols are uniquely $\mathsf{ECT}$ -optimal for G (and similarly for unique $\mathsf{GCT}$ -optimality), then their behaviour on $G$ can differ only on stages that are not reachable in the first place by the protcols. Also, their behaviour can of course differ on games other than $G$ .

The next lemma states that two structurally similar stages are essentially the same stage with respect to different $\mathsf{ECT}$ s and $\mathsf{GCT}$ s. The proof is straightforward.

Lemma 3.9.

Assume stages $S$ and $S^{\prime}$ of $G$ are structurally similar. Now, for any protocol $\pi$ , there exists a protocol $\pi^{\prime}$ which gives the same $\mathsf{ECT}$ and $\mathsf{GCT}$ from $S^{\prime}$ as $\pi$ gives from $S$ .

4 Protocols for Repeated $\mathsf{WLC}$ -Games

In this section we introduce two special protocols, the loop avoidance protocol $\mathsf{LA}$ and the wait-or-move protocol $\mathsf{WM}$ . Informally, $\mathsf{LA}$ asserts that in every round, every player $i$ should avoid—if possible—all choices $c$ that could possibly make the resulting stage automorphism-equivalent (cf. Def. 3.5) to the current stage, i.e., the stage just before selecting $c$ .

Definition 4.1.

The loop avoidance protocol ( $\mathsf{LA}$ ) asserts that in every round, every player $i$ should avoid—if possible—all choices $c$ for which the following condition holds: if the player $i$ selects $c$ , then there exist choices for the other players so that the resulting stage is automorphism-equivalent to the current stage. If this condition holds for all choices of the player $i$ , then $i$ makes a random choice. Moreover, uniform probability is used among all the possible choices of $i$ .

It is easy to see that $\mathsf{LA}$ avoids, when possible, all such stages that are structurally similar to any earlier stage in the repeated play. As structurally similar stages are essentially identical (cf. Lemma 3.9), repetition of such stages can be seen as a “loop” in the repeated play. When trying to guarantee coordination as quickly as possible, such loops should be avoided. In addition to this heuristic justification, Theorems 5.1 and 5.2 give a fully compelling justification for $\mathsf{LA}$ when considering guaranteed coordination in two-player $\mathsf{CM}$ -games. For now, we present the following results (for the proofs, see the correspondingly numbered Propositions 4.2 and 4.3 in [14]); see also Example 4.4 below for an illustration of the use of $\mathsf{LA}$ .

Proposition 4.2.

$\mathsf{LA}$ is uniquely $\mathsf{ECT}$ -optimal and uniquely $\mathsf{GCT}$ -optimal in $\mathsf{CM}_{\_}3$ .

Proposition 4.3.

$\mathsf{LA}$ guarantees coordination in games $\mathsf{CM}_{\_}m$ in $\lceil m/2\rceil$ rounds when $m$ is odd, but $\mathsf{LA}$ does not guarantee coordination in $\mathsf{CM}_{\_}m$ for any even $m$ .

Example 4.4.

We illustrate the use of the $\mathsf{LA}$ protocol in the game $\mathsf{CM}_{\_}5$ , pictured below. Suppose that coordination fails in the first round. By symmetry, we may assume that the players selected $a_{\_}1$ and $b_{\_}2$ . Now, in the resulting stage $S_{\_}1$ , the structural equivalence classes are $\{a_{\_}1,b_{\_}2\}$ , $\{b_{\_}1,a_{\_}2\}$ and $\{c_{\_}1,d_{\_}1,e_{\_}1,c_{\_}2,d_{\_}2,e_{\_}2\}$ .

If the pair $(b_{\_}1,a_{\_}2)$ is selected in the next round, then the structural equivalence classes do not change and thus the resulting next stage is automorphism-equivalent to $S_{\_}1$ . Hence, by following $\mathsf{LA}$ , player 1 should avoid selecting $b_{\_}1$ and player 2 should avoid selecting $a_{\_}2$ . For the same reason, the players should also avoid selecting the choices $a_{\_}1$ and $b_{\_}2$ .

Hence, by following $\mathsf{LA}$ in $S_{\_}1$ , the players will select among the set $\{c_{\_}1,d_{\_}1,e_{\_}1,c_{\_}2,d_{\_}2,e_{\_}2\}$ with the uniform probability distribution. Supposing that they fail again in coordination, we may assume by symmetry that they selected the pair $(c_{\_}1,d_{\_}2)$ . The equivalence classes in the resulting stage $S_{\_}2$ are $\{a_{\_}1,b_{\_}2\}$ , $\{b_{\_}1,a_{\_}2\}$ , $\{c_{\_}1,d_{\_}2\}$ , $\{d_{\_}1,c_{\_}2\}$ and $\{e_{\_}1,e_{\_}2\}$ . Now, selecting any of the pairs $(a_{\_}1,b_{\_}2)$ , $(b_{\_}1,a_{\_}2)$ , $(c_{\_}1,d_{\_}2)$ and $(d_{\_}1,c_{\_}2)$ leads to a next stage which is automorphism-equivalent to $S_{\_}2$ . Thus, by following $\mathsf{LA}$ in $S_{\_}2$ , the players will select the pair $(e_{\_}1,e_{\_}2)$ . This leads to guaranteed coordination in the third round.

We next present the wait-or-move protocol $\mathsf{WM}$ , which naturally appears in numerous real-life two-player coordination scenarios. Informally, both players alternate (with equal probability) between two choices: the players own initial choice and another choice that coordinates with the initial choice of the other player.

Definition 4.5.

The wait-or-move protocol ( $\mathsf{WM}$ ) for repeated two-player $\mathsf{WLC}$ -games goes as follows: first randomly select any choice $c$ , and thereafter choose, with equal probability, $c$ or a choice $c^{\prime}$ that coordinates with the initial choice of the other player (thereby never picking other choices than $c$ and $c^{\prime}$ ).

Definition A.5 in [14] specifies $\mathsf{WM}$ in yet further detail. The following theorem shows that $\mathsf{WM}$ is very fast in relation to $\mathsf{ECT}$ s. This holds for all two-player $\mathsf{WLC}$ -games, not only choice matching games $\mathsf{CM}_{\_}m$ . (The claim is validated by the proof of Proposition 4.5 in [14].)

Theorem 4.6.

Let $G$ be a $\mathsf{WLC}$ -game with one-shot coordination probability $p$ when both players make their first choice randomly. Then the expected coordination time by $\mathsf{WM}$ is at most $3-2p$ . Thus the $\mathsf{ECT}$ with $\mathsf{WM}$ is strictly less than $3$ in every two-player $\mathsf{WLC}$ -game.

It follows from the proof of Theorem 4.6 that the $\mathsf{ECT}$ with $\mathsf{WM}$ is exactly $3-\frac{2}{m}$ in all choice matching games $\mathsf{CM}_{\_}m$ . Thus the last claim of the theorem cannot be improved, as the $\mathsf{ECT}$ s of the games $\mathsf{CM}_{\_}m$ grow asymptotically closer to the strict upper bound $3$ when $m$ is increased. In the particular case of $\mathsf{CM}_{\_}2$ , the $\mathsf{ECT}$ with $\mathsf{WM}$ is $3-\frac{2}{2}=2$ . Thus the following clearly holds.

Lemma 4.7.

When $S=(\mathsf{CM}_{\_}m,\mathcal{H}_{\_}k)$ is a non-final stage with exactly two touched edges, then the $\mathsf{ECT}$ from $S$ with $\mathsf{WM}$ is 2. Moreover, in any $\mathsf{WLC}$ -game $G$ , if $S^{\prime}=(G,\mathcal{H}_{\_}k)$ is a non-final stage that is reachable by using $\mathsf{WM}$ , then the $\mathsf{ECT}$ from $S^{\prime}$ with $\mathsf{WM}$ is at most 2.

$\mathsf{WM}$ eventually leads to coordination with asymptotic probability $1$ in all two-player $\mathsf{WLC}$ -games. But it does not guarantee (with certainty) coordination in any number of rounds in $\mathsf{WLC}$ -games where the winning relation is not the total relation. In a typical real-life scenario, eternal non-coordination is of course impossible by $\mathsf{WM}$ , but it is conceivable, e.g., that two computing units using the very same pseudorandom number generator will never coordinate due to being synchronized to swap their choices in precisely the same rounds.

It is easy to show that $\mathsf{WM}$ is the unique protocol which gives the optimal $\mathsf{ECT}$ (namely, 2 rounds) in the “droitwich-scenario” of the game $\mathsf{CM}_{\_}2$ (see the proof of Proposition 4.8 in [14]).

Proposition 4.8.

$\mathsf{WM}$ is uniquely $\mathsf{ECT}$ -optimal in $\mathsf{CM}_{\_}2$ .

Next we compare the pros and cons of $\mathsf{LA}$ and $\mathsf{WM}$ in two-player $\mathsf{CM}$ -games. Recall that $\mathsf{WM}$ does not guarantee coordination in $\mathsf{CM}_{\_}m$ (when $m\not=1$ ), while $\mathsf{LA}$ does guarantee coordination in $\mathsf{CM}_{\_}m$ if and only if $m$ is odd. Concerning expected coordination times, it is easy to prove that $\mathsf{WM}$ gives a smaller $\mathsf{ECT}$ than $\mathsf{LA}$ in $\mathsf{CM}_{\_}m$ for all even $m$ (except for the case $m=2$ , where $\mathsf{WM}$ and $\mathsf{LA}$ behave identically). Thus we now restrict attention to the games $\mathsf{CM}_{\_}m$ with odd $m$ . Then, the probability of coordinating in the $\ell$ -th round of $\mathsf{CM}_{\_}m$ using $\mathsf{LA}$ , with $\ell\leq\lceil m/2\rceil$ , can relatively easily be seen to be calculable by the formula $P_{\_}{\ell,m}$ defined below (where the product is $1$ when $\ell=1$ ). And using the formula for $P_{\_}{\ell,m}$ , we also get a formula for the expected coordination time $E_{\_}m$ in $\mathsf{CM}_{\_}m$ with $\mathsf{LA}$ :

P_{\_}{\ell,m}=\frac{1}{m-2\ell+2}\prod\limits_{\_}{k=0}^{k\,=\,{\ell-2}}\frac{m-2k-1}{m-2k},\qquad E_{\_}m=\!\!\!\sum\limits_{\_}{\ell=1}^{\ell=\lceil m/2\rceil}\!\!\!\ell\cdot P_{\_}{\ell,m}.

Using this and Theorem 4.6, we can compare the $\mathsf{ECT}$ s in $\mathsf{CM}_{\_}m$ with $\mathsf{LA}$ and $\mathsf{WM}$ for odd $m$ (see the following table).

$m$	$\mathsf{ECT}$ in $\mathsf{CM}_{\_}m$ with $\mathsf{WM}$	$\mathsf{ECT}$ in $\mathsf{CM}_{\_}m$ with $\mathsf{LA}$
$1$	1	$1$
$3$	$2+\frac{1}{3}$	$1+\frac{2}{3}$
$5$	$2+\frac{3}{5}$	$2+\frac{1}{3}$
$7$	$2+\frac{5}{7}$	$3$
$9$	$2+\frac{7}{9}$	$3+\frac{2}{3}$

Especially the case $m=7$ is interesting, as the $\mathsf{ECT}$ with $\mathsf{LA}$ is exactly 3 which is precisely the strict upper bound for the $\mathsf{ECT}$ s with $\mathsf{WM}$ for the class of all two-player choice matching games $\mathsf{CM}_{\_}m$ . Furthermore, $m=7$ is the case where $\mathsf{WM}$ becomes faster than $\mathsf{LA}$ in relation to $\mathsf{ECT}$ s. Thus $\mathsf{WM}$ clearly stays faster than $\mathsf{LA}$ for all $m\geq 7$ , including even values of $m$ .

5 Optimizing Guaranteed Coordination Times

In this section we investigate when coordination can be guaranteed in two-player $\mathsf{CM}$ -games and which protocols give the optimal $\mathsf{GCT}$ for them. The following result is a direct consequence of the symmetries of $\mathsf{CM}_{\_}m$ for even $m$ (see the proof of Proposition 5.1 in [14]):

Theorem 5.1.

For all even $m\geq 2$ , there is no protocol guaranteeing coordination in $\mathsf{CM}_{\_}m$ .

We then consider choice matching games $\mathsf{CM}_{\_}m$ with an odd $m$ . Proposition 4.3 showed that the $\mathsf{GCT}$ with $\mathsf{LA}$ in these games is $\lceil m/2\rceil$ . The next theorem shows that this is the optimal $\mathsf{GCT}$ for $\mathsf{CM}_{\_}m$ , and moreover, $\mathsf{LA}$ is the unique protocol giving this $\mathsf{GCT}$ . The proof of Theorem 5.2 is an interesting exercise in the optimally fast elimination of automorphisms.

Theorem 5.2.

For any odd $m\geq 1$ , $\mathsf{LA}$ is uniquely $\mathsf{GCT}$ -optimal for $\mathsf{CM}_{\_}m$ .

Proof.

Let $m$ be odd. Recall that, by Proposition 4.3, the $\mathsf{GCT}$ in $\mathsf{CM}_{\_}m$ with $\mathsf{LA}$ is $\lceil m/2\rceil$ rounds. We assume, for contradiction, that there is some protocol $\pi\neq\mathsf{LA}$ that guarantees coordination in $\mathsf{CM}_{\_}m$ in at most $\lceil m/2\rceil$ of rounds, possibly less. As $\pi\neq\mathsf{LA}$ , there exists some play of $\mathsf{CM}_{\_}m$ where both players follow $\pi$ , and in some round, at least one of the players chooses a node on a touched edge. (Recall from the proof of Proposition 4.3 that $\mathsf{LA}$ never chooses from a touched edge in $\mathsf{CM}$ -game with an odd number of edges.) Now, let $S_{\_}\ell=(\mathsf{CM}_{\_}m,\mathcal{H}_{\_}\ell)$ be the first stage of that play when this happens—so if $(c,c^{\prime})$ is the most recently recorded pair of choices in $S_{\_}\ell$ , then at least one of $c$ and $c^{\prime}$ is part of an edge that has already been touched in some earlier round. And furthermore, in all stages $S_{\_}{\ell^{\prime}}$ with $\ell^{\prime}<\ell$ , the most recently chosen pair does not contain a choice belonging to an edge that was touched in some yet earlier round $\ell^{\prime\prime}<\ell^{\prime}$ .

In the stage $S_{\_}{\ell-1}$ it therefore holds that for every choice profile $(c_{\_}i,d_{\_}i)$ , chosen in some round $i\leq(\ell-1)$ , the nodes $c_{\_}i$ and $d_{\_}i$ are structurally equivalent. Of course also the nodes of $S_{\_}{\ell-1}$ on so far untouched edges are structurally equivalent to each other. Furthermore, the number of already touched edges in $S_{\_}{\ell-1}$ is the even number $m^{\prime}=2(\ell-1)$ .

We will now show that $\pi$ does not guarantee a win in $\lceil m/2\rceil-(\ell-1)$ rounds when starting from the stage $S_{\_}{\ell-1}$ . This completes the proof, contradicting the assumption that $\pi$ guarantees a win in $\mathsf{CM}_{\_}m$ in at most $\lceil m/2\rceil$ rounds.

Now, recall the stage $S_{\_}\ell$ from above where $(c,c^{\prime})$ contained a choice from an already touched edge. By symmetry, we may assume that $c$ is such a choice. Starting from the stage $S_{\_}{\ell-1}$ , consider a newly defined stage $S_{\_}\ell^{\prime}$ where the first player again makes the choice $c$ but the other player this time makes a structurally equivalent choice $c^{*}\sim c$ . This is possible as $\pi$ is a structural protocol. Now note that the choice profile $(c,c^{*})$ is not winning since $c$ and $c^{*}$ are structurally equivalent choices from already touched edges, and thus either $(c,c^{*})$ is a choice profile that has already been chosen in some earlier round $j<\ell$ , or the nodes $d,d^{*}$ adjacent in $\mathsf{CM}_{\_}m$ to $c^{*},c$ (respectively) form a choice profile $(d,d^{*})$ chosen in some earlier round $j<\ell$ .

Therefore, in the freshly defined stage $S_{\_}\ell^{\prime}$ , the players have in every stage (including the stage $S_{\_}\ell^{\prime}$ itself) selected a choice profile that consists of two structurally equivalent choices. Both choices in the most recently selected choice profile in $S_{\_}\ell^{\prime}$ have been picked from edges that have become touched even earlier. It now suffices to show that it can still take $\lceil m/2\rceil-(\ell-1)$ rounds to finish the game. To see that this is the case, we shall next consider a play from the stage $S_{\_}\ell^{\prime}$ onwards where in each remaining round, the choice profile $(e,e^{*})$ picked by the players consists of structurally equivalent choices; such a play exists since $\pi$ is structural.

Due to picking only structurally equivalent choices in the remaining play, when choosing a profile from the already touched part, the players will clearly never coordinate. And when choosing from the untouched part, immediate coordination is guaranteed if and only if there is only one untouched edge left. Therefore the players coordinate exactly when they ultimately select from the last untouched edge. As the stage $S_{\_}{\ell}^{\prime}$ has precisely $m-2(\ell-1)$ untouched edges, winning in this play takes at least $\big{\lceil}\ \frac{m-2(\ell-1)}{2}\,\big{\rceil}\ =\ \lceil m/2\rceil-(\ell-1)$ rounds to win from $S_{\_}\ell^{\prime}$ . ∎

6 Optimizing Expected Coordination Times

In this section we investigate which protocols give the best $\mathsf{ECT}$ s for two-player choice matching games. We also investigate when the best $\mathsf{ECT}$ is obtained by a unique protocol. We already know by Propositions 4.8 and 4.2 that the optimal $\mathsf{ECT}$ s for $\mathsf{CM}_{\_}2$ and $\mathsf{CM}_{\_}3$ are uniquely given by $\mathsf{WM}$ and $\mathsf{LA}$ , respectively. Thus it remains to consider the games $\mathsf{CM}_{\_}m$ with $m\geq 4$ . We first cover the case $m\geq 6$ and show that then $\mathsf{WM}$ is the unique protocol giving the best $\mathsf{ECT}$ . The remaining special cases $m=4$ and $m=5$ will then be examined. The following auxiliary lemma (proven in [14]) will be used in the proofs.

Lemma 6.1.

The $\mathsf{ECT}$ from $(\mathsf{CM}_{\_}m,\mathcal{H}_{\_}k)$ with no focal point is at least $\frac{3}{2}$ with any protocol.

We are now ready to cover the case for choice matching games $\mathsf{CM}_{\_}m$ with $m\geq 6$ .

Theorem 6.2.

$\mathsf{WM}$ is uniquely $\mathsf{ECT}$ -optimal for each $\mathsf{CM}_{\_}m$ with $m\geq 6$ .

Proof.

We first present a formula for estimating the best $\mathsf{ECT}$ s in stages of $\mathsf{CM}_{\_}m$ (with any $m\geq 1$ ). Let $S:=(\mathsf{CM}_{\_}m,\mathcal{H}_{\_}k)$ be a non-final stage with exactly two touched edges. Thus there are $n:=m-2$ untouched edges. Suppose the players use a protocol $\pi$ behaving as follows in round $k+1$ . Both players pick a choice from some touched edge with probability $p$ and from an untouched edge with probability $(1-p)$ . A uniform distribution is used on choices in both classes: probability $\frac{p}{2}$ for both choices on touched edges (which makes sense by Lemma B.2 of [14]) and probability $\frac{1-p}{n}$ for each choice on untouched edges (which is necessary with a structural protocol). If one player selects a choice $c$ from a touched edge and the other one a choice $c^{\prime}$ from an untouched edge, the players win in the next round by choosing the edge with $c^{\prime}$ . Note that $c^{\prime}$ is a focal point, so the winning edge can be chosen by a structural protocol with probability 1. (Also other focal points arise which could alternatively be used; cf. Example 3.6):

Suppose then that $E_{\_}1$ is the $\mathsf{ECT}$ with $\pi$ from a stage $(\mathsf{CM}_{\_}m,\mathcal{H}_{\_}{k+1})$ where both players have chosen a touched edge in round $k+1$ but failed to coordinate. Two different such stages $(\mathsf{CM}_{\_}m,\mathcal{H}_{\_}{k+1})$ exist, but they are automorphism-equivalent, so $\pi$ can give the same $\mathsf{ECT}$ from both of them by Lemma 3.9. (Indeed, if $\pi$ gave two different $\mathsf{ECT}$ s, it would make sense to adjust it to give the smaller one.) Similarly, suppose $E_{\_}2$ is the $\mathsf{ECT}$ with $\pi$ from a stage $(\mathsf{CM}_{\_}m,\mathcal{H}_{\_}{k+1}^{\prime})$ where both players have chosen an untouched edge in round $k+1$ but failed to coordinate. Note that all possible such stages $(\mathsf{CM}_{\_}m,\mathcal{H}_{\_}{k+1}^{\prime})$ are renamings of each other, so $\pi$ gives the same $\mathsf{ECT}$ from each one. We next establish that the expected coordination time from $(\mathsf{CM}_{\_}m,\mathcal{H}_{\_}k)$ with $\pi$ is now given by the following formula (called formula (E) below):

\displaystyle p^{2}\Bigl{(}\frac{1}{2}+\frac{1}{2}\bigl{(}1+E_{\_}1\bigr{)}\Bigr{)}\

\displaystyle+\ 2p(1-p)\cdot 2\ +\ (1-p)^{2}\,\Bigl{(}\dfrac{1}{n}+\dfrac{n-1}{n}\Bigl{(}1+E_{\_}2\Bigr{)}\Bigr{)}

(E)

Indeed, both players choose a touched edge in round $k+1$ with probability $p^{2}$ . In that case the $\mathsf{ECT}$ from $(\mathsf{CM}_{\_}m,\mathcal{H}_{\_}{k})$ is $\frac{1}{2}+\frac{1}{2}(1+E_{\_}1)$ , the first occurrence of $\frac{1}{2}$ corresponding to direct coordination and the remaining term covering the case where coordination fails at first. Both players choose an untouched edge in round $k+1$ with probability $(1-p)^{2}$ , and then the $\mathsf{ECT}$ from $(\mathsf{CM}_{\_}m,\mathcal{H}_{\_}k)$ is $\frac{1}{n}+\frac{n-1}{n}(1+E_{\_}2).$ The remaining term $2p(1-p)\cdot 2$ is the contribution of the case where one player chooses a touched edge and the other player an untouched one. The probability for this is $2p(1-p)$ , and the remaining factor $2$ indicates that coordination immediately happens in the subsequent round $k+2$ using the focal point created in round $k+1$ .

We then present an informal argument sketch for the case $\mathsf{CM}_{\_}m$ with $m\geq 6$ . We may assume that $E_{\_}1\leq 2$ and $E_{\_}2\geq\frac{3}{2}$ by Lemmas 4.7 and 6.1. Figure 2 below illustrates the graph of (E) with $E_{\_}1=2$ , $E_{\_}2=\frac{3}{2}$ , $n=4$ , so then (E) has a unique minimum at $p=1$ when $p\in[0,1]$ . This suggests that—under these parameter values—the players should always choose a touched edge in stages with exactly two touched edges. Clearly, lowering $E_{\_}1$ , raising $E_{\_}2$ or raising $n$ should make it even more beneficial to choose a touched edge. As we indeed can assume that $E_{\_}1\leq 2$ and $E_{\_}2\geq\frac{3}{2}$ in $\mathsf{CM}_{\_}m$ for $m\geq 6$ , this informally justifies that $\mathsf{WM}$ is uniquely $\mathsf{ECT}$ -optimal in $\mathsf{CM}_{\_}m$ .

Next we formalize the argument above. Let $S:=(\mathsf{CM}_{\_}m,\mathcal{H}_{\_}k)$ , $m\geq 6$ , be a non-final stage with precisely two touched edges and $S^{\prime}$ a stage extending $S$ by one round where the players both choose an untouched edge but fail to coordinate. Let $r_{\_}1$ (respectively, $r_{\_}2$ ) be the infimum of all possible $\mathsf{ECT}$ s from $S$ (respectively, $S^{\prime}$ ) with different protocols. Note that by Lemma 3.9, $r_{\_}1$ and $r_{\_}2$ are independent of which particular representative stages we choose, as long as the stages satisfy the given constraints. Let $\epsilon>0$ and fix some numbers $E_{\_}1$ and $E_{\_}2$ such that $|E_{\_}1-r_{\_}1|<\epsilon$ and $|E_{\_}2-r_{\_}2|<\epsilon$ . We assume $E_{\_}1\leq 2$ and $E_{\_}2\geq\frac{3}{2}$ by Lemmas 4.7 and 6.1. It is easy to show that with such $E_{\_}1$ and $E_{\_}2$ , the minimum value of the formula (E) with $p\in[0,1]$ is obtained at $p=1$ (for any $n=m-2\geq 4$ ).

Thus, after the necessarily random choice in round one, the above reasoning shows the players should choose a touched edge with probability $p=1$ in each round. Indeed, assume that the earliest occasion that a protocol $\pi_{\_}k$ assigns $p\not=1$ occurs in round $k$ . Then, as shown above, the $\mathsf{ECT}$ of $\pi_{\_}k$ can be strictly improved by letting $p=1$ in that round. By Lemma B.2 of [14], a uniform probability over the touched choices should be used. ∎

Figure 2: Graph of (E) with (i)

n=4

E_{\_}1=2

E_{\_}2=\frac{3}{2}

; (ii)

n=3

E_{\_}1=\frac{3}{2}

E_{\_}2=1

; (iii)

n=2

E_{\_}1=E_{\_}2=2

We then cover the case for $\mathsf{CM}_{\_}5$ . The argument is similar to the case for $\mathsf{CM}_{\_}m$ with $m\geq 6$ , but this time leads to the use of $\mathsf{LA}$ instead of $\mathsf{WM}$ .

Theorem 6.3.

For $\mathsf{CM}_{\_}5$ , $\mathsf{LA}$ is uniquely $\mathsf{ECT}$ -optimal.

Proof.

Recall the formula (E) from the proof of Theorem 6.2. Let $S:=(\mathsf{CM}_{\_}5,\mathcal{H}_{\_}k)$ be a non-final stage with precisely two touched edges and $S^{\prime}$ a stage extending $S$ by one round where the players both choose an untouched edge but fail to coordinate. The $\mathsf{ECT}$ -optimal protocol from $S^{\prime}$ chooses the unique winning pair of focal points in round $k+2$ , so we now have $E_{\_}2=1$ . Let $r_{\_}1$ be the infimum of all possible $\mathsf{ECT}$ s from $S$ with different protocols. Let $\epsilon>0$ and fix some real number $E_{\_}1$ such that $|E_{\_}1-r_{\_}1|<\epsilon$ , assuming $E_{\_}1\geq\frac{3}{2}$ (cf. Lemma 6.1). It is straightforward to show that with these values, and with $n=3$ , the minimum of (E) when $p\in[0,1]$ is obtained at $p=0$ . (See also Figure 2 for the graph of (E) when $E_{\_}1=\frac{3}{2}$ for an illustration. Even then the figure suggests to choose an untouched edge.)

Thus, after the necessarily random choice in round one, the above reasoning shows that the players should choose an untouched edge with probability $1$ in the second round, thereby following $\mathsf{LA}$ . Coordination is guaranteed (latest) in the third round. ∎

In the last case $m=4$ , $\mathsf{WM}$ is $\mathsf{ECT}$ -optimal, but not uniquely, as there exist infinitely many other $\mathsf{ECT}$ -optimal protocols. The reason for this is that—as shown in Figure 2—the graph of (E) becomes the constant line with the value $2$ in the special case where $E_{\_}1=E_{\_}2=2$ , and then any $p\in[0,1]$ gives the optimal value for (E). A complete proof is given in [14].

Theorem 6.4.

$\mathsf{WM}$ is $\mathsf{ECT}$ -optimal for $\mathsf{CM}_{\_}4$ , but there are continuum many other protocols that are also $\mathsf{ECT}$ -optimal.

We have thus given a complete analysis of optimal $\mathsf{ECT}$ s and $\mathsf{GCT}$ s in two-player $\mathsf{CM}$ -games, summarized in Figure 1. Appendix D of [14] contains reflections on the results. We note that the cases for $\mathsf{CM}_{\_}m$ with small $m$ are exceptionally important from the point of view of applications, as such cases tend to occur more frequently in real-life scenarios.

7 The Most Difficult Two-Player $\mathsf{WLC}$ -Games

In this section we give a complete characterization of the upper bounds of optimal $\mathsf{ECT}$ s in $\mathsf{WLC}$ -games as a function of game size. For any $m\geq 1$ , an $m$ -choice game refers to any two-player $\mathsf{WLC}$ -game $G=(A,C_{\_}1,C_{\_}2,W_{\_}G)$ where $m=\max\{|C_{\_}1|,|C_{\_}2|\}$ . Note that with the classical matrix representation of an $m$ -choice game, the parameter $m$ corresponds to the largest dimension of the matrix. In this section we will also show that $\mathsf{CM}_{\_}m$ can be seen as the uniquely most difficult $m$ -choice game for all $m\neq 3$ , see Theorem 7.3.

Our first theorem shows that the wait-or-move protocol is reasonably “safe” to use in any $m$ -choice game with $m\not\in\{3,5\}$ as it always guarantees an $\mathsf{ECT}$ which is at most equal to the upper bound of optimal $\mathsf{ECT}$ s of all $m$ -choice games for the particular $m$ .

Theorem 7.1.

Let $m\not\in\{1,3,5\}$ and consider an $m$ -choice game $G=(A,C_{\_}1,C_{\_}2,W_{\_}G)\not=\mathsf{CM}_{\_}m$ . Then the $\mathsf{ECT}$ in $G$ with $\mathsf{WM}$ is strictly smaller than the optimal $\mathsf{ECT}$ in $\mathsf{CM}_{\_}m$ .

Proof.

By Theorems 6.2, 6.4 and Proposition 4.8, the optimal $\mathsf{ECT}$ in $\mathsf{CM}_{\_}m$ is given by $\mathsf{WM}$ . We saw in Section 4 that the $\mathsf{ECT}$ with $\mathsf{WM}$ is $3-\frac{2}{m}$ in $\mathsf{CM}_{\_}m$ and at most $3-2p$ in $G$ , where $p$ is the one-shot coordination probability when choosing randomly in $G$ . Since $G$ is an $m$ -choice game, $|W_{\_}G|\geq m$ . If $|W_{\_}G|>m$ , then $p>\frac{m}{m^{2}}=\frac{1}{m}$ . And if $|W_{\_}G|=m$ , we have $p=\frac{m}{mn}=\frac{1}{n}>\frac{1}{m}$ where $n:=\min\{|C_{\_}1|,|C_{\_}2|\}<m$ since $G\neq\mathsf{CM}_{\_}m$ . In both cases, we have $3-2p<3-\frac{2}{m}$ . ∎

The greatest optimal $\mathsf{ECT}$ among a class $\mathcal{G}$ of $\mathsf{WLC}$ -games is the value $r$ such that (1) $r$ is the optimal $\mathsf{ECT}$ for some $G\in\mathcal{G}$ ; and (2) for every $G\in\mathcal{G}$ , there is a protocol which gives it an $\mathsf{ECT}\leq r$ . By Theorem 7.1, the greatest optimal $\mathsf{ECT}$ among $m$ -choice games is given by $\mathsf{WM}$ in $\mathsf{CM}_{\_}m$ for $m\not\in\{1,3,5\}$ . The case $m=1$ is trivial, but for the special cases $m=3$ and $m=5$ , we need to perform a systematic graph-theoretic analysis of all such $m$ -choice games and their $\mathsf{ECT}$ s to identify the greatest optimal $\mathsf{ECT}$ among the class. This analysis is done in Appendix C of [14], but we sketch below the main ideas starting from the case $m=3$ .

Since the optimal $\mathsf{ECT}$ for all $3$ -choice games whose graph has a node with degree $3$ is trivially $1$ round, we may leave them out of the analysis. All the remaining $3$ -choice games (up to structural equivalence) are pictured in Figure 3. Except for $\mathsf{CM}_{\_}3$ and the last two games on the right, all of these games have a focal point and thus their optimal $\mathsf{ECT}$ is 1. The optimal $\mathsf{ECT}$ for the second game from the right is $1+\frac{1}{2}$ which is obtained by the protocol making a uniformly random choice every round. The optimal $\mathsf{ECT}$ of the rightmost game is

$\frac{1}{2}\left(1+\sqrt{4+\sqrt{17}}\right)$

which is indeed the greatest optimal $\mathsf{ECT}$ among all $3$ -choice games. (Moreover, this $\mathsf{ECT}$ is obtained by a protocol using highly nontrivial probability distributions for selecting choices.) See Appendix C.1 of [14] for the full details in each case.

Figure 3: Game graphs of all (nontrivial)

3

-choice games. The game on the right has the greatest optimal

\mathsf{ECT}

A similar—yet much more complex—analysis can be done for the case $m=5$ to show that $\mathsf{CM}_{\_}5$ has the greatest optimal $\mathsf{ECT}$ ( $2+\frac{1}{3}$ rounds) among all $5$ -choice games. (See Appendix C.2 of [14] for the full details of the analysis sketched here.) We first observe that if $|W_{\_}G|>8$ , then the $\mathsf{ECT}$ obtained by following $\mathsf{WM}$ in $G$ is $2+\frac{7}{25}<2+\frac{1}{3}$ . Next we analyze games $G$ with $|W_{\_}G|\leq 8$ based on the degrees of choices. If at least one of the players has a choice of degree at least $4$ , we can prove that either there exists a focal point in $G$ or it has the special form where two choices have degree $4$ and all the others have degree $1$ . In this special case we can obtain the $\mathsf{ECT}$ of $2$ rounds by giving the probability $\frac{1}{2}$ for the choices with degree $4$ in the first round and following $\mathsf{WM}$ thereafter.

Suppose next that the degree of all choices in $G$ is at most $3$ . If at least one of the players has a choice with degree $3$ , we can use graph-theoretic reasoning to show that there is a way to immediately guarantee coordination—unless both players have exactly one choice of degree $3$ , denoted by $c$ and $c^{\prime}$ . In this special case, if there is an edge between $c$ and $c^{\prime}$ , they are focal points. Else, $G$ has to be one of the following six games (up to structural equivalence).

Among these games, the leftmost one does not have a focal point unlike all the others, but for that game we can formulate a protocol which gives the $\mathsf{ECT}$ of $2$ rounds. Finally, supposing that the degrees of all choices in $G$ are at most $2$ , the graph of $G$ must consist of disjoint paths and cycles. Recalling the assumption $|W_{\_}G|\leq 8$ , there is a total of 28 such games (listed systematically in Appendix C.2 of [14]). Among those, only the following four do not have a focal point.

However, for the last three games above, we can formulate a protocol which gives them an $\mathsf{ECT}$ of $2$ rounds. Hence $\mathsf{CM}_{\_}5$ indeed has the greatest optimal $\mathsf{ECT}$ among all $5$ -choice games.

We argue that these graph-theoretic analyses are interesting for their own sake and demonstrate the usefulness of representing two-player $\mathsf{WLC}$ -games as bipartite graphs. By Theorem 7.1 and the analyses for the 3 and 5-choice cases, we obtain the following results.

Theorem 7.2.

For any $m$ , the greatest optimal $\mathsf{ECT}$ among $m$ -choice games is given below:

Game size	$m\in\mathbb{Z}_{\_}+\setminus\{3,5\}$	$m=5$	$m=3$
Greatest optimal $\mathsf{ECT}$	$3-\frac{2}{m}$	$2+\frac{1}{3}$	$\frac{1+\sqrt{4+\sqrt{17}}}{2}$ $(\approx 1.925)$

Theorem 7.3.

For $m\not=3$ , the greatest optimal $\mathsf{ECT}$ among m-choice games is uniquely realized by $\mathsf{CM}_{\_}m$ .

Hence choice matching games can indeed be seen as the most difficult two-player $\mathsf{WLC}$ -games—excluding the interesting and important special case of 3-choice games.

8 Conclusion

In this paper we gave a complete analysis for two-player $\mathsf{CM}$ -games with respect to both $\mathsf{GCT}$ s and $\mathsf{ECT}$ s, including uniqueness proofs for the related protocols. We also found optimal upper bounds for optimal $\mathsf{ECT}$ s for all two-player $\mathsf{WLC}$ -games when determined according to game size only. Moreover, our arguments demonstrate the usefulness of representing $\mathsf{WLC}$ -games as hypergraphs. The current paper concentrated on the two-player case as this already turned out a challenging question. A complete characterization of the $n$ -player case remains. This is expected to be a highly difficult task that is likely to require sophisticated arguments.

Acknowledgements

We thank Valentin Goranko, Lauri Hella and Kerkko Luosto for discussions on coordination games. Antti Kuusisto was supported by the Academy of Finland project Theory of computational logics, grants 324435 and 328987. Raine Rönnholm was supported by Jenny and Antti Wihuri Foundation.

References

[1]
[2] Douglas Adams & John Lloyd (1984): The Meaning of Liff. Crown Pub.
[3] Robert J. Aumann & Michael B. Maschler (1995): Repeated Games with Incomplete Information. MIT Press, 10.1007/978-1-4614-1800-9162.
[4] Gary Biglaiser (1994): Coordination in Games: A Survey. In James W. Friedman, editor: Problems of Coordination in Economic Activity, 35, Springer, Dordrecht, pp. 49–65, 10.1007/978-94-011-1398-43.
[5] Russell Cooper (1999): Coordination Games. Cambridge University Press, 10.1017/CBO9780511609428.
[6] Vincent P. Crawford (1995): Adaptive Dynamics in Coordination Games. Econometrica 63(1), pp. 103–43, 10.2307/2951699.
[7] Vincent P. Crawford & Hans Haller (1990): Learning how to cooperate: optimal play in repeated coordination games. Econometrica 58(3), pp. 571–595, 10.2307/2938191.
[8] Valentin Goranko, Antti Kuusisto & Raine Rönnholm (2017): Rational Coordination in Games with Enriched Representations. In Francesco Belardinelli & Estefania Argente, editors: Multi-Agent Systems and Agreement Technologies EUMAS 2017, LNCS 10767, Springer, pp. 323–338, 10.1007/978-3-030-01713-223.
[9] Valentin Goranko, Antti Kuusisto & Raine Rönnholm (2017): Rational coordination with no communication or conventions. In: Proceedings of LORI VI, LNCS 10455, Springer, pp. 33–48, 10.1007/978-3-662-55665-83.
[10] Valentin Goranko, Antti Kuusisto & Raine Rönnholm (2020): Gradual guaranteed coordination in repeated win-lose coordination games. In: Proceedings of ECAI 2020, Frontiers in Artificial Intelligence and Applications 325, pp. 115–122, 10.3233/FAIA200083.
[11] Valentin Goranko, Antti Kuusisto & Raine Rönnholm (2020): Rational coordination with no communication or conventions. J. Log. Comput. 30(6), pp. 1183–1211, 10.1093/logcom/exaa032.
[12] Sanjeev Goyal & Maarten Janssen (1996): Can we rationally learn to coordinate? Theory and Decision 40, pp. 29–49, 10.1007/BF00133159.
[13] Shmuel Zamir Jean-François Mertens, Sylvain Sorin (2015): Repeated Games. Econometric Society Monographs, Cambridge University Press, 10.1017/CBO9781139343275.
[14] Antti Kuusisto & Raine Rönnholm (2020): Optimal protocols for the most difficult repeated coordination games. CoRR abs/2004.07381v1.
[15] Roger Lagunoff & Akihiko Matsui (1997): Asynchronous Choice in Repeated Coordination Games. Econometrica 65, pp. 1467–1477, 10.2307/2171745.
[16] D. Lewis (1969): Convention, A Philosophical Study. Harvard University Press.
[17] George J. Mailath & Larry Samuelson (2006): Repeated Games and Reputations: Long-Run Relationships. Oxford University Press, 10.1093/acprof:oso/9780195300796.001.0001.
[18] Thomas Schelling (1960): The Strategy of Conflict. Harvard University Press, 10.1177/002200275800200301.

The Optimal Way to Play the Most Difficult Repeated Coordination Games

Abstract

1 Introduction

2 Preliminaries

Definition 2.1.

3 Repeated 𝖶𝖫𝖢\mathsf{WLC}-Games

Definition 3.1.

Definition 3.2.

Definition 3.3.

Definition 3.4.

Definition 3.5.

Example 3.6.

Definition 3.7.

Definition 3.8.

Lemma 3.9.

4 Protocols for Repeated 𝖶𝖫𝖢\mathsf{WLC}-Games

Definition 4.1.

Proposition 4.2.

Proposition 4.3.

Example 4.4.

Definition 4.5.

Theorem 4.6.

Lemma 4.7.

Proposition 4.8.

5 Optimizing Guaranteed Coordination Times

Theorem 5.1.

Theorem 5.2.

Proof.

6 Optimizing Expected Coordination Times

Lemma 6.1.

Theorem 6.2.

Proof.

Theorem 6.3.

Proof.

Theorem 6.4.

7 The Most Difficult Two-Player 𝖶𝖫𝖢\mathsf{WLC}-Games

Theorem 7.1.

Proof.

Theorem 7.2.

Theorem 7.3.

8 Conclusion

Acknowledgements

References

The Optimal Way to Play the Most Difficult
Repeated Coordination Games

3 Repeated $\mathsf{WLC}$ -Games

4 Protocols for Repeated $\mathsf{WLC}$ -Games

7 The Most Difficult Two-Player $\mathsf{WLC}$ -Games