A Time-Inconsistent Dynkin Game: from Intra-personal to Inter-personal Equilibria
Abstract
This paper studies a nonzero-sum Dynkin game in discrete time under non-exponential discounting. For both players, there are two levels of game-theoretic reasoning intertwined. First, each player looks for an intra-personal equilibrium among her current and future selves, so as to resolve time inconsistency triggered by non-exponential discounting. Next, given the other player’s chosen stopping policy, each player selects a best response among her intra-personal equilibria. A resulting inter-personal equilibrium is then a Nash equilibrium between the two players, each of whom employs her best intra-personal equilibrium with respect to the other player’s stopping policy. Under appropriate conditions, we show that an inter-personal equilibrium exists, based on concrete iterative procedures along with Zorn’s lemma. To illustrate our theoretic results, we investigate a two-player real options valuation problem: two firms negotiate a deal of cooperation to initiate a project jointly. By deriving inter-personal equilibria explicitly, we find that coercive power in negotiation depends crucially on the impatience levels of the two firms.
MSC (2020): 60J20, 91A05, 91A07, 03E75.
Keywords: Dynkin games, time inconsistency, non-exponential discounting, intra-personal equilibrium, inter-personal equilibrium, alternating fixed-point iterations.
1 Introduction
In dynamic optimization, time inconsistency is the self-conflicting situation where the same agent at different times (i.e. the current and future selves) cannot agree on a “dynamically optimal strategy” that is good for the entire planning horizon. A long-standing approach to resolving time inconsistency is Strotz’ consistent planning [41]: An agent should take her future selves’ disobedience into account, so as to find a strategy that none of her future selves will have an incentive to deviate from. Essentially, such a strategy is an intra-personal equilibrium—an equilibrium established internally within the agent, among her current and future selves.
The investigation of intra-personal equilibria, particularly their mathematical definitions and characterizations, has been the main focus of the literature on time inconsistency. This includes the classical framework in discrete time that relies on a straightforward backward sequential optimization detailed in [36], as well as the more recent development in continuous time that employs the spike variation technique introduced in Ekeland and Lazrak [15]. The latter has led to vibrant research on time-inconsistent stochastic control, including [17, 16, 7, 6, 45], among many others. Lately, marked progress has been made for time-inconsistent optimal stopping, along two different paths. One is to extend the spike variation technique from stochastic control to optimal stopping, as carried out in [14, 8, 9]. The other path is the iterative approach developed in [20, 21, 23], which circumvents spike variations via a fixed-point perspective. Let us also mention the recent work [4] which builds a connection between different concepts of equilibria in these two paths.
A natural question follows all the developments: How does the intra-personal reconciliation within one single agent integrate into the interaction among multiple (non-cooperative) agents? Intuitively, there should be two levels of game-theoretic reasoning—the inner level where each agent looks for time-consistent strategies her future selves will actually follow, and the outer level where each agent chooses her best strategy (among time-consistent ones) in response to other agents’ strategies. A resulting inter-personal equilibrium should then be a Nash equilibrium among all the agents, each of whom is restricted to choose time-consistent strategies (i.e. her intra-personal equilibria). To the best of our knowledge, such inter-personal equilibria built from intra-personal ones have not been properly formulated and studied in the literature. The crucial question is whether and how different agents’ respective intra-personal equilibria can ultimately forge an inter-personal equilibrium among all agents. This paper will shed new light on this through a time-inconsistent Dynkin game.
A Dynkin game involves two players interacting through their stopping strategies. The zero-sum version of the game, introduced in Dynkin [13], has been substantially studied along various directions, including [13, 34] (discrete time), [5, 28, 31] (continuous time), [44, 37, 43] (randomized strategies), [10] (non-Markovian settings), and [2] (model uncertainty), among others. Many of the studies not only show that a Nash equilibrium between the two players exists, but provide concrete constructions. By contrast, the nonzero-sum version of the game has received relatively less attention; see the early investigations [32, 35, 33] and more recent ones [19, 27, 11], among others. Remarkably, all the developments above assume that the two players optimize their expected payoff/cost under exponential discounting (including the case of no discounting), which readily ensures time consistency.
In this paper, we consider a nonzero-sum Dynkin game where the state process is a discrete-time strong Markov process taking values in a Polish space . Each player chooses to stop at the first entrance time of some Borel subset of , which will be called a stopping policy. The Dynkin game is in general time-inconsistent, as we allow the two players to take general discount functions that satisfy only a log sub-additive condition, i.e. (2.4) below. This condition captures decreasing impatience, a widely observed feature of empirical discounting, and readily covers numerous non-exponential discount functions in behavioral economics; see the discussion below (2.4).
As time inconsistency arises under non-exponential discounting, each player, when given the other’s chosen stopping policy , needs to find accordingly an intra-personal equilibrium among her current and future selves. Following the fixed-point approach in Huang and Nguyen-Huu [20], we define each player’s intra-personal equilibrium as a fixed point of an operator, which encodes the aforementioned inner level of game-theoretic reasoning (Definition 2.1). To achieve an inter-personal equilibrium, a minimal requirement is that each player should attain her inner-level equilibrium simultaneously—that is, the following situation should materialize: is Player 1’s intra-personal equilibrium given Player 2’s stopping policy , and is Player 2’s intra-personal equilibrium given Player 1’s stopping policy . In this case, we say is a soft inter-personal equilibrium (Definition 2.2). To further refine this “soft” definition, we note that each player, when following the Nash equilibrium idea, should not be satisfied with an arbitrary intra-personal equilibrium, but aim at the best one under an appropriate optimality criterion. Reminiscent of the “optimal equilibrium” concept proposed in Huang and Zhou [24, 25], we say that an intra-personal equilibrium is optimal if it generates larger values than any other intra-personal equilibrium, for not only the current but all future selves (Definition 2.3). This immediately brings about a stronger notion for an inter-personal equilibrium: is said to be a sharp inter-personal equilibrium if each player attains her best inner-level equilibrium simultaneously, i.e. is Player 1’s optimal intra-personal equilibrium given Player 2’s stopping policy , and is Player 2’s optimal intra-personal equilibrium given Player 1’s stopping policy (Definition 2.4).
The focus of this paper is to establish the existence of inter-personal equilibria, soft and sharp, through concrete iterative procedures. First, we develop for each player an individual iterative procedure, i.e. (3.4) below, that directly leads to her optimal intra-personal equilibrium (Theorem 3.1). This procedure can be viewed as an improvement to those in [20, 21], which lead to intra-personal equilibria but not necessarily the optimal ones. Next, we devise an alternating iterative procedure, i.e. (4.2) below, in which the two players take turns to perform the individual iterative procedure repetitively. In each iteration, one player, given the other’s stopping policy determined in the previous iteration, performs the individual iterative procedure and then updates her policy to the optimal intra-personal equilibrium obtained; see Section 2.1 for details. Under appropriate conditions, this alternating iterative procedure converges and the limit, denoted by , is guaranteed a soft inter-personal equilibrium (Theorem 4.1). While it is tempting to believe that is in fact sharp, in view of its structure revealed in Theorem 4.1, this is generally not the case: We demonstrate explicitly that is sharp in Example 4.1, but only soft in the slightly modified Example 4.2. In other words, the general existence of sharp inter-personal equilibria is still in question. Assuming additionally that the state process has transition densities, we are able to upgrade the construction of and apply Zorn’s lemma appropriately, which yields the desired result that a sharp inter-personal equilibrium must exist (Theorem 4.2).
It is worth noting that Theorems 4.1 and 4.2 hinge on a supermartingale condition, i.e. (3.13) below. As shown in Section 4.3, when the supermartingale condition fails, there may exist no inter-personal equilibrium, either soft or sharp; see Proposition 4.1 particularly. Let us point out that similar supermartingale conditions were also imposed in some studies on classical (time-consistent) nonzero-sum Dynkin games (e.g. [32, 35]) to facilitate the existence of a Nash equilibrium.
As an application, we study the negotiation between two firms (or countries) in Section 5. Suppose that each firm intends to coerce the other into unfavorable terms so as to obtain a larger payoff. A firm either waits until the other gives in and takes the larger payoff (i.e. its coercion works), or gives in to the other and accepts the unfavorable terms (i.e. its coercion fails). By computing explicitly the sharp inter-personal equilibrium between the two firms (Propositions 5.1 and 5.2, Corollary 5.1), we find that whether coercion in negotiation works depends on the impatience levels of the two firms: If a firm is less impatient than the other, its coercion always works; on the other hand, if a firm is significantly more impatient than the other, its coercion must fail. See particularly the discussion below Corollary 5.1 for details.
The rest of the paper is organized as follows. Section 2 introduces the model setup, formulates intra- and inter-personal equilibria, and collects preliminary results. Section 3 develops an individual iterative procedure that directly yields a player’s optimal intra-personal equilibrium. Under a supermartingale condition, the monotonicity of this procedure is also established. Section 4 devises an alternating iterative procedure, from which we prove the existence of soft and sharp inter-personal equilibria. Examples are presented to demonstrate the alternating iterative procedure and the necessity of the supermartingale condition. Finally, Section 5 applies our analysis to the negotiation between two firms, relating coercive power to impatience level.
2 The Model and Preliminaries
Let and consider a time-homogeneous strong Markov process taking values in a Polish space . We denote by the Borel -algebra of . On the path space , the set of all functions mapping to , let be the filtration generated by and be the set of all -stopping times. In addition, we consider . For any , we denote by the process with initial value , by the probability measure on generated by (i.e. the law of ), and by the expectation under .
Consider a nonzero-sum Dynkin game where the two players maximize their respective expected payoffs, determined jointly by their stopping strategies. Specifically, for , given the stopping time chosen by the other player, Player at the current state selects a stopping time to maximize her expected discounted payoff
(2.1) |
where
(2.2) |
Here, is Player ’s discount function, assumed to be strictly decreasing with , and are Player ’s payoff functions, assumed to be Borel measurable. Note that we allow to take the value . For any , we simply define . To ensure that in (2.1) is well-defined, we will impose throughout the paper
(2.3) |
As mentioned in Introduction, the vast literature on Dynkin games mostly assumes exponential discounting, i.e. for some . Empirical studies (e.g. [42, 29]), on the other hand, have found that individuals do not normally discount exponentially. In this paper, the only standing assumption on , , is
(2.4) |
This particularly captures decreasing impatience, a widely observed feature of empirical discounting. Numerous non-exponential discount functions in behavioral economics, such as hyperbolic, generalized hyperbolic, and pseudo-exponential discount functions, readily satisfy (2.4); see the discussion below [20, Assumption 3.12] for details.
In a one-player stopping problem, it is well-understood that non-exponential discounting induces time inconsistency: An optimal stopping strategy derived at the current state may no longer be optimal at a subsequent state . In other words, the current and future selves cannot agree on a “dynamically optimal stopping strategy” that is good for the entire planning horizon; see e.g. [20, Section 2.2] for an explicit demonstration. Strotz’ consistent planning [41] is a long-standing approach to resolving time inconsistency: Knowing that her future selves may overturn her current plan, an agent selects the best present action taking the future disobedience as a constraint; the resulting strategy is a (subgame perfect) Nash equilibrium from which no future self has an incentive to deviate.
In our Dynkin game, thanks to the time-homogeneous Markovian setup, we assume that each player decides to stop or to continue depending on her current state . That is, each player stops at the first entrance time of some , defined by
For convenience, we will often call a stopping policy. This corresponds to a “pure strategy” in economic terms.111See Remark 4.3 for discussions on the use of pure and randomized strategies. For , given the other player’s stopping policy , Player is faced with time inconsistency among her current and future selves (as explained above), and needs to find an equilibrium stopping policy at the intra-personal level. Following [25, Section 2.1] (or [20, Section 3.1]), Strotz’ consistent planning boils down to the current self’s game-theoretic reasoning: “Given that my future selves will follow the policy , what is the best policy today in response to that?” The best policy is determined by comparing the payoff of immediate stopping and the payoff of continuation , where
is the first hitting time to . This leads to the following stopping policy
(2.5) |
We can consider as an improving operator for Player : Given the other player’s stopping policy , improves the present policy of Player to .
For each player, we define an equilibrium at the intra-personal level (i.e. among the player’s current and future selves) in the same spirit as [25, Definition 2.2] and [20, Definition 3.7].
Definition 2.1.
For , is called Player ’s intra-personal equilibrium w.r.t. (with respect to) if . We denote by the set of all Player ’s intra-personal equilibria w.r.t. .
Remark 2.1.
The above fixed-point definition of an intra-personal equilibrium was introduced in [20] and followed by [21, 25, 23], among others. Note that there is a slightly different formulation in [24]: If we follow [24], particularly (2.5) therein, in (2.5) needs to be modified as
(2.6) |
Observe from (2.5) and (2.6) that the equilibrium condition “” in Definition 2.1 is slightly weaker than “” as in [24, Definition 2.3]. This paper uses the slightly weaker definition because it conforms more closely to the Nash equilibrium idea—one deviates to a new policy only when it is strictly better than the current one; see the explanations at the beginning of [20, p.7]. Moreover, the weaker definition facilitates the search for intra-personal equilibria, as it allows for the explicit construction in Proposition 3.1 below.
Based on Definition 2.1, we introduce the first kind of equilibria at the inter-personal level (i.e. between the two players)—the soft inter-personal equilibria.
Definition 2.2.
We say is a soft inter-personal equilibrium (for the Dynkin game) if and (i.e. and ). We denote by the set of all soft inter-personal equilibria.
Essentially, means that each player simultaneously attains an equilibrium at the intra-personal level, given the other’s stopping policy: is Player ’s intra-personal equilibrium w.r.t. Player 2’s policy , and is Player ’s intra-personal equilibrium w.r.t. Player 1’s policy .
As emphasized in [24, 25], Strotz’ consistent planning is a two-phase procedure: An agent first determines the strategies that she will actually follow over time (Phase I), and then chooses the best one among them (Phase II). In our Dynkin game, Phase I amounts to each player finding her intra-personal equilibria (w.r.t. the other player’s stopping policy); Phase II is then the search for an optimal intra-personal equilibrium, defined as below.
Definition 2.3.
For and , the value function associated with is defined by
We say is Player ’s optimal intra-personal equilibrium w.r.t. if for any ,
We denote by the set of all Player ’s optimal intra-personal equilibria w.r.t. .
Remark 2.2.
Thanks to , defined above coincides with . Indeed, by (due to ) and (2.5), for and for . That is, for all .
Definition 2.3 follows the “optimal equilibrium” notion introduced in [24]. It is a rather strong optimality criterion, as it requires a (subgame perfect Nash) equilibrium to dominate any other equilibrium on the entire state space—a rare occurrence in game theory. Nonetheless, for the one-player optimal stopping problem under non-exponential discounting, as long as the discount function satisfies (2.4), the existence of an optimal equilibrium has been established first in discrete time [24] and then in continuous time, including [25, 22] (diffusion models) and [4] (continuous-time Markov chain models).
Based on Definition 2.3, we introduce the second kind of equilibria at the inter-personal level (i.e. between the two players)—the sharp inter-personal equilibria.
Definition 2.4.
We say is a sharp inter-personal equilibrium (for the Dynkin game) if and . We denote by the set of all sharp inter-personal equilibria.
A sharp inter-personal equilibrium, compared with a soft one in Definition 2.2, conforms to the Nash equilibrium concept more closely. Given the other player’s policy, what a player aims at should not be an arbitrary agreement among her current and future selves (as is stipulated in Definition 2.2), but the agreement that is best-rewarding—the one that generates the largest possible value for every incarnation of herself in time. In other words, our time-inconsistent Dynkin game involves two levels of game-theoretic reasoning. Player 1 wants to find the best response to Player 2’s policy at the inter-personal level, while maintaining an agreement among her current and future selves at the intra-personal level; Player 2 does the same in response to Player 1’s policy. In the end, each player chooses an optimal intra-personal equilibrium w.r.t. the other player’s policy, leading to a sharp inter-personal equilibrium for the Dynkin game.
It is worth noting that our definition of a sharp inter-personal equilibrium covers, as a special case, the standard Nash equilibrium in a time-consistent Dynkin game.
Remark 2.3.
In the time-consistent case of exponential discounting, a Nash equilibrium for the Dynkin game is defined as a tuple of stopping times such that is Player 1’s optimal stopping time given that Player 2 employs , while , at the same time, is Player 2’s optimal stopping time given that Player 1 employs . In a time-homogeneous setting, let (resp. ) denote the stopping region associated with (resp. ). In view of [20, Proposition 3.11], is readily Player 1’s intra-personal equilibrium w.r.t. . Moreover, by the argument in [22, Remark 2.12], is in fact Player 1’s optimal intra-personal equilibria w.r.t. —namely, . By the same token, we have . It then follows that is a sharp inter-personal equilibrium.
That is to say, in the classical time-consistent case, a Nash equilibrium in a time-homogeneous model is automatically a sharp inter-personal equilibrium, once we re-state using their respective stopping regions.
2.1 Problem Formulation
This paper aims to establish the existence of soft and sharp inter-personal equilibria, using concrete iterative procedures. Although the existence and construction of each player’s intra-personal equilibria is well-understood (based on the one-player results [20, 24]), it is unclear whether the two players’ respective intra-personal equilibria can be coordinated properly to form an inter-personal equilibrium, either soft or sharp.
We will tackle this in two steps. First, we look into the one-player problem more closely, developing for each player an individual iterative procedure that directly brings about her optimal intra-personal equilibrium (Theorem 3.1). Next, we devise an alternating iterative procedure in which the two players take turns to perform the individual iterative procedure:
-
1.
With respect to Player 1’s initial policy , Player 2 performs the individual iterative procedure to get an optimal intra-personal equilibrium .
-
2.
With respect to Player 2’s policy , Player 1 performs the individual iterative procedure to get an optimal intra-personal equilibrium .
-
3.
With respect to Player 1’s policy , Player 2 performs the individual iterative procedure to get an optimal intra-personal equilibrium .
- ⋮
The hope is that this alternating iterative procedure will ultimately converge, with the limit being a soft, or even sharp, inter-personal equilibrium. This will be investigated in detail in Section 4, with affirmative results established in Theorems 4.1 and 4.2.
2.2 Preliminaries
We collect two technical results that will be useful throughout the paper. The first one concerns the convergence of first entrance and hitting times.
Lemma 2.1.
Let be a monotone sequence in . For any , there exists such that for all , where
(2.7) |
The same result holds with replaced by .
Proof.
Fix . If is nondecreasing, set . Without loss of generality, assume . As , there exists such that for all . Hence, for all . If there exists such that , then , a contradiction. We thus conclude for all . On the other hand, if is nonincreasing, set . Without loss of generality, assume . Then, there exists such that for all . Hence, for all and thus . This implies . Since by definition, we conclude for all . The same arguments as above hold with replaced by . ∎
The next result states that any stopping policy containing an intra-personal equilibrium must be dominated by . This kind of result was first established for one-player stopping problems in [25, Lemma 3.1], and is now extended to a Dynkin game setting.
Lemma 2.2.
Fix and assume . Then, for any , with and for some ,
Proof.
Consider and . For any , by (2.1) and (2.2),
(2.8) |
By the assumption ,
(2.9) |
On the other hand,
(2.10) |
In view of (2.2), (2.4), and the nonnegativity of , and ,
(2.11) |
where the equality follows from the strong Markov property of . Thanks to the above inequality and the fact that , (2.10) implies
(2.12) |
On the set , we deduce from that . Since , (2.5) implies . Then, (2.12) yields . On strength of this and (2.9), we conclude from (2.8) that . ∎
There is an intriguing message here—a smaller intra-personal equilibrium is more rewarding. Indeed, for any with , Lemma 2.2 asserts for all . This “ranking by size” will play a crucial role in Theorem 3.1 below, where an optimal intra-personal equilibrium is derived. Economically, this ranking simply reflects a player’s decreasing impatience, which is captured by (2.4); see the detailed discussion below [25, Corollary 3.2].
3 The One-Player Analysis
In this section, we first focus on developing an iterative procedure for each player that directly leads to her intra-personal equilibrium. Next, under a supermartingale condition, we will establish the monotonicity of this iterative procedure.
For each , we introduce, for any fixed , the operator defined by
(3.1) |
where
(3.2) |
We first note that converges desirably along nondecreasing sequences of stopping policies.
Lemma 3.1.
Fix . Let be a nondecreasing sequence in . Then, for any , for all , with .
Proof.
Fix . By the definition of in (3.2), . To show the converse inequality, let with be a -optimizer of , for each . Then,
(3.3) |
For each , Lemma 2.1 asserts the existence of such that for all . In particular, for all . This, together with (2.3), indicates that we may apply the dominated convergence theorem to show that the expectation in (3.3) converges to . Hence, we conclude from (3.3) that . ∎
For any fixed , we perform an iterative procedure by starting with the empty set and applying the operator repetitively. This will give an intra-personal equilibrium for Player .
Proposition 3.1.
Fix and assume . For any , let be a nondecreasing sequence in defined by
(3.4) |
where is defined as in (3.1). Then,
(3.5) |
Moreover, for all ; in particular, .
Proof.
Fix . If , by (3.1) we have , which implies . If , since is by definition nondecreasing, there must exist such that . Thanks again to (3.1), this yields , which implies . Hence, we conclude .
It remains to show the converse inclusion. Fix . We claim that , i.e.
(3.6) |
Assume to the contrary that (3.6) fails, so that
(3.7) |
Consider
(3.8) |
As , we can take such that
(3.9) |
Observe that we must have , and thus -a.s. Indeed, if , then , which contradicts (3.9). Moreover, since implies for all , we deduce from (3.1) that for all . By Lemma 3.1, this implies
(3.10) |
Let with be a -optimizer of . Consider the sets
(3.11) |
By (2.4) and the nonnegativity of , , and , we can argue as in (2.11) to get
Thanks to this and the fact that , (3.11) yields
where the second inequality follows from on and the definition of in (3.7), and the third inequality is due to on , the definition of in (3.8), and by definition. The above inequality is clearly a contradiction, and we thus conclude that (3.6) holds. That is, we have shown that , or simply . This brings the final conclusion that , i.e. .
Remark 3.1.
We will go one step further to claim that in (3.5) is in fact an optimal intra-personal equilibrium for Player . To this end, we need the following auxiliary result: Being included in an intra-personal equilibrium is an invariant relation under the operator .
Lemma 3.2.
Fix , , and . For any with , .
Proof.
Now, we are ready to present the main result of this section.
Theorem 3.1.
Fix and assume . For any , defined in (3.5) belongs to .
Proof.
3.1 Monotonicity with respect to
So far, we have fixed (the other player’s stopping policy) and constructed a corresponding optimal intra-personal equilibrium in (3.5). By viewing as a variable, we will show that the map is monotone under appropriate conditions.
Lemma 3.3.
Fix . Assume and that
(3.13) |
Then, for any with ,
(3.14) |
Hence, for all and . Moreover, we have
(3.15) |
Proof.
Given and , consider and Observe that
where the first inequality is due to and the last inequality follows from being a supermartingale. By the above inequality and
thanks to , we conclude .
Next, fix with . For any , if , then by definition. Hence, we assume in the following. With , (3.1) gives
(3.16) |
where the last inequality follows from (3.14) and (3.2). Note that we must have . Indeed, if , then -a.s. and thus
which contradicts the first inequality in (3.16). With and thus -a.s.,
where the last inequality follows from (3.16). This, together with , yields . We therefore conclude . ∎
Proof.
The monotonicity of , , will play a crucial role in Theorem 4.1 below, contributing to the convergence of an alternating iterative procedure performed jointly by Players 1 and 2.
4 The Existence of Inter-Personal Equilibria
In this section, we will design an alternating iterative procedure, to be performed jointly by the two players. As shown in Theorem 4.1 below, this procedure converges to a soft inter-personal equilibrium that is almost sharp. By a probabilistic modification of this iterative procedure and an appropriate use of Zorn’s lemma, we establish the existence of sharp inter-personal equilibria in Theorem 4.2 below. Explicit examples will be presented to illustrate this iterative procedure and the necessity of a supermartingale condition.
First, we observe that Theorem 3.1 already provides a sufficient condition for the existence of a sharp inter-personal equilibrium.
Lemma 4.1.
For each , assume . If satisfies
(4.1) |
then .
Proof.
By Theorem 3.1, and , so that . ∎
4.1 Construction of Soft Inter-Personal Equilibira
In order to achieve (4.1), we let the two players take turns to perform the individual iterative procedure (3.4). As the next result shows, such alternating iterations do converge, and the limit is guaranteed a soft inter-personal equilibrium.
Theorem 4.1.
For each , assume and (3.13). Let be a sequence in defined by and
(4.2) |
Then, is nondecreasing and is nonincreasing. By taking and , we have with
(4.3) |
Proof.
As , applying Corollary 3.1 for Player 2 implies . With , applying Corollary 3.1 for Player 1 implies . Again, by , applying Corollary 3.1 for Player 2 gives . Repeating this procedure for Players 1 and 2 recursively, we have nondecreasing and is nonincreasing.
Next, let us show that . Fix . There exists such that for all . By the fact that (thanks to Proposition 3.1), for all . As , (2.3) allows us to use the dominated convergence theorem, so that we may conclude from Lemma 2.1 that
(4.4) |
which implies . Hence, . On the other hand, for any , for all . Thanks again to the fact that , indicates for all . As , we can argue as in (4.4) to get , which implies . Hence, , or . We thus conclude , i.e. . Similar arguments as above yield , i.e. . This readily shows that .
The next example illustrates the alternating iterative procedure (4.2) explicitly.
Example 4.1.
Let contain countably many states, i.e. , and assume
(4.5) |
Take such that
(4.6) |
Additionally, take and consider the following payoff functions for the two players
(4.7) | |||
(4.8) |
while (resp. ) is allowed to be any function such that (resp. ) on .
For small enough, we claim that the alternating iterative procedure (4.2) gives rise to
(4.9) | ||||||
First, starting with , we deduce from (4.8) that for any ,
(4.10) |
This implies and , so that . Next, thanks to (4.5) and (4.7), for any ,
(4.11) |
where the inequality “” holds as is small enough. This implies and , so that . Now, thanks to (4.5) and (4.8), for any such that ,
(4.12) |
where we use (4.6) in the last two lines. This implies and , so that . By similar arguments as above, we can derive and in (4.9) for all . By Theorem 4.1, is a soft inter-personal equilibrium and satisfies (4.3). Observe that for any ,
It follows that , so that . That is, we have a stronger version of (4.3) where the inclusion therein is equality. Hence, by Lemma 4.1, is in fact a sharp inter-personal equilibrium.
In view of (4.3) and Lemma 4.1, the soft inter-personal equilibrium constructed in Theorem 4.1 is nearly a sharp one. It is natural to ask whether the inclusion in (4.3) is actually equality (as in Example 4.1), so that is sharp in general. The next example shows that this is generally not the case: the inclusion in (4.3) can be strict and may fail to be sharp.
Example 4.2.
Let us extend the state space in Example 4.1 by including two additional states, i.e. . The transition probabilities are specified as in (4.5), as well as
(4.13) |
Take such that (4.6) holds. Assume additionally that satisfies . Take and define and , , as in (4.7)-(4.8) on , along with
(4.14) |
while and are allowed to take arbitrary nonnegative values on as long as . Also, (resp. ) is allowed to be any function such that (resp. ) on .
For small enough, we claim that the alternating iterative procedure (4.2) gives rise to
(4.15) | ||||||
First, note that the relations (4.10), (4.11), and (4.12) remain true in our current setting. Now, starting with , we deduce from (4.13), (4.8), and (4.14) that for any ,
These two inequalities, along with (4.10), imply and , so that . Next, since , for any , for . This, together with (4.11), implies that as is small enough, and , so that . Thanks to (4.13), (4.5), (4.8), and (4.14), for any such that ,
where the last inequality holds as is small enough, thanks to (4.14). The above two inequalities, along with (4.12), imply and , so that . By similar arguments as above, we can derive and in (4.15) for all . Hence, .
Now, it can be easily checked that for all . Moreover, due to (4.13),
In view of (3.1), we conclude . By (3.5), this in turn implies That is, the inclusion in (4.3) is strict, so that we can no longer conclude from Lemma 4.1 that is a sharp inter-personal equilibrium. In fact, is not sharp. Recall from Theorem 3.1 that . Then, it can be checked directly that but . Specifically, is strictly dominated by at the state , as (4.14) indicates
As , is not a sharp inter-personal equilibrium.
4.2 General Existence of Sharp Inter-Personal Equilibria
In view of Example 4.2, the soft inter-personal equilibrium constructed in Theorem 4.1 may not be a sharp one. That is to say, the general existence of a sharp inter-personal equilibrium is still in question. To resolve this, we impose appropriate regularity of .
Assumption 4.1.
has transition densities with respect to a measure on . That is, for each , is a Borel measurable function such that
Remark 4.1.
When is at most countable, Assumption 4.1 is trivially satisfied. When is uncountable, the literature is focused on the case for some . In this case, many discrete-time Markov processes fulfill Assumption 4.1 (with being the Lebesgue measure). This includes, particularly, defined by the formula , , where is a Borel measurable function and is a random variable independent of such that admits a probability density function for all . This formula is commonly used in practical simulation of Markov processes; see e.g. [38, Section 3] and [1, Section 5].
The next result, as a direct consequence of [26, Lemma 6.5], will also play a crucial role.
Lemma 4.2.
Let be a measure on . For any , there exists a maximal Borel minorant of under , defined as a set with such that for any with , .
Now, we are ready to present the general existence of a sharp inter-personal equilibrium.
Theorem 4.2.
Proof.
Consider the set
By Theorem 4.1, . Now, define a partial order on as follows: for any ,
(4.16) |
Step 1: Showing that every totally ordered subset of has an upper bound in .
Let be a subset of that is totally ordered. Set and .
Recall the measure in Assumption 4.1. By Lemma 4.2, there exists a maximal Borel minorant of under , which will be denoted by . For any with , since , we deduce from Assumption 4.1 that for all and . It follows that
(4.17) |
On the other hand, observe that
(4.18) |
Indeed, for any with , by Corollary 3.1 and the definition of , for all , which implies . Similarly, for any with , by Corollary 3.1 and the definition of , for all , which implies .
Now, define
Note that the first inclusion follows from , , and (4.18). As implies , we deduce from , , and (4.18) that the second inclusion above holds. With (thanks to ) and , (4.17) gives for all . This readily implies
(4.19) |
We claim that . By the definition of in (3.5), it suffices to show that for all . First, as , the last assertion of Proposition 3.1 implies . With , we obtain
(4.20) |
Now, for , (3.15) implies . For any , due to (by Proposition 3.1) and (4.20), we must have . Observe that
(4.21) |
where the first and third equalities follow from (4.19) and the inequality stems from the definition of in (3.1). This shows that . Hence, we obtain and thus conclude . Suppose that for some . By (3.15) again, . Fix . By using Proposition 3.1 and (4.19) as above, we get . If , then trivially, by the definition of in (3.4). If , then by the definition of ,
By (4.19) and the above inequality, we may argue similarly as in (4.21) to get
where the second equality is due to . It follows that . Hence, we obtain and thus conclude . By induction, we have established for all , as desired.
By Corollary 3.1 and ,
Now, by defining and for all , we can follow the same argument in the proof of Theorem 4.1 to show that is nondecreasing, is nonincreasing, and with and . By construction, and for all . Hence, is an upper bound for .
Step 2: Applying Zorn’s lemma.
As every totally ordered subset of is shown to have an upper bound in , Zorn’s lemma implies that there exists a maximal element in , denoted by . We claim that . Set , , and define
Thanks to and (as ), we may apply Corollary 3.1 recursively to show that is nondecreasing and is nonincreasing. Then, by the same argument in the proof of Theorem 4.1, we obtain with and . By construction, and . But since is a maximal element of (under the partial order (4.16)), we must have and . This in particular implies and , so that
By Lemma 4.1, this readily implies . ∎
Remark 4.2.
In view of (2.1)-(2.2), the condition (in Theorems 4.1 and 4.2) encourages each player to wait/continue until the other player stops, so as to obtain a larger reward. Consequently, each player faces the tradeoff between the potential (generous) gain from outlasting the other player and the cost of waiting that enlarges with time (due to discounting and possible loss of opportunity). That is, our Dynkin game exemplifies the “war of attrition” in game theory. The negotiation example in Section 5 below well demonstrates this “war”: Each firm intends to wait until the other firm gives in so as to seal the best deal, while subject to the impact of discounting and the varying cost of project initiation.
Remark 4.3.
In a classical (time-consistent) nonzero-sum Dynkin game, the condition ensures that a Nash equilibrium, as a tuple of pure stopping times , exists; see e.g., [19]. Without the condition , a Nash equilibrium need not exist, as shown in [27]. One needs to consider randomized strategies to possibly establish the existence of a Nash equilibrium, as a tuple of randomized strategies. Still, in some cases, only an -Nash equilibrium is known to exist; see e.g., [39, 18, 27].
In this paper, as we assume (cf. Theorems 4.1 and 4.2), our focus on pure strategies is consistent with the literature. If we drop the condition , many arguments will no longer hold and we expect the use of randomized strategies indispensable. Randomized strategies for time-inconsistent stopping problems have recently been proposed and analyzed by [3] in discrete time and by [9] in continuous time. It is of interest as future research to modify their definitions and allow for randomized strategies in our Dynkin game.
4.3 Discussion on the Supermartingale Condition
It is worth noting that while the supermartingale condition (3.13) is required in Theorems 4.1 and 4.2, it does not play a role in Theorem 3.1. Because the one-player iterative procedure (3.4) is by construction monotone, it converges without the need of any other condition. It is much more complicated for the two-player alternating iterative procedure (4.2) to converge. The monotonicity of (3.4) only ensures that each iteration (performed by one of the two players) converges to a stopping policy, but says nothing about whether the two resulting sequences of policies (one sequence for each player) will actually converge. It is the supermartingale condition (3.13) that brings about the monotonicity for these two sequences of policies (on strength of Corollary 3.1), leading to an inter-personal equilibrium between the two players.
When (3.13) fails, the monotonicity in Corollary 3.1 no longer holds in general, and there may exist no inter-personal equilibrium, soft or sharp. To demonstrate this, consider a three-state model
(4.22) |
Given , define the payoff functions by
(4.23) | ||||||||||
and
(4.24) |
Proposition 4.1.
Proof.
Let for all . Then,
where the inequality holds as is large enough. This readily shows that (3.13) is violated.
For any and , we deduce from the definitions of , , , , that for each ,
(4.25) |
Moreover,
(4.26) |
By (4.25) and (4.26), it can be checked that
(4.27) |
This readily shows that there exists no such that and , i.e. there exists no soft inter-personal equilibrium.
In the following, we will show the derivation of in detail, while all other identities in (4.27) can be proved in a similar manner. It follows directly from the definitions of and that and . Hence, with large enough,
which implies . Also, as , we can similarly conclude that with large enough,
which implies . Note that
(4.28) |
By taking , , and in (4.28), we immediately see that , , and . We therefore conclude , , and . Now, observe that and are both of the form with (recall (4.26)). Thus, with large enough,
(4.29) | ||||
(4.30) |
This, together with (4.28), implies . Since (resp. ) is also of the form , the inequality in (4.30) indicates that with large enough, (resp. ). Hence, we conclude and . In view of the above derivations, is the only intra-personal equilibrium for Player 1 w.r.t. Player 2’s policy , i.e. . ∎
Remark 4.4.
Remark 4.5.
Another way to interpret (4.31) is that the alternating iterative procedure (4.2) will never converge, failing to provide any soft inter-personal equilibrium. Specifically, (4.31) indicates that the alternating iterations will always lead to loops, as listed below where Player 1’s stopping policies are underlined and Player 2’s stopping policies are double underlined.
1. .
2. .
3. .
4. .
5. .
6. .
7. .
8. .
Remark 4.6.
As is not specified in Proposition 4.1, the result admits an interesting implication for classical (time-consistent) nonzero-sum Dynkin games. To see this, let be an exponential discount function so that there is no time inconsistency. Proposition 4.1 shows that even when the state process is time-homogeneous and payoff functions are as simple as (4.3)-(4.24), the Dynkin game has no “time-homogeneous” Nash equilibrium—a Nash equilibrium as a tuple of two stopping regions , one for each player. Indeed, if such a Nash equilibrium existed, it would be a sharp inter-personal equilibrium under Definition 2.4. If a Nash equilibrium in fact exists, it must be of a more complicated form (which is also suggested by the constructions in [19, 27]).
5 Application: Negotiation with Diverse Impatience
In this section, we apply our theoretic results in Section 4 to a two-player real options valuation problem. The vast literature on real options, see e.g. [30, 12, 40] among many others, focuses on a single firm’s corporate decision making, particularly the optimal timing of a project’s initiation. In contrast to this, we will study two firms’ joint decision making on their cooperation to initiate a project together, embedding real options valuation in a nonzero-sum Dynkin game.
Consider two firms who would like to cooperate to initiate a new project, such as entering a new market or developing a new product. Each firm has a proprietary skill/technology, so that only when they cooperate can the project be successfully carried out. Once the project is initiated, it will generate a fixed total revenue . The cost of initiation , on the other hand, evolves stochastically and is modeled by a discrete-time binomial structure as follows: There exist and such that takes values in
(5.1) |
and satisfies
Assume additionally that is a submartingale, which corresponds to the condition . That is, the cost has a tendency to increase over time, which incentivizes the two firms to strike a deal of cooperation sooner than later.
In negotiating such a deal, each firm, leveraging on its proprietary skill/technology, insists on taking a fixed (risk-free) larger share
of the total revenue , while demanding the other firm to take the smaller share
of revenue and additionally incur the stochastic (risky) cost . Each firm either waits until the other gives in and takes the larger payoff , or gives in to the other and takes the smaller payoff , where denotes the firm’s (random) time to give in. This can be formulated in our Dynkin game framework as
If the two firms happen to give in at the same time, they realize that both of them cannot endure any delay of a deal, and will quickly agree on a deal that is more mutually beneficial. This corresponds to the requirement , . In addition, we model the time preferences of the firms using the hyperbolic discount function, i.e. for ,
where is a constant that represents the level of impatience of Firm .
To facilitate the investigation of inter-personal equilibria between the two firms, we introduce a random walk defined on some probability space such that
Consider
and define, for each ,
(5.2) |
where denotes the expectation under conditioned on . Note that , , can be computed explicitly. For instance,
(5.3) |
Lemma 5.1.
For , , where
(5.4) |
Proof.
Observe from (2.1) and (2.2) that
Hence, the one-player stopping analysis in [24, Section 5] applies to our current setting. The same arguments therein (particularly [24, Proposition 5.5]) show that , with given as in (5.4), is Player ’s unique optimal intra-personal equilibrium w.r.t. , i.e. . As belongs to by Theorem 3.1, it must coincide with . ∎
With the aid of Lemma 5.1, we will show that the alternating iterative procedure (4.2) always leads to a sharp inter-personal equilibrium. Let us divide our investigation into two cases, depending on the impatience levels of the two firms: (Proposition 5.1) and (Proposition 5.2).
Proposition 5.1.
Suppose . Then, the alternating iterative procedure (4.2) terminates after one iteration, and gives a sharp inter-personal equilibrium. That is,
Proof.
Note from (5.3) that implies . This in turn yields , in view of (5.4). Following (4.2), we have and , where the last equality is due to Lemma 5.1. Now, in view of Corollary 3.1, Lemma 5.1, and Proposition 3.1,
(5.5) |
As , the above two relations entail . This, together with , shows that , thanks to Lemma 4.1. ∎
Remark 5.1.
For the case , by starting with and switching the roles of and in (4.2), we may repeat the same arguments in the above proof to show that and satisfy , which then yields .
Proposition 5.2.
Proof.
Note from (5.3) and (5.4) that implies . If (which happens when is sufficiently small), the same arguments in the proof of Proposition 5.1 apply, and we end up with Case 1. In the rest of the proof, we assume . Following (4.2), and , where the last equality is due to Lemma 5.1. If , (4.2) terminates with and . Hence, by Lemma 4.1. Alternatively, if but for some , following the argument in Remark 5.1 gives .
It remains to deal with the situation where and for all . By using Corollary 3.1, Lemma 5.1, and Proposition 3.1 as in (5.5), we get
(5.6) |
As and , the above two inequalities imply for some nonempty . With being a submartingale, the same argument in the second half of the proof of [24, Lemma 5.1] can be repeated here, showing that the set has to be connected, i.e. for some with . Now, by the same argument in [24, Lemma 5.3], needs to be large or equal to , otherwise would not belong to (which would contradict Proposition 3.1). We then conclude and thus . Next, since and , we must have for some nonempty . Again, by the same argument in the second half of the proof of [24, Lemma 5.1], is connected. Also, we must have , otherwise (which would contradict Proposition 3.1). Thus, for some with . As is nondecreasing, is nonincreasing, and (by Proposition 3.1), there exist nonincreasing sequences and of positive reals with such that
(5.7) |
Now, if there exists such that or , we get and , so that by Lemma 4.1. If there exists no such , the iterative procedure (4.2) continues indefinitely with and , leading to , which belongs to by Remark 5.1. ∎
Finally, let us explore the more extreme situation where one firm is highly impatient while the other is highly patient.
Corollary 5.1.
If is sufficiently large and is sufficiently small, the alternating iterative procedure (4.2) yields .
Proof.
Take . With fixed, in view of (5.3) and (5.1), there exists small enough such that for , we have and the interval
(5.8) |
Define and take . Note that , where the second inequality is due to . With , we deduce from (5.3), (5.4), and (5.8) that and the interval
(5.9) |
Following (4.2), and , where the last equality is due to Lemma 5.1. By (5.9), must belong to . Recall from (5.6) that
(5.10) |
For any , observe that
(5.11) |
where the second inequality follows from , the third inequality is due to and (recall (5.2)), and the last inequality stems from , , and the definition of in (5.4). This shows that . Then, in view of (3.4)-(3.5), . In particular, as it must contain . If for some , Case 2 of Proposition 5.2 immediately gives , as desired. Hence, we assume for all in the rest of the proof.
As argued below (5.6), for some with . We claim that . First, observe from (5.4) that . Consider the function
As for all , we have for all . Recalling , we obtain for all . By direct calculation, this is equivalent to
(5.12) |
Assume to the contrary that . Then,
where the inequality follows from and (5.12). This indicates , a contradiction to (by Proposition 3.1). With established, we conclude . For any , the same calculation as in (5.11) (with replaced by ) yields . By the same arguments below (5.11), this yields ; particularly, we have . As by construction, we deduce from , (5.10), and , that . By following the same arguments as above, we can show recursively that and for all . By Case 3 of Proposition 5.2, . ∎
Propositions 5.1 and 5.2, along with Corollary 5.1, admit interesting economic interpretations. Intuitively, a firm can demonstrate a strong determination not to give in, so as to coerce the other firm into giving in in the negotiation. Whether this strategy will work depends on the impatience levels of the two firms. For the case where Firm 1 is less impatient than Firm 2 (i.e. ), Propositions 5.1 shows that when Firm 1 insists that it will never give in (i.e. ), Firm 2 indeed gives in by taking the stopping policy , and is already a sharp inter-personal equilibrium. For the case where Firm 1 is more impatient than Firm 2 (i.e. ), Propositions 5.2 shows that the negotiation becomes more complicated and Firm 1’s coercion does not necessarily work. In particular, if Firm 1 is sufficiently impatient and Firm 2 is sufficiently patient, Corollary 5.1 shows that Firm 1’s coercion must fail, and fail in a drastic way—the coercer becomes coerced. While Firm 1 started with , trying to coerce Firm 2 into giving in, the alternating game-theoretic reasoning (4.2) eventually lead to the sharp inter-personal equilibrium . That is, it is Firm 1 who ultimately gives in by taking the stopping policy , while Firm 2 in the end decides not to stop at all.
References
- [1] N. Akakpo and C. Lacour, Inhomogeneous and anisotropic conditional density estimation from dependent data, Electron. J. Stat., 5 (2011), pp. 1618–1653.
- [2] E. Bayraktar and S. Yao, On the robust Dynkin game, Ann. Appl. Probab., 27 (2017), pp. 1702–1755.
- [3] E. Bayraktar, J. Zhang, and Z. Zhou, Time consistent stopping for the mean-standard deviation problem—the discrete time case, SIAM J. Financial Math., 10 (2019), pp. 667–697.
- [4] , Equilibrium concepts for time-inconsistent stopping problems in continuous time, Mathematical Finance, 31 (2021), pp. 508–530.
- [5] J.-M. Bismut, Sur un problème de Dynkin, Z. Wahrscheinlichkeitstheorie und Verw. Gebiete, 39 (1977), pp. 31–53.
- [6] T. Björk, M. Khapko, and A. Murgoci, On time-inconsistent stochastic control in continuous time, Finance and Stochastics, 21 (2017), pp. 331–360.
- [7] T. Björk, A. Murgoci, and X. Y. Zhou, Mean-variance portfolio optimization with state-dependent risk aversion, Math. Finance, 24 (2014), pp. 1–24.
- [8] S. Christensen and K. Lindensjö, On finding equilibrium stopping times for time-inconsistent Markovian problems, SIAM J. Control Optim., 56 (2018), pp. 4228–4255.
- [9] , On time-inconsistent stopping problems and mixed strategy stopping times, Stochastic Process. Appl., 130 (2020), pp. 2886–2917.
- [10] J. Cvitanić and I. Karatzas, Backward stochastic differential equations with reflection and Dynkin games, Ann. Probab., 24 (1996), pp. 2024–2056.
- [11] T. De Angelis, G. Ferrari, and J. Moriarty, Nash equilibria of threshold type for two-player nonzero-sum games of stopping, Ann. Appl. Probab., 28 (2018), pp. 112–147.
- [12] A. Dixit and R. Pindyck, Investment under uncertainty, Princeton University Press, Princeton, NJ, (1994).
- [13] E. B. Dynkin, Game variant of a problem on optimal stopping, Soviet Math. Dokl., 10 (1969), pp. 270–274.
- [14] S. Ebert, W. Wei, and X. Y. Zhou, Weighted discounting—on group diversity, time-inconsistency, and consequences for investment, Journal of Economic Theory, 189 (2020), p. 105089.
- [15] I. Ekeland and A. Lazrak, Being serious about non-commitment: subgame perfect equilibrium in continuous time, tech. rep., University of British Columbia, 2006. Available at http://arxiv.org/abs/math/0604264.
- [16] I. Ekeland, O. Mbodji, and T. A. Pirvu, Time-consistent portfolio management, SIAM J. Financial Math., 3 (2012), pp. 1–32.
- [17] I. Ekeland and T. A. Pirvu, Investment and consumption without commitment, Math. Financ. Econ., 2 (2008), pp. 57–86.
- [18] E. Z. Ferenstein, On randomized stopping games, in Advances in dynamic games, vol. 7 of Ann. Internat. Soc. Dynam. Games, Birkhäuser Boston, Boston, MA, 2005, pp. 223–233.
- [19] S. Hamadène and J. Zhang, The continuous time nonzero-sum Dynkin game problem and application in game options, SIAM J. Control Optim., 48 (2009/10), pp. 3659–3669.
- [20] Y.-J. Huang and A. Nguyen-Huu, Time-consistent stopping under decreasing impatience, Finance and Stochastics, 22 (2018), pp. 69–95.
- [21] Y.-J. Huang, A. Nguyen-Huu, and X. Y. Zhou, General stopping behaviors of naïve and noncommitted sophisticated agents, with application to probability distortion, Mathematical Finance, 30 (2020), pp. 310–340.
- [22] Y.-J. Huang and Z. Wang, Optimal equilibria for multi-dimensional time-inconsistent stopping problems, SIAM J. Control Optim., 59 (2021), pp. 1705–1729.
- [23] Y.-J. Huang and X. Yu, Optimal stopping under model ambiguity: a time-consistent equilibrium approach, Mathematical Finance, 31 (2021), pp. 979–1012.
- [24] Y.-J. Huang and Z. Zhou, The optimal equilibrium for time-inconsistent stopping problems—the discrete-time case, SIAM J. Control Optim., 57 (2019), pp. 590–609.
- [25] , Optimal equilibria for time-inconsistent stopping problems in continuous time, Mathematical Finance, 30 (2020), pp. 1103–1134.
- [26] M. R. Kosorok, Introduction to empirical processes and semiparametric inference, Springer Series in Statistics, Springer, New York, 2008.
- [27] R. Laraki and E. Solan, Equilibrium in two-player non-zero-sum Dynkin games in continuous time, Stochastics, 85 (2013), pp. 997–1014.
- [28] J.-P. Lepeltier and M. A. Maingueneau, Le jeu de Dynkin en théorie générale sans l’hypothèse de Mokobodski, Stochastics, 13 (1984), pp. 25–44.
- [29] G. Loewenstein and R. Thaler, Anomalies: Intertemporal choice, Journal of Economic Perspectives, 3 (1989), pp. 181–193.
- [30] R. McDonald and D. Siegel, The value of waiting to invest, The Quarterly Journal of Economics, 101 (1986), pp. 707–727.
- [31] H. Morimoto, Dynkin games and martingale methods, Stochastics, 13 (1984), pp. 213–228.
- [32] , Non-zero-sum discrete parameter stochastic games with stopping times, Probab. Theory Relat. Fields, 72 (1986), pp. 155–160.
- [33] H. Nagai, Non-zero-sum stopping games of symmetric Markov processes, Probab. Theory Related Fields, 75 (1987), pp. 487–497.
- [34] J. Neveu, Discrete-parameter martingales, North-Holland Publishing Co., Amsterdam-Oxford; American Elsevier Publishing Co., Inc., New York, revised ed., 1975.
- [35] Y. Ohtsubo, A nonzero-sum extension of Dynkin’s stopping problem, Math. Oper. Res., 12 (1987), pp. 277–296.
- [36] R. A. Pollak, Consistent planning, The Review of Economic Studies, (1968), pp. 201–208.
- [37] D. Rosenberg, E. Solan, and N. Vieille, Stopping games with randomized strategies, Probab. Theory Related Fields, 119 (2001), pp. 433–451.
- [38] M. Sart, Estimation of the transition density of a Markov chain, Ann. Inst. Henri Poincaré Probab. Stat., 50 (2014), pp. 1028–1068.
- [39] E. Shmaya and E. Solan, Two-player nonzero-sum stopping games in discrete time, Ann. Probab., 32 (2004), pp. 2733–2764.
- [40] J. E. Smith and R. F. Nau, Valuing risky projects: option pricing theory and decision analysis, Management Science, 41(5) (1995), pp. 795–816.
- [41] R. H. Strotz, Myopia and inconsistency in dynamic utility maximization, The Review of Economic Studies, 23 (1955), pp. 165–180.
- [42] R. Thaler, Some empirical evidence on dynamic inconsistency, Econ. Lett., 8 (1981), pp. 201–207.
- [43] N. Touzi and N. Vieille, Continuous-time Dynkin games with mixed strategies, SIAM J. Control Optim., 41 (2002), pp. 1073–1088.
- [44] M. Yasuda, On a randomized strategy in Neveu’s stopping problem, Stochastic Process. Appl., 21 (1985), pp. 159–166.
- [45] J. Yong, Time-inconsistent optimal control problems and the equilibrium HJB equation, Mathematical Control and Related Fields, 3 (2012), pp. 271–329.