This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A Time-Inconsistent Dynkin Game: from Intra-personal to Inter-personal Equilibria

Yu-Jui Huang University of Colorado, Department of Applied Mathematics, Boulder, CO 80309-0526, USA, email: [email protected]. Partially supported by National Science Foundation (DMS-1715439) and the University of Colorado (11003573).    Zhou Zhou University of Sydney, School of Mathematics and Statistics, NSW 2006, Australia, email: [email protected].
Abstract

This paper studies a nonzero-sum Dynkin game in discrete time under non-exponential discounting. For both players, there are two levels of game-theoretic reasoning intertwined. First, each player looks for an intra-personal equilibrium among her current and future selves, so as to resolve time inconsistency triggered by non-exponential discounting. Next, given the other player’s chosen stopping policy, each player selects a best response among her intra-personal equilibria. A resulting inter-personal equilibrium is then a Nash equilibrium between the two players, each of whom employs her best intra-personal equilibrium with respect to the other player’s stopping policy. Under appropriate conditions, we show that an inter-personal equilibrium exists, based on concrete iterative procedures along with Zorn’s lemma. To illustrate our theoretic results, we investigate a two-player real options valuation problem: two firms negotiate a deal of cooperation to initiate a project jointly. By deriving inter-personal equilibria explicitly, we find that coercive power in negotiation depends crucially on the impatience levels of the two firms.

MSC (2020): 60J20, 91A05, 91A07, 03E75.

Keywords: Dynkin games, time inconsistency, non-exponential discounting, intra-personal equilibrium, inter-personal equilibrium, alternating fixed-point iterations.

1 Introduction

In dynamic optimization, time inconsistency is the self-conflicting situation where the same agent at different times (i.e. the current and future selves) cannot agree on a “dynamically optimal strategy” that is good for the entire planning horizon. A long-standing approach to resolving time inconsistency is Strotz’ consistent planning [41]: An agent should take her future selves’ disobedience into account, so as to find a strategy that none of her future selves will have an incentive to deviate from. Essentially, such a strategy is an intra-personal equilibrium—an equilibrium established internally within the agent, among her current and future selves.

The investigation of intra-personal equilibria, particularly their mathematical definitions and characterizations, has been the main focus of the literature on time inconsistency. This includes the classical framework in discrete time that relies on a straightforward backward sequential optimization detailed in [36], as well as the more recent development in continuous time that employs the spike variation technique introduced in Ekeland and Lazrak [15]. The latter has led to vibrant research on time-inconsistent stochastic control, including [17, 16, 7, 6, 45], among many others. Lately, marked progress has been made for time-inconsistent optimal stopping, along two different paths. One is to extend the spike variation technique from stochastic control to optimal stopping, as carried out in [14, 8, 9]. The other path is the iterative approach developed in [20, 21, 23], which circumvents spike variations via a fixed-point perspective. Let us also mention the recent work [4] which builds a connection between different concepts of equilibria in these two paths.

A natural question follows all the developments: How does the intra-personal reconciliation within one single agent integrate into the interaction among multiple (non-cooperative) agents? Intuitively, there should be two levels of game-theoretic reasoning—the inner level where each agent looks for time-consistent strategies her future selves will actually follow, and the outer level where each agent chooses her best strategy (among time-consistent ones) in response to other agents’ strategies. A resulting inter-personal equilibrium should then be a Nash equilibrium among all the agents, each of whom is restricted to choose time-consistent strategies (i.e. her intra-personal equilibria). To the best of our knowledge, such inter-personal equilibria built from intra-personal ones have not been properly formulated and studied in the literature. The crucial question is whether and how different agents’ respective intra-personal equilibria can ultimately forge an inter-personal equilibrium among all agents. This paper will shed new light on this through a time-inconsistent Dynkin game.

A Dynkin game involves two players interacting through their stopping strategies. The zero-sum version of the game, introduced in Dynkin [13], has been substantially studied along various directions, including [13, 34] (discrete time), [5, 28, 31] (continuous time), [44, 37, 43] (randomized strategies), [10] (non-Markovian settings), and [2] (model uncertainty), among others. Many of the studies not only show that a Nash equilibrium between the two players exists, but provide concrete constructions. By contrast, the nonzero-sum version of the game has received relatively less attention; see the early investigations [32, 35, 33] and more recent ones [19, 27, 11], among others. Remarkably, all the developments above assume that the two players optimize their expected payoff/cost under exponential discounting (including the case of no discounting), which readily ensures time consistency.

In this paper, we consider a nonzero-sum Dynkin game where the state process XX is a discrete-time strong Markov process taking values in a Polish space 𝕏\mathbb{X}. Each player chooses to stop at the first entrance time of some Borel subset SS of 𝕏\mathbb{X}, which will be called a stopping policy. The Dynkin game is in general time-inconsistent, as we allow the two players to take general discount functions that satisfy only a log sub-additive condition, i.e. (2.4) below. This condition captures decreasing impatience, a widely observed feature of empirical discounting, and readily covers numerous non-exponential discount functions in behavioral economics; see the discussion below (2.4).

As time inconsistency arises under non-exponential discounting, each player, when given the other’s chosen stopping policy TT, needs to find accordingly an intra-personal equilibrium SS among her current and future selves. Following the fixed-point approach in Huang and Nguyen-Huu [20], we define each player’s intra-personal equilibrium as a fixed point of an operator, which encodes the aforementioned inner level of game-theoretic reasoning (Definition 2.1). To achieve an inter-personal equilibrium, a minimal requirement is that each player should attain her inner-level equilibrium simultaneously—that is, the following situation should materialize: SS is Player 1’s intra-personal equilibrium given Player 2’s stopping policy TT, and TT is Player 2’s intra-personal equilibrium given Player 1’s stopping policy SS. In this case, we say (S,T)(S,T) is a soft inter-personal equilibrium (Definition 2.2). To further refine this “soft” definition, we note that each player, when following the Nash equilibrium idea, should not be satisfied with an arbitrary intra-personal equilibrium, but aim at the best one under an appropriate optimality criterion. Reminiscent of the “optimal equilibrium” concept proposed in Huang and Zhou [24, 25], we say that an intra-personal equilibrium is optimal if it generates larger values than any other intra-personal equilibrium, for not only the current but all future selves (Definition 2.3). This immediately brings about a stronger notion for an inter-personal equilibrium: (S,T)(S,T) is said to be a sharp inter-personal equilibrium if each player attains her best inner-level equilibrium simultaneously, i.e. SS is Player 1’s optimal intra-personal equilibrium given Player 2’s stopping policy TT, and TT is Player 2’s optimal intra-personal equilibrium given Player 1’s stopping policy SS (Definition 2.4).

The focus of this paper is to establish the existence of inter-personal equilibria, soft and sharp, through concrete iterative procedures. First, we develop for each player an individual iterative procedure, i.e. (3.4) below, that directly leads to her optimal intra-personal equilibrium (Theorem 3.1). This procedure can be viewed as an improvement to those in [20, 21], which lead to intra-personal equilibria but not necessarily the optimal ones. Next, we devise an alternating iterative procedure, i.e. (4.2) below, in which the two players take turns to perform the individual iterative procedure repetitively. In each iteration, one player, given the other’s stopping policy determined in the previous iteration, performs the individual iterative procedure and then updates her policy to the optimal intra-personal equilibrium obtained; see Section 2.1 for details. Under appropriate conditions, this alternating iterative procedure converges and the limit, denoted by (S,T)(S_{\infty},T_{\infty}), is guaranteed a soft inter-personal equilibrium (Theorem 4.1). While it is tempting to believe that (S,T)(S_{\infty},T_{\infty}) is in fact sharp, in view of its structure revealed in Theorem 4.1, this is generally not the case: We demonstrate explicitly that (S,T)(S_{\infty},T_{\infty}) is sharp in Example 4.1, but only soft in the slightly modified Example 4.2. In other words, the general existence of sharp inter-personal equilibria is still in question. Assuming additionally that the state process XX has transition densities, we are able to upgrade the construction of (S,T)(S_{\infty},T_{\infty}) and apply Zorn’s lemma appropriately, which yields the desired result that a sharp inter-personal equilibrium must exist (Theorem 4.2).

It is worth noting that Theorems 4.1 and 4.2 hinge on a supermartingale condition, i.e. (3.13) below. As shown in Section 4.3, when the supermartingale condition fails, there may exist no inter-personal equilibrium, either soft or sharp; see Proposition 4.1 particularly. Let us point out that similar supermartingale conditions were also imposed in some studies on classical (time-consistent) nonzero-sum Dynkin games (e.g. [32, 35]) to facilitate the existence of a Nash equilibrium.

As an application, we study the negotiation between two firms (or countries) in Section 5. Suppose that each firm intends to coerce the other into unfavorable terms so as to obtain a larger payoff. A firm either waits until the other gives in and takes the larger payoff (i.e. its coercion works), or gives in to the other and accepts the unfavorable terms (i.e. its coercion fails). By computing explicitly the sharp inter-personal equilibrium between the two firms (Propositions 5.1 and 5.2, Corollary 5.1), we find that whether coercion in negotiation works depends on the impatience levels of the two firms: If a firm is less impatient than the other, its coercion always works; on the other hand, if a firm is significantly more impatient than the other, its coercion must fail. See particularly the discussion below Corollary 5.1 for details.

The rest of the paper is organized as follows. Section 2 introduces the model setup, formulates intra- and inter-personal equilibria, and collects preliminary results. Section 3 develops an individual iterative procedure that directly yields a player’s optimal intra-personal equilibrium. Under a supermartingale condition, the monotonicity of this procedure is also established. Section 4 devises an alternating iterative procedure, from which we prove the existence of soft and sharp inter-personal equilibria. Examples are presented to demonstrate the alternating iterative procedure and the necessity of the supermartingale condition. Finally, Section 5 applies our analysis to the negotiation between two firms, relating coercive power to impatience level.

2 The Model and Preliminaries

Let +:={0,1,2,}\mathbb{Z}_{+}:=\{0,1,2,...\} and consider a time-homogeneous strong Markov process X=(Xt)t+X=(X_{t})_{t\in\mathbb{Z}_{+}} taking values in a Polish space 𝕏\mathbb{X}. We denote by \mathcal{B} the Borel σ\sigma-algebra of 𝕏\mathbb{X}. On the path space Ω\Omega, the set of all functions mapping +\mathbb{Z}_{+} to 𝕏\mathbb{X}, let (t)t+(\mathcal{F}_{t})_{t\in\mathbb{Z}_{+}} be the filtration generated by XX and 𝒯\mathcal{T} be the set of all (t)t+(\mathcal{F}_{t})_{t\in\mathbb{Z}_{+}}-stopping times. In addition, we consider :=t+t\mathcal{F}_{\infty}:=\bigcup_{t\in\mathbb{Z}_{+}}\mathcal{F}_{t}. For any x𝕏x\in\mathbb{X}, we denote by XxX^{x} the process XX with initial value X0=xX_{0}=x, by x\mathbb{P}_{x} the probability measure on (Ω,)(\Omega,\mathcal{F}_{\infty}) generated by XxX^{x} (i.e. the law of (Xtx)t+(X^{x}_{t})_{t\in\mathbb{Z}_{+}}), and by 𝔼x\mathbb{E}_{x} the expectation under x\mathbb{P}_{x}.

Consider a nonzero-sum Dynkin game where the two players maximize their respective expected payoffs, determined jointly by their stopping strategies. Specifically, for i{1,2}i\in\{1,2\}, given the stopping time σ𝒯\sigma\in\mathcal{T} chosen by the other player, Player ii at the current state x𝕏x\in\mathbb{X} selects a stopping time τ𝒯\tau\in\mathcal{T} to maximize her expected discounted payoff

Ji(x,τ,σ):=𝔼x[Fi(τ,σ)],J_{i}(x,\tau,\sigma):=\mathbb{E}_{x}[F_{i}(\tau,\sigma)], (2.1)

where

Fi(τ,σ):=δi(τ)fi(Xτ)1{τ<σ}+δi(σ)gi(Xσ)1{τ>σ}+δi(τ)hi(Xτ)1{τ=σ},τ,σ𝒯.F_{i}(\tau,\sigma):=\delta_{i}(\tau)f_{i}(X_{\tau})1_{\{\tau<\sigma\}}+\delta_{i}(\sigma)g_{i}(X_{\sigma})1_{\{\tau>\sigma\}}+\delta_{i}(\tau)h_{i}(X_{\tau})1_{\{\tau=\sigma\}},\quad\forall\tau,\sigma\in\mathcal{T}. (2.2)

Here, δi:+[0,1]\delta_{i}:\mathbb{Z}_{+}\to[0,1] is Player ii’s discount function, assumed to be strictly decreasing with δ(0)=1\delta(0)=1, and fi,gi,hi:𝕏+f_{i},g_{i},h_{i}:\mathbb{X}\to\mathbb{R}_{+} are Player ii’s payoff functions, assumed to be Borel measurable. Note that we allow τ,σ𝒯\tau,\sigma\in\mathcal{T} to take the value ++\infty. For any ω{τ=σ=+}\omega\in\{\tau=\sigma=+\infty\}, we simply define Fi(τ,σ)(ω):=lim suptδi(t)hi(Xt(ω))F_{i}(\tau,\sigma)(\omega):=\limsup_{t\to\infty}\delta_{i}(t)h_{i}(X_{t}(\omega)). To ensure that Ji(x,τ,σ)J_{i}(x,\tau,\sigma) in (2.1) is well-defined, we will impose throughout the paper

𝔼x[suptδi(t)(fi(Xt)+gi(Xt)+hi(Xt))]<x𝕏.\mathbb{E}_{x}\bigg{[}\sup_{t\in\mathbb{N}}\delta_{i}(t)\big{(}f_{i}(X_{t})+g_{i}(X_{t})+h_{i}(X_{t})\big{)}\bigg{]}<\infty\quad\forall x\in\mathbb{X}. (2.3)

As mentioned in Introduction, the vast literature on Dynkin games mostly assumes exponential discounting, i.e. δi(t)=eβit\delta_{i}(t)=e^{-\beta_{i}t} for some βi>0\beta_{i}>0. Empirical studies (e.g. [42, 29]), on the other hand, have found that individuals do not normally discount exponentially. In this paper, the only standing assumption on δi\delta_{i}, i{1,2}i\in\{1,2\}, is

δi(s)δi(t)δi(s+t),s,t+.\delta_{i}(s)\delta_{i}(t)\leq\delta_{i}(s+t),\quad\forall s,t\in\mathbb{Z}_{+}. (2.4)

This particularly captures decreasing impatience, a widely observed feature of empirical discounting. Numerous non-exponential discount functions in behavioral economics, such as hyperbolic, generalized hyperbolic, and pseudo-exponential discount functions, readily satisfy (2.4); see the discussion below [20, Assumption 3.12] for details.

In a one-player stopping problem, it is well-understood that non-exponential discounting induces time inconsistency: An optimal stopping strategy derived at the current state x𝕏x\in\mathbb{X} may no longer be optimal at a subsequent state yxy\neq x. In other words, the current and future selves cannot agree on a “dynamically optimal stopping strategy” that is good for the entire planning horizon; see e.g. [20, Section 2.2] for an explicit demonstration. Strotz’ consistent planning [41] is a long-standing approach to resolving time inconsistency: Knowing that her future selves may overturn her current plan, an agent selects the best present action taking the future disobedience as a constraint; the resulting strategy is a (subgame perfect) Nash equilibrium from which no future self has an incentive to deviate.

In our Dynkin game, thanks to the time-homogeneous Markovian setup, we assume that each player decides to stop or to continue depending on her current state x𝕏x\in\mathbb{X}. That is, each player stops at the first entrance time of some SS\in\mathcal{B}, defined by

ρS:=inf{t0:XtS}.\rho_{S}:=\inf\{t\geq 0:\ X_{t}\in S\}.

For convenience, we will often call SS\in\mathcal{B} a stopping policy. This corresponds to a “pure strategy” in economic terms.111See Remark 4.3 for discussions on the use of pure and randomized strategies. For i{1,2}i\in\{1,2\}, given the other player’s stopping policy TT\in\mathcal{B}, Player ii is faced with time inconsistency among her current and future selves (as explained above), and needs to find an equilibrium stopping policy at the intra-personal level. Following [25, Section 2.1] (or [20, Section 3.1]), Strotz’ consistent planning boils down to the current self’s game-theoretic reasoning: “Given that my future selves will follow the policy SS\in\mathcal{B}, what is the best policy today in response to that?” The best policy is determined by comparing the payoff of immediate stopping Ji(x,0,ρT)J_{i}(x,0,\rho_{T}) and the payoff of continuation Ji(x,ρS+,ρT)J_{i}(x,\rho^{+}_{S},\rho_{T}), where

ρS+:=inf{t>0:XtS}\rho^{+}_{S}:=\inf\{t>0:\ X_{t}\in S\}

is the first hitting time to SS. This leads to the following stopping policy

ΘiT(S):=\displaystyle\Theta_{i}^{T}(S):= {xS:Ji(x,0,ρT)Ji(x,ρS+,ρT)}{xS:Ji(x,0,ρT)>Ji(x,ρS+,ρT)}.\displaystyle\{x\in S:J_{i}(x,0,\rho_{T})\geq J_{i}(x,\rho^{+}_{S},\rho_{T})\}\cup\{x\notin S:J_{i}(x,0,\rho_{T})>J_{i}(x,\rho^{+}_{S},\rho_{T})\}\in\mathcal{B}. (2.5)

We can consider ΘiT:\Theta^{T}_{i}:\mathcal{B}\to\mathcal{B} as an improving operator for Player ii: Given the other player’s stopping policy TT\in\mathcal{B}, ΘiT\Theta^{T}_{i} improves the present policy SS\in\mathcal{B} of Player ii to ΘT(S)\Theta^{T}(S)\in\mathcal{B}.

For each player, we define an equilibrium at the intra-personal level (i.e. among the player’s current and future selves) in the same spirit as [25, Definition 2.2] and [20, Definition 3.7].

Definition 2.1.

For i{1,2}i\in\{1,2\}, SS\in\mathcal{B} is called Player ii’s intra-personal equilibrium w.r.t. (with respect to) TT\in\mathcal{B} if ΘiT(S)=S\Theta_{i}^{T}(S)=S. We denote by iT\mathcal{E}_{i}^{T} the set of all Player ii’s intra-personal equilibria w.r.t. TT\in\mathcal{B}.

Remark 2.1.

The above fixed-point definition of an intra-personal equilibrium was introduced in [20] and followed by [21, 25, 23], among others. Note that there is a slightly different formulation in [24]: If we follow [24], particularly (2.5) therein, ΘiT(S)\Theta^{T}_{i}(S) in (2.5) needs to be modified as

Θ¯iT(S):={x𝕏:Ji(x,0,ρT)Ji(x,ρS+,ρT)}.\displaystyle\bar{\Theta}_{i}^{T}(S):=\{x\in\mathbb{X}:J_{i}(x,0,\rho_{T})\geq J_{i}(x,\rho^{+}_{S},\rho_{T})\}. (2.6)

Observe from (2.5) and (2.6) that the equilibrium condition “ΘiT(S)=S\Theta_{i}^{T}(S)=S” in Definition 2.1 is slightly weaker than “Θ¯iT(S)=S\bar{\Theta}_{i}^{T}(S)=S” as in [24, Definition 2.3]. This paper uses the slightly weaker definition because it conforms more closely to the Nash equilibrium idea—one deviates to a new policy only when it is strictly better than the current one; see the explanations at the beginning of [20, p.7]. Moreover, the weaker definition facilitates the search for intra-personal equilibria, as it allows for the explicit construction in Proposition 3.1 below.

Based on Definition 2.1, we introduce the first kind of equilibria at the inter-personal level (i.e. between the two players)—the soft inter-personal equilibria.

Definition 2.2.

We say (S,T)×(S,T)\in\mathcal{B}\times\mathcal{B} is a soft inter-personal equilibrium (for the Dynkin game) if S1TS\in\mathcal{E}_{1}^{T} and T2ST\in\mathcal{E}_{2}^{S} (i.e. Θ1T(S)=S\Theta_{1}^{T}(S)=S and Θ2S(T)=T\Theta_{2}^{S}(T)=T). We denote by \mathcal{E} the set of all soft inter-personal equilibria.

Essentially, (S,T)(S,T)\in\mathcal{E} means that each player simultaneously attains an equilibrium at the intra-personal level, given the other’s stopping policy: SS is Player 11’s intra-personal equilibrium w.r.t. Player 2’s policy TT, and TT is Player 22’s intra-personal equilibrium w.r.t. Player 1’s policy SS.

As emphasized in [24, 25], Strotz’ consistent planning is a two-phase procedure: An agent first determines the strategies that she will actually follow over time (Phase I), and then chooses the best one among them (Phase II). In our Dynkin game, Phase I amounts to each player finding her intra-personal equilibria (w.r.t. the other player’s stopping policy); Phase II is then the search for an optimal intra-personal equilibrium, defined as below.

Definition 2.3.

For i{1,2}i\in\{1,2\} and TT\in\mathcal{B}, the value function associated with SiTS\in\mathcal{E}_{i}^{T} is defined by

UiT(x,S):=Ji(x,0,ρT)Ji(x,ρS+,ρT),x𝕏.U_{i}^{T}(x,S):=J_{i}(x,0,\rho_{T})\vee J_{i}(x,\rho^{+}_{S},\rho_{T}),\quad x\in\mathbb{X}.

We say SiTS\in\mathcal{E}_{i}^{T} is Player ii’s optimal intra-personal equilibrium w.r.t. TT\in\mathcal{B} if for any RiTR\in\mathcal{E}_{i}^{T},

UiT(x,S)UiT(x,R)for allx𝕏.U_{i}^{T}(x,S)\geq U_{i}^{T}(x,R)\quad\hbox{for all}\ x\in\mathbb{X}.

We denote by ^iT\widehat{\mathcal{E}}_{i}^{T} the set of all Player ii’s optimal intra-personal equilibria w.r.t. TT.

Remark 2.2.

Thanks to SiTS\in\mathcal{E}_{i}^{T}, UiT(x,S)U_{i}^{T}(x,S) defined above coincides with Ji(x,ρS,ρT)J_{i}(x,\rho_{S},\rho_{T}). Indeed, by ΘiT(S)=S\Theta_{i}^{T}(S)=S (due to SiTS\in\mathcal{E}_{i}^{T}) and (2.5), Ji(x,ρS,ρT)=Ji(x,0,ρT)Ji(x,ρS+,ρT)J_{i}(x,\rho_{S},\rho_{T})=J_{i}(x,0,\rho_{T})\geq J_{i}(x,\rho^{+}_{S},\rho_{T}) for xSx\in S and Ji(x,ρS,ρT)=Ji(x,ρS+,ρT)Ji(x,0,ρT)J_{i}(x,\rho_{S},\rho_{T})=J_{i}(x,\rho^{+}_{S},\rho_{T})\geq J_{i}(x,0,\rho_{T}) for xSx\notin S. That is, Ji(x,ρS,ρT)=Ji(x,0,ρT)Ji(x,ρS+,ρT)=UiT(x,S)J_{i}(x,\rho_{S},\rho_{T})=J_{i}(x,0,\rho_{T})\vee J_{i}(x,\rho^{+}_{S},\rho_{T})=U_{i}^{T}(x,S) for all x𝕏x\in\mathbb{X}.

Definition 2.3 follows the “optimal equilibrium” notion introduced in [24]. It is a rather strong optimality criterion, as it requires a (subgame perfect Nash) equilibrium to dominate any other equilibrium on the entire state space—a rare occurrence in game theory. Nonetheless, for the one-player optimal stopping problem under non-exponential discounting, as long as the discount function satisfies (2.4), the existence of an optimal equilibrium has been established first in discrete time [24] and then in continuous time, including [25, 22] (diffusion models) and [4] (continuous-time Markov chain models).

Based on Definition 2.3, we introduce the second kind of equilibria at the inter-personal level (i.e. between the two players)—the sharp inter-personal equilibria.

Definition 2.4.

We say (S,T)×(S,T)\in\mathcal{B}\times\mathcal{B} is a sharp inter-personal equilibrium (for the Dynkin game) if S^1TS\in\widehat{\mathcal{E}}_{1}^{T} and T^2ST\in\widehat{\mathcal{E}}_{2}^{S}. We denote by ^\widehat{\mathcal{E}} the set of all sharp inter-personal equilibria.

A sharp inter-personal equilibrium, compared with a soft one in Definition 2.2, conforms to the Nash equilibrium concept more closely. Given the other player’s policy, what a player aims at should not be an arbitrary agreement among her current and future selves (as is stipulated in Definition 2.2), but the agreement that is best-rewarding—the one that generates the largest possible value for every incarnation of herself in time. In other words, our time-inconsistent Dynkin game involves two levels of game-theoretic reasoning. Player 1 wants to find the best response to Player 2’s policy at the inter-personal level, while maintaining an agreement among her current and future selves at the intra-personal level; Player 2 does the same in response to Player 1’s policy. In the end, each player chooses an optimal intra-personal equilibrium w.r.t. the other player’s policy, leading to a sharp inter-personal equilibrium for the Dynkin game.

It is worth noting that our definition of a sharp inter-personal equilibrium covers, as a special case, the standard Nash equilibrium in a time-consistent Dynkin game.

Remark 2.3.

In the time-consistent case of exponential discounting, a Nash equilibrium for the Dynkin game is defined as a tuple of stopping times (τ^,σ^)(\hat{\tau},\hat{\sigma}) such that τ^\hat{\tau} is Player 1’s optimal stopping time given that Player 2 employs σ^\hat{\sigma}, while σ^\hat{\sigma}, at the same time, is Player 2’s optimal stopping time given that Player 1 employs τ^\hat{\tau}. In a time-homogeneous setting, let S^\hat{S} (resp. T^\hat{T}) denote the stopping region associated with τ^\hat{\tau} (resp. σ^\hat{\sigma}). In view of [20, Proposition 3.11], S^\hat{S} is readily Player 1’s intra-personal equilibrium w.r.t. T^\hat{T}. Moreover, by the argument in [22, Remark 2.12], S^\hat{S} is in fact Player 1’s optimal intra-personal equilibria w.r.t. T^\hat{T}—namely, S^^1T^\hat{S}\in\widehat{\mathcal{E}}^{\hat{T}}_{1}. By the same token, we have T^^2S^\hat{T}\in\widehat{\mathcal{E}}^{\hat{S}}_{2}. It then follows that (S^,T^)(\hat{S},\hat{T}) is a sharp inter-personal equilibrium.

That is to say, in the classical time-consistent case, a Nash equilibrium (τ^,σ^)(\hat{\tau},\hat{\sigma}) in a time-homogeneous model is automatically a sharp inter-personal equilibrium, once we re-state (τ^,σ^)(\hat{\tau},\hat{\sigma}) using their respective stopping regions.

2.1 Problem Formulation

This paper aims to establish the existence of soft and sharp inter-personal equilibria, using concrete iterative procedures. Although the existence and construction of each player’s intra-personal equilibria is well-understood (based on the one-player results [20, 24]), it is unclear whether the two players’ respective intra-personal equilibria can be coordinated properly to form an inter-personal equilibrium, either soft or sharp.

We will tackle this in two steps. First, we look into the one-player problem more closely, developing for each player an individual iterative procedure that directly brings about her optimal intra-personal equilibrium (Theorem 3.1). Next, we devise an alternating iterative procedure in which the two players take turns to perform the individual iterative procedure:

  • 1.

    With respect to Player 1’s initial policy S0S_{0}\in\mathcal{B}, Player 2 performs the individual iterative procedure to get an optimal intra-personal equilibrium T0T_{0}\in\mathcal{B}.

  • 2.

    With respect to Player 2’s policy T0T_{0}\in\mathcal{B}, Player 1 performs the individual iterative procedure to get an optimal intra-personal equilibrium S1S_{1}\in\mathcal{B}.

  • 3.

    With respect to Player 1’s policy S1S_{1}\in\mathcal{B}, Player 2 performs the individual iterative procedure to get an optimal intra-personal equilibrium T1T_{1}\in\mathcal{B}.

The hope is that this alternating iterative procedure will ultimately converge, with the limit (S,T)(S_{\infty},T_{\infty}) being a soft, or even sharp, inter-personal equilibrium. This will be investigated in detail in Section 4, with affirmative results established in Theorems 4.1 and 4.2.

2.2 Preliminaries

We collect two technical results that will be useful throughout the paper. The first one concerns the convergence of first entrance and hitting times.

Lemma 2.1.

Let (Sn)n(S_{n})_{n\in\mathbb{N}} be a monotone sequence in \mathcal{B}. For any ωΩ\omega\in\Omega, there exists NN\in\mathbb{N} such that ρSn(ω)=ρS(ω)\rho_{S_{n}}(\omega)=\rho_{S_{\infty}}(\omega) for all nNn\geq N, where

S:={nSn,if(Sn)is nondecreasing,nSn,if(Sn)is nonincreasing.S_{\infty}:=\begin{cases}\bigcup_{n\in\mathbb{N}}S_{n},\quad\hbox{if}\ (S_{n})\ \hbox{is nondecreasing},\\ \bigcap_{n\in\mathbb{N}}S_{n},\quad\hbox{if}\ (S_{n})\ \hbox{is nonincreasing}.\end{cases} (2.7)

The same result holds with ρ\rho replaced by ρ+\rho^{+}.

Proof.

Fix ωΩ\omega\in\Omega. If (Sn)(S_{n}) is nondecreasing, set t:=ρS(ω)t:=\rho_{S_{\infty}}(\omega). Without loss of generality, assume t<t<\infty. As Xt(ω)S=nSnX_{t}(\omega)\in S_{\infty}=\bigcup_{n\in\mathbb{N}}S_{n}, there exists NN\in\mathbb{N} such that Xt(ω)SnX_{t}(\omega)\in S_{n} for all nNn\geq N. Hence, ρSn(ω)t\rho_{S_{n}}(\omega)\leq t for all nNn\geq N. If there exists nNn^{*}\geq N such that ρSn(ω)<t\rho_{S_{n^{*}}}(\omega)<t, then ρS(ω)ρSn(ω)<t\rho_{S_{\infty}}(\omega)\leq\rho_{S_{n^{*}}}(\omega)<t, a contradiction. We thus conclude ρSn(ω)=t=ρS(ω)\rho_{S_{n}}(\omega)=t=\rho_{S_{\infty}}(\omega) for all nNn\geq N. On the other hand, if (Sn)(S_{n}) is nonincreasing, set t:=limnρSn(ω)t:=\lim_{n\to\infty}\rho_{S_{n}}(\omega). Without loss of generality, assume t<t<\infty. Then, there exists NN\in\mathbb{N} such that ρSn(ω)=t\rho_{S_{n}}(\omega)=t for all nNn\geq N. Hence, Xt(ω)SnX_{t}(\omega)\in S_{n} for all nNn\geq N and thus Xt(ω)S=nSnX_{t}(\omega)\in S_{\infty}=\bigcap_{n\in\mathbb{N}}S_{n}. This implies ρS(ω)t\rho_{S_{\infty}}(\omega)\leq t. Since ρS(ω)t\rho_{S_{\infty}}(\omega)\geq t by definition, we conclude ρS(ω)=t=ρSn(ω)\rho_{S_{\infty}}(\omega)=t=\rho_{S_{n}}(\omega) for all nNn\geq N. The same arguments as above hold with ρ\rho replaced by ρ+\rho^{+}. ∎

The next result states that any stopping policy containing an intra-personal equilibrium RR must be dominated by RR. This kind of result was first established for one-player stopping problems in [25, Lemma 3.1], and is now extended to a Dynkin game setting.

Lemma 2.2.

Fix i{1,2}i\in\{1,2\} and assume higih_{i}\leq g_{i}. Then, for any RR, SS\in\mathcal{B} with RSR\subseteq S and RiTR\in\mathcal{E}_{i}^{T} for some TT\in\mathcal{B},

Ji(x,ρR+,ρT)Ji(x,ρS+,ρT)for allx𝕏.J_{i}(x,\rho^{+}_{R},\rho_{T})\geq J_{i}(x,\rho^{+}_{S},\rho_{T})\quad\hbox{for all}\ x\in\mathbb{X}.
Proof.

Consider A:={ωΩ:ρS+=ρT<ρR+}A:=\{\omega\in\Omega:\rho^{+}_{S}=\rho_{T}<\rho^{+}_{R}\} and B:={ωΩ:ρS+<ρTρR+}B:=\{\omega\in\Omega:\rho^{+}_{S}<\rho_{T}\wedge\rho^{+}_{R}\}. For any x𝕏x\in\mathbb{X}, by (2.1) and (2.2),

Ji(x,ρR+,ρT)Ji(x,ρS+,ρT)=𝔼x[(1A+1B)(Fi(ρR+,ρT)Fi(ρS+,ρT)].\displaystyle J_{i}(x,\rho^{+}_{R},\rho_{T})-J_{i}(x,\rho^{+}_{S},\rho_{T})=\mathbb{E}_{x}\left[(1_{A}+1_{B})(F_{i}(\rho^{+}_{R},\rho_{T})-F_{i}(\rho^{+}_{S},\rho_{T})\right]. (2.8)

By the assumption gihig_{i}\geq h_{i},

𝔼x[1A(Fi(ρR+,ρT)Fi(ρS+,ρT))]=𝔼x[1A(δi(ρT)gi(XρT)δi(ρT)hi(XρT))]0.\mathbb{E}_{x}\left[1_{A}(F_{i}(\rho^{+}_{R},\rho_{T})-F_{i}(\rho^{+}_{S},\rho_{T}))\right]=\mathbb{E}_{x}[1_{A}(\delta_{i}(\rho_{T})g_{i}(X_{\rho_{T}})-\delta_{i}(\rho_{T})h_{i}(X_{\rho_{T}}))]\geq 0. (2.9)

On the other hand,

𝔼x[1B(Fi(ρR+,ρT)Fi(ρS+,ρT))]\displaystyle\mathbb{E}_{x}\left[1_{B}(F_{i}(\rho^{+}_{R},\rho_{T})-F_{i}(\rho^{+}_{S},\rho_{T}))\right] =𝔼x[1B(𝔼x[Fi(ρR+,ρT)|ρS+]Fi(ρS+,ρT))].\displaystyle=\mathbb{E}_{x}\left[1_{B}\left(\mathbb{E}_{x}\left[F_{i}(\rho^{+}_{R},\rho_{T})\Big{|}\mathcal{F}_{\rho^{+}_{S}}\right]-F_{i}(\rho^{+}_{S},\rho_{T})\right)\right]. (2.10)

In view of (2.2), (2.4), and the nonnegativity of fi,gif_{i},g_{i}, and hih_{i},

1B\displaystyle 1_{B}\ 𝔼x[Fi(ρR+,ρT)|ρS+]\displaystyle\mathbb{E}_{x}\left[F_{i}(\rho^{+}_{R},\rho_{T})\ \middle|\ \mathcal{F}_{\rho^{+}_{S}}\right]
1Bδi(ρS+)𝔼x[δi(ρR+ρS+)fi(XρR+)1{ρR+<ρT}+δi(ρT+ρS+)gi(XρT)1{ρR+>ρT}\displaystyle\geq 1_{B}\delta_{i}(\rho^{+}_{S})\mathbb{E}_{x}\Big{[}\delta_{i}(\rho^{+}_{R}-\rho^{+}_{S})f_{i}(X_{\rho^{+}_{R}})1_{\{\rho^{+}_{R}<\rho_{T}\}}+\delta_{i}(\rho^{+}_{T}-\rho^{+}_{S})g_{i}(X_{\rho_{T}})1_{\{\rho^{+}_{R}>\rho_{T}\}}
+δi(ρR+ρS+)hi(XρR+)1{ρR+=ρT}|ρS+]\displaystyle\hskip 216.81pt+\delta_{i}(\rho^{+}_{R}-\rho^{+}_{S})h_{i}(X_{\rho^{+}_{R}})1_{\{\rho^{+}_{R}=\rho_{T}\}}\Big{|}\ \mathcal{F}_{\rho^{+}_{S}}\Big{]}
=1Bδi(ρS+)Ji(XρS+x,ρR+,ρT),\displaystyle=1_{B}\delta_{i}(\rho^{+}_{S})J_{i}\Big{(}{X^{x}_{\rho^{+}_{S}}},\rho^{+}_{R},\rho_{T}\Big{)}, (2.11)

where the equality follows from the strong Markov property of XX. Thanks to the above inequality and the fact that 1BFi(ρS+,ρT)=1Bδi(ρS+)fi(XρS+x)=1Bδi(ρS+)Ji(XρS+x,0,ρT)1_{B}F_{i}(\rho_{S}^{+},\rho_{T})=1_{B}\delta_{i}(\rho^{+}_{S})f_{i}(X^{x}_{\rho^{+}_{S}})=1_{B}\delta_{i}(\rho^{+}_{S})J_{i}(X^{x}_{\rho^{+}_{S}},0,\rho_{T}), (2.10) implies

𝔼x[1B(Fi(ρR+,ρT)Fi(ρS+,ρT))]\displaystyle\mathbb{E}_{x}\left[1_{B}(F_{i}(\rho^{+}_{R},\rho_{T})-F_{i}(\rho^{+}_{S},\rho_{T}))\right] 𝔼x[1Bδi(ρS+)(Ji(XρS+,ρR+,ρT)Ji(XρS+,0,ρT))].\displaystyle\geq\mathbb{E}_{x}\left[1_{B}\delta_{i}(\rho^{+}_{S})\left(J_{i}(X_{\rho^{+}_{S}},\rho^{+}_{R},\rho_{T})-J_{i}(X_{\rho^{+}_{S}},0,\rho_{T})\right)\right]. (2.12)

On the set BB, we deduce from ρS+<ρR+\rho^{+}_{S}<\rho^{+}_{R} that XρS+xRX^{x}_{\rho^{+}_{S}}\notin R. Since RiTR\in\mathcal{E}_{i}^{T}, (2.5) implies Ji(XρS+,0,ρT)Ji(XρS+,ρR+,ρT)J_{i}(X_{\rho^{+}_{S}},0,\rho_{T})\leq J_{i}(X_{\rho^{+}_{S}},\rho^{+}_{R},\rho_{T}). Then, (2.12) yields 𝔼x[1B(Fi(ρR+,ρT)Fi(ρS+,ρT))]0\mathbb{E}_{x}\left[1_{B}(F_{i}(\rho^{+}_{R},\rho_{T})-F_{i}(\rho^{+}_{S},\rho_{T}))\right]\geq 0. On strength of this and (2.9), we conclude from (2.8) that Ji(x,ρR+,ρT)Ji(x,ρS+,ρT)0J_{i}(x,\rho^{+}_{R},\rho_{T})-J_{i}(x,\rho^{+}_{S},\rho_{T})\geq 0. ∎

There is an intriguing message here—a smaller intra-personal equilibrium is more rewarding. Indeed, for any R,SiTR,S\in\mathcal{E}^{T}_{i} with RSR\subseteq S, Lemma 2.2 asserts Ji(x,ρR+,ρT)Ji(x,ρS+,ρT)J_{i}(x,\rho^{+}_{R},\rho_{T})\geq J_{i}(x,\rho^{+}_{S},\rho_{T}) for all x𝕏x\in\mathbb{X}. This “ranking by size” will play a crucial role in Theorem 3.1 below, where an optimal intra-personal equilibrium is derived. Economically, this ranking simply reflects a player’s decreasing impatience, which is captured by (2.4); see the detailed discussion below [25, Corollary 3.2].

3 The One-Player Analysis

In this section, we first focus on developing an iterative procedure for each player that directly leads to her intra-personal equilibrium. Next, under a supermartingale condition, we will establish the monotonicity of this iterative procedure.

For each i{1,2}i\in\{1,2\}, we introduce, for any fixed TT\in\mathcal{B}, the operator ΦiT:\Phi_{i}^{T}:\mathcal{B}\to\mathcal{B} defined by

ΦiT(S):=S{xS:Ji(x,0,ρT)>ViT(x,S)},\Phi_{i}^{T}(S):=S\cup\{x\notin S:J_{i}(x,0,\rho_{T})>V_{i}^{T}(x,S)\}, (3.1)

where

ViT(x,S):=sup1τρS+𝔼x[Fi(τ,ρT)]for anyx𝕏andS.V_{i}^{T}(x,S):=\sup_{1\leq\tau\leq\rho^{+}_{S}}\mathbb{E}_{x}[F_{i}(\tau,\rho_{T})]\quad\hbox{for any}\ x\in\mathbb{X}\ \hbox{and}\ S\in\mathcal{B}. (3.2)

We first note that ViT(x,)V_{i}^{T}(x,\cdot) converges desirably along nondecreasing sequences of stopping policies.

Lemma 3.1.

Fix i{1,2}i\in\{1,2\}. Let (Sn)n(S_{n})_{n\in\mathbb{N}} be a nondecreasing sequence in \mathcal{B}. Then, for any TT\in\mathcal{B}, ViT(x,Sn)ViT(x,S)V_{i}^{T}(x,S_{n})\downarrow V_{i}^{T}(x,S_{\infty}) for all x𝕏x\in\mathbb{X}, with S:=nSnS_{\infty}:=\bigcup_{n\in\mathbb{N}}S_{n}.

Proof.

Fix x𝕏x\in\mathbb{X}. By the definition of ViTV_{i}^{T} in (3.2), limnViT(x,Sn)ViT(x,S)\lim_{n\to\infty}V_{i}^{T}(x,S_{n})\geq V_{i}^{T}(x,S_{\infty}). To show the converse inequality, let τn𝒯\tau_{n}\in\mathcal{T} with 1τnρSn+1\leq\tau_{n}\leq\rho^{+}_{S_{n}} be a 1n\frac{1}{n}-optimizer of ViT(x,Sn)V_{i}^{T}(x,S_{n}), for each nn\in\mathbb{N}. Then,

ViT(x,Sn)ViT(x,S)\displaystyle V_{i}^{T}(x,S_{n})-V_{i}^{T}(x,S_{\infty}) 𝔼x[Fi(τn,ρT)]+1/n𝔼x[Fi(τnρS+,ρT)]\displaystyle\leq\mathbb{E}_{x}[F_{i}(\tau_{n},\rho_{T})]+{1}/{n}-\mathbb{E}_{x}[F_{i}(\tau_{n}\wedge\rho^{+}_{S_{\infty}},\rho_{T})]
=𝔼x[Fi(τn,ρT)Fi(τnρS+,ρT)]+1/n.\displaystyle=\mathbb{E}_{x}\left[F_{i}(\tau_{n},\rho_{T})-F_{i}(\tau_{n}\wedge\rho^{+}_{S_{\infty}},\rho_{T})\right]+1/n. (3.3)

For each ωΩ\omega\in\Omega, Lemma 2.1 asserts the existence of NN\in\mathbb{N} such that τn(ω)ρSn+(ω)=ρS+(ω)\tau_{n}(\omega)\leq\rho^{+}_{S_{n}}(\omega)=\rho^{+}_{S_{\infty}}(\omega) for all nNn\geq N. In particular, τn(ω)=(τnρS+)(ω)\tau_{n}(\omega)=(\tau_{n}\wedge\rho^{+}_{S_{\infty}})(\omega) for all nNn\geq N. This, together with (2.3), indicates that we may apply the dominated convergence theorem to show that the expectation in (3.3) converges to 0. Hence, we conclude from (3.3) that limnVT(x,Sn)VT(x,S)0\lim_{n\to\infty}V^{T}(x,S_{n})-V^{T}(x,S_{\infty})\leq 0. ∎

For any fixed TT\in\mathcal{B}, we perform an iterative procedure by starting with the empty set and applying the operator ΦiT\Phi_{i}^{T} repetitively. This will give an intra-personal equilibrium for Player ii.

Proposition 3.1.

Fix i{1,2}i\in\{1,2\} and assume higih_{i}\leq g_{i}. For any TT\in\mathcal{B}, let (Sin(T))n\left(S_{i}^{n}(T)\right)_{n\in\mathbb{N}} be a nondecreasing sequence in \mathcal{B} defined by

Si1(T):=ΦiT()andSin(T):=ΦiT(Sn1(T))forn2,S_{i}^{1}(T):=\Phi_{i}^{T}(\emptyset)\quad\hbox{and}\quad S_{i}^{n}(T):=\Phi_{i}^{T}\left(S^{n-1}(T)\right)\ \ \hbox{for}\ n\geq 2, (3.4)

where ΦiT:\Phi_{i}^{T}:\mathcal{B}\to\mathcal{B} is defined as in (3.1). Then,

Γi(T):=nSin(T)iT.\Gamma_{i}(T):=\bigcup_{n\in\mathbb{N}}S_{i}^{n}(T)\in\mathcal{E}_{i}^{T}. (3.5)

Moreover, TSin(T)=T\cap S_{i}^{n}(T)=\emptyset for all nn\in\mathbb{N}; in particular, TΓi(T)=T\cap\Gamma_{i}(T)=\emptyset.

Proof.

Fix xΓi(T)x\in\Gamma_{i}(T). If xSi1(T)=ΦiT()x\in S_{i}^{1}(T)=\Phi_{i}^{T}(\emptyset), by (3.1) we have Ji(x,0,ρT)>ViT(x,)ViT(x,Γi(T))Ji(x,ρΓi(T)+,ρT)J_{i}(x,0,\rho_{T})>V_{i}^{T}(x,\emptyset)\geq V_{i}^{T}(x,\Gamma_{i}(T))\geq J_{i}(x,\rho^{+}_{\Gamma_{i}(T)},\rho_{T}), which implies xΘiT(Γi(T))x\in\Theta_{i}^{T}(\Gamma_{i}(T)). If xSi1(T)x\notin S_{i}^{1}(T), since (Sin(T))n(S_{i}^{n}(T))_{n\in\mathbb{N}} is by definition nondecreasing, there must exist nn\in\mathbb{N} such that xSin+1(T)Sin(T)x\in S_{i}^{n+1}(T)\setminus S_{i}^{n}(T). Thanks again to (3.1), this yields Ji(x,0,ρT)>ViT(x,Sin(T))ViT(x,Γi(T))Ji(x,ρΓi(T)+,ρT)J_{i}(x,0,\rho_{T})>V_{i}^{T}(x,S_{i}^{n}(T))\geq V_{i}^{T}(x,\Gamma_{i}(T))\geq J_{i}(x,\rho^{+}_{\Gamma_{i}(T)},\rho_{T}), which implies xΘiT(Γi(T))x\in\Theta_{i}^{T}(\Gamma_{i}(T)). Hence, we conclude Γi(T)ΘiT(Γi(T))\Gamma_{i}(T)\subseteq\Theta_{i}^{T}(\Gamma_{i}(T)).

It remains to show the converse inclusion. Fix xΓi(T)x\notin\Gamma_{i}(T). We claim that xΘiT(Γi(T))x\notin\Theta_{i}^{T}(\Gamma_{i}(T)), i.e.

Ji(x,0,ρT)Ji(x,ρΓi(T)+,ρT).J_{i}(x,0,\rho_{T})\leq J_{i}(x,\rho^{+}_{\Gamma_{i}(T)},\rho_{T}). (3.6)

Assume to the contrary that (3.6) fails, so that

Δ:={yΓi(T):Ji(y,0,ρT)>Ji(y,ρΓi(T)+,ρT)}.\Delta:=\{y\notin\Gamma_{i}(T):J_{i}(y,0,\rho_{T})>J_{i}(y,\rho^{+}_{\Gamma_{i}(T)},\rho_{T})\}\neq\emptyset. (3.7)

Consider

α:=supyΔ{Ji(y,0,ρT)Ji(y,ρΓi(T)+,ρT)}>0.\alpha:=\sup_{y\in\Delta}\left\{J_{i}(y,0,\rho_{T})-J_{i}(y,\rho^{+}_{\Gamma_{i}(T)},\rho_{T})\right\}>0. (3.8)

As δi(1)<δi(0)=1\delta_{i}(1)<\delta_{i}(0)=1, we can take xΔx^{*}\in\Delta such that

Ji(x,0,ρT)Ji(x,ρΓi(T)+,ρT)>1+δi(1)2α.J_{i}(x^{*},0,\rho_{T})-J_{i}(x^{*},\rho^{+}_{\Gamma_{i}(T)},\rho_{T})>\frac{1+\delta_{i}(1)}{2}\alpha. (3.9)

Observe that we must have xTx^{*}\notin T, and thus ρT>0\rho_{T}>0 x\mathbb{P}_{x^{*}}-a.s. Indeed, if xTx^{*}\in T, then Ji(x,0,ρT)Ji(x,ρΓi(T)+,ρT)=hi(x)gi(x)0J_{i}(x^{*},0,\rho_{T})-J_{i}(x^{*},\rho^{+}_{\Gamma_{i}(T)},\rho_{T})=h_{i}(x)-g_{i}(x)\leq 0, which contradicts (3.9). Moreover, since xΓi(T)x^{*}\notin\Gamma_{i}(T) implies xSin(T)x^{*}\notin S_{i}^{n}(T) for all nn\in\mathbb{N}, we deduce from (3.1) that Ji(x,0,ρT)ViT(x,Sin(T))J_{i}(x^{*},0,\rho_{T})\leq V_{i}^{T}(x^{*},S_{i}^{n}(T)) for all nn\in\mathbb{N}. By Lemma 3.1, this implies

Ji(x,0,ρT)ViT(x,Γi(T)).J_{i}(x^{*},0,\rho_{T})\leq V_{i}^{T}(x^{*},\Gamma_{i}(T)). (3.10)

Let ρ𝒯\rho^{*}\in\mathcal{T} with 1ρρΓi(T)+1\leq\rho^{*}\leq\rho^{+}_{\Gamma_{i}(T)} be a 1δi(1)2α\frac{1-\delta_{i}(1)}{2}\alpha-optimizer of ViT(x,Γi(T))V_{i}^{T}(x^{*},\Gamma_{i}(T)). Consider the sets

A\displaystyle A :={ωΩ:ρ(ω)<ρΓi(T)+(ω),ρ(ω)ρT(ω),Xρ(ω)(Γi(T)Δ)},\displaystyle:=\{\omega\in\Omega:\rho^{*}(\omega)<\rho^{+}_{\Gamma_{i}(T)}(\omega),\ \rho^{*}(\omega)\leq\rho_{T}(\omega),\ X_{\rho^{*}}(\omega)\notin(\Gamma_{i}(T)\cup\Delta)\},
B\displaystyle B :={ωΩ:ρ(ω)<ρΓi(T)+(ω),ρ(ω)ρT(ω),Xρ(ω)ΔΓi(T)}.\displaystyle:=\{\omega\in\Omega:\rho^{*}(\omega)<\rho^{+}_{\Gamma_{i}(T)}(\omega),\ \rho^{*}(\omega)\leq\rho_{T}(\omega),\ X_{\rho^{*}}(\omega)\in\Delta\setminus\Gamma_{i}(T)\}.

By (3.9) and (3.10),

1+δi(1)2α\displaystyle\frac{1+\delta_{i}(1)}{2}\alpha <Ji(x,0,ρT)Ji(x,ρΓi(T)+,ρT)\displaystyle<J_{i}(x^{*},0,\rho_{T})-J_{i}(x^{*},\rho^{+}_{\Gamma_{i}(T)},\rho_{T})
𝔼x[Fi(ρ,ρT)Fi(ρΓi(T)+,ρT)]+1δi(1)2α\displaystyle\leq\mathbb{E}_{x^{*}}\left[F_{i}(\rho^{*},\rho_{T})-F_{i}(\rho^{+}_{\Gamma_{i}(T)},\rho_{T})\right]+\frac{1-\delta_{i}(1)}{2}\alpha
=𝔼x[(1A+1B)(Fi(ρ,ρT)Fi(ρΓi(T)+,ρT))]+1δi(1)2α.\displaystyle=\mathbb{E}_{x^{*}}\left[(1_{A}+1_{B})(F_{i}(\rho^{*},\rho_{T})-F_{i}(\rho^{+}_{\Gamma_{i}(T)},\rho_{T}))\right]+\frac{1-\delta_{i}(1)}{2}\alpha. (3.11)

By (2.4) and the nonnegativity of fif_{i}, gig_{i}, and hih_{i}, we can argue as in (2.11) to get

𝔼x[(1A+1B)Fi(ρΓi(T)+,ρT)]\displaystyle\mathbb{E}_{x^{*}}\left[(1_{A}+1_{B})F_{i}(\rho^{+}_{\Gamma_{i}(T)},\rho_{T})\right] =𝔼x[(1A+1B)𝔼x[Fi(ρΓi(T)+,ρT)|ρ]]\displaystyle=\mathbb{E}_{x^{*}}\left[(1_{A}+1_{B})\mathbb{E}_{x^{*}}\left[F_{i}(\rho^{+}_{\Gamma_{i}(T)},\rho_{T})\ \middle|\ \mathcal{F}_{\rho^{*}}\right]\right]
𝔼x[(1A+1B)δi(ρ)Ji(Xρ,ρΓi(T)+,ρT)].\displaystyle\geq\mathbb{E}_{x^{*}}\left[(1_{A}+1_{B})\delta_{i}(\rho^{*})J_{i}(X_{\rho^{*}},\rho^{+}_{\Gamma_{i}(T)},\rho_{T})\right].

Thanks to this and the fact that (1A+1B)Fi(ρ,ρT)=(1A+1B)δi(ρ)Ji(Xρ,0,ρT)(1_{A}+1_{B})F_{i}(\rho^{*},\rho_{T})=(1_{A}+1_{B})\delta_{i}(\rho^{*})J_{i}(X_{\rho^{*}},0,\rho_{T}), (3.11) yields

1+δi(1)2α\displaystyle\frac{1+\delta_{i}(1)}{2}\alpha <𝔼x[(1A+1B)δi(ρ)(Ji(Xρ,0,ρT)Ji(Xρ,ρΓi(T)+,ρT))]+1δi(1)2α\displaystyle<\mathbb{E}_{x^{*}}\left[(1_{A}+1_{B})\delta_{i}(\rho^{*})(J_{i}(X_{\rho^{*}},0,\rho_{T})-J_{i}(X_{\rho^{*}},\rho^{+}_{\Gamma_{i}(T)},\rho_{T}))\right]+\frac{1-\delta_{i}(1)}{2}\alpha
𝔼x[1Bδi(ρ)(Ji(Xρ,0,ρT)Ji(Xρ,ρΓi(T)+,ρT))]+1δi(1)2α\displaystyle\leq\mathbb{E}_{x^{*}}\left[1_{B}\delta_{i}(\rho^{*})(J_{i}(X_{\rho^{*}},0,\rho_{T})-J_{i}(X_{\rho^{*}},\rho^{+}_{\Gamma_{i}(T)},\rho_{T}))\right]+\frac{1-\delta_{i}(1)}{2}\alpha
δi(1)α+1δi(1)2α=1+δi(1)2α.\displaystyle\leq\delta_{i}(1)\alpha+\frac{1-\delta_{i}(1)}{2}\alpha=\frac{1+\delta_{i}(1)}{2}\alpha.

where the second inequality follows from XρΔX_{\rho^{*}}\notin\Delta on AA and the definition of Δ\Delta in (3.7), and the third inequality is due to XρΔX_{\rho^{*}}\in\Delta on BB, the definition of α\alpha in (3.8), and ρ1\rho^{*}\geq 1 by definition. The above inequality is clearly a contradiction, and we thus conclude that (3.6) holds. That is, we have shown that (Γi(T))c(Θ(Γi(T)))c(\Gamma_{i}(T))^{c}\subseteq(\Theta(\Gamma_{i}(T)))^{c}, or simply Γi(T)Θ(Γi(T))\Gamma_{i}(T)\supseteq\Theta(\Gamma_{i}(T)). This brings the final conclusion that Γi(T)=ΘiT(Γi(T))\Gamma_{i}(T)=\Theta_{i}^{T}(\Gamma_{i}(T)), i.e. Γi(T)iT\Gamma_{i}(T)\in\mathcal{E}_{i}^{T}.

Finally, for any xTx\in T, since ρT=0\rho_{T}=0 x\mathbb{P}_{x}-a.s.,

Ji(x,0,ρT)=hi(x)gi(x)=sup1τρR+𝔼x[Fi(τ,ρT)]=ViT(x,R)R.J_{i}(x,0,\rho_{T})=h_{i}(x)\leq g_{i}(x)=\sup_{1\leq\tau\leq\rho^{+}_{R}}\mathbb{E}_{x}[F_{i}(\tau,\rho_{T})]=V_{i}^{T}(x,R)\quad\forall R\in\mathcal{B}. (3.12)

In view of (3.4), taking R=R=\emptyset in (3.12) shows that xSi1(T)x\notin S_{i}^{1}(T). By the fact xSi1(T)x\notin S_{i}^{1}(T) and taking R=Si1(T)R=S_{i}^{1}(T) in (3.12), we in turn obtain xSi2(T)x\notin S_{i}^{2}(T). Applying (3.12) recursively in the same way then gives xSin(T)x\notin S_{i}^{n}(T) for all nn\in\mathbb{N}. Hence, we conclude TSin(T)T\cap S_{i}^{n}(T) for all nn\in\mathbb{N}. ∎

Remark 3.1.

The idea of the iterative procedure (3.4) was initially inspired by [8, Section 3], while its specific construction is partially borrowed from [4, Theorem 2.2].

We will go one step further to claim that Γi(T)\Gamma_{i}(T) in (3.5) is in fact an optimal intra-personal equilibrium for Player ii. To this end, we need the following auxiliary result: Being included in an intra-personal equilibrium is an invariant relation under the operator ΦiT\Phi^{T}_{i}.

Lemma 3.2.

Fix i{1,2}i\in\{1,2\}, TT\in\mathcal{B}, and RiTR\in\mathcal{E}_{i}^{T}. For any SS\in\mathcal{B} with SRS\subseteq R, ΦiT(S)R\Phi^{T}_{i}(S)\subseteq R.

Proof.

By contradiction, suppose that there exists xΦiT(S)Rx\in\Phi_{i}^{T}(S)\setminus R for some SS\in\mathcal{B} with SRS\subseteq R. With xΦiT(S)x\in\Phi_{i}^{T}(S) but xSx\notin S, (3.1) gives Ji(x,0,ρT)>ViT(x,S)ViT(x,R)Ji(x,ρR+,ρT),J_{i}(x,0,\rho_{T})>V_{i}^{T}(x,S)\geq V_{i}^{T}(x,R)\geq J_{i}(x,\rho^{+}_{R},\rho_{T}), where the second and third inequalities follow directly from the definition of ViTV_{i}^{T} in (3.2). As xRx\notin R, this shows that xΘiT(R)x\in\Theta_{i}^{T}(R), from which we conclude ΘiT(R)R\Theta_{i}^{T}(R)\neq R. This contradicts RiTR\in\mathcal{E}_{i}^{T}. ∎

Now, we are ready to present the main result of this section.

Theorem 3.1.

Fix i{1,2}i\in\{1,2\} and assume higih_{i}\leq g_{i}. For any TT\in\mathcal{B}, Γi(T)\Gamma_{i}(T) defined in (3.5) belongs to ^iT\widehat{\mathcal{E}}_{i}^{T}.

Proof.

By Proposition 3.1, Γi(T)iT\Gamma_{i}(T)\in\mathcal{E}_{i}^{T}. Recall (Sin(T))n(S_{i}^{n}(T))_{n\in\mathbb{N}} defined in (3.4). For any RiTR\in\mathcal{E}_{i}^{T}, Lemma 3.2 directly implies Si1(T)=ΦiT()RS_{i}^{1}(T)=\Phi_{i}^{T}(\emptyset)\subseteq R. By applying Lemma 3.2 recursively, we get Sin(T)=ΦiT(Sin1(T))RS_{i}^{n}(T)=\Phi_{i}^{T}(S_{i}^{n-1}(T))\subseteq R for all n2n\geq 2. Hence, Γi(T)=nSin(T)R\Gamma_{i}(T)=\bigcup_{n\in\mathbb{N}}S_{i}^{n}(T)\subseteq R. This, together with Lemma 2.2, gives Ji(x,ρΓi(T)+,ρT)Ji(x,ρR+,ρT)J_{i}(x,\rho^{+}_{\Gamma_{i}(T)},\rho_{T})\geq J_{i}(x,\rho^{+}_{R},\rho_{T}), and thus UiT(x,Γi(T))UiT(x,R)U_{i}^{T}(x,\Gamma_{i}(T))\geq U_{i}^{T}(x,R), for all x𝕏x\in\mathbb{X}. As RiTR\in\mathcal{E}_{i}^{T} is arbitrarily chosen, we conclude Γi(T)^iT\Gamma_{i}(T)\in\widehat{\mathcal{E}}_{i}^{T}. ∎

3.1 Monotonicity with respect to TT\in\mathcal{B}

So far, we have fixed TT\in\mathcal{B} (the other player’s stopping policy) and constructed a corresponding optimal intra-personal equilibrium Γi(T)\Gamma_{i}(T) in (3.5). By viewing TT\in\mathcal{B} as a variable, we will show that the map TΓi(T)T\mapsto\Gamma_{i}(T) is monotone under appropriate conditions.

Lemma 3.3.

Fix i{1,2}i\in\{1,2\}. Assume fihigif_{i}\leq h_{i}\leq g_{i} and that

(δi(t)gi(Xtx))t0is a supermartingale for all x𝕏.(\delta_{i}(t)g_{i}(X_{t}^{x}))_{t\geq 0}\ \ \hbox{is a supermartingale for all $x\in\mathbb{X}$}. (3.13)

Then, for any T,RT,R\in\mathcal{B} with TRT\subseteq R,

Ji(x,τ,ρT)Ji(x,τ,ρR)x𝕏andτ𝒯.J_{i}(x,\tau,\rho_{T})\leq J_{i}(x,\tau,\rho_{R})\quad\forall x\in\mathbb{X}\ \hbox{and}\ \tau\in\mathcal{T}. (3.14)

Hence, ViT(x,S)ViR(x,S)V_{i}^{T}(x,S)\leq V_{i}^{R}(x,S) for all x𝕏x\in\mathbb{X} and SS\in\mathcal{B}. Moreover, we have

ΦiT(S)ΦiR(S)S,SwithSS.\Phi_{i}^{T}(S)\supseteq\Phi_{i}^{R}(S^{\prime})\quad\forall S,S^{\prime}\in\mathcal{B}\ \hbox{with}\ S\supseteq S^{\prime}. (3.15)
Proof.

Given x𝕏x\in\mathbb{X} and τ𝒯\tau\in\mathcal{T}, consider A:={ωΩ:τ=ρR<ρT}A:=\{\omega\in\Omega:\tau=\rho_{R}<\rho_{T}\} and B:={ωΩ:ρR<τρT}.B:=\{\omega\in\Omega:\rho_{R}<\tau\wedge\rho_{T}\}. Observe that

𝔼x[1B(Fi(τ,ρT)Fi(τ,ρR))]\displaystyle\mathbb{E}_{x}\left[1_{B}(F_{i}(\tau,\rho_{T})-F_{i}(\tau,\rho_{R}))\right] =𝔼x[1B(Fi(τ,ρT)δi(ρR)gi(XρR))]\displaystyle=\mathbb{E}_{x}\left[1_{B}(F_{i}(\tau,\rho_{T})-\delta_{i}(\rho_{R})g_{i}(X_{\rho_{R}}))\right]
𝔼x[1B(δi(τρT)gi(XτρT)δi(ρR)gi(XρR))]\displaystyle\leq\mathbb{E}_{x}\left[1_{B}(\delta_{i}(\tau\wedge\rho_{T})g_{i}(X_{\tau\wedge\rho_{T}})-\delta_{i}(\rho_{R})g_{i}(X_{\rho_{R}}))\right]
=𝔼x[1B(𝔼[δi(τρT)gi(XτρT)|ρR]δi(ρR)gi(XρR))]0,\displaystyle=\mathbb{E}_{x}\left[1_{B}\left(\mathbb{E}\left[\delta_{i}(\tau\wedge\rho_{T})g_{i}(X_{\tau\wedge\rho_{T}})\Big{|}\mathcal{F}_{\rho_{R}}\right]-\delta_{i}(\rho_{R})g_{i}(X_{\rho_{R}})\right)\right]\leq 0,

where the first inequality is due to fihigif_{i}\leq h_{i}\leq g_{i} and the last inequality follows from (δi(t)gi(Xt))t0(\delta_{i}(t)g_{i}(X_{t}))_{t\geq 0} being a supermartingale. By the above inequality and

𝔼x[1A(Fi(τ,ρT)Fi(τ,ρR))]=𝔼x[1A(δi(τ)fi(Xτ)δi(τ)hi(Xτ))]0,\mathbb{E}_{x}\left[1_{A}(F_{i}(\tau,\rho_{T})-F_{i}(\tau,\rho_{R}))\right]=\mathbb{E}_{x}\left[1_{A}(\delta_{i}(\tau)f_{i}(X_{\tau})-\delta_{i}(\tau)h_{i}(X_{\tau}))\right]\leq 0,

thanks to fihif_{i}\leq h_{i}, we conclude Ji(x,τ,ρT)Ji(x,τ,ρR)=𝔼x[(1A+1B)(Fi(τ,ρT)Fi(τ,ρR))]0J_{i}(x,\tau,\rho_{T})-J_{i}(x,\tau,\rho_{R})=\mathbb{E}_{x}\left[(1_{A}+1_{B})(F_{i}(\tau,\rho_{T})-F_{i}(\tau,\rho_{R}))\right]\leq 0.

Next, fix S,SS,S^{\prime}\in\mathcal{B} with SSS\supseteq S^{\prime}. For any xΦiR(S)x\in\Phi_{i}^{R}(S^{\prime}), if xSx\in S, then xΦiT(S)x\in\Phi_{i}^{T}(S) by definition. Hence, we assume xSx\notin S in the following. With xΦR(S)Sx\in\Phi^{R}(S^{\prime})\setminus S^{\prime}, (3.1) gives

Ji(x,0,ρR)>ViR(x,S)ViR(x,S)ViT(x,S),J_{i}(x,0,\rho_{R})>V_{i}^{R}(x,S^{\prime})\geq V_{i}^{R}(x,S)\geq V_{i}^{T}(x,S), (3.16)

where the last inequality follows from (3.14) and (3.2). Note that we must have xRx\notin R. Indeed, if xRx\in R, then ρR=0\rho_{R}=0 x\mathbb{P}_{x}-a.s. and thus

Ji(x,0,ρR)=hi(x)gi(x)=sup1τρS+𝔼x[Fi(τ,ρR)]=ViR(x,S),J_{i}(x,0,\rho_{R})=h_{i}(x)\leq g_{i}(x)=\sup_{1\leq\tau\leq\rho^{+}_{S^{\prime}}}\mathbb{E}_{x}[F_{i}(\tau,\rho_{R})]=V_{i}^{R}(x,S^{\prime}),

which contradicts the first inequality in (3.16). With xRx\notin R and thus ρTρR>0\rho_{T}\geq\rho_{R}>0 x\mathbb{P}_{x}-a.s.,

Ji(x,0,ρT)=fi(x)=Ji(x,0,ρR)>ViT(x,S),J_{i}(x,0,\rho_{T})=f_{i}(x)=J_{i}(x,0,\rho_{R})>V_{i}^{T}(x,S),

where the last inequality follows from (3.16). This, together with xSx\notin S, yields xΦiT(S)x\in\Phi_{i}^{T}(S). We therefore conclude ΦiR(S)ΦiT(S)\Phi_{i}^{R}(S^{\prime})\subseteq\Phi_{i}^{T}(S). ∎

Corollary 3.1.

Fix i{1,2}i\in\{1,2\}. Assume fihigif_{i}\leq h_{i}\leq g_{i} and (3.13). Then, for any T,RT,R\in\mathcal{B} with TRT\subseteq R, we have Γi(T)Γi(R)\Gamma_{i}(T)\supseteq\Gamma_{i}(R), with Γi()\Gamma_{i}(\cdot) defined as in (3.5).

Proof.

In view of (3.5), Γi(T)=nSin(T)\Gamma_{i}(T)=\bigcup_{n\in\mathbb{N}}S_{i}^{n}(T) and Γi(R)=nSin(R)\Gamma_{i}(R)=\bigcup_{n\in\mathbb{N}}S_{i}^{n}(R), with (Sin(T))n(S_{i}^{n}(T))_{n\in\mathbb{N}} and (Sin(R))n(S_{i}^{n}(R))_{n\in\mathbb{N}} defined as in (3.4). Hence, it suffices to show Sin(T)Sin(R)S_{i}^{n}(T)\supseteq S_{i}^{n}(R) for all nn\in\mathbb{N}. By (3.15), Si1(T)=ΦiT()ΦiR()=Si1(R)S^{1}_{i}(T)=\Phi_{i}^{T}(\emptyset)\supseteq\Phi_{i}^{R}(\emptyset)=S^{1}_{i}(R). Using this and (3.15) again, we get Si2(T)=ΦiT(Si1(T))ΦiR(Si1(R))=Si2(R)S^{2}_{i}(T)=\Phi_{i}^{T}(S_{i}^{1}(T))\supseteq\Phi_{i}^{R}(S^{1}_{i}(R))=S_{i}^{2}(R). Applying (3.15) recursively in the same way then yields Sin(T)Sin(R)S^{n}_{i}(T)\supseteq S^{n}_{i}(R) for all nn\in\mathbb{N}. ∎

The monotonicity of TΓi(T)T\mapsto\Gamma_{i}(T), i{1,2}i\in\{1,2\}, will play a crucial role in Theorem 4.1 below, contributing to the convergence of an alternating iterative procedure performed jointly by Players 1 and 2.

4 The Existence of Inter-Personal Equilibria

In this section, we will design an alternating iterative procedure, to be performed jointly by the two players. As shown in Theorem 4.1 below, this procedure converges to a soft inter-personal equilibrium that is almost sharp. By a probabilistic modification of this iterative procedure and an appropriate use of Zorn’s lemma, we establish the existence of sharp inter-personal equilibria in Theorem 4.2 below. Explicit examples will be presented to illustrate this iterative procedure and the necessity of a supermartingale condition.

First, we observe that Theorem 3.1 already provides a sufficient condition for the existence of a sharp inter-personal equilibrium.

Lemma 4.1.

For each i{1,2}i\in\{1,2\}, assume higih_{i}\leq g_{i}. If (S,T)×(S,T)\in\mathcal{B}\times\mathcal{B} satisfies

Γ1(T)=SandΓ2(S)=T,\Gamma_{1}(T)=S\quad\hbox{and}\quad\Gamma_{2}(S)=T, (4.1)

then (S,T)^(S,T)\in\widehat{\mathcal{E}}.

Proof.

By Theorem 3.1, S=Γ1(T)^1TS=\Gamma_{1}(T)\in\widehat{\mathcal{E}}_{1}^{T} and T=Γ2(S)^2ST=\Gamma_{2}(S)\in\widehat{\mathcal{E}}_{2}^{S}, so that (S,T)^(S,T)\in\widehat{\mathcal{E}}. ∎

4.1 Construction of Soft Inter-Personal Equilibira

In order to achieve (4.1), we let the two players take turns to perform the individual iterative procedure (3.4). As the next result shows, such alternating iterations do converge, and the limit is guaranteed a soft inter-personal equilibrium.

Theorem 4.1.

For each i{1,2}i\in\{1,2\}, assume fihigif_{i}\leq h_{i}\leq g_{i} and (3.13). Let (Sn,Tn)(S_{n},T_{n}) be a sequence in ×\mathcal{B}\times\mathcal{B} defined by S0:=S_{0}:=\emptyset and

Tn:=Γ2(Sn)andSn+1:=Γ1(Tn),n{0}.T_{n}:=\Gamma_{2}(S_{n})\quad\text{and}\quad S_{n+1}:=\Gamma_{1}(T_{n}),\quad\forall n\in\mathbb{N}\cup\{0\}. (4.2)

Then, (Sn)(S_{n}) is nondecreasing and (Tn)(T_{n}) is nonincreasing. By taking S:=nSnS_{\infty}:=\bigcup_{n}S_{n} and T:=nTnT_{\infty}:=\bigcap_{n}T_{n}, we have (S,T)(S_{\infty},T_{\infty})\in\mathcal{E} with

Γ1(T)=SandΓ2(S)T.\Gamma_{1}(T_{\infty})=S_{\infty}\quad\hbox{and}\quad\Gamma_{2}(S_{\infty})\subseteq T_{\infty}. (4.3)
Proof.

As S0=S1S_{0}=\emptyset\subseteq S_{1}, applying Corollary 3.1 for Player 2 implies T0=Γ2(S0)Γ2(S1)=T1T_{0}=\Gamma_{2}(S_{0})\supseteq\Gamma_{2}(S_{1})=T_{1}. With T0T1T_{0}\supseteq T_{1}, applying Corollary 3.1 for Player 1 implies S1=Γ1(T0)Γ1(T1)=S2S_{1}=\Gamma_{1}(T_{0})\subseteq\Gamma_{1}(T_{1})=S_{2}. Again, by S1S2S_{1}\subseteq S_{2}, applying Corollary 3.1 for Player 2 gives T1=Γ2(S1)Γ2(S2)=T2T_{1}=\Gamma_{2}(S_{1})\supseteq\Gamma_{2}(S_{2})=T_{2}. Repeating this procedure for Players 1 and 2 recursively, we have (Sn)(S_{n}) nondecreasing and (Tn)(T_{n}) is nonincreasing.

Next, let us show that Θ1T(S)=S\Theta_{1}^{T_{\infty}}(S_{\infty})=S_{\infty}. Fix xS=nSnx\in S_{\infty}=\bigcup_{n}S_{n}. There exists NN\in\mathbb{N} such that xSn+1=Γ1(Tn)x\in S_{n+1}=\Gamma_{1}(T_{n}) for all n>Nn>N. By the fact that Γ1(Tn)1Tn\Gamma_{1}(T_{n})\in\mathcal{E}_{1}^{T_{n}} (thanks to Proposition 3.1), J1(x,0,ρTn)J1(x,ρSn+1+,ρTn)J_{1}(x,0,\rho_{T_{n}})\geq J_{1}(x,\rho^{+}_{S_{n+1}},\rho_{T_{n}}) for all nNn\geq N. As nn\to\infty, (2.3) allows us to use the dominated convergence theorem, so that we may conclude from Lemma 2.1 that

J1(x,0,ρT)J1(x,ρS+,ρT),J_{1}(x,0,\rho_{T_{\infty}})\geq J_{1}(x,\rho^{+}_{S_{\infty}},\rho_{T_{\infty}}), (4.4)

which implies xΘ1T(S)x\in\Theta_{1}^{T_{\infty}}(S_{\infty}). Hence, SΘ1T(S)S_{\infty}\subseteq\Theta_{1}^{T_{\infty}}(S_{\infty}). On the other hand, for any xS=nSnx\notin S_{\infty}=\bigcup_{n}S_{n}, xSn+1=Γ1(Tn)x\notin S_{n+1}=\Gamma_{1}(T_{n}) for all nn\in\mathbb{N}. Thanks again to the fact that Γ1(Tn)1Tn\Gamma_{1}(T_{n})\in\mathcal{E}_{1}^{T_{n}}, xΓ1(Tn)x\notin\Gamma_{1}(T_{n}) indicates J1(x,0,ρTn)J1(x,ρSn+1+,ρTn)J_{1}(x,0,\rho_{T_{n}})\leq J_{1}(x,\rho^{+}_{S_{n+1}},\rho_{T_{n}}) for all nn\in\mathbb{N}. As nn\to\infty, we can argue as in (4.4) to get J1(x,0,ρT)J1(x,ρS+,T)J_{1}(x,0,\rho_{T_{\infty}})\leq J_{1}(x,\rho^{+}_{S_{\infty}},T_{\infty}), which implies xΘ1T(S)x\notin\Theta_{1}^{T_{\infty}}(S_{\infty}). Hence, (S)c(Θ1T(S))c(S_{\infty})^{c}\subseteq\big{(}\Theta_{1}^{T_{\infty}}(S_{\infty})\big{)}^{c}, or Θ1T(S)S\Theta_{1}^{T_{\infty}}(S_{\infty})\subseteq S_{\infty}. We thus conclude Θ1T(S)=S\Theta_{1}^{T_{\infty}}(S_{\infty})=S_{\infty}, i.e. S1TS_{\infty}\in\mathcal{E}_{1}^{T_{\infty}}. Similar arguments as above yield Θ2S(T)=T\Theta_{2}^{S_{\infty}}(T_{\infty})=T_{\infty}, i.e. T2ST_{\infty}\in\mathcal{E}_{2}^{S_{\infty}}. This readily shows that (S,T)(S_{\infty},T_{\infty})\in\mathcal{E}.

As SnSS_{n}\subseteq S_{\infty} by construction for all nn\in\mathbb{N}, Corollary 3.1 implies Γ2(S)Γ2(Sn)=Tn\Gamma_{2}(S_{\infty})\subseteq\Gamma_{2}(S_{n})=T_{n} for all nn\in\mathbb{N}, which in turn gives Γ2(S)nTn=T\Gamma_{2}(S_{\infty})\subseteq\bigcap_{n}T_{n}=T_{\infty}. Similarly, as TnTT_{n}\supseteq T_{\infty} by construction for all nn\in\mathbb{N}, Corollary 3.1 implies Γ1(T)Γ1(Tn)=Sn+1\Gamma_{1}(T_{\infty})\supseteq\Gamma_{1}(T_{n})=S_{n+1} for all nn\in\mathbb{N}, which in turn gives Γ1(T)nSn+1=S\Gamma_{1}(T_{\infty})\supseteq\bigcup_{n\in\mathbb{N}}S_{n+1}=S_{\infty}. Finally, recall S1TS_{\infty}\in\mathcal{E}_{1}^{T_{\infty}}. This, together with Lemma 3.2, implies Γ1(T)S\Gamma_{1}(T_{\infty})\subseteq S_{\infty}. We then conclude Γ1(T)=S\Gamma_{1}(T_{\infty})=S_{\infty}. ∎

The next example illustrates the alternating iterative procedure (4.2) explicitly.

Example 4.1.

Let 𝕏\mathbb{X} contain countably many states, i.e. 𝕏={x0,x1,x2,}\mathbb{X}=\{x_{0},x_{1},x_{2},\dotso\}, and assume

xn+1(X1=xn)=1,forn=0,1,2,x0(X1=x0)=1εandx0(X1=x1)=ε,for someε[0,1).\begin{split}&\mathbb{P}_{x_{n+1}}(X_{1}=x_{n})=1,\quad\hbox{for}\ n=0,1,2\dotso,\\ \mathbb{P}_{x_{0}}(X_{1}=x_{0})&=1-\varepsilon\quad\text{and}\quad\mathbb{P}_{x_{0}}(X_{1}=x_{1})=\varepsilon,\quad\hbox{for some}\ \varepsilon\in[0,1).\end{split} (4.5)

Take M>1M>1 such that

δ2(2)<1/M<δ2(1).\delta_{2}(2)<1/M<\delta_{2}(1). (4.6)

Additionally, take L>1L>1 and consider the following payoff functions for the two players

f1(xn)=1andg1(xn)=Lforn=0,1,2,,\displaystyle f_{1}(x_{n})=1\quad\text{and}\quad g_{1}(x_{n})=L\quad\hbox{for}\ n=0,1,2,\dotso, (4.7)
f2(x0)=0,f2(xn)=1forn=1,2,,andg2(xn)=Mforn=0,1,2,\displaystyle f_{2}(x_{0})=0,\quad f_{2}(x_{n})=1\ \hbox{for}\ n=1,2,\dotso,\quad\text{and}\quad g_{2}(x_{n})=M\ \hbox{for}\ n=0,1,2\dotso, (4.8)

while h1h_{1} (resp. h2h_{2}) is allowed to be any function such that f1h1g1f_{1}\leq h_{1}\leq g_{1} (resp. f2h2g2f_{2}\leq h_{2}\leq g_{2}) on 𝕏\mathbb{X}.

For ε[0,1)\varepsilon\in[0,1) small enough, we claim that the alternating iterative procedure (4.2) gives rise to

S0\displaystyle S_{0} =,\displaystyle=\emptyset, T0\displaystyle T_{0} ={x1,x2,},\displaystyle=\{x_{1},x_{2},\dotso\},
S1\displaystyle S_{1} ={x0},\displaystyle=\{x_{0}\}, T1\displaystyle T_{1} ={x2,x3,},\displaystyle=\{x_{2},x_{3},\dotso\},
S2\displaystyle S_{2} ={x0,x1},\displaystyle=\{x_{0},x_{1}\}, T2\displaystyle T_{2} ={x3,x4,},\displaystyle=\{x_{3},x_{4},\dotso\}, (4.9)
\displaystyle\vdots \displaystyle\vdots
Sn\displaystyle S_{n} ={x0,x1,,xn1},\displaystyle=\{x_{0},x_{1},\dotso,x_{n-1}\}, Tn\displaystyle T_{n} ={xn+1,xn+2,}.\displaystyle=\{x_{n+1},x_{n+2},\dotso\}.

First, starting with S0=S_{0}=\emptyset, we deduce from (4.8) that for any S=2𝕏S\in\mathcal{B}=2^{\mathbb{X}},

V2S0(xn,S)=sup1τρS+𝔼xn[F2(τ,ρS0)]{0=f2(xn)=J2(xn,0,ρS0),forn=0,<1=f2(xn)=J2(xn,0,ρS0),forn=1,2,V^{S_{0}}_{2}(x_{n},S)=\sup_{1\leq\tau\leq\rho^{+}_{S}}\mathbb{E}_{x_{n}}[F_{2}(\tau,\rho_{S_{0}})]\begin{cases}\geq 0=f_{2}(x_{n})=J_{2}(x_{n},0,\rho_{S_{0}}),\ &\hbox{for}\ n=0,\\ <1=f_{2}(x_{n})=J_{2}(x_{n},0,\rho_{S_{0}}),\ &\hbox{for}\ n=1,2,\dotso\end{cases} (4.10)

This implies Φ2S0()={x1,x2,}\Phi^{S_{0}}_{2}(\emptyset)=\{x_{1},x_{2},...\} and Φ2S0({x1,x2,})={x1,x2,}\Phi^{S_{0}}_{2}(\{x_{1},x_{2},...\})=\{x_{1},x_{2},...\}, so that T0:=Γ2(S0)={x1,x2,}T_{0}:=\Gamma_{2}(S_{0})=\{x_{1},x_{2},...\}. Next, thanks to (4.5) and (4.7), for any S2𝕏S\in 2^{\mathbb{X}},

V1T0(xn,S)\displaystyle V^{T_{0}}_{1}(x_{n},S) =sup1τρS+𝔼xn[F1(τ,ρT0)]\displaystyle=\sup_{1\leq\tau\leq\rho^{+}_{S}}\mathbb{E}_{x_{n}}[F_{1}(\tau,\rho_{T_{0}})]
{(1ε)δ1(1)+εL(1+(1ε)+(1ε)2+)=(1ε)δ1(1)+εL1+ε<1=f1(xn)=J1(xn,0,ρT0),forn=0,=g1(xn)h1(xn)=J1(xn,0,ρT0),forn=1,2,\displaystyle\begin{cases}\leq(1-\varepsilon)\delta_{1}(1)+\varepsilon L\left(1+(1-\varepsilon)+(1-\varepsilon)^{2}+\dotso\right)\\ \hskip 72.26999pt=(1-\varepsilon)\delta_{1}(1)+\frac{\varepsilon L}{1+\varepsilon}<1=f_{1}(x_{n})=J_{1}(x_{n},0,\rho_{T_{0}}),\ &\hbox{for}\ n=0,\\ =g_{1}(x_{n})\geq h_{1}(x_{n})=J_{1}(x_{n},0,\rho_{T_{0}}),\ &\hbox{for}\ n=1,2,\dotso\end{cases} (4.11)

where the inequality “(1ε)δ1(1)+εL1+ε<1(1-\varepsilon)\delta_{1}(1)+\frac{\varepsilon L}{1+\varepsilon}<1” holds as ε[0,1)\varepsilon\in[0,1) is small enough. This implies Φ1T0()={x0}\Phi^{T_{0}}_{1}(\emptyset)=\{x_{0}\} and Φ1T0({x0})={x0}\Phi^{T_{0}}_{1}(\{x_{0}\})=\{x_{0}\}, so that S1:=Γ1(T0)={x0}S_{1}:=\Gamma_{1}(T_{0})=\{x_{0}\}. Now, thanks to (4.5) and (4.8), for any S2𝕏S\in 2^{\mathbb{X}} such that x0Sx_{0}\notin S,

V2S1(xn,S)\displaystyle V^{S_{1}}_{2}(x_{n},S) =sup1τρS+𝔼xn[F2(τ,ρS1)]\displaystyle=\sup_{1\leq\tau\leq\rho^{+}_{S}}\mathbb{E}_{x_{n}}[F_{2}(\tau,\rho_{S_{1}})]
{=g2(xn)h2(xn)=J2(xn,0,ρS1),forn=0,=δ2(1)g2(x0)=δ2(1)M1=f2(xn)=J2(xn,0,ρS1),forn=1,max{δ2(1),δ2(2)M}<1=f2(xn)=J2(xn,0,ρS1),forn=2,3,\displaystyle\begin{cases}=g_{2}(x_{n})\geq h_{2}(x_{n})=J_{2}(x_{n},0,\rho_{S_{1}}),\ &\hbox{for}\ n=0,\\ =\delta_{2}(1)g_{2}(x_{0})=\delta_{2}(1)M\geq 1=f_{2}(x_{n})=J_{2}(x_{n},0,\rho_{S_{1}}),\ &\hbox{for}\ n=1,\\ \leq\max\{\delta_{2}(1),\delta_{2}(2)M\}<1=f_{2}(x_{n})=J_{2}(x_{n},0,\rho_{S_{1}}),\ &\hbox{for}\ n=2,3,\dotso\end{cases} (4.12)

where we use (4.6) in the last two lines. This implies Φ2S1()={x2,x3,}\Phi^{S_{1}}_{2}(\emptyset)=\{x_{2},x_{3},...\} and Φ2S1({x2,x3,})={x2,x3,}\Phi^{S_{1}}_{2}(\{x_{2},x_{3},...\})=\{x_{2},x_{3},...\}, so that T1:=Γ2(S1)={x2,x3,}T_{1}:=\Gamma_{2}(S_{1})=\{x_{2},x_{3},...\}. By similar arguments as above, we can derive SnS_{n} and TnT_{n} in (4.9) for all n2n\geq 2. By Theorem 4.1, (S,T):=(nSn,nTn)=(𝕏,)(S_{\infty},T_{\infty}):=(\bigcup_{n\in\mathbb{N}}S_{n},\bigcap_{n\in\mathbb{N}}T_{n})=(\mathbb{X},\emptyset) is a soft inter-personal equilibrium and satisfies (4.3). Observe that for any n=0,1,2,n=0,1,2,\dotso,

V2𝕏(xn,)=sup1τρ+𝔼xn[F2(τ,ρ𝕏)]=g2(xn)h2(xn)=J2(xn,0,ρ𝕏).V^{\mathbb{X}}_{2}(x_{n},\emptyset)=\sup_{1\leq\tau\leq\rho^{+}_{\emptyset}}\mathbb{E}_{x_{n}}[F_{2}(\tau,\rho_{\mathbb{X}})]=g_{2}(x_{n})\geq h_{2}(x_{n})=J_{2}(x_{n},0,\rho_{\mathbb{X}}).

It follows that Φ2𝕏()=\Phi^{\mathbb{X}}_{2}(\emptyset)=\emptyset, so that Γ2(S)=Γ2(𝕏)==T\Gamma_{2}(S_{\infty})=\Gamma_{2}(\mathbb{X})=\emptyset=T_{\infty}. That is, we have a stronger version of (4.3) where the inclusion therein is equality. Hence, by Lemma 4.1, (S,T)=(𝕏,)(S_{\infty},T_{\infty})=(\mathbb{X},\emptyset) is in fact a sharp inter-personal equilibrium.

In view of (4.3) and Lemma 4.1, the soft inter-personal equilibrium (S,T)(S_{\infty},T_{\infty}) constructed in Theorem 4.1 is nearly a sharp one. It is natural to ask whether the inclusion in (4.3) is actually equality (as in Example 4.1), so that (S,T)(S_{\infty},T_{\infty}) is sharp in general. The next example shows that this is generally not the case: the inclusion in (4.3) can be strict and (S,T)(S_{\infty},T_{\infty}) may fail to be sharp.

Example 4.2.

Let us extend the state space in Example 4.1 by including two additional states, i.e. 𝕏={x0,x1,x2,}{y,z}\mathbb{X}=\{x_{0},x_{1},x_{2},\dotso\}\cup\{y,z\}. The transition probabilities are specified as in (4.5), as well as

y(X1=xn)=pn>0withn=0pn=1andz(X1=y)=1.\mathbb{P}_{y}(X_{1}=x_{n})=p_{n}>0\ \ \text{with}\ \ \sum_{n=0}^{\infty}p_{n}=1\quad\text{and}\quad\mathbb{P}_{z}(X_{1}=y)=1. (4.13)

Take M>1M>1 such that (4.6) holds. Assume additionally that δ2:[0,)[0,1]\delta_{2}:[0,\infty)\to[0,1] satisfies δ2(1)2<δ2(2)\delta_{2}(1)^{2}<\delta_{2}(2). Take L>1L>1 and define fif_{i} and gig_{i}, i{1,2}i\in\{1,2\}, as in (4.7)-(4.8) on {x0,x1,x2,}\{x_{0},x_{1},x_{2},\dotso\}, along with

f2(y)=Mδ2(1),f2(z)(Mδ2(1)2δ2(2),Mδ2(2)),g2(y)=g2(z)=M,f_{2}(y)=M\delta_{2}(1),\quad f_{2}(z)\in\left(M\delta_{2}(1)^{2}\vee\delta_{2}(2),M\delta_{2}(2)\right),\quad g_{2}(y)=g_{2}(z)=M, (4.14)

while f1f_{1} and g1g_{1} are allowed to take arbitrary nonnegative values on {y,z}\{y,z\} as long as f1g1f_{1}\leq g_{1}. Also, h1h_{1} (resp. h2h_{2}) is allowed to be any function such that f1h1g1f_{1}\leq h_{1}\leq g_{1} (resp. f2h2g2f_{2}\leq h_{2}\leq g_{2}) on 𝕏\mathbb{X}.

For ε[0,1)\varepsilon\in[0,1) small enough, we claim that the alternating iterative procedure (4.2) gives rise to

S0\displaystyle S_{0} =,\displaystyle=\emptyset, T0\displaystyle T_{0} ={x1,x2,}{y,z},\displaystyle=\{x_{1},x_{2},\dotso\}\cup\{y,z\},
S1\displaystyle S_{1} ={x0},\displaystyle=\{x_{0}\}, T1\displaystyle T_{1} ={x2,x3,}{y,z},\displaystyle=\{x_{2},x_{3},\dotso\}\cup\{y,z\},
S2\displaystyle S_{2} ={x0,x1},\displaystyle=\{x_{0},x_{1}\}, T2\displaystyle T_{2} ={x3,x4,}{y,z},\displaystyle=\{x_{3},x_{4},\dotso\}\cup\{y,z\}, (4.15)
\displaystyle\vdots \displaystyle\vdots
Sn\displaystyle S_{n} ={x0,x1,,xn1},\displaystyle=\{x_{0},x_{1},\dotso,x_{n-1}\}, Tn\displaystyle T_{n} ={xn+1,xn+2,}{y,z}.\displaystyle=\{x_{n+1},x_{n+2},\dotso\}\cup\{y,z\}.

First, note that the relations (4.10), (4.11), and (4.12) remain true in our current setting. Now, starting with S0=S_{0}=\emptyset, we deduce from (4.13), (4.8), and (4.14) that for any S=2𝕏S\in\mathcal{B}=2^{\mathbb{X}},

V2S0(y,S)\displaystyle V^{S_{0}}_{2}(y,S) =sup1τρS+𝔼y[F2(τ,ρS0)]<δ2(1)<f2(y)=J2(y,0,ρS0),\displaystyle=\sup_{1\leq\tau\leq\rho^{+}_{S}}\mathbb{E}_{y}[F_{2}(\tau,\rho_{S_{0}})]<\delta_{2}(1)<f_{2}(y)=J_{2}(y,0,\rho_{S_{0}}),
V2S0(z,S)\displaystyle V^{S_{0}}_{2}(z,S) =sup1τρS+𝔼z[F2(τ,ρS0)]max{δ2(1)f2(y),δ2(2)}\displaystyle=\sup_{1\leq\tau\leq\rho^{+}_{S}}\mathbb{E}_{z}[F_{2}(\tau,\rho_{S_{0}})]\leq\max\{\delta_{2}(1)f_{2}(y),\delta_{2}(2)\}
=max{Mδ2(1)2,δ2(2)}<f2(z)=J2(z,0,ρS0).\displaystyle\hskip 112.0187pt=\max\{M\delta_{2}(1)^{2},\delta_{2}(2)\}<f_{2}(z)=J_{2}(z,0,\rho_{S_{0}}).

These two inequalities, along with (4.10), imply Φ2S0()={x1,x2,}{y,z}\Phi^{S_{0}}_{2}(\emptyset)=\{x_{1},x_{2},...\}\cup\{y,z\} and Φ2S0({x1,x2,}{y,z})={x1,x2,}{y,z}\Phi^{S_{0}}_{2}(\{x_{1},x_{2},...\}\cup\{y,z\})=\{x_{1},x_{2},...\}\cup\{y,z\}, so that T0:=Γ2(S0)={x1,x2,}{y,z}T_{0}:=\Gamma_{2}(S_{0})=\{x_{1},x_{2},...\}\cup\{y,z\}. Next, since {y,z}T0\{y,z\}\subset T_{0}, for any S2𝕏S\in 2^{\mathbb{X}}, V1T0(x,S)=sup1τρS+𝔼x[F1(τ,ρT0)]=g1(x)h1(x)=J1(x,0,ρT0)V^{T_{0}}_{1}(x,S)=\sup_{1\leq\tau\leq\rho^{+}_{S}}\mathbb{E}_{x}[F_{1}(\tau,\rho_{T_{0}})]=g_{1}(x)\geq h_{1}(x)=J_{1}(x,0,\rho_{T_{0}}) for x{y,z}x\in\{y,z\}. This, together with (4.11), implies that as ε[0,1)\varepsilon\in[0,1) is small enough, Φ1T0()={x0}\Phi^{T_{0}}_{1}(\emptyset)=\{x_{0}\} and Φ1T0({x0})={x0}\Phi^{T_{0}}_{1}(\{x_{0}\})=\{x_{0}\}, so that S1:=Γ1(T0)={x0}S_{1}:=\Gamma_{1}(T_{0})=\{x_{0}\}. Thanks to (4.13), (4.5), (4.8), and (4.14), for any S2𝕏S\in 2^{\mathbb{X}} such that x0Sx_{0}\notin S,

V2S1(y,S)\displaystyle V^{S_{1}}_{2}(y,S) =sup1τρS+𝔼y[F2(τ,ρS1)]<δ2(1)M=f2(y)=J2(y,0,ρS1),\displaystyle=\sup_{1\leq\tau\leq\rho^{+}_{S}}\mathbb{E}_{y}[F_{2}(\tau,\rho_{S_{1}})]<\delta_{2}(1)M=f_{2}(y)=J_{2}(y,0,\rho_{S_{1}}),
V2S1(z,S)\displaystyle V^{S_{1}}_{2}(z,S) =sup1τρS+𝔼z[F2(τ,ρS1)]max{δ2(1)f2(y),δ2(2)((1O(ε))+O(ε)M)}\displaystyle=\sup_{1\leq\tau\leq\rho^{+}_{S}}\mathbb{E}_{z}[F_{2}(\tau,\rho_{S_{1}})]\leq\max\left\{\delta_{2}(1)f_{2}(y),\delta_{2}(2)\big{(}(1-O(\varepsilon))+O(\varepsilon)M\big{)}\right\}
<f2(z)=J2(z,0,ρS1),\displaystyle\hskip 112.0187pt<f_{2}(z)=J_{2}(z,0,\rho_{S_{1}}),

where the last inequality holds as ε[0,1)\varepsilon\in[0,1) is small enough, thanks to (4.14). The above two inequalities, along with (4.12), imply Φ2S1()={x2,x3,}{y,z}\Phi^{S_{1}}_{2}(\emptyset)=\{x_{2},x_{3},...\}\cup\{y,z\} and Φ2S1({x2,x3,})={x2,x3,}{y,z}\Phi^{S_{1}}_{2}(\{x_{2},x_{3},...\})=\{x_{2},x_{3},...\}\cup\{y,z\}, so that T1:=Γ2(S1)={x2,x3,}{y,z}T_{1}:=\Gamma_{2}(S_{1})=\{x_{2},x_{3},...\}\cup\{y,z\}. By similar arguments as above, we can derive SnS_{n} and TnT_{n} in (4.15) for all n2n\geq 2. Hence, (S,T):=(nSn,nTn)=({x0,x1,x2,},{y,z})(S_{\infty},T_{\infty}):=(\bigcup_{n\in\mathbb{N}}S_{n},\bigcap_{n\in\mathbb{N}}T_{n})=(\{x_{0},x_{1},x_{2},\dotso\},\{y,z\}).

Now, it can be easily checked that V2S(xn,)=g2(xn)h2(xn)=J2(xn,0,ρS)V^{S_{\infty}}_{2}(x_{n},\emptyset)=g_{2}(x_{n})\geq h_{2}(x_{n})=J_{2}(x_{n},0,\rho_{S_{\infty}}) for all n=0,1,2,n=0,1,2,\dotso. Moreover, due to (4.13),

V2S(y,)\displaystyle V^{S_{\infty}}_{2}(y,\emptyset) =sup1τρ+𝔼y[F2(τ,ρS)]=δ2(1)M=f2(y)=J2(y,0,ρS),\displaystyle=\sup_{1\leq\tau\leq\rho^{+}_{\emptyset}}\mathbb{E}_{y}[F_{2}(\tau,\rho_{S_{\infty}})]=\delta_{2}(1)M=f_{2}(y)=J_{2}(y,0,\rho_{S_{\infty}}),
V2S(z,)\displaystyle V^{S_{\infty}}_{2}(z,\emptyset) =sup1τρ+𝔼z[F2(τ,ρS)]=δ2(2)M>f2(z)=J2(z,0,ρS).\displaystyle=\sup_{1\leq\tau\leq\rho^{+}_{\emptyset}}\mathbb{E}_{z}[F_{2}(\tau,\rho_{S_{\infty}})]=\delta_{2}(2)M>f_{2}(z)=J_{2}(z,0,\rho_{S_{\infty}}).

In view of (3.1), we conclude Φ2S()=\Phi_{2}^{S_{\infty}}(\emptyset)=\emptyset. By (3.5), this in turn implies Γ2(S)=T.\Gamma_{2}(S_{\infty})=\emptyset\subsetneq T_{\infty}. That is, the inclusion in (4.3) is strict, so that we can no longer conclude from Lemma 4.1 that (S,T)(S_{\infty},T_{\infty}) is a sharp inter-personal equilibrium. In fact, (S,T)(S_{\infty},T_{\infty}) is not sharp. Recall from Theorem 3.1 that =Γ2(S)^2S\emptyset=\Gamma_{2}(S_{\infty})\in\widehat{\mathcal{E}}^{S_{\infty}}_{2}. Then, it can be checked directly that T={y,z}2ST_{\infty}=\{y,z\}\in\mathcal{E}^{S_{\infty}}_{2} but T^2ST_{\infty}\notin\widehat{\mathcal{E}}^{S_{\infty}}_{2}. Specifically, TT_{\infty} is strictly dominated by \emptyset at the state zz, as (4.14) indicates

U2S(z,T)\displaystyle U^{S_{\infty}}_{2}(z,T_{\infty}) =J2(z,0,ρS)J2(z,ρT+,ρS)=f2(z)Mδ2(1)2\displaystyle=J_{2}(z,0,\rho_{S_{\infty}})\vee J_{2}(z,\rho^{+}_{T_{\infty}},\rho_{S_{\infty}})=f_{2}(z)\vee M\delta_{2}(1)^{2}
=f2(z)<Mδ2(2)=J2(z,ρ+,ρS)U2S(z,).\displaystyle=f_{2}(z)<M\delta_{2}(2)=J_{2}(z,\rho^{+}_{\emptyset},\rho_{S_{\infty}})\leq U^{S_{\infty}}_{2}(z,\emptyset).

As T^2ST_{\infty}\notin\widehat{\mathcal{E}}^{S_{\infty}}_{2}, (S,T)=({x0,x1,x2,},{y,z})(S_{\infty},T_{\infty})=(\{x_{0},x_{1},x_{2},\dotso\},\{y,z\}) is not a sharp inter-personal equilibrium.

4.2 General Existence of Sharp Inter-Personal Equilibria

In view of Example 4.2, the soft inter-personal equilibrium constructed in Theorem 4.1 may not be a sharp one. That is to say, the general existence of a sharp inter-personal equilibrium is still in question. To resolve this, we impose appropriate regularity of XX.

Assumption 4.1.

XX has transition densities (pt)t1(p_{t})_{t\geq 1} with respect to a measure μ\mu on (𝕏,)(\mathbb{X},\mathcal{B}). That is, for each t=1,2,t=1,2,..., pt:𝕏×𝕏+p_{t}:\mathbb{X}\times\mathbb{X}\to\mathbb{R}_{+} is a Borel measurable function such that

x(XtA)=Apt(x,y)μ(dy),x𝕏andA.\mathbb{P}_{x}(X_{t}\in A)=\int_{A}p_{t}(x,y)\mu(dy),\quad\forall x\in\mathbb{X}\ \hbox{and}\ A\in\mathcal{B}.
Remark 4.1.

When 𝕏\mathbb{X} is at most countable, Assumption 4.1 is trivially satisfied. When 𝕏\mathbb{X} is uncountable, the literature is focused on the case 𝕏=d\mathbb{X}=\mathbb{R}^{d} for some d1d\geq 1. In this case, many discrete-time Markov processes XX fulfill Assumption 4.1 (with μ\mu being the Lebesgue measure). This includes, particularly, XX defined by the formula Xt+1:=G(Xt,Zt)X_{t+1}:=G(X_{t},Z_{t}), t+t\in\mathbb{Z}_{+}, where GG is a Borel measurable function and ZtZ_{t} is a random variable independent of XX such that G(x,Zt)G(x,Z_{t}) admits a probability density function for all x𝕏x\in\mathbb{X}. This formula is commonly used in practical simulation of Markov processes; see e.g. [38, Section 3] and [1, Section 5].

The next result, as a direct consequence of [26, Lemma 6.5], will also play a crucial role.

Lemma 4.2.

Let μ\mu be a measure on (𝕏,)(\mathbb{X},\mathcal{B}). For any A𝕏A\subseteq\mathbb{X}, there exists a maximal Borel minorant of AA under μ\mu, defined as a set AμA^{\mu}\in\mathcal{B} with AμAA^{\mu}\subseteq A such that for any AA^{\prime}\in\mathcal{B} with AAA^{\prime}\subseteq A, μ(AAμ)=0\mu(A^{\prime}\setminus A^{\mu})=0.

Now, we are ready to present the general existence of a sharp inter-personal equilibrium.

Theorem 4.2.

Suppose Assumption 4.1 holds. For each i{1,2}i\in\{1,2\}, assume fihigif_{i}\leq h_{i}\leq g_{i} and (3.13). Then, there exists a sharp inter-personal equilibrium.

Proof.

Consider the set

A:={(S,T):Γ1(T)SandΓ2(S)T}.A:=\{(S,T)\in\mathcal{E}:\Gamma_{1}(T)\supseteq S\ \hbox{and}\ \Gamma_{2}(S)\subseteq T\}.

By Theorem 4.1, AA\neq\emptyset. Now, define a partial order on AA as follows: for any (S,T),(S,T)A(S,T),(S^{\prime},T^{\prime})\in A,

(S,T)(S,T)ifSSandTT.(S,T)\succeq(S^{\prime},T^{\prime})\quad\text{if}\ S\supseteq S^{\prime}\ \text{and}\ T\subseteq T^{\prime}. (4.16)

Step 1: Showing that every totally ordered subset of AA has an upper bound in AA.
Let (Sα,Tα)αI(S_{\alpha},T_{\alpha})_{\alpha\in I} be a subset of AA that is totally ordered. Set S0:=αISαS_{0}:=\bigcup_{\alpha\in I}S_{\alpha} and T0:=αITαT_{0}:=\bigcap_{\alpha\in I}T_{\alpha}. Recall the measure μ\mu in Assumption 4.1. By Lemma 4.2, there exists a maximal Borel minorant of T0T_{0} under μ\mu, which will be denoted by T0μT_{0}^{\mu}. For any TT\in\mathcal{B} with TT0T\subseteq T_{0}, since μ(TT0μ)=0\mu(T\setminus T_{0}^{\mu})=0, we deduce from Assumption 4.1 that x(XtTT0μ)=TT0μpt(x,y)μ(dy)=0\mathbb{P}_{x}(X_{t}\in T\setminus T_{0}^{\mu})=\int_{T\setminus T_{0}^{\mu}}p_{t}(x,y)\mu(dy)=0 for all x𝕏x\in\mathbb{X} and tt\in\mathbb{N}. It follows that

x(XtTT0μfor some t)=0x𝕏,whenever T and TT0.\mathbb{P}_{x}(X_{t}\in T\setminus T_{0}^{\mu}\ \hbox{for some $t\in\mathbb{N}$})=0\quad\forall x\in\mathbb{X},\quad\hbox{whenever $T\in\mathcal{B}$ and $T\subseteq T_{0}$}. (4.17)

On the other hand, observe that

Γ1(T)S0for any T with TT0,Γ2(S)T0for any S with SS0.\Gamma_{1}(T)\supseteq S_{0}\ \hbox{for any $T\in\mathcal{B}$ with $T\subseteq T_{0}$},\quad\Gamma_{2}(S)\subseteq T_{0}\ \hbox{for any $S\in\mathcal{B}$ with $S\supseteq S_{0}$}. (4.18)

Indeed, for any TT\in\mathcal{B} with TT0=αITαT\subseteq T_{0}=\bigcap_{\alpha\in I}T_{\alpha}, by Corollary 3.1 and the definition of AA, Γ1(T)Γ1(Tα)Sα\Gamma_{1}(T)\supseteq\Gamma_{1}(T_{\alpha})\supseteq S_{\alpha} for all αI\alpha\in I, which implies Γ1(T)S0\Gamma_{1}(T)\supseteq S_{0}. Similarly, for any SS\in\mathcal{B} with SS0=αISαS\supseteq S_{0}=\bigcup_{\alpha\in I}S_{\alpha}, by Corollary 3.1 and the definition of AA, Γ2(S)Γ2(Sα)Tα\Gamma_{2}(S)\subseteq\Gamma_{2}(S_{\alpha})\subseteq T_{\alpha} for all αI\alpha\in I, which implies Γ2(S)T0\Gamma_{2}(S)\subseteq T_{0}.

Now, define

S1:=Γ1(T0μ)S0andT1:=Γ2(S1)T0.S_{1}:=\Gamma_{1}(T_{0}^{\mu})\supseteq S_{0}\quad\hbox{and}\quad T_{1}:=\Gamma_{2}(S_{1})\subseteq T_{0}.

Note that the first inclusion follows from T0μT_{0}^{\mu}\in\mathcal{B}, T0μT0T_{0}^{\mu}\subseteq T_{0}, and (4.18). As T0μT_{0}^{\mu}\in\mathcal{B} implies S1:=Γ1(T0μ)S_{1}:=\Gamma_{1}(T_{0}^{\mu})\in\mathcal{B}, we deduce from S1S_{1}\in\mathcal{B}, S1S0S_{1}\supseteq S_{0}, and (4.18) that the second inclusion above holds. With T1T_{1}\in\mathcal{B} (thanks to S1S_{1}\in\mathcal{B}) and T1T0T_{1}\subseteq T_{0}, (4.17) gives x(XtT1T0μfor some t)=0\mathbb{P}_{x}(X_{t}\in T_{1}\setminus T_{0}^{\mu}\ \hbox{for some $t\in\mathbb{N}$})=0 for all xdx\in\mathbb{R}^{d}. This readily implies

ρT1T0μ=ρT0μx-a.s.,forxT1T0μ.\rho_{T_{1}\cup T_{0}^{\mu}}=\rho_{T_{0}^{\mu}}\quad\mathbb{P}_{x}\hbox{-a.s.},\quad\hbox{for}\ x\notin T_{1}\cup T^{\mu}_{0}. (4.19)

We claim that Γ1(T1T0μ)=Γ1(T0μ)\Gamma_{1}(T_{1}\cup T_{0}^{\mu})=\Gamma_{1}(T_{0}^{\mu}). By the definition of Γ1\Gamma_{1} in (3.5), it suffices to show that S1n(T1T0μ)=S1n(T0μ)S^{n}_{1}(T_{1}\cup T^{\mu}_{0})=S^{n}_{1}(T^{\mu}_{0}) for all nn\in\mathbb{N}. First, as T1=Γ2(S1)T_{1}=\Gamma_{2}(S_{1}), the last assertion of Proposition 3.1 implies T1S1=T_{1}\cap S_{1}=\emptyset. With S1=Γ1(T0μ)=nS1n(T0μ)S_{1}=\Gamma_{1}(T^{\mu}_{0})=\bigcup_{n\in\mathbb{N}}S_{1}^{n}(T^{\mu}_{0}), we obtain

T1S1n(T0μ)=,n.T_{1}\cap S_{1}^{n}(T^{\mu}_{0})=\emptyset,\quad\forall n\in\mathbb{N}. (4.20)

Now, for n=1n=1, (3.15) implies S11(T1T0μ)=Φ1T1T0μ()Φ1T0μ()=S11(T0μ)S_{1}^{1}(T_{1}\cup T^{\mu}_{0})=\Phi_{1}^{T_{1}\cup T^{\mu}_{0}}(\emptyset)\subseteq\Phi_{1}^{T^{\mu}_{0}}(\emptyset)=S_{1}^{1}(T^{\mu}_{0}). For any xS11(T0μ)x\in S_{1}^{1}(T^{\mu}_{0}), due to T0μS11(T0μ)=T_{0}^{\mu}\cap S_{1}^{1}(T^{\mu}_{0})=\emptyset (by Proposition 3.1) and (4.20), we must have xT1T0μx\notin T_{1}\cup T^{\mu}_{0}. Observe that

J1(x,0,ρT1T0μ)=J1(x,0,ρT0μ)>V1T0μ(x,)=V1T1T0μ(x,),J_{1}(x,0,\rho_{T_{1}\cup T^{\mu}_{0}})=J_{1}(x,0,\rho_{T^{\mu}_{0}})>V^{T^{\mu}_{0}}_{1}(x,\emptyset)=V^{T_{1}\cup T^{\mu}_{0}}_{1}(x,\emptyset), (4.21)

where the first and third equalities follow from (4.19) and the inequality stems from the definition of S11(T0μ)=Φ1T0μ()S_{1}^{1}(T^{\mu}_{0})=\Phi_{1}^{T^{\mu}_{0}}(\emptyset) in (3.1). This shows that xΦ1T1T0μ()=S11(T1T0μ)x\in\Phi_{1}^{T_{1}\cup T^{\mu}_{0}}(\emptyset)=S_{1}^{1}(T_{1}\cup T^{\mu}_{0}). Hence, we obtain S11(T0μ)S11(T1T0μ)S_{1}^{1}(T^{\mu}_{0})\subseteq S_{1}^{1}(T_{1}\cup T^{\mu}_{0}) and thus conclude S11(T1T0μ)=S11(T0μ)S_{1}^{1}(T_{1}\cup T^{\mu}_{0})=S_{1}^{1}(T^{\mu}_{0}). Suppose that S1k(T1T0μ)=S1k(T0μ)S_{1}^{k}(T_{1}\cup T^{\mu}_{0})=S_{1}^{k}(T^{\mu}_{0}) for some k1k\geq 1. By (3.15) again, S1k+1(T1T0μ)S1k+1(T0μ)S^{k+1}_{1}(T_{1}\cup T^{\mu}_{0})\subseteq S^{k+1}_{1}(T^{\mu}_{0}). Fix xS1k+1(T0μ)x\in S_{1}^{k+1}(T^{\mu}_{0}). By using Proposition 3.1 and (4.19) as above, we get xT1T0μx\notin T_{1}\cup T^{\mu}_{0}. If xS1k(T1T0μ)x\in S_{1}^{k}(T_{1}\cup T^{\mu}_{0}), then xS1k+1(T1T0μ)x\in S_{1}^{k+1}(T_{1}\cup T^{\mu}_{0}) trivially, by the definition of S1k+1(T1T0μ)S_{1}^{k+1}(T_{1}\cup T^{\mu}_{0}) in (3.4). If xS1k(T1T0μ)=S1k(T0μ)x\notin S_{1}^{k}(T_{1}\cup T^{\mu}_{0})=S_{1}^{k}(T^{\mu}_{0}), then by the definition of S1k+1(T0μ)S_{1}^{k+1}(T^{\mu}_{0}),

J1(x,0,ρT0μ)>V1T0μ(x,S1k(T0μ)).J_{1}(x,0,\rho_{T^{\mu}_{0}})>V^{T^{\mu}_{0}}_{1}(x,S_{1}^{k}(T^{\mu}_{0})).

By (4.19) and the above inequality, we may argue similarly as in (4.21) to get

J1(x,0,ρT1T0μ)=J1(x,0,ρT0μ)>V1T0μ(x,S1k(T0μ))\displaystyle J_{1}(x,0,\rho_{T_{1}\cup T^{\mu}_{0}})=J_{1}(x,0,\rho_{T^{\mu}_{0}})>V^{T^{\mu}_{0}}_{1}(x,S_{1}^{k}(T^{\mu}_{0})) =V1T0μ(x,S1k(T1T0μ))\displaystyle=V^{T^{\mu}_{0}}_{1}(x,S_{1}^{k}(T_{1}\cup T^{\mu}_{0}))
=V1T1T0μ(x,S1k(T1T0μ)),\displaystyle=V^{T_{1}\cup T^{\mu}_{0}}_{1}(x,S_{1}^{k}(T_{1}\cup T^{\mu}_{0})),

where the second equality is due to Sk(T0μ)=Sk(T1T0μ)S^{k}(T^{\mu}_{0})=S^{k}(T_{1}\cup T^{\mu}_{0}). It follows that xSk+1(T1T0μ)x\in S^{k+1}(T_{1}\cup T_{0}^{\mu}). Hence, we obtain Sk+1(T0μ)Sk+1(T1T0μ)S^{k+1}(T^{\mu}_{0})\subseteq S^{k+1}(T_{1}\cup T^{\mu}_{0}) and thus conclude Sk+1(T1T0μ)=Sk+1(T0μ)S^{k+1}(T_{1}\cup T^{\mu}_{0})=S^{k+1}(T^{\mu}_{0}). By induction, we have established Sn(T0μ)=Sn(T1T0μ)S^{n}(T^{\mu}_{0})=S^{n}(T_{1}\cup T^{\mu}_{0}) for all nn\in\mathbb{N}, as desired.

By Corollary 3.1 and Γ1(T1T0μ)=Γ1(T0μ)\Gamma_{1}(T_{1}\cup T_{0}^{\mu})=\Gamma_{1}(T_{0}^{\mu}),

S2:=Γ1(T1)Γ1(T1T0μ)=Γ1(T0μ)=S1andT2:=Γ2(S2)Γ2(S1)=T1.\displaystyle S_{2}:=\Gamma_{1}(T_{1})\supseteq\Gamma_{1}(T_{1}\cup T_{0}^{\mu})=\Gamma_{1}(T_{0}^{\mu})=S_{1}\quad\hbox{and}\quad T_{2}:=\Gamma_{2}(S_{2})\subseteq\Gamma_{2}(S_{1})=T_{1}.

Now, by defining Sn+1:=Γ1(Tn)S_{n+1}:=\Gamma_{1}(T_{n}) and Tn+1:=Γ2(Sn+1)T_{n+1}:=\Gamma_{2}(S_{n+1}) for all n3n\geq 3, we can follow the same argument in the proof of Theorem 4.1 to show that (Sn)(S_{n}) is nondecreasing, (Tn)(T_{n}) is nonincreasing, and (S,T)A(S_{\infty},T_{\infty})\in A with S:=nSnS_{\infty}:=\bigcup_{n}S_{n} and T:=nTnT_{\infty}:=\bigcap_{n}T_{n}. By construction, SS0SαS_{\infty}\supseteq S_{0}\supseteq S_{\alpha} and TT0TαT_{\infty}\subseteq T_{0}\subseteq T_{\alpha} for all αI\alpha\in I. Hence, (S,T)A(S_{\infty},T_{\infty})\in A is an upper bound for (Sα,Tα)αI(S_{\alpha},T_{\alpha})_{\alpha\in I}.
Step 2: Applying Zorn’s lemma.
As every totally ordered subset of AA is shown to have an upper bound in AA, Zorn’s lemma implies that there exists a maximal element in AA, denoted by (S¯,T¯)A(\bar{S},\bar{T})\in A. We claim that (S¯,T¯)^(\bar{S},\bar{T})\in\widehat{\mathcal{E}}. Set S0:=S¯S_{0}:=\bar{S}, T0:=T¯T_{0}:=\bar{T}, and define

Sn+1:=Γ1(Tn)andTn+1:=Γ2(Sn+1)n0.S_{n+1}:=\Gamma_{1}(T_{n})\quad\hbox{and}\quad T_{n+1}:=\Gamma_{2}(S_{n+1})\quad\forall n\geq 0.

Thanks to Γ1(T0)S0\Gamma_{1}(T_{0})\supseteq S_{0} and Γ2(S0)T0\Gamma_{2}(S_{0})\subseteq T_{0} (as (S0,T0)=(S¯,T¯)A(S_{0},T_{0})=(\bar{S},\bar{T})\in A), we may apply Corollary 3.1 recursively to show that (Sn)(S_{n}) is nondecreasing and (Tn)(T_{n}) is nonincreasing. Then, by the same argument in the proof of Theorem 4.1, we obtain (S,T)A(S_{\infty},T_{\infty})\in A with S:=nSnS_{\infty}:=\bigcup_{n}S_{n} and T:=nTnT_{\infty}:=\bigcap_{n}T_{n}. By construction, SS0=S¯S_{\infty}\supseteq S_{0}=\bar{S} and TT0=T¯T_{\infty}\subseteq T_{0}=\bar{T}. But since (S¯,T¯)(\bar{S},\bar{T}) is a maximal element of AA (under the partial order (4.16)), we must have S=S0S_{\infty}=S_{0} and T=T0T_{\infty}=T_{0}. This in particular implies S1=S0S_{1}=S_{0} and T1=T0T_{1}=T_{0}, so that

Γ1(T¯)=Γ1(T0)=S1=S0=S¯andΓ2(S¯)=Γ2(S0)=Γ2(S1)=T1=T0=T¯.\Gamma_{1}(\bar{T})=\Gamma_{1}(T_{0})=S_{1}=S_{0}=\bar{S}\quad\hbox{and}\quad\Gamma_{2}(\bar{S})=\Gamma_{2}(S_{0})=\Gamma_{2}(S_{1})=T_{1}=T_{0}=\bar{T}.

By Lemma 4.1, this readily implies (S¯,T¯)^(\bar{S},\bar{T})\in\widehat{\mathcal{E}}. ∎

Remark 4.2.

In view of (2.1)-(2.2), the condition fihigif_{i}\leq h_{i}\leq g_{i} (in Theorems 4.1 and 4.2) encourages each player to wait/continue until the other player stops, so as to obtain a larger reward. Consequently, each player faces the tradeoff between the potential (generous) gain from outlasting the other player and the cost of waiting that enlarges with time (due to discounting and possible loss of opportunity). That is, our Dynkin game exemplifies the “war of attrition” in game theory. The negotiation example in Section 5 below well demonstrates this “war”: Each firm intends to wait until the other firm gives in so as to seal the best deal, while subject to the impact of discounting and the varying cost of project initiation.

Remark 4.3.

In a classical (time-consistent) nonzero-sum Dynkin game, the condition fihigif_{i}\leq h_{i}\leq g_{i} ensures that a Nash equilibrium, as a tuple of pure stopping times (τ,σ)(\tau^{*},\sigma^{*}), exists; see e.g., [19]. Without the condition fihigif_{i}\leq h_{i}\leq g_{i}, a Nash equilibrium (τ,σ)(\tau^{*},\sigma^{*}) need not exist, as shown in [27]. One needs to consider randomized strategies to possibly establish the existence of a Nash equilibrium, as a tuple of randomized strategies. Still, in some cases, only an ε\varepsilon-Nash equilibrium is known to exist; see e.g., [39, 18, 27].

In this paper, as we assume fihigif_{i}\leq h_{i}\leq g_{i} (cf. Theorems 4.1 and 4.2), our focus on pure strategies is consistent with the literature. If we drop the condition fihigif_{i}\leq h_{i}\leq g_{i}, many arguments will no longer hold and we expect the use of randomized strategies indispensable. Randomized strategies for time-inconsistent stopping problems have recently been proposed and analyzed by [3] in discrete time and by [9] in continuous time. It is of interest as future research to modify their definitions and allow for randomized strategies in our Dynkin game.

4.3 Discussion on the Supermartingale Condition

It is worth noting that while the supermartingale condition (3.13) is required in Theorems 4.1 and 4.2, it does not play a role in Theorem 3.1. Because the one-player iterative procedure (3.4) is by construction monotone, it converges without the need of any other condition. It is much more complicated for the two-player alternating iterative procedure (4.2) to converge. The monotonicity of (3.4) only ensures that each iteration (performed by one of the two players) converges to a stopping policy, but says nothing about whether the two resulting sequences of policies (one sequence for each player) will actually converge. It is the supermartingale condition (3.13) that brings about the monotonicity for these two sequences of policies (on strength of Corollary 3.1), leading to an inter-personal equilibrium between the two players.

When (3.13) fails, the monotonicity in Corollary 3.1 no longer holds in general, and there may exist no inter-personal equilibrium, soft or sharp. To demonstrate this, consider a three-state model

𝕏={a,b,c}withx(X1=y){=0,for(x,y)=(a,c),>0,otherwise.\mathbb{X}=\{a,b,c\}\quad\hbox{with}\quad\mathbb{P}_{x}(X_{1}=y)\begin{cases}=0,\quad\hbox{for}\ (x,y)=(a,c),\\ >0,\quad\hbox{otherwise}.\end{cases} (4.22)

Given M>0M>0, define the payoff functions by

f1(a)\displaystyle f_{1}(a) =1,\displaystyle=1, g1(a)\displaystyle g_{1}(a) =M2,\displaystyle=M^{2}, f2(a)\displaystyle f_{2}(a) =M,\displaystyle=M, g2(a)\displaystyle g_{2}(a) =M+1,\displaystyle=M+1,
f1(b)\displaystyle f_{1}(b) =M,\displaystyle=M, g1(b)\displaystyle g_{1}(b) =M+1,\displaystyle=M+1, f2(b)\displaystyle f_{2}(b) =1,\displaystyle=1, g2(b)\displaystyle g_{2}(b) =2,\displaystyle=2, (4.23)
f1(c)\displaystyle f_{1}(c) =1,\displaystyle=1, g1(c)\displaystyle g_{1}(c) =2,\displaystyle=2, f2(c)\displaystyle f_{2}(c) =M2,\displaystyle=M^{2}, g2(c)\displaystyle g_{2}(c) =M2+1,\displaystyle=M^{2}+1,

and

hi(x)=12(fi(x)+gi(x)),x𝕏andi{1,2}.h_{i}(x)=\frac{1}{2}(f_{i}(x)+g_{i}(x)),\quad\forall x\in\mathbb{X}\ \hbox{and}\ i\in\{1,2\}. (4.24)
Proposition 4.1.

Under (4.22), (4.3), and (4.24), as M>0M>0 is large enough, (3.13) is violated and there exists no soft inter-personal equilibrium.

Proof.

Let pxy:=x(X1=y)p_{xy}:=\mathbb{P}_{x}(X_{1}=y) for all x,y𝕏x,y\in\mathbb{X}. Then,

𝔼c[δ1(1)g1(X1)]=δ1(1)(pcaM2+pcb(M+1)+pcc2)>2=g1(c),\mathbb{E}_{c}[\delta_{1}(1)g_{1}(X_{1})]=\delta_{1}(1)\left(p_{ca}M^{2}+p_{cb}(M+1)+p_{cc}\cdot 2\right)>2=g_{1}(c),

where the inequality holds as M>0M>0 is large enough. This readily shows that (3.13) is violated.

For any S,T=2𝕏S,T\in\mathcal{B}=2^{\mathbb{X}} and x𝕏x\in\mathbb{X}, we deduce from the definitions of fif_{i}, hih_{i}, gig_{i}, i{1,2}i\in\{1,2\}, that for each i{1,2}i\in\{1,2\},

Ji(x,ρS+,ρT)=k0+k1M+k2M2,for somek0[0,2]andk1,k2[0,1].J_{i}(x,\rho^{+}_{S},\rho_{T})=k_{0}+k_{1}M+k_{2}M^{2},\quad\hbox{for some}\ k_{0}\in[0,2]\ \hbox{and}\ k_{1},k_{2}\in[0,1]. (4.25)

Moreover,

ρT>0k0<2,k1<1,andk2<1.\rho_{T}>0\iff k_{0}<2,\ k_{1}<1,\ \hbox{and}\ k_{2}<1. (4.26)

By (4.25) and (4.26), it can be checked that

2𝕏=2{a,c}=2{c}={}but1={{b}},2{a,b}=2{a}=2={{c}}but1{c}={{b}},2{b,c}={{a}}but1{a}=,2{b}={{a,c}}but1{a,c}=.\begin{split}&\mathcal{E}_{2}^{\mathbb{X}}=\mathcal{E}_{2}^{\{a,c\}}=\mathcal{E}_{2}^{\{c\}}=\{\emptyset\}\ \ \hbox{but}\ \ \mathcal{E}_{1}^{\emptyset}=\{\{b\}\},\\ &\mathcal{E}_{2}^{\{a,b\}}=\mathcal{E}_{2}^{\{a\}}=\mathcal{E}_{2}^{\emptyset}=\{\{c\}\}\ \ \hbox{but}\ \ \mathcal{E}_{1}^{\{c\}}=\{\{b\}\},\\ &\mathcal{E}_{2}^{\{b,c\}}=\{\{a\}\}\ \ \hbox{but}\ \ \mathcal{E}_{1}^{\{a\}}=\emptyset,\\ &\mathcal{E}_{2}^{\{b\}}=\{\{a,c\}\}\ \ \hbox{but}\ \ \mathcal{E}_{1}^{\{a,c\}}=\emptyset.\end{split} (4.27)

This readily shows that there exists no (S,T)2𝕏×2𝕏(S,T)\in 2^{\mathbb{X}}\times 2^{\mathbb{X}} such that S1TS\in\mathcal{E}_{1}^{T} and T2ST\in\mathcal{E}_{2}^{S}, i.e. there exists no soft inter-personal equilibrium.

In the following, we will show the derivation of 1{c}={{b}}\mathcal{E}_{1}^{\{c\}}=\{\{b\}\} in detail, while all other identities in (4.27) can be proved in a similar manner. It follows directly from the definitions of f1f_{1} and g1g_{1} that J1(b,0,ρ{c})=f1(b)=MJ_{1}(b,0,\rho_{\{c\}})=f_{1}(b)=M and J1(b,ρ+,ρ{c})=k0<2J_{1}(b,\rho^{+}_{\emptyset},\rho_{\{c\}})=k_{0}<2. Hence, with M>0M>0 large enough,

b{x𝕏:J1(x,0,ρ{c})>J1(x,ρ+,ρ{c})}=Θ1{c}(),b\in\left\{x\in\mathbb{X}:J_{1}(x,0,\rho_{\{c\}})>J_{1}(x,\rho^{+}_{\emptyset},\rho_{\{c\}})\right\}=\Theta_{1}^{\{c\}}(\emptyset),

which implies Θ1{c}()\Theta_{1}^{\{c\}}(\emptyset)\neq\emptyset. Also, as J1(b,ρ{a}+,ρ{c})=k0<2J_{1}(b,\rho^{+}_{\{a\}},\rho_{\{c\}})=k_{0}<2, we can similarly conclude that with M>0M>0 large enough,

b{x{b,c}:J1(x,0,ρ{c})>J1(x,ρ{a}+,ρ{c})}Θ1{c}({a}),b\in\left\{x\in\{b,c\}:J_{1}(x,0,\rho_{\{c\}})>J_{1}(x,\rho^{+}_{\{a\}},\rho_{\{c\}})\right\}\subseteq\Theta_{1}^{\{c\}}(\{a\}),

which implies Θ1{c}({a}){a}\Theta_{1}^{\{c\}}(\{a\})\neq\{a\}. Note that

J1(c,0,ρ{c})=h1(c)<g1(c)=J1(c,ρS+,ρ{c}),S2𝕏.J_{1}(c,0,\rho_{\{c\}})=h_{1}(c)<g_{1}(c)=J_{1}(c,\rho^{+}_{S},\rho_{\{c\}}),\quad\forall S\in 2^{\mathbb{X}}. (4.28)

By taking S={c}S=\{c\}, S={b,c}S=\{b,c\}, and S=𝕏S=\mathbb{X} in (4.28), we immediately see that cΘ1{c}({c})c\notin\Theta^{\{c\}}_{1}(\{c\}), cΘ1{c}({b,c})c\notin\Theta^{\{c\}}_{1}(\{b,c\}), and cΘ1{c}(𝕏)c\notin\Theta^{\{c\}}_{1}(\mathbb{X}). We therefore conclude Θ1{c}({c}){c}\Theta^{\{c\}}_{1}(\{c\})\neq\{c\}, Θ1{c}({b,c}){b,c}\Theta^{\{c\}}_{1}(\{b,c\})\neq\{b,c\}, and Θ1{c}(𝕏)𝕏\Theta^{\{c\}}_{1}(\mathbb{X})\neq\mathbb{X}. Now, observe that J1(b,ρ{b}+,ρ{c})J_{1}(b,\rho^{+}_{\{b\}},\rho_{\{c\}}) and J1(a,ρ{b}+,ρ{c})J_{1}(a,\rho^{+}_{\{b\}},\rho_{\{c\}}) are both of the form k0+k1Mk_{0}+k_{1}M with k1<1k_{1}<1 (recall (4.26)). Thus, with M>0M>0 large enough,

J1(b,0,ρ{c})\displaystyle J_{1}(b,0,\rho_{\{c\}}) =f1(b)=M>k0+k1M=J1(b,ρ{b}+,ρ{c}),\displaystyle=f_{1}(b)=M>k_{0}+k_{1}M=J_{1}(b,\rho^{+}_{\{b\}},\rho_{\{c\}}), (4.29)
J1(a,0,ρ{c})\displaystyle J_{1}(a,0,\rho_{\{c\}}) =f1(a)=1<k0+k1M=J1(a,ρ{b}+,ρ{c}).\displaystyle=f_{1}(a)=1<k_{0}+k_{1}M=J_{1}(a,\rho^{+}_{\{b\}},\rho_{\{c\}}). (4.30)

This, together with (4.28), implies Θ1{c}({b})={b}\Theta^{\{c\}}_{1}(\{b\})=\{b\}. Since J1(a,ρ{a,b}+,ρ{c})J_{1}(a,\rho^{+}_{\{a,b\}},\rho_{\{c\}}) (resp. J1(a,ρ{a,c}+,ρ{c})J_{1}(a,\rho^{+}_{\{a,c\}},\rho_{\{c\}})) is also of the form k0+k1Mk_{0}+k_{1}M, the inequality in (4.30) indicates that with M>0M>0 large enough, aΘ1{c}({a,b})a\notin\Theta^{\{c\}}_{1}(\{a,b\}) (resp. aΘ1{c}({a,c})a\notin\Theta^{\{c\}}_{1}(\{a,c\})). Hence, we conclude Θ1{c}({a,b}){a,b}\Theta^{\{c\}}_{1}(\{a,b\})\neq\{a,b\} and Θ1{c}({a,c}){a,c}\Theta^{\{c\}}_{1}(\{a,c\})\neq\{a,c\}. In view of the above derivations, {b}\{b\} is the only intra-personal equilibrium for Player 1 w.r.t. Player 2’s policy {c}\{c\}, i.e. 1{c}={{b}}\mathcal{E}_{1}^{\{c\}}=\{\{b\}\}. ∎

Remark 4.4.

In (4.27), iT\mathcal{E}_{i}^{T} is a singleton for all i{1,2}i\in\{1,2\} and T2𝕏T\in 2^{\mathbb{X}}. The single element in iT\mathcal{E}_{i}^{T} is trivially the optimal intra-personal equilibrium, which can be recovered by Γi(T)\Gamma_{i}(T) in (3.5) thanks to Theorem 3.1. In other words, (4.27) implies

Γ2(𝕏)=Γ2({a,c})=Γ2({c})=,Γ2({a,b})=Γ2({a})=Γ2()={c},Γ2({b,c})={a},Γ2({b})={a,c},Γ1()=Γ1({c})={b},Γ1({a})=Γ1({a,c})=.\begin{split}&\Gamma_{2}(\mathbb{X})=\Gamma_{2}(\{a,c\})=\Gamma_{2}(\{c\})=\emptyset,\quad\Gamma_{2}(\{a,b\})=\Gamma_{2}(\{a\})=\Gamma_{2}(\emptyset)=\{c\},\\ &\Gamma_{2}(\{b,c\})=\{a\},\quad\Gamma_{2}(\{b\})=\{a,c\},\\ &\Gamma_{1}(\emptyset)=\Gamma_{1}(\{c\})=\{b\},\quad\Gamma_{1}(\{a\})=\Gamma_{1}(\{a,c\})=\emptyset.\end{split} (4.31)

This clearly shows that the monotonicity of TΓi(T)T\mapsto\Gamma_{i}(T) fails: Despite the inclusion {c}{b,c}𝕏\{c\}\subseteq\{b,c\}\subseteq\mathbb{X}, we have Γ2({c})=Γ2(𝕏)={a}=Γ2({b,c})\Gamma_{2}(\{c\})=\Gamma_{2}(\mathbb{X})=\emptyset\subseteq\{a\}=\Gamma_{2}(\{b,c\}).

Remark 4.5.

Another way to interpret (4.31) is that the alternating iterative procedure (4.2) will never converge, failing to provide any soft inter-personal equilibrium. Specifically, (4.31) indicates that the alternating iterations will always lead to loops, as listed below where Player 1’s stopping policies are underlined and Player 2’s stopping policies are double underlined.

1. {c}{b}{a,c}\uline{\emptyset}\to\uuline{\{c\}}\to\uline{\{b\}}\to\uuline{\{a,c\}}\to\uline{\emptyset}\to....

2. {a}{c}{b}{a,c}{c}\uline{\{a\}}\to\uuline{\{c\}}\to\uline{\{b\}}\to\uuline{\{a,c\}}\to\uline{\emptyset}\to\uuline{\{c\}}\to....

3. {b}{a,c}{c}{b}\uline{\{b\}}\to\uuline{\{a,c\}}\to\uline{\emptyset}\to\uuline{\{c\}}\to\uline{\{b\}}\to....

4. {c}{b}{a,c}{c}{b}\uline{\{c\}}\to\uuline{\emptyset}\to\uline{\{b\}}\to\uuline{\{a,c\}}\to\uline{\emptyset}\to\uuline{\{c\}}\to\uline{\{b\}}\to....

5. {a,b}{c}{b}{a,c}{c}\uline{\{a,b\}}\to\uuline{\{c\}}\to\uline{\{b\}}\to\uuline{\{a,c\}}\to\uline{\emptyset}\to\uuline{\{c\}}\to....

6. {a,c}{b}{a,c}{c}{b}\uline{\{a,c\}}\to\uuline{\emptyset}\to\uline{\{b\}}\to\uuline{\{a,c\}}\to\uline{\emptyset}\to\uuline{\{c\}}\to\uline{\{b\}}\to....

7. {b,c}{a}{c}{b}{a,c}\uline{\{b,c\}}\to\uuline{\{a\}}\to\uline{\emptyset}\to\uuline{\{c\}}\to\uline{\{b\}}\to\uuline{\{a,c\}}\to\uline{\emptyset}\to....

8. {a,b,c}{b}{a,c}{c}{b}\uline{\{a,b,c\}}\to\uuline{\emptyset}\to\uline{\{b\}}\to\uuline{\{a,c\}}\to\uline{\emptyset}\to\uuline{\{c\}}\to\uline{\{b\}}\to....

Remark 4.6.

As δ\delta is not specified in Proposition 4.1, the result admits an interesting implication for classical (time-consistent) nonzero-sum Dynkin games. To see this, let δ\delta be an exponential discount function so that there is no time inconsistency. Proposition 4.1 shows that even when the state process XX is time-homogeneous and payoff functions are as simple as (4.3)-(4.24), the Dynkin game has no “time-homogeneous” Nash equilibrium—a Nash equilibrium as a tuple of two stopping regions (S,T)(S,T), one for each player. Indeed, if such a Nash equilibrium existed, it would be a sharp inter-personal equilibrium under Definition 2.4. If a Nash equilibrium in fact exists, it must be of a more complicated form (which is also suggested by the constructions in [19, 27]).

5 Application: Negotiation with Diverse Impatience

In this section, we apply our theoretic results in Section 4 to a two-player real options valuation problem. The vast literature on real options, see e.g. [30, 12, 40] among many others, focuses on a single firm’s corporate decision making, particularly the optimal timing of a project’s initiation. In contrast to this, we will study two firms’ joint decision making on their cooperation to initiate a project together, embedding real options valuation in a nonzero-sum Dynkin game.

Consider two firms who would like to cooperate to initiate a new project, such as entering a new market or developing a new product. Each firm has a proprietary skill/technology, so that only when they cooperate can the project be successfully carried out. Once the project is initiated, it will generate a fixed total revenue R>0R>0. The cost of initiation XX, on the other hand, evolves stochastically and is modeled by a discrete-time binomial structure as follows: There exist u>1u>1 and p(0,1)p\in(0,1) such that XX takes values in

𝕏={ui:i=0,±1,±2,}\mathbb{X}=\{u^{i}:i=0,\pm 1,\pm 2,\dotso\} (5.1)

and satisfies

x(X1/x=u)=pandx(X1/x=1/u)=1p,x𝕏.\mathbb{P}_{x}(X_{1}/x=u)=p\quad\hbox{and}\quad\mathbb{P}_{x}(X_{1}/x=1/u)=1-p,\quad\forall x\in\mathbb{X}.

Assume additionally that XX is a submartingale, which corresponds to the condition p1u+1p\geq\frac{1}{u+1}. That is, the cost XX has a tendency to increase over time, which incentivizes the two firms to strike a deal of cooperation sooner than later.

In negotiating such a deal, each firm, leveraging on its proprietary skill/technology, insists on taking a fixed (risk-free) larger share

N(R/2,R)N\in(R/2,R)

of the total revenue R>0R>0, while demanding the other firm to take the smaller share

K:=RN(0,R/2)K:=R-N\in(0,R/2)

of revenue and additionally incur the stochastic (risky) cost XX. Each firm either waits until the other gives in and takes the larger payoff NN, or gives in to the other and takes the smaller payoff (KXτ)+(K-X_{\tau})^{+}, where τ\tau denotes the firm’s (random) time to give in. This can be formulated in our Dynkin game framework as

f1(x)=f2(x)=(Kx)+andg1(x)=g2(x)=N,x𝕏.f_{1}(x)=f_{2}(x)=(K-x)^{+}\quad\hbox{and}\quad g_{1}(x)=g_{2}(x)=N,\quad\forall x\in\mathbb{X}.

If the two firms happen to give in at the same time, they realize that both of them cannot endure any delay of a deal, and will quickly agree on a deal that is more mutually beneficial. This corresponds to the requirement fihigif_{i}\leq h_{i}\leq g_{i}, i{1,2}i\in\{1,2\}. In addition, we model the time preferences of the firms using the hyperbolic discount function, i.e. for i{1,2}i\in\{1,2\},

δi(t)=11+βit,\delta_{i}(t)=\frac{1}{1+\beta_{i}t},

where βi>0\beta_{i}>0 is a constant that represents the level of impatience of Firm ii.

To facilitate the investigation of inter-personal equilibria between the two firms, we introduce a random walk YY defined on some probability space (Ω¯,¯,P)(\bar{\Omega},\bar{\mathcal{F}},P) such that

P(Yt+1Yt=1)=pandP(Yt+1Yt=1)=1p,t+.P(Y_{t+1}-Y_{t}=1)=p\quad\hbox{and}\quad P(Y_{t+1}-Y_{t}=-1)=1-p,\quad\forall t\in\mathbb{Z}_{+}.

Consider

ξ:=inf{t0:Yt=0}\xi:=\inf\{t\geq 0:Y_{t}=0\}

and define, for each i{1,2}i\in\{1,2\},

αin:=En[11+βiξ]n,\alpha^{n}_{i}:=E^{n}\left[\frac{1}{1+\beta_{i}\xi}\right]\ \ \forall n\in\mathbb{N}, (5.2)

where EnE^{n} denotes the expectation under PP conditioned on Y0=nY_{0}=n. Note that αn\alpha_{n}, nn\in\mathbb{N}, can be computed explicitly. For instance,

αi1=k=1(2k1k)pk1(1p)k2k111+βi(2k1).\alpha^{1}_{i}=\sum_{k=1}^{\infty}\frac{\binom{2k-1}{k}p^{k-1}(1-p)^{k}}{2k-1}\cdot\frac{1}{1+\beta_{i}(2k-1)}. (5.3)
Lemma 5.1.

For i{1,2}i\in\{1,2\}, Γi()=(0,yi]𝕏\Gamma_{i}(\emptyset)=(0,y_{i}^{*}]\cap\mathbb{X}, where

yi:=min{[1αi1uαi1K,)𝕏}.y_{i}^{*}:=\min\bigg{\{}\bigg{[}\frac{1-\alpha_{i}^{1}}{u-\alpha_{i}^{1}}K,\infty\bigg{)}\cap\mathbb{X}\bigg{\}}. (5.4)
Proof.

Observe from (2.1) and (2.2) that

Ji(x,τ,ρ)=𝔼x[Fi(τ,ρ)]=𝔼x[δi(τ)fi(Xτ)]=𝔼x[(KXτ)+1+βiτ],x𝕏.J_{i}(x,\tau,\rho_{\emptyset})=\mathbb{E}_{x}[F_{i}(\tau,\rho_{\emptyset})]=\mathbb{E}_{x}[\delta_{i}(\tau)f_{i}(X_{\tau})]=\mathbb{E}_{x}\bigg{[}\frac{(K-X_{\tau})^{+}}{1+\beta_{i}\tau}\bigg{]},\quad\forall x\in\mathbb{X}.

Hence, the one-player stopping analysis in [24, Section 5] applies to our current setting. The same arguments therein (particularly [24, Proposition 5.5]) show that (0,yi]𝕏(0,y_{i}^{*}]\cap\mathbb{X}, with yiy_{i}^{*} given as in (5.4), is Player ii’s unique optimal intra-personal equilibrium w.r.t. \emptyset, i.e. ^i={(0,yi]𝕏}\widehat{\mathcal{E}}^{\emptyset}_{i}=\{(0,y_{i}^{*}]\cap\mathbb{X}\}. As Γi()\Gamma_{i}(\emptyset) belongs to ^i\widehat{\mathcal{E}}^{\emptyset}_{i} by Theorem 3.1, it must coincide with (0,yi]𝕏(0,y_{i}^{*}]\cap\mathbb{X}. ∎

With the aid of Lemma 5.1, we will show that the alternating iterative procedure (4.2) always leads to a sharp inter-personal equilibrium. Let us divide our investigation into two cases, depending on the impatience levels of the two firms: β1β2\beta_{1}\leq\beta_{2} (Proposition 5.1) and β1>β2\beta_{1}>\beta_{2} (Proposition 5.2).

Proposition 5.1.

Suppose β1β2\beta_{1}\leq\beta_{2}. Then, the alternating iterative procedure (4.2) terminates after one iteration, and gives a sharp inter-personal equilibrium. That is,

(S,T)=(S0,T0)=(,(0,y2]𝕏)^.(S_{\infty},T_{\infty})=(S_{0},T_{0})=(\emptyset,(0,y_{2}^{*}]\cap\mathbb{X})\in\widehat{\mathcal{E}}.
Proof.

Note from (5.3) that β1β2\beta_{1}\leq\beta_{2} implies α11α21\alpha_{1}^{1}\geq\alpha_{2}^{1}. This in turn yields y1y2y_{1}^{*}\leq y_{2}^{*}, in view of (5.4). Following (4.2), we have S0:=S_{0}:=\emptyset and T0:=Γ2(S0)=Γ2()=(0,y2]𝕏T_{0}:=\Gamma_{2}(S_{0})=\Gamma_{2}(\emptyset)=(0,y_{2}^{*}]\cap\mathbb{X}, where the last equality is due to Lemma 5.1. Now, in view of Corollary 3.1, Lemma 5.1, and Proposition 3.1,

Γ1(T0)Γ1()=(0,y1]𝕏,Γ1(T0)((0,y2]𝕏)=Γ1(T0)T0=.\begin{split}&\Gamma_{1}(T_{0})\subseteq\Gamma_{1}(\emptyset)=(0,y_{1}^{*}]\cap\mathbb{X},\\ &\Gamma_{1}(T_{0})\cap\big{(}(0,y_{2}^{*}]\cap\mathbb{X}\big{)}=\Gamma_{1}(T_{0})\cap T_{0}=\emptyset.\end{split} (5.5)

As y1y2y_{1}^{*}\leq y_{2}^{*}, the above two relations entail Γ1(T0)==S0\Gamma_{1}(T_{0})=\emptyset=S_{0}. This, together with Γ2(S0)=T0\Gamma_{2}(S_{0})=T_{0}, shows that (S0,T0)^(S_{0},T_{0})\in\widehat{\mathcal{E}}, thanks to Lemma 4.1. ∎

Remark 5.1.

For the case β1β2\beta_{1}\geq\beta_{2}, by starting with T0=T_{0}=\emptyset and switching the roles of SnS_{n} and TnT_{n} in (4.2), we may repeat the same arguments in the above proof to show that T0T_{0} and S0:=Γ1(T0)=(0,y1]𝕏S_{0}:=\Gamma_{1}(T_{0})=(0,y^{*}_{1}]\cap\mathbb{X} satisfy Γ2(S0)==T0\Gamma_{2}(S_{0})=\emptyset=T_{0}, which then yields (S0,T0)=((0,y1]𝕏,)^(S_{0},T_{0})=((0,y^{*}_{1}]\cap\mathbb{X},\emptyset)\in\widehat{\mathcal{E}}.

Proposition 5.2.

Suppose β1>β2\beta_{1}>\beta_{2}. Then, the alternating iterative procedure (4.2) falls into the following three cases.

  • Case 1: S1=S_{1}=\emptyset. Then (S,T)=(,(0,y2]𝕏)^(S_{\infty},T_{\infty})=(\emptyset,(0,y_{2}^{*}]\cap\mathbb{X})\in\widehat{\mathcal{E}}.

  • Case 2: S1S_{1}\neq\emptyset and Tn=T_{n}=\emptyset for some n>1n>1. Then (S,T)=((0,y1]𝕏,)^(S_{\infty},T_{\infty})=((0,y_{1}^{*}]\cap\mathbb{X},\emptyset)\in\widehat{\mathcal{E}}.

  • Case 3: S1S_{1}\neq\emptyset and TnT_{n}\neq\emptyset for all n>1n>1.

    • If (4.2) terminates in finite steps, then there exist y2<cy1y^{*}_{2}<c\leq y^{*}_{1} and 0<dy20<d\leq y^{*}_{2} such that (S,T)=([c,y1]𝕏,(0,d]𝕏)^.(S_{\infty},T_{\infty})=([c,y_{1}^{*}]\cap\mathbb{X},(0,d]\cap\mathbb{X})\in\widehat{\mathcal{E}}.

    • If (4.2) continues indefinitely, (S,T)=((0,y1]𝕏,)^(S_{\infty},T_{\infty})=((0,y_{1}^{*}]\cap\mathbb{X},\emptyset)\in\widehat{\mathcal{E}}.

Proof.

Note from (5.3) and (5.4) that β1>β2\beta_{1}>\beta_{2} implies y1y2y_{1}^{*}\geq y_{2}^{*}. If y1=y2y_{1}^{*}=y_{2}^{*} (which happens when β1β2>0\beta_{1}-\beta_{2}>0 is sufficiently small), the same arguments in the proof of Proposition 5.1 apply, and we end up with Case 1. In the rest of the proof, we assume y1>y2y_{1}^{*}>y_{2}^{*}. Following (4.2), S0:=S_{0}:=\emptyset and T0:=Γ2(S0)=Γ2()=(0,y2]𝕏T_{0}:=\Gamma_{2}(S_{0})=\Gamma_{2}(\emptyset)=(0,y_{2}^{*}]\cap\mathbb{X}, where the last equality is due to Lemma 5.1. If S1:=Γ1(T0)==S0S_{1}:=\Gamma_{1}(T_{0})=\emptyset=S_{0}, (4.2) terminates with Γ1(T0)=S0\Gamma_{1}(T_{0})=S_{0} and Γ2(S0)=T0\Gamma_{2}(S_{0})=T_{0}. Hence, (S,T)=(S0,T0)=(,(0,y2]𝕏)^(S_{\infty},T_{\infty})=(S_{0},T_{0})=(\emptyset,(0,y_{2}^{*}]\cap\mathbb{X})\in\widehat{\mathcal{E}} by Lemma 4.1. Alternatively, if S1S_{1}\neq\emptyset but Tn=T_{n}=\emptyset for some n>1n>1, following the argument in Remark 5.1 gives (Sn+1,Tn+1)=((0,y1]𝕏,)^(S_{n+1},T_{n+1})=((0,y_{1}^{*}]\cap\mathbb{X},\emptyset)\in\widehat{\mathcal{E}}.

It remains to deal with the situation where S1S_{1}\neq\emptyset and TnT_{n}\neq\emptyset for all n1n\geq 1. By using Corollary 3.1, Lemma 5.1, and Proposition 3.1 as in (5.5), we get

S1:=Γ1(T0)Γ1()=(0,y1]𝕏,S1((0,y2]𝕏)=Γ1(T0)T0=.\begin{split}&S_{1}:=\Gamma_{1}(T_{0})\subseteq\Gamma_{1}(\emptyset)=(0,y_{1}^{*}]\cap\mathbb{X},\\ &S_{1}\cap\big{(}(0,y^{*}_{2}]\cap\mathbb{X}\big{)}=\Gamma_{1}(T_{0})\cap T_{0}=\emptyset.\end{split} (5.6)

As S1S_{1}\neq\emptyset and y2<y1y_{2}^{*}<y_{1}^{*}, the above two inequalities imply S1=A𝕏S_{1}=A\cap\mathbb{X} for some nonempty A(y2,y1]A\subseteq(y_{2}^{*},y_{1}^{*}]. With XX being a submartingale, the same argument in the second half of the proof of [24, Lemma 5.1] can be repeated here, showing that the set AA has to be connected, i.e. A=[c1,c1]A=[c_{1},c^{\prime}_{1}] for some c1,c1𝕏c_{1},c_{1}^{\prime}\in\mathbb{X} with y2<c1c1y1y_{2}^{*}<c_{1}\leq c^{\prime}_{1}\leq y_{1}^{*}. Now, by the same argument in [24, Lemma 5.3], c1𝕏c_{1}^{\prime}\in\mathbb{X} needs to be large or equal to 1α11uα11\frac{1-\alpha_{1}^{1}}{u-\alpha_{1}^{1}}, otherwise S1=[c1,c1]𝕏S_{1}=[c_{1},c^{\prime}_{1}]\cap\mathbb{X} would not belong to 1T0\mathcal{E}_{1}^{T_{0}} (which would contradict Proposition 3.1). We then conclude c1=y1c^{\prime}_{1}=y_{1}^{*} and thus S1=[c1,y1]𝕏S_{1}=[c_{1},y^{*}_{1}]\cap\mathbb{X}. Next, since T1:=Γ2(S1)T_{1}:=\Gamma_{2}(S_{1})\neq\emptyset and T1T0=(0,y2]𝕏T_{1}\subseteq T_{0}=(0,y_{2}^{*}]\cap\mathbb{X}, we must have T1=B𝕏T_{1}=B\cap\mathbb{X} for some nonempty B(0,y2]B\subseteq(0,y_{2}^{*}]. Again, by the same argument in the second half of the proof of [24, Lemma 5.1], BB is connected. Also, we must have infB=0\inf B=0, otherwise T1=B𝕏2S1T_{1}=B\cap\mathbb{X}\notin\mathcal{E}_{2}^{S_{1}} (which would contradict Proposition 3.1). Thus, T1=(0,d1]𝕏T_{1}=(0,d_{1}]\cap\mathbb{X} for some d1𝕏d_{1}\in\mathbb{X} with 0<d1y20<d_{1}\leq y_{2}^{*}. As (Sn)(S_{n}) is nondecreasing, (Tn)(T_{n}) is nonincreasing, and TnSn=Γ2(Sn)Sn=T_{n}\cap S_{n}=\Gamma_{2}(S_{n})\cap S_{n}=\emptyset (by Proposition 3.1), there exist nonincreasing sequences (cn)(c_{n}) and (dn)(d_{n}) of positive reals with dn<cnd_{n}<c_{n} such that

Sn=[cn,y1]𝕏andTn=(0,dn]𝕏,n.S_{n}=[c_{n},y^{*}_{1}]\cap\mathbb{X}\quad\hbox{and}\quad T_{n}=(0,d_{n}]\cap\mathbb{X},\quad\forall n\in\mathbb{N}. (5.7)

Now, if there exists nn^{*}\in\mathbb{N} such that cn=cn+1c_{n^{*}}=c_{n^{*}+1} or dn1=dnd_{n^{*}-1}=d_{n^{*}}, we get Γ2(Sn)=Tn\Gamma_{2}(S_{n^{*}})=T_{n^{*}} and Γ1(Tn)=Sn\Gamma_{1}(T_{n^{*}})=S_{n^{*}}, so that (S,T)=(Sn,Tn)^(S_{\infty},T_{\infty})=(S_{n^{*}},T_{n^{*}})\in\widehat{\mathcal{E}} by Lemma 4.1. If there exists no such nn^{*}\in\mathbb{N}, the iterative procedure (4.2) continues indefinitely with cn0c_{n}\downarrow 0 and dn0d_{n}\downarrow 0, leading to (S,T)=((0,y1]𝕏,)(S_{\infty},T_{\infty})=((0,y^{*}_{1}]\cap\mathbb{X},\emptyset), which belongs to ^\widehat{\mathcal{E}} by Remark 5.1. ∎

Finally, let us explore the more extreme situation where one firm is highly impatient while the other is highly patient.

Corollary 5.1.

If β1>0\beta_{1}>0 is sufficiently large and β2>0\beta_{2}>0 is sufficiently small, the alternating iterative procedure (4.2) yields (S,T)=((0,y1]𝕏,)^(S_{\infty},T_{\infty})=((0,y_{1}^{*}]\cap\mathbb{X},\emptyset)\in\widehat{\mathcal{E}}.

Proof.

Take β1>β¯:=uu1(NK1)+1u1>0\beta_{1}>\overline{\beta}:=\frac{u}{u-1}\left(\frac{N}{K}-1\right)+\frac{1}{u-1}>0. With β1\beta_{1} fixed, in view of (5.3) and (5.1), there exists β>0\beta^{*}>0 small enough such that for β2<β\beta_{2}<\beta^{*}, we have 1α21uα21<1α11uα11\frac{1-\alpha_{2}^{1}}{u-\alpha_{2}^{1}}<\frac{1-\alpha_{1}^{1}}{u-\alpha_{1}^{1}} and the interval

(1α21uα21K,1α11uα11K)contains at least two points in 𝕏.\bigg{(}\frac{1-\alpha_{2}^{1}}{u-\alpha_{2}^{1}}K,\frac{1-\alpha_{1}^{1}}{u-\alpha_{1}^{1}}K\bigg{)}\ \hbox{contains at least two points in $\mathbb{X}$}. (5.8)

Define β¯:=p(NK1)>0\underline{\beta}:=p\left(\frac{N}{K}-1\right)>0 and take β2<min{β,β¯}\beta_{2}<\min\{\beta^{*},\underline{\beta}\}. Note that β1>β¯>β¯>β2\beta_{1}>\overline{\beta}>\underline{\beta}>\beta_{2}, where the second inequality is due to p<1p<1. With β1>β2\beta_{1}>\beta_{2}, we deduce from (5.3), (5.4), and (5.8) that y1>y2y_{1}^{*}>y_{2}^{*} and the interval

(y2,y1)contains at least one point in 𝕏.(y_{2}^{*},y_{1}^{*})\ \hbox{contains at least one point in $\mathbb{X}$}. (5.9)

Following (4.2), S0:=S_{0}:=\emptyset and T0:=Γ2(S0)=Γ2()=(0,y2]𝕏T_{0}:=\Gamma_{2}(S_{0})=\Gamma_{2}(\emptyset)=(0,y_{2}^{*}]\cap\mathbb{X}, where the last equality is due to Lemma 5.1. By (5.9), uy2𝕏uy_{2}^{*}\in\mathbb{X} must belong to (y2,y1)(y_{2}^{*},y_{1}^{*}). Recall from (5.6) that

S1[uy2,y1]𝕏.S_{1}\subseteq[uy_{2}^{*},y_{1}^{*}]\cap\mathbb{X}. (5.10)

For any x[uy2,y1)𝕏x\in[uy_{2}^{*},y_{1}^{*})\cap\mathbb{X}, observe that

V1T0(x,)=sup1τρ+𝔼x[F1(τ,T0)]N1+β1\displaystyle V_{1}^{T_{0}}(x,\emptyset)=\sup_{1\leq\tau\leq\rho^{+}_{\emptyset}}\mathbb{E}_{x}[F_{1}(\tau,T_{0})]\leq\frac{N}{1+\beta_{1}} <K1uK\displaystyle<K-\frac{1}{u}K
<K1α11uα11KKx=J1(x,0,ρT0),\displaystyle<K-\frac{1-\alpha_{1}^{1}}{u-\alpha_{1}^{1}}K\leq K-x=J_{1}(x,0,\rho_{T_{0}}), (5.11)

where the second inequality follows from β1>β¯\beta_{1}>\overline{\beta}, the third inequality is due to u>1u>1 and 0<α11<10<\alpha_{1}^{1}<1 (recall (5.2)), and the last inequality stems from x𝕏x\in\mathbb{X}, x<y1x<y^{*}_{1}, and the definition of y1y^{*}_{1} in (5.4). This shows that [uy2,y1)𝕏Φ1T0()[uy_{2}^{*},y_{1}^{*})\cap\mathbb{X}\subseteq\Phi^{T_{0}}_{1}(\emptyset). Then, in view of (3.4)-(3.5), S1:=Γ1(T0)Φ1T0()[uy2,y1)𝕏S_{1}:=\Gamma_{1}(T_{0})\supseteq\Phi^{T_{0}}_{1}(\emptyset)\supseteq[uy_{2}^{*},y_{1}^{*})\cap\mathbb{X}. In particular, S1S_{1}\neq\emptyset as it must contain uy2uy^{*}_{2}. If Tn=T_{n}=\emptyset for some n>1n>1, Case 2 of Proposition 5.2 immediately gives (S,T)=((0,y1]𝕏,)^(S_{\infty},T_{\infty})=((0,y_{1}^{*}]\cap\mathbb{X},\emptyset)\in\widehat{\mathcal{E}}, as desired. Hence, we assume TnT_{n}\neq\emptyset for all nn\in\mathbb{N} in the rest of the proof.

As argued below (5.6), T1:=Γ2(S1)=(0,d1]𝕏T_{1}:=\Gamma_{2}(S_{1})=(0,d_{1}]\cap\mathbb{X} for some d1𝕏d_{1}\in\mathbb{X} with 0<d1y20<d_{1}\leq y^{*}_{2}. We claim that d1<y2d_{1}<y_{2}^{*}. First, observe from (5.4) that y2<u1α21uα21K<Ky_{2}^{*}<u\cdot\frac{1-\alpha_{2}^{1}}{u-\alpha_{2}^{1}}K<K. Consider the function

η(x):=p(NK)+x(11pu)Kxfor x[0,K).\eta(x):=\frac{p(N-K)+x(1-\frac{1-p}{u})}{K-x}\quad\hbox{for $x\in[0,K)$.}

As η(x)>0\eta^{\prime}(x)>0 for all x[0,K)x\in[0,K), we have β¯=p(NK1)=η(0)<η(x)\underline{\beta}=p\left(\frac{N}{K}-1\right)=\eta(0)<\eta(x) for all x(0,K)x\in(0,K). Recalling β2<β¯\beta_{2}<\underline{\beta}, we obtain β2<η(x)\beta_{2}<\eta(x) for all x(0,K)x\in(0,K). By direct calculation, this is equivalent to

(1p)Kx/u1+β2+pN1+β2>Kx,x(0,K).(1-p)\frac{K-x/u}{1+\beta_{2}}+p\frac{N}{1+\beta_{2}}>K-x,\quad\forall x\in(0,K). (5.12)

Assume to the contrary that d1=y2d_{1}=y_{2}^{*}. Then,

J2(d1,ρT1+,ρS1)=(1p)Kd1/u1+β2+pN1+β2>Kd1=J2(d1,0,ρS1),J_{2}(d_{1},\rho^{+}_{T_{1}},\rho_{S_{1}})=(1-p)\frac{K-d_{1}/u}{1+\beta_{2}}+p\frac{N}{1+\beta_{2}}>K-d_{1}=J_{2}(d_{1},0,\rho_{S_{1}}),

where the inequality follows from d1=y2<Kd_{1}=y_{2}^{*}<K and (5.12). This indicates T1=(0,d1]𝕏2S0T_{1}=(0,d_{1}]\cap\mathbb{X}\notin\mathcal{E}_{2}^{S_{0}}, a contradiction to T1=Γ2(S1)2S0T_{1}=\Gamma_{2}(S_{1})\in\mathcal{E}_{2}^{S_{0}} (by Proposition 3.1). With d1<y2d_{1}<y_{2}^{*} established, we conclude T1T0T_{1}\subsetneq T_{0}. For any x[ud1,y1)𝕏x\in[ud_{1},y^{*}_{1})\cap\mathbb{X}, the same calculation as in (5.11) (with T0T_{0} replaced by T1T_{1}) yields V1T1(x,)<J1(x,0,ρT1)V_{1}^{T_{1}}(x,\emptyset)<J_{1}(x,0,\rho_{T_{1}}). By the same arguments below (5.11), this yields S2:=Γ1(T1)Φ1T1()[ud1,y1)𝕏S_{2}:=\Gamma_{1}(T_{1})\supseteq\Phi^{T_{1}}_{1}(\emptyset)\supseteq[ud_{1},y_{1}^{*})\cap\mathbb{X}; particularly, we have ud1S2ud_{1}\in S_{2}. As S2S1S_{2}\supseteq S_{1} by construction, we deduce from ud1S2ud_{1}\in S_{2}, (5.10), and d1<y2d_{1}<y_{2}^{*}, that S2S1S_{2}\supsetneq S_{1}. By following the same arguments as above, we can show recursively that TnTn1T_{n}\subsetneq T_{n-1} and Sn+1SnS_{n+1}\supsetneq S_{n} for all n>1n>1. By Case 3 of Proposition 5.2, (S,T)=((0,y1]𝕏,)^(S_{\infty},T_{\infty})=((0,y_{1}^{*}]\cap\mathbb{X},\emptyset)\in\widehat{\mathcal{E}}. ∎

Propositions 5.1 and 5.2, along with Corollary 5.1, admit interesting economic interpretations. Intuitively, a firm can demonstrate a strong determination not to give in, so as to coerce the other firm into giving in in the negotiation. Whether this strategy will work depends on the impatience levels of the two firms. For the case where Firm 1 is less impatient than Firm 2 (i.e. β1β2\beta_{1}\leq\beta_{2}), Propositions 5.1 shows that when Firm 1 insists that it will never give in (i.e. S0=S_{0}=\emptyset), Firm 2 indeed gives in by taking the stopping policy T0=(0,y2]𝕏T_{0}=(0,y_{2}^{*}]\cap\mathbb{X}, and (S0,T0)=(,(0,y2]𝕏)(S_{0},T_{0})=(\emptyset,(0,y_{2}^{*}]\cap\mathbb{X}) is already a sharp inter-personal equilibrium. For the case where Firm 1 is more impatient than Firm 2 (i.e. β1>β2\beta_{1}>\beta_{2}), Propositions 5.2 shows that the negotiation becomes more complicated and Firm 1’s coercion does not necessarily work. In particular, if Firm 1 is sufficiently impatient and Firm 2 is sufficiently patient, Corollary 5.1 shows that Firm 1’s coercion must fail, and fail in a drastic way—the coercer becomes coerced. While Firm 1 started with S0=S_{0}=\emptyset, trying to coerce Firm 2 into giving in, the alternating game-theoretic reasoning (4.2) eventually lead to the sharp inter-personal equilibrium ((0,y1]𝕏,)((0,y_{1}^{*}]\cap\mathbb{X},\emptyset). That is, it is Firm 1 who ultimately gives in by taking the stopping policy (0,y1]𝕏(0,y_{1}^{*}]\cap\mathbb{X}, while Firm 2 in the end decides not to stop at all.

References

  • [1] N. Akakpo and C. Lacour, Inhomogeneous and anisotropic conditional density estimation from dependent data, Electron. J. Stat., 5 (2011), pp. 1618–1653.
  • [2] E. Bayraktar and S. Yao, On the robust Dynkin game, Ann. Appl. Probab., 27 (2017), pp. 1702–1755.
  • [3] E. Bayraktar, J. Zhang, and Z. Zhou, Time consistent stopping for the mean-standard deviation problem—the discrete time case, SIAM J. Financial Math., 10 (2019), pp. 667–697.
  • [4]  , Equilibrium concepts for time-inconsistent stopping problems in continuous time, Mathematical Finance, 31 (2021), pp. 508–530.
  • [5] J.-M. Bismut, Sur un problème de Dynkin, Z. Wahrscheinlichkeitstheorie und Verw. Gebiete, 39 (1977), pp. 31–53.
  • [6] T. Björk, M. Khapko, and A. Murgoci, On time-inconsistent stochastic control in continuous time, Finance and Stochastics, 21 (2017), pp. 331–360.
  • [7] T. Björk, A. Murgoci, and X. Y. Zhou, Mean-variance portfolio optimization with state-dependent risk aversion, Math. Finance, 24 (2014), pp. 1–24.
  • [8] S. Christensen and K. Lindensjö, On finding equilibrium stopping times for time-inconsistent Markovian problems, SIAM J. Control Optim., 56 (2018), pp. 4228–4255.
  • [9]  , On time-inconsistent stopping problems and mixed strategy stopping times, Stochastic Process. Appl., 130 (2020), pp. 2886–2917.
  • [10] J. Cvitanić and I. Karatzas, Backward stochastic differential equations with reflection and Dynkin games, Ann. Probab., 24 (1996), pp. 2024–2056.
  • [11] T. De Angelis, G. Ferrari, and J. Moriarty, Nash equilibria of threshold type for two-player nonzero-sum games of stopping, Ann. Appl. Probab., 28 (2018), pp. 112–147.
  • [12] A. Dixit and R. Pindyck, Investment under uncertainty, Princeton University Press, Princeton, NJ, (1994).
  • [13] E. B. Dynkin, Game variant of a problem on optimal stopping, Soviet Math. Dokl., 10 (1969), pp. 270–274.
  • [14] S. Ebert, W. Wei, and X. Y. Zhou, Weighted discounting—on group diversity, time-inconsistency, and consequences for investment, Journal of Economic Theory, 189 (2020), p. 105089.
  • [15] I. Ekeland and A. Lazrak, Being serious about non-commitment: subgame perfect equilibrium in continuous time, tech. rep., University of British Columbia, 2006. Available at http://arxiv.org/abs/math/0604264.
  • [16] I. Ekeland, O. Mbodji, and T. A. Pirvu, Time-consistent portfolio management, SIAM J. Financial Math., 3 (2012), pp. 1–32.
  • [17] I. Ekeland and T. A. Pirvu, Investment and consumption without commitment, Math. Financ. Econ., 2 (2008), pp. 57–86.
  • [18] E. Z. Ferenstein, On randomized stopping games, in Advances in dynamic games, vol. 7 of Ann. Internat. Soc. Dynam. Games, Birkhäuser Boston, Boston, MA, 2005, pp. 223–233.
  • [19] S. Hamadène and J. Zhang, The continuous time nonzero-sum Dynkin game problem and application in game options, SIAM J. Control Optim., 48 (2009/10), pp. 3659–3669.
  • [20] Y.-J. Huang and A. Nguyen-Huu, Time-consistent stopping under decreasing impatience, Finance and Stochastics, 22 (2018), pp. 69–95.
  • [21] Y.-J. Huang, A. Nguyen-Huu, and X. Y. Zhou, General stopping behaviors of naïve and noncommitted sophisticated agents, with application to probability distortion, Mathematical Finance, 30 (2020), pp. 310–340.
  • [22] Y.-J. Huang and Z. Wang, Optimal equilibria for multi-dimensional time-inconsistent stopping problems, SIAM J. Control Optim., 59 (2021), pp. 1705–1729.
  • [23] Y.-J. Huang and X. Yu, Optimal stopping under model ambiguity: a time-consistent equilibrium approach, Mathematical Finance, 31 (2021), pp. 979–1012.
  • [24] Y.-J. Huang and Z. Zhou, The optimal equilibrium for time-inconsistent stopping problems—the discrete-time case, SIAM J. Control Optim., 57 (2019), pp. 590–609.
  • [25]  , Optimal equilibria for time-inconsistent stopping problems in continuous time, Mathematical Finance, 30 (2020), pp. 1103–1134.
  • [26] M. R. Kosorok, Introduction to empirical processes and semiparametric inference, Springer Series in Statistics, Springer, New York, 2008.
  • [27] R. Laraki and E. Solan, Equilibrium in two-player non-zero-sum Dynkin games in continuous time, Stochastics, 85 (2013), pp. 997–1014.
  • [28] J.-P. Lepeltier and M. A. Maingueneau, Le jeu de Dynkin en théorie générale sans l’hypothèse de Mokobodski, Stochastics, 13 (1984), pp. 25–44.
  • [29] G. Loewenstein and R. Thaler, Anomalies: Intertemporal choice, Journal of Economic Perspectives, 3 (1989), pp. 181–193.
  • [30] R. McDonald and D. Siegel, The value of waiting to invest, The Quarterly Journal of Economics, 101 (1986), pp. 707–727.
  • [31] H. Morimoto, Dynkin games and martingale methods, Stochastics, 13 (1984), pp. 213–228.
  • [32]  , Non-zero-sum discrete parameter stochastic games with stopping times, Probab. Theory Relat. Fields, 72 (1986), pp. 155–160.
  • [33] H. Nagai, Non-zero-sum stopping games of symmetric Markov processes, Probab. Theory Related Fields, 75 (1987), pp. 487–497.
  • [34] J. Neveu, Discrete-parameter martingales, North-Holland Publishing Co., Amsterdam-Oxford; American Elsevier Publishing Co., Inc., New York, revised ed., 1975.
  • [35] Y. Ohtsubo, A nonzero-sum extension of Dynkin’s stopping problem, Math. Oper. Res., 12 (1987), pp. 277–296.
  • [36] R. A. Pollak, Consistent planning, The Review of Economic Studies, (1968), pp. 201–208.
  • [37] D. Rosenberg, E. Solan, and N. Vieille, Stopping games with randomized strategies, Probab. Theory Related Fields, 119 (2001), pp. 433–451.
  • [38] M. Sart, Estimation of the transition density of a Markov chain, Ann. Inst. Henri Poincaré Probab. Stat., 50 (2014), pp. 1028–1068.
  • [39] E. Shmaya and E. Solan, Two-player nonzero-sum stopping games in discrete time, Ann. Probab., 32 (2004), pp. 2733–2764.
  • [40] J. E. Smith and R. F. Nau, Valuing risky projects: option pricing theory and decision analysis, Management Science, 41(5) (1995), pp. 795–816.
  • [41] R. H. Strotz, Myopia and inconsistency in dynamic utility maximization, The Review of Economic Studies, 23 (1955), pp. 165–180.
  • [42] R. Thaler, Some empirical evidence on dynamic inconsistency, Econ. Lett., 8 (1981), pp. 201–207.
  • [43] N. Touzi and N. Vieille, Continuous-time Dynkin games with mixed strategies, SIAM J. Control Optim., 41 (2002), pp. 1073–1088.
  • [44] M. Yasuda, On a randomized strategy in Neveu’s stopping problem, Stochastic Process. Appl., 21 (1985), pp. 159–166.
  • [45] J. Yong, Time-inconsistent optimal control problems and the equilibrium HJB equation, Mathematical Control and Related Fields, 3 (2012), pp. 271–329.