This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Limit Cycles Analysis and Control of Evolutionary Game Dynamics with Environmental Feedback

Lulu Gong    Weijia Yao    Jian Gao    Ming Cao ENTEG, Faculty of Science and Engineering, University of Groningen, 9747 AG, Groningen, The Netherlands, (e-mail: [email protected], [email protected], and [email protected]). School of Science, Beijing University of Posts and Telecommunications, Beijing 100876, China, (e-mail: [email protected]).
Abstract

Recently, an evolutionary game dynamics model taking into account the environmental feedback has been proposed to describe the co-evolution of strategic actions of a population of individuals and the state of the surrounding environment; correspondingly a range of interesting dynamic behaviors have been reported. In this paper, we provide new theoretical insight into such behaviors and discuss control options. Instead of the standard replicator dynamics, we use a more realistic and comprehensive model of replicator-mutator dynamics, to describe the strategic evolution of the population. After integrating the environment feedback, we study the effect of mutations on the resulting closed-loop system dynamics. We prove the conditions for two types of bifurcations, Hopf bifurcation and Heteroclinic bifurcation, both of which result in stable limit cycles. These limit cycles have not been identified in existing works, and we further prove that such limit cycles are in fact persistent in a large parameter space and are almost globally stable. In the end, an intuitive control policy based on incentives is applied, and the effectiveness of this control policy is examined by analysis and simulations.

keywords:
Replicator-mutator dynamics, game-environment feedback, limit cycles, Hopf bifurcation, Heteroclinic bifurcation, local and global stability, incentive-based control.
keywords:
Replicator-mutator dynamics, game-environment feedback, limit cycles, incentive-based control.
thanks: The work of M. Cao is supported in part by the European Research Council (ERC-CoG-771687). L. Gong, W. Yao, and J. Gao are funded by China Scholarship Council (CSC).

, , ,

1 Introduction

Evolutionary game theory studies the evolution of strategic decision-making processes of individuals in one or multiple populations. A range of mathematical models have been extensively investigated, among which the replicator dynamics model is the most well-known [10]. Powerful theories and tools originating from evolutionary game theory facilitate the study of complex system behaviors of biological, ecological, social, and engineering fields [19], [12], [14], [22], [30]. In the classic game setting, the payoffs in each two-player game are usually predetermined in the form of constant payoff matrices. However, in many applications, especially those involving common resources in the environment that are consumed by groups of individuals, it is recognized that the payoffs can change over time or be affected directly by external environmental factors. Thus game-playing individuals’ decisions can influence the surrounding environment, and the environment, especially its richness in resources, also acts back on the payoff distributions, which forms the so-called bi-directional game-environment feedback [35]. Such a feedback mechanism is believed to be able to explain rich real-world complex population dynamics, including human decision-making, social culture evolution, plant nutrient acquisition, and natural resource harvesting. In particular, evolutionary dynamics with game-environment feedback have attracted great attention in recent years, due to its high relevance in biological and sociological systems [24], [14], [32].

This line of research is originated in [35], where the replicator dynamics of a two-player two-strategy game are coupled with the logistic environment resource dynamics. Utilizing an elaborate environment-dependent payoff matrix, the joint game-environment dynamics exhibit interesting system behaviors. In most cases, the strategic states of the population and the environment converge to an equilibrium point on the boundary of the phase space with the zero value of environment state, which reflects the tragedy of the commons. The system dynamics may also show cyclic oscillations which correspond to closed periodic orbits. Particularly when the system dynamics converge to a heteroclinic cycle on the boundary, the environment state cycles between low and high values, which is referred to as the oscillating tragedy of commons. Since then, [35] has been followed by several extensions and variations [7], [32], [4], [11], [18], [15]. The game-environment feedback mechanism has been studied more broadly in different applications, while its theoretical settings have been adapted in different research directions, e.g. implementing different types of environment resource models and considering the structures of the populations. New meaningful dynamical behaviors have been identified, and these results are in turn helpful to understand real-world applications.

The majority of the existing studies on the game-environment framework chooses replicator dynamics to model the strategic dynamics of the game-environment system. While the replicator dynamics have been proved to be a powerful model in analyzing a variety of classical games, they do not take into account the mutation or exploration in strategies which exists ubiquitously in biological and sociological systems. Genetic mutations typically occur with small but non-negligible probabilities, and random exploration of available strategies is common in behavioral experiments [33]. Mutations and explorations can be captured by allowing individuals to spontaneously switch from one strategy to another in small probability, which results in the so-called replicator-mutator dynamics. It has also played a prominent role in evolutionary game theory and has been applied frequently in different fields [19], [12], [23], [22].

We note that although rich system behaviors, such as convergence to equilibria or heteroclinic cycles on the boundary and existence of neutral periodic orbits, have been identified in related works, the results on the limit cycle dynamics have been rarely reported in the game-environment feedback setting. Only in [32], it is shown that the time-scale difference between game and environment dynamics can result in limit cycles. When mutations are taken into account, it is of great interest to study if the coupled game and environment dynamics can exhibit limit cycles.

We also emphasize that research on how to design reasonable and effective control policies to achieve expected system behaviors in evolutionary games has attracted increasing attention [17], [37], [25], [27]. Specifically, using game-environment feedback, [21] considers optimal control problems where control takes the form of the incentives and opinions respectively. The adopted control policies can maximize accumulated resource over time. However, since the controlled system still exhibits heteroclinic oscillations, repeated collapses of the resource are inevitable. [34] considers adjusting the law of feedback from population states to the environment based on a slightly different co-evolutionary model of public goods games. An involved nonlinear control law is proved to be able to steer the system to evolve towards the designed behaviors, but the biological implication of such control policies still remains open to debate.

Motivated by all the unsolved issues just mentioned, in this work we study the simultaneous co-evolution of game and environment using an integrated model consisting of replicator-mutator dynamics and logistic resource dynamics. Although similar linear environment-dependent payoff matrices have been studied before, we focus on investigating the significant impact of mutations on the overall system dynamics using bifurcation theory as the main tool. In particular, we show that two different types of bifurcations, which lead to limit cycles, can take place. In sharp comparison to the marginal oscillating behaviors reported before, such as neutral periodic orbits and heteroclinic cycles, we clarify that the identified limit cycle conditions correspond to a large space of the mutation parameter and such limit cycles are almost globally stable. In this sense, the limit cycle behavior is robust and persistent, which agrees with biological observations. Furthermore, when mutations are not too rare, the corresponding limit cycle will not be close to the boundary, and consequently the (oscillating) tragedy of commons can be averted.

Enabled by the new insight into co-evolutionary dynamics of the game and environment, we consider further a simple but effective incentive-based control policy, where the individuals receive some external incentives in the game interactions if they choose the cooperation strategy. It is shown that the controlled system dynamics can converge to certain equilibria with a high level of the environmental resource. Thus, the effectiveness of the proposed control policy is guaranteed. Compared with the proposed control policies in [[21]] and [[34]], our proposed approach is not only more effective, but also easier to implement than influencing the opinions of the individuals or realizing more complicated feedback between strategists and the environment.

Our preliminary conference paper [5] considers Hopf bifurcation and limit cycles of the system dynamics in a special case where the enhancement effect and the degradation effect is balanced, but in this paper we consider general bifurcation conditions and provide a comprehensive account of the results. Moreover, the latter part about incentive-based control of this paper is not in the conference version.

The remainder of this paper is organized as follows. Section 22 provides the framework of the co-evolutionary dynamics, and introduces the integrated model with game-environment feedback. The analysis of system dynamics is presented in Section 33. The incentive-based control is applied and its performance is provided in Section 44. Finally, concluding remarks and discussion points are given in Section 55.

2 Problem Formulation

Co-evolutionary game and environment dynamics occur because evolutionary game dynamics are sometimes coupled with the dynamics of the surrounding environment (see Fig. 1 for an illustrative drawing). The coupled game and environment dynamics form a closed-loop system, which consists of bi-directional actions between the environment and the population’s game play: an individual’s fitness depends on not only the population state, but also the state of the environment, and the state of the environment is influenced by the distribution of the choices of strategies in the population. In the following subsections, we will introduce each component of this system in turn, and then integrate them to give a complete description of our model.

Refer to caption
Figure 1: The framework of evolutionary game dynamics with the environmental feedback

2.1 Environment-dependent payoffs

We consider games played in a changing environment, which is characterized by the richness r[0,1]r\in[0,1] of a resource of interest; such a resource has direct influence on the payoffs that players receive. Correspondingly, the payoffs become dynamic and depend on rr. We consider a two-player game with two strategies, Cooperation and Defection (C, D). Following [35], we assume that the environment-dependent payoff matrix is a convex combination of the payoff matrices from the classic Prisoner’s Dilemma, i.e.,

A(r)\displaystyle A(r) =(1r)[R1S1T1P1]+r[R2S2T2P2]\displaystyle=(1-r)\begin{bmatrix}R_{1}&S_{1}\\ T_{1}&P_{1}\end{bmatrix}+r\begin{bmatrix}R_{2}&S_{2}\\ T_{2}&P_{2}\end{bmatrix} (1)
=[R1+(R2R1)rS1+(S2S1)rT1+(T2T1)rP1+(P2P1)r],\displaystyle=\begin{bmatrix}R_{1}+(R_{2}-R_{1})r&S_{1}+(S_{2}-S_{1})r\\ T_{1}+(T_{2}-T_{1})r&P_{1}+(P_{2}-P_{1})r\end{bmatrix},

where R1>T1R_{1}>T_{1}, S1>P1S_{1}>P_{1}, R2<T2R_{2}<T_{2} and S2<P2S_{2}<P_{2}. The entries of A(r)A(r) represent the payoffs to the players at the given environment state rr. For example, a C player receives the payoff of R1+(R2R1)rR_{1}+(R_{2}-R_{1})r and S1+(S2S1)rS_{1}+(S_{2}-S_{1})r when the opponent uses strategies C and D, respectively. Because of the inequalities below (1), the two matrices [R2S2T2P2]\begin{bmatrix}R_{2}&S_{2}\\ T_{2}&P_{2}\end{bmatrix} and [R1S1T1P1]\begin{bmatrix}R_{1}&S_{1}\\ T_{1}&P_{1}\end{bmatrix} correspond to the prisoner’s dilemma and its inverse game, respectively. Note that the payoff matrix A(r)A(r) is the linear interpolation between the above two matrices. Thus, when rr approaches 0 and 11, the mutual cooperation and defection tend to be the Nash Equilibrium of this dynamic game respectively. The biological interpretation is that: the individuals tend to cooperate facing the depleted environmental resource, and become more selfish and choose to defect while the resource becomes abundant.

2.2 Replicator-Mutator equations

The two-player game governed by the payoff matrix A(r)A(r) in (1) are played in a well-mixed and infinite population. We denote the proportion of individuals choosing C by the variable x[0,1]x\in[0,1]. Since there are only two strategies, the population state can be represented by the vector 𝐱=[x1x]\mathbf{x}=[x~{}~{}1-x]^{\top}. Assume the individuals with different strategies can mutate into each other with an identical probability μ[0,1]\mu\in[0,1]: a fraction of cooperators, μx\mu x, spontaneously change to choose strategy D; a fraction of defectors, μ(1x)\mu(1-x), change to choose strategy C at the same time. We focus on the dynamics of xx, which can be described by the following replicator-mutator equation

x˙=x[(A(r)𝐱)1𝐱A(r)𝐱]μx+μ(1x),\dot{x}=x[(A(r)\mathbf{x})_{1}-\mathbf{x}^{\top}A(r)\mathbf{x}]-\mu x+\mu(1-x), (2)

where (A(r)𝐱)1(A(r)\mathbf{x})_{1} is the first entry of A(r)𝐱A(r)\mathbf{x} and represents the fitness of choosing strategy C, and 𝐱A(r)𝐱\mathbf{x}^{\top}A(r)\mathbf{x} is the average fitness at the population state 𝐱\mathbf{x}. The term μx-\mu x represents the mutation from strategy C towards D, while μ(1x)\mu(1-x) captures the mutation in the opposite direction, from D towards C.

2.3 Closed-loop system with game-environment feedback

To model the evolution of the environmental change, we use the standard logistic model

r˙=r(1r)[θx(1x)],\dot{r}=r(1-r)[\theta x-(1-x)], (3)

where the parameter θ>0\theta>0 represents the ratio between the enhancement effect due to cooperation and degradation effect due to defection. The influence of the population on the environment state is captured by the function [θx(1x)][\theta x-(1-x)], and the dynamics are restricted in the unit interval by r(1r)r(1-r). When θ=1\theta=1, the enhancement effect and the degradation effect are balanced. And if θ>1\theta>1 or 0<θ<10<\theta<1, the enhancement effect is respectively stronger or weaker than the opposite.

Substituting (1) into (2), and then combining with (3), we obtain the closed-loop planar system describing the evolutionary game dynamics under the game-environment feedback:

{x˙=x(1x)[xr(c+da+b)+x(ab)r(d+b)+b]+μ(12x)r˙=r(1r)[θx(1x)],\begin{cases}&\begin{aligned} \dot{x}=&x(1-x)[xr(-c+d-a+b)+x(a-b)\\ &-r(d+b)+b]+\mu(1-2x)\end{aligned}\\ &\dot{r}=r(1-r)[\theta x-(1-x)],\end{cases} (4)

where to simplify notations, we have used a=R1T1a=R_{1}-T_{1}, b=S1P1b=S_{1}-P_{1}, c=T2R2c=T_{2}-R_{2}, and d=P2S2d=P_{2}-S_{2}, all of which are positive because of the inequalities below (1). Note that system (4) reduces to the model considered in [35] when μ=0\mu=0. Therefore, system (4) generalizes the previous model.

In view of the payoff matrix (1) and system equations (4), these four parameters for re-parameterization have intuitive interpretations with respect to strategy changes when the resource is either abundant or depleted: the parameter aa quantifies the incentive to stick to strategy C given that all individuals are following strategy C and the system is currently in a resource-poor state; bb quantifies the incentive to switch to strategy C when the resource is depleted and all individuals enforce strategy D; in contrast, cc quantifies the incentive to switch to strategy D given that all individuals enforce strategy C and the system is in a state of sufficient resource; dd quantifies the incentive to stick to strategy D when the resource is abundant and all individuals are following strategy D.

3 System Dynamics and Bifurcations

The state space of system (4) is the unit square =[0,1]2\mathcal{I}=[0,1]^{2}. The interior region of this square is denoted by int()=(0,1)2\text{int}(\mathcal{I})=(0,1)^{2}. Four sides of the unit square form the boundary \partial\mathcal{I}. For convenience of exposition, we denote the four sides by:

t={(x,r):x[0,1],r=1},\displaystyle\mathcal{B}_{t}=\{(x,r):x\in[0,1],r=1\}, (5)
b={(x,r):x[0,1],r=0},\displaystyle\mathcal{B}_{b}=\{(x,r):x\in[0,1],r=0\},
l={(x,r):x=0,r[0,1]},\displaystyle\mathcal{B}_{l}=\{(x,r):x=0,r\in[0,1]\},
r={(x,r):x=1,r[0,1]}.\displaystyle\mathcal{B}_{r}=\{(x,r):x=1,r\in[0,1]\}.

Then, one has :=tblr\partial\mathcal{I}:=\mathcal{B}_{t}\cup\mathcal{B}_{b}\cup\mathcal{B}_{l}\cup\mathcal{B}_{r}. It is easy to check that \mathcal{I} is invariant under the dynamics (4) for all μ[0,1]\mu\in[0,1] by Nagumo’s theorem [1, Theorem 4.7].

System (4) can have equilibria in the interior int()\text{int}(\mathcal{I}); we call them interior equilibria. System (4) also has equilibria on the boundary \partial\mathcal{I} (boundary equilibria), which will be discussed in detail in Section 3.4. It is easy to check that the interior equilibria, if they exist, must lie on the line {(x,r)int():x=1/(θ+1)}\{(x,r)\in\text{int}(\mathcal{I}):x=1/(\theta+1)\}. Substituting such xx into the xx-dynamics in (4), we have

0=\displaystyle 0= x(1x)[xr(c+da+b)\displaystyle x(1-x)[xr(-c+d-a+b)
+x(ab)r(d+b)+b]+μ(12x),\displaystyle+x(a-b)-r(d+b)+b]+\mu(1-2x),

which leads to

r=θa+θ2b+μ(θ3+θ2θ1)θ(a+c+θb+θd).r=\frac{\theta a+\theta^{2}b+\mu(\theta^{3}+\theta^{2}-\theta-1)}{\theta(a+c+\theta b+\theta d)}.

Thus if the following condition holds

(θa+θ2b)<μ(θ3+θ2θ1)<θc+θ2d,-(\theta a+\theta^{2}b)<\mu(\theta^{3}+\theta^{2}-\theta-1)<\theta c+\theta^{2}d, (6)

system (4) has a unique interior equilibrium

(x,r)=(1θ+1,θa+θ2b+μ(θ3+θ2θ1)θ(a+c+θb+θd)).(x^{*},r^{*})=\left(\frac{1}{\theta+1},\frac{\theta a+\theta^{2}b+\mu(\theta^{3}+\theta^{2}-\theta-1)}{\theta(a+c+\theta b+\theta d)}\right). (7)

Otherwise, no interior equilibrium exists. The interior equilibrium supports the coexistence of the two strategies under a non-depleted resource state. In view of (6), one can see that for a given θ\theta, the existence of the interior equilibrium is determined by not only the payoff parameters but also the mutation rate.

Note that when θ=1\theta=1, (x,r)(x^{*},r^{*}) is independent of the parameter μ\mu, and (6) holds trivially for all μ[0,1]\mu\in[0,1]. This specific case has been studied in our previous conference paper [5]. In this paper we do not require θ=1\theta=1 and study the generic θ\theta that takes any positive value satisfying (6).

Towards the goal of establishing conditions for the existence of limit cycles, we first present some preliminary results on the non-existence of closed orbits or limit cycles.

3.1 Non-existence of closed orbits

First, we define the 𝐂1\mathbf{C}^{1} function φ(x,r):int()\varphi(x,r):\text{int}(\mathcal{I})\rightarrow\mathbb{R} by

φ(x,r)=xα(1x)βrγ(1r)δ\varphi(x,r)=x^{\alpha}(1-x)^{\beta}r^{\gamma}(1-r)^{\delta} (8)

with the parameters α\alpha, β\beta, γ\gamma, and δ\delta to be determined later. Note that φ(x,r)\varphi(x,r) is strictly positive in int()\text{int}(\mathcal{I}) for any values of these parameters. Denote the right hand side of (4) by f(x,r)=[f1(x,r),f2(x,r)]f(x,r)=[f_{1}(x,r),f_{2}(x,r)]^{\top}, and then the divergence of the modified vector field φf(x,r)\varphi f(x,r) on int()\text{int}(\mathcal{I}) is given by

divφf(x,r)=(φf1)x(x,r)+(φf2)r(x,r)\displaystyle\text{div}~{}\varphi f(x,r)=\frac{\partial(\varphi f_{1})}{\partial x}(x,r)+\frac{\partial(\varphi f_{2})}{\partial r}(x,r) (9)
=φ(x,r)((α+β+3)(c+da+b)x2r\displaystyle=\varphi(x,r)(-(\alpha+\beta+3)(-c+d-a+b)x^{2}r
(α+β+3)(ab)x2\displaystyle~{}~{}~{}-(\alpha+\beta+3)(a-b)x^{2}
+[(c+2da+2b)α+(d+b)β\displaystyle~{}~{}~{}+[(-c+2d-a+2b)\alpha+(d+b)\beta
(θ+1)(γ+δ+2)+2(c+2da+2b)]xr\displaystyle~{}~{}~{}-(\theta+1)(\gamma+\delta+2)+2(-c+2d-a+2b)]xr
+[(a2b)αbβ+(θ+1)(1+γ)+2a4b]x\displaystyle~{}~{}~{}+[(a-2b)\alpha-b\beta+(\theta+1)(1+\gamma)+2a-4b]x
+[(d+b)α+γ+δ+2bd]r\displaystyle~{}~{}~{}+[-(d+b)\alpha+\gamma+\delta+2-b-d]r
+μ(12x)(ααxβx)x(1x)\displaystyle~{}~{}~{}+\frac{\mu(1-2x)(\alpha-\alpha x-\beta x)}{x(1-x)}
+(1+α)b(1+γ)2μ).\displaystyle~{}~{}~{}+(1+\alpha)b-(1+\gamma)-2\mu).

Let all the coefficients of the powers of xx, rr in (9) (except for the term containing μ\mu) be zero, and we obtain the following set of equations

(α+β+3)(c+da+b)\displaystyle(\alpha+\beta+3)(-c+d-a+b) =0\displaystyle=0 (10)
(α+β+3)(ab)\displaystyle(\alpha+\beta+3)(a-b) =0\displaystyle=0
(c+2da+2b)α+(d+b)β\displaystyle(-c+2d-a+2b)\alpha+(d+b)\beta
(θ+1)(γ+δ+2)+2(c+2da+2b)\displaystyle-(\theta+1)(\gamma+\delta+2)+2(-c+2d-a+2b) =0\displaystyle=0
(a2b)αbβ+(θ+1)(γ+1)+2a4b\displaystyle(a-2b)\alpha-b\beta+(\theta+1)(\gamma+1)+2a-4b =0\displaystyle=0
(d+b)α+γ+δ+2bd\displaystyle-(d+b)\alpha+\gamma+\delta+2-b-d =0.\displaystyle=0.

We first show that the equation set (10) always has solutions.

Lemma 1.

There always exist some α\alpha, β\beta, γ\gamma, and δ\delta as solutions to (10).

{pf}

See Appendix A.

In fact, if one of c+da+b-c+d-a+b and aba-b is not zero, (10) will have exactly one solution

α=ζ+c+aζ,\displaystyle\alpha=-\frac{\zeta+c+a}{\zeta}, (11)
β=2ζ+c+aζ,\displaystyle\beta=\frac{-2\zeta+c+a}{\zeta},
γ=(a+θ+1)ζ+a2+acabbc(θ+1)ζ,\displaystyle\gamma=\frac{-(a+\theta+1)\zeta+a^{2}+ac-ab-bc}{(\theta+1)\zeta},
δ=(a+c)(θb+θd+d+a)+(θ+1a)ζ(θ+1)ζ\displaystyle\delta=-\frac{(a+c)(\theta b+\theta d+d+a)+(\theta+1-a)\zeta}{(\theta+1)\zeta}

with ζ=a+c+θb+θd>0\zeta=a+c+\theta b+\theta d>0. Otherwise, (10) will have infinitely many solutions including (11). So we fix the parameters α\alpha, β\beta, γ\gamma, and δ\delta at the values given in (11) such that the divergence (9) can be greatly simplified.

Then we show the non-existence of closed orbits for system (4).

Lemma 2.

For system (4), the following statements hold:

  1. 1.

    when adbc>0ad-bc>0, there exists some μ0>0\mu_{0}>0 such that there are no closed orbits in \mathcal{I} for μ[μ0,1]\mu\in[\mu_{0},1];

  2. 2.

    when adbc<0ad-bc<0, there are no closed orbits in \mathcal{I} for μ[0,1]\mu\in[0,1];

  3. 3.

    when adbc=0ad-bc=0, there are no closed orbits in \mathcal{I} for μ(0,1]\mu\in(0,1].

{pf}

The proof is direct by applying Bendixson-Dulac criterion [36, Theorem 4.1.2]. Since the boundary \partial\mathcal{I} contains equilibria (as shown in Section 3.4), it is not possible to have periodic orbits on it. And the sides are either invariant or repelling, so they cannot intersect with closed orbits. Now let us consider the interior int()\text{int}(\mathcal{I}) which is simply connected.

In view of the fact that α\alpha, β\beta, γ\gamma, and δ\delta are given as in (11), one can calculate the divergence of φf(x,r)\varphi f(x,r)

divφf(x,r)\displaystyle\text{div}~{}\varphi f(x,r) (12)
=φ(x,r)(μ(12x)(3x+α)x(1x)+(1+α)b\displaystyle=\varphi(x,r)\bigg{(}\frac{\mu(1-2x)(3x+\alpha)}{x(1-x)}+(1+\alpha)b
(1+γ)2μ)\displaystyle\quad\quad\quad\quad\quad-(1+\gamma)-2\mu\bigg{)}
=φ(x,r)(μ(12x)(3x+α)x(1x)2μθ(bcad)(θ+1)ζ).\displaystyle=\varphi(x,r)\left(\frac{\mu(1-2x)(3x+\alpha)}{x(1-x)}-2\mu-\frac{\theta(bc-ad)}{(\theta+1)\zeta}\right).

From (11) one knows α(2,1)\alpha\in(-2,-1), and thus the term g(x):=μ(12x)(3x+α)x(1x)g(x):=\frac{\mu(1-2x)(3x+\alpha)}{x(1-x)} takes its maximum value at x¯(0,1)\bar{x}\in(0,1) given by

x¯={αα(3+α)3+2α,1.5<α<1,0.5,α=1.5,α+α(3+α)3+2α,2<α<1.5.\bar{x}=\begin{cases}&\frac{\alpha-\sqrt{-\alpha(3+\alpha)}}{3+2\alpha},~{}-1.5<\alpha<-1,\\ &0.5,~{}\quad\quad\quad\quad\quad\quad~{}~{}~{}\alpha=-1.5,\\ &\frac{\alpha+\sqrt{-\alpha(3+\alpha)}}{3+2\alpha},~{}-2<\alpha<-1.5.\end{cases} (13)

Rearranging (12) yields

divφf(x,r)φ(x,r)\displaystyle\frac{\text{div}~{}\varphi f(x,r)}{\varphi(x,r)} μ(12x¯)(3x¯+α)x¯(1x¯)2μθ(bcad)(θ+1)ζ\displaystyle\leq\frac{\mu(1-2\bar{x})(3\bar{x}+\alpha)}{\bar{x}(1-\bar{x})}-2\mu-\frac{\theta(bc-ad)}{(\theta+1)\zeta} (14)
=(12α)x¯+α4x¯2x¯(1x¯)μθ(bcad)(θ+1)ζ.\displaystyle=\frac{(1-2\alpha)\bar{x}+\alpha-4\bar{x}^{2}}{\bar{x}(1-\bar{x})}\mu-\frac{\theta(bc-ad)}{(\theta+1)\zeta}.

It is easy to check that (12α)x¯+α4x¯2x¯(1x¯)\frac{(1-2\alpha)\bar{x}+\alpha-4\bar{x}^{2}}{\bar{x}(1-\bar{x})} is negative for any value of α(2,1)\alpha\in(-2,-1). Since φ(x,r)\varphi(x,r) is positive in int()\text{int}(\mathcal{I}) by definition, if adbc>0ad-bc>0, from (14) one has divφf(x,r)0\text{div}~{}\varphi f(x,r)\leq 0 when

μμ0:=θ(bcad)x¯(1x¯)(θ+1)ζ[(12α)x¯+α4x¯2].\mu\geq\mu_{0}:=\frac{\theta(bc-ad)\bar{x}(1-\bar{x})}{(\theta+1)\zeta[(1-2\alpha)\bar{x}+\alpha-4\bar{x}^{2}]}. (15)

And because the divergence is not identically zero in int()\text{int}(\mathcal{I}), from the Bendixson-Dulac criterion, one knows that no closed orbits can exist in int()\text{int}(\mathcal{I}) for μμ0\mu\geq\mu_{0}.

For the second case, it is easy to prove since the right hand side of (14) will always be negative for any μ0\mu\geq 0. For the third case, the right hand side of (14) is always negative when μ>0\mu>0. In Lemma 2, it is noted that the conditions for the non-existence of periodic orbits depend on the relationship between adad and bcbc. The reason is as follows: when adbc<0ad-bc<0, the divergence divφf(x,r)\text{div}~{}\varphi f(x,r) will remain positive for any μ[0,1]\mu\in[0,1] in view of (14), which excludes the possibility of periodic orbits in the phase space; when adbc>0ad-bc>0 or adbc=0ad-bc=0, there exists some μ\mu such that divφf(x,r)\text{div}~{}\varphi f(x,r) equals 0. As a consequence, it is possible for the system to have periodic orbits in these two cases. Another point worth noting is that the only difference between the second and third cases of Lemma 2 is that system (4) may have periodic solutions for μ=0\mu=0 when adbc=0ad-bc=0. This possibility has been shown in [35], [4]. Despite the possibility of closed orbits under certain conditions, for μ=0\mu=0 it can be proven that limit cycles (isolated closed orbits) are impossible to exist in (4) for all the parameter space as summarized below in Lemma 3.

Lemma 3.

System (4) has no limit cycles in \mathcal{I} when μ=0\mu=0.

The full proof is presented in [5]. See Appendix B for a sketch of the proof.

3.2 Hopf bifurcation at the interior equilibrium

In this section we take μ\mu as the bifurcation parameter to study the effect of different mutation rates on the system dynamics. We investigate the possible bifurcations of the dynamics (4) as μ\mu varies in [0,1][0,1]. We analyze the stability of the interior equilibrium and prove that it involves a Hopf bifurcation which leads to periodic orbits.

We first invoke the Hopf bifurcation theorem [6] which will be used to prove the existence of stable limit cycles.

Theorem 4 (Hopf bifurcation theorem).

Suppose that the system y˙=F(y,ρ),y2,ρ\dot{y}=F(y,\rho),y\in\mathbb{R}^{2},\rho\in\mathbb{R} has an equilibrium (y0,ρ0)(y_{0},\rho_{0}) at which the following properties are satisfied:

  1. 1.

    the Jacobian DyF|(y0,ρ0)D_{y}F|_{(y_{0},\rho_{0})} has a simple pair of pure imaginary eigenvalues λ(ρ0)\lambda(\rho_{0}) and λ¯(ρ0)\bar{\lambda}(\rho_{0});

  2. 2.

    d(λ(ρ))dρ|ρ00\frac{d\Re(\lambda(\rho))}{d\rho}|_{\rho_{0}}\neq 0.

Then the dynamics undergo a Hopf bifurcation at (y0,ρ0)(y_{0},\rho_{0}), which results in a family of periodic solutions in a sufficiently small neighborhood of (y0,ρ0)(y_{0},\rho_{0}).

Then we make an assumption about some parameters of system (4).

Assumption 5.

We assume that 0<adbcΔθ2.0<ad-bc\leq\frac{\Delta}{\theta^{2}}.

The following theorem demonstrates that the interior equilibrium (x,r)(x^{*},r^{*}) of system (4) undergoes a Hopf bifurcation.

Theorem 6.

Under Assumption 5, the interior equilibrium (x,r)(x^{*},r^{*}) of system (4) with the bifurcation parameter μ\mu undergoes a supercritical Hopf bifurcation at μ=μ1:=θ2(adbc)Δ\mu=\mu_{1}:=\frac{\theta^{2}(ad-bc)}{\Delta}, which leads to the existence of stable limit cycles for μ<μ1\mu<\mu_{1} in the vicinity of μ1\mu_{1}.

{pf}

To prove the existence of Hopf bifurcation, we need to show that the two conditions of Theorem 4 are satisfied in system (4). The Jacobian of the vector field of (4) at (x,r)(x^{*},r^{*}) is given by

J=[θ2(adbc)μΔθ(θ+1)ζθζ(θ+1)3(θ+1)(cθ+dθ2μθ^)(aθ+bθ2+μθ^)θ2ζ20],J^{*}=\begin{bmatrix}\frac{\theta^{2}(ad-bc)-\mu\Delta}{\theta(\theta+1)\zeta}&\frac{-\theta\zeta}{(\theta+1)^{3}}\\ \frac{(\theta+1)(c\theta+d\theta^{2}-\mu\hat{\theta})(a\theta+b\theta^{2}+\mu\hat{\theta})}{\theta^{2}\zeta^{2}}&0\end{bmatrix}, (16)

where

θ^=(θ3+θ2θ1)\hat{\theta}=(\theta^{3}+\theta^{2}-\theta-1)

and

Δ=(a+c)(1+θ2+2θ3)+(b+d)(2θ+θ2+θ4).\Delta=(a+c)(1+\theta^{2}+2\theta^{3})+(b+d)(2\theta+\theta^{2}+\theta^{4}).

Note that the off-diagonal entries

J12=θζ(θ+1)3<0J_{12}=\frac{-\theta\zeta}{(\theta+1)^{3}}<0

and

J21=(θ+1)(cθ+dθ2μ1θ^)(aθ+bθ2+μ1θ^)θ2ζ2>0J_{21}=\frac{(\theta+1)(c\theta+d\theta^{2}-\mu_{1}\hat{\theta})(a\theta+b\theta^{2}+\mu_{1}\hat{\theta})}{\theta^{2}\zeta^{2}}>0

because of (6). One can calculate the eigenvalues of JJ^{*}

λ(μ)\displaystyle\lambda(\mu) =J11±J112+4J12J212\displaystyle=\frac{J_{11}\pm\sqrt{J_{11}^{2}+4J_{12}J_{21}}}{2} (17)
=(λ(μ))±(λ(μ))i.\displaystyle=\Re(\lambda(\mu))\pm\Im(\lambda(\mu))i.

It is easy to check that when μ=μ1:=θ2(adbc)Δ\mu=\mu_{1}:=\frac{\theta^{2}(ad-bc)}{\Delta} the eigenvalues are purely imaginary, i.e.,

(λ(μ1))=0,\displaystyle\Re(\lambda(\mu_{1}))=0,
(λ(μ1))=(cθ+dθ2μθ^)(aθ+bθ2+μθ^)θ(θ+1)2ζ,\displaystyle\Im(\lambda(\mu_{1}))=\sqrt{\frac{(c\theta+d\theta^{2}-\mu\hat{\theta})(a\theta+b\theta^{2}+\mu\hat{\theta})}{\theta(\theta+1)^{2}\zeta}},

which implies that the first condition listed in Theorem 4 is satisfied. In addition, the second condition is also satisfied because

d(λ(μ))dμ|μ1=Δθ(θ+1)ζ<0.\left.\frac{d\Re(\lambda(\mu))}{d\mu}\right|_{\mu_{1}}=\frac{-\Delta}{\theta(\theta+1)\zeta}<0.

Then one can conclude that the dynamics (4) undergo a Hopf bifurcation at μ=μ1\mu=\mu_{1}. Since μ\mu varies in the interval (0,1](0,1], to guarantee the occurrence of Hopf bifurcation, the critical parameter value μ1\mu_{1} should also be constrained in this interval, i.e., 0<μ110<\mu_{1}\leq 1, which leads to

0<adbcΔθ2,0<ad-bc\leq\frac{\Delta}{\theta^{2}},

which is exactly Assumption 5. The Hopf bifurcation theorem implies that a family of periodic orbits bifurcate from (x,r)(x^{*},r^{*}) for some μ\mu in the vicinity of μ1\mu_{1}. Whether the bifurcated periodic orbits are stable or not is determined by the so-called first Lyapunov coefficient 1\ell_{1} at (x,r)(x^{*},r^{*}) when μ=μ1\mu=\mu_{1}. If 1<0\ell_{1}<0, then these periodic orbits are stable limit cycles; if 1>0\ell_{1}>0, the periodic orbits are repelling. Following the computation procedure in Appendix C, we obtain 1(μ1)\ell_{1}(\mu_{1}) as below

1(μ1)\displaystyle\ell_{1}(\mu_{1}) =\displaystyle= (18)
θ(bcad)ζ((b+d)(θ3+2θ)+(a+c)(2θ3+1))2ω03(θ+1)4Δ,\displaystyle\frac{\theta(bc-ad)\zeta((b+d)(\theta^{3}+2\theta)+(a+c)(2\theta^{3}+1))}{2\omega_{0}^{3}(\theta+1)^{4}\Delta},

which is negative because of (6) and Assumption 5. Thus, the bifurcated periodic orbits are stable limit cycles.

Furthermore, in view of (17), for μ<μ1\mu<\mu_{1}, (x,r)(x^{*},r^{*}) is unstable since the eigenvalues have positive real parts. On the other hand, the interior equilibrium is asymptotically stable for μ>μ1\mu>\mu_{1} with the negative eigenvalues. Hence, the stable limit cycles exist for μ<μ1\mu<\mu_{1} in the vicinity of μ1\mu_{1}. Since the bifurcation is associated with stable limit cycles, it is a supercritical Hopf bifurcation. The relationship between adad and bcbc has played an important role in Theorem 6 in the sense that Hopf bifurcation can only happen when adbc>0ad-bc>0. In view of the Jacobian (16), when adbc0ad-bc\leq 0, J11J_{11} will remain non-positive no matter what value μ\mu takes. Thus, as μ\mu varies in [0,1][0,1], the stability of the interior equilibrium does not change, which excludes the existence of a Hopf bifurcation. On the other hand, when adbc>0ad-bc>0, to ensure there is a Hopf bifurcation for μ[0,1]\mu\in[0,1], Assumption 5 has to be satisfied as proved.

From the Hopf bifurcation theory, we know that the limit cycles generated from Hopf bifurcation have the amplitude of O(μ1μ)O(\sqrt{\mu_{1}-\mu}). Since Hopf bifurcation is a local bifurcation, the obtained results only hold for μ\mu in the vicinity of μ1\mu_{1}. Next, we will show that system (4) exhibits another type of bifurcation.

3.3 Heteroclinic bifurcation

In the case μ=0\mu=0, it has been shown in Lemma 3 that the system (4) cannot exhibit limit cycles for any parameters. We show in this section that limit cycles may occur for μ>0\mu>0 in the vicinity of 0.

When μ=0\mu=0, system (4) has four equilibria on the boundary \partial\mathcal{I}. Denote the four equilibria respectively by E1=(0,0)E_{1}=(0,0), E2=(1,0)E_{2}=(1,0), E3=(1,1)E_{3}=(1,1), and E4=(0,1)E_{4}=(0,1). Then, one can calculate the Jacobians at these points, which are

J1=[b001],\displaystyle J_{1}=\begin{bmatrix}b&0\\ 0&-1\end{bmatrix}, J2=[a00θ],\displaystyle J_{2}=\begin{bmatrix}-a&0\\ 0&\theta\end{bmatrix}, (19)
J3=[c00θ],\displaystyle J_{3}=\begin{bmatrix}c&0\\ 0&-\theta\end{bmatrix}, J4=[d001].\displaystyle J_{4}=\begin{bmatrix}-d&0\\ 0&1\end{bmatrix}.

It is easy to see that the eigenvalues of these equilibria are indeed the diagonal entries of the associated Jacobian matrices, and thus all of these equilibria are saddle points. Next we note that the trace of each Jacobian, which is denoted by TiT_{i}, is

T1=b1,T2=θa,T3=cθ,T4=1d.T_{1}=b-1,~{}T_{2}=\theta-a,~{}T_{3}=c-\theta,~{}T_{4}=1-d.

A heteroclinic cycle on the boundary denoted by Λ\Lambda, which connects the four equilibria and has the orientation E1E2E3E4E1E_{1}\rightarrow E_{2}\rightarrow E_{3}\rightarrow E_{4}\rightarrow E_{1}, can be identified. Its stability is determined by the payoff parameters.

Lemma 7.

Consider system (4) at μ=0\mu=0. There exists a heteroclinic cycle Λ\Lambda, which is stable (resp. unstable) when the condition ad>bcad>bc (resp. ad<bcad<bc) is satisfied.

We refer to the reference [35] for the complete proof for the case of the stable heteroclinic cycle. And the opposite case can be proved using the analogous method according to [9, Corollary 2].

When μ>0\mu>0, the four corners are no longer equilibria, and the heteroclinic cycle Λ\Lambda is “broken”. Now we are going to show that the system (4) undergoes a heteroclinic bifurcation at μ=0\mu=0 when certain condition holds, and a limit cycle with the same stability of the corresponding heteroclinic cycle may be generated from Λ\Lambda for μ>0\mu>0 sufficiently close to 0.

Definition 8.

The internal σ\sigma-neighborhood of Λ\Lambda is the set of all points in \mathcal{I} that are at a distance less than σ>0\sigma>0 from Λ\Lambda.

Theorem 9.

Consider system (4). If a>θa>\theta, b<1b<1, c<θc<\theta, and d>1d>1, then there exist σ,ϵ>0\sigma,\epsilon>0, such that for 0<μ<ϵ0<\mu<\epsilon, there is at most one limit cycle Γ\Gamma in the internal σ\sigma-neighborhood of Λ\Lambda, and Γ\Gamma (if it exists) is stable.

{pf}

The proof is straightforward by applying the theorem of the heteroclinic bifurcation [16, Theorem 2.3.2, 2.3.3]. When a>θa>\theta, b<1b<1, c<θc<\theta, and d>1d>1 hold, which implies ad>bcad>bc. Then according to Lemma 7, there is a heteroclinic cycle when μ=0\mu=0 and it is stable.

In addition, all the traces TiT_{i} of the Jacobians of the equilibria are negative, while their product i=14Ti\prod_{i=1}^{4}T_{i} is positive. Then according to [16, Theorem 2.3.3], there exist sufficiently small σ,ϵ>0\sigma,\epsilon>0 such that for 0<μ<ϵ0<\mu<\epsilon, system (4) has at most one limit cycle in the internal σ\sigma-neighbourhood of Λ\Lambda, and this limit cycle (if it exists) is stable due to the stability of Λ\Lambda. When ad<bcad<bc, according to the second statement of Lemma 7, the heteroclinic cycle is unstable. It is impossible for limit cycles to be generated from this unstable heteroclinic cycle in view of Lemma 2, which claims that there are no closed orbits in \mathcal{I} for any μ0\mu\geq 0 when adbc<0ad-bc<0.

Compared with the result of Hopf bifurcation, the conclusion of Heteroclinic bifurcation is relatively weaker since Theorem 9 only states that there may exist a unique limit cycle in some neighborhood of Λ\Lambda. To study the exact conditions for the existence of the unique limit cycle is beyond this work.

Another difference between the limit cycles generated from Hopf bifurcation and Heteroclinic bifurcation is that the limit cycle arising from Hopf bifurcation is small in size in the phase space, as demonstrated in the simulation results in Fig. 2, while the other can be large. In these simulations the payoff matrices are given by

[R1S1T1P1]=[3.510.50.8],[R2S2T2P2]=[20.22.51.2].\begin{bmatrix}R_{1}&S_{1}\\ T_{1}&P_{1}\end{bmatrix}=\begin{bmatrix}3.5&1\\ 0.5&0.8\end{bmatrix},~{}~{}\begin{bmatrix}R_{2}&S_{2}\\ T_{2}&P_{2}\end{bmatrix}=\begin{bmatrix}2&0.2\\ 2.5&1.2\end{bmatrix}. (20)

The parameter θ\theta is chosen to be 11, and thus the interior equilibrium is fixed and independent of μ\mu. In view of Theorem 6, the Hopf bifurcation point is μ1=0.1543\mu_{1}=0.1543.

Refer to caption
(a) Large limit cycle
Refer to caption
(b) Small limit cycle
Figure 2: Phase portraits of the dynamics in (4) with payoff matrices as shown in (20). The red stars denote the unique interior equilibrium (0.5,0.6809)(0.5,0.6809). The blue and green cycles show the approximated limit cycles when (a) μ=0.005\mu=0.005; (b) μ=0.1540\mu=0.1540.

3.4 Global dynamics

As stated before, the limit cycle analysis for Hopf and Heteroclinic bifurcations are limited to the vicinity of the bifurcation points since the used methods rely mainly on linearization. To study if the limit cycle persists for μ(0,μ1)\mu\in(0,\mu_{1}), it is necessary to analyze the dynamics of (4) for all such μ\mu.

3.4.1 Boundary equilibria and their stability

Recall that the condition (6), which involves μ\mu, guarantees that there is a unique interior equilibrium. For some given payoff parameters and fixed values of θ\theta, there may be some μ\mu such that there is no interior equilibrium. In this case, the system dynamics are relatively trivial since the trajectories will converge to some limit sets on the boundary according to the Poincaré-Bendixson theorem. In the following part, we concentrate on the situation where an interior equilibrium always exists for all μ[0,1]\mu\in[0,1]. Then, we have the following assumption.

Assumption 10.

We assume hereafter that (θa+θ2b)<(θ1)(θ+1)2<θc+θ2d-(\theta a+\theta^{2}b)<(\theta-1)(\theta+1)^{2}<\theta c+\theta^{2}d.

Assumption 10 ensures that there is a unique interior equilibrium point for all μ[0,1]\mu\in[0,1] in system (4). It is noted that when θ=1\theta=1, Assumption 10 is satisfied trivially.

It is easy to check that the boundary :tblr\partial\mathcal{I}:\mathcal{B}_{t}\cup\mathcal{B}_{b}\cup\mathcal{B}_{l}\cup\mathcal{B}_{r} itself and each of its sides are invariant under (4) in the case μ=0\mu=0. When 0<μ10<\mu\leq 1, the sides t\mathcal{B}_{t} and b\mathcal{B}_{b} remain invariant, while l\mathcal{B}_{l} and r\mathcal{B}_{r} do not. As μ\mu varies, all the possible boundary equilibria lie on the sides t\mathcal{B}_{t} and b\mathcal{B}_{b}, but the positions of these equilibria change. Let us try to determine the locations of the equilibria on t\mathcal{B}_{t} and b\mathcal{B}_{b}.

Lemma 11.

There is only one equilibrium on each side b\mathcal{B}_{b} and t\mathcal{B}_{t} in system (4). Moreover, the equilibria are located in the sets {(x,r):x(max{1/2,1/(θ+1)},1),r=0}\{(x,r):x\in(\max\{1/2,1/(\theta+1)\},1),~{}r=0\} and {(x,r):x(0,min{1/2,1/(θ+1)}),r=1}\{(x,r):x\in(0,\min\{1/2,1/(\theta+1)\}),~{}r=1\} respectively.

{pf}

See Appendix D.

One can examine the stability of these boundary equilibria by evaluating the corresponding Jacobian matrices.

Lemma 12.

All the boundary equilibria of system (4) are unstable for μ(0,1]\mu\in(0,1].

{pf}

We only show the case on the side b\mathcal{B}_{b}. For the case of t\mathcal{B}_{t}, one can obtain the same result analogously. Denote the equilibrium on the side b\mathcal{B}_{b} by (xb,0)(x_{b}^{*},0). We obtain its Jacobian matrix as below

Jb=[3xb2(ba)2μ+b+2xb(a2b)0(θ+1)xb1],J_{b}^{*}=\begin{bmatrix}\begin{aligned} &3x_{b}^{*^{2}}(b-a)-2\mu+b\\ &+2x_{b}^{*}(a-2b)\end{aligned}&\star\\ 0&(\theta+1)x_{b}^{*}-1\end{bmatrix}, (21)

where the symbol \star stands for xb3(c+da+b)xb(d+b)+xb2(c+2da+2b)-x_{b}^{*^{3}}(-c+d-a+b)-x_{b}^{*}(d+b)+x_{b}^{*^{2}}(-c+2d-a+2b). As the matrix is upper triangular, the entry (θ+1)xb1(\theta+1)x_{b}^{*}-1 is one of its eigenvalues. According to Lemma 11, one has xb(max{1/2,1/(θ+1)},1)x_{b}^{*}\in(\max\{1/2,1/(\theta+1)\},1). Then the eigenvalue (θ+1)xb1(\theta+1)x_{b}^{*}-1 will be positive for any θ>0\theta>0, which implies that the equilibrium is unstable.

By checking the sign of x˙\dot{x} on the left side l\mathcal{B}_{l} and the right side r\mathcal{B}_{r}, one observes that it is always positive and negative respectively. It can be easily verified that the vectors on these two sides point inwards to int()\mathrm{int}(\mathcal{I}). The sides b\mathcal{B}_{b} and t\mathcal{B}_{t} are positively invariant under system (4), and thus the dynamics on each side of b\mathcal{B}_{b} and t\mathcal{B}_{t} are easy to analyze. For example, on the side b\mathcal{B}_{b}, one has r˙=0\dot{r}=0, x˙>0\dot{x}>0 at x=0x=0 and r˙=0\dot{r}=0, x˙<0\dot{x}<0 at x=1x=1. As there is only one equilibrium (xb,0)(x_{b}^{*},0) on the side b\mathcal{B}_{b}, one can easily verify that x˙>0\dot{x}>0 for x[0,xb)x\in[0,x_{b}^{*}) and x˙<0\dot{x}<0 for x(xb,1]x\in(x_{b}^{*},1]. Thus, trajectories starting from b(xb,0)\mathcal{B}_{b}\setminus(x_{b}^{*},0) will converge to the unique equilibrium (xb,0)(x_{b}^{*},0). The similar result can be obtained for the dynamics on t\mathcal{B}_{t}.

In addition, we can show that the boundary \partial\mathcal{I} is repelling for μ(0,1]\mu\in(0,1].

Lemma 13.

The boundary \partial\mathcal{I} of system (4) is repelling for μ(0,1]\mu\in(0,1].

{pf}

See Appendix E.

With Lemma 13 in hand, we are ready to present some global results of the system dynamics in \mathcal{I}.

Theorem 14.

Under Assumption 5, for μ(μ0,1]\mu\in(\mu_{0},1], the interior equilibrium (x,r)(x^{*},r^{*}) of system (4) is asymptotically stable, and all trajectories starting from (tb)\mathcal{I}\setminus(\mathcal{B}_{t}\cup\mathcal{B}_{b}) will converge to (x,r)(x^{*},r^{*}).

{pf}

Recall the two critical parameter values μ0\mu_{0} and μ1\mu_{1}, which appear respectively in Lemma 2 and Theorem 6. One can check that μ0μ1\mu_{0}\geq\mu_{1} when Assumption 5 holds. Again according to Lemma 2, it is impossible for system (4) to have closed orbits in \mathcal{I} when μ>μ0\mu>\mu_{0}. In addition, the interior equilibrium is locally asymptotically stable in view of (16). Due to the fact that the boundary \partial\mathcal{I} is repelling as shown in Lemma 13, any trajectories starting from int(){(x,r)}\text{int}(\mathcal{I})\setminus\{(x^{*},r^{*})\} cannot converge to \partial\mathcal{I}. Then according to the Poincaré-Bendixson theorem, because of the non-existence of closed orbits or other equilibria in int()\text{int}(\mathcal{I}), the trajectories with any initial points in int(){(x,r)}\text{int}(\mathcal{I})\setminus\{(x^{*},r^{*})\} will converge to (x,r)(x^{*},r^{*}). Similarly, trajectories starting from the sides l\mathcal{B}_{l} and r\mathcal{B}_{r} also converge to (x,r)(x^{*},r^{*}). So the interior equilibrium is almost globally stable in system (4), because almost all trajectories in \mathcal{I} (except for those starting from tb\mathcal{B}_{t}\cup\mathcal{B}_{b}) converge to the stable equilibrium (x,r)(x^{*},r^{*}). We apply similar terminology to the stability of the limit cycle when it is stable and almost all trajectories converge to it.

3.4.2 Uniqueness of limit cycles in the case θ=1\theta=1

Assumption 15.

We assume that θ=1\theta=1 henceforth.

Under Assumption 15, the enhancement effect and the degradation effect are balanced. In this case, the system can be transformed to a generalized Liénard system. However, the existence and uniqueness of limit cycles for μ(0,μ1]\mu\in(0,\mu_{1}] in the case θ1\theta\neq 1 is still open.

Lemma 16.

Under Assumption 5, when a+c=b+da+c=b+d, the system dynamics in (4) admit at most one limit cycle in int()\text{int}(\mathcal{I}) for μ(0,μ1)\mu\in(0,\mu_{1}).

{pf}

Under Assumption 15, when a+c=b+da+c=b+d, the interior equilibrium, denoted by q=(1/2,(a+b)/(2(b+d)))q^{*}=(1/2,(a+b)/(2(b+d))), is fixed. And the system dynamics (4) are reduced to be

{x˙=x(1x)[x(dc)r(d+b)+b]+μ(12x)r˙=r(1r)(2x1).\begin{cases}&\begin{aligned} \dot{x}=&x(1-x)[x(d-c)-r(d+b)+b]+\mu(1-2x)\end{aligned}\\ &\dot{r}=r(1-r)(2x-1).\end{cases} (22)

We apply the transformation of the coordinates on (22), xx,r1rx\rightarrow x^{\prime},~{}r\rightarrow 1-r^{\prime}, and obtain the following system

{x˙=x(1x)[x(dc)+r(d+b)d]+μ(12x)r˙=r(1r)(2x1).\begin{cases}&\begin{aligned} \dot{x}^{\prime}=&x^{\prime}(1-x^{\prime})[x^{\prime}(d-c)+r^{\prime}(d+b)-d]\\ &+\mu(1-2x^{\prime})\end{aligned}\\ &\dot{r}^{\prime}=-r^{\prime}(1-r^{\prime})(2x^{\prime}-1).\end{cases}

The interior equilibrium qq^{*} of (22) now becomes q=(1/2,r)q^{\prime*}=(1/2,r^{\prime*}) with r=(a+b)/(2(b+d))r^{\prime*}=(a+b)/(2(b+d)). By using the change of variables xx~+1/2x^{\prime}\rightarrow\tilde{x}+1/2 and rr~+rr^{\prime}\rightarrow\tilde{r}+r^{\prime*}, we can shift the equilibrium qq^{\prime*} to the origin of a new system as below

{x~˙=α(x~)[(d+b)r~+8μ1x~α(x~)2μx~α(x~)]r~˙=2β(r~)x~,\begin{cases}&\begin{aligned} \dot{\tilde{x}}=&\alpha(\tilde{x})\left[(d+b)\tilde{r}+\frac{8\mu_{1}\tilde{x}\alpha(\tilde{x})-2\mu\tilde{x}}{\alpha(\tilde{x})}\right]\end{aligned}\\ &\dot{\tilde{r}}=-2\beta(\tilde{r})\tilde{x},\end{cases} (23)

which is defined in the region ~={(x~,r~):x~(1/2,1/2),r~(r~,1r~)}\tilde{\mathcal{I}}=\{(\tilde{x},\tilde{r}):\tilde{x}\in(-1/2,1/2),\tilde{r}\in(-\tilde{r}^{*},1-\tilde{r}^{*})\} with α(x~)=(1/4x~2)\alpha(\tilde{x})=(1/4-\tilde{x}^{2}) and β(r~)=(r~+r)(1r~r)\beta(\tilde{r})=(\tilde{r}+r^{\prime*})(1-\tilde{r}-r^{\prime*}). The system is now in the form of the generalized Liénard system [28].

One can observe that the functions α(x~)\alpha(\tilde{x}) and β(r~)\beta(\tilde{r}) are always positive in the domain. Multiplying the vector field of (23) by the positive function 1/(α(x~)β(r~))1/(\alpha(\tilde{x})\beta(\tilde{r})), we obtain

{x~˙=φ(r~)F(x~,r~)r~˙=g(x~),\begin{cases}&\dot{\tilde{x}}=\varphi(\tilde{r})-F(\tilde{x},\tilde{r})\\ &\dot{\tilde{r}}=-g(\tilde{x}),\end{cases} (24)

where φ(r~)=(b+d)r~β(r~)\varphi(\tilde{r})=\frac{(b+d)\tilde{r}}{\beta(\tilde{r})}, F(x~,r~)=2μx~8μ1x~α(x~)α(x~)β(r~)F(\tilde{x},\tilde{r})=\frac{2\mu\tilde{x}-8\mu_{1}\tilde{x}\alpha(\tilde{x})}{\alpha(\tilde{x})\beta(\tilde{r})}, and g(x~)=2x~α(x~)g(\tilde{x})=\frac{2\tilde{x}}{\alpha(\tilde{x})}. The point-wise direction of the vector field of (23) is maintained in (24). Hence the existence and uniqueness of limit cycles for the two systems (23) and (24) are equivalent.

Under Assumption 5, if 0<μ<μ10<\mu<\mu_{1} holds, it can be validated that there always exists a ν=μ1μ4μ1\nu=\sqrt{\frac{\mu_{1}-\mu}{4\mu_{1}}} such that the following conditions are satisfied:

  1. 1.

    G(ν)=G(ν)G(-\nu)=G(\nu) where G(x~):=0x~g(s)𝑑sG(\tilde{x}):=\int_{0}^{\tilde{x}}g(s)ds;

  2. 2.

    x~(ν,0)\forall\tilde{x}\in(-\nu,0), the function r~F(x~,r~)φ(r~)\tilde{r}\mapsto\frac{F(\tilde{x},\tilde{r})}{\varphi(\tilde{r})} is strictly decreasing both on 0<r~<1r0<\tilde{r}<1-r^{\prime*} and r<r~<0-r^{\prime*}<\tilde{r}<0; x~(0,ν)\forall\tilde{x}\in(0,\nu), the function r~F(x~,r~)φ(r~)\tilde{r}\mapsto\frac{F(\tilde{x},\tilde{r})}{\varphi(\tilde{r})} is strictly increasing both on 0<r~<1r0<\tilde{r}<1-r^{\prime*} and r<r~<0-r^{\prime*}<\tilde{r}<0;

  3. 3.

    x~(ν,ν)\forall\tilde{x}\in(-\nu,\nu), r~(r,1r)\forall\tilde{r}\in(-r^{\prime*},1-r^{\prime*}), one has g(x~)F(x~,r~)0g(\tilde{x})F(\tilde{x},\tilde{r})\leq 0;

  4. 4.

    x~(ν,ν)\forall\tilde{x}\notin(-\nu,\nu), r~(r,1r)\forall\tilde{r}\in(-r^{\prime*},1-r^{\prime*}), one has F(x~,r~)0F(\tilde{x},\tilde{r})\geq 0; r~(r,1r)\forall\tilde{r}\in(-r^{\prime*},1-r^{\prime*}), the function x~F(x~,r~)\tilde{x}\mapsto F(\tilde{x},\tilde{r}) is increasing both on 1/2<x~<ν-1/2<\tilde{x}<-\nu and ν<x~<1/2\nu<\tilde{x}<1/2.

The detailed expressions of the functions involved in the above conditions can be found in Appendix F. It is not difficult to check these conditions, so the process is omitted here.

Then according to [28, Theorem 1, Corrollary 1], system (24) has at most one limit cycle in ~\tilde{\mathcal{I}}. Therefore, the original system (22) admits at most one limit cycle in int()\text{int}(\mathcal{I}).

It is noted that when a+c=b+da+c=b+d, the coefficient of the cross term xrxr is 0, and thus this cross term disappears in the system equations of (22). So the nonlinearity of this system is reduced. In this case, when system (22) is transformed into the Liénard form, the properties (1)(4)(1)-(4) listed in the proof of Lemma 16 allow us to use the Liénard system theory to prove that there is at most one limit cycle in int()\text{int}(\mathcal{I}) for μ(0,μ1)\mu\in(0,\mu_{1}). Unfortunately, those properties are not satisfied when a+cb+da+c\neq b+d, and thus the existence and the number of limit cycles in this case remain unclear.

Now we are in the position to present the last result of this section about the uniqueness of limit cycle and stability of the interior equilibrium.

Theorem 17.

Under Assumption 5, when a+c=b+da+c=b+d,

  1. 1.

    for μ(0,μ1\mu\in(0,\mu_{1}), system (4) has exactly one limit cycle that is almost globally stable;

  2. 2.

    for μ[μ1,1]\mu\in[\mu_{1},1], the interior equilibrium (x,r)(x^{*},r^{*}) is almost globally stable.

{pf}

Under Assumption 5, when a+c=b+da+c=b+d, according to Lemma 16, the system (4) has at most one limit cycle for 0<μ<μ10<\mu<\mu_{1}. As shown in Theorem 6, the eigenvalues of the Jacobian at (x,r)(x^{*},r^{*}) have positive real parts, implying that it is repulsive. Then all trajectories starting in the small neighborhood of this equilibrium (excluding the equilibrium) will diverge, and the existence of infinite many periodic orbits in the neighborhood is excluded. From Lemma 13, one also knows that the boundary \partial\mathcal{I} is repelling. According to the Poincaré-Bendixson theorem, for any initial states in int(){(x,r)}\text{int}(\mathcal{I})\setminus\{(x^{*},r^{*})\}, its ω\omega-limit set must be a closed orbit. If there are no limit cycles in int(){(x,r)}\text{int}(\mathcal{I})\setminus\{(x^{*},r^{*})\}, then every trajectory starting from int(){(x,r)}\text{int}(\mathcal{I})\setminus\{(x^{*},r^{*})\} will form a periodic orbit, and an infinite number of periodic orbits will fill int(){(x,r)}\text{int}(\mathcal{I})\setminus\{(x^{*},r^{*})\}. This contradicts the repulsiveness of the interior equilibrium and the boundary [28].

Suppose in int(){(x,r)}\text{int}(\mathcal{I})\setminus\{(x^{*},r^{*})\}, there is some connected region which consists of an infinite number of neutral periodic orbits and is enclosed by two periodic orbits denoted by Γ1\Gamma_{1} and Γ2\Gamma_{2}. Then Γ1\Gamma_{1} and Γ2\Gamma_{2} must be limit cycles, which contradicts Lemma 16 regarding the maximal number of limit cycles. Therefore, one can conclude that there is exactly one stable limit cycle in \mathcal{I}, and it is attractive for (tb(x,r))\mathcal{I}\setminus(\mathcal{B}_{t}\cup\mathcal{B}_{b}\cup(x^{*},r^{*})).

Note that when a+c=b+da+c=b+d and θ=1\theta=1, we have μ0=μ1\mu_{0}=\mu_{1}. Thus, the second claim is easy to verify according to Theorems 6 and 14. Fig. 3 shows the plot of the unique limit cycle of system (4) when μ\mu changes from 0 to 0.40.4. The payoff matrices used in the simulations are shown below, and they satisfy the condition a+c=b+da+c=b+d.

[R1S1T1P1]=[3.520.51],[R2S2T2P2]=[20.233.2].\begin{bmatrix}R_{1}&S_{1}\\ T_{1}&P_{1}\end{bmatrix}=\begin{bmatrix}3.5&2\\ 0.5&1\end{bmatrix},~{}~{}\begin{bmatrix}R_{2}&S_{2}\\ T_{2}&P_{2}\end{bmatrix}=\begin{bmatrix}2&0.2\\ 3&3.2\end{bmatrix}. (25)

The Hopf bifurcation point is computed to be μ1=0.1633\mu_{1}=0.1633. It can be observed that the limit cycle “collides” with the heteroclinic cycle lying on the boundary and the interior equilibrium as μ\mu approaches 0 and 0.16330.1633 respectively. In addition, it disappears after μ\mu passes 0.16330.1633.

Refer to caption
Figure 3: The plot of a unique limit cycle of dynamics (4) with the given payoff matrices in (25). The xx-axis is the mutation rate μ\mu, the blue and red curves are stable and unstable equilibria respectively, and the magenta curves show the shape of stable limit cycles.

4 Incentive-based Control

In real biological and social systems, the mutation or exploration rate is usually small. For example, an extremely high mutation rate for a biological entity may reach 1/4001/400 per site as reported in [3]. As known from Section 3, although the Hopf bifurcation point μ1\mu_{1} depends on the payoff parameters, limit cycles exist in system (4) for μ<μ1\mu<\mu_{1}. It means that in many situations stable limit cycles persist in the co-evolutionary dynamics of the game and environment. On the one hand, the shapes and positions of limit cycles in nonlinear systems are generally difficult to identify analytically; on the other hand, such limit cycle oscillations, although different than the (oscillating) tragedy of the commons, are not desirable from a control point of view. In addition, considering the public resource management, the issues of how to best govern common-pool resources have long been a common concern in the field of economics [20]. Thus, in this section we intend to study how to design suitable control policies such that the following two objectives are attained:

  1. (a).

    alleviate or even eliminate the oscillation;

  2. (b).

    increase the environmental resource stock.

One needs to choose what the control input should be. Although multiple possible approaches have been proposed in the evolutionary games field, perhaps the most plausible control input is to offer suitable incentives to be added to the payoff of the desired strategies [26]. Now we consider that in each game interaction, the cooperator receives certain constant incentive u0u\geq 0 from an external regulating authority.

Recall that the entries on the first row of A(r)A(r) are the payoffs to a C player. The external incentive will be added to the first row of (1). Thus, the control input uu is incorporated into the previously defined payoff matrix (1) as follows

Au(r)=(1r)[R1S1T1P1]+r[R2S2T2P2]+[uu00]\displaystyle A_{u}(r)=(1-r)\begin{bmatrix}R_{1}&S_{1}\\ T_{1}&P_{1}\end{bmatrix}+r\begin{bmatrix}R_{2}&S_{2}\\ T_{2}&P_{2}\end{bmatrix}+\begin{bmatrix}u&u\\ 0&0\end{bmatrix} (26)
=[r(R2R1)+R1+ur(S2S1)+S1+ur(T2T1)+T1r(P2P1)+P1].\displaystyle=\begin{bmatrix}&r(R_{2}-R_{1})+R_{1}+u&r(S_{2}-S_{1})+S_{1}+u\\ &r(T_{2}-T_{1})+T_{1}&r(P_{2}-P_{1})+P_{1}\end{bmatrix}.

Substituting (26) into equations (4) results in the following system with the control input uu

{x˙=x(1x)[xr(c+da+b)+x(ab)r(d+b)+b+u]+μ(12x)r˙=r(1r)(2x1).\begin{cases}&\begin{aligned} \dot{x}=&x(1-x)[xr(-c+d-a+b)+x(a-b)\\ &-r(d+b)+b+u]+\mu(1-2x)\end{aligned}\\ &\dot{r}=r(1-r)(2x-1).\end{cases} (27)

To emphasize the effect of the control input uu on the system behaviors, we have assumed that the enhancement effect and the degradation effect are balanced by fixing the parameter θ\theta to be 11 (under Assumption 15). In the following part we will study whether such a constant incentive can achieve the desired control objectives.

4.1 Stable equilibrium

System (27) has a unique interior equilibrium which is given by (xc,rc)=(1/2,(a+b+2u)/(a+c+b+d))(x_{c}^{*},r_{c}^{*})=(1/2,(a+b+2u)/(a+c+b+d)) when 0u<(c+d)/20\leq u<(c+d)/2. It is noted that the interior equilibrium is shifted up vertically, i.e., the xx-coordinate of this equilibrium is the same as the equilibrium of the system without control, while the value of the rr-coordinate increases. Then we evaluate the Jacobian at (xc,rc)(x_{c}^{*},r_{c}^{*}), which is given by

Jc=[adbc+(b+dac)u2(a+b+c+d)2μ(a+b+c+d)82(a+b+2u)(c+d2u)(a+b+c+d)20].J_{c}^{*}=\begin{bmatrix}\frac{ad-bc+(b+d-a-c)u}{2(a+b+c+d)}-2\mu&\frac{-(a+b+c+d)}{8}\\ \frac{2(a+b+2u)(c+d-2u)}{(a+b+c+d)^{2}}&0\end{bmatrix}. (28)

For this matrix, it is easy to identify that the real parts of its eigenvalues are of the same sign, which is the same as the first entry.

When u(c+d)/2u\geq(c+d)/2, no equilibria exist in the interior of \mathcal{I}, and all the equilibria are located on the sides t\mathcal{B}_{t} and b\mathcal{B}_{b}. For system (27), if u0u\geq 0, it is easy to identify that the xx-coordinate of the equilibrium on b\mathcal{B}_{b} is in the range x(1/2,1)x\in(1/2,1) by using a similar approach in Lemma 11. Then the same argument of Lemma 12 is applicable, so one can ensure that this equilibrium is unstable for int()\text{int}(\mathcal{I}). Similarly, on the side t\mathcal{B}_{t}, when 0<u<(c+d)/20<u<(c+d)/2, the equilibria are located in the range x(0,1/2)x\in(0,1/2), and thus they are unstable. However, when u>(c+d)/2u>(c+d)/2, the situation will be different, and we have the following statement for the equilibria.

Lemma 18.

Consider system (27). When u>(c+d)/2u>(c+d)/2, on the side t\mathcal{B}_{t}, there exists one equilibrium, denoted by (xt,1)(x_{t}^{*},1), satisfying xt(1/2,1)x_{t}^{*}\in(1/2,1). And the other possible equilibria are all located in the set {(x,r):x(0,1/2),r=1}\{(x,r):x\in(0,1/2),~{}r=1\}.

{pf}

See Appendix G. It is easy to check the stability of these equilibria by evaluating the Jacobian.

Lemma 19.

When u>(c+d)/2u>(c+d)/2, the boundary equilibrium (xt,1)(x_{t}^{*},1) is locally asymptotically stable under system (27), while the other equilibria (if they exist) are unstable.

{pf}

One can check the local stability of (xt,1)(x_{t}^{*},1) by examining the Jacobian which is given by

Jt=[3(cd)xt2+2(2dcu)xtd+u2μ012xt],J_{t}^{*}=\begin{bmatrix}\begin{aligned} &3(c-d){x_{t}^{*}}^{2}+2(2d-c-u)x_{t}^{*}\\ &-d+u-2\mu\end{aligned}&\bullet\\ 0&1-2x_{t}^{*}\end{bmatrix}, (29)

where the symbol \bullet stands for xt3(c+da+b)xt(d+b)+xt2(c+2da+2b)-x_{t}^{*^{3}}(-c+d-a+b)-x_{t}^{*}(d+b)+x_{t}^{*^{2}}(-c+2d-a+2b). Note that the eigenvalues of JtJ_{t}^{*} are actually the two diagonal entries. According to Lemma 18, obviously the eigenvalue λ1=12xt\lambda_{1}=1-2x_{t}^{*} is negative.

For the other eigenvalue λ2=3(cd)xt2+2(2dcu)xtd+u2μ\lambda_{2}=3(c-d){x_{t}^{*}}^{2}+2(2d-c-u)x_{t}^{*}-d+u-2\mu, one can prove it is negative in all the three cases regarding the relationship of cc and dd. To illustrate, we only show the case of c>dc>d here, since the other cases are similar. Suppose c=d+ϵc=d+\epsilon with ϵ>0\epsilon>0. From Appendix G, we have 1/2<xt<(ud)/ϵ1/2<x_{t}^{*}<(u-d)/\epsilon, then one can check that

λ2\displaystyle\lambda_{2} =(3xt22xt)ϵ+(ud)(12xt)2μ\displaystyle=(3{x_{t}^{*}}^{2}-2x_{t}^{*})\epsilon+(u-d)(1-2x_{t}^{*})-2\mu
<(3xt22xt)ϵ+ϵxt(12xt)2μ\displaystyle<(3{x_{t}^{*}}^{2}-2x_{t}^{*})\epsilon+\epsilon x_{t}^{*}(1-2x_{t}^{*})-2\mu
=(xt2xt)ϵ2μ\displaystyle=({x_{t}^{*}}^{2}-x_{t}^{*})\epsilon-2\mu
<2μ.\displaystyle<-2\mu.

Thus, the eigenvalue λ2\lambda_{2} is negative. The fact of two negative eigenvalues ensures that the equilibrium (xt,1)(x_{t}^{*},1) is locally asymptotically stable. The instability of other possible equilibria is easy to check since there is always a positive eigenvalue in view of (29).

Then, we define a Dulac function which is similar to that in Section 3.1

φc(x,r)=xα(1x)βrγ0.5u(1r)δ+0.5u,\varphi_{c}(x,r)=x^{\alpha}(1-x)^{\beta}r^{\gamma-0.5u}(1-r)^{\delta+0.5u}, (30)

where the parameters α\alpha, β\beta, γ\gamma, and δ\delta are the same as in (11) with θ=1\theta=1. Denote the vector field of (27) by fc(x,r)f_{c}(x,r). After the multiplication of φc(x,r)\varphi_{c}(x,r), we compute the divergence of the modified vector field φcfc(x,r)\varphi_{c}f_{c}(x,r) in int()\text{int}(\mathcal{I}), which yields

divφcfc(x,r)\displaystyle\text{div}~{}\varphi_{c}f_{c}(x,r) =φc(x,r)[g(x)μ2μ+(1.5+α)u\displaystyle=\varphi_{c}(x,r)[g(x)\mu-2\mu+(1.5+\alpha)u (31)
(bcad)2(a+b+c+d)],\displaystyle~{}~{}~{}-\frac{(bc-ad)}{2(a+b+c+d)}],

with g(x)g(x) being the same as in (12), which takes the maximum value at x¯\bar{x} as shown in (13).

Next we discuss in two cases and show that there always exists some uu such that system (27) has a stable equilibrium for the whole interior.

Theorem 20.

For system (27), when a+c>b+da+c>b+d, there exists some u1:=adbc(4g(x¯))μ(a+b+c+d)a+cbdu_{1}:=\frac{ad-bc-(4-g(\bar{x}))\mu(a+b+c+d)}{a+c-b-d} such that

  1. 1.

    if u1<(c+d)/2u_{1}<(c+d)/2, the interior equilibrium (xc,rc)(x_{c}^{*},r_{c}^{*}) is almost globally stable under u(u1,(c+d)/2)u\in(u_{1},(c+d)/2);

  2. 2.

    if u1>(c+d)/2u_{1}>(c+d)/2, the boundary equilibrium (xt,1)(x_{t}^{*},1) is almost globally stable under u>(c+d)/2u>(c+d)/2.

{pf}

When a+c>b+da+c>b+d, we have α=(1+a+ca+b+c+d)(2,1.5)\alpha=-(1+\frac{a+c}{a+b+c+d})\in(-2,-1.5), so the coefficient of the term with uu in (31), i.e., 1.5+α1.5+\alpha, is negative. Thus, if

u>u1:=adbc(4g(x¯))μ(a+b+c+d)a+cbd,u>u_{1}:=\frac{ad-bc-(4-g(\bar{x}))\mu(a+b+c+d)}{a+c-b-d}, (32)

the right hand side of (31) is always negative, which implies that there are no closed orbits in \mathcal{I}.

By checking the Jacobian (28), one can see that the interior equilibrium (xc,rc)(x_{c}^{*},r_{c}^{*}) is stable when

u>u2:=adbc4μ(a+b+c+d)a+cbd.u>u_{2}:=\frac{ad-bc-4\mu(a+b+c+d)}{a+c-b-d}.

As g(x¯)0g(\bar{x})\geq 0, we have u1u2u_{1}\geq u_{2}. Then if u1<(c+d)/2u_{1}<(c+d)/2, one can choose some u(u1,(c+d)/2)u\in(u_{1},(c+d)/2) such that the interior equilibrium (xc,rc)(x_{c}^{*},r_{c}^{*}) exists and is asymptotically stable. Due to the non-existence of closed orbits in \mathcal{I} and other stable equilibria, and also because of the repulsiveness of \partial\mathcal{I}, one can conclude that almost all trajectories will converge to (xc,rc)(x_{c}^{*},r_{c}^{*}) asymptotically.

On the other hand, if u1(c+d)/2u_{1}\geq(c+d)/2, one can choose some u>(c+d)/2u>(c+d)/2 which leads to the occurrence of the equilibrium (xt,1)(x_{t}^{*},1). In view of (29), the equilibrium (xt,1)(x_{t}^{*},1) is locally asymptotically stable. The other equilibria on t\mathcal{B}_{t} and b\mathcal{B}_{b} are all unstable, and they are located on the sets whose sufficient close neighborhoods in int()\text{int}(\mathcal{I}) are repelling. Hence, it is impossible for trajectories starting from int()\text{int}(\mathcal{I}) to converge to these points. As a consequence, (xt,1)(x_{t}^{*},1) is the only ω\omega-limit set for all initial states in int()\text{int}(\mathcal{I}) from the Poincaré-Bendixson theorem. Similarly, this result extends to the sides l\mathcal{B}_{l} and r\mathcal{B}_{r} by following the preceding analysis.

Theorem 21.

For system (27), when a+cb+da+c\leq b+d, the boundary equilibrium (xt,1)(x_{t}^{*},1) is almost globally stable under u>(c+d)/2u>(c+d)/2.

{pf}

When a+cb+da+c\leq b+d, the first entry of (28) will always be positive for any uu if adbc2(a+b+c+d)2μ>0\frac{ad-bc}{2(a+b+c+d)}-2\mu>0, in which case the equilibrium will always be unstable (saddle). Thus, it is impossible to stabilize the interior equilibrium with any positive uu if it is unstable in the uncontrolled system. However, one can choose some u>(c+d)/2u>(c+d)/2 such that there is an equilibrium (xt,1)(x_{t}^{*},1) on t\mathcal{B}_{t} which can be proved to be almost globally stable by using the same argument in the second case of Theorem 20.

In Theorems 20 and 21, the relationship between a+ca+c and b+db+d determines whether the interior equilibrium can be stabilized or not under the positive control input uu. In view of (28), one can observe that the stability of the interior equilibrium can be changed from unstable to stable when a+c>b+da+c>b+d under a suitable uu, while the stability is not qualitatively affected by uu when a+cb+da+c\leq b+d.

When there exists an equilibrium that is globally stable for \mathcal{I}, obviously the limit cycle oscillation is eliminated. No matter whether this stable equilibrium is in the interior or on the top side. The second designed goal is also achieved, since the rr-coordinates of the equilibria are higher. One can find some illustrations in Fig. 4.

Refer to caption
(a) stable int. eq.
Refer to caption
(b) stable eq. on t\mathcal{B}_{t}
Refer to caption
(c) unstable int. eq.
Refer to caption
(d) stable eq. on t\mathcal{B}_{t}
Figure 4: Phase portraits of dynamics (27) when a+cb+da+c\neq b+d. (a) and (b) respectively show the trajectories converge to a stable equilibrium in the interior and on the boundary with different control inputs when a+c>b+da+c>b+d with a=4a=4, b=1b=1, c=3c=3, d=3d=3, and μ=0.05\mu=0.05. The control inputs in (a)(a) and (b)(b) are 2.82.8 and 3.53.5 respectively. (c) and (d) are for the case a+c<b+da+c<b+d where a=2a=2, b=3b=3, c=1c=1, d=4d=4, and μ=0.15\mu=0.15. In (c) the interior equilibrium cannot be stabilized with small control input with u=1.8u=1.8 , but in (d) one equilibrium on the boundary is stable under a large control input with u=2.6u=2.6.

4.2 Reduced amplitude of the limit cycle

For the case of a+c=b+da+c=b+d, although it is impossible to stabilize the interior equilibrium with a suitable uu, one can observe that the amplitude of the oscillation on rr-coordinate decreases under some small uu.

As a+c=b+da+c=b+d, it is noted that the control input does not affect the stability of equilibrium (xc,rc)(x_{c}^{*},r_{c}^{*}) according to the Jacobian (28). However, one can transform system (27) into a form of Liénard system as (24) in the same manner. Since the listed conditions are satisfied straightforwardly, according to Lemma (16), for μ<(0,μ1)\mu<(0,\mu_{1}), system (27) has a unique limit cycle when u[0,(c+d)/2)u\in[0,(c+d)/2). When there is some intermediate control input u(0,(c+d)/2)u\in(0,(c+d)/2), the corresponding interior equilibrium is shifted towards the top side t\mathcal{B}_{t}. To some extent, the vector field between the interior equilibrium and the top side is “compressed”. The position of the limit cycle is shifted up, and the amplitude of the cyclic oscillation on rr-coordinate is reduced at the same time. In this sense, the oscillation is alleviated. Some numerical results can be found in Fig. 5.

Refer to caption
(a) no control input
Refer to caption
(b) intermediate control
Refer to caption
(c) stable eq. on t\mathcal{B}_{t}
Refer to caption
(d) decreasing amplitude
Figure 5: Reduced amplitudes of oscillation of variable rr when a+c=b+da+c=b+d with a=3a=3, b=1b=1, c=1c=1, d=3d=3, and μ=0.05\mu=0.05. The first three plots show the phase portraits with different values of uu: (a) u=0u=0; (b) u=1.6u=1.6; (c) u=2.1u=2.1. (d) shows that the amplitude of rr of the limit cycle (approximated by the numerical integration) decreases as uu increases.

5 Conclusions and Future Work

We have investigated the co-evolutionary dynamics of the game and environment when strategies’ mutations are taken into account. We used the replicator-mutator model to depict the strategic evolution. By using bifurcation theory we showed that the system at the interior equilibrium undergoes a Hopf bifurcation which generates stable limit cycles; on the other hand, we showed there is a heteroclinic bifurcation which may also generate a stable limit cycle. Our analysis highlights the important role that mutations play in the integrated system dynamics. For the co-evolutionary system, the stable limit cycle corresponds to a sustained oscillation of population’s decisions and richness of the environmental resource. Compared with the neutral periodic oscillation or heteroclinic cycle oscillation that have been exhibited in previous studies using the replicator model, such limit cycle oscillations can better explain the phenomena that appear in real biological or social systems. Moreover, we proved that under certain parameter conditions, the system always admits a unique stable limit cycle which is attractive for almost the entire domain.

Such a robust limit cycle oscillation may provide implications and theoretical support for relevant studies in biology and sociology. For example, in the microbial context, some cooperating bacteria can produce costly public good, while the defecting bacteria only profit. Thus, the richness of the public good is enhanced by the cooperating bacteria and degraded by the defectors. In the process of bacterial reproduction, mutations are common. The amount of mutant offsprings can have important impact on the dynamic outcomes of the biological systems. For such microbial systems, [2] shows that the dynamics can converge to an interior equilibrium, while [35] shows that persistent heteroclinic oscillations can exist. However, the results obtained in this work theoretically reveal that a moderate limit cycle oscillation can exist in addition to the convergence to a steady state or a heteroclinic cycle. Thus, this work complements the previous studies.

We also considered the incentive-based control applied into the co-evolutionary dynamics, and showed the effectiveness of such control on maintaining the environmental resource. Namely, compared with the uncontrolled system, the incentive will increase the level of environmental resource in most cases. In particular, one can apply some intermediate incentive to let the system have an interior equilibrium that is stable for the whole interior of domain. If the interior equilibrium cannot be stabilized, some large incentive could be put into use to make the system dynamics converge to a boundary equilibrium with full environmental resource.

The obtained results in the control part of this work can provide valuable insights into many practical issues in nature, such as the common-pool resource management, environmental and sustainable development, and ecological diversity conservation. For example, the global climate change is one of the biggest challenges of our times. The strategic decisions of individuals, corporations, and governments have long-term environmental consequence that will, in turn, alter the strategic landscape those parties face [32]. To mitigate the global warming, all concerned parties are called upon to reduce greenhouse gas emissions. In such a socio-ecological system, the reduction of greenhouse gas emissions can be taken as the environmental factor of interest. Policy-makers can design and implement economic incentives aiming at adapting individual decisions to collectively agreed goals [31]. Hence, the incentive-based policy proposed in this work shows the power of incentive-based control policy and leads to some directions in mitigating the global warming issue.

In this paper, we have used a more general and realistic model, replicator-mutator dynamics, to describe the strategic evolution, but the environment dynamics is still described by the simplified logistic model. Thus, it is of great interest to study the game dynamics under different types of feedback of the practical environmental resource, such as renewing or decaying resource. The replicator or replicator-mutator equations describe the evolutionary game dynamics in an infinite well-mixed population. Due to this shortcoming, they are not suitable to study game dynamics in finite or structured populations. Therefore, another direction of future studies is to develop more appropriate individual-based models to study the game dynamics under the environmental feedback. Last but not least, in addition to the incentive-based control, there could exist other kinds of control policies that can be more effective in various specific situations.

References

  • [1] F. Blanchini and S. Miani. Set-theoretic methods in control. Springer, New York, 2008.
  • [2] S.P. Brown and F. Taddei. The durability of public goods changes the dynamics and nature of social dilemmas. PLoS One, 2(7):e593, 2007.
  • [3] S. Gago, S.F. Elena, R. Flores, and R. Sanjuán. Extremely high mutation rate of a hammerhead viroid. Science, 323(5919):1308–1308, 2009.
  • [4] L. Gong, J. Gao, and M. Cao. Evolutionary game dynamics for two interacting populations in a co-evolving environment. Proceedings of the 57th IEEE Conference on Decision and Control, pages 3535–3540, 2018.
  • [5] L. Gong, W. Yao, J. Gao, and M. Cao. Limit cycles in replicator-mutator dynamics with game-environment feedback. IFAC-PapersOnLine, 2020.
  • [6] J. Guckenheimer and P. Holmes. Nonlinear oscillations, dynamical systems and bifurcations of vector fields. Springer, New York, 2000.
  • [7] C. Hauert, C. Saade, and A. McAvoy. Asymmetric evolutionary games with environmental feedback. Journal of Theoretical Biology, 462:347–360, 2019.
  • [8] M.W. Hirsch, S. Smale, and R.L. Devaney. Differential Equations, Dynamical Systems & An Introduction to Chaos. Elsevier, Amsterdam, 2004.
  • [9] J. Hofbauer. Heteroclinic cycles in ecological differential equations. In Equadiff 8, pages 105–116. Mathematical Institute, Slovak Academy of Sciences, 1994.
  • [10] J. Hofbauer and K. Sigmund. Evolutionary Games and Population Dynamics. Cambridge University Press, Cambridge, 1998.
  • [11] Y. Kawano, L. Gong, B.D.O. Anderson, and M. Cao. Evolutionary dynamics of two communities under environmental feedback. IEEE Control Systems Letters, 3(2):254–259, 2019.
  • [12] N.L. Komarova. Replicator-mutator equation, universality property and population dynamics of learning. Journal of Theoretical Biology, 230:227–239, 2004.
  • [13] Y.A. Kuznetsov. Elements of Applied Bifurcation Theory. Springer, New York, 2004.
  • [14] J. Lee, Y. Iwasa, U. Dieckmann, and K. Sigmund. Social evolution leads to persistent corruption. Proceedings of the National Academy of Sciences of USA, 116(27):13276–13281, 2019.
  • [15] Y. Lin and J.S. Weitz. Spatial interactions and oscillatory tragedies of the commons. Phys. Rev. Lett., 122:148102, 2019.
  • [16] D. Luo, X. Wang, D. Zhu, and M. Han. Bifurcation Theory and Methods of Dynamical Systems. World Scientific, 1997.
  • [17] T. Morimoto, T. Kanazawa, and T. Ushio. Subsidy-based control of heterogeneous multiagent systems modeled by replicator dynamics. IEEE Transactions on Automatic Control, 61(10):3158–3163, 2016.
  • [18] D. Muratore and J. S. Weitz. Infect while the iron is scare: Nutrient explicit phage-bateria games. Theoretical Ecology, 14:467–487, 2021.
  • [19] M.A. Nowak, N.L. Komarova, and P. Niyogi. Evolution of universal grammar. Science, 291:114–118, 2001.
  • [20] E. Ostrom, Cambridge University Press, R. Calvert, and T. Eggertsson. Governing the Commons: The Evolution of Institutions for Collective Action. Cambridge University Press, Cambridge, 2015.
  • [21] K. Paarporn, C. Eksin, J. S. Weitz, and Y. Wardi. Optimal control policies for evolutionary dynamics with environmental feedback. In 2018 IEEE Conference on Decision and Control (CDC), pages 1905–1910, 2018.
  • [22] D. Pais, C.H. Caicedo-Nunez, and N.E. Leonard. Hopf bifurcation and limit cycles in evolutionary network dynamics. SIAM Journal on Applied Dynamical Systems, 11(4):1754–1884, 2013.
  • [23] D. Pais and N. E. Leonard. Limit cycles in replicator-mutator network dynamics. In 2011 50th IEEE Conference on Decision and Control and European Control Conference, pages 3922–3927, 2011.
  • [24] D.G. Rand, D. Tomlin, A. Bear, E.A. Ludvig, and J.D. Cohen. Cyclical population dynamics of automatic versus controlled processing: An evolutionary pendulum. Psychological review, 124(5):626, 2017.
  • [25] J. Riehl and M. Cao. Towards optimal control of evolutionary games on networks. IEEE Transactions on Automatic Control, 62(1):458–462, 2017.
  • [26] J. Riehl, P. Ramazi, and M. Cao. A survey on the analysis and control of evolutionary matrix games. Annual Reviews in Control, 45:87 – 106, 2018.
  • [27] J. Riehl, P. Ramazi, and M. Cao. Incentive-based control of asynchronous best-response dynamics on binary decision networks. IEEE Transactions on Control of Network Systems, 6(2):727–736, 2019.
  • [28] M. Sabatini and G. Villari. Limit cycle uniqueness for a class of planar dynamical systems. Applied Mathematics Letters, 19:1180–1184, 2006.
  • [29] M. F. Singer. Liouvillian first integrals of differential equations. Transactions of the American Mathematical Society, 333:673–688, 1992.
  • [30] L. Stella and D. Bauso. Bio-inspired evolutionary game dynamics in symmetric and asymmetric models. IEEE Control Systems Letters, 2(3):405–410, 2018.
  • [31] R. Sullivan. Corporate responses to climate change: Achieving emissions reductions through regulation, self-regulation and economic incentives. Routledge, 2017.
  • [32] A.R. Tilman, J. Plotkin, and E. Akcay. Evolutionary games with environmental feedbacks. Nature Communications, 11:915, 2020.
  • [33] A. Traulsen, C. Hauert, H. De Silva, M.A. Nowak, and K. Sigmund. Exploration dynamics in evolutionary games. Proceedings of the National Academy of Sciences, 106(3):709–712, 2009.
  • [34] X. Wang, Z. Zheng, and F. Fu. Steering eco-evolutionary game dynamics with manifold control. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 476(2233):20190643, 2020.
  • [35] J.S. Weitz, C. Eksin, K. Paarporn, S.P. Brown, and W.C. Ratcliff. An oscillating tragedy of the commons in replicator dynamics with game-environment feedback. Proceedings of the National Academy of Sciences of USA, 113(47):E7518–E7525, 2016.
  • [36] S. Wiggins. Introduction to Applied Nonlinear Dynamical Systems and Chaos. Springer, New York, 2000.
  • [37] B. Zhu, X. Xia, and Z. Wu. Evolutionary game theoretic demand-side management and control for a class of networked smart grid. Automatica, 70:94–100, 2016.

Appendix A Proof of Lemma 1

{pf}

The equations (10) can be taken as a set of linear equations with respect to unknown variables α\alpha, β\beta, γ\gamma, and δ\delta. It suffices to show that (10) always has solutions. This can be done by checking the ranks of the coefficient matrix CC and augmented matrix [C|D][C|D], which are given by

[(c+da+b)(c+da+b)00(ab)(ab)00(c+2da+2b)(d+b)(θ+1)(θ+1)(a2b)b(θ+1)0(d+b)011],\begin{bmatrix}&\begin{aligned} &(-c+d\\ &~{}-a+b)\end{aligned}&\begin{aligned} &(-c+d\\ &-a+b)\end{aligned}&0&0\\ &(a-b)&(a-b)&0&0\\ &\begin{aligned} &(-c+2d\\ &-a+2b)\end{aligned}&(d+b)&-(\theta+1)&-(\theta+1)\\ &(a-2b)&-b&(\theta+1)&0\\ &-(d+b)&0&1&1\\ \end{bmatrix}, (33)
[(c+da+b)(c+da+b)003(cd+ab)(ab)(ab)003(ba)(c+2da+2b)(d+b)(θ+1)(θ+1)ϖ(a2b)b(θ+1)0(4b2aθ1)(d+b)011b+d2],\begin{bmatrix}&\begin{aligned} &(-c+d\\ &~{}-a+b)\end{aligned}&\begin{aligned} &(-c+d\\ &-a+b)\end{aligned}&0&0&\begin{aligned} &3(c-d\\ &+a-b)\end{aligned}\\ &(a-b)&(a-b)&0&0&3(b-a)\\ &\begin{aligned} &(-c+2d\\ &-a+2b)\end{aligned}&(d+b)&-(\theta+1)&-(\theta+1)&\varpi\\ &(a-2b)&-b&(\theta+1)&0&\begin{aligned} &(4b-2a\\ &-\theta-1)\end{aligned}\\ &-(d+b)&0&1&1&b+d-2\\ \end{bmatrix}, (34)

where ϖ=2(c2d+a2b+θ+1)\varpi=2(c-2d+a-2b+\theta+1).

One can easily check that rankCrank[C|D]4\text{rank}~{}C\equiv\text{rank}~{}[C|D]\leq 4, which implies that (10) has at least one set of solutions. Actually when c+da+b=0-c+d-a+b=0 and ab=0a-b=0, (10) will have infinite many solutions; otherwise, it will have exactly one solution.

Appendix B Proof of Lemma 3

{pf}

For the boundary of \mathcal{I}, the same argument presented in the proof of Lemma 2 is also applicable to exclude the existence of periodic orbits that lie on or intersect with the boundary. Thus we only need to focus on the case of the interior int()\text{int}(\mathcal{I}).

When μ=0\mu=0, the divergence of φf(x,r)\varphi f(x,r) is

(φf1)x(x,r)+(φf2)r(x,r)=φ(x,r)[(1+α)b1γ].\frac{\partial(\varphi f_{1})}{\partial x}(x,r)+\frac{\partial(\varphi f_{2})}{\partial r}(x,r)=\varphi(x,r)[(1+\alpha)b-1-\gamma]. (35)

If there exist periodic orbits in int()\text{int}(\mathcal{I}), the Bendixson-Dulac criterion would imply that either φ(x,r)[(1+α)b1γ]\varphi(x,r)[(1+\alpha)b-1-\gamma] is identically zero or it changes sign. Because φ(x,r)\varphi(x,r) is strictly positive, we have (1+α)b1γ=0(1+\alpha)b-1-\gamma=0. Then (35) turns out to be

(φf1)x(x,r)=(φf2)r(x,r).\frac{\partial(\varphi f_{1})}{\partial x}(x,r)=-\frac{\partial(\varphi f_{2})}{\partial r}(x,r). (36)

Thus system (4) has a first integral [[29]] in int()\text{int}(\mathcal{I})

V(x,r)=φf2𝑑xφf1dr,V(x,r)=\int\varphi f_{2}dx-\varphi f_{1}dr, (37)

whose derivative over time tt satisfies

dVdt=Vxx˙+Vrr˙=φf1f2φf1f20,\frac{dV}{dt}=\frac{\partial V}{\partial x}\dot{x}+\frac{\partial V}{\partial r}\dot{r}=\varphi f_{1}f_{2}-\varphi f_{1}f_{2}\equiv 0,

and hence V(x,r)V(x,r) remains constant along the solutions of (4).

By definition, a limit cycle is an isolated periodic orbit, which is the α\alpha-limit set or ω\omega-limit set for some initial points that do not lie on the orbit [[8]]. Suppose there is a limit cycle Γ0\Gamma_{0} in int()\text{int}(\mathcal{I}), and an arbitrary trajectory starts from the initial point (x0,r0)(x_{0},r_{0}) and spirals toward Γ0\Gamma_{0} as t+t\rightarrow+\infty or tt\rightarrow-\infty. Then according to [8, Corollary 10.1], (x0,r0)(x_{0},r_{0}) must have a neighborhood Ω\Omega such that Γ0\Gamma_{0} is the ω\omega-limit set for all points in it. Due to the fact that V(x,r)V(x,r) is the first integral for system (4), V(x,r)V(x,r) must be constant in Ω\Omega because of the continuity. However, the partial derivatives of V(x,r)V(x,r)

Vx(x,r)=2φ(x,r)f2(x,r)\displaystyle\frac{\partial V}{\partial x}(x,r)=2\varphi(x,r)f_{2}(x,r)
Vr(x,r)=2φ(x,r)f1(x,r).\displaystyle\frac{\partial V}{\partial r}(x,r)=-2\varphi(x,r)f_{1}(x,r).

only equal 0 at (x,r)(x^{*},r^{*}), which leads to a contradiction. Therefore, system (4) cannot have limit cycles in \mathcal{I} when μ=0\mu=0.

Appendix C Computation of first Lyapunov coefficient

The following is the process of calculating 1\ell_{1} as presented in [13]. Consider a planar autonomous system y˙=f(y,μ)\dot{y}=f(y,\mu) where y2y\in\mathbb{R}^{2} and μ\mu\in\mathbb{R}. Let A0A_{0} be the Jacobian matrix evaluated at the equilibrium y0y_{0} and the Hopf bifurcation point μh\mu_{h}. Suppose A0A_{0} has two purely imaginary complex conjugate eigenvalues, given by ±iω0\pm i\omega_{0}. The algorithm for computing 1\ell_{1} can be summarized as follows:

  1. 1.

    compute the complex vectors q,p2q,p\in\mathbb{C}^{2} such that

    A0q=iω0q,A0Tp=iω0p,p,q=1A_{0}q=i\omega_{0}q,~{}~{}A_{0}^{T}p=-i\omega_{0}p,~{}\langle p,q\rangle=1

    where p,q=p¯Tq\langle p,q\rangle=\bar{p}^{T}q is the inner product;

  2. 2.

    compose the expression with a complex variable zz

    H(z,z¯)=p,f(y0+zq+z¯q¯,μh),H(z,\bar{z})=\langle p,f(y_{0}+zq+\bar{z}\bar{q},\mu_{h})\rangle,

    and find the coefficients g20g_{20}, g11g_{11}, and g21g_{21} in the formal Taylor series

    H(z,z¯)=iω0z+j+k21j!k!gjkzjz¯k;H(z,\bar{z})=i\omega_{0}z+\sum_{j+k\geq 2}\frac{1}{j!k!}g_{jk}z^{j}\bar{z}^{k};
  3. 3.

    the first Lyapunov coefficient 1(y0,μh)\ell_{1}(y_{0},\mu_{h}) is given by

    1(y0,μh)=12ω02(ig20g11+ω0g21).\ell_{1}(y_{0},\mu_{h})=\frac{1}{2\omega_{0}^{2}}\Re(ig_{20}g_{11}+\omega_{0}g_{21}).

Consider system (4). When μ=μ1\mu=\mu_{1}, we have

A0=[0θζ(θ+1)3(θ+1)(cθ+dθ2μ1θ^)(aθ+bθ2+μ1θ^)θ2ζ20],A_{0}=\begin{bmatrix}0&\frac{-\theta\zeta}{(\theta+1)^{3}}\\ \frac{(\theta+1)(c\theta+d\theta^{2}-\mu_{1}\hat{\theta})(a\theta+b\theta^{2}+\mu_{1}\hat{\theta})}{\theta^{2}\zeta^{2}}&0\end{bmatrix}, (38)
ω0=(cθ+dθ2μ1θ^)(aθ+bθ2+μ1θ^)θ(θ+1)2ζ.\omega_{0}=\sqrt{\frac{(c\theta+d\theta^{2}-\mu_{1}\hat{\theta})(a\theta+b\theta^{2}+\mu_{1}\hat{\theta})}{\theta(\theta+1)^{2}\zeta}}. (39)

Then following the above process, we obtain

1(μ1)\displaystyle\ell_{1}(\mu_{1}) =\displaystyle= (40)
θ(bcad)ζ((b+d)(θ3+2θ)+(a+c)(2θ3+1))2ω03(θ+1)4Δ.\displaystyle\frac{\theta(bc-ad)\zeta((b+d)(\theta^{3}+2\theta)+(a+c)(2\theta^{3}+1))}{2\omega_{0}^{3}(\theta+1)^{4}\Delta}.

Appendix D Proof of Lemma 11

{pf}

One only needs to check the case of b\mathcal{B}_{b}, since the other case is analogous. When r(0)=0r(0)=0 initially, we have r˙=0\dot{r}=0, and thus one can focus on the following one-dimensional dynamics about xx:

x˙=x(1x)[ax+b(1x)]+μ(12x),x[0,1].\dot{x}=x(1-x)[ax+b(1-x)]+\mu(1-2x),~{}~{}x\in[0,1]. (41)

Thus, the equilibrium’s xx-coordinates are the solutions of the following equation

x(1x)[ax+b(1x)]=μ(2x1),x[0,1].x(1-x)[ax+b(1-x)]=\mu(2x-1),~{}~{}x\in[0,1]. (42)

The left hand side (LHS) of (42) is non-negative in the interval x[0,1]x\in[0,1] and is exactly equal to 0 at x=0,1x=0,1, while the right hand side (RHS) of (42), μ(2x1)\mu(2x-1), is negative for x[0,1/2)x\in[0,1/2) and positive for x(1/2,1]x\in(1/2,1] when μ(0,1]\mu\in(0,1]. Thus, the LHS and RHS of (42) will be equal at some x(1/2,1)x\in(1/2,1).

Now, we will show that the LHS and RHS of (42) will equal for exactly one value of xx in the range at (1/2,1)(1/2,1). Denote the LHS of (42) by ϕ(x)\phi(x). Its first derivative is ϕ(x)=3(ab)x2+2(a2b)x+b\phi^{\prime}(x)=-3(a-b)x^{2}+2(a-2b)x+b, and the second derivative is ϕ′′(x)=2(a2b)6(ab)x\phi^{\prime\prime}(x)=2(a-2b)-6(a-b)x. We now discuss in the following cases:

  1. 1).

    When ab=0a-b=0, one has ϕ′′(x)=2b<0\phi^{\prime\prime}(x)=-2b<0 as b>0b>0, which implies ϕ(x)\phi(x) is concave. In this case, ϕ(x)\phi(x) equals the RHS of (42) exactly once when x(1/2,1)x\in(1/2,1).

  2. 2).

    When ab0a-b\neq 0, the solution to ϕ′′(x)=0\phi^{\prime\prime}(x)=0 is x=a2b3(ab)x=\frac{a-2b}{3(a-b)}. Then,

    1. i.

      when a2b3(ab)0\frac{a-2b}{3(a-b)}\leq 0 or a2b3(ab)1\frac{a-2b}{3(a-b)}\geq 1, one has b<a2bb<a\leq 2b or a<b2aa<b\leq 2a, respectively. In both cases, ϕ′′(x)\phi^{\prime\prime}(x) is non-positive for x[0,1]x\in[0,1]. Thus, ϕ(x)\phi(x) is concave in [0,1][0,1], which results in the same conclusion as in case 1).

    2. ii.

      when 0<a2b3(ab)<10<\frac{a-2b}{3(a-b)}<1, one has a>2ba>2b or b>2ab>2a. If a>2ba>2b, we have a2b3(ab)<13\frac{a-2b}{3(a-b)}<\frac{1}{3}, which results in ϕ′′(x)<0\phi^{\prime\prime}(x)<0 for x(1/3,1]x\in(1/3,1]. Namely, ϕ(x)\phi(x) is concave in x(1/3,1]x\in(1/3,1]. The same conclusion follows. In the other case, if b>2ab>2a, we have a2b3(ab)>12\frac{a-2b}{3(a-b)}>\frac{1}{2}. Then, one can check that ϕ(x)\phi^{\prime}(x) is negative for x(1/2,1]x\in(1/2,1]. The strictly decreasing function ϕ(x)\phi(x) will only be equal to the RHS of (42) for one time when x(1/2,1]x\in(1/2,1].

When θ1\theta\geq 1, the latter statement of the lemma holds trivially since 1/(θ+1)1/21/(\theta+1)\leq 1/2. For the case θ<1\theta<1, one can substitute x=1/(θ+1)x=1/(\theta+1) into the RHS of (41), which yields μ(θ1)(θ+1)2+θ(a+θb)(θ+1)3\frac{\mu(\theta-1)(\theta+1)^{2}+\theta(a+\theta b)}{(\theta+1)^{3}}. This value is positive for any μ(0,1]\mu\in(0,1] because of (6). Since the RHS of (41) is negative at x=1x=1, then due to continuity, the RHS of (41) equals 0 when x(1/(θ+1),1)x\in(1/(\theta+1),1).

Appendix E Proof of Lemma 13

{pf}

The proof is inspired by [10, Theorem 12.2.1]. As stated before, the vectors on the left side l\mathcal{B}_{l} and the right side r\mathcal{B}_{r} point inwards to int()\mathrm{int}(\mathcal{I}). This implies that these two sides are repelling. Therefore, the subsequent discussion is restricted to the top and bottom sides t\mathcal{B}_{t} and b\mathcal{B}_{b}, which are positively invariant.

We define a partial distance function F:[0,1]F:[0,1]\to\mathbb{R} as below:

F(r)=r(1r),F(r)=r(1-r), (43)

which indicates the distance of the system states (x,r)(x,r) to the top and bottom sides t\mathcal{B}_{t} and b\mathcal{B}_{b}. It is positive in int()\text{int}(\mathcal{I}) and only equals zero on these two sides. In addition, as rr increases from 0 to 11, the function value F(r)F(r) monotonically increases from 0 to the maximal value at r=1/2r=1/2 and then monotonically decreases to 0 at r=1r=1. The derivative of FF with respect to time tt is

F˙(r)=r˙(12r)=F(r)Φ(x,r),\displaystyle\dot{F}(r)=\dot{r}(1-2r)=F(r)\Phi(x,r), (44)

where Φ(x,r):=(12r)[(θ+1)x1]\Phi(x,r):=(1-2r)[(\theta+1)x-1]. We want to show that there exists a δ^>0\hat{\delta}>0 such that lim inftF(r(t))>δ^\liminf_{t\to\infty}F(r(t))>\hat{\delta}, which implies that t\mathcal{B}_{t} and b\mathcal{B}_{b} are repelling. We only discuss the case of b\mathcal{B}_{b} here, as the analysis for t\mathcal{B}_{t} is similar and thus omitted. Denote the unique interior equilibrium by q=(x,r)q^{*}=(x^{*},r^{*}), as given before. Choose 0<r<min{r,1/2}0<r^{\prime}<\min\{r^{*},1/2\}, and let m=F(r)=r(1r)m=F(r^{\prime})=r^{\prime}(1-r^{\prime}). We define a lower region by Il(m):={(x,r):0<F(r)m,0<rr}I_{l}(m):=\{(x,r)\in\mathcal{I}:0<F(r)\leq m,0<r\leq r^{\prime}\}. Note that Il(m)b=I_{l}(m)\cap\mathcal{B}_{b}=\emptyset, and the condition 0F(r)m0\leq F(r)\leq m is redundant in this definition, but it simply indicates that the set is related to the partial distance function evaluated at r=rr=r^{\prime}. Denote the solution of the dynamics (4) by ξ(t):=(x(t),r(t))\xi(t):=(x(t),r(t)), given some initial condition ξ(0)=(x(0),r(0))\xi(0)=(x(0),r(0))\in\mathcal{I}.

First we show that if ξ(0)Il(m)\xi(0)\in I_{l}(m), then there exists a time instant T>0T>0, such that ξ(T)Il(m)\xi(T)\notin I_{l}(m). Define a line LcL_{c} in the lower region Il(m)I_{l}(m) by Lc:={(x,r)Il(m):x=1/(θ+1)}L_{c}:=\{(x,r)\in I_{l}(m):x=1/(\theta+1)\}. Note that on this line, r˙=0\dot{r}=0 and x˙ϵ>0\dot{x}\geq\epsilon>0 for some positive constant ϵ\epsilon, and thus the vectors point horizontally towards the positive xx-direction. The line LcL_{c} divides the lower region Il(m)I_{l}(m) into the left region JL:={(x,r)Il(m):0x<1/(θ+1)}J_{L}:=\{(x,r)\in I_{l}(m):0\leq x<1/(\theta+1)\} and the right region JR:={(x,r)Il(m):1/(θ+1)<x1}J_{R}:=\{(x,r)\in I_{l}(m):1/(\theta+1)<x\leq 1\}. Note that F˙=F(r)Φ(x,r)>0\dot{F}=F(r)\Phi(x,r)>0 in JRJ_{R} while F˙<0\dot{F}<0 in JLJ_{L}. Therefore, if ξ(0)JR\xi(0)\in J_{R}, the trajectory of r(t)r(t) will increase, and there exists a T>0T>0 such that ξ(T)Il(m)\xi(T)\notin I_{l}(m) (due to the directions of vectors on the line LcL_{c}, the right side r\mathcal{B}_{r} and F˙(r)ϵ>0\dot{F}(r)\geq\epsilon^{\prime}>0 for some positive constant ϵ\epsilon^{\prime} in a sufficiently large compact set Ω\Omega satisfying {(x,r)JR:r=r}ΩJR\{(x,r)\in J_{R}:r=r^{\prime}\}\subset\Omega\subset J_{R} and ξ(0)Ω\xi(0)\in\Omega). If ξ(0)JL\xi(0)\in J_{L}, then the trajectory ξ(t)\xi(t) will not always stay in JLJ_{L} due to the Poincaré-Bendixson theorem, since there are no equilibria in JLJ_{L}. As F˙(r)<0\dot{F}(r)<0 in JLJ_{L}, the trajectory of r(t)r(t) will decrease, and x(t)x(t) will cross the center line LcL_{c}, and enter the right region JRJ_{R}. Due to the uniqueness of solutions of (4), similar to the previous analysis of the right region, there exits a T>0T>0 such that ξ(T)Il(m)\xi(T)\notin I_{l}(m).

Define Il¯(m):=Il(m)b\overline{I_{l}}(m):=I_{l}(m)\cup\mathcal{B}_{b}. Next we show that there exists n(0,m)n\in(0,m), such that if ξ(0)Il¯(m)\xi(0)\notin\overline{I_{l}}(m), then ξ(t)Il(n):={(x,r):0<F(r)n,0<rr}\xi(t)\notin I_{l}(n):=\{(x,r)\in\mathcal{I}:0<F(r)\leq n,0<r\leq r^{\prime}\} for all t0t\geq 0. For zIl¯(m)z\in\overline{I_{l}}(m), let Tm(z):=inf{T0:ξ(0)=z,ξ(T)Il(m)}T_{m}(z):=\inf\{T\geq 0:\xi(0)=z,\xi(T)\notin I_{l}(m)\}. Note that when zbz\in\mathcal{B}_{b}, then Tm(z)=0T_{m}(z)=0 trivially since Il(m)b=I_{l}(m)\cap\mathcal{B}_{b}=\emptyset. As we have shown previously, any trajectory starting in the lower region Il(m)I_{l}(m) will leave this region at some finite time instant. Therefore, we can define T¯:=supzTl¯(m)Tm(z)<\overline{T}:=\sup_{z\in\overline{T_{l}}(m)}T_{m}(z)<\infty. Let t0t_{0} be the first time instant when the trajectory ξ(t)\xi(t) reaches Il¯(m)\overline{I_{l}}(m) (i.e., t0=min{T>0:ξ(T)Il¯(m)}t_{0}=\min\{T>0:\xi(T)\in\overline{I_{l}}(m)\}), and let ξ(t0)=(x(t0),r(t0))\xi(t_{0})=(x(t_{0}),r(t_{0})). It is obvious that r(t0)=rr(t_{0})=r^{\prime} and F(r(t0))=mF(r(t_{0}))=m, namely, the trajectory reaches the boundary of the lower region Il(m)I_{l}(m) at time t0t_{0}. Let Φmin=min(x,r)Φ(x,r)\Phi_{\rm min}=\min_{(x,r)\in\mathcal{I}}\Phi(x,r) (the minimum is attainable on the compact set \mathcal{I}). Then for t(t0,t0+T¯)t\in(t_{0},t_{0}+\overline{T}), we have

1tt0t0tΦ(x,r)𝑑tΦmin.\frac{1}{t-t_{0}}\int_{t_{0}}^{t}\Phi(x,r)dt\geq\Phi_{\rm min}. (45)

Since Φ(x,r)=F˙(r)/F(r)\Phi(x,r)=\dot{F}(r)/F(r), the equation (45) can be further derived:

1tt0t0tddt(lnF(r))𝑑t=1tt0lnF(r(t))F(r(t0))Φmin,\frac{1}{t-t_{0}}\int_{t_{0}}^{t}\frac{d}{dt}\big{(}\ln F(r)\big{)}dt=\frac{1}{t-t_{0}}\ln\frac{F(r(t))}{F(r(t_{0}))}\geq\Phi_{\rm min},

which implies

F(r(t))F(r(t0))eΦmin(tt0)>meΦminT¯.F(r(t))\geq F(r(t_{0}))e^{\Phi_{\rm min}(t-t_{0})}>me^{\Phi_{\rm min}\overline{T}}.

Let n=meΦminT¯n=me^{\Phi_{\rm min}\overline{T}}. Therefore we have shown that the trajectory will not reach Il(n)I_{l}(n) for t(t0,t0+T¯)t\in(t_{0},t_{0}+\overline{T}). Since T¯\overline{T} is the supremum of the duration when the solution could stay in the lower region Il(m)I_{l}(m), then there exists T′′(t0,t0+T¯]T^{\prime\prime}\in(t_{0},t_{0}+\overline{T}] such that ξ(T′′)Il(m)\xi(T^{\prime\prime})\notin I_{l}(m). Therefore, the trajectory ξ(t)\xi(t) will leave the lower region Il(m)I_{l}(m) without reaching Il(n)Il(m)I_{l}(n)\subset I_{l}(m). Repeating this argument, we conclude that the trajectory ξ(t)\xi(t) will never reach Il(n)I_{l}(n). Thus we have shown that there exists 0<δ^<n0<\hat{\delta}<n such that lim inftF(r(t))>δ^\liminf_{t\to\infty}F(r(t))>\hat{\delta}, and thus the lower side b\mathcal{B}_{b} is repelling.

Appendix F Functions Expressions in Proof of Lemma 16

  1. (1)

    One does not need to obtain the explicit expression of G(x~)G(\tilde{x}), since the integrand, g(s)=2sα(s)=2s1/4s2g(s)=\frac{2s}{\alpha(s)}=\frac{2s}{1/4-s^{2}} is an odd function. Obviously, we have G(ν)=0νg(s)𝑑s=ν0g(s)𝑑s=G(ν)G(\nu)=\int_{0}^{\nu}g(s)ds=-\int^{0}_{-\nu}g(s)ds=G(-\nu);

  2. (2)

    F(x~,r~)φ(r~)=2μx~8μ1x~(1/4x~2)(1/4x~2)(b+d)r~\dfrac{F(\tilde{x},\tilde{r})}{\varphi(\tilde{r})}=\dfrac{2\mu\tilde{x}-8\mu_{1}\tilde{x}(1/4-\tilde{x}^{2})}{(1/4-\tilde{x}^{2})(b+d)\tilde{r}};

  3. (3)

    g(x~)F(x~,r~)=4μx~216μ1x~2(1/4x~2)(1/4x~2)2(r~+r)(1r~r)g(\tilde{x})F(\tilde{x},\tilde{r})=\dfrac{4\mu\tilde{x}^{2}-16\mu_{1}\tilde{x}^{2}(1/4-\tilde{x}^{2})}{{(1/4-\tilde{x}^{2})}^{2}(\tilde{r}+r^{\prime*})(1-\tilde{r}-r^{\prime*})};

  4. (4)

    F(x~,r~)=2μx~8μ1x~(1/4x~2))(1/4x~2)(r~+r)(1r~r)F(\tilde{x},\tilde{r})=\dfrac{2\mu\tilde{x}-8\mu_{1}\tilde{x}(1/4-\tilde{x}^{2}))}{(1/4-\tilde{x}^{2})(\tilde{r}+r^{\prime*})(1-\tilde{r}-r^{\prime*})}.

Appendix G Proof of Lemma 18

{pf}

In view of the system equations (27), the xx-coordinates of the equilibria on t\mathcal{B}_{t} are obtained by solving the following polynomial equation

x(1x)[x(dc)d+u]=μ(2x1),x[0,1].x(1-x)[x(d-c)-d+u]=\mu(2x-1),~{}~{}x\in[0,1]. (46)

The RHS of (46), μ(2x1)\mu(2x-1), is a linear function of xx which equals 0 at x=1/2x=1/2. It is negative for x(0,1/2)x\in(0,1/2) and positive for x(1/2,1)x\in(1/2,1) when μ(0,1]\mu\in(0,1]. The LHS of (46) equals 0 at the two end points x=0,1x=0,1.

When u>(c+d)/2u>(c+d)/2 and c=dc=d, the LHS will be always positive for x(0,1)x\in(0,1), such that the root of (46) is in the range (1/2,1)(1/2,1). The value of LHS will change the sign from positive to negative (resp. negative to positive) as xx increases from 0 to 11 when c>dc>d (resp. c<dc<d), and it equals 0 also at x=(du)/(dc)x=(d-u)/(d-c). One has (du)/(dc)<1/2(d-u)/(d-c)<1/2 and (du)/(dc)>1/2(d-u)/(d-c)>1/2 respectively for c<dc<d and c>dc>d. For both cases, there will be some x(1/2,1)x\in(1/2,1) such that the RHS and LHS are equal as shown geometrically in Fig. (6). Therefore, one can conclude that there is always one equilibrium with xx-coordinate being 1/2<xt<11/2<x_{t}^{*}<1.

Refer to caption
Figure 6: The plot of the LHS and RHS of (46) as functions of xx. In all cases, the curves of LHS and RHS have exactly one intersection (red dot) in the range x(1/2,1)x\in(1/2,1).

Next, only in the case c<dc<d, the curves of the LHS and RHS can have one or two intersections in the range 0<x<1/20<x<1/2. Thus, the other possible equilibria on t\mathcal{B}_{t} can only be located in x(0,1/2)x\in(0,1/2).