Limit Cycles Analysis and Control of Evolutionary Game Dynamics with Environmental Feedback

Lulu Gong Weijia Yao Jian Gao Ming Cao ENTEG, Faculty of Science and Engineering, University of Groningen, 9747 AG, Groningen, The Netherlands, (e-mail: [email protected], [email protected], and [email protected]). School of Science, Beijing University of Posts and Telecommunications, Beijing 100876, China, (e-mail: [email protected]).

Abstract

Recently, an evolutionary game dynamics model taking into account the environmental feedback has been proposed to describe the co-evolution of strategic actions of a population of individuals and the state of the surrounding environment; correspondingly a range of interesting dynamic behaviors have been reported. In this paper, we provide new theoretical insight into such behaviors and discuss control options. Instead of the standard replicator dynamics, we use a more realistic and comprehensive model of replicator-mutator dynamics, to describe the strategic evolution of the population. After integrating the environment feedback, we study the effect of mutations on the resulting closed-loop system dynamics. We prove the conditions for two types of bifurcations, Hopf bifurcation and Heteroclinic bifurcation, both of which result in stable limit cycles. These limit cycles have not been identified in existing works, and we further prove that such limit cycles are in fact persistent in a large parameter space and are almost globally stable. In the end, an intuitive control policy based on incentives is applied, and the effectiveness of this control policy is examined by analysis and simulations.

keywords:

Replicator-mutator dynamics, game-environment feedback, limit cycles, Hopf bifurcation, Heteroclinic bifurcation, local and global stability, incentive-based control.

keywords:

Replicator-mutator dynamics, game-environment feedback, limit cycles, incentive-based control.

^†^†thanks: The work of M. Cao is supported in part by the European Research Council (ERC-CoG-771687). L. Gong, W. Yao, and J. Gao are funded by China Scholarship Council (CSC).

, , ,

1 Introduction

Evolutionary game theory studies the evolution of strategic decision-making processes of individuals in one or multiple populations. A range of mathematical models have been extensively investigated, among which the replicator dynamics model is the most well-known [10]. Powerful theories and tools originating from evolutionary game theory facilitate the study of complex system behaviors of biological, ecological, social, and engineering fields [19], [12], [14], [22], [30]. In the classic game setting, the payoffs in each two-player game are usually predetermined in the form of constant payoff matrices. However, in many applications, especially those involving common resources in the environment that are consumed by groups of individuals, it is recognized that the payoffs can change over time or be affected directly by external environmental factors. Thus game-playing individuals’ decisions can influence the surrounding environment, and the environment, especially its richness in resources, also acts back on the payoff distributions, which forms the so-called bi-directional game-environment feedback [35]. Such a feedback mechanism is believed to be able to explain rich real-world complex population dynamics, including human decision-making, social culture evolution, plant nutrient acquisition, and natural resource harvesting. In particular, evolutionary dynamics with game-environment feedback have attracted great attention in recent years, due to its high relevance in biological and sociological systems [24], [14], [32].

This line of research is originated in [35], where the replicator dynamics of a two-player two-strategy game are coupled with the logistic environment resource dynamics. Utilizing an elaborate environment-dependent payoff matrix, the joint game-environment dynamics exhibit interesting system behaviors. In most cases, the strategic states of the population and the environment converge to an equilibrium point on the boundary of the phase space with the zero value of environment state, which reflects the tragedy of the commons. The system dynamics may also show cyclic oscillations which correspond to closed periodic orbits. Particularly when the system dynamics converge to a heteroclinic cycle on the boundary, the environment state cycles between low and high values, which is referred to as the oscillating tragedy of commons. Since then, [35] has been followed by several extensions and variations [7], [32], [4], [11], [18], [15]. The game-environment feedback mechanism has been studied more broadly in different applications, while its theoretical settings have been adapted in different research directions, e.g. implementing different types of environment resource models and considering the structures of the populations. New meaningful dynamical behaviors have been identified, and these results are in turn helpful to understand real-world applications.

The majority of the existing studies on the game-environment framework chooses replicator dynamics to model the strategic dynamics of the game-environment system. While the replicator dynamics have been proved to be a powerful model in analyzing a variety of classical games, they do not take into account the mutation or exploration in strategies which exists ubiquitously in biological and sociological systems. Genetic mutations typically occur with small but non-negligible probabilities, and random exploration of available strategies is common in behavioral experiments [33]. Mutations and explorations can be captured by allowing individuals to spontaneously switch from one strategy to another in small probability, which results in the so-called replicator-mutator dynamics. It has also played a prominent role in evolutionary game theory and has been applied frequently in different fields [19], [12], [23], [22].

We note that although rich system behaviors, such as convergence to equilibria or heteroclinic cycles on the boundary and existence of neutral periodic orbits, have been identified in related works, the results on the limit cycle dynamics have been rarely reported in the game-environment feedback setting. Only in [32], it is shown that the time-scale difference between game and environment dynamics can result in limit cycles. When mutations are taken into account, it is of great interest to study if the coupled game and environment dynamics can exhibit limit cycles.

We also emphasize that research on how to design reasonable and effective control policies to achieve expected system behaviors in evolutionary games has attracted increasing attention [17], [37], [25], [27]. Specifically, using game-environment feedback, [21] considers optimal control problems where control takes the form of the incentives and opinions respectively. The adopted control policies can maximize accumulated resource over time. However, since the controlled system still exhibits heteroclinic oscillations, repeated collapses of the resource are inevitable. [34] considers adjusting the law of feedback from population states to the environment based on a slightly different co-evolutionary model of public goods games. An involved nonlinear control law is proved to be able to steer the system to evolve towards the designed behaviors, but the biological implication of such control policies still remains open to debate.

Motivated by all the unsolved issues just mentioned, in this work we study the simultaneous co-evolution of game and environment using an integrated model consisting of replicator-mutator dynamics and logistic resource dynamics. Although similar linear environment-dependent payoff matrices have been studied before, we focus on investigating the significant impact of mutations on the overall system dynamics using bifurcation theory as the main tool. In particular, we show that two different types of bifurcations, which lead to limit cycles, can take place. In sharp comparison to the marginal oscillating behaviors reported before, such as neutral periodic orbits and heteroclinic cycles, we clarify that the identified limit cycle conditions correspond to a large space of the mutation parameter and such limit cycles are almost globally stable. In this sense, the limit cycle behavior is robust and persistent, which agrees with biological observations. Furthermore, when mutations are not too rare, the corresponding limit cycle will not be close to the boundary, and consequently the (oscillating) tragedy of commons can be averted.

Enabled by the new insight into co-evolutionary dynamics of the game and environment, we consider further a simple but effective incentive-based control policy, where the individuals receive some external incentives in the game interactions if they choose the cooperation strategy. It is shown that the controlled system dynamics can converge to certain equilibria with a high level of the environmental resource. Thus, the effectiveness of the proposed control policy is guaranteed. Compared with the proposed control policies in [[21]] and [[34]], our proposed approach is not only more effective, but also easier to implement than influencing the opinions of the individuals or realizing more complicated feedback between strategists and the environment.

Our preliminary conference paper [5] considers Hopf bifurcation and limit cycles of the system dynamics in a special case where the enhancement effect and the degradation effect is balanced, but in this paper we consider general bifurcation conditions and provide a comprehensive account of the results. Moreover, the latter part about incentive-based control of this paper is not in the conference version.

The remainder of this paper is organized as follows. Section $2$ provides the framework of the co-evolutionary dynamics, and introduces the integrated model with game-environment feedback. The analysis of system dynamics is presented in Section $3$ . The incentive-based control is applied and its performance is provided in Section $4$ . Finally, concluding remarks and discussion points are given in Section $5$ .

2 Problem Formulation

Co-evolutionary game and environment dynamics occur because evolutionary game dynamics are sometimes coupled with the dynamics of the surrounding environment (see Fig. 1 for an illustrative drawing). The coupled game and environment dynamics form a closed-loop system, which consists of bi-directional actions between the environment and the population’s game play: an individual’s fitness depends on not only the population state, but also the state of the environment, and the state of the environment is influenced by the distribution of the choices of strategies in the population. In the following subsections, we will introduce each component of this system in turn, and then integrate them to give a complete description of our model.

Refer to caption — Figure 1: The framework of evolutionary game dynamics with the environmental feedback

2.1 Environment-dependent payoffs

We consider games played in a changing environment, which is characterized by the richness $r\in[0,1]$ of a resource of interest; such a resource has direct influence on the payoffs that players receive. Correspondingly, the payoffs become dynamic and depend on $r$ . We consider a two-player game with two strategies, Cooperation and Defection (C, D). Following [35], we assume that the environment-dependent payoff matrix is a convex combination of the payoff matrices from the classic Prisoner’s Dilemma, i.e.,

	$\displaystyle A(r)$	$\displaystyle=(1-r)\begin{bmatrix}R_{1}&S_{1}\\ T_{1}&P_{1}\end{bmatrix}+r\begin{bmatrix}R_{2}&S_{2}\\ T_{2}&P_{2}\end{bmatrix}$		(1)
		$\displaystyle=\begin{bmatrix}R_{1}+(R_{2}-R_{1})r&S_{1}+(S_{2}-S_{1})r\\ T_{1}+(T_{2}-T_{1})r&P_{1}+(P_{2}-P_{1})r\end{bmatrix},$		(1)

where $R_{1}>T_{1}$ , $S_{1}>P_{1}$ , $R_{2}<T_{2}$ and $S_{2}<P_{2}$ . The entries of $A(r)$ represent the payoffs to the players at the given environment state $r$ . For example, a C player receives the payoff of $R_{1}+(R_{2}-R_{1})r$ and $S_{1}+(S_{2}-S_{1})r$ when the opponent uses strategies C and D, respectively. Because of the inequalities below (1), the two matrices $\begin{bmatrix}R_{2}&S_{2}\\ T_{2}&P_{2}\end{bmatrix}$ and $\begin{bmatrix}R_{1}&S_{1}\\ T_{1}&P_{1}\end{bmatrix}$ correspond to the prisoner’s dilemma and its inverse game, respectively. Note that the payoff matrix $A(r)$ is the linear interpolation between the above two matrices. Thus, when $r$ approaches $0$ and $1$ , the mutual cooperation and defection tend to be the Nash Equilibrium of this dynamic game respectively. The biological interpretation is that: the individuals tend to cooperate facing the depleted environmental resource, and become more selfish and choose to defect while the resource becomes abundant.

2.2 Replicator-Mutator equations

The two-player game governed by the payoff matrix $A(r)$ in (1) are played in a well-mixed and infinite population. We denote the proportion of individuals choosing C by the variable $x\in[0,1]$ . Since there are only two strategies, the population state can be represented by the vector $\mathbf{x}=[x~{}~{}1-x]^{\top}$ . Assume the individuals with different strategies can mutate into each other with an identical probability $\mu\in[0,1]$ : a fraction of cooperators, $\mu x$ , spontaneously change to choose strategy D; a fraction of defectors, $\mu(1-x)$ , change to choose strategy C at the same time. We focus on the dynamics of $x$ , which can be described by the following replicator-mutator equation

\dot{x}=x[(A(r)\mathbf{x})_{1}-\mathbf{x}^{\top}A(r)\mathbf{x}]-\mu x+\mu(1-x),

(2)

where $(A(r)\mathbf{x})_{1}$ is the first entry of $A(r)\mathbf{x}$ and represents the fitness of choosing strategy C, and $\mathbf{x}^{\top}A(r)\mathbf{x}$ is the average fitness at the population state $\mathbf{x}$ . The term $-\mu x$ represents the mutation from strategy C towards D, while $\mu(1-x)$ captures the mutation in the opposite direction, from D towards C.

2.3 Closed-loop system with game-environment feedback

To model the evolution of the environmental change, we use the standard logistic model

\dot{r}=r(1-r)[\theta x-(1-x)],

(3)

where the parameter $\theta>0$ represents the ratio between the enhancement effect due to cooperation and degradation effect due to defection. The influence of the population on the environment state is captured by the function $[\theta x-(1-x)]$ , and the dynamics are restricted in the unit interval by $r(1-r)$ . When $\theta=1$ , the enhancement effect and the degradation effect are balanced. And if $\theta>1$ or $0<\theta<1$ , the enhancement effect is respectively stronger or weaker than the opposite.

Substituting (1) into (2), and then combining with (3), we obtain the closed-loop planar system describing the evolutionary game dynamics under the game-environment feedback:

\begin{cases}&\begin{aligned} \dot{x}=&x(1-x)[xr(-c+d-a+b)+x(a-b)\\ &-r(d+b)+b]+\mu(1-2x)\end{aligned}\\ &\dot{r}=r(1-r)[\theta x-(1-x)],\end{cases}

(4)

where to simplify notations, we have used $a=R_{1}-T_{1}$ , $b=S_{1}-P_{1}$ , $c=T_{2}-R_{2}$ , and $d=P_{2}-S_{2}$ , all of which are positive because of the inequalities below (1). Note that system (4) reduces to the model considered in [35] when $\mu=0$ . Therefore, system (4) generalizes the previous model.

In view of the payoff matrix (1) and system equations (4), these four parameters for re-parameterization have intuitive interpretations with respect to strategy changes when the resource is either abundant or depleted: the parameter $a$ quantifies the incentive to stick to strategy C given that all individuals are following strategy C and the system is currently in a resource-poor state; $b$ quantifies the incentive to switch to strategy C when the resource is depleted and all individuals enforce strategy D; in contrast, $c$ quantifies the incentive to switch to strategy D given that all individuals enforce strategy C and the system is in a state of sufficient resource; $d$ quantifies the incentive to stick to strategy D when the resource is abundant and all individuals are following strategy D.

3 System Dynamics and Bifurcations

The state space of system (4) is the unit square $\mathcal{I}=[0,1]^{2}$ . The interior region of this square is denoted by $\text{int}(\mathcal{I})=(0,1)^{2}$ . Four sides of the unit square form the boundary $\partial\mathcal{I}$ . For convenience of exposition, we denote the four sides by:

		$\displaystyle\mathcal{B}_{t}=\{(x,r):x\in[0,1],r=1\},$		(5)
		$\displaystyle\mathcal{B}_{b}=\{(x,r):x\in[0,1],r=0\},$
		$\displaystyle\mathcal{B}_{l}=\{(x,r):x=0,r\in[0,1]\},$
		$\displaystyle\mathcal{B}_{r}=\{(x,r):x=1,r\in[0,1]\}.$

Then, one has $\partial\mathcal{I}:=\mathcal{B}_{t}\cup\mathcal{B}_{b}\cup\mathcal{B}_{l}\cup\mathcal{B}_{r}$ . It is easy to check that $\mathcal{I}$ is invariant under the dynamics (4) for all $\mu\in[0,1]$ by Nagumo’s theorem [1, Theorem 4.7].

System (4) can have equilibria in the interior $\text{int}(\mathcal{I})$ ; we call them interior equilibria. System (4) also has equilibria on the boundary $\partial\mathcal{I}$ (boundary equilibria), which will be discussed in detail in Section 3.4. It is easy to check that the interior equilibria, if they exist, must lie on the line $\{(x,r)\in\text{int}(\mathcal{I}):x=1/(\theta+1)\}$ . Substituting such $x$ into the $x$ -dynamics in (4), we have

	$\displaystyle 0=$	$\displaystyle x(1-x)[xr(-c+d-a+b)$
		$\displaystyle+x(a-b)-r(d+b)+b]+\mu(1-2x),$

which leads to

r=\frac{\theta a+\theta^{2}b+\mu(\theta^{3}+\theta^{2}-\theta-1)}{\theta(a+c+\theta b+\theta d)}.

Thus if the following condition holds

-(\theta a+\theta^{2}b)<\mu(\theta^{3}+\theta^{2}-\theta-1)<\theta c+\theta^{2}d,

(6)

system (4) has a unique interior equilibrium

(x^{*},r^{*})=\left(\frac{1}{\theta+1},\frac{\theta a+\theta^{2}b+\mu(\theta^{3}+\theta^{2}-\theta-1)}{\theta(a+c+\theta b+\theta d)}\right).

(7)

Otherwise, no interior equilibrium exists. The interior equilibrium supports the coexistence of the two strategies under a non-depleted resource state. In view of (6), one can see that for a given $\theta$ , the existence of the interior equilibrium is determined by not only the payoff parameters but also the mutation rate.

Note that when $\theta=1$ , $(x^{*},r^{*})$ is independent of the parameter $\mu$ , and (6) holds trivially for all $\mu\in[0,1]$ . This specific case has been studied in our previous conference paper [5]. In this paper we do not require $\theta=1$ and study the generic $\theta$ that takes any positive value satisfying (6).

Towards the goal of establishing conditions for the existence of limit cycles, we first present some preliminary results on the non-existence of closed orbits or limit cycles.

3.1 Non-existence of closed orbits

First, we define the $\mathbf{C}^{1}$ function $\varphi(x,r):\text{int}(\mathcal{I})\rightarrow\mathbb{R}$ by

\varphi(x,r)=x^{\alpha}(1-x)^{\beta}r^{\gamma}(1-r)^{\delta}

(8)

with the parameters $\alpha$ , $\beta$ , $\gamma$ , and $\delta$ to be determined later. Note that $\varphi(x,r)$ is strictly positive in $\text{int}(\mathcal{I})$ for any values of these parameters. Denote the right hand side of (4) by $f(x,r)=[f_{1}(x,r),f_{2}(x,r)]^{\top}$ , and then the divergence of the modified vector field $\varphi f(x,r)$ on $\text{int}(\mathcal{I})$ is given by

		$\displaystyle\text{div}~{}\varphi f(x,r)=\frac{\partial(\varphi f_{1})}{\partial x}(x,r)+\frac{\partial(\varphi f_{2})}{\partial r}(x,r)$		(9)
		$\displaystyle=\varphi(x,r)(-(\alpha+\beta+3)(-c+d-a+b)x^{2}r$
		$\displaystyle~{}~{}~{}-(\alpha+\beta+3)(a-b)x^{2}$
		$\displaystyle~{}~{}~{}+[(-c+2d-a+2b)\alpha+(d+b)\beta$
		$\displaystyle~{}~{}~{}-(\theta+1)(\gamma+\delta+2)+2(-c+2d-a+2b)]xr$
		$\displaystyle~{}~{}~{}+[(a-2b)\alpha-b\beta+(\theta+1)(1+\gamma)+2a-4b]x$
		$\displaystyle~{}~{}~{}+[-(d+b)\alpha+\gamma+\delta+2-b-d]r$
		$\displaystyle~{}~{}~{}+\frac{\mu(1-2x)(\alpha-\alpha x-\beta x)}{x(1-x)}$
		$\displaystyle~{}~{}~{}+(1+\alpha)b-(1+\gamma)-2\mu).$

Let all the coefficients of the powers of $x$ , $r$ in (9) (except for the term containing $\mu$ ) be zero, and we obtain the following set of equations

$\displaystyle(\alpha+\beta+3)(-c+d-a+b)$	$\displaystyle=0$	(10)
$\displaystyle(\alpha+\beta+3)(a-b)$	$\displaystyle=0$
$\displaystyle(-c+2d-a+2b)\alpha+(d+b)\beta$
$\displaystyle-(\theta+1)(\gamma+\delta+2)+2(-c+2d-a+2b)$	$\displaystyle=0$
$\displaystyle(a-2b)\alpha-b\beta+(\theta+1)(\gamma+1)+2a-4b$	$\displaystyle=0$
$\displaystyle-(d+b)\alpha+\gamma+\delta+2-b-d$	$\displaystyle=0.$

We first show that the equation set (10) always has solutions.

Lemma 1.

There always exist some $\alpha$ , $\beta$ , $\gamma$ , and $\delta$ as solutions to (10).

{pf}

See Appendix A. ∎

In fact, if one of $-c+d-a+b$ and $a-b$ is not zero, (10) will have exactly one solution

		$\displaystyle\alpha=-\frac{\zeta+c+a}{\zeta},$		(11)
		$\displaystyle\beta=\frac{-2\zeta+c+a}{\zeta},$
		$\displaystyle\gamma=\frac{-(a+\theta+1)\zeta+a^{2}+ac-ab-bc}{(\theta+1)\zeta},$
		$\displaystyle\delta=-\frac{(a+c)(\theta b+\theta d+d+a)+(\theta+1-a)\zeta}{(\theta+1)\zeta}$

with $\zeta=a+c+\theta b+\theta d>0$ . Otherwise, (10) will have infinitely many solutions including (11). So we fix the parameters $\alpha$ , $\beta$ , $\gamma$ , and $\delta$ at the values given in (11) such that the divergence (9) can be greatly simplified.

Then we show the non-existence of closed orbits for system (4).

Lemma 2.

For system (4), the following statements hold:

1.

when $ad-bc>0$ , there exists some $\mu_{0}>0$ such that there are no closed orbits in $\mathcal{I}$ for $\mu\in[\mu_{0},1]$ ;
2.

when $ad-bc<0$ , there are no closed orbits in $\mathcal{I}$ for $\mu\in[0,1]$ ;
3.

when $ad-bc=0$ , there are no closed orbits in $\mathcal{I}$ for $\mu\in(0,1]$ .

{pf}

The proof is direct by applying Bendixson-Dulac criterion [36, Theorem 4.1.2]. Since the boundary $\partial\mathcal{I}$ contains equilibria (as shown in Section 3.4), it is not possible to have periodic orbits on it. And the sides are either invariant or repelling, so they cannot intersect with closed orbits. Now let us consider the interior $\text{int}(\mathcal{I})$ which is simply connected.

In view of the fact that $\alpha$ , $\beta$ , $\gamma$ , and $\delta$ are given as in (11), one can calculate the divergence of $\varphi f(x,r)$

		$\displaystyle\text{div}~{}\varphi f(x,r)$		(12)
		$\displaystyle=\varphi(x,r)\bigg{(}\frac{\mu(1-2x)(3x+\alpha)}{x(1-x)}+(1+\alpha)b$
		$\displaystyle\quad\quad\quad\quad\quad-(1+\gamma)-2\mu\bigg{)}$
		$\displaystyle=\varphi(x,r)\left(\frac{\mu(1-2x)(3x+\alpha)}{x(1-x)}-2\mu-\frac{\theta(bc-ad)}{(\theta+1)\zeta}\right).$

From (11) one knows $\alpha\in(-2,-1)$ , and thus the term $g(x):=\frac{\mu(1-2x)(3x+\alpha)}{x(1-x)}$ takes its maximum value at $\bar{x}\in(0,1)$ given by

\bar{x}=\begin{cases}&\frac{\alpha-\sqrt{-\alpha(3+\alpha)}}{3+2\alpha},~{}-1.5<\alpha<-1,\\ &0.5,~{}\quad\quad\quad\quad\quad\quad~{}~{}~{}\alpha=-1.5,\\ &\frac{\alpha+\sqrt{-\alpha(3+\alpha)}}{3+2\alpha},~{}-2<\alpha<-1.5.\end{cases}

(13)

Rearranging (12) yields

	$\displaystyle\frac{\text{div}~{}\varphi f(x,r)}{\varphi(x,r)}$	$\displaystyle\leq\frac{\mu(1-2\bar{x})(3\bar{x}+\alpha)}{\bar{x}(1-\bar{x})}-2\mu-\frac{\theta(bc-ad)}{(\theta+1)\zeta}$		(14)
		$\displaystyle=\frac{(1-2\alpha)\bar{x}+\alpha-4\bar{x}^{2}}{\bar{x}(1-\bar{x})}\mu-\frac{\theta(bc-ad)}{(\theta+1)\zeta}.$		(14)

It is easy to check that $\frac{(1-2\alpha)\bar{x}+\alpha-4\bar{x}^{2}}{\bar{x}(1-\bar{x})}$ is negative for any value of $\alpha\in(-2,-1)$ . Since $\varphi(x,r)$ is positive in $\text{int}(\mathcal{I})$ by definition, if $ad-bc>0$ , from (14) one has $\text{div}~{}\varphi f(x,r)\leq 0$ when

\mu\geq\mu_{0}:=\frac{\theta(bc-ad)\bar{x}(1-\bar{x})}{(\theta+1)\zeta[(1-2\alpha)\bar{x}+\alpha-4\bar{x}^{2}]}.

(15)

And because the divergence is not identically zero in $\text{int}(\mathcal{I})$ , from the Bendixson-Dulac criterion, one knows that no closed orbits can exist in $\text{int}(\mathcal{I})$ for $\mu\geq\mu_{0}$ .

For the second case, it is easy to prove since the right hand side of (14) will always be negative for any $\mu\geq 0$ . For the third case, the right hand side of (14) is always negative when $\mu>0$ . ∎ In Lemma 2, it is noted that the conditions for the non-existence of periodic orbits depend on the relationship between $ad$ and $bc$ . The reason is as follows: when $ad-bc<0$ , the divergence $\text{div}~{}\varphi f(x,r)$ will remain positive for any $\mu\in[0,1]$ in view of (14), which excludes the possibility of periodic orbits in the phase space; when $ad-bc>0$ or $ad-bc=0$ , there exists some $\mu$ such that $\text{div}~{}\varphi f(x,r)$ equals $0$ . As a consequence, it is possible for the system to have periodic orbits in these two cases. Another point worth noting is that the only difference between the second and third cases of Lemma 2 is that system (4) may have periodic solutions for $\mu=0$ when $ad-bc=0$ . This possibility has been shown in [35], [4]. Despite the possibility of closed orbits under certain conditions, for $\mu=0$ it can be proven that limit cycles (isolated closed orbits) are impossible to exist in (4) for all the parameter space as summarized below in Lemma 3.

Lemma 3.

System (4) has no limit cycles in $\mathcal{I}$ when $\mu=0$ .

The full proof is presented in [5]. See Appendix B for a sketch of the proof.

3.2 Hopf bifurcation at the interior equilibrium

In this section we take $\mu$ as the bifurcation parameter to study the effect of different mutation rates on the system dynamics. We investigate the possible bifurcations of the dynamics (4) as $\mu$ varies in $[0,1]$ . We analyze the stability of the interior equilibrium and prove that it involves a Hopf bifurcation which leads to periodic orbits.

We first invoke the Hopf bifurcation theorem [6] which will be used to prove the existence of stable limit cycles.

Theorem 4 (Hopf bifurcation theorem).

Suppose that the system $\dot{y}=F(y,\rho),y\in\mathbb{R}^{2},\rho\in\mathbb{R}$ has an equilibrium $(y_{0},\rho_{0})$ at which the following properties are satisfied:

1.

the Jacobian $D_{y}F|_{(y_{0},\rho_{0})}$ has a simple pair of pure imaginary eigenvalues $\lambda(\rho_{0})$ and $\bar{\lambda}(\rho_{0})$ ;
2.

$\frac{d\Re(\lambda(\rho))}{d\rho}|_{\rho_{0}}\neq 0$ .

Then the dynamics undergo a Hopf bifurcation at $(y_{0},\rho_{0})$ , which results in a family of periodic solutions in a sufficiently small neighborhood of $(y_{0},\rho_{0})$ .

Then we make an assumption about some parameters of system (4).

Assumption 5.

We assume that $0<ad-bc\leq\frac{\Delta}{\theta^{2}}.$

The following theorem demonstrates that the interior equilibrium $(x^{*},r^{*})$ of system (4) undergoes a Hopf bifurcation.

Theorem 6.

Under Assumption 5, the interior equilibrium $(x^{*},r^{*})$ of system (4) with the bifurcation parameter $\mu$ undergoes a supercritical Hopf bifurcation at $\mu=\mu_{1}:=\frac{\theta^{2}(ad-bc)}{\Delta}$ , which leads to the existence of stable limit cycles for $\mu<\mu_{1}$ in the vicinity of $\mu_{1}$ .

{pf}

To prove the existence of Hopf bifurcation, we need to show that the two conditions of Theorem 4 are satisfied in system (4). The Jacobian of the vector field of (4) at $(x^{*},r^{*})$ is given by

J^{*}=\begin{bmatrix}\frac{\theta^{2}(ad-bc)-\mu\Delta}{\theta(\theta+1)\zeta}&\frac{-\theta\zeta}{(\theta+1)^{3}}\\ \frac{(\theta+1)(c\theta+d\theta^{2}-\mu\hat{\theta})(a\theta+b\theta^{2}+\mu\hat{\theta})}{\theta^{2}\zeta^{2}}&0\end{bmatrix},

(16)

where

\hat{\theta}=(\theta^{3}+\theta^{2}-\theta-1)

and

\Delta=(a+c)(1+\theta^{2}+2\theta^{3})+(b+d)(2\theta+\theta^{2}+\theta^{4}).

Note that the off-diagonal entries

J_{12}=\frac{-\theta\zeta}{(\theta+1)^{3}}<0

and

J_{21}=\frac{(\theta+1)(c\theta+d\theta^{2}-\mu_{1}\hat{\theta})(a\theta+b\theta^{2}+\mu_{1}\hat{\theta})}{\theta^{2}\zeta^{2}}>0

because of (6). One can calculate the eigenvalues of $J^{*}$

	$\displaystyle\lambda(\mu)$	$\displaystyle=\frac{J_{11}\pm\sqrt{J_{11}^{2}+4J_{12}J_{21}}}{2}$		(17)
		$\displaystyle=\Re(\lambda(\mu))\pm\Im(\lambda(\mu))i.$		(17)

It is easy to check that when $\mu=\mu_{1}:=\frac{\theta^{2}(ad-bc)}{\Delta}$ the eigenvalues are purely imaginary, i.e.,

		$\displaystyle\Re(\lambda(\mu_{1}))=0,$
		$\displaystyle\Im(\lambda(\mu_{1}))=\sqrt{\frac{(c\theta+d\theta^{2}-\mu\hat{\theta})(a\theta+b\theta^{2}+\mu\hat{\theta})}{\theta(\theta+1)^{2}\zeta}},$

which implies that the first condition listed in Theorem 4 is satisfied. In addition, the second condition is also satisfied because

\left.\frac{d\Re(\lambda(\mu))}{d\mu}\right|_{\mu_{1}}=\frac{-\Delta}{\theta(\theta+1)\zeta}<0.

Then one can conclude that the dynamics (4) undergo a Hopf bifurcation at $\mu=\mu_{1}$ . Since $\mu$ varies in the interval $(0,1]$ , to guarantee the occurrence of Hopf bifurcation, the critical parameter value $\mu_{1}$ should also be constrained in this interval, i.e., $0<\mu_{1}\leq 1$ , which leads to

0<ad-bc\leq\frac{\Delta}{\theta^{2}},

which is exactly Assumption 5. The Hopf bifurcation theorem implies that a family of periodic orbits bifurcate from $(x^{*},r^{*})$ for some $\mu$ in the vicinity of $\mu_{1}$ . Whether the bifurcated periodic orbits are stable or not is determined by the so-called first Lyapunov coefficient $\ell_{1}$ at $(x^{*},r^{*})$ when $\mu=\mu_{1}$ . If $\ell_{1}<0$ , then these periodic orbits are stable limit cycles; if $\ell_{1}>0$ , the periodic orbits are repelling. Following the computation procedure in Appendix C, we obtain $\ell_{1}(\mu_{1})$ as below

	$\displaystyle\ell_{1}(\mu_{1})$	$\displaystyle=$		(18)
		$\displaystyle\frac{\theta(bc-ad)\zeta((b+d)(\theta^{3}+2\theta)+(a+c)(2\theta^{3}+1))}{2\omega_{0}^{3}(\theta+1)^{4}\Delta},$		(18)

which is negative because of (6) and Assumption 5. Thus, the bifurcated periodic orbits are stable limit cycles.

Furthermore, in view of (17), for $\mu<\mu_{1}$ , $(x^{*},r^{*})$ is unstable since the eigenvalues have positive real parts. On the other hand, the interior equilibrium is asymptotically stable for $\mu>\mu_{1}$ with the negative eigenvalues. Hence, the stable limit cycles exist for $\mu<\mu_{1}$ in the vicinity of $\mu_{1}$ . Since the bifurcation is associated with stable limit cycles, it is a supercritical Hopf bifurcation. ∎ The relationship between $ad$ and $bc$ has played an important role in Theorem 6 in the sense that Hopf bifurcation can only happen when $ad-bc>0$ . In view of the Jacobian (16), when $ad-bc\leq 0$ , $J_{11}$ will remain non-positive no matter what value $\mu$ takes. Thus, as $\mu$ varies in $[0,1]$ , the stability of the interior equilibrium does not change, which excludes the existence of a Hopf bifurcation. On the other hand, when $ad-bc>0$ , to ensure there is a Hopf bifurcation for $\mu\in[0,1]$ , Assumption 5 has to be satisfied as proved.

From the Hopf bifurcation theory, we know that the limit cycles generated from Hopf bifurcation have the amplitude of $O(\sqrt{\mu_{1}-\mu})$ . Since Hopf bifurcation is a local bifurcation, the obtained results only hold for $\mu$ in the vicinity of $\mu_{1}$ . Next, we will show that system (4) exhibits another type of bifurcation.

3.3 Heteroclinic bifurcation

In the case $\mu=0$ , it has been shown in Lemma 3 that the system (4) cannot exhibit limit cycles for any parameters. We show in this section that limit cycles may occur for $\mu>0$ in the vicinity of $0$ .

When $\mu=0$ , system (4) has four equilibria on the boundary $\partial\mathcal{I}$ . Denote the four equilibria respectively by $E_{1}=(0,0)$ , $E_{2}=(1,0)$ , $E_{3}=(1,1)$ , and $E_{4}=(0,1)$ . Then, one can calculate the Jacobians at these points, which are

		$\displaystyle J_{1}=\begin{bmatrix}b&0\\ 0&-1\end{bmatrix},$	$\displaystyle J_{2}=\begin{bmatrix}-a&0\\ 0&\theta\end{bmatrix},$		(19)
		$\displaystyle J_{3}=\begin{bmatrix}c&0\\ 0&-\theta\end{bmatrix},$	$\displaystyle J_{4}=\begin{bmatrix}-d&0\\ 0&1\end{bmatrix}.$		(19)

It is easy to see that the eigenvalues of these equilibria are indeed the diagonal entries of the associated Jacobian matrices, and thus all of these equilibria are saddle points. Next we note that the trace of each Jacobian, which is denoted by $T_{i}$ , is

T_{1}=b-1,~{}T_{2}=\theta-a,~{}T_{3}=c-\theta,~{}T_{4}=1-d.

A heteroclinic cycle on the boundary denoted by $\Lambda$ , which connects the four equilibria and has the orientation $E_{1}\rightarrow E_{2}\rightarrow E_{3}\rightarrow E_{4}\rightarrow E_{1}$ , can be identified. Its stability is determined by the payoff parameters.

Lemma 7.

Consider system (4) at $\mu=0$ . There exists a heteroclinic cycle $\Lambda$ , which is stable (resp. unstable) when the condition $ad>bc$ (resp. $ad<bc$ ) is satisfied.

We refer to the reference [35] for the complete proof for the case of the stable heteroclinic cycle. And the opposite case can be proved using the analogous method according to [9, Corollary 2].

When $\mu>0$ , the four corners are no longer equilibria, and the heteroclinic cycle $\Lambda$ is “broken”. Now we are going to show that the system (4) undergoes a heteroclinic bifurcation at $\mu=0$ when certain condition holds, and a limit cycle with the same stability of the corresponding heteroclinic cycle may be generated from $\Lambda$ for $\mu>0$ sufficiently close to $0$ .

Definition 8.

The internal $\sigma$ -neighborhood of $\Lambda$ is the set of all points in $\mathcal{I}$ that are at a distance less than $\sigma>0$ from $\Lambda$ .

Theorem 9.

Consider system (4). If $a>\theta$ , $b<1$ , $c<\theta$ , and $d>1$ , then there exist $\sigma,\epsilon>0$ , such that for $0<\mu<\epsilon$ , there is at most one limit cycle $\Gamma$ in the internal $\sigma$ -neighborhood of $\Lambda$ , and $\Gamma$ (if it exists) is stable.

{pf}

The proof is straightforward by applying the theorem of the heteroclinic bifurcation [16, Theorem 2.3.2, 2.3.3]. When $a>\theta$ , $b<1$ , $c<\theta$ , and $d>1$ hold, which implies $ad>bc$ . Then according to Lemma 7, there is a heteroclinic cycle when $\mu=0$ and it is stable.

In addition, all the traces $T_{i}$ of the Jacobians of the equilibria are negative, while their product $\prod_{i=1}^{4}T_{i}$ is positive. Then according to [16, Theorem 2.3.3], there exist sufficiently small $\sigma,\epsilon>0$ such that for $0<\mu<\epsilon$ , system (4) has at most one limit cycle in the internal $\sigma$ -neighbourhood of $\Lambda$ , and this limit cycle (if it exists) is stable due to the stability of $\Lambda$ . ∎ When $ad<bc$ , according to the second statement of Lemma 7, the heteroclinic cycle is unstable. It is impossible for limit cycles to be generated from this unstable heteroclinic cycle in view of Lemma 2, which claims that there are no closed orbits in $\mathcal{I}$ for any $\mu\geq 0$ when $ad-bc<0$ .

Compared with the result of Hopf bifurcation, the conclusion of Heteroclinic bifurcation is relatively weaker since Theorem 9 only states that there may exist a unique limit cycle in some neighborhood of $\Lambda$ . To study the exact conditions for the existence of the unique limit cycle is beyond this work.

Another difference between the limit cycles generated from Hopf bifurcation and Heteroclinic bifurcation is that the limit cycle arising from Hopf bifurcation is small in size in the phase space, as demonstrated in the simulation results in Fig. 2, while the other can be large. In these simulations the payoff matrices are given by

\begin{bmatrix}R_{1}&S_{1}\\ T_{1}&P_{1}\end{bmatrix}=\begin{bmatrix}3.5&1\\ 0.5&0.8\end{bmatrix},~{}~{}\begin{bmatrix}R_{2}&S_{2}\\ T_{2}&P_{2}\end{bmatrix}=\begin{bmatrix}2&0.2\\ 2.5&1.2\end{bmatrix}.

(20)

The parameter $\theta$ is chosen to be $1$ , and thus the interior equilibrium is fixed and independent of $\mu$ . In view of Theorem 6, the Hopf bifurcation point is $\mu_{1}=0.1543$ .

3.4 Global dynamics

As stated before, the limit cycle analysis for Hopf and Heteroclinic bifurcations are limited to the vicinity of the bifurcation points since the used methods rely mainly on linearization. To study if the limit cycle persists for $\mu\in(0,\mu_{1})$ , it is necessary to analyze the dynamics of (4) for all such $\mu$ .

3.4.1 Boundary equilibria and their stability

Recall that the condition (6), which involves $\mu$ , guarantees that there is a unique interior equilibrium. For some given payoff parameters and fixed values of $\theta$ , there may be some $\mu$ such that there is no interior equilibrium. In this case, the system dynamics are relatively trivial since the trajectories will converge to some limit sets on the boundary according to the Poincaré-Bendixson theorem. In the following part, we concentrate on the situation where an interior equilibrium always exists for all $\mu\in[0,1]$ . Then, we have the following assumption.

Assumption 10.

We assume hereafter that $-(\theta a+\theta^{2}b)<(\theta-1)(\theta+1)^{2}<\theta c+\theta^{2}d$ .

Assumption 10 ensures that there is a unique interior equilibrium point for all $\mu\in[0,1]$ in system (4). It is noted that when $\theta=1$ , Assumption 10 is satisfied trivially.

It is easy to check that the boundary $\partial\mathcal{I}:\mathcal{B}_{t}\cup\mathcal{B}_{b}\cup\mathcal{B}_{l}\cup\mathcal{B}_{r}$ itself and each of its sides are invariant under (4) in the case $\mu=0$ . When $0<\mu\leq 1$ , the sides $\mathcal{B}_{t}$ and $\mathcal{B}_{b}$ remain invariant, while $\mathcal{B}_{l}$ and $\mathcal{B}_{r}$ do not. As $\mu$ varies, all the possible boundary equilibria lie on the sides $\mathcal{B}_{t}$ and $\mathcal{B}_{b}$ , but the positions of these equilibria change. Let us try to determine the locations of the equilibria on $\mathcal{B}_{t}$ and $\mathcal{B}_{b}$ .

Lemma 11.

There is only one equilibrium on each side $\mathcal{B}_{b}$ and $\mathcal{B}_{t}$ in system (4). Moreover, the equilibria are located in the sets $\{(x,r):x\in(\max\{1/2,1/(\theta+1)\},1),~{}r=0\}$ and $\{(x,r):x\in(0,\min\{1/2,1/(\theta+1)\}),~{}r=1\}$ respectively.

{pf}

See Appendix D. ∎

One can examine the stability of these boundary equilibria by evaluating the corresponding Jacobian matrices.

Lemma 12.

All the boundary equilibria of system (4) are unstable for $\mu\in(0,1]$ .

{pf}

We only show the case on the side $\mathcal{B}_{b}$ . For the case of $\mathcal{B}_{t}$ , one can obtain the same result analogously. Denote the equilibrium on the side $\mathcal{B}_{b}$ by $(x_{b}^{*},0)$ . We obtain its Jacobian matrix as below

J_{b}^{*}=\begin{bmatrix}\begin{aligned} &3x_{b}^{*^{2}}(b-a)-2\mu+b\\ &+2x_{b}^{*}(a-2b)\end{aligned}&\star\\ 0&(\theta+1)x_{b}^{*}-1\end{bmatrix},

(21)

where the symbol $\star$ stands for $-x_{b}^{*^{3}}(-c+d-a+b)-x_{b}^{*}(d+b)+x_{b}^{*^{2}}(-c+2d-a+2b)$ . As the matrix is upper triangular, the entry $(\theta+1)x_{b}^{*}-1$ is one of its eigenvalues. According to Lemma 11, one has $x_{b}^{*}\in(\max\{1/2,1/(\theta+1)\},1)$ . Then the eigenvalue $(\theta+1)x_{b}^{*}-1$ will be positive for any $\theta>0$ , which implies that the equilibrium is unstable. ∎

By checking the sign of $\dot{x}$ on the left side $\mathcal{B}_{l}$ and the right side $\mathcal{B}_{r}$ , one observes that it is always positive and negative respectively. It can be easily verified that the vectors on these two sides point inwards to $\mathrm{int}(\mathcal{I})$ . The sides $\mathcal{B}_{b}$ and $\mathcal{B}_{t}$ are positively invariant under system (4), and thus the dynamics on each side of $\mathcal{B}_{b}$ and $\mathcal{B}_{t}$ are easy to analyze. For example, on the side $\mathcal{B}_{b}$ , one has $\dot{r}=0$ , $\dot{x}>0$ at $x=0$ and $\dot{r}=0$ , $\dot{x}<0$ at $x=1$ . As there is only one equilibrium $(x_{b}^{*},0)$ on the side $\mathcal{B}_{b}$ , one can easily verify that $\dot{x}>0$ for $x\in[0,x_{b}^{*})$ and $\dot{x}<0$ for $x\in(x_{b}^{*},1]$ . Thus, trajectories starting from $\mathcal{B}_{b}\setminus(x_{b}^{*},0)$ will converge to the unique equilibrium $(x_{b}^{*},0)$ . The similar result can be obtained for the dynamics on $\mathcal{B}_{t}$ .

In addition, we can show that the boundary $\partial\mathcal{I}$ is repelling for $\mu\in(0,1]$ .

Lemma 13.

The boundary $\partial\mathcal{I}$ of system (4) is repelling for $\mu\in(0,1]$ .

{pf}

See Appendix E. ∎

With Lemma 13 in hand, we are ready to present some global results of the system dynamics in $\mathcal{I}$ .

Theorem 14.

Under Assumption 5, for $\mu\in(\mu_{0},1]$ , the interior equilibrium $(x^{*},r^{*})$ of system (4) is asymptotically stable, and all trajectories starting from $\mathcal{I}\setminus(\mathcal{B}_{t}\cup\mathcal{B}_{b})$ will converge to $(x^{*},r^{*})$ .

{pf}

Recall the two critical parameter values $\mu_{0}$ and $\mu_{1}$ , which appear respectively in Lemma 2 and Theorem 6. One can check that $\mu_{0}\geq\mu_{1}$ when Assumption 5 holds. Again according to Lemma 2, it is impossible for system (4) to have closed orbits in $\mathcal{I}$ when $\mu>\mu_{0}$ . In addition, the interior equilibrium is locally asymptotically stable in view of (16). Due to the fact that the boundary $\partial\mathcal{I}$ is repelling as shown in Lemma 13, any trajectories starting from $\text{int}(\mathcal{I})\setminus\{(x^{*},r^{*})\}$ cannot converge to $\partial\mathcal{I}$ . Then according to the Poincaré-Bendixson theorem, because of the non-existence of closed orbits or other equilibria in $\text{int}(\mathcal{I})$ , the trajectories with any initial points in $\text{int}(\mathcal{I})\setminus\{(x^{*},r^{*})\}$ will converge to $(x^{*},r^{*})$ . Similarly, trajectories starting from the sides $\mathcal{B}_{l}$ and $\mathcal{B}_{r}$ also converge to $(x^{*},r^{*})$ . ∎ So the interior equilibrium is almost globally stable in system (4), because almost all trajectories in $\mathcal{I}$ (except for those starting from $\mathcal{B}_{t}\cup\mathcal{B}_{b}$ ) converge to the stable equilibrium $(x^{*},r^{*})$ . We apply similar terminology to the stability of the limit cycle when it is stable and almost all trajectories converge to it.

3.4.2 Uniqueness of limit cycles in the case $\theta=1$

Assumption 15.

We assume that $\theta=1$ henceforth.

Under Assumption 15, the enhancement effect and the degradation effect are balanced. In this case, the system can be transformed to a generalized Liénard system. However, the existence and uniqueness of limit cycles for $\mu\in(0,\mu_{1}]$ in the case $\theta\neq 1$ is still open.

Lemma 16.

Under Assumption 5, when $a+c=b+d$ , the system dynamics in (4) admit at most one limit cycle in $\text{int}(\mathcal{I})$ for $\mu\in(0,\mu_{1})$ .

{pf}

Under Assumption 15, when $a+c=b+d$ , the interior equilibrium, denoted by $q^{*}=(1/2,(a+b)/(2(b+d)))$ , is fixed. And the system dynamics (4) are reduced to be

\begin{cases}&\begin{aligned} \dot{x}=&x(1-x)[x(d-c)-r(d+b)+b]+\mu(1-2x)\end{aligned}\\ &\dot{r}=r(1-r)(2x-1).\end{cases}

(22)

We apply the transformation of the coordinates on (22), $x\rightarrow x^{\prime},~{}r\rightarrow 1-r^{\prime}$ , and obtain the following system

\begin{cases}&\begin{aligned} \dot{x}^{\prime}=&x^{\prime}(1-x^{\prime})[x^{\prime}(d-c)+r^{\prime}(d+b)-d]\\ &+\mu(1-2x^{\prime})\end{aligned}\\ &\dot{r}^{\prime}=-r^{\prime}(1-r^{\prime})(2x^{\prime}-1).\end{cases}

The interior equilibrium $q^{*}$ of (22) now becomes $q^{\prime*}=(1/2,r^{\prime*})$ with $r^{\prime*}=(a+b)/(2(b+d))$ . By using the change of variables $x^{\prime}\rightarrow\tilde{x}+1/2$ and $r^{\prime}\rightarrow\tilde{r}+r^{\prime*}$ , we can shift the equilibrium $q^{\prime*}$ to the origin of a new system as below

\begin{cases}&\begin{aligned} \dot{\tilde{x}}=&\alpha(\tilde{x})\left[(d+b)\tilde{r}+\frac{8\mu_{1}\tilde{x}\alpha(\tilde{x})-2\mu\tilde{x}}{\alpha(\tilde{x})}\right]\end{aligned}\\ &\dot{\tilde{r}}=-2\beta(\tilde{r})\tilde{x},\end{cases}

(23)

which is defined in the region $\tilde{\mathcal{I}}=\{(\tilde{x},\tilde{r}):\tilde{x}\in(-1/2,1/2),\tilde{r}\in(-\tilde{r}^{*},1-\tilde{r}^{*})\}$ with $\alpha(\tilde{x})=(1/4-\tilde{x}^{2})$ and $\beta(\tilde{r})=(\tilde{r}+r^{\prime*})(1-\tilde{r}-r^{\prime*})$ . The system is now in the form of the generalized Liénard system [28].

One can observe that the functions $\alpha(\tilde{x})$ and $\beta(\tilde{r})$ are always positive in the domain. Multiplying the vector field of (23) by the positive function $1/(\alpha(\tilde{x})\beta(\tilde{r}))$ , we obtain

\begin{cases}&\dot{\tilde{x}}=\varphi(\tilde{r})-F(\tilde{x},\tilde{r})\\ &\dot{\tilde{r}}=-g(\tilde{x}),\end{cases}

(24)

where $\varphi(\tilde{r})=\frac{(b+d)\tilde{r}}{\beta(\tilde{r})}$ , $F(\tilde{x},\tilde{r})=\frac{2\mu\tilde{x}-8\mu_{1}\tilde{x}\alpha(\tilde{x})}{\alpha(\tilde{x})\beta(\tilde{r})}$ , and $g(\tilde{x})=\frac{2\tilde{x}}{\alpha(\tilde{x})}$ . The point-wise direction of the vector field of (23) is maintained in (24). Hence the existence and uniqueness of limit cycles for the two systems (23) and (24) are equivalent.

Under Assumption 5, if $0<\mu<\mu_{1}$ holds, it can be validated that there always exists a $\nu=\sqrt{\frac{\mu_{1}-\mu}{4\mu_{1}}}$ such that the following conditions are satisfied:

1.

$G(-\nu)=G(\nu)$ where $G(\tilde{x}):=\int_{0}^{\tilde{x}}g(s)ds$ ;
2.

$\forall\tilde{x}\in(-\nu,0)$ , the function $\tilde{r}\mapsto\frac{F(\tilde{x},\tilde{r})}{\varphi(\tilde{r})}$ is strictly decreasing both on $0<\tilde{r}<1-r^{\prime*}$ and $-r^{\prime*}<\tilde{r}<0$ ; $\forall\tilde{x}\in(0,\nu)$ , the function $\tilde{r}\mapsto\frac{F(\tilde{x},\tilde{r})}{\varphi(\tilde{r})}$ is strictly increasing both on $0<\tilde{r}<1-r^{\prime*}$ and $-r^{\prime*}<\tilde{r}<0$ ;
3.

$\forall\tilde{x}\in(-\nu,\nu)$ , $\forall\tilde{r}\in(-r^{\prime*},1-r^{\prime*})$ , one has $g(\tilde{x})F(\tilde{x},\tilde{r})\leq 0$ ;
4.

$\forall\tilde{x}\notin(-\nu,\nu)$ , $\forall\tilde{r}\in(-r^{\prime*},1-r^{\prime*})$ , one has $F(\tilde{x},\tilde{r})\geq 0$ ; $\forall\tilde{r}\in(-r^{\prime*},1-r^{\prime*})$ , the function $\tilde{x}\mapsto F(\tilde{x},\tilde{r})$ is increasing both on $-1/2<\tilde{x}<-\nu$ and $\nu<\tilde{x}<1/2$ .

The detailed expressions of the functions involved in the above conditions can be found in Appendix F. It is not difficult to check these conditions, so the process is omitted here.

Then according to [28, Theorem 1, Corrollary 1], system (24) has at most one limit cycle in $\tilde{\mathcal{I}}$ . Therefore, the original system (22) admits at most one limit cycle in $\text{int}(\mathcal{I})$ . ∎

It is noted that when $a+c=b+d$ , the coefficient of the cross term $xr$ is $0$ , and thus this cross term disappears in the system equations of (22). So the nonlinearity of this system is reduced. In this case, when system (22) is transformed into the Liénard form, the properties $(1)-(4)$ listed in the proof of Lemma 16 allow us to use the Liénard system theory to prove that there is at most one limit cycle in $\text{int}(\mathcal{I})$ for $\mu\in(0,\mu_{1})$ . Unfortunately, those properties are not satisfied when $a+c\neq b+d$ , and thus the existence and the number of limit cycles in this case remain unclear.

Now we are in the position to present the last result of this section about the uniqueness of limit cycle and stability of the interior equilibrium.

Theorem 17.

Under Assumption 5, when $a+c=b+d$ ,

1.

for $\mu\in(0,\mu_{1}$ ), system (4) has exactly one limit cycle that is almost globally stable;
2.

for $\mu\in[\mu_{1},1]$ , the interior equilibrium $(x^{*},r^{*})$ is almost globally stable.

{pf}

Under Assumption 5, when $a+c=b+d$ , according to Lemma 16, the system (4) has at most one limit cycle for $0<\mu<\mu_{1}$ . As shown in Theorem 6, the eigenvalues of the Jacobian at $(x^{*},r^{*})$ have positive real parts, implying that it is repulsive. Then all trajectories starting in the small neighborhood of this equilibrium (excluding the equilibrium) will diverge, and the existence of infinite many periodic orbits in the neighborhood is excluded. From Lemma 13, one also knows that the boundary $\partial\mathcal{I}$ is repelling. According to the Poincaré-Bendixson theorem, for any initial states in $\text{int}(\mathcal{I})\setminus\{(x^{*},r^{*})\}$ , its $\omega$ -limit set must be a closed orbit. If there are no limit cycles in $\text{int}(\mathcal{I})\setminus\{(x^{*},r^{*})\}$ , then every trajectory starting from $\text{int}(\mathcal{I})\setminus\{(x^{*},r^{*})\}$ will form a periodic orbit, and an infinite number of periodic orbits will fill $\text{int}(\mathcal{I})\setminus\{(x^{*},r^{*})\}$ . This contradicts the repulsiveness of the interior equilibrium and the boundary [28].

Suppose in $\text{int}(\mathcal{I})\setminus\{(x^{*},r^{*})\}$ , there is some connected region which consists of an infinite number of neutral periodic orbits and is enclosed by two periodic orbits denoted by $\Gamma_{1}$ and $\Gamma_{2}$ . Then $\Gamma_{1}$ and $\Gamma_{2}$ must be limit cycles, which contradicts Lemma 16 regarding the maximal number of limit cycles. Therefore, one can conclude that there is exactly one stable limit cycle in $\mathcal{I}$ , and it is attractive for $\mathcal{I}\setminus(\mathcal{B}_{t}\cup\mathcal{B}_{b}\cup(x^{*},r^{*}))$ .

Note that when $a+c=b+d$ and $\theta=1$ , we have $\mu_{0}=\mu_{1}$ . Thus, the second claim is easy to verify according to Theorems 6 and 14. ∎ Fig. 3 shows the plot of the unique limit cycle of system (4) when $\mu$ changes from $0$ to $0.4$ . The payoff matrices used in the simulations are shown below, and they satisfy the condition $a+c=b+d$ .

\begin{bmatrix}R_{1}&S_{1}\\ T_{1}&P_{1}\end{bmatrix}=\begin{bmatrix}3.5&2\\ 0.5&1\end{bmatrix},~{}~{}\begin{bmatrix}R_{2}&S_{2}\\ T_{2}&P_{2}\end{bmatrix}=\begin{bmatrix}2&0.2\\ 3&3.2\end{bmatrix}.

(25)

The Hopf bifurcation point is computed to be $\mu_{1}=0.1633$ . It can be observed that the limit cycle “collides” with the heteroclinic cycle lying on the boundary and the interior equilibrium as $\mu$ approaches $0$ and $0.1633$ respectively. In addition, it disappears after $\mu$ passes $0.1633$ .

4 Incentive-based Control

In real biological and social systems, the mutation or exploration rate is usually small. For example, an extremely high mutation rate for a biological entity may reach $1/400$ per site as reported in [3]. As known from Section 3, although the Hopf bifurcation point $\mu_{1}$ depends on the payoff parameters, limit cycles exist in system (4) for $\mu<\mu_{1}$ . It means that in many situations stable limit cycles persist in the co-evolutionary dynamics of the game and environment. On the one hand, the shapes and positions of limit cycles in nonlinear systems are generally difficult to identify analytically; on the other hand, such limit cycle oscillations, although different than the (oscillating) tragedy of the commons, are not desirable from a control point of view. In addition, considering the public resource management, the issues of how to best govern common-pool resources have long been a common concern in the field of economics [20]. Thus, in this section we intend to study how to design suitable control policies such that the following two objectives are attained:

(a).

alleviate or even eliminate the oscillation;
(b).

increase the environmental resource stock.

One needs to choose what the control input should be. Although multiple possible approaches have been proposed in the evolutionary games field, perhaps the most plausible control input is to offer suitable incentives to be added to the payoff of the desired strategies [26]. Now we consider that in each game interaction, the cooperator receives certain constant incentive $u\geq 0$ from an external regulating authority.

Recall that the entries on the first row of $A(r)$ are the payoffs to a C player. The external incentive will be added to the first row of (1). Thus, the control input $u$ is incorporated into the previously defined payoff matrix (1) as follows

		$\displaystyle A_{u}(r)=(1-r)\begin{bmatrix}R_{1}&S_{1}\\ T_{1}&P_{1}\end{bmatrix}+r\begin{bmatrix}R_{2}&S_{2}\\ T_{2}&P_{2}\end{bmatrix}+\begin{bmatrix}u&u\\ 0&0\end{bmatrix}$		(26)
		$\displaystyle=\begin{bmatrix}&r(R_{2}-R_{1})+R_{1}+u&r(S_{2}-S_{1})+S_{1}+u\\ &r(T_{2}-T_{1})+T_{1}&r(P_{2}-P_{1})+P_{1}\end{bmatrix}.$		(26)

Substituting (26) into equations (4) results in the following system with the control input $u$

\begin{cases}&\begin{aligned} \dot{x}=&x(1-x)[xr(-c+d-a+b)+x(a-b)\\ &-r(d+b)+b+u]+\mu(1-2x)\end{aligned}\\ &\dot{r}=r(1-r)(2x-1).\end{cases}

(27)

To emphasize the effect of the control input $u$ on the system behaviors, we have assumed that the enhancement effect and the degradation effect are balanced by fixing the parameter $\theta$ to be $1$ (under Assumption 15). In the following part we will study whether such a constant incentive can achieve the desired control objectives.

4.1 Stable equilibrium

System (27) has a unique interior equilibrium which is given by $(x_{c}^{*},r_{c}^{*})=(1/2,(a+b+2u)/(a+c+b+d))$ when $0\leq u<(c+d)/2$ . It is noted that the interior equilibrium is shifted up vertically, i.e., the $x$ -coordinate of this equilibrium is the same as the equilibrium of the system without control, while the value of the $r$ -coordinate increases. Then we evaluate the Jacobian at $(x_{c}^{*},r_{c}^{*})$ , which is given by

J_{c}^{*}=\begin{bmatrix}\frac{ad-bc+(b+d-a-c)u}{2(a+b+c+d)}-2\mu&\frac{-(a+b+c+d)}{8}\\ \frac{2(a+b+2u)(c+d-2u)}{(a+b+c+d)^{2}}&0\end{bmatrix}.

(28)

For this matrix, it is easy to identify that the real parts of its eigenvalues are of the same sign, which is the same as the first entry.

When $u\geq(c+d)/2$ , no equilibria exist in the interior of $\mathcal{I}$ , and all the equilibria are located on the sides $\mathcal{B}_{t}$ and $\mathcal{B}_{b}$ . For system (27), if $u\geq 0$ , it is easy to identify that the $x$ -coordinate of the equilibrium on $\mathcal{B}_{b}$ is in the range $x\in(1/2,1)$ by using a similar approach in Lemma 11. Then the same argument of Lemma 12 is applicable, so one can ensure that this equilibrium is unstable for $\text{int}(\mathcal{I})$ . Similarly, on the side $\mathcal{B}_{t}$ , when $0<u<(c+d)/2$ , the equilibria are located in the range $x\in(0,1/2)$ , and thus they are unstable. However, when $u>(c+d)/2$ , the situation will be different, and we have the following statement for the equilibria.

Lemma 18.

Consider system (27). When $u>(c+d)/2$ , on the side $\mathcal{B}_{t}$ , there exists one equilibrium, denoted by $(x_{t}^{*},1)$ , satisfying $x_{t}^{*}\in(1/2,1)$ . And the other possible equilibria are all located in the set $\{(x,r):x\in(0,1/2),~{}r=1\}$ .

{pf}

See Appendix G. ∎ It is easy to check the stability of these equilibria by evaluating the Jacobian.

Lemma 19.

When $u>(c+d)/2$ , the boundary equilibrium $(x_{t}^{*},1)$ is locally asymptotically stable under system (27), while the other equilibria (if they exist) are unstable.

{pf}

One can check the local stability of $(x_{t}^{*},1)$ by examining the Jacobian which is given by

J_{t}^{*}=\begin{bmatrix}\begin{aligned} &3(c-d){x_{t}^{*}}^{2}+2(2d-c-u)x_{t}^{*}\\ &-d+u-2\mu\end{aligned}&\bullet\\ 0&1-2x_{t}^{*}\end{bmatrix},

(29)

where the symbol $\bullet$ stands for $-x_{t}^{*^{3}}(-c+d-a+b)-x_{t}^{*}(d+b)+x_{t}^{*^{2}}(-c+2d-a+2b)$ . Note that the eigenvalues of $J_{t}^{*}$ are actually the two diagonal entries. According to Lemma 18, obviously the eigenvalue $\lambda_{1}=1-2x_{t}^{*}$ is negative.

For the other eigenvalue $\lambda_{2}=3(c-d){x_{t}^{*}}^{2}+2(2d-c-u)x_{t}^{*}-d+u-2\mu$ , one can prove it is negative in all the three cases regarding the relationship of $c$ and $d$ . To illustrate, we only show the case of $c>d$ here, since the other cases are similar. Suppose $c=d+\epsilon$ with $\epsilon>0$ . From Appendix G, we have $1/2<x_{t}^{*}<(u-d)/\epsilon$ , then one can check that

	$\displaystyle\lambda_{2}$	$\displaystyle=(3{x_{t}^{}}^{2}-2x_{t}^{})\epsilon+(u-d)(1-2x_{t}^{*})-2\mu$
		$\displaystyle<(3{x_{t}^{}}^{2}-2x_{t}^{})\epsilon+\epsilon x_{t}^{}(1-2x_{t}^{})-2\mu$
		$\displaystyle=({x_{t}^{}}^{2}-x_{t}^{})\epsilon-2\mu$
		$\displaystyle<-2\mu.$

Thus, the eigenvalue $\lambda_{2}$ is negative. The fact of two negative eigenvalues ensures that the equilibrium $(x_{t}^{*},1)$ is locally asymptotically stable. The instability of other possible equilibria is easy to check since there is always a positive eigenvalue in view of (29). ∎

Then, we define a Dulac function which is similar to that in Section 3.1

\varphi_{c}(x,r)=x^{\alpha}(1-x)^{\beta}r^{\gamma-0.5u}(1-r)^{\delta+0.5u},

(30)

where the parameters $\alpha$ , $\beta$ , $\gamma$ , and $\delta$ are the same as in (11) with $\theta=1$ . Denote the vector field of (27) by $f_{c}(x,r)$ . After the multiplication of $\varphi_{c}(x,r)$ , we compute the divergence of the modified vector field $\varphi_{c}f_{c}(x,r)$ in $\text{int}(\mathcal{I})$ , which yields

	$\displaystyle\text{div}~{}\varphi_{c}f_{c}(x,r)$	$\displaystyle=\varphi_{c}(x,r)[g(x)\mu-2\mu+(1.5+\alpha)u$		(31)
		$\displaystyle~{}~{}~{}-\frac{(bc-ad)}{2(a+b+c+d)}],$		(31)

with $g(x)$ being the same as in (12), which takes the maximum value at $\bar{x}$ as shown in (13).

Next we discuss in two cases and show that there always exists some $u$ such that system (27) has a stable equilibrium for the whole interior.

Theorem 20.

For system (27), when $a+c>b+d$ , there exists some $u_{1}:=\frac{ad-bc-(4-g(\bar{x}))\mu(a+b+c+d)}{a+c-b-d}$ such that

1.

if $u_{1}<(c+d)/2$ , the interior equilibrium $(x_{c}^{*},r_{c}^{*})$ is almost globally stable under $u\in(u_{1},(c+d)/2)$ ;
2.

if $u_{1}>(c+d)/2$ , the boundary equilibrium $(x_{t}^{*},1)$ is almost globally stable under $u>(c+d)/2$ .

{pf}

When $a+c>b+d$ , we have $\alpha=-(1+\frac{a+c}{a+b+c+d})\in(-2,-1.5)$ , so the coefficient of the term with $u$ in (31), i.e., $1.5+\alpha$ , is negative. Thus, if

u>u_{1}:=\frac{ad-bc-(4-g(\bar{x}))\mu(a+b+c+d)}{a+c-b-d},

(32)

the right hand side of (31) is always negative, which implies that there are no closed orbits in $\mathcal{I}$ .

By checking the Jacobian (28), one can see that the interior equilibrium $(x_{c}^{*},r_{c}^{*})$ is stable when

u>u_{2}:=\frac{ad-bc-4\mu(a+b+c+d)}{a+c-b-d}.

As $g(\bar{x})\geq 0$ , we have $u_{1}\geq u_{2}$ . Then if $u_{1}<(c+d)/2$ , one can choose some $u\in(u_{1},(c+d)/2)$ such that the interior equilibrium $(x_{c}^{*},r_{c}^{*})$ exists and is asymptotically stable. Due to the non-existence of closed orbits in $\mathcal{I}$ and other stable equilibria, and also because of the repulsiveness of $\partial\mathcal{I}$ , one can conclude that almost all trajectories will converge to $(x_{c}^{*},r_{c}^{*})$ asymptotically.

On the other hand, if $u_{1}\geq(c+d)/2$ , one can choose some $u>(c+d)/2$ which leads to the occurrence of the equilibrium $(x_{t}^{*},1)$ . In view of (29), the equilibrium $(x_{t}^{*},1)$ is locally asymptotically stable. The other equilibria on $\mathcal{B}_{t}$ and $\mathcal{B}_{b}$ are all unstable, and they are located on the sets whose sufficient close neighborhoods in $\text{int}(\mathcal{I})$ are repelling. Hence, it is impossible for trajectories starting from $\text{int}(\mathcal{I})$ to converge to these points. As a consequence, $(x_{t}^{*},1)$ is the only $\omega$ -limit set for all initial states in $\text{int}(\mathcal{I})$ from the Poincaré-Bendixson theorem. Similarly, this result extends to the sides $\mathcal{B}_{l}$ and $\mathcal{B}_{r}$ by following the preceding analysis. ∎

Theorem 21.

For system (27), when $a+c\leq b+d$ , the boundary equilibrium $(x_{t}^{*},1)$ is almost globally stable under $u>(c+d)/2$ .

{pf}

When $a+c\leq b+d$ , the first entry of (28) will always be positive for any $u$ if $\frac{ad-bc}{2(a+b+c+d)}-2\mu>0$ , in which case the equilibrium will always be unstable (saddle). Thus, it is impossible to stabilize the interior equilibrium with any positive $u$ if it is unstable in the uncontrolled system. However, one can choose some $u>(c+d)/2$ such that there is an equilibrium $(x_{t}^{*},1)$ on $\mathcal{B}_{t}$ which can be proved to be almost globally stable by using the same argument in the second case of Theorem 20. ∎

In Theorems 20 and 21, the relationship between $a+c$ and $b+d$ determines whether the interior equilibrium can be stabilized or not under the positive control input $u$ . In view of (28), one can observe that the stability of the interior equilibrium can be changed from unstable to stable when $a+c>b+d$ under a suitable $u$ , while the stability is not qualitatively affected by $u$ when $a+c\leq b+d$ .

When there exists an equilibrium that is globally stable for $\mathcal{I}$ , obviously the limit cycle oscillation is eliminated. No matter whether this stable equilibrium is in the interior or on the top side. The second designed goal is also achieved, since the $r$ -coordinates of the equilibria are higher. One can find some illustrations in Fig. 4.

4.2 Reduced amplitude of the limit cycle

For the case of $a+c=b+d$ , although it is impossible to stabilize the interior equilibrium with a suitable $u$ , one can observe that the amplitude of the oscillation on $r$ -coordinate decreases under some small $u$ .

As $a+c=b+d$ , it is noted that the control input does not affect the stability of equilibrium $(x_{c}^{*},r_{c}^{*})$ according to the Jacobian (28). However, one can transform system (27) into a form of Liénard system as (24) in the same manner. Since the listed conditions are satisfied straightforwardly, according to Lemma (16), for $\mu<(0,\mu_{1})$ , system (27) has a unique limit cycle when $u\in[0,(c+d)/2)$ . When there is some intermediate control input $u\in(0,(c+d)/2)$ , the corresponding interior equilibrium is shifted towards the top side $\mathcal{B}_{t}$ . To some extent, the vector field between the interior equilibrium and the top side is “compressed”. The position of the limit cycle is shifted up, and the amplitude of the cyclic oscillation on $r$ -coordinate is reduced at the same time. In this sense, the oscillation is alleviated. Some numerical results can be found in Fig. 5.

5 Conclusions and Future Work

We have investigated the co-evolutionary dynamics of the game and environment when strategies’ mutations are taken into account. We used the replicator-mutator model to depict the strategic evolution. By using bifurcation theory we showed that the system at the interior equilibrium undergoes a Hopf bifurcation which generates stable limit cycles; on the other hand, we showed there is a heteroclinic bifurcation which may also generate a stable limit cycle. Our analysis highlights the important role that mutations play in the integrated system dynamics. For the co-evolutionary system, the stable limit cycle corresponds to a sustained oscillation of population’s decisions and richness of the environmental resource. Compared with the neutral periodic oscillation or heteroclinic cycle oscillation that have been exhibited in previous studies using the replicator model, such limit cycle oscillations can better explain the phenomena that appear in real biological or social systems. Moreover, we proved that under certain parameter conditions, the system always admits a unique stable limit cycle which is attractive for almost the entire domain.

Such a robust limit cycle oscillation may provide implications and theoretical support for relevant studies in biology and sociology. For example, in the microbial context, some cooperating bacteria can produce costly public good, while the defecting bacteria only profit. Thus, the richness of the public good is enhanced by the cooperating bacteria and degraded by the defectors. In the process of bacterial reproduction, mutations are common. The amount of mutant offsprings can have important impact on the dynamic outcomes of the biological systems. For such microbial systems, [2] shows that the dynamics can converge to an interior equilibrium, while [35] shows that persistent heteroclinic oscillations can exist. However, the results obtained in this work theoretically reveal that a moderate limit cycle oscillation can exist in addition to the convergence to a steady state or a heteroclinic cycle. Thus, this work complements the previous studies.

We also considered the incentive-based control applied into the co-evolutionary dynamics, and showed the effectiveness of such control on maintaining the environmental resource. Namely, compared with the uncontrolled system, the incentive will increase the level of environmental resource in most cases. In particular, one can apply some intermediate incentive to let the system have an interior equilibrium that is stable for the whole interior of domain. If the interior equilibrium cannot be stabilized, some large incentive could be put into use to make the system dynamics converge to a boundary equilibrium with full environmental resource.

The obtained results in the control part of this work can provide valuable insights into many practical issues in nature, such as the common-pool resource management, environmental and sustainable development, and ecological diversity conservation. For example, the global climate change is one of the biggest challenges of our times. The strategic decisions of individuals, corporations, and governments have long-term environmental consequence that will, in turn, alter the strategic landscape those parties face [32]. To mitigate the global warming, all concerned parties are called upon to reduce greenhouse gas emissions. In such a socio-ecological system, the reduction of greenhouse gas emissions can be taken as the environmental factor of interest. Policy-makers can design and implement economic incentives aiming at adapting individual decisions to collectively agreed goals [31]. Hence, the incentive-based policy proposed in this work shows the power of incentive-based control policy and leads to some directions in mitigating the global warming issue.

In this paper, we have used a more general and realistic model, replicator-mutator dynamics, to describe the strategic evolution, but the environment dynamics is still described by the simplified logistic model. Thus, it is of great interest to study the game dynamics under different types of feedback of the practical environmental resource, such as renewing or decaying resource. The replicator or replicator-mutator equations describe the evolutionary game dynamics in an infinite well-mixed population. Due to this shortcoming, they are not suitable to study game dynamics in finite or structured populations. Therefore, another direction of future studies is to develop more appropriate individual-based models to study the game dynamics under the environmental feedback. Last but not least, in addition to the incentive-based control, there could exist other kinds of control policies that can be more effective in various specific situations.

References

[1] F. Blanchini and S. Miani. Set-theoretic methods in control. Springer, New York, 2008.
[2] S.P. Brown and F. Taddei. The durability of public goods changes the dynamics and nature of social dilemmas. PLoS One, 2(7):e593, 2007.
[3] S. Gago, S.F. Elena, R. Flores, and R. Sanjuán. Extremely high mutation rate of a hammerhead viroid. Science, 323(5919):1308–1308, 2009.
[4] L. Gong, J. Gao, and M. Cao. Evolutionary game dynamics for two interacting populations in a co-evolving environment. Proceedings of the 57th IEEE Conference on Decision and Control, pages 3535–3540, 2018.
[5] L. Gong, W. Yao, J. Gao, and M. Cao. Limit cycles in replicator-mutator dynamics with game-environment feedback. IFAC-PapersOnLine, 2020.
[6] J. Guckenheimer and P. Holmes. Nonlinear oscillations, dynamical systems and bifurcations of vector fields. Springer, New York, 2000.
[7] C. Hauert, C. Saade, and A. McAvoy. Asymmetric evolutionary games with environmental feedback. Journal of Theoretical Biology, 462:347–360, 2019.
[8] M.W. Hirsch, S. Smale, and R.L. Devaney. Differential Equations, Dynamical Systems & An Introduction to Chaos. Elsevier, Amsterdam, 2004.
[9] J. Hofbauer. Heteroclinic cycles in ecological differential equations. In Equadiff 8, pages 105–116. Mathematical Institute, Slovak Academy of Sciences, 1994.
[10] J. Hofbauer and K. Sigmund. Evolutionary Games and Population Dynamics. Cambridge University Press, Cambridge, 1998.
[11] Y. Kawano, L. Gong, B.D.O. Anderson, and M. Cao. Evolutionary dynamics of two communities under environmental feedback. IEEE Control Systems Letters, 3(2):254–259, 2019.
[12] N.L. Komarova. Replicator-mutator equation, universality property and population dynamics of learning. Journal of Theoretical Biology, 230:227–239, 2004.
[13] Y.A. Kuznetsov. Elements of Applied Bifurcation Theory. Springer, New York, 2004.
[14] J. Lee, Y. Iwasa, U. Dieckmann, and K. Sigmund. Social evolution leads to persistent corruption. Proceedings of the National Academy of Sciences of USA, 116(27):13276–13281, 2019.
[15] Y. Lin and J.S. Weitz. Spatial interactions and oscillatory tragedies of the commons. Phys. Rev. Lett., 122:148102, 2019.
[16] D. Luo, X. Wang, D. Zhu, and M. Han. Bifurcation Theory and Methods of Dynamical Systems. World Scientific, 1997.
[17] T. Morimoto, T. Kanazawa, and T. Ushio. Subsidy-based control of heterogeneous multiagent systems modeled by replicator dynamics. IEEE Transactions on Automatic Control, 61(10):3158–3163, 2016.
[18] D. Muratore and J. S. Weitz. Infect while the iron is scare: Nutrient explicit phage-bateria games. Theoretical Ecology, 14:467–487, 2021.
[19] M.A. Nowak, N.L. Komarova, and P. Niyogi. Evolution of universal grammar. Science, 291:114–118, 2001.
[20] E. Ostrom, Cambridge University Press, R. Calvert, and T. Eggertsson. Governing the Commons: The Evolution of Institutions for Collective Action. Cambridge University Press, Cambridge, 2015.
[21] K. Paarporn, C. Eksin, J. S. Weitz, and Y. Wardi. Optimal control policies for evolutionary dynamics with environmental feedback. In 2018 IEEE Conference on Decision and Control (CDC), pages 1905–1910, 2018.
[22] D. Pais, C.H. Caicedo-Nunez, and N.E. Leonard. Hopf bifurcation and limit cycles in evolutionary network dynamics. SIAM Journal on Applied Dynamical Systems, 11(4):1754–1884, 2013.
[23] D. Pais and N. E. Leonard. Limit cycles in replicator-mutator network dynamics. In 2011 50th IEEE Conference on Decision and Control and European Control Conference, pages 3922–3927, 2011.
[24] D.G. Rand, D. Tomlin, A. Bear, E.A. Ludvig, and J.D. Cohen. Cyclical population dynamics of automatic versus controlled processing: An evolutionary pendulum. Psychological review, 124(5):626, 2017.
[25] J. Riehl and M. Cao. Towards optimal control of evolutionary games on networks. IEEE Transactions on Automatic Control, 62(1):458–462, 2017.
[26] J. Riehl, P. Ramazi, and M. Cao. A survey on the analysis and control of evolutionary matrix games. Annual Reviews in Control, 45:87 – 106, 2018.
[27] J. Riehl, P. Ramazi, and M. Cao. Incentive-based control of asynchronous best-response dynamics on binary decision networks. IEEE Transactions on Control of Network Systems, 6(2):727–736, 2019.
[28] M. Sabatini and G. Villari. Limit cycle uniqueness for a class of planar dynamical systems. Applied Mathematics Letters, 19:1180–1184, 2006.
[29] M. F. Singer. Liouvillian first integrals of differential equations. Transactions of the American Mathematical Society, 333:673–688, 1992.
[30] L. Stella and D. Bauso. Bio-inspired evolutionary game dynamics in symmetric and asymmetric models. IEEE Control Systems Letters, 2(3):405–410, 2018.
[31] R. Sullivan. Corporate responses to climate change: Achieving emissions reductions through regulation, self-regulation and economic incentives. Routledge, 2017.
[32] A.R. Tilman, J. Plotkin, and E. Akcay. Evolutionary games with environmental feedbacks. Nature Communications, 11:915, 2020.
[33] A. Traulsen, C. Hauert, H. De Silva, M.A. Nowak, and K. Sigmund. Exploration dynamics in evolutionary games. Proceedings of the National Academy of Sciences, 106(3):709–712, 2009.
[34] X. Wang, Z. Zheng, and F. Fu. Steering eco-evolutionary game dynamics with manifold control. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 476(2233):20190643, 2020.
[35] J.S. Weitz, C. Eksin, K. Paarporn, S.P. Brown, and W.C. Ratcliff. An oscillating tragedy of the commons in replicator dynamics with game-environment feedback. Proceedings of the National Academy of Sciences of USA, 113(47):E7518–E7525, 2016.
[36] S. Wiggins. Introduction to Applied Nonlinear Dynamical Systems and Chaos. Springer, New York, 2000.
[37] B. Zhu, X. Xia, and Z. Wu. Evolutionary game theoretic demand-side management and control for a class of networked smart grid. Automatica, 70:94–100, 2016.

Appendix A Proof of Lemma 1

{pf}

The equations (10) can be taken as a set of linear equations with respect to unknown variables $\alpha$ , $\beta$ , $\gamma$ , and $\delta$ . It suffices to show that (10) always has solutions. This can be done by checking the ranks of the coefficient matrix $C$ and augmented matrix $[C|D]$ , which are given by

\begin{bmatrix}&\begin{aligned} &(-c+d\\ &~{}-a+b)\end{aligned}&\begin{aligned} &(-c+d\\ &-a+b)\end{aligned}&0&0\\ &(a-b)&(a-b)&0&0\\ &\begin{aligned} &(-c+2d\\ &-a+2b)\end{aligned}&(d+b)&-(\theta+1)&-(\theta+1)\\ &(a-2b)&-b&(\theta+1)&0\\ &-(d+b)&0&1&1\\ \end{bmatrix},

(33)

\begin{bmatrix}&\begin{aligned} &(-c+d\\ &~{}-a+b)\end{aligned}&\begin{aligned} &(-c+d\\ &-a+b)\end{aligned}&0&0&\begin{aligned} &3(c-d\\ &+a-b)\end{aligned}\\ &(a-b)&(a-b)&0&0&3(b-a)\\ &\begin{aligned} &(-c+2d\\ &-a+2b)\end{aligned}&(d+b)&-(\theta+1)&-(\theta+1)&\varpi\\ &(a-2b)&-b&(\theta+1)&0&\begin{aligned} &(4b-2a\\ &-\theta-1)\end{aligned}\\ &-(d+b)&0&1&1&b+d-2\\ \end{bmatrix},

(34)

where $\varpi=2(c-2d+a-2b+\theta+1)$ .

One can easily check that $\text{rank}~{}C\equiv\text{rank}~{}[C|D]\leq 4$ , which implies that (10) has at least one set of solutions. Actually when $-c+d-a+b=0$ and $a-b=0$ , (10) will have infinite many solutions; otherwise, it will have exactly one solution. ∎

Appendix B Proof of Lemma 3

{pf}

For the boundary of $\mathcal{I}$ , the same argument presented in the proof of Lemma 2 is also applicable to exclude the existence of periodic orbits that lie on or intersect with the boundary. Thus we only need to focus on the case of the interior $\text{int}(\mathcal{I})$ .

When $\mu=0$ , the divergence of $\varphi f(x,r)$ is

\frac{\partial(\varphi f_{1})}{\partial x}(x,r)+\frac{\partial(\varphi f_{2})}{\partial r}(x,r)=\varphi(x,r)[(1+\alpha)b-1-\gamma].

(35)

If there exist periodic orbits in $\text{int}(\mathcal{I})$ , the Bendixson-Dulac criterion would imply that either $\varphi(x,r)[(1+\alpha)b-1-\gamma]$ is identically zero or it changes sign. Because $\varphi(x,r)$ is strictly positive, we have $(1+\alpha)b-1-\gamma=0$ . Then (35) turns out to be

\frac{\partial(\varphi f_{1})}{\partial x}(x,r)=-\frac{\partial(\varphi f_{2})}{\partial r}(x,r).

(36)

Thus system (4) has a first integral [[29]] in $\text{int}(\mathcal{I})$

V(x,r)=\int\varphi f_{2}dx-\varphi f_{1}dr,

(37)

whose derivative over time $t$ satisfies

\frac{dV}{dt}=\frac{\partial V}{\partial x}\dot{x}+\frac{\partial V}{\partial r}\dot{r}=\varphi f_{1}f_{2}-\varphi f_{1}f_{2}\equiv 0,

and hence $V(x,r)$ remains constant along the solutions of (4).

By definition, a limit cycle is an isolated periodic orbit, which is the $\alpha$ -limit set or $\omega$ -limit set for some initial points that do not lie on the orbit [[8]]. Suppose there is a limit cycle $\Gamma_{0}$ in $\text{int}(\mathcal{I})$ , and an arbitrary trajectory starts from the initial point $(x_{0},r_{0})$ and spirals toward $\Gamma_{0}$ as $t\rightarrow+\infty$ or $t\rightarrow-\infty$ . Then according to [8, Corollary 10.1], $(x_{0},r_{0})$ must have a neighborhood $\Omega$ such that $\Gamma_{0}$ is the $\omega$ -limit set for all points in it. Due to the fact that $V(x,r)$ is the first integral for system (4), $V(x,r)$ must be constant in $\Omega$ because of the continuity. However, the partial derivatives of $V(x,r)$

		$\displaystyle\frac{\partial V}{\partial x}(x,r)=2\varphi(x,r)f_{2}(x,r)$
		$\displaystyle\frac{\partial V}{\partial r}(x,r)=-2\varphi(x,r)f_{1}(x,r).$

only equal $0$ at $(x^{*},r^{*})$ , which leads to a contradiction. Therefore, system (4) cannot have limit cycles in $\mathcal{I}$ when $\mu=0$ . ∎

Appendix C Computation of first Lyapunov coefficient

The following is the process of calculating $\ell_{1}$ as presented in [13]. Consider a planar autonomous system $\dot{y}=f(y,\mu)$ where $y\in\mathbb{R}^{2}$ and $\mu\in\mathbb{R}$ . Let $A_{0}$ be the Jacobian matrix evaluated at the equilibrium $y_{0}$ and the Hopf bifurcation point $\mu_{h}$ . Suppose $A_{0}$ has two purely imaginary complex conjugate eigenvalues, given by $\pm i\omega_{0}$ . The algorithm for computing $\ell_{1}$ can be summarized as follows:

1.

compute the complex vectors $q,p\in\mathbb{C}^{2}$ such that

$A_{0}q=i\omega_{0}q,~{}~{}A_{0}^{T}p=-i\omega_{0}p,~{}\langle p,q\rangle=1$

where $\langle p,q\rangle=\bar{p}^{T}q$ is the inner product;
2.

compose the expression with a complex variable $z$

$H(z,\bar{z})=\langle p,f(y_{0}+zq+\bar{z}\bar{q},\mu_{h})\rangle,$

and find the coefficients $g_{20}$ , $g_{11}$ , and $g_{21}$ in the formal Taylor series

$H(z,\bar{z})=i\omega_{0}z+\sum_{j+k\geq 2}\frac{1}{j!k!}g_{jk}z^{j}\bar{z}^{k};$
3.

the first Lyapunov coefficient $\ell_{1}(y_{0},\mu_{h})$ is given by

$\ell_{1}(y_{0},\mu_{h})=\frac{1}{2\omega_{0}^{2}}\Re(ig_{20}g_{11}+\omega_{0}g_{21}).$

Consider system (4). When $\mu=\mu_{1}$ , we have

A_{0}=\begin{bmatrix}0&\frac{-\theta\zeta}{(\theta+1)^{3}}\\ \frac{(\theta+1)(c\theta+d\theta^{2}-\mu_{1}\hat{\theta})(a\theta+b\theta^{2}+\mu_{1}\hat{\theta})}{\theta^{2}\zeta^{2}}&0\end{bmatrix},

(38)

\omega_{0}=\sqrt{\frac{(c\theta+d\theta^{2}-\mu_{1}\hat{\theta})(a\theta+b\theta^{2}+\mu_{1}\hat{\theta})}{\theta(\theta+1)^{2}\zeta}}.

(39)

Then following the above process, we obtain

	$\displaystyle\ell_{1}(\mu_{1})$	$\displaystyle=$		(40)
		$\displaystyle\frac{\theta(bc-ad)\zeta((b+d)(\theta^{3}+2\theta)+(a+c)(2\theta^{3}+1))}{2\omega_{0}^{3}(\theta+1)^{4}\Delta}.$		(40)

Appendix D Proof of Lemma 11

{pf}

One only needs to check the case of $\mathcal{B}_{b}$ , since the other case is analogous. When $r(0)=0$ initially, we have $\dot{r}=0$ , and thus one can focus on the following one-dimensional dynamics about $x$ :

\dot{x}=x(1-x)[ax+b(1-x)]+\mu(1-2x),~{}~{}x\in[0,1].

(41)

Thus, the equilibrium’s $x$ -coordinates are the solutions of the following equation

x(1-x)[ax+b(1-x)]=\mu(2x-1),~{}~{}x\in[0,1].

(42)

The left hand side (LHS) of (42) is non-negative in the interval $x\in[0,1]$ and is exactly equal to $0$ at $x=0,1$ , while the right hand side (RHS) of (42), $\mu(2x-1)$ , is negative for $x\in[0,1/2)$ and positive for $x\in(1/2,1]$ when $\mu\in(0,1]$ . Thus, the LHS and RHS of (42) will be equal at some $x\in(1/2,1)$ .

Now, we will show that the LHS and RHS of (42) will equal for exactly one value of $x$ in the range at $(1/2,1)$ . Denote the LHS of (42) by $\phi(x)$ . Its first derivative is $\phi^{\prime}(x)=-3(a-b)x^{2}+2(a-2b)x+b$ , and the second derivative is $\phi^{\prime\prime}(x)=2(a-2b)-6(a-b)x$ . We now discuss in the following cases:

1).

When $a-b=0$ , one has $\phi^{\prime\prime}(x)=-2b<0$ as $b>0$ , which implies $\phi(x)$ is concave. In this case, $\phi(x)$ equals the RHS of (42) exactly once when $x\in(1/2,1)$ .
2).
When $a-b\neq 0$ , the solution to $\phi^{\prime\prime}(x)=0$ is $x=\frac{a-2b}{3(a-b)}$ . Then,
1. i.
  
  when $\frac{a-2b}{3(a-b)}\leq 0$ or $\frac{a-2b}{3(a-b)}\geq 1$ , one has $b<a\leq 2b$ or $a<b\leq 2a$ , respectively. In both cases, $\phi^{\prime\prime}(x)$ is non-positive for $x\in[0,1]$ . Thus, $\phi(x)$ is concave in $[0,1]$ , which results in the same conclusion as in case 1).
2. ii.
  
  when $0<\frac{a-2b}{3(a-b)}<1$ , one has $a>2b$ or $b>2a$ . If $a>2b$ , we have $\frac{a-2b}{3(a-b)}<\frac{1}{3}$ , which results in $\phi^{\prime\prime}(x)<0$ for $x\in(1/3,1]$ . Namely, $\phi(x)$ is concave in $x\in(1/3,1]$ . The same conclusion follows. In the other case, if $b>2a$ , we have $\frac{a-2b}{3(a-b)}>\frac{1}{2}$ . Then, one can check that $\phi^{\prime}(x)$ is negative for $x\in(1/2,1]$ . The strictly decreasing function $\phi(x)$ will only be equal to the RHS of (42) for one time when $x\in(1/2,1]$ .

When $\theta\geq 1$ , the latter statement of the lemma holds trivially since $1/(\theta+1)\leq 1/2$ . For the case $\theta<1$ , one can substitute $x=1/(\theta+1)$ into the RHS of (41), which yields $\frac{\mu(\theta-1)(\theta+1)^{2}+\theta(a+\theta b)}{(\theta+1)^{3}}$ . This value is positive for any $\mu\in(0,1]$ because of (6). Since the RHS of (41) is negative at $x=1$ , then due to continuity, the RHS of (41) equals $0$ when $x\in(1/(\theta+1),1)$ . ∎

Appendix E Proof of Lemma 13

{pf}

The proof is inspired by [10, Theorem 12.2.1]. As stated before, the vectors on the left side $\mathcal{B}_{l}$ and the right side $\mathcal{B}_{r}$ point inwards to $\mathrm{int}(\mathcal{I})$ . This implies that these two sides are repelling. Therefore, the subsequent discussion is restricted to the top and bottom sides $\mathcal{B}_{t}$ and $\mathcal{B}_{b}$ , which are positively invariant.

We define a partial distance function $F:[0,1]\to\mathbb{R}$ as below:

F(r)=r(1-r),

(43)

which indicates the distance of the system states $(x,r)$ to the top and bottom sides $\mathcal{B}_{t}$ and $\mathcal{B}_{b}$ . It is positive in $\text{int}(\mathcal{I})$ and only equals zero on these two sides. In addition, as $r$ increases from $0$ to $1$ , the function value $F(r)$ monotonically increases from $0$ to the maximal value at $r=1/2$ and then monotonically decreases to $0$ at $r=1$ . The derivative of $F$ with respect to time $t$ is

\displaystyle\dot{F}(r)=\dot{r}(1-2r)=F(r)\Phi(x,r),

(44)

where $\Phi(x,r):=(1-2r)[(\theta+1)x-1]$ . We want to show that there exists a $\hat{\delta}>0$ such that $\liminf_{t\to\infty}F(r(t))>\hat{\delta}$ , which implies that $\mathcal{B}_{t}$ and $\mathcal{B}_{b}$ are repelling. We only discuss the case of $\mathcal{B}_{b}$ here, as the analysis for $\mathcal{B}_{t}$ is similar and thus omitted. Denote the unique interior equilibrium by $q^{*}=(x^{*},r^{*})$ , as given before. Choose $0<r^{\prime}<\min\{r^{*},1/2\}$ , and let $m=F(r^{\prime})=r^{\prime}(1-r^{\prime})$ . We define a lower region by $I_{l}(m):=\{(x,r)\in\mathcal{I}:0<F(r)\leq m,0<r\leq r^{\prime}\}$ . Note that $I_{l}(m)\cap\mathcal{B}_{b}=\emptyset$ , and the condition $0\leq F(r)\leq m$ is redundant in this definition, but it simply indicates that the set is related to the partial distance function evaluated at $r=r^{\prime}$ . Denote the solution of the dynamics (4) by $\xi(t):=(x(t),r(t))$ , given some initial condition $\xi(0)=(x(0),r(0))\in\mathcal{I}$ .

First we show that if $\xi(0)\in I_{l}(m)$ , then there exists a time instant $T>0$ , such that $\xi(T)\notin I_{l}(m)$ . Define a line $L_{c}$ in the lower region $I_{l}(m)$ by $L_{c}:=\{(x,r)\in I_{l}(m):x=1/(\theta+1)\}$ . Note that on this line, $\dot{r}=0$ and $\dot{x}\geq\epsilon>0$ for some positive constant $\epsilon$ , and thus the vectors point horizontally towards the positive $x$ -direction. The line $L_{c}$ divides the lower region $I_{l}(m)$ into the left region $J_{L}:=\{(x,r)\in I_{l}(m):0\leq x<1/(\theta+1)\}$ and the right region $J_{R}:=\{(x,r)\in I_{l}(m):1/(\theta+1)<x\leq 1\}$ . Note that $\dot{F}=F(r)\Phi(x,r)>0$ in $J_{R}$ while $\dot{F}<0$ in $J_{L}$ . Therefore, if $\xi(0)\in J_{R}$ , the trajectory of $r(t)$ will increase, and there exists a $T>0$ such that $\xi(T)\notin I_{l}(m)$ (due to the directions of vectors on the line $L_{c}$ , the right side $\mathcal{B}_{r}$ and $\dot{F}(r)\geq\epsilon^{\prime}>0$ for some positive constant $\epsilon^{\prime}$ in a sufficiently large compact set $\Omega$ satisfying $\{(x,r)\in J_{R}:r=r^{\prime}\}\subset\Omega\subset J_{R}$ and $\xi(0)\in\Omega$ ). If $\xi(0)\in J_{L}$ , then the trajectory $\xi(t)$ will not always stay in $J_{L}$ due to the Poincaré-Bendixson theorem, since there are no equilibria in $J_{L}$ . As $\dot{F}(r)<0$ in $J_{L}$ , the trajectory of $r(t)$ will decrease, and $x(t)$ will cross the center line $L_{c}$ , and enter the right region $J_{R}$ . Due to the uniqueness of solutions of (4), similar to the previous analysis of the right region, there exits a $T>0$ such that $\xi(T)\notin I_{l}(m)$ .

Define $\overline{I_{l}}(m):=I_{l}(m)\cup\mathcal{B}_{b}$ . Next we show that there exists $n\in(0,m)$ , such that if $\xi(0)\notin\overline{I_{l}}(m)$ , then $\xi(t)\notin I_{l}(n):=\{(x,r)\in\mathcal{I}:0<F(r)\leq n,0<r\leq r^{\prime}\}$ for all $t\geq 0$ . For $z\in\overline{I_{l}}(m)$ , let $T_{m}(z):=\inf\{T\geq 0:\xi(0)=z,\xi(T)\notin I_{l}(m)\}$ . Note that when $z\in\mathcal{B}_{b}$ , then $T_{m}(z)=0$ trivially since $I_{l}(m)\cap\mathcal{B}_{b}=\emptyset$ . As we have shown previously, any trajectory starting in the lower region $I_{l}(m)$ will leave this region at some finite time instant. Therefore, we can define $\overline{T}:=\sup_{z\in\overline{T_{l}}(m)}T_{m}(z)<\infty$ . Let $t_{0}$ be the first time instant when the trajectory $\xi(t)$ reaches $\overline{I_{l}}(m)$ (i.e., $t_{0}=\min\{T>0:\xi(T)\in\overline{I_{l}}(m)\}$ ), and let $\xi(t_{0})=(x(t_{0}),r(t_{0}))$ . It is obvious that $r(t_{0})=r^{\prime}$ and $F(r(t_{0}))=m$ , namely, the trajectory reaches the boundary of the lower region $I_{l}(m)$ at time $t_{0}$ . Let $\Phi_{\rm min}=\min_{(x,r)\in\mathcal{I}}\Phi(x,r)$ (the minimum is attainable on the compact set $\mathcal{I}$ ). Then for $t\in(t_{0},t_{0}+\overline{T})$ , we have

\frac{1}{t-t_{0}}\int_{t_{0}}^{t}\Phi(x,r)dt\geq\Phi_{\rm min}.

(45)

Since $\Phi(x,r)=\dot{F}(r)/F(r)$ , the equation (45) can be further derived:

\frac{1}{t-t_{0}}\int_{t_{0}}^{t}\frac{d}{dt}\big{(}\ln F(r)\big{)}dt=\frac{1}{t-t_{0}}\ln\frac{F(r(t))}{F(r(t_{0}))}\geq\Phi_{\rm min},

which implies

F(r(t))\geq F(r(t_{0}))e^{\Phi_{\rm min}(t-t_{0})}>me^{\Phi_{\rm min}\overline{T}}.

Let $n=me^{\Phi_{\rm min}\overline{T}}$ . Therefore we have shown that the trajectory will not reach $I_{l}(n)$ for $t\in(t_{0},t_{0}+\overline{T})$ . Since $\overline{T}$ is the supremum of the duration when the solution could stay in the lower region $I_{l}(m)$ , then there exists $T^{\prime\prime}\in(t_{0},t_{0}+\overline{T}]$ such that $\xi(T^{\prime\prime})\notin I_{l}(m)$ . Therefore, the trajectory $\xi(t)$ will leave the lower region $I_{l}(m)$ without reaching $I_{l}(n)\subset I_{l}(m)$ . Repeating this argument, we conclude that the trajectory $\xi(t)$ will never reach $I_{l}(n)$ . Thus we have shown that there exists $0<\hat{\delta}<n$ such that $\liminf_{t\to\infty}F(r(t))>\hat{\delta}$ , and thus the lower side $\mathcal{B}_{b}$ is repelling. ∎

Appendix F Functions Expressions in Proof of Lemma 16

(1)

One does not need to obtain the explicit expression of $G(\tilde{x})$ , since the integrand, $g(s)=\frac{2s}{\alpha(s)}=\frac{2s}{1/4-s^{2}}$ is an odd function. Obviously, we have $G(\nu)=\int_{0}^{\nu}g(s)ds=-\int^{0}_{-\nu}g(s)ds=G(-\nu)$ ;
(2)

$\dfrac{F(\tilde{x},\tilde{r})}{\varphi(\tilde{r})}=\dfrac{2\mu\tilde{x}-8\mu_{1}\tilde{x}(1/4-\tilde{x}^{2})}{(1/4-\tilde{x}^{2})(b+d)\tilde{r}}$ ;
(3)

$g(\tilde{x})F(\tilde{x},\tilde{r})=\dfrac{4\mu\tilde{x}^{2}-16\mu_{1}\tilde{x}^{2}(1/4-\tilde{x}^{2})}{{(1/4-\tilde{x}^{2})}^{2}(\tilde{r}+r^{\prime*})(1-\tilde{r}-r^{\prime*})}$ ;
(4)

$F(\tilde{x},\tilde{r})=\dfrac{2\mu\tilde{x}-8\mu_{1}\tilde{x}(1/4-\tilde{x}^{2}))}{(1/4-\tilde{x}^{2})(\tilde{r}+r^{\prime*})(1-\tilde{r}-r^{\prime*})}$ .

Appendix G Proof of Lemma 18

{pf}

In view of the system equations (27), the $x$ -coordinates of the equilibria on $\mathcal{B}_{t}$ are obtained by solving the following polynomial equation

x(1-x)[x(d-c)-d+u]=\mu(2x-1),~{}~{}x\in[0,1].

(46)

The RHS of (46), $\mu(2x-1)$ , is a linear function of $x$ which equals $0$ at $x=1/2$ . It is negative for $x\in(0,1/2)$ and positive for $x\in(1/2,1)$ when $\mu\in(0,1]$ . The LHS of (46) equals $0$ at the two end points $x=0,1$ .

When $u>(c+d)/2$ and $c=d$ , the LHS will be always positive for $x\in(0,1)$ , such that the root of (46) is in the range $(1/2,1)$ . The value of LHS will change the sign from positive to negative (resp. negative to positive) as $x$ increases from $0$ to $1$ when $c>d$ (resp. $c<d$ ), and it equals $0$ also at $x=(d-u)/(d-c)$ . One has $(d-u)/(d-c)<1/2$ and $(d-u)/(d-c)>1/2$ respectively for $c<d$ and $c>d$ . For both cases, there will be some $x\in(1/2,1)$ such that the RHS and LHS are equal as shown geometrically in Fig. (6). Therefore, one can conclude that there is always one equilibrium with $x$ -coordinate being $1/2<x_{t}^{*}<1$ .

Next, only in the case $c<d$ , the curves of the LHS and RHS can have one or two intersections in the range $0<x<1/2$ . Thus, the other possible equilibria on $\mathcal{B}_{t}$ can only be located in $x\in(0,1/2)$ . ∎

	$\displaystyle\lambda_{2}$	$\displaystyle=(3{x_{t}^{}}^{2}-2x_{t}^{})\epsilon+(u-d)(1-2x_{t}^{*})-2\mu$
		$\displaystyle<(3{x_{t}^{}}^{2}-2x_{t}^{})\epsilon+\epsilon x_{t}^{}(1-2x_{t}^{})-2\mu$
		$\displaystyle=({x_{t}^{}}^{2}-x_{t}^{})\epsilon-2\mu$
		$\displaystyle<-2\mu.$

Limit Cycles Analysis and Control of Evolutionary Game Dynamics with Environmental Feedback

Abstract

keywords:

keywords:

1 Introduction

2 Problem Formulation

2.1 Environment-dependent payoffs

2.2 Replicator-Mutator equations

2.3 Closed-loop system with game-environment feedback

3 System Dynamics and Bifurcations

3.1 Non-existence of closed orbits

Lemma 1.

Lemma 2.

Lemma 3.

3.2 Hopf bifurcation at the interior equilibrium

Theorem 4 (Hopf bifurcation theorem).

Assumption 5.

Theorem 6.

3.3 Heteroclinic bifurcation

Lemma 7.

Definition 8.

Theorem 9.

3.4 Global dynamics

3.4.1 Boundary equilibria and their stability

Assumption 10.

Lemma 11.

Lemma 12.

Lemma 13.

Theorem 14.

3.4.2 Uniqueness of limit cycles in the case θ=1\theta=1

Assumption 15.

Lemma 16.

Theorem 17.

4 Incentive-based Control

4.1 Stable equilibrium

Lemma 18.

Lemma 19.

Theorem 20.

Theorem 21.

4.2 Reduced amplitude of the limit cycle

5 Conclusions and Future Work

References

Appendix A Proof of Lemma 1

Appendix B Proof of Lemma 3

Appendix C Computation of first Lyapunov coefficient

Appendix D Proof of Lemma 11

Appendix E Proof of Lemma 13

Appendix F Functions Expressions in Proof of Lemma 16

Appendix G Proof of Lemma 18

3.4.2 Uniqueness of limit cycles in the case $\theta=1$