Passivity-based Analysis and Design for Population Dynamics with Conformity Biases

Shunya Yamashita \IEEEmembershipStudent Member, IEEE Kodai Irifune Takeshi Hatanaka \IEEEmembershipSenior Member, IEEE
Yasuaki Wasa \IEEEmembershipMember, IEEE Kenji Hirata \IEEEmembershipMember, IEEE and Kenko Uchida \IEEEmembershipMember, IEEE This work was supported in part by JST-Mirai Program under Grant 18077648, and in part by JSPS KAKENHI under Grant 21J13956. S. Yamashita, K. Irifune and T. Hatanaka are with the School of Engineering, Tokyo Institute of Technology, Tokyo 152-8550, Japan (e-mail: [email protected]; [email protected]; [email protected]). Y. Wasa and K. Uchida are with the School of Advanced Science and Engineering, Waseda University, Tokyo 169-8555, Japan (e-mail: [email protected]; [email protected]). K. Hirata is with the Faculty of Engineering, University of Toyama, Toyama 930-8555, Japan (e-mail: [email protected]).

Abstract

This paper addresses mechanisms for boundedly rational decision makers in discrete choice problem. First, we introduce two mathematical models of population dynamics with conformity biases. We next analyze the models in terms of $\delta$ -passivity, and show that the conformity biases work to break passivity of decision makers. Based on the passivity perspective, we propose mechanisms so as to induce decision makers to a desired population state. Furthermore, we analyze a convergence property of designed mechanisms, and present parameter conditions to guarantee stable inducements.

{IEEEkeywords}

Population dynamics, Bounded rationality, Conformity bias, Behavior modification, Passivity

1 Introduction

Human behavior is a critically important factor in design and analysis of large-scale social systems like transportation network [1] and energy management systems [2, 3, 4]. As seen in these publications, a typical approach to address such systems involving humans is to model social decisions/dynamics assuming human rationality, against the background of expected utility theory [5]. Meanwhile, behavioral economics has pointed out that human rationality may be bounded due to information and/or cognitive constraints, as typified by prospect theory [6] or dual-process theory [7]. It is highly uncertain that any system designed under the assumption of human rationality will work when their rationality is bounded. Motivated by the issue, boundedly rational human/social models have begun to be investigated in the field of systems and control[8, 9]. Among various types of bounded rationality, in this paper, we focus on so-called conformity bias [10], tendency to follow the majority, that is observed in the scenes of parking location choice [11] and evacuation decision [12].

One of the decision making issues is discrete choice problem which decision makers choose a strategy from finite number of options [13]. Indeed, some significant issues in society are categorized as discrete choice: route selection in transportation networks [14], choice of energies [15], and water distribution [16], for example. Meanwhile, when we deal with systems including large scale of population, it should be required to consider decision models as a population [17]. Discrete choice behavior by large populations is well-addressed in evolutionary game theory, in which various types of population dynamics have been presented, e.g., logit dynamics [18, 19], Smith dynamics [20] and pairwise comparison dynamics [21]. Meanwhile, some recent publications have pointed out relations between population dynamics and passivity [22, 23, 24]. The researches in [22, 23] analyze passivity for generally formulated population dynamics, and [24] particularly focuses on the decision model in water distribution system. However, the models dealt with in the above literature do not explicitly consider influences of biases. Although several publications [25, 26] deal with bounded rationality in population dynamics, they address the models without focusing on passivity.

In social systems, it is sometimes preferred to induce humans to a desired social state. From the viewpoint of inducement for populations, behavior modification mechanism might be a key concept to achieve a desired behavior. This includes incentive [27] which is an economic approach, or nudge [28] which is an informational one. These kinds of mechanisms have recently attracted the attention of control community [29, 30], and more specifically, the ones for population dynamics have also been studied [31, 32]. Whereas, the publication which explicitly consider bounded rationality is rare. The authors in [25] propose a nudging mechanism for biased population dynamics, and analyze stability by using singular perturbation.

In this paper, we address mechanisms for discrete choice problem under bounded rationality. First, we introduce a well-used population dynamics model, called the logit dynamics, and its $\delta$ -passivity [23]. Next, we extend the decision making model to the one with conformity biases. We show two types of the bias models respectively addressed in [33] and [25]. We then analyze the biased logit dynamics in terms of $\delta$ -passivity, and reveal the impacts of conformity biases. Inspired by passivity-based control methods, we propose mechanisms so as to achieve desired social behavior for two types of biased models, respectively. Furthermore, we analyze their convergence, and show parameter conditions in order to stabilize the proposed mechanisms.

We summarize the contributions of this paper as follows:

•

the impact by conformity biases to population dynamics is shown in terms of passivity, and
•

passivity-based mechanisms are presented, which stably achieve desired social state.

Parts of the contents in this paper are similar to the previous work in [34]. Meanwhile, variation of bias models and exact analysis of mechanisms are the incremental contributions added anew in this paper.

2 Model Description

2.1 Preliminaries: $\delta$ -Passivity

This subsection introduces $\delta$ -passivity [23] which is a similar concept to passive systems [35]. Suppose a system $\Sigma$ represented by the state space model

\displaystyle\dot{x}=f(x,u),

where $x\in\mathbb{R}^{n}$ is the state, $u\in\mathbb{R}^{n}$ is the input, and $f:\mathbb{R}^{n}\times\mathbb{R}^{n}\to\mathbb{R}^{n}$ is a function. Then, $\Sigma$ is called $\delta$ -passive from $\dot{u}$ to $\dot{x}$ if there exists a positive semi-definite function $S:\mathbb{R}^{n}\times\mathbb{R}^{n}\to[0,\infty)$ and a scalar $\rho\geq 0$ such that

\displaystyle S(u(t),x(t))-S(u(0),x(0))\leq\int_{0}^{t}\dot{x}^{\top}(\hat{t})\dot{u}(\hat{t})-\rho||\dot{u}(\hat{t})||^{2}d\hat{t}

for all input $u$ , all initial state $x(0)$ and all $t\geq 0$ . The positive semi-definite function $S$ is particularly called a storage function. In addition, $\Sigma$ is called $\delta$ -input-strictly-passive if the above inequality is satisfied with some positive scalar $\rho>0$ . As widely known, if $S$ is differentiable, we can replace the above inequality with $\dot{S}(u(t),x(t))\leq\dot{x}^{\top}(t)\dot{u}(t)-\rho||\dot{u}(t)||^{2}$ .

In the same way as passivity shortage [36], we define $\delta$ -output-passivity-shortage. The system $\Sigma$ is called $\delta$ -output-passivity-short from $\dot{u}$ to $\dot{x}$ if there exists a positive semi-definite function $S:\mathbb{R}^{n}\times\mathbb{R}^{n}\to[0,\infty)$ , and a scalar $\gamma\geq 0$ such that

\displaystyle S(u(t),x(t))-S(u(0),x(0))\leq\int_{0}^{t}\dot{x}^{\top}(\hat{t})\dot{u}(\hat{t})+\gamma||\dot{x}(\hat{t})||^{2}d\hat{t}

for all input $u$ , all initial state $x(0)$ and all $t\geq 0$ . We call the value $\gamma$ as an impact coefficient. If $S$ is differentiable, the above inequality is equivalent to $\dot{S}(u(t),x(t))\leq\dot{x}^{\top}(t)\dot{u}(t)+\gamma||\dot{x}(t)||^{2}$ .

Feedback system composed of strictly passive component and passivity-short one is related to the Lyapunov stability [35, 37]. Consider a $\delta$ -input-strictly-passive system whose input is $u_{1}$ and state is $x_{1}$ , which satisfies $\dot{S}_{1}\leq\dot{x}_{1}^{\top}\dot{u}_{1}-\rho\|\dot{u}_{1}\|^{2}$ for a differentiable storage function $S_{1}$ and $\rho>0$ . We also suppose a $\delta$ -output-passivity-short system whose input is $u_{2}$ and state is $x_{2}$ , which satisfies $\dot{S}_{2}\leq\dot{x}_{2}^{\top}\dot{u}_{2}+\gamma\|\dot{x}_{2}\|^{2}$ for a differentiable storage function $S_{2}$ and $\gamma>0$ . Then, the feedback interconnection under $u_{1}=x_{2}$ and $u_{2}=-x_{1}$ provides

\displaystyle\dot{S}_{1}+\dot{S}_{2}\leq(\gamma-\rho)\|\dot{x}_{1}\|^{2}.

If $\gamma\leq\rho$ holds and $S_{1}+S_{2}$ is radially unbounded, the above inequality suggests stability of the feedback system in the sense of Lyapunov [35]. In other words, $\delta$ -input-strictly-passive systems can stabilize $\delta$ -output-passivity-short ones by negative feedback.

2.2 Dynamic Decision Making Model under Rationality

This subsection introduces the logit dynamics [19, 23] which is a well-used dynamical discrete choice model.

Consider the situation that decision makers choose a strategy from $n$ available strategies. We denote the strategy set as $\mathcal{A}:=\{1,2,\ldots,n\}$ . Suppose that the population of the decision makers can be represented as a continuum value. We now define $\mathcal{S}:=\{x\in[0,1]^{n}\mid\textbf{1}_{n}^{\top}x=1\}$ as the strategy choice distribution set, where $\textbf{1}_{n}$ describes $n$ dimensional vector whose elements are all $1$ . We denote the relative interior of the set $\mathcal{S}$ as $\textrm{int}(\mathcal{S})$ . The population state $\pi\in\mathcal{S}$ implies the distribution of strategy choice. Specifically, the $k$ -th element of $\pi$ , denoted by $\pi_{k}\in[0,1]$ , implies a fraction of the decision makers selecting strategy $k\in\mathcal{A}$ .

Let us define a cost vector $\tau\in\mathbb{R}^{n}$ of which $k$ -th element $\tau_{k}\in\mathbb{R}$ corresponds to the cost for choosing strategy $k\in\mathcal{A}$ . Then, the logit dynamics [19, 23] is represented as

\displaystyle\dot{\pi}=\eta\left(Q(\tau)-\pi\right),\ \pi(0)\in\textrm{int}(\mathcal{S}),

(1)

where $\eta$ is a positive constant which indicates the update rate. The function $Q:\mathbb{R}^{n}\to\textrm{int}(\mathcal{S})$ corresponds to the steady state of (1) and follows

\displaystyle Q_{k}(\tau)=\frac{\exp(-\beta\tau_{k})}{\sum\limits_{l\in\mathcal{A}}\exp(-\beta\tau_{l})},

(2)

where $Q_{k}:\mathbb{R}^{n}\to(0,1)$ implies $k$ -th element of $Q$ , and $\beta>0$ is a constant. The system (1) is known to guarantee $\pi(t)\in\textrm{int}(\mathcal{S})$ for any time $t\geq 0$ . Remarking that $Q_{k}(\tau)>Q_{i}(\tau)$ for any pair $k\neq i$ satisfying $\tau_{k}<\tau_{i}$ , the logit dynamics (1) can be interpreted as an approximated model of best response, i.e., rational decision making [18].

The logit dynamics (1) is also known to satisfy $\delta$ -passivity as below.

Lemma 1 ([23])

The logit dynamics (1) is $\delta$ -passive from $-\dot{\tau}$ to $\dot{\pi}$ for the storage function

	$\displaystyle S(\tau,\pi):=\$	$\displaystyle\eta\left(\pi^{\top}\tau+\frac{1}{\beta}\sum_{l\in\mathcal{A}}\pi_{l}\log{\pi_{l}}\right)$
		$\displaystyle\ -\eta\min_{\omega\in\mathcal{S}}\left(\omega^{\top}\tau+\frac{1}{\beta}\sum_{l\in\mathcal{A}}\omega_{l}\log{\omega_{l}}\right),$		(3)

i.e., $\dot{S}\leq-\dot{\tau}^{\top}\dot{\pi}$ holds.

Remark that the storage function $S(\tau,\pi)$ has a kind of radial unboundedness. See Appendix 6 for the details.

2.3 Bias Models

The logit model (1) corresponds to an approximation of rational strategy choice. Meanwhile, in behavioral economics field, it has been pointed out that human’s decision process is sometimes biased relying on population state. This subsection introduces two models of conformity bias. In particular, we represent these bias models as the ones to affect the cost $\tau$ . Hereafter, we refer to $\tau$ as a biased cost. Alternatively, the actual cost is denoted by $T\in\mathbb{R}^{n}$ , which is basically supposed to be non-negative.

First, we introduce the bias model which is a generalization of interactions model [33]. In this model, the biased cost $\tau$ is given by

\displaystyle\tau=T+B(\pi),

(4)

where $B:\mathcal{S}\to\mathbb{R}^{n}$ is the bias function, whose $k$ -th element obeys $b_{k}(\pi_{k})$ . The function $b_{k}:[0,1]\to\mathbb{R}$ is assumed to satisfy the following:

Assumption 1

The function $b_{k}\ (k\in\mathcal{A})$ obeys the following items:

(i)

$b_{k}$ is continuous in $[0,1]$ and continuously differentiable in $(0,1)$ .
(ii)

$b_{k}$ is decreasing in $\pi_{k}$ , and uniformly bounded by $[b^{L},b^{H}]$ with some $b^{H}\geq b^{L}$ .
(iii)

The first derivative $\nabla b_{k}$ is uniformly bounded by $[-c^{H},-c^{L}]$ with some $c^{H}\geq c^{L}>0$ .

The item (ii) in Assumption 1 implies that if $k\in\mathcal{A}$ is a majority strategy, the decision makers get a lower impression of cost than the actual cost $T_{k}$ . Hence, $B(\pi)$ has the tendency that the strategy chosen by many people intensifies its own popularity, which corresponds to conformity bias. The first derivative $\nabla b_{k}$ implies the dependency of $b_{k}$ on $\pi_{k}$ , i.e., this corresponds to the strength of the bias. In particular, $c^{H}$ , which is the maximum of $|\nabla b_{k}(\pi_{k})|$ , can be interpreted as the maximal bias-strength. In the sequel, we call the decision making model composed of (1) and (4) as Model 1. The block diagram of Model 1 is illustrated in Fig. 1.

Refer to caption — Figure 1: Block diagram of Model 1. The system colored by khaki is the logit model (1).

In this paper, we address another model of conformity bias, presented in [25, 34]. For given $T$ , in this model, the biased cost $\tau$ obeys

\displaystyle\tau=W(\pi)T,

(5)

where $W(\pi)=\textrm{diag}(w_{1}(\pi_{1}),w_{2}(\pi_{2}),\ldots,w_{n}(\pi_{n}))$ is the bias matrix. The $k$ -th diagonal element $w_{k}:[0,1]\to\mathbb{R}$ ( $k\in\mathcal{A}$ ) is the bias function for strategy $k$ . Throughout this paper, we set the following assumption for the bias function $w_{k}$ .

Assumption 2

The function $w_{k}\ (k\in\mathcal{A})$ obeys the following items:

(i)

$w_{k}$ is continuous in $[0,1]$ and continuously differentiable in $(0,1)$ .
(ii)

$w_{k}$ is decreasing in $\pi_{k}$ , and uniformly bounded by $[w^{L},w^{H}]$ with some $w^{H}\geq w^{L}>0$ .
(iii)

The first derivative $\nabla w_{k}$ is uniformly bounded by $[-v^{H},-v^{L}]$ with some $v^{H}\geq v^{L}>0$ .

In the subsequent discussion, we call the model composed of (1) and (5) as Model 2. Under Assumption 2, the biased cost for strategy $k$ decreases when $\pi_{k}$ gets large. In other words, the majority strategy tends to be well chosen. Since $|\nabla w_{k}|$ is the dependency of $w_{k}$ on $\pi_{k}$ , this implies the strength of the bias and its maximum value $v^{H}$ represents the maximal bias-strength of Model 2. The block diagram of Model 2 is illustrated in Fig. 2.

3 Passivity Analysis of Decision Making Models

In this section, we analyze the biased decision making models, introduced in the last section, in terms of $\delta$ -passivity.

First, we consider Model 1, which is composed of (1) and (4). From (4), the signal $\dot{\tau}$ is given by

\displaystyle\dot{\tau}=\dot{T}+B^{\prime}(\pi)\dot{\pi},

(6)

where $B^{\prime}(\pi):=\textrm{diag}(\nabla b_{1}(\pi_{1}),\nabla b_{2}(\pi_{2}),\dots,\nabla b_{n}(\pi_{n}))$ is negative definite because of Assumption 1. From Lemma 1 and (6), we have the following lemma.

Lemma 2

Under Assumption 1, the system (1) with (4) is $\delta$ -output-passivity-short from $-\dot{T}$ to $\dot{\pi}$ .

Proof 3.1.

Consider the function $S(\tau,\pi)$ defined in (3) with substituting $\tau=T+B(\pi)$ . From Lemma 1, we have

$\displaystyle\dot{S}$	$\displaystyle\leq-\dot{\tau}^{\top}\dot{\pi}$
	$\displaystyle=-(\dot{T}+B^{\prime}(\pi)\dot{\pi})^{\top}\dot{\pi}$
	$\displaystyle=-\dot{T}^{\top}\dot{\pi}-\dot{\pi}^{\top}B^{\prime}(\pi)\dot{\pi}$
	$\displaystyle\leq-\dot{T}^{\top}\dot{\pi}+c^{H}\\|\dot{\pi}\\|^{2}.$	(7)

This completes the proof.

Focusing on the $\delta$ -passivity, the block diagram of Model 1 can be represented as Fig. 3. Lemma 2 suggests that a positive feedback by $-B^{\prime}(\pi)$ appears on the outside of the non-biased logit model, as shown in Fig. 3. We can confirm from (7) that the impact coefficient incrementally varies according to strength of the bias $c^{H}$ . In other words, the conformity bias modeled in (4) violates passivity of decision making.

We next analyze Model 2. Here, we define the diagonal matrix $W^{\prime}(\pi):=\textrm{diag}(\nabla w_{1}(\pi_{1}),\nabla w_{2}(\pi_{2}),\dots,\nabla w_{n}(\pi_{n}))$ which is negative definite due to Assumption 2. Then, the bias model (5) yields

	$\displaystyle\dot{\tau}$	$\displaystyle=W(\pi)\dot{T}+\textrm{diag}(W^{\prime}(\pi)\dot{\pi})T$
		$\displaystyle=W(\pi)\dot{T}-\Phi(T,\pi)\dot{\pi},$		(8)

where $\Phi(T,\pi):=-W^{\prime}(\pi)\textrm{diag}(T)$ is a diagonal matrix. Substituting (8) to the inequality in Lemma 1, we can prove the following lemma which is a particular case of [34, Lemma 5].

Lemma 3.2 ([34]).

Suppose that there exists a constant $T_{\max}>0$ such that $T_{k}(t)\leq T_{\max}$ holds for all $k\in\mathcal{A}$ and for all $t\geq 0$ . Under Assumption 2, the system (1) with (5) is $\delta$ -output-passivity-short from $-\dot{T}$ to $y:=W(\pi)\dot{\pi}$ .

Proof 3.3.

Consider the function $S(\tau,\pi)$ with substituting $\tau=W(\pi)T$ . From Lemma 1 and (8), we obtain

$\displaystyle\dot{S}$	$\displaystyle\leq-\dot{\tau}^{\top}\dot{\pi}$
	$\displaystyle=-\left(W(\pi)\dot{T}-\Phi(T,\pi)\dot{\pi}\right)^{\top}\dot{\pi}$
	$\displaystyle=-\dot{T}^{\top}W(\pi)\dot{\pi}+\dot{\pi}^{\top}\Phi(T,\pi)\dot{\pi}$
	$\displaystyle=-\dot{T}^{\top}y+y^{\top}(W(\pi))^{-1}\Phi(T,\pi)(W(\pi))^{-1}y$
	$\displaystyle\leq-\dot{T}^{\top}y+\frac{v^{H}T_{\max}}{(w^{L})^{2}}\\|y\\|^{2}.$	(9)

This completes the proof.

The block diagram of Model 2 can be illustrated in Fig. 4 by replacing the input and the output as $-\dot{T}$ and $y$ . Lemma 3.2 reveals that the matrix $\Phi(T,\pi)$ appears as a positive feedback on the original logit model (1), which is $\delta$ -passive. The impact coefficient in (9) increases with $v^{H}$ which is the strength of the bias $W(\pi)$ . This result suggests that the conformity bias represented in (5) destabilizes the decision making.

Both of Lemmas 2 and 3.2 indicate that the positive feedback paths attributed by the conformity biases violate $\delta$ -passivity of the original rational model (1). Therefore, we conclude that the conformity bias works to destabilize the dynamic decision making.

4 Passivity-based Design of Mechanisms

In this section, we propose behavior modification mechanisms to lead decision makers to desired social state. Particularly, we focus on the output-passivity-shortage of the decision making models, shown in the last section, and design mechanism based on passivity paradigm. In the sequel, we assume that the population state $\pi$ is observable, and the desired population state, denoted as $\pi^{*}\in\textrm{int}(\mathcal{S})$ , is given as a constant vector. In this paper, a mechanism indicates the system to update the actual cost¹¹1In the case of incentive design, the cost $T(t)$ is added to decision makers as an economic input. Whereas, in the case of nudge, it is announced to them as an informational input. $T$ by using $\pi$ and $\pi^{*}$ . The goal in this section is to design mechanisms so as to ensure the inducement $\lim_{t\to\infty}\pi(t)=\pi^{*}$ for Model 1 and Model 2, respectively.

4.1 Mechanism for Model 1

In this subsection, we consider a mechanism for Model 1. The structure of the mechanism is illustrated in Fig. 5. The gray part in Fig. 5 implies Model 1, and the block $\Sigma_{1}$ is the mechanism. Notice that the system enclosed by the red line is $\delta$ -output-passivity-short, as proved in Lemma 2. In terms of passivity theory, positive energy generated from an output-passivity-short system can be canceled out by negative feedback of an input-strictly-passive system. Hence, we can expect to implement a stable mechanism by designing $\Sigma_{1}$ so as to satisfy input-strict-passivity of the system enclosed by blue line.

Based on the above concept, we propose the mechanism inspired by Proportional-Integral controller, as below:


$\displaystyle\dot{\mu}$	$\displaystyle=\rho\left(\pi-\pi^{*}\right),$	(10a)
$\displaystyle T$	$\displaystyle=\mu+\kappa(\pi-\pi^{*}),$	(10b)

where $\rho>0$ and $\kappa>0$ is a design parameter. The mechanism (10) satisfies the following lemma.

Lemma 4.4.

The system (10) is $\delta$ -input-strictly-passive from $\dot{\pi}$ to $\dot{T}$ for the storage function $H(\pi):=\frac{\rho}{2}\|\pi-\pi^{*}\|^{2}$ .

Proof 4.5.

From (10b), $\dot{T}=\dot{\mu}+\kappa\dot{\pi}$ holds. Noticing this result and (10a), we have

	$\displaystyle\dot{H}$	$\displaystyle=\rho(\pi-\pi^{*})^{\top}\dot{\pi}=\dot{\mu}^{\top}\dot{\pi}$
		$\displaystyle=\dot{T}^{\top}\dot{\pi}-\kappa\\|\dot{\pi}\\|^{2}.$		(11)

This result shows the $\delta$ -input-strict-passivity of (10).

The result in Lemma 4.4 suggests that the system from $\dot{\pi}$ to $\dot{T}$ , enclosed by the blue line in Fig. 5, is $\delta$ -input-strictly-passive if we apply (10) to $\Sigma_{1}$ . Thus, it is expected that the mechanism (10) cancels out the positive energy in (7) and induces the population state $\pi$ to the desired state $\pi^{*}$ .

As a preparation for convergence analysis, we introduce the following lemma about the storage function $S$ under the mechanism (10).

Lemma 4.6.

Consider the system (10). Let a signal $\pi(t)\in\textrm{int}(\mathcal{S})$ achieves $\|T(t)\|\to\infty$ when $t\to\infty$ . For this specific signal $\pi(t)$ , $S(T+B(\pi),\pi)\to\infty$ holds for any $\pi^{*}\in\textrm{int}(\mathcal{S})$ .

Proof 4.7.

See Appendix 7.1.

We are now ready to prove the following theorem. In the proof, we use the notation $\mathcal{L}_{\infty}$ as the set of all signals $u:[0,\infty)\to\mathbb{R}^{n}$ satisfying $\sup_{t\in[0,\infty)}\|u(t)\|<\infty$ .

Theorem 4.8.

Consider the feedback system composed of (1), (4) and (10). When $\kappa>c^{H}$ holds, the population state $\pi$ achieves $\lim_{t\to\infty}\pi(t)=\pi^{*}$ .

Proof 4.9.

We define the function $V_{1}:=S(T+B(\pi),\pi)+H(\pi)$ . From Lemma 2 and Lemma 4.4, we obtain

	$\displaystyle\dot{V}_{1}$	$\displaystyle\leq-\dot{T}^{\top}\dot{\pi}+c^{H}\\|\dot{\pi}\\|^{2}+\dot{T}^{\top}\dot{\pi}-\kappa\\|\dot{\pi}\\|^{2}$
		$\displaystyle=-(\kappa-c^{H})\\|\dot{\pi}\\|^{2}\leq 0$		(12)

under $\kappa>c^{H}$ . Denote the initial values of $S$ and $H$ by $S_{0}$ and $H_{0}$ , respectively. Then, $V_{1}\leq S_{0}+H_{0}<\infty$ holds for any time $t\geq 0$ . Remarking $\pi\in\textrm{int}(\mathcal{S})$ , this implies $T\in\mathcal{L}_{\infty}$ due to Lemma 4.6. Thus, we can apply the LaSalle’s invariance principle [38], and hence solution of (1), (4) and (10) for any initial conditions $T(0)\in\mathbb{R}^{n}$ and $\pi(0)\in\textrm{int}(\mathcal{S})$ converges to the largest invariant set satisfying $\dot{V}_{1}=0$ .

Consider the state trajectories such that $\dot{V}_{1}\equiv 0$ holds. From (12), $\dot{V}_{1}\equiv 0$ implies $\dot{\pi}\equiv 0$ . Hence, $\pi$ should be constant. Focusing on (10), $\dot{T}\equiv\dot{\mu}$ holds and $\dot{\mu}=\rho(\pi-\pi^{*})$ should be constant. If $\pi\neq\pi^{*}$ , $\dot{T}\neq 0$ identically holds and thus $T$ should diverge. However, the divergence contradicts $T\in\mathcal{L}_{\infty}$ . Accordingly, $T$ is constant and $\pi\equiv\pi^{*}$ is satisfied.

As a result, by invoking the LaSalle’s invariance principle, we can prove that $\pi$ asymptotically converges to $\pi^{*}$ .

Theorem 4.8 shows that the mechanism (10) can induce the decision makers of Model 1 to desired state if we design the parameter $\kappa$ large enough. The right hand side of the convergence condition $\kappa>c^{H}$ is given by the bias-strength. Hence, for the strongly biased decision makers, $\kappa$ is needed to be set large.

4.2 Mechanism for Model 2

Let us consider to design a mechanism for Model 2. Its structure is illustrated in Fig. 6: The gray part is the decision makers, and $\Sigma_{2}$ is the mechanism. Similar to 4.1, we design $\Sigma_{2}$ to guarantee the passivity from $y$ to $\dot{T}$ .

Applying the mechanism (10) to $\Sigma_{2}$ , the following $\delta$ -passivity is satisfied, that is proved in the previous work [34].

Fact 1 ([34]).

Under Assumption 2, the system (10) is $\delta$ -input-strictly-passive from $y$ to $\dot{T}$ .

By using (10) for $\Sigma_{2}$ , certainly, the whole system in Fig. 6 is composed of the feedback of a $\delta$ -output-passivity-short system and a $\delta$ -input-strictly-passive system. However, the analysis in [34] leaves an important issue about signal boundedness. In [34], the boundedness of the signal $T$ is given by an assumption, which is not theoretically guaranteed. In particular, to certify the upper bound of signal $T$ , denoted by $T_{\max}$ , is the most significant problem to cancel out the positive energy in (9). Therefore, we should redesign a mechanism for Model 2 to guarantee the existence of $T_{\max}$ , with keeping passivity.

To solve the above issue, we propose the following mechanism.


$\displaystyle\dot{\mu}$	$\displaystyle=\min\left\{\rho\left(\pi-\pi^{*}\right),-\alpha\mu\right\},\ \mu(0)\leq 0,$	(13a)
$\displaystyle T$	$\displaystyle=\bar{T}\textbf{1}_{n}+\mu+\kappa(\pi-\pi^{*}),$	(13b)

where $\rho>0$ and $\kappa>0$ are design parameters, $\alpha>0$ is a parameter satisfying $\alpha>\frac{1}{2\kappa}$ , and $\bar{T}>0$ is a constant. Remark that (13a) ensures $\mu\leq 0$ for all time. Thus, the system (13) guarantees the existence of $T_{\max}<\bar{T}+\kappa$ such that $T_{k}\leq T_{\max}\ \forall k\in\mathcal{A}$ holds for all time. About the time derivative of $T$ , $\dot{T}=\dot{\mu}+\kappa\dot{\pi}$ holds from (13b).

For the passivity analysis of (13), we consider the primitive function of $w_{k}$ . Under Assumption 2, $w_{k}$ is continuous. Hence, we can define the following function:

\displaystyle F_{k}(\pi_{k})

\displaystyle:=\left\{\begin{array}[]{cl}\displaystyle{\int_{0}^{\pi_{k}}\left(\int_{0}^{\phi}w_{k}(\theta)d\theta\right)d\phi}&\textrm{if}\ \pi_{k}\in[0,1],\\ \infty&\textrm{otherwise}.\end{array}\right.

(16)

For $\pi_{k}\in(0,1)$ , $\nabla^{2}F_{k}(\pi_{k})=w_{k}(\pi_{k})\geq w^{L}$ clearly holds and this reveals that $F_{k}$ is convex. We denote $F^{\star}_{k}:\mathbb{R}\to\mathbb{R}$ as the convex conjugate of $F_{k}$ [39]. Due to the convexity of $F_{k}$ ,

\displaystyle\pi_{k}=\nabla F^{\star}_{k}(\nabla F_{k}(\pi_{k})),\ \pi_{k}\in(0,1)

(17)

is satisfied [39]. By using these results, we can prove the following lemma.

Lemma 4.10.

Define $\zeta_{k}:=\nabla F_{k}(\min\{\pi_{k},\pi_{k}^{*}-\frac{\alpha}{\rho}\mu_{k}\})$ and $\zeta_{k}^{*}:=\nabla F_{k}(\pi_{k}^{*})$ for each $k\in\mathcal{A}$ . Under Assumption 2, the system (13) is $\delta$ -input-strictly-passive from $y$ to $\dot{T}$ for the storage function

	$\displaystyle U(\mu,\pi)$	$\displaystyle:=\sum_{k\in\mathcal{A}}U_{k}(\mu_{k},\pi_{k}),$
	$\displaystyle U_{k}(\mu_{k},\pi_{k})$	$\displaystyle:=\rho\left(F^{\star}_{k}(\zeta_{k})-F^{\star}_{k}(\zeta_{k}^{})-(\zeta_{k}-\zeta_{k}^{})\nabla F^{\star}_{k}(\zeta_{k}^{*})\right).$

Proof 4.11.

Due to the convexity of $F^{\star}_{k}$ ,

\displaystyle F^{\star}_{k}(\zeta_{k})-F^{\star}_{k}(\zeta_{k}^{*})\geq(\zeta_{k}-\zeta_{k}^{*})\nabla F^{\star}_{k}(\zeta_{k}^{*})

holds and hence the function $U_{k}$ becomes positive semi-definite. We first consider the time when mode switch does not happen in (13a) for $k$ -th element. Then, the time derivative of $U_{k}$ along (13) is given by

\displaystyle\dot{U}_{k}=\rho\left(\nabla F^{\star}_{k}(\zeta_{k})-\nabla F^{\star}_{k}(\zeta_{k}^{*})\right)\dot{\zeta}_{k}.

(18)

If $\pi_{k}-\pi_{k}^{*}\leq-\frac{\alpha}{\rho}\mu_{k}$ , then $\zeta_{k}=\nabla F_{k}(\pi_{k})$ holds and hence

	$\displaystyle\dot{U}_{k}$	$\displaystyle=\rho(\pi_{k}-\pi_{k}^{*})\nabla^{2}F_{k}(\pi_{k})\dot{\pi}_{k}$
		$\displaystyle=\rho(\pi_{k}-\pi_{k}^{*})y_{k}.$		(19)

Under $\pi_{k}-\pi_{k}^{*}\leq-\frac{\alpha}{\rho}\mu_{k}$ , $\dot{T}_{k}=\rho\left(\pi_{k}-\pi_{k}^{*}\right)+\kappa\dot{\pi}_{k}$ holds. Thus, we have

$\displaystyle\dot{U}_{k}$	$\displaystyle=\dot{T}_{k}y_{k}-\kappa\dot{\pi}_{k}y_{k}$
	$\displaystyle=\dot{T}_{k}y_{k}-\kappa w_{k}(\pi_{k})\dot{\pi}_{k}^{2}$
	$\displaystyle\leq\dot{T}_{k}y_{k}-\frac{\kappa}{2}w_{k}(\pi_{k})\dot{\pi}_{k}^{2}.$	(20)

If $\pi_{k}-\pi_{k}^{*}>-\frac{\alpha}{\rho}\mu_{k}$ , we obtain $\zeta_{k}=\nabla F_{k}(\pi_{k}^{*}-\frac{\alpha}{\rho}\mu_{k})$ and $\dot{\mu}_{k}=-\alpha\mu_{k}$ . Then,

$\displaystyle\dot{U}_{k}$	$\displaystyle=\rho\left(\pi_{k}^{}-\frac{\alpha}{\rho}\mu_{k}-\pi_{k}^{}\right)\nabla^{2}F_{k}\left(\pi_{k}^{*}-\frac{\alpha}{\rho}\mu_{k}\right)(-\alpha\dot{\mu}_{k})$
	$\displaystyle=\alpha^{2}w_{k}\left(\pi_{k}^{*}-\frac{\alpha}{\rho}\mu_{k}\right)\mu_{k}\dot{\mu}_{k}$
	$\displaystyle=-\alpha w_{k}\left(\pi_{k}^{*}-\frac{\alpha}{\rho}\mu_{k}\right)\dot{\mu}_{k}^{2}$	(21)

holds. Since $w_{k}$ is decreasing function, we obtain $w_{k}(\pi_{k}^{*}-\frac{\alpha}{\rho}\mu_{k})\geq w_{k}(\pi_{k})$ . From this inequality and $\dot{T}=\dot{\mu}+\kappa\dot{\pi}$ , we have

$\displaystyle\dot{U}_{k}$	$\displaystyle\leq-\alpha w_{k}(\pi_{k})\dot{\mu}_{k}^{2}$
	$\displaystyle=-\frac{1}{2\kappa}w_{k}(\pi_{k})\dot{\mu}_{k}^{2}-\left(\alpha-\frac{1}{2\kappa}\right)w_{k}(\pi_{k})\dot{\mu}_{k}^{2}$
	$\displaystyle\leq-\frac{1}{2\kappa}w_{k}(\pi_{k})\left(\dot{T}_{k}-\kappa\dot{\pi}_{k}\right)^{2}$
	$\displaystyle=\dot{T}_{k}w_{k}(\pi_{k})\dot{\pi}_{k}-\frac{1}{2\kappa}w_{k}(\pi_{k})\dot{T}_{k}^{2}-\frac{\kappa}{2}w_{k}(\pi_{k})\dot{\pi}_{k}^{2}$
	$\displaystyle\leq\dot{T}_{k}y_{k}-\frac{\kappa}{2}w_{k}(\pi_{k})\dot{\pi}_{k}^{2}.$	(22)

Next, we consider the time when a mode switch occurs in (13a) for $k$ -th element. Since $U_{k}(\mu_{k},\pi_{k})$ is not differentiable at $(\mu_{k},\pi_{k})$ satisfying $\pi_{k}-\pi_{k}^{*}=-\alpha\mu_{k}$ , we now take in the upper Dini derivative, denoted by $D^{+}U_{k}$ . From the results in (20) and (22), we can confirm that

\displaystyle D^{+}U_{k}\leq\dot{T}_{k}y_{k}-\frac{\kappa}{2}w_{k}(\pi_{k})\dot{\pi}_{k}^{2}

holds for all time $t\geq 0$ . Therefore, we obtain

$\displaystyle D^{+}U$	$\displaystyle\leq\dot{T}^{\top}y-\frac{\kappa}{2}\dot{\pi}^{\top}W(\pi)\dot{\pi}$
	$\displaystyle=\dot{T}^{\top}y-\frac{\kappa}{2}y^{\top}\left(W(\pi)\right)^{-1}y$
	$\displaystyle\leq\dot{T}^{\top}y-\frac{\kappa}{2w^{H}}\\|y\\|^{2}.$	(23)

Integrating (23) in time completes the proof.

Thanks to the result in Lemma 4.10, it is revealed that the proposed mechanism (13) guarantees both $\delta$ -input-strict-passivity and the existence of $T_{\max}$ . In other words, the remained issue in the previous method [34] can be cleared by (13). Thus, we can expected to achieve the passivity-based mechanism for Model 2.

Here we introduce the following lemma, which will be used in convergence analysis.

Lemma 4.12.

Consider the system (13). Let a signal $\pi(t)\in\textrm{int}(\mathcal{S})$ achieves $\|T(t)\|\to\infty$ when $t\to\infty$ . For this specific signal $\pi(t)$ , $S(W(\pi)T,\pi)\to\infty$ holds for any $\pi^{*}\in\textrm{int}(\mathcal{S})$ .

Proof 4.13.

See Appendix 7.2.

From Lemma 3.2, Lemma 4.10 and Lemma 4.12, we are now ready to analyze convergence of the proposed mechanism (13) for Model 2. We should remark that the LaSalle’s invariance principle [38] cannot be applied due to non-smoothness of the storage function $U(\mu,\pi)$ . Alternatively, we discuss the convergence by using the invariance principle for non-smooth Lyapunov functions [40]. Then, the following theorem can be proven.

Theorem 4.14.

Consider the feedback system composed of (1), (5) and (13) under Assumption 2. When $\kappa>\frac{2v^{H}T_{\max}}{w^{L}}$ holds, the population state $\pi$ achieves $\lim_{t\to\infty}\pi(t)=\pi^{*}$ .

Proof 4.15.

Define the function $V_{2}:=S(W(\pi)T,\pi)+U(\mu,\pi)$ . From Lemma 3.2 and Lemma 4.10, the upper Dini derivative of $V_{2}$ satisfies

	$\displaystyle D^{+}V_{2}$	$\displaystyle\leq-\dot{T}^{\top}y+\dot{\pi}^{\top}\Phi(T,\pi)\dot{\pi}+\dot{T}^{\top}y-\frac{\kappa}{2}\dot{\pi}^{\top}W(\pi)\dot{\pi}$
		$\displaystyle=\dot{\pi}^{\top}\left(\Phi(T,\pi)-\frac{\kappa}{2}W(\pi)\right)\dot{\pi}.$		(24)

Under $\kappa>\frac{2v^{H}T_{\max}}{w^{L}}$ , $\Phi(T,\pi)-\frac{\kappa}{2}W(\pi)\prec 0$ is satisfied and hence $D^{+}V_{2}\leq 0$ holds. Denote the initial values of $S$ and $U$ by $S_{0}$ and $U_{0}$ , respectively. Then, $V_{2}\leq S_{0}+U_{0}<\infty$ holds for all time. This implies $T\in\mathcal{L}_{\infty}$ and $\mu\in\mathcal{L}_{\infty}$ from Lemma 4.12 and $\pi\in\textrm{int}(\mathcal{S})$ . Accordingly, the invariance principle for non-smooth function [40] is applicable, and trajectories generated by (1), (5) and (13) for any initial conditions $T(0)\in\mathbb{R}^{n}$ and $\pi(0)\in\textrm{int}(\mathcal{S})$ converge to the largest invariant set satisfying $D^{+}{V}_{2}=0$ .

Let us now suppose the situation under $D^{+}V_{2}\equiv 0$ . From (24), $D^{+}V_{2}\equiv 0$ yields $\dot{\pi}\equiv 0$ and hence $\pi$ must be constant. If $\pi_{k}<\pi^{*}_{k}$ , then $k$ -th element of (13a) follows $\dot{\mu}_{k}\equiv\rho(\pi_{k}-\pi_{k}^{*})<0$ , which contradicts the boundedness of $\mu$ . Thus, $\pi\geq\pi^{*}$ is identically satisfied. If $\pi_{k}>\pi^{*}_{k}$ holds for some $k\in\mathcal{A}$ , $\sum_{l\in\mathcal{A}}(\pi_{l}-\pi_{l}^{*})\geq\pi_{k}-\pi_{k}^{*}>0$ must hold. However, this contradicts the fact $\textbf{1}_{n}^{\top}(\pi-\pi^{*})\equiv 0$ given by $\pi\in\textrm{int}(\mathcal{S})$ and $\pi^{*}\in\textrm{int}(\mathcal{S})$ . Therefore, $\pi\equiv\pi^{*}$ is satisfied when $D^{+}V_{2}\equiv 0$ holds.

In summary, the invariance principle for non-smooth function [40] proves that $\pi$ asymptotically converges to $\pi^{*}$ .

Thanks to the upper bound condition $T_{k}(t)\leq T_{\max}\ \forall k\in\mathcal{A},\forall t\geq 0$ ensured by (13), the stability and convergence of the feedback system in Fig. 6 can be exactly guaranteed under the gain condition $\kappa>\frac{2v^{H}T_{\max}}{w^{L}}$ . Similar to Theorem 4.8, the result in Theorem 4.14 suggests the tendency that large gain $\kappa$ will be required for strongly biased decision makers. The quantitative inequality is calculated as a result of passivity-based analysis.

Remark 4.16.

The authors in [32] and [25] proposed a similar nudging mechanism to (10), with assuming that the signal $Q(\tau)$ is observable. This assumption is different from the one dealt with in this paper. Whereas, the information $Q(\tau)$ is not easy to get from the decision makers since it is implicit variable. Although $Q(\tau)$ might be estimated by using $\pi$ and $\dot{\pi}$ , there is another difficulty of identifying $\eta$ . Thus, we suppose the observation of population state $\pi$ , which is commonly used in the field of evolutionary game [23].

Remark 4.17.

In [25], the authors addressed the biased population dynamics, and analyzed the convergence of nudging mechanism based on singular perturbation. The convergence condition in [25] implicitly relies on the update rate $\eta$ of the decision makers (1). Meanwhile, the result in Theorem 4.14 shows two advantages against [25]. The first one is to explicitly clarify a quantitative condition for stability, which is a benefit of passivity-based analysis. In addition, the proposed mechanism in this paper can design the parameter $\kappa$ independently of the update rate $\eta$ , which is the second contribution against the nudging method in [25].

5 Conclusion

In this paper, we addressed design of mechanisms for decision makers with conformity biases. We first introduced two types of bias models addressed in [33] and [25]. Next, we analyzed the population dynamics with the bias models in terms of $\delta$ -passivity. Then, we clarified that conformity biases appear as positive feedback terms, and they break passivity of dynamic decision making. We furthermore presented passivity-based mechanisms for biased population dynamics, and showed convergence conditions for the proposed mechanisms. Accordingly, we confirmed that high gain feedback should have been required for the decision makers with strong biases.

\appendices

6 Radial Unboundedness of Storage Function

The storage function $S(\tau,\pi)$ , defined in (3), satisfies the following lemma.

Lemma 6.18.

Define $\tau^{H}:=\max\{\tau_{1},\tau_{2},\dots,\tau_{n}\}$ and $\tau^{L}:=\min\{\tau_{1},\tau_{2},\dots,\tau_{n}\}$ . Under $\pi\in\rm{int}(\mathcal{S})$ , then

\displaystyle\tau^{H}-\tau^{L}\to\infty\ \Rightarrow\ S(\tau,\pi)\to\infty

holds.

Proof 6.19.

Let us first focus on the second term in the right hand side of (3), which is a convex optimization about $\omega$ . Remarking that $\sum_{l\in\mathcal{A}}\omega_{l}\log{\omega_{l}}$ works as a barrier function to the inequality constraints, $\omega^{*}\in\mathbb{R}^{n}$ is an optimal solution if and only if there exists $\lambda^{*}\in\mathbb{R}$ satisfying the following conditions:


	$\displaystyle\tau_{k}+\frac{1}{\beta}\left(1+\log\omega_{k}^{}\right)+\lambda^{}=0\ \ \forall k\in\mathcal{A},$		(25a)
	$\displaystyle{\bf 1}_{n}^{\top}\omega^{*}=1.$		(25b)

From (25a), we obtain $\log\omega_{k}^{*}=-\beta\tau_{k}-\beta\lambda^{*}-1$ and hence

\displaystyle\omega_{k}^{*}=\textrm{exp}\left(-\beta\tau_{k}-\beta\lambda^{*}-1\right).

(26)

Applying (25b) to (26), we have

		$\displaystyle\textrm{exp}\left(-\beta\lambda^{*}-1\right)\sum_{l\in\mathcal{A}}\textrm{exp}\left(-\beta\tau_{l}\right)=1,$
	$\displaystyle\therefore\$	$\displaystyle\lambda^{*}=\frac{1}{\beta}\left(-1+\log\left(\sum_{l\in\mathcal{A}}\textrm{exp}\left(-\beta\tau_{l}\right)\right)\right).$		(27)

Hence, we can calculate the second term in the right hand side of (3) as

	$\displaystyle\min_{\omega\in\mathcal{S}}\left(\omega^{\top}\tau+\frac{1}{\beta}\sum_{l\in\mathcal{A}}\omega_{l}\log{\omega_{l}}\right)$
	$\displaystyle\ \ \ \ =(\omega^{})^{\top}\tau+\frac{1}{\beta}\sum_{l\in\mathcal{A}}\omega^{}_{l}\log{\omega^{*}_{l}}$
	$\displaystyle\ \ \ \ =\sum_{l\in\mathcal{A}}\omega^{}_{l}\left(\tau_{l}+\frac{1}{\beta}\left(-\beta\tau_{l}-\beta\lambda^{}-1\right)\right)$
	$\displaystyle\ \ \ \ =\sum_{l\in\mathcal{A}}\omega^{}_{l}\left(-\lambda^{}-\frac{1}{\beta}\right)=-\frac{1}{\beta}\log\left(\sum_{l\in\mathcal{A}}\textrm{exp}\left(-\beta\tau_{l}\right)\right).$

Thus, the storage function $S(\tau,\pi)$ is given as

\displaystyle S(\tau,\pi)=\

\displaystyle\eta\pi^{\top}\tau+\frac{\eta}{\beta}\sum_{l\in\mathcal{A}}\pi_{l}\log{\pi_{l}}+\frac{\eta}{\beta}\log\left(\sum_{l\in\mathcal{A}}\exp(-\beta\tau_{l})\right).

Let us next focus on the terms depending on $\tau$ . Due to the property of the Log-Sum-Exp function [39],

\displaystyle\frac{1}{\beta}\log\left(\sum_{l\in\mathcal{A}}\exp(-\beta\tau_{l})\right)\geq-\tau^{L}

is satisfied. Noticing that $\textbf{1}_{n}^{\top}\pi=1$ holds and there exists $\varepsilon>0$ such that $\pi_{k}>\varepsilon\ \forall k\in\mathcal{A}$ , we obtain

$\displaystyle\pi^{\top}\tau+\frac{1}{\beta}\log\left(\sum_{l\in\mathcal{A}}\exp(-\beta\tau_{l})\right)$	$\displaystyle\geq\pi^{\top}\tau-\tau^{L}$
	$\displaystyle=\sum_{l\in\mathcal{A}}\pi_{l}(\tau_{l}-\tau^{L})$
	$\displaystyle\geq\varepsilon(\tau^{H}-\tau^{L}).$	(28)

We now consider the case that $(\tau^{H}-\tau^{L})\to\infty$ happens. Then, $\eta\pi^{\top}\tau+\frac{\eta}{\beta}\log\left(\sum_{l\in\mathcal{A}}\exp(-\beta\tau_{l})\right)$ goes to infinity due to (28). Therefore, since $\sum_{l\in\mathcal{A}}\pi_{l}\log{\pi_{l}}$ is finite, $S(\tau,\pi)\to\infty$ holds. This completes the proof.

7 Proof of Lemmas

7.1 Proof of Lemma 4.6

Due to $\textbf{1}_{n}^{\top}\pi=1$ and $\textbf{1}_{n}^{\top}\pi^{*}=1$ , the signal $\dot{\mu}$ of (10a) satisfies $\textbf{1}_{n}^{\top}\dot{\mu}\equiv 0$ . Hence, the system (10) satisfies $\textbf{1}_{n}^{\top}T(t)\equiv\textbf{1}_{n}^{\top}\mu(t)\equiv\textbf{1}_{n}^{\top}\mu(0)$ . Define

	$\displaystyle T^{H}(t)$	$\displaystyle:=\max\{T_{1}(t),T_{2}(t),\dots,T_{n}(t)\},$
	$\displaystyle T^{L}(t)$	$\displaystyle:=\min\{T_{1}(t),T_{2}(t),\dots,T_{n}(t)\}.$

When $\|T(t)\|\to\infty$ happens, $T^{H}(t)\to\infty$ and $T^{L}(t)\to-\infty$ hold due to the constraint $\textbf{1}_{n}^{\top}T(t)\equiv\textbf{1}_{n}^{\top}\mu(0)$ . Then, $(T^{H}(t)-T^{L}(t)+b^{L}-b^{H})\to\infty$ is also satisfied. Let us consider $\tau^{H}$ and $\tau^{L}$ defined in Lemma 6.18 under (4). Due to $\tau^{H}\geq T^{H}+b^{L}$ and $\tau^{L}\leq T^{L}+b^{H}$ , we have $\tau^{H}-\tau^{L}\geq T^{H}-T^{L}+b^{L}-b^{H}$ . Thus, $(\tau^{H}-\tau^{L})\to\infty$ holds when $\|T(t)\|\to\infty$ happens. From the above discussion and Lemma 6.18, when the system (10) achieves $\|T(t)\|\to\infty$ under $\pi(t)\in\textrm{int}(\mathcal{S})$ , $S(T+B(\pi),\pi)\to\infty$ holds. This completes the proof of Lemma 4.6.

7.2 Proof of Lemma 4.12

Before we prove Lemma 4.12, we show the following lemma.

Lemma 7.20.

Consider the signal $T(t)$ generated by (13). For any constant $\pi^{*}\in\textrm{int}(\mathcal{S})$ and for any signal $\pi(t)\in\textrm{int}(\mathcal{S})$ , the signal $T^{H}(t):=\max\{T_{1}(t),T_{2}(t),\dots,T_{n}(t)\}$ is bounded.

Proof 7.21.

Let us suppose the case that the system (13a) satisfies $\|\mu_{k}\|\to\infty$ for all $k\in\mathcal{A}$ . Since (13a) guarantees $\mu_{k}\leq 0$ , $\mu_{k}\to-\infty$ must hold for all $k\in\mathcal{A}$ . Then, there exists a time $\hat{t}>0$ such that $-\frac{\alpha}{\rho}\mu_{k}>\pi_{k}-\pi_{k}^{*}\ \forall k\in\mathcal{A}$ holds for any time $t\geq\hat{t}$ . In other words, (13a) follows $\dot{\mu}=\rho(\pi-\pi^{*})$ for any $t\geq\hat{t}$ . This yields $\textbf{1}_{n}^{\top}\dot{\mu}=0$ and hence $\sum_{l\in\mathcal{A}}\mu_{l}(t)$ must be constant for any time $t\geq\hat{t}$ . This contradicts the assumption satisfying $\mu_{k}\to-\infty\ \forall k\in\mathcal{A}$ .

Therefore, the system (13a) guarantees at least $-\infty<\max\{\mu_{1}(t),\mu_{2}(t),\dots,\mu_{n}(t)\}$ . As a result, $T^{H}(t)$ is bounded since $\max\{\mu_{1}(t),\mu_{2}(t),\dots,\mu_{n}(t)\}+\bar{T}-\kappa<T^{H}(t)<\bar{T}+\kappa$ holds under (13b).

We now prove Lemma 4.12. Let us introduce the signals $T^{H}(t)$ and $T^{L}(t)$ , defined in Appendix 7.1. If (13) achieves $\|T(t)\|\to\infty$ , then $T^{L}(t)\to-\infty$ must hold. This yields $(T^{H}(t)-T^{L}(t))\to\infty$ from Lemma 7.20. Then, $(w^{H}T^{H}(t)-w^{L}T^{L}(t))\to\infty$ is also satisfied. Let us consider $\tau^{H}$ and $\tau^{L}$ defined in Lemma 6.18 under (5). Due to $\tau^{H}-\tau^{L}\geq w^{H}T^{H}-w^{L}T^{L}$ , $(\tau^{H}-\tau^{L})\to\infty$ holds when $\|T(t)\|\to\infty$ happens. Accordingly, Lemma 6.18 proves Lemma 4.12.

References

[1] G. Como, K. Savla, D. Acemoglu, M.A. Dahleh and E. Frazzoli, “Stability analysis of transportation networks with multiscale driver decisions,” SIAM Journal on Control and Optimization, vol. 51, no. 1, pp. 230–252, 2013.
[2] C.J. Day, B.F. Hobbs and J. Pang, “Oligopolistic competition in power networks: A conjectured supply function approach,” IEEE Transactions on Power Systems, vol. 17, no. 3, pp. 597–607, 2002.
[3] E. Bompard, Y.C. Ma, R. Napoli, G. Gross and T. Guler, “Comparative analysis of game theory models for assessing the performances of network constrained electricity markets,” IET Generation, Transmission & Distribution, vol. 4, no. 3, pp. 386–399, 2010.
[4] N. Li, L. Chen and M. Dahleh, “Demand response using linear supply function bidding,” IEEE Transactions on Smart Grid vol. 6, no. 4, pp. 1827–1838, 2015.
[5] R. Sugden, “Rational choice: A survey of contributions from economics and philosophy,” The Economic Journal, vol. 101, no. 407, pp. 751–85, 1991.
[6] D. Kahneman and A. Tversky, “Prospect theory: An analysis of decision under risk,” Econometrica, vol. 47, no. 2, pp. 263–291, 1979.
[7] I. Brocas and J.D. Carrillo, “Dual-process theories of decision-making: A selective survey,” Journal of Economic Psychology, vol. 41, pp. 45–54, 2014.
[8] Y. Guan, A.M. Annaswamy and H.E. Tseng, “Cumulative prospect theory based dynamic pricing for shared mobility on demand services,” Proc. IEEE 58th Conference on Decision and Control, pp. 2239–2244, 2019.
[9] D.M. Mason, L. Stella and D. Bauso, “Evolutionary game dynamics for crowd behavior in emergency evacuations,” Proc. IEEE 59th Conference on Decision and Control, pp. 1672–1677, 2020.
[10] T. Kameda and D. Nakanishi, “Cost-benefit analysis of social/cultural learning in a nonstationary uncertain environment: An evolutionary simulation and an experiment with human subjects,” Evolution and Human Behavior, vol. 23, no. 5, pp. 373–393, 2002.
[11] D. Fukuda and S. Morichi, “Incorporating aggregate behavior in an individual’s discrete choice: An application to analyzing illegal bicycle parking behavior,” Transportation Research Part A, vol. 41, no. 4, pp. 313–325, 2007.
[12] J. Urata and E. Hato, “Modeling the cooperation network formation process for evacuation systems design in disaster areas with a focus on Japanese megadisasters,” Leadership and Management in Engineering, vol. 12, no. 4, pp. 231–246, 2012.
[13] K. Train, Discrete Choice Methods with Simulation, SUNY-Oswego, Department of Economics, 2003.
[14] K. Srinivasan and H. Mahmassani, “Modeling inertia and compliance mechanisms in route choice behavior under real-time information,” Transportation Research Record: Journal of the Transportation Research Board, no. 1725, pp. 45–53, 2000.
[15] J. Sagebiel, “Preference heterogeneity in energy discrete choice experiments: A review on methods for model selection,” Renewable and Sustainable Energy Reviews, vol. 69, pp. 804–811, 2017.
[16] E. Ramírez-Llanos and N. Quijano, “A population dynamics approach for the water distribution problem,” International Journal of Control, vol. 83, pp. 1947–1964, 2010.
[17] W. Mei, N.E. Friedkin, K. Lewis and F. Bullo, “Dynamic models of appraisal networks explaining collective learning,” IEEE Transactions on Automatic Control, vol. 63, no. 9, pp. 2898–2912, 2018.
[18] J. Hofbauer and E. Hopkins, “Learning in perturbed asymmetric games,” Games and Economic Behavior, vol. 52, no. 1, pp. 133–152, 2005.
[19] J. Hofbauer and W.H. Sandholm, “Evolution in games with randomly disturbed payoffs,” Journal of Economic Theory, vol. 132, no. 1, pp. 47–69, 2007.
[20] M.J. Smith, “The stability of a dynamic model of traffic assignment: An application of a method of Lyapunov,” Transportation Science, vol. 18, no. 3, pp. 245–252, 1984.
[21] W.H. Sandholm, “Pairwise comparison dynamics and evolutionary foundations for Nash equilibrium,” Games, vol. 1, no. 1, pp. 3–17, 2010.
[22] M.J. Fox and J.S. Shamma, “Population games, stable games, and passivity,” Games, vol. 4, no. 4, pp. 561–583, 2013.
[23] S. Park, J.S. Shamma and N.C. Martins, “Passivity and evolutionary game dynamics,” Proc. IEEE 57th Conference on Decision and Control, pp. 3553–3560, 2018.
[24] A. Pashaie, L. Pavel and C.J. Damaren, “A population game approach for dynamic resource allocation problems,” International Journal of Control, vol. 90, no. 9, pp. 1957–1972, 2017.
[25] Y. Cheng and C. Langbort, “On informational nudging for boundedlly rational decision makers,” Proc. IEEE 57th Conference on Decision and Control, pp. 4591–4796, 2018.
[26] W. Zhao, H. Yang, X. Deng and C. Zhong, “Stability of equilibria for population games with uncertain parameters under bounded rationality,” Journal of Inequalities and Applications, vol. 2021, no. 15, 2021.
[27] J.J. Laffont and D. Martimort, The Theory of Incentives: The Principal-Agent Model, Princeton University Press, 2002.
[28] R.H. Thaler and C.R. Sunstein, Nudge: Improving Decisions about Health, Wealth, and Happiness, Yale University Press, 2008.
[29] L.J. Ratliff, R. Dong, H. Ohlsson and S.S. Sastry, “Incentive design and utility learning via energy disaggregation,” IFAC Proceedings Volumes, vol. 47, no. 3, pp. 3158–3163, 2014.
[30] M. Shakarami, A. Cherukuri and N. Monshizadeh, “Nudging the aggregative behavior of noncooperative agents,” Proc. IEEE 59th Conference on Decision and Control, pp. 2579–2584, 2020.
[31] J.S. Weitz, C. Eksin, K. Paarporn, S.P. Brown and W.C. Ratcliff, “An oscillating tragedy of the commons in replicator dynamics with game-environment feedback,” Proc. the National Academy of Sciences, vol. 113, no. 47, E7518–E7525, 2016.
[32] Y. Cheng and C. Langbort, “A model of informational nudging in transportation networks,” Proc. IEEE 55th Conference on Decision and Control, pp. 7598–7604, 2016.
[33] W.A. Brock and S.N. Durlauf, “Interactions-based models,” Handbook of Econometrica, vol. 5, pp. 3297–3380, 2001.
[34] S. Yamashita, T. Hatanaka, Y. Wasa, N. Hayashi, K. Hirata and K. Uchida, “Passivity-based analysis and nudging design for dynamics social model with bounded rationality,” IFAC-PapersOnLine, vol. 53, no. 5, pp. 338–343, 2020.
[35] T. Hatanaka, N. Chopra, M. Fujita and M.W. Spong, Passivity-Based Control and Estimation in Networked Robotics, Springer-Verlag, 2015.
[36] Z. Qu and M.A. Simaan, “Modularized design for cooperative control and plug-and-play operation of networked heterogeneous systems,” Automatica, vol. 50, no. 9, pp. 2405–2414, 2014.
[37] R. Sepulchre, M. Jankovic and P.V. Kokotovic, Constructive Nonlinear Control, Springer, London, 2012.
[38] H.K. Khalil, Nonlinear Systems, 3rd edition, Prentice Hall, 2002.
[39] S. Boyd and L. Vandenberghe, Convex Optimization, Cambridge University Press, 2004.
[40] D. Shevitz and B. Paden, “Lyapunov stability theory of nonsmooth systems,” IEEE Transactions on Automatic Control, vol. 39, no. 9, pp. 1910–1914, 1994.

	$\displaystyle\min_{\omega\in\mathcal{S}}\left(\omega^{\top}\tau+\frac{1}{\beta}\sum_{l\in\mathcal{A}}\omega_{l}\log{\omega_{l}}\right)$
	$\displaystyle\ \ \ \ =(\omega^{})^{\top}\tau+\frac{1}{\beta}\sum_{l\in\mathcal{A}}\omega^{}_{l}\log{\omega^{*}_{l}}$
	$\displaystyle\ \ \ \ =\sum_{l\in\mathcal{A}}\omega^{}_{l}\left(\tau_{l}+\frac{1}{\beta}\left(-\beta\tau_{l}-\beta\lambda^{}-1\right)\right)$
	$\displaystyle\ \ \ \ =\sum_{l\in\mathcal{A}}\omega^{}_{l}\left(-\lambda^{}-\frac{1}{\beta}\right)=-\frac{1}{\beta}\log\left(\sum_{l\in\mathcal{A}}\textrm{exp}\left(-\beta\tau_{l}\right)\right).$