Stability analysis and control of decision-making of miners in blockchain

Kosuke Toda Graduate School of Engineering Science, Osaka University, Machikaneyama 1-3, Toyonaka-shi, Osaka, 560–8531, Japan. Naomi Kuze Graduate School of Engineering Science, Osaka University, Machikaneyama 1-3, Toyonaka-shi, Osaka, 560–8531, Japan. Toshimitsu Ushio Graduate School of Engineering Science, Osaka University, Machikaneyama 1-3, Toyonaka-shi, Osaka, 560–8531, Japan.

Abstract

To maintain blockchain-based services with ensuring its security, it is an important issue how to decide a mining reward so that the number of miners participating in the mining increases. We propose a dynamical model of decision-making for miners using an evolutionary game approach and analyze the stability of equilibrium points of the proposed model. The proposed model is described by the 1st-order differential equation. So, it is simple but its theoretical analysis gives an insight into the characteristics of the decision-making. Through the analysis of the equilibrium points, we show the transcritical bifurcations and hysteresis phenomena of the equilibrium points. We also design a controller that determines the mining reward based on the number of participating miners to stabilize the state where all miners participate in the mining. Numerical simulation shows that there is a trade-off in the choice of the design parameters.

Keywords— B lockchain, Proof-of-work, Decision-making, Evolutionary Game, Bifurcation, Hysteresis, Feedback control.

1 Introduction

Blockchain is a distributed ledger technology for recording transactions that underlies various fields such as digital currency like Bitcoin [1], data sharing [2], and computer security [3]. Blockchain-based services use cryptography to record transactions as a chain of blocks. A block consists of a block header and transaction data. The block header contains a cryptographic hash of its previous block, which makes blockchain-based services resistant to tampering. In these services, participants called miners create blocks in a distributed manner, and the longest chain of blocks is considered to be legitimate. When a miner succeeds in creating a block, he/she gets a reward called a mining reward.

Blockchain-based services approve transactions through a consensus algorithm. As a consensus algorithm, proof-of-work (PoW) is typically used. In this algorithm, the mining difficulty is set using a scalar value called a nonce in the block header. To create a block, miners must find a nonce such that the cryptographic hash value for the previous block satisfies specific conditions. The process of creating blocks is called a mining. In general, a cryptographic hash value for a block is unique according to the nonce contained in the block. Moreover, a nonce that satisfies the specific conditions cannot be calculated directly. As a result, an exhaustive search imposes a large computational cost on miners, which contributes to the resistance to tampering. Because transaction approvals depend on miner calculations (such calculations are very costly and require a lot of energy [4, 5]), the participation of many miners is needed to maintain blockchain-based services and ensure blockchain system security [6, 7]. Therefore, it is important to analyze the decision-making problem of whether miners participate in the mining according to the energy consumption and mining rewards.

Game theory is used to analyze interactions among rational decision-makers. Many studies have adopted game theory to analyze blockchain-related issues with PoW [8], such as decision-making problems in the mining considering the energy consumption [9, 10]. Evolutionary game theory has been used as a powerful mathematical tool for analyzing dynamical models of evolutionary selection [11]. Dynamical characteristics of the selection process are modeled by replicator dynamics. Control methods for the replicator dynamics have been studied in [12, 13, 14]. Evolutionary game models and replicator dynamics are also used in analyzing blockchain-related issues such as mining pool selection problems [15, 16], and attack scenarios [17].

We previously focused on a decision-making problem of whether miners participate in the mining according to the energy consumption and the mining rewards, and modeled it as a non-cooperative game. Through theoretical and numerical analysis, we showed the property of Nash equilibria [18]. However, in this study, we assumed that, once miners choose a strategy (i.e., participation in the mining or not), they do not change their strategies. Practically, the miners may decide to participate in the mining dynamically based on their current earned mining rewards. It is important to analyze such a dynamical decision-making process.

In this paper, we propose a dynamical model of the decision-making problem for miners, by applying an evolutionary game approach. We analyze the stability of its equilibrium points and show the existence of transcritical bifurcations and hysteresis phenomena with the coexistence of two asymptotically stable equilibrium points: one corresponds to the state where all miners participate in the mining and the other to the state where the number of participating miners is minimum. The former equilibrium point is preferable to maintain blockchain-based services. We propose a controller that determines the mining reward based on the number of current participating miners so as to stabilize the equilibrium point if at least one miner participates in the initial time.

The remainder of this paper is organized as follows. In Section 2, we propose an evolutionary game-based dynamical model of the decision-making process. In Section 3, we analyze the stability of its equilibrium point. In Section 4, we design a state feedback controller to let all miners participate in the mining.

2 Dynamical model of decision-making

We assume that miners in a blockchain network are partitioned into two sets $\mathcal{M}$ and $\mathcal{N}$ , where miners in $\mathcal{M}$ always participate in the mining and those in $\mathcal{N}$ have two strategies, participating in the mining (strategy $s_{k}=1$ ) and not participating in the mining (strategy $s_{k}=0$ ), where $k\in\mathcal{N}$ . Note that $\mathcal{M}\cap\mathcal{N}=\emptyset$ . Denoted by $m$ and $n$ are the cardinalities of $\mathcal{M}$ and $\mathcal{N}$ , respectively (we assume $m\geq 1$ and $n\geq 1$ ). We define $x_{0}$ and $x_{1}$ as the ratios of miners in $\mathcal{N}$ that choose strategies $0$ and $1$ , respectively. Note that

\displaystyle x_{0}+x_{1}=1.

(2.1)

Miners need to find a nonce such that the first $h$ bits of the hash of the block are all $0$ . Then, $D=2^{h}$ is the difficulty parameter and $1/D$ is the probability that a miner creates a block with one hash calculation [19]. When miner $k\in\mathcal{M}\cup\mathcal{N}$ participates in the mining, he/she needs a cost $c$ per unit operating time. The average number $w_{k}=f_{k}(c)$ of hash queries calculated per unit operating time by miner $k$ depends on the cost $c$ , and we assume that $f_{k}(c)$ is the same for all miners. In this paper, for simplicity, we assume $f_{k}(c)=vc\,(v>0)$ for any $k\in\mathcal{M}\cup\mathcal{N}$ .

The mining of blocks can be described as a Poisson process [20, 21]. That is, the block creation time is exponentially distributed [22]. The rate $\lambda_{k}$ of the Poisson process of miner $k\in\mathcal{M}\cup\mathcal{N}$ is given by $\lambda_{k}=w_{k}/D$ [21]¹¹1 Note that a combination of independent Poisson processes is still a Poisson process. Thus, the rate of the Poisson process of all miners is written as $\sum_{i\in\mathcal{M}\cup\mathcal{N}}\lambda_{i}$ . . If miner $k$ chooses $s_{k}=1$ , then the rate of the Poisson process is $\lambda_{k}=s_{k}f_{k}(c)/D=s_{k}c/d$ (we define $d=D/v$ , in this paper). Let $R$ be the mining reward. Based on the previous work [18], the expected reward $R_{k}$ and the expected cost $CS_{k}$ for the mining of miner $k$ are calculated as follows.

	$\displaystyle R_{k}=\frac{\lambda_{k}}{\sum_{i\in\mathcal{M}\cup\mathcal{N}}\lambda_{i}}R=\frac{Rs_{k}}{m+nx_{1}},$		(2.2)
	$\displaystyle CS_{k}=\frac{c\lambda_{k}}{(\sum_{i\in\mathcal{M}\cup\mathcal{N}}\lambda_{i})^{2}}=\frac{ds_{k}}{(m+nx_{1})^{2}}.$		(2.3)

We define the utility function $u_{i}(x_{0},x_{1})$ of miners that choose the strategy $i\in\{0,\ 1\}$ as

\displaystyle u_{i}(x_{0},x_{1})=\begin{cases}0&\mbox{if}\;\;i=0,\\ \frac{1}{m+nx_{1}}\left(R-\frac{d}{m+nx_{1}}\right)&\mbox{if}\;\;i=1,\end{cases}

(2.4)

which means that the utility of a miner who participates in the mining is the difference between the expected reward $R_{k}$ and the expected cost $CS_{k}$ . Based on the principle of the evolutionary game [11], the dynamics of the ratio of miners that choose the strategy $i$ is given by

\displaystyle\frac{\dot{x}_{i}}{x_{i}}=u_{i}(x_{0},x_{1})-\bar{u}(x_{0},x_{1})\;(i=0,1),

(2.5)

where $\bar{u}(x_{0},x_{1})=\sum_{i=0}^{1}x_{i}u_{i}(x_{0},x_{1})$ is the average utility of all miners. According to (2.4), (2.5) is rewritten as

\displaystyle\dot{x}_{1}=-\dot{x}_{0}=\frac{x_{1}(1-x_{1})}{m+nx_{1}}\left(R-\frac{d}{m+nx_{1}}\right)\eqqcolon\varphi_{R}(x_{1}).

(2.6)

Thus, the dynamics of the decision-making of miners is described by the above 1st-order differential equation and the reward $R$ plays an important role in the decision-making of the miners for the participation in the mining. In the following, we investigate stability and stabilization of equilibrium points of (2.6). For that purpose, the concept of a basin of attraction [23] is important. Let $\xi(t;x_{1}^{\mathrm{init}})$ be the solution of (2.6) that starts from an initial state $x_{1}^{\mathrm{init}}$ at time $t=0$ . For a given asymptotically stable equilibrium point $x_{1}^{\prime}$ of (2.6), the basin of attraction is defined as the set of all initial states $x_{1}^{\mathrm{init}}$ such that $\xi(t;x_{1}^{\mathrm{init}})$ is defined for all $t\geq 0$ and $\lim_{t\to\infty}\xi(t;x_{1}^{\mathrm{init}})=x_{1}^{\prime}$ .

3 Stability analysis

In this section, we investigate the stability of the equilibrium point $x_{1}=0,\ 1,\ x_{1}^{*}$ of (2.6), where

\displaystyle x_{1}^{*}=\frac{1}{n}\left(\frac{d}{R}-m\right).

(3.1)

When $1/(m+n)<R/d<1/m$ , the equilibrium point $x_{1}^{*}$ satisfies $0<x_{1}^{*}<1$ . This equilibrium point is the state where the utility $u_{1}(1-x_{1}^{*},x_{1}^{*})$ is equal to $0$ , that is, the utility for the strategy $0$ is equal to that for the strategy $1$ .

We investigate the local stability of the three equilibrium points. The derivative of $\varphi_{R}(x_{1})$ with respect to $x_{1}$ is

	$\displaystyle\frac{\partial\varphi_{R}(x_{1})}{\partial x_{1}}=\left(-\frac{x_{1}}{m+nx_{1}}+\frac{1-x_{1}}{m+nx_{1}}-\frac{nx_{1}(1-x_{1})}{(m+nx_{1})^{2}}\right)$
	$\displaystyle\hskip 56.9055pt\times\left(R-\frac{d}{m+nx_{1}}\right)+\frac{dnx_{1}(1-x_{1})}{(m+nx_{1})^{3}}.$		(3.2)

Thus, we obtain

	$\displaystyle\left.\frac{\partial\varphi_{R}(x_{1})}{\partial x_{1}}\right\|_{x_{1}=0}=\frac{1}{m}\left(R-\frac{d}{m}\right),$		(3.3)
	$\displaystyle\left.\frac{\partial\varphi_{R}(x_{1})}{\partial x_{1}}\right\|_{x_{1}=1}=-\frac{1}{m+n}\left(R-\frac{d}{m+n}\right),$		(3.4)
	$\displaystyle\left.\frac{\partial\varphi_{R}(x_{1})}{\partial x_{1}}\right\|_{x_{1}=x_{1}^{*}}=\frac{R^{3}}{nd^{2}}\left(\frac{d}{R}-m\right)\left((m+n)-\frac{d}{R}\right).$		(3.5)

Table 1: The relation between

R/d

and the stability of equilibrium points.

Condition for $R/d$	$x_{1}=0$	$x_{1}=1$	$x_{1}=x_{1}^{*}$
$R/d<1/(m+n)$	S	U	S
$1/(m+n)<R/d<1/m$	S	S	U
$R/d>1/m$	U	S	S

Thus, we have their stability conditions as shown in Table 1, where S (resp. U) represents an asymptotically stable (resp. unstable) point.

Refer to caption — Figure 1: The $m-R/d$ parameter plane where $n$ is fixed to $n=2$ .

Fig. 1 shows the $m-R/d$ parameter plane with $n=2$ where the meaning of each region is as follows. In the region $(A)$ , both $x_{1}=1$ and $x_{1}^{*}<0$ are asymptotically stable equilibrium points, and the basin of attraction of $x_{1}=1$ is $(0,\ \infty)$ , that is, every solution of (2.6) starting in $(0,1]$ converges to $1$ . In the region $(B)$ , both $x_{1}=0$ and $x_{1}=1$ are asymptotically stable equilibrium points, and basins of attraction of $x_{1}=0$ and $x_{1}=1$ are $(-\infty,\ x_{1}^{*})$ and $(x_{1}^{*},\ \infty)$ , respectively, that is, every solution of (2.6) starting in $[0,\ x_{1}^{*})$ converges to $0$ , and every solution of (2.6) starting in $(x_{1}^{*},\ 1]$ converges to $1$ . In the region $(C)$ , both $x_{1}=0$ and $x_{1}^{*}>0$ are asymptotically stable equilibrium points, and the basin of attraction of $x_{1}=0$ is $(-\infty,\ 1)$ , that is, every solution of (2.6) starting in $[0,\ 1)$ converges to $0$ .

Shown in Fig. 2 is a bifurcation diagram with respect to the bifurcation parameter $R/d$ , where $m=n=2$ . The solid (resp. dashed) line represents an asymptotically stable (resp. unstable) equilibrium point. Two curves of equilibrium points pass through $(x_{1},R)=(1,d/(m+n))$ (resp. $(x_{1},R)=(0,d/m)$ ), one given by $x_{1}=x_{1}^{*}$ , the other by $x_{1}=1$ (resp. $x_{1}=0$ ). Both curves exist on both sides of $R=d/(m+n)$ (resp. $R=d/m$ ). The stability along each curve exchanges on passing through $R=d/(m+n)$ (resp. $R=d/m$ ). Thus, the exchange of stability (known as a transcritical bifurcation) [24] is observed when $(x_{1},R)=(0,d/m),(1,d/(m+n))$ . We show in A that the vector field (2.6) satisfies the condition of the transcritical bifurcation shown in [24].

Since the values of $x_{i}\,(i=0,1)$ satisfy $0\leq x_{i}\leq 1$ , we observe jump phenomena owing to these transcritical bifurcations. Moreover, $R>d/m$ needs to be satisfied so that all miners in $\mathcal{N}$ participate in the mining. It is noted that, once the miners participate in the mining, they continue to participate in the mining until the reward $R$ becomes $d/(m+n)$ . Thus, a hysteresis phenomenon of the equilibrium points is observed.

Fig. 3 shows trajectories of (2.6) from an initial state $x_{1}^{\mathrm{init}}=0.1$ (blue) and $x_{1}^{\mathrm{init}}=0.9$ (red). When $d/(m+n)<R<d/m$ , both $x_{1}=0$ and $x_{1}=1$ are asymptotically stable points whose basins of attraction are $(-\infty,\ x_{1}^{*})$ and $(x_{1}^{*},\ \infty)$ , respectively. Thus, the number of miners who participate in the mining converges to $0$ if the initial ratio is less than $x_{1}^{*}$ because their utility is negative and they prefer non-participation.

4 Stabilization

The result in Section 3 implies that no miner in $\mathcal{N}$ participates in the mining in the steady state when the mining reward $R^{*}$ satisfies $R^{*}<d/(m+n)$ . When the mining reward $R^{*}$ satisfies $d/(m+n)<R^{*}<d/m$ , miners’ behaviors depend on the initial state $x_{1}^{\mathrm{init}}$ , i.e., no miner in $\mathcal{N}$ participates in the mining in the steady state when $x_{1}^{\mathrm{init}}<x_{1}^{*}$ . We propose a state feedback controller to adjust the reward based on the ratio $x_{1}$ so that all miners in $\mathcal{N}$ participate in the mining, i.e., let every trajectory of $x_{1}$ with its initial state in $(0,\ 1]$ converge to $1$ .

4.1 Case where $R^{*}<d/(m+n)$

First, we show that $x_{1}=1$ cannot be stabilized when $R^{*}<d/(m+n)$ . We consider the following state feedback controller $R_{1}(x_{1})$ that adjusts the reward based on the ratio $x_{1}$ .

\displaystyle R=R_{1}(x_{1}),\;R_{1}(1)=R^{*}<\frac{d}{m+n}.

(4.1)

The controlled trajectory of $x_{1}$ by (4.1) is described by

\displaystyle\dot{x}_{1}=\frac{x_{1}(1-x_{1})}{m+nx_{1}}\left(R_{1}(x_{1})-\frac{d}{m+n}\right)\eqqcolon\psi_{R}(x_{1}).

(4.2)

The derivative of $\psi_{R}(x_{1})$ with respect to $x_{1}$ is

	$\displaystyle\frac{\partial\psi_{R}(x_{1})}{\partial x_{1}}=\left(-\frac{x_{1}}{m+nx_{1}}+\frac{1-x_{1}}{m+nx_{1}}-\frac{nx_{1}(1-x_{1})}{(m+nx_{1})^{2}}\right)$
	$\displaystyle\hskip 42.67912pt\times\left(R_{1}(x_{1})-\frac{d}{m+nx_{1}}\right)$
	$\displaystyle\hskip 48.36967pt+\frac{x_{1}(1-x_{1})}{m+nx_{1}}\left(\frac{\partial R_{1}(x_{1})}{\partial x_{1}}+\frac{dn}{(m+nx_{1})^{2}}\right).$		(4.3)

We obtain

\displaystyle\left.\frac{\partial\psi_{R}(x_{1})}{\partial x_{1}}\right|_{x_{1}=1}=-\frac{1}{m+n}\left(R_{1}(1)-\frac{d}{m+n}\right)>0.

(4.4)

Therefore, the unstable equilibrium point $x_{1}=1$ cannot be stabilized even if the feedback controller is used.

4.2 Case where $d/(m+n)<R^{*}<d/m$

Next, we show that $x_{1}=1$ can be an asymptotically stable equilibrium point whose basin of attraction is $(0,\ 1]$ with a state feedback controller. We introduce the following state feedback controller $R_{2}(x_{1})$ to adjust the reward based on the ratio $x_{1}$ ,

\displaystyle R=R_{2}(x_{1})=R^{*}+\Delta R(x_{1}),\;\Delta R(1)=0,

(4.5)

and let every trajectory of $x_{1}$ with its initial state in $(0,\ 1]$ converge to $1$ .

4.2.1 The condition of the feedback gain

We give $\bar{x}_{1}$ satisfying $x_{1}^{*}<\bar{x}_{1}\leq 1$ and $\varepsilon>0$ . For a given reward $R^{*}\in(d/(m+n),d/m)$ , let $\Delta R(x_{1})$ be

\displaystyle\Delta R(x_{1})=\begin{cases}K(\bar{x}_{1}-x_{1})&\mbox{if}\;\;x_{1}<x_{1}^{*}+\varepsilon,\\ 0&\mbox{otherwise},\end{cases}

(4.6)

where $K>0$ is a feedback gain. We obtain a condition for the gain $K$ and $\varepsilon$ such that every trajectory of $x_{1}$ with its initial state in $(0,\ 1]$ converges to $1$ as in Proposition 1.

Proposition 1.

Assume $d/(m+n)<R^{*}<d/m$ . Let $\zeta_{R}(x_{1})$ be

	$\displaystyle\zeta_{R}(x_{1})$	$\displaystyle\coloneqq-Knx_{1}^{2}+(Kn\bar{x}_{1}-Km+R^{*}n)x_{1}$
		$\displaystyle\hskip 71.13188pt+(R^{*}m+Km\bar{x}_{1}-d),$		(4.7)

and let $\alpha,\beta\;(\alpha<\beta)$ be real solutions of the quadratic equation $\zeta_{R}(x_{1})=0$ . Then, every trajectory of $x_{1}$ with its initial state in $(0,\ 1]$ converges to $1$ if the gain $K$ and $\varepsilon$ satisfy

	$\displaystyle K>\frac{d-R^{*}m}{m\bar{x}_{1}}\ (>0),$		(4.8)
	$\displaystyle 0<\varepsilon\begin{cases}<\beta-x_{1}^{}&{\rm if}\;\;\beta<1,\\ \leq 1-x_{1}^{}&{\rm if}\;\;\beta\geq 1.\end{cases}$		(4.9)

Proof.

With the controller (4.5) and (4.6), the dynamics of $x_{1}$ ( $x_{1}<x_{1}^{*}+\varepsilon$ ) is given by

	$\displaystyle\dot{x}_{1}=\eta_{R}(x_{1}),$		(4.10)
	$\displaystyle\eta_{R}(x_{1})\coloneqq\frac{x_{1}(1-x_{1})}{m+nx_{1}}\left(R^{*}+K(\bar{x}_{1}-x_{1})-\frac{d}{m+nx_{1}}\right).$		(4.11)

According to (4.7), $\eta_{R}(x_{1})$ can be rewritten as

\displaystyle\eta_{R}(x_{1})=\frac{x_{1}(1-x_{1})\zeta_{R}(x_{1})}{(m+nx_{1})^{2}}.

(4.12)

First, we prove that the quadratic equation $\zeta_{R}(x_{1})=0$ has two distinct real solutions under (4.8). We have

\displaystyle\zeta_{R}(x_{1}^{*})=K(\bar{x}_{1}-x_{1}^{*})(m+nx_{1})>0,

(4.13)

which implies with (4.8) that the quadratic equation $\zeta_{R}(x_{1})=0$ has two distinct real solutions $\alpha,\beta$ satisfying $\alpha<x_{1}^{*}<\beta$ .

Next, we prove $\alpha<0$ under (4.8). We obtain

	$\displaystyle\zeta_{R}(0)$	$\displaystyle=m\bar{x}_{1}K-(d-R^{*}m)$
		$\displaystyle>m\bar{x}_{1}\frac{d-R^{}m}{m\bar{x}_{1}}-(d-R^{}m)=0,$		(4.14)

from (4.7) and (4.8). Since $\zeta_{R}(x_{1})$ is a convex upward quadratic function, the smaller solution $\alpha$ of $\zeta_{R}(x_{1})=0$ satisfies $\alpha<0$ .

Finally, we prove that the system controlled by (4.5) satisfies $\dot{x}_{1}>0$ for any $x_{1}\in(0,\ 1)$ under (4.8) and (4.9). When $\beta<1$ , $\eta_{R}(x_{1})>0$ for any $x_{1}\in(0,\ \beta)$ from (4.12). It is obvious that $\dot{x}_{1}>0$ for any $x_{1}\in(0,x_{1}^{*}+\varepsilon)$ from (4.9). For any $x_{1}\in[x_{1}^{*}+\varepsilon,1)$ , $\dot{x}_{1}>0$ because $K=0$ . Thus,

\displaystyle\dot{x}_{1}>0\;\mbox{for}\;\mbox{any}\;x_{1}\in(0,\;1).

(4.15)

Similary, it is also shown by (4.9) that (4.15) holds for any $\beta\geq 1$ . Therefore, every trajectory of $x_{1}$ with its initial state in $(0,\ 1]$ converges to $1$ under (4.8) and (4.9). ∎

It is noted that $\bar{x}_{1}<\beta$ since $d/(m+n)<R^{*}$ . So, (4.6) is continuous if $\varepsilon=\bar{x}_{1}-x_{1}^{*}$ .

4.2.2 Performance evaluation

In this section, we provide the numerical analysis of the controller. We consider the case where $m=n=2$ , $d=100$ , and $R^{*}=40$ . Then, we have $x_{1}^{*}=0.25$ from (3.1). Let the initial state of $x_{1}$ be $x_{1}^{\mathrm{init}}=0.1$ . We consider the following two cases where $K$ and $\varepsilon$ satisfy (4.8) and (4.9).

Case 1): $\bar{x}_{1}=0.26,\;\varepsilon=0.005,\;K=56.8125$ .
Case 2): $\bar{x}_{1}=1,\;\varepsilon=0.75,\;K=10.1$ .

Fig. 4 shows trajectories of the state and the reward. The red and blue lines represent the trajectories of Cases 1) and 2), respectively. In Case 1), it takes a longer time than Case 2) for the state $x_{1}$ to converge to $1$ , but the reward $R_{2}(x_{1})$ returns to the original value $R^{*}$ quickly. Note that $R_{2}(x_{1})$ in Case 1) is not continuous because we switch the input $\Delta R(x_{1})$ to $0$ when $x_{1}=x_{1}^{*}+\varepsilon$ (see (4.6)). In Case 2), the state $x_{1}$ converges to $1$ quickly, but it takes longer time than Case 1) for the reward $R_{2}(x_{1})$ to return to its original value $R^{*}$ . Thus, there is a trade-off in the choice of the design parameters $\bar{x}_{1}$ and $\varepsilon$ .

5 Conclusion

We proposed a dynamical model of the decision-making of miners in the blockchain. The proposed model is described by the 1st-order differential equation. So, it is simple but its theoretical analysis gives an insight into the characteristics of the decision-making. We analyzed the stability of its equilibrium points. We showed the occurrence of the transcritical bifurcations and observed a hysteresis phenomenon. We also proposed a feedback controller and showed that it can stabilize the state where all miners participate in the mining from any non-zero initial participation ratio of the miners. Our future work is to extend our model to the case where miners’ computational performances are different from each other.

Acknowledgements

This research was supported by JST ERATO JPMJER1603.

Appendix A Transcritical bifurcation

We consider the following system.

\displaystyle\dot{x}=f(x,\mu),\;\;x\in\mathbb{R},\;\;\mu\in\mathbb{R}.

(A.1)

We assume that

\displaystyle f(x,\mu)=xF(x,\mu),

(A.2)

where $F:\mathbb{R}\times\mathbb{R}\to\mathbb{R}$ satisfies the following condition.

\displaystyle F(x,\mu)\coloneqq\begin{cases}\frac{f(x,\mu)}{x}&x\neq 0,\\ \frac{\partial f(0,\mu)}{\partial x}&x=0.\end{cases}

(A.3)

Then, it is shown in [24] that (A.1) undergoes a transcritical bifurcation at $(x,\mu)=(0,0)$ if the following three conditions hold.

(T1): $f(0,0)=0,\;\frac{\partial f(0,0)}{\partial x}=0$ ,
(T2): $\frac{\partial f(0,0)}{\partial\mu}=0$ ,
(T3): $\frac{\partial^{2}f(0,0)}{\partial x\partial\mu}\neq 0,\;\frac{\partial^{2}f(0,0)}{\partial x^{2}}\neq 0$ .

Thus we will show that (2.6) satisfies the above three conditions at $(x_{1},R)=(0,d/m),\ (1,d/(m+n))$ .

A.1 Case where $(x_{1},R)=(0,d/m)$

First, we consider the following coordination transformation by which $(x_{1},R)=(0,d/m)$ is transformed to $(x,\mu)=(0,0)$ .

\displaystyle\left(\begin{array}[]{c}x_{1}\\ R\end{array}\right)=\left(\begin{array}[]{c}x\\ \mu\end{array}\right)+\left(\begin{array}[]{c}0\\ \frac{d}{m}\end{array}\right).

(A.10)

Then, we define

	$\displaystyle f(x,\mu)$	$\displaystyle\coloneqq\varphi_{\mu+\frac{d}{m}}(x)$
		$\displaystyle=\frac{x(1-x)}{m+nx}\left(\mu+\frac{d}{m}-\frac{d}{m+nx}\right)=xF(x,\mu),$		(A.11)

where the function $F$ is defined by

\displaystyle F(x,\mu)\coloneqq\frac{1-x}{m+nx}\left(\mu+\frac{d}{m}-\frac{d}{m+nx}\right).

(A.12)

Then, we have

$\displaystyle\frac{\partial f(x,\mu)}{\partial x}$	$\displaystyle=F(x,\mu)+x\frac{\partial F(x,\mu)}{\partial x}$
	$\displaystyle=\left(-\frac{x}{m+nx}+\frac{1-x}{m+nx}-\frac{nx(1-x)}{(m+nx)^{2}}\right)$
	$\displaystyle\hskip 28.45274pt\times\left(\mu+\frac{d}{m}-\frac{d}{m+nx}\right)+\frac{dnx(1-x)}{(m+nx)^{3}}.$	(A.13)

It is obvious that

\displaystyle F(x,\mu)=\frac{f(x,\mu)}{x}

(A.14)

when $x\neq 0$ and

\displaystyle F(0,\mu)=\frac{\mu}{m}=\frac{\partial f(0,\mu)}{\partial x}\;\;(\because\eqref{eq:A_phi_deriv_x})

(A.15)

when $x=0$ . Thus, the function $f$ defined by (A.11) satisfies (A.2) and (A.3).

Next, we show that $f(x,\mu)$ satisfies the conditions (T1) – (T3). We obtain

	$\displaystyle\frac{\partial f(x,\mu)}{\partial\mu}=\frac{x(1-x)}{m+nx},$		(A.16)
	$\displaystyle\frac{\partial^{2}f(x,\mu)}{\partial x\partial\mu}=-\frac{x}{m+nx}+\frac{1-x}{m+nx}-\frac{nx(1-x)}{(m+nx)^{2}},$		(A.17)
	$\displaystyle\frac{\partial^{2}f(x,\mu)}{\partial x^{2}}=\left(\frac{n^{2}x(1-x)}{(m+nx)^{2}}-\frac{n(1-x)}{m+nx}+\frac{nx}{m+nx}-1\right)$
	$\displaystyle\hskip 14.22636pt\times\frac{2}{m+nx}\left(\mu+\frac{d}{m}-\frac{d}{m+nx}\right)+\frac{2dn}{(m+nx)^{3}}$
	$\displaystyle\hskip 28.45274pt\times\left(\frac{-2nx(1-x)}{m+nx}-x+(1-x)\right).$		(A.18)

Thus, $f(x,\mu)$ satisfies the conditions (T1) – (T3) because

$\displaystyle f(0,0)$	$\displaystyle=0,$	(A.19)
$\displaystyle\frac{\partial f(0,0)}{\partial x}$	$\displaystyle=0,$	(A.20)
$\displaystyle\frac{\partial f(0,0)}{\partial\mu}$	$\displaystyle=0,$	(A.21)
$\displaystyle\frac{\partial^{2}f(0,0)}{\partial x\partial\mu}$	$\displaystyle=\frac{1}{m}\neq 0,$	(A.22)
$\displaystyle\frac{\partial^{2}f(0,0)}{\partial x^{2}}$	$\displaystyle=\frac{2dn}{m^{3}}\neq 0.$	(A.23)

A.2 Case where $(x_{1},R)=(1,d/(m+n))$

It is noted that the dynamics of $x_{0}$ is written as follows.

	$\displaystyle\dot{x}_{0}$	$\displaystyle=-\frac{x_{0}(1-x_{0})}{m+n(1-x_{0})}\left(R-\frac{d}{m+n(1-x_{0})}\right)$
		$\displaystyle=-\varphi_{R}(1-x_{0})$		(A.24)

because $x_{0}$ and $x_{1}$ satisfies (2.1). Thus, it is sufficient to show that (A.24) undergoes a transcirtical bifurcation at $(x_{0},R)=(0,d/(m+n))$ . We consider the following coordination transformation by which $(x_{0},R)=(0,d/(m+n))$ is transformed to $(x,\mu)=(0,0)$ .

\displaystyle\left(\begin{array}[]{c}x_{0}\\ R\end{array}\right)=\left(\begin{array}[]{c}x\\ \mu\end{array}\right)+\left(\begin{array}[]{c}0\\ \frac{d}{m+n}\end{array}\right).

(A.31)

Then, we define

$\displaystyle f(x,\mu)$	$\displaystyle\coloneqq-\varphi_{\mu+\frac{d}{m+n}}(1-x)$
	$\displaystyle=-\frac{x(1-x)}{m+n(1-x)}$
	$\displaystyle\hskip 28.45274pt\times\left(\mu+\frac{d}{m+n}-\frac{d}{m+n(1-x)}\right)$
	$\displaystyle=xF(x,\mu),$	(A.32)

where the function $F$ is defined by

	$\displaystyle F(x,\mu)$	$\displaystyle\coloneqq-\frac{1-x}{m+n(1-x)}$
		$\displaystyle\hskip 28.45274pt\times\left(\mu+\frac{d}{m+n}-\frac{d}{m+n(1-x)}\right).$		(A.33)

Then, we have

$\displaystyle\frac{\partial f(x,\mu)}{\partial x}$	$\displaystyle=-\left(\frac{-x+(1-x)}{m+n(1-x)}+\frac{nx(1-x)}{(m+n(1-x))^{2}}\right)$
	$\displaystyle\hskip 14.22636pt\times\left(\mu+\frac{d}{m+n}-\frac{d}{m+n(1-x)}\right)$
	$\displaystyle\hskip 28.45274pt+\frac{dnx(1-x)}{(m+n(1-x))^{3}}.$	(A.34)

It is obvious that

\displaystyle F(x,\mu)=\frac{f(x,\mu)}{x}

(A.35)

when $x\neq 0$ and

\displaystyle F(0,\mu)=-\frac{\mu}{m+n}=\frac{\partial f(0,\mu)}{\partial x}\;\;(\because\eqref{eq:A_phi_1_deriv_x})

(A.36)

when $x=0$ . Thus, the function $f$ defined by (A.32) satisfies (A.2) and (A.3).

In the same way as Appendix A.1, we obtain the partial derivatives of $f(x,\mu)$ and show that $f(x,\mu)$ defined by (A.24) satisfies the conditions (T1) – (T3) because

$\displaystyle f(0,0)$	$\displaystyle=0,$	(A.37)
$\displaystyle\frac{\partial f(0,0)}{\partial x}$	$\displaystyle=0,$	(A.38)
$\displaystyle\frac{\partial f(0,0)}{\partial\mu}$	$\displaystyle=0,$	(A.39)
$\displaystyle\frac{\partial^{2}f(0,0)}{\partial x\partial\mu}$	$\displaystyle=-\frac{1}{m+n}\neq 0,$	(A.40)
$\displaystyle\frac{\partial^{2}f(0,0)}{\partial x^{2}}$	$\displaystyle=\frac{2dn}{(m+n)^{3}}\neq 0.$	(A.41)

Therefore, (2.6) undergoes transcritical bifurcations at $(x_{1},R)=(0,d/m),\ (1,d/(m+n))$ .

References

[1] S. Nakamoto, “Bitcoin: A peer-to-peer electronic cash system,” 2008. [Online]. Available: http://bitcoin.org/bitcoin.pdf
[2] Q. Xia, E. B. Sifah, K. O. Asamoah, J. Gao, X. Du, and M. Guizani, “MeDShare: Trust-less medical data sharing among cloud service providers via blockchain,” IEEE Access, vol. 5, pp. 14 757–14 767, 2017.
[3] A. Ouaddah, A. A. Elkalam, and A. A. Ouahman, “Fairaccess: a new blockchain-based access control framework for the internet of things,” Security and Communication Networks, vol. 9, no. 18, pp. 5943–5964, 2016.
[4] J. Truby, “Decarbonizing bitcoin: Law and policy choices for reducing the energy consumption of blockchain technologies and digital currencies,” Energy Research & Social Science, vol. 44, pp. 399–410, 2018.
[5] “Cambridge bitcoin electricity consumption index,” (accessed on 1 March 2021). [Online]. Available: https://www.cbeci.org/
[6] W. Cai, Z. Wang, J. B. Ernst, Z. Hong, C. Feng, and V. C. M. Leung, “Decentralized applications: The blockchain-empowered software system,” IEEE Access, vol. 6, pp. 53 019–53 033, 2018.
[7] Y. Liu, Z. Fang, M. H. Cheung, W. Cai, and J. Huang, “A social welfare maximization mechanism for blockchain storage,” arXiv preprint arXiv:2103.05866, 2021.
[8] Z. Liu, N. C. Luong, W. Wang, D. Niyato, P. Wang, Y. Liang, and D. I. Kim, “A survey on blockchain: A game theoretical perspective,” IEEE Access, vol. 7, pp. 47 615–47 643, 2019.
[9] N. Dimitri, “Bitcoin mining as a contest,” Ledger, vol. 2, pp. 31–37, 2017.
[10] A. Fiat, A. Karlin, E. Koutsoupias, and C. Papadimitriou, “Energy equilibria in proof-of-work mining,” in Proceedings of the 2019 ACM Conference on Economics and Computation, 2019, pp. 489–502.
[11] J. W. Weibull, Evolutionary game theory. MIT press, 1997.
[12] T. Kanazawa, H. Goto, and T. Ushio, “Replicator dynamics with dynamic payoff reallocation based on the government’s payoff,” IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, vol. E91-A, no. 9, pp. 2411–2418, 2008.
[13] T. Kanazawa, Y. Fukumoto, T. Ushio, and T. Misaka, “Replicator dynamics with pigovian subsidy and capitation tax,” Nonlinear Analysis, Theory, Methods and Applications, vol. 71, no. 12, pp. e818–e826, 2009.
[14] T. Morimoto, T. Kanazawa, and T. Ushio, “Subsidy-based control of heterogeneous multiagent systems modeled by replicator dynamics,” IEEE Transactions on Automatic Control, vol. 61, no. 10, pp. 3158–3163, 2016.
[15] X. Liu, W. Wang, D. Niyato, N. Zhao, and P. Wang, “Evolutionary game for mining pool selection in blockchain networks,” IEEE Wireless Communications Letters, vol. 7, no. 5, pp. 760–763, 2018.
[16] K. Fujita, Y. Zhang, M. Sasabe, and S. Kasahara, “Mining pool selection problem in the presence of block withholding attack,” in Proceedings of 2020 IEEE International Conference on Blockchain, 2020, pp. 321–326.
[17] S. Kim and S. G. Hahn, “Mining pool manipulation in blockchain network over evolutionary block withholding attack,” IEEE Access, vol. 7, pp. 144 230–144 244, 2019.
[18] K. Toda, N. Kuze, and T. Ushio, “Game-theoretic approach to a decision-making problem for blockchain mining,” IEEE Control Systems Letters, vol. 5, no. 5, pp. 1783–1788, 2021.
[19] J. Debus, “Consensus methods in blockchain systems,” FSBC Working Paper, 2017. [Online]. Available: http://www.fs-blockchain.de/
[20] N. Houy, “The bitcoin mining game,” Ledger, vol. 1, pp. 53–68, 2016.
[21] D. Kraft, “Difficulty control for blockchain-based consensus systems,” Peer-to-Peer Networking and Applications, vol. 9, no. 2, pp. 397–413, 2016.
[22] S. Kasahara and J. Kawahara, “Effect of bitcoin fee on transaction-confirmation process,” Journal of Industrial & Management Optimization, vol. 15, no. 1, pp. 365–386, 2019.
[23] H. K. Khalil, Nonlinear systems, 3rd ed. Prentice Hall, 2002.
[24] S. Wiggins, Introduction to applied nonlinear dynamical systems and chaos, 2nd ed. Springer, 2003.