This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Stability analysis and control of decision-making of miners in blockchain

Kosuke Toda Graduate School of Engineering Science, Osaka University, Machikaneyama 1-3, Toyonaka-shi, Osaka, 560–8531, Japan. Naomi Kuze Graduate School of Engineering Science, Osaka University, Machikaneyama 1-3, Toyonaka-shi, Osaka, 560–8531, Japan. Toshimitsu Ushio Graduate School of Engineering Science, Osaka University, Machikaneyama 1-3, Toyonaka-shi, Osaka, 560–8531, Japan.
Abstract

To maintain blockchain-based services with ensuring its security, it is an important issue how to decide a mining reward so that the number of miners participating in the mining increases. We propose a dynamical model of decision-making for miners using an evolutionary game approach and analyze the stability of equilibrium points of the proposed model. The proposed model is described by the 1st-order differential equation. So, it is simple but its theoretical analysis gives an insight into the characteristics of the decision-making. Through the analysis of the equilibrium points, we show the transcritical bifurcations and hysteresis phenomena of the equilibrium points. We also design a controller that determines the mining reward based on the number of participating miners to stabilize the state where all miners participate in the mining. Numerical simulation shows that there is a trade-off in the choice of the design parameters.

Keywords— B lockchain, Proof-of-work, Decision-making, Evolutionary Game, Bifurcation, Hysteresis, Feedback control.

1 Introduction

Blockchain is a distributed ledger technology for recording transactions that underlies various fields such as digital currency like Bitcoin [1], data sharing [2], and computer security [3]. Blockchain-based services use cryptography to record transactions as a chain of blocks. A block consists of a block header and transaction data. The block header contains a cryptographic hash of its previous block, which makes blockchain-based services resistant to tampering. In these services, participants called miners create blocks in a distributed manner, and the longest chain of blocks is considered to be legitimate. When a miner succeeds in creating a block, he/she gets a reward called a mining reward.

Blockchain-based services approve transactions through a consensus algorithm. As a consensus algorithm, proof-of-work (PoW) is typically used. In this algorithm, the mining difficulty is set using a scalar value called a nonce in the block header. To create a block, miners must find a nonce such that the cryptographic hash value for the previous block satisfies specific conditions. The process of creating blocks is called a mining. In general, a cryptographic hash value for a block is unique according to the nonce contained in the block. Moreover, a nonce that satisfies the specific conditions cannot be calculated directly. As a result, an exhaustive search imposes a large computational cost on miners, which contributes to the resistance to tampering. Because transaction approvals depend on miner calculations (such calculations are very costly and require a lot of energy [4, 5]), the participation of many miners is needed to maintain blockchain-based services and ensure blockchain system security [6, 7]. Therefore, it is important to analyze the decision-making problem of whether miners participate in the mining according to the energy consumption and mining rewards.

Game theory is used to analyze interactions among rational decision-makers. Many studies have adopted game theory to analyze blockchain-related issues with PoW [8], such as decision-making problems in the mining considering the energy consumption [9, 10]. Evolutionary game theory has been used as a powerful mathematical tool for analyzing dynamical models of evolutionary selection [11]. Dynamical characteristics of the selection process are modeled by replicator dynamics. Control methods for the replicator dynamics have been studied in [12, 13, 14]. Evolutionary game models and replicator dynamics are also used in analyzing blockchain-related issues such as mining pool selection problems [15, 16], and attack scenarios [17].

We previously focused on a decision-making problem of whether miners participate in the mining according to the energy consumption and the mining rewards, and modeled it as a non-cooperative game. Through theoretical and numerical analysis, we showed the property of Nash equilibria [18]. However, in this study, we assumed that, once miners choose a strategy (i.e., participation in the mining or not), they do not change their strategies. Practically, the miners may decide to participate in the mining dynamically based on their current earned mining rewards. It is important to analyze such a dynamical decision-making process.

In this paper, we propose a dynamical model of the decision-making problem for miners, by applying an evolutionary game approach. We analyze the stability of its equilibrium points and show the existence of transcritical bifurcations and hysteresis phenomena with the coexistence of two asymptotically stable equilibrium points: one corresponds to the state where all miners participate in the mining and the other to the state where the number of participating miners is minimum. The former equilibrium point is preferable to maintain blockchain-based services. We propose a controller that determines the mining reward based on the number of current participating miners so as to stabilize the equilibrium point if at least one miner participates in the initial time.

The remainder of this paper is organized as follows. In Section 2, we propose an evolutionary game-based dynamical model of the decision-making process. In Section 3, we analyze the stability of its equilibrium point. In Section 4, we design a state feedback controller to let all miners participate in the mining.

2 Dynamical model of decision-making

We assume that miners in a blockchain network are partitioned into two sets \mathcal{M} and 𝒩\mathcal{N}, where miners in \mathcal{M} always participate in the mining and those in 𝒩\mathcal{N} have two strategies, participating in the mining (strategy sk=1s_{k}=1) and not participating in the mining (strategy sk=0s_{k}=0), where k𝒩k\in\mathcal{N}. Note that 𝒩=\mathcal{M}\cap\mathcal{N}=\emptyset. Denoted by mm and nn are the cardinalities of \mathcal{M} and 𝒩\mathcal{N}, respectively (we assume m1m\geq 1 and n1n\geq 1). We define x0x_{0} and x1x_{1} as the ratios of miners in 𝒩\mathcal{N} that choose strategies 0 and 11, respectively. Note that

x0+x1=1.\displaystyle x_{0}+x_{1}=1. (2.1)

Miners need to find a nonce such that the first hh bits of the hash of the block are all 0. Then, D=2hD=2^{h} is the difficulty parameter and 1/D1/D is the probability that a miner creates a block with one hash calculation [19]. When miner k𝒩k\in\mathcal{M}\cup\mathcal{N} participates in the mining, he/she needs a cost cc per unit operating time. The average number wk=fk(c)w_{k}=f_{k}(c) of hash queries calculated per unit operating time by miner kk depends on the cost cc, and we assume that fk(c)f_{k}(c) is the same for all miners. In this paper, for simplicity, we assume fk(c)=vc(v>0)f_{k}(c)=vc\,(v>0) for any k𝒩k\in\mathcal{M}\cup\mathcal{N}.

The mining of blocks can be described as a Poisson process [20, 21]. That is, the block creation time is exponentially distributed [22]. The rate λk\lambda_{k} of the Poisson process of miner k𝒩k\in\mathcal{M}\cup\mathcal{N} is given by λk=wk/D\lambda_{k}=w_{k}/D [21]111 Note that a combination of independent Poisson processes is still a Poisson process. Thus, the rate of the Poisson process of all miners is written as i𝒩λi\sum_{i\in\mathcal{M}\cup\mathcal{N}}\lambda_{i}. . If miner kk chooses sk=1s_{k}=1, then the rate of the Poisson process is λk=skfk(c)/D=skc/d\lambda_{k}=s_{k}f_{k}(c)/D=s_{k}c/d (we define d=D/vd=D/v, in this paper). Let RR be the mining reward. Based on the previous work [18], the expected reward RkR_{k} and the expected cost CSkCS_{k} for the mining of miner kk are calculated as follows.

Rk=λki𝒩λiR=Rskm+nx1,\displaystyle R_{k}=\frac{\lambda_{k}}{\sum_{i\in\mathcal{M}\cup\mathcal{N}}\lambda_{i}}R=\frac{Rs_{k}}{m+nx_{1}}, (2.2)
CSk=cλk(i𝒩λi)2=dsk(m+nx1)2.\displaystyle CS_{k}=\frac{c\lambda_{k}}{(\sum_{i\in\mathcal{M}\cup\mathcal{N}}\lambda_{i})^{2}}=\frac{ds_{k}}{(m+nx_{1})^{2}}. (2.3)

We define the utility function ui(x0,x1)u_{i}(x_{0},x_{1}) of miners that choose the strategy i{0, 1}i\in\{0,\ 1\} as

ui(x0,x1)={0ifi=0,1m+nx1(Rdm+nx1)ifi=1,\displaystyle u_{i}(x_{0},x_{1})=\begin{cases}0&\mbox{if}\;\;i=0,\\ \frac{1}{m+nx_{1}}\left(R-\frac{d}{m+nx_{1}}\right)&\mbox{if}\;\;i=1,\end{cases} (2.4)

which means that the utility of a miner who participates in the mining is the difference between the expected reward RkR_{k} and the expected cost CSkCS_{k}. Based on the principle of the evolutionary game [11], the dynamics of the ratio of miners that choose the strategy ii is given by

x˙ixi=ui(x0,x1)u¯(x0,x1)(i=0,1),\displaystyle\frac{\dot{x}_{i}}{x_{i}}=u_{i}(x_{0},x_{1})-\bar{u}(x_{0},x_{1})\;(i=0,1), (2.5)

where u¯(x0,x1)=i=01xiui(x0,x1)\bar{u}(x_{0},x_{1})=\sum_{i=0}^{1}x_{i}u_{i}(x_{0},x_{1}) is the average utility of all miners. According to (2.4), (2.5) is rewritten as

x˙1=x˙0=x1(1x1)m+nx1(Rdm+nx1)φR(x1).\displaystyle\dot{x}_{1}=-\dot{x}_{0}=\frac{x_{1}(1-x_{1})}{m+nx_{1}}\left(R-\frac{d}{m+nx_{1}}\right)\eqqcolon\varphi_{R}(x_{1}). (2.6)

Thus, the dynamics of the decision-making of miners is described by the above 1st-order differential equation and the reward RR plays an important role in the decision-making of the miners for the participation in the mining. In the following, we investigate stability and stabilization of equilibrium points of (2.6). For that purpose, the concept of a basin of attraction [23] is important. Let ξ(t;x1init)\xi(t;x_{1}^{\mathrm{init}}) be the solution of (2.6) that starts from an initial state x1initx_{1}^{\mathrm{init}} at time t=0t=0. For a given asymptotically stable equilibrium point x1x_{1}^{\prime} of (2.6), the basin of attraction is defined as the set of all initial states x1initx_{1}^{\mathrm{init}} such that ξ(t;x1init)\xi(t;x_{1}^{\mathrm{init}}) is defined for all t0t\geq 0 and limtξ(t;x1init)=x1\lim_{t\to\infty}\xi(t;x_{1}^{\mathrm{init}})=x_{1}^{\prime}.

3 Stability analysis

In this section, we investigate the stability of the equilibrium point x1=0, 1,x1x_{1}=0,\ 1,\ x_{1}^{*} of (2.6), where

x1=1n(dRm).\displaystyle x_{1}^{*}=\frac{1}{n}\left(\frac{d}{R}-m\right). (3.1)

When 1/(m+n)<R/d<1/m1/(m+n)<R/d<1/m, the equilibrium point x1x_{1}^{*} satisfies 0<x1<10<x_{1}^{*}<1. This equilibrium point is the state where the utility u1(1x1,x1)u_{1}(1-x_{1}^{*},x_{1}^{*}) is equal to 0, that is, the utility for the strategy 0 is equal to that for the strategy 11.

We investigate the local stability of the three equilibrium points. The derivative of φR(x1)\varphi_{R}(x_{1}) with respect to x1x_{1} is

φR(x1)x1=(x1m+nx1+1x1m+nx1nx1(1x1)(m+nx1)2)\displaystyle\frac{\partial\varphi_{R}(x_{1})}{\partial x_{1}}=\left(-\frac{x_{1}}{m+nx_{1}}+\frac{1-x_{1}}{m+nx_{1}}-\frac{nx_{1}(1-x_{1})}{(m+nx_{1})^{2}}\right)
×(Rdm+nx1)+dnx1(1x1)(m+nx1)3.\displaystyle\hskip 56.9055pt\times\left(R-\frac{d}{m+nx_{1}}\right)+\frac{dnx_{1}(1-x_{1})}{(m+nx_{1})^{3}}. (3.2)

Thus, we obtain

φR(x1)x1|x1=0=1m(Rdm),\displaystyle\left.\frac{\partial\varphi_{R}(x_{1})}{\partial x_{1}}\right|_{x_{1}=0}=\frac{1}{m}\left(R-\frac{d}{m}\right), (3.3)
φR(x1)x1|x1=1=1m+n(Rdm+n),\displaystyle\left.\frac{\partial\varphi_{R}(x_{1})}{\partial x_{1}}\right|_{x_{1}=1}=-\frac{1}{m+n}\left(R-\frac{d}{m+n}\right), (3.4)
φR(x1)x1|x1=x1=R3nd2(dRm)((m+n)dR).\displaystyle\left.\frac{\partial\varphi_{R}(x_{1})}{\partial x_{1}}\right|_{x_{1}=x_{1}^{*}}=\frac{R^{3}}{nd^{2}}\left(\frac{d}{R}-m\right)\left((m+n)-\frac{d}{R}\right). (3.5)
Table 1: The relation between R/dR/d and the stability of equilibrium points.
Condition for R/dR/d x1=0x_{1}=0 x1=1x_{1}=1 x1=x1x_{1}=x_{1}^{*}
R/d<1/(m+n)R/d<1/(m+n) S U S
1/(m+n)<R/d<1/m1/(m+n)<R/d<1/m S S U
R/d>1/mR/d>1/m U S S

Thus, we have their stability conditions as shown in Table 1, where S (resp. U) represents an asymptotically stable (resp. unstable) point.

Refer to caption
Figure 1: The mR/dm-R/d parameter plane where nn is fixed to n=2n=2.

Fig. 1 shows the mR/dm-R/d parameter plane with n=2n=2 where the meaning of each region is as follows. In the region (A)(A), both x1=1x_{1}=1 and x1<0x_{1}^{*}<0 are asymptotically stable equilibrium points, and the basin of attraction of x1=1x_{1}=1 is (0,)(0,\ \infty), that is, every solution of (2.6) starting in (0,1](0,1] converges to 11. In the region (B)(B), both x1=0x_{1}=0 and x1=1x_{1}=1 are asymptotically stable equilibrium points, and basins of attraction of x1=0x_{1}=0 and x1=1x_{1}=1 are (,x1)(-\infty,\ x_{1}^{*}) and (x1,)(x_{1}^{*},\ \infty), respectively, that is, every solution of (2.6) starting in [0,x1)[0,\ x_{1}^{*}) converges to 0, and every solution of (2.6) starting in (x1, 1](x_{1}^{*},\ 1] converges to 11. In the region (C)(C), both x1=0x_{1}=0 and x1>0x_{1}^{*}>0 are asymptotically stable equilibrium points, and the basin of attraction of x1=0x_{1}=0 is (, 1)(-\infty,\ 1), that is, every solution of (2.6) starting in [0, 1)[0,\ 1) converges to 0.

Refer to caption
Figure 2: The relation between R/dR/d and the stability of equilibrium points when m=n=2m=n=2.

Shown in Fig. 2 is a bifurcation diagram with respect to the bifurcation parameter R/dR/d, where m=n=2m=n=2. The solid (resp. dashed) line represents an asymptotically stable (resp. unstable) equilibrium point. Two curves of equilibrium points pass through (x1,R)=(1,d/(m+n))(x_{1},R)=(1,d/(m+n)) (resp. (x1,R)=(0,d/m)(x_{1},R)=(0,d/m)), one given by x1=x1x_{1}=x_{1}^{*}, the other by x1=1x_{1}=1 (resp. x1=0x_{1}=0). Both curves exist on both sides of R=d/(m+n)R=d/(m+n) (resp. R=d/mR=d/m). The stability along each curve exchanges on passing through R=d/(m+n)R=d/(m+n) (resp. R=d/mR=d/m). Thus, the exchange of stability (known as a transcritical bifurcation[24] is observed when (x1,R)=(0,d/m),(1,d/(m+n))(x_{1},R)=(0,d/m),(1,d/(m+n)). We show in A that the vector field (2.6) satisfies the condition of the transcritical bifurcation shown in [24].

Since the values of xi(i=0,1)x_{i}\,(i=0,1) satisfy 0xi10\leq x_{i}\leq 1, we observe jump phenomena owing to these transcritical bifurcations. Moreover, R>d/mR>d/m needs to be satisfied so that all miners in 𝒩\mathcal{N} participate in the mining. It is noted that, once the miners participate in the mining, they continue to participate in the mining until the reward RR becomes d/(m+n)d/(m+n). Thus, a hysteresis phenomenon of the equilibrium points is observed.

Refer to caption
Figure 3: Trajectories of (2.6) when m=n=2m=n=2, d=100d=100, and R=40R=40, from x1init=0.1x_{1}^{\mathrm{init}}=0.1 (blue) and x1init=0.9x_{1}^{\mathrm{init}}=0.9 (red).

Fig. 3 shows trajectories of (2.6) from an initial state x1init=0.1x_{1}^{\mathrm{init}}=0.1 (blue) and x1init=0.9x_{1}^{\mathrm{init}}=0.9 (red). When d/(m+n)<R<d/md/(m+n)<R<d/m, both x1=0x_{1}=0 and x1=1x_{1}=1 are asymptotically stable points whose basins of attraction are (,x1)(-\infty,\ x_{1}^{*}) and (x1,)(x_{1}^{*},\ \infty), respectively. Thus, the number of miners who participate in the mining converges to 0 if the initial ratio is less than x1x_{1}^{*} because their utility is negative and they prefer non-participation.

4 Stabilization

The result in Section 3 implies that no miner in 𝒩\mathcal{N} participates in the mining in the steady state when the mining reward RR^{*} satisfies R<d/(m+n)R^{*}<d/(m+n). When the mining reward RR^{*} satisfies d/(m+n)<R<d/md/(m+n)<R^{*}<d/m, miners’ behaviors depend on the initial state x1initx_{1}^{\mathrm{init}}, i.e., no miner in 𝒩\mathcal{N} participates in the mining in the steady state when x1init<x1x_{1}^{\mathrm{init}}<x_{1}^{*}. We propose a state feedback controller to adjust the reward based on the ratio x1x_{1} so that all miners in 𝒩\mathcal{N} participate in the mining, i.e., let every trajectory of x1x_{1} with its initial state in (0, 1](0,\ 1] converge to 11.

4.1 Case where R<d/(m+n)R^{*}<d/(m+n)

First, we show that x1=1x_{1}=1 cannot be stabilized when R<d/(m+n)R^{*}<d/(m+n). We consider the following state feedback controller R1(x1)R_{1}(x_{1}) that adjusts the reward based on the ratio x1x_{1}.

R=R1(x1),R1(1)=R<dm+n.\displaystyle R=R_{1}(x_{1}),\;R_{1}(1)=R^{*}<\frac{d}{m+n}. (4.1)

The controlled trajectory of x1x_{1} by (4.1) is described by

x˙1=x1(1x1)m+nx1(R1(x1)dm+n)ψR(x1).\displaystyle\dot{x}_{1}=\frac{x_{1}(1-x_{1})}{m+nx_{1}}\left(R_{1}(x_{1})-\frac{d}{m+n}\right)\eqqcolon\psi_{R}(x_{1}). (4.2)

The derivative of ψR(x1)\psi_{R}(x_{1}) with respect to x1x_{1} is

ψR(x1)x1=(x1m+nx1+1x1m+nx1nx1(1x1)(m+nx1)2)\displaystyle\frac{\partial\psi_{R}(x_{1})}{\partial x_{1}}=\left(-\frac{x_{1}}{m+nx_{1}}+\frac{1-x_{1}}{m+nx_{1}}-\frac{nx_{1}(1-x_{1})}{(m+nx_{1})^{2}}\right)
×(R1(x1)dm+nx1)\displaystyle\hskip 42.67912pt\times\left(R_{1}(x_{1})-\frac{d}{m+nx_{1}}\right)
+x1(1x1)m+nx1(R1(x1)x1+dn(m+nx1)2).\displaystyle\hskip 48.36967pt+\frac{x_{1}(1-x_{1})}{m+nx_{1}}\left(\frac{\partial R_{1}(x_{1})}{\partial x_{1}}+\frac{dn}{(m+nx_{1})^{2}}\right). (4.3)

We obtain

ψR(x1)x1|x1=1=1m+n(R1(1)dm+n)>0.\displaystyle\left.\frac{\partial\psi_{R}(x_{1})}{\partial x_{1}}\right|_{x_{1}=1}=-\frac{1}{m+n}\left(R_{1}(1)-\frac{d}{m+n}\right)>0. (4.4)

Therefore, the unstable equilibrium point x1=1x_{1}=1 cannot be stabilized even if the feedback controller is used.

4.2 Case where d/(m+n)<R<d/md/(m+n)<R^{*}<d/m

Next, we show that x1=1x_{1}=1 can be an asymptotically stable equilibrium point whose basin of attraction is (0, 1](0,\ 1] with a state feedback controller. We introduce the following state feedback controller R2(x1)R_{2}(x_{1}) to adjust the reward based on the ratio x1x_{1},

R=R2(x1)=R+ΔR(x1),ΔR(1)=0,\displaystyle R=R_{2}(x_{1})=R^{*}+\Delta R(x_{1}),\;\Delta R(1)=0, (4.5)

and let every trajectory of x1x_{1} with its initial state in (0, 1](0,\ 1] converge to 11.

4.2.1 The condition of the feedback gain

We give x¯1\bar{x}_{1} satisfying x1<x¯11x_{1}^{*}<\bar{x}_{1}\leq 1 and ε>0\varepsilon>0. For a given reward R(d/(m+n),d/m)R^{*}\in(d/(m+n),d/m), let ΔR(x1)\Delta R(x_{1}) be

ΔR(x1)={K(x¯1x1)ifx1<x1+ε,0otherwise,\displaystyle\Delta R(x_{1})=\begin{cases}K(\bar{x}_{1}-x_{1})&\mbox{if}\;\;x_{1}<x_{1}^{*}+\varepsilon,\\ 0&\mbox{otherwise},\end{cases} (4.6)

where K>0K>0 is a feedback gain. We obtain a condition for the gain KK and ε\varepsilon such that every trajectory of x1x_{1} with its initial state in (0, 1](0,\ 1] converges to 11 as in Proposition 1.

Proposition 1.

Assume d/(m+n)<R<d/md/(m+n)<R^{*}<d/m. Let ζR(x1)\zeta_{R}(x_{1}) be

ζR(x1)\displaystyle\zeta_{R}(x_{1}) Knx12+(Knx¯1Km+Rn)x1\displaystyle\coloneqq-Knx_{1}^{2}+(Kn\bar{x}_{1}-Km+R^{*}n)x_{1}
+(Rm+Kmx¯1d),\displaystyle\hskip 71.13188pt+(R^{*}m+Km\bar{x}_{1}-d), (4.7)

and let α,β(α<β)\alpha,\beta\;(\alpha<\beta) be real solutions of the quadratic equation ζR(x1)=0\zeta_{R}(x_{1})=0. Then, every trajectory of x1x_{1} with its initial state in (0, 1](0,\ 1] converges to 11 if the gain KK and ε\varepsilon satisfy

K>dRmmx¯1(>0),\displaystyle K>\frac{d-R^{*}m}{m\bar{x}_{1}}\ (>0), (4.8)
0<ε{<βx1ifβ<1,1x1ifβ1.\displaystyle 0<\varepsilon\begin{cases}<\beta-x_{1}^{*}&{\rm if}\;\;\beta<1,\\ \leq 1-x_{1}^{*}&{\rm if}\;\;\beta\geq 1.\end{cases} (4.9)
Proof.

With the controller (4.5) and (4.6), the dynamics of x1x_{1} (x1<x1+εx_{1}<x_{1}^{*}+\varepsilon) is given by

x˙1=ηR(x1),\displaystyle\dot{x}_{1}=\eta_{R}(x_{1}), (4.10)
ηR(x1)x1(1x1)m+nx1(R+K(x¯1x1)dm+nx1).\displaystyle\eta_{R}(x_{1})\coloneqq\frac{x_{1}(1-x_{1})}{m+nx_{1}}\left(R^{*}+K(\bar{x}_{1}-x_{1})-\frac{d}{m+nx_{1}}\right). (4.11)

According to (4.7), ηR(x1)\eta_{R}(x_{1}) can be rewritten as

ηR(x1)=x1(1x1)ζR(x1)(m+nx1)2.\displaystyle\eta_{R}(x_{1})=\frac{x_{1}(1-x_{1})\zeta_{R}(x_{1})}{(m+nx_{1})^{2}}. (4.12)

First, we prove that the quadratic equation ζR(x1)=0\zeta_{R}(x_{1})=0 has two distinct real solutions under (4.8). We have

ζR(x1)=K(x¯1x1)(m+nx1)>0,\displaystyle\zeta_{R}(x_{1}^{*})=K(\bar{x}_{1}-x_{1}^{*})(m+nx_{1})>0, (4.13)

which implies with (4.8) that the quadratic equation ζR(x1)=0\zeta_{R}(x_{1})=0 has two distinct real solutions α,β\alpha,\beta satisfying α<x1<β\alpha<x_{1}^{*}<\beta.

Next, we prove α<0\alpha<0 under (4.8). We obtain

ζR(0)\displaystyle\zeta_{R}(0) =mx¯1K(dRm)\displaystyle=m\bar{x}_{1}K-(d-R^{*}m)
>mx¯1dRmmx¯1(dRm)=0,\displaystyle>m\bar{x}_{1}\frac{d-R^{*}m}{m\bar{x}_{1}}-(d-R^{*}m)=0, (4.14)

from (4.7) and (4.8). Since ζR(x1)\zeta_{R}(x_{1}) is a convex upward quadratic function, the smaller solution α\alpha of ζR(x1)=0\zeta_{R}(x_{1})=0 satisfies α<0\alpha<0.

Finally, we prove that the system controlled by (4.5) satisfies x˙1>0\dot{x}_{1}>0 for any x1(0, 1)x_{1}\in(0,\ 1) under (4.8) and (4.9). When β<1\beta<1, ηR(x1)>0\eta_{R}(x_{1})>0 for any x1(0,β)x_{1}\in(0,\ \beta) from (4.12). It is obvious that x˙1>0\dot{x}_{1}>0 for any x1(0,x1+ε)x_{1}\in(0,x_{1}^{*}+\varepsilon) from (4.9). For any x1[x1+ε,1)x_{1}\in[x_{1}^{*}+\varepsilon,1), x˙1>0\dot{x}_{1}>0 because K=0K=0. Thus,

x˙1>0foranyx1(0, 1).\displaystyle\dot{x}_{1}>0\;\mbox{for}\;\mbox{any}\;x_{1}\in(0,\;1). (4.15)

Similary, it is also shown by (4.9) that (4.15) holds for any β1\beta\geq 1. Therefore, every trajectory of x1x_{1} with its initial state in (0, 1](0,\ 1] converges to 11 under (4.8) and (4.9). ∎

It is noted that x¯1<β\bar{x}_{1}<\beta since d/(m+n)<Rd/(m+n)<R^{*}. So, (4.6) is continuous if ε=x¯1x1\varepsilon=\bar{x}_{1}-x_{1}^{*}.

4.2.2 Performance evaluation

In this section, we provide the numerical analysis of the controller. We consider the case where m=n=2m=n=2, d=100d=100, and R=40R^{*}=40. Then, we have x1=0.25x_{1}^{*}=0.25 from (3.1). Let the initial state of x1x_{1} be x1init=0.1x_{1}^{\mathrm{init}}=0.1. We consider the following two cases where KK and ε\varepsilon satisfy (4.8) and (4.9).

Case 1)

x¯1=0.26,ε=0.005,K=56.8125\bar{x}_{1}=0.26,\;\varepsilon=0.005,\;K=56.8125.

Case 2)

x¯1=1,ε=0.75,K=10.1\bar{x}_{1}=1,\;\varepsilon=0.75,\;K=10.1.

Refer to caption
(a)
Refer to caption
(b)
Figure 4: Trajectories of (a) the state x1x_{1} and (b) the reward R2(x1)R_{2}(x_{1}) with a feedback controller satisfying Proposition 1, when m=n=2m=n=2, d=100d=100, R=40R^{*}=40, x1=0.25x_{1}^{*}=0.25 from x1init=0.1x_{1}^{\mathrm{init}}=0.1, where x¯1=0.26,ε=0.005,K=56.8125\bar{x}_{1}=0.26,\varepsilon=0.005,K=56.8125 (red) and x¯1=1,ε=0.75,K=10.1\bar{x}_{1}=1,\varepsilon=0.75,K=10.1 (blue).

Fig. 4 shows trajectories of the state and the reward. The red and blue lines represent the trajectories of Cases 1) and 2), respectively. In Case 1), it takes a longer time than Case 2) for the state x1x_{1} to converge to 11, but the reward R2(x1)R_{2}(x_{1}) returns to the original value RR^{*} quickly. Note that R2(x1)R_{2}(x_{1}) in Case 1) is not continuous because we switch the input ΔR(x1)\Delta R(x_{1}) to 0 when x1=x1+εx_{1}=x_{1}^{*}+\varepsilon (see (4.6)). In Case 2), the state x1x_{1} converges to 11 quickly, but it takes longer time than Case 1) for the reward R2(x1)R_{2}(x_{1}) to return to its original value RR^{*}. Thus, there is a trade-off in the choice of the design parameters x¯1\bar{x}_{1} and ε\varepsilon.

5 Conclusion

We proposed a dynamical model of the decision-making of miners in the blockchain. The proposed model is described by the 1st-order differential equation. So, it is simple but its theoretical analysis gives an insight into the characteristics of the decision-making. We analyzed the stability of its equilibrium points. We showed the occurrence of the transcritical bifurcations and observed a hysteresis phenomenon. We also proposed a feedback controller and showed that it can stabilize the state where all miners participate in the mining from any non-zero initial participation ratio of the miners. Our future work is to extend our model to the case where miners’ computational performances are different from each other.

Acknowledgements

This research was supported by JST ERATO JPMJER1603.

Appendix A Transcritical bifurcation

We consider the following system.

x˙=f(x,μ),x,μ.\displaystyle\dot{x}=f(x,\mu),\;\;x\in\mathbb{R},\;\;\mu\in\mathbb{R}. (A.1)

We assume that

f(x,μ)=xF(x,μ),\displaystyle f(x,\mu)=xF(x,\mu), (A.2)

where F:×F:\mathbb{R}\times\mathbb{R}\to\mathbb{R} satisfies the following condition.

F(x,μ){f(x,μ)xx0,f(0,μ)xx=0.\displaystyle F(x,\mu)\coloneqq\begin{cases}\frac{f(x,\mu)}{x}&x\neq 0,\\ \frac{\partial f(0,\mu)}{\partial x}&x=0.\end{cases} (A.3)

Then, it is shown in [24] that (A.1) undergoes a transcritical bifurcation at (x,μ)=(0,0)(x,\mu)=(0,0) if the following three conditions hold.

(T1)

f(0,0)=0,f(0,0)x=0f(0,0)=0,\;\frac{\partial f(0,0)}{\partial x}=0,

(T2)

f(0,0)μ=0\frac{\partial f(0,0)}{\partial\mu}=0,

(T3)

2f(0,0)xμ0,2f(0,0)x20\frac{\partial^{2}f(0,0)}{\partial x\partial\mu}\neq 0,\;\frac{\partial^{2}f(0,0)}{\partial x^{2}}\neq 0.

Thus we will show that (2.6) satisfies the above three conditions at (x1,R)=(0,d/m),(1,d/(m+n))(x_{1},R)=(0,d/m),\ (1,d/(m+n)).

A.1 Case where (x1,R)=(0,d/m)(x_{1},R)=(0,d/m)

First, we consider the following coordination transformation by which (x1,R)=(0,d/m)(x_{1},R)=(0,d/m) is transformed to (x,μ)=(0,0)(x,\mu)=(0,0).

(x1R)=(xμ)+(0dm).\displaystyle\left(\begin{array}[]{c}x_{1}\\ R\end{array}\right)=\left(\begin{array}[]{c}x\\ \mu\end{array}\right)+\left(\begin{array}[]{c}0\\ \frac{d}{m}\end{array}\right). (A.10)

Then, we define

f(x,μ)\displaystyle f(x,\mu) φμ+dm(x)\displaystyle\coloneqq\varphi_{\mu+\frac{d}{m}}(x)
=x(1x)m+nx(μ+dmdm+nx)=xF(x,μ),\displaystyle=\frac{x(1-x)}{m+nx}\left(\mu+\frac{d}{m}-\frac{d}{m+nx}\right)=xF(x,\mu), (A.11)

where the function FF is defined by

F(x,μ)1xm+nx(μ+dmdm+nx).\displaystyle F(x,\mu)\coloneqq\frac{1-x}{m+nx}\left(\mu+\frac{d}{m}-\frac{d}{m+nx}\right). (A.12)

Then, we have

f(x,μ)x\displaystyle\frac{\partial f(x,\mu)}{\partial x} =F(x,μ)+xF(x,μ)x\displaystyle=F(x,\mu)+x\frac{\partial F(x,\mu)}{\partial x}
=(xm+nx+1xm+nxnx(1x)(m+nx)2)\displaystyle=\left(-\frac{x}{m+nx}+\frac{1-x}{m+nx}-\frac{nx(1-x)}{(m+nx)^{2}}\right)
×(μ+dmdm+nx)+dnx(1x)(m+nx)3.\displaystyle\hskip 28.45274pt\times\left(\mu+\frac{d}{m}-\frac{d}{m+nx}\right)+\frac{dnx(1-x)}{(m+nx)^{3}}. (A.13)

It is obvious that

F(x,μ)=f(x,μ)x\displaystyle F(x,\mu)=\frac{f(x,\mu)}{x} (A.14)

when x0x\neq 0 and

F(0,μ)=μm=f(0,μ)x((A.13))\displaystyle F(0,\mu)=\frac{\mu}{m}=\frac{\partial f(0,\mu)}{\partial x}\;\;(\because\eqref{eq:A_phi_deriv_x}) (A.15)

when x=0x=0. Thus, the function ff defined by (A.11) satisfies (A.2) and (A.3).

Next, we show that f(x,μ)f(x,\mu) satisfies the conditions (T1) – (T3). We obtain

f(x,μ)μ=x(1x)m+nx,\displaystyle\frac{\partial f(x,\mu)}{\partial\mu}=\frac{x(1-x)}{m+nx}, (A.16)
2f(x,μ)xμ=xm+nx+1xm+nxnx(1x)(m+nx)2,\displaystyle\frac{\partial^{2}f(x,\mu)}{\partial x\partial\mu}=-\frac{x}{m+nx}+\frac{1-x}{m+nx}-\frac{nx(1-x)}{(m+nx)^{2}}, (A.17)
2f(x,μ)x2=(n2x(1x)(m+nx)2n(1x)m+nx+nxm+nx1)\displaystyle\frac{\partial^{2}f(x,\mu)}{\partial x^{2}}=\left(\frac{n^{2}x(1-x)}{(m+nx)^{2}}-\frac{n(1-x)}{m+nx}+\frac{nx}{m+nx}-1\right)
×2m+nx(μ+dmdm+nx)+2dn(m+nx)3\displaystyle\hskip 14.22636pt\times\frac{2}{m+nx}\left(\mu+\frac{d}{m}-\frac{d}{m+nx}\right)+\frac{2dn}{(m+nx)^{3}}
×(2nx(1x)m+nxx+(1x)).\displaystyle\hskip 28.45274pt\times\left(\frac{-2nx(1-x)}{m+nx}-x+(1-x)\right). (A.18)

Thus, f(x,μ)f(x,\mu) satisfies the conditions (T1) – (T3) because

f(0,0)\displaystyle f(0,0) =0,\displaystyle=0, (A.19)
f(0,0)x\displaystyle\frac{\partial f(0,0)}{\partial x} =0,\displaystyle=0, (A.20)
f(0,0)μ\displaystyle\frac{\partial f(0,0)}{\partial\mu} =0,\displaystyle=0, (A.21)
2f(0,0)xμ\displaystyle\frac{\partial^{2}f(0,0)}{\partial x\partial\mu} =1m0,\displaystyle=\frac{1}{m}\neq 0, (A.22)
2f(0,0)x2\displaystyle\frac{\partial^{2}f(0,0)}{\partial x^{2}} =2dnm30.\displaystyle=\frac{2dn}{m^{3}}\neq 0. (A.23)

A.2 Case where (x1,R)=(1,d/(m+n))(x_{1},R)=(1,d/(m+n))

It is noted that the dynamics of x0x_{0} is written as follows.

x˙0\displaystyle\dot{x}_{0} =x0(1x0)m+n(1x0)(Rdm+n(1x0))\displaystyle=-\frac{x_{0}(1-x_{0})}{m+n(1-x_{0})}\left(R-\frac{d}{m+n(1-x_{0})}\right)
=φR(1x0)\displaystyle=-\varphi_{R}(1-x_{0}) (A.24)

because x0x_{0} and x1x_{1} satisfies (2.1). Thus, it is sufficient to show that (A.24) undergoes a transcirtical bifurcation at (x0,R)=(0,d/(m+n))(x_{0},R)=(0,d/(m+n)). We consider the following coordination transformation by which (x0,R)=(0,d/(m+n))(x_{0},R)=(0,d/(m+n)) is transformed to (x,μ)=(0,0)(x,\mu)=(0,0).

(x0R)=(xμ)+(0dm+n).\displaystyle\left(\begin{array}[]{c}x_{0}\\ R\end{array}\right)=\left(\begin{array}[]{c}x\\ \mu\end{array}\right)+\left(\begin{array}[]{c}0\\ \frac{d}{m+n}\end{array}\right). (A.31)

Then, we define

f(x,μ)\displaystyle f(x,\mu) φμ+dm+n(1x)\displaystyle\coloneqq-\varphi_{\mu+\frac{d}{m+n}}(1-x)
=x(1x)m+n(1x)\displaystyle=-\frac{x(1-x)}{m+n(1-x)}
×(μ+dm+ndm+n(1x))\displaystyle\hskip 28.45274pt\times\left(\mu+\frac{d}{m+n}-\frac{d}{m+n(1-x)}\right)
=xF(x,μ),\displaystyle=xF(x,\mu), (A.32)

where the function FF is defined by

F(x,μ)\displaystyle F(x,\mu) 1xm+n(1x)\displaystyle\coloneqq-\frac{1-x}{m+n(1-x)}
×(μ+dm+ndm+n(1x)).\displaystyle\hskip 28.45274pt\times\left(\mu+\frac{d}{m+n}-\frac{d}{m+n(1-x)}\right). (A.33)

Then, we have

f(x,μ)x\displaystyle\frac{\partial f(x,\mu)}{\partial x} =(x+(1x)m+n(1x)+nx(1x)(m+n(1x))2)\displaystyle=-\left(\frac{-x+(1-x)}{m+n(1-x)}+\frac{nx(1-x)}{(m+n(1-x))^{2}}\right)
×(μ+dm+ndm+n(1x))\displaystyle\hskip 14.22636pt\times\left(\mu+\frac{d}{m+n}-\frac{d}{m+n(1-x)}\right)
+dnx(1x)(m+n(1x))3.\displaystyle\hskip 28.45274pt+\frac{dnx(1-x)}{(m+n(1-x))^{3}}. (A.34)

It is obvious that

F(x,μ)=f(x,μ)x\displaystyle F(x,\mu)=\frac{f(x,\mu)}{x} (A.35)

when x0x\neq 0 and

F(0,μ)=μm+n=f(0,μ)x((A.34))\displaystyle F(0,\mu)=-\frac{\mu}{m+n}=\frac{\partial f(0,\mu)}{\partial x}\;\;(\because\eqref{eq:A_phi_1_deriv_x}) (A.36)

when x=0x=0. Thus, the function ff defined by (A.32) satisfies (A.2) and (A.3).

In the same way as Appendix A.1, we obtain the partial derivatives of f(x,μ)f(x,\mu) and show that f(x,μ)f(x,\mu) defined by (A.24) satisfies the conditions (T1) – (T3) because

f(0,0)\displaystyle f(0,0) =0,\displaystyle=0, (A.37)
f(0,0)x\displaystyle\frac{\partial f(0,0)}{\partial x} =0,\displaystyle=0, (A.38)
f(0,0)μ\displaystyle\frac{\partial f(0,0)}{\partial\mu} =0,\displaystyle=0, (A.39)
2f(0,0)xμ\displaystyle\frac{\partial^{2}f(0,0)}{\partial x\partial\mu} =1m+n0,\displaystyle=-\frac{1}{m+n}\neq 0, (A.40)
2f(0,0)x2\displaystyle\frac{\partial^{2}f(0,0)}{\partial x^{2}} =2dn(m+n)30.\displaystyle=\frac{2dn}{(m+n)^{3}}\neq 0. (A.41)

Therefore, (2.6) undergoes transcritical bifurcations at (x1,R)=(0,d/m),(1,d/(m+n))(x_{1},R)=(0,d/m),\ (1,d/(m+n)).

References

  • [1] S. Nakamoto, “Bitcoin: A peer-to-peer electronic cash system,” 2008. [Online]. Available: http://bitcoin.org/bitcoin.pdf
  • [2] Q. Xia, E. B. Sifah, K. O. Asamoah, J. Gao, X. Du, and M. Guizani, “MeDShare: Trust-less medical data sharing among cloud service providers via blockchain,” IEEE Access, vol. 5, pp. 14 757–14 767, 2017.
  • [3] A. Ouaddah, A. A. Elkalam, and A. A. Ouahman, “Fairaccess: a new blockchain-based access control framework for the internet of things,” Security and Communication Networks, vol. 9, no. 18, pp. 5943–5964, 2016.
  • [4] J. Truby, “Decarbonizing bitcoin: Law and policy choices for reducing the energy consumption of blockchain technologies and digital currencies,” Energy Research & Social Science, vol. 44, pp. 399–410, 2018.
  • [5] “Cambridge bitcoin electricity consumption index,” (accessed on 1 March 2021). [Online]. Available: https://www.cbeci.org/
  • [6] W. Cai, Z. Wang, J. B. Ernst, Z. Hong, C. Feng, and V. C. M. Leung, “Decentralized applications: The blockchain-empowered software system,” IEEE Access, vol. 6, pp. 53 019–53 033, 2018.
  • [7] Y. Liu, Z. Fang, M. H. Cheung, W. Cai, and J. Huang, “A social welfare maximization mechanism for blockchain storage,” arXiv preprint arXiv:2103.05866, 2021.
  • [8] Z. Liu, N. C. Luong, W. Wang, D. Niyato, P. Wang, Y. Liang, and D. I. Kim, “A survey on blockchain: A game theoretical perspective,” IEEE Access, vol. 7, pp. 47 615–47 643, 2019.
  • [9] N. Dimitri, “Bitcoin mining as a contest,” Ledger, vol. 2, pp. 31–37, 2017.
  • [10] A. Fiat, A. Karlin, E. Koutsoupias, and C. Papadimitriou, “Energy equilibria in proof-of-work mining,” in Proceedings of the 2019 ACM Conference on Economics and Computation, 2019, pp. 489–502.
  • [11] J. W. Weibull, Evolutionary game theory.   MIT press, 1997.
  • [12] T. Kanazawa, H. Goto, and T. Ushio, “Replicator dynamics with dynamic payoff reallocation based on the government’s payoff,” IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, vol. E91-A, no. 9, pp. 2411–2418, 2008.
  • [13] T. Kanazawa, Y. Fukumoto, T. Ushio, and T. Misaka, “Replicator dynamics with pigovian subsidy and capitation tax,” Nonlinear Analysis, Theory, Methods and Applications, vol. 71, no. 12, pp. e818–e826, 2009.
  • [14] T. Morimoto, T. Kanazawa, and T. Ushio, “Subsidy-based control of heterogeneous multiagent systems modeled by replicator dynamics,” IEEE Transactions on Automatic Control, vol. 61, no. 10, pp. 3158–3163, 2016.
  • [15] X. Liu, W. Wang, D. Niyato, N. Zhao, and P. Wang, “Evolutionary game for mining pool selection in blockchain networks,” IEEE Wireless Communications Letters, vol. 7, no. 5, pp. 760–763, 2018.
  • [16] K. Fujita, Y. Zhang, M. Sasabe, and S. Kasahara, “Mining pool selection problem in the presence of block withholding attack,” in Proceedings of 2020 IEEE International Conference on Blockchain, 2020, pp. 321–326.
  • [17] S. Kim and S. G. Hahn, “Mining pool manipulation in blockchain network over evolutionary block withholding attack,” IEEE Access, vol. 7, pp. 144 230–144 244, 2019.
  • [18] K. Toda, N. Kuze, and T. Ushio, “Game-theoretic approach to a decision-making problem for blockchain mining,” IEEE Control Systems Letters, vol. 5, no. 5, pp. 1783–1788, 2021.
  • [19] J. Debus, “Consensus methods in blockchain systems,” FSBC Working Paper, 2017. [Online]. Available: http://www.fs-blockchain.de/
  • [20] N. Houy, “The bitcoin mining game,” Ledger, vol. 1, pp. 53–68, 2016.
  • [21] D. Kraft, “Difficulty control for blockchain-based consensus systems,” Peer-to-Peer Networking and Applications, vol. 9, no. 2, pp. 397–413, 2016.
  • [22] S. Kasahara and J. Kawahara, “Effect of bitcoin fee on transaction-confirmation process,” Journal of Industrial & Management Optimization, vol. 15, no. 1, pp. 365–386, 2019.
  • [23] H. K. Khalil, Nonlinear systems, 3rd ed.   Prentice Hall, 2002.
  • [24] S. Wiggins, Introduction to applied nonlinear dynamical systems and chaos, 2nd ed.   Springer, 2003.