Stability analysis and control of decision-making of miners in blockchain
Abstract
To maintain blockchain-based services with ensuring its security, it is an important issue how to decide a mining reward so that the number of miners participating in the mining increases. We propose a dynamical model of decision-making for miners using an evolutionary game approach and analyze the stability of equilibrium points of the proposed model. The proposed model is described by the 1st-order differential equation. So, it is simple but its theoretical analysis gives an insight into the characteristics of the decision-making. Through the analysis of the equilibrium points, we show the transcritical bifurcations and hysteresis phenomena of the equilibrium points. We also design a controller that determines the mining reward based on the number of participating miners to stabilize the state where all miners participate in the mining. Numerical simulation shows that there is a trade-off in the choice of the design parameters.
Keywords— B lockchain, Proof-of-work, Decision-making, Evolutionary Game, Bifurcation, Hysteresis, Feedback control.
1 Introduction
Blockchain is a distributed ledger technology for recording transactions that underlies various fields such as digital currency like Bitcoin [1], data sharing [2], and computer security [3]. Blockchain-based services use cryptography to record transactions as a chain of blocks. A block consists of a block header and transaction data. The block header contains a cryptographic hash of its previous block, which makes blockchain-based services resistant to tampering. In these services, participants called miners create blocks in a distributed manner, and the longest chain of blocks is considered to be legitimate. When a miner succeeds in creating a block, he/she gets a reward called a mining reward.
Blockchain-based services approve transactions through a consensus algorithm. As a consensus algorithm, proof-of-work (PoW) is typically used. In this algorithm, the mining difficulty is set using a scalar value called a nonce in the block header. To create a block, miners must find a nonce such that the cryptographic hash value for the previous block satisfies specific conditions. The process of creating blocks is called a mining. In general, a cryptographic hash value for a block is unique according to the nonce contained in the block. Moreover, a nonce that satisfies the specific conditions cannot be calculated directly. As a result, an exhaustive search imposes a large computational cost on miners, which contributes to the resistance to tampering. Because transaction approvals depend on miner calculations (such calculations are very costly and require a lot of energy [4, 5]), the participation of many miners is needed to maintain blockchain-based services and ensure blockchain system security [6, 7]. Therefore, it is important to analyze the decision-making problem of whether miners participate in the mining according to the energy consumption and mining rewards.
Game theory is used to analyze interactions among rational decision-makers. Many studies have adopted game theory to analyze blockchain-related issues with PoW [8], such as decision-making problems in the mining considering the energy consumption [9, 10]. Evolutionary game theory has been used as a powerful mathematical tool for analyzing dynamical models of evolutionary selection [11]. Dynamical characteristics of the selection process are modeled by replicator dynamics. Control methods for the replicator dynamics have been studied in [12, 13, 14]. Evolutionary game models and replicator dynamics are also used in analyzing blockchain-related issues such as mining pool selection problems [15, 16], and attack scenarios [17].
We previously focused on a decision-making problem of whether miners participate in the mining according to the energy consumption and the mining rewards, and modeled it as a non-cooperative game. Through theoretical and numerical analysis, we showed the property of Nash equilibria [18]. However, in this study, we assumed that, once miners choose a strategy (i.e., participation in the mining or not), they do not change their strategies. Practically, the miners may decide to participate in the mining dynamically based on their current earned mining rewards. It is important to analyze such a dynamical decision-making process.
In this paper, we propose a dynamical model of the decision-making problem for miners, by applying an evolutionary game approach. We analyze the stability of its equilibrium points and show the existence of transcritical bifurcations and hysteresis phenomena with the coexistence of two asymptotically stable equilibrium points: one corresponds to the state where all miners participate in the mining and the other to the state where the number of participating miners is minimum. The former equilibrium point is preferable to maintain blockchain-based services. We propose a controller that determines the mining reward based on the number of current participating miners so as to stabilize the equilibrium point if at least one miner participates in the initial time.
The remainder of this paper is organized as follows. In Section 2, we propose an evolutionary game-based dynamical model of the decision-making process. In Section 3, we analyze the stability of its equilibrium point. In Section 4, we design a state feedback controller to let all miners participate in the mining.
2 Dynamical model of decision-making
We assume that miners in a blockchain network are partitioned into two sets and , where miners in always participate in the mining and those in have two strategies, participating in the mining (strategy ) and not participating in the mining (strategy ), where . Note that . Denoted by and are the cardinalities of and , respectively (we assume and ). We define and as the ratios of miners in that choose strategies and , respectively. Note that
(2.1) |
Miners need to find a nonce such that the first bits of the hash of the block are all . Then, is the difficulty parameter and is the probability that a miner creates a block with one hash calculation [19]. When miner participates in the mining, he/she needs a cost per unit operating time. The average number of hash queries calculated per unit operating time by miner depends on the cost , and we assume that is the same for all miners. In this paper, for simplicity, we assume for any .
The mining of blocks can be described as a Poisson process [20, 21]. That is, the block creation time is exponentially distributed [22]. The rate of the Poisson process of miner is given by [21]111 Note that a combination of independent Poisson processes is still a Poisson process. Thus, the rate of the Poisson process of all miners is written as . . If miner chooses , then the rate of the Poisson process is (we define , in this paper). Let be the mining reward. Based on the previous work [18], the expected reward and the expected cost for the mining of miner are calculated as follows.
(2.2) | |||
(2.3) |
We define the utility function of miners that choose the strategy as
(2.4) |
which means that the utility of a miner who participates in the mining is the difference between the expected reward and the expected cost . Based on the principle of the evolutionary game [11], the dynamics of the ratio of miners that choose the strategy is given by
(2.5) |
where is the average utility of all miners. According to (2.4), (2.5) is rewritten as
(2.6) |
Thus, the dynamics of the decision-making of miners is described by the above 1st-order differential equation and the reward plays an important role in the decision-making of the miners for the participation in the mining. In the following, we investigate stability and stabilization of equilibrium points of (2.6). For that purpose, the concept of a basin of attraction [23] is important. Let be the solution of (2.6) that starts from an initial state at time . For a given asymptotically stable equilibrium point of (2.6), the basin of attraction is defined as the set of all initial states such that is defined for all and .
3 Stability analysis
In this section, we investigate the stability of the equilibrium point of (2.6), where
(3.1) |
When , the equilibrium point satisfies . This equilibrium point is the state where the utility is equal to , that is, the utility for the strategy is equal to that for the strategy .
We investigate the local stability of the three equilibrium points. The derivative of with respect to is
(3.2) |
Thus, we obtain
(3.3) | |||
(3.4) | |||
(3.5) |
Condition for | |||
---|---|---|---|
S | U | S | |
S | S | U | |
U | S | S |
Thus, we have their stability conditions as shown in Table 1, where S (resp. U) represents an asymptotically stable (resp. unstable) point.

Fig. 1 shows the parameter plane with where the meaning of each region is as follows. In the region , both and are asymptotically stable equilibrium points, and the basin of attraction of is , that is, every solution of (2.6) starting in converges to . In the region , both and are asymptotically stable equilibrium points, and basins of attraction of and are and , respectively, that is, every solution of (2.6) starting in converges to , and every solution of (2.6) starting in converges to . In the region , both and are asymptotically stable equilibrium points, and the basin of attraction of is , that is, every solution of (2.6) starting in converges to .

Shown in Fig. 2 is a bifurcation diagram with respect to the bifurcation parameter , where . The solid (resp. dashed) line represents an asymptotically stable (resp. unstable) equilibrium point. Two curves of equilibrium points pass through (resp. ), one given by , the other by (resp. ). Both curves exist on both sides of (resp. ). The stability along each curve exchanges on passing through (resp. ). Thus, the exchange of stability (known as a transcritical bifurcation) [24] is observed when . We show in A that the vector field (2.6) satisfies the condition of the transcritical bifurcation shown in [24].
Since the values of satisfy , we observe jump phenomena owing to these transcritical bifurcations. Moreover, needs to be satisfied so that all miners in participate in the mining. It is noted that, once the miners participate in the mining, they continue to participate in the mining until the reward becomes . Thus, a hysteresis phenomenon of the equilibrium points is observed.

Fig. 3 shows trajectories of (2.6) from an initial state (blue) and (red). When , both and are asymptotically stable points whose basins of attraction are and , respectively. Thus, the number of miners who participate in the mining converges to if the initial ratio is less than because their utility is negative and they prefer non-participation.
4 Stabilization
The result in Section 3 implies that no miner in participates in the mining in the steady state when the mining reward satisfies . When the mining reward satisfies , miners’ behaviors depend on the initial state , i.e., no miner in participates in the mining in the steady state when . We propose a state feedback controller to adjust the reward based on the ratio so that all miners in participate in the mining, i.e., let every trajectory of with its initial state in converge to .
4.1 Case where
First, we show that cannot be stabilized when . We consider the following state feedback controller that adjusts the reward based on the ratio .
(4.1) |
The controlled trajectory of by (4.1) is described by
(4.2) |
The derivative of with respect to is
(4.3) |
We obtain
(4.4) |
Therefore, the unstable equilibrium point cannot be stabilized even if the feedback controller is used.
4.2 Case where
Next, we show that can be an asymptotically stable equilibrium point whose basin of attraction is with a state feedback controller. We introduce the following state feedback controller to adjust the reward based on the ratio ,
(4.5) |
and let every trajectory of with its initial state in converge to .
4.2.1 The condition of the feedback gain
We give satisfying and . For a given reward , let be
(4.6) |
where is a feedback gain. We obtain a condition for the gain and such that every trajectory of with its initial state in converges to as in Proposition 1.
Proposition 1.
Assume . Let be
(4.7) |
and let be real solutions of the quadratic equation . Then, every trajectory of with its initial state in converges to if the gain and satisfy
(4.8) | |||
(4.9) |
Proof.
With the controller (4.5) and (4.6), the dynamics of () is given by
(4.10) | |||
(4.11) |
According to (4.7), can be rewritten as
(4.12) |
First, we prove that the quadratic equation has two distinct real solutions under (4.8). We have
(4.13) |
which implies with (4.8) that the quadratic equation has two distinct real solutions satisfying .
Next, we prove under (4.8). We obtain
(4.14) |
from (4.7) and (4.8). Since is a convex upward quadratic function, the smaller solution of satisfies .
Finally, we prove that the system controlled by (4.5) satisfies for any under (4.8) and (4.9). When , for any from (4.12). It is obvious that for any from (4.9). For any , because . Thus,
(4.15) |
Similary, it is also shown by (4.9) that (4.15) holds for any . Therefore, every trajectory of with its initial state in converges to under (4.8) and (4.9). ∎
It is noted that since . So, (4.6) is continuous if .
4.2.2 Performance evaluation
In this section, we provide the numerical analysis of the controller. We consider the case where , , and . Then, we have from (3.1). Let the initial state of be . We consider the following two cases where and satisfy (4.8) and (4.9).
- Case 1)
-
.
- Case 2)
-
.


Fig. 4 shows trajectories of the state and the reward. The red and blue lines represent the trajectories of Cases 1) and 2), respectively. In Case 1), it takes a longer time than Case 2) for the state to converge to , but the reward returns to the original value quickly. Note that in Case 1) is not continuous because we switch the input to when (see (4.6)). In Case 2), the state converges to quickly, but it takes longer time than Case 1) for the reward to return to its original value . Thus, there is a trade-off in the choice of the design parameters and .
5 Conclusion
We proposed a dynamical model of the decision-making of miners in the blockchain. The proposed model is described by the 1st-order differential equation. So, it is simple but its theoretical analysis gives an insight into the characteristics of the decision-making. We analyzed the stability of its equilibrium points. We showed the occurrence of the transcritical bifurcations and observed a hysteresis phenomenon. We also proposed a feedback controller and showed that it can stabilize the state where all miners participate in the mining from any non-zero initial participation ratio of the miners. Our future work is to extend our model to the case where miners’ computational performances are different from each other.
Acknowledgements
This research was supported by JST ERATO JPMJER1603.
Appendix A Transcritical bifurcation
We consider the following system.
(A.1) |
We assume that
(A.2) |
where satisfies the following condition.
(A.3) |
Then, it is shown in [24] that (A.1) undergoes a transcritical bifurcation at if the following three conditions hold.
- (T1)
-
,
- (T2)
-
,
- (T3)
-
.
Thus we will show that (2.6) satisfies the above three conditions at .
A.1 Case where
First, we consider the following coordination transformation by which is transformed to .
(A.10) |
Then, we define
(A.11) |
where the function is defined by
(A.12) |
Then, we have
(A.13) |
It is obvious that
(A.14) |
when and
(A.15) |
when . Thus, the function defined by (A.11) satisfies (A.2) and (A.3).
Next, we show that satisfies the conditions (T1) – (T3). We obtain
(A.16) | |||
(A.17) | |||
(A.18) |
Thus, satisfies the conditions (T1) – (T3) because
(A.19) | ||||
(A.20) | ||||
(A.21) | ||||
(A.22) | ||||
(A.23) |
A.2 Case where
It is noted that the dynamics of is written as follows.
(A.24) |
because and satisfies (2.1). Thus, it is sufficient to show that (A.24) undergoes a transcirtical bifurcation at . We consider the following coordination transformation by which is transformed to .
(A.31) |
Then, we define
(A.32) |
where the function is defined by
(A.33) |
Then, we have
(A.34) |
It is obvious that
(A.35) |
when and
(A.36) |
when . Thus, the function defined by (A.32) satisfies (A.2) and (A.3).
In the same way as Appendix A.1, we obtain the partial derivatives of and show that defined by (A.24) satisfies the conditions (T1) – (T3) because
(A.37) | ||||
(A.38) | ||||
(A.39) | ||||
(A.40) | ||||
(A.41) |
Therefore, (2.6) undergoes transcritical bifurcations at .
References
- [1] S. Nakamoto, “Bitcoin: A peer-to-peer electronic cash system,” 2008. [Online]. Available: http://bitcoin.org/bitcoin.pdf
- [2] Q. Xia, E. B. Sifah, K. O. Asamoah, J. Gao, X. Du, and M. Guizani, “MeDShare: Trust-less medical data sharing among cloud service providers via blockchain,” IEEE Access, vol. 5, pp. 14 757–14 767, 2017.
- [3] A. Ouaddah, A. A. Elkalam, and A. A. Ouahman, “Fairaccess: a new blockchain-based access control framework for the internet of things,” Security and Communication Networks, vol. 9, no. 18, pp. 5943–5964, 2016.
- [4] J. Truby, “Decarbonizing bitcoin: Law and policy choices for reducing the energy consumption of blockchain technologies and digital currencies,” Energy Research & Social Science, vol. 44, pp. 399–410, 2018.
- [5] “Cambridge bitcoin electricity consumption index,” (accessed on 1 March 2021). [Online]. Available: https://www.cbeci.org/
- [6] W. Cai, Z. Wang, J. B. Ernst, Z. Hong, C. Feng, and V. C. M. Leung, “Decentralized applications: The blockchain-empowered software system,” IEEE Access, vol. 6, pp. 53 019–53 033, 2018.
- [7] Y. Liu, Z. Fang, M. H. Cheung, W. Cai, and J. Huang, “A social welfare maximization mechanism for blockchain storage,” arXiv preprint arXiv:2103.05866, 2021.
- [8] Z. Liu, N. C. Luong, W. Wang, D. Niyato, P. Wang, Y. Liang, and D. I. Kim, “A survey on blockchain: A game theoretical perspective,” IEEE Access, vol. 7, pp. 47 615–47 643, 2019.
- [9] N. Dimitri, “Bitcoin mining as a contest,” Ledger, vol. 2, pp. 31–37, 2017.
- [10] A. Fiat, A. Karlin, E. Koutsoupias, and C. Papadimitriou, “Energy equilibria in proof-of-work mining,” in Proceedings of the 2019 ACM Conference on Economics and Computation, 2019, pp. 489–502.
- [11] J. W. Weibull, Evolutionary game theory. MIT press, 1997.
- [12] T. Kanazawa, H. Goto, and T. Ushio, “Replicator dynamics with dynamic payoff reallocation based on the government’s payoff,” IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, vol. E91-A, no. 9, pp. 2411–2418, 2008.
- [13] T. Kanazawa, Y. Fukumoto, T. Ushio, and T. Misaka, “Replicator dynamics with pigovian subsidy and capitation tax,” Nonlinear Analysis, Theory, Methods and Applications, vol. 71, no. 12, pp. e818–e826, 2009.
- [14] T. Morimoto, T. Kanazawa, and T. Ushio, “Subsidy-based control of heterogeneous multiagent systems modeled by replicator dynamics,” IEEE Transactions on Automatic Control, vol. 61, no. 10, pp. 3158–3163, 2016.
- [15] X. Liu, W. Wang, D. Niyato, N. Zhao, and P. Wang, “Evolutionary game for mining pool selection in blockchain networks,” IEEE Wireless Communications Letters, vol. 7, no. 5, pp. 760–763, 2018.
- [16] K. Fujita, Y. Zhang, M. Sasabe, and S. Kasahara, “Mining pool selection problem in the presence of block withholding attack,” in Proceedings of 2020 IEEE International Conference on Blockchain, 2020, pp. 321–326.
- [17] S. Kim and S. G. Hahn, “Mining pool manipulation in blockchain network over evolutionary block withholding attack,” IEEE Access, vol. 7, pp. 144 230–144 244, 2019.
- [18] K. Toda, N. Kuze, and T. Ushio, “Game-theoretic approach to a decision-making problem for blockchain mining,” IEEE Control Systems Letters, vol. 5, no. 5, pp. 1783–1788, 2021.
- [19] J. Debus, “Consensus methods in blockchain systems,” FSBC Working Paper, 2017. [Online]. Available: http://www.fs-blockchain.de/
- [20] N. Houy, “The bitcoin mining game,” Ledger, vol. 1, pp. 53–68, 2016.
- [21] D. Kraft, “Difficulty control for blockchain-based consensus systems,” Peer-to-Peer Networking and Applications, vol. 9, no. 2, pp. 397–413, 2016.
- [22] S. Kasahara and J. Kawahara, “Effect of bitcoin fee on transaction-confirmation process,” Journal of Industrial & Management Optimization, vol. 15, no. 1, pp. 365–386, 2019.
- [23] H. K. Khalil, Nonlinear systems, 3rd ed. Prentice Hall, 2002.
- [24] S. Wiggins, Introduction to applied nonlinear dynamical systems and chaos, 2nd ed. Springer, 2003.