Pólya urn with memory kernel and
asymptotic behaviors of autocorrelation function
Abstract
Pólya urn is a stochastic process in which balls are randomly drawn from an urn of red and blue balls, and balls of the same color as the drawn balls are added. The probability of a ball of a certain color being drawn is equal to the percentage of balls of that color in the urn. We introduce arbitrary memory kernels to modify this probability. If the memory kernel decays exponentially, it is a stationary process and is mean-reverting. If the memory kernel decays by a power-law, a phase transition occurs and the asymptotic behavior of the autocorrelation function changes. An auxiliary field variable is introduced to transform the process Markovian and the field obeys a multivariate Ornstein-Uhlenbeck process. The exponents of the power law are estimated for the decay of the leading and subleading terms of the autocorrelation function. It is shown that the power law exponents changes discontinuously at the critical point.
pacs:
05.70.Fh,89.65.GhI Introduction
Econophysics and socio-physics are fascinating research fields in statistical physicsMantegna and Stanley (2007); Galam (2008); Castellano et al. (2009). In particular, herding or the tendency to follow the majority has attracted many researchers as it plays a crucial role in the understanding of the phenomenaArthur (1989); Bikhchandani et al. (1992); Cont and Bouchaud (2000); Mori et al. (2016). Many types of probabilistic models have been proposed to model herding and one example is the ant recruitment model, which describes the intermittent oscillation of the ants faced with two identical food sourcesKirman (1993). It incorporates a simple herding mechanism where a randomly chosen ant chooses one of the two food sources with a probability that is proportional to the numbers of ants who have chosen the food sources.
A variant of the ant recruitment model has been adopted as a model for the intermittent switching phenomena of trading strategy, trend-follower and contrarian, in the modeling of the time-series data of stock and foreign exchange marketsLux (1995). Another application is the collective bankruptcy of assetsHisakado and Mori (2020). In the modeling of the time-series data of defaults of assets, the estimation of the default probability (PD) and the default correlation is crucial as it affects the risk estimation of the investment. In addition to the default correlation for the defaults in the same year, it is necessary to model the correlation of the defaults in different years. The beta-binomial distribution model with memory kernel is a model for both correlations and it is tractable compared to the established model based on the correlation of asset valuesPluto and Tasche (2011); Hisakado and Mori (2021).
The fascinating point of the beta-binomial distribution model with memory kernel is its phase transition, and its essence is grasped by a variant of the Pólya urnG.Pólya (1931), Pólya urn with memory kernelHisakado and Mori (2020). If the memory kernel decays with the power law, the power-law index determines the phases of the model. If , the autocorrelation function (ACF) shows power-law decay with exponent . For , the process becomes nonstationary and . If approaches 1, changes discontinuously to a value . The order parameter of the phase transition is the limit value of the ACF. In the nonstationary phase with for , the memory of the history remains forever and the estimation of the PD and other parameters becomes impossible even if the length of the time series is infinite.
This study addresses the phase transition of the Pólya urn with an arbitrary memory kernel using the stochastic differential equation (SDE). The original Pólya urn is a Markov process as the probability that the color of a newly added ball is red depends only on the present number of red balls in the urn. The Pólya urn with exponential decay memory kernel can also be described as a Markov process when one focuses on the probability that the color of a newly added ball is red. However, if the memory kernel is a power-law decay one, it is necessary to introduce an infinite number of auxiliary variables to make the process Markov. The auxiliary field obeys the SDE with a common Wiener process. Using the formulation, we derive the self-consistent equation for the ACF. By studying the equation, we succeed to derive the power-law exponents of the leading and subleading terms of the ACF.
Our analysis is very similar to the one that studies the Hawkes process using a stochastic differential equationKanazawa and D.Sornette (2020a, b). The Hawkes process is a non-Markov self-excited point processHawkes (1971). The Hawkes process shows a stationary–nonstationary phase transition when the branching ratio exceeds one. The critical and near-critical behaviors of the probability density function of the intensity have been studied by solving the Master equationKanazawa and D.Sornette (2020a).
In our study of the Pólya urn with memory kernel, we focus on the phase transition. Two universality classes are known for the phase transitions of non-linear Pólya urnsMori and Hisakado (2015a); Nakayama and Mori (2021), which have the same scaling structure of order parameter as that of the absorption state phase transitionsHinrichsen (2000). Our goal is to understand the nature of the phase transition of the ”linear” Pólya urn with power-law memory kernel based on the behaviors of the order parameter. The organization of the paper is as follows. Section II defines the model. We review some results of the Pólya urn. In section III, we study the model using the SDE. We start from the exponential decay memory kernel model. The ACF decays exponentially and the process is stationary. Next we study the power-law decay memory kernel model. We introduce an auxiliary field to make the process Markovian and we derive the self-consistent equation for the ACF. Section IV shows the results about the asymptotic behavior of the ACF for the power-law decay memory kernel model. We study the power-law exponents of the leading and sub-leading terms of the ACF. In section V, we summarize the results.
II Model
The Pólya urn is a binary stochastic process and the probability that takes the value 1 is
(1) |
are parameters and can be interpreted as the initial numbers of red (1) and blue (0) balls in the urnG.Pólya (1931). They are parameters and are not necessarily restricted to be integers. We denote as .
We define the Pólya urn with an arbitrary memory kernel Hisakado and Mori (2020). We denote the cumulative sum of as
and the weighted sum of with weights as ,
is then defined as
The probability that takes the value 1 is
(2) |
If one chooses , and , so that (2) reduces to (1). The probabilistic rule of (2) is linear in , the expected value of is .
We are interested in the response of by the change in the initial value . The ACF is defined as the covariance of and divided by the variance of as,
We have another expression for as Mori and Hisakado (2015a),
depends on and , we have
The order parameter of the phase transition of the stochastic process is defined as the limit value of , . The conditional expected value of under the condition for is the average probability that takes the value 1 under the same condition.
As , we have the recursive equation for as,
(3) |
One can compute using this equation and the initial condition .
There are several choices for the memory kernel .
-
1.
Pólya urn, .
As we have noted before, if one choose , the model is the Pólya urn. We denote for this case as . obeys a binomial distribution with shape parameters G.Pólya (1931). The expected value of is . obeys the beta distribution and the variance is . When one is interested in the dynamics of for the initial condition , it is only necessary to alter the parameters and as and , respectively. The conditional expected value is for and ACF becomes constant.
(4) The process is not stationary and not mean-reverting. The conditional variance of for is
(5) -
2.
Finite memory kernel, and .
There is a cutoff in the memory kernel, and for , and for . For , is influenced by the most recent variables . For , are influenced from all past variables . is defined as
The probability that changes by for the condition is given as
If one adopts the mean field approximation as and , the probabilities can be written as
By the approximation, the model becomes the Kirman’s ant colony modelKirman (1993). In the Kirman’s ant colony model, there are two food sources: and ants. The numbers of ants that have chosen food sources 1 and 0 are and , respectively. A randomly selected ant chooses food sources 1 and 0 with a probability that is linear in and among ants. The probability that the ratio changes is given as
The prefactors in the right hand side of the equations are the probabilities that the randomly selected ants are from food sources 0 and 1, respectively. The remaining terms are the probabilities that the selected ant chooses food sources 1 and 0, respectively. The are noises in the decision of the ants. The Pólya urn with finite memory kernel corresponds to the ant colony model in the mean filed approximation.
The stationary distribution of is a beta binomial distribution, with shape parameters . decays exponentially as , which defines the correlation length . depends on the length of the memory and diverges in the limit Mori and Hisakado (2015b).
-
3.
Exponential decay memory kernel, .
-
4.
Power-law decay memory kernel, .
The model shows a phase transition at Hisakado and Mori (2020). By the numerical study of the ACF , it was conjectured that the asymptotic behavior of is
The critical power-law exponent at is given as the solution of the next equation,
(6)
III Stochastic differential equation
We study the Pólya urn with memory kernel using the method of stochastic approximation Gardiner (2009); Renlund (2010). We start from the exponential decay memory kernel model. For the reader’s convenience, we summarize the results of the analysis of the original Pólya urn using the SDE in Appendix A.
III.1 Exponential decay memory kernel model :
We denote for as ,, , respectively. We decompose and as
can be decomposed as
(7) |
For , and we write . The conditional expected value and the variance of for is
Based on the assumption that ACF decays with rapidly, fluctuates around its mean value and , we approximate the conditional variance of for by replacing with as,
One reads the drift and diffusion terms from the conditional expected value and the variance, and the SDE becomes
are defined as,
It is the Ornstein-Uhlenbeck process. The condition of mean-reverting is satisfied and fluctuates around Gardiner (2009); Vatiwutipong and Phewchean (2019).
The solution for the SDE with the initial condition is
The conditional expected value and the variance of for the initial condition are
The conditional variance does not depend on the initial condition. For , , and . The ACF is estimated as
decays exponentially and for . forgets the initial value and fluctuates around with variance for . In the limit , , and becomes constant. The stochastic process remembers its initial condition and the process is not stationary. A discontinuous phase transition occurs at .
III.2 Power-law decay memory kernel model :
We denote for as , , , respectively. Different from the exponential decay memory kernel model, it is impossible to decompose with and directly. We introduce an auxiliary field and express as the superposition of the field. Using the decomposition of with and of (7), we derive the SDE for the auxiliary field and estimate .
First, we express as the superposition of with the gamma distribution as
(8) |
We also express and as the superposition of and with .
is then written as
We denote the average of function with the weight by ,
The denominator ensures the identity and is written as .
Next we derive the SDE for the auxiliary field . When , is estimated as . The probability that takes the value 1 with the condition is then given as
As in (7), we have,
(9) | |||||
Here, we define . We denote the weighted average of and with the weights and as ,
(10) |
Then, the conditional expected value is written as
Likewise, we estimate the conditional variance as
We assume that z(t) does not converge to 0 nor 1 and the numerator of the variance does not vanish. We can replace the numerator of the variance with constant for simplicity without affecting the essence of the process. One reads the drift and the diffusion terms and the SDE for the auxiliary field becomes
(11) |
We note that the auxiliary field shares the common Wiener process as it emerges from .
The ”formal” solution for the auxiliary field to the initial value problem is
Here, is defined as
The reason to call the solution ”formal” is that in the integral of the second term of the solution is calculated using . We consider the deviation of from as , i.e., . is also defined as . is then written as
The formal solution for becomes
is then estimated as
To estimate the average in the right hand side of the solution for , we derive a useful approximation integral formula for the arbitrary function , which is valid for large .
It replaces the definite integral of multiplied by over with the definite integral of multiplied by over . For large , for and with dominates the asymptotic behavior. In the limit , can be replaced with . Here, we adopt instead of to ensure that the approximation formula gives for , which is the identity of (8). The formula is useful to study the asymptotic (large ) behavior of the left-hand side of the equation.
Then, we have two approximation formulas,
Using the results, we have
The conditional expected value of for is
is then given as
(12) |
We derive the integral equation for , which is the continuous approximation of (3).
IV Asymptotic behavior of Correlation function
We study the asymptotic behavior of the ACF for the power-law decay memory kernel model. We assume the power-law asymptotic behavior of the leading and sub-leading terms of the ACF as
(13) |
We multiply on both sides of (12) and we obtain
(14) |
We substitute (13) into (12) and compare the power-law exponent of the leading and subleading terms on both sides of the equation.
The asymptotic behaviors of are and for and , respectively. We put in and estimate the integral of the right hand side of (14).
After changing the variable from to , the first term of the integral becomes
Here we write . If the exponents of in the integrand are smaller than 1, the integral converges to the beta function, , in the limit . If or , it is necessary to regularize the integral. We denote the integral as ,
We write the finite term of in the limit as .
When , the integral converges and is .
The regularization procedure is simple. When and , the integral of diverges when the upper bound goes to 1. We regularize the integral as
Here we neglect a term that vanishes in the limit . The second term diverges as . Likewise for , the divergence of the integral of comes from both bounds of the integral. We regularize the integral as
When and , a logarithmic divergence appears, and we have
Using , (14) becomes
(15) |
-
1.
case.
and we assume . (15) becomes,
The last term does not appear when . The leading term of the left-hand side is . The leading term of the right-hand side is as and . By equating the coefficients of the leading terms, we obtain an identity
As , we obtain the result . The subleading term of the left-hand side is as , we have . We obtain . If one assume , we cannot match terms of both sides of (15). We can exclude the possibility for .
-
2.
case.
and we assume . (15) becomes,
The leading term is and we obtain a trivial identity . The subleading term is and we obtain the identity
(16) By solving the identity, we estimate , the critical exponent of the ACF, . The estimation of is obscure, as t The same identity with holds for by comparing the terms. It suggests that at .
-
3.
case.
and we assume . (15) becomes,
The leading term is and we obtain . The subleading term is and we obtain .
The power-law exponents of the leading and subleading terms of the ACF are summarized as
(20) |
We denote the values by to distinguish them from the numerically estimated values, which are denoted as .
We solve the recursive relation of (3) numerically and estimate for . We set and . For large , the leading term of is , and we have the relation for ,
(21) |
We estimate by the formula with . About , we use the relation
(22) |
The relation is based on the relation for the subleading term of , for . We estimate using the formula .

Figure 1 shows vs. . For reference, we also plot . As can be clearly observed, if is far from the critical value , the numerically estimated values agree with the theoretical results. However, near the critical point, the discrepancy becomes large. At the critical point, both coincide with .
V Conclusion
We study the Pólya urn with an arbitrary memory kernel. Previously, we have shown that the process shows a phase transition for the power-law decay memory kernelHisakado and Mori (2020). We formulate the problem using the method of stochastic approximation. It is necessary to introduce an auxiliary field to reduce the process Markovian. The SDE for the auxiliary field are coupled as the field is driven by a common Wiener process. We obtain the formal solution for the initial value problem and derive the self-consistent equation for the ACF in the framework. We estimate the power-law exponents for the leading and subleading terms of the ACF. The results for the power-law exponents are new except for at the critical point.
The essential difference from the continuous phase transitions of the nonlinear Pólya urn is the lack of the universal function. The transition of nonlinear Pólya urns has the correlation length scale for the decay of the ACF and the asymptotic behavior of the ACF obeys a scaling relation. For the existence of the scaling relation for ACF, the continuity of the power-law exponent and the logarithmic behavior of the ACF at the critical point is crucialMori and Hisakado (2015a); Nakayama and Mori (2021). In the phase transition of the Pólya urn with power-law decay, the power-law exponent is discontinuous at and the ACF does not show logarithmic behavior at the critical point. The phase transition of the Pólya urn with power-law memory kernel is completely different from that of the nonlinear Pólya urn. The nature of the former is the phase transition of the stationary-nonstationary phase transition. The nature of the latter is the absorption-state phase transition where there exists the scaling relation for the order parameter.
This study also shows the importance of the correlation of the Wiener process. The auxiliary field , which is introduced for the Pólya urn with power-law decay memory kernel, shares the same Wiener process and it induces the coupling among the modes of the field. If are driven by independent Wiener processes , they obey the Ornstein-Uhlenbeck process for the exponential decay memory kernel independently from each other. They are mean-reverting and converge to the same equilibrium . Even though with small should have long relaxation time, also converges to the same equilibrium. The phase transition does not occur in the case. The condition of the mean-revering for the multivariate Ornstein-Uhlenbeck process is knownVatiwutipong and Phewchean (2019). By the correlation of the Wiener process, the condition should be broken. To understand the importance of the correlation of the Wiener process for the asymptotic behavior of the process, we need to solve the initial value problem for the SDE of the coupled auxiliary field.
References
- Mantegna and Stanley (2007) R. N. Mantegna and H. E. Stanley, Introduction to Econophysics: Correlations and Complexity in Finance (Cambridge University Press, Cambridge, 2007).
- Galam (2008) S. Galam, Int. J. Mod. Phys. C 19, 409 (2008).
- Castellano et al. (2009) C. Castellano, S. Fortunato, and V. Loreto, Rev.Mod.Phys. 81, 591 (2009).
- Arthur (1989) W. B. Arthur, Econ. Jour. 99, 116 (1989).
- Bikhchandani et al. (1992) S. Bikhchandani, D. Hirshleifer, and I. Welch, J. Polit. Econ. 100, 992 (1992).
- Cont and Bouchaud (2000) R. Cont and J. Bouchaud, Macroecon. Dynam. 4, 170 (2000).
- Mori et al. (2016) S. Mori, K. Nakayama, and M. Hisakado, Phys.Rev. E 94, 052301 (2016).
- Kirman (1993) A. Kirman, Q. J. Econ. 108, 137 (1993).
- Lux (1995) T. Lux, Econ. J. 105, 881 (1995).
- Hisakado and Mori (2020) M. Hisakado and S. Mori, Physica A 544, 123480 (2020).
- Pluto and Tasche (2011) K. Pluto and D. Tasche, in The Basel II Risk Parameters, edited by B.Engelmann. and R. Rauhmeier (Springer, Berlin, 2011) pp. 75–101.
- Hisakado and Mori (2021) M. Hisakado and S. Mori, Physica A 563, 125435 (2021).
- G.Pólya (1931) G.Pólya, Ann. Inst. Henri Poincaré 1, 117 (1931).
- Kanazawa and D.Sornette (2020a) K. Kanazawa and D.Sornette, Phys. Rev. Lett. 125, 138301 (2020a).
- Kanazawa and D.Sornette (2020b) K. Kanazawa and D.Sornette, Phys. Rev. Research. 2, 033442 (2020b).
- Hawkes (1971) A. Hawkes, J. R. Stat. Soc. Ser. B 33, 438 (1971).
- Mori and Hisakado (2015a) S. Mori and M. Hisakado, Phys.Rev. E 92, 052112 (2015a).
- Nakayama and Mori (2021) K. Nakayama and S. Mori, Phys. Rev.E 104, 014109 (2021).
- Hinrichsen (2000) H. Hinrichsen, Adv.Phys. 49, 815 (2000).
- Mori and Hisakado (2015b) S. Mori and M. Hisakado, J.Phys.Soc.Jpn. 84, 054001 (2015b).
- Bhm (2000) W. Bhm, J. Appl. Prob. 37, 470 (2000).
- Gardiner (2009) C. Gardiner, Stochastic Methods: A handbook for the Natural and Social Science, 4th ed. (Springer, Berlin, 2009).
- Renlund (2010) H. Renlund, “Generalized pólya urns via stochastic approximation,” (2010), arXiv:1002.3716 [math.PR] .
- Vatiwutipong and Phewchean (2019) P. Vatiwutipong and N. Phewchean, Adv.Differ.Equ. , 276 (2019).
Appendix A SDE analysis of Pólya urn
We study the Pólya urn using the SDE for the reader’s convenience. We denote for the infinite memory kernel as ,,, respectively. To estimate the expected value and the variance of with the condition , we decompose as
We estimate the conditional expected value and conditional variance of as follows:
We read the drift and variance from them and derive the SDE Gardiner (2009) as
are defined as follows:
As for the Pólya urn, we take the limit by replacing with and approximate as
The SDE is the Ornstein–Uhlenbeck type SDEGardiner (2009). The solution with the initial condition is
We have , and the solution is written as follows:
The conditional expected value of for is
The ACF is then estimated as
The result agrees with (4). We estimate the conditional variance of for as
Apart from the denominator of the prefactor, it agrees with the exact result of (5).