This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

From the multi-terms urn model to the self-exciting negative binomial distribution
and
Hawkes processes

Masato Hisakado [email protected] Nomura Holdings, Inc., Otemachi 2-2-2, Chiyoda-ku, Tokyo 100-8130, Japan.    Kodai Hattori [email protected]    Shintaro Mori [email protected] †Department of Mathematics and Physics, Graduate School of Science and Technology, Hirosaki University
Bunkyo-cho 3, Hirosaki, Aomori 036-8561, Japan
Abstract

This study considers a new multi-term urn process that has a correlation in the same term and temporal correlation. The objective is to clarify the relationship between the urn model and the Hawkes process. Correlation in the same term is represented by the Pólya urn model and the temporal correlation is incorporated by introducing the conditional initial condition. In the double-scaling limit of this urn process, the self-exciting negative binomial distribution (SE-NBD) process, which is a marked Hawkes process, is obtained. In the standard continuous limit, this process becomes the Hawkes process, which has no correlation in the same term. The difference is the variance of the intensity function in that the phase transition from the steady to the non-steady state can be observed. The critical point, at which the power law distribution is obtained, is the same for the Hawkes and the urn processes. These two processes are used to analyze empirical data of financial default to estimate the parameters of the model. For the default portfolio, the results produced by the urn process are superior to those obtained with the Hawkes process and confirm self-excitation.

Hawkes process, Phase transition, Pólya urn model, Power law

I I. Introduction

Anomalous diffusion is one of the most interesting topics in sociophysics and econophysics galam ; galam2 ; Man . Models describing these phenomena have correlations Bro ; W2 ; G ; M ; hod ; hui ; sch , and show several types of phase transitions. In our previous work, we investigated voting models for an information cascade Mori ; Hisakado2 ; Hisakado3 ; Hisakado35 ; Hisakado4 ; Hisakado5 ; Hisakado6 ; Mori6 . This model is a type of urn process that represents the correlations and has two types of phase transitions. One is the information cascade transition, which is one of the non-equilibrium phase transitions Hisakado3 . The other is the convergence transition of the super-normal diffusion that corresponds to an anomalous diffusion Hod ; Hisakado2 .

Financial engineering has led to the development of several products to hedge risks. These products protect against a subset of the total loss on a credit portfolio in exchange for payments, and provide valuable insights into the market implications of default dependencies and clustering of defaults M2010 ; M2008 ; Sch . This aspect is important, because the difficulties in managing credit events depend on these correlations. The Hawkes process was recently used to represent the clustering defaults of time series Haw ; kir ; qp ; err ; KZ ; KZ1 . The clustering defaults mean that as the number of events increases, the probability of the events increases. This phenomenon corresponds to self-excitation and constitutes a temporal correlation. From the physical view point this process is characterized by a phase transition from the steady state to the non-steady state. It is one kind of the non-equilibrium phase transitions and the relation between the Hawkes process and the branching process is shown in Haw2 . In fact the extinction phase of the branching process corresponds to the steady state in Hawkess process Hisakado8 . Confirmation of the steady state is important for finance and risk management to hedge risks because it is not possible to manage the non-steady state.

In our previous study, we discussed the parameter estimation of the urn process, which has a correlation in the same term, and considered a multi-year case with a temporal correlation Hisakado6 ; Mori6 . In this work, we introduce a new extended multi-term urn process and discuss the relationship between the new urn process and the Hawkes process. In the continuous time limit one can obtain the self-exciting NBD (SE-NBD) process and Hawkes process as the parameters approach certain limits.

We study the properties of the phase transitions of these processes. In fact the simultaneous and temporal effects of the correlation were confirmed by analyzing empirical default data. The results confirmed that the urn process fits more accurately than the Hawkes process. The reason is the variance of the intensity function is nonzero. In fact some firms effect many companies and some firms do not effect the companies. It is one of the effects of the network, because the network with hubs have the large variance of the degree distributions Hisakado7 .

The remainder of this article is organized as follows. In Section II, we introduce the multi-term urn process. We discuss the relationship between the Hawkes process and phase transition. In Section III we present our study of the phase transition of this model. In Section IV, we present the power law of the distribution function at the critical point. In Section V we discuss the application of the process to empirical data of historical defaults and confirm its parameters. Finally, the conclusions are presented in Section VI.

II II. From the multi-term urn process to the Hawkes process

In this section, we consider a multi-term urn process that has correlations. In the first term, the urn contains θ0\theta_{0} red balls and n0θ0n_{0}-\theta_{0} white balls. Then, balls are sequentially taken out from the urn. For example, a single ball is taken out, after which we return the ball taken out to the urn and add ω\omega balls of the same color to the urn. Thus, the total number of balls increases by ω\omega a process, which is a correlation parameter in the same term, Hisakado . In fact as we take out more red balls, the probability that a red ball is taken out increases, because the number of red balls in urn increases. This is the correlation in the same term. We repeat the process NN times in the first term. Hence, the number of the total balls is n0+Nωn_{0}+N\omega in the end of the first term. This is nothing but the Pólya urn model, which has a beta binomial distribution (BBD). We summarize the parameters for this urn model in Appendix A.

Next we extend the process to the multi-term process. We repeat the same process as the first process with the updated parameter θt\theta_{t} at the t+1t+1-th term. In the t+1t+1-th term the urn contains θt=θ0+itkid^ti+1ω~\theta_{t}=\theta_{0}+\sum^{t}_{i}k_{i}\hat{d}_{t-i+1}\tilde{\omega} red balls, and n0θtn_{0}-\theta_{t} white balls. Here, the total number of initial balls is n0n_{0}, d^i\hat{d}_{i} denotes the kernel function or discount factor that represents the decrease of the effects of self excitation. Note that θt\theta_{t} depends on the history of number of the taken out red balls. For the normalization we set d^0=1\hat{d}_{0}=1. Here we introduce the variable XiX_{i} which is the number of red balls taken out in the ii-th term and its value is Xi=kiX_{i}=k_{i}. ω~\tilde{\omega} is one of the parameters for temporal correlation, where ω\omega denotes the scale parameter for the added red balls. ω~\tilde{\omega} is the ratio of the number of balls placed back to the urn to the number of balls drawn from the urn, when d^0=1\hat{d}_{0}=1 and d^i=0,i1\hat{d}_{i}=0,i\geq 1. In this case only the previous term affects the present term. In summary we update only the parameter θt\theta_{t} and other parameters are fixed in each term.

As the first term we sequentially take out balls from the urn. After that we return the taken out ball to the urn and ω\omega balls of the same color are added to the urn. We repeat the process NN times in this t+1t+1-th term. It is the definition of the one term. After the process of t+1t+1 term, we set the initial condition of t+2t+2-term and continue the t+2t+2-th process. Each term is also the Pólya urn model. In this model we use two kinds of correlations: the first is a correlation in the same term using the parameter ω\omega, and the other is the temporal correlation using the parameters ω~\tilde{\omega} which is independent from ω\omega and d^i\hat{d}_{i}, which is the kernel function. The temporal correlation decays as a function of time using the parameter d^i\hat{d}_{i}.

In Hisakado6 , we introduced a similar urn process with a correlation in the same term and temporal correlation. The difference is the initial condition of the t+1t+1-th term. The number of red balls is the same, but the number of white balls is n0θ0+it(Nki)d^ti+1ω~n_{0}-\theta_{0}+\sum_{i}^{t}(N-k_{i})\hat{d}_{t-i+1}\tilde{\omega}, where Xi=kiX_{i}=k_{i}, and the total number of balls in the initial condition of each term is not constant, n0n_{0}.

Refer to caption
Figure 1: Illustration of the multi-term urn process. The process in each term is a Pólya urn model. In each term we repeat the process that we take out a ball and place back ω\omega same collar balls, NN times. In this figure two balls are taken out in one term, N=2N=2. The process corresponds the black arrow. The conclusion of each term affects the initial condition of the next term as the temporal correlation. It corresponds to the white arrow. The total number of balls in the initial condition of each term is n0n_{0} and the number of the red balls in the initial condition of tt-th term is θt=θ0+itkid^ti+1ω~\theta_{t}=\theta_{0}+\sum^{t}_{i}k_{i}\hat{d}_{t-i+1}\tilde{\omega}. , d^i\hat{d}_{i} denotes the kernel function or discount factor. The number of red balls taken out in the ii-th is Xi=kiX_{i}=k_{i}. ω~\tilde{\omega} is the scale factor for the adjustment for the temporal correlation.

When k1k_{1} red balls are taken out in the 1-st term, X1=k1X_{1}=k_{1}, the probability in the 1-st term can be calculated as

P(X1=k1)\displaystyle P(X_{1}=k_{1}) =\displaystyle= N!k1!(Nk1)![θ0n0θ0+(k11)ωn0+(k11)ω][n0θ0n0+k1ωn0θ0+(Nk11)ωn0+(N1)ω]\displaystyle\frac{N!}{k_{1}!(N-k_{1})!}[\frac{\theta_{0}}{n_{0}}\cdots\frac{\theta_{0}+(k_{1}-1)\omega}{n_{0}+(k_{1}-1)\omega}][\frac{n_{0}-\theta_{0}}{n_{0}+k_{1}\omega}\cdots\frac{n_{0}-\theta_{0}+(N-k_{1}-1)\omega}{n_{0}+(N-1)\omega}] (1)
=\displaystyle= N!k1!(Nk1)!Πi=0k11(θ0+iω)Πj=0Nk11(n0θ0+jω)Πl=0N1(n0+lω)\displaystyle\frac{N!}{k_{1}!(N-k_{1})!}\frac{\Pi_{i=0}^{k_{1}-1}(\theta_{0}+i\omega)\Pi_{j=0}^{N-k_{1}-1}(n_{0}-\theta_{0}+j\omega)}{\Pi_{l=0}^{N-1}(n_{0}+l\omega)}
=\displaystyle= N!k1!(Nk1)!Πi=0k11(p+iρ/(1ρ))Πj=0Nk11(q+jρ/(1ρ))Πl=0N1(1+lρ/(1ρ)),\displaystyle\frac{N!}{k_{1}!(N-k_{1})!}\frac{\Pi_{i=0}^{k_{1}-1}(p+i\rho/(1-\rho))\Pi_{j=0}^{N-k_{1}-1}(q+j\rho/(1-\rho))}{\Pi_{l=0}^{N-1}(1+l\rho/(1-\rho))},

where p=θ0/n0p=\theta_{0}/n_{0}, q=1θ0/n0q=1-\theta_{0}/n_{0}, and ω/n0=ρ/(1ρ)\omega/n_{0}=\rho/(1-\rho). This is known as the beta binomial distribution BBD(α,β\mbox{BBD}(\alpha,\beta,NN) where p=α/(α+β)p=\alpha/(\alpha+\beta) and q=β/(α+β)q=\beta/(\alpha+\beta). Here, α\alpha and β\beta correspond to the parameters of the beta distribution in the continuous limit of BBD. Note that ρ=1/(1+α+β)\rho=1/(1+\alpha+\beta) plays the role of a correlation in this term Hisakado . In fact the variance of BBD is Npq+N(N1)pqρNpq+N(N-1)pq\rho and the second term corresponds to the correlation in the same term. Hence, ω\omega is the correlation parameter.

Herein we consider the temporal correlation by adjusting the initial conditions of the red balls, θt\theta_{t} in each term. The temporal correlation is the decay of the correlation at different times, Cor(Xt,Xt+τ)\mbox{Cor}(X_{t},X_{t+\tau}) as τ\tau increases. The total number of balls in the initial condition of the each term is n0n_{0}. The expected value of the ratio of the taken out red balls in t+1t+1-th term is θt/n0\theta_{t}/n_{0}, where θt=θ0+itkid^ti+1ω~\theta_{t}=\theta_{0}+\sum^{t}_{i}k_{i}\hat{d}_{t-i+1}\tilde{\omega}. This implies that the events of the previous terms affect the present events. Here the number of the events is the number of the red balls taken out. As the number of the events increases, the expected value of the number of events in the next term increases. Hence, we can introduce the temporal correlation by adjusting the initial condition of each term. When θt=θ0\theta_{t}=\theta_{0}, there is no temporal correlation, because each term is independent.

Here we set the first double scaling limit N/n0=ΔN/n_{0}=\Delta, NN\rightarrow\infty and n0n_{0}\rightarrow\infty. We can obtain

P(X1=k1)NBD(X1=k1|K0,M0/K0)=(K0+k11)!k1!(K01)!(K0K0+M0)K0(M0K0+M0)k1,P(X_{1}=k_{1})\sim\mbox{NBD}(X_{1}=k_{1}|K_{0},M_{0}/K_{0})=\frac{(K_{0}+k_{1}-1)!}{k_{1}!(K_{0}-1)!}(\frac{K_{0}}{K_{0}+M_{0}})^{K_{0}}(\frac{M_{0}}{K_{0}+M_{0}})^{k_{1}}, (2)

where M0=θ0N/n0=θ0ΔM_{0}=\theta_{0}N/n_{0}=\theta_{0}\Delta and K0=θ0/ωK_{0}=\theta_{0}/\omega. Here, \sim means that the stochastic variable on the LHS obeys the probability distribution on RHS. This is a negative binomial distribution (NBD), NBD(X1=k1|K0,M0/K0)\mbox{NBD}(X_{1}=k_{1}|K_{0},M_{0}/K_{0}). Parameter M0/K0=ωN/n0=ωΔM_{0}/K_{0}=\omega N/n_{0}=\omega\Delta is related to the correlation in this term. The mean and the variance of NBD is M0M_{0} and M0+M02/K0M_{0}+M_{0}^{2}/K_{0}, respectively.

The negative binomial distribution NBD(X1=k1|K0,M0/K0)\mbox{NBD}(X_{1}=k_{1}|K_{0},M_{0}/K_{0}) has another face:

NBD(X1=k1|K0,M0/K0)\displaystyle\mbox{NBD}(X_{1}=k_{1}|K_{0},M_{0}/K_{0}) =\displaystyle= 0Poisson(k1|λ)Gamma(λ|K0,M0/K0)𝑑λ,\displaystyle\int_{0}^{\infty}\mbox{Poisson}(k_{1}|\lambda)\cdot\mbox{Gamma}(\lambda|K_{0},M_{0}/K_{0})d\lambda, (3)
=\displaystyle= 0λk1eλk1!λK01Γ(K0)(M0/K0)K0˙eλK0/M0𝑑λ,\displaystyle\int_{0}^{\infty}\frac{\lambda^{k_{1}}e^{-\lambda}}{k_{1}!}\dot{\frac{\lambda^{K_{0}-1}}{\Gamma(K_{0})(M_{0}/K_{0})^{K_{0}}}}e^{-\lambda K_{0}/M_{0}}d\lambda,

where Poisson(k1|λ)\mbox{Poisson}(k_{1}|\lambda) is the Poisson process Gamma(λ|K0,M0/K0)\mbox{Gamma}(\lambda|K_{0},M_{0}/K_{0}) is the gamma distribution. We show the proof Eq.(3) in Appendix B. From this result NBD is the Poisson process with the intensity function λ\lambda which obeys the gamma distribution. In the multi-term model, we can decompose the process in the same way. Hence, λt\lambda_{t} is the intensity function of the tt-th term and obeying the gamma distribution. Gamma(λ|K0,M0/K0)\mbox{Gamma}(\lambda|K_{0},M_{0}/K_{0}) has an average of M0M_{0} and a variance of M02/K0M_{0}^{2}/K_{0}. This means that the urn process in the 1-st term corresponds to the Poisson process with intensity function λ\lambda, which has a gamma distribution in the double scaling limit. The intensity function yields the variance of comparing the Poisson process, which has a constant intensity function. We refer to this as the NBD process.

Next, we define the t+1t+1-th term with the temporal correlation Hisakado6 using the conditional probability. We define the conditional probability of t+1t+1-th term,

P(Xt+1=kt+1|X0=k0,,Xt=kt)=NBD(Xt+1=kt+1|Kt,Mt/Kt),P(X_{t+1}=k_{t+1}|X_{0}=k_{0},\cdots,X_{t}=k_{t})=\mbox{NBD}(X_{t+1}=k_{t+1}|K_{t},M_{t}/K_{t}), (4)

where kik_{i} is the history of the number of red balls taken out. The conditional probability is defined by updating parameters KtK_{t} and MtM_{t}. The only difference between the 1-st term and the t+1t+1-th term is the number of white balls in the initial condition of the term. The other conditions are the same as those in the 1-st term. We can obtain the parameters at the t+1t+1-th term for the intensity function:

Mt\displaystyle M_{t} =\displaystyle= θtN/n0=θ0+itXid^t+1iω~n0N=(θ0+itXid^t+1iω~)Δ\displaystyle\theta_{t}N/n_{0}=\frac{\theta_{0}+\sum_{i}^{t}X_{i}\hat{d}_{t+1-i}\tilde{\omega}}{n_{0}}N=(\theta_{0}+\sum_{i}^{t}X_{i}\hat{d}_{t+1-i}\tilde{\omega})\Delta (5)
=\displaystyle= M0+M0/L0itXid^t+1i,\displaystyle M_{0}+M_{0}/L_{0}\sum_{i}^{t}X_{i}\hat{d}_{t+1-i},

where M0=θ0N/n0=θ0ΔM_{0}=\theta_{0}N/n_{0}=\theta_{0}\Delta, K0=θ0/ωK_{0}=\theta_{0}/\omega, L0=θ0/ω~L_{0}=\theta_{0}/\tilde{\omega} and ωN/n0=ωΔ=M0/K0\omega N/n_{0}=\omega\Delta=M_{0}/K_{0}. The other parameters are obtained as follows:

Kt=θt/ω=θ0+itXid^t+1iω~ω=K0+K0/L0itkid^t+1i,K_{t}=\theta_{t}/\omega=\frac{\theta_{0}+\sum_{i}^{t}X_{i}\hat{d}_{t+1-i}\tilde{\omega}}{\omega}=K_{0}+K_{0}/L_{0}\sum_{i}^{t}k_{i}\hat{d}_{t+1-i}, (6)

and

Mt/Kt=ωN/n0=ωΔ=M0/K0.M_{t}/K_{t}=\omega N/n_{0}=\omega\Delta=M_{0}/K_{0}. (7)

We refer to this process as the discrete self-exciting negative binomial distribution (SE-NBD) process. The self-exciting is introduced by the conditional probability, Eq. (4). Note that for all the process parameters, Mt/KtM_{t}/K_{t} is a constant M0/K0M_{0}/K_{0}. This signifies that the correlation in the same term does not depend on the term tt. By this condition the process has the reproductive property of NBD. The mean of the intensity function is MtM_{t} and the variance is Mt2/KtM_{t}^{2}/K_{t}. As ω\omega increases, KtK_{t} decreases, and the variance of the intensity function increases. In this mean the correlation in the same term affects the variance of the intensity function. In the limit K0(ω0)K_{0}\rightarrow\infty(\omega\rightarrow 0) with Δ\Delta fixed, the intensity function has a vanishing variance and thus its distribution converges to the delta function. This is known as the discrete Hawkes process kir . The intensity function is illustrated in Fig.2 (a). L0L_{0} acts as a parameter for the temporal correlation as ω~\tilde{\omega}.

In summary, we obtained that the discrete SE-NBD process XtX_{t} obeys NBD for MtM_{t} from the urn process,

Xt+1NBD(Kt,M0/K0),t0,X_{t+1}\sim\mbox{NBD}\left(K_{t},M_{0}/K_{0}\right),t\geq 0, (8)

where

Mt=M0+M0/L0s=1tXsd^t+1s,t1,M_{t}=M_{0}+M_{0}/L_{0}\sum_{s=1}^{t}X_{s}\hat{d}_{t+1-s},t\geq 1, (9)

and

Kt=K0+K0/L0s=1tXsd^t+1s,t1.K_{t}=K_{0}+K_{0}/L_{0}\sum_{s=1}^{t}X_{s}\hat{d}_{t+1-s},t\geq 1. (10)

In the limit K0(ω=0)K_{0}\rightarrow\infty(\omega=0), the discrete Hawkes process, XtX_{t}, is shown to obey the Poisson process for MtM_{t} from the urn process,

Xt+1Poisson(Mt),t0,X_{t+1}\sim\mbox{Poisson}\left(M_{t}\right),t\geq 0, (11)

where

Mt=M0+M0/L0s=1tXsd^t+1s,t1.M_{t}=M_{0}+M_{0}/L_{0}\sum_{s=1}^{t}X_{s}\hat{d}_{t+1-s},t\geq 1. (12)

In the limit L0(ω~=0)L_{0}\rightarrow\infty(\tilde{\omega}=0), the process becomes an NBD process, which does not have self-excitation capabilities. Δ\Delta corresponds to the number of balls taken out in a term, and represents the interval between the terms. Here we introduce the counting process, X~t=iXi\tilde{X}_{t}=\sum_{i}X_{i}. We set the second double scaling limit Δ=N/n00\Delta=N/n_{0}\rightarrow 0, ω\omega\rightarrow\infty with ωΔ=ω\omega\Delta=\omega^{\prime}, as the continuous limit of the process X~t\tilde{X}_{t}. Note that in this limit ωΔ=ω\omega\Delta=\omega^{\prime} is constant and the process has the reproductive property as a discrete SE-NBD process. We can obtain the mean of the intensity function at tt, λt\lambda_{t}

E(λt|Ft)=limΔ0E[X~t+ΔX~t|Ft]Δ=limΔ0MtΔ=(θ0+ω~i<tXid^ti),E(\lambda_{t}|F_{t})=\lim_{\Delta\rightarrow 0}\frac{E[\tilde{X}_{t+\Delta}-\tilde{X}_{t}|F_{t}]}{\Delta}=\lim_{\Delta\rightarrow 0}\frac{M_{t}}{\Delta}=(\theta_{0}+\tilde{\omega}\sum_{i<t}X_{i}\hat{d}_{t-i}), (13)

where FtF_{t} is the history of the number of red balls and X1=k1,,Xt=ktX_{1}=k_{1},\cdots,X_{t}=k_{t} and it corresponds to the Hazard function sch . The variance of the intensity of the distribution at time tt is

Var(λt|Ft)=limΔ0Mt2/KtΔ=ω(θ0+ω~i<td^tiXi).Var(\lambda_{t}|F_{t})=\lim_{\Delta\rightarrow 0}\frac{M_{t}^{2}/K_{t}}{\Delta}=\omega^{\prime}(\theta_{0}+\tilde{\omega}\sum_{i<t}\hat{d}_{t-i}X_{i}). (14)

In the continuous SE-NBD process, the intensity function exhibits a gamma distribution as a discrete SE-NBD process. We can confirm that the intensity function becomes a delta function in the limit ω0\omega\rightarrow 0, which corresponds to the continuous Hawkes process. In summary, we can obtain in the continuous limit,

X~t+ΔX~tNBD(θtΔ/ω,ω),t0,\tilde{X}_{t+\Delta}-\tilde{X}_{t}\sim\mbox{NBD}\left(\theta_{t}\Delta/\omega^{\prime},\omega^{\prime}\right),t\geq 0, (15)

where

θt=θ0+ω~s<tXsd^ts,t0.\theta_{t}=\theta_{0}+\tilde{\omega}\sum_{s<t}X_{s}\hat{d}_{t-s},t\geq 0. (16)

In the limit ω0\omega^{\prime}\rightarrow 0, the continuous SE-NBD process becomes the Hawkes process.

X~t+ΔX~tPoisson(θtΔ),t0,\tilde{X}_{t+\Delta}-\tilde{X}_{t}\sim\mbox{Poisson}\left(\theta_{t}\Delta\right),t\geq 0, (17)

where

θt=θ0+ω~s<tXsd^ts,t0.\theta_{t}=\theta_{0}+\tilde{\omega}\sum_{s<t}X_{s}\hat{d}_{t-s},t\geq 0. (18)

We show the path form the discrete Hawkes process to the Hawkes process in Appendix C. Finally, we show the path from the urn process to SE-NBD process and the Hawkes process in Fig. 2 (b).

Refer to caption (a) Refer to caption (b)
Figure 2: Difference between the continuous SE-NBD and Hawkes processes. (a) Intensity function of the continuous SE-NBD process, which obeys the gamma function (dotted line) and Hawkes process, which is the delta function (solid line). It is the intensity function at tΔt-\Delta, tt, and Δ\Delta. (b) We can confirm the flow from the BBD to the Hawkes process through the NBD process.

III III. Phase transition of the new urn process

In this section we consider the phase transition of the SE-NBD process. Here we apply the mean field approximation that we set v¯=Xt\bar{v}=X_{t}. We consider the increase of the process in Δ\Delta,

E[X~t+ΔX~t]=E[λt|Ft]=[θ0+v¯ω~id^i]Δ,E[\tilde{X}_{t+\Delta}-\tilde{X}_{t}]=E[\lambda_{t}|F_{t}]=[\theta_{0}+\bar{v}\tilde{\omega}\sum_{i}\hat{d}_{i}]\Delta, (19)

where we use Eq.(16). We set the average, v¯\bar{v}, of the intensity function and LHS of Eq.(19) is v¯Δ\bar{v}\Delta. Then we obtain,

v¯=θ0/(1ω~T^),\bar{v}=\theta_{0}/(1-\tilde{\omega}\hat{T}), (20)

where T^=d^i\hat{T}=\sum^{\infty}\hat{d}_{i}.

In the limit ω~0\tilde{\omega}\rightarrow 0, the temporal correlation is zero and the process is the NBD process, where the phase transition disappears.

The SE-NBD includes the Hawkes process as a branching process. The branching ratio is

ν=ω~T^,\nu=\tilde{\omega}\hat{T}, (21)

and the condition for the steady state is

ν=ω~T^<1.\nu=\tilde{\omega}\hat{T}<1. (22)

The phase transition between the steady state and non-steady state occurs at ν=1\nu=1, which is the critical point. The transition point is the same as that in the Hawkes process KZ ; KZ1 .

The parameter ν\nu is termed the effective reproduction number with regard to an infectious disease. It is the number of patients infected by one patient in the infection model. If the effective reproduction number is above 1, the number of patients increases to infinity indicating the non-steady state.

In the SE-NBD process, the distribution of the intensity function has variance. By contrast, in the Hawkes process, the intensity function is a delta function. The variance of the intensity function is the origin of the variance of the branching ratios. This means that the SE-NBD process has the variance of branching ratios, because the intensity function has the variance. In fact, nishi demonstrated that the effective reproduction number depends on the COVID-19 environment. The mixture of branching ratios affects not only the expected value of the intensity function but also the variance of the intensity function. Hence, the SE-NBD process, which has gamma distribution as the intensity function, may be useful. We confirm this in Section V.

We consider the exponential and power decay cases as the kernel function. These correspond to short and long memories Long as the kernel function. When we consider the exponential decay case d^=eβt\hat{d}=e^{-\beta t}, the condition for the steady state is M0/L0<βM_{0}/L_{0}<\beta. When we consider the power decay case d^=1/(1+t)γ\hat{d}=1/(1+t)^{\gamma}, the condition for the steady state is M0/L0<γ1M_{0}/L_{0}<\gamma-1. In Section V we use the exponential kernel for the empirical default data.

IV IV. Power-law distribution at critical point

We start with a discrete SE-NBD process {Xt},t=1,\{X_{t}\},t=1,\cdots. Here, Xt{0,1,}X_{t}\in\{0,1,\cdots\} represents the size of an event at time tt. This is the process we obtained from the urn process. This event corresponds to the taken out of the red ball from the urn. XtX_{t} obeys NBD for MtM_{t}.

Xt+1\displaystyle X_{t+1} \displaystyle\sim NBD(Mtω=Kt,ωΔ),t0,\displaystyle\mbox{NBD}\left(\frac{M_{t}}{\omega}=K_{t},\omega\Delta\right),t\geq 0,
Mt\displaystyle M_{t} =\displaystyle= M0+ni=1tXsh(ti),t1,\displaystyle M_{0}+n\sum_{i=1}^{t}X_{s}h(t-i),t\geq 1,

where n=M0/(1r)L0n=M_{0}/(1-r)L_{0}. We adopt the exponential decay kernel function, h(t)=(1r)d^=(1r)rt,0r<1h(t)=(1-r)\hat{d}=(1-r)r^{t},0\leq r<1. In addition, we replace the normalization factor (1r)(1-r) of h(t)h(t) with 1/τ1/\tau to ensure that 0h(t)𝑑t=1τ0et/τ𝑑t=1\int_{0}^{\infty}h(t)dt=\frac{1}{\tau}\int_{0}^{\infty}e^{-t/\tau}dt=1.

The stochastic process {Xt},t=1,\{X_{t}\},t=1,\cdots is non-Markovian. We focus on the time evolution of the intensity function MtM_{t}, which satisfies the following recursive equation.

Mt+1=r(MtM0)+M0+nh(0)Xt+1.M_{t+1}=r(M_{t}-M_{0})+M_{0}+nh(0)X_{t+1}.

Here, we use the relationship i=1t+1Xih(t+1i)=Xt+1h(0)+ri=1tXsh(ti)\sum_{i=1}^{t+1}X_{i}h(t+1-i)=X_{t+1}h(0)+r\sum_{i=1}^{t}X_{s}h(t-i). The stochastic difference equation for the excess intensity z^tMtM0\hat{z}_{t}\equiv M_{t}-M_{0} is

z^t+1z^t=(r1)z^t+nh(0)Xt+1.\hat{z}_{t+1}-\hat{z}_{t}=(r-1)\hat{z}_{t}+nh(0)X_{t+1}. (23)

We take the continuous time limit as in Section II. We divided the unit time interval by the infinitesimal time intervals with width dt=Δdt=\Delta. The decreasing factor rtr^{t} during the interval dtdt is replaced with rdt=edt/τ1dt/τ+o(dt/τ)r^{dt}=e^{-dt/\tau}\simeq 1-dt/\tau+o(dt/\tau).

Xt+1X_{t+1} is the noise for time interval [t,t+1][t,t+1], and it is necessary to prepare the noise for the infinitesimal interval [t,t+dt)[t,t+dt). For the purpose, we use the reproductive property of NBD. If Xt+1X_{t+1}, the noise for the interval [t,t+1)[t,t+1) obeys NBD(θt/ω,ω)\mbox{NBD}(\theta_{t}/\omega,\omega). The noise for [t,t+dt)[t,t+dt) is denoted by dξ^(θt/ω,ω)NB(t)d\hat{\xi}^{NB}_{(\theta_{t}/\omega^{\prime},\omega^{\prime})}(t). Here, the double scaling limit is applied to change the parameter from ω\omega to ω\omega^{\prime}. As Eq.(15) the stochastic difference equation (SDE) then becomes

dz^t=z^t+dtz^t=1τz^tdt+nτdξ^(θ0+z^tω,ω)NBD(t).d\hat{z}_{t}=\hat{z}_{t+dt}-\hat{z}_{t}=-\frac{1}{\tau}\hat{z}_{t}dt+\frac{n}{\tau}d\hat{\xi}^{NBD}_{(\frac{\theta_{0}+\hat{z}_{t}}{\omega^{\prime}},\omega^{\prime})}(t). (24)

The state-dependent NBD noise ξ^θt/ω,ωNBD\hat{\xi}^{NBD}_{\theta_{t}/\omega,\omega} defines the noise for the infinitesimal time interval [t,t+dt)[t,t+dt) with the following probabilistic rules:

dξ^(θt/ω,ω)NBD(t)ξ^(θt/ω,ω)NBD(t+dt)ξ^(θt/ω,ω)NBD(t)NBD(θtdt/ω,ω).d\hat{\xi}^{NBD}_{(\theta_{t}/\omega^{\prime},\omega^{\prime})}(t)\equiv\hat{\xi}^{NBD}_{(\theta_{t}/\omega^{\prime},\omega^{\prime})}(t+dt)-\hat{\xi}^{NBD}_{(\theta_{t}/\omega^{\prime},\omega^{\prime})}(t)\sim\mbox{NBD}(\theta_{t}dt/\omega^{\prime},\omega^{\prime}).

The characteristic function for dξ^(θt/ω,ω)NBDd\hat{\xi}^{NBD}_{(\theta_{t}/\omega^{\prime},\omega^{\prime})} is ϕ(s)=(11+ωωeis)θtdt/ω\phi(s)=(\frac{1}{1+\omega^{\prime}-\omega^{\prime}e^{is}})^{\theta_{t}dt/\omega^{\prime}}. The infinite divisibility is clear from the functional form. In addition, dξ^(θt/ω,ω)NBDd\hat{\xi}^{NBD}_{(\theta_{t}/\omega^{\prime},\omega^{\prime})} can be written as the superposition of the Poisson noise with the state-dependent random intensity λt\lambda_{t},

dξ(θt/ω,ω)NBDPoisson(λtdt),λtGamma(θt/ω,ω).d\xi^{NBD}_{(\theta_{t}/\omega^{\prime},\omega^{\prime})}\sim\mbox{Poisson}(\lambda_{t}dt),\lambda_{t}\sim\mbox{Gamma}(\theta_{t}/\omega^{\prime},\omega^{\prime}).

as we discussed in section II. When we denote the timing and size of the ii-th event as tit_{i} and kik_{i}, and the number of events before tt as N^(t)\hat{N}(t), we can rewrite the state-dependent NBD noise ξ^(θ0+z^tω,ω)NBD(t)\hat{\xi}^{NBD}_{(\frac{\theta_{0}+\hat{z}_{t}}{\omega^{\prime}},\omega^{\prime})}(t) as

ξ^(θ0+z^tω,ω)NBD(t)=i=1N^(t)kiδ(tti).\hat{\xi}^{NBD}_{(\frac{\theta_{0}+\hat{z}_{t}}{\omega^{\prime}},\omega^{\prime})}(t)=\sum_{i=1}^{\hat{N}(t)}k_{i}\delta(t-t_{i}).

The probability of the occurrence and non-occurrence of an event of size kk during time interval dtdt is given as,

P(dξ^(θ0+z^tω,ω)NBD=k)={1z^t+θ0ωln(ω+1)dtk=01k(z^t+θ0ω)(ωω+1)kdtk1.P\left(d\hat{\xi}^{NBD}_{(\frac{\theta_{0}+\hat{z}_{t}}{\omega^{\prime}},\omega^{\prime})}=k\right)=\left\{\begin{array}[]{cc}1-\frac{\hat{z}_{t}+\theta_{0}}{\omega^{\prime}}\ln(\omega^{\prime}+1)dt&k=0\\ \frac{1}{k}\left(\frac{\hat{z}_{t}+\theta_{0}}{\omega^{\prime}}\right)\left(\frac{\omega^{\prime}}{\omega^{\prime}+1}\right)^{k}dt&k\geq 1.\end{array}\right. (25)

In the limit ω0\omega\to 0, the probabilities becomes

limω0P(dξ^(θ0+z^tω,ω)NBD=k)={1(z^t+θ0)dtk=0(z^t+θ0)dtk=10k2.\lim_{\omega\to 0}P\left(d\hat{\xi}^{NBD}_{(\frac{\theta_{0}+\hat{z}_{t}}{\omega^{\prime}},\omega^{\prime})}=k\right)=\left\{\begin{array}[]{cc}1-(\hat{z}_{t}+\theta_{0})dt&k=0\\ (\hat{z}_{t}+\theta_{0})dt&k=1\\ 0&k\geq 2.\end{array}\right.

kik_{i} is restricted to be one or zero, and the state dependent noise becomes the Poisson noise. We discuss the relation between the SE-NBD and the marked Hawkes processes in Appendix D.

The SDE (24) is interpreted as

z^(t+dt)z^(t)={1τz^tProb.=1z^t+θ0ωln(ω+1)dtnkτProb.=1k(z^t+θ0ω)(ωω+1)kdt,k=1,.\hat{z}(t+dt)-\hat{z}(t)=\left\{\begin{array}[]{cc}-\frac{1}{\tau}\hat{z}_{t}&\mbox{Prob.}=1-\frac{\hat{z}_{t}+\theta_{0}}{\omega^{\prime}}\ln(\omega^{\prime}+1)dt\\ \frac{nk}{\tau}&\mbox{Prob.}=\frac{1}{k}\left(\frac{\hat{z}_{t}+\theta_{0}}{\omega^{\prime}}\right)\left(\frac{\omega^{\prime}}{\omega^{\prime}+1}\right)^{k}dt,k=1,\cdots.\end{array}\right.

The same procedure is adopted to derive the master equation for the probability density function (PDF) of z^t\hat{z}_{t} in KZ ; KZ1 . This yields

tPt(z)=1τzzPt(z)+k=11ωk(ωω+1)k{(θ0+znkτ)Pt(znkτ)(θ0+z)Pt(z)}.\frac{\partial}{\partial t}P_{t}(z)=\frac{1}{\tau}\frac{\partial}{\partial z}zP_{t}(z)+\sum_{k=1}^{\infty}\frac{1}{\omega^{\prime}k}\left(\frac{\omega^{\prime}}{\omega^{\prime}+1}\right)^{k}\left\{(\theta_{0}+z-\frac{nk}{\tau})P_{t}(z-\frac{nk}{\tau})-(\theta_{0}+z)P_{t}(z)\right\}. (26)

The Laplacian representation of the steady state is denoted as PSS(z)P_{SS}(z) as P~SS(s)\tilde{P}_{SS}(s).

P~SS(s)0PSS(z)esz𝑑z.\tilde{P}_{SS}(s)\equiv\int_{0}^{\infty}P_{SS}(z)e^{-sz}dz.

The master equation for P~SS(s)\tilde{P}_{SS}(s) is

[k=11ωk(ωω+1)k(enkτs1)+sτ]ddsP~SS(s)=k=1n1ωk(ωω+1)kθ0(enkτs1)P~SS(s).\displaystyle\left[\sum_{k=1}^{\infty}\frac{1}{\omega^{\prime}k}\left(\frac{\omega^{\prime}}{\omega^{\prime}+1}\right)^{k}\left(e^{-\frac{nk}{\tau}s}-1\right)+\frac{s}{\tau}\right]\frac{d}{ds}\tilde{P}_{SS}(s)=\sum_{k=1}^{n}\frac{1}{\omega k^{\prime}}\left(\frac{\omega^{\prime}}{\omega^{\prime}+1}\right)^{k}\theta_{0}\left(e^{-\frac{nk}{\tau}s}-1\right)\tilde{P}_{SS}(s).

Thus,

ddslnP~SS(s)=θ0θ0s/τk=11ωk(ωω+1)k(enkτs1)+sτ.\frac{d}{ds}\ln\tilde{P}_{SS}(s)=\theta_{0}-\frac{\theta_{0}s/\tau}{\sum_{k=1}^{\infty}\frac{1}{\omega^{\prime}k}\left(\frac{\omega^{\prime}}{\omega^{\prime}+1}\right)^{k}(e^{-\frac{nk}{\tau}s}-1)+\frac{s}{\tau}}.

We integrate the equation with the initial condition P~SS(0)=1\tilde{P}_{SS}(0)=1.

lnP~SS(s)=θ0s0sθ0s/τk=11ωk(ωω+1)k(enkτs1)+sτ𝑑s.\ln\tilde{P}_{SS}(s)=\theta_{0}s-{\displaystyle\int_{0}^{s}\frac{\theta_{0}s^{\prime}/\tau}{\sum_{k=1}^{\infty}\frac{1}{\omega^{\prime}k}\left(\frac{\omega^{\prime}}{\omega^{\prime}+1}\right)^{k}(e^{-\frac{nk}{\tau}s^{\prime}}-1)+\frac{s^{\prime}}{\tau}}}ds^{\prime}.

Here, the large zz behavior of PSS(z)P_{SS}(z) near the critical point n=1n=1 is of interest, and we study the integral at s0s\simeq 0 and n=1ϵ,ϵ<<1n=1-\epsilon,\epsilon<<1. We expand enkτs=1nksτ+12(nksτ)2+o(s2)e^{\frac{nk}{\tau}s}=1-\frac{nks}{\tau}+\frac{1}{2}(\frac{nks}{\tau})^{2}+o(s^{2}) and calculate the summation over kk in the denominator of the integral. Therefore, we have

lnP~SS(s)θ0s0s2θ0τω+12τω+1ϵ+s𝑑s.\ln\tilde{P}_{SS}(s)\simeq\theta_{0}s-\int_{0}^{s}\frac{\frac{2\theta_{0}\tau}{\omega^{\prime}+1}}{\frac{2\tau}{\omega^{\prime}+1}\epsilon+s^{\prime}}ds^{\prime}.

Near the critical point, the excess intensity distribution shows power-law behavior with a non-universal exponent, up to an exponential truncation:

PSS(z)z1+2θ0τω+1e2τϵω+1z.P_{SS}(z)\propto z^{-1+2\frac{\theta_{0}\tau}{\omega^{\prime}+1}}e^{-\frac{2\tau\epsilon}{\omega^{\prime}+1}z}. (28)

The power-law exponent of the PDF of the excess intensity is 12θ0τω+11-\frac{2\theta_{0}\tau}{\omega^{\prime}+1}, and depends on ω\omega^{\prime}, which is the correlation simultaneously. In the limit ω0\omega^{\prime}\to 0, the result coincides with that in KZ ; KZ1 . The power-law exponent increases with ω\omega^{\prime} and it converges to 1 in the limit ω\omega^{\prime}\to\infty. ω0\omega^{\prime}\rightarrow 0 is the no correlation in the same time and the intensity function becomes the delta function. ω\omega^{\prime}\rightarrow\infty is the strong correlation limit in the same time and the intensity function has the large variance. The correlation in the same term alters the critical behavior. In addition, the length scale of the exponential decay for the off-critical case is (ω+1)/(2τϵ)(\omega^{\prime}+1)/(2\tau\epsilon), which is also an increasing function of ω\omega^{\prime}.

V V. Parameter estimation using default data

We use empirical data pertaining to financial default to estimate the parameters. First, the S&P default data from 1981 to 2020 are used. A speculative grade (SG) rating represents ratings below BBB-(Baa3), whereas an investment grade (IG) rating indicates those above BBB-(Baa3). We also use Moody’s default data from 1920 to 2020, a period of 100 years that includes the Great Depression of 1929 and the Great Recession of 2008.

We estimate the parameters using Bayes’ formula

P(K0,M0,L0,β|k0,k1,,kT)\displaystyle P(K_{0},M_{0},L_{0},\beta|k_{0},k_{1},\cdots,k_{T}) =\displaystyle= P(rT|K0,M0,L0,β,))P(kT)P(k0|K0,M0,L0,β)P(k0)\displaystyle\frac{P(r_{T}|K_{0},M_{0},L_{0},\beta,))}{P(k_{T})}\cdots\frac{P(k_{0}|K_{0},M_{0},L_{0},\beta)}{P(k_{0})} (29)
×\displaystyle\times f(K0,M0,L0,β),\displaystyle f(K_{0},M_{0},L_{0},\beta),

where f(K0,M0,L0,β)f(K_{0},M_{0},L_{0},\beta) is a prior distribution Hisakado6 , for which we used a uniform distribution. We apply the maximum a posteriori (MAP) estimation expressed by Eq.(29). The use of the NBD to determine the distribution PP would be the parameter estimation for the discrete SE-NBD process introduced in Section II. The use of the Poisson distribution instead of the NBD would be the parameter estimation for the discrete Hawkes process. We present the outcome of the optimization using the discrete SE-NBD, discrete Hawkes, and NBD processes in Tables 1, 2, and 3. NBD was employed to confirm whether self-excitation existed. For the Hawkes process, K0(ω=0)K_{0}\rightarrow\infty(\omega=0), and for the NBD process, L0(ω~=0)L_{0}\rightarrow\infty(\tilde{\omega}=0), as discussed in Section II.

When K0K_{0} is large, it is nearly a Hawkes process. When L0L_{0} is large, it is nearly an NBD process, which has no self-excitation. The estimated K0K_{0} is small for the SE-NBD process and, particularly, K0K_{0} is small for IG. As in Fig. 3, we can obtain a much better AIC for the SE-NBD process. This implies an intensity function of which the variance is not a delta function, as in the Hawkes process. In fact, certain defaulted obligors affect other obligors, whereas this does not occur in the case of other obligors. The former case corresponds to the situation of chain bankruptcy and may be considered to depend on network effects. An obligor who is connected to many obligors affects many other obligors. In fact, the AIC for the SE-NBD process was smaller than that for the NBD process. Hence, we can confirm the existence of self-excitation in this historical credit dataset.

Table 1: MAP estimation of the parameters for the discrete SE-NBD and discrete Hawkes processes
SE-NBD Hawkes
No. Model K0K_{0} L0L_{0} M0/K0M_{0}/K_{0} β\beta v¯\bar{v} M0M_{0} L0L_{0} β\beta v¯\bar{v}
1 Moody’s 1920-2020 0.28 6.17 18.89 2.94 58.35 3.4 3.55 15.98 86.85
2 S&P 1981-2020 1.06 27.80 18.95 16.08 71.22 13.3 15.76 18.40 83.65
3 Moody’s 1981- 2020 1.03 32.12 22.55 15.97 82.79 17.2 21.09 16.69 91.72
4 S&P 1990-2020 1.51 62.52 23.04 16.23 78.64 29.2 45.37 13.01 81.82
5 Moody’s 1990-2020 1.58 86.55 27.92 14.40 90.00 40.6 73.03 19.19 91.38
6 Moody’s 1920-2020 SG 0.29 5.91 17.81 3.03 56.04 2.9 3.00 13.57 105.32
7 S&P 1981-2020 SG 1.05 25.66 17.90 16.22 69.90 12.0 13.93 17.57 85.29
8 Moody’s 1981-2020 SG 1.02 30.57 21.65 15.99 80.65 15.4 18.53 15.71 92.38
9 S&P 1990-2020 SG 1.54 60.05 21.82 16.02 76.39 28.2 43.74 18.97 79.59
10 Moody’s 1990-2020 SG 1.62 86.79 26.76 15.44 87.05 39.6 71.53 14.57 88.57
11 Moody’s 1920-2020 IG 0.13 1.10 4.06 0.99 2.13 1.14 0.40 0.98 1.24
12 S&P 1981-2020 IG 0.39 2.87 3.06 15.32 2.05 1.2 2.67 14.68 2.05
13 Moody’s 1981-2020 IG 0.28 6.07 5.37 1.26 2.36 1.7 6.02 14.63 2.34
14 S&P 1990-2020 IG 0.33 2.73 3.75 16.80 2.32 1.2 2.65 17.46 2.32
15 Moody’s 1990-2020 IG 0.28 4.13 5.80 13.27 2.69 1.8 5.58 16.26 2.68
Table 2: MAP estimation of the parameters for the NBD process
NBD
No. Model K0K_{0} M0/K0M_{0}/K_{0} v¯\bar{v}
1 Moody’s 1920-2020 0.47 80.64 37.86
2 S&P 1981-2020 1.54 38.61 59.62
3 Moody’s 1981- 2020 1.52 45.76 69.78
4 S&P 1990-2020 2.07 34.37 71.29
5 Moody’s 1990-2020 2.14 38.86 83.23
6 Moody’s 1920-2020 SG 0.47 76.28 35.81
7 S&P 1981-2020 SG 1.55 37.14 57.57
8 Moody’s 1981-2020 SG 1.53 44.00 67.45
9 S&P 1990-2020 SG 2.12 32.46 68.97
10 Moody’s 1990-2020 SG 2.20 36.65 80.58
11 Moody’s 1920-2020 IG 0.29 7.18 2.05
12 S&P 1981-2020 IG 0.46 4.42 2.05
13 Moody’s 1981-2020 IG 0.37 6.30 2.33
14 S&P 1990-2020 IG 0.41 5.63 2.32
15 Moody’s 1990-2020 IG 0.36 7.32 2.65
Table 3: AIC for the discrete SE-NBD, discrete Hawkes, and NBD processes
SE-NBD process Hawkes process NBD process
No. Model AIC AIC AIC
1 Moody’s 1920-2020 791.9 2193.1 904.0
2 S&P 1981- 2020 386.7 1010.6 407.9
3 Moody’s 1981-2020 399.3 1186.1 420.6
4 S&P 1990-2020 316.3 923.0 323.5
5 Moody’s 1990-2020 327.1 1098.6 332.4
6 Moody’s 1920-2020 SG 781.5 2060.1 893.0
7 S&P 1981-2020 SG 383.2 975.8 405.1
8 Moody’s 1981-2020 SG 396.5 1140.0 417.8
9 S&P 1990-2020 SG 313.9 894.0 321.0
10 Moody’s 1990-2020 SG 325.1 1062.4 329.9
11 Moody’s 1920-2020 IG 321.7 490.1 360.3
12 S&P 1981-2020 IG 150.0 197.6 153.2
13 Moody’s 1981-2020 IG 156.8 257.1 156.8
14 S&P 1990-2020 IG 121.3 168.3 124.2
15 Moody’s 1990-2020 IG 127.4 219.7 127.9

VI VI. Concluding Remarks

In this study, we considered a multi-term urn process that has a correlation in the same term and temporal correlation. Each term is the Pólya urn model, which represents the correlation in the same time. The temporal correlation represents the correlation effects from the previous terms. When the number of red balls is much smaller, we can obtain the Poisson process with the gamma distribution intensity function, the NBD process. We introduced the temporal correlation as the conditional distribution for the intensity function. This is equivalent to a self-exciting negative binomial distribution (SE-NBD) with conditional parameters. We referred to this process as the discrete SE-NBD process. This process becomes a discrete Hawkes process without correlation in the same term but with the temporal correlation between two different terms. In the standard continuous limit of the discrete SE-NBD process, we obtained the Hawkes process. On the other hand, taking the double-scaling limit enabled us to obtain the continuous SE-NBD process. The difference between the continuous SE-NBD and Hawkes processes is the variance in the intensity function. In other words, at the limit where the intensity function becomes the delta function, the continuous SE-NBD process becomes the Hawkes process. The continuous SE-NBD process is a marked point process.

We observed a phase transition from the steady to the non-steady state, which is the same type of phase transition as that in the Hawkes process. We can observe a difference in the distribution of the intensity function at the critical point. The distribution functions of both models obey the power law at the critical point and have different indexes. We applied the process to the default data to estimate the parameters. According to our observation, the urn process is more effective for a default portfolio because of network effects.

Appendix A Appendix A. Parameters for a urn process

In this appendix we summarize the parameters for a urn process.

θ0\theta_{0}: Number of the red balls at the 1-st term

n0n_{0}: Number of the total balls in the initial condition of each term

NN:Number of the balls taken out in each term

ω\omega: In crease of the balls when we take put a ball. It is related the correlation in the same term.

did_{i}: Weight for the red balls taken out at the previous ii-th terms (discount factor or the kernel function). It is one of the parameter for the temporal correlation.

kik_{i}: Number of the red balls taken out in ii-th term

ω~\tilde{\omega}: Scale parameter for the initial condition in each term. It is one of the parameter for the temporal correlation.

Appendix B Appendix B. Proof of Eq.(3)

0Poisson(k1|λ)Gamma(λ|K0,M0/K0)𝑑λ=0λk1eλk1!λK01Γ(K0)(M0/K0)K0˙eλK0/M0𝑑λ,\displaystyle\int_{0}^{\infty}\mbox{Poisson}(k_{1}|\lambda)\cdot\mbox{Gamma}(\lambda|K_{0},M_{0}/K_{0})d\lambda=\int_{0}^{\infty}\frac{\lambda^{k_{1}}e^{-\lambda}}{k_{1}!}\dot{\frac{\lambda^{K_{0}-1}}{\Gamma(K_{0})(M_{0}/K_{0})^{K_{0}}}}e^{-\lambda K_{0}/M_{0}}d\lambda, (30)
=\displaystyle= (M0/K0)K0k1!Γ(K0)0Γ(K0+k1)Γ(K0+k1)(M0/(M0+K0))K0+k1(M0/(M0+K0))K0+k1λK0+k11eλ/(M0/(M0+K0))𝑑λ\displaystyle\frac{(M_{0}/K_{0})^{-K_{0}}}{k_{1}!\Gamma(K_{0})}\int_{0}^{\infty}\frac{\Gamma(K_{0}+k_{1})}{\Gamma(K_{0}+k_{1})}\frac{(M_{0}/(M_{0}+K_{0}))^{K_{0}+k_{1}}}{(M_{0}/(M_{0}+K_{0}))^{K_{0}+k_{1}}}\lambda^{K_{0}+k_{1}-1}e^{-\lambda/(M_{0}/(M_{0}+K_{0}))}d\lambda
=\displaystyle= (M0/K0)K0k1!Γ(K0)Γ(K0+k1)(M0M0+K0)K0+k1\displaystyle\frac{(M_{0}/K_{0})^{-K_{0}}}{k_{1}!\Gamma(K_{0})}\Gamma(K_{0}+k_{1})(\frac{M_{0}}{M_{0}+K_{0}})^{K_{0}+k_{1}}
=\displaystyle= Γ(K0+k1)k1!Γ(K0)(M0M0+K0)k1(K0M0+K0)K0=NBD(X1=k1|K0,M0/K0),\displaystyle\frac{\Gamma(K_{0}+k_{1})}{k_{1}!\Gamma(K_{0})}(\frac{M_{0}}{M_{0}+K_{0}})^{k_{1}}(\frac{K_{0}}{M_{0}+K_{0}})^{K_{0}}=\mbox{NBD}(X_{1}=k_{1}|K_{0},M_{0}/K_{0}),

where we use the relation at the third equal,

01Γ(K0+k1)1(M0/(M0+K0))K0+k1λK0+k11eλ/(M0/(M0+K0))𝑑λ=1,\int_{0}^{\infty}\frac{1}{\Gamma(K_{0}+k_{1})}\frac{1}{(M_{0}/(M_{0}+K_{0}))^{K_{0}+k_{1}}}\lambda^{K_{0}+k_{1}-1}e^{-\lambda/(M_{0}/(M_{0}+K_{0}))}d\lambda=1, (31)

because it is the integral of Gamma(λ|K0+k1,M0/(M0+K0))\mbox{Gamma}(\lambda|K_{0}+k_{1},M_{0}/(M_{0}+K_{0})).

Appendix C Appendix C. From discrete Hawkes process to Hawkes process

This is the discrete Hawkes process,

Xt+1Poisson(Mt),t0,X_{t+1}\sim\mbox{Poisson}\left(M_{t}\right),t\geq 0, (32)

where

Mt=M0+M0/L0s=1tXsd^t+1s,t1.M_{t}=M_{0}+M_{0}/L_{0}\sum_{s=1}^{t}X_{s}\hat{d}_{t+1-s},t\geq 1. (33)

Here we use the counting process, X~t=iXi\tilde{X}_{t}=\sum_{i}X_{i}. The limit Δ=N/n00\Delta=N/n_{0}\rightarrow 0 is set, as the continuous limit of process X~t\tilde{X}_{t}. The intensity function λt\lambda_{t},

λt=limΔ0E[X~t+ΔX~t|Ft]Δ=limΔ0MtΔ=(θ0+ω~i<tkid^ti),\lambda_{t}=\lim_{\Delta\rightarrow 0}\frac{E[\tilde{X}_{t+\Delta}-\tilde{X}_{t}|F_{t}]}{\Delta}=\lim_{\Delta\rightarrow 0}\frac{M_{t}}{\Delta}=(\theta_{0}+\tilde{\omega}\sum_{i<t}k_{i}\hat{d}_{t-i}), (34)

We can then obtain the Hawkes process,

X~t+ΔX~tPoisson(θtΔ),t0,\tilde{X}_{t+\Delta}-\tilde{X}_{t}\sim\mbox{Poisson}\left(\theta_{t}\Delta\right),t\geq 0, (35)

where

θt=θ0+ω~s<tXsd^ts,t0.\theta_{t}=\theta_{0}+\tilde{\omega}\sum_{s<t}X_{s}\hat{d}_{t-s},t\geq 0. (36)

Appendix D Appendix D. Marked Hawkes process

Here we consider the conditional probability in the condition that the event occurs, ρ(k)\rho(k), using Eq.(25). ρ(k)\rho(k) is given as

ρ(k)=1kln(ω+1)(ωω+1)k,\rho(k)=\frac{1}{k\ln(\omega^{\prime}+1)}\left(\frac{\omega^{\prime}}{\omega^{\prime}+1}\right)^{k},

where k=1,2,.k=1,2,\cdots. Note that ρ(k)\rho(k) is the gamma function with the shape parameter 0 and does not depend on time tt. The probability that an event occurs during the period [t,t+dt][t,t+dt] is

θtωln(ω+1)dt.\frac{\theta_{t}}{\omega^{\prime}}\ln(\omega^{\prime}+1)dt.

Hence, the number of events, the marks, is considered IID random numbers. In the limit ω0\omega^{\prime}\to 0, the distribution of the markes is

limω0ρ(k)=δ(k1),\lim_{\omega\to 0}\rho(k)=\delta(k-1),

where δ(x)\delta(x) is the delta function and the process reduces to the Hawkes process. Hence, SE-NBD is the marked Hawkes process oga .

References

  • (1) G. Galam, Stat. Phys. 61, 943 (1990).
  • (2) G. Galam, Int. J. Mod. Phys. C 19(03), 409 (2008).
  • (3) N. M. Mantegna and H. E. Stanley, Introduction to Econophysics: Correlations and Complexity in Finance, Cambridge University Press (2000).
  • (4) D. Brockmann, L. Hufinage, and T. Geisel, Nature 439, 462 (2006).
  • (5) I. T. Wong, M. L. Gardel, D. R. Reichman, E. R. Weeks, M. T. Valentine, A. R. Bausch, and D. A. Weitz, Phys. Rev. Lett. 92, 178101 (2004).
  • (6) Y. Gefen, A. Aharony, and S. Alexander, Phys. Rev. Lett. 50, 77 (1983).
  • (7) R. Metzler and J. Klafter, Phys. Rep. 339, 1 (2000).
  • (8) S. Hod and U. Keshet, Phys. Rev. E 70 11006 (2004).
  • (9) T. Huillet, J. Phys. A 41 505005 (2008).
  • (10) G. Schütz and S. Trimper, Phys. Rev. E 70 045101 (2004).
  • (11) S Mori and M. Hisakado, J. Phys. Soc. Jpn. 79, 034001 (2010).
  • (12) M. Hisakado and S. Mori, J. Phys. A 43, 31527 (2010).
  • (13) M. Hisakado and S. Mori, J. Phys. A 44, 275204 (2011).
  • (14) M. Hisakado and S. Mori, J. Phys. A 45, 345002 (2012).
  • (15) M. Hisakado and S. Mori, Physica A 417, 63 (2015).
  • (16) M. Hisakado and S. Mori, Physica A 108, 570 (2016).
  • (17) M. Hisakado and S. Mori, Physica A, 544 123480 (2020).
  • (18) S. Mori, M.Hisakado, and K. Nakayama, J. Phys Soc of Jpn 90(11) (2021).
  • (19) S. Hod and U. Keshet, Phys. Rev. E 70, 11006 (2004).
  • (20) S. Mori, K. Kitsukawa, and M. Hisakado, Quant. Fin. 10, 1469 (2010).
  • (21) S. Mori, K. Kitsukawa, and M. Hisakado, J. Phys. Sco.Jpn. 77, 114802 (2008).
  • (22) P. J. Schönbucher, Credit Derivatives Pricing Models: Models, Pricing, and Implementation John Wiley & Sons, Ltd. (2003).
  • (23) A. G. Hawkes, Biometrica 58 83 (1971).
  • (24) M. Kirchner, Quant. Fin. 17 57 (2017).
  • (25) P. Blanc, J. Donier, and J. -P. Bouchard, Quant. Fin. 17 171 (2017).
  • (26) E. Errais, K. Giesecke, L.R. Goldberg, SIAM J. Fin Math 1 642 (2010)
  • (27) K. Kanazawa and D. Sornett Phys. Rev. Lett. 125 138301 (2020)
  • (28) K. Kanazawa and D. Sornett Phys. Rev. Research 2 033442 (2020).
  • (29) A. G. Hawkes and D. Oakes J App Prob 11 493 (1974)
  • (30) M.Hisakado, K. KHattori, adn S. MoriarXiv2205.14146 (2022)
  • (31) M.Hisakado and S. Mori,J. Phys. Soc of Jpn 90(8) 084801 2021.
  • (32) M. Hisakado, K. Kitsukawa, S. and Mori, J. Phys. A 39, 15365 (2006).
  • (33) I. Florescu, M. C. Mariani, H. E. Stanley, and F. G. Viens (Eds.) Handbook of high-frequency trading and modeling in finance , John Wiley and Sons (2016).
  • (34) H. Nishiura, et al. Closed environments facilitate secondary transmission of coronavirus disease 2019 (COVID-19) MedRxiv (2020)
  • (35) Y. Ogata J. American Stat Association 83, 9, (1988)