This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

First–order integer–valued autoregressive processes with Generalized Katz innovations

Ovielt Baltodano Lopez [email protected] Federico Bassetti [email protected] Giulia Carallo [email protected] Roberto Casarin [email protected] Polytechnic University of Milan, Italy Ca’ Foscari University of Venice, Fondamenta San Giobbe, 873, 30121 Venice, Italy.
Abstract

A new integer–valued autoregressive process (INAR) with Generalised Lagrangian Katz (GLK) innovations is defined. This process family provides a flexible modelling framework for count data, allowing for under and over–dispersion, asymmetry, and excess of kurtosis and includes standard INAR models such as Generalized Poisson and Negative Binomial as special cases. We show that the GLK–INAR process is discrete semi–self–decomposable, infinite divisible, stable by aggregation and provides stationarity conditions. Some extensions are discussed, such as the Markov–Switching and the zero–inflated GLK–INARs. A Bayesian inference framework and an efficient posterior approximation procedure are introduced. The proposed models are applied to 130 time series from Google Trend, which proxy the worldwide public concern about climate change. New evidence is found of heterogeneity across time, countries and keywords in the persistence, uncertainty, and long–run public awareness level.

keywords:
Bayesian inference , Big data , Counts time series , Climate Risk , Generalized Lagrangian Katz distribution , MCMC

1 Introduction

In the recent years there has been a large interest in discrete–time integer–valued models, also due to increased availability of count data in very diverse fields including finance [Liesenfeld et al., 2006, Aknouche et al., 2021], economics [Freeland & McCabe, 2004, Berry & West, 2020], social sciences [Pedeli & Karlis, 2011], sports [Shahtahmassebi & Moyeed, 2016], image processing [Afrifa-Yamoah & Mueller, 2022] and oceanography [Cunha et al., 2018]. Among the modelling approaches, integer–valued autoregressive processes (INAR), introduced independently by Al-Osh & Alzaid [1987] and McKenzie [1985], become very popular. The stochastic construction of the INAR relies on the binomial thinning operator and the properties of the model on the discrete self–decomposability of the stationary distribution of the process [Steutel & van Harn, 1979]. See Scotto et al. [2015] for a review.

The original INAR model has been studied further in Al-Osh & Alzaid [1987] and extended in different directions. [McKenzie, 1986] introduced an INAR model with negative–Binomial and geometric marginal distributions, Jin-Guan & Yuan [1991] extended the INAR(1) model of Al-Osh & Alzaid [1987] to the higher order INAR(p)(p). Al-Osh & Aly [1992] introduced a negative–binomial INAR with a new iterated thinning operator. Other extensions of the INAR process have been made to include a seasonal structure in the model [e.g., see Bourguignon et al., 2016]. INAR models with values in the set of signed integers have been propose firstly by Kim & Park [2008] and generalised by Alzaid & Omair [2014] and Andersson & Karlis [2014]. Freeland [2010] proposed a true integer–valued autoregressive model (TINAR(1)). More flexible INAR models have been introduced by assuming more flexible distributions for the innovations terms. Alzaid & Al-Osh [1993] propose integer–valued ARMA process with Generalized Poisson marginals and Kim & Lee [2017] introduced INAR with Katz innovations.

This paper introduces a general class of INARs with Generalized Lagrangian Katz innovations. The Lagrangian Katz family is a flexible distribution and naturally arises as first crossing probabilities, which is a common problem in actuarial mathematics, e.g. claim number distribution in cascading processes or ruin probability in discrete–time risk models [e.g., see Consul & Famoye, 2006, ch. 12]. It has been extended further by Janardan [1998] and Janardan [1999], which introduced the four–parameter generalized Pólya–Eggenberger (GPED) distributions of the first and second kind. Janardan [1998] showed that both families contain the Lagrangian Katz distribution as a special case. We consider the four-parameters GPED of the first kind, also known as Generalized Lagrangian Katz (GLK). The resulting process family provides a flexible modelling framework for count data, allowing for under and over–dispersion, asymmetry, and excess of kurtosis and includes standard INAR models such as Generalized Poisson and Negative Binomial as special cases. Further extensions are provided, such as the Markov–Switching and the zero–inflated GLK–INARs, to account for different sources of model instability and excess of zeros.

Various approaches to inference have traditionally been presented for count data models, such as the conditional likelihood approach, generalized method of moments and Yule–Walker approach. See Weiß & Kim [2013] for a review. Despite the popularity gained in recent years by Bayesian methods, the applications to count data models are still limited [e.g., see McCabe & Martin, 2005, Neal & Subba Rao, 2007, Drovandi et al., 2016, Shang & Zhang, 2018, Garay et al., 2020b]. Thus, we provide a Bayesian inference procedure for our model and illustrate the procedure’s efficiency on a synthetic dataset. The Bayesian approach to inference entirely considers parameter uncertainty in the prior knowledge about a random process. It allows for imposing parameter restrictions by specifying the prior distribution [Chen & Lee, 2016]. The posterior distribution of the parameters quantifies uncertainty in the estimation [Chen & Lee, 2017], which can be included in the prediction. The inference from the Bayesian perspective may result in richer inferences in the case of small samples [Garay et al., 2020a] and extra–sample information and in robust inference in the presence of outliers [Fried et al., 2015]. Finally, model selection for both nested and non–nested models can be easily carried out.

We illustrate the model’s flexibility with an application to an original Google Trend dataset of 130 time–series measuring the public concern about climate change in different countries. The contrasting features of the series, such as excess of zeros, outliers, and regimes, are common in count data and provide a challenging and diversified ground for illustrating the robustness and flexibility of the GLK–INAR model. Assessing public awareness and knowledge of a specific topic and understanding the dynamics of social consciousness allows for designing more effective public policies. For this reason, researchers measured and studied the level of awareness about the effects of climate change in different sectors of society such as households [Frondel et al., 2017], winegrowers [Battaglini et al., 2009], farmers [Fahad & Wang, 2018], mountain peoples [Ullah et al., 2018]. Most of these studies rely on surveys conducted in a specific geographical area and sector of society, with a few exceptions. For example, Ziegler [2017] proposed a cross–country analysis of climate change beliefs and attitudes. Lineman et al. [2015] provided a broader and global perspective by exploiting the potentiality of big data provided by Google Trend. This extended climate change perception literature along two lines. First, we consider a multi–country dataset, including country–specific measures to capture worldwide heterogeneity in public awareness. Moreover, we offer a model–based approach and an inference procedure to analyze these measures.

The paper is organized as follows. Section 2 introduces the GLK family and INAR process with some extensions such as the Markov–Switching GLK–INAR. Section 3 proposes a Bayesian inference procedure and provides some simulation results. Section 4 provides some illustrations on a multi–country Google Trend dataset related to climate change. Section 5 concludes.

2 INAR(1) with generalized Katz innovations

2.1 Generalized Lagrangian Katz family

The probability mass function (pmf), P(X=x)=pxP(X=x)=p_{x}, of the Generalized Lagrangian Katz (GLK) is

px=1x!βxac1(ac+xbc+x)(1β)ac+xbc(ac+xbc+1)xp_{x}=\frac{1}{x!}\beta^{x}\frac{a}{c}\frac{1}{(\frac{a}{c}+x\frac{b}{c}+x)}(1-\beta)^{\frac{a}{c}+x\frac{b}{c}}\left(\frac{a}{c}+x\frac{b}{c}+1\right)_{{x}\uparrow} (1)

x=0,1,2,x=0,1,2,\ldots, where (x)k=x(x+1)(x+k1)(x)_{{k}\uparrow}=x(x+1)\ldots(x+k-1) is the rising factorial with the convention that (x)0=1(x)_{0}=1, and a>0a>0, c>0c>0, bcb\geq-c and 0<β<10<\beta<1 are the parameters [Consul & Famoye, 2006]. We denote the distribution with 𝒢𝒦(a,b,c,β)\mathcal{GLK}(a,b,c,\beta). We notice that for c<b<0-c<b<0 some additional constraints on the parameters are needed to have all the px0p_{x}\geq 0. See the discussion at the beginning of Subsection 3.1 and Appendix B in the Supplementary. GLK distributions have probability generating function (pgf)

H(u)=x=0pxuxH(u)=\sum_{x=0}^{\infty}p_{x}u^{x}

which satisfies:

H(u)=(1β+βz)a/c,z=u(1β+βz)b/c+1,H(u)=(1-\beta+\beta z)^{a/c},\,\,z=u(1-\beta+\beta z)^{b/c+1}, (2)

or alternatively

H(u)=((1β)/(1βz))a/c,z=u((1βz)/(1β))b/c,H(u)=\left((1-\beta)/(1-\beta z)\right)^{a/c},\,\,z=u\left((1-\beta z)/(1-\beta)\right)^{b/c}, (3)

see Janardan [1998].

Remark 1.

Building on the Lagrangian expansion, Janardan [1998] introduced the Generalized Polya Eggenberger distribution. [Consul & Famoye, 2006] argued that since the distribution is unrelated to the Polya, it should be named Generalized Lagrangian Katz distribution. As shown in Appendix B in the Supplementary Material, it is possible to derive the Generalized Polya Eggenberger / Generalized Lagrangian Katz as a particular ”generalized Lagrangian distribution”.

The GLK distribution family is very general and includes some well–known distributions and new distributions that have yet to be used in count data modelling.

  • The Lagrangian Katz distribution 𝒦(a,b,β)\mathcal{LK}(a,b,\beta) is obtained by replacing cc with β\beta (which is called Generalized Katz in [Consul & Famoye, 2006]).

  • The Katz distribution 𝒦(a,β)\mathcal{K}(a,\beta) is obtained for b=0b=0 and by replacing cc with β\beta, [Katz, 1965].

  • The Polya–Eggenberger distribution 𝒫(a,c,β)\mathcal{PE}(a,c,\beta) is obtained for b=0b=0, [Janardan, 1998]. Note that the Katz distribution in Consul & Famoye [2006], Tab. 2.1, is not the Katz distribution of Katz [1965], it corresponds instead to the Generalized Polya Eggenberger of the first type (GPED1–I) of Janardan [1998] and can be obtain as the limit of the zero–truncated GLK for aca\rightarrow-c.

  • The Generalized Negative Binomial distribution 𝒢𝒩(r,γ,p)\mathcal{GNB}(r,\gamma,p) is obtained for c=1c=1, a=ra=r, b=γ1b=\gamma-1 and β=p\beta=p .

  • The Negative Binomial distribution 𝒩(r,p)\mathcal{NB}(r,p) is obtained for b=0b=0 β=1p\beta=1-p and r=a/cr=a/c.

  • The Binomial distribution in(n,p)\mathcal{B}in(n,p) is obtained for c=1c=1, b=1b=-1, a=na=n\in\mathbb{N} and β=p\beta=p

  • The Generalized Poisson (GP) distribution 𝒢𝒫(θ,λ)\mathcal{GP}(\theta,\lambda) for c0c\rightarrow 0 s.t. b/a=λb/a=\lambda and aβ/c=θ>0a\beta/c=\theta>0 with 0<λ<θ10<\lambda<\theta^{-1}. The GP limit of the GLK distribution is stated in [Consul & Famoye, 2006] without proof. In Appendix B.2 in the Supplementary Material, we provide a proof.

  • The Poisson distribution 𝒫(θ)\mathcal{P}(\theta) for c0c\rightarrow 0, b0b\rightarrow 0 s.t. aβ/c=θa\beta/c=\theta.

Refer to caption Refer to caption
Refer to caption Refer to caption
Figure 1: Probability mass function of the Generalized Lagrangian Katz for different parameter settings. Top left: comparison between 𝒦(a,c)\mathcal{LK}(a,c), 𝒦(a,b,β)\mathcal{LK}(a,b,\beta) and 𝒢𝒦(a,b,c,β)\mathcal{GLK}(a,b,c,\beta). Top right: sensitivity of 𝒢𝒦(a,b,c,β)\mathcal{GLK}(a,b,c,\beta) with respect to the parameters. Bottom: effect of the parameters on the tails (log scale) for a 𝒢𝒦(a,b,c,β)\mathcal{GLK}(a,b,c,\beta) with over–dispersion (VMR=50/15VMR=50/15, left) and under–dispersion (VMR=13/15VMR=13/15, right). In each plot the distribution mean (vertical dashed line).

The probability mass function of the GLK for different parameter settings is given in Fig. 1. In the top–left plot, we compare 𝒦(a,c)\mathcal{K}(a,c), 𝒦(a,b,β)\mathcal{LK}(a,b,\beta) and 𝒢𝒦(a,b,c,β)\mathcal{GLK}(a,b,c,\beta) with the same mean. The top–right plot illustrates the sensitivity of the 𝒢𝒦(a,b,c,β)\mathcal{GLK}(a,b,c,\beta) pmf with respect to the different parameters. All distributions have the same mean (vertical dashed line). The bottom plots illustrate the effects of the parameters on the tails (log–scale) for a 𝒢𝒦(a,b,c,β)\mathcal{GLK}(a,b,c,\beta) with over–dispersion VMR=50/15VMR=50/15 (left) and under–dispersion VMR=13/15VMR=13/15 (right).

We provide in Appendix A in the Supplementary Material some useful moments of the GLK distributions, which can be used to derive the following measures of dispersion. The standard deviation to the mean ratio returns the coefficient of variation. From the results in Appendix A in the Supplementary Material it follows that the coefficient of variation is CV=((1β)/(aθκ))1/2CV=\left((1-\beta)/(a\theta\kappa)\right)^{1/2} where κ=1βbβ/c\kappa=1-\beta-b\beta/c and θ=β/c\theta=\beta/c, assuming κ>0\kappa>0. The Fisher index is given by the variance–to–mean ratio VMR=(1β)/(κ2)VMR=(1-\beta)/(\kappa^{2}) which does not depend on the parameter aa. For a given β\beta, following the values of κ\kappa (bb and cc), the distribution allows for various degrees of dispersion: not dispersed (VMR=0VMR=0), under–dispersed (VMR<1VMR<1), equally dispersed (VMR=1VMR=1) and over–dispersed (VMR>1VMR>1).

We conclude this section with another important property.

Proposition 1.

A random variable X𝒢𝒦(a,b,c,β)X\sim\mathcal{GLK}(a,b,c,\beta) is infinite divisible, in particular X=j=1nXjnX\stackrel{{\scriptstyle\mathcal{L}}}{{=}}\sum_{j=1}^{n}X_{jn} where Xjniid𝒢𝒦(a/n,b,c,β)X_{jn}\stackrel{{\scriptstyle iid}}{{\sim}}\mathcal{GLK}(a/n,b,c,\beta).

Proof.

From the pgf of a GLK given in Eq. 3

𝔼(X)=𝔼(uX)=(1β1βz)ac=j=1n(1β1βz)anc\mathbb{E}(X)=\mathbb{E}\left(u^{X}\right)=\left(\frac{1-\beta}{1-\beta z}\right)^{\frac{a}{c}}=\prod_{j=1}^{n}\left(\frac{1-\beta}{1-\beta z}\right)^{\frac{a}{nc}} (4)

which is the pgf of the sum of nn independent GLKs with distribution 𝒢𝒦(a/n,b,c,β)\mathcal{GLK}(a/n,b,c,\beta) where a/n>0a/n>0 according to the definition of GLK. ∎

2.2 A INAR(1) process

The Generalized Katz INAR(1) process (GLK–INAR(1)) is defined using the binomial thinning operator, \circ. The binomial thinning for a non–negative discrete random variable XX is defined as

αX=i=1XBi(α)\alpha\circ X=\sum_{i=1}^{X}B_{i}(\alpha)

where Bi(α)B_{i}(\alpha) are iid Bernoulli r.v.s with success probability P(Bi(α)=1)=αP(B_{i}(\alpha)=1)=\alpha.

Definition 1 (GLK–INAR process).

For α(0,1)\alpha\in(0,1), the GLK–INAR(1) process is defined by

Xt=αXt1+εt,tX_{t}=\alpha\circ X_{t-1}+\varepsilon_{t},\qquad t\in\mathbb{Z}

where εt\varepsilon_{t} are iid random variables with Generalized Lagrangian Katz distribution 𝒢𝒦(a,b,c,β)\mathcal{GLK}(a,b,c,\beta), independent of XsX_{s}, st1s\leq t-1.

Figure 2 provides some trajectories of T=100T=100 points each, simulated from a GLK–INAR(1) with innovation distributions given by the solid lines in the bottom plots of Fig. 1, that are 𝒢𝒦(3.86\mathcal{GLK}(3.86, 0, 0.600.60, 0.70)0.70) (overdispersion) and 𝒢𝒦(25.00\mathcal{GLK}(25.00, 0.000.00, 0.700.70, 0.42)0.42) (underdispersion). The trajectories correspond to the two parameter settings we find the empirical application to climate change discussed in Section 4, that are: (i) high persistence setting (α=0.7\alpha=0.7, left); (ii) low persistence setting (α=0.3\alpha=0.3, right). In all plots, the empirical mean of the observations is reported (dashed line) as a reference to illustrate the different levels of persistence in the trajectories.

Thanks to the general parametric family assumed, by setting b=0b=0, c=β=θ1c=\beta=\theta_{1} and a=θ2a=\theta_{2}, our GLK–INAR(1) nests the INARKF(1) of Kim & Lee [2017] as special case. The GLK–INAR(1) naturally nests the Poisson INAR(1) of Al-Osh & Alzaid [1987], the Negative Binomial INAR(1) of Al-Osh & Aly [1992] (NBINAR(1)), and the Generalized Poisson INAR(1) of Alzaid & Al-Osh [1993].

(a) Over–dispersed innovations
Refer to caption Refer to caption
(a) Under-dispersed innovations
Refer to caption Refer to caption
Figure 2: Trajectories of the GLK–INAR(1) in the high (α=0.7\alpha=0.7, left column) and low persistence (α=0.3\alpha=0.3, right column) regimes. The trajectories in the over- and under-dispersion settings are in the rows. In all plots, the empirical mean of the observations (dashed line).

As for any INAR process, the GLK–INAR(1) has the following representation

Xt+k=αkXt+j=0k1αjεt+1jX_{t+k}=\alpha^{k}\circ X_{t}+\sum_{j=0}^{k-1}\alpha^{j}\circ\varepsilon_{t+1-j}

and its conditional pgf can be written as

HXt+k|Xt(u)=(1αk+αku)Xtj=0k1H(1αj+αju)H_{X_{t+k}|X_{t}}(u)=(1-\alpha^{k}+\alpha^{k}u)^{X_{t}}\prod_{j=0}^{k-1}H(1-\alpha^{j}+\alpha^{j}u)

where H(u)H(u) is defined in Eq. 2 or in Eq. 3. Starting from the general results on INAR processes given in Alzaid & Al-Osh [1988], one easily obtains explicit expressions for the conditional mean and variance of the GLK–INAR(1):

𝔼(Xt+k|Xt)=αkXt+1αk11αaθκ\displaystyle\mathbb{E}(X_{t+k}|X_{t})=\alpha^{k}X_{t}+\frac{1-\alpha^{k-1}}{1-\alpha}\frac{a\theta}{\kappa} (5)
𝕍(Xt+k|Xt)=1α2k1α2(a(1β)θκ3aθκ)\displaystyle\mathbb{V}(X_{t+k}|X_{t})=\frac{1-\alpha^{2k}}{1-\alpha^{2}}\left(\frac{a(1-\beta)\theta}{\kappa^{3}}-\frac{a\theta}{\kappa}\right)
+(αkα2k)Xt+1αk1αaθκ\displaystyle\hskip 70.0pt+(\alpha^{k}-\alpha^{2k})X_{t}+\frac{1-\alpha^{k}}{1-\alpha}\frac{a\theta}{\kappa} (6)

where κ=1βbβ/c\kappa=1-\beta-b\beta/c and θ=β/c\theta=\beta/c.

Remark 2.

Setting b=0b=0, c=β=θ1c=\beta=\theta_{1} and a=θ2a=\theta_{2} the results in Kim & Lee [2017] Th. 2.2 are obtained.

Remark 3.

Since α<1\alpha<1, limk𝔼(Xt+k|Xt)=aθ/(κ(1α)\underset{k\rightarrow\infty}{\lim}\mathbb{E}(X_{t+k}|X_{t})=a\theta/(\kappa(1-\alpha) and limk𝕍(Xt+k|Xt)=aθ((1β)+ακ2)/((1α2)κ3)\underset{k\rightarrow\infty}{\lim}\mathbb{V}(X_{t+k}|X_{t})=a\theta((1-\beta)+\alpha\kappa^{2})/((1-\alpha^{2})\kappa^{3}).

The process {Xt}t\{X_{t}\}_{t\in\mathbb{Z}} is a Markov Chain on \mathbb{N} and the transition probability Pi,j=(Xt=j|Xt1=i)P_{i,j}=\mathbb{P}(X_{t}=j|X_{t-1}=i) satisfies

Pi,j=k=0min(i,j)(αXt1=k|Xt1=i)(ε=jk)\displaystyle P_{i,j}=\sum_{k=0}^{\min(i,j)}\mathbb{P}(\alpha\circ X_{t-1}=k|X_{t-1}=i)\mathbb{P}(\varepsilon=j-k)
=k=0min(i,j)(ik)αk(1α)ikpjk\displaystyle=\sum_{k=0}^{\min(i,j)}\binom{i}{k}\alpha^{k}(1-\alpha)^{i-k}p_{j-k}

where pxp_{x} is the pmf given in Eq. 1.

In the next Proposition, we summarize some of the asymptotic properties of a GLK–INAR(1). These properties follow from general results in Alzaid & Al-Osh [1988] and Schweer & Weiß [2014].

Proposition 2.

Assume that {Xt}t\{X_{t}\}_{t\in\mathbb{Z}} is a GLK–INAR(1).

  • (i)

    The process {Xt}t\{X_{t}\}_{t\in\mathbb{Z}} is an irreducible, aperiodic and positive recurrent Markov chain. Hence there is a unique stationary distribution for the process {Xt}t\{X_{t}\}_{t\in\mathbb{Z}}.

  • (ii)

    The marginal distribution of the stationary process {Xt}t\{X_{t}\}_{t\in\mathbb{Z}} is infinitely divisible.

Proof.

By Proposition 1, the distribution of the innovations is infinitely divisible, and hence, it is a compound Poisson distribution, see, e.g. Lemma 2.1 in Steutel & van Harn [1979]. Hence, both (i) and (ii) follow from Theorem 3.2.1 in Schweer & Weiß [2014]. In point of fact, (i) is true for any INAR process, see Al-Osh & Alzaid [1987]. An alternative derivation of (ii) is as follows. Since at stationarity the process satisfies X=αX+εX=\alpha\circ X+\varepsilon, where ε𝒢𝒦(a,b,c,β)\varepsilon\sim\mathcal{GLK}(a,b,c,\beta), and the innovation terms are infinite divisible by Proposition 1, the stationary distribution satisfies the definition of discrete semi–self–decomposability given in Bouzar [2008]. Theorem 2 in Bouzar [2008] yields that it is also infinitely divisible. ∎

Since the GLK distribution satisfies the convolution property [see Janardan, 1998, Th. 8], then the GLK–INAR(1) is stable by aggregation as stated in the following

Proposition 3.

Let {Xjt}t\{X_{jt}\}_{t\in\mathbb{Z}} with j=1,2,,Jj=1,2,\ldots,J be a sequence of independent GLK–INAR(1) which satisfy:

Xjt=αXjt1+εjt,εjt𝒢𝒦(aj,b,c,β)X_{jt}=\alpha\circ X_{jt-1}+\varepsilon_{jt},\quad\varepsilon_{jt}\sim\mathcal{GLK}(a_{j},b,c,\beta)

The process Yt=X1t++XJtY_{t}=X_{1t}+\ldots+X_{Jt} is GLK–INAR(1) which satisfies:

Yt=αYt1+εt,εt𝒢𝒦(a1++aJ,b,c,β)Y_{t}=\alpha\circ Y_{t-1}+\varepsilon_{t},\quad\varepsilon_{t}\sim\mathcal{GLK}(a_{1}+\ldots+a_{J},b,c,\beta)

Below, we state explicit closed-form expressions for unconditional moments of the process.

Proposition 4.

Let με\mu_{\varepsilon}, με(2)\mu_{\varepsilon}^{(2)} and σε2\sigma^{2}_{\varepsilon} the mean, second order non-central moment and variance given in Prop. 5 for a 𝒢𝒦(a,b,c,β)\mathcal{GLK}(a,b,c,\beta). For a GLK–INAR(1) process, the following unconditional moments can be derived:

  1. (i)

    μX=𝔼(Xt)=με/(1α)\mu_{X}=\mathbb{E}(X_{t})=\mu_{\varepsilon}/(1-\alpha)

  2. (ii)

    μX(2)=𝔼(Xt2)=(αμε+2αμε2/(1α)+με(2))/(1α2)\mu_{X}^{(2)}=\mathbb{E}(X_{t}^{2})=(\alpha\mu_{\varepsilon}+2\alpha\mu_{\varepsilon}^{2}/(1-\alpha)+\mu_{\varepsilon}^{(2)})/(1-\alpha^{2})

  3. (iii)

    𝔼(XtXtk)=α𝔼(Xt1Xtk)+μεμX\mathbb{E}(X_{t}X_{t-k})=\alpha\mathbb{E}(X_{t-1}X_{t-k})+\mu_{\varepsilon}\mu_{X}

  4. (iv)

    Higher-order non-central moments can be derived using the formula:

    μX(m)=i=0mk=0i1l=0ik(ik)(1αi)1S(m,i)s(ik,l)αkμX(k)με(l)\mu_{X}^{(m)}=\sum_{i=0}^{m}\sum_{k=0}^{i-1}\sum_{l=0}^{i-k}{i\choose k}(1-\alpha^{i})^{-1}S(m,i)s(i-k,l)\alpha^{k}\mu_{X}^{({k})}\mu_{\varepsilon}^{(l)}

    where s(m,k)s(m,k) and S(m,k)S(m,k) denote the Stirling’s numbers of the I and II kind, respectively.

Proof.

First- and second-order moments are known from Al-Osh & Alzaid [1987] for general INAR. Specifying the GLK innovations gives (i)–(iii). High-order moments can be computed similarly. See e.g. Weiß [2013]. For the sake of completeness, details are given in Appendix B.3 in the Supplementary Material. ∎

From the previous proposition, under the assumption κ=1βbβ/c>0\kappa=1-\beta-b\beta/c>0 one obtains the unconditional variance of the process σX2=𝕍(Xt)=(σε2+αμε)/(1α2)\sigma^{2}_{X}=\mathbb{V}(X_{t})=(\sigma_{\varepsilon}^{2}+\alpha\mu_{\varepsilon})/(1-\alpha^{2}) and the dispersion index of the process

VMRX=σX2μX=VMRε+α1+α=11+α(α+1β(1βbβ/c)2)VMR_{X}=\frac{\sigma^{2}_{X}}{\mu_{X}}=\frac{VMR_{\varepsilon}+\alpha}{1+\alpha}=\frac{1}{1+\alpha}\left(\alpha+\frac{1-\beta}{(1-\beta-b\beta/c)^{2}}\right)

where VMRε=σε2/μεVMR_{\varepsilon}=\sigma_{\varepsilon}^{2}/\mu_{\varepsilon} is the innovation index of dispersion. It follows that there is under– or over–dispersion in the marginal distribution, VMRX<1VMR_{X}<1 and VMRX>1VMR_{X}>1, if and only if there is under- or over–dispersion in the innovation, VMRε<1VMR_{\varepsilon}<1 or VMRε<1VMR_{\varepsilon}<1 respectively.

The autocorrelation function is

γk=ov(Xt,Xtk)=𝔼(XtXtk)μX2=αkσX2\gamma_{k}=\mathbb{C}ov(X_{t},X_{t-k})=\mathbb{E}(X_{t}X_{t-k})-\mu_{X}^{2}=\alpha^{k}\sigma^{2}_{X}

as in the INAR(1) process [e.g., see Al-Osh & Alzaid, 1987].

2.3 A Markov–switching GLK–INAR(1) process

The GLK–INAR(1) process can be extended to account for various sources of model instability such as structural breaks, regimes and outliers by introducing a time–varying parameter setting [see for example Malyshkina et al., 2009]. A parsimonious approach is to assume a finite set of regimes k=1,,Kk=1,\ldots,K corresponding to different parameter configurations, i.e.

Xt=α(St)Xt1+εt,t,X_{t}=\alpha(S_{t})\circ X_{t-1}+\varepsilon_{t},\qquad t\in\mathbb{Z}, (7)

with εt|St𝒢𝒦(a(St),b(St),c(St),β(St))\varepsilon_{t}|S_{t}\sim\mathcal{GLK}(a(S_{t}),b(S_{t}),c(S_{t}),\beta(S_{t})), where the thinning coefficient and the GLKGLK parameters of the error term ψ(St)=(α(St),\psi(S_{t})=(\alpha(S_{t}), a(St),a(S_{t}), b(St),c(St),b(S_{t}),c(S_{t}), β(St))\beta(S_{t}))^{\prime} are time–varying ψ(St)=k=1K𝕀(St=k)ψk\psi(S_{t})=\sum_{k=1}^{K}\mathbb{I}(S_{t}=k)\psi_{k}, where ψk=(a(k),b(k),\psi_{k}=(a(k),b(k), c(k),β(k))c(k),\beta(k))^{\prime}. The St{1,,K}S_{t}\in\{1,\ldots,K\} for tt\in\mathbb{Z} denotes a hidden Markov–chain process with transition probabilities (St=j|St1=i)=πij\mathbb{P}(S_{t}=j|S_{t-1}=i)=\pi_{ij} for i,j{1,,K}i,j\in\{1,\ldots,K\}. From now on, this extension is denoted with MS–GLK–INAR(1).

A special case, which is relevant for a common issue in count data series, is the large proportion of zeros [e.g., Maiti et al., 2015]. The excess of zero, which leads to over–dispersion, can be handled by assuming a zero–inflated GLK–INAR(1). This can be defined by assuming that in one of the regimes, e.g. St=1S_{t}=1, there is complete thinning α10\alpha_{1}\to 0, and the error distribution is a Dirac centred at zero. Some alternatives where the GLK collapses to a Dirac include the (Negative) Binomial distribution with parameters c1=1c_{1}=1, b1=1b_{1}=-1 (b1=0b_{1}=0), a1=1a_{1}=1 and β10\beta_{1}\to 0.

The transition probabilities provide information on the persistence of these events. If the probability is independent on past information, i.e. (St=j|St1=i)=πj\mathbb{P}(S_{t}=j|S_{t-1}=i)=\pi_{j} for all i,j{1,2}i,j\in\{1,2\}, then the zero–inflated regime is transitory. If the zero–inflated regime is persistent, then the duration of the regime is captured by a large π11\pi_{11}. Other regimes (St1S_{t}\neq 1) with low mean and/or large variance can also generate zeroes. This is convenient in some applications, such as in epidemiology where zeroes from St1S_{t}\neq 1 can be interpreted as under–reported cases of a particular disease [e.g., Douwes-Schultz & Schmidt, 2022].

2.4 Possible extensions of the GLK–INAR

The GLK–INAR can be extended to include more general auto–correlation structures and to the multivariate setting. The process can be modified to allow for multiple lags building on the specification strategy used in Neal & Subba Rao [2007]. In particular, a GLK integer–valued ARMA of order pp and qq, i.e. GLK–INARMA(p,q)(p,q) can be specified using independent thinning operators.

Definition 2 (GLK–INARMA(p,q)(p,q) process).

Let α(0,1)\alpha_{\ell}\in(0,1) for =1,,p\ell=1,\ldots,p and ζr(0,1)\zeta_{r}\in(0,1) for r=1,,qr=1,\ldots,q, the GLK–INARMA(p,q) process is defined as

Xt==1pαXt+r=1qζrεtr+εt,tX_{t}=\sum_{\ell=1}^{p}\alpha_{\ell}\circ X_{t-\ell}+\sum_{r=1}^{q}\zeta_{r}\circ\varepsilon_{t-r}+\varepsilon_{t},\qquad t\in\mathbb{Z}

where εt\varepsilon_{t} are iid random variables with Generalized Lagrangian Katz distribution 𝒢𝒦(a,b,c,β)\mathcal{GLK}(a,b,c,\beta) and =1pα<1\sum_{\ell=1}^{p}\alpha_{\ell}<1 and r=1qζr<1\sum_{r=1}^{q}\zeta_{r}<1.

Compared to GLK–INAR(1), in the GLK–INARMA(p,q)(p,q), a further restriction on the autoregressive parameters is required for stationarity, although a weaker condition can be used. For alternatives specification strategies such as combined INAR (CINAR) see for example McKenzie [2003], Weiß [2008].

For the case of random integer vectors, a multivariate GLK–INAR(1) (GLK–MINAR(1)) can be used by introducing a thinning matrix operator.

Definition 3 (GLK–MINAR(1) process).

Let 𝐗t=(X1t,,XJt)\mathbf{X}_{t}=(X_{1t},\ldots,X_{Jt})^{\prime} be a random integer vector and 𝐀=(αij)i,j=1J\mathbf{A}=(\alpha_{ij})_{i,j=1}^{J}, where αij(0,1)\alpha_{ij}\in(0,1), the GLK–MINAR(1) process can be defined by

𝐗t=𝐀𝐗t1+𝜺t,t\mathbf{X}_{t}=\mathbf{A}\circ\mathbf{X}_{t-1}+\bm{\varepsilon}_{t},\qquad t\in\mathbb{Z}

where 𝛆t=(ε1t,,εJt)\bm{\varepsilon}_{t}=(\varepsilon_{1t},\ldots,\varepsilon_{Jt})^{\prime} is iid Generalized Lagrangian Katz distributions, and 𝐀𝐗t1\mathbf{A}\circ\mathbf{X}_{t-1} is the thinning matrix operator, such that for each i=1,,Ji=1,\ldots,J,

Xit=j=1JαijXjt1+εit,X_{it}=\sum_{j=1}^{J}\alpha_{ij}\circ X_{jt-1}+\varepsilon_{it},

and αijXjt1\alpha_{ij}\circ X_{jt-1} refers to the binomial thinning operator.

The independence between the innovation terms of the different equations allows us to estimate them separately. This assumption can be relaxed by adding common GLK errors to the equations, or under some special cases of the GLK, a joint distribution can be introduced, such as the bivariate Katz’s or Poisson distribution [Pedeli & Karlis, 2011, Diafouka et al., 2022].

3 Bayesian inference

3.1 Prior distribution

With the construction in Eq. 1, the constraint x0px=1\sum_{x\geq 0}p_{x}=1 is guaranteed by the condition H(1)=1H(1)=1. Nevertheless, some constraints on the parameters are needed to have all the px:>0p_{x}:>0. Three different cases are discussed below (details are given in Appendix B in the Supplementary Material.).

  • For parameter values a>0,b0,c>0a>0,b\geq 0,c>0 the pmf are positive. Moreover, for a/c,b/ca/c,b/c\in\mathbb{N} the extended binomial coefficient ((a+bx)/c+1)x/x!((a+bx)/c+1)_{x\uparrow}/x! coincides with the standard binomial coefficient (a+bxc+xx){\frac{a+bx}{c}+x\choose x} [Consul & Famoye, 2006, p. 8].

  • For c<b<0-c<b<0, a/c,b/ca/c,b/c\in\mathbb{N} and (ca)/(c+b)(a+c)/|b|(c-a)/(c+b)\leq(a+c)/|b|, the pmf are positive for x<x=(a+c)/|b|x<x^{*}=(a+c)/|b|, while px=0p_{x}=0 for xxx\geq x^{*}.

  • If c<b<0-c<b<0 but the additional constraints of the previous point are not satisfied, the terms appearing in the product ((a+bx)/c+1)x({(a+bx)}/{c}+1)_{x\uparrow} change sign, and there is no guarantee that the result is positive. Indeed, for all the xx such that x>max{(a+1)/|b|,(ca)/(c+b),2}x>\max\{(a+1)/|b|,(c-a)/(c+b),2\} one has (a+bx)/c+1<0(a+bx)/c+1<0 and (a+bx)/c+x1>0(a+bx)/c+x-1>0 and hence there is an integer q=qxq=q_{x} such that (a+bx)/c+m<0(a+bx)/c+m<0 for 1mq1\leq m\leq q and (a+bx)/c+m>0(a+bx)/c+m>0 for q+1mk1q+1\leq m\leq k-1. Hence

    1a+bxc+x((a+bx)c+1)x=(1)qm=1q|(a+bx)c\displaystyle\frac{1}{\frac{a+bx}{c}+x}\Big{(}\frac{(a+bx)}{c}+1\Big{)}_{x\uparrow}=(-1)^{q}\prod_{m=1}^{q}\Big{|}\frac{(a+bx)}{c}
    +m|m=q+1k1|(a+bx)c+m|\displaystyle+m\Big{|}\prod_{m=q+1}^{k-1}\Big{|}\frac{(a+bx)}{c}+m\Big{|}

    which is negative whenever qxq_{x} is odd. For example take a=10a=10, b=1b=-1 and c=2c=2, for x=20x=20 one has q20=5q_{20}=5, which shows that p20<0p_{20}<0 which clearly is impossible.

Remark 4.

It should be noted that alternative definitions for c<b<0-c<b<0 can be considered. For example one can set to 0 the px<0p_{x}<0, i.e. when x>max{(a+1)/|b|x>\max\{(a+1)/|b|. In this case re-scaling the pxp_{x} is necessary to get x=0xpx=1\sum_{x=0}^{x^{*}}p_{x}=1. The resulting pmf is not a generalized Lagrangian distribution (due to the truncation and rescaling), and the normalizing constant is not in closed form. See, for example, McCabe & Skeels [2020] for a discussion on the parameter values for the Katz distributions.

In a Bayesian framework, the parameter constraints can be easily included in the inference process through a suitable choice of the prior distributions. We assume:

αe(κα,τα),a𝒢a(κa,τa),b𝒢a(κb,τb),\displaystyle\alpha\sim\mathcal{B}e(\kappa_{\alpha},\tau_{\alpha}),\quad a\sim\mathcal{G}a(\kappa_{a},\tau_{a}),\quad b\sim\mathcal{G}a(\kappa_{b},\tau_{b}),
c𝒢a(κc,τc),βe(κβ,τβ)\displaystyle c\sim\mathcal{G}a(\kappa_{c},\tau_{c}),\quad\beta\sim\mathcal{B}e(\kappa_{\beta},\tau_{\beta})

where e(κ,τ)\mathcal{B}e(\kappa,\tau) is the beta distribution with shape parameters κ\kappa and τ\tau and 𝒢a(κ,τ)\mathcal{G}a(\kappa,\tau) the gamma distribution with shape and scale parameters κ\kappa and τ\tau, respectively. In the empirical applications we assume a non-informative hyper-parameter setting for α\alpha and β\beta, that is κα=τα=κβ=τβ=1\kappa_{\alpha}=\tau_{\alpha}=\kappa_{\beta}=\tau_{\beta}=1 and an informative prior for a,ba,b and cc with κa=τa=1\kappa_{a}=\tau_{a}=1, κb=κc=2\kappa_{b}=\kappa_{c}=2 and τb=τc=1/2\tau_{b}=\tau_{c}=1/2.

In the case of Markov–switching specification of the GLK–INAR(1), the same prior is assumed for the regime–specific parameters h(ψk)=e(κα,τα)h(\psi_{k})=\mathcal{B}e(\kappa_{\alpha},\tau_{\alpha}) 𝒢a(κa,τa)\mathcal{G}a(\kappa_{a},\tau_{a}) 𝒢a(κb,τb)\mathcal{G}a(\kappa_{b},\tau_{b}) 𝒢a(κc,τc)e(κβ,τβ)\mathcal{G}a(\kappa_{c},\tau_{c})\mathcal{B}e(\kappa_{\beta},\tau_{\beta}) for k=1,,Kk=1,\ldots,K. For the transition probabilities of the allocation variable, we assume a symmetric Dirichlet prior for each row of the transition matrix, i.e. πi𝒟(1/K,,1/K)\pi_{i\cdot}\sim\mathcal{D}(1/K,\ldots,1/K) with concentration parameter 1/K1/K, where πi=(πi1,,πiK)\pi_{i\cdot}=(\pi_{i1},\ldots,\pi_{iK}) for i=1,,Ki=1,\ldots,K.

3.2 Posterior distribution

Let x1,,xTx_{1},\ldots,x_{T} be a sequence of observations for the GLK–INAR(1) process, then the joint posterior distribution is given by

f(ψ|x1,,xT)f(ψ)t=1Ti=0j=0Pij(ψ)𝕀(xtj)𝕀(xt1i)f(\psi|x_{1},\ldots,x_{T})\propto f(\psi)\prod_{t=1}^{T}\prod_{i=0}^{\infty}\prod_{j=0}^{\infty}P_{ij}(\psi)^{\mathbb{I}(x_{t}-j)\mathbb{I}(x_{t-1}-i)}

where ψ=(α,a,b,c,β)\psi=(\alpha,a,b,c,\beta) is the parameter vector f(ψ)f(\psi) the joint prior and

Pij(ψ)=k=0min(i,j)dijk(a+b(jk)c+jkjk)αk(1α)ikβjk(1β)a+b(jk)cP_{ij}(\psi)=\sum_{k=0}^{\min(i,j)}d_{ijk}\binom{\frac{a+b(j-k)}{c}+j-k}{j-k}\alpha^{k}(1-\alpha)^{i-k}\beta^{j-k}(1-\beta)^{\frac{a+b(j-k)}{c}}

where dijk=(ik)((a/c)/((a+bx)/(c)+jk)d_{ijk}=\binom{i}{k}((a/c)/((a+bx)/(c)+j-k).

Following the discussion above in this section, if the parameter constraint c>0c>0 is not imposed, the coefficients of the Lagrangian expansion can be negative. In this case, a truncated GLK can be used, similarly to what is proposed in McCabe & Skeels [2020] for the Katz distribution, and the inference procedure can be easily extended to include this type of distribution. The truncation can be imposed by using the following recursion for the transition probability:

pi(ψ)=p0j=0i1max{0,U(ψ)+V(ψ)ja+j}p_{i}(\psi)=p_{0}\prod_{j=0}^{i-1}\max\left\{0,\frac{U(\psi)+V(\psi)j}{a+j}\right\}

where U(ψ)=aβ/cU(\psi)=a\beta/c, V(ψ)=U(b+c)/(a+b)V(\psi)=U(b+c)/(a+b) and

p0=(1+j=1k=0j1max{0,U(ψ)+V(ψ)ja+j})1.p_{0}=\left(1+\sum_{j=1}^{\infty}\prod_{k=0}^{j-1}\max\left\{0,\frac{U(\psi)+V(\psi)j}{a+j}\right\}\right)^{-1}.

The probability pip_{i} becomes null for i>ji>j if U(ψ)+V(ψ)j<0U(\psi)+V(\psi)j<0 at jj.

Since the joint posterior is not tractable, we follow a Markov Chain Monte Carlo (MCMC) framework for posterior approximation. See Robert & Casella [2013] for an introduction to MCMC methods. We overcome the difficulties in tuning the parameters of the MCMC procedure by applying the Adaptive MCMC sampler (AMCMC) proposed in Andrieu & Thoms [2008]. Following a standard procedure, the following reparametrization is considered to impose constraints on the parameters of the GLK–INAR(1). Let η=(η1,,η5)\eta=(\eta_{1},\ldots,\eta_{5}) be the 5-dimensional parameter vector obtained by the transformation η=φ(ψ)\eta=\varphi(\psi) with η1=log(ψ1/(1ψ1))\eta_{1}=\log(\psi_{1}/(1-\psi_{1})), η2=log(ψ2)\eta_{2}=\log(\psi_{2}), η3=log(ψ3)\eta_{3}=\log(\psi_{3}), η4=log(ψ4)\eta_{4}=\log(\psi_{4}), and η5=log(ψ5/(1ψ5))\eta_{5}=\log(\psi_{5}/(1-\psi_{5})) and let f(η|x1,,xT)=f(φ1(η)|x1,,xT)J(η)f(\eta|x_{1},\ldots,x_{T})=f(\varphi^{-1}(\eta)|x_{1},\ldots,x_{T})J(\eta) be the posterior of η\eta, with J(η)=ψ1ψ2ψ3ψ4ψ5(1ψ1)(1ψ5)J(\eta)=\psi_{1}\psi_{2}\psi_{3}\psi_{4}\psi_{5}(1-\psi_{1})(1-\psi_{5}) the Jacobian of the transformation φ\varphi given above. Given the adaptation parameters μj\mu^{j} and Σ(j)\Sigma^{(j)}, at the jj-th iterations, the AMCMC consists of the following three steps. First, a candidate η\eta^{\ast} is generated from the random walk proposal: η=η(j1)+λ(j)w(j),w(j)𝒩q(𝟎,Σ(j))\eta^{\ast}=\eta^{(j-1)}+\lambda^{(j)}w^{(j)},\quad w^{(j)}\sim\mathcal{N}_{q}(\mathbf{0},\Sigma^{(j)}). Second, the candidate is accepted with probability ρ(j)=ρ(η(j1),η)\rho^{(j)}=\rho(\eta^{(j-1)},\eta^{\ast}), where

ρ(η(j1),η)=min(1,f(φ1(η)|x1,,xT)J(η)f(φ1(η(j1))|x1,,xT)J(η(j1)))\rho(\eta^{(j-1)},\eta^{\ast})=\min\left(1,\frac{f(\varphi^{-1}(\eta^{\ast})|x_{1},\ldots,x_{T})J(\eta^{\ast})}{f(\varphi^{-1}(\eta^{(j-1)})|x_{1},\ldots,x_{T})J(\eta^{(j-1)})}\right)

and third, the adaptive parameters are updated as follows:

μ(j+1)\displaystyle\mu^{(j+1)} =\displaystyle= μ(j)+γ(j)(μ(j)η(j))\displaystyle\mu^{(j)}+\gamma^{(j)}(\mu^{(j)}-\eta^{(j)})
Σ(j+1)\displaystyle\Sigma^{(j+1)} =\displaystyle= Σ(j)+γ(j)((μ(j)η(j))(μ(j)η(j))Σ(j))\displaystyle\Sigma^{(j)}+\gamma^{(j)}((\mu^{(j)}-\eta^{(j)})(\mu^{(j)}-\eta^{(j)})^{\prime}-\Sigma^{(j)})
logλ(j+1)\displaystyle\log\lambda^{(j+1)} =\displaystyle= logλ(j)+γ(j)(ρ(j)ρ),\displaystyle\log\lambda^{(j)}+\gamma^{(j)}(\rho^{(j)}-\rho^{\ast}),

where ρ\rho^{*} is the target acceptance probability and γ(j)=ja\gamma^{(j)}=j^{-a}, a>0a>0 is the adaptive scale [Andrieu & Thoms, 2008, , Algorithm 4]. Following the suggestions in Roberts et al. [1997] we set ρ=0.44\rho^{*}=0.44.

The latent allocation variables in the Markov–switching specification of the GLK–INAR(1) are sampled the Forward–Filtering Backward sampling procedure (FFBS). The prediction and filtering probabilities are given by

(St=k|𝒳t1)==1Kπk(St1=|𝒳t1)\displaystyle\mathbb{P}(S_{t}=k|\mathcal{X}_{t-1})=\sum_{\ell=1}^{K}\pi_{\ell k}\mathbb{P}(S_{t-1}=\ell|\mathcal{X}_{t-1})
(St=k|𝒳t)f(xt|ψk,xt1,St=k)(St=k|𝒳t1),\displaystyle\mathbb{P}(S_{t}=k|\mathcal{X}_{t})\propto f(x_{t}|\psi_{k},x_{t-1},S_{t}=k)\mathbb{P}(S_{t}=k|\mathcal{X}_{t-1}),

where 𝒳t1=(x1,,xt1)\mathcal{X}_{t-1}=(x_{1},\ldots,x_{t-1})^{\prime}, f(xt|ψk,xt1,St=k)=f(x_{t}|\psi_{k},x_{t-1},S_{t}=k)= i=0\prod_{i=0}^{\infty} j=0\prod_{j=0}^{\infty} Pij(ψk)𝕀(xtj)𝕀(xt1i)P_{ij}(\psi_{k})^{\mathbb{I}(x_{t}-j)\mathbb{I}(x_{t-1}-i)} for k,{1,,K}k,\ell\in\{1,\ldots,K\}. Notice that the conditioning on the parameters ψ\psi is included only in the likelihood but not in the probabilities to simplify the notation. The filtered probabilities can be smoothed by considering all the information available, i.e.

(S1:T|𝒳T)=(ST|𝒳T)t=1T1(St|St+1,𝒳t)\mathbb{P}\left(S_{1:T}|\mathcal{X}_{T}\right)=\mathbb{P}\left(S_{T}|\mathcal{X}_{T}\right)\prod_{t=1}^{T-1}\mathbb{P}\left(S_{t}|S_{t+1},\mathcal{X}_{t}\right)

where (St|St+1,𝒳t)πStSt+1(St|𝒳t)\mathbb{P}\left(S_{t}|S_{t+1},\mathcal{X}_{t}\right)\propto\pi_{S_{t}S_{t+1}}\mathbb{P}\left(S_{t}|\mathcal{X}_{t}\right) and S1:T=(S1,,ST)S_{1:T}=(S_{1},\ldots,S_{T})^{\prime}. The allocation variables are sampled directly from these smoothed probabilities.

The conditional posterior distribution of the transition probabilities of the Markov chain StS_{t} is conditionally conjugate and can be sampled directly from πk|S1:T𝒟(d1,,dK),\pi_{\cdot k}|S_{1:T}\sim\mathcal{D}(d_{1},\ldots,d_{K}), where dk=1/K+t=1T𝕀k(St)d_{k}=1/K+\sum_{t=1}^{T}\mathbb{I}_{k}(S_{t}) for k=1,,Kk=1,\ldots,K.

For the possible extensions of the GLK–INAR, such as the GLK–INRMA(p,q) and the GLK–MINAR(1), data augmentation techniques can be used to improve the efficiency of the MCMC [Neal & Subba Rao, 2007, Marques et al., 2022]. For instance, in the case of the GLK–INRMA(p,q), conditional conjugacy of the thinning parameters can be obtained by assuming each autoregressive (moving average) component is a latent variable following a binomial distribution. In the case of GLK–MINAR(1), a similar strategy can be followed, see for instance Soyer & Zhang [2022].

3.3 Simulation results

We illustrate the Bayesian procedure’s effectiveness in recovering the parameters’ true value and the MCMC procedure’s efficiency through some simulation experiments. We test the algorithm’s efficiency in two different settings, commonly found in the data: low persistence and high persistence (see trajectories in Fig. 2). The true values of the parameters are: α=0.3\alpha=0.3, a=5.3239a=5.3239, b=0.0592b=0.0592, c=0.6c=0.6, β=0.5917\beta=0.5917 in the low persistence setting, and α=0.7\alpha=0.7, a=5.3239a=5.3239,b=0.0592b=0.0592, c=0.6c=0.6, β=0.5917\beta=0.5917 in the high persistence setting. For each setting, we run the Gibbs sampler for 50,000 iterations on each dataset, discard the first 10,000 draws to remove dependence on initial conditions, and apply a thinning procedure with a factor of 10 to reduce the dependence between consecutive draws.

For illustrative purposes, in Figure C.1 in the Supplementary Material we show the MCMC posterior approximation for the parameter α\alpha (first row), the unconditional mean of the process (second row), and the marginal likelihood (last row), in one of our experiments for the high- and low-persistence settings. Each plot represents the true value (solid black line) and the Bayesian estimates. Posterior estimated are approximated by using 4,000 MCMC samples after thinning and burn-in removal (dashed red line). Figures C.2-C.3 and C.4-C.5 in the Supplementary Material exhibit 10,000 MCMC posterior draws and the MCMC approximation of the posterior distribution for all the parameters, in the high- and low-persistence settings.

In our experiments, the acceptance rate is in the range of 40%-53% for both parameter settings (see Figure C.6 in the Supplementary Material). Table C.1 in the Supplementary Material shows, for all the parameters the autocorrelation function (ACF), effective sample size (ESS), inefficiency factor (INEFF) and Geweke’s convergence diagnostic (CD) before (BT subscript) and after thinning (AT subscript). The numerical standard errors are evaluated using the nse package [Geyer, 1992, Ardia & Bluteau, 2017, Ardia et al., 2018].

The thinning procedure is effective in reducing the autocorrelation levels and in increasing the ESS. The p-values of the CD statistics indicate that the null hypothesis that two sub-samples of the MCMC draws have the same distribution is always accepted. The efficiency of the MCMC after the thinning procedure is generally improved. After thinning, on average, the inefficiency measures (5.83), the p-values of the CD statistics (0.36) and the NSE (0.02) achieved the values recommended in the literature [e.g., see Roberts et al., 1997].

It is important to underline that the persistence parameter estimation and the forecast are highly sensitive to the innovation distributional assumption. An illustration is presented in the left plot of Figure 3, where the data generating process corresponds to a GLK–INAR(1) with large overdispersion (VRM=8.6). The standard model for count data is the Poisson INAR(1) model (PINAR(1)), which cannot capture overdispersion. This misspecified model entails an underestimation of the persistence parameter (medium gray histogram). The NBINAR(1) captures the overdispersion and provides reliable persistence estimates (light gray) comparable with the one of GLK–INAR (dark gray). Nevertheless, in the case of underdispersion (VRM=0.4, right plot of Figure 3), both NBINAR(1) and PINAR(1) return an estimation bias in the persistence parameter, while the INAR–GLK gives a good approximation of the true persistence. In summary, the INAR–GLK(1) model nests standard models, such as Generalized Poisson and Negative Binomial INARs, and allows for different degrees of underdispersion and overdispersion. Hence, it can be used without preliminary testing of the dispersion features of the series.

Refer to caption Refer to caption
Figure 3: Posterior approximation of the persistence parameter under a PINAR(1) (medium gray), NBINAR(1) (light gray) and GLK–INAR(1) (dark gray) for over (left plot) and under (right) dispersion scenarios. The red line indicates the true level of persistence.

Similarly, to exemplify the effectiveness and efficiency of the estimation procedure in different scenarios we considered: i) high and low persistence regimes with the same parameter configuration of the settings presented before and ii) a large mean regime and an zero–inflated regime where α0\alpha\to 0, a=1a=1, b0b\to 0, c=1c=1, β0\beta\to 0. The simulated trajectories are shown in Figure C.7 (C.8) in the Supplementary Material together with the estimated allocations of the regimes, represented by the shaded areas, with an accuracy of 97% (100%) for the two regimes (zero–inflated) scenario. Moreover, the parameters are successfully retrieved, see in Figures C, C and C in the Supplementary Material. Notice that the zero–inflated parameters are not estimated but set by default to approximate the Dirac distribution.

In conclusion, the Gibbs sampler is computationally efficient and can retrieve the true parameter values of the MS–GLK–INAR in different settings, including the single–regime and the zero–inflated specifications. The MCMC for the GLK–INAR takes 0.5 minutes for a sample size of T=260T=260 observations and for 30,000 MCMC iterations. This is comparable with the Negative Binomial INAR (0.4 minutes). The method is scalable and can be applied to datasets with thousands of observations. For larger-size datasets, the theoretical moment of the process can be used to devise alternative estimation procedures, such as the method of moments. The moments of the distribution are provided in closed form in Proposition 5 in the Supplementary Material.

4 Application to climate change

4.1 Data description

We used Google Trends data to measure the changes in public concern about climate change. Google Trend represents a source of big data [Choi & Varian, 2012, Scott & Varian, 2014] which have been used in many studies, for example, Anderberg et al. [2021] studied domestic violence during covid-19, Yang et al. [2021] studied influenza trends, Schiavoni et al. [2021] and Yi et al. [2021] presented applications to unemployment and Yu et al. [2019] studied oil consumption. In this study, we follow Lineman et al. [2015] and use Google search volumes as a proxy for public concern about “Climate Change” (CC) and “Global Warming” (GW). The search volume is the traffic for the specific combination of keywords relative to all queries submitted in Google Search in the world or a given region over a defined period. The indicator ranges from 0 to 100, with 100 corresponding to the largest relative search volume during the period of interest. The search volume is sampled weekly from 4th December 2016 until 21st November 2021. We analysed the dynamics at the global and country level. Countries with an excess of zeros above 95% in the search volume series have been excluded. The final dataset includes 65 countries of the about 200 countries provided by Google Trends. For illustration purposes, we report in the top plots of Fig. 4 the series of the world volume. The CC global volume exhibits overdispersion with VMR^=102/27.33=3.73\widehat{VMR}=102/27.33=3.73, skewness and kurtosis S^=2.09\widehat{S}=2.09 and K^=13.47\widehat{K}=13.47, respectively. The GW global volume has over–dispersion VMR^=170.42/48.56=3.51\widehat{VMR}=170.42/48.56=3.51, skewness S^=0.27\widehat{S}=0.27 and kurtosis K^=3.22\widehat{K}=3.22 (see also the histograms in the bottom plots). The country-specific indexes exhibit different levels of persistence and over–dispersion.

Refer to caption Refer to caption
Refer to caption Refer to caption
Figure 4: Time series (top) and histograms (bottom) of the global Google search of the words “Climate Change” (left) and “Global Warming” (right). Weekly frequency from 4th December 2016 to 21st November 2021. Empirical mean (dashed line).
(a) Google search dataset “Climate Change”
Refer to caption Refer to caption
(b) Google search dataset “Global Warming”
Refer to caption Refer to caption
Figure 5: Posterior approximation of the persistence parameter α\alpha (left) and the unconditional moment με/(1α)\mu_{\varepsilon}/(1-\alpha) (right) for the global search volume.
(a) Google search dataset “Climate Change”
Refer to caption Refer to caption Refer to caption
(b) Google search dataset “Global Warming”
Refer to caption Refer to caption Refer to caption
Figure 6: Posterior approximation of the persistence parameter for a three–state Markov Switching model: high (right plot), medium (middle) and low (left) persistence, for the Climate Change (top) and Global Warming (bottom) datasets.

4.2 Estimation results

The posterior distribution of the autoregressive coefficient is given in Fig. 5. The coefficient estimate and posterior credible interval (in parenthesis) are α^=0.56\widehat{\alpha}=0.56 (0.50,0.62)(0.50,0.62) and α^=0.62\widehat{\alpha}=0.62 (0.56,0.67)(0.56,0.67) for the GW and the CC dataset, respectively (see also the approximation to the posterior distribution of the parameters in Figures D.1 and D.2 in the Supplementary Material). This result indicates that the public concern about climate risk is persistent over time worldwide at an aggregate level. The estimated parameter of the innovation process and their 0.95% credible intervals (in parenthesis) are a^=3.53\widehat{a}=3.53 (1.56,6.08)(1.56,6.08), b^=0.04\widehat{b}=0.04 (0.01,0.11)(0.01,0.11), c^=0.21\widehat{c}=0.21 (0.05,0.47)(0.05,0.47) and β^=0.48\widehat{\beta}=0.48 (0.20,0.65)(0.20,0.65) for the GW dataset and a^=3.26\widehat{a}=3.26 (1.44,5.72)(1.44,5.72), b^=0.12\widehat{b}=0.12 (0.021,0.310)(0.021,0.310), c^=0.26\widehat{c}=0.26 (0.032,0.726)(0.032,0.726) and β^=0.35\widehat{\beta}=0.35 (0.067,0.623)(0.067,0.623) for the CC one.

4.3 Model comparison

The results indicate a deviation from the Negative Binomial model. Thus we apply the DIC criterion DIC=4𝔼(logf(X|ψ)|y)+2logf(X|ψ^)DIC=-4\mathbb{E}(\log f(X|\psi)|y)+2\log f(X|\widehat{\psi}) to compare GLK–INAR(1) and NB–INAR(1). The DICDIC is computed following [Spiegelhalter et al., 2002]:

DIC=41Nj=1Nlogf(X|ψ(j))+2logf(X|ψ^)\hbox{DIC}=-4\frac{1}{N}\sum_{j=1}^{N}\log f(X|\psi^{(j)})+2\log f(X|\widehat{\psi}) (8)

where f(X|ψ)f(X|\psi) is the likelihood of the model, ψ(j)\psi^{(j)} j=1,,Nj=1,\ldots,N the MCMC draws after thinning and burn-in sample removal, and ψ^\widehat{\psi} is the parameter estimate. The DICs for the GLK (NB) INARs fitted on the aggregate CC and GW series are 1.67431031.6743\cdot 10^{3} (1.6862103)(1.6862\cdot 10^{3}) and 1.87351031.8735\cdot 10^{3} (1.8834103)(1.8834\cdot 10^{3}), respectively.

Given the high kurtosis levels and the multi–modality in the empirical distribution of both series (see Figure 4), the Markov Switching INAR is used to deal with outliers and parameter instability. We use DIC and RMSE to select the number of regimes and the model (see Table D.3 in the Supplementary Material). We find that GLK–INAR with two or three regimes present the best fit in–sample and out–sample for both the CC and GW datasets. The results with three–regimes are presented in Figure 6. The three regimes identify different persistence levels: high (right plot), medium (middle) and low (left). Some of the regimes also have different unconditional mean levels (see Figure 7 left plots). In terms of one–step–ahead forecasting, in both datasets, the model can reproduce the upward trend at the end of the sample and effectively cover the true values within their 90% credible intervals (see Figure 7 right plots).

Refer to caption Refer to caption
Refer to caption Refer to caption
Figure 7: MS–GLK–INAR(1) results with three regimes (left) and one–step forecast (right) including point (dashed–blue line) and 90% credible intervals (shaded region) for the Climate Change (top) and Global Warming (bottom) database

4.4 Disaggregate analysis

We run the analysis at a disaggregate level. The results are given in Figures 8-9 and Tables D.1-D.2 in the Supplementary Material. Figure 8 provides evidence of an inverse relationship between estimated persistence α^\widehat{\alpha} and dispersion VMR^\widehat{VMR} cross countries (reference lines in the left plot). There is evidence of this inverse relationship in both the CC (blue dots) and GW (red dots) datasets. The plot on the right indicates an inverse (direct) relationship between the estimated unconditional mean με^/(1α^)\widehat{\mu_{\varepsilon}}/(1-\widehat{\alpha}) and the dispersion index VMR^\widehat{VMR} for the GW (CC). In the same picture, we indicate the parameter estimates for the world volume of searches (stars).

Refer to caption Refer to caption
Figure 8: Persistence-dispersion (α^\widehat{\alpha} and VMR^\widehat{VMR}, left) and unconditional mean and dispersion (με^/(1α^)\widehat{\mu_{\varepsilon}}/(1-\widehat{\alpha}) and VMR^\widehat{VMR}, right) scatter plots for all countries in the “Climate Change” (\bullet) and “Global Warming” (\bullet) datasets. Only countries with less than 21% of zeros are reported. Stars indicate the parameters of the world’s volume of searches. “*” indicates the parameter estimates for the aggregated search volume.
Refer to caption Refer to caption
Figure 9: Unconditional mean (left) and dispersion index (right) of the GW (horizontal) and CC (vertical) for countries with more than 21% of zeros (\bullet) less than 21% (\bullet, values rescaled by five for visualization purposes) in the number of searches. In each plot, the 45 reference line.

The terms “Climate Change” and “Global Warming” are used interchangeably. Nevertheless, they describe different phenomena and can be used to determine the public’s level of understanding about these two parallel concepts Lineman et al. [2015]. We investigate the relationships in the search volumes through the lens of our GLK–INAR(1) model. The left plot in Fig. 9 shows the unconditional mean of the search volumes for the two concepts in all countries (dots). In public attention, the two concepts are connected in the long run. We find a positive association for both countries with large (percentage of zeros << 21%) and low search volumes (percentage of zeros >> 21%). There is an asymmetric effect in the overdispersion (right plot), and in all countries, the GW search volume has a larger VMR than the CC volume. This can be explained by the larger variability induced by the changes in the use of the GW term in official communications.

Comparing the coefficients across the rows of Tables D.1-D.2 in the Supplementary Material, we find evidence of two types of series, one with high persistence and the other with low persistence. Moreover, for each country, the level of persistence is similar across the two datasets (compare columns of Tables D.1-D.2 in the Supplementary Material).

Tables D.1-D.2 in the Supplementary Material report the marginal likelihood of the GLK–INAR(1) and Lagrangian Katz INAR(1) in columns GLK and LK, respectively. We find evidence of a better fitting of the GLK–INAR(1) for some countries and variables, e.g. CC searches in India and CC and GW searches in South Africa. To get further insights into the results, we study the relationship between the dynamic and dispersion properties of the series and the actual level of climate risk of the countries. We consider the Global Climate Risk Index (CRI), which ranks countries and regions following the impacts of extreme weather events (such as storms, hurricanes, floods, heatwaves, etc.). The lower the index value, the larger the climate risk is. Following the values of the CRI for 2021, based on the events recorded from 2000 to 2019, our dataset includes some of the countries most exposed to climate risk, such as Japan, Philippines, Germany, South Africa, India, Sri Lanka and Canada [Eckstein et al., 2021, see].

The left plot in Fig. 10 shows the unconditional mean against the CRI. There is evidence of a positive relationship between the public interest in climate-related topics and the actual level of climatic risk. The lower the CRI level, the larger the Google search volumes are (see dashed lines). For example, India has a high risk (CRI equal to 7) and a very high long-run level of public attention.

The right plot reports the coefficient of variation against the CRI for all countries in the “Climate Change” (blue) and “Global Warming” (red) datasets. The dashed lines represent linear regressions estimated on the data. There is evidence of a negative relationship between the dispersion of public concern and climatic risk; in countries with more significant risk levels, the Google search volumes are less over–dispersed.

Refer to caption Refer to caption
Figure 10: Climate Risk Index and unconditional mean scatter plot (CRI-με/(1α)\mu_{\varepsilon}/(1-\alpha), left) and Climate Risk Index and dispersion scatter plot (CRI-CVCV, right) scatter plots for all countries in the “Climate Change” (\bullet) and “Global Warming” (\bullet) datasets. Dashed lines represent the linear regression estimated on the data.

To deal with the excess of zeros, which are very frequent in more than 62% of the series, we apply the MS–GLK–INAR(1) with two states, where the first state represents a prolonged absence of searches on Google and the second a persistent search activity. The MS–GLK–INAR(1) performs better than MS–NBINAR(1) in 119 out of the 130 CC and GW time series following the DIC (see Table D.5 and D.4 in the Supplementary Material). Accounting for the excess of zeros allows for improving the estimation of the persistence and provides an estimate of the probability π^11\hat{\pi}_{11} to stay in an inactive search regime. The findings on the persistence parameter discussed in this section for the GLK–INAR(1) are confirmed by the MS–GLK–INAR. Furthermore, there is evidence of a positive relationship between the probability π^11\hat{\pi}_{11} and the CRI, consistent with the results on the Google search persistence.

5 Conclusion

A novel integer–valued autoregressive process is proposed with Generalized Lagrangian Katz innovations (GLK–INAR). Theoretical properties of the model, such as stationarity, moments, and semi–self–decomposability, are provided. To deal with parameter instability and excess of zeroes, we also propose a Markov–Switching GLK–INAR. A Bayesian approach to inference and an efficient Gibbs sampling procedure have been proposed, which naturally account for uncertainty when forecasting. The modelling framework is applied to a Google Trend dataset measuring the public concern about climate change in 65 countries. The greater flexibility of the GLK–INAR allows for a superior fitting compared to the standard INAR models and a better comprehension of the heterogeneity in public perception. More specifically, new evidence is provided about the long-run level of public attention, its persistence and dispersion in countries with low and high levels of climate risk. The Markov-switching GLK-INAR identified regimes with the absence of searches and changes in the dynamic features of the series.

Acknowledgment

The authors acknowledge support from: the MUR– PRIN project ‘Discrete random structures for Bayesian learning and prediction’ under g.a. n. 2022CLTYP4 and the Next Generation EU – ‘GRINS– Growing Resilient, INclusive and Sustainable’ project (PE0000018), National Recovery and Resilience Plan (NRRP)– PE9 – Mission 4, C2, Intervention 1.3. The views and opinions expressed are only those of the authors and do not necessarily reflect those of the European Union or the European Commission. Neither the European Union nor the European Commission can be held responsible for them.

References

  • Afrifa-Yamoah & Mueller [2022] Afrifa-Yamoah, E., & Mueller, U. (2022). Modeling digital camera monitoring count data with intermittent zeros for short-term prediction. Heliyon, 8, e08774.
  • Aknouche et al. [2021] Aknouche, A., Almohaimeed, B. S., & Dimitrakopoulos, S. (2021). Forecasting transaction counts with integer-valued garch models. Studies in Nonlinear Dynamics & Econometrics, 26, 529–539.
  • Al-Osh & Alzaid [1987] Al-Osh, M., & Alzaid, A. A. (1987). First-order integer-valued autoregressive (INAR (1)) process. Journal of Time Series Analysis, 8, 261–275.
  • Al-Osh & Aly [1992] Al-Osh, M. A., & Aly, E.-E. A. (1992). First order autoregressive time series with negative binomial and geometric marginals. Communications in Statistics - Theory and Methods, 21, 2483–2492.
  • Alzaid & Al-Osh [1988] Alzaid, A., & Al-Osh, M. (1988). First-order integer-valued autoregressive (INAR (1)(1)) process: distributional and regression properties. Statist. Neerlandica, 42, 53–61.
  • Alzaid & Al-Osh [1993] Alzaid, A., & Al-Osh, M. (1993). Generalized Poisson ARMA processes. Annals of the Institute of Statistical Mathematics, 45, 223–232.
  • Alzaid & Omair [2014] Alzaid, A. A., & Omair, M. A. (2014). Poisson difference integer valued autoregressive model of order one. Bulletin of the Malaysian Mathematical Sciences Society, 37, 465–485.
  • Anderberg et al. [2021] Anderberg, D., Rainer, H., & Siuda, F. (2021). Quantifying domestic violence in times of crisis: An internet search activity-based measure for the covid-19 pandemic. Journal of the Royal Statistical Society: Series A (Statistics in Society), .
  • Andersson & Karlis [2014] Andersson, J., & Karlis, D. (2014). A parametric time series model with covariates for integers in Z. Statistical Modelling, 14, 135–156.
  • Andrieu & Thoms [2008] Andrieu, C., & Thoms, J. (2008). A tutorial on adaptive MCMC. Statistics and computing, 18, 343–373.
  • Ardia & Bluteau [2017] Ardia, D., & Bluteau, K. (2017). nse: Computation of numerical standard errors in r. Journal of Open Source Software, 2, 172.
  • Ardia et al. [2018] Ardia, D., Bluteau, K., & Hoogerheide, L. F. (2018). Methods for computing numerical standard errors: Review and application to value-at-risk estimation. Journal of Time Series Econometrics, 10.
  • Battaglini et al. [2009] Battaglini, A., Barbeau, G., Bindi, M., & Badeck, F.-W. (2009). European winegrowers’ perceptions of climate change impact and options for adaptation. Regional Environmental Change, 9, 61–73.
  • Berry & West [2020] Berry, L. R., & West, M. (2020). Bayesian forecasting of many count-valued time series. Journal of Business & Economic Statistics, 38, 872–887.
  • Bourguignon et al. [2016] Bourguignon, M., Vasconcellos, K. L., Reisen, V. A., & Ispány, M. (2016). A Poisson INAR(1) process with a seasonal structure. Journal of Statistical Computation and Simulation, 86, 373–387.
  • Bouzar [2008] Bouzar, N. (2008). Semi-self-decomposable distributions on 𝐙+\mathbf{Z}_{+}. Annals of the Institute of Statistical Mathematics, 60, 901–917.
  • Chen & Lee [2016] Chen, C. W., & Lee, S. (2016). Generalized Poisson autoregressive models for time series of counts. Computational Statistics & Data Analysis, 99, 51–67.
  • Chen & Lee [2017] Chen, C. W., & Lee, S. (2017). Bayesian causality test for integer-valued time series models with applications to climate and crime data. Journal of the Royal Statistical Society: Series C (Applied Statistics), 66, 797–814.
  • Choi & Varian [2012] Choi, H., & Varian, H. (2012). Predicting the present with Google Trends. Economic Record, 88, 2–9.
  • Consul & Famoye [2006] Consul, P. C., & Famoye, F. (2006). Lagrangian probability distributions. Springer.
  • Cunha et al. [2018] Cunha, E. T. d., Vasconcellos, K. L., & Bourguignon, M. (2018). A skew integer-valued time-series process with generalized Poisson difference marginal distribution. Journal of Statistical Theory and Practice, 12, 718–743.
  • Diafouka et al. [2022] Diafouka, M. K., Louzayadio, C. G., Malouata, R. O., Ngabassaka, N. R., & Bidounga, R. (2022). On a bivariate katz’s distribution. Advances in Mathematics: Scientific Journal, 11, 955–968.
  • Douwes-Schultz & Schmidt [2022] Douwes-Schultz, D., & Schmidt, A. M. (2022). Zero-state coupled markov switching count models for spatio-temporal infectious disease spread. Journal of the Royal Statistical Society Series C: Applied Statistics, 71, 589–612.
  • Drovandi et al. [2016] Drovandi, C. C., Pettitt, A. N., & McCutchan, R. A. (2016). Exact and Approximate Bayesian Inference for Low Integer-Valued Time Series Models with Intractable Likelihoods. Bayesian Analysis, 11, 325 – 352.
  • Eckstein et al. [2021] Eckstein, D., Künzel, V., & Schäfer, L. (2021). Global climate risk index 2021. Who suffers most from extreme weather events? Weather-related loss events in 2019 and 2000-2019. Bonn: Germanwatch, 2021.
  • Fahad & Wang [2018] Fahad, S., & Wang, J. (2018). Farmers’ risk perception, vulnerability, and adaptation to climate change in rural pakistan. Land Use Policy, 79, 301–309.
  • Freeland & McCabe [2004] Freeland, R., & McCabe, B. P. (2004). Analysis of low count time series data by Poisson autoregression. Journal of Time Series Analysis, 25, 701–722.
  • Freeland [2010] Freeland, R. K. (2010). True integer value time series. AStA Advances in Statistical Analysis, 94, 217–229.
  • Fried et al. [2015] Fried, R., Agueusop, I., Bornkamp, B., Fokianos, K., Fruth, J., & Ickstadt, K. (2015). Retrospective Bayesian outlier detection in INGARCH series. Statistics and Computing, 25, 365–374.
  • Frondel et al. [2017] Frondel, M., Simora, M., & Sommer, S. (2017). Risk perception of climate change: Empirical evidence for Germany. Ecological Economics, 137, 173–183.
  • Garay et al. [2020a] Garay, A. M., Medina, F. L., Cabral, C. R., & Lin, T.-I. (2020a). Bayesian analysis of the p-order integer-valued ar process with zero-inflated poisson innovations. Journal of Statistical Computation and Simulation, 90, 1943–1964.
  • Garay et al. [2020b] Garay, A. M., Medina, F. L., Cabral, C. R. B., & Lin, T.-I. (2020b). Bayesian analysis of the p-order integer-valued ar process with zero-inflated poisson innovations. Journal of Statistical Computation and Simulation, 90, 1943–1964.
  • Geyer [1992] Geyer, C. J. (1992). Practical Markov chain Monte Carlo. Statistical Science, (pp. 473–483).
  • Janardan [1998] Janardan, K. (1998). Generalized Polya Eggenberger family of distributions and its relation to Lagrangian Katz family. Communications in Statistics-Theory and Methods, 27, 2423–2442.
  • Janardan [1999] Janardan, K. (1999). Estimation of parameters of the GPED. Communications in Statistics-Theory and Methods, 28, 2167–2179.
  • Jin-Guan & Yuan [1991] Jin-Guan, D., & Yuan, L. (1991). The integer-valued autoregressive (INAR (p)) model. Journal of Time Series Analysis, 12, 129–142.
  • Katz [1965] Katz, L. (1965). Unified treatment of a broad class of discrete probability distributions. Classical and contagious discrete distributions, 1, 175–182.
  • Kim & Lee [2017] Kim, H., & Lee, S. (2017). On first-order integer-valued autoregressive process with Katz family innovations. Journal of Statistical Computation and Simulation, 87, 546–562.
  • Kim & Park [2008] Kim, H.-Y., & Park, Y. (2008). A non-stationary integer-valued autoregressive model. Statistical Papers, 49, 485.
  • Liesenfeld et al. [2006] Liesenfeld, R., Nolte, I., & Pohlmeier, W. (2006). Modelling financial transaction price movements: A dynamic integer count data model. Empirical Economics, 30, 795–825.
  • Lineman et al. [2015] Lineman, M., Do, Y., Kim, J. Y., & Joo, G.-J. (2015). Talking about climate change and global warming. PloS one, 10, e0138996.
  • Maiti et al. [2015] Maiti, R., Biswas, A., & Das, S. (2015). Time series of zero-inflated counts and their coherent forecasting. Journal of Forecasting, 34, 694–707.
  • Malyshkina et al. [2009] Malyshkina, N. V., Mannering, F. L., & Tarko, A. P. (2009). Markov switching negative binomial models: an application to vehicle accident frequencies. Accident Analysis & Prevention, 41, 217–226.
  • Marques et al. [2022] Marques, P., Graziadei, H., & Lopes, H. F. (2022). Bayesian generalizations of the integer-valued autoregressive model. Journal of Applied Statistics, 49, 336–356.
  • McCabe & Martin [2005] McCabe, B., & Martin, G. (2005). Bayesian predictions of low count time series. International Journal of Forecasting, 21, 315–330.
  • McCabe & Skeels [2020] McCabe, B. P., & Skeels, C. L. (2020). Distributions you can count on… But what’s the point? Econometrics, 8, 9.
  • McKenzie [1985] McKenzie, E. (1985). Some simple models for discrete variate time series. Water Resources Bulletin, 21, 645–650.
  • McKenzie [1986] McKenzie, E. (1986). Autoregressive moving-average processes with negative-binomial and geometric marginal distributions. Advances in Applied Probability, 18, 679?705.
  • McKenzie [2003] McKenzie, E. (2003). Discrete variate time series. In D. N. Shanbhag, & C. R. Rao (Eds.), Stochastic Processes: Modelling and Simulation (pp. 573–606). Elsevier.
  • Neal & Subba Rao [2007] Neal, P., & Subba Rao, T. (2007). MCMC for integer-valued ARMA processes. Journal of Time Series Analysis, 28, 92–110.
  • Pedeli & Karlis [2011] Pedeli, X., & Karlis, D. (2011). A bivariate INAR(1) process with application. Statistical Modelling, 11, 325–349.
  • Robert & Casella [2013] Robert, C., & Casella, G. (2013). Monte Carlo statistical methods. Springer Science & Business Media.
  • Roberts et al. [1997] Roberts, G. O., Gelman, A., Gilks, W. R. et al. (1997). Weak convergence and optimal scaling of random walk metropolis algorithms. The Annals of Applied Probability, 7, 110–120.
  • Schiavoni et al. [2021] Schiavoni, C., Palm, F., Smeekes, S., & van den Brakel, J. (2021). A dynamic factor model approach to incorporate big data in state space models for official statistics. Journal of the Royal Statistical Society: Series A (Statistics in Society), 184, 324–353.
  • Schweer & Weiß [2014] Schweer, S., & Weiß, C. H. (2014). Compound Poisson INAR(1) processes: stochastic properties and testing for overdispersion. Comput. Statist. Data Anal., 77, 267–284.
  • Scott & Varian [2014] Scott, S. L., & Varian, H. R. (2014). Predicting the present with Bayesian structural time series. International Journal of Mathematical Modelling and Numerical Optimisation, 5, 4–23.
  • Scotto et al. [2015] Scotto, M. G., Weiß, C. H., & Gouveia, S. (2015). Thinning-based models in the analysis of integer-valued time series: A review. Statistical Modelling, 15, 590–618.
  • Shahtahmassebi & Moyeed [2016] Shahtahmassebi, G., & Moyeed, R. (2016). An application of the generalized Poisson difference distribution to the Bayesian modelling of football scores. Statistica Neerlandica, 70, 260–273.
  • Shang & Zhang [2018] Shang, H., & Zhang, B. (2018). Outliers detection in INAR (1) time series. Journal of Physics: Conference Series, 1053, 012094.
  • Soyer & Zhang [2022] Soyer, R., & Zhang, D. (2022). Bayesian modeling of multivariate time series of counts. Wiley Interdisciplinary Reviews: Computational Statistics, 14, e1559.
  • Spiegelhalter et al. [2002] Spiegelhalter, D. J., Best, N. G., Carlin, B. P., & Van Der Linde, A. (2002). Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 64, 583–639.
  • Steutel & van Harn [1979] Steutel, F. W., & van Harn, K. (1979). Discrete analogues of self-decomposability and stability. The Annals of Probability, (pp. 893–899).
  • Ullah et al. [2018] Ullah, H., Rashid, A., Liu, G., & Hussain, M. (2018). Perceptions of mountainous people on climate change, livelihood practices and climatic shocks: A case study of Swat District, Pakistan. Urban Climate, 26, 244–257.
  • Weiß [2008] Weiß, C. H. (2008). The combined INAR (p) models for time series of counts. Statistics & probability letters, 78, 1817–1822.
  • Weiß [2013] Weiß, C. H. (2013). Integer-valued autoregressive models for counts showing underdispersion. J. Appl. Stat., 40, 1931–1948.
  • Weiß & Kim [2013] Weiß, C. H., & Kim, H.-Y. (2013). Parameter estimation for binomial AR(1) models with applications in finance and industry. Statistical Papers, 54, 563–590.
  • Yang et al. [2021] Yang, S., Ning, S., & Kou, S. C. (2021). Use internet search data to accurately track state level influenza epidemics. Scientific Reports, 11, 4023.
  • Yi et al. [2021] Yi, D., Ning, S., Chang, C.-J., & Kou, S. C. (2021). Forecasting unemployment using internet search data via prism. Journal of the American Statistical Association, 116, 1662–1673.
  • Yu et al. [2019] Yu, L., Zhao, Y., Tang, L., & Yang, Z. (2019). Online big data-driven oil consumption forecasting with Google trends. International Journal of Forecasting, 35, 213–223.
  • Ziegler [2017] Ziegler, A. (2017). Political orientation, environmental values, and climate change beliefs and attitudes: An empirical cross country analysis. Energy Economics, 63, 144–153.

First–order integer–valued autoregressive processes with Generalized Katz innovations

Supplement

This supplement consists of four Appendices. Appendix A provides some properties of the GLK distributions. Appendix B provides proof of the paper’s results. Appendix C contains the simulation results, while Appendix D includes more details on the empirical application.

Appendix A Properties of the GLK distributions

Studying the moments allows for a better understanding of the flexibility of the GLK distribution. The following are four moments relevant to our analysis.

Proposition 5.

Let X𝒢𝒦(a,b,c,β)X\sim\mathcal{GLK}(a,b,c,\beta), define μk=𝔼((X𝔼(X))k)\mu_{k}^{\prime}=\mathbb{E}((X-\mathbb{E}(X))^{k}) and μk=𝔼(Xk)\mu_{k}=\mathbb{E}(X^{k}) then

μ1\displaystyle\mu_{1} =\displaystyle= aθκ,μ2=(1β)aθκ3,\displaystyle\frac{a\theta}{\kappa},\quad\quad\mu_{2}^{\prime}=\frac{(1-\beta)a\theta}{\kappa^{3}},\quad\quad
μ3\displaystyle\mu_{3}^{\prime} =\displaystyle= aθ(12β)(1β)κ4+3aθ2(1β)2(b+c)κ5\displaystyle\frac{a\theta(1-2\beta)(1-\beta)}{\kappa^{4}}+\frac{3a\theta^{2}(1-\beta)^{2}(b+c)}{\kappa^{5}}
μ4\displaystyle\mu_{4}^{\prime} =\displaystyle= aθ(1β)(1+2θb(b+c)βθ)(1ββ2κ6\displaystyle a\theta(1-\beta)(1+2\theta b-(b+c)\beta\theta)\left(\frac{1-\beta-\beta^{2}}{\kappa^{6}}\right.
+5aθ(1β)(b+c)κ7)+3(μ2)2,\displaystyle\left.+\frac{5a\theta(1-\beta)(b+c)}{\kappa^{7}}\right)+3(\mu_{2}^{\prime})^{2},

where κ=1βbβ/c>0\kappa=1-\beta-b\beta/c>0 and θ=β/c\theta=\beta/c.

For a proof, see Janardan [1998] Theorems 1–3.

(a) Mean (μ1\mu_{1}) (b) Index of Dispersion (VMRVMR)
Refer to caption Refer to caption
(c) Skewness (SS) (d) Kurtosis (KK)
Refer to caption Refer to caption
Figure A.1: GLK moments when increasing the value of β\beta (horizontal axis) for different values of bb (lines).

The skewness and the kurtosis of the distribution are

S\displaystyle S =\displaystyle= (12β)κ1/2((1β)aθ)1/2+3((1β)θ)1/2(b+c)aκ1/2,\displaystyle\frac{(1-2\beta)\kappa^{1/2}}{((1-\beta)a\theta)^{1/2}}+\frac{3((1-\beta)\theta)^{1/2}(b+c)}{a\kappa^{1/2}},
K\displaystyle K =\displaystyle= (1+2bθ(b+c)cθ2)((1ββ2)aθ(1β)+5(b+c)κ)+3,\displaystyle(1+2b\theta-(b+c)c\theta^{2})\left(\frac{(1-\beta-\beta^{2})}{a\theta(1-\beta)}+\frac{5(b+c)}{\kappa}\right)+3,

respectively. For a given value of θ\theta, there is negative skewness if β<(1+ξ)/(2κ+ξ)\beta<(1+\xi)/(2\kappa+\xi) with ξ=3θ(b+c)a1/2\xi=3\theta(b+c)a^{-1/2} and positive otherwise.

Figure A.1 illustrates the effect of the parameter values on the mean, dispersion index, skewness and kurtosis. Increasing the value of β\beta (horizontal axis) the 𝒢𝒦(a,b,c,β)\mathcal{GLK}(a,b,c,\beta) distribution allows for different types of dispersion (panel b), for both negative and positive skewness (panel c) and various degrees of excess of kurtosis (panel d).

Appendix B Details of some statements of Section 2

B.1 Connection with generalized Lagrangian distributions

As stated in Remark 1 it is possible to derive the Generalized Lagrangian Katz distribution as a ”generalized Lagrangian distribution”. Let f(z)f(z) and g(z)g(z) be two analytic functions of zz, which are infinitely differentiable in [1,1][-1,1] with g(0)0g(0)\not=0. Following [Consul & Famoye, 2006, p. 10-11] the general Lagrangian expansion of ff is

f(z)1zg/g(z)=j=0ujj!|j(gj(z)f(z))|z=0,\frac{f(z)}{1-zg^{\prime}/g(z)}=\sum_{j=0}^{\infty}\frac{u^{j}}{j!}\left|\partial^{j}(g^{j}(z)f(z))\right|_{z=0}, (B.1)

where uu satisfies z=ug(z)z=ug(z). The definition of Lagrangian distribution given in Janardan [1998] uses a slightly different expansion, which is obtained from the one given above by replacing f(z)f(z) with f(z)(1zg(z)/g(z)))f(z)(1-zg^{\prime}(z)/g(z))). By applying iteratively the derivative \partial to the product of functions, we obtain the coefficient in the jj-th term of the expansion

1j!|j(g(z)jf(z)(1zg(z)/g(z)))|z=0\displaystyle\frac{1}{j!}|\partial^{j}\left(g(z)^{j}f(z)(1-zg^{\prime}(z)/g(z))\right)|_{z=0}
=1j!|j1(g(z)jf(z)\displaystyle=\frac{1}{j!}|\partial^{j-1}(g(z)^{j}f^{\prime}(z)
+(j1)g(z)g(z)j1f(z)zgj1(z)g(z)f(z))|z=0=\displaystyle+(j-1)g^{\prime}(z)g(z)^{j-1}f(z)-z\partial g^{j-1}(z)g^{\prime}(z)f(z))|_{z=0}=\ldots
=1j!|j1(g(z)jf(z))|z=0+|j((j)1(g(z)g(z)j1f(z))\displaystyle=\frac{1}{j!}|\partial^{j-1}(g(z)^{j}f^{\prime}(z))|_{z=0}+|\partial^{j-\ell}\left((j-\ell)\partial^{\ell-1}(g^{\prime}(z)g(z)^{j-1}f(z))\right.
z(gj1(z)g(z)f(z)))|z=0=1j!|j1(g(z)jf(z))|z=0,\displaystyle\left.-z\partial^{\ell}(g^{j-1}(z)g^{\prime}(z)f(z))\right)|_{z=0}=\frac{1}{j!}|\partial^{j-1}(g(z)^{j}f^{\prime}(z))|_{z=0},

where we set =j\ell=j to get the result and the following equivalent Lagrangian expansion used in Janardan [1998]

f(z)1zg/g(z)=j=0ujj!|j(gj(z)f(z))|z=0\displaystyle\frac{f(z)}{1-zg^{\prime}/g(z)}=\sum_{j=0}^{\infty}\frac{u^{j}}{j!}\left|\partial^{j}(g^{j}(z)f(z))\right|_{z=0} (B.2)
f(z)=j=0ujj!|j1(gj(z)f(z))|z=0\displaystyle\Leftrightarrow f(z)=\sum_{j=0}^{\infty}\frac{u^{j}}{j!}\left|\partial^{j-1}(g^{j}(z)f^{\prime}(z))\right|_{z=0} (B.3)

In particular, if f(1)=g(1)=1f(1)=g(1)=1, the function uf(z(u))u\mapsto f(z(u)) defines the pgf of the ”generalized Lagrangian distribution” pj=1j!|j1(gj(z)f(z))|z=0p_{j}=\frac{1}{j!}\left|\partial^{j-1}(g^{j}(z)f^{\prime}(z))\right|_{z=0} provided that pj0p_{j}\geq 0 for j=0,1,j=0,1,\dots. Assuming f(z)=(1β1βz)a/cf(z)=\left(\frac{1-\beta}{1-\beta z}\right)^{a/c} and g(z)=(1β1βz)b/cg(z)=\left(\frac{1-\beta}{1-\beta z}\right)^{b/c}, the expressions in (2) and (1) follows after some algebra as detailed in the following. The expansion coefficients become

f(z)=ac(1β1βz)ac+1β1β,f^{\prime}(z)=\frac{a}{c}\left(\frac{1-\beta}{1-\beta z}\right)^{\frac{a}{c}+1}\frac{\beta}{1-\beta},
gk(z)f(z)=ac(1β1βz)ac+kbc+1β1β.g^{k}(z)f^{\prime}(z)=\frac{a}{c}\left(\frac{1-\beta}{1-\beta z}\right)^{\frac{a}{c}+k\frac{b}{c}+1}\frac{\beta}{1-\beta}.

Hence

p0=f(0)=(1β)acp1=g1(0)f(0)=ac(1β)ac+bcβ.p_{0}=f(0)=\left({1-\beta}\right)^{\frac{a}{c}}\quad p_{1}=g^{1}(0)f^{\prime}(0)=\frac{a}{c}\left({1-\beta}\right)^{\frac{a}{c}+\frac{b}{c}}{\beta}.

while, for k2k\geq 2, the kk-th coefficient of the Lagrangian expansion in Eq. B.2 is

pk=1k!|k1(g(z))kf(z)|z=0=\displaystyle p_{k}=\frac{1}{k!}|\partial^{k-1}(g(z))^{k}f^{\prime}(z)|_{z=0}=
=1k!k2|(acβ2(1β)2ξk(1β1βz)ξk+1)|z=0\displaystyle=\frac{1}{k!}\partial^{k-2}|(\frac{a}{c}\frac{\beta^{2}}{(1-\beta)^{2}}\xi_{k}\left(\frac{1-\beta}{1-\beta z}\right)^{\xi_{k}+1})|_{z=0}
=1k!|k3(acβ3(1β)3(ξk(ξk+1))(1β1βz)ξk+2)|z=0\displaystyle=\frac{1}{k!}|\partial^{k-3}(\frac{a}{c}\frac{\beta^{3}}{(1-\beta)^{3}}(\xi_{k}(\xi_{k}+1))\left(\frac{1-\beta}{1-\beta z}\right)^{\xi_{k}+2})|_{z=0}
=\displaystyle=...
=1k!βkac(1β)ac+kbcm=0k2(ξk+m)\displaystyle=\frac{1}{k!}\beta^{k}\frac{a}{c}(1-\beta)^{\frac{a}{c}+k\frac{b}{c}}\prod_{m=0}^{k-2}\left(\xi_{k}+m\right)
=1k!βkac(1β)ac+kbcm=1k1(ac+kbc+m)\displaystyle=\frac{1}{k!}\beta^{k}\frac{a}{c}(1-\beta)^{\frac{a}{c}+k\frac{b}{c}}\prod_{m=1}^{k-1}\left(\frac{a}{c}+k\frac{b}{c}+m\right)
=1k!βkac(1β)ac+kbc(ac+kbc+1)k1\displaystyle=\frac{1}{k!}\beta^{k}\frac{a}{c}(1-\beta)^{\frac{a}{c}+k\frac{b}{c}}\left(\frac{a}{c}+k\frac{b}{c}+1\right)_{{k-1}\uparrow}
=1k!βkac1(ac+kbc+k)(1β)ac+kbc(ac+kbc+1)k\displaystyle=\frac{1}{k!}\beta^{k}\frac{a}{c}\frac{1}{(\frac{a}{c}+k\frac{b}{c}+k)}(1-\beta)^{\frac{a}{c}+k\frac{b}{c}}\left(\frac{a}{c}+k\frac{b}{c}+1\right)_{{k}\uparrow}

where ξk=ac+kbc+1\xi_{k}=\frac{a}{c}+k\frac{b}{c}+1 and (x)k=x(x+1)(x+k1)(x)_{{k}\uparrow}=x(x+1)\ldots(x+k-1) is the rising factorial.

We now discuss conditions for which pk0p_{k}\geq 0 for all k0k\geq 0.

  • If a>0,b>0,c>0a>0,b>0,c>0 one has pk>0p_{k}>0 for every k1k\geq 1.

  • If c<b<0-c<b<0, a/c,b/ca/c,b/c\in\mathbb{N} and (ca)/(c+b)(a+c)/|b|(c-a)/(c+b)\leq(a+c)/|b|, then for k<k=(a+c)/|b|k<k^{*}=(a+c)/|b| one has

    a+bkc+1>0\frac{a+bk}{c}+1>0

    and hence also

    m=1k1(ac+kbc+m)>0\prod_{m=1}^{k-1}\left(\frac{a}{c}+k\frac{b}{c}+m\right)>0

    proving that pk>0p_{k}>0. For kk(ca)/(c+b)k\geq k^{*}\geq(c-a)/(c+b) one has that mk=(|b|ka)/cm_{k}=(|b|k-a)/c is an integer with 1mkk11\leq m_{k}\leq k-1 and hence the product m=1k1(ac+kbc+m)=0\prod_{m=1}^{k-1}\left(\frac{a}{c}+k\frac{b}{c}+m\right)=0 since for m=mkm=m_{k} one has ac+kbc+m=0\frac{a}{c}+k\frac{b}{c}+m=0. This shows that pk=0p_{k}=0 for every kkk\geq k^{*}.

B.2 Generalized Poisson as limit

One obtains a Lagrangian Katz distribution by replacing cc by β\beta. The LK is one of the few distributions which admit more pgfs. Let us consider the following definition of pgf for a 𝒦(a,b,β)\mathcal{LK}(a,b,\beta)

G(u,a,b,β)=(1βz1β)aβ,withz(a,b,β)=u(1βz1β)bβ.G(u,a,b,\beta)=\left(\frac{1-\beta z}{1-\beta}\right)^{-\frac{a}{\beta}},\quad\hbox{with}\,\,z(a,b,\beta)=u\left(\frac{1-\beta z}{1-\beta}\right)^{-\frac{b}{\beta}}. (B.4)

given in [Consul & Famoye, 2006, p. 241]. Defining n=(1βz)/(β(z1))n=(1-\beta z)/(\beta(z-1)) and 1/β=n(z1)+z1/\beta=n(z-1)+z the limiting pgf becomes

limβ0+G(u;a,b,β)=limn+(1+1n)n(z1)+za=ea(z1),\underset{\beta\rightarrow 0^{+}}{\lim}G(u;a,b,\beta)=\underset{n\rightarrow+\infty}{\lim}\left(1+\frac{1}{n}\right)^{\frac{n(z-1)+z}{a}}=e^{a(z-1)}, (B.5)

with

limβ0+z(a,b,β)=limn+(1+1n)n(z1)+zb=eb(z1)\underset{\beta\rightarrow 0^{+}}{\lim}z(a,b,\beta)=\underset{n\rightarrow+\infty}{\lim}\left(1+\frac{1}{n}\right)^{\frac{n(z-1)+z}{b}}=e^{b(z-1)} (B.6)

which is the pgf of the Generalized Poisson given in [Consul & Famoye, 2006, pp. 166].

B.3 Proof of the results in Theorem 4

(i) Under stationarity assumption one has μX=E(Xs)\mu_{X}=E(X_{s}) for all ss\in\mathbb{Z}, thus μX=αμX+𝔼(εt)\mu_{X}=\alpha\mu_{X}+\mathbb{E}(\varepsilon_{t}) which implies μX=με/(1α)\mu_{X}=\mu_{\varepsilon}/(1-\alpha).

(ii) Let μX(2)=E(Xs2)\mu_{X}^{(2)}=E(X_{s}^{2}) for all ss\in\mathbb{Z}, then 𝔼(Xt2)=𝔼((αXt1)2)+𝔼(εt2)+𝔼(2(αXt1)εt)=\mathbb{E}(X_{t}^{2})=\mathbb{E}((\alpha\circ X_{t-1})^{2})+\mathbb{E}(\varepsilon_{t}^{2})+\mathbb{E}(2(\alpha\circ X_{t-1})\varepsilon_{t})= 𝕍((αXt1)2)+(𝔼(αXt1))2+𝔼(εt2)+𝔼(2(αXt1)εt)\mathbb{V}((\alpha\circ X_{t-1})^{2})+(\mathbb{E}(\alpha\circ X_{t-1}))^{2}+\mathbb{E}(\varepsilon_{t}^{2})+\mathbb{E}(2(\alpha\circ X_{t-1})\varepsilon_{t}). By the law of iterated expectation

μX(2)=α2(μX(2)μX2)+α(1α)μX+α2μX2+με(2)+αμXμε\begin{split}\mu_{X}^{(2)}=&\alpha^{2}(\mu_{X}^{(2)}-\mu_{X}^{2})+\alpha(1-\alpha)\mu_{X}+\alpha^{2}\mu_{X}^{2}\\ &+\mu_{\varepsilon}^{(2)}+\alpha\mu_{X}\mu_{\varepsilon}\\ \end{split}

and hence

μX(2)=11α2(αμε+με(2)+2α1αμε2)\mu_{X}^{(2)}=\frac{1}{1-\alpha^{2}}\left(\alpha\mu_{\varepsilon}+\mu_{\varepsilon}^{(2)}+\frac{2\alpha}{1-\alpha}\mu_{\varepsilon}^{2}\right)

(iii) One has

𝔼(XtXtk)=𝔼((αXt1+εt)Xtk)=𝔼(𝔼((αXt1)Xtk|Xtk,Xt1))+𝔼(εt)𝔼(Xtk)=α𝔼(Xt1Xtk)+μεμX.\begin{split}&\mathbb{E}(X_{t}X_{t-k})=\mathbb{E}((\alpha\circ X_{t-1}+\varepsilon_{t})X_{t-k})=\\ &\mathbb{E}(\mathbb{E}((\alpha\circ X_{t-1})X_{t-k}|X_{t-k},X_{t-1}))+\mathbb{E}(\varepsilon_{t})\mathbb{E}(X_{t-k})\\ &=\alpha\mathbb{E}(X_{t-1}X_{t-k})+\mu_{\varepsilon}\mu_{X}.\\ \end{split}

(iv) Let us denote with (x)m=x(x1)(xm+1)(x)_{m}=x(x-1)\ldots(x-m+1) the falling factorial and with μ(k¯)=𝔼((X)k)\mu^{(\underline{k})}=\mathbb{E}((X)_{k}) the mm-order falling factorial moment of a random variable XX. The following two results will be used. The relationships between non-central moments and falling factorial moments are

𝔼(Xtm)\displaystyle\mathbb{E}(X_{t}^{m}) =\displaystyle= k=0mS(m,k)𝔼((Xt)k)\displaystyle\sum_{k=0}^{m}S(m,k)\mathbb{E}((X_{t})_{k}) (B.7)
𝔼((Xt)m)\displaystyle\mathbb{E}((X_{t})_{m}) =\displaystyle= k=0ms(m,k)𝔼(Xtk)\displaystyle\sum_{k=0}^{m}s(m,k)\mathbb{E}(X_{t}^{k}) (B.8)

where s(m,k)s(m,k) and S(m,k)S(m,k) are the Stirling numbers of the I and II kind, respectively [e.g., see Consul & Famoye, 2006, p. 18]. Let XX and YY be two random variables then

𝔼((X+Y)m)=k=0m(mk)𝔼((X)k)𝔼((X)mk)\mathbb{E}((X+Y)_{m})=\sum_{k=0}^{m}{m\choose k}\mathbb{E}((X)_{k})\mathbb{E}((X)_{m-k}) (B.9)

which can be proved by induction. Let αX\alpha\circ X a binomial thinning with XX a discrete random variable, then

𝔼((αX)k)=𝔼((j=1XBj)k)=𝔼(|κ|=kj=1XBjκj)=𝔼((Xk)k!αk)=αk𝔼((X)k)\begin{split}\mathbb{E}((\alpha\circ X)_{k})&=\mathbb{E}((\sum_{j=1}^{X}B_{j})_{k})=\mathbb{E}\left(\sum_{|\kappa|=k}\prod_{j=1}^{X}B_{j}^{\kappa_{j}}\right)\\ &=\mathbb{E}\left({X\choose k}k!\alpha^{k}\right)=\alpha^{k}\mathbb{E}((X)_{k})\\ \end{split} (B.10)

where |κ|=κ1++κX|\kappa|=\kappa_{1}+\ldots+\kappa_{X}. Using the results given above and stationarity (i.e. 𝔼((Xt)m)=μX(m¯)\mathbb{E}((X_{t})_{m})=\mu_{X}^{(\underline{m})} one obtains

𝔼((Xt)m)=𝔼((αXt1+εt)m)\displaystyle\mathbb{E}((X_{t})_{m})=\mathbb{E}((\alpha\circ X_{t-1}+\varepsilon_{t})_{m}) (B.11)
=k=0m(mk)𝔼((αXt1)k)𝔼((εt)mk)\displaystyle=\sum_{k=0}^{m}{m\choose k}\mathbb{E}\left(\left(\alpha\circ X_{t-1}\right)_{k}\right)\mathbb{E}((\varepsilon_{t})_{m-k}) (B.12)
=k=0m(mk)αk𝔼((Xt1)k)με(mk¯)\displaystyle=\sum_{k=0}^{m}{m\choose k}\alpha^{k}\mathbb{E}((X_{t-1})_{k})\mu_{\varepsilon}^{(\underline{m-k})} (B.13)

which implies the mm-order falling factorial moment of a INAR(1) is

μX(m¯)=11αmk=0m1(mk)αkμX(k¯)με(mk¯)\displaystyle\mu_{X}^{(\underline{m})}=\frac{1}{1-\alpha^{m}}\sum_{k=0}^{m-1}{m\choose k}\alpha^{k}\mu_{X}^{(\underline{k})}\mu_{\varepsilon}^{(\underline{m-k})} (B.14)
=11αmk=0m1l=0mk(mk)s(mk,l)αkμX(k¯)με(l)\displaystyle=\frac{1}{1-\alpha^{m}}\sum_{k=0}^{m-1}\sum_{l=0}^{m-k}{m\choose k}s(m-k,l)\alpha^{k}\mu_{X}^{(\underline{k})}\mu_{\varepsilon}^{(l)} (B.15)

and the mm-order moment is

μX(m)=i=0mS(m,i)11αik=0i1l=0ik(ik)s(ik,l)αkμX(k¯)με(l)\mu_{X}^{(m)}=\sum_{i=0}^{m}S(m,i)\frac{1}{1-\alpha^{i}}\sum_{k=0}^{i-1}\sum_{l=0}^{i-k}{i\choose k}s(i-k,l)\alpha^{k}\mu_{X}^{(\underline{k})}\mu_{\varepsilon}^{(l)} (B.16)

Appendix C Further simulation results

Refer to caption Refer to caption
Refer to caption Refer to caption
Refer to caption Refer to caption
Figure C.1: MCMC approximation of the posterior distribution (histogram) of the parameters α\alpha (top), the unconditional mean με/(1α)\mu_{\varepsilon}/(1-\alpha) (middle) and the marginal likelihood (bottom) of the GLK–INAR(1) in the high–persistence (left) and low–persistence (right) setting. In all plots, the true parameter value (red dashed) and the estimated one (black solid).
Table C.1: Autocorrelation function (ACF), effective sample size (ESS), inefficiency factor (INEFF), numerical standard errors (NSE) and Geweke’s convergence diagnostic (CD) of the posterior MCMC samples for the two settings: low persistence and high persistence. We ran the proposed MCMC algorithm for 50,000 iterations and evaluate the statistics before (subscript BT) and after (subscript AT) removing the first 10,000 burn-in samples, and applying a thinning procedure with a factor of 10. In parenthesis the p-values of the Geweke’s convergence diagnostic.
Low persistence High persistence
α=0.3\alpha=0.3, a=5.3239a=5.3239, b=0.0592b=0.0592, c=0.6c=0.6, β=0.5917\beta=0.5917 α=0.7\alpha=0.7, a=5.3239a=5.3239, b=0.0592b=0.0592, c=0.6c=0.6, β=0.5917\beta=0.5917
α\alpha aa bb cc β\beta α\alpha aa bb cc β\beta
ACF(1)BTACF(1)_{BT} 0.93 0.92 0.93 0.92 0.94 0.94 0.92 0.93 0.92 0.94
ACF(5)BTACF(5)_{BT} 0.71 0.68 0.72 0.66 0.74 0.72 0.68 0.73 0.65 0.74
ACF(10)BTACF(10)_{BT} 0.52 0.47 0.54 0.45 0.56 0.53 0.48 0.55 0.44 0.55
ACF(1)ATACF(1)_{AT} 0.54 0.47 0.55 0.44 0.58 0.53 0.48 0.56 0.45 0.57
ACF(5)ATACF(5)_{AT} 0.08 0.06 0.13 0.08 0.16 0.06 0.05 0.11 0.02 0.08
ACF(10)ATACF(10)_{AT} -0.01 0.02 0.02 0.04 0.06 0.01 -0.003 -0.004 0.06 0.03
ESSBTESS_{BT} 0.06 0.06 0.06 0.06 0.06 0.06 0.06 0.06 0.06 0.06
ESSATESS_{AT} 0.17 0.18 0.15 0.18 0.14 0.19 0.20 0.15 0.20 0.16
INEFFBTINEFF_{BT} 17.05 16.43 17.20 16.12 17.61 17.30 16.51 17.46 16.00 17.57
INEFFATINEFF_{AT} 5.84 5.51 6.45 5.54 6.97 5.35 5.07 6.58 4.87 6.09
NSEBTNSE_{BT} 0.002 0.05 0.002 0.006 0.002 0.001 0.06 0.004 0.01 0.004
NSEATNSE_{AT} 0.002 0.09 0.003 0.01 0.004 0.002 0.09 0.003 0.01 0.004
CDBTCD_{BT} 1.12 -2.08 -0.42 0.31 1.72 -0.48 -0.92 -0.13 -1.48 -0.83
(0.26) (0.04) (0.68) (0.76) (0.09) (0.63) (0.36) (0.89) (0.14) (0.41)
CDATCD_{AT} -1.047 0.57 -1.32 0.68 1.12 -1.05 0.57 -1.32 0.68 1.12
(0.30) (0.57) (0.19) (0.50) (0.26) (0.30) (0.57) (0.19) (0.50) (0.26)
High-persistence setting
α\alpha aa
Refer to caption Refer to caption
bb cc
Refer to caption Refer to caption
β\beta aβ/(ccββb)a\beta/(c-c\beta-\beta b)
Refer to caption Refer to caption
Figure C.2: MCMC output for the parameters of the GLK–INAR(1). In all plots, the MCMC draws (gray solid), the progressive MCMC average (dashed black) over the iterations (horizontal axis in thousands), and the true value of the parameter (horizontal red dashed).
High–persistence setting
α\alpha aa
Refer to caption Refer to caption
bb cc
Refer to caption Refer to caption
β\beta aβ/(ccββb)a\beta/(c-c\beta-\beta b)
Refer to caption Refer to caption
Figure C.3: MCMC approximation of the posterior distribution (histogram) of the parameters. In all plots, the estimated value (vertical black solid), the true value (vertical red dotted) and the prior density (dashed).
Low–persistence setting
α\alpha aa
Refer to caption Refer to caption
bb cc
Refer to caption Refer to caption
β\beta aβ/(ccββb)a\beta/(c-c\beta-\beta b)
Refer to caption Refer to caption
Figure C.4: MCMC output for the parameters of the GLK–INAR(1). In all plots, the MCMC draws (gray solid), the progressive MCMC average (dashed black) over the iterations (horizontal axis in thousands), and the true value of the parameter (horizontal red dashed).
Low-persistence setting
α\alpha aa
Refer to caption Refer to caption
bb cc
Refer to caption Refer to caption
β\beta aβ/(ccββb)a\beta/(c-c\beta-\beta b)
Refer to caption Refer to caption
Figure C.5: MCMC approximation of the posterior distribution (histogram) of the parameters. In all plots, the estimated value (vertical black solid), the true value (vertical red dotted) and the prior density (dashed).
High–persistence setting
Refer to caption Refer to caption
Low–persistence setting
Refer to caption Refer to caption
Figure C.6: MCMC acceptance rate (left) and adaptive log-scales (right) over the iterations (horizontal axis in thousands).
Refer to caption
Figure C.7: Trajectory of the MS–GLK–INAR(1) (right subplot) with two regimes high (gray) and low (white) persistence and unconditional mean with their corresponding histogram (left subplot) and the estimated allocation variable in shaded rectangles.
Refer to caption
Figure C.8: Trajectory of the MS–GLK–INAR(1) (right subplot) with three regimes: inflated–zero (dark gray), high (gray) and low (white) persistence and unconditional mean with their corresponding histogram (left subplot) and the estimated allocation variable in shaded rectangles.
a) Low persistence regime b) High persistence regime
α\alpha Refer to caption Refer to caption
aa Refer to caption Refer to caption
bb Refer to caption Refer to caption
cc Refer to caption Refer to caption
β\beta Refer to caption Refer to caption
Figure C.9: MCMC output for the parameters of the MS–GLK–INAR(1) with two regimes: High and low persistence and unconditional mean. In all plots, the MCMC draws (gray solid), the progressive MCMC average (dashed black) over the iterations (horizontal axis in thousands), and the true value of the parameter (horizontal red dashed).
a) Low persistence regime b) High persistence regime
α\alpha Refer to caption Refer to caption
aa Refer to caption Refer to caption
bb Refer to caption Refer to caption
cc Refer to caption Refer to caption
β\beta Refer to caption Refer to caption
Figure C.10: MCMC approximation of the posterior distribution (histogram)of the MS–GLK–INAR(1) parameters with two regimes: High and low persistence and unconditional mean. In all plots, the estimated value (vertical black solid), the true value (vertical red dotted) and the prior density (dashed).
α\alpha Refer to caption Refer to caption
aa Refer to caption Refer to caption
bb Refer to caption Refer to caption
cc Refer to caption Refer to caption
β\beta Refer to caption Refer to caption
Figure C.11: MCMC trace plot (left column) and posterior approximation (histograms right column) for the parameters of the MS–GLK–INAR(1) with two regimes: Inflated–zero and High persistence and unconditional mean. In all plots, the MCMC draws (gray solid), the progressive MCMC average (dashed black) over the iterations (horizontal axis in thousands), and the true value of the parameter (red line).

Appendix D Further real data results

Google search dataset “Climate Change”
α\alpha aa
Refer to caption Refer to caption
bb cc
Refer to caption Refer to caption
β\beta aβ/(ccββb)a\beta/(c-c\beta-\beta b)
Refer to caption Refer to caption
Figure D.1: MCMC approximation of the posterior distribution (histogram) of the parameters. In all plots, the estimated value (vertical black solid).
Google search dataset “Global Warming”
α\alpha aa
Refer to caption Refer to caption
bb cc
Refer to caption Refer to caption
β\beta aβ/(ccββb)a\beta/(c-c\beta-\beta b)
Refer to caption Refer to caption
Figure D.2: MCMC approximation of the posterior distribution (histogram) of the parameters. In all plots, the estimated value (vertical black solid).
Table D.1: Estimated GLK–INAR(1) autoregressive coefficient (α^\widehat{\alpha}) and its 95% credible interval (CI), and marginal likelihood of the GLK–INAR(1) and the NBINAR(1) models, for the “Climate Change” and “Global Warming” search volumes in different countries. Countries with less than 21% of zeros in the two series. “*” indicate the model with the largest marginal likelihood.
Climate Change dataset Climate Warming dataset
Country α^\widehat{\alpha} CICI GLK NB α^\widehat{\alpha} CICI GLK NB
Australia 0.547 (0.501,0.589) -882.43 -885.87 0.343 (0.287,0.397) -1125.99 -1120.90
Bangladesh 0.018 (0.002,0.053) -1118.68 -1111.99 0.001 (0.001,0.005) -1166.74 -1162.50
Brazil 0.011 (0.001,0.029) -1187.96 -1165.17 0.001 (0.001,0.002) -1027.29 -1024.77
Canada 0.672 (0.615,0.714) -711.55 -716.70 0.512 (0.468,0.554) -1003.35 -1000.60
Emirates 0.010 (0.001,0.029) -1200.72 -1182.74 0.005 (0.001,0.020) -1158.09 -1149.59
France 0.184 (0.127,0.248) -1069.66 -1068.36 0.006 (0.001,0.019) -1140.59 -1130.91
Germany 0.298 (0.209,0.373) -1023.87 -1022.87 0.021 (0.001,0.060) -1099.04 -1094.57
India 0.499 (0.420,0.562) -922.74 -923.57 0.482 (0.417,0.541) -1019.07 -1018.57
Indonesia 0.021 (0.003,0.064) -1123.06 -1108.15 0.253 (0.193,0.311) -1195.34 -1172.30
Ireland 0.333 (0.279,0.388) -953.35 -952.64 0.001 (0.001,0.002) -1093.97 -1094.39
Italy 0.169 (0.090,0.241) -983.83 -982.26 0.001 (0.001,0.004) -1036.21 -1034.28
Malaysia 0.002 (0.001,0.008) -1171.22 -1167.40 0.006 (0.001,0.031) -1135.02 -1129.73
Mexico 0.009 (0.001,0.031) -1160.49 -1148.85 0.001 (0.001,0.001) -1098.96 -1097.73
Netherlands 0.148 (0.075,0.218) -1064.29 -1061.30 0.001 (0.001,0.005) -1066.26 -1059.54
NewZealand 0.353 (0.283,0.412) -894.23 -895.29 0.013 (0.001,0.044) -1151.01 -1136.31
Nigeria 0.129 (0.068,0.191) -1043.77 -1041.80 0.017 (0.001,0.062) -974.18 -966.83
Pakistan 0.212 (0.148,0.272) -1131.60 -1124.74 0.023 (0.001,0.064) -1135.66 -1128.55
Philippine 0.409 (0.354,0.460) -1069.39 -1066.66 0.383 (0.327,0.432) -1150.86 -1138.26
Singapore 0.159 (0.095,0.222) -1099.02 -1094.02 0.006 (0.001,0.023) -1132.94 -1122.04
SouthAfrica 0.413 (0.353,0.467) -923.33 -925.21 0.334 (0.274,0.391) -865.23 -868.48
Spain 0.253 (0.193,0.320) -883.65 -886.68 0.001 (0.001,0.002) -1049.48 -1041.62
Thailand 0.008 (0.001,0.035) -1082.74 -1078.49 0.004 (0.001,0.012) -1218.44 -1193.67
UK 0.535 (0.486,0.587) -932.80 -937.25 0.319 (0.246,0.388) -1084.84 -1081.81
US 0.601 (0.549,0.649) -867.57 -871.12 0.606 (0.558,0.649) -941.39 -942.22
Vietnam 0.003 (0.001,0.012) -1189.79 -1178.82 0.001 (0.001,0.001) -987.59 -986.21
Table D.2: Estimated GLK–INAR(1) autoregressive coefficient (α^\widehat{\alpha}) and its 95% credible interval (CI), and marginal likelihood of the GLK–INAR(1) and the NBINAR(1) models, for the “Climate Change” and “Global Warming” search volumes in different countries. Countries with more than 21% of zeros in the two series. “*” indicate the model with the largest marginal likelihood.
Climate Change dataset Climate Warming dataset
Country α^\widehat{\alpha} CICI GLK NB α^\widehat{\alpha} CICI GLK NB
Argentina 0.001 (0.001,0.001) -1022.22 -1044.08 0.001 (0.001,0.001) -803.30 -850.10
Austria 0.001 (0.001,0.001) -1085.17 -1082.50 0.001 (0.001,0.001) -717.08 -754.18
Belgium 0.002 (0.001,0.012) -1141.59 -1136.92 0.001 (0.001,0.001) -879.09 -944.73
Colombia 0.001 (0.001,0.002) -1040.35 -1053.91 0.001 (0.001,0.001) -848.15 -882.50
Denmark 0.008 (0.001,0.023) -1123.97 -1101.34 0.002 (0.001,0.005) -973.88 -950.20
Egypt 0.001 (0.001,0.002) -1103.36 -1114.87 0.001 (0.001,0.002) -855.84 -867.85
Ethiopia 0.002 (0.001,0.011) -1089.74 -1081.68 0.001 (0.001,0.001) -876.11 -914.27
Finland 0.027 (0.001,0.084) -987.30 -983.16 0.001 (0.001,0.001) -670.46 -679.12
Ghana 0.003 (0.001,0.012) -992.09 -980.83 0.001 (0.001,0.001) -854.44 -922.58
Jamaica 0.001 (0.001,0.005) -995.13 -994.24 0.001 (0.001,0.001) -900.84 -947.66
Greece 0.001 (0.001,0.001) -1070.08 -1095.69 0.001 (0.001,0.001) -620.97 -687.79
HongKong 0.005 (0.001,0.040) -1116.20 -1104.70 0.001 (0.001,0.001) -1070.33 -1076.02
Iran 0.001 (0.001,0.002) -1046.05 -1107.53 0.001 (0.001,0.002) -945.79 -992.93
Israel 0.001 (0.001,0.001) -914.20 -959.02 0.001 (0.001,0.001) -726.50 -797.23
Japan 0.008 (0.001,0.026) -1209.16 -1193.53 0.002 (0.001,0.004) -1099.15 -1109.15
Kenya 0.104 (0.049,0.170) -1100.01 -1092.16 0.001 (0.001,0.001) -1073.55 -1096.71
Lebanon 0.001 (0.001,0.001) -761.11 -776.44 0.001 (0.001,0.001) -800.14 -850.50
Morocco 0.001 (0.001,0.001) -755.97 -839.02 0.001 (0.001,0.002) -471.85 -534.09
Mauritius 0.001 (0.001,0.002) -887.72 -926.89 0.001 (0.001,0.001) -601.12 -655.86
Myanmar 0.001 (0.001,0.001) -917.99 -979.43 0.001 (0.001,0.002) -602.14 -665.81
Nepal 0.001 (0.001,0.001) -1148.05 -1145.39 0.001 (0.001,0.001) -1027.36 -1060.00
Norway 0.016 (0.003,0.051) -1121.71 -1105.17 0.001 (0.001,0.001) -1002.83 -1004.64
Peru 0.001 (0.001,0.001) -915.67 -950.87 0.001 (0.001,0.001) -666.93 -712.80
Polish 0.001 (0.001,0.001) -1078.86 -1090.00 0.001 (0.001,0.001) -1000.76 -1063.99
Portugal 0.002 (0.001,0.010) -1035.62 -1030.57 0.001 (0.001,0.001) -800.35 -851.13
Qatar 0.001 (0.001,0.001) -879.41 -917.09 0.001 (0.001,0.001) -674.32 -701.04
Romania 0.001 (0.001,0.001) -880.49 -901.20 0.001 (0.001,0.001) -819.31 -896.17
Russia 0.001 (0.001,0.003) -1038.70 -1050.47 0.001 (0.001,0.001) -984.09 -1023.75
StHelena 0.001 (0.001,0.001) -873.70 -914.40 0.001 (0.001,0.002) -374.81 -394.04
SouthKorea 0.004 (0.001,0.019) -1149.02 -1142.78 0.001 (0.001,0.003) -1051.25 -1057.01
SriLanka 0.001 (0.001,0.001) -1086.31 -1109.74 0.001 (0.001,0.003) -842.37 -863.17
Sweden 0.136 (0.067,0.205) -1031.72 -1026.82 0.001 (0.001,0.001) -1078.30 -1089.70
Swiss 0.028 (0.005,0.063) -1055.02 -1046.17 0.001 (0.001,0.001) -835.78 -878.17
Taiwan 0.001 (0.001,0.001) -1074.92 -1085.60 0.001 (0.001,0.001) -794.34 -815.63
TrinidadTobago 0.001 (0.001,0.001) -920.81 -955.62 0.001 (0.001,0.001) -809.94 -858.50
Turkey 0.004 (0.001,0.019) -1095.99 -1091.08 0.001 (0.001,0.001) -1085.46 -1091.38
Ukraine 0.001 (0.001,0.001) -902.24 -949.78 0.001 (0.001,0.002) -709.80 -745.82
Hungary 0.001 (0.001,0.001) -935.84 -953.58 0.001 (0.001,0.001) -642.03 -717.52
Zambia 0.001 (0.001,0.003) -1075.30 -1053.65 0.001 (0.001,0.001) -746.07 -789.78
Zimbabwe 0.001 (0.001,0.002) -1130.16 -1128.73 0.001 (0.001,0.002) -658.24 -690.68
Table D.3: In and out–of–sample performance of the Markov–Switching INAR(1) with two and three regimes (K) for the Climate change and Global warming datasets under GLK, Negative Binomial (NB) and Poisson (P) distribution including the Deviance Information Criterion (DIC), Root Mean Squared Error (RMSE), 90% Credible Interval Coverage (CIcov) and 90% Credible interval width (CIwidth)
Data Distr. K In–sample Out–of–sample
DIC RMSE CIcov CIwidth
Climate Change GLK 2 1400.81 2.65 1 14.00
3 1365.68 2.68 1 14.50
NB 2 1399.19 2.68 1 13.10
3 1379.05 2.68 1 13.80
P 2 1661.23 4.36 1 16.80
3 1454.00 2.65 1 12.70
Global Warming GLK 2 1644.95 4.86 1 22.80
3 1377.39 4.67 1 21.70
NB 2 1647.50 4.80 1 21.40
3 1590.23 4.69 1 20.60
P 2 1653.30 5.35 1 16.80
3 1633.60 4.98 1 16.90
Table D.4: Estimated MS–GLK–INAR(1) autoregressive coefficient for the non–zero–inflated regime (α^\widehat{\alpha}) and the persistence of the zero–inflated state (π^11\widehat{\pi}_{11}), and Deviance Information Criteria of the MS–GLK–INAR(1) and the MS–NBINAR(1) models, for the “Climate Change” and “Global Warming” search volumes in different countries. “*” indicate the model with the largest marginal likelihood.
Country Climate Change dataset Global Warming dataset
GLK NB GLK NB
α^\hat{\alpha} π^11\hat{\pi}_{11} DIC α^\hat{\alpha} π^11\hat{\pi}_{11} DIC α^\hat{\alpha} π^11\hat{\pi}_{11} DIC α^\hat{\alpha} π^11\hat{\pi}_{11} DIC
Argentina 0.089 0.345 1619.585 0.099 0.345 1780.791 0.039 0.607 1119.311 0.037 0.607 1396.970
Australia 0.598 0.509 1596.879 0.600 0.466 1602.322 0.352 0.170 2078.651 0.371 0.093 2070.579
Austria 0.090 0.108 1872.957 0.100 0.099 1933.522 0.122 0.626 1074.436 0.127 0.627 1363.791
Bangladesh 0.036 0.070 2045.797 0.043 0.071 2055.200 0.074 0.097 1965.099 0.086 0.095 2033.827
Belgium 0.148 0.169 2007.494 0.170 0.160 2039.671 0.070 0.550 1294.334 0.072 0.550 1544.908
Brazil 0.093 0.223 2053.698 0.105 0.220 2079.390 0.074 0.193 1731.327 0.079 0.188 1826.657
Canada 0.672 0.508 1327.845 0.664 0.495 1340.503 0.522 0.499 1879.075 0.524 0.472 1876.222
Colombia 0.090 0.398 1667.121 0.000 0.397 1803.969 0.192 0.553 1259.727 0.198 0.552 1498.598
Denmark 0.107 0.255 1898.035 0.116 0.249 1948.963 0.109 0.519 1462.477 0.111 0.518 1659.000
Egypt 0.066 0.244 1780.555 0.071 0.241 1891.883 0.137 0.414 1347.583 0.138 0.413 1563.709
Emirates 0.160 0.071 2074.214 0.173 0.063 2105.633 0.135 0.256 1996.096 0.143 0.248 2039.387
Ethiopia 0.104 0.153 1867.472 0.106 0.150 1924.091 0.095 0.476 1337.687 0.092 0.475 1572.360
Finland 0.187 0.314 1724.529 0.204 0.291 1784.135 0.057 0.645 1069.114 0.055 0.646 1340.069
France 0.168 0.496 1935.826 0.179 0.491 1933.377 0.114 0.244 1951.947 0.127 0.239 2007.952
Germany 0.293 0.514 1896.652 0.294 0.498 1895.353 0.048 0.098 2020.428 0.053 0.097 2025.436
Ghana 0.151 0.240 1631.274 0.163 0.221 1711.621 0.092 0.556 1253.993 0.091 0.556 1512.884
Greece 0.125 0.276 1692.857 0.143 0.273 1837.323 0.042 0.717 860.269 0.533 0.006 1225.069
HongKong 0.101 0.311 1901.765 0.114 0.307 1954.842 0.163 0.315 1781.510 0.183 0.307 1896.437
Hungary 0.128 0.405 1476.092 0.132 0.406 1663.531 0.033 0.663 931.925 0.018 0.665 1256.010
India 0.534 0.491 1714.041 0.531 0.508 1714.829 0.636 0.511 1835.222 0.635 0.505 1839.658
Indonesia 0.088 0.115 1977.859 0.099 0.108 1997.493 0.361 0.162 2108.212 0.359 0.424 2102.328
Iran 0.105 0.362 1619.937 0.111 0.361 1806.291 0.092 0.497 1391.738 0.092 0.495 1625.134
Ireland 0.395 0.114 1703.061 0.431 0.047 1702.772 0.162 0.218 1872.783 0.182 0.197 1947.321
Israel 0.059 0.499 1399.355 0.059 0.499 1623.913 0.045 0.620 1070.880 0.045 0.619 1364.546
Italy 0.186 0.069 1779.219 0.211 0.050 1780.901 0.149 0.252 1787.010 0.169 0.233 1864.037
Jamaica 0.105 0.413 1646.154 0.121 0.408 1765.751 0.068 0.573 1313.182 0.067 0.573 1557.623
Japan 0.094 0.016 2097.733 0.105 0.017 2120.688 0.071 0.264 1773.651 0.076 0.263 1887.743
Kenya 0.175 0.181 1929.977 0.183 0.178 1938.944 0.103 0.361 1655.868 0.106 0.362 1807.085
Lebanon 0.104 0.538 1165.567 0.109 0.538 1420.193 0.038 0.647 1187.075 0.036 0.649 1455.428
Malaysia 0.182 0.158 2070.368 0.187 0.155 2079.151 0.084 0.027 2027.732 0.095 0.029 2042.069
Mauritius 0.032 0.500 1326.807 0.029 0.500 1566.223 0.131 0.704 945.556 0.132 0.704 1272.587
Mexico 0.099 0.097 2041.225 0.107 0.096 2063.796 0.134 0.266 1873.485 0.142 0.263 1942.329
Morocco 0.036 0.671 1059.084 0.037 0.672 1363.009 0.114 0.832 625.621 0.127 0.832 1009.280
Table D.5: Estimated MS–GLK–INAR(1) autoregressive coefficient for the non–zero–inflated regime (α^\widehat{\alpha}) and the persistence of the zero–inflated state (π^11\widehat{\pi}_{11}), and Deviance Information Criteria of the MS–GLK–INAR(1) and the MS–NBINAR(1) models, for the “Climate Change” and “Global Warming” search volumes in different countries. “*” indicate the model with the largest marginal likelihood.
Country Climate Change dataset Global Warming dataset
GLK NB GLK NB
α^\hat{\alpha} π^11\hat{\pi}_{11} DIC α^\hat{\alpha} π^11\hat{\pi}_{11} DIC α^\hat{\alpha} π^11\hat{\pi}_{11} DIC α^\hat{\alpha} π^11\hat{\pi}_{11} DIC
Myanmar 0.096 0.488 1371.038 0.093 0.487 1609.242 0.066 0.738 813.777 0.066 0.738 1166.376
Nepal 0.109 0.136 1965.494 0.116 0.132 2023.832 0.104 0.432 1593.188 0.115 0.430 1769.996
Netherlands 0.147 0.496 1972.367 0.152 0.488 1969.887 0.055 0.315 1792.008 0.000 0.314 1890.708
NewZealand 0.362 0.492 1659.492 0.370 0.356 1662.115 0.158 0.322 1980.746 0.167 0.320 2033.827
Nigeria 0.161 0.097 1932.834 0.170 0.084 1934.833 0.106 0.122 1719.025 0.120 0.085 1766.515
Norway 0.137 0.259 1890.741 0.151 0.255 1943.169 0.067 0.413 1639.325 0.073 0.412 1794.887
Pakistan 0.252 0.099 2043.832 0.259 0.093 2043.666 0.114 0.068 2047.780 0.116 0.064 2052.311
Peru 0.123 0.465 1387.617 0.115 0.463 1602.407 0.054 0.687 937.838 0.052 0.688 1255.208
Philippine 0.497 0.493 1921.648 0.498 0.494 1917.078 0.409 0.065 2064.834 0.411 0.060 2063.957
Polish 0.069 0.229 1767.794 0.077 0.229 1882.671 0.090 0.401 1560.757 0.096 0.400 1752.885
Portugal 0.041 0.277 1727.197 0.041 0.273 1827.148 0.031 0.587 1215.954 0.030 0.586 1478.256
Qatar 0.058 0.520 1291.855 0.064 0.521 1539.752 0.042 0.554 1006.056 0.041 0.551 1307.701
Romania 0.097 0.458 1351.055 0.098 0.457 1558.692 0.053 0.626 1211.659 0.052 0.625 1477.310
Russia 0.075 0.368 1553.053 0.075 0.370 1720.946 0.082 0.449 1524.936 0.081 0.448 1715.720
Singapore 0.199 0.165 2010.912 0.207 0.158 2009.881 0.072 0.073 2002.081 0.087 0.066 2035.269
SouthAfrica 0.522 0.388 1630.689 0.530 0.435 1632.432 0.418 0.120 1570.317 0.441 0.084 1571.766
SouthKorea 0.054 0.020 2025.651 0.061 0.021 2046.260 0.067 0.415 1599.901 0.071 0.415 1755.566
Spain 0.246 0.518 1651.155 0.253 0.414 1655.710 0.126 0.312 1759.649 0.133 0.306 1850.803
SriLanka 0.050 0.345 1670.538 0.050 0.341 1818.467 0.044 0.517 1268.525 0.042 0.516 1515.381
StHelena 0.132 0.484 1338.414 0.133 0.482 1572.449 0.080 0.817 594.577 0.583 0.007 860.226
Sweden 0.160 0.262 1883.905 0.169 0.257 1899.062 0.060 0.347 1829.726 0.066 0.345 1935.821
Swiss 0.083 0.087 1888.159 0.090 0.075 1913.949 0.098 0.551 1283.276 0.098 0.550 1529.698
Taiwan 0.116 0.324 1714.102 0.119 0.322 1839.746 0.126 0.460 1245.494 0.119 0.460 1483.356
Thailand 0.132 0.022 1928.947 0.143 0.023 1948.096 0.027 0.154 2014.468 0.029 0.152 2072.000
TrinidadTobago 0.077 0.447 1359.122 0.076 0.446 1587.225 0.114 0.585 1217.544 0.109 0.586 1477.778
Turkey 0.096 0.373 1854.282 0.106 0.370 1927.291 0.135 0.480 1793.179 0.148 0.478 1905.162
UK 0.571 0.510 1696.845 0.568 0.454 1701.288 0.322 0.507 2016.025 0.324 0.498 2012.407
Ukraine 0.111 0.457 1349.152 0.113 0.458 1578.258 0.049 0.597 1002.874 0.048 0.595 1311.657
US 0.632 0.520 1623.722 0.624 0.506 1634.120 0.636 0.509 1761.454 0.632 0.511 1765.975
Vietnam 0.100 0.149 2053.407 0.109 0.147 2091.799 0.212 0.180 1777.152 0.336 0.035 1804.133
Zambia 0.102 0.262 1736.355 0.111 0.254 1849.733 0.131 0.589 1111.358 0.123 0.588 1396.448
Zimbabwe 0.170 0.308 1808.413 0.192 0.303 1913.692 0.050 0.659 862.586 0.049 0.661 1197.766