Modelling Volatility of Spatio-temporal Integer-valued Data
with Network Structure and Asymmetry
Abstract
This paper proposes a spatial threshold GARCH-type model for dynamic spatio-temporal integer-valued data with network structure. The proposed model can simplify the parameterization by using network structure in data, and can capture the asymmetric property in dynamic volatility by adopting a threshold structure. The proposed model assumes the conditional distribution is Poisson distribution. Asymptotic theory of maximum likelihood estimation (MLE) for the spatial model is derived when both sample size and network dimension are large. We obtain asymptotic statistical inferences via investigation of the weak dependence of components of the model and application of limit theorems for weakly dependent random fields. Simulation studies and a real data example are presented to support our methodology.
Keywords: Conditional variance/mean, high-dimensional count time series, asymmetry of volatility, spatial threshold GARCH, network structure.
MSC 2020 Subject Classification: Primary 62M10, 91B05; Secondary 60G60, 60F05.
1 Introduction
Integer-valued time series can be observed in a wide range of scientific fields, such as the yearly trading volume of houses on real estate market (De Wit et al., , 2013), number of transactions of stocks (Jones et al., , 1994), daily mortality from Covid-19 (Pham, , 2020) and daily new cases of Covid-19 (Jiang et al., , 2022). A first idea to model integer-valued time series is using a simple first-order autoregressive model (AR)
(1.1) |
where is a parameter. However in (1.1) is not necessarily an integer given integer-valued and , due to the multiplication structure . To circumvent such problem, McKenzie, (1985) and Al-Osh and Alzaid, (1987) proposed an integer-valued counterpart of AR model (INAR). They replaced the ordinary multiplication by the binomial thinning operation , where . This was ground-breaking and led to various extensions of thinning-based linear models including integer-valued moving average model (INMA) (Al-Osh and Alzaid, , 1988) and INARMA model (McKenzie, , 1988). An alternative approach to the multiplication problem is to consider the regression of the conditional mean , where is the -algebra generated by historical information up to . Based on this idea, integer-valued GARCH-type models (INGARCH) were proposed by Heinen, (2003), Ferland et al., (2006) and Fokianos et al., (2009) with conditional Poisson distribution of , i.e.
(1.2) | ||||
where parameters satisfy . Other variations of INGARCH models with different specifications of conditional distribution include negative binomial INGARCH (Zhu, , 2010; Xu et al., , 2012) and generalized Poisson INGARCH (Zhu, , 2012).
Integer-valued models mentioned above are all limited to one-dimensional time series. The development of multivariate integer-valued GARCH-type models is still at its early stage. For example, the bivariate INGARCH models (Lee et al., , 2018; Cui and Zhu, , 2018; Cui et al., , 2020), multivariate INGARCH models (Fokianos et al., , 2020; Lee et al., , 2023) and some counterparts of the continuous-valued network GARCH model (Zhou et al., , 2020) are on fixed low-dimensional time series of counts. For high-dimensional integer-valued time series with increasing dimension, there is the Poisson network autoregressive model (PNAR) by Armillotta and Fokianos, (2024). The PNAR allows for integer-valued time series with increasing network dimension. However, it adopted a ARCH-type structure without considering the autoregressive term on the conditional mean/variance to describe persistence in volatility, and it can not capture asymmetric characteristics of volatility. Tao et al., (2024) proposed a grouped PNAR model which has a GARCH structure, but its network dimension is fixed and not suitable to spatio-temporal integer-valued data. In this paper, we propose a Poisson threshold network GARCH model (PTNGARCH) and discuss its asymptotic inferences. The specific contributions are as follows.
-
1.
A threshold structure is designed so that it is able to capture asymmetric property of high-dimensional volatility for integer-valued data. The threshold effect can also be tested under such a framework.
-
2.
Our PTNGARCH includes an autoregressive term on the conditional mean/variance so that it provides a parsimonious description of dynamic volatility persistence of high-dimensional integer-valued time series.
-
3.
Asymptotic theory, when both sample size () and network dimension () are large, of maximum likelihood estimation for the proposed model is established by the limit theorems for arrays of weakly dependent random fields in Pan and Pan, (2024). This enables the proposed model to be suitable for fitting dynamic spatio-temporal integer-valued data (or the so-called ultra-high dimensional integer-valued time series).
The remainder of this paper is arranged as follows. The PTNGARCH model is introduced and its stationarity over time is discussed under fixed network dimension in section 2. Section 3 presents the MLE for parameters, including the threshold, and the consistency and asymptotic normality for estimates of coefficients under large sample size () and large network dimension (). Section 4 gives a Wald test which can be applied to testing of the existence of threshold effect (i.e. asymmetric property), GARCH effect and network effect. Section 5 conducts simulation studies to verify the asymptotic properties of the MLE, and applies the proposed model to daily numbers of car accidents that occurred in 41 neighbourhoods in New York City, with interpretation of the results of analysis. All proofs of theoretical results are postponed to the appendix.
2 Model setting and stationarity over time
Consider an non-directed and weightless network with nodes. Define adjacency matrix , where if there is a connection between node and , otherwise . In addition, self-connection is not allowed for any node by letting . As an interpretation of the network structure, is symmetric since , hence for any node , the out-degree is equal to the in-degree , and we use to denote both for convenience. To embed a network into statistical models, it is often convenient to use the row normalized adjacency matrix with its element .
For any node in this network, let be an non-negative integer-valued observation at time , and denotes the -algebra consisting of all available information up to . In our Poisson threshold network GARCH model, for each and , is assumed to follow a conditional (on ) Poisson distribution with -varying variance (mean) . A PTNGARCH(1,1) model has the following form:
(2.1) | ||||
The threshold parameter is a positive integer, and denotes an indicator function. To assure the positiveness of conditional variance, we need to assume positiveness of the base parameter , and non-negativeness of all the coefficients , , , .
Remark.
Notice that in (2.1) we model the dynamics of conditional mean , which is the reason why the name “Poisson autoregression” is sometimes used in the literature (Fokianos et al., , 2009; Wang et al., , 2014); Some authors still use the name “GARCH” since the mean is equal to the variance under Poisson distribution, and the dynamics of conditional mean are GARCH-like.
Let be independent Poisson processes with unit intensities. Depending on , can be interpreted as a Poisson distributed random variable , which is the number of occurrences during the time interval , i.e. We could rewrite (2.1) in a vectorized form as follows:
(2.2) |
where
Note that with dimension and . For a fixed dimension , the stationarity condition of time series can be obtained. The proof of the following theorem is given in the appendix.
3 Parameter estimation with and
Denote parameter vector of model (2.1) by with . Let and . For a given threshold , the reasonable parameter space for is a sufficiently large compact subset of , such as with sufficient small positive real number .
Let be the index set. Suppose that the samples are generated by model (2.1) with respect to true parameters .
The log-likelihood function (ignoring constants) is
(3.1) |
where is generated from model (2.1) as
(3.2) | ||||
In practice, (3.1) can not be evaluated without knowing the true values of for . We need to approximate (3.1) by (3.3) below, using specified initial values :
(3.3) |
Therefore, the maximum likelihood estimate (MLE) of parameter is defined as
(3.4) |
However, the solution that maximizes the target function can not be directly obtained by solving since is discrete-valued, therefore the partial derivative of w.r.t. is invalid. According to Wang et al., (2014), such optimization problem with integer-valued parameter could be broken up into two steps as follows:
-
1.
Find
for each in a predetermined range ;
-
2.
Find
Then would be the maximizer of .
Assumption 3.1 below is a regular condition on the parameter space. Assumptions 3.2 and 3.3 are necessary for obtaining -weak dependence of and their derivatives. For the definition of -weak dependence for random fields, see Pan and Pan, (2024). Then the consistency of MLE in Theorem 2 can be proved based on the law of large numbers (LLN) of -weakly dependent arrays of random fields in Pan and Pan, (2024).
Assumption 3.1.
For a given threshold ,
-
(a)
The parameter space for is a compact subset of and includes the true parameter as its interior point;
-
(b)
Condition (2.3) holds.
Assumption 3.2.
-
(a)
for some ;
-
(b)
The array of random fields is -weakly dependent with coefficient for some .
Assumption 3.3.
For any and , there exist constants and such that That is, the power of connection between two nodes and decays as the distance grows.
Theorem 2.
Since is an integer-valued consistent estimate of , will eventually be equal to when the sample size becomes sufficiently large. Therefore, is asymptotically equal to . In this way, the problem of investigating the asymptotic distribution of degenerates to investigating the asymptotic distribution of .
Theorem 3.
4 Hypothesis testing with and
Based on Theorem 2 and Theorem 3, for sufficiently large sample region such that , we are able to design a Wald test for the null hypothesis
(4.1) |
where is an matrix with rank ( ) and is an -dimensional vector. For example, to test the existence of threshold effect, simply let and , and the null hypothesis (4.1) becomes
To see if the autoregressive term is necessary (i.e. GARCH effect), take and , and the hypothesis becomes
To test the network effect, just let and , and the question becomes testing the hypothesis
5 Simulation studies and empirical data analysis
5.1 Simulation studies
We intend to use four different mechanisms of simulating the network structure in model (2.1). The network structure in Example 5.1 satisfies Assumption 3.3. Simulation mechanisms in Examples 5.2 – 5.4 are for testing the robustness of our estimation, against network structures that may violate Assumption 3.3.
Example 5.1.
(D-neighbourhood network) For each node , it is connected to node only if is inside ’s -neighbourhood. That is, in the adjacency matrix, if and otherwise. Figure 1(a) is a visualization of such a network with and .
Example 5.2.
(Random network) For each node , we generate from uniform distribution , and then draw samples randomly from to form a set ( denotes the integer part of ). could be generated by letting if and otherwise. In a network simulated with such mechanism, as it is indicated in Figure 1(b), there is no significantly influential node (i.e. node with extremely large in-degree).
Example 5.3.
(Network with power-law) According to Clauset et al., (2009), for each node in such a network, is generated the same way as in Example 5.2. Instead of uniformly selecting samples from , these samples are collected w.r.t. probability where is generated from a discrete power-law distribution with scaling parameter . As shown in Figure 1(c), a few nodes have much larger in-degrees while most of them have less than 2. Compared to Example 5.2, network structure with power-law distribution exhibits larger gaps between the influences of different nodes. This type of network is suitable for modeling social media such as Twitter and Instagram, where celebrities have huge influence while the ordinary majority has little.
Example 5.4.
(Network with K-blocks) As it was proposed in Nowicki and Snijders, (2001), in a network with stochastic block structure, all nodes are divided into blocks and nodes from the same block are more likely to be connected comparing to those from different blocks. To simulate such structure, these nodes are randomly divided into groups by assigning labels to every nodes with equal probability. For any two nodes and from the same group, let while for those two from different groups, . Hence, it is very unlikely for nodes to be connected across groups. Our simulated network successfully mimics this characteristic as Figure 1(d) shows clear boundaries between groups. Block network also has its advantage in practical perspective. For instance, the price of one stock is highly relevant to those in the same industry sector.
Set the true parameters of the data generating process (2.1). As for the sample region , let increases from 200 to 2000, while also increases at relatively slower rates of and respectively, as it is showed in the following table:
200 | 500 | 1000 | 2000 | |
---|---|---|---|---|
14 | 22 | 31 | 44 | |
37 | 80 | 144 | 263 |
For each network size , the adjacency matrix is simulated according to four different mechanisms in Example 5.1 to Example 5.4.
Remark.
Particularly in the empirical analysis we will study the dataset of car collisions across different neighbourhoods that are distributed on five boroughs of New York City. These boroughs are separated by rivers (except for Brooklyn and Queens), and neighbourhoods within the same borough are more likely to share a borderline while cross-borough connections are very rare. Therefore the network constructed with New York City neighbourhoods follows the block structure in Example 5.4 with and .
Based on a simulated network, the data is generated according to (2.1), and the true parameters are estimated by the MLE (3.4). To monitor the finite performance of MLE, data generation and parameter estimation are repeated for times, for each combination of sample size . The -th replication produces the estimates and . We use the following two measurements to evaluate the performance of simulation results:
-
•
Root-mean-square error: ,
-
•
Coverage probability: ,
where represents th component of , is the 95% confidence interval defined as
Here the estimated standard error is the square root of -th diagonal element of , and is the 0.975th quantile of standard normal distribution. To eliminate the effect of starting points, a different initial values of is used for each . RMSEs and CPs with different sample sizes and network simulation mechanisms are reported in Tables 1 and 2; We also report the mean estimates of the threshold in the last columns of both tables.
Example 5.1 | 200 | 14 | 0.0696 (0.94) | 0.0203 (0.94) | 0.0278 (0.93) | 0.0170 (0.95) | 0.0256 (0.93) | 5.028 |
500 | 22 | 0.0367 (0.96) | 0.0100 (0.95) | 0.0138 (0.95) | 0.0101 (0.93) | 0.0127 (0.95) | 5 | |
1000 | 31 | 0.0238 (0.95) | 0.0058 (0.95) | 0.0081 (0.95) | 0.0062 (0.97) | 0.0074 (0.95) | 5 | |
2000 | 44 | 0.0153 (0.95) | 0.0035 (0.95) | 0.0047 (0.95) | 0.0041 (0.96) | 0.0045 (0.95) | 5 | |
Example 5.2 | 200 | 14 | 0.0454 (0.95) | 0.0200 (0.95) | 0.0264 (0.94) | 0.0119 (0.96) | 0.0245 (0.94) | 5.045 |
500 | 22 | 0.0284 (0.95) | 0.0101 (0.95) | 0.0134 (0.95) | 0.0072 (0.94) | 0.0126 (0.95) | 5.002 | |
1000 | 31 | 0.0162 (0.97) | 0.0059 (0.96) | 0.0077 (0.97) | 0.0044 (0.94) | 0.0074 (0.95) | 5 | |
2000 | 44 | 0.0112 (0.96) | 0.0034 (0.96) | 0.0047 (0.95) | 0.0029 (0.94) | 0.0043 (0.96) | 5 | |
Example 5.3 | 200 | 14 | 0.0511 (0.96) | 0.0200 (0.95) | 0.0272 (0.94) | 0.0131 (0.95) | 0.0246 (0.95) | 5.034 |
500 | 22 | 0.0349 (0.95) | 0.0102 (0.95) | 0.0135 (0.96) | 0.0084 (0.95) | 0.0127 (0.96) | 5.001 | |
1000 | 31 | 0.0146 (0.95) | 0.0060 (0.95) | 0.0079 (0.95) | 0.0038 (0.95) | 0.0077 (0.94) | 5 | |
2000 | 44 | 0.0104 (0.95) | 0.0035 (0.95) | 0.0048 (0.94) | 0.0025 (0.95) | 0.0043 (0.96) | 5 | |
Example 5.4 | 200 | 14 | 0.0882 (0.95) | 0.0205 (0.95) | 0.0273 (0.95) | 0.0227 (0.94) | 0.0256 (0.93) | 5.013 |
500 | 22 | 0.0379 (0.94) | 0.0102 (0.95) | 0.0136 (0.95) | 0.0096 (0.95) | 0.0124 (0.95) | 5 | |
1000 | 31 | 0.0218 (0.95) | 0.0060 (0.95) | 0.0078 (0.95) | 0.0055 (0.95) | 0.0073 (0.96) | 5 | |
2000 | 44 | 0.0118 (0.94) | 0.0035 (0.96) | 0.0047 (0.95) | 0.0029 (0.95) | 0.0043 (0.96) | 5 |
Example 5.1 | 200 | 37 | 0.0537 (0.95) | 0.0124 (0.95) | 0.0164 (0.95) | 0.0143 (0.94) | 0.0158 (0.94) | 5.002 |
500 | 80 | 0.0287 (0.96) | 0.0054 (0.94) | 0.0071 (0.95) | 0.0078 (0.95) | 0.0066 (0.95) | 5 | |
1000 | 144 | 0.0201 (0.95) | 0.0029 (0.94) | 0.0040 (0.93) | 0.0055 (0.95) | 0.0036 (0.94) | 5 | |
2000 | 263 | 0.0136 (0.95) | 0.0015 (0.94) | 0.0019 (0.95) | 0.0038 (0.95) | 0.0019 (0.93) | 5 | |
Example 5.2 | 200 | 37 | 0.0347 (0.95) | 0.0121 (0.95) | 0.0170 (0.95) | 0.0089 (0.95) | 0.0161 (0.93) | 5.008 |
500 | 80 | 0.0140 (0.95) | 0.0053 (0.95) | 0.0070 (0.95) | 0.0035 (0.95) | 0.0066 (0.95) | 5 | |
1000 | 144 | 0.0073 (0.95) | 0.0029 (0.93) | 0.0036 (0.95) | 0.0020 (0.94) | 0.0036 (0.93) | 5 | |
2000 | 263 | 0.0041 (0.95) | 0.0014 (0.95) | 0.0020 (0.94) | 0.0011 (0.95) | 0.0018 (0.96) | 5 | |
Example 5.3 | 200 | 37 | 0.0385 (0.95) | 0.0124 (0.94) | 0.0168 (0.95) | 0.0092 (0.95) | 0.0152 (0.95) | 5.003 |
500 | 80 | 0.0144 (0.95) | 0.0054 (0.95) | 0.0071 (0.94) | 0.0036 (0.95) | 0.0067 (0.95) | 5 | |
1000 | 144 | 0.0073 (0.94) | 0.0029 (0.94) | 0.0035 (0.96) | 0.0019 (0.94) | 0.0035 (0.95) | 5 | |
2000 | 263 | 0.0037 (0.95) | 0.0015 (0.95) | 0.0019 (0.96) | 0.0009 (0.95) | 0.0018 (0.95) | 5 | |
Example 5.4 | 200 | 37 | 0.0498 (0.95) | 0.0120 (0.95) | 0.0165 (0.94) | 0.0129 (0.94) | 0.0148 (0.96) | 5.011 |
500 | 80 | 0.0176 (0.94) | 0.0055 (0.94) | 0.0071 (0.94) | 0.0045 (0.94) | 0.0069 (0.94) | 5 | |
1000 | 144 | 0.0083 (0.97) | 0.0028 (0.95) | 0.0036 (0.96) | 0.0022 (0.96) | 0.0034 (0.95) | 5 | |
2000 | 263 | 0.0048 (0.95) | 0.0015 (0.95) | 0.0019 (0.95) | 0.0012 (0.96) | 0.0019 (0.95) | 5 |
From Tables 1 and 2 it can be seen that the RMSEs of decrease asymptotically toward zero, and the mean of is equal to for sufficiently large sample size. These results support the consistency of MLE (3.4) in Theorem 2. The reported CPs are close to the theoretical value . This shows that provides a reliable estimation of the true standard error of . Moreover, normal Q-Q plots for the estimation results, Figures 2 to 5, are presented when and respectively, under different network structures. These Q-Q plots provide additional evidence for the asymptotic normality of in Theorem 3.








5.2 Analysis of daily numbers of car accidents in New York City
New York City Police Department (NYPD) publishes and updates regularly the detailed data of motor vehicle collisions that have occurred city-wide. These data are openly accessible on NYPD website111https://www1.nyc.gov/site/nypd/stats/traffic-data/traffic-data-collision.page and contain sufficient information for us to apply our model. We collect all records from 16th February 2021 to 30th June 2022, each record includes the date when an accident happened, and the zip code of where it happened. We classified all records into 41 neighbourhoods according to the correspondence between zip codes and the geometric locations they represent. Re-grouping the data by neighbourhoods and dates of occurrence, we obtain a high-dimensional time series with dimension and sample size .
Two neighbourhoods are regarded as connected nodes if they share a borderline. Therefore, based on the geometric information provided by the data, we are able to construct a reasonable network with 41 nodes, which is visualized in Figure 6. In Figure 7 we plot histograms of daily numbers of car accidents in 9 randomly selected neighbourhoods. The shapes of histograms show potential Poisson distribution. Moreover, in Figure 8 we can easily observe volatility clustering in daily numbers of car accident in four selected neighbourhoods of NYC, indicating potential autoregressive structure in the conditional heteroscedasticity of the data.



Our model was fitted to this dataset by the method proposed in Section 3. The results of parameter estimation are reported in Table 3 below.
Estimation | 0.018693 | 0.126472 | 0.135026 | 0.002727 | 0.862244 | 10 |
SE | 4.12e-03 | 4.40e-03 | 4.68e-03 | 1.09e-03 | 4.73e-03 | \ |
Now we try to interpret these results. Firstly, it is worthy noting that is slightly smaller than , which means that the conditional variance of the number of car accidents in these neighbourhoods are less affected by previous day’s number if it is above the threshold . Secondly, the volatility in the number of car accidents in one area is also affected by its geometrically neighboured areas. In addition, the estimated value of is significantly larger than other coefficients, indicating a strong persistence in volatility that leads to volatility clustering.
At last, we utilize the Wald test in Section 4 to investigate the following important properties of this real dataset, and these properties provide substantial evidence to support our model setting:
(i) The existence of threshold effect (i.e asymmetric property) for volatility. The null hypothesis is (by taking and in (4.1)). In this case, the Wald statistic (4.2) , which suggests the rejection of at significant level below according to Theorem 4. This testing shows that the proposed model with threshold is essentially useful for capturing the nature of daily numbers of car accidents in New York City.
(ii) The existence of GARCH effect (persistence in volatility). For the null hypothesis is , the Wald statistic (4.2) and -value is very close . This strongly suggests that the autoregressive term (i.e. ) should be included in the model. This can be seen from the large value of estimate of .
(iii) The existence of network effect. The Wald statistic (4.2) and -value is in this case. This indicate that there is significant network effect among the daily numbers of car accidents.
6 Conclusion
We have proposed a model to describe spatio-temporal integer-valued data which are observed at nodes of a network and have asymmetric property. Although the proposed model is called GARCH-type for volatility (conditional variance), it also can be used to model the conditional mean, because we assume that the conditional distribution is Poisson distribution. Asymptotic inferences, parameter estimation and hypothesis testing of the proposed model, are discussed by applying limit theorems for nonstationary arrays of random fields. Simulations for different network structures and application to a real dataset show that our methodology is useful for modelling dynamic spatio-temporal integer-valued data.
Appendix A Proofs of theoretical results
In this appendix, we give details of proofs for our theoretical results.
Lemma A.1.
If , and for all , then
(A.1) |
with probability one for all and , where .
Proof.
When , (A.1) obviously holds. Now we consider the case when . Let if and 0 otherwise, . By Jensen’s inequality we have
By Lemma 2.2 in Berkes et al., (2003) we have for any . Therefore, by the Borel-Cantelli lemma, almost surely. Letting , we can prove that the right-hand-side of (A.1) converges almost surely.
A.1 Proof of Theorem 1
Our proof of Theorem 1 relies on the arguments given by Ferland et al., (2006) in their proof of Corollary 1. Let
where are IID positive random variables with mean 1. For each , we define and through following recursion:
(A.2) | ||||
Claim A.1.
is strictly stationary for each .
Proof.
Since are independent Poisson processes with unit intensity, then for any and we have
(A.3) | ||||
When , is -invariant for any and , by (A.3) and the IID of . Therefore is strictly stationary. Assume that and are strictly stationary, then is also strictly stationary since . According to (A.3) and the strict stationarity of , we have being strictly stationary too. Claim A.1 can be proved by induction.
∎
Let for vector . In following claim we prove the convergence of as .
Claim A.2.
for some constants and .
Proof.
Since is a Poisson process with unit intensity, is Poisson distributed with parameter assuming that . Then it is easy to verify that
Recall that, from (A.2),
Then
(A.4) | ||||
To estimate the right-side terms of the above inequality, we define function for . For any , we have
-
•
If and , we have where ;
-
•
If and , we have .
-
•
If and are on different sides of , we assume that and without loss of generality. Notice that
When , we see
When , we see
Combining above cases, we obtain that
(A.5) |
for any .
Therefore,
(A.6) | ||||
for , where is the -th element of .
Combining (A.4) and (A.6), we get
where denotes the spectral radius, and the last inequality is due to the Gershgorin circle theorem. According to (2.3), we can find such that
for .
∎
By Claim A.2,
This implies that and
according to the Borel-Cantelli lemma. This means that, with probability one, there exists such that for all , equals to some with integer components. That is, exists almost surely. Apparently, is strictly stationary since is strictly stationary for each , according to Claim A.1.
A.2 Proof of Theorem 2
From Lemma A.1, we have
and
(A.7) |
with probability one, where . Given initial values for , we can replace with and get
for and . Hence
(A.8) |
Now we are ready to prove the consistency of when and . The proof is broken up into Claim A.3 to Claim A.6 below: Claim A.3 shows that the choice of initial values is asymptotically negligible; Claims A.4 and A.5 verify the weak dependence of , and facilitate the adoption of LLN; Claim A.6 is concerned with the identifiability of the true parameters .
Claim A.3.
For any , as and .
Proof.
In the following proofs, for a random variable , we denote its -norm by .
Claim A.4.
The functions are uniformly -bounded for some , i.e.
Proof.
Claim A.5.
For any element in the parameter space satisfying Assumption 3.1, the array of random fields is -weakly dependent with coefficient where .
Proof.
Claim A.6.
for all if and only if .
Proof.
The if part is obvious, it remains for us to prove the only if part. Observe that
where stands for the back-shift operator in the sense that , and represents either or according to the value of at time . Therefore we have
The polynomial has a root , which lies outside the unit circle since . Therefore the inverse is well-defined for any , and we have
with As for each ,
We can deduce from above equation that for any , otherwise will be degenerated to a deterministic vector given . implies that
The diagonal elements of are all zeros while the matrix on the left side of above equation has non-zero diagonal elements, so we have
which imply , and . Besides, can be easily derived from . ∎
With Claim A.4 and Claim A.5, we can apply Theorem 1 in Pan and Pan, (2024) and obtain that
(A.11) |
for any . Therefore, we have
(A.12) | ||||
with equality if and only if for all , which is equivalent to by Claim A.6.
A.3 Proof of Theorem 3
With a fixed threshold parameter , we rewrite , and etc., in succeeding proofs for simplicity of notations. To prove the asymptotic normality, we need to derive some intermediate results regarding the first, second and third order derivatives of .
Since
almost surely, the partial derivative of are
(A.14) | |||
where
Note that
(A.15) |
where .
For the second order derivatives, we get that, for any ,
and
(A.16) | |||
We also have
(A.17) |
where .
For the third order derivatives of ,
(A.18) | |||
Based on the consistency of , we are now ready to prove asymptotic normality. We split the proof into Claim A.7 to Claim A.10 below.
Claim A.7.
For any , as and .
Proof.
Note that
(A.19) |
where
and similarly
(A.20) |
Therefore, we have
But, by Assumption 3.2(a) and (A.8), we have
(A.21) | ||||
when and . Then, in view of (A.15),
(A.22) | ||||
when and . In light of (A.21) and (A.22), we can prove that
The proofs regarding partial derivatives w.r.t. , , and follow similar arguments and are therefore omitted here. ∎
Claim A.8.
For any , as and .
Proof.
For any , we have
(A.23) | ||||
and
(A.24) | ||||
Note that
(A.25) | ||||
We will handle above two terms separately next.
For the first term on the right-hand-side of (A.25), we see
(A.26) | ||||
Analogous to the proof of (A.21), we can show that and as and . Using (A.17), we can also verify that
Then as well. Similarly, using (A.15), we obtain that and .
Claim A.9.
-
(a)
for some ;
-
(b)
For each such that , are -weakly dependent, with dependence coefficient where .
Proof.
Now we verify (b). In the proof of Claim A.5, for each and , we defined such that if and only if . At first, we verify that satisfies condition (2.7) in Pan and Pan, (2024). Note that
(A.28) | ||||
and
where
Following analogous arguments in (A.10), we obtain that
(A.29) | ||||
Combining (A.10), (A.28) and (A.29), we can verify that satisfies condition (2.7) in Pan and Pan, (2024) with and . Partial derivatives of with respect to other parameters in follows similarly. Therefore satisfies condition (2.7) in Pan and Pan, (2024) with and for each .
According to Proposition 2 and Example 2.1 in Pan and Pan, (2024), the array of random fields is -weakly dependent with coefficient . But because . So (b) is verified. ∎
Claim A.10.
-
(a)
for some ;
-
(b)
With respect to all , are -weakly dependent, with dependence coefficient where .
Proof.
Following the same idea as in previous proofs, for each and , we define such that if and only if . To prove (b), we verify that satisfies condition (2.7) in Pan and Pan, (2024). Firstly we have
(A.30) | ||||
Taking the second order derivative with respect to and as an example, analogous to (A.10) and (A.29), we get
(A.31) | ||||
Proofs regarding second order derivatives with respect to other parameters follow similar arguments and then are omitted here. Substituting (A.10), (A.29) and (A.31) to (A.30), we have that satisfies condition (2.7) in Pan and Pan, (2024) with and .
According to Proposition 2 and Example 2.1 in Pan and Pan, (2024), the array of random fields is -weakly dependent with coefficient , and .
∎
By the Taylor expansion, for some between and we have
Since , we have
(A.32) | ||||
Note that is Poisson distributed with mean conditioning on historical information , with being IID Poisson point processes with intensity 1. Therefore we have
By Claim A.10, we apply Theorem 1 in Pan and Pan, (2024) and obtain that
(A.33) |
According to condition (3.5) we can further prove that
(A.34) |
When or , we have
assuming . Then we can verify that
For each , By (3.5) and the symmetry of , it is implied that
Then, by Claim A.9 and Theorem 2 in Pan and Pan, (2024), we can prove that
According to the Cramér-Wold theorem, we have
(A.35) |
Combining (A.32), (A.34) and (A.35), we complete the proof of Theorem 3.
A.4 Proof of Theorem 4
Recalling from (4.2), the Wald statistic is
where
It is sufficient to show that
(A.36) |
Note that
Similar to the proof of Claim A.10, we can verify that the LLN Theorem 1 in Pan and Pan, (2024) applies to and therefore . can be further decomposed as follows
But, since . The proof of is similar to the proof of (A.26), therefore omitted here. So the proof of Theorem 4 is completed.
References
- Al-Osh and Alzaid, (1987) Al-Osh, M. A. and Alzaid, A. A. (1987). First-order integer-valued autoregressive (INAR (1)) process. Journal of Time Series Analysis, 8(3):261–275.
- Al-Osh and Alzaid, (1988) Al-Osh, M. A. and Alzaid, A. A. (1988). Integer-valued moving average (INMA) process. Statistical Papers, 29:281–300.
- Armillotta and Fokianos, (2024) Armillotta, M. and Fokianos, K. (2024). Count network autoregression. Journal of Time Series Analysis, 45(4):584–612.
- Berkes et al., (2003) Berkes, I., Horváth, L., and Kokoszka, P. (2003). GARCH processes: structure and estimation. Bernoulli, 9(2):201–227.
- Clauset et al., (2009) Clauset, A., Shalizi, C. R., and Newman, M. E. J. (2009). Power-law distributions in empirical data. SIAM Review, 51(4):661–703.
- Cui et al., (2020) Cui, Y., Li, Q., and Zhu, F. (2020). Flexible bivariate Poisson integer-valued GARCH model. Annals of the Institute of Statistical Mathematics, 72(6):1449–1477.
- Cui and Zhu, (2018) Cui, Y. and Zhu, F. (2018). A new bivariate integer-valued GARCH model allowing for negative cross-correlation. Test, 27:428–452.
- De Wit et al., (2013) De Wit, E. R., Englund, P., and Francke, M. K. (2013). Price and transaction volume in the Dutch housing market. Regional Science and Urban Economics, 43(2):220–241.
- Ferland et al., (2006) Ferland, R., Latour, A., and Oraichi, D. (2006). Integer-valued GARCH process. Journal of time series analysis, 27(6):923–942.
- Fokianos et al., (2009) Fokianos, K., Rahbek, A., and Tjøstheim, D. (2009). Poisson autoregression. Journal of the American Statistical Association, 104(488):1430–1439.
- Fokianos et al., (2020) Fokianos, K., Støve, B., Tjøstheim, D., and Doukhan, P. (2020). Multivariate count autoregression. Bernoulli, 26(1):471 – 499.
- Heinen, (2003) Heinen, A. (2003). Modelling time series count data: an autoregressive conditional Poisson model. Available at SSRN 1117187.
- Jiang et al., (2022) Jiang, F., Zhao, Z., and Shao, X. (2022). Modelling the covid-19 infection trajectory: A piecewise linear quantile trend model. Journal of Royal Statistical Society (Series B), 84(5):1589–1607.
- Jones et al., (1994) Jones, C. M., Kaul, G., and Lipson, M. L. (1994). Transactions, volume, and volatility. The Review of Financial Studies, 7(4):631–651.
- Lee et al., (2023) Lee, S., Kim, D., and Kim, B. (2023). Modeling and inference for multivariate time series of counts based on the INGARCH scheme. Computational Statistics & Data Analysis, 177:107579.
- Lee et al., (2018) Lee, Y., Lee, S., and Tjøstheim, D. (2018). Asymptotic normality and parameter change test for bivariate Poisson INGARCH models. Test, 27:52–69.
- McKenzie, (1985) McKenzie, E. (1985). Some simple models for discrete variate time series. Journal of the American Water Resources Association, 21(4):645–650.
- McKenzie, (1988) McKenzie, E. (1988). Some ARMA models for dependent sequences of Poisson counts. Advances in Applied Probability, 20(4):822–835.
- Nowicki and Snijders, (2001) Nowicki, K. and Snijders, T. A. B. (2001). Estimation and prediction for stochastic block structures. Journal of the American Statistical Association, 96(455):1077–1087.
- Pan and Pan, (2024) Pan, Y. and Pan, J. (2024). Limit theorems for weakly dependent non-stationary random field arrays and asymptotic inference of dynamic spatio-temporal models. Available at arXiv:2408.07429.
- Pham, (2020) Pham, H. (2020). On estimating the number of deaths related to Covid-19. Mathematics, 8(5):655.
- Tao et al., (2024) Tao, Y., Li, D., and Niu, X. (2024). Grouped network Poisson autoregressive model. Statistica Sinica, 34:1603–1624.
- Wang et al., (2014) Wang, C., Liu, H., Yao, J.-F., Davis, R. A., and Li, W. K. (2014). Self-excited threshold Poisson autoregression. Journal of the American Statistical Association, 109(506):777–787.
- Xu et al., (2012) Xu, H.-Y., Xie, M., Goh, T. N., and Fu, X. (2012). A model for integer-valued time series with conditional overdispersion. Computational Statistics & Data Analysis, 56(12):4229–4242.
- Zhou et al., (2020) Zhou, J., Li, D., Pan, R., and Wang, H. (2020). Network GARCH model. Statistica Sinica, 30:1–18.
- Zhu, (2010) Zhu, F. (2010). A negative binomial integer-valued GARCH model. Journal of Time Series Analysis, 32(1):54–67.
- Zhu, (2012) Zhu, F. (2012). Modeling overdispersed or underdispersed count data with generalized Poisson integer-valued GARCH models. Journal of Mathematical Analysis and Applications, 389(1):58–71.