\floatsetup

[table]capposition=top

Exponential GARCH-Itô Volatility Models

Donggyu Kim
College of Business,
Korea Advanced Institute of Science and Technology (KAIST)

Abstract

This paper introduces a novel Itô diffusion process to model high-frequency financial data, which can accommodate low-frequency volatility dynamics by embedding the discrete-time non-linear exponential GARCH structure with log-integrated volatility in a continuous instantaneous volatility process. The key feature of the proposed model is that, unlike existing GARCH-Itô models, the instantaneous volatility process has a non-linear structure, which ensures that the log-integrated volatilities have the realized GARCH structure. We call this the exponential realized GARCH-Itô (ERGI) model. Given the auto-regressive structure of the log-integrated volatility, we propose a quasi-likelihood estimation procedure for parameter estimation and establish its asymptotic properties. We conduct a simulation study to check the finite sample performance of the proposed model and an empirical study with 50 assets among the S&P 500 compositions. The numerical studies show the advantages of the new proposed model.

Keywords: High-frequency financial data, non-linear GARCH, stochastic differential equation, volatility estimation and prediction

1 Introduction

In finance practice, volatility plays a pivotal role. Low-frequency and high-frequency financial data are widely used to analyze volatility dynamics. For example, generalized auto-regressive conditional heteroskedasticity (GARCH) models are introduced to catch the low-frequency volatility dynamics, such as volatility clustering, by employing the squared low-frequency log-return as the innovation (Bollerslev,, 1986; Engle,, 1982). However, when the volatility changes rapidly, it is often difficult to catch the change using only the low-frequency log-returns as the innovations (Andersen et al.,, 2003). On the other hand, high-frequency financial data are available to construct the so-called realized volatility for estimating daily integrated volatility. Examples include two-time scale realized volatility (TSRV) (Zhang et al.,, 2005), multi-scale realized volatility (MSRV) (Zhang,, 2006), kernel realized volatility (KRV) (Barndorff-Nielsen et al.,, 2008), quasi-maximum likelihood estimator (QMLE) (Aït-Sahalia et al.,, 2010; Xiu,, 2010), pre-averaging realized volatility (PRV) (Jacod et al.,, 2009), and robust pre-averaging realized volatility (Fan and Kim,, 2018; Shin et al.,, 2021). These realized volatility estimators contain high-frequency information about the market volatility, and many studies show that incorporating high-frequency information helps account for low-frequency market dynamics (Corsi,, 2009; Hansen et al.,, 2012; Kim and Wang,, 2016; Shephard and Sheppard,, 2010). Several conditional volatility models have been developed to combine high-frequency and low-frequency data and enhance volatility estimation and predication by employing realized volatility as the volatility proxy. Examples include the realized volatility based modeling approaches (Andersen and Bollerslev, 1997a, ; Andersen and Bollerslev, 1997b, ; Andersen and Bollerslev, 1998a, ; Andersen and Bollerslev, 1998b, ; Andersen et al.,, 2003), the heterogeneous auto-regressive (HAR) models (Corsi,, 2009), the realized GARCH models (Hansen et al.,, 2012), the high-frequency based volatility (HEAVY) models (Shephard and Sheppard,, 2010), and the unified GARCH/SV-Itô models (Kim and Fan,, 2019; Kim and Wang,, 2016; Song et al.,, 2021). These models have been developed based on the linear auto-regressive structure of realized volatilities. However, we often observe that non-linear auto-regressive structures, such as exponential functions, better capture the volatility dynamics (Nelson,, 1991; Hansen and Huang,, 2016; Kawakatsu,, 2006). This may be because log-volatilities often have a stronger linear auto-regressive relationship. In fact, when variables are close to normal distributions, linear models work well. To check the normality of realized volatilities, we draw normal QQ-plots of realized volatilities and log realized volatilities for AAPL stock. Figure 1 shows that the log-transformation makes the realized volatilities closer to a normal distribution. Most of assets have the similar phenomena. Thus, we can conjecture that log-volatilities better explain volatility dynamics. To harness this feature, Hansen and Huang, (2016) employed the exponential GARCH structure with the log-realized volatility as the innovation, and their empirical study indicates that the non-linear GARCH structure helps account for market dynamics. Although, as discussed above, empirical studies support that incorporating high-frequency data with a non-linear auto-regressive structure better captures the market dynamics, the mathematical gap between the empirical low-frequency discrete-time non-linear volatility models, such as the exponential realized GARCH, and high-frequency-based continuous-time diffusion process is not well-studied. In fact, several studies have been conducted to fill the gap between the discrete-time volatility models and continuous-time diffusion process (Kim and Fan,, 2019; Kim and Wang,, 2016; Song et al.,, 2021). However, these studies are based on the linear auto-regressive structure, and the extension from linear to non-linear structures is not straightforward. This fact increases the demand for developing continuous-time diffusion process-based models that provide a rigorous mathematical formulation for the non-linear auto-regressive structure of realized volatilities.

Refer to caption — Figure 1: Normal QQ-plots of realized volatilities and log realized volatilities for AAPL. The red real line denotes the best linear fit line, which illustrates perfect normal distribution.

In this paper, we develop a novel diffusion process to model high-frequency financial data, which can accommodate a non-linear GARCH structure of the realized volatilities. From the empirical study, we often observe that the log-realized volatility has a stronger auto-regressive structure than the original realized volatility. To reflect this, we employ the exponential GARCH structure as the non-linear function. Specifically, the log-realized volatility follows the realized GARCH structure. To connect this low-frequency volatility structure with the continuous-time diffusion process, we develop a novel continuous instantaneous volatility process. Since the volatility process has a non-linear structure, the linear structure of the unified GARCH-Itô (Kim and Wang,, 2016) is not applicable. Furthermore, usual log-diffusion processes for instantaneous volatility processes can not provide the solution. To tackle this issue, we propose a novel instantaneous volatility process, based on the average integrated volatility process. Then, the proposed instantaneous volatility process is continuous with respect to time, and its daily integrated volatility is decomposed into the exponential realized GARCH with the daily log-integrated volatility as the innovation and exponential martingale difference. We call it the exponential realized GARCH-Itô (ERGI) model. Unlike the linear realized GARCH model, the log-realized volatility can have negative values. Thus, we allow model parameters to be negative. To estimate the model parameter, we propose a quasi-maximum likelihood estimation procedure. Specifically, we adopt the Gaussian quasi-likelihood function and use the realized volatility as the proxy of the conditional expected value. Furthermore, we establish its asymptotic properties. To illustrate the benefit of the proposed model, we apply the ERGI model to real trading high-frequency data and find that the exponential structure helps account for the volatility dynamics.

The rest of paper is organized as follows. In Section 2, we propose the ERGI model and investigate its properties. In Section 3, we suggest the quasi-maximum likelihood estimation procedure and study its asymptotic behaviors. In Section 4, we conduct a simulation study to check the finite sample performance of the ERGI model. In Section 5, we apply the ERGI model to the top 50 high trading volume assets among the S&P 500 compositions. Section 6 contains the conclusions. All the technical proofs are collected in Section 7.

2 Exponential realized GARCH-Itô models

In this section, we develop an exponential realized GARCH-Itô (ERGI) model as follows.

Definition 1.

We call the log-price $X_{t}$ to follow the ERGI process if it satisfies

	$\displaystyle dX_{t}=\mu_{t}dt+\sigma_{t}(\theta)dB_{t}+J_{t}d\Lambda_{t},$
	$\displaystyle\sigma_{t}^{2}(\theta)=\bar{\sigma}_{t}^{2}(\theta)\{1+(t-[t])b_{t}(\theta)\},$
	$\displaystyle b_{t}(\theta)=b_{[t]}(\theta)+(t-[t])(\omega+(\gamma-1)b_{[t]}(\theta))+\beta\log\bar{\sigma}_{t}^{2}(\theta)$
	$\displaystyle\qquad\qquad-(1-t+[t])\left(\beta+\beta^{*}(t-[t])\right)\log\sigma_{[t]}^{2}+\nu(1-t+[t])Z_{t}^{2},$

where $\bar{\sigma}_{t}^{2}(\theta)=(t-[t])^{-1}\int_{[t]}^{t}\sigma^{2}_{s}(\theta)ds$ , $[t]$ denotes the integer part of $t$ , except that $[t]=t-1$ when $t$ is an integer, $Z_{t}=\int_{[t]}^{t}dW_{s}$ , and $\theta=(\omega,\gamma,\beta,\beta^{*},\nu)$ is the model parameter. For the jump part, $J_{t}$ is the jump size and $\Lambda_{t}$ is the Poisson process with the intensity $\lambda_{t}$ .

We note that $\beta^{*}$ is an adjust term to handle some remaining terms. Specifically, we set $\beta^{*}=\frac{1+\beta\varrho_{2}}{\varrho_{2}-2\varrho_{3}}$ , where $\varrho$ ’s are functions of $\beta$ , which are defined in Theorem 1. The log-average integrated volatility, $\log\bar{\sigma}_{t}^{2}(\theta)$ , provides the innovation term, and $Z_{t}$ is the random fluctuation. Since the process $b_{t}(\theta)$ is continuous with respect to time $t$ , the instantaneous volatility process $\sigma_{t}^{2}(\theta)$ is also continuous. At the integer time points, we have

b_{n}(\theta)=\omega+\gamma b_{n-1}(\theta)+\beta\log\int_{n-1}^{n}\sigma_{t}^{2}(\theta)dt.

That is, $b_{n}(\theta)$ can be explicitly expressed by the past log-integrated volatilities, and $b_{t}(\theta)$ has a form of the interpolation between these values. Thus, when considering $b_{t}(\theta)$ , the ERGI model has a similar structure of realized GARCH-Itô model (Song et al.,, 2021) with the log-integrated volatility as the innovations. However, unlike the realized GARCH-Itô model, to obtain the non-linear exponential realized GARCH form, the instantaneous volatility process has a non-linear structure, such as $\bar{\sigma}_{t}^{2}(\theta)\{1+(t-[t])b_{t}(\theta)\}$ . The solution for this structure is

\frac{1}{t-n+1}\int_{n-1}^{t}\sigma_{s}^{2}(\theta)ds=\sigma_{n-1}^{2}(\theta)e^{\int_{n-1}^{t}b_{s}(\theta)ds}\text{ a.s.}

Details can be found in Lemma 1. Using the above solution, we can measure the integrated volatility $\int_{n-1}^{n}\sigma_{t}^{2}(\theta)dt$ with $\sigma_{n-1}^{2}(\theta)$ and $\int_{n-1}^{n}b_{s}(\theta)ds$ , which has the realized GARCH form with the log-integrated volatilities. Specific properties of the integrated volatility are shown in the following theorem.

Theorem 1.

Under the ERGI model, for $|\beta|<1$ and $n\in\mathbb{N}$ , the integrated volatilities have the following properties.

(a)

We have

	$\displaystyle\int_{n-1}^{n}\sigma_{t}^{2}(\theta)dt=\exp\left(h_{n}(\theta)+D_{n}\right)\quad\text{a.s.},$
	$\displaystyle h_{n}(\theta)=\omega^{*}+\gamma h_{n-1}(\theta)+\beta^{g}\log\int_{n-2}^{n-1}\sigma_{t}^{2}(\theta)dt,$

where

	$\displaystyle\omega^{*}=\{(1-\gamma)\varrho_{2}+\varrho\}\omega+(1-\gamma)\nu(\varrho_{2}-2\varrho_{3}),\quad\beta^{g}=\varrho\beta,\quad\varrho=\varrho_{1}+(\gamma-1)\varrho_{2},$
	$\displaystyle\varrho_{1}=\beta^{-1}(e^{\beta}-1),\quad\varrho_{2}=\beta^{-2}(e^{\beta}-1-\beta),\quad\varrho_{3}=\beta^{-3}(e^{\beta}-1-\beta-\beta^{2}/2),$

and

\displaystyle D_{n}=2\nu\int_{n-1}^{n}\{(n-t)\beta^{-1}e^{\beta(n-t)}-(e^{\beta(n-t)}-1)\beta^{-2}\}Z_{t}dW_{t}

is a martingale difference.

(b)

We have

	$\displaystyle\int_{n-1}^{n}\sigma_{t}^{2}(\theta)dt=\exp\left(H_{n}(\theta)\right)M_{n},$		(2.1)
	$\displaystyle\mathbb{E}\left[\int_{n-1}^{n}\sigma^{2}_{t}(\theta)dt\middle\|\mathcal{F}_{n-1}\right]=\exp\left(H_{n}(\theta)\right)\text{ a.s.},$		(2.2)

where

	$\displaystyle H_{n}(\theta)=\omega^{g}+\gamma H_{n-1}(\theta)+\beta^{g}\log\ \int_{n-2}^{n-1}\sigma_{t}^{2}(\theta)dt,$
	$\displaystyle\omega^{g}=\omega^{*}+(1-\gamma)\log\mathbb{E}[\exp(D_{n})],$

and

M_{n}=\exp(D_{n}-\log\mathbb{E}[\exp(D_{n})])

is an exponential martingale difference.

Theorem 1(a) shows that the log-integrated volatility is decomposed into the realized GARCH with the log-integrated volatility innovations, $h_{n}(\theta)$ , and the martingale difference $D_{n}$ . Thus, the log-realized GARCH, $h_{n}(\theta)$ , is the conditional expected value of the log-integrated volatility, but it is not the conditional expected value of the original integrated volatility. In Theorem 1(b), we show that the integrated volatility is decomposed into the exponential function of the realized GARCH with the log-integrated volatility innovations, $H_{n}(\theta)$ , which has the additional interceptor term from the martingale difference term $D_{n}$ , and the exponential martingale difference $M_{n}$ . Since the expectation is a linear function, the additional interceptor term does not appear in the linear realized GARCH. However, the ERGI is non-linear, and, thus, we have the additional interceptor term. The main purpose of this paper is to develop a model for analyzing the original integrated volatility, so we develop a statistical inference based on (2.2). Theorem 1 indicates that the proposed model has a non-linear exponential GARCH structure. From the empirical study, we find that this non-linear structure helps explain the volatility dynamics as compared to the usual linear realized GARCH. Details can be found in Section 5.

2.1 Relationship with the daily log-returns

The traditional discrete GARCH models are models of close-to-close volatilities for log-returns. In this section, we discuss the relationship between the proposed ERGI and close-to-close volatilities for log-returns.

We first consider the continuous part. That is, we assume that the log-price process does not have the jump part. Then, by Itô’s lemma and Theorem 1(b), we have

	$\displaystyle\mathbb{E}\left[\left(X_{n}-X_{n-1}-\int_{n-1}^{n}\mu_{t}dt\right)^{2}\middle\|\mathcal{F}_{n-1}\right]$	$\displaystyle=$	$\displaystyle\mathbb{E}\left[\left(\int_{n-1}^{n}\sigma_{t}(\theta)dB_{t}\right)^{2}\middle\|F_{n-1}\right]$
		$\displaystyle=$	$\displaystyle\exp\left(H_{n}(\theta)\right)\text{ a.s.}$

Thus, the exponential GARCH volatility, $\exp\left(H_{n}(\theta)\right)$ , is the conditional volatility of the daily log-return. Unfortunately, in practice, we do not have the observations during the close-to-open period. Thus, in order to investigate the close-to-close volatility, we need to impose a structure on the overnight period. For example, we can simply use squared close-to-open log-returns as as the proxy of the integrated volatility for the close-to-open period. Then, we can apply the proposed ERGI model to the realized volatility plus squared close-to-open log-return. On the other hand, we can assume that the close-to-open volatility dynamics is same as the open-to-close period. Then, we only need to match the scale. To do this, we can calculate the averages for the open-to-close realized volatilities and the squared close-to-open log-returns, and we multiply the inverse of the proportion of the average of the realized volatility. The above methods are practical solutions without theoretical justifications. Thus, it is interesting and important to develop a diffusion process which can accommodate the close-to-close period. We leave this for future study.

To investigate the jump diffusion process. We assume that the jump sizes $J_{t}$ ’s are i.i.d. with mean $\mu_{J}$ and variance $\sigma_{J}^{2}$ , and the intensity is constant over time; that is, $\lambda_{t}=\lambda$ . Furthermore, we assume the continuous part and jump part are not correlated. Then, the conditional volatility of the daily log-return is

\displaystyle\mathbb{E}\left[\left(X_{n}-X_{n-1}-\int_{n-1}^{n}\mu_{t}dt\right)^{2}\middle|\mathcal{F}_{n-1}\right]=\exp\left(H_{n}(\theta)\right)+\lambda\sigma_{J}^{2}+\lambda^{2}\mu_{J}^{2}\text{ a.s.}

The squared log-return has the exponential GARCH, $\exp\left(H_{n}(\theta)\right)$ , and additional expected jump variation. The jump variation part depends on the assumption of the jump structure. Thus, it is interesting and important to investigate the jump variation dynamics and to model the jump part. We leave this for future study.

3 Estimation procedure

3.1 A model setup

We assume that the log-prices follow the ERGI process defined in Definition 1. The intra-day log-prices for the $d$ th day are observed at $t_{d,i},i=1,\ldots,m_{d}$ , where $d-1=t_{d,0}<t_{d,1}<\cdots<t_{d,m_{d}}=1+d-1$ . We denote $m$ as the average number of the high-frequency observations; that is, $m=\frac{1}{n}\sum_{d=1}^{n}m_{d}$ . Unfortunately, true high-frequency observations, $X_{t_{d,i}}$ ’s, are not observed due to market micro-structure noises. To accommodate the market micro-structure noises, we assume that the observed log-prices $Y_{t_{d,i}}$ has the following structure:

Y_{t_{d,i}}=X_{t_{d,i}}+\epsilon_{t_{d,i}},\quad\text{for }d=1,\ldots,n,i=1,\ldots,m_{d},

where $X_{t}$ is the true log-price and $\epsilon_{t_{d,i}}$ ’s are micro-structure noises with mean zero.

Without the presence of price jumps, several nonparametric realized volatility estimators have been constructed that take advantage of sub-sampling and local-averaging techniques to remove the effect of market micro-structure noises so that the integrated volatility can be estimated consistently and efficiently (Barndorff-Nielsen et al.,, 2008; Fan and Kim,, 2018; Jacod et al.,, 2009; Shin et al.,, 2021; Zhang,, 2006; Xiu,, 2010). To identify the jump locations given noisy high-frequency data, Fan and Wang, (2007) and Zhang et al., (2016) proposed wavelet methods to detect jumps and applied the MSRV method to jump-adjusted data. They showed that the estimator of jump variation has the convergence rate of $m^{-1/4}$ , and the estimator of integrated volatility achieves the optimal convergence rate of $m^{-1/4}$ . On the other hand, Aït-Sahalia and Xiu, (2016) proposed jump robust pre-averaging methods by employing a truncation method. They also demonstrated that the estimators of jump variation and integrated volatility achieve the optimal convergence rate of $m^{-1/4}$ . In this paper, for the $i$ th day, we let $RV_{i}$ to be the corresponding estimator of daily integrated volatility that is robust to micro-structure noises and price jumps. In the numerical study, we employ the jump robust pre-averaging method.

3.2 GARCH parameters estimation

We first fix notations. For a given vector $\mathbf{x}=(x_{i})_{i=1,\ldots,k}$ , we define $\|\mathbf{x}\|_{\max}=\max_{i}|x_{i}|$ . Let $C$ ’s be generic constants whose values does not depend on $\theta^{g},n$ , and $m$ and may change from occurrence to occurrence. In this section, we develop an estimation procedure for the true GARCH model parameters $\theta^{g}_{0}=(\omega^{g}_{0},\gamma_{0},\alpha^{g}_{0})$ .

Theorem 1 indicates that the integrated volatility is decomposed into the exponential GARCH term $\exp\left(H_{n}(\theta^{g})\right)$ and the exponential martingale difference term $M_{n}$ , which implies

\frac{\int_{n-1}^{n}\sigma_{t}^{2}(\theta_{0}^{g})dt-\exp\left(H_{n}(\theta^{g}_{0})\right)}{\exp\left(H_{n}(\theta^{g}_{0})\right)}=M_{n}-1\text{ a.s.}

Since $M_{n}$ is the exponential martingale difference, $M_{n}-1$ is a martingale difference. This inspires us to use integrated volatility as a proxy for exponential GARCH volatility, and we define a quasi-likelihood function as follows:

\widehat{L}_{n}(\theta^{g})=-\frac{1}{n}\sum_{i=1}^{n}\left\{H_{i}(\theta^{g})+\frac{\int_{i-1}^{i}\sigma_{t}^{2}(\theta^{g}_{0})dt}{\exp\left(H_{i}(\theta^{g})\right)}\right\}.

We can estimate the parameter $\theta^{g}$ by maximizing the above quasi-likelihood function. However, in practice, the integrated volatility is not observable, so we need to estimate it first. We employ the jump robust realized volatility estimator (Aït-Sahalia and Xiu,, 2016; Fan and Wang,, 2007; Zhang et al.,, 2016). Then, we estimate the log-conditional expectation of the integrated volatilities as follows:

\widehat{H}_{i}(\theta^{g})=\omega^{g}+\gamma\widehat{H}_{i-1}(\theta^{g})+\beta^{g}\log RV_{i-1},

(3.1)

where the initial value $\widehat{H}_{1}(\theta^{g})$ is set to be $\log RV_{1}$ . The effect of the initial value is negligible with the rate of $n^{-1}$ (see Lemma 1 in Kim and Wang, (2016)), so its choice does not have a significant effect on the parameter estimation. With the estimated conditional expected volatility function, we define the following quasi-likelihood function:

\widehat{L}_{n,m}(\theta^{g})=-\frac{1}{n}\sum_{i=1}^{n}\left\{\widehat{H}_{i}(\theta^{g})+\frac{RV_{i}}{\exp\left(\widehat{H}_{i}(\theta^{g})\right)}\right\}.

Then, we obtain the estimator for the GARCH parameters $\theta^{g}_{0}$ by maximizing the above quasi-likelihood function,

\displaystyle\widehat{\theta^{g}}=\arg\max_{\theta^{g}\in\Theta}\widehat{L}_{n,m}(\theta^{g}),

where $\Theta$ is the parameter space of $\theta^{g}$ . To establish its asymptotic properties, we need the following technical conditions.

Remark 1.

Even if the effect of the initial value is negligible, for the finite sample, the random variable $\log RV_{1}$ happens to be far from the true initial value. To handle this practical issue, we can assume that the initial value is a long-term average. Under this condition, we can use the theoretical average value $\frac{\omega^{g}}{1-\gamma-\beta^{g}}$ as the initial value. That is, we additionally assume that the initial value is $H_{1}(\theta^{g}_{0})=\frac{\omega_{0}^{g}}{1-\gamma_{0}-\beta_{0}^{g}}$ . With this condition, we can obtain the same asymptotic result derived in Theorem 2.

Assumption 1.

(a)

$\theta^{g}_{0}\in\Theta=\{(\omega^{g},\gamma,\beta^{g});\omega_{l}<|\omega^{g}|<\omega_{u},\gamma_{l}<|\gamma|<\gamma_{u}<1,\beta_{l}<|\beta|<\beta_{u}<1,|\gamma+\beta^{g}|<1\}$ , where $\omega_{l},\omega_{u},\gamma_{l},\gamma_{u},\beta_{l},\beta_{u}$ are some known constants.
(b)

$\sup_{i}\mathbb{E}\left[\left|RV_{i}-\int_{i-1}^{i}\sigma_{t}^{2}(\theta^{g}_{0})dt\right|^{4}\right]^{1/4}\leq Cm^{-1/4}$ and $\sup_{i}\mathbb{E}\left[\left|\log RV_{i}-\log\int_{i-1}^{i}\sigma_{t}^{2}(\theta^{g}_{0})dt\right|^{4}\right]^{1/4}\leq Cm^{-1/4}$
(c)

$\mathbb{E}\left[\left|\int_{i-1}^{i}\sigma_{t}^{2}(\theta^{g}_{0})dt\right|^{4}\right]\leq C$ , $\mathbb{E}\left[\sup_{\theta^{g}\in\Theta}\exp(4|H_{i}(\theta^{g})|)\right]\leq C$ , $\mathbb{E}\left[\sup_{\theta^{g}\in\Theta}\exp(4|\widehat{H}_{i}(\theta^{g})|)\right]\leq C$ , and $E\left[M_{i}^{4}\right]\leq C$ for all $i$ .

Remark 2.

Under Assumption 1(a), unlike the linear GARCH-Itô models (Kim and Wang,, 2016; Song et al.,, 2021), we allow the parameters to be negative. The condition $|\gamma+\beta^{g}|<1$ provides stationary properties of conditional volatilities. There exist realized volatility estimators satisfying Assumption 1(b) under some finite moment condition (see Kim et al., (2016); Tao et al., (2011)). The sufficient condition for Assumption 1(c) is that $E\left[\exp(s|\log RV_{i}|)\right]\leq C$ and $E\left[\exp(s|\log\int_{i-1}^{i}\sigma_{t}^{2}(\theta^{g}_{0})dt|)\right]\leq C$ for $s\geq 4/(1-\gamma_{u})$ .

In the following theorem, we establish the asymptotic properties of the proposed quasi-maximum likelihood estimator (QMLE).

Theorem 2.

Under Assumption 1, we have

\|\widehat{\theta^{g}}-\theta^{g}_{0}\|_{\max}=O_{p}(n^{-1/2}+m^{-1/4}).

Furthermore, we suppose that $nm^{-1/2}\to 0$ and Assumption 1 is satisfied. Then, we have

\sqrt{n}(\widehat{\theta^{g}}-\theta^{g}_{0})\overset{d}{\to}N(0,AV^{-1}),

Remark 3.

Theorem 2 shows that the QMLE $\widehat{\theta^{g}}$ has the convergence rate $n^{-1/2}+m^{-1/4}$ . The $n^{-1/2}$ term is the usual convergence rate due to the low-frequency errors, $M_{i}-1$ . The $m^{-1/4}$ term is the cost to estimate the integrated volatility, which is known as the optimal rate with the presence of the micro-structure noises. Specifically, by Theorem 1, we have the following relationship:

\int_{n-1}^{n}\sigma_{t}^{2}(\theta^{g})dt=\exp\left(H_{n}(\theta^{g})\right)M_{n}\text{ a.s.},

and additionally, due to the estimation error of the latent integrated volatility, we have

RV_{n}=\int_{n-1}^{n}\sigma_{t}^{2}(\theta^{g})dt+E_{n}=\exp\left(H_{n}(\theta^{g})\right)M_{n}+E_{n}\text{ a.s.},

where $E_{n}$ is the estimation error of the latent integrated volatility. The error rate of $E_{n}$ is $m^{-1/4}$ , and its asymptotic variance is specified in the literature of estimating integrated volatility (Aït-Sahalia et al.,, 2010; Barndorff-Nielsen et al.,, 2008; Jacod et al.,, 2009; Xiu,, 2010; Zhang,, 2006). Then, the asymptotic variance of $\widehat{\theta^{g}}$ in Theorem 2 has an additional term that is a function of variance of $E_{n}$ . For example, we have

\|\widehat{\theta^{g}}-\theta^{g}_{0}\|_{\max}\approx C_{1}\frac{\sqrt{A}}{n^{1/2}}+C_{2}\frac{\sqrt{Avar_{RV}}}{m^{1/4}},

where $Avar_{RV}$ is the asymptotic variance of $RV$ , and $C_{1}$ and $C_{2}$ are functions of $H_{i}(\theta^{g}_{0})$ .

Remark 4.

The condition $nm^{-1/2}\to 0$ is required to remove the effect from the estimation error of the realized volatility when establishing the asymptotic normality. However, as in the realized GARCH model (Hansen et al.,, 2012), if we assume that the conditional volatility is a function of the realized volatility estimator $RV_{i}$ , this assumption is not required.

3.3 Hypothesis tests

In financial practices, we are interested in statistical inferences about the GARCH parameters $(\omega^{g},\gamma,\beta^{g})$ , such as hypothesis tests. In this section, we discuss how to conduct hypothesis tests for the GARCH parameters.

Theorem 2 implies that

\sqrt{n}(\widehat{\theta^{g}}-\theta^{g}_{0})\overset{d}{\to}N(0,AV^{-1}),

where $A=\mathbb{E}\left[(1-M_{i})^{2}\right]$ and $V=\mathbb{E}\left[\frac{\partial H_{i}(\theta^{g})}{\partial\theta^{g}}\frac{\partial H_{i}(\theta^{g})}{\partial(\theta^{g})^{\top}}\big{|}_{\theta^{g}=\theta^{g}_{0}}\right]$ . To evaluate the asymptotic variances of the GARCH parameter estimators, we first need to estimate $A$ and $V$ . We use the following estimators,

\widehat{A}=\frac{1}{n}\sum_{i=1}^{n}\left(\frac{RV_{i}-\widehat{H}_{i}(\widehat{\theta^{g}})}{\widehat{H}_{i}(\widehat{\theta^{g}})}\right)^{2}\quad\text{and}\quad\widehat{V}(\theta^{g})=\frac{1}{n}\sum_{i=1}^{n}\frac{\partial\widehat{H}_{i}(\theta^{g})}{\partial\theta^{g}}\frac{\partial\widehat{H}_{i}(\theta^{g})}{\partial(\theta^{g})^{\top}},

where $\widehat{H}_{i}(\theta^{g})$ is defined in (3.1). Under some stationary condition, we can establish their consistency. Then, by the Slutsky’s theorem, we can obtain

T_{i,n}=\frac{\sqrt{n}(\widehat{\theta^{g}}_{i}-\theta^{g}_{0i})}{\sqrt{\widehat{A}\widehat{V}_{ii}^{-1}(\widehat{\theta^{g}})}}\overset{d}{\to}N(0,1),

where $\widehat{\theta^{g}}_{i}$ and $\theta^{g}_{0i}$ are the $i$ th elements of $\widehat{\theta^{g}}$ and $\theta^{g}_{0}$ , respectively, and $\widehat{V}_{ii}^{-1}(\widehat{\theta^{g}})$ is the $i$ th diagonal element of $\widehat{V}^{-1}(\widehat{\theta^{g}})$ . Thus, using the proposed Z-statistics $T_{i,n}$ , we can conduct the hypothesis tests based on the standard normal distribution.

4 A simulation study

We conducted Monte-Carlo simulations to check the finite sample performance of the ERGI model. The log-prices were generated from the ERGI model given in Definition 1 for $n$ days with $m$ high-frequency observations. The model parameters were set to be $(\omega_{0},\gamma_{0},\beta_{0},\nu_{0})=(-0.1,0.3,0.5,2)$ and $\mu_{t}=0$ . Then, the GARCH parameters $(\omega_{0}^{g},\gamma_{0},\beta_{0}^{g})=(0.3207,0.3,0.4405)$ . For the jump part, we set the intensity $\lambda_{t}=10$ and the jump size $|J_{t}|=0.05$ . The signs of the jump size were randomly generated. Let $t_{d,j}=d-1+j/m$ for $d=1,\ldots,n$ and $j=0,\ldots,m$ . We generated the noisy observations as follows:

\displaystyle Y_{t_{d,j}}=X_{t_{d,j}}+\epsilon_{t_{d,j}},\qquad\text{for }d=1,\ldots,n\text{ and }j=0,\ldots,m,

where $\epsilon_{t_{d,j}}$ ’s are i.i.d. normal random variables with mean zero and standard deviation $0.01\sqrt{\int_{d-1}^{d}\sigma_{t}^{2}(\theta^{g})dt}$ . To generate the true process, we chose $m=11700$ . We varied $n$ from 100 to 500 and $m$ from 390 to 11700, which corresponds to the number of minutes and 2-seconds during the open-to-close period, respectively. We used $Y_{t_{d,j}}$ as the high-frequency observations. To estimate the integrated volatilities, we used the jump robust pre-averaging method (Aït-Sahalia and Xiu,, 2016; Jacod et al.,, 2009) as follows:

RV_{d}=\frac{1}{\psi K}\sum_{k=1}^{m-K+1}\left\{\bar{Y}^{2}(t_{d,k})-\frac{1}{2}\,\widehat{Y}^{2}(t_{d,k})\right\}\boldsymbol{1}_{\{|\bar{Y}(t_{d,k})|\leq\tau_{m}\}},

where we take the weight function $g(x)=x\wedge(1-x)$ , the bandwidth size $K=\lfloor m^{1/2}\rfloor$ ,

	$\displaystyle\bar{Y}(t_{d,k})=\sum_{l=1}^{K-1}g\left(\frac{l}{K}\right)\left(Y_{t_{d,k+l}}-Y_{t_{d,k+l-1}}\right),\quad\psi=\int_{0}^{1}g(t)^{2}dt,$
	$\displaystyle\widehat{Y}^{2}(t_{d,k})=\sum_{l=1}^{K}\left\{g\left(\frac{l}{K}\right)-g\left(\frac{l-1}{K}\right)\right\}^{2}\left(Y_{t_{d,k+l-1}}-Y_{t_{d,k+l-2}}\right)^{2},$

$\boldsymbol{1}_{\{\cdot\}}$ is an indicator function, and $\tau_{m}=c_{\tau}m^{-0.235}$ is a truncation level for the constant $c_{\tau}$ . We chose $c_{\tau}$ as four times the sample standard deviation of the pre-averaged prices $m^{1/4}\bar{Y}(t_{d,k})$ . We estimated the parameters using the procedure in Section 3. We repeated the whole procedure 500 times.

To check the performance of the realized volatility estimator, we calculated squared relative errors as follows:

\frac{1}{n}\sum_{i=1}^{n}\left(\frac{RV_{i}-\int_{i-1}^{i}\sigma_{t}^{2}(\theta_{0})dt}{RV_{i}}\right)^{2}.

Then, we calculated the sample average of squared relative errors over 500 simulations. We have the average errors 0.0117, 0.0463, and 0.10751 for $m=11700,1170,$ and $390$ , respectively. As $m$ increases, the average errors decreases. This result supports the theoretical findings in the realized volatility estimator literature (Aït-Sahalia and Xiu,, 2016; Jacod et al.,, 2009).

Table 1: MSEs for the parameter estimates with

n=100,\,200,\,500

and

m=390,\,1170,\,11700

$n$	$m$	$\omega^{g}$	$\gamma$	$\beta^{g}$
100	390	0.0854	0.1312	0.0468
	1170	0.0852	0.1217	0.0415
	11700	0.0865	0.1204	0.0408
200	390	0.0435	0.0714	0.0309
	1170	0.0428	0.0690	0.0280
	11700	0.0453	0.0720	0.0274
500	390	0.0296	0.0484	0.0213
	1170	0.0249	0.0395	0.0177
	11700	0.0244	0.0367	0.0171

Table 1 reports the mean squared errors (MSE) of the parameter estimates $\widehat{\theta^{g}}$ with $n=100,\,200,\,500$ and $m=390,\,1170,11700$ . In Table 1, MSEs usually decrease as the number of high-frequency observations or daily observations increases. This result supports the theoretical findings in Section 3.

To check the asymptotic normality of the GARCH parameters $(\omega^{g},\gamma,\beta^{g})$ , we calculated the Z-statistics defined in Section 3.3. In Figure 2, we draw the standard normal quantile-quantile plots (QQ-plots) of the Z-statistics estimates of $\omega^{g}$ , $\gamma$ , and $\beta^{g}$ for $m=390$ and $n=100,200,500$ . In Figure 2, we find that the Z-statistics become close to the standard normal distribution as the sample period increases. This result supports the theoretical findings in Section 3. Thus, based on the proposed Z-statistics, we can conduct hypothesis tests using the standard normal distribution.

We examined the out-of-sample performance of estimating the one-day-ahead GARCH volatility $\exp(H_{n+1}(\theta^{g}_{0}))$ . To estimate future GARCH volatility, we employed the proposed conditional ERGI estimator $\exp(\widehat{H}_{n+1}(\widehat{\theta^{g}}))$ , realized GARCH volatility estimator (Hansen et al.,, 2012; Song et al.,, 2021), and PRV of the previous day. For example, the realized GARCH volatility estimator is estimated based on the following conditional volatility,

h_{n}(\theta^{g})=\omega+\gamma h_{n-1}(\theta^{g})+\beta RV_{n-1}.

That is, the realized GARCH volatility estimator has the usual linear GARCH structure with the realized volatilities. We measured the mean squared errors with the one-day-ahead sample period over 500 samples as follows:

\frac{1}{500}\sum_{i=1}^{500}\left[\widehat{\mbox{var}}_{n+1,i}-\exp\left(H_{n+1,i}(\theta^{g}_{0})\right)\right]^{2},

where $\widehat{\mbox{var}}_{n+1,i}$ is one of the above future volatility estimators at the $i$ th sample path given the available information at time $n$ . Figure 3 depicts the mean squared errors for the ERGI, realized GARCH, and PRV against varying the numbers of low- and high-frequency observations, $n$ and $m$ . In Figure 3, we find that the ERGI models show the best performance. The interesting finding is that the realized GARCH model can also capture some volatility dynamics. This may be because even if the volatility dynamic structure is non-linear, it could have some linear dynamics. Especially, when the log-volatility quantities are small, by Taylor’s expansion, the linear model can capture some non-linear dynamics. However, using the only linear structure, it cannot fully explain the non-linear dynamic structure. From these results, we can conjecture that modeling appropriate dynamic structure helps account for market dynamics.

5 Empirical study

We applied the proposed ERGI model to real trading high-frequency data. We obtained the top 50 trading volume assets intra-day data from January 2010 to December 2016 from the TAQ database in the Wharton Research Data Services (WRDS) system. We used the log-prices and employed the jump robust PRV estimation procedure defined in Section 4 to estimate open-to-close integrated volatility. In the empirical study, we chose the tuning parameter $c_{\tau}$ as 10 times the sample standard deviation of pre-averaged prices $m^{1/4}\bar{Y}(t_{d,k})$ . To check the accuracy of the PRV estimator, we calculated standard errors (SE) as follows. We first calculated the asymptotic variance, proposed by Aït-Sahalia and Xiu, (2016), and divided the square root of the asymptotic variance estimator by the square root of the number of high-frequency observations. We report data summary in Table 2. The number of high-frequency data is from 16,000 to 90,000 on average, and we find that the proportion of the jump variation is about 8% to 40% of the total variation on average. The standard error is less than 10% of realized volatility.

Table 2: Averages of the number of high-frequency observations, realized volatility (RV), standard error of the realized volatility estimator (SE), and jump variation (JV).

Stock	$\#$ of obs	RV $\times 10^{4}$	SE $\times 10^{4}$	JV $\times 10^{4}$	Stock	$\#$ of obs	RV $\times 10^{4}$	SE $\times 10^{4}$	JV $\times 10^{4}$
PG	16912.4	0.5637	0.0414	0.4000	MO	23667.5	0.6886	0.0075	0.2074
HBAN	15941.4	2.7678	0.0183	0.3550	QCOM	40986.1	1.3487	0.0059	0.2306
FCX	37196.1	5.4242	0.0222	0.8249	MRK	36710.1	0.9756	0.0198	0.2643
MRO	31241.5	3.8938	0.0203	0.6569	GILD	43938.5	1.8619	0.0132	0.4030
ORCL	44489.4	1.3062	0.0062	0.1915	DAL	35653.1	4.2791	0.0219	0.6506
AMD	22373.9	5.9256	0.0345	0.9735	LUV	23756.9	2.3442	0.0120	0.6057
AMAT	33556.6	2.0554	0.0079	0.3043	T	38606.7	0.7138	0.0037	0.1288
XRX	18877.3	2.3137	0.0172	1.4115	CSCO	40328.4	1.2632	0.0079	0.2128
WFC	44023.8	1.4846	0.0073	0.3143	DIS	16682.3	1.0435	0.0108	0.1842
NFLX	30613.5	5.4658	0.0293	0.5557	NVDA	27743.1	3.2549	0.0151	0.4210
F	34610.1	2.2033	0.0177	0.4130	SLB	31761.7	2.0279	0.0094	0.4409
GE	46327.4	1.2444	0.0153	0.2523	BMY	27565.6	1.1346	0.0157	0.2655
INTC	45515.7	1.4031	0.0076	0.2464	ATVI	25432.0	2.0102	0.0100	0.3804
XOM	45802.3	0.9642	0.0072	0.1490	MU	38907.5	5.1948	0.0301	0.9009
RF	19662.3	3.7024	0.0236	0.4208	JPM	40898.6	1.6448	0.0729	0.4199
DOW	28093.0	1.9877	0.0190	0.3006	CVX	32592.7	1.1472	0.0067	0.2205
NEM	29263.8	3.3862	0.0118	0.4225	MSFT	61219.5	1.2213	0.0035	0.1855
CSX	16106.8	1.6852	0.0153	0.4427	BAC	63492.7	2.4282	0.0098	0.3718
TXN	25727.4	1.3822	0.0072	0.3163	WMT	30549.0	0.6412	0.0033	0.1288
JNJ	31877.1	0.5137	0.0062	0.1689	WMB	26970.9	3.8400	0.0394	0.8323
VZ	32309.9	0.7749	0.0185	0.2155	AAPL	90386.9	1.4419	0.0118	0.3017
HST	21239.2	2.2687	0.0135	0.2591	BSX	24163.9	2.3119	0.0145	0.4101
MGM	16642.6	4.5845	0.0309	0.6648	PFE	43600.5	1.0821	0.0160	0.2296
KO	37024.6	0.6041	0.0034	0.1091	HAL	39739.1	3.2834	0.0136	0.4913
SCHW	27159.3	2.0437	0.1963	0.3522	GLW	25151.0	1.8426	0.0155	0.3186

We first estimated the model parameters using the recent 1000 days data. From the estimated model parameters, we obtained the following conditional expected volatility for each asset

\exp(\widehat{H}_{n+1}(\widehat{\theta^{g}}))\quad\text{and}\quad\widehat{H}_{n+1}(\widehat{\theta^{g}})=\widehat{\omega}^{g}+\widehat{\gamma}\widehat{H}_{n}(\widehat{\theta^{g}})+\widehat{\alpha}^{g}\log(RV_{n}).

Table 3 reports the estimation results. From Table 3, we show that dynamic structures can be explained by the past log-PRV, and the coefficients of realized volatilities are statistically significant. Thus, the proposed exponential model is valid.

Table 3: ERGI model estimation results. In the parenthesis, we report the p-values.

Stock	$\omega^{g}$	$\gamma$	$\beta^{g}$	Stock	$\omega^{g}$	$\gamma$	$\beta^{g}$
PG	-1.18 (0.00)	0.33 (0.00)	0.54 (0.00)	MO	-1.37 (0.00)	0.33 (0.00)	0.51 (0.00)
HBAN	-1.06 (0.00)	0.35 (0.00)	0.52 (0.00)	QCOM	-1.67 (0.00)	0.21 (0.00)	0.59 (0.00)
FCX	-0.07 (0.00)	0.48 (0.00)	0.50 (0.00)	MRK	-0.98 (0.00)	0.33 (0.00)	0.55 (0.00)
MRO	-0.12 (0.00)	0.39 (0.00)	0.58 (0.00)	GILD	-0.86 (0.00)	0.33 (0.00)	0.56 (0.00)
ORCL	-1.60 (0.00)	0.18 (0.00)	0.64 (0.00)	DAL	-1.91 (0.00)	0.18 (0.02)	0.57 (0.00)
AMD	-0.50 (0.00)	0.50 (0.00)	0.42 (0.00)	LUV	-1.45 (0.00)	0.33 (0.00)	0.49 (0.00)
AMAT	-1.83 (0.26)	0.19 (0.00)	0.59 (0.00)	T	-1.98 (0.01)	0.28 (0.00)	0.50 (0.00)
XRX	-0.57 (0.00)	0.47 (0.00)	0.45 (0.00)	CSCO	-1.83 (0.00)	0.18 (0.00)	0.61 (0.00)
WFC	-1.17 (0.00)	0.24 (0.00)	0.62 (0.00)	DIS	-1.41 (0.00)	0.23 (0.00)	0.61 (0.00)
NFLX	-0.69 (0.05)	0.29 (0.00)	0.60 (0.02)	NVDA	-1.31 (0.02)	0.29 (0.00)	0.54 (0.00)
F	-1.56 (0.00)	0.26 (0.03)	0.56 (0.00)	SLB	-0.62 (0.00)	0.36 (0.00)	0.55 (0.00)
GE	-1.40 (0.00)	0.19 (0.10)	0.66 (0.00)	BMY	-1.71 (0.00)	0.23 (0.00)	0.57 (0.00)
INTC	-1.14 (0.00)	0.27 (0.00)	0.60 (0.00)	ATVI	-0.82 (0.00)	0.42 (0.00)	0.47 (0.00)
XOM	-0.62 (0.00)	0.34 (0.00)	0.59 (0.00)	MU	-1.10 (0.00)	0.34 (0.07)	0.51 (0.00)
RF	-0.98 (0.00)	0.42 (0.00)	0.46 (0.00)	JPM	-1.42 (0.00)	0.15 (0.00)	0.68 (0.00)
DOW	-1.06 (0.00)	0.32 (0.00)	0.55 (0.00)	CVX	-0.39 (0.00)	0.38 (0.00)	0.57 (0.00)
NEM	-0.74 (0.00)	0.35 (0.00)	0.55 (0.00)	MSFT	-1.29 (0.00)	0.29 (0.00)	0.55 (0.00)
CSX	-0.83 (0.00)	0.32 (0.00)	0.58 (0.00)	BAC	-1.26 (0.00)	0.27 (0.00)	0.58 (0.00)
TXN	-1.31 (0.00)	0.23 (0.00)	0.61 (0.04)	WMT	-0.71 (0.00)	0.52 (0.00)	0.40 (0.00)
JNJ	-1.02 (0.01)	0.38 (0.00)	0.51 (0.00)	WMB	-0.09 (0.00)	0.41 (0.00)	0.56 (0.00)
VZ	-1.91 (0.00)	0.21 (0.00)	0.58 (0.00)	AAPL	-2.01 (0.00)	0.08 (0.00)	0.68 (0.00)
HST	-0.62 (0.10)	0.46 (0.00)	0.46 (0.00)	BSX	-1.40 (0.00)	0.29 (0.00)	0.54 (0.00)
MGM	-1.04 (0.00)	0.31 (0.00)	0.55 (0.00)	PFE	-0.89 (0.00)	0.33 (0.00)	0.56 (0.00)
KO	-1.46 (0.00)	0.26 (0.00)	0.58 (0.00)	HAL	-0.70 (0.00)	0.30 (0.00)	0.60 (0.03)
SCHW	-1.14 (0.00)	0.33 (0.00)	0.53 (0.00)	GLW	-1.92 (0.00)	0.18 (0.00)	0.59 (0.00)

For comparison, we employed the realized GARCH (Hansen et al.,, 2012; Song et al.,, 2021), unified GARCH-Itô (UGARCH) (Kim and Wang,, 2016), and HAR (Corsi,, 2009) models. To measure the performance of the volatility, we used the mean squared prediction errors (MSPE) and relative mean squared prediction errors (RMSPE) as follows:

\displaystyle MSPE=\frac{1}{n}\sum_{i=1}^{n}\left(Vol_{i}-RV_{i}\right)^{2}\quad\text{and}\quad RMSPE=\frac{1}{n}\sum_{i=1}^{n}\left(\frac{Vol_{i}-RV_{i}}{RV_{i}}\right)^{2},

where $Vol_{i}$ is one of the ERGI, realized GARCH, HAR, and UGARCH. We used $RV_{i}$ as the nonparametric daily volatility estimator. Furthermore, we calculated the out-of-sample R-square (OSR) (Campbell and Thompson,, 2008) as follows:

OSR=1-\frac{\sum_{i=1}^{n}\left(RV_{i}-Vol_{i}^{*}\right)^{2}}{\sum_{i=1}^{n}\left(RV_{i}-Vol_{i}\right)^{2}},

where $Vol_{i}^{*}$ is the proposed ERGI, and $Vol_{i}$ is one of the realized GARCH, HAR, unified GARCH-Itô, and sample mean of the in-sample $RV_{i}$ ’s. We predicted the one-day-ahead conditional expected volatility by the ERGI, realized GARCH, HAR, and UGARCH models using the in-sample period data. We fixed the in-sample period as 500 days and used the rolling window scheme to estimate the parameters. The number of out-of-sample was 1,262. To check the period dependency, we split the period into two equal parts. We denote the two periods as Period 1 and Period 2. Table 4 reports the average rank and the number of the first rank of MSPEs and RMSPEs for the ERGI, realized GARCH, HAR and UGARCH for Period 1, Period 2, and the whole period over the 50 assets. Figure 4 depicts the relative MSPE and RMSPE for the realized GARCH, HAR, and UGARCH with respect to the ERGI for Period 1, Period 2, and the whole period. Figure 5 draws the OSR for the ERGI with respect to realized GARCH, HAR, UGARCH, and sample mean for Period 1, Period 2, and the whole period. From Table 4 and Figures 4–5, we find that the realized volatility-based model, such as the ERGI, realized GARCH, and HAR models, perform better than the UGARCH model, which incorporates the squared open-to-close returns as the innovation. That is, incorporating the realized volatility helps account for the volatility dynamics. When comparing the realized volatility-based models, the proposed ERGI model shows the best performance. From this result, we can conjecture that the non-linear exponential form with realized volatilities helps explain the market dynamics.

Table 4: Average rank of MSPEs and RMSPEs for the ERGI, realized GARCH, HAR and UGARCH for Period 1, Period 2, and the whole period. In the parenthesis, we report the number of the first rank among competitors.

	Period 1				Period 2				Whole period
	ERGI	Real	HAR	UGARCH	ERGI	Real	HAR	UGARCH	ERGI	Real	HAR	UGARCH
MSPE	1.4 (33)	2.3 (5)	2.2 (12)	4.0 (0)	1.1 (42)	2.5 (2)	2.6 (4)	3.6 (2)	1.2 (41)	2.5 (2)	2.4 (6)	3.7 (1)
RMSPE	1.2 (38)	2.4 (3)	2.3 (9)	4.0 (0)	1.0 (48)	2.2 (1)	2.8 (0)	3.8 (1)	1.0 (47)	2.3 (0)	2.6 (3)	3.9 (0)

To further compare the predictive accuracy among the ERGI, realized GARCH, HAR, and UGARCH models, we conducted Diebold-Mariano tests (Diebold and Mariano,, 2002) as follows. We first calculated the residuals for the four models:

e_{i}=RV_{i}-Vol_{i},

where $Vol_{i}$ is one of the ERGI, realized GARCH, HAR, and UGARCH, and $RV_{i}$ is the non-parametric realized volatility. We define

d_{i}=e_{i}^{*2}-e_{i}^{2},

where $e_{i}^{*}$ is the residuals from the ERGI and $e_{i}$ is the residuals from one of realized GARCH, HAR, and UGARCH. Then, we conducted hypothesis tests for

H_{0}^{\prime}:\mathbb{E}[d_{i}]=0\quad\text{v.s.}\quad H_{1}:\mathbb{E}[d_{i}]<0\text{ ( or }\mathbb{E}[d_{i}]>0).

The first alternative statement ( $\mathbb{E}[d_{i}]<0$ ) is to test whether the ERGI is better, while the second alternative statement ( $\mathbb{E}[d_{i}]>0$ ) is to test whether other model is better than the ERGI. We call them “less” and “greater” tests, respectively. Figure 6 depicts box plots of p-values of the less and greater DM tests for the ERGI versus one of the realized GARCH, HAR, and UGARCH for Period 1, Period 2, and the whole period. From Figure 6, the less tests show that p-values of 22, 16, and 46 assets for realized GARCH, HAR, and UGARCH models, respectively, were less than 10% over the whole period. In contrast, the greater tests indicate that a couple of assets for the HAR model have significant p-values over the whole period. From these results, although the ERGI does not give significant better predictive accuracy for all assets, we can conclude that for most assets, the ERGI is at least not worse than other models, and, for some assets, the ERGI shows significantly better performance than the other models.

To check the volatility persistence of the nonparametric volatility, we studied the residuals between the nonparametric volatility and estimated conditional volatilities, $Vol_{i}-RV_{i}$ , where $Vol_{i}$ is the predicted one-day-ahead conditional expected volatility by ERGI, realized GARCH, HAR, and UGARCH using the in-sample period data. Then, we checked their autocorrelations. Table 5 reports the average rank and number of the first rank of the first order autocorrelation for the ERGI, realized GARCH, HAR, and UGARCH for Period 1, Period 2, and the whole period over the 50 assets. Figure 7 provides the box plots of the first order autocorrelation for the ERGI, realized GARCH, HAR, and UGARCH for Period 1, Period 2, and the whole period over the 50 assets. From Table 5 and Figure 7, we find that the ERGI has relatively small autocorrelations. That is, the ERGI model can reduce the volatility persistence. These numerical results provide evidence to conclude that the non-linear exponential auto-regressive structure helps explain the market dynamics in the volatility time series.

Table 5: Average rank of the first order autocorrelation for the ERGI, realized GARCH, HAR and UGARCH for Period 1, Period 2, and the whole period. In the parenthesis, we report the number of the first rank among competitors.

Period 1					Period 2					Whole period
ERGI	Real	HAR	UGARCH		ERGI	Real	HAR	UGARCH		ERGI	Real	HAR	UGARCH
1.2 (40)	1.9 (9)	2.9 (1)	3.9 (0)		1.8 (18)	2.3 (14)	2.3 (13)	3.6 (5)		1.6 (28)	2.2 (12)	2.5 (8)	3.6 (2)

6 Conclusions

In this paper, we propose a novel jump diffusion process to model the non-linear auto-regressive structure of the realized volatility. We employ the exponential GARCH structure. By introducing a continuous instantaneous volatility process whose integrated volatility follows the exponential realized GARCH structure, we fill the gap between the empirical discrete-time non-linear volatility model with the realized volatility and high-frequency based continuous-time diffusion process. That is, this paper provides rigorous mathematical background to understand the exponential realized GARCH structure. To estimate the model parameter, we propose the quasi-maximum likelihood estimation procedure and establish its asymptotic properties. From the empirical study, we find the benefits of incorporating the non-linear exponential realized GARCH.

In this paper, we focus on the continuous part of log-return processes for the open-to-close period. However, it is important and interesting to study dynamic structures of the jump variation and close-to-open returns. We leave this for future study.

7 Proofs

7.1 Proof of Theorem 1

Lemma 1.

Under the ERGI model in Definition 1, we have for $t\in(n-1,n]$ ,

\displaystyle\frac{1}{t-n+1}\int_{n-1}^{t}\sigma_{s}^{2}(\theta)ds=\sigma_{n-1}^{2}(\theta)e^{\int_{n-1}^{t}b_{s}(\theta)ds}\text{ a.s.}

(7.1)

Proof of Lemma 1. We have

$\displaystyle d((t-[t])\bar{\sigma}_{t}^{2}(\theta))$	$\displaystyle=$	$\displaystyle(t-[t])d\bar{\sigma}_{t}^{2}(\theta)+\bar{\sigma}_{t}^{2}(\theta)dt$
	$\displaystyle=$	$\displaystyle\sigma_{t}^{2}(\theta)dt$
	$\displaystyle=$	$\displaystyle\bar{\sigma}_{t}^{2}(\theta)\left(1+(t-[t])b_{t}(\theta)\right)dt\text{ a.s.}$

Thus, we have

	$\displaystyle(t-[t])d\bar{\sigma}_{t}^{2}(\theta)$	$\displaystyle=$	$\displaystyle\bar{\sigma}_{t}^{2}(\theta)\left(1+(t-[t])b_{t}(\theta)\right)dt-\bar{\sigma}_{t}^{2}(\theta)dt$
		$\displaystyle=$	$\displaystyle(t-[t])\bar{\sigma}_{t}^{2}(\theta)b_{t}(\theta)dt\text{ a.s.}$

This implies that

\displaystyle d\bar{\sigma}_{t}^{2}(\theta)=\bar{\sigma}_{t}^{2}(\theta)b_{t}(\theta)dt\text{ a.s.}

and

\frac{1}{t-n+1}\int_{n-1}^{t}\sigma_{s}^{2}(\theta)ds=\sigma_{n-1}^{2}(\theta)e^{\int_{n-1}^{t}b_{s}(\theta)ds}\text{ a.s.}\qquad\text{for }t\in(n-1,n].

$\blacksquare$

Proof of Theorem 1. First, we consider $(a)$ . By the Itô’s lemma, we have

$\displaystyle R(k)$	$\displaystyle=$	$\displaystyle\int_{n-1}^{n}\frac{(n-t)^{k}}{k!}b_{t}(\theta)dt$
	$\displaystyle=$	$\displaystyle\frac{\omega}{(k+2)!}+\nu\left(\frac{1}{(k+2)!}-\frac{2}{(k+3)!}\right)$
		$\displaystyle+\left\{\frac{1}{(k+1)!}+\frac{\gamma-1}{(k+2)!}\right\}b_{n-1}(\theta)$
		$\displaystyle-\left\{\frac{\beta}{(k+1)!}+\frac{\beta^{}-\beta}{(k+2)!}-\frac{2\beta^{}}{(k+3)!}\right\}\log\sigma_{n-1}^{2}(\theta)$
		$\displaystyle+2\nu\ \int_{n}^{n-1}\frac{(n-t)^{k+2}}{k!(k+2)}Z_{t}dW_{t}+\beta\int_{n-1}^{n}\frac{(n-t)^{k}}{k!}\log\bar{\sigma}_{t}^{2}(\theta)dt\text{ a.s.},$

and, by Lemma 1, we have

	$\displaystyle\int_{n-1}^{n}\frac{(n-t)^{k}}{k!}\log\bar{\sigma}_{t}^{2}(\theta)dt$
	$\displaystyle=\int_{n-1}^{n}\frac{(n-t)^{k}}{k!}\left(\int_{n-1}^{t}b_{s}(\theta)ds+\log\sigma_{n-1}^{2}(\theta)\right)dt$
	$\displaystyle=\frac{1}{(k+1)!}\log\sigma_{n-1}^{2}(\theta)+\int_{n-1}^{n}b_{s}(\theta)\int_{s}^{n}\frac{(n-t)^{k}}{k!}dtds$
	$\displaystyle=\frac{1}{(k+1)!}\log\sigma_{n-1}^{2}(\theta)+R(k+1)\text{ a.s.}$

Thus, we have

$\displaystyle R(0)$	$\displaystyle=$	$\displaystyle\int_{n-1}^{n}b_{t}(\theta)dt$
	$\displaystyle=$	$\displaystyle\sum_{k=0}^{\infty}\omega\beta^{-2}\frac{\beta^{k+2}}{(k+2)!}+\nu\left(\beta^{-2}\frac{\beta^{k+2}}{(k+2)!}-\beta^{-3}\frac{\beta^{k+3}}{(k+3)!}\right)$
		$\displaystyle+\sum_{k=0}^{\infty}b_{n-1}(\theta)\left\{\beta^{-1}\frac{\beta^{k+1}}{(k+1)!}+(\gamma-1)\beta^{-2}\frac{2\beta^{k+2}}{(k+2)!}\right\}$
		$\displaystyle-\sum_{k=0}^{\infty}\log\sigma_{n-1}^{2}(\theta)\left\{\left(\beta^{}-\beta\right)\beta^{-2}\frac{\beta^{k+2}}{(k+2)!}-2\beta^{}\beta^{-3}\frac{\beta^{k+3}}{(k+3)!}\right\}$
		$\displaystyle+\sum_{k=0}^{\infty}2\nu\int_{n-1}^{n-1}\beta^{k}\frac{(n-t)^{k+2}(k+1)}{(k+2)!}Z_{t}dW_{t}$
	$\displaystyle=$	$\displaystyle\omega\varrho_{2}+\nu(\varrho_{2}-2\varrho_{3})-\{(\beta^{}-\beta)\varrho_{2}-2\beta^{}\varrho_{3}\}\log\sigma_{n-1}^{2}(\theta)$
		$\displaystyle+\{\varrho_{1}+(\gamma-1)\varrho_{2}\}b_{n-1}(\theta)+D_{n}$
	$\displaystyle=$	$\displaystyle\omega\varrho_{2}+\nu(\varrho_{2}-2\varrho_{3})-\log\sigma_{n-1}^{2}(\theta)+\varrho b_{n-1}(\theta)+D_{n}$
	$\displaystyle=$	$\displaystyle h_{n}(\theta)-\log\sigma_{n-1}^{2}(\theta)+D_{n}\text{ a.s.}$

Then, by (7.1), we have

	$\displaystyle\int_{n-1}^{n}\sigma_{t}^{2}(\theta)dt$	$\displaystyle=$	$\displaystyle\sigma_{n-1}^{2}\exp\left(\int_{n-1}^{n}b_{t}(\theta)dt\right)$
		$\displaystyle=$	$\displaystyle\exp\left(h_{n}(\theta)+D_{n}\right)\text{ a.s.}$

For $(b)$ , since $\mathbb{E}[\exp(D_{n})]$ is a constant, we have

\displaystyle\int_{n-1}^{n}\sigma_{t}^{2}(\theta)dt

\displaystyle=\exp\left(H_{n}(\theta)\right)M_{n},

and we obtain $(b)$ . $\blacksquare$

7.2 Proof of Theorem 2

To simplify the notation, we use $\theta$ for the GARCH model parameters $\theta^{g}$ . For derivatives of any given function $f$ at $x_{0}$ , we denote $\frac{\partial f(x_{0})}{\partial x}=\frac{\partial f(x)}{\partial x}\bigr{|}_{x=x_{0}}$ . Define

	$\displaystyle\widehat{L}_{n,m}(\theta)=-\frac{1}{n}\sum_{i=1}^{n}\left\{\widehat{H}_{i}(\theta)+\frac{RV_{i}}{\exp\left(\widehat{H}_{i}(\theta)\right)}\right\}\quad\text{ and }\quad\widehat{s}_{n,m}(\theta)=\frac{\partial\widehat{L}_{n,m}(\theta)}{\partial\theta};$
	$\displaystyle\widehat{L}_{n}(\theta)=-\frac{1}{n}\sum_{i=1}^{n}\left\{H_{i}(\theta)+\frac{\int_{i-1}^{i}\sigma_{t}^{2}(\theta_{0})dt}{\exp\left(H_{i}(\theta)\right)}\right\}\quad\text{ and }\quad\widehat{s}_{n}(\theta)=\frac{\partial\widehat{L}_{n}(\theta)}{\partial\theta};$
	$\displaystyle L_{n}(\theta)=-\frac{1}{n}\sum_{i=1}^{n}\left\{H_{i}(\theta)+\frac{\exp\left(H_{i}(\theta_{0})\right)}{\exp\left(H_{i}(\theta)\right)}\right\}\quad\text{ and }\quad s_{n}(\theta)=\frac{\partial L_{n}(\theta)}{\partial\theta}.$

Since the dependence of $H_{i}(\theta)$ on the initial value decays exponentially (Kim and Wang,, 2016), we use the true initial value $H_{0}(\theta_{0})$ for the rest of the proofs without of loss of generality.

Lemma 2.

Under the assumptions of Theorem 2, we have $\widehat{\theta}\xrightarrow{p}\theta_{0}$ .

Proof of Lemma 2. We first show the uniform convergence of $\widehat{L}_{n,m}(\theta)$ . That is, we need to show

	$\displaystyle\sup_{\theta\in\Theta}\left\|\widehat{L}_{n,m}(\theta)-L_{n}(\theta)\right\|$	$\displaystyle\leq\sup_{\theta\in\Theta}\left\|\widehat{L}_{n,m}(\theta)-\widehat{L}_{n}(\theta)\right\|+\sup_{\theta\in\Theta}\left\|\widehat{L}_{n}(\theta)-L_{n}(\theta)\right\|$
		$\displaystyle=o_{p}(1).$

For $\sup_{\theta\in\Theta}\left|\widehat{L}_{n,m}(\theta)-\widehat{L}_{n}(\theta)\right|$ , we have

	$\displaystyle\sup_{\theta\in\Theta}\left\|\widehat{L}_{n,m}(\theta)-\widehat{L}_{n}(\theta)\right\|$
	$\displaystyle\leq\sup_{\theta\in\Theta}\Bigg{\{}\frac{1}{n}\left\|\sum_{i=1}^{n}\widehat{H}_{i}(\theta)-H_{i}(\theta)\right\|+\frac{1}{n}\left\|\sum_{i=1}^{n}\frac{RV_{i}-\int_{i-1}^{i}\sigma_{t}^{2}(\theta_{0})dt}{\exp\left(\widehat{H}_{i}(\theta)\right)}\right\|$
	$\displaystyle\qquad\qquad+\frac{1}{n}\left\|\sum_{i=1}^{n}\int_{i-1}^{i}\sigma_{t}^{2}(\theta_{0})dt\left(e^{-\widehat{H}_{i}(\theta)}-e^{-H_{i}(\theta)}\right)\right\|\Bigg{\}}$
	$\displaystyle=\text{(I)}+\text{(II)}+\text{(III)}.$

For (I), we have

$\displaystyle\mathbb{E}[\text{(I)}]$	$\displaystyle\leq$	$\displaystyle\frac{1}{n}\sum_{i=1}^{n}\mathbb{E}\left[\Bigg{\|}\sup_{\theta\in\Theta}\sum_{k=1}^{i-1}\beta\gamma^{k-1}\left(\log RV_{i-k}-\log IV_{i-k}\right)\Bigg{\|}\right]$	(7.2)
	$\displaystyle\leq$	$\displaystyle\frac{C}{n}\sum_{i=1}^{n}\sum_{k=1}^{i-1}\gamma^{k-1}_{u}\mathbb{E}\left[\|\log RV_{i-k}-\log IV_{i-k}\|\right]$	(7.3)
	$\displaystyle\leq$	$\displaystyle Cm^{-1/4},$	(7.4)

where $IV_{i}=\int_{i-1}^{i}\sigma_{t}^{2}(\theta_{0})dt$ , and the last inequality is due to Assumption 1 (b). Consider (II). By Assumption 1 (b) and (c), we have

\displaystyle\mathbb{E}[\text{(II)}]

\displaystyle\leq\frac{C}{n}\sum_{i=1}^{n}\mathbb{E}\left[\left|RV_{i}-IV_{i}\right|^{2}\right]\mathbb{E}\left[\sup_{\theta\in\Theta}\left|\exp\left(|\widehat{H}_{i}(\theta)|\right)\right|^{2}\right]\leq Cm^{-1/4}.

For (III), we have

$\displaystyle\mathbb{E}[\text{(III)}]$	$\displaystyle\leq$	$\displaystyle\frac{C}{n}\sum_{i=1}^{n}\mathbb{E}\left[\sup_{\theta\in\Theta}\left(e^{-\widehat{H}_{i}(\theta)}-e^{-H_{i}(\theta)}\right)^{2}\right]^{1/2}$
	$\displaystyle\leq$	$\displaystyle\frac{C}{n}\sum_{i=1}^{n}\mathbb{E}\left[\sup_{\theta\in\Theta}\left(e^{4\|\widehat{H}_{i}(\theta)\|}+e^{4\|H_{i}(\theta)\|}\right)\right]^{1/4}\mathbb{E}\left[\sup_{\theta\in\Theta}\left(\widehat{H}_{i}(\theta)-H_{i}(\theta)\right)^{4}\right]^{1/4}$
	$\displaystyle\leq$	$\displaystyle Cm^{-1/4},$

where the first and second inequalities are due to Holder’s inequality and Taylor’s expansion, respectively, and the last inequality can be showed similar to the proof of (7.2) with Assumption 1(c). Thus, we have

\sup_{\theta\in\Theta}\left|\widehat{L}_{n,m}(\theta)-\widehat{L}_{n}(\theta)\right|=o_{p}(1).

(7.5)

We consider $\sup_{\theta\in\Theta}\left|\widehat{L}_{n}(\theta)-L_{n}(\theta)\right|$ . We have

\displaystyle\widehat{L}_{n}(\theta)-L_{n}(\theta)=-\frac{1}{n}\sum_{i=1}^{n}\frac{e^{H_{i}(\theta_{0})}}{e^{H_{i}(\theta)}}(M_{i}-1),

which is a martingale process for any given $\theta$ . Thus, by martingale convergence theorem, we can show its pointwise convergence. To show its uniform convergence, we need to show the stochastic continuity for $G_{n}(\theta)=\widehat{L}_{n}(\theta)-L_{n}(\theta)$ . By the Taylor’s expansion and the mean value theorem, there exits $\theta^{*}$ between $\theta$ and $\theta^{\prime}$ such that

	$\displaystyle\|G_{n}(\theta)-G_{n}(\theta^{\prime})\|$	$\displaystyle=$	$\displaystyle\left\|\frac{1}{n}\sum_{i=1}^{n}\frac{e^{H_{i}(\theta_{0})}}{e^{H_{i}(\theta^{})}}\frac{\partial H_{i}(\theta^{})}{\partial\theta}(M_{i}-1)(\theta-\theta^{\prime})\right\|$
		$\displaystyle\leq$	$\displaystyle C\frac{1}{n}\sum_{i=1}^{n}\sup_{\theta^{}\in\Theta}\left\\|\frac{e^{H_{i}(\theta_{0})}}{e^{H_{i}(\theta^{})}}\frac{\partial H_{i}(\theta^{*})}{\partial\theta}(M_{i}-1)\right\\|_{\max}\\|\theta-\theta^{\prime}\\|_{\max}.$

By Assumption 1(c), we have $\mathbb{E}\left[e^{4|H_{i}(\theta)|}\right]\leq C$ . Then, similar to the proofs of Lemma 3 in Kim and Wang, (2016), we can show

\frac{1}{n}\sum_{i=1}^{n}\sup_{\theta^{*}\in\Theta}\left\|\frac{e^{H_{i}(\theta_{0})}}{e^{H_{i}(\theta^{*})}}\frac{\partial H_{i}(\theta^{*})}{\partial\theta}(M_{i}-1)\right\|_{\max}=O_{p}(1).

Thus, $G_{n}(\theta)$ satisfies the weak Lipschitz condition, so, by Theorem 4 in Andrews, (1992), we can show the uniform convergence. Therefore, we have

\sup_{\theta\in\Theta}\left|\widehat{L}_{n}(\theta)-L_{n}(\theta)\right|=o_{p}(1).

(7.6)

By (7.5) and (7.6), we show the uniform convergence of $\widehat{L}_{n,m}(\theta)$ .

When $\exp(H_{i}(\theta_{0}))=\exp(H_{i}(\theta))$ for all $i$ , $L_{n}(\theta)$ is maximized. Obviously, $\theta_{0}$ is one of the solutions. Suppose that there exists $\theta_{*}$ such that $\exp(H_{i}(\theta_{0}))=\exp(H_{i}(\theta_{*}))$ a.s. for all $i$ . Since the exponential function is a strictly increasing function, we have $H_{i}(\theta_{0})-H_{i}(\theta_{*})=0$ a.s. for all $i$ . Thus, $\theta_{0}$ and $\theta_{*}=(\omega_{*}^{g},\gamma_{*},\beta_{*}^{g})$ satisfy

\begin{pmatrix}1&H_{1}(\theta_{0})&\log IV_{1}\\ 1&H_{2}(\theta_{0})&\log IV_{2}\\ \vdots&\vdots&\vdots\\ 1&H_{n-1}(\theta_{0})&\log IV_{n-1}\end{pmatrix}\begin{pmatrix}\omega_{*}^{g}-\omega_{0}^{g}\\ \gamma_{*}-\gamma_{0}\\ \beta_{*}^{g}-\beta_{0}^{g}\end{pmatrix}\equiv M\begin{pmatrix}\omega_{*}^{g}-\omega_{0}^{g}\\ \gamma_{*}-\gamma_{0}\\ \beta_{*}^{g}-\beta_{0}^{g}\end{pmatrix}=0\text{ a.s.}

Since $IV_{i}$ ’s are non-degenerating, $M$ is of full rank. Then, $M^{\top}M$ is invertible, which implies $\theta_{0}=\theta_{*}$ a.s. Therefore, $L_{n}(\theta)$ has the unique maximizer $\theta_{0}$ . Then, by Theorem 1 in Xiu, (2010), with the uniform convergence of $\widehat{L}_{n,m}(\theta)$ , we can show $\widehat{\theta}\overset{p}{\to}\theta_{0}$ . $\blacksquare$

Proof of Theorem 2. The mean value theorem and Taylor’s expansion, there exists $\theta^{*}$ between $\widehat{\theta}$ and $\theta_{0}$ such that

\widehat{s}_{n,m}(\widehat{\theta})-\widehat{s}_{n,m}(\theta_{0})=-\widehat{s}_{n,m}(\theta_{0})=\triangledown\widehat{s}_{n,m}(\theta^{*})(\widehat{\theta}-\theta_{0}),

where $\triangledown\widehat{s}_{n,m}(\theta^{*})=\frac{\partial\widehat{s}_{n,m}(\theta^{*})}{\partial\theta^{\top}}$ . We first consider $\widehat{s}_{n,m}(\theta_{0})$ . We have

$\displaystyle-\widehat{s}_{n,m}(\theta_{0})$	$\displaystyle=$	$\displaystyle\frac{1}{n}\sum_{i=1}^{n}\left\{1-e^{-\widehat{H}_{i}(\theta_{0})}RV_{i}\right\}\frac{\partial\widehat{H}_{i}(\theta_{0})}{\partial\theta}$	(7.7)
	$\displaystyle=$	$\displaystyle\frac{1}{n}\sum_{i=1}^{n}\left\{1-e^{-H_{i}(\theta_{0})}IV_{i}\right\}\frac{\partial H_{i}(\theta_{0})}{\partial\theta}+O_{p}(m^{-1/4})$	(7.8)
	$\displaystyle=$	$\displaystyle\frac{1}{n}\sum_{i=1}^{n}(1-M_{i})\frac{\partial H_{i}(\theta_{0})}{\partial\theta}+O_{p}(m^{-1/4})$	(7.9)
	$\displaystyle=$	$\displaystyle O_{p}(n^{-1/2}+m^{-1/4}),$	(7.10)

where the second equality can be showed similar to the proofs of Lemma 2, and the last equality is due to the martingale convergence theorem.

We consider $\triangledown\widehat{s}_{n,m}(\theta^{*})$ . Similar to the proofs of (7.7) with the consistency of $\widehat{\theta}$ , we can show

$\displaystyle\triangledown\widehat{s}_{n,m}(\theta^{*})$	$\displaystyle=$	$\displaystyle-\frac{1}{n}\sum_{i=1}^{n}\left\{1-e^{-\widehat{H}_{i}(\theta^{})}RV_{i}\right\}\frac{\partial^{2}\widehat{H}_{i}(\theta^{})}{\partial\theta\partial\theta^{\top}}-\frac{1}{n}\sum_{i=1}^{n}e^{-\widehat{H}_{i}(\theta^{})}RV_{i}\frac{\partial\widehat{H}_{i}(\theta^{})}{\partial\theta}\frac{\partial\widehat{H}_{i}(\theta^{*})}{\partial\theta^{\top}}$
	$\displaystyle=$	$\displaystyle-\frac{1}{n}\sum_{i=1}^{n}e^{-\widehat{H}_{i}(\theta^{})}RV_{i}\frac{\partial\widehat{H}_{i}(\theta^{})}{\partial\theta}\frac{\partial\widehat{H}_{i}(\theta^{*})}{\partial\theta^{\top}}+o_{p}(1)$
	$\displaystyle=$	$\displaystyle-\frac{1}{n}\sum_{i=1}^{n}M_{i}\frac{\partial H_{i}(\theta_{0})}{\partial\theta}\frac{\partial H_{i}(\theta_{0})}{\partial\theta^{\top}}+o_{p}(1).$

Since $IV_{i}$ ’s and $M_{i}$ ’s are non-degenerating, $\frac{1}{n}\sum_{i=1}^{n}M_{i}\frac{\partial H_{i}(\theta_{0})}{\partial\theta}\frac{\partial H_{i}(\theta_{0})}{\partial\theta^{\top}}$ is positive definite. Thus, by (7.7), we have

\theta-\theta_{0}=O_{p}(n^{-1/2}+m^{-1/4}).

Now, we show the asymptotic normality. By Theorem 1(b), we have

	$\displaystyle H_{n}(\theta)$	$\displaystyle=$	$\displaystyle\omega^{g}+(\gamma+\beta^{g})H_{n-1}(\theta)+\beta^{g}\log M_{n-1}$
		$\displaystyle=$	$\displaystyle\frac{\omega^{g}}{1-\gamma-\beta^{g}}+\sum_{i=1}^{\infty}\beta^{g}(\gamma+\beta^{g})^{i-1}\log M_{n-i}.$

Since $\log M_{i}$ ’s are i.i.d., $(H_{n}(\theta),M_{n})$ is strictly stationary. By Theorem 2.1 (Francq et al.,, 2013) and Theorem 2.5 (Bougerol and Picard,, 1992), $(H_{n}(\theta),M_{n})$ is ergodic. Then, applying the martingale central limit theorem, we obtain

\frac{1}{\sqrt{n}}\sum_{i=1}^{n}(1-M_{i})\frac{\partial H_{i}(\theta_{0})}{\partial\theta}\overset{d}{\to}N(0,AV).

By the ergodic theorem, we can show

-\triangledown\widehat{s}_{n,m}(\theta^{*})\overset{p}{\to}V.

Thus, by the Slutsky theorem, we have

\sqrt{n}(\widehat{\theta}-\theta_{0})\overset{d}{\to}N(0,AV^{-1}).

$\blacksquare$

Acknowledgments

The authors appreciate the Editor, Professor D. Kristensen, and anonymous two referees for their careful reading of this paper and valuable comments. The research of Donggyu Kim was supported in part by the National Research Foundation of Korea (NRF) (2021R1C1C1003216).

References

Aït-Sahalia et al., (2010) Aït-Sahalia, Y., Fan, J., and Xiu, D. (2010). High-frequency covariance estimates with noisy and asynchronous financial data. Journal of the American Statistical Association, 105(492):1504–1517.
Aït-Sahalia and Xiu, (2016) Aït-Sahalia, Y. and Xiu, D. (2016). Increased correlation among asset classes: Are volatility or jumps to blame, or both? Journal of Econometrics, 194(2):205–219.
(3) Andersen, T. G. and Bollerslev, T. (1997a). Heterogeneous information arrivals and return volatility dynamics: Uncovering the long-run in high frequency returns. The journal of Finance, 52(3):975–1005.
(4) Andersen, T. G. and Bollerslev, T. (1997b). Intraday periodicity and volatility persistence in financial markets. Journal of empirical finance, 4(2-3):115–158.
(5) Andersen, T. G. and Bollerslev, T. (1998a). Answering the skeptics: Yes, standard volatility models do provide accurate forecasts. International Economic Review, 39(4):885–905.
(6) Andersen, T. G. and Bollerslev, T. (1998b). Deutsche mark-dollar volatility: Intraday activity patterns, macroeconomic announcements, and longer run dependencies. The journal of Finance, 53(1):219–265.
Andersen et al., (2003) Andersen, T. G., Bollerslev, T., Diebold, F. X., and Labys, P. (2003). Modeling and forecasting realized volatility. Econometrica, 71(2):579–625.
Andrews, (1992) Andrews, D. W. (1992). Generic uniform convergence. Econometric theory, 8(2):241–257.
Barndorff-Nielsen et al., (2008) Barndorff-Nielsen, O. E., Hansen, P. R., Lunde, A., and Shephard, N. (2008). Designing realized kernels to measure the ex post variation of equity prices in the presence of noise. Econometrica, 76(6):1481–1536.
Bollerslev, (1986) Bollerslev, T. (1986). Generalized autoregressive conditional heteroskedasticity. Journal of econometrics, 31(3):307–327.
Bougerol and Picard, (1992) Bougerol, P. and Picard, N. (1992). Strict stationarity of generalized autoregressive processes. The Annals of Probability, 20(4):1714–1730.
Campbell and Thompson, (2008) Campbell, J. Y. and Thompson, S. B. (2008). Predicting excess stock returns out of sample: Can anything beat the historical average? The Review of Financial Studies, 21(4):1509–1531.
Corsi, (2009) Corsi, F. (2009). A simple approximate long-memory model of realized volatility. Journal of Financial Econometrics, 7(2):174–196.
Diebold and Mariano, (2002) Diebold, F. X. and Mariano, R. S. (2002). Comparing predictive accuracy. Journal of Business & economic statistics, 20(1):134–144.
Engle, (1982) Engle, R. F. (1982). Autoregressive conditional heteroscedasticity with estimates of the variance of united kingdom inflation. Econometrica: Journal of the Econometric Society, pages 987–1007.
Fan and Kim, (2018) Fan, J. and Kim, D. (2018). Robust high-dimensional volatility matrix estimation for high-frequency factor model. Journal of the American Statistical Association, 113(523):1268–1283.
Fan and Wang, (2007) Fan, J. and Wang, Y. (2007). Multi-scale jump and volatility analysis for high-frequency financial data. Journal of the American Statistical Association, 102(480):1349–1362.
Francq et al., (2013) Francq, C., Wintenberger, O., and Zakoian, J.-M. (2013). Garch models without positivity constraints: Exponential or log garch? Journal of Econometrics, 177(1):34–46.
Hansen and Huang, (2016) Hansen, P. R. and Huang, Z. (2016). Exponential garch modeling with realized measures of volatility. Journal of Business & Economic Statistics, 34(2):269–287.
Hansen et al., (2012) Hansen, P. R., Huang, Z., and Shek, H. H. (2012). Realized garch: a joint model for returns and realized measures of volatility. Journal of Applied Econometrics, 27(6):877–906.
Jacod et al., (2009) Jacod, J., Li, Y., Mykland, P. A., Podolskij, M., and Vetter, M. (2009). Microstructure noise in the continuous case: the pre-averaging approach. Stochastic processes and their applications, 119(7):2249–2276.
Kawakatsu, (2006) Kawakatsu, H. (2006). Matrix exponential garch. Journal of Econometrics, 134(1):95–128.
Kim and Fan, (2019) Kim, D. and Fan, J. (2019). Factor garch-itô models for high-frequency data with application to large volatility matrix prediction. Journal of Econometrics, 208(2):395–417.
Kim and Wang, (2016) Kim, D. and Wang, Y. (2016). Unified discrete-time and continuous-time models and statistical inferences for merged low-frequency and high-frequency financial data. Journal of Econometrics, 194:220–230.
Kim et al., (2016) Kim, D., Wang, Y., and Zou, J. (2016). Asymptotic theory for large volatility matrix estimation based on high-frequency financial data. Stochastic Processes and their Applications, 126:3527––3577.
Nelson, (1991) Nelson, D. B. (1991). Conditional heteroskedasticity in asset returns: A new approach. Econometrica: Journal of the Econometric Society, pages 347–370.
Shephard and Sheppard, (2010) Shephard, N. and Sheppard, K. (2010). Realising the future: forecasting with high-frequency-based volatility (heavy) models. Journal of Applied Econometrics, 25(2):197–231.
Shin et al., (2021) Shin, M., Kim, D., and Fan, J. (2021). Adaptive robust large volatility matrix estimation based on high-frequency financial data. Available at SSRN 3793394.
Song et al., (2021) Song, X., Kim, D., Yuan, H., Cui, X., Lu, Z., Zhou, Y., and Wang, Y. (2021). Volatility analysis with realized garch-itô models. Journal of Econometrics, 222(1):393–410.
Tao et al., (2011) Tao, M., Wang, Y., Yao, Q., and Zou, J. (2011). Large volatility matrix inference via combining low-frequency and high-frequency approaches. Journal of the American Statistical Association, 106(495):1025–1040.
Xiu, (2010) Xiu, D. (2010). Quasi-maximum likelihood estimation of volatility with high frequency data. Journal of Econometrics, 159(1):235–250.
Zhang, (2006) Zhang, L. (2006). Efficient estimation of stochastic volatility using noisy observations: A multi-scale approach. Bernoulli, 12(6):1019–1043.
Zhang et al., (2005) Zhang, L., Mykland, P. A., and Aït-Sahalia, Y. (2005). A tale of two time scales: Determining integrated volatility with noisy high-frequency data. Journal of the American Statistical Association, 100(472):1394–1411.
Zhang et al., (2016) Zhang, X., Kim, D., and Wang, Y. (2016). Jump variation estimation with noisy high frequency financial data via wavelets. Econometrics, 4(3):34.

	$\displaystyle\sup_{\theta\in\Theta}\left\|\widehat{L}_{n,m}(\theta)-\widehat{L}_{n}(\theta)\right\|$
	$\displaystyle\leq\sup_{\theta\in\Theta}\Bigg{\{}\frac{1}{n}\left\|\sum_{i=1}^{n}\widehat{H}_{i}(\theta)-H_{i}(\theta)\right\|+\frac{1}{n}\left\|\sum_{i=1}^{n}\frac{RV_{i}-\int_{i-1}^{i}\sigma_{t}^{2}(\theta_{0})dt}{\exp\left(\widehat{H}_{i}(\theta)\right)}\right\|$
	$\displaystyle\qquad\qquad+\frac{1}{n}\left\|\sum_{i=1}^{n}\int_{i-1}^{i}\sigma_{t}^{2}(\theta_{0})dt\left(e^{-\widehat{H}_{i}(\theta)}-e^{-H_{i}(\theta)}\right)\right\|\Bigg{\}}$
	$\displaystyle=\text{(I)}+\text{(II)}+\text{(III)}.$

	$\displaystyle\|G_{n}(\theta)-G_{n}(\theta^{\prime})\|$	$\displaystyle=$	$\displaystyle\left\|\frac{1}{n}\sum_{i=1}^{n}\frac{e^{H_{i}(\theta_{0})}}{e^{H_{i}(\theta^{})}}\frac{\partial H_{i}(\theta^{})}{\partial\theta}(M_{i}-1)(\theta-\theta^{\prime})\right\|$
		$\displaystyle\leq$	$\displaystyle C\frac{1}{n}\sum_{i=1}^{n}\sup_{\theta^{}\in\Theta}\left\\|\frac{e^{H_{i}(\theta_{0})}}{e^{H_{i}(\theta^{})}}\frac{\partial H_{i}(\theta^{*})}{\partial\theta}(M_{i}-1)\right\\|_{\max}\\|\theta-\theta^{\prime}\\|_{\max}.$