This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Permutation-based tests for discontinuities in event studiesthanks: We thank the Coeditor and two anonymous referees for comments and suggestions that have greatly improved the manuscript.

Federico A. Bugni
Department of Economics
Northwestern University
[email protected]
   Jia Li
School of Economics
Singapore Management University
[email protected]
   Qiyuan Li
School of Economics
Singapore Management University
[email protected]
Abstract

We propose using a permutation test to detect discontinuities in an underlying economic model at a known cutoff point. Relative to the existing literature, we show that this test is well suited for event studies based on time-series data. The test statistic measures the distance between the empirical distribution functions of observed data in two local subsamples on the two sides of the cutoff. Critical values are computed via a standard permutation algorithm. Under a high-level condition that the observed data can be coupled by a collection of conditionally independent variables, we establish the asymptotic validity of the permutation test, allowing the sizes of the local subsamples to be either be fixed or grow to infinity. In the latter case, we also establish that the permutation test is consistent. We demonstrate that our high-level condition can be verified in a broad range of problems in the infill asymptotic time-series setting, which justifies using the permutation test to detect jumps in economic variables such as volatility, trading activity, and liquidity. These potential applications are illustrated in an empirical case study for selected FOMC announcements during the ongoing COVID-19 pandemic.

KEYWORDS: event study, infill asymptotics, jump, permutation tests, randomization tests, semimartingale.

JEL classification codes: C12, C14, C22, C32.

1 Introduction

Many econometric problems can be expressed in terms of the continuity or the discontinuity of certain component in the underlying economic model. In an influential paper, Chow (1960) tested the temporal stability in the demand for automobiles, and subsequently stimulated a large literature on structural breaks in time series analysis; see, for example, Andrews (1993), Stock (1994), Bai and Perron (1998), and many references therein. In microeconometrics, the regression discontinuity design (RDD) has been extensively used for causal inference. This literature identifies and estimates an average treatment effect by evaluating discontinuities of conditional expectation functions of outcome and treatment variables at a cutoff point of the running variable; see Imbens and Lemieux (2008) and Lee and Lemieux (2010) for comprehensive reviews.111Coincidentally, the RDD was first proposed by Thistlethwaite and Campbell (1960) around the same time as the Chow test. Meanwhile, a more recent high-frequency financial econometrics literature has been devoted to studying discontinuities, or jumps, in various financial time series (e.g., price, volatility, trading activity, etc.). The high-frequency jump literature is pioneered by Barndorff-Nielsen and Shephard (2006), who propose the first nonparametric test for asset price jumps using high-frequency data in an infill asymptotic setting. More recently, Bollerslev et al. (2018) study the jumps of volatility and trading intensity in high-frequency jump regressions (Li et al. (2017)) that closely resemble the classical RDD.

Although these strands of literature involve apparently different terminology and technical tools, they share a common theme: The econometric goal is to learn about differences in the data generating processes between two subsamples separated by the cutoff. Imbens and Kalyanaraman (2011) emphasize that these subsamples should be “local” to the cutoff point, which is quite natural given the nonparametric nature of discontinuity inference (Hahn et al. (2001)). The issue under study is thus a local version of the classical two-sample problem. Correspondingly, the related inference is often carried out using nonparametric two-sample t-tests, which are based on kernel regressions in the RDD (Hahn et al. (2001), Imbens and Kalyanaraman (2011), Calonico et al. (2014)) or, in the same spirit, spot high-frequency estimators (Foster and Nelson (1996), Comte and Renault (1998), Jacod and Protter (2012), Li et al. (2017), Bollerslev et al. (2018)) in the infill time-series setting.

In an ideal scenario in which the subsamples separated by the cutoff are i.i.d. and independent of each other, the permutation test is an excellent tool to detect differences in their distributions. In particular, standard results for randomization inference (Lehmann and Romano (2005, Chapter 15.2)) indicate that a permutation test implemented with any arbitrary test statistic is finite-sample valid under these conditions. The recent literature has investigated the properties of permutation tests to detect differences between two samples under less ideal conditions. One example is Canay and Kamat (2017), who consider an RDD and show that permutation-based inference is asymptotically valid to detect discontinuities in the distribution of the baseline covariates at the cutoff. These authors implement their test with a finite number of observations that are located closest to the cutoff, effectively forcing them to concentrate on a small neighborhood of the cutoff as the sample size grows. In the same spirit, Cattaneo et al. (2017) propose using permutation-based inference to detect discontinuities at the cutoff under the “local randomization framework” introduced in Cattaneo et al. (2015). Outside of the RDD literature, Chung and Romano (2013) and DiCiccio and Romano (2017) investigate the asymptotic properties of permutation-based inference to test for differences in specific distributional features of two samples, such as the mean or the correlation coefficient. It is important to note that all of the references mentioned in this paragraph presume cross-sectional data.

In the context of time-series applications, there is an active literature on change-point tests implemented via permutations. This approach was first suggested by Antoch and Hušková (2001) and later pursued by other authors. See Hušková (2004) or Horváth and Rice (2014) for surveys of this literature. While most of this literature imposes independent errors, some allow for limited forms of weak dependence (Kirch and Steinebach (2006), Kirch (2007), and Jentsch and Pauly (2015)). In contrast, our econometric setting accommodates essentially unrestricted persistence and nonstationarity in the underlying state processes (e.g., volatility), which better suits our interest on their dynamics over short time windows around economic news events. In the context of machine-learning methods, Chernozhukov et al. (2018) propose using permutations to implement conformal inference that allows for time series data.

Set against this background, our main goal in this paper is to establish a general theory for permutation-based discontinuity tests, with a special emphasis on event studies based on time-series data. To capture the “local” nature of this problem, we adopt an infill asymptotic framework, under which the inference concentrates on observations “close” to the event time. Specifically, we consider the Cramér-von Mises test statistic formed as the squared L2L_{2} distance between the empirical cumulative distribution functions for the two local subsamples near the cutoff, and compute the critical value via a standard permutation algorithm. As explained earlier, if the data were i.i.d., the behavior of this permutation test would follow directly from standard results for randomization inference. This “off-the-shelf” theory, however, is not applicable here because time-series data observed in a short event window can be serially highly dependent.

The main theoretical contribution of the present paper is to establish the asymptotic validity of the permutation test in this non-standard setting. The theory has two components. The first is a new general result for permutation test. Specifically, we link the (feasible) permutation test formed using the original data with an infeasible test constructed in a “coupling” problem that involves conditionally i.i.d. coupling variables. Since the latter resembles the classical two-sample problem, the infeasible test controls size exactly under the coupling null hypothesis (i.e., coupling variables in the two subsamples are homogeneous), and is consistent under the complementary alternative hypothesis. Under a proper notion of coupling, which is customized for the permutation test, we show that the feasible test inherits the same asymptotic rejection properties from the infeasible one. Since this result is of independent theoretical interest that is well beyond our subsequent analysis in the infill time-series setting, we frame the theory under general high-level conditions so as to facilitate other types of applications.

The second component of our analysis pertains to specializing the general result to the infill time-series setting designed for event-study applications. The event-study framework is particularly relevant for studying macroeconomic and financial shocks, including monetary shocks triggered by FOMC announcements (Cochrane and Piazzesi (2002), Nakamura and Steinsson (2018a)), or “natural disasters” such as the ongoing COVID-19 pandemic. Following Li and Xiu (2016) and Bollerslev et al. (2018), we model observed data using a general state-space framework, in which the observations are discretely sampled from a latent state process “contaminated” by random disturbances. This model has been used to model variables such as asset returns, trading volume, duration, and bid-ask spread, and readily accommodates both continuously and discretely valued variables. Under this state-space model, the temporal discontinuity in the data’s distribution is mainly driven by the jump of the latent state process (e.g., asset volatility, trading intensity, and propensity of informed trading), which can be detected by the permutation test. Under easy-to-verify primitive conditions, we construct coupling variables and apply the aforementioned general theory to establish the permutation test’s asymptotic validity.

We recognize two advantages of the proposed permutation test in comparison with the standard approach based on the nonparametric “spot” estimation of the underlying state process. Firstly, the permutation test attains asymptotic size control even if the number of observations in each subsample is fixed.222For similar type of results in the context of RDD; see Cattaneo et al. (2015), Cattaneo et al. (2017), Canay and Kamat (2017), and Bugni and Canay (2021). This remarkable property is reminiscent of the finite-sample exactness of the permutation test in the classical two-sample problem for i.i.d. data. In contrast, the nonparametric estimation approach works in a fundamentally different way, as it relies on the asymptotic (mixed) normality of the estimator, which in turn requires the sizes of the local subsamples to grow to infinity. In empirical applications, however, it is often desirable to use a short time window, either to reduce the effect of confounding factors in the background, or simply because of the lack of observations soon after the occurrence of the economic event (say, in a real-time research situation). Not surprisingly, the conventional inference based on asymptotic Gaussianity often results in large size distortions in this “small-sample” scenario, as we demonstrate concretely in a realistically calibrated Monte Carlo experiment (see Section 3). Meanwhile, the permutation test exhibits much more robust size control in finite samples.

The second advantage of the permutation test is its versatility: The same test can be applied in many different empirical contexts without any modification. On the other hand, the nonparametric estimation approach often relies on specific features of the problem, and needs to be designed on a case-by-case basis. Therefore, the proposed permutation test may be particularly attractive in new empirical environments for which tests based on the conventional approach are not yet developed or not yet well-understood. In Section 2.2, we illustrate this point more concretely in the context of testing for volatility jumps. In that case, the standard approach relies crucially on the assumption that the price shocks are Brownian in its design of the spot volatility estimator and the associated t-statistic, and it cannot be adapted easily to accommodate a more general setting with Lévy-driven shocks.333As explained by Barndorff-Nielsen and Shephard (2001), these more general processes offer the possibility of capturing important deviations from Brownian shocks and for flexible modelling of dependence structures. However, to the best of our knowledge, the estimation and inference of the spot volatility (i.e., the scaling process) in the non-Brownian case remains to be an open question in the literature. There is some limited work on the inference of integrated volatility functionals for the non-Brownian case (see Todorov and Tauchen (2012)) which demonstrates various distinct complications in the non-Brownian setting. The permutation test, on the other hand, is valid even in the latter, more general, setting.

That being said, we stress that the proposed permutation test is a complement, rather than substitute, for the conventional nonparametric estimation method, because it has two limitations. One is that the permutation test focuses exclusively on hypothesis testing, without producing a point estimate for the jump of the state process (e.g., volatility) of interest, whereas the estimate is a by-product of the conventional approach. In addition, the proposed permutation test is purely nonparametric and it does not exploit any parametric structure that one may be willing to impose. It is therefore conceivable that in certain semiparametric settings, more efficient tests may be designed to exploit a priori model restrictions. Put differently, the aforementioned versatility of the permutation test may come with an efficiency cost. A better understanding about the robustness-efficiency tradeoff might be an interesting topic for future research.

In an empirical illustration, we apply the permutation test to a recent sample of high-frequency intraday returns of the SPY ETF for the S&P 500 index. Specifically, we focus on four important FOMC announcements during the ongoing COVID-19 pandemic, and test whether each announcement induces discontinuities in volatility, trading activity, and two measures of market illiquidity. We document robust empirical evidence for discontinuities in volatility and trading activity. We also find evidence for announcement-induced discontinuity in transaction cost (measured by bid-ask spread), but not in market impact (gauged by Amihud’s measure). This application highlights one of the main advantages of the proposed test, namely, it is applicable for a broad variety of high-frequency observations modeled in distinct ways, which is unlike, for example, the conventional t-test designed specifically for testing volatility jumps in the Brownian setting.

The rest of the paper is organized as follows. We present the asymptotic theory for the permutation test in Section 2. Section 3 reports the test’s finite-sample performance in Monte Carlo experiments, and Section 4 presents the empirical illustration. Section 5 concludes. The appendix contains all proofs.

Notation. We use x\left\|x\right\| to denote the Euclidean norm of a vector xx. For any real number aa, we use a\lceil a\rceil to denote the smallest integer that is larger than aa. For any constant p1p\geq 1, p\left\|\,\cdot\,\right\|_{p} denotes the LpL_{p} norm for random variables. For two real sequences ana_{n} and bnb_{n}, we write anbna_{n}\asymp b_{n} if an/CbnCana_{n}/C\leq b_{n}\leq Ca_{n} for some finite constant C1C\geq 1.

2 Theory

2.1 A general result for the asymptotic validity of permutation tests

We first prove a new result that is broadly useful for establishing the asymptotic validity of permutation tests. Because of its independent theoretical interest, we develop the theory under high-level conditions. In Section 2.2, below, we shall specialize this general result in event-study applications under a more specific infill time-series setting, for which the existing theory on permutation tests is not applicable.

Consider an array (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}} of \mathbb{R}-valued observed variables defined on a probability space (Ω,,)\left(\Omega,\mathcal{F},\mathbb{P}\right), which may be either “raw” data or preliminary estimators. Our econometric goal is to decide whether two subsamples (Yn,i)i1,n(Y_{n,i})_{i\in\mathcal{I}_{1,n}} and (Yn,i)i2,n\left(Y_{n,i}\right)_{i\in\mathcal{I}_{2,n}} have “significantly” different distributions, where (1,n,2,n)(\mathcal{I}_{1,n},\mathcal{I}_{2,n}) is a partition of n\mathcal{I}_{n}\subseteq\mathbb{Z}. For ease of exposition, we assume that 1,n\mathcal{I}_{1,n} and 2,n\mathcal{I}_{2,n} contain the same number of observations, denoted by knk_{n}.444 All of our results can be easily extended to the case when 1,n\mathcal{I}_{1,n} and 2,n\mathcal{I}_{2,n} have different sizes, but with the same order of magnitude. We stress from the outset that knk_{n} may either be fixed or grow to infinity in the subsequent analysis. As such, our analysis speaks to not only the classical finite-sample analysis of permutation tests, but also the large-sample analysis routinely used in econometrics.

To implement the test, we first estimate the empirical cumulative distribution functions (CDF) for the two subsamples using

F^j,n(x)1knij,n1{Yn,ix},j{1,2}.\widehat{F}_{j,n}\left(x\right)\equiv\frac{1}{k_{n}}\sum_{i\in\mathcal{I}_{j,n}}1{\left\{Y_{n,i}\leq x\right\}},\quad j\in\{1,2\}.

We then measure their difference via the Cramér–von Mises statistic, given by

T^n12knin(F^1,n(Yn,i)F^2,n(Yn,i))2.\widehat{T}_{n}\equiv\frac{1}{2k_{n}}\sum_{i\in\mathcal{I}_{n}}\left(\widehat{F}_{1,n}\left(Y_{n,i}\right)-\widehat{F}_{2,n}\left(Y_{n,i}\right)\right)^{2}. (1)

For a significance level α(0,1)\alpha\in\left(0,1\right), we compute the critical value via a standard permutation algorithm as in Lehmann and Romano (2005, page 633), which we specify in Algorithm 1 below. We use π\pi to denote a permutation of the elements of n\mathcal{I}_{n}, that is, a bijective mapping from n\mathcal{I}_{n} to itself. Let GnG_{n} denote the collection of all possible permutations of n\mathcal{I}_{n}, with MnM_{n} being its cardinality.

Algorithm 1. Step 1. For each permutation πGn\pi\in G_{n}, compute the permuted test statistic T^n(π)\widehat{T}_{n}(\pi) as T^n\widehat{T}_{n}, but with (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}} replaced by (Yn,π(i))in(Y_{n,\pi\left(i\right)})_{i\in\mathcal{I}_{n}}.

Step 2. Order {T^n(π):πGn}\{\widehat{T}_{n}(\pi):\pi\in G_{n}\} as T^n(1)T^n(Mn)\widehat{T}_{n}^{(1)}\leq\cdots\leq\widehat{T}_{n}^{(M_{n})}. Set T^n=T^n(k)\widehat{T}_{n}^{\ast}=\widehat{T}_{n}^{(k)} for k=Mn(1α)k=\left\lceil M_{n}(1-\alpha)\right\rceil.

Step 3. If T^n>T^n\widehat{T}_{n}>\widehat{T}_{n}^{\ast}, reject the null hypothesis. If T^n<T^n\widehat{T}_{n}<\widehat{T}_{n}^{\ast}, do not reject the null hypothesis. If T^n=T^n\widehat{T}_{n}=\widehat{T}_{n}^{\ast}, reject the null hypothesis with probability p^n(MnαM^n+)/M^n0\hat{p}_{n}\equiv(M_{n}\alpha-\widehat{M}_{n}^{+})/\widehat{M}_{n}^{0}, where M^n+\widehat{M}_{n}^{+} and M^n0\widehat{M}_{n}^{0} are the cardinalities of {j:T^n(j)>T^n}\{j:\widehat{T}_{n}^{(j)}>\widehat{T}_{n}^{\ast}\} and {j:T^n(j)=T^n}\{j:\widehat{T}_{n}^{(j)}=\widehat{T}_{n}^{\ast}\}, respectively. The resulting test then rejects according to the critical function ϕ^n1{T^n>T^n}+p^n1{T^n=T^n}\hat{\phi}_{n}\equiv 1\{\widehat{T}_{n}>\widehat{T}_{n}^{\ast}\}+\hat{p}_{n}1\{\widehat{T}_{n}=\widehat{T}_{n}^{\ast}\}. \Box

Remark 2.1.

The test specified in Algorithm 1 is a randomized test and has a random outcome when T^n=T^n\widehat{T}_{n}=\widehat{T}_{n}^{\ast} (i.e., it rejects the null hypothesis with probability p^n\hat{p}_{n} independent of the data); see Section 3.1 in Lehmann and Romano (2005) for details about randomized tests. One can construct a non-randomized (and more conservative) version by replacing p^n\hat{p}_{n} with zero. Also, in practice, MnM_{n} may be too large to consider GnG_{n} in its entirety. In such cases, we could replace Gn{G}_{n} with a random subset of it, denoted by G^n\widehat{G}_{n}, and composed of the identity permutation and an i.i.d. sample of permutations in Gn{G}_{n}. All of the formal results in this paper apply if we use G^n\widehat{G}_{n} instead of Gn{G}_{n} in Algorithm 1.   

Remark 2.2.

In this paper, we use the Cramér-von Mises statistic in (1) for concreteness and simplicity of exposition. However, the supplement of this paper shows that our main results extend well beyond this particular statistic. In particular, the asymptotic validity of the permutation test extends to any other rank statistic, i.e., a statistic that only depends on the rank of the observations. Furthermore, our consistency result applies to any other rank statistic under mild regularity conditions. For example, both of these results hold if we replace (1) with the Kolmogorov-Smirnov statistic, given by

T^nmaxin|F^1,n(Yn,i)F^2,n(Yn,i)|.\widehat{T}_{n}\equiv\max_{i\in\mathcal{I}_{n}}\left|\widehat{F}_{1,n}\left(Y_{n,i}\right)-\widehat{F}_{2,n}\left(Y_{n,i}\right)\right|.

For details, see the supplement of this paper.   

If the data (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}} are i.i.d., then the null hypothesis of the classical two-sample problem holds, and Lehmann and Romano (2005, Theorem 15.2.1) implies that the aforementioned permutation test has exact size control in finite samples. This is a remarkable property of the permutation test, as it holds without requiring any specific distributional assumptions on the data. In contrast to the classical two-sample problem, however, we shall not assume that the data are independent, or even “weakly” dependent (e.g., mixing). As mentioned in the Introduction, the main goal of this paper is to study the permutation test for time-series data observed within a short event window (say, a few days or hours), which can be serially highly dependent in practice. Our key theoretical insight is that the permutation test is still asymptotically valid if the data (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}} can be approximated, or “coupled,” by another collection of variables that are conditionally independent, as formalized by the following assumption.

Assumption 2.1.

There exists a collection of variables (Un,i)in\left(U_{n,i}\right)_{i\in\mathcal{I}_{n}} such that the following conditions hold for a sequence (𝒢n)n1(\mathcal{G}_{n})_{n\geq 1} of σ\sigma-fields: (i) for each n1n\geq 1, the variables (Un,i)in(U_{n,i})_{i\in\mathcal{I}_{n}} are 𝒢n\mathcal{G}_{n}-conditionally independent, and Un,iU_{n,i} has the same 𝒢n\mathcal{G}_{n}-conditional distribution as Un,jU_{n,j} if i,ji,j belong to the same subsample (i.e., 1,n\mathcal{I}_{1,n} or 2,n\mathcal{I}_{2,n}); (ii) for any real sequence ηn=o(1)\eta_{n}=o(1), we have supx(|Un,ix|ηn|𝒢n)=Op(ηn)\sup_{x\in\mathbb{R}}\mathbb{P}\left(\left|U_{n,i}-x\right|\leq\eta_{n}|\mathcal{G}_{n}\right)=O_{p}\left(\eta_{n}\right); (iii) maxin|Y~n,iUn,i|=op(kn2)\max_{i\in\mathcal{I}_{n}}|\widetilde{Y}_{n,i}-U_{n,i}|=o_{p}(k_{n}^{-2}), where (Y~n,i)in(\widetilde{Y}_{n,i})_{i\in\mathcal{I}_{n}} is an identical copy of (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}} in 𝒢n\mathcal{G}_{n}-conditional distribution.

Assumption 2.1 lays out the high-level structure for bridging our analysis with the classical theory on permutation tests, which we carry out in Theorem 2.1 below. Condition (i) sets up the “coupling” problem, which corresponds to a conditional version of the classical two-sample problem, treating the (Un,i)i1,n(U_{n,i})_{i\in\mathcal{I}_{1,n}} and (Un,i)i2,n(U_{n,i})_{i\in\mathcal{I}_{2,n}} variables as “data”. In part (a) of Theorem 2.1, we consider the situation in which both subsamples have the same conditional distribution. In this case, our coupling variables (Un,i)in(U_{n,i})_{i\in\mathcal{I}_{n}} give rise to an infeasible permutation test that can be analyzed as a classical two-sample problem. In particular, this infeasible permutation test attains the exact finite-sample size under our conditions.

This infeasible test, however, only plays an auxiliary role in our analysis, because our interest is on the feasible test ϕ^n\hat{\phi}_{n} formed using the original (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}} data. Therefore, a key component of our theoretical argument in Theorem 2.1 is to show that the feasible test for the original data inherits asymptotically the same rejection properties from the infeasible test. Conditions (ii) and (iii) in Assumption 2.1 are introduced for this purpose. Specifically, condition (ii) requires the variable Un,iU_{n,i} to be non-degenerate, in the sense that its conditional probability mass within any small [xη,x+η]\left[x-\eta,x+\eta\right] interval is of order O(η)O\left(\eta\right) in probability.555Condition (ii) is satisfied if Un,iU_{n,i} has a conditional probability density that is uniformly bounded in probability. Condition (iii) specifies the requisite approximation accuracy of the coupling variables. We note that this condition is easier to hold when knk_{n} is smaller, because the joint coupling requirement would involve a smaller number of variables and the op(kn2)o_{p}(k_{n}^{-2}) error bound is easier to attain. This condition can be verified under more primitive conditions pertaining to the smoothness of underlying processes and an upper bound on the growth rate of knk_{n}, as detailed in Section 2.2.666In our applications, we can often verify condition (iii) with Y~n,i=Yn,i\widetilde{Y}_{n,i}=Y_{n,i}. Nonetheless, allowing Y~n,iYn,i\widetilde{Y}_{n,i}\neq Y_{n,i} is useful when Yn,iY_{n,i} is itself an estimator. For example, if (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}} is a finite collection of estimators that converge jointly in distribution, then the coupling required in Assumption 2.1(iii) can be obtained via Skorokhod representation. As such, the theory developed in Canay and Kamat (2017) can be cast in our framework with 𝒢n\mathcal{G}_{n} being the trivial information set and knk_{n} being a fixed constant, although our anti-concentration condition in Assumption 2.1(ii) is slightly stronger than the continuity condition in Canay and Kamat (2017, Assumption 4.2).

Theorem 2.1.

Under Assumption 2.1, the following statements hold for the permutation test ϕ^n\hat{\phi}_{n} described in Algorithm 1:

(a) If the variables (Un,i)in(U_{n,i})_{i\in\mathcal{I}_{n}} have the same 𝒢n\mathcal{G}_{n}-conditional distribution, we have 𝔼[ϕ^n]α\mathbb{E}[\hat{\phi}_{n}]\rightarrow\alpha.

(b) Let Qj,n()Q_{j,n}\left(\cdot\right) denote the 𝒢n\mathcal{G}_{n}-conditional distribution function of Un,iU_{n,i} for ij,ni\in\mathcal{I}_{j,n} and j{1,2}j\in\left\{1,2\right\}, and Q¯n=(Q1,n+Q2,n)/2\overline{Q}_{n}=(Q_{1,n}+Q_{2,n})/2. If knk_{n}\rightarrow\infty and ((Q1,n(x)Q2,n(x))2𝑑Q¯n(x)>δn)1\mathbb{P}(\int\left(Q_{1,n}(x)-Q_{2,n}(x)\right)^{2}d\overline{Q}_{n}\left(x\right)>\delta_{n})\rightarrow 1 for any real sequence δn=o(1)\delta_{n}=o(1), we have 𝔼[ϕ^n]1\mathbb{E}[\hat{\phi}_{n}]\rightarrow 1.

Theorem 2.1 characterizes the asymptotic rejection probabilities of the feasible test ϕ^n\hat{\phi}_{n} under the null and alternative hypotheses of the two-sample problem for the coupling variables. Part (a) pertains to the situation in which the two subsamples of coupling variables, (Un,i)i1,n(U_{n,i})_{i\in\mathcal{I}_{1,n}} and (Un,i)i2n(U_{n,i})_{i\in\mathcal{I}_{2n}}, have the same conditional distribution, which corresponds to the null hypothesis. In this case, the theorem shows that the asymptotic rejection probability of the feasible test is equal to the nominal level α\alpha. It is relevant to note that this result holds whether knk_{n} is fixed or divergent. This property is clearly reminiscent of the permutation test’s finite-sample exactness in the classical setting.

Part (b) of Theorem 2.1 concerns the power of the feasible test ϕ^n\hat{\phi}_{n}. It shows that the feasible test rejects with probability approaching one when the conditional distributions of the two coupling subsamples, Q1,nQ_{1,n} and Q2,nQ_{2,n}, are different, in the sense that their “distance” measured by (Q1,n(x)Q2,n(x))2𝑑Q¯n(x)\int\left(Q_{1,n}(x)-Q_{2,n}(x)\right)^{2}d\overline{Q}_{n}\left(x\right) is asymptotically non-degenerate, where the mixture distribution Q¯n\overline{Q}_{n} captures approximately the distribution of the permuted data.777When the two subsamples have different sizes, the same result goes through with Q¯n\overline{Q}_{n} defined as the sample-size weighted average between Q1,nQ_{1,n} and Q2,nQ_{2,n}, given by Q¯n=(Q1,n|1,n|+Q2,n|2,n|)/(|1,n|+|2,n|)\overline{Q}_{n}=(Q_{1,n}|\mathcal{I}_{1,n}|+Q_{2,n}|\mathcal{I}_{2,n}|)/(|\mathcal{I}_{1,n}|+|\mathcal{I}_{2,n}|). This consistency-type result requires that the information available from each subsample grows with the sample size, i.e., knk_{n}\rightarrow\infty. This result appears to be new in the context of permutation-based tests under a fixed alternative for the coupling variables. In particular, we note that an analogous result is unavailable in Canay and Kamat (2017), as they restrict attention to an asymptotic framework with a fixed knk_{n}, which makes a consistency-type result unavailable. Our proof relies on applying Lehmann and Romano (2005, Theorem 15.2.3) to the infeasible test, for which we use the coupling construction developed by Chung and Romano (2013) to show that the so-called Hoeffding (1952) condition is satisfied. We note that this argument is used to establish the consistency of the permutation test rather than its asymptotic size property.

Theorem 2.1 establishes the relation between the rejection probability of the feasible test ϕ^n\hat{\phi}_{n} and the homogeneity (or the lack of it) across the two coupling subsamples (Un,i)i1,n(U_{n,i})_{i\in\mathcal{I}_{1,n}} and (Un,i)i2,n(U_{n,i})_{i\in\mathcal{I}_{2,n}}. This result does not speak directly to hypotheses formulated in terms of the original (Yn,i)in\left(Y_{n,i}\right)_{i\in\mathcal{I}_{n}} observations. Rather, its theoretical significance is to “absorb” all generic technicalities stemming from the (feasible) permutation test, which in turn considerably simplifies our overall analysis. The residual issue for any specific application is to explicitly construct the coupling variables and translate their homogeneity in terms of the primitive structures of the original empirical problem, which can be done using domain-specific techniques. We provide general results along this line in the infill time-series context, as detailed in Section 2.2 below.

To help anticipate the general discussion, it is instructive to sketch the scheme in a basic running example. Let Yn,i=Δn1/2(P(i+1)ΔnPiΔn)Y_{n,i}=\Delta_{n}^{-1/2}(P_{\left(i+1\right)\Delta_{n}}-P_{i\Delta_{n}}) be the scaled increment of the asset price process PtP_{t} over the iith sampling interval. Let τ\tau be a “cutoff” time point of interest that is known to the researcher (e.g., the announcement time of a news release) and ii^{\ast} be the unique integer such that τ[iΔn,(i+1)Δn)\tau\in[i^{\ast}\Delta_{n},(i^{\ast}+1)\Delta_{n}).888The integer ii^{\ast} depends on nn. We suppress this dependence in our notation for simplicity and to avoid having nested subscripts. We consider two index sets 1,n={ikn,,i1}\mathcal{I}_{1,n}=\{i^{\ast}-k_{n},\ldots,i^{\ast}-1\} and 2,n={i+1,,i+kn}\mathcal{I}_{2,n}=\{i^{\ast}+1,\ldots,i^{\ast}+k_{n}\}, which collect observations before and after the cutoff, respectively. We consider an asymptotic setting in which these subsamples are “local” in calendar time, that is, knΔn0k_{n}\Delta_{n}\rightarrow 0. Note that this implies that Δn0\Delta_{n}\rightarrow 0, which means that we are considering an infill asymptotic setting. If PtP_{t} is an Itô process with respect to an information filtration (t)t0(\mathcal{F}_{t})_{t\geq 0}, we may represent Yn,iY_{n,i} as

Yn,i=Δn1/2iΔn(i+1)Δnbs𝑑s+Δn1/2iΔn(i+1)Δnσs𝑑Ws,forin,Y_{n,i}=\Delta_{n}^{-1/2}\int_{i\Delta_{n}}^{\left(i+1\right)\Delta_{n}}b_{s}ds+\Delta_{n}^{-1/2}\int_{i\Delta_{n}}^{\left(i+1\right)\Delta_{n}}\sigma_{s}dW_{s},\quad\text{for}\quad i\in\mathcal{I}_{n}, (2)

where btb_{t} is the drift process, σt\sigma_{t} is the stochastic volatility process, and WtW_{t} is a standard Brownian motion.999Note that n\mathcal{I}_{n} does not include the ii^{\ast}th return observation. Therefore, although the returns in (2) do not contain price jumps, an event-induced price jump is allowed to occur at time τ\tau. If the σt\sigma_{t} process is smooth (e.g., Hölder continuous) in a local neighborhood before τ\tau, then the volatility throughout the pre-event subsample 1,n\mathcal{I}_{1,n} is approximately σ(ikn)Δn\sigma_{(i^{\ast}-k_{n})\Delta_{n}}. Further recognizing that the drift term is negligible relative to the Brownian component, we can approximate Yn,iY_{n,i} for each i1,ni\in\mathcal{I}_{1,n} using the coupling variables

Un,i=σ(ikn)ΔnΔn1/2(W(i+1)ΔnWiΔn)𝒩(0,σ(ikn)Δn2),U_{n,i}=\sigma_{(i^{\ast}-k_{n})\Delta_{n}}\Delta_{n}^{-1/2}(W_{\left(i+1\right)\Delta_{n}}-W_{i\Delta_{n}})\sim\mathcal{MN}\left(0,\sigma_{(i^{\ast}-k_{n})\Delta_{n}}^{2}\right), (3)

where 𝒩\mathcal{MN} denotes the mixed normal distribution. Since the Brownian motion has independent and stationary increments, it is easy to see that the coupling variables (Un,i)i1,n(U_{n,i})_{i\in\mathcal{I}_{1,n}} are (ikn)Δn\mathcal{F}_{(i^{\ast}-k_{n})\Delta_{n}}-conditionally i.i.d. Moreover, if the volatility process σt\sigma_{t} does not jump at the cutoff time τ\tau, we may follow the same logic to extend the approximation in (3) further to i2,ni\in\mathcal{I}_{2,n}. In other words, if the volatility process process does not jump then the coupling variables (Un,i)in(U_{n,i})_{i\in\mathcal{I}_{n}} are conditionally i.i.d., which corresponds to the situation in part (a) of Theorem 2.1. On the other hand, if the volatility process jumps at time τ\tau, say by a constant c0c\neq 0, then the coupling variables for the 2,n\mathcal{I}_{2,n} subsample will instead take the form Un,i=(σ(ikn)Δn+c)(W(i+1)ΔnWiΔn)U_{n,i}=\left(\sigma_{(i^{\ast}-k_{n})\Delta_{n}}+c\right)(W_{\left(i+1\right)\Delta_{n}}-W_{i\Delta_{n}}). In this case, the two subsamples of Un,iU_{n,i}’s have distinct conditional distributions (i.e., mixed normal with different conditional variances), corresponding to the scenario in part (b) of Theorem 2.1.

Within the context of this illustrative example, we can further clarify a key feature of the proposed test that holds more generally. It is not aimed at detecting “small” time-variations in the distribution of the observed data. In fact, by allowing the drift btb_{t} and the volatility σt\sigma_{t} to be time-varying, a smooth form of heterogeneity is always built in. The test instead detects abrupt changes, or discontinuities, in the evolution of the distribution, which can be more plausibly associated with the “lumpy” information carried by the underlying economic announcement, as emphasized by Nakamura and Steinsson (2018b). Specifically in this example, the asset returns are locally centered Gaussian (due to the assumption that the price is an Itô process), and hence, the temporal discontinuity in the return distribution manifests itself as a volatility jump. The empirical scope of our permutation test, however, is far beyond volatility-jump testing depicted in this illustration, as we shall demonstrate in the remainder of the paper.

2.2 Permutation tests for discontinuities in event studies

We now specialize the general Theorem 2.1 into an infill asymptotic time-series setting that is particularly suitable for event studies. By introducing a mild additional econometric structure, we shall establish the asymptotic validity of the permutation test under more primitive conditions that are easy to verify in a variety of concrete empirical settings. As in the running example above, we consider an event occurring at time τ[iΔn,(i+1)Δn)\tau\in[i^{\ast}\Delta_{n},(i^{\ast}+1)\Delta_{n}), which separates two subsamples indexed by 1,n={ikn,,i1}\mathcal{I}_{1,n}=\{i^{\ast}-k_{n},\ldots,i^{\ast}-1\} and 2,n={i+1,,i+kn}\mathcal{I}_{2,n}=\{i^{\ast}+1,\ldots,i^{\ast}+k_{n}\}, respectively. All limits in the sequel are obtained under the infill asymptotic setting with Δn0\Delta_{n}\rightarrow 0.

We suppose that the data are generated from an approximate state-space model of the form

Yn,i=g(ζiΔn,ϵn,i)+Rn,i,in,Y_{n,i}=g\left(\zeta_{i\Delta_{n}},\epsilon_{n,i}\right)+R_{n,i},\quad i\in\mathcal{I}_{n}, (4)

where the state process ζt\zeta_{t} is càdlàg, adapted to a filtration t\mathcal{F}_{t}, and takes values in an open set 𝒵dim(ζ)\mathcal{Z}\subseteq\mathbb{R}^{\dim(\zeta)}; (ϵn,i)in(\epsilon_{n,i})_{i\in\mathcal{I}_{n}} are i.i.d. random disturbances taking values in some (possibly abstract) space \mathcal{E}; g(,)g\left(\cdot,\cdot\right) is a “smooth” transform; and Rn,iR_{n,i} is a residual term that is negligible relative to the leading term g(ζiΔn,ϵn,i)g\left(\zeta_{i\Delta_{n}},\epsilon_{n,i}\right) in a proper sense detailed below. A simpler version of this state-space model without the Rn,iR_{n,i} residual term has been used by Li and Xiu (2016) and Bollerslev et al. (2018), among others, for modeling market variables such as trading volume and bid-ask spread. By introducing the Rn,iR_{n,i} residual term, we can use a unified framework to accommodate a broader class of models, which in particular include increments of an Itô semimartingale. We now revisit the model in (2) as the first illustration.


Example 1 (Brownian Asset Returns). We represent the Itô-process model (2) for asset returns in the form of (4) by setting ζt=σt\zeta_{t}=\sigma_{t}, ϵn,i=Δn1/2(W(i+1)ΔnWiΔn)\epsilon_{n,i}=\Delta_{n}^{-1/2}(W_{\left(i+1\right)\Delta_{n}}-W_{i\Delta_{n}}), and g(z,ϵ)=zϵg\left(z,\epsilon\right)=z\epsilon. The resulting residual term has the form

Rn,i=Δn1/2iΔn(i+1)Δnbs𝑑s+Δn1/2iΔn(i+1)Δn(σsσiΔn)𝑑Ws,in.R_{n,i}=\Delta_{n}^{-1/2}\int_{i\Delta_{n}}^{\left(i+1\right)\Delta_{n}}b_{s}ds+\Delta_{n}^{-1/2}\int_{i\Delta_{n}}^{\left(i+1\right)\Delta_{n}}(\sigma_{s}-\sigma_{i\Delta_{n}})dW_{s},\quad i\in\mathcal{I}_{n}. (5)

Under mild and fairly standard regularity conditions, it is easy to show that maxin|Rn,i|\max_{i\in\mathcal{I}_{n}}|R_{n,i}| is op(1)o_{p}(1). On the other hand, the leading term g(ζiΔn,ϵn,i)g\left(\zeta_{i\Delta_{n}},\epsilon_{n,i}\right) has a non-degenerate centered mixed Gaussian distribution with conditional variance σiΔn2\sigma_{i\Delta_{n}}^{2}. \square


This running example further illustrates the distinct roles played by ζt\zeta_{t}, ϵn,i\epsilon_{n,i}, and Rn,iR_{n,i} in our state-space model (4). The leading term g(ζiΔn,ϵn,i)g\left(\zeta_{i\Delta_{n}},\epsilon_{n,i}\right) captures the “main feature” of the observed data; in addition, since the ϵn,i\epsilon_{n,i} disturbance terms are i.i.d., any “large” change in the empirical distribution across the two subsamples must be attributed to the time-τ\tau discontinuity in the state process ζt\zeta_{t}. From this description, it follows that the hypothesis test for the continuity of the distribution of the main feature of the observed data can be formulated as

H0:Δζτ=0versusHa:Δζτ0,H_{0}:\Delta\zeta_{\tau}=0\quad\text{versus}\quad H_{a}:\Delta\zeta_{\tau}\neq 0, (6)

where Δζτζτζτζτlimsτζs\Delta\zeta_{\tau}\equiv\zeta_{\tau}-\zeta_{\tau-}\equiv\zeta_{\tau}-\lim_{s\uparrow\tau}\zeta_{s} denotes the jump of the state process at time τ\tau.

With the state-space model (4) in place, we can design more primitive sufficient conditions for establishing the asymptotic validity of the permutation test under the hypotheses in (6). We need some additional notation to describe these conditions. For each fixed z𝒵z\in\mathcal{Z}, let fz()f_{z}\left(\cdot\right) and Fz()F_{z}\left(\cdot\right) denote the probability density function (PDF) and the CDF of the random variable g(z,εn,i)g\left(z,\varepsilon_{n,i}\right), respectively. It is also convenient to introduce a “shifted” version of ζt\zeta_{t} defined as ζ~tζtΔζτ1{tτ}\tilde{\zeta}_{t}\equiv\zeta_{t}-\Delta\zeta_{\tau}1\{t\geq\tau\}, which has the same increments as ζt\zeta_{t} over time intervals not containing τ\tau.

Assumption 2.2.

(i) The collection of variables (ϵn,i)in(\epsilon_{n,i})_{i\in\mathcal{I}_{n}} are i.i.d. and, for each knk\in\mathcal{I}_{n}, the variables (ϵn,i)ik(\epsilon_{n,i})_{i\geq k} are independent of kΔn\mathcal{F}_{k\Delta_{n}}. Moreover, for any compact subset 𝒦𝒵\mathcal{K}\subseteq\mathcal{Z}, we have (ii) supx,z𝒦fz(x)<\sup_{x\in\mathbb{R},z\in\mathcal{K}}f_{z}(x)<\infty; and (iii) infz𝒦(Fz(x)Fz+c(x))2𝑑Fz(x)>0\inf_{z\in\mathcal{K}}\int_{\mathbb{R}}\left(F_{z}(x)-F_{z+c}(x)\right)^{2}dF_{z}(x)>0 whenever c0c\neq 0.

Assumption 2.3.

There exist a sequence (Tm)m1\left(T_{m}\right)_{m\geq 1} of stopping times increasing to infinity, a sequence of compact subsets (𝒦m)m1(\mathcal{K}_{m})_{m\geq 1} of 𝒵\mathcal{Z}, and a sequence (Km)m1\left(K_{m}\right)_{m\geq 1} of constants such that for some real sequence an1a_{n}\geq 1 and each m1m\geq 1: (i) g(z,ϵn,i)g(z,ϵn,i)2Kmanzz\left\|g\left(z,\epsilon_{n,i}\right)-g(z^{\prime},\epsilon_{n,i})\right\|_{2}\leq K_{m}a_{n}\left\|z-z^{\prime}\right\| for all z,z𝒦mz,z^{\prime}\in\mathcal{K}_{m}; (ii) ζt\zeta_{t} takes values in 𝒦m\mathcal{K}_{m} for all tTmt\leq T_{m}, and ζ~tTmζ~sTm2Km|ts|1/2\|\tilde{\zeta}_{t\wedge T_{m}}-\tilde{\zeta}_{s\wedge T_{m}}\|_{2}\leq K_{m}\left|t-s\right|^{1/2} for all t,st,s in some fixed neighborhood of τ\tau; (iii) maxin|Rn,i|=op(kn2)\max_{i\in\mathcal{I}_{n}}\left|R_{n,i}\right|=o_{p}(k_{n}^{-2}).

Assumption 2.2 entails regularity conditions pertaining to the random disturbance terms, which are often easy to verify in concrete examples as demonstrated later in this subsection. Assumption 2.3 imposes a set of smoothness conditions that permits the approximation of the observed data using properly constructed coupling variables.101010Note that the assumption is framed in a localized fashion using the stopping times (Tm)m1(T_{m})_{m\geq 1}, which is a standard technique for weakening the regularity condition in the infill asymptotic setting. See Jacod and Protter (2012, Section 4.4.1) for a comprehensive discussion on the localization technique. Specifically, condition (i) requires that the random function zg(z,ϵn,i)z\mapsto g\left(z,\epsilon_{n,i}\right) is Lipschitz in zz over compact sets under the L2L_{2} distance. The ana_{n} sequence captures the scale of the Lipschitz coefficient. In many applications, we can verify this condition simply with an1a_{n}\equiv 1, but allowing ana_{n} to diverge to infinity is sometimes necessary (see Example 2 below). Condition (ii) states that the ζt\zeta_{t} process is locally compact (up to each stopping time TmT_{m}) and, upon removing the fixed-time discontinuity at τ\tau, it is (1/2)(1/2)-Hölder continuous under the L2L_{2} norm. This Hölder-continuity requirement can be easily verified using well-known results provided that the ζ~\tilde{\zeta} process is an Itô semimartingle or a long-memory process (see Jacod and Protter (2012, Chapter 2) and Li and Liu (2020)). Condition (iii) imposes the requisite assumptions on the residual terms. In some applications, this condition holds trivially with Rn,i0R_{n,i}\equiv 0, but, more generally, it needs to be verified on a case-by-case basis using (relatively standard) infill asymptotic techniques. Theorem 2.2, below, establishes the size and power properties of the permutation test under the hypotheses described in (6).

Theorem 2.2.

In the state-space model (4), suppose that Assumptions 2.2 and 2.3 hold, and that ankn3Δn1/2=o(1)a_{n}k_{n}^{3}\Delta_{n}^{1/2}=o(1). Then, the following statements hold for the permutation test ϕ^n\hat{\phi}_{n} described in Algorithm 1:

(a) Under the null hypothesis in (6), i.e., Δζτ=0\Delta\zeta_{\tau}=0, we have 𝔼[ϕ^n]α\mathbb{E}[\hat{\phi}_{n}]\rightarrow\alpha;

(b) Under a fixed alternative hypothesis in (6), i.e., Δζτ=c\Delta\zeta_{\tau}=c for some (unknown) constant c0c\neq 0 and if knk_{n}\rightarrow\infty, we have 𝔼[ϕ^n]1\mathbb{E}[\hat{\phi}_{n}]\rightarrow 1.

This theorem is proved by verifying the high-level conditions in Theorem 2.1 with properly constructed coupling variables analogous to those in equation (3). The condition ankn3Δn1/2=o(1)a_{n}k_{n}^{3}\Delta_{n}^{1/2}=o(1) mainly requires that the window size knk_{n} does not grow too fast, which ensures the closeness between the coupling variables and the original data. In the typical case with an=1a_{n}=1, it reduces to kn=o(Δn1/6)k_{n}=o(\Delta_{n}^{-1/6}).111111This sufficient condition for the growth rate of knk_{n} is different from the conditions needed for conventional asymptotic-Gaussian-based spot inference, which requires knΔnιk_{n}\asymp\Delta_{n}^{-\iota} for some ι(0,1/2)\iota\in(0,1/2). For the permutation test, knk_{n} may be fixed or grow to infinity. However, in the latter case, our condition on the growth rate of knk_{n} is more stringent than what is needed for the conventional spot inference theory. In general, a larger knk_{n} allows one to utilize more data, but the associated longer event window may also lead to a larger nonparametric bias, and hence, a more severe size distortion. Part (a) shows that the permutation test attains the desired asymptotic level under the null hypothesis in (6). Again, we stress that the test has valid asymptotic size control even in the “small-sample” case with fixed knk_{n}. As in Theorem 2.1, the “large-sample” condition knk_{n}\rightarrow\infty is needed for establishing the consistency of the test under the alternative, as shown in part (b).

In the remainder of this subsection, we use a few prototype examples to demonstrate how the proposed test may be used in various empirical settings. In particular, we show how to cast the specific problems into the approximate state-space model (4), and discuss how to verify our sufficient regularity conditions. We start by revisiting the running example.


Example 1 (Brownian Asset Returns, Continued). Recall that ϵn,iΔn1/2(W(i+1)ΔnWiΔn)\epsilon_{n,i}\equiv\Delta_{n}^{-1/2}(W_{\left(i+1\right)\Delta_{n}}-W_{i\Delta_{n}}), ζt=σt\zeta_{t}=\sigma_{t}, and g(z,ϵ)=zϵg\left(z,\epsilon\right)=z\epsilon. In this context, the hypothesis testing problem in (6) represents a test of the continuity of the volatility process σt\sigma_{t} at time t=τt=\tau, i.e.,

H0:Δστ=0versusHa:Δστ0.H_{0}:\Delta\sigma_{\tau}=0\quad\text{versus}\quad H_{a}:\Delta\sigma_{\tau}\neq 0.

We suppose that the volatility process σt\sigma_{t} is non-degenerate by setting its domain to 𝒵=(0,)\mathcal{Z}=\left(0,\infty\right). Since the Brownian motion has independent increments with respect to the underlying filtration, the disturbance term ϵn,i\epsilon_{n,i} satisfies Assumption 2.2(i). In addition, for each point z𝒵z\in\mathcal{Z}, the random variable f(z,ϵn,i)f\left(z,\epsilon_{n,i}\right) has an 𝒩(0,z2)\mathcal{N}\left(0,z^{2}\right) distribution. It is then easy to see that conditions (ii) and (iii) in Assumption 2.2 hold for any compact subset 𝒦𝒵\mathcal{K}\subseteq\mathcal{Z} (note that 𝒦\mathcal{K} is necessarily bounded away from zero). To verify Assumption 2.3, first note that g(z,ϵn,i)g(z,ϵn,i)=(zz)ϵn,ig\left(z,\epsilon_{n,i}\right)-g(z^{\prime},\epsilon_{n,i})=(z-z^{\prime})\epsilon_{n,i}, and hence, g(z,ϵn,i)g(z,ϵn,i)2=|zz|\left\|g\left(z,\epsilon_{n,i}\right)-g(z^{\prime},\epsilon_{n,i})\right\|_{2}=|z-z^{\prime}|. Assumption 2.3(i) thus holds for an=1a_{n}=1. It is well known that σt\sigma_{t} is locally (1/2)(1/2)-Hölder continuous under the L2L_{2} norm if it is an Itô semimartingale or a long-memory process; if so, Assumption 2.3(ii) is satisfied if the σt\sigma_{t} and σt1\sigma_{t}^{-1} processes are both locally bounded. Finally, to verify Assumption 2.3(iii), we assume that the drift process btb_{t} is locally bounded. It is then easy to show via routine calculations that maxin|Rn,i|=Op(kn1/2Δn1/2)\max_{i\in\mathcal{I}_{n}}|R_{n,i}|=O_{p}(k_{n}^{1/2}\Delta_{n}^{1/2}). Since the condition ankn3Δn1/2=o(1)a_{n}k_{n}^{3}\Delta_{n}^{1/2}=o(1) in Theorem 2.2 implies that Op(kn1/2Δn1/2)=op(kn2)O_{p}(k_{n}^{1/2}\Delta_{n}^{1/2})=o_{p}(k_{n}^{-2}), we have maxin|Rn,i|=op(kn2)\max_{i\in\mathcal{I}_{n}}|R_{n,i}|=o_{p}(k_{n}^{-2}) as needed in Assumption 2.3(iii). All conditions in Theorem 2.2 are now verified, and this shows that the permutation test ϕ^n\hat{\phi}_{n} is asymptotically valid for testing the null hypothesis Δστ=0\Delta\sigma_{\tau}=0. \square


Example 1 shows that the permutation test ϕ^n\hat{\phi}_{n} is asymptotically valid for testing the presence of a volatility jump. This is a relatively familiar problem in the literature. It is therefore useful to contrast the proposed permutation test with the standard approach, which is based on nonparametric “spot” estimators of the asset price’s instantaneous variances before and after the event time given by, respectively,

σ^τ2=1kni1,nYn,i2,σ^τ2=1kni2,nYn,i2.\hat{\sigma}_{\tau-}^{2}=\frac{1}{k_{n}}\sum_{i\in\mathcal{I}_{1,n}}Y_{n,i}^{2},\quad\hat{\sigma}_{\tau}^{2}=\frac{1}{k_{n}}\sum_{i\in\mathcal{I}_{2,n}}Y_{n,i}^{2}. (7)

Assuming knk_{n}\rightarrow\infty and kn2Δn0k_{n}^{2}\Delta_{n}\rightarrow 0, it can be shown that (see Jacod and Protter (2012, Chapter 13))

kn1/2(σ^τ2σ^τ2(στ2στ2))2σ^τ4+2σ^τ4𝑑𝒩(0,1).\frac{k_{n}^{1/2}\left(\hat{\sigma}_{\tau}^{2}-\hat{\sigma}_{\tau-}^{2}-(\sigma_{\tau}^{2}-\sigma_{\tau-}^{2})\right)}{\sqrt{2\hat{\sigma}_{\tau}^{4}+2\hat{\sigma}_{\tau-}^{4}}}\overset{d}{\longrightarrow}\mathcal{N}\left(0,1\right). (8)

Thus, we can test H0:Δστ=0H_{0}:\Delta\sigma_{\tau}=0 by comparing the t-statistic kn1/2(σ^τ2σ^τ2)/2σ^τ4+2σ^τ4k_{n}^{1/2}\left(\hat{\sigma}_{\tau}^{2}-\hat{\sigma}_{\tau-}^{2}\right)/\sqrt{2\hat{\sigma}_{\tau}^{4}+2\hat{\sigma}_{\tau-}^{4}} with critical values based on the standard normal distribution.

Two remarks are in order. First, note that the asymptotic size control of the standard approach relies on the asymptotic normal approximation (8), which depends crucially on knk_{n}\rightarrow\infty (in addition to having Δn0\Delta_{n}\to 0) because the underlying central limit theorem is obtained by aggregating a “large” number of martingale differences. Hence, the t-test may suffer from severe size distortion when knk_{n} is relatively small. This issue is empirically relevant because an applied researcher may use a short time window to capture short-lived “impulse-like” dynamics and/or to minimize the impact of other confounding economic factors in the background. Moreover, for “real-time” applications, the researcher may have no choice but to use a small knk_{n} simply because of the limited amount of available data soon after the event time τ\tau. In sharp contrast, the permutation test controls asymptotic size even when knk_{n} is fixed. This remarkable property is inherited from the coupling two-sample problem, in which the permutation test controls size exactly regardless of whether knk_{n} is fixed or grows to infinity.

The second and perhaps practically more important difference between the two tests is that the permutation test is more versatile. Under the spot-estimation-based approach, both the design of the spot estimators in (7) and the convergence in (8) depend heavily on the fact that the increments of the Brownian motion are not only i.i.d., but also Gaussian. Gaussianity is obviously essential for the conventional approach because, among other things, it ensures that the instantaneous variance of the normalized returns are well-defined.121212Recall that many distributions used in continuous-time models do not have finite second moments. For example, within the class of stable distributions, the Gaussian distribution is the only one with a finite second moment. Moreover, Gaussianity also implies that the variance of Δn1(WiΔnW(i1)Δn)2\Delta_{n}^{-1}(W_{i\Delta_{n}}-W_{(i-1)\Delta_{n}})^{2} is 2, which explains the “2” factor in the denominator of the t-statistic. The permutation test, on the other hand, only exploits the i.i.d. property of the Brownian shocks, without relying on their Gaussianity. Therefore, the permutation test readily accommodates a more general model for asset returns with Lévy shocks, as we demonstrate in the following example.


Example 2 (Lévy-driven Asset Returns). We generalize the model in Example 1 by replacing the Brownian motion WW with a Lévy martingale LL, so that the asset return has the form

P(i+1)ΔnPiΔn=iΔn(i+1)Δnbs𝑑s+iΔn(i+1)Δnσs𝑑Ls,in.P_{\left(i+1\right)\Delta_{n}}-P_{i\Delta_{n}}=\int_{i\Delta_{n}}^{\left(i+1\right)\Delta_{n}}b_{s}ds+\int_{i\Delta_{n}}^{\left(i+1\right)\Delta_{n}}\sigma_{s}dL_{s},\quad i\in\mathcal{I}_{n}.

In this case, we define the random disturbance as ϵn,iΔn1/β(L(i+1)ΔnLiΔn)\epsilon_{n,i}\equiv\Delta_{n}^{-1/\beta}(L_{\left(i+1\right)\Delta_{n}}-L_{i\Delta_{n}}) for some constant β(1,2]\beta\in(1,2]. The more general normalizing sequence Δn1/β\Delta_{n}^{-1/\beta} is used to ensure that ϵn,i\epsilon_{n,i} has a non-degenerate distribution. For instance, if LL is a stable process, we take β\beta to be its jump-activity index, so that ϵn,i\epsilon_{n,i} has a centered stable distribution (recall that the Brownian motion is a stable process with index β=2\beta=2). We treat the value of β\beta as unknown. Since the permutation test is scale-invariant with respect to the data, we can nonetheless regard the normalized return Yn,i=Δn1/β(P(i+1)ΔnPiΔn)Y_{n,i}=\Delta_{n}^{-1/\beta}(P_{\left(i+1\right)\Delta_{n}}-P_{i\Delta_{n}}) as directly observable (because tests implemented for P(i+1)ΔnPiΔnP_{\left(i+1\right)\Delta_{n}}-P_{i\Delta_{n}} and Yn,iY_{n,i} are identical). To apply our theory, we represent Yn,iY_{n,i} using the state-space model (4) with ζt=σt\zeta_{t}=\sigma_{t}, g(z,ϵ)=zϵg\left(z,\epsilon\right)=z\epsilon, and the residual term given by

Rn,i=Δn1/βiΔn(i+1)Δnbs𝑑s+Δn1/βiΔn(i+1)Δn(σsσiΔn)𝑑Ls,in.R_{n,i}=\Delta_{n}^{-1/\beta}\int_{i\Delta_{n}}^{\left(i+1\right)\Delta_{n}}b_{s}ds+\Delta_{n}^{-1/\beta}\int_{i\Delta_{n}}^{\left(i+1\right)\Delta_{n}}\left(\sigma_{s}-\sigma_{i\Delta_{n}}\right)dL_{s},\quad i\in\mathcal{I}_{n}.

Recognizing that the scaled Lévy increments (ϵn,i)in(\epsilon_{n,i})_{i\in\mathcal{I}_{n}} are i.i.d., we can verify Assumptions 2.2 and 2.3 using similar arguments as in Example 1 but with an=Δn1/21/βa_{n}=\Delta_{n}^{1/2-1/\beta}, which depicts the rate at which ϵn,i2\|\epsilon_{n,i}\|_{2} diverges. In particular, the condition ankn3Δn1/2=o(1)a_{n}k_{n}^{3}\Delta_{n}^{1/2}=o(1) requires knk_{n} to obey kn=o(Δn(1/β1)/3)k_{n}=o(\Delta_{n}^{(1/\beta-1)/3}). Then, we can apply Theorem 2.2 to show that the permutation test ϕ^n\hat{\phi}_{n} is asymptotically valid for testing the discontinuity in the volatility process σt\sigma_{t} at time τ\tau, regardless of whether the driving Lévy process is a Brownian motion or not. To our knowledge, our test is the first in the literature that accommodates Lévy-type shocks in the context of testing for volatility jumps. \square


So far, we have illustrated the use of the permutation test for high-frequency asset returns data. Under the settings of Examples 1 and 2, the distributional change of asset returns is mainly driven by the time-τ\tau discontinuity in volatility, and hence, the permutation test is effectively a test for volatility jumps. Example 2, in particular, highlights the versatility and robustness of the permutation test compared with the conventional approach based on spot estimation. Going one step further, we now illustrate how to apply the permutation test to other types of economic variables.


Example 3 (Location-Scale Model for Volume). Consider a simple model for trading volume, under which the volume within the iith sampling interval is given by Yn,i=μiΔn+viΔnϵn,iY_{n,i}=\mu_{i\Delta_{n}}+v_{i\Delta_{n}}\epsilon_{n,i}. The μt\mu_{t} location process captures the local mean, or trading intensity, and the vtv_{t} scale process captures the time-varying heterogeneity in the order size. This location-scale model fits directly into the state-space model (4) with ζt=(μt,vt)\zeta_{t}=(\mu_{t},v_{t}), g((μ,v),ϵ)=μ+vϵg\left((\mu,v),\epsilon\right)=\mu+v\epsilon, and Rn,i0R_{n,i}\equiv 0. Let t\mathcal{F}_{t} be the filtration generated by the ζt\zeta_{t} process. If ϵn,i\epsilon_{n,i} is independent of the ζt\zeta_{t} process and has finite second moment and bounded PDF, then it is easy to verify Assumptions 2.2 and 2.3 with an=1a_{n}=1. Theorem 2.2 thus implies that the permutation test is valid for testing the discontinuity in ζt=(μt,vt)\zeta_{t}=\left(\mu_{t},v_{t}\right) at time τ\tau. \square


The location-scale structure in Example 3 is by no means essential in applications, because the permutation test is valid provided that the more general conditions in Assumptions 2.2 and 2.3 hold. This illustration is pedagogically convenient, in that it permits a straightforward verification of our high-level conditions. That being said, this example does reveal a limitation of our theory developed so far. That is, the data variable needs to be continuously distributed, as required in Assumption 2.2(ii) (which in turn is related to Assumption 2.1(ii)). Observed data in actual applications are invariably discrete, but this continuous-distribution assumption is often deemed as a reasonable approximation to reality. In some situations, however, the discreteness in the data is more salient. For example, the trading volume of a relatively illiquid asset may take values as small integer multiples of the lot size (e.g., 100 shares).131313This issue has become less important in the equity market as retail investors can now trade a single share, or even a fractional share, of a stock. However, the lot size is still relevant for less liquid assets such as option contracts or for equity data from earlier sample periods. This motivates us to directly confront the discreteness in the data, as detailed in the next subsection.

2.3 Extension: the case with discretely valued data

The extension will be carried out in similar steps as the theory developed above. We start with modifying the general result in Theorem 2.1 to accommodate discretely valued observations; we then specialize the general theory to a high-frequency setting under more primitive conditions. Recall that Qj,n()Q_{j,n}\left(\cdot\right) denotes the 𝒢n\mathcal{G}_{n}-conditional distribution function of the coupling variable Un,iU_{n,i} for ij,ni\in\mathcal{I}_{j,n} and j{1,2}j\in\left\{1,2\right\}, and Q¯n=(Q1,n+Q2,n)/2\overline{Q}_{n}=(Q_{1,n}+Q_{2,n})/2.

Theorem 2.3.

Suppose that there exists a collection of variables (Un,i)in\left(U_{n,i}\right)_{i\in\mathcal{I}_{n}} that satisfies Assumption 2.1(i) for some sequence (𝒢n)n1(\mathcal{G}_{n})_{n\geq 1} of σ\sigma-fields, and (Y~n,iUn,i)=o(kn1)\mathbb{P}(\widetilde{Y}_{n,i}\neq U_{n,i})=o\left(k_{n}^{-1}\right) uniformly in ini\in\mathcal{I}_{n} where (Y~n,i)in(\widetilde{Y}_{n,i})_{i\in\mathcal{I}_{n}} is an identical copy of (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}} in 𝒢n\mathcal{G}_{n}-conditional distribution. Then, the following statements hold for the test ϕ^n\hat{\phi}_{n} described in Algorithm 1:

(a) If the variables (Un,i)in(U_{n,i})_{i\in\mathcal{I}_{n}} have the same 𝒢n\mathcal{G}_{n}-conditional distribution, we have 𝔼[ϕ^n]α\mathbb{E}[\hat{\phi}_{n}]\rightarrow\alpha.

(b) If knk_{n}\rightarrow\infty and ((Q1,n(x)Q2,n(x))2𝑑Q¯n(x)>δn)1\mathbb{P(}\int\left(Q_{1,n}(x)-Q_{2,n}(x)\right)^{2}d\overline{Q}_{n}\left(x\right)>\delta_{n})\rightarrow 1 for any real sequence δn=o(1)\delta_{n}=o(1), we have 𝔼[ϕ^n]1\mathbb{E}[\hat{\phi}_{n}]\rightarrow 1.

Theorem 2.3 establishes exactly the same asymptotic properties for the permutation test as Theorem 2.1, but under different conditions: it does not impose the anti-concentration requirement for the coupling variable (i.e., Assumption 2.1(ii)), and the “distance” between the observed data and the coupling variable is measured by the probability mass of {Y~n,iUn,i}\{\widetilde{Y}_{n,i}\neq U_{n,i}\}. These modifications seem natural for the discrete-data setting.

Next, we specialize the general result in Theorem 2.3 to the state-space model (4), starting with some motivating examples. The first is an alternative model for the trading volume that explicitly features discretely valued data, which shows an interesting contrast to Example 3.


Example 4 (Poisson Model for Volume). Let Yn,iY_{n,i} be the trading volume of an asset within the iith sampling interval. Following Andersen (1996), we model the discretely valued volume using a Poisson distribution with time-varying mean. To form a state-space representation, let (ϵn,i(t))t0(\epsilon_{n,i}(t))_{t\geq 0} be a copy of the standard Poisson process on +\mathbb{R}_{+}, independent across ii, and let ζt\zeta_{t} be the time-varying mean process independent of the ϵn,i\epsilon_{n,i}’s. We then set Yn,i=ϵn,i(ζiΔn)Y_{n,i}=\epsilon_{n,i}\left(\zeta_{i\Delta_{n}}\right), which, conditional on the ζ\zeta process, is Poisson distributed with mean ζiΔn\zeta_{i\Delta_{n}}. This representation is a special case of (4), with g(ζ,ϵ)=ϵ(ζ)g(\zeta,\epsilon)=\epsilon(\zeta) being a time-change and Rn,i=0R_{n,i}=0. We also note that although the ϵn,i\epsilon_{n,i}’s are assumed to be i.i.d., the (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}} series can be highly persistent through its dependence on the stochastic mean process ζt\zeta_{t}. \square


To further broaden the empirical scope, we consider another example concerning the bid-ask spread of asset quotes. This example is econometrically interesting because of its resemblance to the discrete-choice models (e.g., probit and logit) commonly used for modeling binary and multinomial data.


Example 5 (Bid-Ask Spread). Let Yn,iY_{n,i} be the bid-ask spread of an asset at time iΔni\Delta_{n}. For a liquid asset, the spread is often maintained at 1 tick (e.g., 1 cent), but it may widen to several ticks due to a higher level of asymmetric information or dealer’s inventory cost. For ease of exposition, we suppose that Yn,iY_{n,i} is a binary variable taking values in {1,2}\{1,2\}, while noting that a multinomial extension is straightforward. Motivated by the classical discrete-choice models, we model the spread as Yn,i=1+1{ζiΔnϵn,i}Y_{n,i}=1+1\left\{\zeta_{i\Delta_{n}}\geq\epsilon_{n,i}\right\}, and suppose that the variables (ϵn,i)in(\epsilon_{n,i})_{i\in\mathcal{I}_{n}} are i.i.d. and independent of the ζt\zeta_{t} process. With the CDF of ϵn,i\epsilon_{n,i} denoted by Fϵ()F_{\epsilon}(\cdot), we have (Yn,i=2|ζiΔn)=Fϵ(ζiΔn)\mathbb{P}\left(Y_{n,i}=2|\zeta_{i\Delta_{n}}\right)=F_{\epsilon}(\zeta_{i\Delta_{n}}). Evidently, upon redefining ζt\zeta_{t} as Fϵ(ζt)F_{\epsilon}\left(\zeta_{t}\right), we can assume that ϵn,i\epsilon_{n,i} is uniformly distributed on the [0,1][0,1] interval without loss of generality. This normalization in turn allows us to interpret ζt\zeta_{t} as the stochastic propensity of a “wide” spread, which may serve as a measure of market illiquidity. \square


We now proceed to establish the asymptotic validity of the permutation test for the hypotheses described in (6) for discretely valued observations; see Theorem 2.4 below. Since the state-space representation (4) holds with the residual term Rn,i=0R_{n,i}=0 in the examples above, it seems reasonable to avoid unnecessary redundancy by restricting our analysis to a simpler version given by

Yn,i=g(ζiΔn,ϵn,i),in.Y_{n,i}=g\left(\zeta_{i\Delta_{n}},\epsilon_{n,i}\right),\quad i\in\mathcal{I}_{n}. (9)

We replace Assumption 2.3 with the following assumption, where we recall that for each z𝒵z\in\mathcal{Z}, Fz()F_{z}\left(\cdot\right) denotes the CDF of the random variable g(z,εn,i)g\left(z,\varepsilon_{n,i}\right) and ζ~t=ζtΔζτ1{tτ}\tilde{\zeta}_{t}=\zeta_{t}-\Delta\zeta_{\tau}1\{t\geq\tau\}.

Assumption 2.4.

There exist a sequence (Tm)m1\left(T_{m}\right)_{m\geq 1} of stopping times increasing to infinity, a sequence of compact subsets (𝒦m)m1(\mathcal{K}_{m})_{m\geq 1} of 𝒵\mathcal{Z}, and a sequence (Km)m1\left(K_{m}\right)_{m\geq 1} of constants such that for each m1m\geq 1: (i) (g(z,ϵn,i)g(z,ϵn,i))Kmzz\mathbb{P}\left(g\left(z,\epsilon_{n,i}\right)\neq g(z^{\prime},\epsilon_{n,i})\right)\leq K_{m}\left\|z-z^{\prime}\right\| for all z,z𝒦mz,z^{\prime}\in\mathcal{K}_{m}; (ii) ζt\zeta_{t} takes values in 𝒦m\mathcal{K}_{m} for all tTmt\leq T_{m}, and ζ~tTmζ~sTm2Km|ts|1/2\|\tilde{\zeta}_{t\wedge T_{m}}-\tilde{\zeta}_{s\wedge T_{m}}\|_{2}\leq K_{m}\left|t-s\right|^{1/2} for all t,st,s in some fixed neighborhood of τ\tau.

Theorem 2.4.

In the state-space model (9), suppose that Assumptions 2.2(i), 2.2(iii) and 2.4 hold, and that kn3Δn=o(1)k_{n}^{3}\Delta_{n}=o(1). Then, the following statements hold for the permutation test ϕ^n\hat{\phi}_{n} described in Algorithm 1:

(a) Under the null hypothesis in (6), i.e., Δζτ=0\Delta\zeta_{\tau}=0, we have 𝔼[ϕ^n]α\mathbb{E}[\hat{\phi}_{n}]\rightarrow\alpha;

(b) Under a fixed alternative hypothesis in (6), i.e., Δζτ=c\Delta\zeta_{\tau}=c for some (unknown) constant c0c\neq 0, and if knk_{n}\rightarrow\infty, we have that 𝔼[ϕ^n]1\mathbb{E}[\hat{\phi}_{n}]\rightarrow 1.

Theorem 2.4 depicts the same asymptotic behavior of the permutation test as in Theorem 2.2. The sufficient conditions of these results differ mainly in how to gauge the closeness between the data and the coupling variable, as manifest in the difference between Assumption 2.3(i) and Assumption 2.4(i). The latter is easy to verify under more primitive conditions in concrete settings. Specifically, in Example 4, we note that |g(z,ϵn,i)g(z,ϵn,i)||g(z,\epsilon_{n,i})-g(z^{\prime},\epsilon_{n,i})| is a Poisson random variable with mean |zz|\left|z-z^{\prime}\right|, and hence, (g(z,ϵn,i)g(z,ϵn,i))=1exp(|zz|)|zz|\mathbb{P}(g(z,\epsilon_{n,i})\neq g(z^{\prime},\epsilon_{n,i}))=1-\exp(-\left|z-z^{\prime}\right|)\leq\left|z-z^{\prime}\right| as desired. In Example 5, we can use ϵn,iUniform[0,1]\epsilon_{n,i}\sim\text{Uniform}\left[0,1\right] to deduce that

(g(z,ϵn,i)g(z,ϵn,i))=(1{zϵn,i}1{zϵn,i})=|zz|,\mathbb{P}\left(g\left(z,\epsilon_{n,i}\right)\neq g\left(z^{\prime},\epsilon_{n,i}\right)\right)=\mathbb{P}\left(1\{z\geq\epsilon_{n,i}\}\neq 1\{z^{\prime}\geq\epsilon_{n,i}\}\right)=\left|z-z^{\prime}\right|,

which, again, verifies Assumption 2.4(i). Therefore, in the context of Examples 4 and 5 above, the permutation test is asymptotically valid for detecting discontinuities in trading activity and illiquidity, respectively.

3 Monte Carlo simulations

3.1 Setting

Our Monte Carlo experiment is based on the setting of Example 2. We simulate the (log) price process according to dPt=σtdLtdP_{t}=\sigma_{t}dL_{t} under an Euler scheme on a 1-second mesh, and then resample the data at the Δn=1\Delta_{n}=1 minute frequency. We simulate LL either as a standard Brownian motion or as a (centered symmetric) stable process with index β=1.5\beta=1.5. To avoid unrealistic price path, we truncate the stable distribution so that its normalized increment Δn1/β(LiΔnL(i1)Δn)\Delta_{n}^{-1/\beta}\left(L_{i\Delta_{n}}-L_{(i-1)\Delta_{n}}\right) is supported on [C,C]\left[-C,C\right], and we consider C{10,20,30}C\in\{10,20,30\} to examine the effect of the support. The unit of time is one day.

To simulate the volatility process, we first simulate two volatility factors according to the following dynamics (see Bollerslev and Todorov (2011)):

dV1,t\displaystyle dV_{1,t} =\displaystyle= 0.0116(0.5V1,t)dt+0.1023V1,t(ρdLt+1ρ2dB1,t)+c1{t=τ},\displaystyle 0.0116(0.5-V_{1,t})dt+0.1023\sqrt{V_{1,t}}\left(\rho dL_{t}+\sqrt{1-\rho^{2}}dB_{1,t}\right)+c\cdot 1_{\left\{t=\tau\right\}},
dV2,t\displaystyle dV_{2,t} =\displaystyle= 0.6930(0.5V2,t)dt+0.7909V2,t(ρdLt+1ρ2dB2,t)+c1{t=τ},\displaystyle 0.6930(0.5-V_{2,t})dt+0.7909\sqrt{V_{2,t}}\left(\rho dL_{t}+\sqrt{1-\rho^{2}}dB_{2,t}\right)+c\cdot 1_{\left\{t=\tau\right\}},

where B1,tB_{1,t} and B2,tB_{2,t} are independent standard Brownian motions that are also independent of LtL_{t}, ρ=0.7\rho=-0.7 captures the negative correlation between price and volatility shocks (namely the “leverage” effect). The V1V_{1} volatility factor is highly persistent with a half-life of 2.5 months, while the V2V_{2} volatility factor is quickly mean-reverting with a half-life of only one day. The constant cc determines the size of the volatility jump at the event time τ\tau. In particular, c=0c=0 corresponds to the null hypothesis, and we consider a range of cc values in (0,5](0,5] in order to trace out a power curve for the corresponding alternative hypotheses. The range of the cc parameter is calibrated according to Bollerslev et al.’s (2018) empirical estimates for FOMC announcements.141414Specifically, Bollerslev et al. (2018) estimate the average jump size of log(σt)\log(\sigma_{t}) for the S&P 500 ETF around FOMC announcements to be 1.037 (see Table 3 of that paper). This suggests that στ2/στ2=\sigma_{\tau}^{2}/\sigma_{\tau-}^{2}= (exp(1.037))28(\exp(1.037))^{2}\approx 8 on average, corresponding to c3.5c\approx 3.5 in this Monte Carlo design.

We note that the two volatility factors, V1V_{1} and V2V_{2}, capture the slow- and fast-mean-reverting volatility dynamics, respectively, with the former having “smoother” sample paths than the latter. With this in mind, we simulate σt\sigma_{t} using two models:

{Model A:σt2=2V1,t,Model B:σt2=V1,t+V2,t.\left\{\begin{array}[]{ll}\text{Model A:}&\sigma_{t}^{2}=2V_{1,t},\\ \text{Model B:}&\sigma_{t}^{2}=V_{1,t}+V_{2,t}.\end{array}\right. (10)

In finite samples, Model A features relatively smooth volatility paths, which is close to the “ideal” scenario underlying the infill asymptotic theory. Meanwhile, Model B generates more realistic, and rougher, sample path for σ\sigma, providing a nontrivial challenge for the proposed inference theory.

We implement the permutation test at the 5% significance level, with the window size kn{15,30,60,90}k_{n}\in\{15,30,60,90\}. The six-fold increase from the smallest window size to the largest one represents a considerable range that allows us to explore the robustness of the proposed test with respect to the knk_{n} tuning parameter.151515We also implemented simulations in which the two subsamples, 1,n\mathcal{I}_{1,n} and 2,n\mathcal{I}_{2,n}, have different sample sizes k1,n,k2,n{15,30,60,90}k_{1,n},k_{2,n}\in\{15,30,60,90\}. As anticipated in footnote 4, these results are quantitatively similar to those with a common sample size, i.e., k1,n=k2,nk_{1,n}=k_{2,n}. These additional results are omitted for brevity but are available upon request. The critical value is computed as in Remark 2.1 based on 1,000 i.i.d. permutations. For comparison, we also implement the standard (two-sided) t-test based on (8). Rejection frequencies are computed based on 2,000 Monte Carlo trials.

3.2 Results

Table 1: Rejection Rates under the Null Hypothesis
Permutation Test T-test
(1) (2) (3) (4) (1) (2) (3) (4)
Model A: One-factor Volatility
kn=15k_{n}=15 0.050 0.058 0.053 0.062 0.011 0.017 0.021 0.032
kn=30k_{n}=30 0.054 0.049 0.056 0.057 0.031 0.042 0.044 0.063
kn=60k_{n}=60 0.047 0.048 0.053 0.059 0.046 0.053 0.055 0.077
kn=90k_{n}=90 0.058 0.050 0.053 0.056 0.048 0.056 0.074 0.091
Model B: Two-factor Volatility
kn=15k_{n}=15 0.049 0.055 0.054 0.062 0.014 0.017 0.023 0.037
kn=30k_{n}=30 0.055 0.050 0.055 0.056 0.037 0.041 0.051 0.073
kn=60k_{n}=60 0.049 0.049 0.054 0.064 0.091 0.077 0.078 0.102
kn=90k_{n}=90 0.064 0.052 0.058 0.059 0.136 0.101 0.117 0.147
Note: This table presents rejection frequencies of the permutation test and the t-test under the null hypothesis στ2=στ2\sigma^{2}_{\tau-}=\sigma^{2}_{\tau}. The significance level is fixed at 5%. Column (1) corresponds to the case with LL being a standard Brownian motion, and columns (2)–(4) correspond to cases in which LL is truncated stable with index 1.51.5 and truncation parameter C{10,20,30}C\in\{10,20,30\}. The rejection frequencies are computed based on 2,000 Monte Carlo trials.

We first examine the size properties of the permutation test ϕ^n\hat{\phi}_{n} and the t-test based on (8). Table 1 reports the rejection frequencies of these tests under the null hypothesis (i.e., c=0c=0) for various data generating processes. Column (1) corresponds to the case with LL being a standard Brownian motion, and columns (2), (3), and (4) report results when LL is a truncated stable process with the truncation parameter C=10C=10, 2020, and 3030, respectively.

The top panel of the table shows results from Model A, where the volatility is solely driven by the “slow” factor. Quite remarkably, the rejection frequencies of the permutation test are very close to the 5% nominal level for all specifications of LL and, importantly, for a wide range of the window size knk_{n}. In contrast, the rejection rates of the t-test appear to be far more sensitive to the choice of knk_{n}. As we increase knk_{n} from 15 to 90, the rejection rate increases from 1.1% to 4.8% when LL is a Brownian motion. A similar pattern emerges when LL is a truncated stable process, except that the rejection rates now exceed the nominal level and reach 7.4% and 9.1% in columns (3) and (4), respectively. It is relevant to note that the t-test is not formally justified when LL is not a Brownian motion.

The more challenging case is Model B with the two-factor volatility dynamics. Looking at the bottom panel of Table 1, we find that the permutation test still has rejection rates that are quite close to the nominal level, although we see a slight over-rejection of 6.4% when kn=90k_{n}=90. This is likely due to the fact that the approximation error in the coupling has nontrivial impact when the window size is large. That being said, we note that the benchmark t-test is more severely affected by this bias issue, with rejection rates reaching 9.1% and 13.6% when kn=60k_{n}=60 and kn=90k_{n}=90, respectively.

Figure 1 plots the power curves of the permutation test and the t-test for various knk_{n}’s in Model A and Model B. We note that the four specifications of LL produce qualitatively similar results. We see that the rejection frequencies increase with the window size knk_{n} and the jump size cc, which is expected from our consistency result obtained under knk_{n}\to\infty. The permutation test appears to be less powerful than the t-test under the alternative hypothesis. This is a natural consequence of the typical trade-off between efficiency and robustness to the assumptions. The t-test is based on the spot variance estimator, which is “locally” the maximum-likelihood estimator of the spot variance under the Brownian shocks. The asymptotic validity and efficiency of the t-test rely on the Brownian assumption. In contrast, the permutation test is asymptotically valid regardless of whether the shocks are Brownian or not. As expected, this robustness is costly in terms of power. On the flip side, the power advantage of the t-test comes at the cost of size distortion when shocks are non-Brownian, which can be large, as shown in Table 1.

Refer to caption
Figure 1: The figure plots the rejection frequencies of the permutation test and the t-test. The significance level is fixed at 5% (highlighted by shade). Results for Model A and Model B are presented in the top and bottom rows, respectively. In all the plots, LL is simulated as a standard Brownian motion. The power curves are computed for the jump size parameter c{0,0.5,1,,5}c\in\{0,0.5,1,\ldots,5\}. The rejection frequencies are computed based on 2,000 Monte Carlo trials.

Overall, we find that the permutation test controls size remarkably well under the null hypothesis. Although it appears to be less powerful than the t-test, it does not suffer from the latter’s size distortion which can be severe in the two-factor volatility model. Our results suggest that, given its robustness, the permutation test is a useful complement to the conventional test based on spot estimation and asymptotic Gaussian approximation.

4 An empirical illustration

We apply the proposed permutation test to a recent sample of high-frequency price and volume observations of the S&P 500 ETF (NYSE: SPY) as an empirical illustration. The sampling frequency is one minute; the data source is the TAQ database. With the permutation test, we are interested in testing distributional discontinuities of the ETF’s return, trading volume, and two measures of illiquidity for several FOMC announcements during the COVID-19 pandemic. This setting highlights one of the key merits of the proposed test, namely, it is applicable for a broad variety of high-frequency observations modeled in distinct ways. This is in sharp contrast to the conventional t-test based on (8), which is designed specifically for testing volatility jumps and whose validity relies on the assumption of Brownian shocks. We construct the high-frequency volume series as the total number of shares within each one-minute trading session. The illiquidity measures of interest include Amihud’s measure defined as the ratio between absolute return and dollar trading volume (Amihud (2002)) and the bid-ask spread averaged within each one-minute trading session.

We focus on four important FOMC announcements in the 2020–2021 sample period that are related to distinct aspects of the Federal Reserve’s monetary policy during the COVID-19 pandemic. The first is the announcement made on March 3, 2020, which was also the first FOMC announcement after COVID-19 hit the United States. The Fed stated its decision to lower the federal funds rate by 1/2 percentage point as its first response to counter the pandemic’s negative impact on the economy. The second event occurred on December 16, 2020. At that time, being concerned with the rising long-term yield, many market participants expected that the Fed might implement the so-called “operation twist” to tame the steepening of the yield curve. But this turned out not to be on the Fed’s agenda, and so, the Fed’s “inaction” may be deemed as a shock relative to the market’s anticipation. The third case pertains to the announcement on March 17, 2021. During the press conference, the Fed Chairman suggested that the central bank would be unlikely to raise the rate in the next 2–3 years, which may be regarded as a forward guidance on the target rate. The final example is the announcement on September 22, 2021, when the Fed officially declared its intention to taper the large-scale asset purchase program.

Figure 2 plots the asset return and trading volume of SPY over one-hour windows centered at these announcement times. For ease of comparison, we plot the return and volume data for the four events on the same scale. These plots immediately reveal the highly distinct market conditions at those times. This highlights the usefulness of adopting a high-frequency event-study research design, which allows us to investigate each event separately, rather than pooling information across different announcements under a likely fragile homogeneity assumption. We also observe several interesting patterns regarding how the market responds to the “lumpy” information embedded in the announcements. We generally see a rise in trading activity after the announcement. The price also tends to fluctuate more in the post-announcement window, although the March 3, 2020 event may be an exception as the market was already quite volatile even before the announcement.

Refer to caption
Figure 2: This figure plots the price return (solid) and volume (shaded) of the SPY ETF in one-hour windows around four FOMC announcements.

As mentioned above, we implement the permutation test described in Algorithm 1 on the ETF’s returns, trading volume, Amihud’s measure, and bid-ask spread constructed on the one-minute sampling frequency. For ease of interpretation, we consider the non-randomized version of the test described in Remark 2.1. We consider two event windows: kn=10k_{n}=10 minutes or 3030 minutes. Recall that each FOMC announcement contains two stages. The first is an immediate release of a short summary on the Federal Reserve’s webpage, which will be further detailed in the Fed Chairman’s opening statement during the first (roughly) 10 minutes of the press conference. The second part is a Q&A session in which the Chairman responds to questions from the media, with the first few generally being more important. Given this setup, the shorter kn=10k_{n}=10 window allows us to focus on the immediate impact of the FOMC statement, whereas the longer kn=30k_{n}=30 window further covers the “more subtle” policy information conveyed to the public during the Q&A session. It is worth noting that the 30-minute event window is also adopted in prior work on the high-frequency identification of monetary policy shocks; see Nakamura and Steinsson (2018a).

Table 2: Test Results for Selected FOMC Announcements
kn=10k_{n}=10 kn=30k_{n}=30
Date (τ\tau) Return Volume Amihud Spread Return Volume Amihud Spread
03/03/2020 *** * *** *** ***
12/16/2020 ** *** ** *** ** *** ***
03/17/2021 * *** *** ** *** ***
09/22/2021 ** *** *** *** *** ***
Note: The permutation test is implemented following Algorithm 1. For ease of interpretation, we consider the non-randomized version as described in Remark 2.1 using 100,000 i.i.d. permutations. Rejections at the 10%, 5%, and 1% significance levels are indicated by *, **, and ***, respectively.

Table 2 reports the results of the permutation test. Recall that the permutation test applied to asset returns may be interpreted as a test for volatility jumps, as explained in Example 1 (with Brownian shocks) or Example 2 (with more general Lévy shocks).161616Prior work on volatility jumps (see, e.g., Bollerslev et al. (2018) and Li et al. (2021)) focuses exclusively on the case in which volatility is the scaling factor of Brownian shocks. Our permutation test is related to the prior work, but is valid under a more general notion of volatility with respect to non-Brownian shocks. At a significance level of 10%, the permutation test rejects the null of continuity for all events except the one on March 3, 2020. The latter non-rejection is consistent with our previous observation that the volatility of SPY was high even before the announcement. Meanwhile, the test also strongly rejects the null hypothesis of distributional continuity for the volume series, echoing the burst of trading activity seen in Figure 2.

The permutation test applied to the two illiquidity measures generates mixed results. We find some moderate evidence for the distributional discontinuity of Amihud’s measure shortly after the March 3 and December 16 announcements in 2020. For the case with a 30-minute window, we do not reject the null of continuity for any of the announcements. The overall evidence suggests that the FOMC announcements under study did not lead to an abrupt change in the market impact coefficient gauged by Amihud’s measure. Needless to say, this finding per se does not imply that the liquidity condition is unchanged after the announcement, as the notion of liquidity is a multifaceted concept. Indeed, we see that the permutation test applied to the bid-ask spread always strongly rejects the null of continuity. The post-announcement spread tends to be larger than its pre-announcement level, suggesting that it is significantly more costly to trade during the post-announcement trading session.

All in all, the empirical illustration above demonstrates how the proposed permutation test may be used to test for distributional discontinuities for a variety of market variables. This type of versatility is not easily attained by existing methods in the high-frequency econometrics literature. We also see that interesting empirical findings may be obtained even with a small number of observations, which confirms the practical relevance of allowing the knk_{n} window to be possibly fixed in our asymptotic theory for the permutation test.

5 Concluding remarks

In this paper, we propose using a permutation test to detect discontinuities in an economic model at a cutoff point. Relative to the existing literature, we show that the permutation test is well suited for event studies based on time-series data. While nonparametric t-tests have been widely used for this purpose in various empirical contexts, the permutation test proposed in this paper provides a distinct alternative. Instead of relying on asymptotic (mixed) Gaussianity from central limit theorems, we exploit finite-sample properties of the permutation test in the approximating, or “coupling”, two-sample problem.

We demonstrate that our new theory is broadly useful in a wide range of problems in the infill asymptotic time-series setting, which justifies using the permutation test to detect jumps in economic variables such as volatility, trading activity, and liquidity. Compared with the conventional nonparametric t-test, the proposed permutation test has several distinct features. First, the permutation test provides asymptotic size control regardless of whether the sizes of the local subsamples are fixed or growing to infinity. In the latter case, we also establish that the permutation test is consistent. Second, the permutation test is versatile, as it can be applied without modification to many different contexts and under relatively weak conditions.

Appendix: Proofs

Throughout the proofs, we use KK to denote a positive constant that may change from line to line, and write KpK_{p} to emphasize its dependence on some parameter pp. For any event EE\in\mathcal{F}, we identify it with the associated indicator random variable.

Proof of Theorem 2.1.

Step 1. Define ϕn\phi_{n} in the same way as ϕ^n\hat{\phi}_{n} but with (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}} replaced by (Un,i)in(U_{n,i})_{i\in\mathcal{I}_{n}}. In this step, we show that

𝔼[ϕ^n]=𝔼[ϕn]+o(1).\mathbb{E}[\hat{\phi}_{n}]=\mathbb{E}[\phi_{n}]+o(1). (11)

Let ϕ~n\tilde{\phi}_{n} be defined in the same way as ϕ^n\hat{\phi}_{n}, but with (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}} replaced by (Y~n,i)in(\widetilde{Y}_{n,i})_{i\in\mathcal{I}_{n}}, as defined in Assumption 2.1(iii). Since (Y~n,i)in(\widetilde{Y}_{n,i})_{i\in\mathcal{I}_{n}} and (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}} have the same (conditional) distribution,

𝔼[ϕ~n]=𝔼[ϕ^n].\mathbb{E}[\tilde{\phi}_{n}]=\mathbb{E}[\hat{\phi}_{n}]. (12)

Let EnE_{n}\in\mathcal{F} be the event where the ordered values of (Un,i)in(U_{n,i})_{i\in\mathcal{I}_{n}} and (Y~n,i)in(\widetilde{Y}_{n,i})_{i\in\mathcal{I}_{n}} correspond to the same permutation of n\mathcal{I}_{n}. Since the test statistic is only a function of the rank of the observations, we have ϕ~n=ϕn\tilde{\phi}_{n}=\phi_{n} in restriction to EnE_{n}. Hence,

|𝔼[ϕ~n]𝔼[ϕn]|=|𝔼[ϕ~nEnc]𝔼[ϕnEnc]|(Enc).|\mathbb{E}[\tilde{\phi}_{n}]-\mathbb{E}[\phi_{n}]|=|\mathbb{E}[\tilde{\phi}_{n}E_{n}^{c}]-\mathbb{E}[\phi_{n}E_{n}^{c}]|\leq\mathbb{P}(E_{n}^{c}). (13)

By (12) and (13), (11) follows from (Enc)=o(1)\mathbb{P}(E_{n}^{c})=o(1), which will be proved below.

Let An,i,j{Un,jUn,i0,Y~n,jY~n,i<0}A_{n,i,j}\equiv\{U_{n,j}-U_{n,i}\geq 0,\widetilde{Y}_{n,j}-\widetilde{Y}_{n,i}<0\} for every (i,j)n×n(i,j)\in\mathcal{I}_{n}\times\mathcal{I}_{n}, and note that Enci,jAn,i,jE_{n}^{c}\subseteq\cup_{i,j}A_{n,i,j}. Recall the elementary fact that if a sequence of random variables Xn=op(1)X_{n}=o_{p}(1), then there exists a real sequence δn=o(1)\delta_{n}=o(1) such that (|Xn|δn)1\mathbb{P}(|X_{n}|\leq\delta_{n})\to 1. Under Assumption 2.1(iii), by applying this result to Xn=2maxin|Y~n,iUn,i|kn2X_{n}=2\max_{i\in\mathcal{I}_{n}}|\widetilde{Y}_{n,i}-U_{n,i}|k_{n}^{2}, we can find a sequence δn=o(1)\delta_{n}=o(1) such that

(maxin|Y~n,iUn,i|δnkn2/2)1.\mathbb{P}\left(\max_{i\in\mathcal{I}_{n}}|\widetilde{Y}_{n,i}-U_{n,i}|\leq\delta_{n}k_{n}^{-2}/2\right)\rightarrow 1. (14)

We then observe that

An,i,j\displaystyle A_{n,i,j} {Un,jUn,iδnkn2,Y~n,jY~n,i<0}{0Un,jUn,i<δnkn2}\displaystyle~{}\subseteq~{}\{U_{n,j}-U_{n,i}\geq\delta_{n}k_{n}^{-2},\widetilde{Y}_{n,j}-\widetilde{Y}_{n,i}<0\}\cup\{0\leq U_{n,j}-U_{n,i}<\delta_{n}k_{n}^{-2}\}
{|Y~n,jY~n,i(Un,jUn,i)|>δnkn2}{0Un,jUn,i<δnkn2}\displaystyle~{}\subseteq~{}\{|\widetilde{Y}_{n,j}-\widetilde{Y}_{n,i}-(U_{n,j}-U_{n,i})|>\delta_{n}k_{n}^{-2}\}\cup\{0\leq U_{n,j}-U_{n,i}<\delta_{n}k_{n}^{-2}\}
{maxin|Y~n,iUn,i|>δnkn2/2}{0Un,jUn,i<δnkn2}.\displaystyle~{}\subseteq~{}\{\max_{i\in\mathcal{I}_{n}}|\widetilde{Y}_{n,i}-U_{n,i}|>\delta_{n}k_{n}^{-2}/2\}\cup\{0\leq U_{n,j}-U_{n,i}<\delta_{n}k_{n}^{-2}\}.

Therefore,

Enci,jAn,i,j{maxin|Y~n,iUn,i|>δnkn2/2}(i,jn{0Un,jUn,i<δnkn2}),E_{n}^{c}~{}\subseteq~{}\cup_{i,j}A_{n,i,j}~{}\subseteq~{}\{\max_{i\in\mathcal{I}_{n}}|\widetilde{Y}_{n,i}-U_{n,i}|>\delta_{n}k_{n}^{-2}/2\}\cup(\cup_{i,j\in\mathcal{I}_{n}}\{0\leq U_{n,j}-U_{n,i}<\delta_{n}k_{n}^{-2}\}),

which, together with (14), implies that

(Enc)(i,jn{0Un,jUn,i<δnkn2})+o(1).\mathbb{P}(E_{n}^{c})\leq\mathbb{P}(\cup_{i,j\in\mathcal{I}_{n}}\{0\leq U_{n,j}-U_{n,i}<\delta_{n}k_{n}^{-2}\})+o(1). (15)

Next, consider the following argument:

(i,jn{0Un,jUn,i<δnkn2}|𝒢n)\displaystyle\mathbb{P}(\cup_{i,j\in\mathcal{I}_{n}}\{0\leq U_{n,j}-U_{n,i}<\delta_{n}k_{n}^{-2}\}|\mathcal{G}_{n}) i,jn(0Un,jUn,i<δnkn2|𝒢n)\displaystyle\leq\sum_{i,j\in\mathcal{I}_{n}}\mathbb{P}(0\leq U_{n,j}-U_{n,i}<\delta_{n}k_{n}^{-2}|\mathcal{G}_{n})
2kninsupx(|Un,ix|δnkn2|𝒢n)\displaystyle\leq 2k_{n}\sum_{i\in\mathcal{I}_{n}}\sup_{x\in\mathbb{R}}\mathbb{P}(|U_{n,i}-x|\leq\delta_{n}k_{n}^{-2}|\mathcal{G}_{n})
=Op(δn)=op(1),\displaystyle=O_{p}(\delta_{n})=o_{p}(1), (16)

where the last line holds by Assumption 2.1(ii). By (16) and the bounded convergence theorem,

(i,jn{0Un,jUn,i<δnkn2})=o(1).\mathbb{P}(\cup_{i,j\in\mathcal{I}_{n}}\{0\leq U_{n,j}-U_{n,i}<\delta_{n}k_{n}^{-2}\})=o(1). (17)

By combining (15) and (17), we conclude that (Enc)=o(1)\mathbb{P}(E_{n}^{c})=o(1), as desired.

Step 2. We now prove the assertions in parts (a) and (b) of the theorem. In view of (11), we only need to prove 𝔼[ϕn]α\mathbb{E}[\phi_{n}]\to\alpha and 𝔼[ϕn]1\mathbb{E}[\phi_{n}]\to 1 in these two parts, respectively. For part (a), note that (Un,i)in(U_{n,i})_{i\in\mathcal{I}_{n}} are conditionally i.i.d. and so permutations constitute a group of transformations that satisfy the randomization hypothesis in Lehmann and Romano (2005, Definition 15.2.1). Then, Lehmann and Romano (2005, Theorem 15.2.1) implies that 𝔼[ϕn|𝒢n]=α\mathbb{E}[\phi_{n}|\mathcal{\ G}_{n}]=\alpha, and 𝔼[ϕn]=α\mathbb{E}[\phi_{n}]=\alpha then follows from the law of iterated expectations.

To prove part (b), we need some additional notation. To emphasize the dependence of T^n\widehat{T}_{n}, T^n\widehat{T}_{n}^{\ast}, and ϕ^n\hat{\phi}_{n} on the original data (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}}, we explicitly write them as T^n(Y)\widehat{T}_{n}(Y), T^n(Y)\widehat{T}_{n}^{\ast}(Y), and ϕ^n(Y)\hat{\phi}_{n}(Y). With this notation, we can rewrite ϕn=ϕ^n(U)\phi_{n}=\hat{\phi}_{n}(U), since it is computed in the same way as ϕ^n\hat{\phi}_{n} but with (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}} replaced by (Un,i)in(U_{n,i})_{i\in\mathcal{I}_{n}}.

We first analyze the asymptotic behavior of T^n(U)\widehat{T}_{n}(U). Define the empirical analogue of Qj,n()Q_{j,n}(\cdot) as

Q^j,n(x)1knij,n1{Un,ix}.\widehat{Q}_{j,n}(x)\equiv\frac{1}{k_{n}}\sum_{i\in\mathcal{I}_{j,n}}1\{U_{n,i}\leq x\}.

Since the variables (Un,i)ij,n(U_{n,i})_{i\in\mathcal{I}_{j,n}} are 𝒢n\mathcal{G}_{n}-conditionally i.i.d.,

𝔼[(Q^j,n(x)Qj,n(x))2|𝒢n]O(kn1)=o(1).\mathbb{E}[(\widehat{Q}_{j,n}(x)-Q_{j,n}(x))^{2}|\mathcal{G}_{n}]\leq O(k_{n}^{-1})=o(1).

By Markov’s inequality and the law of iterated expectations, this implies that Q^j,n(x)Qj,n(x)=op(1)\widehat{Q}_{j,n}(x)-Q_{j,n}(x)=o_{p}(1) for each xx\in\mathbb{R}. This and a classical Glivenko–Cantelli theorem (e.g., Davidson (1994, Theorem 21.5)) imply that

supx|Q^j,n(x)Qj,n(x)|=op(1), for j{1,2}.\sup_{x\in\mathbb{R}}|\widehat{Q}_{j,n}(x)-Q_{j,n}(x)|=o_{p}(1),\text{ for }j\in\{1,2\}. (18)

By definition,

T^n(U)12knin(Q^1,n(Un,i)Q^2,n(Un,i))2.\widehat{T}_{n}(U)\equiv\frac{1}{2k_{n}}\sum_{i\in\mathcal{I}_{n}}(\widehat{Q}_{1,n}(U_{n,i})-\widehat{Q}_{2,n}(U_{n,i}))^{2}.

In addition, we define

Sn12knin(Q1,n(Un,i)Q2,n(Un,i))2.S_{n}\equiv\frac{1}{2k_{n}}\sum_{i\in\mathcal{I}_{n}}(Q_{1,n}(U_{n,i})-Q_{2,n}(U_{n,i}))^{2}.

Note that the functions Q^j,n()\widehat{Q}_{j,n}(\cdot) and Qj,n()Q_{j,n}(\cdot) are uniformly bounded. Hence, by the triangle inequality and (18),

|T^n(U)Sn|\displaystyle|\widehat{T}_{n}(U)-S_{n}| 12knin|(Q^1,n(Un,i)Q^2,n(Un,i))2(Q1,n(Un,i)Q2,n(Un,i))2|\displaystyle\leq\frac{1}{2k_{n}}\sum_{i\in\mathcal{I}_{n}}|(\widehat{Q}_{1,n}(U_{n,i})-\widehat{Q}_{2,n}(U_{n,i}))^{2}-(Q_{1,n}(U_{n,i})-Q_{2,n}(U_{n,i}))^{2}|
Kkninj{1,2}|Q^j,n(Un,i)Qj,n(Un,i)|=op(1).\displaystyle\leq\frac{K}{k_{n}}\sum_{i\in\mathcal{I}_{n}}\sum_{j\in\{1,2\}}|\widehat{Q}_{j,n}(U_{n,i})-Q_{j,n}(U_{n,i})|=o_{p}(1). (19)

Conditional on 𝒢n\mathcal{G}_{n}, the bounded random functions Q1,n()Q_{1,n}(\cdot) and Q2,n()Q_{2,n}(\cdot) can be treated as deterministic functions. Next, note that

Sn=12j{1,2}(Q1,n(x)Q2,n(x))2𝑑Qj,n(x)+op(1)=(Q1,n(x)Q2,n(x))2𝑑Q¯n(x)+op(1),S_{n}=\frac{1}{2}\sum_{j\in\{1,2\}}\int(Q_{1,n}(x)-Q_{2,n}(x))^{2}dQ_{j,n}(x)+o_{p}(1)=\int(Q_{1,n}(x)-Q_{2,n}(x))^{2}d\overline{Q}_{n}(x)+o_{p}(1), (20)

where the first equality holds by a law of large numbers for the conditionally i.i.d. variables (Un,i)ij,n(U_{n,i})_{i\in\mathcal{I}_{j,n}} for j=1,2j=1,2, and the second equality holds by the definition of Q¯n\overline{Q}_{n}. By combining (19) and (20), we deduce that

T^n(U)=(Q1,n(x)Q2,n(x))2𝑑Q¯n(x)+op(1).\widehat{T}_{n}(U)=\int(Q_{1,n}(x)-Q_{2,n}(x))^{2}d\overline{Q}_{n}(x)+o_{p}(1). (21)

In turn, by (21) and the condition in part (b), we conclude that

(T^n(U)>δn)1 for any δn=o(1).\mathbb{P}(\widehat{T}_{n}(U)>\delta_{n})\to 1\text{ for any }\delta_{n}=o(1). (22)

Next, we analyze the asymptotic behavior of T^n(U)\widehat{T}_{n}^{\ast}(U). It is useful to consider the following representation of this variable. We denote Uπ~=(Un,π~(i))inU_{\tilde{\pi}}=(U_{n,\tilde{\pi}(i)})_{i\in\mathcal{I}_{n}}, where π~\tilde{\pi} is a random permutation of n\mathcal{I}_{n}, independent from the data, and is drawn uniformly from the set of all permutations of n\mathcal{I}_{n}. By definition, T^n(U)\widehat{T}_{n}^{\ast}(U) is the 1α1-\alpha quantile of T^n(Uπ~)\widehat{T}_{n}(U_{\tilde{\pi}}), conditional on the sample, where the randomness comes from the random realization of π~\tilde{\pi}. To analyze the permutation distribution, we construct an additional coupling sequence of (Un,i)in(U_{n,i})_{i\in\mathcal{I}_{n}} following the method of Chung and Romano (2013, Section 5.3). We note that their coupling construction does not require the null hypothesis to hold, and it is thus suitable for our current purposes. The result of their coupling construction is another random sequence (Un,i)in(U_{n,i}^{\prime})_{i\in\mathcal{I}_{n}} such that (i) Un,i=Un,iU_{n,i}=U_{n,i}^{\prime} for all ii in some random subset nn\mathcal{I}_{n}^{\prime}\subseteq\mathcal{I}_{n}; (ii) the cardinality of nn\mathcal{I}_{n}\setminus\mathcal{I}_{n}^{\prime}, denoted DnD_{n}, satisfies 𝔼[Dn]=O(kn1/2)\mathbb{E}[D_{n}]=O(k_{n}^{1/2}); and (iii) (Un,i)in(U_{n,i}^{\prime})_{i\in\mathcal{I}_{n}} are 𝒢n\mathcal{G}_{n}-conditionally i.i.d. with marginal distribution Q¯n\overline{Q}_{n}.

For any fixed arbitrary permutation π\pi and for j{1,2}j\in\{1,2\}, define

Q^j,n(x;π)1knij,n1{Un,π(i)x}andQ^j,n(x;π)1knij,n1{Un,π(i)x}.\widehat{Q}_{j,n}(x;\pi)\equiv\frac{1}{k_{n}}\sum_{i\in\mathcal{I}_{j,n}}1\{U_{n,\pi(i)}\leq x\}\quad\text{and}\quad\widehat{Q}_{j,n}^{\prime}(x;\pi)\equiv\frac{1}{k_{n}}\sum_{i\in\mathcal{I}_{j,n}}1\{U_{n,\pi(i)}^{\prime}\leq x\}.

By repeatedly using the triangle inequality,

|T^n(Uπ)T^n(Uπ)|\displaystyle|\widehat{T}_{n}(U_{\pi})-\widehat{T}_{n}(U_{\pi}^{\prime})|
=12kn|in((Q^1,n(Un,π(i);π)Q^2,n(Un,π(i);π))2(Q^1,n(Un,π(i);π)Q^2,n(Un,π(i);π))2)|\displaystyle=\frac{1}{2k_{n}}\left|\sum_{i\in\mathcal{I}_{n}}\bigg{(}\big{(}\widehat{Q}_{1,n}(U_{n,\pi(i)};\pi)-\widehat{Q}_{2,n}(U_{n,\pi(i)};\pi)\big{)}^{2}-\big{(}\widehat{Q}_{1,n}^{\prime}(U_{n,\pi(i)}^{\prime};\pi)-\widehat{Q}_{2,n}^{\prime}(U_{n,\pi(i)}^{\prime};\pi)\big{)}^{2}\bigg{)}\right|
Kknj{1,2}in|Q^j,n(Un,π(i);π)Q^j,n(Un,π(i);π)|\displaystyle\leq\frac{K}{k_{n}}\sum_{j\in\{1,2\}}\sum_{i\in\mathcal{I}_{n}}\big{|}\widehat{Q}_{j,n}(U_{n,\pi(i)};\pi)-\widehat{Q}_{j,n}^{\prime}(U_{n,\pi(i)}^{\prime};\pi)\big{|}
Kkn2i,kn|1{Un,π(k)Un,π(i)}1{Un,π(k)Un,π(i)}|\displaystyle\leq\frac{K}{k_{n}^{2}}\sum_{i,k\in\mathcal{I}_{n}}\big{|}1\{U_{n,\pi(k)}\leq U_{n,\pi(i)}\}-1\{U_{n,\pi(k)}^{\prime}\leq U_{n,\pi(i)}^{\prime}\}\big{|}
KDn/kn=op(1),\displaystyle\leq KD_{n}/k_{n}=o_{p}(1), (23)

where the last inequality uses the fact that (Un,i,Un,k)=(Un,i,Un,k)(U_{n,i},U_{n,k})=(U_{n,i}^{\prime},U_{n,k}^{\prime}) if (i,k)n×n(i,k)\in\mathcal{I}_{n}^{\prime}\times\mathcal{I}_{n}^{\prime}, and so the summation on the previous line only has (2kn)2(2knDn)24knDn(2k_{n})^{2}-(2k_{n}-D_{n})^{2}\leq 4k_{n}D_{n} bounded terms that can be different from zero; and the op(1)o_{p}(1) statement follows from 𝔼[Dn]=O(kn1/2)\mathbb{E}[D_{n}]=O(k_{n}^{1/2}), knk_{n}\to\infty, and Markov’s inequality.

For any fixed arbitrary permutation π\pi, T^n(Uπ)\widehat{T}_{n}(U_{\pi}^{\prime}) is the Cramér-von Mises statistic for the 𝒢n\mathcal{G}_{n}-conditionally i.i.d. variables (Un,π(i))in(U_{n,\pi(i)}^{\prime})_{i\in\mathcal{I}_{n}}. Hence, by a similar argument leading to (22), we have T^n(Uπ)=op(1)\widehat{T}_{n}(U_{\pi}^{\prime})=o_{p}(1). By combining this with (23), it follows that

T^n(Uπ)=op(1).\widehat{T}_{n}(U_{\pi})=o_{p}(1). (24)

Since this result holds for any arbitrary fixed permutation π\pi, it also holds for any pair of permutations considered at random from the set of all possible permutations of n\mathcal{I}_{n}, independently from the data. By elementary properties of stochastic convergence, this implies the so-called Hoeffding’s condition (e.g., Lehmann and Romano (2005, Equation (15.10))). By this and Lehmann and Romano (2005, Theorem 15.2.3), the permutation distribution associated with the test statistic T^n(U)\widehat{T}_{n}(U), conditional on the data, converges to zero in probability. As a corollary of this,

T^n(U)=op(1).\widehat{T}_{n}^{\ast}(U)=o_{p}(1). (25)

From (22) and (25), it is easy to see that T^n(U)>T^n(U)\widehat{T}_{n}(U)>\widehat{T}_{n}^{\ast}(U) with probability approaching 1. This further implies that 𝔼[ϕn]1\mathbb{E}[\phi_{n}]\to 1, which, together with (11) proves the assertion of part (b).  


Proof of Theorem 2.2.

(a) We prove the assertion of part (a) by applying Theorem 2.1(a). We construct the coupling variable Un,iU_{n,i} as follows:

Un,i=g(ζ(ikn)Δn,ϵn,i),for all in.U_{n,i}=g(\zeta_{(i^{\ast}-k_{n})\Delta_{n}},\epsilon_{n,i}),\quad\text{for all }i\in\mathcal{I}_{n}\text{.} (26)

We set 𝒢n=(ikn)Δn\mathcal{G}_{n}=\mathcal{F}_{(i^{\ast}-k_{n})\Delta_{n}}. By Assumption 2.2, (ϵn,i)in(\epsilon_{n,i})_{i\in\mathcal{I}_{n}} are i.i.d. and independent of 𝒢n\mathcal{G}_{n}. Since ζ(ikn)Δn\zeta_{(i^{\ast}-k_{n})\Delta_{n}} is 𝒢n\mathcal{G}_{n}-measurable, the variables (Un,i)in\left(U_{n,i}\right)_{i\in\mathcal{I}_{n}} are 𝒢n\mathcal{G}_{n}-conditionally i.i.d. This verifies the condition in part (a) of Theorem 2.1, which also implies Assumption 2.1(i). It remains to verify conditions (ii) and (iii) in Assumption 2.1.

By a standard localization argument (see Jacod and Protter (2012, Section 4.4.1)), we can strengthen Assumption 2.3 by assuming T1=T_{1}=\infty, 𝒦m=𝒦\mathcal{K}_{m}=\mathcal{K}, and Km=KK_{m}=K for some fixed compact set 𝒦\mathcal{K} and constant K>0K>0. In particular, ζ(ikn)Δn\zeta_{(i^{\ast}-k_{n})\Delta_{n}} takes values in the compact set 𝒦\mathcal{K}. By Assumption 2.2, it is then easy to see that the 𝒢n\mathcal{G}_{n}-conditional probability density of Un,i=g(ζ(ikn)Δn,ϵn,i)U_{n,i}=g(\zeta_{(i^{\ast}-k_{n})\Delta_{n}},\epsilon_{n,i}) is uniformly bounded (and it does not depend on ii). This implies condition (ii) of Assumption 2.1.

Finally, we verify condition (iii) of Assumption 2.1. By Assumption 2.2(i), for each ini\in\mathcal{I}_{n}, εn,i\varepsilon_{n,i} is independent of iΔn\mathcal{F}_{i\Delta_{n}}. Since ζiΔn\zeta_{i\Delta_{n}} and ζ(ikn)Δn\zeta_{\left(i^{\ast}-k_{n}\right)\Delta_{n}} are iΔn\mathcal{F}_{i\Delta_{n}}-measurable, we deduce from Assumption 2.3(i) that

𝔼[|g(ζiΔn,ϵn,i)g(ζ(ikn)Δn,ϵn,i)|2|iΔn]Kan2ζiΔnζ(ikn)Δn2.\mathbb{E}[|g(\zeta_{i\Delta_{n}},\epsilon_{n,i})-g(\zeta_{(i^{\ast}-k_{n})\Delta_{n}},\epsilon_{n,i})|^{2}|\mathcal{F}_{i\Delta_{n}}]\leq Ka_{n}^{2}\|\zeta_{i\Delta_{n}}-\zeta_{(i^{\ast}-k_{n})\Delta_{n}}\|^{2}. (27)

Note that under the null hypothesis with Δζτ=0\Delta\zeta_{\tau}=0, the processes ζt\zeta_{t} and ζ~t\tilde{\zeta}_{t} are identical. Hence, by Assumption 2.3(ii) and (27),

g(ζiΔn,ϵn,i)g(ζ(ikn)Δn,ϵn,i)2Kankn1/2Δn1/2.\left\|g(\zeta_{i\Delta_{n}},\epsilon_{n,i})-g(\zeta_{(i^{\ast}-k_{n})\Delta_{n}},\epsilon_{n,i})\right\|_{2}\leq Ka_{n}k_{n}^{1/2}\Delta_{n}^{1/2}.

By the maximal inequality under the L2L_{2} norm (see, e.g., van der Vaart and Wellner (1996, Lemma 2.2.2)), we further deduce that

maxin|g(ζiΔn,ϵn,i)g(ζ(ikn)Δn,ϵn,i)|2KanknΔn1/2.\left\|\max_{i\in\mathcal{I}_{n}}\left|g(\zeta_{i\Delta_{n}},\epsilon_{n,i})-g(\zeta_{(i^{\ast}-k_{n})\Delta_{n}},\epsilon_{n,i})\right|\right\|_{2}\leq Ka_{n}k_{n}\Delta_{n}^{1/2}. (28)

Recall that ankn3Δn1/2=o(1)a_{n}k_{n}^{3}\Delta_{n}^{1/2}=o(1) by assumption. Hence,

maxin|g(ζiΔn,ϵn,i)g(ζ(ikn)Δn,ϵn,i)|=op(kn2).\max_{i\in\mathcal{I}_{n}}\left|g(\zeta_{i\Delta_{n}},\epsilon_{n,i})-g(\zeta_{(i^{\ast}-k_{n})\Delta_{n}},\epsilon_{n,i})\right|=o_{p}(k_{n}^{-2}). (29)

Note that, by the definitions in (4) and (26),

Yn,iUn,i=g(ζiΔn,ϵn,i)g(ζ(ikn)Δn,ϵn,i)+Rn,i.Y_{n,i}-U_{n,i}=g(\zeta_{i\Delta_{n}},\epsilon_{n,i})-g(\zeta_{(i^{\ast}-k_{n})\Delta_{n}},\epsilon_{n,i})+R_{n,i}. (30)

Combining (29), (30), and Assumption 2.3(iii), we deduce that maxin|Yn,iUn,i|=op(kn2)\max_{i\in\mathcal{I}_{n}}\left|Y_{n,i}-U_{n,i}\right|=o_{p}\left(k_{n}^{-2}\right), which verifies Assumption 2.1(iii). We have now verified all the conditions needed in Theorem 2.1(a), which proves the assertion of part (a) of Theorem 2.2.

(b) We prove the assertion of part (b) by applying Theorem 2.1(b). Under the maintained alternative hypothesis, we have Δζτ=c\Delta\zeta_{\tau}=c for some constant c0c\neq 0. The coupling variable now takes the following form

Un,i={g(ζ(ikn)Δn,ϵn,i)i1,n,g(ζ(ikn)Δn+c,ϵn,i)i2,n.U_{n,i}=\left\{\begin{array}[]{ll}g(\zeta_{(i^{\ast}-k_{n})\Delta_{n}},\epsilon_{n,i})&i\in\mathcal{I}_{1,n},\\ g(\zeta_{(i^{\ast}-k_{n})\Delta_{n}}+c,\epsilon_{n,i})&i\in\mathcal{I}_{2,n}.\end{array}\right. (31)

Under Assumption 2.2, it is easy to see that, for each j{1,2}j\in\{1,2\}, the variables (Un,i)ij,n(U_{n,i})_{i\in\mathcal{I}_{j,n}} are 𝒢n\mathcal{G}_{n}-conditionally i.i.d., which verifies Assumption 2.1(i).

We now turn to the remaining conditions in Assumption 2.1. As in part (a), we can invoke the standard localization procedure and assume that the ζt\zeta_{t} process takes value in a compact set 𝒦\mathcal{K}. Note that

ζτ(ζ(ikn)Δn+Δζτ)=ζτζ(ikn)Δn=op(1),\zeta_{\tau}-(\zeta_{(i^{\ast}-k_{n})\Delta_{n}}+\Delta\zeta_{\tau})=\zeta_{\tau-}-\zeta_{(i^{\ast}-k_{n})\Delta_{n}}=o_{p}(1),

where the op(1)o_{p}(1) statement follows from the fact that the ζt\zeta_{t} process is càdlàg and knΔn0k_{n}\Delta_{n}\rightarrow 0. Therefore, by enlarging the compact set 𝒦\mathcal{K} slightly if necessary, we also have ζ(ikn)Δn+c𝒦\zeta_{(i^{\ast}-k_{n})\Delta_{n}}+c\in\mathcal{K} with probability approaching 1. Then, we can verify Assumption 2.1(ii) following the same argument as in part (a). The verification of Assumption 2.1(iii) is also similar.

Finally, we verify the condition in Theorem 2.1(b) pertaining to the conditional CDFs. Note that

Q1,n(x)=Fζ(ikn)Δn(x)andQ2,n(x)=Fζ(ikn)Δn+c(x).Q_{1,n}(x)=F_{\zeta_{(i^{\ast}-k_{n})\Delta_{n}}}(x)\quad\text{and}\quad Q_{2,n}(x)=F_{\zeta_{(i^{\ast}-k_{n})\Delta_{n}}+c}(x).

It is then easy to see that

2(Q1,n(x)Q2,n(x))2𝑑Q¯n(x)(Fζ(ikn)Δn(x)Fζ(ikn)Δn+c(x))2𝑑Fζ(ikn)Δn(x).\displaystyle 2\int\left(Q_{1,n}(x)-Q_{2,n}(x)\right)^{2}d\overline{Q}_{n}\left(x\right)\geq\int\left(F_{\zeta_{(i^{\ast}-k_{n})\Delta_{n}}}(x)-F_{\zeta_{(i^{\ast}-k_{n})\Delta_{n}}+c}(x)\right)^{2}dF_{\zeta_{(i^{\ast}-k_{n})\Delta_{n}}}(x).

Since ζ(ikn)Δn\zeta_{(i^{\ast}-k_{n})\Delta_{n}} takes values in the compact set 𝒦\mathcal{K}, Assumption 2.2(iii) implies that the lower bound in the above display is bounded away from zero. Hence, (Q1,n(x)Q2,n(x))2𝑑Q¯n(x)>δn\int\left(Q_{1,n}(x)-Q_{2,n}(x)\right)^{2}d\overline{Q}_{n}\left(x\right)>\delta_{n} for any real sequence δn=o(1)\delta_{n}=o(1). We have now verified all conditions for Theorem 2.1(b), which proves the assertion of part (b) of Theorem 2.2.  


Proof of Theorem 2.3.

This proof follows from similar arguments to those used to prove Theorem 2.1. For the sake of brevity, we focus on the only substantial difference, which is how we establish that (Enc)=o(1)\mathbb{P}(E_{n}^{c})=o(1). Recall that EnE_{n} denotes the event where the ordered values of (Un,i)in(U_{n,i})_{i\in\mathcal{I}_{n}} and (Y~n,i)in(\widetilde{Y}_{n,i})_{i\in\mathcal{I}_{n}} correspond to the same permutation of n\mathcal{I}_{n}. In the case of this proof, this result follows from

(Enc)(in{Y~n,iUn,i})in(Y~n,iUn,i)=o(1),\mathbb{P}(E_{n}^{c})~{}\leq~{}\mathbb{P}(\cup_{i\in\mathcal{I}_{n}}\{\widetilde{Y}_{n,i}\neq U_{n,i}\})~{}\leq~{}\sum_{i\in\mathcal{I}_{n}}\mathbb{P}(\widetilde{Y}_{n,i}\neq U_{n,i})~{}=~{}o(1),

where the first inequality follows from Encin{Y~n,iUn,i}E_{n}^{c}\subseteq\cup_{i\in\mathcal{I}_{n}}\{\widetilde{Y}_{n,i}\neq U_{n,i}\} and the convergence follows from the assumption that (Y~n,iUn,i)=o(kn1)\mathbb{P}(\widetilde{Y}_{n,i}\neq U_{n,i})=o\left(k_{n}^{-1}\right) uniformly in ini\in\mathcal{I}_{n}.  


Proof of Theorem 2.4.

(a) We prove this assertion by applying Theorem 2.3(a). We shall verify the conditions in Theorem 2.3 for Y~n,i=Yn,i\widetilde{Y}_{n,i}=Y_{n,i}, Un,i=g(ζ(ikn)Δn,ϵn,i)U_{n,i}=g(\zeta_{\left(i^{\ast}-k_{n}\right)\Delta_{n}},\epsilon_{n,i}), and 𝒢n=(ikn)Δn\mathcal{G}_{n}=\mathcal{F}_{\left(i^{\ast}-k_{n}\right)\Delta_{n}}. By assumption, the variables (ϵn,i)in\left(\epsilon_{n,i}\right)_{i\in\mathcal{I}_{n}} are i.i.d. and independent of 𝒢n\mathcal{G}_{n}. Hence, the variables (Un,i)in(U_{n,i})_{i\in\mathcal{I}_{n}} are 𝒢n\mathcal{G}_{n}-conditionally i.i.d.

It remains to verify that (Yn,iUn,i)=o(kn1)\mathbb{P}\left(Y_{n,i}\neq U_{n,i}\right)=o\left(k_{n}^{-1}\right) uniformly in ini\in\mathcal{I}_{n}. By repeating the localization argument used in the proof of Theorem 2.2, we can strengthen Assumption 2.4 with T1=T_{1}=\infty without loss of generality. In particular, ζt\zeta_{t} takes values in some compact subset 𝒦𝒵\mathcal{K}\subseteq\mathcal{Z}. Note that for each ini\in\mathcal{I}_{n}, ϵn,i\epsilon_{n,i} is independent of (ζiΔn,ζ(ikn)Δn)\left(\zeta_{i\Delta_{n}},\zeta_{(i^{\ast}-k_{n})\Delta_{n}}\right). By Assumption 2.4(i), we thus have (Yn,iUn,i|𝒢n)KζiΔnζ(ikn)Δn\mathbb{P}\left(Y_{n,i}\neq U_{n,i}|\mathcal{G}_{n}\right)\leq K\left\|\zeta_{i\Delta_{n}}-\zeta_{\left(i^{\ast}-k_{n}\right)\Delta_{n}}\right\|. Then, by Assumption 2.4(ii), we further have (Yn,iUn,i)K(knΔn)1/2\mathbb{P}\left(Y_{n,i}\neq U_{n,i}\right)\leq K\left(k_{n}\Delta_{n}\right)^{1/2}. The condition (Yn,iUn,i)=o(kn1)\mathbb{P}\left(Y_{n,i}\neq U_{n,i}\right)=o\left(k_{n}^{-1}\right) then follows from kn3Δn=o(1)k_{n}^{3}\Delta_{n}=o(1). By Theorem 2.3(a), we have 𝔼[ϕ^n]α\mathbb{E}[\hat{\phi}_{n}]\rightarrow\alpha as asserted.

(b) We prove this assertion by applying Theorem 2.3(b). We verify the conditions in Theorem 2.3 for Y~n,i=Yn,i\widetilde{Y}_{n,i}=Y_{n,i}, 𝒢n=(ikn)Δn\mathcal{G}_{n}=\mathcal{F}_{\left(i^{\ast}-k_{n}\right)\Delta_{n}}, and

Un,i={g(ζ(ikn)Δn,ϵn,i)if i1,n,g(ζ(ikn)Δn+c,ϵn,i)if i2,n.U_{n,i}=\left\{\begin{array}[]{ll}g(\zeta_{\left(i^{\ast}-k_{n}\right)\Delta_{n}},\epsilon_{n,i})&\text{if }i\in\mathcal{I}_{1,n},\\ g(\zeta_{\left(i^{\ast}-k_{n}\right)\Delta_{n}}+c,\epsilon_{n,i})&\text{if }i\in\mathcal{I}_{2,n}.\end{array}\right.

Following the same argument as in part (a), we see that (Un,i)ij,n(U_{n,i})_{i\in\mathcal{I}_{j,n}} are 𝒢n\mathcal{G}_{n}-conditionally i.i.d. for each j{1,2}j\in\{1,2\}, and (Yn,iUn,i)=o(kn1)\mathbb{P}(Y_{n,i}\neq U_{n,i})=o\left(k_{n}^{-1}\right) uniformly in ini\in\mathcal{I}_{n}. Assumption 2.2(iii) also ensures that ((Q1,n(x)Q2,n(x))2𝑑Q¯n(x)>δn)1\mathbb{P}(\int\left(Q_{1,n}(x)-Q_{2,n}(x)\right)^{2}d\overline{Q}_{n}\left(x\right)>\delta_{n})\rightarrow 1 for any real sequence δn=o(1)\delta_{n}=o(1). By Theorem 2.3(b), we have that 𝔼[ϕ^n]1\mathbb{E}[\hat{\phi}_{n}]\rightarrow 1, as asserted.  

Supplement: Extension to other test statistics

As explained in Remark 2.2, our main results extend beyond the Cramér-von Mises statistic in (1). We characterize the relevant class of test statistics by the following high-level assumption.

Assumption 5.1.

T^nΨn((Yn,i)in)\widehat{T}_{n}\equiv\Psi_{n}((Y_{n,i})_{i\in\mathcal{I}_{n}}), where (Ψn)n(\Psi_{n})_{n\in\mathbb{N}} is a sequence of functions that satisfies the following conditions:

(a) T^n\widehat{T}_{n} is a rank statistic, i.e., for any (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}} and (Yn,i)in(Y_{n,i}^{\prime})_{i\in\mathcal{I}_{n}} with sign(Yn,iYn,j)=sign(Yn,iYn,j)sign(Y_{n,i}-Y_{n,j})=sign(Y_{n,i}^{\prime}-Y_{n,j}^{\prime}) for all i,jni,j\in\mathcal{I}_{n}, Ψn((Yn,i)in)=Ψn((Yn,i)in)\Psi_{n}((Y_{n,i})_{i\in\mathcal{I}_{n}})=\Psi_{n}((Y_{n,i}^{\prime})_{i\in\mathcal{I}_{n}}).

(b) For any (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}} and (Yn,i)in(Y_{n,i}^{\prime})_{i\in\mathcal{I}_{n}}, |Ψn((Yn,i)in)Ψn((Yn,i)in)|=Op(Dn/kn)|\Psi_{n}((Y_{n,i})_{i\in\mathcal{I}_{n}})-\Psi_{n}((Y_{n,i}^{\prime})_{i\in\mathcal{I}_{n}})|=O_{p}(D_{n}/k_{n}) with Dn=|{in:Yn,iYn,i}|D_{n}=|\{i\in\mathcal{I}_{n}:Y_{n,i}\not=Y_{n,i}^{\prime}\}|.

Assumption 5.1 is satisfied for a large class of test statistics, which includes the Cramér-von Mises and Kolmogorov-Smirnov statistics. In fact, the proof of Theorem 2.1 shows that the Cramér-von Mises statistic satisfies Assumption 5.1, and an analogous argument can be used to extend this to the Kolmogorov-Smirnov statistic. We now briefly describe the assumption. Assumption 5.1(a) is an essential ingredient to our methodology. In turn, Assumption 5.1(b) is a mild regularity condition that limits the influence that a few sample observations can have on the test statistic, and is only required to establish our consistency result.

The result proves Theorem 2.1 for any test statistic that satisfies Assumption 5.1. Since Theorem 2.1 is the key to all of the results in the paper, this effectively implies that our findings extend to the class of statistics characterized by Assumption 5.1, as claimed in Remark 2.2.

Theorem 5.1.

Under Assumptions 2.1 and 5.1 (instead of (1)),

(a) If the variables (Un,i)in(U_{n,i})_{i\in\mathcal{I}_{n}} have the same 𝒢n\mathcal{G}_{n}-conditional distribution, we have 𝔼[ϕ^n]α\mathbb{E}[\hat{\phi}_{n}]\to\alpha.

(b) Let T^n(U)\widehat{T}_{n}(U) denote the test statistic but applied to (Un,i)in(U_{n,i})_{i\in\mathcal{I}_{n}} instead of (Yn,i)in(Y_{n,i})_{i\in\mathcal{I}_{n}}. If knk_{n}\to\infty and (T^n(U)>δn)1\mathbb{P}(\widehat{T}_{n}(U)>\delta_{n})\to 1 for any real sequence δn=o(1)\delta_{n}=o(1), we have 𝔼[ϕ^n]1\mathbb{E}[\hat{\phi}_{n}]\to 1.

Proof.

This proof follows closely that of Theorem 2.1, which has two steps. Step 1 remains unchanged, as it only relies on Assumption 2.1 and the fact that T^n\widehat{T}_{n} is a rank statistic, imposed in Assumption 5.1(a). Part (a) of Step 2 also remains unchanged, as it is entirely based on Step 1. To complete this proof, it then suffices to cover the analog of part (b) of Step 2.

We begin by considering the asymptotic behavior of T^n(U)\widehat{T}_{n}^{\ast}(U), i.e., the 1α1-\alpha quantile of T^n(Uπ~)\widehat{T}_{n}(U_{\tilde{\pi}}), conditional on the sample, where the randomness comes from the realization of π~\tilde{\pi}. As in the proof of Theorem 2.1, we rely on the coupling construction based on Chung and Romano (2013, Section 5.3), which produces a random sequence (Un,i)in(U_{n,i}^{\prime})_{i\in\mathcal{I}_{n}} such that (i) Un,i=Un,iU_{n,i}=U_{n,i}^{\prime} for all ii in some random subset nn\mathcal{I}_{n}^{\prime}\subseteq\mathcal{I}_{n}; (ii) Dn=|{in:Un,iUn,i}|D_{n}=|\{i\in\mathcal{I}_{n}:U_{n,i}\not=U_{n,i}^{\prime}\}| satisfies 𝔼[Dn]=O(kn1/2)\mathbb{E}[D_{n}]=O(k_{n}^{1/2}); and (iii) (Un,i)in(U_{n,i}^{\prime})_{i\in\mathcal{I}_{n}} are 𝒢n\mathcal{G}_{n}-conditionally i.i.d. Then, for any fixed arbitrary permutation π\pi,

|T^n(Uπ)T^n(Uπ)|=|Ψn((Un,π(i))in)Ψn((Un,π(i))in)|=Op(Dn/kn1/2+ε)=op(1),|\widehat{T}_{n}(U_{\pi})-\widehat{T}_{n}(U_{\pi}^{\prime})|=|\Psi_{n}((U_{n,\pi(i)})_{i\in\mathcal{I}_{n}})-\Psi_{n}((U_{n,\pi(i)}^{\prime})_{i\in\mathcal{I}_{n}})|=O_{p}(D_{n}/k_{n}^{1/2+\varepsilon})=o_{p}(1), (32)

where the second equality relies on |{in:Un,π(i)Un,π(i)}|Dn|\{i\in\mathcal{I}_{n}:U_{n,\pi(i)}\not=U_{n,\pi(i)}^{\prime}\}|\leq D_{n} and Assumption 5.1(b), and the last equality relies on 𝔼[Dn]=O(kn1/2)\mathbb{E}[D_{n}]=O(k_{n}^{1/2}), knk_{n}\to\infty, and Markov’s inequality. We can then repeat the arguments in the proof of Theorem 2.1 to conclude that T^n(Uπ)=op(1)\widehat{T}_{n}(U_{\pi}^{\prime})=o_{p}(1) for any fixed arbitrary permutation π\pi. By this with (32), we conclude that T^n(Uπ)=op(1)\widehat{T}_{n}(U_{\pi})=o_{p}(1). Since π\pi was arbitrarily chosen, it then follows that

T^n(U)=op(1).\widehat{T}_{n}^{\ast}(U)=o_{p}(1). (33)

From the assumption in part (b) and (33), it is easy to see that T^n(U)>T^n(U)\widehat{T}_{n}(U)>\widehat{T}_{n}^{\ast}(U) with probability approaching one. We can then repeat the remaining arguments in the proof of Theorem 2.1 to complete this proof.  

References

  • Amihud (2002) Amihud, Y. (2002): “Illiquidity and Stock Returns: Cross-Section and Time-Series Effects,” Journal of Financial Markets, 5, 31–56.
  • Andersen (1996) Andersen, T. G. (1996): “Return Volatility and Trading Volume: An Information Flow Interpretation of Stochastic Volatility,” Journal of Finance, 51, 169–204.
  • Andrews (1993) Andrews, D. W. K. (1993): “Tests for Parameter Instability and Structural Change With Unknown Change Point,” Econometrica, 61, 821–856.
  • Antoch and Hušková (2001) Antoch, J. and M. Hušková (2001): “Permutation tests in change point analysis,” Statistics & Probability Letters, 53, 273–291.
  • Bai and Perron (1998) Bai, J. and P. Perron (1998): “Estimating and Testing Linear Models with Multiple Structural Changes,” Econometrica, 66, 47–78.
  • Barndorff-Nielsen and Shephard (2001) Barndorff-Nielsen, O. and N. Shephard (2001): “Non-Gaussian Ornstein–Uhlenbeck-Based Models and some of Their Uses in Financial Economics,” Journal of the Royal Statistical Society, Series B, 63, 167–241.
  • Barndorff-Nielsen and Shephard (2006) Barndorff-Nielsen, O. E. and N. Shephard (2006): “Econometrics of Testing for Jumps in Financial Economics Using Bipower Variation,” Journal of Financial Econometrics, 4, 1–30.
  • Bollerslev et al. (2018) Bollerslev, T., J. Li, and Y. Xue (2018): “Volume, Volatility, and Public News Announcements,” Review of Economic Studies, 85, 2005–2041.
  • Bollerslev and Todorov (2011) Bollerslev, T. and V. Todorov (2011): “Estimation of Jump Tails,” Econometrica, 79, 1727–1783.
  • Bugni and Canay (2021) Bugni, F. A. and I. A. Canay (2021): “Testing Continuity of a Density via g-order statistics in the Regression Discontinuity Design,” Journal of Econometrics, 221, 138–159.
  • Calonico et al. (2014) Calonico, S., M. D. Cattaneo, and R. Titiunik (2014): “Robust Nonparametric Confidence Intervals for Regression-Discontinuity Designs,” Econometrica, 82, 2295–2326.
  • Canay and Kamat (2017) Canay, I. A. and V. Kamat (2017): “Approximate Permutation Tests and Induced Order Statistics in the Regression Discontinuity Design,” The Review of Economic Studies, 85, 1577–1608.
  • Cattaneo et al. (2015) Cattaneo, M. D., B. R. Frandsen, and R. Titiunik (2015): “Randomization inference in the regression discontinuity design: An application to party advantages in the US Senate,” Journal of Causal Inference, 3, 1 – 24.
  • Cattaneo et al. (2017) Cattaneo, M. D., R. Titiunik, and G. Vazquez-Bare (2017): “Comparing inference approaches for RD designs: A reexamination of the effect of Head Start on child mortality,” Journal of Policy Analysis and Management, 36, 643–681.
  • Chernozhukov et al. (2018) Chernozhukov, V., K. Wuthrich, and Y. Zhu (2018): “Exact and Robust Conformal Inference Methods for Predictive Machine Learning With Dependent Data,” Tech. rep., arXiv:1802.06300v3.
  • Chow (1960) Chow, G. C. (1960): “Tests of Equality Between Sets of Coefficients in Two Linear Regressions,” Econometrica, 28, 591–605.
  • Chung and Romano (2013) Chung, E. and J. P. Romano (2013): “Exact and Asymptotically Robust Permutation Tests,” Annals of Statistics, 41, 484–507.
  • Cochrane and Piazzesi (2002) Cochrane, J. H. and M. Piazzesi (2002): “The Fed and Interest Rates - A High-Frequency Identification,” American Economic Review, 92, 90–95.
  • Comte and Renault (1998) Comte, F. and E. Renault (1998): “Long Memory in Continuous Time Stochastic Volatility Models,” Mathematical Finance, 8, 291–323.
  • Davidson (1994) Davidson, J. (1994): Stochastic Limit Theory, Oxford University Press.
  • DiCiccio and Romano (2017) DiCiccio, C. J. and J. P. Romano (2017): “Robust Permutation Tests for Correlation and Regression Coefficients,” Journal of the American Statistical Association, 112, 1211–1220.
  • Foster and Nelson (1996) Foster, D. P. and D. B. Nelson (1996): “Continuous Record Asymptotics for Rolling Sample Variance Estimators,” Econometrica, 64, 139–174.
  • Hahn et al. (2001) Hahn, J., P. Todd, and W. V. der Klaauw (2001): “Identification and Estimation of Treatment Effects with a Regression-Discontinuity Design,” Econometrica, 69, 201–209.
  • Hoeffding (1952) Hoeffding, W. (1952): “The Large-Sample Power of Tests Based on Permutations of Observations,” Annals of Mathematical Statistics, 3, 169–192.
  • Horváth and Rice (2014) Horváth, L. and G. Rice (2014): “Extensions of some classical methods in change point analysis,” Test, 23, 219–255.
  • Hušková (2004) Hušková, M. (2004): “Permutation principle and bootstrap in change point analysis,” Fields Institute Communications, 44, 273–291.
  • Imbens and Kalyanaraman (2011) Imbens, G. and K. Kalyanaraman (2011): “Optimal Bandwidth Choice for the Regression Discontinuity Estimator,” The Review of Economic Studies, 79, 933–959.
  • Imbens and Lemieux (2008) Imbens, G. W. and T. Lemieux (2008): “Regression Discontinuity Designs: A Guide to Practice,” Journal of Econometrics, 142, 615 – 635.
  • Jacod and Protter (2012) Jacod, J. and P. Protter (2012): Discretization of Processes, Springer.
  • Jentsch and Pauly (2015) Jentsch, C. and M. Pauly (2015): “Testing equality of spectral densities using randomization techniques,” Bernoulli, 21, 697–739.
  • Kirch (2007) Kirch, C. (2007): “Block permutation principles for the change analysis of dependent data,” Journal of Statistical Planning and Inference, 137, 2453–2474.
  • Kirch and Steinebach (2006) Kirch, C. and J. Steinebach (2006): “Permutation principles for the change analysis of stochastic processes under strong invariance,” Journal of Computational and Applied Mathematics, 186, 68–88.
  • Lee and Lemieux (2010) Lee, D. S. and T. Lemieux (2010): “Regression Discontinuity Designs in Economics,” Journal of Economic Literature, 48, 281–355.
  • Lehmann and Romano (2005) Lehmann, E. L. and J. P. Romano (2005): Testing Statistical Hypothesis, Springer.
  • Li and Liu (2020) Li, J. and Y. Liu (2020): “Efficient Estimation of Integrated Volatility Functionals under General Volatility Dynamics,” Econometric Theory, Forthcoming.
  • Li et al. (2017) Li, J., V. Todorov, and G. Tauchen (2017): “Jump Regressions,” Econometrica, 85, 173–195.
  • Li et al. (2021) Li, J., V. Todorov, and Q. Zhang (2021): “Testing the Dimensionality of Policy Shocks,” Review of Economics and Statistics, forthcoming.
  • Li and Xiu (2016) Li, J. and D. Xiu (2016): “Generalized Method of Integrated Moments for High-frequency Data,” Econometrica, 84, 1613–1633.
  • Nakamura and Steinsson (2018a) Nakamura, E. and J. Steinsson (2018a): “High-Frequency Identification of Monetary Non-Neutrality: The Information Effect,” Quarterly Journal of Economics, 133, 1283–1330.
  • Nakamura and Steinsson (2018b) ——— (2018b): “Identification in Macroeconomics,” Journal of Economic Perspectives, 32, 59–86.
  • Stock (1994) Stock, J. (1994): “Chapter 46 Unit Roots, Structural Breaks and Trends,” Elsevier, vol. 4 of Handbook of Econometrics, 2739 – 2841.
  • Thistlethwaite and Campbell (1960) Thistlethwaite, D. L. and D. T. Campbell (1960): “Regression-discontinuity Analysis: An alternative to the Ex Post Facto Experiment.” Journal of Educational Psychology, 51, 309–317.
  • Todorov and Tauchen (2012) Todorov, V. and G. Tauchen (2012): “Realized Laplace Transforms for Pure-jump Semimartingales,” The Annals of Statistics, 40, 1233–1262.
  • van der Vaart and Wellner (1996) van der Vaart, A. and J. Wellner (1996): Weak Convergence and Empirical Processes, Springer-Verlag.