Testing exogeneity in the functional linear regression model

Manuela Dorn
Department of Mathematics, Physics, and Computer Science, University of Bayreuth
[email protected] Melanie Birke
Department of Mathematics, Physics, and Computer Science, University of Bayreuth
[email protected] Carsten Jentsch
Department of Statistics, TU Dortmund University
[email protected]

Abstract

We propose a novel test statistic for testing exogeneity in the functional linear regression model. In contrast to Hausman-type tests in finite dimensional linear regression setups, a direct extension to the functional linear regression model is not possible. Instead, we propose a test statistic based on the sum of the squared difference of projections of the two estimators for testing the null hypothesis of exogeneity in the functional linear regression model. We derive asymptotic normality under the null and consistency under general alternatives. Moreover, we prove bootstrap consistency results for residual-based bootstraps. In simulations, we investigate the finite sample performance of the proposed testing approach and illustrate the superiority of bootstrap-based approaches. In particular, the bootstrap approaches turn out to be much more robust with respect to the choice of the regularization parameter.

AMS 2010 Classification: Primary: 62R10, 62F05, 62E20, 62J20; Secondary: 62F40

Keywords and Phrases: asymptotic theory, bootstrap infe

1 Introduction

In functional linear regression models, goodness-of-fit tests are much more complicated to construct then e.g. in the multiple linear setting. This, among others stems to the fact that in functional linear regression models the $L_{2}$ -distance of the slope function estimator to the true function has no proper limiting distribution. This was shown in [Cardot et al. (2006)] and [Ruymgaart et al. (2000)] for two estimators in the classical functional linear regression model under exogeneity. It turns out, that the lack of a proper limiting distribution also applies for other estimators using different model assumptions. This phenomenon inherent to functional data setups is probably one of the reasons why goodness-of-fit testing is generally not that widely developed for functional regression models yet. In particular, desirable counterparts of standard tests that are well-established in the multiple linear model are still missing in the functional linear setting.

In functional data settings, existing goodness-of-fit tests are described in [Müller and Stadtmüller (2005)], who use a suitable scalar product to transform the functions to a different space using the autocovariance operator to obtain a test statistic having a proper limiting distribution. Further approaches are given in [Cuesta-Albertos et al. (2019)] and [García-Portugués et al. (2014), García-Portugués et al. (2020)], who use random projections together with empirical process techniques.

In practice, one important model assumption is the exogeneity of the regressor. Especially in economics, this assumption is often violated such that the regressors are correlated with the error terms which leads to endogeneity issues. Estimating in such a model is an inverse problem. Neglecting endogeneity generally results in inconsistent estimators. Hence, it is important to test the data for exogeneity first. If the null hypothesis of exogeneity is rejected, different estimators such as e.g. instrumental variable (IV) estimators are required to achieve consistent estimation. See [Johannes (2016)], [Florens et al. (2011)] or [Florens and Van Bellegem (2014)], who consider such IV estimators in functional regression setups and derive asymptotic theory. While in the multiple linear regression model the Hausman test (see [Hausman (1978)] und [Wu (1973)]) is a standard and natural approach for testing exogeneity, this method cannot be transferred directly to the functional linear model since it is based on the $L_{2}$ -distance of two slope function estimators due to the following proposition which transfers the results in [Cardot et al. (2006)] and [Ruymgaart et al. (2000)] to the present setting.

In the following, let $\hat{\beta}$ denote the estimator of the slope function in the exogeneous model described in [Johannes (2013)] and [Joh13add], which is consistent under exogeneity, but inconsistent under endogeneity, and by $\hat{\beta}_{IV}$ the IV estimator in the endogeneous functional linear model given in [Johannes (2016)], which is consistent in both cases. Then, we get the following result.

Proposition 1.1

In the functional linear regression model (2.1) defined below, even under exogeneity, there is no random variable $Z$ with non-degenerate distribution, such that

s_{n}\|\hat{\beta}_{IV}-\hat{\beta}\|\stackrel{{\scriptstyle\mathcal{D}}}{{\to}}Z

for some real sequence $(s_{n})_{n\in\mathbb{N}}$ with $lim_{n\to\infty}s_{n}=\infty$ , where $||\cdot||$ denotes the norm of the Hilbert space.

The proof of this result mainly goes along the lines of the one in [Cardot et al. (2006)], see also [Dorn (2021)] for further details. This is why we just state the result here and use it as motivation for a different approach in the following. Motivated by the fact, that in contrast to the $L_{2}$ -distance, the projection error typically has an asymptotic distribution (see e.g. [Müller and Stadtmüller (2005)] and [Florens and Van Bellegem (2014)]), we propose to use the sum of the squared difference of projections of the two estimators as test statistic.

The rest of the paper is organized as follows. In Section 2, we state the model assumptions, construct the test statistic and derive its asymptotic distribution. As the limiting distribution turns out to depend on unknown functional nuisance parameters, which are difficult to estimate, we propose residual-based bootstrap methods in Section 3 and prove their consistency. The finite sample performance of all discussed tests is investigated in Section 5. All longer proofs are deferred to the Appendix and additional auxiliary results to a supplement.

2 Model and test statistic

We consider the functional linear regression model

Y=\int_{[0,1]}X(t)\beta(t)dt+U=\left\langle\beta,X\right\rangle+U,

(2.1)

where $Y$ is a real-valued random variable, $U$ is a real-valued error term with $E[U]=0$ and $E[U^{2}]=\sigma^{2}{\in(0,\infty)}$ , $X$ is a functional random variable with values in $L_{2}([0,1])$ such that $\int_{0}^{1}E|X(t)|^{2}\,dt<\infty$ . In this setup, the error variance $\sigma^{2}$ is unknown, and $\beta$ is an unknown function from the Sobolev space of periodically extendable square integrable functions denoted by

\mathcal{W}_{\nu}:=\Big{\{}f\in L^{2}[0,1]:\;\|f\|_{\nu}^{2}:=\sum_{k\in\mathbb{Z}}\gamma_{k}^{\nu}|\langle f,\phi_{k}\rangle|^{2}<\infty\Big{\}},

(2.2)

where $(\phi_{k})_{k\in\mathbb{Z}}$ is the Fourier basis of $L^{2}([0,1])$ , $\nu\in\mathbb{R}$ and $\gamma_{k}=1+|2\pi k|$ , $k\in\mathbb{Z}$ , see e.g. [Neuba, Neubb], [MR96] or [Tsybakov (2004)]. In the setup of (2.1), we will speak of exogeneity (and call $X$ an exogenous regressor), if

H_{0}:\ \ E[X(t)U]=0\mbox{ for all }t\in[0,1].

(2.3)

Otherwise, we will speak of endogeneity (and call $X$ an endogeneous regressor), if

H_{1}:\ \ E[X(t)U]\neq 0\mbox{ for at least one }t\in[0,1].

(2.4)

For consistent estimation in the endogeneous case, we assume to additionally have a functional instrumental variable $W$ with values in $L_{2}([0,1])$ such that $\int_{0}^{1}E|W(t)|^{2}\,dt<\infty$ and $E[UW(t)]=0$ for all $t\in[0,1]$ . For the sake of simplicity, it is often assumed in the literature, that $E[X(t)]=E[W(t)]=0$ holds for all $t\in[0,1]$ . However, the general case can be handled along the same lines by centering with the sample mean in a first step and our results are stated for the general case. For estimating the cross-covariance operator, we also assume that $(X,W)$ is second-order stationary, see [Johannes (2016)].

Assumption 1

There exist functions $c_{X},c_{W},c_{WX}:\mathbb{[}-1,1]\to\mathbb{R}$ , such that

	$\displaystyle Cov(X(s),X(t))$	$\displaystyle=c_{X}(t-s),$
	$\displaystyle Cov(W(s),W(t))$	$\displaystyle=c_{W}(t-s),$
	$\displaystyle Cov(W(s),X(t))$	$\displaystyle=c_{WX}(t-s),$

for all $s,t\in[0,1]$ , respectively, where $c_{X}$ is assumed to be continuous.

By imposing continuity of $c_{X}$ , we have that whenever (2.4) holds for one $t\in[0,1]$ , this immediately implies $E[X(t)U]\neq 0$ on some set with positive Lebesgue measure. This condition ensures, that the test statistic proposed in the following can be used to consistently test for the null hypothesis (2.3) against alternatives (2.4).

Note, that $c_{X}$ and $c_{W}$ define the kernels of the covariance operators $\Gamma_{X}$ of $X$ and $\Gamma_{W}$ of $W$ , respectively, and $c_{WX}$ is the kernel of the cross covariance operator $\Gamma_{WX}$ of $X$ and $W$ . The (joint) weak stationarity of $(X,W)$ ensures, that both covariance operators as well as the cross covariance operator have the same exponential system of eigenfunctions, which we denote by $(\phi_{k})_{k\in\mathbb{N}}$ . Hence, let $(x_{k},\phi_{k})_{k\in\mathbb{N}}$ be the eigensystem of $\Gamma_{X}$ , $(w_{k},\phi_{k})_{k\in\mathbb{N}}$ the eigensystem of $\Gamma_{W}$ and $(c_{k},\phi_{k})_{k\in\mathbb{N}}$ the eigensystem of $\Gamma_{WX}$ . Furthermore, denote $\lambda_{k}=\frac{|c_{k}|^{2}}{w_{k}}$ .

Assumption 2

Throughout the article, we assume that all eigenvalues are strictly positive and that

\sum_{k\in\mathbb{Z}}\frac{|E[Y\langle X,\phi_{k}\rangle]|^{2}}{x_{k}^{2}}<\infty.

Furthermore, we denote by $\mu_{X}=\sum_{k\in\mathbb{Z}}\langle\mu_{X},\phi_{k}\rangle\phi_{k}$ and $\mu_{W}=\sum_{k\in\mathbb{Z}}\langle\mu_{W},\phi_{k}\rangle\phi_{k}$ the expectations of $X$ and $W$ , respectively. Additionally, we assume that there exists some $0<\tau<\infty$ such that

\displaystyle\sup_{k\in\mathbb{Z}}\left|\frac{\lambda_{k}}{w_{k}}\right|\leq\tau.

(2.5)

The last assumptions ensures, that the linear prediction of $X$ with respect to $W$ is well defined.

In principle, if they were available, IV estimation would be based on the optimal instrument $\tilde{W}$ defined by

\displaystyle\tilde{W}=\Gamma_{WX}\Gamma_{W}^{-1}W=\sum_{k\in\mathbb{Z}}\frac{\overline{c_{k}}}{w_{k}}\langle W,\phi_{k}\rangle\phi_{k}

and the eigenvalues $(\lambda_{k})_{k\in\mathbb{N}}$ of the corresponding cross covariance operator $\Gamma_{\tilde{W}X}$ . However, this is usuall not the case and the optimal instrument respectively the corresponding eigenvalues of the cross covariance operator have to be estimated. Note, that $\tilde{W}$ could be exactly computed from $X$ and $W$ , if the (cross) covariance operators were known and remark that $\lambda_{k}=\frac{|c_{k}|^{2}}{w_{k}}\leq x_{k}$ for all $k\in\mathbb{Z}$ .

In the following, let $\{(X_{i},W_{i},Y_{i})\}_{i=1,\ldots,n}$ be independent and identically distributed (i.i.d.) copies of $(X,W,Y)$ and suppose (2.3) is valid. Then, we can consistently estimate the unknown slope function $\beta$ due to [Johannes (2013)] and [Johannes (2016)] in two different ways. For this purpose, let $(\alpha_{n})_{n\in\mathbb{N}}$ be a sequence of regularization parameters such that $\alpha_{n}>0$ for all $n\in\mathbb{N}$ and $\lim_{n\to\infty}\alpha_{n}=0$ . To simplify notation, we will write $\alpha$ for the regularization keeping in mind that it still depends on $n$ . Since the covariance operators and therefore the corresponding eigenvalues are unknown, they have to be estimated in a first step. Further, let $\Gamma_{WX,n},\Gamma_{X,n},\Gamma_{W,n}:\,L_{2}([0,1])\to L_{2}([0,1])$ denote the empirical versions of $\Gamma_{WX},\Gamma_{X}$ and $\Gamma_{W}$ , respectively, defined by

\displaystyle\Gamma_{WX,n}f:=\frac{1}{n}\sum_{i=1}^{n}\langle W_{i},f\rangle X_{i},\quad\Gamma_{X,n}f

\displaystyle:=\frac{1}{n}\sum_{i=1}^{n}\langle X_{i},f\rangle X_{i},\quad\text{and}\quad\Gamma_{W,n}f

\displaystyle:=\frac{1}{n}\sum_{i=1}^{n}\langle W_{i},f\rangle W_{i}

for $f\in L_{2}([0,1])$ . These estimators as well as the deduced estimators

	$\displaystyle\hat{w}_{k}:=\frac{1}{n}\sum_{i=1}^{n}\|\langle W_{i},\phi_{k}\rangle\|^{2},\,$	$\displaystyle\hat{x}_{k}:=\frac{1}{n}\sum_{i=1}^{n}\|\langle X_{i},\phi_{k}\rangle\|^{2},$
	$\displaystyle\hat{c}_{k}:=\frac{1}{n}\sum_{i=1}^{n}\langle\phi_{k},X_{i}\rangle\langle W_{i},\phi_{k}\rangle,\,$	$\displaystyle\hat{\lambda}_{k}:=\frac{\|\hat{c}_{k}\|^{2}}{\hat{w}_{k}}I\{\hat{w}_{k}\geq\alpha\}$

for the eigenvalues $w_{k}$ , $x_{k}$ , $c_{k}$ and $\lambda_{k}$ , respectively, are consistent for all $k\in\mathbb{Z}$ . Hence, observations of the optimal linear instrument $\widetilde{W}$ can be estimated by

\displaystyle\widetilde{W}_{n,i}:=\sum_{k\in\mathbb{Z}}\frac{\overline{\hat{c}_{k}}}{\hat{w}_{k}}I\{\hat{w}_{k}\geq\alpha\}\langle W_{i},\phi_{k}\rangle\phi_{k},\quad i=1,\ldots,n,

and the corresponding cross covariance operator by

\displaystyle\widetilde{\Gamma}_{n}:=\frac{1}{n}\sum_{i=1}^{n}\langle\widetilde{W}_{n,i},\cdot\rangle X_{i}=\frac{1}{n}\sum_{k\in\mathbb{Z}}\frac{\overline{\hat{c}_{k}}}{\hat{w}_{k}}I\{\hat{w}_{k}\geq\alpha\}\sum_{i=1}^{n}\langle\cdot,X_{i}\rangle\langle W_{i},\phi_{k}\rangle\phi_{k}.

(2.6)

This allows to construct the IV-based estimator $\hat{\beta}_{IV}$ of the slope function $\beta$ defined by

\hat{\beta}_{IV}:=\sum_{k\in\mathbb{Z}}\frac{\hat{g}_{k}}{\hat{\lambda}_{k}}I\{\hat{\lambda}_{k}\geq\gamma_{k}^{\nu}\alpha\}\phi_{k}=\sum_{k\in\mathbb{Z}}\frac{\frac{1}{n}\sum_{i=1}^{n}\langle W_{i},\phi_{k}\rangle Y_{i}}{\hat{c}_{k}}I\{\hat{\lambda}_{k}\geq\gamma_{k}^{\nu}\alpha\}I\{\hat{w}_{k}\geq\alpha\}\phi_{k},

(2.7)

where

\displaystyle\hat{g}_{k}

\displaystyle=\frac{1}{n}\sum_{i=1}^{n}Y_{i}\langle\tilde{W}_{n,i},\phi_{k}\rangle.

As shown in [Johannes (2016)], under Assumptions 1 and 2 the estimator $\hat{\beta}_{IV}$ is consistent under the exogeneity assumption (2.3) as well as under endogeneity of (2.4). In contrast, again under Assumptions 1 and 2, the estimator

\hat{\beta}=\sum_{k\in\mathbb{Z}}\frac{\frac{1}{n}\sum_{i=1}^{n}\langle X_{i},\phi_{k}\rangle Y_{i}}{\hat{x}_{k}}I\{\hat{\lambda}_{k}\geq\alpha\gamma_{k}^{\nu}\}\phi_{k}

(2.8)

is only consistent under the exogeneity assumption (2.3) (see [Johannes (2013)]) and inconsistent under endogeneity of (2.4). Note, that in comparison to the original definition in [Johannes (2013)], for $\hat{\beta}$ , we use the same indicator function $I\{\hat{\lambda}_{k}\geq\alpha\gamma_{k}^{\nu}\}$ as in $\hat{\beta}_{IV}$ . It turned out, that the tests perform better if the same regularization is used in both estimators although it might not be the best choice for estimating $\beta$ by $\hat{\beta}$ under assumption (2.3).

Based on the two estimators (2.7) and (2.8), we construct the test statistic as

\displaystyle T_{n}

\displaystyle=\frac{1}{n}\sum_{i=1}^{n}\left|\left\langle\hat{\beta}_{IV}-\hat{\beta},X_{i}\right\rangle\right|^{2}=\left\langle\hat{\beta}_{IV}-\hat{\beta},\Gamma_{X,n}\left(\hat{\beta}_{IV}-\hat{\beta}\right)\right\rangle.

(2.9)

The last representation above corresponds to the idea used in [Müller and Stadtmüller (2005)] to construct a goodness-of-fit test. The equivalence of both approaches can be seen by using the singular value decomposition for the estimators and for the covariance operator.

Assumption 3

For the sequence of regularization parameters, we assume

\displaystyle\alpha_{n}=\alpha>0\;\forall\,n\in\mathbb{N},\ \alpha=o(1)\text{ and }\ \frac{1}{n\alpha^{2}}=o(1).

For the next results, different moment conditions for $X$ , $W$ and $U$ are required. To simplify the notation, we introduce the following sets. In doing so, we assume, that all conditions on $X$ and $W$ mentioned above are fulfilled and define

	$\displaystyle\mathcal{F}_{\eta}^{m}$	$\displaystyle:=\Big{\{}(X,W)\Big{\|}\sup_{k\in\mathbb{Z}}\text{E}\left\|\frac{\langle X,\phi_{k}\rangle}{\sqrt{x_{k}}}\right\|^{m}\leq\eta\text{ and }\sup_{k\in\mathbb{Z}}\text{E}\left\|\frac{\langle W,\phi_{k}\rangle}{\sqrt{w_{k}}}\right\|^{m}\leq\eta\Big{\}},$		(2.10)
	$\displaystyle\mathcal{G}_{\eta}^{m}$	$\displaystyle:=\Big{\{}X\Big{\|}\Gamma_{X}>0\text{ and }\sup_{k\in\mathbb{Z}}\text{E}\left\|\frac{\langle X,\phi_{k}\rangle}{\sqrt{x_{k}}}\right\|^{m}\leq\eta\Big{\}}.$		(2.11)

In the following, for an operator $\Delta$ , we denote by $\Delta^{\dagger}$ the regularized inverse of the operator, that is

\Delta^{\dagger}=\sum_{k\in\mathbb{Z}}\frac{1}{\delta_{k}}I\{|\delta_{k}|>\alpha\gamma_{k}^{\nu}\langle\cdot,\phi_{k}\rangle\phi_{k}\}

and we define

t_{n}^{2}:=\|(\tilde{\Gamma}_{X,n}^{\dagger}-\Gamma_{X}^{\dagger})\Gamma_{X}\|_{HS}^{2}=\sum_{k\in\mathcal{K}_{n}}\left(\frac{x_{k}w_{k}}{|c_{k}|^{2}}-1\right)^{2},

(2.12)

where $\|\cdot\|_{HS}$ denotes the Hilbert-Schmidt norm and we set

\mathcal{K}_{n}:=\{k\in\mathbb{Z}\mid\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}.

Now, we are in a position to state an asymptotic result for the test statistic.

Theorem 2.1

In model (2.1), under Assumptions 1-3, let $\{(X_{i},W_{i},Y_{i})\}_{i=1,\ldots,n}$ be i.i.d. copies of $(X,W,Y)$ with $\left(X,W\right)\in\mathcal{F}_{\eta}^{128}$ and $\text{E}|U|^{128}\leq\eta<\infty$ . Furthermore, let $t_{n}\to\infty$ as $n\to\infty$ , and

\displaystyle\frac{1}{t_{n}^{4}}\sum_{k\in\mathcal{K}_{n}}\left(\frac{x_{k}w_{k}}{|c_{k}|^{2}}-1\right)^{4}=o(1),\quad\sum_{k\in\mathbb{Z}}|\langle\beta,\phi_{k}\rangle|\frac{x_{k}^{3/2}w_{k}}{|c_{k}|^{2}}<\infty,\quad\sum_{k\in\mathbb{Z}}\frac{x_{k}^{2}w_{k}}{|c_{k}|^{2}}<\infty.

Then, under $H_{0}$ , we have

\frac{n}{t_{n}}\left(T_{n}-\mathfrak{B}_{n}-\mathfrak{R}_{n}\right)\stackrel{{\scriptstyle\mathcal{D}}}{{\to}}\mathcal{N}(0,\mathfrak{V}),

where

	$\displaystyle\mathfrak{B}_{n}$	$\displaystyle=\frac{n}{2t_{n}}\langle\beta,\mu_{X}\rangle^{2}\sum_{k\in\mathbb{Z}}\left(\frac{\langle\mu_{W},\phi_{k}\rangle}{c_{k}}-\frac{\langle\mu_{X},\phi_{k}\rangle}{x_{k}}\right)^{2}x_{k}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\},$
	$\displaystyle\mathfrak{R}_{n}$	$\displaystyle=\frac{1}{n}\left(\sigma^{2}+\sum_{m\in\mathbb{Z}}\|\langle\beta,\phi_{m}\rangle\|^{2}x_{m}\right)\sum_{k\in\mathbb{Z}}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\},$
	$\displaystyle\mathfrak{V}$	$\displaystyle=\left(\sigma^{2}+\sum_{m\in\mathbb{Z}}\|\langle\beta,\phi_{m}\rangle\|^{2}x_{m}\right)^{2}.$

Additionally, if $X$ is centered, that is, $E[X(t)]=0$ for all $t\in[0,1]$ , we have $\mu_{X}=0$ leading to $\mathfrak{B}_{n}=0$ .

Proof. For the sake of simplicity, we assume, that $X$ is centered. If not, the additional bias term has to be taken into account as well as stated in the assertion of the theorem. We give a short overview of the proof. The used propositions and lemmas are stated and proven in the appendix. For the employed decomposition of the test statistic, we need several (modified) correlation operators of the instruments and $X$ . We define $\mathcal{U}_{n}$ , $\Delta_{W,n}:L_{2}([0,1])\to\mathbb{R}$ by

\displaystyle\mathcal{U}_{n}f=\frac{1}{n}\sum_{i=1}^{n}(W_{i}\otimes U_{i})f\quad\text{and}\quad\Delta_{W,n}f=\frac{1}{n}\sum_{i=1}^{n}(W_{i}\otimes Y_{i})f,

and set

	$\displaystyle\widetilde{\mathcal{U}}_{n}$	$\displaystyle:=\frac{1}{n}\sum_{i=1}^{n}\langle\cdot,\tilde{W}_{i}\rangle U_{i}=\frac{1}{n}\sum_{k\in\mathbb{Z}}\frac{\overline{c_{k}}}{w_{k}}\sum_{i=1}^{n}\langle\phi_{k},W_{i}\rangle\langle\cdot,\phi_{k}\rangle U_{i},$
	$\displaystyle\widehat{\widetilde{\mathcal{U}}}_{n}$	$\displaystyle:=\frac{1}{n}\sum_{i=1}^{n}\langle\cdot,\tilde{W}_{n,i}\rangle U_{i}=\frac{1}{n}\sum_{k\in\mathbb{Z}}\frac{\overline{\hat{c}_{k}}}{\hat{w}_{k}}I\{\hat{w}_{k}\geq\alpha\}\sum_{i=1}^{n}\langle\phi_{k},W_{i}\rangle\langle\cdot,\phi_{k}\rangle U_{i}.$

For the test statistic, the following decomposition holds

	$\displaystyle\frac{n}{t_{n}}T_{n}$	$\displaystyle=\frac{1}{t_{n}}\sum_{j=1}^{n}\left\|\left\langle T_{n,1}+T_{n,2}+T_{n,3},X_{j}\right\rangle\right\|^{2}+\frac{1}{t_{n}}\sum_{j=1}^{n}\langle T_{n,1}+T_{n,2}+T_{n,3},X_{j}\rangle\langle X_{j},R_{n}\rangle$
		$\displaystyle\phantom{=}+\frac{1}{t_{n}}\sum_{j=1}^{n}\langle X_{j},T_{n,1}+T_{n,2}+T_{n,3}\rangle\langle R_{n},X_{j}\rangle+\frac{1}{t_{n}}\sum_{j=1}^{n}\left\|\left\langle R_{n},X_{j}\right\rangle\right\|^{2},$		(2.13)

where

	$\displaystyle T_{n,1}$	$\displaystyle=\left(\tilde{\Gamma}_{n}^{\dagger}\hat{\tilde{\mathcal{U}}}_{n}-\Gamma_{X,n}^{\dagger}\mathcal{U}_{X,n}\right)-\hat{\Pi}_{\mathcal{K}_{n}}\left(\tilde{\Gamma}^{\dagger}\tilde{\mathcal{U}}_{n}-\Gamma_{X}^{\dagger}\mathcal{U}_{X,n}\right)$
	$\displaystyle T_{n,2}$	$\displaystyle=\left(\tilde{\Gamma}_{n}^{\dagger}\tilde{\Gamma}_{n}-\Gamma_{X,n}^{\dagger}\Gamma_{X,n}\right)\beta-\hat{\Pi}_{\mathcal{K}_{n}}A_{n}$
	$\displaystyle T_{n,3}$	$\displaystyle=\hat{\Pi}_{\mathcal{K}_{n}}\left(\tilde{\Gamma}^{\dagger}\tilde{\mathcal{U}}_{n}-\Gamma_{X}^{\dagger}\mathcal{U}_{X,n}+A_{n}\right)-\left(\tilde{\Gamma}^{\dagger}\tilde{\mathcal{U}}_{n}-\Gamma_{X}^{\dagger}\mathcal{U}_{X,n}+A_{n}\right)$
	$\displaystyle R_{n}$	$\displaystyle=\tilde{\Gamma}^{\dagger}\tilde{\mathcal{U}}_{n}-\Gamma_{X}^{\dagger}\mathcal{U}_{X,n}+A_{n}$

and

A_{n}=\frac{1}{n}\sum_{i=1}^{n}\sum_{k\in\mathbb{Z}}D_{i,k}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\sum_{\begin{subarray}{c}m\in\mathbb{Z},\\ |m|\neq|k|\end{subarray}}S_{i,m}\phi_{k}.

(2.14)

When subtracting $\mathfrak{R}_{n}$ , the last term in (2.13) can be further decomposed to get

\frac{1}{t_{n}}\sum_{j=1}^{n}\left|\left\langle R_{n},X_{j}\right\rangle\right|^{2}-\frac{n}{t_{n}}\mathfrak{R}_{n}=\frac{n}{t_{n}}R_{n,3}+\frac{n}{t_{n}}\left(R_{n,2}-\mathfrak{R}_{n}\right)+\frac{n}{t_{n}}\left(R_{n,1}+R_{n,4}+R_{n,5}\right),

where $R_{n,i}$ , $i=1,\ldots,5$ are defined in the appendix. There, we will also see that

\displaystyle\frac{n}{t_{n}}R_{n,3}

\displaystyle{=}\frac{1}{nt_{n}}\sum_{k\in\mathbb{Z}}x_{k}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\sum_{\begin{subarray}{c}i,j=1,\\ i\neq j\end{subarray}}^{n}D_{i,k}\Big{(}\sigma U_{i}+\sum_{\begin{subarray}{c}m\in\mathbb{Z},\\ |m|\neq|k|\end{subarray}}S_{i,m}\Big{)}\overline{D_{j,k}}\Big{(}\sigma U_{j}+\sum_{\begin{subarray}{c}m\in\mathbb{Z},\\ |m|\neq|k|\end{subarray}}\overline{S_{j,m}}\Big{)}

converges weakly due to Theorem A.1 to a normal distribution with mean 0 and variance $\mathfrak{V}$ , while all remaining terms are discussed to be asymptotically negligible using Proposition A.5, A.2, A.3 and A.4 together with standard estimation techniques for the mixed terms. With the Lemma of Slutsky, the assertion follows. $\Box$

To apply the above result for testing, the bias and variance term have to be estimated. To this end, note that $\sigma^{2}$ can be consistently estimated by

{\hat{\sigma}}_{n}^{2}=\frac{1}{n}\sum_{i=1}^{n}(Y_{i}-\langle\hat{\beta}_{IV},X_{i}\rangle)^{2}

(2.15)

due to the law of large numbers and since $\frac{1}{n}\sum_{i=1}^{n}\langle\beta-\hat{\beta}_{IV},X_{i}\rangle^{2}=o_{P}(1)$ by similar calculations as in the derivation of the asymptotic distribution of $T_{n}$ .

Corollary 2.2

Suppose all assumptions of Theorem 2.1 hold. Then, under $H_{0}$ , we have

\frac{n}{\hat{t}_{n}}\frac{T_{n}-\hat{\mathfrak{B}}_{n}-\hat{\mathfrak{R}}_{n}}{\sqrt{\hat{\mathfrak{V}}_{n}}}\stackrel{{\scriptstyle\mathcal{D}}}{{\to}}\mathcal{N}(0,1),

where ${\hat{\sigma}}_{n}^{2}$ is defined in (2.15) and

	$\displaystyle\hat{t}_{n}^{2}$	$\displaystyle=\sum_{k\in\mathbb{Z}}\left(\frac{\hat{x}_{k}\hat{w}_{k}}{\|\hat{c}_{k}\|^{2}}-1\right)^{2}I\{\hat{\lambda}_{k}\geq\alpha\gamma_{k}^{\nu}\},$
	$\displaystyle\hat{\mathfrak{B}}_{n}$	$\displaystyle=\frac{n}{2\hat{t}_{n}}\langle\hat{\beta}_{IV},\hat{\mu}_{X}\rangle^{2}\sum_{k\in\mathbb{Z}}\left(\frac{\langle\hat{\mu}_{W},\phi_{k}\rangle}{c_{k}}-\frac{\langle\hat{\mu}_{X},\phi_{k}\rangle}{x_{k}}\right)^{2}\hat{x}_{k}I\{\hat{\lambda}_{k}\geq\alpha\gamma_{k}^{\nu}\},$
	$\displaystyle\hat{\mathfrak{R}}_{n}$	$\displaystyle=\frac{1}{n}\left({\hat{\sigma}}_{n}^{2}+\\|\Gamma_{X,n}^{1/2}\hat{\beta}_{IV}\\|^{2}\right)\sum_{k\in\mathbb{Z}}\left(\frac{\hat{x}_{k}\hat{w}_{k}}{\|\hat{c}_{k}\|^{2}}-1\right)I\{\hat{\lambda}_{k}\geq\alpha\gamma_{k}^{\nu}\},$
	$\displaystyle\hat{\mathfrak{V}}_{n}$	$\displaystyle=\left({\hat{\sigma}}^{2}+\\|\Gamma_{X,n}^{1/2}\hat{\beta}_{IV}\\|^{2}\right)^{2}.$

Using Corollary 2.2, it is possible to construct a test for the null hypothesis

H_{0}:\ \mbox{E}[X(t)U]=0\mbox{ for all }t\in[0,1]

(2.16)

against

H_{1}:\ \mbox{E}[X(t)U]\neq 0\mbox{ for at least one }t\in[0,1].

(2.17)

For given size $\gamma\in(0,1)$ , we can reject $H_{0}$ if

\frac{n}{\hat{t}_{n}}\frac{T_{n}-\hat{\mathfrak{B}}_{n}-\hat{\mathfrak{R}}_{n}}{\sqrt{\hat{\mathfrak{V}}_{n}}}>u_{1-\gamma}

(2.18)

where $u_{1-\gamma}$ denotes the $(1-\gamma)$ -quantile of the standard normal distribution. That is, we get a one-sided test for $H_{0}$ against $H_{1}$ . In the special case $\mu_{X}=0$ we can neglect the additional bias term which avoids the use of its plug-in estimator such that the test has the simpler structure $I\{n(T_{n}-\hat{\mathfrak{R}}_{n})/{{\hat{t}_{n}}\sqrt{\hat{\mathfrak{V}}_{n}}}>u_{1-\gamma}\}$ .

Corollary 2.3

Suppose all assumptions of Corollary 2.2 hold. Then, under the alternative $H_{1}$ , the test constructed in (2.18) is consistent.

Proof. We only consider the special case $\mu_{X}=0$ here. The general case can be proven by similar arguments. Under $H_{1}$ , $\hat{\beta}$ is not consistently estimating $\beta$ such that it converges in probability to $\beta+b$ for some $b\in L^{2}[0,1]$ with

b=\sigma\sum_{k\in\mathbb{Z}}\frac{E[U_{1}\langle X_{1},\phi_{k}\rangle]}{x_{k}}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\phi_{k}(t),

which is in general not equal to $0\in L^{2}[0,1]$ under endogeneity (by the continuity imposed in Assumption 1). Hence, we have

	$\displaystyle T_{n}$	$\displaystyle=\frac{1}{n}\sum_{i=1}^{n}\|\langle\hat{\beta}_{IV}-(\hat{\beta}-b),X_{i}\rangle\|^{2}-\frac{2}{n}\sum_{i=1}^{n}\langle\hat{\beta}_{IV}-(\hat{\beta}-b),X_{i}\rangle\langle b,X_{i}\rangle+\frac{1}{n}\sum_{i=1}^{n}\|\langle b,X_{i}\rangle\|^{2}$
		$\displaystyle=\frac{1}{n}\sum_{i=1}^{n}\|\langle\hat{\beta}_{IV}-(\hat{\beta}-b),X_{i}\rangle\|^{2}-O_{p}\left(\sqrt{\frac{t_{n}}{n}}\right)+O_{p}\left(1\right).$

The standardized version of the first part converges in distribution to a standard normal distribution by similar arguments as in Theorem 2.1 and Corollary 2.2 while the sum of the remainder terms multiplied with $\frac{n}{t_{n}}$ goes to infinity for $n\to\infty$ . Consequently, we have

\displaystyle P\left(\frac{n}{\hat{t}_{n}}\frac{T_{n}-\hat{\mathfrak{R}}_{n}}{\sqrt{\hat{\mathfrak{V}}_{n}}}>u_{1-\gamma}\right)\to 1

for $n\to\infty$ . $\Box$

In practice, we do not know, if $\mu_{X}=0$ such that a naive application of the asymptotic test without estimating ${\mathfrak{B}}_{n}$ could result in wrong decisions. In addition, asymptotic tests based on plug-in methods as above usually exhibit a smaller power compared to other methods. This is due to the additional estimation step. The bootstrap version of the test discussed in the next section is expected to have better finite sample behavior, since it is not required to estimate the unknown bias and variance. This has additionally the effect, that we need not distinguish between the cases $\mu_{X}=0$ and $\mu_{X}\neq 0$ which is a clear advantage of the bootstrap test.

3 Bootstrap Consistency

In this section, we use residual-based bootstrap procedures to estimate the distribution of

\frac{n}{t_{n}}\left(T_{n}-\mathfrak{B}_{n}-\mathfrak{R}_{n}\right)

under the null of exogeneity . To this end, we first estimate the residuals from the original data set and define

\displaystyle\hat{U}_{i}=Y_{i}-\langle\hat{\beta}_{IV},X_{i}\rangle,\quad i=1,\ldots,n,

where we use the IV-based estimator, because it is consistent under the null hypothesis as well as under the alternative. However, using the classical estimator $\hat{\beta}$ would also result in a proper bootstrap scheme to approximate the distribution of the test statistics under the null of exogeneity, since the independence of error and regressor in the bootstrap sample is achieved by the (fixed-design) bootstrap procedure itself. However, to get bootstrap data that mimics the true distribution under the null hypothesis of exogeneity given the original sample as close as possible, the IV-based estimator turns out to be more natural and performs better in simulations. In the sequel, different versions of residual-based bootstraps are considered. All bootstrap methods will follow these steps

Step 1.)

Given i.i.d. observations $(X_{i},W_{i},Y_{i})$ , $i=1,\ldots,n$ , we generate a bootstrap sample $(X_{i},W_{i},Y_{i}^{*})$ , $i=1,\ldots,n$ , by

\displaystyle Y_{i}^{*}=\langle\hat{\beta}_{IV},X_{i}\rangle+U_{i}^{*},

where the bootstrap errors $U_{i}^{*}$ are generated from the residuals $\hat{U}_{1},\ldots,\hat{U}_{n}$ in such a way that the conditional independence of $U_{i}^{*}$ and $(X_{i},W_{i})$ is ensured. A thorough discussion, which types of bootstrap are appropriate in this sense follows in the next subsection.

Step 2.)

From $(X_{i},W_{i},Y_{i}^{*})$ , $i=1,\ldots,n$ , a bootstrap test statistic $T_{n}^{*}$ is calculated.

Step 3.)

Repeat Steps 1.) and 2.) $B$ times, where $B$ is large, to get bootstrap realizations $T_{n}^{*,1},\ldots,T_{n}^{*,B}$ of the test statistic and denote by $q_{1-\gamma}^{*}=T_{n}^{*,(\lfloor B(1-\gamma)\rfloor)}$ the corresponding empirical $(1-\gamma)$ -quantile.

As the bootstrap errors are generated such that conditional independence of $U_{i}^{*}$ and $(X_{i},W_{i})$ is ensured, the bootstrap automatically adopts the exogeneity assumption. For the naive (Efron-type)residual bootstrap, this is trivially the case, because the bootstrap errors are drawn independently with replacement from the residuals, and for the wild bootstrap, since suitable bootstrap multiplier variables $V_{i}$ will also be drawn independently from $X_{i}$ and $W_{i}$ .

Theorem 3.1

Under the assumptions of Theorem 2.1 let $\mathcal{S}_{n}=\{(X_{i},W_{i},Y_{i})\}_{i=1,\ldots,n}$ be a set of i.i.d. copies of $(X,W,Y)$ with $\left(X,W\right)\in\mathcal{F}_{\eta}^{128}$ and $\text{E}|U|^{128}\leq\eta<\infty$ and let $(t_{n})_{n\in\mathbb{N}}$ from (2.12) fulfill $\lim_{n\to\infty}t_{n}=\infty$ . Additionally, suppose that

	$\displaystyle\frac{1}{t_{n}^{4}}\sum_{k\in\mathcal{K}_{n}}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{4}=o(1),\quad\sum_{k\in\mathcal{K}_{n}}\left(x_{k}^{2}\mathrm{E}\|\langle\beta-\hat{\beta}_{IV},\phi_{k}\rangle\|^{4}\right)^{1/4}\frac{x_{k}^{4}w_{k}^{4}}{\|c_{k}\|^{8}}=O(1),$
	$\displaystyle\sum_{k\in\mathbb{Z}}\frac{x_{k}w_{k}^{1/2}}{\|c_{k}\|}<\infty,\quad\text{and}\quad\frac{1}{t_{n}}\sum_{k\in\mathcal{K}_{n}}\frac{x_{k}^{3/2}w_{k}}{\|c_{k}\|^{2}}=O(1)$

hold. Then, under both $H_{0}$ and $H_{1}$ , we have

\displaystyle\sup_{t\in\mathbb{R}}\left|P\left(\frac{n}{t_{n}}\left(T_{n}^{*}-\mathfrak{B}_{n}^{*}-\mathfrak{R}_{n}^{*}\right)\leq t\mid\mathcal{S}_{n}\right)-P_{H_{0}}\left(\frac{n}{t_{n}}\left(T_{n}-\mathfrak{B}_{n}-\mathfrak{R}_{n}\right)\leq t\right)\right|\stackrel{{\scriptstyle\mathbb{P}}}{{\longrightarrow}}0,

where $\mathfrak{B}_{n}^{*}$ and $\mathfrak{R}_{n}^{*}$ denote the bootstrap versions of $\mathfrak{B}_{n}$ and $\mathfrak{R}_{n}$ defined in Theorem A.1 and $P_{H_{0}}$ is the distribution of $\frac{n}{t_{n}}\left(T_{n}-\mathfrak{B}_{n}-\mathfrak{R}_{n}\right)$ under $H_{0}$ .

Based on this result, we can again construct a one-sided test for the hypotheses (2.17) which rejects the null hypothesis if $T_{n}>q_{1-\gamma}^{*}$ from Step 3 since $T_{n},T_{n}^{\ast}\geq 0$ and both asymptotically have the same bias and variance.

4 Generalization to other estimators and measuring goodness-of-fit

While the above results are stated for the spectral-cut-off estimators as proposed in [Johannes (2013)] and [Johannes (2016)], it is also possible to derive analogue results for other types of estimators like cut-off as in [Müller and Stadtmüller (2005)] or ones based on Tikhonov or ridge-type regularization. A quite general approach is given in [Cardot et al. (2006)] with a sequence of regularization functions $f_{n}:[c_{n},\infty)\to\mathbb{R}_{0}^{+}$ such that $f_{n}$ is decreasing on $[c_{n},2z_{1}-z_{2}]$ where $(z_{j})_{j\in\mathbb{Z}}$ are the eigenvalues of the relevant covariance operator and $(c_{n})_{n\in\mathbb{N}}$ is a decreasing sequence of positive values with $c_{n}<z_{1}$ . Furthermore $\lim_{n\to\infty}\sup_{z\geq c_{n}}|zf_{n}(z)-1|=o(1/\sqrt{n})$ and $f_{n}$ is differentiable on $[c_{n},\infty)$ which replaces Assumption 3. While the estimator $\hat{\beta}$ from (2.8) above does not completely fit this situation it is not neccessary to consider this modification if one is only interested in testing goodness-of-fit in an exogeneous model (2.1). For the sake of shorter notation we assume here $\mu_{X}\equiv 0$ . If $\tilde{\beta}$ denotes the estimator proposed in [Cardot et al. (2006)] we obtain under Assumption 1 and the moment conditions in Theorem 2.1 the following result

\frac{n}{s_{n}}\left(\frac{1}{n}\sum_{i=1}^{n}\left|\left\langle\tilde{\beta}-\beta,X_{i}\right\rangle\right|^{2}-\mathfrak{R}_{n}\right)\stackrel{{\scriptstyle\mathcal{D}}}{{\to}}\mathcal{N}(0,\mathfrak{V})

with $s_{n}=\sum_{k\in\mathbb{Z}}x_{k}^{4}f_{n}^{4}(x_{k})$ , $\mathfrak{V}$ as in Theorem 2.1 and

\mathfrak{R}_{n}=\frac{1}{n}\left(\sigma^{2}+\sum_{m\in\mathbb{Z}}|\langle\beta,\phi_{m}\rangle|^{2}x_{m}\right)\sum_{k\in\mathbb{Z}}f_{n}(x_{k})=O\left(\frac{1}{\sqrt{n}}\right).

If is straight forward to also generalize the instrumental variable estimator to other regularization schemes. We get an estimator

\tilde{\beta}_{IV}=\sum_{k\in\mathbb{Z}}\hat{g}_{k}f_{n}(\hat{\lambda}_{k})

and, if we are willing to assume exogeneity here, derive by the same arguments as above

\frac{n}{s_{n,IV}}\left(\frac{1}{n}\sum_{i=1}^{n}\left|\left\langle\tilde{\beta}_{IV}-\beta,X_{i}\right\rangle\right|^{2}-\mathfrak{R}_{n}\right)\stackrel{{\scriptstyle\mathcal{D}}}{{\to}}\mathcal{N}(0,\mathfrak{V})

with $s_{n,IV}=\sum_{k\in\mathbb{Z}}x_{k}^{2}\lambda_{k}^{2}f_{n}^{4}(\lambda_{k})$ , $\mathfrak{V}$ as in Theorem 2.1 and

\mathfrak{R}_{n}=\frac{1}{n}\mathfrak{V}^{1/2}\sum_{k\in\mathbb{Z}}x_{k}\lambda_{k}f_{n}(\lambda_{k})=O\left(\frac{1}{\sqrt{n}}\right).

The assumption of exegeneity is in this case not realistic because one would only use the instrumental variable estimator under endogeneity. Proving an analogue result under endogeneity is in principle possible but the proof differs in several points from the one presented here.

Using the estimators $\tilde{\beta}$ and $\tilde{\beta}_{IV}$ we can construct a test statistic similar to the one above. To this end we need a similar regularization scheme for both estimators. If we allow for a second argument in $f_{n}$ the estimators involved in the test above can also be written with $f_{n,1}(x_{k},\lambda_{k})=\frac{1}{x_{k}}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}$ for $\hat{\beta}$ and $f_{n,2}(x_{k},\lambda_{k})=\frac{1}{\lambda_{k}}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}$ for $\hat{\beta}_{IV}$ and it is straight forward to generalize them at least to regularisation functions of type $f_{n,1}(x_{k},\lambda_{k})=g_{1}(x_{k},\lambda_{k})\tilde{f}_{n}(\lambda_{k})$ respectively $f_{n,2}(x_{k},\lambda_{k})=g_{2}(x_{k},\lambda_{k})\tilde{f}_{n}(\lambda_{k})$ . Under Assumption 1, the moment assumptions of Theorem 2.1 and certain regularity conditions we derive under the null hypthesis

\frac{n}{\tilde{t}_{n}}\left(T_{n}-\mathfrak{B}_{n}-\mathfrak{R}_{n}\right)\stackrel{{\scriptstyle\mathcal{D}}}{{\to}}\mathcal{N}(0,\mathfrak{V}),

where

\tilde{t}_{n}=\sum_{k\in\mathbb{Z}}(\lambda_{k}g_{2}^{2}(x_{k},\lambda_{k})-2\lambda_{k}g_{1}(x_{k},\lambda_{k})g_{2}(x_{k},\lambda_{k})+x_{k}g_{1}(x_{k},\lambda_{k}))^{2}{\tilde{f}_{n}}^{2}(\lambda_{k}),

$\mathfrak{V}$ as in Theorem 2.1 and

\displaystyle\mathfrak{R}_{n}

\displaystyle=\frac{1}{n}\mathfrak{V}^{1/2}\sum_{k\in\mathbb{Z}}\left(\lambda_{k}g_{2}^{2}(x_{k},\lambda_{k})-2\lambda_{k}g_{1}(x_{k},\lambda_{k})g_{2}(x_{k},\lambda_{k})+x_{k}g_{1}(x_{k},\lambda_{k})\right)\tilde{f}_{n}(\lambda_{k}).

For all results presented in this section it is again straight forward to derive empirical versions and bootstrap results.

5 Finite sample properties

In this section, we investigate the finite sample behavior of the tests proposed above under several degrees of endogeneity and for different slope functions. We generate our data from the model

\displaystyle X(t)=\left(t+\frac{1}{2}\right)Z_{1},\quad W(t)=\left(t+\frac{1}{2}\right)Z_{2}+H

and

Y=\frac{1}{p+1}\sum_{l=0}^{p}X(l/p+1)\cdot\beta(l/p)+\sigma\cdot\varepsilon,

for some $p$ large enough to approximate the integral sufficiently well. To control all correlations in the model, we generate i.i.d. copies of

\begin{pmatrix}Z_{1}\\ Z_{2}\\ U\end{pmatrix}\sim\mathcal{N}_{3}\left(\begin{pmatrix}0\\ 0\\ 0\end{pmatrix},\begin{pmatrix}3&\nu\sqrt{6}&\rho\sqrt{3}\\ \nu\sqrt{6}&2&0\\ \rho\sqrt{3}&0&1\\ \end{pmatrix}\right)

with $corr(Z_{1},Z_{2})=\nu,\;corr(Z_{1},U)=\rho$ , see [Wong (1996)]. The random variable $H$ is uniformly distributed on $(-1/2,1/2)$ and independent of $(Z_{1},Z_{2},\varepsilon)^{\prime}$ . The parameter $\rho$ controls the severity of endogeneity (if $\rho=0$ we are in the exogenous case, i.e. under the null $H_{0}$ ) and $\nu$ the strength of the instrument $W$ . The standard deviation is assumed to be $\sigma=7/5$ . In the following, as illustrated in Figure 1, we will use three different slope functions $\beta_{1},\beta_{2}$ and $\beta_{3}$ defined by

$\displaystyle\beta_{1}(t)$	$\displaystyle=\sin(4\pi t)+\frac{1}{2}\sin\left(8\pi t\right)+\frac{1}{7}\sin\left(20\pi t\right),$
$\displaystyle\beta_{2}(t)$	$\displaystyle=\frac{2}{\pi}\arcsin(\cos(2\pi t)),$
$\displaystyle\beta_{3}(t)$	$\displaystyle=\sum_{n\in\mathbb{Z}}\int_{\mathbb{R}}r_{n}(s)k_{n,h}(t-s)\,ds,$	(5.1)

Refer to caption — Figure 1: Slope functions $\beta_{1},\beta_{2}$ and $\beta_{3}$ as defined in (5.1).

where in $\beta_{3}$ , $r_{n}(t)=I_{\big{\{}n+\frac{1}{4},n+\frac{3}{4}\big{\}}}(t)$ and $k_{n,h}(t)=\frac{1}{h}k_{n}(\frac{t}{h})$ with

k_{n}(t)=\frac{1}{C}\exp\left(-\frac{1}{1-(t-2n)^{2}}\right)I_{(-1+2n,2n+1)}(t)

and $C=\int_{\mathbb{R}}k_{0}(s)\,ds$ . For all simulations, we generate 1000 Monte Carlo realizations and use $B=500$ bootstrap replications.

Besides an Efron-type residual-based bootstrap, which draws the bootstrap errors $U_{i}^{*}$ , $i=1,\ldots,n$ independently with replacement from the residuals $\hat{U}_{1},\ldots,\hat{U}_{n}$ , we consider also several versions of a residual-based wild bootstrap, where

\displaystyle U_{i}^{*}=V_{i}\hat{U}_{i},\quad i=1,\ldots,n

and the $V_{i}$ ’s are i.i.d. with $\mbox{E}[V_{1}]=0$ and $\mbox{E}[V_{1}^{2}]=1$ and independent of $(X_{i},W_{i},Y_{i})_{i=1,\ldots,n}$ . We consider different choices for the distribution of the $V_{i}$ ’s as commonly used in the literature, see e.g. [Mammen (1993)],

$\displaystyle a)$	$\displaystyle\quad P\left(V_{1}=\frac{-(\sqrt{5}-1)}{2}\right)=\frac{\sqrt{5}+1}{2\sqrt{5}},\quad P\left(V_{1}=\frac{\sqrt{5}+1}{2}\right)=\frac{\sqrt{5}-1}{2\sqrt{5}},$	(5.2)
$\displaystyle b)$	$\displaystyle\quad P(V_{1}=1)=0.5=P(V_{1}=-1),$	(5.3)
$\displaystyle c)$	$\displaystyle\quad V_{1}\sim\mathcal{N}(0,1).$	(5.4)

In a first step we try to get an idea how to choose $\alpha$ and, in a next step how to choose $\mathcal{K}_{n}$ . To this end, we fix the degree of endogeneity with $\rho=0.4$ and the strength of the instrument with $\nu=0.6$ . In Figure 2, the results for the asymptotic test using $\beta_{1}$ as slope parameter and different choices of $\alpha$ are shown. We see that the best results are obtained for $\alpha$ between $0.05$ and $0.055$ . For smaller $\alpha$ , the test does not hold the prescribed level, while for larger $\alpha$ the power is comparably small up to biased tests for $\alpha$ larger than $0.07$ . Based on Figure 2, we can find a sequence of good choices for $\alpha$ depending on the sample size varying from $\alpha=0.04$ for $n=25$ to $\alpha=0.053$ for larger sample sizes up to $300$ . We see that the asymptotic test has only moderate power even for larger sample sizes. This is a well known effect with asymptotic tests using plug-in estimators.

The way out is typically a bootstrap-based test. The results for the residual-based bootstraps proposed in Section 3 and again $\beta_{1}$ are shown in Figure 3.

It turns out, that the regularization parameter can be chosen considerably smaller than for the asymptotic test and the procedure is much more robust in choosing $\alpha$ . Nearly all tests hold the size of $\gamma=0.05$ for larger sample sizes and the power increases with sample size for most choices of $\alpha$ up to a value close to 1 already for $n=300$ . Again we can get an idea of choosing a good $\alpha$ depending on the sample size which varies from $\alpha=0.01$ for $n=25,50$ to $\alpha=0.0001$ for $n=75,100,200$ and $300$ .

Apparently all bootstrap procedures discussed in Section 3 perform comparably good which can be seen in Figure 4 for a choice of $\alpha=0.0001$ .

Comparing the performance of the bootstrap test for different slope functions, we discover that in all models the bootstrap test holds the size $\gamma=0.05$ while we see in Table 1 that the power is similarly good for all settings with only sligh disadvantages for the smoothed indicator function $\beta_{3}$ .

$n$
	25	50	75	100	125	150	175	200	225	250	275	300
$\beta_{1}$	0.111	0.507	0.773	0.901	0.960	0.980	0.992	0.997	0.998	0.998	1	1
$\beta_{2}$	0.164	0.568	0.798	0.912	0.958	0.979	0.992	0.997	0.999	0.998	1	1
$\beta_{3}$	0.255	0.560	0.733	0.853	0.904	0.961	0.978	0.990	0.993	0.994	0.997	0.998

Table 1: Empirical power of the bootstrap tests for slope functions defined in (5.1) using

\rho=0.4

\nu=0.6

and

\alpha=0.0001

Finally, we inspect the influence of the degree of endogeneity and the strength of the instrument on the performance of the test. In Figure 5, we see that the power of the bootstrap test increases with increasing degree $\rho$ of endogeneity being already acceptable for $\rho=0.3$ .

Figure 6 shows, that the performance of the test is highly dependent on the strength of the instrument. If the instrument is too weak, the power is too low and the test does not hold the size. It turns out, that for the setting with slope function $\beta_{1}$ , $\rho=0.4$ and $\alpha=0.0001$ , the bootstrap test performs best for a strength of the instrument around $\nu=0.7$ .

6 Concluding remarks

The underlying work is the first approach of testing for endogeneity in a functional regression setup by introducing a modified approach of the classical Hausman test in a multiple linear regression model. This modification is required, because the $L_{2}$ -distance of two slope function estimators in functional linear regression models are shown to have no proper limiting distribution. We prove asymptotic normality for the proposed modified Hausman-type test statistic, which allows for the construction of asymptotic tests for exogeneity. As the asymptotic test has several drawbacks such as many nuisance parameters, which are cumbersome to estimate, an additional bias term, which diverges when multiplied with the rate of convergence, and a high sensitivity to the choice of the regularization parameter, we propose suitable bootstrap versions of the test to approximate the null distribution. This avoids the additional estimation of nuisance parameters and turns out to be much more robust to the choice of the regularization parameter. This behavior is demonstrated in a detailed simulation study. Topics of ongoing work are the choice of the instrument, a data driven choice of the regularization parameter and the transfer to other regression models.

Appendix A Auxiliary Results for the Proof of Theorem 2.1

We assume for the sake of simplicity $E[X(t)]=E[W(t)]=0$ for all $t\in[0,1]$ and remember from Section 2 the decomposition of the test statistic with

$\displaystyle R_{n,1}$	$\displaystyle=\frac{1}{n^{2}}\sum_{k\in\mathbb{Z}}\left(\hat{x}_{k}-x_{k}\right)\Big{\|}\sum_{i=1}^{n}D_{i,k}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\Big{(}\sigma U_{i}+\sum_{\begin{subarray}{c}m\in\mathbb{Z},\\ \|m\|\neq\|k\|\end{subarray}}S_{i,m}\Big{)}\Big{\|}^{2}$
$\displaystyle R_{n,2}$	$\displaystyle=\frac{1}{n^{2}}\sum_{k\in\mathbb{Z}}x_{k}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\sum_{i=1}^{n}\|D_{i,k}\|^{2}\Big{\|}\sigma U_{i}+\sum_{\begin{subarray}{c}m\in\mathbb{Z},\\ \|m\|\neq\|k\|\end{subarray}}S_{i,m}\Big{\|}^{2},$
$\displaystyle R_{n,3}$	$\displaystyle=\frac{1}{n^{2}}\sum_{k\in\mathbb{Z}}x_{k}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\sum_{\begin{subarray}{c}i,j=1,\\ i\neq j\end{subarray}}^{n}D_{i,k}\Big{(}\sigma U_{i}+\sum_{\begin{subarray}{c}m\in\mathbb{Z},\\ \|m\|\neq\|k\|\end{subarray}}S_{i,m}\Big{)}\overline{D_{j,k}}\Big{(}\sigma U_{j}+\sum_{\begin{subarray}{c}m\in\mathbb{Z},\\ \|m\|\neq\|k\|\end{subarray}}\overline{S_{j,m}}\Big{)},$
$\displaystyle R_{n,4}$	$\displaystyle=\frac{1}{n^{3}}\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ \|k\|\neq\|l\|\end{subarray}}\sum_{j=1}^{n}\langle\phi_{k},X_{j}\rangle\langle X_{j},\phi_{l}\rangle I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}I\{\lambda_{l}\geq\alpha\gamma_{l}^{\nu}\}$	(A.1)
	$\displaystyle\phantom{\frac{1}{n^{3}}\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ \|k\|\neq\|l\|\end{subarray}}\sum_{j=1}^{n}\langle\phi_{k},X_{j}\rangle}\times\sum_{i=1}^{n}D_{i,k}\Big{(}\sigma U_{i}+\sum_{\begin{subarray}{c}m\in\mathbb{Z},\\ \|m\|\neq\|k\|\end{subarray}}S_{i,m}\Big{)}\overline{D_{i,l}}\Big{(}\sigma U_{i}+\sum_{\begin{subarray}{c}m\in\mathbb{Z},\\ \|m\|\neq\|l\|\end{subarray}}\overline{S_{i,m}}\Big{)},$
$\displaystyle R_{n,5}$	$\displaystyle={\frac{1}{n^{3}}\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ k\neq l\end{subarray}}\sum_{j=1}^{n}\langle\phi_{k},X_{j}\rangle}\sum_{\begin{subarray}{c}i_{1},i_{2}=1,\\ i_{1}\neq i_{2}\end{subarray}}^{n}D_{i_{1},k}\Big{(}\sigma U_{i_{1}}+\sum_{\begin{subarray}{c}m\in\mathbb{Z},\\ \|m\|\neq\|k\|\end{subarray}}S_{i_{1},m}\Big{)}\overline{D_{i_{2},l}}\Big{(}\sigma U_{i_{2}}+\sum_{\begin{subarray}{c}m\in\mathbb{Z},\\ \|m\|\neq\|l\|\end{subarray}}\overline{S_{i_{2},m}}\Big{)}$	(A.2)

and define

$\displaystyle D_{i,k,n}$	$\displaystyle=\frac{\langle W_{i},\phi_{k}\rangle}{\hat{c}_{k}}I\{\hat{w}_{k}\geq\alpha\}-\frac{1}{\hat{x}_{k}}\langle X_{i},\phi_{k}\rangle,$	(A.3)
$\displaystyle D_{i,k}$	$\displaystyle=\left(\frac{\langle W_{i},\phi_{k}\rangle}{c_{k}}-\frac{1}{x_{k}}\langle X_{i},\phi_{k}\rangle\right),$	(A.4)
$\displaystyle S_{i,m}$	$\displaystyle=\langle\beta,\phi_{m}\rangle\langle\phi_{m},X_{i}\rangle.$	(A.5)

The first result establishes the asymptotic distribution of the test statistic.

Theorem A.1

Under the assumptions of Theorem 2.1, under the null hypothesis, and for $(X,W)\in\mathcal{F}_{\eta}^{4}$ and $E[X(t)]=E[W(t)]=0$ for all $t\in[0,1]$ , we have

\frac{n}{t_{n}}R_{n,3}\stackrel{{\scriptstyle\mathcal{D}}}{{\longrightarrow}}\mathcal{N}(0,\mathfrak{V}).

The remaining results are to show, that the remainder terms are negligible.

Proposition A.2

Let $\left(X,W\right)\in\mathcal{F}_{\eta}^{128}$ and $\mathrm{E}|U|^{128}\leq\eta<\infty$ . Under the assumptions of Theorem 2.1, we have

\frac{1}{n}\sum_{j=1}^{n}\left|\left\langle T_{n,1},X_{j}\right\rangle\right|^{2}=o_{P}\left(\frac{1}{n}\right).

Proposition A.3

Under the assumptions of Theorem 2.1 and if $\left(X,W\right)\in\mathcal{F}_{\eta}^{64}$ and $\mathrm{E}|U|^{64}\leq\eta<\infty$ , we have

\frac{1}{n}\sum_{j=1}^{n}\left|\left\langle T_{n,2},X_{j}\right\rangle\right|^{2}=o_{P}\left(\frac{t_{n}}{n}\right).

Proposition A.4

Under the assumptions of Theorem 2.1, and if $\left(X,W\right)\in\mathcal{F}_{\eta}^{8}$ and $\mathrm{E}|U|^{8}\leq\eta<\infty$ , we have

\frac{1}{n}\sum_{j=1}^{n}\left|\left\langle T_{n,3},X_{j}\right\rangle\right|^{2}=o_{P}\left(\frac{t_{n}}{n}\right).

Proposition A.5

Under the assumptions of Theorem 2.1, and if $\mbox{E}|U|^{4}\leq\eta<\infty$ and $(X,W)\in\mathcal{F}_{\eta}^{4}$ , we have

	$\displaystyle R_{n,1}$	$\displaystyle=o_{P}\left(\frac{1}{n}\right),\quad R_{n,2}=\mathfrak{R}_{n}+o_{P}\left(\frac{t_{n}}{n}\right),\quad\mathfrak{R}_{n}=o\left(\frac{1}{\sqrt{n}}\right),$
	$\displaystyle R_{n,4}$	$\displaystyle=o_{P}\left(\frac{1}{n^{3/2}}\right),\quad R_{n,5}=o_{P}\left(\frac{1}{n}\right).$

Appendix B Auxiliary results

The results in this section are used at several places in the proofs. They follow from Lemma A.1 in [Johannes (2016)].

Lemma B.1

Let $X$ and $W$ have finite second moments and $m\in\mathbb{N}$ . Then we have $\sum_{k\in\mathbb{Z}}x_{k}^{2m}<\infty$ and $\sum_{k\in\mathbb{Z}}x_{k}^{2m}w_{k}<\infty$ . If additionally $X\in\mathcal{G}_{\eta}^{2m}$ und $\beta\in L_{2}([0,1])$ , we have

\displaystyle\mathrm{E}\left|\sum_{k\in\mathbb{Z}}\langle\beta,\phi_{k}\rangle\langle\phi_{k},X\rangle\right|^{2m}<\infty.

Lemma B.2

Let $p\in\mathbb{N}$ be fixed and suppose $\left(X,W\right)\in\mathcal{F}_{\eta}^{8p}$ and $\mathrm{E}|U|^{8p}\leq\eta<\infty$ . Then, there is a positive Konstant $C=C_{p}$ such that for $k\in\mathbb{Z}$ , we have

\displaystyle\mathrm{E}|I\{\hat{\lambda}_{k}\geq\alpha\gamma_{k}^{\nu}\}\left(D_{i,k,n}-D_{i,k}\right)|^{p}\leq\frac{C}{n^{p/2}}\left(\frac{w_{k}^{p}x_{k}^{p/2}}{|c_{k}|^{2p}}+\frac{1}{x_{k}^{p/2}}\right)\left(1+o(1)\right)

(B.1)

and

\displaystyle\mathrm{E}\left|I\{\hat{\lambda}_{k}\geq\alpha\gamma_{k}^{\nu}\}D_{i,k,n}\right|^{p}\leq C_{p}\Bigg{\{}\frac{w_{k}^{p/2}}{|c_{k}|^{p}}+\frac{1}{x_{k}^{p/2}}+\frac{C}{n^{p/2}}\left(\frac{w_{k}^{p}x_{k}^{p/2}}{|c_{k}|^{2p}}+\frac{1}{x_{k}^{p/2}}\right)\left(1+o(1)\right)\Bigg{\}}.

(B.2)

Appendix C Proof of Theorem A.1

The proof follows by using a central limit theorem for martingal difference sequences with respect to $\left(\mathcal{F}_{n,j}\right)_{n\in\mathbb{N},0\leq j\leq n}$ , where $\mathcal{F}_{n,j}=\sigma\left(X_{1},W_{1},Y_{1},\ldots,X_{j},W_{j},Y_{j}\right)$ and $\mathcal{F}_{n,0}=\sigma\left(\emptyset,\Omega\right)$ , see [Hall und Heyde (1980)], Theorem 3.2 and Corollary 3.1, for

\displaystyle\frac{n}{2t_{n}}R_{n,3}

\displaystyle=\sum_{j=2}^{n}\frac{1}{t_{n}n}\sum_{k\in\mathbb{Z}}\mathscr{U}_{j,k}D_{j,k}\sum_{i=1}^{j-1}\overline{\mathscr{U}_{i,k}D_{i,k}}x_{k}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}=\sum_{j=2}^{n}Y_{n,j},

where

Y_{n,j}=\frac{1}{t_{n}n}\sum_{k\in\mathbb{Z}}\mathscr{U}_{j,k}D_{j,k}Z_{n,j,k},

and

Z_{n,j,k}=\sum_{i=1}^{j-1}\overline{\mathscr{U}_{i,k}D_{i,k}}x_{k}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}.

In a first step, we consider the conditional variance of the maringale difference scheme.

Proposition C.1

Under the assumptions of Theorem 2.1, under the null hypothesis and for $(X,W)\in\mathcal{F}_{\eta}^{4}$ , we have

\mathfrak{V}_{n}:=\sum_{j=2}^{n}\mathrm{E}\left[Y_{n,j}^{2}\mid\mathcal{F}_{n,j-1}\right]\stackrel{{\scriptstyle P}}{{\longrightarrow}}\mathfrak{V}\quad\text{as}\quad n\to\infty.

Proof. Using that $\mathscr{U}_{j,k}D_{j,k}\overline{\mathscr{U}_{j,l}D_{j,l}}$ is independent of $(\mathcal{F}_{n,j-1})_{j=1,\ldots,n}$ , we can decompose

	$\displaystyle\mathfrak{V}_{n}$	$\displaystyle=\frac{1}{t_{n}^{2}n^{2}}\sum_{j=2}^{n}\text{E}\Big{[}\Big{\|}\sum_{k\in\mathbb{Z}}\mathscr{U}_{j,k}D_{j,k}Z_{n,j,k}\Big{\|}^{2}\mid\mathcal{F}_{n,j-1}\Big{]}$
		$\displaystyle=\frac{1}{t_{n}^{2}n}\sum_{k\in\mathbb{Z}}x_{k}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\text{E}\|\mathscr{U}_{1,k}\|^{2}$
		$\displaystyle\phantom{=}\phantom{\frac{1}{t_{n}^{2}n^{2}}\sum_{j=2}^{n}\sum_{k\in\mathbb{Z}}}\Bigg{(}\sum_{i=1}^{n-1}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}+\sum_{\begin{subarray}{c}i,p=1,\\ i\neq p\end{subarray}}^{n-1}\mathscr{U}_{i,k}D_{i,k}\overline{\mathscr{U}_{p,k}D_{p,k}}\Bigg{)}$
		$\displaystyle=\mathfrak{V}_{n,1}+\mathfrak{V}_{n,2}.$

We define

\displaystyle\mathfrak{H}_{n}:=\frac{\mathfrak{V}}{t_{n}^{2}n}\sum_{k\in\mathbb{Z}}x_{k}\left(\frac{x_{k}w_{k}}{|c_{k}|^{2}}-1\right)I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\sum_{i=1}^{n-1}\text{E}|D_{i,k}|^{2}

and show

\displaystyle\mathfrak{V}_{n,1}=\mathfrak{H}_{n}+o(1)

by proving the corresponding $L_{2}$ -convergence. Afterwards we show that $\mathfrak{H}_{n}$ converges in probability to $\mathfrak{V}$ . Writing for $i\in\{1,\ldots,n\}$ and $k\in\mathbb{Z}$

	$\displaystyle\|\mathscr{U}_{i,k}D_{i,k}\|^{2}\text{E}\|\mathscr{U}_{1,k}\|^{2}-\mathfrak{V}\text{E}\|D_{i,k}\|^{2}$
	$\displaystyle=\mathfrak{V}^{1/2}\Big{[}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\Big{]}-\|\mathscr{U}_{i,k}D_{i,k}\|^{2}\|\langle\beta,\phi_{k}\rangle\|^{2}x_{k}.$

and, observing that $\sigma^{2}+\sum_{m\in\mathbb{Z}}|\langle\beta,\phi_{m}\rangle|^{2}x_{m}\leq C_{1}$ for some constant $C_{1}>0$ , we get

\displaystyle\text{E}\left(\mathfrak{V}_{n,1}-\mathfrak{H}_{n}\right)^{2}\leq\mathbb{V}_{n,1}+\mathbb{V}_{n,2}+\mathbb{V}_{n,3}

with

	$\displaystyle\mathbb{V}_{n,1}$	$\displaystyle=\frac{C}{t_{n}^{4}n^{2}}\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}$
		$\displaystyle\phantom{=}\phantom{\frac{C}{t_{n}^{4}n^{2}}\sum_{k\in\mathbb{Z}}}\Bigg{\{}\sum_{i=1}^{n-1}\text{E}\Big{(}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\Big{)}^{2}$
		$\displaystyle\phantom{=}\phantom{\frac{C}{t_{n}^{4}n^{2}}\sum_{k\in\mathbb{Z}}\Big{\{}}+\sum_{\begin{subarray}{c}i,p=1,\\ i\neq p\end{subarray}}^{n-1}\text{E}\Big{[}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{1,k}\|^{2}\Big{]}\text{E}\Big{[}(\|\mathscr{U}_{p,k}D_{p,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{1,k}\|^{2}\Big{]}\Bigg{\}}$
	$\displaystyle\mathbb{V}_{n,2}$	$\displaystyle{=}\frac{C}{t_{n}^{4}n^{2}}\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ \|k\|\neq\|l\|\end{subarray}}x_{k}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}x_{l}\left(\frac{x_{l}w_{l}}{\|c_{l}\|^{2}}-1\right)I\{\lambda_{l}\geq\alpha\gamma_{l}^{\nu}\}$
		$\displaystyle\phantom{=}\phantom{+\frac{1}{t_{n}^{4}n^{2}}}\phantom{\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ k\neq l\end{subarray}}}\Bigg{\{}\sum_{i=1}^{n-1}\text{E}\Big{[}\Big{(}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\Big{)}\Big{(}\|\mathscr{U}_{i,l}D_{i,l}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,l}\|^{2}\Big{)}\Big{]}$
		$\displaystyle\phantom{=}\phantom{+\frac{1}{t_{n}^{4}n^{2}}}\phantom{\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ k\neq l\end{subarray}}}+\sum_{\begin{subarray}{c}i,p=1,\\ i\neq p\end{subarray}}^{n-1}\text{E}\Big{[}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\Big{]}\text{E}\Big{[}\|\mathscr{U}_{i,l}D_{i,l}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,l}\|^{2}\Big{]}\Bigg{\}}$
	$\displaystyle\mathbb{V}_{n,3}$	$\displaystyle{=}\frac{2}{t_{n}^{4}n^{2}}\text{E}\Bigg{(}\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)\|\langle\beta,\phi_{k}\rangle\|^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\sum_{i=1}^{n-1}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}\Bigg{)}^{2}.$

We have

\displaystyle\text{E}|\mathscr{U}_{j,k}D_{j,k}|^{2}=\Bigg{(}\sigma^{2}+\sum_{\begin{subarray}{c}m\in\mathbb{Z},\\ |m|\neq|k|\end{subarray}}|\langle\beta,\phi_{m}\rangle|^{2}x_{m}\Bigg{)}\left(\frac{w_{k}}{|c_{k}|^{2}}-\frac{1}{x_{k}}\right),

(C.1)

because $|\mathscr{U}_{j,k}|^{2}$ and $|D_{j,k}|^{2}$ are uncorrelated for all $k\in\mathbb{Z}$ and $j\in\{1,\ldots,n\}$ . With Lemma B.1 and (E.6), for all $i\in\{1,\ldots,n\}$ and $k\in\mathcal{K}_{n}$ , we have

	$\displaystyle\text{E}\left(\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\right)^{2}$	$\displaystyle\leq C\left(\text{E}\|D_{1,k}\|^{4}-\left(\text{E}\|D_{1,k}\|^{2}\right)^{2}\right)\leq C\text{E}\|D_{1,k}\|^{4}$
		$\displaystyle\leq\frac{C}{\alpha^{2}}.$		(C.2)

as well as

	$\displaystyle\text{E}\left[\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\right]$	$\displaystyle=-\text{E}\|D_{1,k}\|^{2}\|\langle\beta,\phi_{k}\rangle\|^{2}x_{k}$
	$\displaystyle=-\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)\|\langle\beta,\phi_{k}\rangle\|^{2}.$			(C.3)

For the mixed terms with $k,l\in\mathbb{Z},|k|\neq|l|$ and $i\in\{1,\ldots,n\}$ and $\frac{w_{k}}{|c_{k}|^{2}}-\frac{1}{x_{k}}\geq 0$ for all $k\in\mathbb{Z}$ , we get

	$\displaystyle\text{E}\Big{[}\Big{(}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\Big{)}\Big{(}\|\mathscr{U}_{i,l}D_{i,l}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,l}\|^{2}\Big{)}\Big{]}$
	$\displaystyle\leq\text{E}\Big{[}\|\mathscr{U}_{1,k}D_{1,k}\mathscr{U}_{1,l}D_{1,l}\|^{2}\Big{]}+\Big{(}\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\Big{)}\Big{(}\frac{w_{l}}{\|c_{l}\|^{2}}-\frac{1}{x_{l}}\Big{)}$
	$\displaystyle\leq C\Bigg{\{}\frac{1}{\alpha^{2}}\|\langle\beta,\phi_{k}\rangle\|^{2}x_{k}\|\langle\beta,\phi_{l}\rangle\|^{2}x_{l}+\frac{x_{l}}{\alpha}\|\langle\beta,\phi_{l}\rangle\|^{2}\Big{(}\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\Big{)}$
	$\displaystyle\phantom{=}\phantom{C\Bigg{\{}}+\frac{x_{k}}{\alpha}\|\langle\beta,\phi_{k}\rangle\|^{2}\Big{(}\frac{w_{l}}{\|c_{l}\|^{2}}-\frac{1}{x_{l}}\Big{)}+\Big{(}\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\Big{)}\Big{(}\frac{w_{l}}{\|c_{l}\|^{2}}-\frac{1}{x_{l}}\Big{)}\Bigg{\}}.$		(C.4)

Using this, we have

	$\displaystyle\mathbb{V}_{n,1}$	$\displaystyle\leq\frac{C}{t_{n}^{4}n^{2}}\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\Bigg{\{}\frac{n}{\alpha^{2}}+n^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}\|\langle\beta,\phi_{k}\rangle\|^{4}\Bigg{\}}$
		$\displaystyle\leq\frac{C}{t_{n}^{4}n\alpha^{2}}\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}$
		$\displaystyle\phantom{=}+\frac{C}{t_{n}^{4}}\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{4}\|\langle\beta,\phi_{k}\rangle\|^{4}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}$
		$\displaystyle=o\left(1+\frac{1}{t_{n}^{2}}\right),$

with some constant $C>0$ . With similar arguments we obtain

\displaystyle\mathbb{V}_{n,2}

\displaystyle=o\left(1+\frac{1}{t_{n}^{2}}+\frac{1}{\sqrt{n}t_{n}}\right)+\mathcal{O}\left(\frac{1}{n}\right).

and

\displaystyle\mathbb{V}_{n,3}

\displaystyle=o\left(1+\frac{1}{t_{n}^{2}}+\frac{1}{n}+\frac{1}{\sqrt{n}t_{n}}\right).

which altogether results in

\mathfrak{V}_{n,1}=\mathfrak{H}_{n}+o_{P}\left(1\right).

The stochastic convergence of $\mathfrak{H}_{n}$ follows by

\displaystyle\mathfrak{H}_{n}

\displaystyle=\mathfrak{V}\frac{n-1}{t_{n}^{2}n}\sum_{k\in\mathbb{Z}}\left(\frac{x_{k}w_{k}}{|c_{k}|^{2}}-1\right)^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\stackrel{{\scriptstyle P}}{{\to}}\mathfrak{V}

for $n\to\infty$ . For proving, that $\mathfrak{V}_{n,2}$ converges stochastically to 0 we show again the corresponding $L_{2}$ -convergence. To this end, we bound for all $i\in\{1,\ldots,n\}$ und $k\in\mathbb{Z}$ the term $\text{E}|\mathscr{U}_{1,k}|^{2}$ by a constant $C<\infty$ using the centeredness of $U$ and Lemma B.1, to obtain

\mathfrak{V}_{n,2}=o_{P}\left(1\right).

The detailed arguments can be found in the supplementary material. $\Box$

The second step is to show the conditional Lindeberg condition by verifying an unconditional Ljapunov condition.

Proposition C.2

Under the assumptions of Theorem 2.1, under the null hypothesis, and with $(X,W)\in\mathcal{F}_{\eta}^{4}$ , we have

\forall\,\varepsilon>0:\;\sum_{j=2}^{n}\mathrm{E}\left[Y_{n,j}^{2}I\{|Y_{n,j}|>\varepsilon\}\mid\mathcal{F}_{n,j-1}\right]\stackrel{{\scriptstyle P}}{{\longrightarrow}}0\quad\text{as}\quad n\to\infty.

(C.5)

Proof. It is shown in [Alj et al. (2014)] and [Gänssler et al. (1978)] that the conditional Lindeberg condition follows from the unconditional Ljapunov condition. We will show in the following, that

\sum_{j=2}^{n}\text{E}|Y_{n,j}|^{4}=o(1)

and decompose

\sum_{j=2}^{n}\text{E}|Y_{n,j}|^{4}=L_{n,1}+L_{n,2}+L_{n,3}+L_{n,4},

where

	$\displaystyle L_{n,1}$	$\displaystyle=\frac{1}{t_{n}^{4}n^{4}}\sum_{j=2}^{n}\sum_{k\in\mathbb{Z}}\text{E}\left\|\mathscr{U}_{j,k}D_{j,k}Z_{n,j,k}\right\|^{4},$
	$\displaystyle L_{n,2}$	$\displaystyle=\frac{1}{t_{n}^{4}n^{4}}\sum_{j=2}^{n}\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ \|k\|\neq\|l\|\end{subarray}}\text{E}\left\|\mathscr{U}_{j,k}D_{j,k}Z_{n,j,k}\overline{\mathscr{U}_{j,l}D_{j,l}Z_{n,j,l}}\right\|^{2},$
	$\displaystyle L_{n,3}$	$\displaystyle{=}{\frac{1}{t_{n}^{4}n^{4}}\sum_{j=2}^{n}}\sum_{\begin{subarray}{c}k,l,q\in\mathbb{Z},\\ \|k\|,\|l\|\neq\|q\|,\|k\|\neq\|l\|\end{subarray}}\text{E}\Big{[}\|\mathscr{U}_{j,k}D_{j,k}Z_{n,j,k}\|^{2}\mathscr{U}_{j,l}D_{j,l}Z_{n,j,l}\overline{\mathscr{U}_{j,q}D_{j,q}Z_{n,j,q}}\Big{]},$
	$\displaystyle L_{n,4}$	$\displaystyle{=}{\frac{1}{t_{n}^{4}n^{4}}\sum_{j=2}^{n}}\sum_{\begin{subarray}{c}k,l,p,q\in\mathbb{Z},\\ \|k\|,\|l\|,\|p\|\neq\|q\|,\\ \|k\|,\|l\|\neq\|p\|,\|k\|\neq\|l\|\end{subarray}}\text{E}\Big{[}\mathscr{U}_{j,k}D_{j,k}Z_{n,j,k}\overline{\mathscr{U}_{j,l}D_{j,l}Z_{n,j,l}}\mathscr{U}_{j,p}D_{j,p}Z_{n,j,p}\overline{\mathscr{U}_{j,q}D_{j,q}Z_{n,j,q}}\Big{]}.$

For $L_{n,1}$ we use, that for all $k\in\mathbb{Z},n\in\mathbb{N},j\in\{1,\ldots,n\}$ , $Z_{n,j,k}$ are stochastically independent of $\mathscr{U}_{j,k}D_{j,k}$ and $\mathscr{U}_{j,k}$ are uncorrelated with $D_{j,k}$ . Furthermore, the fourth absolute moment of $\mathscr{U}_{j,k}$ is due to the centredness of $U$ and Lemma B.1 uniformly bounded. The fourth absolute moment of $D_{j,k}$ can be estimated using Assumption 3 and $(X,W)\in\mathcal{F}_{\eta}^{4}$ as

\displaystyle\text{E}|D_{j,k}|^{4}\leq C\left(\frac{\text{E}|\langle W,\phi_{k}\rangle|^{4}}{|c_{k}|^{4}}+\frac{\text{E}|\langle X,\phi_{k}\rangle|^{4}}{x_{k}^{4}}\right)\leq C\eta\left(\frac{w_{k}^{2}}{|c_{k}|^{4}}+\frac{1}{x_{k}^{2}}\right)\leq\frac{C\eta}{\alpha^{2}}.

(C.6)

Again using similar arguments, we obtain

\displaystyle\text{E}\left|\mathscr{U}_{i_{1},k}D_{i_{1},k}\right|^{2}=\text{E}|\mathscr{U}_{i_{1},k}|^{2}\text{E}|D_{i_{1},k}|^{2}

\displaystyle\leq C\left(\frac{w_{k}}{|c_{k}|^{2}}-\frac{1}{x_{k}}\right).

(C.7)

This results in

	$\displaystyle\text{E}\Big{\|}\sum_{i=1}^{j-1}\mathscr{U}_{i,k}D_{i,k}x_{k}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\Big{\|}^{4}$
	$\displaystyle=x_{k}^{4}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\Bigg{\{}\sum_{i=1}^{j-1}\text{E}\|\mathscr{U}_{i,k}\|^{4}\text{E}\|D_{i,k}\|^{4}+2\sum_{1\leq i_{1}<i_{2}\leq j-1}\text{E}\|\mathscr{U}_{i_{1},k}D_{i_{1},k}\|^{2}\text{E}\|\mathscr{U}_{i_{2},k}D_{i_{2},k}\|^{2}\Bigg{\}}$
	$\displaystyle\leq\frac{Cn}{\alpha^{2}}x_{k}^{4}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}+Cn^{2}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}.$		(C.8)

Putting these results together, we get

	$\displaystyle L_{n,1}$	$\displaystyle=\frac{1}{t_{n}^{4}n^{4}}\sum_{j=2}^{n}\sum_{k\in\mathbb{Z}}\text{E}\|\mathscr{U}_{j,k}\|^{4}\text{E}\|D_{j,k}\|^{4}\text{E}\|Z_{n,j,k}\|^{4}$
		$\displaystyle\leq\frac{C}{t_{n}^{4}n^{4}\alpha^{2}}\sum_{j=2}^{n}\sum_{k\in\mathbb{Z}}\text{E}\Big{\|}\sum_{i=1}^{j-1}\mathscr{U}_{i,k}D_{i,k}x_{k}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\Big{\|}^{4}$
		$\displaystyle\leq\frac{C}{t_{n}^{4}n\alpha^{2}}\sum_{k\in\mathbb{Z}}x_{k}^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\left(\frac{1}{n\alpha^{2}}x_{k}^{2}+\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}\right)$
		$\displaystyle=o(1)\frac{1}{t_{n}^{4}}\left(\sum_{k\in\mathbb{Z}}x_{k}^{4}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}+\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\right),$

where the first series converges due to Lemma B.1 and the second series either also converges or, if not, can be bounded by $Ct_{n}^{2}$ .

Considering $L_{n,4}$ , we use the stochastic independence of $Z_{n,j,k}$ and $\mathscr{U}_{j,l}D_{j,l}$ for all $k,l\in\mathbb{Z}$ , which results in

	$\displaystyle\text{E}\big{[}\mathscr{U}_{j,k}D_{j,k}Z_{n,j,k}\overline{\mathscr{U}_{j,l}}\overline{D_{j,l}}\overline{Z_{n,j,l}}\mathscr{U}_{j,p}D_{j,p}Z_{n,j,p}\overline{\mathscr{U}_{j,q}}\overline{D_{j,q}}\overline{Z_{n,j,q}}\big{]}$
	$\displaystyle=\text{E}\big{[}\mathscr{U}_{j,k}D_{j,k}\overline{\mathscr{U}_{j,l}}\overline{D_{j,l}}\mathscr{U}_{j,p}D_{j,p}\overline{\mathscr{U}_{j,q}}\overline{D_{j,q}}\big{]}\text{E}\big{[}Z_{n,j,k}\overline{Z_{n,j,l}}Z_{n,j,p}\overline{Z_{n,j,q}}\big{]}.$

The rest of the argumentation is just calculating the expectations using that for all $j\in\{1,\ldots,n\}$ , $D_{j,k},D_{j,l},D_{j,p}$ and $D_{j,q}$ are uncorellated with $S_{j,m}$ for all $m\in\mathbb{Z}\backslash\{m\in\mathbb{Z}:|m|=|k|,|l|,|p|,|q|\}$ and stochastically independent of $U_{j}$ . Finally,

\displaystyle\text{E}[S_{j,k}D_{j,k}]

\displaystyle=\langle\beta,\phi_{k}\rangle\text{E}\left[\langle\phi_{k},X_{j}\rangle\left(\frac{\langle W_{j},\phi_{k}\rangle}{c_{k}}-\frac{\langle X_{j},\phi_{k}\rangle}{x_{k}}\right)\right]=\langle\beta,\phi_{k}\rangle\left(\frac{c_{k}}{c_{k}}-\frac{x_{k}}{x_{k}}\right)=0

(C.9)

and, in the same way, $\text{E}[\overline{S_{j,k}}D_{j,k}]=\text{E}[S_{j,k}\overline{D_{j,k}}]=0$ , which gives $L_{n,4}=0$ .

With similar arguments as above, which can be found in the supplementary material we get

\displaystyle L_{n,2}

\displaystyle=o\left(\frac{1}{t_{n}^{4}}+\frac{1}{t_{n}^{2}n}+\frac{1}{t_{n}^{2}}+\frac{1}{t_{n}\sqrt{n}}\right)+\mathcal{O}\left(\frac{1}{n}+\frac{1}{n^{2}}\right)=o(1),

and

\displaystyle L_{n,3}=o\left(\frac{1}{t_{n}^{2}n}\right).

$\Box$

All remainder terms can be estimated with similar techniques. We exemplarily show the idea for Proposition A.5, that is for $R_{n,2}$ , in the supplementary material.

Appendix D Proof of Theorem 3.1

Let $\Phi_{\mathfrak{V}}(\cdot)$ denote the distribution function of the normal distribution with mean zero and variance $\mathfrak{V}$ , $F_{n}$ the distribution function of $\frac{n}{t_{n}}\left(T_{n}-\mathfrak{B}_{n}-\mathfrak{R}_{n}\right)$ and $F_{\mathcal{S}_{n},n}^{*}$ the distribution function of the conditional distribution of $\frac{n}{t_{n}}\left(T_{n}^{*}-\mathfrak{B}_{n}^{*}-\mathfrak{R}_{n}^{*}\right)$ given $\mathcal{S}_{n}$ . By bounding

\displaystyle\sup_{t\in\mathbb{R}}\left|F_{\mathcal{S}_{n},n}^{*}(t)-F_{n}(t)\right|

\displaystyle\leq\sup_{t\in\mathbb{R}}\left|F_{\mathcal{S}_{n},n}^{*}(t)-\Phi_{\mathfrak{V}}(t)\right|+\sup_{t\in\mathbb{R}}\left|F_{n}(t)-\Phi_{\mathfrak{V}}(t)\right|=:M_{1,n}+M_{2,n},

similar to the example in Section 29 of [DasGupta (2008)], it is enough to show the convergence of $M_{1,n}$ and $M_{2,n}$ . Due to the continuity of $\phi_{\mathfrak{V}}$ , the convergence of $M_{2,n}$ follows directly from Theorem 2.1 and Polya’s Theorem, as stated in Section 1.5.3 of [Serfling (1980)]. Again, using Polya’s Theorem, it is enough to show for $M_{1,n}$ , that for all $\varepsilon>0$

\displaystyle\lim_{n\to\infty}\mathbb{P}\left(\left|F_{\mathcal{S}_{n},n}^{*}(t)-\phi_{\mathfrak{V}}(t)\right|>\varepsilon\right)=0.

(D.1)

For this we just immitate the proof of Theorem 2.1. Analogously to (2.13), we decompose

	$\displaystyle\frac{n}{t_{n}}T_{n}^{*}$	$\displaystyle=\frac{1}{t_{n}}\sum_{j=1}^{n}\left\|\left\langle T_{n,1}^{}+T_{n,2}^{}+T_{n,3}^{},X_{j}\right\rangle\right\|^{2}+\frac{1}{t_{n}}\sum_{j=1}^{n}\langle T_{n,1}^{}+T_{n,2}^{}+T_{n,3}^{},X_{j}\rangle\langle X_{j},R_{n}^{*}\rangle$
		$\displaystyle\phantom{=}+\frac{1}{t_{n}}\sum_{j=1}^{n}\langle X_{j},T_{n,1}^{}+T_{n,2}^{}+T_{n,3}^{}\rangle\langle R_{n}^{},X_{j}\rangle+\frac{1}{t_{n}}\sum_{j=1}^{n}\left\|\left\langle R_{n}^{*},X_{j}\right\rangle\right\|^{2},$

where, similar to the proof of Theorem 2.1, we get

\frac{1}{t_{n}}\sum_{j=1}^{n}\left|\left\langle R_{n}^{*},X_{j}\right\rangle\right|^{2}-\frac{n}{t_{n}}\mathfrak{R}_{n}=\frac{n}{t_{n}}R_{n,3}^{*}+\frac{n}{t_{n}}\left(R_{n,2}^{*}-\mathfrak{R}_{n}\right)+\frac{n}{t_{n}}\left(R_{n,1}^{*}+R_{n,4}^{*}+R_{n,5}^{*}\right).

Then, $\frac{n}{t_{n}}(R_{n,3}^{*}-\mathfrak{B}_{n}^{*}-\mathfrak{R}_{n}^{*})$ converges weakly in probability to $\mathcal{N}(0,\mathfrak{V})$ along the lines of Theorem A.1. The remainder terms can be discussed to be negligible with the same arguments as for the remainder terms in Theorem 2.1. $\Box$

Appendix E Supplementary Material

E.1 Proof of Proposition A.5

We give only the proof for $R_{n,2}$ . We have

	$\displaystyle\frac{n^{2}}{t_{n}^{2}}\text{E}\|R_{n,2}-\mathfrak{R}_{n}\|^{2}$
	$\displaystyle\leq\frac{1}{t_{n}^{2}n^{2}}\sum_{k\in\mathbb{Z}}x_{k}^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\text{E}\Bigg{\|}\sum_{i=1}^{n}\left(\|D_{i,k}\mathscr{U}_{i,k}\|^{2}-\mathfrak{V}^{1/2}\left(\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\right)\right)\Bigg{\|}^{2}$
	$\displaystyle\phantom{=}+\frac{1}{t_{n}^{2}n^{2}}\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ \|k\|\neq\|l\|\end{subarray}}x_{k}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}x_{l}I\{\lambda_{l}\geq\alpha\gamma_{l}^{\nu}\}$
	$\displaystyle\phantom{=}\phantom{\frac{1}{t_{n}^{2}n^{2}}\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ k\neq l\end{subarray}}}\sum_{i=1}^{n}\text{E}\Big{[}\Big{(}\|D_{i,k}\mathscr{U}_{i,k}\|^{2}-\mathfrak{V}^{1/2}\Big{(}\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\Big{)}\Big{)}\Big{(}\|D_{i,l}\mathscr{U}_{i,l}\|^{2}-\mathfrak{V}^{1/2}\Big{(}\frac{w_{l}}{\|c_{l}\|^{2}}-\frac{1}{x_{l}}\Big{)}\Big{)}\Big{]}$
	$\displaystyle\phantom{=}+\frac{1}{t_{n}^{2}n^{2}}\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ \|k\|\neq\|l\|\end{subarray}}x_{k}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}x_{l}I\{\lambda_{l}\geq\alpha\gamma_{l}^{\nu}\}$
	$\displaystyle\phantom{=}\phantom{\frac{1}{t_{n}^{2}n^{2}}\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ k\neq l\end{subarray}}}\sum_{\begin{subarray}{c}i,p=1,\\ i\neq p\end{subarray}}^{n}\text{E}\Big{[}\|D_{i,k}\mathscr{U}_{i,k}\|^{2}-\mathfrak{V}^{1/2}\Big{(}\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\Big{)}\Big{]}\text{E}\Big{[}\|D_{p,l}\mathscr{U}_{p,l}\|^{2}-\mathfrak{V}^{1/2}\Big{(}\frac{w_{l}}{\|c_{l}\|^{2}}-\frac{1}{x_{l}}\Big{)}\Big{]}.$

The terms quadratic in $k\in\mathbb{Z}$ can be estimated by Lemma B.1 und (E.6), while the other terms except the one coming from $|\langle\beta,\phi_{k}\rangle|^{2}x_{k}$ vanish

\displaystyle\text{E}\Bigg{|}\sum_{i=1}^{n}\left(|D_{i,k}\mathscr{U}_{i,k}|^{2}-\mathfrak{V}^{1/2}\left(\frac{w_{k}}{|c_{k}|^{2}}-\frac{1}{x_{k}}\right)\right)\Bigg{|}^{2}\leq\frac{Cn}{\alpha^{2}}+Cn^{2}\left(\frac{x_{k}w_{k}}{|c_{k}|^{2}}-1\right)^{2}|\langle\beta,\phi_{k}\rangle|^{4}\left(1+\frac{1}{n}\right).

Using the Cauchy-Schwarz inequality (E.3), leads to

\displaystyle\text{E}\Bigg{[}\left(|D_{i,k}\mathscr{U}_{i,k}|^{2}-\mathfrak{V}^{1/2}\left(\frac{w_{k}}{|c_{k}|^{2}}-\frac{1}{x_{k}}\right)\right)\left(|D_{i,l}\mathscr{U}_{i,l}|^{2}-\mathfrak{V}^{1/2}\left(\frac{w_{l}}{|c_{l}|^{2}}-\frac{1}{x_{l}}\right)\right)\Bigg{]}\leq\frac{C}{\alpha^{2}}.

The expectations with $k,l\in\mathbb{Z}$ , $|k|\neq|l|$ und $i,p\in\{1,\ldots,n\}$ , $i\neq p$ can be estimated by (E.4). This finally yields

	$\displaystyle\frac{n^{2}}{t_{n}^{2}}\text{E}\|R_{n,2}-\mathfrak{R}_{n}\|^{2}$
	$\displaystyle\leq\frac{1}{t_{n}^{2}n^{2}}\sum_{k\in\mathbb{Z}}x_{k}^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\Bigg{\{}\frac{Cn}{\alpha^{2}}+Cn^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}\|\langle\beta,\phi_{k}\rangle\|^{4}\left(1+\frac{1}{n}\right)\Bigg{\}}$
	$\displaystyle\phantom{=}+\frac{C}{t_{n}^{2}n^{2}}\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ \|k\|\neq\|l\|\end{subarray}}x_{k}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}x_{l}I\{\lambda_{l}\geq\alpha\gamma_{l}^{\nu}\}$
	$\displaystyle\phantom{=}\phantom{\frac{1}{t_{n}^{2}n^{2}}\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ k\neq l\end{subarray}}}\Bigg{\{}\frac{n}{\alpha^{2}}+n(n-1)\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)\|\langle\beta,\phi_{k}\rangle\|^{2}\left(\frac{x_{l}w_{l}}{\|c_{l}\|^{2}}-1\right)\|\langle\beta,\phi_{l}\rangle\|^{2}\Bigg{\}}$
	$\displaystyle=o\left(1+\frac{1}{t_{n}^{2}}\right).$

The second part can be shown by using

\displaystyle\frac{x_{k}w_{k}}{|c_{k}|^{2}}-1\leq\frac{1}{\alpha}(x_{k}-\lambda_{k}),

(E.1)

for all $k\in\mathscr{K}_{n}$ together with Lemma B.1 $\Box$

All the other parts of Proposition A.5 as well as Lemmas A.2-A.4 follow by very similar techniques. For details we refer to [5] in the main article.

E.2 Details for the proof of Proposition C.1

Using that $\mathscr{U}_{j,k}D_{j,k}\overline{\mathscr{U}_{j,l}D_{j,l}}$ is independent of $(\mathcal{F}_{n,j-1})_{j=1,\ldots,n}$ , we can decompose

	$\displaystyle\mathfrak{V}_{n}$	$\displaystyle=\frac{1}{t_{n}^{2}n^{2}}\sum_{j=2}^{n}\text{E}\Big{[}\Big{\|}\sum_{k\in\mathbb{Z}}\mathscr{U}_{j,k}D_{j,k}Z_{n,j,k}\Big{\|}^{2}\mid\mathcal{F}_{n,j-1}\Big{]}$
		$\displaystyle=\frac{1}{t_{n}^{2}n}\sum_{k\in\mathbb{Z}}x_{k}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\text{E}\|\mathscr{U}_{1,k}\|^{2}$
		$\displaystyle\phantom{=}\phantom{\frac{1}{t_{n}^{2}n^{2}}\sum_{j=2}^{n}\sum_{k\in\mathbb{Z}}}\Bigg{(}\sum_{i=1}^{n-1}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}+\sum_{\begin{subarray}{c}i,p=1,\\ i\neq p\end{subarray}}^{n-1}\mathscr{U}_{i,k}D_{i,k}\overline{\mathscr{U}_{p,k}D_{p,k}}\Bigg{)}$
		$\displaystyle=\mathfrak{V}_{n,1}+\mathfrak{V}_{n,2}.$

We define

\displaystyle\mathfrak{H}_{n}:=\frac{\mathfrak{V}}{t_{n}^{2}n}\sum_{k\in\mathbb{Z}}x_{k}\left(\frac{x_{k}w_{k}}{|c_{k}|^{2}}-1\right)I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\sum_{i=1}^{n-1}\text{E}|D_{i,k}|^{2}

and show

\displaystyle\mathfrak{V}_{n,1}=\mathfrak{H}_{n}+o(1)

by proving the corresponding $L_{2}$ -convergence. Afterwards we show that $\mathfrak{H}_{n}$ converges in probability to $\mathfrak{V}$ . Writing for $i\in\{1,\ldots,n\}$ and $k\in\mathbb{Z}$

	$\displaystyle\|\mathscr{U}_{i,k}D_{i,k}\|^{2}\text{E}\|\mathscr{U}_{1,k}\|^{2}-\mathfrak{V}\text{E}\|D_{i,k}\|^{2}$
	$\displaystyle=\mathfrak{V}^{1/2}\Big{[}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\Big{]}-\|\mathscr{U}_{i,k}D_{i,k}\|^{2}\|\langle\beta,\phi_{k}\rangle\|^{2}x_{k}.$

and, observing that $\sigma^{2}+\sum_{m\in\mathbb{Z}}|\langle\beta,\phi_{m}\rangle|^{2}x_{m}\leq C_{1}$ for some constant $C_{1}>0$ , we get

\displaystyle\text{E}\left(\mathfrak{V}_{n,1}-\mathfrak{H}_{n}\right)^{2}\leq\mathbb{V}_{n,1}+\mathbb{V}_{n,2}+\mathbb{V}_{n,3}

with

	$\displaystyle\mathbb{V}_{n,1}$	$\displaystyle=\frac{C}{t_{n}^{4}n^{2}}\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}$
		$\displaystyle\phantom{=}\phantom{\frac{C}{t_{n}^{4}n^{2}}\sum_{k\in\mathbb{Z}}}\Bigg{\{}\sum_{i=1}^{n-1}\text{E}\Big{(}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\Big{)}^{2}$
		$\displaystyle\phantom{=}\phantom{\frac{C}{t_{n}^{4}n^{2}}\sum_{k\in\mathbb{Z}}\Big{\{}}+\sum_{\begin{subarray}{c}i,p=1,\\ i\neq p\end{subarray}}^{n-1}\text{E}\Big{[}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{1,k}\|^{2}\Big{]}\text{E}\Big{[}(\|\mathscr{U}_{p,k}D_{p,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{1,k}\|^{2}\Big{]}\Bigg{\}},$
	$\displaystyle\mathbb{V}_{n,2}$	$\displaystyle{=}\frac{C}{t_{n}^{4}n^{2}}\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ \|k\|\neq\|l\|\end{subarray}}x_{k}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}x_{l}\left(\frac{x_{l}w_{l}}{\|c_{l}\|^{2}}-1\right)I\{\lambda_{l}\geq\alpha\gamma_{l}^{\nu}\}$
		$\displaystyle\phantom{=}\phantom{+\frac{1}{t_{n}^{4}n^{2}}}\phantom{\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ k\neq l\end{subarray}}}\Bigg{\{}\sum_{i=1}^{n-1}\text{E}\Big{[}\Big{(}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\Big{)}\Big{(}\|\mathscr{U}_{i,l}D_{i,l}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,l}\|^{2}\Big{)}\Big{]}$
		$\displaystyle\phantom{=}\phantom{+\frac{1}{t_{n}^{4}n^{2}}}\phantom{\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ k\neq l\end{subarray}}}+\sum_{\begin{subarray}{c}i,p=1,\\ i\neq p\end{subarray}}^{n-1}\text{E}\Big{[}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\Big{]}\text{E}\Big{[}\|\mathscr{U}_{i,l}D_{i,l}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,l}\|^{2}\Big{]}\Bigg{\}},$
	$\displaystyle\mathbb{V}_{n,3}$	$\displaystyle{=}\frac{2}{t_{n}^{4}n^{2}}\text{E}\Bigg{(}\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)\|\langle\beta,\phi_{k}\rangle\|^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\sum_{i=1}^{n-1}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}\Bigg{)}^{2}.$

We have

\displaystyle\text{E}|\mathscr{U}_{j,k}D_{j,k}|^{2}=\Bigg{(}\sigma^{2}+\sum_{\begin{subarray}{c}m\in\mathbb{Z},\\ |m|\neq|k|\end{subarray}}|\langle\beta,\phi_{m}\rangle|^{2}x_{m}\Bigg{)}\left(\frac{w_{k}}{|c_{k}|^{2}}-\frac{1}{x_{k}}\right),

(E.2)

	$\displaystyle\text{E}\left(\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\right)^{2}$	$\displaystyle\leq C\left(\text{E}\|D_{1,k}\|^{4}-\left(\text{E}\|D_{1,k}\|^{2}\right)^{2}\right)\leq C\text{E}\|D_{1,k}\|^{4}$
		$\displaystyle\leq\frac{C}{\alpha^{2}}$		(E.3)

as well as

	$\displaystyle\text{E}\left[\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\right]$	$\displaystyle=-\text{E}\|D_{1,k}\|^{2}\|\langle\beta,\phi_{k}\rangle\|^{2}x_{k}$
	$\displaystyle=-\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)\|\langle\beta,\phi_{k}\rangle\|^{2}.$			(E.4)

For the mixed terms with $k,l\in\mathbb{Z},|k|\neq|l|$ and $i\in\{1,\ldots,n\}$ and $\frac{w_{k}}{|c_{k}|^{2}}-\frac{1}{x_{k}}\geq 0$ for all $k\in\mathbb{Z}$ , we get

	$\displaystyle\text{E}\Big{[}\Big{(}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\Big{)}\Big{(}\|\mathscr{U}_{i,l}D_{i,l}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,l}\|^{2}\Big{)}\Big{]}$
	$\displaystyle\leq\text{E}\Big{[}\|\mathscr{U}_{1,k}D_{1,k}\mathscr{U}_{1,l}D_{1,l}\|^{2}\Big{]}+\Big{(}\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\Big{)}\Big{(}\frac{w_{l}}{\|c_{l}\|^{2}}-\frac{1}{x_{l}}\Big{)}$
	$\displaystyle\leq C\Bigg{\{}\frac{1}{\alpha^{2}}\|\langle\beta,\phi_{k}\rangle\|^{2}x_{k}\|\langle\beta,\phi_{l}\rangle\|^{2}x_{l}+\frac{x_{l}}{\alpha}\|\langle\beta,\phi_{l}\rangle\|^{2}\Big{(}\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\Big{)}$
	$\displaystyle\phantom{=}\phantom{C\Bigg{\{}}+\frac{x_{k}}{\alpha}\|\langle\beta,\phi_{k}\rangle\|^{2}\Big{(}\frac{w_{l}}{\|c_{l}\|^{2}}-\frac{1}{x_{l}}\Big{)}+\Big{(}\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\Big{)}\Big{(}\frac{w_{l}}{\|c_{l}\|^{2}}-\frac{1}{x_{l}}\Big{)}\Bigg{\}}.$		(E.5)

Using this, we have

	$\displaystyle\mathbb{V}_{n,1}$	$\displaystyle\leq\frac{C}{t_{n}^{4}n^{2}}\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\Bigg{\{}\frac{n}{\alpha^{2}}+n^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}\|\langle\beta,\phi_{k}\rangle\|^{4}\Bigg{\}}$
		$\displaystyle\leq\frac{C}{t_{n}^{4}n\alpha^{2}}\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}$
		$\displaystyle\phantom{=}+\frac{C}{t_{n}^{4}}\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{4}\|\langle\beta,\phi_{k}\rangle\|^{4}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}$
		$\displaystyle=o\left(1+\frac{1}{t_{n}^{2}}\right),$

with some constant $C>0$ . With similar arguments, we obtain

	$\displaystyle\mathbb{V}_{n,2}$	$\displaystyle\leq\frac{C}{t_{n}^{4}}\Bigg{\{}\frac{1}{n\alpha^{2}}\left(\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)\|\langle\beta,\phi_{k}\rangle\|^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\right)^{2}$
		$\displaystyle\phantom{=}\phantom{+\frac{C}{t_{n}^{4}}\Bigg{\{}}+\frac{t_{n}^{2}}{n\alpha}\left(\sum_{l\in\mathbb{Z}}x_{l}^{2}\left(\frac{x_{l}w_{l}}{\|c_{l}\|^{2}}-1\right)\|\langle\beta,\phi_{l}\rangle\|^{2}I\{\lambda_{l}\geq\alpha\gamma_{l}^{\nu}\}\right)+\frac{(t_{n}^{2})^{2}}{n}\Bigg{\}}$
		$\displaystyle\phantom{=}+\frac{C}{t_{n}^{4}}\left(\sum_{k\in\mathbb{Z}}x_{k}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}\|\langle\beta,\phi_{k}\rangle\|^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\right)^{2},$

which can be further bounded using the Cauchy-Schwarz inequality to get

\displaystyle\mathbb{V}_{n,2}

\displaystyle=o\left(1+\frac{1}{t_{n}^{2}}+\frac{1}{\sqrt{n}t_{n}}\right)+\mathcal{O}\left(\frac{1}{n}\right).

Using similar arguments as for the first two terms, $V_{n,3}$ can also be bounded to get

	$\displaystyle\mathbb{V}_{n,3}$	$\displaystyle\leq\frac{C}{t_{n}^{4}n\alpha^{2}}\sum_{k\in\mathbb{Z}}x_{k}^{4}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}\|\langle\beta,\phi_{k}\rangle\|^{4}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}$
		$\displaystyle\phantom{=}+\frac{C}{t_{n}^{4}}\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{4}\|\langle\beta,\phi_{k}\rangle\|^{4}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}$
		$\displaystyle\phantom{=}+\frac{C}{t_{n}^{4}n\alpha^{2}}\left(\sum_{k\in\mathbb{Z}}x_{k}^{3}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)\|\langle\beta,\phi_{k}\rangle\|^{4}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\right)^{2}$
		$\displaystyle\phantom{=}+\frac{C}{t_{n}^{4}n}\left(\sum_{k\in\mathbb{Z}}x_{k}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}\|\langle\beta,\phi_{k}\rangle\|^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\right)^{2}$
		$\displaystyle\phantom{=}+\frac{C}{t_{n}^{4}n\alpha}\sum_{k\in\mathbb{Z}}x_{k}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}\|\langle\beta,\phi_{k}\rangle\|^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}$
		$\displaystyle\phantom{=}\phantom{\frac{C}{t_{n}^{4}n\alpha}\sum_{k\in\mathbb{Z}}}\sum_{k\in\mathbb{Z}}x_{l}^{3}\left(\frac{x_{l}w_{l}}{\|c_{l}\|^{2}}-1\right)\|\langle\beta,\phi_{l}\rangle\|^{4}I\{\lambda_{l}\geq\alpha\gamma_{l}^{\nu}\}$
		$\displaystyle\phantom{=}+\frac{C}{t_{n}^{4}}\left(\sum_{k\in\mathbb{Z}}x_{k}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}\|\langle\beta,\phi_{k}\rangle\|^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\right)^{2}$
		$\displaystyle=o\left(1+\frac{1}{t_{n}^{2}}+\frac{1}{n}+\frac{1}{\sqrt{n}t_{n}}\right).$

Altogether, we have

\mathfrak{V}_{n,1}=\mathfrak{H}_{n}+o_{P}\left(1\right).

The stochastic convergence of $\mathfrak{H}_{n}$ follows by

\displaystyle\mathfrak{H}_{n}

\displaystyle=\mathfrak{V}\frac{n-1}{t_{n}^{2}n}\sum_{k\in\mathbb{Z}}\left(\frac{x_{k}w_{k}}{|c_{k}|^{2}}-1\right)^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\stackrel{{\scriptstyle P}}{{\to}}\mathfrak{V}

for $n\to\infty$ . For proving, that $\mathfrak{V}_{n,2}$ converges stochastically to 0 we show again the corresponding $L_{2}$ -convergence. To this end we bound for all $i\in\{1,\ldots,n\}$ und $k\in\mathbb{Z}$ the term $\text{E}|\mathscr{U}_{1,k}|^{2}$ by a constant $C<\infty$ using the centredness of $U$ and Lemma B.1, to obtain

	$\displaystyle\text{E}\|\mathfrak{V}_{n,2}\|^{2}$	$\displaystyle\leq\frac{C}{t_{n}^{4}n^{2}}\Bigg{\{}\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\text{E}\Bigg{\|}\sum_{\begin{subarray}{c}i,p=1,\\ i\neq p\end{subarray}}^{n-1}\mathscr{U}_{i,k}D_{i,k}\overline{\mathscr{U}_{p,k}D_{p,k}}\Bigg{\|}^{2}$
		$\displaystyle\phantom{=}\phantom{\frac{C}{t_{n}^{4}n^{2}}\Bigg{\{}}+\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ \|k\|\neq\|l\|\end{subarray}}x_{k}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}x_{l}\left(\frac{x_{l}w_{l}}{\|c_{l}\|^{2}}-1\right)I\{\lambda_{l}\geq\alpha\gamma_{l}^{\nu}\}$
		$\displaystyle\phantom{=}\phantom{\frac{C}{t_{n}^{4}n^{2}}\Bigg{\{}+\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ k\neq l\end{subarray}}}\text{E}\Bigg{[}\Bigg{(}\sum_{\begin{subarray}{c}i,p=1,\\ i\neq p\end{subarray}}^{n-1}\mathscr{U}_{i,k}D_{i,k}\overline{\mathscr{U}_{p,k}D_{p,k}}\Bigg{)}\Bigg{(}\sum_{\begin{subarray}{c}i,p=1,\\ i\neq p\end{subarray}}^{n-1}\overline{\mathscr{U}_{i,l}D_{i,l}}\mathscr{U}_{p,l}D_{p,l}\Bigg{)}\Bigg{]}\Bigg{\}}.$

Since $\mathscr{U}_{i,k}D_{i,k}$ and $\mathscr{U}_{p,k}D_{p,k}$ are stochastically independent for $p\neq i$ , only the quadratic terms for $k\in\mathbb{Z}$ are relevant

	$\displaystyle\sum_{\begin{subarray}{c}i,p=1,\\ i\neq p\end{subarray}}^{n-1}\text{E}\Big{\|}\mathscr{U}_{i,k}D_{i,k}\overline{\mathscr{U}_{p,k}D_{p,k}}\Big{\|}^{2}$	$\displaystyle=\sum_{\begin{subarray}{c}i,p=1,\\ i\neq p\end{subarray}}^{n-1}\text{E}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}\text{E}\|\mathscr{U}_{p,k}D_{p,k}\|^{2}$
		$\displaystyle=(n-1)(n-2)\left(\text{E}\|\mathscr{U}_{1,k}\|^{2}\text{E}\|D_{1,k}\|^{2}\right)^{2}$
		$\displaystyle\leq Cn^{2}\left(\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\right)^{2}.$

Under the assumptions of Theorem 2.1, this leads to

\displaystyle\text{E}|\mathfrak{V}_{n,2}|^{2}

\displaystyle\leq\frac{C}{t_{n}^{4}}\sum_{k\in\mathbb{Z}}\left(\frac{x_{k}w_{k}}{|c_{k}|^{2}}-1\right)^{4}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}=o(1),

and therefore

\mathfrak{V}_{n,2}=o_{P}\left(1\right).

E.3 Details for the proof of Proposition C.2

It is shown in [1] and [10] that the conditional Lindeberg condition follows from the unconditional Ljapunov condition. We will show in the following, that

\sum_{j=2}^{n}\text{E}|Y_{n,j}|^{4}=o(1)

and decompose

\sum_{j=2}^{n}\text{E}|Y_{n,j}|^{4}=L_{n,1}+L_{n,2}+L_{n,3}+L_{n,4},

where

	$\displaystyle L_{n,1}$	$\displaystyle=\frac{1}{t_{n}^{4}n^{4}}\sum_{j=2}^{n}\sum_{k\in\mathbb{Z}}\text{E}\left\|\mathscr{U}_{j,k}D_{j,k}Z_{n,j,k}\right\|^{4},$
	$\displaystyle L_{n,2}$	$\displaystyle=\frac{1}{t_{n}^{4}n^{4}}\sum_{j=2}^{n}\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ \|k\|\neq\|l\|\end{subarray}}\text{E}\left\|\mathscr{U}_{j,k}D_{j,k}Z_{n,j,k}\overline{\mathscr{U}_{j,l}D_{j,l}Z_{n,j,l}}\right\|^{2},$
	$\displaystyle L_{n,3}$	$\displaystyle{=}{\frac{1}{t_{n}^{4}n^{4}}\sum_{j=2}^{n}}\sum_{\begin{subarray}{c}k,l,q\in\mathbb{Z},\\ \|k\|,\|l\|\neq\|q\|,\|k\|\neq\|l\|\end{subarray}}\text{E}\Big{[}\|\mathscr{U}_{j,k}D_{j,k}Z_{n,j,k}\|^{2}\mathscr{U}_{j,l}D_{j,l}Z_{n,j,l}\overline{\mathscr{U}_{j,q}D_{j,q}Z_{n,j,q}}\Big{]},$
	$\displaystyle L_{n,4}$	$\displaystyle{=}{\frac{1}{t_{n}^{4}n^{4}}\sum_{j=2}^{n}}\sum_{\begin{subarray}{c}k,l,p,q\in\mathbb{Z},\\ \|k\|,\|l\|,\|p\|\neq\|q\|,\\ \|k\|,\|l\|\neq\|p\|,\|k\|\neq\|l\|\end{subarray}}\text{E}\Big{[}\mathscr{U}_{j,k}D_{j,k}Z_{n,j,k}\overline{\mathscr{U}_{j,l}D_{j,l}Z_{n,j,l}}\mathscr{U}_{j,p}D_{j,p}Z_{n,j,p}\overline{\mathscr{U}_{j,q}D_{j,q}Z_{n,j,q}}\Big{]}.$

\displaystyle\text{E}|D_{j,k}|^{4}\leq C\left(\frac{\text{E}|\langle W,\phi_{k}\rangle|^{4}}{|c_{k}|^{4}}+\frac{\text{E}|\langle X,\phi_{k}\rangle|^{4}}{x_{k}^{4}}\right)\leq C\eta\left(\frac{w_{k}^{2}}{|c_{k}|^{4}}+\frac{1}{x_{k}^{2}}\right)\leq\frac{C\eta}{\alpha^{2}}.

(E.6)

Again using similar arguments, we obtain

\displaystyle\text{E}\left|\mathscr{U}_{i_{1},k}D_{i_{1},k}\right|^{2}=\text{E}|\mathscr{U}_{i_{1},k}|^{2}\text{E}|D_{i_{1},k}|^{2}

\displaystyle\leq C\left(\frac{w_{k}}{|c_{k}|^{2}}-\frac{1}{x_{k}}\right).

(E.7)

This results in

	$\displaystyle\text{E}\Big{\|}\sum_{i=1}^{j-1}\mathscr{U}_{i,k}D_{i,k}x_{k}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\Big{\|}^{4}$
	$\displaystyle=x_{k}^{4}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\Bigg{\{}\sum_{i=1}^{j-1}\text{E}\|\mathscr{U}_{i,k}\|^{4}\text{E}\|D_{i,k}\|^{4}+2\sum_{1\leq i_{1}<i_{2}\leq j-1}\text{E}\|\mathscr{U}_{i_{1},k}D_{i_{1},k}\|^{2}\text{E}\|\mathscr{U}_{i_{2},k}D_{i_{2},k}\|^{2}\Bigg{\}}$
	$\displaystyle\leq\frac{Cn}{\alpha^{2}}x_{k}^{4}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}+Cn^{2}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}.$		(E.8)

Putting these results together, for $L_{n,1}$ , we get

	$\displaystyle L_{n,1}$	$\displaystyle=\frac{1}{t_{n}^{4}n^{4}}\sum_{j=2}^{n}\sum_{k\in\mathbb{Z}}\text{E}\|\mathscr{U}_{j,k}\|^{4}\text{E}\|D_{j,k}\|^{4}\text{E}\|Z_{n,j,k}\|^{4}$
		$\displaystyle\leq\frac{C}{t_{n}^{4}n^{4}\alpha^{2}}\sum_{j=2}^{n}\sum_{k\in\mathbb{Z}}\text{E}\Big{\|}\sum_{i=1}^{j-1}\mathscr{U}_{i,k}D_{i,k}x_{k}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\Big{\|}^{4}$
		$\displaystyle\leq\frac{C}{t_{n}^{4}n\alpha^{2}}\sum_{k\in\mathbb{Z}}x_{k}^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\left(\frac{1}{n\alpha^{2}}x_{k}^{2}+\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}\right)$
		$\displaystyle=o(1)\frac{1}{t_{n}^{4}}\left(\sum_{k\in\mathbb{Z}}x_{k}^{4}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}+\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\right),$

where the first series converges due to Lemma B.1 and the second series either also converges or, if not, can be bounded by $Ct_{n}^{2}$ .

Considering $L_{n,4}$ , we use the stochastic independence of $Z_{n,j,k}$ and $\mathscr{U}_{j,l}D_{j,l}$ for all $k,l\in\mathbb{Z}$ , which results in

	$\displaystyle\text{E}\big{[}\mathscr{U}_{j,k}D_{j,k}Z_{n,j,k}\overline{\mathscr{U}_{j,l}}\overline{D_{j,l}}\overline{Z_{n,j,l}}\mathscr{U}_{j,p}D_{j,p}Z_{n,j,p}\overline{\mathscr{U}_{j,q}}\overline{D_{j,q}}\overline{Z_{n,j,q}}\big{]}$
	$\displaystyle=\text{E}\big{[}\mathscr{U}_{j,k}D_{j,k}\overline{\mathscr{U}_{j,l}}\overline{D_{j,l}}\mathscr{U}_{j,p}D_{j,p}\overline{\mathscr{U}_{j,q}}\overline{D_{j,q}}\big{]}\text{E}\big{[}Z_{n,j,k}\overline{Z_{n,j,l}}Z_{n,j,p}\overline{Z_{n,j,q}}\big{]}.$

The rest of the argumentation is just calculating the expectations using that for all $j\in\{1,\ldots,n\}$ , $D_{j,k},D_{j,l},D_{j,p}$ and $D_{j,q}$ are uncorrelated with $S_{j,m}$ for all $m\in\mathbb{Z}\backslash\{m\in\mathbb{Z}:|m|=|k|,|l|,|p|,|q|\}$ and stochastically independent of $U_{j}$ . Finally,

\displaystyle\text{E}[S_{j,k}D_{j,k}]

\displaystyle=\langle\beta,\phi_{k}\rangle\text{E}\left[\langle\phi_{k},X_{j}\rangle\left(\frac{\langle W_{j},\phi_{k}\rangle}{c_{k}}-\frac{\langle X_{j},\phi_{k}\rangle}{x_{k}}\right)\right]=\langle\beta,\phi_{k}\rangle\left(\frac{c_{k}}{c_{k}}-\frac{x_{k}}{x_{k}}\right)=0

(E.9)

and, in the same way, $\text{E}[\overline{S_{j,k}}D_{j,k}]=\text{E}[S_{j,k}\overline{D_{j,k}}]=0$ , which gives $L_{n,4}=0$ .

With similar arguments as above, we get

\displaystyle L_{n,2}

\displaystyle=\frac{1}{t_{n}^{4}n^{4}}\sum_{j=2}^{n}\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ k\neq l\end{subarray}}\text{E}|\mathscr{U}_{j,k}D_{j,k}\overline{\mathscr{U}_{j,l}}\overline{D_{j,l}}|^{2}\text{E}|Z_{n,j,k}\overline{Z_{n,j,l}}|^{2},

which can be further bounded by using

	$\displaystyle\text{E}\big{\|}\overline{S_{j,k}}D_{j,k}\big{\|}^{2}$	$\displaystyle\leq\|\langle\beta,\phi_{k}\rangle\|^{2}\sqrt{\text{E}\big{\|}\langle X,\phi_{k}\rangle\|^{4}\text{E}\big{\|}D_{j,k}\big{\|}^{4}}$
		$\displaystyle\leq\sqrt{\eta}\|\langle\beta,\phi_{k}\rangle\|^{2}x_{k}\left(\frac{\text{E}\|\langle W,\phi_{k}\rangle\|^{4}}{\|c_{k}\|^{4}}+\frac{\text{E}\|\langle X,\phi_{k}\rangle\|^{4}}{x_{k}^{4}}\right)^{1/2}$
		$\displaystyle\leq C\|\langle\beta,\phi_{k}\rangle\|^{2}x_{k}\left(\frac{w_{k}^{2}}{\|c_{k}\|^{4}}+\frac{1}{x_{k}^{2}}\right)^{1/2}\leq\frac{C\|\langle\beta,\phi_{k}\rangle\|^{2}x_{k}}{\alpha}$

and

	$\displaystyle\text{E}\|Z_{n,j,k}\overline{Z_{n,j,l}}\|^{2}$
	$\displaystyle\leq Cx_{k}^{2}x_{l}^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}I\{\lambda_{l}\geq\alpha\gamma_{l}^{\nu}\}(n-1)$
	$\displaystyle\phantom{=}\Bigg{\{}\Bigg{[}\frac{C}{\alpha^{2}}\|\langle\beta,\phi_{k}\rangle\|^{2}x_{k}\|\langle\beta,\phi_{l}\rangle\|^{2}x_{l}+\frac{C\|\langle\beta,\phi_{l}\rangle\|^{2}x_{l}}{\alpha}\left(\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\right)$
	$\displaystyle\phantom{=}\phantom{C\big{\{}}+\frac{C\|\langle\beta,\phi_{k}\rangle\|^{2}x_{k}}{\alpha}\left(\frac{w_{l}}{\|c_{l}\|^{2}}-\frac{1}{x_{l}}\right)+\left(\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\right)\left(\frac{w_{l}}{\|c_{l}\|^{2}}-\frac{1}{x_{l}}\right)\Bigg{]}$
	$\displaystyle\phantom{=}\phantom{\Big{\{}}+(n-2)\left(\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\right)\left(\frac{w_{l}}{\|c_{l}\|^{2}}-\frac{1}{x_{l}}\right)\Bigg{\}}.$

This results in

	$\displaystyle L_{n,2}$	$\displaystyle\leq\frac{C}{t_{n}^{4}(n\alpha^{2})^{2}}\left(\sum_{k\in\mathbb{Z}}\|\langle\beta,\phi_{k}\rangle\|^{4}x_{k}^{4}\right)^{2}+\frac{C}{t_{n}^{2}n^{2}\alpha^{2}}\sum_{l\in\mathbb{Z}}\|\langle\beta,\phi_{l}\rangle\|^{4}x_{l}^{4}+\frac{C}{n^{2}}$
		$\displaystyle\phantom{=}+\frac{C}{t_{n}^{4}n\alpha^{2}}\left(\sum_{k\in\mathbb{Z}}\|\langle\beta,\phi_{k}\rangle\|^{2}x_{k}^{2}\left(\frac{w_{k}x_{k}}{\|c_{k}\|^{2}}-1\right)I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\right)^{2}$
		$\displaystyle\phantom{=}+\frac{C}{t_{n}^{2}n\alpha}\sum_{l\in\mathbb{Z}}\|\langle\beta,\phi_{l}\rangle\|^{2}x_{l}^{2}\left(\frac{w_{l}x_{l}}{\|c_{l}\|^{2}}-1\right)I\{\lambda_{l}\geq\alpha\gamma_{l}^{\nu}\}+\frac{C}{n}$
		$\displaystyle\leq o\left(\frac{1}{t_{n}^{4}}+\frac{1}{t_{n}^{2}n}\right)+\mathcal{O}\left(\frac{1}{n}+\frac{1}{n^{2}}\right)+\frac{C}{t_{n}^{2}n\alpha^{2}}\sum_{k\in\mathbb{Z}}\|\langle\beta,\phi_{k}\rangle\|^{4}x_{k}^{4}+\frac{C}{t_{n}n\alpha}\sqrt{\sum_{k\in\mathbb{Z}}\|\langle\beta,\phi_{k}\rangle\|^{4}x_{k}^{4}}$
		$\displaystyle=o\left(\frac{1}{t_{n}^{4}}+\frac{1}{t_{n}^{2}n}+\frac{1}{t_{n}^{2}}+\frac{1}{t_{n}\sqrt{n}}\right)+\mathcal{O}\left(\frac{1}{n}+\frac{1}{n^{2}}\right)$
		$\displaystyle=o(1),$

using the Hölder inequality and Lemma B.1.

For the summands in $L_{n,3}$ , we get

	$\displaystyle\text{E}\big{[}\|\mathscr{U}_{j,k}D_{j,k}Z_{n,j,k}\|^{2}\mathscr{U}_{j,l}D_{j,l}Z_{n,j,l}\overline{\mathscr{U}_{j,q}D_{j,q}Z_{n,j,q}}\big{]}$
	$\displaystyle=\text{E}\big{[}\|\mathscr{U}_{j,k}D_{j,k}\|^{2}\mathscr{U}_{j,l}D_{j,l}\overline{\mathscr{U}_{j,q}D_{j,q}}\big{]}\text{E}\big{[}\|Z_{n,j,k}\|^{2}Z_{n,j,l}\overline{Z_{n,j,q}}\big{]}.$

The first expectation is

	$\displaystyle\text{E}\big{[}\|\mathscr{U}_{j,k}D_{j,k}\|^{2}\mathscr{U}_{j,l}D_{j,l}\overline{\mathscr{U}_{j,q}D_{j,q}}\big{]}$
	$\displaystyle=\left(\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\right)\|\langle\beta,\phi_{l}\rangle\|^{2}\|\langle\beta,\phi_{q}\rangle\|^{2}$
	$\displaystyle\phantom{=}\phantom{\Bigg{(}}\text{E}\left[\|\langle X_{j},\phi_{l}\rangle\|^{2}\left(\frac{\langle W_{j},\phi_{l}\rangle}{c_{l}}-\frac{\langle X_{j},\phi_{l}\rangle}{x_{l}}\right)\right]\text{E}\left[\|\langle X_{j},\phi_{q}\rangle\|^{2}\left(\frac{\langle\phi_{q},W_{j}\rangle}{\overline{c_{q}}}-\frac{\langle\phi_{q},X_{j}\rangle}{x_{q}}\right)\right],$

while

	$\displaystyle\text{E}\big{[}\|Z_{n,j,k}\|^{2}Z_{n,j,l}\overline{Z_{n,j,q}}\big{]}$
	$\displaystyle=x_{k}^{2}x_{l}x_{q}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}I\{\lambda_{l}\geq\alpha\gamma_{l}^{\nu}\}I\{\lambda_{q}\geq\alpha\gamma_{q}^{\nu}\}\sum_{i=1}^{j-1}\text{E}\big{[}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}\mathscr{U}_{i,l}D_{i,l}\overline{\mathscr{U}_{i,q}}\overline{D_{i,q}}\big{]}.$

Altogether, we have

	$\displaystyle L_{n,3}\leq\frac{1}{t_{n}^{4}n^{2}}\sum_{\begin{subarray}{c}k,l,q\in\mathbb{Z},\\ \|k\|,\|l\|\neq\|q\|,\|k\|\neq\|l\|\end{subarray}}x_{k}^{2}x_{l}x_{q}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}I\{\lambda_{l}\geq\alpha\gamma_{l}^{\nu}\}I\{\lambda_{q}\geq\alpha\gamma_{q}^{\nu}\}$
	$\displaystyle\phantom{=}\phantom{\frac{1}{t_{n}^{4}n^{4}}}\left(\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\right)^{2}\|\langle\beta,\phi_{l}\rangle\|^{4}\|\langle\beta,\phi_{q}\rangle\|^{4}$
	$\displaystyle\phantom{=}\phantom{\frac{1}{t_{n}^{4}n^{4}}}\left(\text{E}\left[\|\langle X,\phi_{l}\rangle\|^{2}\left(\frac{\langle W,\phi_{l}\rangle}{c_{l}}-\frac{\langle X,\phi_{l}\rangle}{x_{l}}\right)\right]\text{E}\left[\|\langle X,\phi_{q}\rangle\|^{2}\left(\frac{\langle\phi_{q},W\rangle}{\overline{c_{q}}}-\frac{\langle\phi_{q},X\rangle}{x_{q}}\right)\right]\right)^{2}.$

The series can be bounded by $t_{n}^{2}$ . Using the Hölder inequality for $l\in\mathcal{K}_{n}$ , we have

\displaystyle\left(\text{E}\Big{[}|\langle X,\phi_{l}\rangle|^{2}\Big{(}\frac{\langle\phi_{l},W\rangle}{\overline{c_{l}}}-\frac{\langle\phi_{l},X\rangle}{x_{l}}\Big{)}\Big{]}\right)^{2}\leq\text{E}|\langle X,\phi_{l}\rangle|^{4}\text{E}\Big{|}D_{1,l}|^{2}\leq\eta x_{l}^{2}\left(\frac{w_{l}}{|c_{l}|^{2}}-\frac{1}{x_{l}}\right)\leq\frac{C}{\alpha^{2}}x_{l}^{2}.

Finally, relying again on Assumption 3 and Lemma B.1, also $L_{n,3}$ converges to 0 due to

\displaystyle L_{n,3}\leq\frac{C}{t_{n}^{2}n^{2}}\left(\sum_{k\in\mathbb{Z}}|\langle\beta,\phi_{k}\rangle|^{4}x_{k}\frac{x_{k}-\lambda_{k}}{\lambda_{k}}\right)^{1/2}\leq\frac{C}{t_{n}^{2}n^{2}\alpha^{2}}\sum_{k\in\mathbb{Z}}|\langle\beta,\phi_{k}\rangle|^{4}x_{k}(x_{k}-\lambda_{k})=o\left(\frac{1}{t_{n}^{2}n}\right).

Acknowledgments

The authors would like to thank Jan Johannes for helpfull discussions on the different estimation techniques in the functional linear regression model with and without endogeneity.

References

[Alj et al. (2014)] Alj, Abdelkamel, Azrak, Rajae and Mélard, Guy. On Conditions in Central Limit Theorems for Martingale Difference Arrays Long Version. ECORE Discussion Paper (2014/12).
[Cardot et al. (2006)] Cardot, Hervé, Mas, André and Sarda, Pascal. CLT in functional linear regression models. Probability Theory and Related Fields (2007) 138:325–361.
[Cuesta-Albertos et al. (2019)] Cuesta-Albertos, Juan A., García-Portugués, Eduardo, Febrero-Bande, Manuel and González-Manteiga, Wenceslao. Goodness-of-fit tests for the functional linear model based on randomly projected empirical processes. The Annals of Statistics (2019) 47(1):439–467.
[DasGupta (2008)] DasGupta, Anirban. Asymptotic Theory of Statistics and Probability. Springer Texts in Statistics. Springer, New York 2008.
[Dorn (2021)] Dorn, Manuela. Tests auf Exogenität im funktionalen linearen Regressionsmodell unter schwacher Stationarität. Dissertation. University of Bayreuth 2021.
[Florens et al. (2011)] Florens, Jean-Pierre, Johannes, Jan and Van Bellegem, Sébastien. Identification and estimation by penalization in nonparametric instrumental Regression. Econometric Theory (2011) 27:472–496.
[Florens and Van Bellegem (2014)] Florens, Jean-Pierre and Van Bellegem, Sébastien. Instrumental variable estimation in functional linear models. ECORE Discussion Paper (2014/56).
[García-Portugués et al. (2014)] García-Portugués, Eduardo, González-Manteiga, Wenceslao and Febrero-Bande, Manuel. A goodness-of-fit test for the functional linear model with scalar response. arXiv:1205.6167v6.
[García-Portugués et al. (2020)] García-Portugués, Eduardo, Álvarez-Liébana, Javier, Álvarez-Pérez, Gonzalo and González-Manteiga, Wenceslao. Goodness-of-fit tests for functional linear models based on integrated projections. arXiv:2008.09885v1.
[Gänssler et al. (1978)] Gänssler, Peter, Strobel, J. and Stute, Winfried. On central limit theorems for martingale triangular arrays. Acta Mathematica Academiae Scientiarum Hungaricae (1978) 31(3–4):205–216.
[Hall und Heyde (1980)] Hall, Peter and Heyde, Christopher C. Martingale Limit Theory and Its Application. Probability and mathematical statistics. Academic Press, New York 1980.
[Hausman (1978)] Hausman, Jerry A. Specification Tests in Econometrics, Econometrica (1978) 46(6):1251–1271.
[Johannes (2013)] Johannes, Jan. Nonparametric estimation in functional linear models with second order stationary regressors. arXiv:0901.4266v1.
[Johannes (2016)] Johannes, Jan. Functional linear instrumental regression under second order stationarity. arXiv:1603.01649v1.
[Mammen (1993)] Mammen, Enno. Bootstrap and wild bootstrap for high dimensional linear models. The Annals of Statistics (1993) 21(1):255–285.
[Müller and Stadtmüller (2005)] Müller, Hans-Georg and Stadtmüller, Ulrich. Generalized functional linear models. The Annals of Statistics (2005) 33(2):774–805.
[Neubauer (1988a)] Neubauer, Andreas (1988). An a posteriori parameter choice for Tikhonov regularization in the presence of modeling error. Appl. Numer. Math. 4, 507–519
[Ruymgaart et al. (2000)] Ruymgaart, Frits, Wang, Jing, Wei, Shih-Hsuan and Yu, Li. Some asymptotic theory for functional regression and classification. Texas Tech University, Lubbock 2000.
[Tsybakov (2004)] Tsybakov, Alexandre B. Introduction à l’estimation non-paramétrique., Math. Appl. (Berl.) 41, x + 175
[Serfling (1980)] Serfling, Robert J. Approximation theorems of mathematical statistics. Wiley Series in Probability and Mathematical Statistics. John Wiley and Sons, New York u. a. 1980.
[Wong (1996)] Wong, Ka-fu. Bootstrapping Hausman’s exogeneity test. Economics Letters (1996) 53:139–143.
[Wu (1973)] Wu, De-Min. Alternative tests of independence between stochastic regressors and disturbances. Econometrica (1973) 41(4):733–750.

	$\displaystyle\hat{w}_{k}:=\frac{1}{n}\sum_{i=1}^{n}\|\langle W_{i},\phi_{k}\rangle\|^{2},\,$	$\displaystyle\hat{x}_{k}:=\frac{1}{n}\sum_{i=1}^{n}\|\langle X_{i},\phi_{k}\rangle\|^{2},$
	$\displaystyle\hat{c}_{k}:=\frac{1}{n}\sum_{i=1}^{n}\langle\phi_{k},X_{i}\rangle\langle W_{i},\phi_{k}\rangle,\,$	$\displaystyle\hat{\lambda}_{k}:=\frac{\|\hat{c}_{k}\|^{2}}{\hat{w}_{k}}I\{\hat{w}_{k}\geq\alpha\}$

	$\displaystyle\mathfrak{V}_{n}$	$\displaystyle=\frac{1}{t_{n}^{2}n^{2}}\sum_{j=2}^{n}\text{E}\Big{[}\Big{\|}\sum_{k\in\mathbb{Z}}\mathscr{U}_{j,k}D_{j,k}Z_{n,j,k}\Big{\|}^{2}\mid\mathcal{F}_{n,j-1}\Big{]}$
		$\displaystyle=\frac{1}{t_{n}^{2}n}\sum_{k\in\mathbb{Z}}x_{k}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\text{E}\|\mathscr{U}_{1,k}\|^{2}$
		$\displaystyle\phantom{=}\phantom{\frac{1}{t_{n}^{2}n^{2}}\sum_{j=2}^{n}\sum_{k\in\mathbb{Z}}}\Bigg{(}\sum_{i=1}^{n-1}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}+\sum_{\begin{subarray}{c}i,p=1,\\ i\neq p\end{subarray}}^{n-1}\mathscr{U}_{i,k}D_{i,k}\overline{\mathscr{U}_{p,k}D_{p,k}}\Bigg{)}$
		$\displaystyle=\mathfrak{V}_{n,1}+\mathfrak{V}_{n,2}.$

	$\displaystyle\mathbb{V}_{n,1}$	$\displaystyle=\frac{C}{t_{n}^{4}n^{2}}\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}$
		$\displaystyle\phantom{=}\phantom{\frac{C}{t_{n}^{4}n^{2}}\sum_{k\in\mathbb{Z}}}\Bigg{\{}\sum_{i=1}^{n-1}\text{E}\Big{(}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\Big{)}^{2}$
		$\displaystyle\phantom{=}\phantom{\frac{C}{t_{n}^{4}n^{2}}\sum_{k\in\mathbb{Z}}\Big{\{}}+\sum_{\begin{subarray}{c}i,p=1,\\ i\neq p\end{subarray}}^{n-1}\text{E}\Big{[}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{1,k}\|^{2}\Big{]}\text{E}\Big{[}(\|\mathscr{U}_{p,k}D_{p,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{1,k}\|^{2}\Big{]}\Bigg{\}}$
	$\displaystyle\mathbb{V}_{n,2}$	$\displaystyle{=}\frac{C}{t_{n}^{4}n^{2}}\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ \|k\|\neq\|l\|\end{subarray}}x_{k}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}x_{l}\left(\frac{x_{l}w_{l}}{\|c_{l}\|^{2}}-1\right)I\{\lambda_{l}\geq\alpha\gamma_{l}^{\nu}\}$
		$\displaystyle\phantom{=}\phantom{+\frac{1}{t_{n}^{4}n^{2}}}\phantom{\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ k\neq l\end{subarray}}}\Bigg{\{}\sum_{i=1}^{n-1}\text{E}\Big{[}\Big{(}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\Big{)}\Big{(}\|\mathscr{U}_{i,l}D_{i,l}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,l}\|^{2}\Big{)}\Big{]}$
		$\displaystyle\phantom{=}\phantom{+\frac{1}{t_{n}^{4}n^{2}}}\phantom{\sum_{\begin{subarray}{c}k,l\in\mathbb{Z},\\ k\neq l\end{subarray}}}+\sum_{\begin{subarray}{c}i,p=1,\\ i\neq p\end{subarray}}^{n-1}\text{E}\Big{[}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\Big{]}\text{E}\Big{[}\|\mathscr{U}_{i,l}D_{i,l}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,l}\|^{2}\Big{]}\Bigg{\}}$
	$\displaystyle\mathbb{V}_{n,3}$	$\displaystyle{=}\frac{2}{t_{n}^{4}n^{2}}\text{E}\Bigg{(}\sum_{k\in\mathbb{Z}}x_{k}^{2}\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)\|\langle\beta,\phi_{k}\rangle\|^{2}I\{\lambda_{k}\geq\alpha\gamma_{k}^{\nu}\}\sum_{i=1}^{n-1}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}\Bigg{)}^{2}.$

	$\displaystyle\text{E}\left[\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\right]$	$\displaystyle=-\text{E}\|D_{1,k}\|^{2}\|\langle\beta,\phi_{k}\rangle\|^{2}x_{k}$
	$\displaystyle=-\left(\frac{x_{k}w_{k}}{\|c_{k}\|^{2}}-1\right)\|\langle\beta,\phi_{k}\rangle\|^{2}.$			(C.3)

	$\displaystyle\text{E}\Big{[}\Big{(}\|\mathscr{U}_{i,k}D_{i,k}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,k}\|^{2}\Big{)}\Big{(}\|\mathscr{U}_{i,l}D_{i,l}\|^{2}-\mathfrak{V}^{1/2}\text{E}\|D_{i,l}\|^{2}\Big{)}\Big{]}$
	$\displaystyle\leq\text{E}\Big{[}\|\mathscr{U}_{1,k}D_{1,k}\mathscr{U}_{1,l}D_{1,l}\|^{2}\Big{]}+\Big{(}\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\Big{)}\Big{(}\frac{w_{l}}{\|c_{l}\|^{2}}-\frac{1}{x_{l}}\Big{)}$
	$\displaystyle\leq C\Bigg{\{}\frac{1}{\alpha^{2}}\|\langle\beta,\phi_{k}\rangle\|^{2}x_{k}\|\langle\beta,\phi_{l}\rangle\|^{2}x_{l}+\frac{x_{l}}{\alpha}\|\langle\beta,\phi_{l}\rangle\|^{2}\Big{(}\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\Big{)}$
	$\displaystyle\phantom{=}\phantom{C\Bigg{\{}}+\frac{x_{k}}{\alpha}\|\langle\beta,\phi_{k}\rangle\|^{2}\Big{(}\frac{w_{l}}{\|c_{l}\|^{2}}-\frac{1}{x_{l}}\Big{)}+\Big{(}\frac{w_{k}}{\|c_{k}\|^{2}}-\frac{1}{x_{k}}\Big{)}\Big{(}\frac{w_{l}}{\|c_{l}\|^{2}}-\frac{1}{x_{l}}\Big{)}\Bigg{\}}.$		(C.4)