This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Runs of extremes of observables on dynamical systems and applications.

Meagan Carney Meagan Carney
Department of Mathematics
University of Houston
Houston
TX 77204
USA
[email protected] https://smp.uq.edu.au/profile/11136/meagan-carney
Mark Holland Mark Holland
Department of Mathematics & Statistics
Harrison Building (327)
North Park Road
Exeter, EX4 4QF
UK
[email protected] http://empslocal.ex.ac.uk/people/staff/mph204/
Matthew Nicol Matthew Nicol
Department of Mathematics
University of Houston
Houston
TX 77204
USA
[email protected] http://www.math.uh.edu/ nicol/
 and  Phuong Tran Phuong Tran
Department of Mathematics
University of Houston
Houston
TX 77204
USA
[email protected]
Abstract.

We use extreme value theory to estimate the probability of successive exceedances of a threshold value of a time-series of an observable on several classes of chaotic dynamical systems. The observables have either a Fréchet (fat-tailed) or Weibull (bounded) distribution. The motivation for this work was to give estimates of the probabilities of sustained periods of weather anomalies such as heat-waves, cold spells or prolonged periods of rainfall in climate models. Our predictions are borne out by numerical simulations and also analysis of rainfall and temperature data.

MN thanks the NSF for support from NSF-DMS 2009923.

1. Introduction

The impact of successive extreme weather events on populations has become a significant topic of discussion in climate literature. Recent notable examples include the 2019 global heat-waves, which had severe impacts in Australia, Europe, and the United States, and the 2022 heat-wave in western Europe resulting in a nationally declared emergency in the United Kingdom. Other examples include the 2021-22 floods in Eastern Australia, the 2021 flood in the United Kingdom and Western Europe and the 2017 flood in Texas, all attributed to prolonged heavy rainfall. Understanding the returns of successive extreme weather events like these is crucial for preparing and mitigating disastrous effects. For instance, fire retardant materials can be used to prevent wildfires during heat-waves, and earlier evacuation of populations from high-risk flood zones can be initiated prior to episodes of prolonged rainfall.

Accurately predicting the likelihood of weather anomalies, such as heat-waves, cold spells, or prolonged periods of rainfall, is a challenging problem. In this paper, we explore the use of extreme value theory (EVT) to estimate the probability of successive exceedances of a threshold of a time-series of observations on a variety of uniformly hyperbolic dynamical systems. The study of chaotic dynamical systems provides insights into the statistical behavior of complex systems, such as climate models. We test some of the predictions we derive from uniformly hyperbolic systems on climate data and climate models.

Provided the time-series of an observable ϕ\phi, namely (ϕTj)j0(\phi\circ T^{j})_{j\geq 0}, follow the assumptions outlined in [32], an extreme value law holds for the maxima of the time-series, given by Mj=max{ϕ,(ϕTj)}M_{j}=\max\{\phi\dots,(\phi\circ T^{j})\}, which guarantees a limiting probability distribution for the sequence of maxima (Mj)(M_{j}). The traditional measure of successive extremes is the extremal index, a parameter which appears in the limiting distribution of the extreme value law and measures the expected number of over-threshold exceedances observed in a short block of the time-series.

For appropriate scaling choices, the exact limiting distribution is the generalized extreme value distribution (GEV), G(ξ,μ,σ)G(\xi,\mu,\sigma), with shape ξ\xi, location μ\mu and scale σ\sigma parameters. In practice, the block maximum approach is commonly employed which involves dividing the time-series into blocks of a fixed length and performing some likelihood fitting of the parameters of the GEV using the sequence of block maxima. By nature of the block maximum approach, the extremal index is normalized to 1 so that all information on the successive properties of extremes of the data becomes lost. However, a careful choice of observable can preserve some of the properties of successive extremes in this setting.

In climate applications, successive extremes are often thought of in two ways: a high average in a short run, for example, a high daily temperature average over 7 days; or successive over-threshold values occurring in a short run, for example, a high daily temperature observed every day for 7 days. Motivated by [21] and these definitions, we investigate the time-averages of observables over a time-window of integer length kk, Y(x)=1kj=0k1ϕ(Tjx)Y(x)=\frac{1}{k}\sum_{j=0}^{k-1}\phi(T^{j}x), as well the exceedance function Y(x)=min{ϕ(x),,ϕ(Tk1x)}Y(x)=\min\{\phi(x),\ldots,\phi(T^{k-1}x)\}.

This paper’s focus is on whether the scaling constants in the time-series of kk-exceedances or time-averages can be derived from that of the original time-series. We establish scaling constants in some interesting scenarios for hyperbolic systems. Specifically, we consider an observable ϕ:X\phi:X\rightarrow\mathbb{R} on an ergodic dynamical system (T,X,μ)(T,X,\mu), which is maximized on an invariant repelling set SS. The prolonged bouts of extreme weather events are associated with, for example, a weather system being in a quasi-invariant state of extreme rainfall or temperature. This is broadly modeled by the time-series of an observable maximized on an invariant set in a chaotic dynamical system.

We begin our investigation by establishing a general lemma relating the parameters of the GEV for the kk-exceedances and time-averages coming from any system, with a Fréchet or Weibull limiting distribution, satisfying the conditions of [32]. This general result is then used to establish more concrete theorems for climate-relevant observables taken on hyperbolic systems. We end with a detailed illustration of the applications of these lemmas and theorems to numerically model successive extremes of time-series data coming from: certain hyperbolic dynamical systems; temperature data simulated from the general circulation model, PUMA; and real-world temperature and precipitation data taken from weather stations throughout Germany. Finally, we illustrate that our scaling results can be applied to obtain more accurate numerical estimates of the GEV parameters for maxima of kk-exceedance and time-averaged functionals of increasing kk window lengths than those of traditional maximum likelihood estimation.

This research was motivated in part by the interesting paper [21] in which the authors analyzed the time and spatial averages of daily temperature anomalies over a large number of days (heat-waves or cold spells) via a large deviation approach. Large deviation theory was used to estimate probabilities associated to heat-wave or cold-spell occurrences. The averaging period ranged roughly from 10 to 40 days. The main testing ground was the PUMA climate model of mid-atlantic latitudes, 30 degrees to 60 degrees. The important question they addressed is how to estimate the probability of very rare events, such as long runs of extreme weather, from a limited time-series of recordings. Their predictions were favorably compared to those estimated by standard extreme value theory (Peaks over threshold model leading to a generalized Pareto distribution (GPD)) applied to long simulations of the PUMA model. A related approach based on a large deviation algorithm combined with an importance sampling technique was given in [38].

Our dynamical models and numerical results suggest that in certain settings we do not need to average over long time windows to ensure the applicability of extreme value theory and in some sense we may estimate rare prolonged anomalies from the original time-series of data. The advantage of our method is that while data on, for example, prolonged spells of rainfall is sparse, the simple scalings we find enable us to use all available daily maxima data to estimate the probabilities of prolonged rain of a certain duration.

2. Background on EVT for dynamical systems.

Suppose ϕ:X\phi:X\to\mathbb{R} is an observable on an ergodic dynamical system (T,X,μ)(T,X,\mu). Extreme value theory (EVT) considers distributional limits of the maxima process

Mn=max{ϕ,,ϕTn1}M_{n}=\max\{\phi,\ldots,\phi\circ T^{n-1}\}

under suitable scaling constants an(Mnbn)a_{n}(M_{n}-b_{n}).

P. Collet [3] introduced techniques from EVT to establish hitting and return time statistics to ‘generic’ non-recurrent points in some one-dimensional non-uniformly hyperbolic dynamical systems. He considered the observable ϕ(x)=lnd(x,x0)\phi(x)=-\ln d(x,x_{0}), where x0x_{0} is non-recurrent, in particular not periodic. In this setting EVT is related to hitting and return time statistics since MnM_{n} increases as orbits make closer approaches to x0x_{0}. Following Collet’s work there have been many papers developing EVT for hyperbolic dynamical systems. The theory is well-developed for ϕ\phi maximized at a point x0x_{0} and a function of distance, as in Collet’s case of ϕ(x)=lnd(x,x0)\phi(x)=-\ln d(x,x_{0}). The paper [2] extended some of this work to the more general setting of observables maximized on submanifolds in phase space, while  [15] considered observables maximized on Cantor sets. In a series of influential papers Freitas et al [16, 18] adapted techniques from [32] to the context of deterministic dynamical systems and elucidated the close relation between extremes, hitting time statistics and Poisson processes. For these and other developments we refer to the book [33] for more details.

More generally suppose (Xn)(X_{n}) is a stationary process with probability distribution function FX(u):=μ(Xu).F_{X}(u):=\mu(X\leq u). When determining the distributional limit of

Mn=max{X1,,Xn}M_{n}=\max\{X_{1},\ldots,X_{n}\}

it is well-known that to find the appropriate scaling, given τ\tau\in\mathbb{R}, one should define un(τ)u_{n}(\tau) as a sequence satisfying nμ(X0>un(τ))τn\mu(X_{0}>u_{n}(\tau))\to\tau, as nn\to\infty. We say that (Xn)(X_{n}) satisfies an extreme value law if

μ(Mnun(τ))eθτ\mu(M_{n}\leq u_{n}(\tau))\to e^{-\theta\tau} (2.1)

for some θ(0,1]\theta\in(0,1]. The number θ\theta is called the extremal index and 1θ\frac{1}{\theta} roughly measures the average cluster size of exceedances given that one exceedance has occurred. When (Xn)(X_{n}) is iid and has a regularly varying tail it can be shown that under the scaling unu_{n} there exists an extreme value law (EVL) and θ=1\theta=1. If (Xn)(X_{n}) is only assumed stationary rather than iid then the existence of an EVL with θ=1\theta=1 has been shown provided two conditions hold: (1) D(un)D(u_{n}) (a mixing condition) and; (2) D(un)D^{\prime}(u_{n}) (a non-recurrence condition). Thus in the case of an observable on a dynamical system if the time series of observations Xn=ϕTnX_{n}=\phi\circ T^{n} satisfy D(un)D(u_{n}) and D(un)D^{\prime}(u_{n}) (or some variation thereof) then an EVL holds with θ=1\theta=1.

We will use two conditions Д(un)\operatorname{\textrm{Д}}(u_{n}) and Дq(un)\operatorname{\textrm{Д}}^{{}^{\prime}}_{q}(u_{n}), adapted to the dynamical setting, introduced in the work [18] that imply a non-degenerate EVL in the case the EI θ1\theta\not=1 and also give the value of θ\theta.

2.1. Using EVT to estimate the probability of rare events.

For statistical estimation and fitting schemes such as block maxima or peak over thresholds methods [8], it is desirable to get a limit along linear sequences of the form un(y)=y/an+bnu_{n}(y)=y/a_{n}+b_{n}. Here the emphasis is changed and the sequence un(y)u_{n}(y) is now required to be linear in yy.

R. Fisher and L. Tippett [13] showed: If {Xn}n\{X_{n}\}_{n\in\mathbb{N}} a sequence of iid random variables and there exists linear normalizing sequences of constant {an>0}n\left\{a_{n}>0\right\}_{n\in\mathbb{N}} and {bn}n\left\{b_{n}\right\}_{n\in\mathbb{N}} such that

𝒫(Mnbnany)G(y)as n\mathcal{P}\bigg{(}\frac{M_{n}-b_{n}}{a_{n}}\leq y\bigg{)}\to G(y)\qquad\text{as }n\to\infty

where G is a non-degenerate distribution function then G(y)=eτ(y)G(y)=e^{-\tau(y)}, where τ(y)\tau(y) is one of the following three types (for some β,γ>0\beta,\gamma>0), (up to a scale σ\sigma and location μ\mu shift yyμσy\to\frac{y-\mu}{\sigma}):

  • (1)

    τ(y)=ey\tau(y)=e^{-y} for yy\in\mathbb{R}; (Gumbel)

  • (2)

    τ(y)=yβ\tau(y)=y^{-\beta} for y>0y>0 (Fréchet)

  • (3)

    τ(y)=(y)γ\tau(y)=(-y)^{\gamma} for y0y\leq 0 (Weibull).

The three types may be combined in a unified model called the Generalized Extreme Value (GEV) distribution

G(y)=exp[1+ξ(yμσ)]1ξG(y)=\exp{-\bigg{[}1+\xi\,\bigg{(}\frac{y-\mu}{\sigma}\,\bigg{)}\,\bigg{]}^{-\frac{1}{\xi}}}

defined on {y: 1+ξ(yμσ)>0}\left\{y:\,1+\xi\left(\frac{y-\mu}{\sigma}\right)>0\right\}, where ξ\xi\in\mathbb{R} is the shape parameter μ\mu\in\mathbb{R} is the location parameter, and σ>0\sigma>0 the scale parameter. We have the following classification:

  • (1)

    When ξ=0\xi=0, Type I - The Gumbel distibution

    G(y)=exp{e(yμσ)},y;G(y)=\exp\left\{-e^{-(\frac{y-\mu}{\sigma})}\right\}\quad,y\in\mathbb{R};
  • (2)

    ξ>0\xi>0 Type II - The Fréchet distibution;

  • (3)

    ξ<0\xi<0 Type III - The Weibull distribution.

The Gumbel distribution is unbounded, while the Weibull is bounded. The Fréchet distribution is bounded below but not above and has ‘fat tails’, 1F(x)=l(x)xξ1-F(x)=l(x)x^{-\xi} where l(x)l(x) is a slowly varying function. An important point is that numerical fitting schemes for the GEV distribution are renormalized under place and scale transformations so that the extremal index (EI) is 1 [4, Theorem 5.2]. Hence the EI is not given by the GEV but subsumed into its parameter estimation. A technique to estimate the EI has been given by Süveges [42]. The goal of this work is to determine if there is a relation between ξ\xi, σ\sigma and μ\mu for the GEV for ϕ\phi and the corresponding parameters for YY, both in the case of kk-exceedances and kk-averages.

We now describe how the GEV model is used to estimate return levels. Suppose we have a time-series of length mnmn observations. For example daily rainfall over a year gives n=365n=365 daily readings and suppose and we have m=100m=100 years of such daily measurements. Denote Mn,i=max{Xi,,Xn+i1}M_{n,i}=\max\{X_{i},\cdots,X_{n+i-1}\}, i=1,,mi=1,...,m. We block the data into sequences of observations of length nn generating a series of block maxima, Mn,1,,Mn,mM_{n,1},\dots,M_{n,m} to which the GEV distribution can be fitted. This assumes nn is large enough to ensure convergence of Mn,iM_{n,i} to GG in distribution for each ii. Thus we have mm samples from a GEV distribution determining the distribution of maximal daily rainfall in a year. We may then numerically fit a GEV, via method of moments or method of maximum likelihood, and estimate the shape ξ\xi, location μ\mu and scale σ\sigma parameters. The quantiles of the distribution of the annual maximum daily rainfall are obtained by inverting the distribution GG we obtain:

zp={μσξ(1[log(1p)]ξ),ξ0μσlog[log(1p)],ξ=0z_{p}=\begin{cases}\mu-\frac{\sigma}{\xi}\bigg{(}1-[-\log(1-p)]^{-\xi}\bigg{)},&\xi\neq 0\\ \mu-\sigma\log[-\log(1-p)],&\xi=0\end{cases}

where G(zp)=1pG(z_{p})=1-p and zpz_{p} is the return level associated with the return period 1p\frac{1}{p}. Thus, the level zpz_{p} is expected to be exceeded on average once every 1p\frac{1}{p} years.

3. Sufficient conditions for EVT for dynamical systems.

We let (T,X,μ)(T,X,\mu) be an ergodic dynamical system. Here TT is a uniformly hyperbolic measure-preserving map of a Riemannian metric space (X,μ)(X,\mu) equipped with a probability measure μ\mu which is equivalent to Lebesgue measure. We let ϕ:X\phi:X\to\mathbb{R} be an observable which is a function of Euclidean distance and maximized on a set SS. In this section we consider the extreme value theory of kk-exceedances and time-averages of the observable. More precisely we consider the EVT of the derived time-series (YTi)(Y\circ T^{i}) where Y=min{ϕ,ϕT,,ϕTk1}Y=\min\{\phi,\phi\circ T,\ldots,\phi\circ T^{k-1}\} in the case of kk-exceedances or Y=1kj=0k1ϕTjY=\frac{1}{k}\sum_{j=0}^{k-1}\phi\circ T^{j} for time-averages.

In this paper SS is taken to be either a generic non-recurrent point x0Xx_{0}\in X; a fixed or periodic point x0x_{0}; or a line or curve, invariant or not. The case of a periodic orbit of minimal period qq reduces to the case of a fixed point for TqT^{q}, so we only discuss the case of fixed points. We will consider three classes of uniformly hyperbolic dynamical system, namely: uniformly expanding maps of the interval; hyperbolic toral automorphisms; and coupled expanding maps. We aim not for the greatest generality of dynamical system, as the statements and proofs soon become tiresomely technical. We wish rather for simple illustrative models from chaotic dynamics which hopefully shed light on general principles and the behavior of complex physical models. The extreme value theory for observables ϕ:X\phi:X\to\mathbb{R} maximized on such sets SS for these systems, among others, was investigated in [2]. A motivation of [2] was to extend the theory of EVT for dynamical systems beyond that developed for observables maximized at points to more relevant observables to applications. One goal of this paper is to extend this investigation to observables such as kk-exceedances and time-averages. Another goal is to allow predictions of the GEV for kk-exceedances and time averages from the GEV of the original time series, both in dynamical models and climate data.

3.1. Conditions for EVT laws.

We will use two conditions Дq(un)\operatorname{\textrm{Д}}_{q}(u_{n}) and Дq(un)\operatorname{\textrm{Д}}^{{}^{\prime}}_{q}(u_{n}), adapted to the dynamical setting, introduced in the work [18] that show an extreme value distribution holds and also allows a computation of the extremal index in the case θ1\theta\not=1. We change slightly the notation of [18] and write Дq(un)\operatorname{\textrm{Д}}_{q}(u_{n}) rather than Д)un)\operatorname{\textrm{Д}})u_{n}) to highlight the role of qq. In [2] techniques were developed to verify these conditions in the setting of an observable maximized on an invariant set for the classes of systems we consider.

Let Xn=ϕTnX_{n}=\phi\circ T^{n} and define

An(q):={X0>un,X1un,,Xqun}.A_{n}^{(q)}:=\{X_{0}>u_{n},X_{1}\leq u_{n},\ldots,X_{q}\leq u_{n}\}.

For s,ls,l\in\mathbb{N} and a set BMB\subset M, let

𝒲s,l(B)=i=ss+l1Ti(Bc).\mathscr{W}_{s,l}(B)=\bigcap_{i=s}^{s+l-1}T^{-i}(B^{c}).

Recall that unun(τ)u_{n}\equiv u_{n}(\tau) is a sequence satisfying limnnμ(ϕ>un(τ))=τ\lim_{n\to\infty}n\mu(\phi>u_{n}(\tau))=\tau.

Condition Д(un)\operatorname{\textrm{Д}}(u_{n}): We say that Дq(un)\operatorname{\textrm{Д}}_{q}(u_{n}) holds for the sequence X0,X1,X_{0},X_{1},\ldots if, for every ,t,n\ell,t,n\in\mathbb{N}

|μ(An(q)𝒲t,(An(q)))μ(An(q))μ(𝒲0,(An(q)))|γ(q,n,t),\left|\mu\left(A_{n}^{(q)}\cap\mathscr{W}_{t,\ell}\left(A_{n}^{(q)}\right)\right)-\mu\left(A_{n}^{(q)}\right)\mu\left(\mathscr{W}_{0,\ell}\left(A_{n}^{(q)}\right)\right)\right|\leq\gamma(q,n,t),

where γ(q,n,t)\gamma(q,n,t) is decreasing in tt and there exists a sequence (tn)n(t_{n})_{n\in\mathbb{N}} such that tn=o(n)t_{n}=o(n) and nγ(q,n,tn)0n\gamma(q,n,t_{n})\to 0 when nn\rightarrow\infty.

We consider the sequence (tn)n(t_{n})_{n\in\mathbb{N}} given by condition Дq(un)\operatorname{\textrm{Д}}_{q}(u_{n}) and let (kn)n(k_{n})_{n\in\mathbb{N}} be another sequence of integers such that as nn\to\infty,

knandkntn=o(n).k_{n}\to\infty\quad\mbox{and}\quad k_{n}t_{n}=o(n).

Condition Дq(un)\operatorname{\textrm{Д}}^{\prime}_{q}(u_{n}): We say that Дq(un)\operatorname{\textrm{Д}}^{\prime}_{q}(u_{n}) holds for the sequence X0,X1,X_{0},X_{1},\ldots if there exists a sequence (kn)n(k_{n})_{n\in\mathbb{N}} as above and such that

limnnj=q+1n/knμ(An(q)Tj(An(q)))=0.\lim_{n\rightarrow\infty}\,n\sum_{j=q+1}^{\lfloor n/k_{n}\rfloor}\mu\left(A_{n}^{(q)}\cap T^{-j}\left(A_{n}^{(q)}\right)\right)=0.
Proposition 3.1 [18]).

Let Mn=max{X1,X2,,Xn}M_{n}=\max\{X_{1},X_{2},\ldots,X_{n}\}. Suppose that conditions Дq(un)\operatorname{\textrm{Д}}^{\prime}_{q}(u_{n}) and Дq(un)\operatorname{\textrm{Д}}_{q}(u_{n}) hold and that the limit

θ=limnθn=limnμ(An(q))μ(Un),\theta=\lim_{n\to\infty}\theta_{n}=\lim_{n\to\infty}\frac{\mu(A^{(q)}_{n})}{\mu(U_{n})},

exists. Then

μ(Mnun(τ))eθτ.\mu(M_{n}\leq u_{n}(\tau))\to e^{-\theta\tau}.

Thus Proposition 3.1 above gives us a route to compute the extremal index θ\theta for a given dynamical system and observable function, provided the conditions Дq(un)\operatorname{\textrm{Д}}^{\prime}_{q}(u_{n}) and Дq(un)\operatorname{\textrm{Д}}_{q}(u_{n}) hold.

3.2. Uniformly hyperbolic models.

We now briefly describe three classes of hyperbolic dynamical system. In each there is a notion of expansion transverse to the invariant set which is denoted by a parameter λ\lambda in Theorem 5.1. We specify what λ\lambda is in each of the three settings (A)(A), (B)(B) and (C)(C) described below.

3.2.1. (A) Piecewise C2C^{2} uniformly expanding maps of the interval.

T:[0,1][0,1]T:[0,1]\to[0,1] is called a C2C^{2} piecewise expanding map if there there is a finite partition 0=a1<a2<<am=10=a_{1}<a_{2}<\ldots<a_{m}=1 of the interval such that TT is C2C^{2} on the interior of partition elements and there exists κ>1\kappa>1 such that for |DT(x)|>κ|DT(x)|>\kappa for any x(ai,ai+1)x\in(a_{i},a_{i+1}), i=1,,m1i=1,\ldots,m-1. One of the simplest examples of such a map is the doubling map which has form T(x)=2xT(x)=2x (mod 1). Such maps have an invariant measure μ\mu equivalent to Lebesgue and are exponentially mixing in the sense that there exists ρ(0,1)\rho\in(0,1) such that

|ϕTnψ𝑑μϕ𝑑μψ𝑑μ|CϕBVψρn\left|\int\phi\circ T^{n}\psi d\mu-\int\phi d\mu\int\psi d\mu\right|\leq C\lVert{\phi}\rVert_{BV}\lVert{\psi}\rVert_{\infty}\rho^{n}

for all ϕ\phi of bounded variation and bounded ψ\psi. For a description of the ergodic properties of such maps see [1, Chapter 5].

For this map the expansion rate at a fixed point x0x_{0} is λ=|DT(x0)|\lambda=|DT(x_{0})|. The EI for an observable maximized at a fixed point is θ=11λ\theta=1-\frac{1}{\lambda}. At a periodic point of prime period qq the corresponding quantities are λ=|DT(x0)|q\lambda=|DT(x_{0})|^{q} and again θ=11λ\theta=1-\frac{1}{\lambda}.

3.2.2. (B) Hyperbolic toral automorphisms

We consider hyperbolic toral automorphism of the two-dimensional torus 𝕋2\mathbb{T}^{2} induced by a matrix

T=(abcd)T=\begin{pmatrix}a&b\\ c&d\\ \end{pmatrix}

with integer entries, det(T)= ±1\pm 1 and no eigenvalues on the unit circle. In order to simplify the discussion and proofs, we will assume both eigenvalues are positive. These maps preserve Haar measure μ\mu on 𝕋2\mathbb{T}^{2}. A canonical example is the Arnold Cap map

(2111)\begin{pmatrix}2&1\\ 1&1\\ \end{pmatrix}

𝕋2\mathbb{T}^{2} will be considered as the unit square with usual identifications with universal cover 2\mathbb{R}^{2}. T preserves the Haar measure μ\mu on 𝕋2\mathbb{T}^{2} and has exponential decay of correlations for Lipschitz functions, in the sense that there exists ρ(0,1)\rho\in(0,1), such that

|ϕTnψ𝑑μϕ𝑑μψ𝑑μ|CϕLipψLipρn\left|\int\phi\circ T^{n}\psi d\mu-\int\phi d\mu\int\psi d\mu\right|\leq C\lVert{\phi}\rVert_{Lip}\lVert{\psi}\rVert_{Lip}\rho^{n}

where C is a constant independent of ϕ,ψ\phi,\psi and .Lip\lVert.\rVert_{Lip} is the Lipschitz norm. For more details we refer to [36, Chapter 3].

For these maps the expansion rate at a fixed point x0x_{0} is λ=λu\lambda=\lambda_{u}, the eigenvalue associated to the unstable direction. The EI for an observable maximized at a fixed point is 11λu1-\frac{1}{\lambda_{u}}. At a periodic point of prime period qq the corresponding quantities are λ=λuq\lambda=\lambda_{u}^{q} and as before θ=11λ\theta=1-\frac{1}{\lambda}.

3.2.3. (C) Coupled systems of uniformly expanding maps.

Next, we consider a simple class of coupled mixing expanding maps of [0,1][0,1], similar to those examined in  [9]. For β>0\beta>0 let Tβ:[0,1][0,1]T_{\beta}:[0,1]\to[0,1] be defined by Tβ(x)=βxT_{\beta}(x)=\beta x,(modulo 11). Such beta-transformations possess an absolutely continuous invariant probability measure μβ\mu_{\beta} with density hβh_{\beta} which is of bounded variation and bounded away from zero.

We define an all-to-all coupled system, F:[0,1]m[0,1]mF:[0,1]^{m}\to[0,1]^{m}, by

F(x1,x2,,xm):=(F1(x1,x2,,xm),,Fm(x1,x2,,xm)),F(x_{1},x_{2},\dots,x_{m}):=(F_{1}(x_{1},x_{2},\dots,x_{m}),\dots,F_{m}(x_{1},x_{2},\dots,x_{m})),

where

Fj(x1,x2,,xm)=(1γ)Tβ(xj)+γmk=1mTβ(xk),F_{j}(x_{1},x_{2},\dots,x_{m})=(1-\gamma)T_{\beta}(x_{j})+\frac{\gamma}{m}\sum_{k=1}^{m}T_{\beta}(x_{k}), (3.1)

for j[1,,m]j\in[1,\dots,m]. 0<γ<10<\gamma<1 is the coupling constant. All subspaces of the form xi1=xi1==xijx_{i_{1}}=x_{i_{1}}=\ldots=x_{i_{j}} are invariant synchrony sets.

For these maps, if γ>0\gamma>0 is sufficiently small D. Faranda, H. Ghoudi, P. Guirard and S. Vaienti [9] have shown:

  1. (1)

    there exists a mixing invariant measure μ\mu with density h~\tilde{h}, bounded above and below away from zero;

  2. (2)

    the system has exponential mixing for Lipschitz functions versus LL^{\infty} functions.

Subspaces of the form xi1=xi1==xijx_{i_{1}}=x_{i_{1}}=\ldots=x_{i_{j}}, i1<i2<<ijmi_{1}<i_{2}<\ldots<i_{j}\leq m, are repelling invariant synchrony sets. For this class of examples the invariant set SS will be taken to be the hyperplane of synchrony x1=x2==xmx_{1}=x_{2}=\ldots=x_{m} and in this setting λ=(1γ)β\lambda=(1-\gamma)\beta is the expansion in the directions orthogonal to SS at the point xSx\in S. The extremal index is given by θ=11[(1γ)β]m1\theta=1-\frac{1}{[(1-\gamma)\beta]^{m-1}}. Note that the EI depends upon mm, the number of cells in the coupled system, and tends to zero as mm increases.

3.3. Main Results.

We suppose that SS is a subset of XX, where (X,μ)(X,\mu) is a measure and metric space, and that SS has good regularity properties, for example: a point; straight line segment or synchrony subspace. The main phenomena are illustrated by considering 2 classes of observables, giving the Weibull and Fréchet case.

  • (a)

    Frechét case ϕ(x)=d(x,S)α\phi(x)=d(x,S)^{-\alpha} (α>0\alpha>0);

  • (b)

    Weibull case ϕ(x)=Cd(x,S)α\phi(x)=C-d(x,S)^{-\alpha}, where C>0,α<0C>0,\alpha<0.

In applications daily rainfall is typically modeled by a Fréchet distribution and daily temperatures by a Weibull distribution. We do not consider the Gumbel case ϕ(x)=lnd(x,S)\phi(x)=-\ln d(x,S) as it does not arise in the applications we consider. One goal of this paper is to determine to what extent the GEV for a time series of observations {ϕ(Tjx)}\{\phi(T^{j}x)\} determines the GEV for a derived time series of functionals of the observations {Φ(ϕ(Tjx))}\{\Phi(\phi(T^{j}x))\}. The functionals we consider, the minimum and time-averages of a series of kk observations, in applications model phenomena such as heat waves and prolonged spells of excessive rainfall.

4. A scaling lemma.

We now compare the GEV scaling constants for an observable ϕ\phi for the maximal process Mn=max{ϕ,ϕT,,ϕTn1}M_{n}=\max\{\phi,\phi\circ T,\ldots,\phi\circ T^{n-1}\} with those of the functional Y=min{ϕ,ϕT,,ϕTk1}Y=\min\{\phi,\phi\circ T,\ldots,\phi\circ T^{k-1}\} or Y=1kj=0k1ϕTjY=\frac{1}{k}\sum_{j=0}^{k-1}\phi\circ T^{j} with corresponding maximal process Bn=max1jn1YjB_{n}=\max_{1\leq j\leq n-1}Y_{j}.

We first consider the scaling in the case of a Fréchet distribution, a setting in which the scaling constants bnb_{n} in a linear scaling are zero and then the Weibull. The constant g(k,T)g(k,T) in the lemma below is a constant depending upon kk and the mapTT. One of the goals of this paper is to calculate g(k,T)g(k,T) in some simple examples.

Lemma 4.1.

(a) Suppose ϕ\phi has a Fréchet distribution and

μ(Mnun(τ))eθ1τ\mu(M_{n}\leq u_{n}(\tau))\to e^{-\theta_{1}\tau}

and

μ(Bnwn(τ))eθ2τ\mu(B_{n}\leq w_{n}(\tau))\to e^{-\theta_{2}\tau}

If

wn(τ)=g(k,T)un(τ),w_{n}(\tau)=g(k,T)u_{n}(\tau),

then

ξ2=ξ1\xi_{2}=\xi_{1}
μ2=g(k,T)μ1(θ2θ1)ξ1.\mu_{2}=g(k,T)\mu_{1}\bigl{(}\frac{\theta_{2}}{\theta_{1}}\bigr{)}^{\xi_{1}}.

and

σ2=g(k,T)σ1(θ2θ1)ξ1.\sigma_{2}=g(k,T)\sigma_{1}\bigl{(}\frac{\theta_{2}}{\theta_{1}}\bigr{)}^{\xi_{1}}.

(b) Suppose ϕ\phi has a Weibull distribution and

μ(Mnun(τ))eθ1τ\mu(M_{n}\leq u_{n}(\tau))\to e^{-\theta_{1}\tau}
μ(Bnwn(τ))eθ2τ\mu(B_{n}\leq w_{n}(\tau))\to e^{-\theta_{2}\tau}

Let un(τ)=anρ(τ)+Cu_{n}(\tau)=a_{n}\rho(\tau)+C, wn(τ)=αnρ(τ)+Cw_{n}(\tau)=\alpha_{n}\rho(\tau)+C and αn=g(k,T)an\alpha_{n}=g(k,T)a_{n}. Then

ξ2=ξ1\xi_{2}=\xi_{1}
μ2=σ1ξ1(g(k,T)1)+μ1g(k,T)σ1ξ1(1(θ2θ1)ξ1)\mu_{2}=\frac{\sigma_{1}}{\xi_{1}}(g(k,T)-1)+\mu_{1}-\frac{g(k,T)\sigma_{1}}{\xi_{1}}(1-(\frac{\theta_{2}}{\theta_{1}})^{\xi_{1}})

and

σ2=g(k,T)σ1(θ2θ1)ξ1\sigma_{2}=g(k,T)\sigma_{1}\bigl{(}\frac{\theta_{2}}{\theta_{1}}\bigr{)}^{\xi_{1}}

where g(k,T)g(k,T) is a constant depending on kk and the dynamical system (T,X,μ)(T,X,\mu).

Remark 4.2.

In our applications a key calculation is to determine g(k,T)g(k,T) and in particular how this function scales with kk.

Remark 4.3.

In the second part of the proof below, the Weibull case, it is useful to have an example in mind. Suppose ϕ=Cd(x,x0)α\phi=C-d(x,x_{0})^{\alpha} where x0[0,1]x_{0}\in[0,1] equipped with Lebesgue measure μ\mu. It is easy to calculate that the scaling is μ(MnCtαnα)et\mu(M_{n}\leq C-\frac{t^{\alpha}}{n^{\alpha}})\to e^{-t}. Let ρ=(t)α\rho=-(t)^{\alpha} and an=nαa_{n}=n^{-\alpha} (as an>0a_{n}>0, ρ<0\rho<0 in the standard form for the Weibull distribution). Then μ(MnC+anρ)e(ρ)1α\mu(M_{n}\leq C+a_{n}\rho)\to e^{-(-\rho)^{\frac{1}{\alpha}}} so that α=ξ\alpha=-\xi. We then let z=anρ+Cz=a_{n}\rho+C and obtain μ(Mnz)e([zC]/an)1α\mu(M_{n}\leq z)\to e^{-(-[z-C]/a_{n})^{\frac{1}{\alpha}}}. Finally we compare the expression e([zC]/an)1αe^{-(-[z-C]/a_{n})^{\frac{1}{\alpha}}} to the standard GEV to estimate the scale, location and shape parameters for fixed nn.

Proof:

Assume ϕ\phi has a Fréchet distribution, then bn=0b_{n}=0. To obtain linear scaling for the sequence un(τ)u_{n}(\tau) we write t=ρ(τ)=τξ1t=\rho(\tau)=\tau^{-\xi_{1}} where ξ1>0\xi_{1}>0 is the shape parameter for the limiting distribution of MnM_{n}. Thus un(τ)=antu_{n}(\tau)=a_{n}t and wn(τ)=g(k,T)anρ(τ)w_{n}(\tau)=g(k,T)a_{n}\rho(\tau). For simplicity of exposition we define αn=g(k,T)an\alpha_{n}=g(k,T)a_{n}. We obtain

μ(Mnant)eθ1t1ξ1:=Gθ1(t)\mu(M_{n}\leq a_{n}t)\to e^{-\theta_{1}t^{-\frac{1}{\xi_{1}}}}:=G^{\theta_{1}}(t)
μ(Bnαnt)eθ2t1ξ1:=Gθ2(t)\mu(B_{n}\leq\alpha_{n}t)\to e^{-\theta_{2}t^{-\frac{1}{\xi_{1}}}}:=G^{\theta_{2}}(t)

So for fixed large nn,

μ(Mnt)Gθ1(t/an)\mu(M_{n}\leq t)\sim G^{\theta_{1}}(t/a_{n})

and

μ(Bnt)Gθ2(t/αn)=[Gθ1(tan(anαn))]θ2θ1\mu(B_{n}\leq t)\sim G^{\theta_{2}}(t/\alpha_{n})=\left[G^{\theta_{1}}\bigl{(}\frac{t}{a_{n}}\cdot(\frac{a_{n}}{\alpha_{n}}\bigr{)}\bigr{)}\right]^{\frac{\theta_{2}}{\theta_{1}}}

Note that if H(t)H(t) is a generalized EVT distribution with parameters (μ,σ,ξ)(\mu,\sigma,\xi) then H(γt)H(\gamma t), γ>0\gamma>0, has the same shape parameter ξγ=ξ\xi_{\gamma}=\xi while σγ=σγ\sigma_{\gamma}=\frac{\sigma}{\gamma}, μγ=μγ\mu_{\gamma}=\frac{\mu}{\gamma}.

Thus if Gθ1(t/an)G^{\theta_{1}}(t/a_{n}) has parameters ξ1\xi_{1}, μ1\mu_{1} and σ1\sigma_{1} then Gθ1((t/an)(anαn))G^{\theta_{1}}((t/a_{n})\cdot(\frac{a_{n}}{\alpha_{n}})) has GEV parameters ξ2=ξ1\xi^{{}^{\prime}}_{2}=\xi_{1}, σ2=g(k,T)σ1\sigma^{{}^{\prime}}_{2}=g(k,T)\sigma_{1} and μ2=g(k,T)μ1\mu_{2}^{{}^{\prime}}=g(k,T)\mu_{1}.

Furthermore it is known [4, Theorem 5.2, Page 96] that if G(t)G(t) has GEV parameters ξ\xi, μ\mu and σ\sigma then GθG^{\theta} has GEV parameters ξθ=ξ\xi_{\theta}=\xi, μθ=μσξ(1θξ)\mu_{\theta}=\mu-\frac{\sigma}{\xi}(1-\theta^{\xi}) and σθ=σθξ\sigma_{\theta}=\sigma\theta^{\xi}.

Thus [Gθ1((t/an)(anαn)]θ2θ1][G^{\theta_{1}}((t/a_{n})\cdot(\frac{a_{n}}{\alpha_{n}})]^{\frac{\theta_{2}}{\theta_{1}}}] has GEV parameters ξ2=ξ1\xi_{2}=\xi_{1}, σ2=g(k,T)σ1(θ2θ1)ξ1\sigma_{2}=g(k,T)\sigma_{1}(\frac{\theta_{2}}{\theta_{1}})^{\xi_{1}} and μ2=g(k,T)μ1g(k,T)σ1ξ1(1(θ2/θ1)ξ1)\mu_{2}=g(k,T)\mu_{1}-\frac{g(k,T)\sigma_{1}}{\xi_{1}}(1-(\theta_{2}/\theta_{1})^{\xi_{1}}).

Comparing the two forms of Gθ1(tan)G^{\theta_{1}}(\frac{t}{a_{n}}), namely eθ1(tan)1ξ1e^{-\theta_{1}(\frac{t}{a_{n}})^{-\frac{1}{\xi_{1}}}} and exp{θ1[1+ξ1(tμ1σ1)]1ξ1}\exp\bigg{\{}-\theta_{1}\bigg{[}1+\xi_{1}\bigg{(}\frac{t-\mu_{1}}{\sigma_{1}}\bigg{)}\bigg{]}^{\frac{-1}{\xi_{1}}}\bigg{\}}, we see that the relation 1=ξ1μ1σ11=\frac{\xi_{1}\mu_{1}}{\sigma_{1}} holds. Hence,

μ2=g(k,T)μ1g(k,T)σ1ξ1(1(θ2θ1)ξ1)=g(k,T)μ1(θ2θ1)ξ1.\mu_{2}=g(k,T)\mu_{1}-\frac{g(k,T)\sigma_{1}}{\xi_{1}}\bigl{(}1-\bigl{(}\frac{\theta_{2}}{\theta_{1}}\bigr{)}^{\xi_{1}}\bigr{)}=g(k,T)\mu_{1}\bigl{(}\frac{\theta_{2}}{\theta_{1}}\bigr{)}^{\xi_{1}}.

This concludes the proof in the case of a Fréchet distribution.

Now suppose that ϕ\phi has a Weibull distribution, a setting in which the scaling constants bnb_{n} may be taken as CC, the supremum of the range of ϕ\phi. To obtain a linear scaling for the sequence un(τ)u_{n}(\tau) we write t=ρ(τ)=τξ1t=\rho(\tau)=-\tau^{-\xi_{1}} where ξ1<0\xi_{1}<0 is the shape parameter for the limiting distribution of MnM_{n}. Thus un(τ)=ant+Cu_{n}(\tau)=a_{n}t+C and wn(τ)=αnt+Cw_{n}(\tau)=\alpha_{n}t+C where αn=g(k,T)an\alpha_{n}=g(k,T)a_{n} by assumption. We obtain

μ(Mnant+C)eθ1(t)1ξ1:=Gθ1(t)\mu(M_{n}\leq a_{n}t+C)\to e^{-\theta_{1}(-t)^{-\frac{1}{\xi_{1}}}}:=G^{\theta_{1}}(t)
μ(Bnαnt+C)eθ2(t)1ξ1:=Gθ2(t)\mu(B_{n}\leq\alpha_{n}t+C)\to e^{-\theta_{2}(-t)^{-\frac{1}{\xi_{1}}}}:=G^{\theta_{2}}(t)

We now put the distribution into the standard form of the GEV. So for fixed large nn, let z=ant+Cz=a_{n}t+C, so that

μ(Mnz)Gθ1(zCan)\mu(M_{n}\leq z)\sim G^{\theta_{1}}\biggl{(}\frac{z-C}{a_{n}}\biggr{)}

and similarly

μ(Bnz)Gθ2(zCαn)=[Gθ1(zCαn)]θ2θ1.\mu(B_{n}\leq z)\sim G^{\theta_{2}}\biggl{(}\frac{z-C}{\alpha_{n}}\biggr{)}=\left[G^{\theta_{1}}\biggl{(}\frac{z-C}{\alpha_{n}}\biggr{)}\right]^{\frac{\theta_{2}}{\theta_{1}}}.

In Gθ1G^{\theta_{1}} we have

1+ξ1σ1(zμ1)=(zC)an1+\frac{\xi_{1}}{\sigma_{1}}(z-\mu_{1})=\frac{-(z-C)}{a_{n}}

which gives the relations 1ξ1μ1σ1=Can1-\frac{\xi_{1}\mu_{1}}{\sigma_{1}}=\frac{C}{a_{n}} and ξ1σ1=1an\frac{\xi_{1}}{\sigma_{1}}=\frac{-1}{a_{n}}. Similarly if αn=g(k,T)an\alpha_{n}=g(k,T)a_{n} replaces ana_{n} in Gθ1G^{\theta_{1}} we have, as the shape parameter ξ1\xi_{1} does not change, 1ξ1μ2σ2=Cg(k,T)an1-\frac{\xi_{1}\mu_{2}^{{}^{\prime}}}{\sigma_{2}^{{}^{\prime}}}=\frac{C}{g(k,T)a_{n}} and ξ1σ2=1g(k,T)an\frac{\xi_{1}}{\sigma_{2}^{{}^{\prime}}}=\frac{-1}{g(k,T)a_{n}}. We obtain σ2=g(k,T)σ1\sigma_{2}^{{}^{\prime}}=g(k,T)\sigma_{1} and μ2=σ1ξ1(g(k,T)1)+μ1\mu_{2}^{{}^{\prime}}=\frac{\sigma_{1}}{\xi_{1}}(g(k,T)-1)+\mu_{1}.

Now we account for the transformation Gθ1Gθ2G^{\theta_{1}}\to G^{\theta_{2}} by considering the parameters in [Gθ1]θ2θ1[G^{\theta_{1}}]^{\frac{\theta_{2}}{\theta_{1}}}. As in the Fréchet case we use the fact that if G(t)G(t) has GEV parameters ξ\xi, μ\mu and σ\sigma then GθG^{\theta} has GEV parameters ξθ=ξ\xi_{\theta}=\xi, μθ=μσξ(1θξ)\mu_{\theta}=\mu-\frac{\sigma}{\xi}(1-\theta^{\xi}) and σθ=σθξ\sigma_{\theta}=\sigma\theta^{\xi}.

Hence the parameters for BnB_{n} are σ2=g(k,T)σ1(θ2θ1)ξ1\sigma_{2}=g(k,T)\sigma_{1}(\frac{\theta_{2}}{\theta_{1}})^{\xi_{1}} and μ2=σ1ξ1(g(k,T)1)+μ1g(k,T)σ1ξ1(1(θ2θ1)ξ1)\mu_{2}=\frac{\sigma_{1}}{\xi_{1}}(g(k,T)-1)+\mu_{1}-\frac{g(k,T)\sigma_{1}}{\xi_{1}}(1-(\frac{\theta_{2}}{\theta_{1}})^{\xi_{1}}).

5. kk-exceedances: ϕ(x)\phi(x) maximized at an invariant set.

We assume that SS is a repelling fixed point x0x_{0} in the case of (A) or (B) or repelling invariant hyperplane of synchrony in the case of (C)(C). Let

Y(x)=min{ϕ(x),,ϕ(Tk1(x))}Y(x)=\min\{\phi(x),\ldots,\phi(T^{k-1}(x))\}

where ϕ(x)=d(x,S)α\phi(x)=d(x,S)^{-\alpha} (α>0\alpha>0) in the Fréchet case or ϕ(x)=Cd(x,S)α\phi(x)=C-d(x,S)^{-\alpha} (α<0\alpha<0) in the Weibull case . Recall Mn(x):=max{ϕ(x),ϕ(Tx),,ϕ(Tn1x)}M_{n}(x):=\max\{\phi(x),\phi(Tx),\ldots,\phi(T^{n-1}x)\} and Bn=max{Y(x),,Y(Tn1(x))}B_{n}=\max\{Y(x),\ldots,Y(T^{n-1}(x))\}.

Theorem 5.1.

In the setting of (A)(A), λ=|DT(x0)|\lambda=|DT(x_{0})|; in the setting of (B)(B), λ=λu\lambda=\lambda_{u}, the eigenvalue in the expanding direction; and in the case of (C)(C), λ=(1γ)β\lambda=(1-\gamma)\beta.

(a) Suppose ϕ\phi has a Fréchet distribution. If MnM_{n} has GEV with parameters ξ1\xi_{1}, σ1\sigma_{1} and μ1\mu_{1} then BnB_{n} has GEV with parameters μ2=λ(k1)αμ1\mu_{2}=\lambda^{-(k-1)\alpha}\mu_{1}, σ2=λ(k1)ασ1\sigma_{2}=\lambda^{-(k-1)\alpha}\sigma_{1} and ξ2=ξ1=ξ\xi_{2}=\xi_{1}=\xi.

(b) Suppose ϕ\phi has a Weibull distribution. If MnM_{n} has GEV with parameters ξ1\xi_{1}, σ1\sigma_{1} and μ1\mu_{1} then BnB_{n} has GEV with parameters μ2=μ1\mu_{2}=\mu_{1}, σ2=λ(k1)ασ1\sigma_{2}=\lambda^{-(k-1)\alpha}\sigma_{1} and ξ2=ξ1=ξ\xi_{2}=\xi_{1}=\xi.

Proof.

We focus on the Fréchet case. The Weibull case follows the same argument. Define wn(τ)w_{n}(\tau) by nμ(Y>wn(τ))=τn\mu(Y>w_{n}(\tau))=\tau. It is easy to see that Y(x)Y(x) is maximized for those points such that xx is closest to x0x_{0} or SS and then Y(x)=min{ϕ(x),ϕ(Tx),,ϕ(Tk1x)}=ϕ(Tk1x)λ(k1)αϕ(x)Y(x)=\min\{\phi(x),\phi(Tx),\ldots,\phi(T^{k-1}x)\}=\phi(T^{k-1}x)\sim\lambda^{-(k-1)\alpha}\phi(x) by the assumption of uniform repulsion. Thus the set (Y>wn(τ))(Y>w_{n}(\tau)) is a scaled version of and has the same shape as that of (ϕ>un(τ))(\phi>u_{n}(\tau)). Hence the proofs of Д(un)\operatorname{\textrm{Д}}(u_{n}) and Дq(un)\operatorname{\textrm{Д}}_{q}^{{}^{\prime}}(u_{n}) proceed exactly as in the proofs of Theorem 2.1 and Theorem 2.8 in [2].

The relation nμ(Yk>wn(τ))=τn\mu(Y_{k}>w_{n}(\tau))=\tau implies nμ(ϕ(x)>λ(k1)αwn(τ))=τn\mu(\phi(x)>\lambda^{(k-1)\alpha}w_{n}(\tau))=\tau and hence wn(τ)=un(τ)λ(k1)αw_{n}(\tau)=u_{n}(\tau)\lambda^{-(k-1)\alpha}.

The extremal index of YY and the extremal index of ϕ\phi are both easily calculated and equal to θ=11λ\theta=1-\frac{1}{\lambda} in the case of (A)(A), (B)(B) and equal to 11λm11-\frac{1}{\lambda^{m-1}} in the case of (C)(C) [2]. Thus Lemma 4.1 concludes the proof.

6. kk-exceedances: ϕ(x)\phi(x) maximized at a generic non-recurrent point.

We first consider the Fréchet case. Consider Y(x)=min{ϕ(x),,ϕ(Tk1x)}Y(x)=\min\{\phi(x),\ldots,\phi(T^{k-1}x)\}, where ϕ(x)=d(x,x0)α\phi(x)=d(x,x_{0})^{-\alpha} and α>0\alpha>0. Suppose further that x0x_{0} is non-recurrent. Then there exists a δ>0\delta>0 such that at least one of the iterates TjxT^{j}x, j=0,,k1j=0,\ldots,k-1 satisfies d(Tjx,x0)>δd(T^{j}x,x_{0})>\delta and hence Y(Tnx)Y(T^{n}x) is uniformly bounded. Thus the process Y(Tn(x))Y(T^{n}(x)) is in the domain of attraction of a Weibull distribution rather than a Fréchet distribution. In general the form of the Weibull distribution cannot be discerned readily from ϕ\phi, except for some special cases. We illustrate with an example.

Example 6.1.

Consider the doubling map T(x)=2xmod1T(x)=2x\mod 1, x[0,1]x\in[0,1] local observable ϕ(x)=d(x,x0)α\phi(x)=d(x,x_{0})^{-\alpha} and functional Y(x)=min{ϕ(x),ϕ(Tx)}.Y(x)=\min\{\phi(x),\phi(Tx)\}. Then the process Bn(x)=maxknY(Tkx)B_{n}(x)=\max_{k\leq n}Y(T^{k}x) follows a Weibull law with tail index of 1-1, (which is indeed independent of α\alpha). To show this, suppose without loss of generality that x0(0,1/2)x_{0}\in(0,1/2). By elementary analysis, it can be shown that Y(x)Y(x) has global maximum 3αx0α3^{\alpha}x^{-\alpha}_{0} at value x=2x03x=\frac{2x_{0}}{3}. Within a local neighbourhood of this maximum, Y(x)Y(x) is piecewise smooth, and therefore the level set {Y>3αx0αw}\{Y>3^{\alpha}x^{-\alpha}_{0}-w\} has measure asymptotic to cαwc_{\alpha}w, for some cα>0c_{\alpha}>0 and w0w\to 0. Thus, Y(x)Y(x) is in the domain of attraction of the Weibull distribution with tail index -1. Since the level sets for {Y>wn(τ)}\{Y>w_{n}(\tau)\} are shrinking intervals, the relevant Д(un)\operatorname{\textrm{Д}}(u_{n}) and Дq(un)\operatorname{\textrm{Д}}_{q}^{{}^{\prime}}(u_{n}) easily hold [17]. Here, qq is chosen according to the recurrence properties of the orbit of 2x03\frac{2x_{0}}{3} (e.g. periodic versus non-periodic). Hence Bn(x)B_{n}(x) follows a (Weibull) GEV.

We have a similar (non)-result in the Weibull case. The tail index for YY need not be the same as that of ϕ\phi for the same reason as in the Fréchet case, though the distribution will also be a Weibull distribution but not that much more can be said in any generality.

7. kk-averages: ϕ(x)\phi(x) maximized at a non-recurrent point.

In the Weibull case there is little useful that can be said in any generality, except that YY will also have a Weibull distribution but possibly with a different shape parameter. Hence we focus on the Fréchet case. Assume in case (A)(A), (B)(B) or (C)(C) that ϕ(x)=d(x,x0)α\phi(x)=d(x,x_{0})^{-\alpha}, α>0\alpha>0, where x0x_{0} is non-recurrent in the sense of Collet [3]. In the Fréchet or heavy-tailed case the time average is dominated by the maximum value, so we should expect a simple relation for the GEV parameters of Mn=maxjn{ϕ(Tjx)}M_{n}=\max_{j\leq n}\{\phi(T^{j}x)\} and those of Bn=maxjn{Yj}B_{n}=\max_{j\leq n}\{Y_{j}\} where Yj(x)=1ki=jj+k1ϕ(Tix)Y_{j}(x)=\frac{1}{k}\sum_{i=j}^{j+k-1}\phi(T^{i}x). If ξ\xi, σ\sigma and μ\mu are the shape, scale and location parameters of MnM_{n} then ξ\xi, σk\frac{\sigma}{k} and μk\frac{\mu}{k} are the shape, scale and location parameters of BnB_{n}. Assume that ϕ(x)=d(x,x0)α\phi(x)=d(x,x_{0})^{-\alpha}, α>0\alpha>0, where x0x_{0} is non-recurrent.

Theorem 7.1.

Assume in case (A)(A), (B)(B) or (C)(C) that ϕ(x)\phi(x) is a Fréchet observable of form ϕ(x)=d(x,x0)α\phi(x)=d(x,x_{0})^{-\alpha}, α>0\alpha>0, where x0x_{0} is non-recurrent in the sense of Collet [3]. Let Mn=maxjn{ϕ(Tjx)}M_{n}=\max_{j\leq n}\{\phi(T^{j}x)\} and Bn=maxjn{Yj}B_{n}=\max_{j\leq n}\{Y_{j}\} where Yj(x)=1ki=jj+k1ϕ(Tix)Y_{j}(x)=\frac{1}{k}\sum_{i=j}^{j+k-1}\phi(T^{i}x). If MnM_{n} has GEV with parameters ξ1\xi_{1}, σ1\sigma_{1} and μ1\mu_{1} then BnB_{n} has GEV with parameters μ2=μ1k\mu_{2}=\frac{\mu_{1}}{k}, σ2=σ1k\sigma_{2}=\frac{\sigma_{1}}{k} and ξ2=ξ1=ξ\xi_{2}=\xi_{1}=\xi.

Proof.

Let kk\geq and define the time-average

Yi(x)=1kj=ii+k1ϕ(Tjx).Y_{i}(x)=\frac{1}{k}\sum_{j=i}^{i+k-1}\phi(T^{j}x).

Let unu_{n} be an increasing sequence. Let r(un)r(u_{n}) be defined by Br(un)(x0)={x:ϕ(x)>un}B_{r(u_{n})}(x_{0})=\{x:\phi(x)>u_{n}\}.

Let Vn=i=1k1TiBr(un)(x0)V^{-}_{n}=\cup_{i=1}^{k-1}T^{-i}B_{r(u_{n})}(x_{0}) and Vn0=Br(un)(x0)V^{0}_{n}=B_{r(u_{n})}(x_{0}) and note that because of our genericity condition there exists KK such that for all large nn, ϕ|VnK\phi|_{{V^{-}_{n}}}\leq K.

We first note that if unu_{n}\to\infty then Y1>unY_{1}>u_{n} implies that Tj(x)Br(kun)(x0)T^{j}(x)\in B_{r(ku_{n})}(x_{0}) for one (precisely one) jj^{*} such that 0jk10\leq j\leq k-1. Note that

1kjj,0jk1ϕ(Tjx)M\frac{1}{k}\sum_{j\not=j^{*},0\leq j\leq k-1}\phi(T^{j}x)\leq M

and by the existence of a density at x0x_{0},

|μ(Br(M+kun)(x0))μ(Br(kun)(x0))|=o(μ(Br(kun)(x0))).|\mu(B_{r(M+ku_{n})}(x_{0}))-\mu(B_{r(ku_{n})}(x_{0}))|=o(\mu(B_{r(ku_{n})}(x_{0}))).

Since TT preserves μ\mu, μ(TjBr(un)(x0))=μ(Br(un)(x0))\mu(T^{-j}B_{r(u_{n})}(x_{0}))=\mu(B_{r(u_{n})}(x_{0})) for 0jk10\leq j\leq k-1. Hence

μ(Vn0)μ(Vn0)+μ(Vn)=1k.\frac{\mu(V^{0}_{n})}{\mu(V^{0}_{n})+\mu(V_{n}^{-})}=\frac{1}{k}.

Define

Mn=max{ϕ(x),ϕ(Tx),,ϕ(Tn1(x))}.M_{n}=\max\{\phi(x),\phi(Tx),\ldots,\phi(T^{n-1}(x))\}.

We define a sequence un(τ)u_{n}(\tau) by the requirement that

nμ(ϕ>un(τ))=τ.n\mu(\phi>u_{n}(\tau))=\tau. (7.1)

It is well known that conditions D(un)D(u_{n}), D(un)D^{{}^{\prime}}(u_{n}) of [32] (see for example [33]) hold for a generic x0x_{0} in our dynamical settings and hence

μ(Mnun(τ))eτ.\mu(M_{n}\leq u_{n}(\tau))\to e^{-\tau}.

We will now relate the scaling constants for Bn:=maxjn{Yj(x)}B_{n}:=\max_{j\leq n}\{Y_{j}(x)\} where

Yi(x)=1kj=ii+k1ϕ(Tjx)Y_{i}(x)=\frac{1}{k}\sum_{j=i}^{i+k-1}\phi(T^{j}x)

to the scaling constants for MnM_{n}.

Consider a sequence wnw_{n} such that

nμ(Yk>wn(τ))=τ.n\mu(Y_{k}>w_{n}(\tau))=\tau.

Now as x0x_{0} is non-recurrent

μ(Yk>wn(τ))\displaystyle\mu(Y_{k}>w_{n}(\tau)) =μ(j=0k1ϕ(Tjx)>kwn(τ))\displaystyle=\mu\left(\sum_{j=0}^{k-1}\phi(T^{j}x)>kw_{n}(\tau)\right)
=kμ(ϕ>kwn(τ))+o(1/n).\displaystyle=k\mu(\phi>kw_{n}(\tau))+o(1/n).

Since un(τ)u_{n}(\tau) is defined by the requirement nμ(ϕ>un(τ))τn\mu(\phi>u_{n}(\tau))\to\tau it is easy to see that un(τ)=nξτξu_{n}(\tau)=n^{\xi}\tau^{-\xi}. We see that we have the relation

wn(τ)=1kun(τk)w_{n}(\tau)=\frac{1}{k}\ u_{n}\bigl{(}\frac{\tau}{k}\bigr{)}

and hence wn(τ)=kξ1un(τ)w_{n}(\tau)=k^{\xi-1}u_{n}(\tau).

It is clear that ϕ\phi has extremal index 11. The scheme of the proof of Condition Дq(un)\operatorname{\textrm{Д}}_{q}(u_{n}) is itself somewhat standard [5, 19] and is a consequence of suitable decay of correlation estimates. Our genericity assumption on x0x_{0} establishes Condition Дq(un)\operatorname{\textrm{Д}}^{\prime}_{q}(u_{n}) in a standard way. We will now show that YY has an extremal index θ=1k\theta=\frac{1}{k}. In our setting

limnθn=limnμ(An(q))μ(Un)\lim_{n\to\infty}\theta_{n}=\lim_{n\to\infty}\frac{\mu(A_{n}^{(q)})}{\mu(U_{n})}

exists and equals

μ(Vn0)μ(Vn0)+μ(Vn)=1k.\frac{\mu(V_{n}^{0})}{\mu(V_{n}^{0})+\mu(V_{n}^{-})}=\frac{1}{k}.

Thus we have g(k,T)=kξ1g(k,T)=k^{\xi-1} and θ2=1k\theta_{2}=\frac{1}{k}. We conclude from Lemma 4.1 that

σ2=kξ1(σ1(1/k)ξ)=σ1k\sigma_{2}=k^{\xi-1}(\sigma_{1}(1/k)^{\xi})=\frac{\sigma_{1}}{k}

and

μ2=kξ1(μ1(1/k)ξ)=μ1k.\mu_{2}=k^{\xi-1}(\mu_{1}(1/k)^{\xi})=\frac{\mu_{1}}{k}.

8. kk-averages: ϕ(x)\phi(x) maximized at an invariant set.

Unlike for kk-averages associated to ϕ\phi maximised at a non-recurrent point, there is no general result covering all of cases (A), (B) and (C). The GEV scaling for kk-averages depends on the recurrence properties of the system, as well as on the geometry of the level sets. For simplicity we consider the doubling map T(x)=2xmod1T(x)=2x\mod 1 on the interval [0,1][0,1], and observable ϕ(x)=d(x,0)α\phi(x)=d(x,0)^{-\alpha}, α>0\alpha>0. This is clearly maximized at the fixed point 0 of TT. For k1k\geq 1, consider the time average

Yi(x)=1kj=ii+k1ϕ(Tj(x)).Y_{i}(x)=\frac{1}{k}\sum_{j=i}^{i+k-1}\phi(T^{j}(x)).

We state the following result.

Theorem 8.1.

Consider the map T(x)=2xmod1T(x)=2x\mod 1, and the observable ϕ(x)=d(x,0)α\phi(x)=d(x,0)^{-\alpha}. Let Mn=maxjn{ϕ(Tjx)}M_{n}=\max_{j\leq n}\{\phi(T^{j}x)\} and Bn=maxjn{Yj}B_{n}=\max_{j\leq n}\{Y_{j}\} where Yj(x)=1ki=jj+k1ϕ(Tix)Y_{j}(x)=\frac{1}{k}\sum_{i=j}^{j+k-1}\phi(T^{i}x). If MnM_{n} has GEV with parameters ξ1\xi_{1}, σ1\sigma_{1} and μ1\mu_{1} then BnB_{n} has GEV with parameters μ2=c(k)μ1k\mu_{2}=\frac{c(k)\mu_{1}}{k}, σ2=c(k)σ1k\sigma_{2}=\frac{c(k)\sigma_{1}}{k} and ξ2=ξ1=ξ\xi_{2}=\xi_{1}=\xi. Here c(k)c(k) is a function of kk which satisfies cl<c(k)<cuc_{l}<c(k)<c_{u} for uniform constants cl,cu>0c_{l},c_{u}>0.

Remark 8.2.

This example is clearly within class of systems (A). From the techniques of the proof, it is possible to extend the conclusion of Theorem 8.1 to other examples within class (A). The main steps are to determine the scaling laws for the GEV constants. Given this, the verfication of conditions Дq(un)\operatorname{\textrm{Д}}_{q}(u_{n}), Дq(un)\operatorname{\textrm{Д}}^{\prime}_{q}(u_{n}) follows the same approach as the proof of Theorem 7.1.

Remark 8.3.

Within the proof we will be more explicit about the functions c(k)c(k), and for the extremal index we obtain the asymptotic result θ12k\theta\sim\frac{1}{2k}, (for large kk).

The case k=2k=2. Before embarking on the proof for general kk, we illustrate for k=2k=2. Here, we just focus on the calculation of the relevant wnw_{n} sequence, and calculation of the extremal index. In the proof of Theorem 8.1 for general kk, we consider verification of conditions Дq(un)\operatorname{\textrm{Д}}_{q}(u_{n}), Дq(un)\operatorname{\textrm{Д}}^{\prime}_{q}(u_{n}).

Here, we therefore have

Y0(x)=12(d(x,0)α+d(T(x),0)α).Y_{0}(x)=\frac{1}{2}\left(d(x,0)^{-\alpha}+d(T(x),0)^{-\alpha}\right).

For large u>0u>0, consider the set {Y0(x)u}\{Y_{0}(x)\geq u\}. Clearly {Y0(x)=+}={0,1/2}\{Y_{0}(x)=+\infty\}=\{0,1/2\}, and hence there exists u0>0u_{0}>0 such that for all uu0u\geq u_{0} we have the following. The set {Y0(x)u}\{Y_{0}(x)\geq u\} consists of two disjoint neighbourhoods 𝒩0(u)\mathcal{N}_{0}(u), 𝒩12(u)\mathcal{N}_{\frac{1}{2}}(u) of 0, and (resp.) 1/21/2. We choose u0u_{0} so that T2T^{2} is injective on each neighbourhood. Furthermore we suppose that for all x𝒩12(u)x\in\mathcal{N}_{\frac{1}{2}}(u) we have d(x,0)>1/4d(x,0)>1/4.

For uu0u\geq u_{0}, we now estimate μ{Y0>u}\mu\{Y_{0}>u\}. Consider x𝒩0(u)x\in\mathcal{N}_{0}(u) with d(x,0)=rd(x,0)=r. Then

Y0(x)=12(rα+(2r)α).Y_{0}(x)=\frac{1}{2}(r^{-\alpha}+(2r)^{-\alpha}).

By estimating the range of values of rr for which Y0(x)uY_{0}(x)\geq u we obtain,

μ(𝒩0(u))=2(2u)1α(1+2α)1α.\mu(\mathcal{N}_{0}(u))=2(2u)^{-\frac{1}{\alpha}}(1+2^{-\alpha})^{\frac{1}{\alpha}}. (8.1)

Now let x𝒩12(u)x\in\mathcal{N}_{\frac{1}{2}}(u), and suppose d(x,1/2)=rd(x,1/2)=r. Since T2T^{2} is injective on 𝒩12(u)\mathcal{N}_{\frac{1}{2}}(u), we have d(T(x),T(1/2))=2d(x,1/2)=2rd(T(x),T(1/2))=2d(x,1/2)=2r. Since T(1/2)=0T(1/2)=0, and using the assumption 1/4<d(x,0)1/21/4<d(x,0)\leq 1/2, we obtain the following bounds

12(2α+(2r)α)Y0(x)12(4α+(2r)α).\frac{1}{2}(2^{\alpha}+(2r)^{-\alpha})\leq Y_{0}(x)\leq\frac{1}{2}(4^{\alpha}+(2r)^{-\alpha}). (8.2)

Hence

Y0(x)ur12(2u4α)1α,Y_{0}(x)\geq u\;\implies\;r\leq\frac{1}{2}(2u-4^{\alpha})^{-\frac{1}{\alpha}},

and conversely

r12(2u2α)1αY0(x)u.r\leq\frac{1}{2}(2u-2^{\alpha})^{-\frac{1}{\alpha}}\;\implies\;Y_{0}(x)\geq u.

Putting this together leads to

μ(𝒩12(u))[(2u2α)1α,(2u4α)1α].\mu(\mathcal{N}_{\frac{1}{2}}(u))\in[(2u-2^{\alpha})^{-\frac{1}{\alpha}},(2u-4^{\alpha})^{-\frac{1}{\alpha}}]. (8.3)

Hence,

μ{Y0(x)u}=(2u)1α(2(1+2α)1α+(1c0(α)(2u)1)1α),\mu\{Y_{0}(x)\geq u\}=(2u)^{-\frac{1}{\alpha}}\left(2(1+2^{-\alpha})^{\frac{1}{\alpha}}+(1-c_{0}(\alpha)(2u)^{-1})^{-\frac{1}{\alpha}}\right), (8.4)

with 2αc0(α)4α2^{\alpha}\leq c_{0}(\alpha)\leq 4^{\alpha}. Thus for uu0u\geq u_{0} large, we obtain the asymptotic relation,

μ{Y0(x)u}(2u)1α(1+2(1+2α)1α).\mu\{Y_{0}(x)\geq u\}\sim(2u)^{-\frac{1}{\alpha}}\left(1+2(1+2^{-\alpha})^{\frac{1}{\alpha}}\right). (8.5)

Using the notations of Lemma 4.1 we can obtain the scaling relations between un(τ)u_{n}(\tau) and wn(τ)w_{n}(\tau) via wn(τ)=g(k,T)un(τ)w_{n}(\tau)=g(k,T)u_{n}(\tau). Using, μ{ϕ(x)u}2u1α\mu\{\phi(x)\geq u\}\sim 2u^{-\frac{1}{\alpha}} we obtain un=(2n/τ)ξu_{n}=(2n/\tau)^{\xi}, where ξ=α\xi=\alpha is the shape parameter. The corresponding wn(τ)w_{n}(\tau) can be found using equation (8.5). For k=2k=2, this leads to

g(k,T)=12(1+2(1+2α)1α)α.g(k,T)=\frac{1}{2}\left(1+2(1+2^{-\alpha})^{\frac{1}{\alpha}}\right)^{\alpha}.

To consider the extremal index, it suffices to estimate the measure of the set A(u)={Y0(x)u,Y0(T(x))u}A(u)=\{Y_{0}(x)\geq u,Y_{0}(T(x))\leq u\}. Again we take uu0u\geq u_{0}. For u012(12)αu_{0}\geq\frac{1}{2}(12)^{\alpha} we claim that A(u)𝒩12(u)=A(u)\cap\mathcal{N}_{\frac{1}{2}}(u)=\emptyset. To see this, consider x𝒩12(u)x\in\mathcal{N}_{\frac{1}{2}}(u) and d(x,1/2)=rd(x,1/2)=r. Then

Y0(T(x))Y0(x)=(4r)αd(x,1/2)α(4r)α4α.Y_{0}(T(x))-Y_{0}(x)=(4r)^{-\alpha}-d(x,1/2)^{-\alpha}\geq(4r)^{-\alpha}-4^{\alpha}.

This follows by assumption that d(x,0)>1/4d(x,0)>1/4 and that T2T^{2} is injective on 𝒩12(u)\mathcal{N}_{\frac{1}{2}}(u) so that d(T2(x),T2(1/2))=4d(x,1/2)=4rd(T^{2}(x),T^{2}(1/2))=4d(x,1/2)=4r, with T2(1/2)=0T^{2}(1/2)=0. By assumption on u0u_{0}, it follows that Y0(T(x))>Y0(x)uY_{0}(T(x))>Y_{0}(x)\geq u. Hence A(u)=A(u)𝒩0(u)A(u)=A(u)\cap\mathcal{N}_{0}(u).

To find μ(A(u))\mu(A(u)) we proceed as follows. On 𝒩0(u)\mathcal{N}_{0}(u), with d(x,0)=rd(x,0)=r we have

Y0(T(x))=12((2r)α+(4r)α),Y_{0}(T(x))=\frac{1}{2}((2r)^{-\alpha}+(4r)^{-\alpha}),

so that {Y0(T(x))u}\{Y_{0}(T(x))\leq u\} is the set of points xx satisfying

d(x,0)12(2u)1α(1+2α)1α.d(x,0)\leq\frac{1}{2}(2u)^{-\frac{1}{\alpha}}(1+2^{-\alpha})^{\frac{1}{\alpha}}.

Hence, for the extremal index it suffices to study the limit

limnμ(An(1)(wn))μ{Y0wn},\lim_{n\to\infty}\frac{\mu(A^{(1)}_{n}(w_{n}))}{\mu\{Y_{0}\geq w_{n}\}},

with wnw_{n} specified via the asymptotic relation nμ(Y0>wn)τn\mu(Y_{0}>w_{n})\to\tau, (for τ>0\tau>0). We obtain,

limnμ(An(1)(wn))μ{Y0wn}=(1+2α)1α1+2(1+2α)1α.\lim_{n\to\infty}\frac{\mu(A^{(1)}_{n}(w_{n}))}{\mu\{Y_{0}\geq w_{n}\}}=\frac{(1+2^{-\alpha})^{\frac{1}{\alpha}}}{1+2(1+2^{-\alpha})^{\frac{1}{\alpha}}}. (8.6)

This gives the extremal index, and concludes the example in the case k=2k=2.

Proof.

We now prove Theorem 8.1. Following on from the k=2k=2 example, we obtain asymptotics for μ{Y0(x)u}\mu\{Y_{0}(x)\geq u\}, and μ{Y0(x)u,Y0(T(x))u}\mu\{Y_{0}(x)\geq u,Y_{0}(T(x))\leq u\} as uu\to\infty. The argument here is for the run-length kk arbitrary, but fixed. It is clear that the set {Y0=+}\{Y_{0}=+\infty\} corresponds to the set {x:fk1(x)=0}\{x:f^{k-1}(x)=0\}. We subdivide this set up into subsets EjE_{j} defined as follows:

{Y0=+}=0j<k{x:f(x)0,<j,fj(x)=0}:=0j<kEj.\{Y_{0}=+\infty\}=\bigcup_{0\leq j<k}\{x:\,f^{\ell}(x)\neq 0,\,\forall\,\ell<j,\,f^{j}(x)=0\}:=\bigcup_{0\leq j<k}E_{j}. (8.7)

Thus for k4k\geq 4, we have E0={0}E_{0}=\{0\}, E1={12}E_{1}=\{\frac{1}{2}\}, E2={14,34}E_{2}=\{\frac{1}{4},\frac{3}{4}\}, E3={18,38,58,78}E_{3}=\{\frac{1}{8},\frac{3}{8},\frac{5}{8},\frac{7}{8}\}, etc. The general representation can be expressed as

Ej={2i+12j, 0i2j1},(1jk1),E_{j}=\left\{\frac{2i+1}{2^{j}},\,0\leq i\leq 2^{j-1}\right\},\;(1\leq j\leq k-1), (8.8)

and we write xi,j=2j(2i+1)x_{i,j}=2^{-j}(2i+1). Note that the cardinality of EjE_{j} is 2j12^{j-1}. To estimate μ{Y0>u}\mu\{Y_{0}>u\}, we consider the function Y0Y_{0} on neighbourhoods of xi,jx_{i,j}. We state the following lemma.

Lemma 8.4.

For k2k\geq 2, we have the following asymptotic

μ{Y0(x)u}(ku)1α{2(1zk1z)1α+j=1k1(1zj1z)1α,}\mu\{Y_{0}(x)\geq u\}\sim(ku)^{-\frac{1}{\alpha}}\left\{2\left(\frac{1-z^{k}}{1-z}\right)^{\frac{1}{\alpha}}+\sum_{j=1}^{k-1}\left(\frac{1-z^{j}}{1-z}\right)^{\frac{1}{\alpha}},\right\} (8.9)

where z=2αz=2^{-\alpha}. In particular, the following refinements hold. There exists c1c1(k)>0c_{1}\equiv c_{1}(k)>0 with

μ{Y0(x)u}c1(k)(ku)1α,with 1c1(k)k2(12α)1α.\mu\{Y_{0}(x)\geq u\}\sim c_{1}(k)(ku)^{-\frac{1}{\alpha}},\;\textrm{with}\;1\leq\frac{c_{1}(k)}{k}\leq\frac{2}{(1-2^{-\alpha})^{-\frac{1}{\alpha}}}. (8.10)

Moreover, as kk\to\infty then c1(k)/kc_{1}(k)/k approaches (12α)1α.(1-2^{-\alpha})^{-\frac{1}{\alpha}}.

Proof.

We prove Lemma 8.4 as follows. Following the methods for the case k=2k=2, we see that there exists u0>0u_{0}>0, such that for all uu0u\geq u_{0} the following properties hold.

  • (i)

    The set {Y0>u}\{Y_{0}>u\} consists of a union of 2k12^{k-1} disjoint neighbourhoods 𝒩i,j(u)\mathcal{N}_{i,j}(u) with xi,j𝒩i,j(u)x_{i,j}\in\mathcal{N}_{i,j}(u).

  • (ii)

    The map Tk:𝒩i,j(u)S1T^{k}:\mathcal{N}_{i,j}(u)\to S^{1} is injective. Hence, |T(𝒩i,j(u))|=2|𝒩i,j(u)||T^{\ell}(\mathcal{N}_{i,j}(u))|=2^{\ell}|\mathcal{N}_{i,j}(u)| for all k\ell\leq k.

  • (iii)

    For all x𝒩i,j(u)x\in\mathcal{N}_{i,j}(u) we have that d(T(x),0)>22kd(T^{\ell}(x),0)>2^{-2k} for all <j\ell<j.

Thus from (iii), if x𝒩i,j(u)x\in\mathcal{N}_{i,j}(u) we have that 22kd(T(x),0)1/22^{-2k}\leq d(T^{\ell}(x),0)\leq 1/2 for all <j\ell<j. The choice 22k2^{-2k} is somewhat arbitrary, and any constant less that 2k+12^{-k+1} would suffice. Furthermore if d(x,xi,j)=rd(x,x_{i,j})=r, then combining with (ii) we see that d(T(x),0)=2d(x,xi,j)=2rd(T^{\ell}(x),0)=2^{\ell}d(x,x_{i,j})=2^{\ell}r for all j\ell\geq j. This follows from T(xi,j)=0T^{\ell}(x_{i,j})=0 for all j\ell\geq j.

For j1j\geq 1, let us consider x𝒩i,jx\in\mathcal{N}_{i,j}, and estimate μ(𝒩i,j(u))\mu(\mathcal{N}_{i,j}(u)) for uu0u\geq u_{0}. Suppose that d(x,xi,j)=rd(x,x_{i,j})=r. We obtain bounds on rr in terms of uu by considering {x:Y0(x)u}\{x:Y_{0}(x)\geq u\}. We have

Y0(x)=1k(=0j1d(T(x),0)α+=jk1d(T(x),0)α),=1k(=0j1d(T(x),0)α+=jk1(2r)α).\begin{split}Y_{0}(x)&=\frac{1}{k}\left(\sum_{\ell=0}^{j-1}d(T^{\ell}(x),0)^{-\alpha}+\sum_{\ell=j}^{k-1}d(T^{\ell}(x),0)^{-\alpha}\right),\\ &=\frac{1}{k}\left(\sum_{\ell=0}^{j-1}d(T^{\ell}(x),0)^{-\alpha}+\sum_{\ell=j}^{k-1}(2^{\ell}r)^{-\alpha}\right).\end{split} (8.11)

From (iii), we have 22kd(T(x),0)1/22^{-2k}\leq d(T^{\ell}(x),0)\leq 1/2. Writing z=2αz=2^{-\alpha}, we have the following implications,

Y0(x)ur(kujz2k)1α(=jk1z)1α,Y_{0}(x)\geq u\;\implies\;r\leq(ku-jz^{-2k})^{-\frac{1}{\alpha}}\left(\sum_{\ell=j}^{k-1}z^{\ell}\right)^{\frac{1}{\alpha}},

and conversely

r(2ujz1)1α(=jk1z)1αY0(x)u.r\leq(2u-jz^{-1})^{-\frac{1}{\alpha}}\left(\sum_{\ell=j}^{k-1}z^{\ell}\right)^{\frac{1}{\alpha}}\;\implies\;Y_{0}(x)\geq u.

In the two implications given above, both bounds on rr are asymptotically comparable. This gives

μ(𝒩i,j(u))=2(ku)1α(1h(u,α)jku)1α2j(1zkj1z)1α.\mu(\mathcal{N}_{i,j}(u))=2(ku)^{-\frac{1}{\alpha}}\left(1-\frac{h(u,\alpha)j}{ku}\right)^{-\frac{1}{\alpha}}2^{-j}\left(\frac{1-z^{k-j}}{1-z}\right)^{\frac{1}{\alpha}}. (8.12)

The function h(α,u)h(\alpha,u) satisfies uniform bounds 2αh(u,α)22αk2^{\alpha}\leq h(u,\alpha)\leq 2^{2\alpha k}. Notice that the estimate μ(𝒩i,j(u))\mu(\mathcal{N}_{i,j}(u)) does not depend on ii. The dependence on ii is removed via item (iii). Softening of (iii) to include dependence on ii would not lead to improvement in the asymptotic estimates, except through bounds on the function h(α,u)h(\alpha,u). Now for each j1j\geq 1, there are 2j12^{j-1} such 𝒩i,j\mathcal{N}_{i,j}. Hence, taking union over all ii (for given jj), and by using pairwise disjointness of these neighbourhoods, we obtain

i=02j11μ(𝒩i,j(u))(ku)1α(1zkj1z)1α.\sum_{i=0}^{2^{j-1}-1}\mu(\mathcal{N}_{i,j}(u))\sim(ku)^{-\frac{1}{\alpha}}\left(\frac{1-z^{k-j}}{1-z}\right)^{\frac{1}{\alpha}}. (8.13)

This equation gives the estimate for μ({Y0>u}(i𝒩i,j(u)))\mu(\{Y_{0}>u\}\cap(\cup_{i}\mathcal{N}_{i,j}(u))).

For case j=0j=0, we consider the neighbourhood 𝒩0,0(u)\mathcal{N}_{0,0}(u) of x0,0=0x_{0,0}=0. This estimate is straightforward, and we obtain the precise result

μ(𝒩0,0(u))=2(ku)1α(1zk1z)1α.\mu(\mathcal{N}_{0,0}(u))=2(ku)^{-\frac{1}{\alpha}}\left(\frac{1-z^{k}}{1-z}\right)^{\frac{1}{\alpha}}. (8.14)

Using equations (8.13), (8.14) we obtain equation (8.9) in Lemma 8.4. To complete the proof of the lemma, we just estimate the geometric series terms in equation (8.9) to quantify the constant c1(k)c_{1}(k). This analysis is elementary. Indeed, Since z=2αz=2^{-\alpha}, we have

1(1zkj1z)1α1(1z)1α=2(2α1)1α.1\leq\left(\frac{1-z^{k-j}}{1-z}\right)^{\frac{1}{\alpha}}\leq\frac{1}{(1-z)^{\frac{1}{\alpha}}}=\frac{2}{(2^{\alpha}-1)^{\frac{1}{\alpha}}}.

Thus each term within the sum of equation (8.13) is bounded above/below accordingly.

To complete the proof the lemma, consider now the asymptotic behaviour as kk\to\infty, so we can refine the estimate on c1(k)c_{1}(k). The following estimates are valid for |z|<1|z|<1, but we have in mind the particular value z=2αz=2^{-\alpha}. We have

(1zj)1α=1zjb(z),(1-z^{j})^{\frac{1}{\alpha}}=1-z^{j}b(z),

where b(z)b(z) uniformly bounded above. Thus the following estimates hold

j=1k1(1zj1z)1α=1(1z)1αj=1k1(1b(z)zj)=k1(1z)1α+O(z(1zk1)1z).\begin{split}\sum_{j=1}^{k-1}\left(\frac{1-z^{j}}{1-z}\right)^{\frac{1}{\alpha}}&=\frac{1}{(1-z)^{\frac{1}{\alpha}}}\sum_{j=1}^{k-1}(1-b(z)z^{j})\\ &=\frac{k-1}{(1-z)^{\frac{1}{\alpha}}}+O\left(\frac{z(1-z^{k-1})}{1-z}\right).\end{split} (8.15)

Thus the constant c1(k)c_{1}(k) in equation (8.10) can be replaced by (1z)1αk+c2(1-z)^{-\frac{1}{\alpha}}k+c_{2}, with |c2||c_{2}| uniformly bounded and independent of kk. This completes the proof of the Lemma. ∎

We now consider the extremal index, and it suffices to estimate μ{Y0(x)>u,Y0(Tx)<u}\mu\{Y_{0}(x)>u,Y_{0}(Tx)<u\} for large thresholds u>0u>0. We state the following lemma.

Lemma 8.5.

Let A(u)={Yu,YT<u}A(u)=\{Y\geq u,Y\circ T<u\}. Then we have the following estimate

limuμ{A(u)}μ{Y0>u}=12c1(k)(1zk1z)1α,\lim_{u\to\infty}\frac{\mu\{A(u)\}}{\mu\{Y_{0}>u\}}=\frac{1}{2c_{1}(k)}\left(\frac{1-z^{k}}{1-z}\right)^{\frac{1}{\alpha}}, (8.16)

where c1(k)c_{1}(k) is the constant in Lemma 8.4.

Proof.

Suppose u0u_{0} is as specified in the proof of Lemma 8.4. Consider x𝒩i,j(u)x\in\mathcal{N}_{i,j}(u) for uu0u\geq u_{0}. For j1j\geq 1, we claim that 𝒩i,j(u){Y0(Tx)<u}=\mathcal{N}_{i,j}(u)\cap\{Y_{0}(Tx)<u\}=\emptyset. The claim can be established by analysing Y0(x)Y_{0}(x) and Y0(Tx)Y_{0}(Tx) for x𝒩i,j(u)x\in\mathcal{N}_{i,j}(u). Suppose that d(x,xi,j)=rd(x,x_{i,j})=r. Since Tk:𝒩i,j(u)S1T^{k}:\mathcal{N}_{i,j}(u)\to S^{1} is injective, we have

Y0(x)\displaystyle Y_{0}(x) =1k(=0j1d(T(x),0)α+=jk1(2r)α);\displaystyle=\frac{1}{k}\left(\sum_{\ell=0}^{j-1}d(T^{\ell}(x),0)^{-\alpha}+\sum_{\ell=j}^{k-1}(2^{\ell}r)^{-\alpha}\right);
Y0(Tx)\displaystyle Y_{0}(Tx) =1k(=1j1d(T(x),0)α+=jk(2r)α).\displaystyle=\frac{1}{k}\left(\sum_{\ell=1}^{j-1}d(T^{\ell}(x),0)^{-\alpha}+\sum_{\ell=j}^{k}(2^{\ell}r)^{-\alpha}\right).

Hence,

Y0(Tx)Y0(x)=(2kr)α22kα.Y_{0}(Tx)-Y_{0}(x)=(2^{k}r)^{-\alpha}-2^{2k\alpha}. (8.17)

By adjusting u0u_{0} accordingly, the quantity on the right of equation (8.17) is positive for all x𝒩i,j(u)x\in\mathcal{N}_{i,j}(u). Indeed, we just require r<23k.r<2^{3k}. This leads to the claim that 𝒩i,j(u){Y0(Tx)<u}=\mathcal{N}_{i,j}(u)\cap\{Y_{0}(Tx)<u\}=\emptyset. For j=0j=0, the corresponding intersection is not empty and we estimate the measure 𝒩0,0(u){Y0(Tx)<u}\mathcal{N}_{0,0}(u)\cap\{Y_{0}(Tx)<u\}. The measure of 𝒩0,0\mathcal{N}_{0,0} is given in equation (8.14). Now consider points in this set with Y0(Tx)<uY_{0}(Tx)<u. Suppose that d(x,0)=rd(x,0)=r, with Y0(x)=uY_{0}(x)=u. We estimate rr in terms of uu. Since TkT^{k} is injective on 𝒩0,0(u)\mathcal{N}_{0,0}(u) we obtain

1kr1α(z++zk)=kur=(ku)1αz1α(1zk1z)1α.\frac{1}{k}\cdot r^{-\frac{1}{\alpha}}(z+\ldots+z^{k})=ku\;\implies\;r=(ku)^{-\frac{1}{\alpha}}z^{\frac{1}{\alpha}}\left(\frac{1-z^{k}}{1-z}\right)^{\frac{1}{\alpha}}. (8.18)

Overall, this leads to

μ{Y0(x)>u,Y0(Tx)<u}=12(ku)1α(1zk1z)1α.\mu\{Y_{0}(x)>u,Y_{0}(Tx)<u\}=\frac{1}{2}(ku)^{-\frac{1}{\alpha}}\left(\frac{1-z^{k}}{1-z}\right)^{\frac{1}{\alpha}}. (8.19)

Hence, we obtain

limuμ(A(u))μ{Y0u}=12c1(k)(1zk1z)1α.\lim_{u\to\infty}\frac{\mu(A(u))}{\mu\{Y_{0}\geq u\}}=\frac{1}{2c_{1}(k)}\left(\frac{1-z^{k}}{1-z}\right)^{\frac{1}{\alpha}}. (8.20)

This completes the proof of the lemma. ∎

We now complete the proof of Theorem 8.1. First we quantify the function g(k,T)g(k,T) in the relation wn(τ)=g(k,T)un(τ)w_{n}(\tau)=g(k,T)u_{n}(\tau). Recall that given τ>0\tau>0, un(τ)u_{n}(\tau) is such that nμ(ϕ(x)>un(τ))τn\mu(\phi(x)>u_{n}(\tau))\to\tau. Thus un=(2n/τ)ξu_{n}=(2n/\tau)^{\xi}, where ξ=α\xi=\alpha is the shape parameter. Using Lemma 8.9, we obtain the corresponding wn(τ)w_{n}(\tau) satisfying nμ(Y0>wn(τ))τn\mu(Y_{0}>w_{n}(\tau))\to\tau. This gives wn=1k(τ/c1(k)n)ξw_{n}=\frac{1}{k}(\tau/c_{1}(k)n)^{-\xi}. Hence

g(k,T)=k1(2c1(k))ξkξ1.g(k,T)=k^{-1}\left(\frac{2}{c_{1}(k)}\right)^{-\xi}\approx k^{\xi-1}.

For kk\to\infty, c1(k)/kc_{1}(k)/k approaches (12α)1α(1-2^{-\alpha})^{-\frac{1}{\alpha}}, and hence,

g(k,T)(2ξ1)kξ1.g(k,T)\sim(2^{\xi}-1)k^{\xi-1}.

The extremal index θ\theta is calculated via

θ=limnμ(An(1)(wn))μ{Y0>wn}.\theta=\lim_{n\to\infty}\frac{\mu(A^{(1)}_{n}(w_{n}))}{\mu\{Y_{0}>w_{n}\}}.

The value of θ\theta is precisely the quantity in the right-hand side of equation (8.16), and we have

θ=12c1(k)(12αk12α)1α.\theta=\frac{1}{2c_{1}(k)}\left(\frac{1-2^{-\alpha k}}{1-2^{-\alpha}}\right)^{\frac{1}{\alpha}}. (8.21)

For kk\to\infty, the value of θ\theta approaches 1/2k1/2k. Thus, from the notations of Lemma 4.1 we obtain θ2/θ11k\theta_{2}/\theta_{1}\to\frac{1}{k}. That is, when we compare the extremal index for the process BnB_{n} relative to the extremal index for the process MnM_{n}.

To conclude the proof, it suffices to verify conditions Дq(un)\operatorname{\textrm{Д}}_{q}(u_{n}), Дq(un)\operatorname{\textrm{Д}}^{\prime}_{q}(u_{n}). This follows a standard approach, see [17]. Indeed the level sets of {Y0>u}\{Y_{0}>u\} are a finite union of intervals which shrink to a finite union of points as Y0(x)Y_{0}(x) approaches its maximum. Hence indicator functions of these sets are of bounded variation type (as necessary to allow verification of these two conditions). This completes the proof.

9. Numerical applications

9.1. Introduction to numerics for chaotic dynamical systems.

In this section common numerical techniques are used to estimate the extremal index and the generalized extreme value parameters for the extremal functionals. We provide a brief introduction on numerical simulation for chaotic dynamical systems; however, for the interested reader, we refer to [33, Chapter 9] for a nice summary of proven numerical techniques that produce reliable statistical properties in this setting.

Numerical simulations to support Theorem 5.1 and Theorem 7.1 are explored in the following sections. In general, every simulation utilizes the following procedure:

  • 1.

    Simulate the trajectories, under iterations of a given dynamical map, beginning with a set of random initial conditions chosen uniformly over XX.

  • 2.

    Take an observable function ϕ(x):X\phi(x):X\rightarrow\mathbb{R} on each trajectory as a measurable observation in \mathbb{R}.

  • 3.

    Block the observable trajectories from 2. and take the maximum over each block to calculate the sequence (Mn)(M_{n}).

  • 4.

    Compute the moving kk windowed exceedance or kk windowed time-average over each trajectory from 2.

  • 5.

    Block the trajectories from 4. with the same block length as 3. to calculate the sequence (Bn)(B_{n}).

  • 6.

    Perform maximum likelihood estimation of the generalized extreme value distribution parameters μ1\mu_{1}, σ1\sigma_{1}, and ξ1\xi_{1}, for the sequence (Mn)(M_{n}) and μ2(k)\mu_{2}(k), σ2(k)\sigma_{2}(k), and ξ2(k)\xi_{2}(k) for the sequence (Bn)(B_{n}).

  • 7.

    Repeat 4–7 for each window size kk.

Trapping of a trajectory near the a fixed point occurs in numerical simulation of piecewise C2C^{2} uniformly example maps, introduced in 3.2.1 (A), due to finite precision. Adding a small ε=O(108)\varepsilon=O(10^{-8}) perturbation to each step in the orbit allows our trajectory to continue to evolve under longer iterations of the interval map without affecting the statistical properties of extremes of the trajectory. Detailed studies on the effects of small additive error for these systems in the context of extreme value theory are described in [33, 7.5 and 9.7] and [11], and employed in many numerical investigations of extremes in dynamical systems such as [12, 11], for interval expanding maps, and [2, 9] for coupled systems of uniformly expanding maps.

9.2. Numerical applications in the Fréchet case.

9.2.1. Numerical applications of Theorem 5.1 (a): kk-exceedances, maximized at an invariant set

(A) Uniformly expanding maps of the interval: kk-exceedances maximized at repelling fixed point. We use the doubling map

T(x)=2xmod1T(x)=2x\mod 1

to simulate 500 different trajectories of length 10410^{4} beginning with 500 randomly chosen (from a uniform distribution) points and calculate the trajectory of the observable ϕ(x)=d(x,x0)1\phi(x)=d(x,x_{0})^{-1} where x0=0x_{0}=0 is the repelling fixed point. Note that with this choice of observable we can expect the sequence (Mn)(M_{n}) to follow a Fréchet distribution, ξ1=1\xi_{1}=1, from [27]. We estimate the generalized extreme value parameters using the procedure described in Section 9.1 with the kk-exceedance functional. From Theorem 5.1, noting that λ=2\lambda=2, we expect that

μ2(k)=2(k1)1μ1,σ2(k)=2(k1)1σ1,ξ2(k)=ξ1=1\mu_{2}(k)=2^{-(k-1)\cdot 1}\mu_{1},\quad\sigma_{2}(k)=2^{-(k-1)\cdot 1}\sigma_{1},\quad\xi_{2}(k)=\xi_{1}=1

We refer to figure 1 for an illustration of the numerical agreement we obtain for Theorem 5.1 (a). We report results for k=1k=1 to 1010, beyond this maximum likelihood estimation of the shape parameter becomes unreliable to compute. This makes computational sense since the kk-exceedance functional limits right-tail sampling, where sampling is vital to estimate the tail decay, by way of taking the minima over an increasing window length.

In terms of Lemma 4.1, our results indicate that g(k,T)=2(k1)1g(k,T)=2^{-(k-1)\cdot 1} by noting that θ2=θ1\theta_{2}=\theta_{1} holds for any kk. Finally, although we illustrate our results for shape ξ1=α=1\xi_{1}=\alpha=1, we remark that our numerical results, without alterations to the number of initial conditions or length of trajectories, also show agreement with Theorem 5.1 (a) for 1α>01\geq\alpha>0 with g(k,T)=2(k1)αg(k,T)=2^{-(k-1)\alpha}.

Refer to caption
  (a)               (b)
Refer to caption
  (c)               (d)
Figure 1. Numerical agreement with Theorem 5.1 (a, b) for the doubling map and (c, d) the coupled system of expanding maps with the kk-exceedance functional, Fréchet observable, and x0=0x_{0}=0 as the invariant fixed point.

(C) Coupled systems of uniformly expanding maps: kk-exceedances maximized at an invariant set.

We consider the 3-coupled system F:[0,1]M[0,1]MF:[0,1]^{M}\to[0,1]^{M}, described in Equation (3.1) of Section 3.2.3, of expanding maps of form Tβ(x)=βxmod1T_{\beta}(x)=\beta x\mod{1} for β=3\beta=3. In this setting, the uniform expanding factor will be λ=3(1γ)\lambda=3(1-\gamma).

For x=(x1,x2,x3)x=(x_{1},x_{2},x_{3}), we define (x1,x2,x3)=x12+x22+x32=x2\|(x_{1},x_{2},x_{3})\|=\sqrt{x_{1}^{2}+x_{2}^{2}+x_{3}^{2}}=\|x\|_{2}. We take ϕ(x)=d(x,L)1\phi(x)=d(x,L)^{-1} as our observable of this system, where LL is the repelling invariant hyperplane of synchrony

L={(x1,x2,x3):x1=x2=x3}.L=\{(x_{1},x_{2},x_{3}):x_{1}=x_{2}=x_{3}\}.

Note that the component of a point x=(x1,x2,x3)x=(x_{1},x_{2},x_{3}) orthogonal to LL is x=(x1x¯,x2x¯,x3x¯)x^{\perp}=(x_{1}-\bar{x},x_{2}-\bar{x},x_{3}-\bar{x}) where x¯=13i=13xi\bar{x}=\frac{1}{3}\sum^{3}_{i=1}x_{i}, so our observable reduces to ϕ(x)=(x2)1\phi(x)=(\|x^{\perp}\|_{2})^{-1} for α=1\alpha=1. From Theorem 5.1, we expect that,

μ2(k)=(3(1γ))(k1)1μ1,σ2(k)=(3(1γ))(k1)1σ1,ξ2(k)=ξ1=1.\mu_{2}(k)=(3(1-\gamma))^{-(k-1)\cdot 1}\mu_{1},\quad\sigma_{2}(k)=(3(1-\gamma))^{-(k-1)\cdot 1}\sigma_{1},\quad\xi_{2}(k)=\xi_{1}=1.

We refer to figure 1 for an illustration of the numerical results for this case which agree well with Theorem 5.1 (a).

9.2.2. Numerical inference using Theorem 5.1 (a): kk-exceedances, maximized at an invariant set

To fit an extremal distribution to data, one often assumes the form of the underlying distribution and fits a Generalized Extreme Value (GEV) for block maxima or a Generalized Pareto for over threshold excesses. From this one estimates the parameters (location, scale, and shape), often by some form of likelihood or moments estimation. Arguably, the most important parameter in extremal fitting is the shape parameter, as it describes the behavior in the tails of the distributions which correspond to the most impactful extreme events.

It is well-known, with research beginning in 1928 with Fisher and Tippett, that estimation of the true shape parameter is the most numerically difficult because of the length of time it takes the extremal distribution to converge in the tails. Advancements in shape parameter estimation, stemming from the work of Leadbetter (1983) and Smith (1982, 1987), have made reasonable estimations of limiting extremal distributions possible, especially for the bounded, Weibull case and the heavy-tailed, Fréchet case. As an example, the optimal rate of convergence to the asymptotic Fréchet extremal distribution was found by Smith (1987) to be O(nβ)O(n^{\beta}) for some β\beta which depends on the scaling sequence (an)(a_{n}) coming from normalizing thresholds un=y/an+bnu_{n}=y/a_{n}+b_{n}. As previously mentioned, for any fixed NN, the standard deviation σ\sigma plays the role of (aN)(a_{N}) by describing the normalization factor for the maxima (MN)(M_{N}) required based on the rate that (Mn)(M_{n})\rightarrow\infty as nn\rightarrow\infty. As a result, the theorems and lemmas for the kk-exceedance and kk-average functional observables which indicate a reduced σ\sigma for increasing kk, illustrate that the value of β\beta, and hence the convergence rate, is reduced as well.

We now illustrate how to apply Theorem 5.1 (a), checked for accuracy in the previous section, to obtain accurate estimates of the extreme value distribution for the kk-exceedance functional and show that these estimates are, for increasing kk values, more accurate than maximum likelihood estimation (MLE). We first explain how we numerically perform fits for Fréchet random variables from the doubling map, using the relationships derived, and illustrate that the fits we obtain using our method are as good as the MLE for smaller window sizes kk and outperform the MLE for increasing window sizes. The benefit of working with a simple numerical model initially, is that we can simulate much more data to obtain more reliable empirical estimates of return probabilities.

To apply the results of Theorem 5.1, we rely on the Ferro-Segers estimate of the extremal index to estimate the expansion parameter λ\lambda. Of course, in theory we know the value of λ\lambda apriori; however, this value will need to be estimated in the data-based case. We estimate the true probabilities and return times of extremes for increasing window size kk through empirical estimates from a Monte Carlo method, using a very long simulation of 5×1065\times 10^{6} iterations, and take the maxima over 5,0005,000 blocks of length 10310^{3}. We then compare the performance of maximum likelihood estimation and our method, using 500500 blocks of length 10310^{3}, against this long-run empirical estimate. We find that return time plots for window size k=1k=1 using our method is, by definition, the same as the maximum likelihood estimation. For increasing window sizes k5k\geq 5 we see an increasing improvement in return time estimates from our method compared with fits using the MLE, see figure 2(a,b) for illustrations corresponding to window size k=5k=5 and k=10k=10. For k10k\geq 10, we observe that the MLE completely fails to estimate a distributional fit, likely due to poor sampling in the tail for increasing kk window sizes.

Refer to caption
(a)
Refer to caption
(b)
Refer to caption
      (c)
Figure 2. Doubling Map with ϕ(x)=d(x,0)0.2\phi(x)=d(x,0)^{-0.2}: Return level plots for the block maxima of the kk-exceedance functional over time windows (a) k=5k=5 and (b) k=8k=8 applying the results of Theorem 5.1 (a) verses the maximum likelihood estimated parameters and (c) tail sampling plot and empirical tail probability plot showing tail decay estimate x0.18\propto x^{-0.18} corresponding to k0.18k\approx 0.18.

Instability of the shape parameter from maximum likelihood estimation as tail sampling becomes rarer contributes to the poor performance that we observe for window sizes k>10k>10. Sampling issues are illustrated by the distribution in figure 2(c) reflecting the low tail sampling which causes data-based estimates, like the MLE, to become unreliable. On the other hand, the tail decay remains as expected for the data, even for window sizes as large as k=20k=20. One can check this by fitting a power law to the empirical tail probabilities at high thresholds as seen in figure 2(c). As a result, we can be confident that the shape parameter we estimated in the original time-series provides a good representation of the windowed kk-exceedance for window size k20k\leq 20. To numerically check higher values of kk, we would need to simulate more data for proper estimates of the tail since for k>20k>20 we find the tail no longer has the expected power law decay.

9.2.3. Numerical applications of Theorem 7.1: kk-average maximized at a non-recurrent point

(A) Uniformly expanding maps of the interval: kk-average maximized at a non-recurrent point.

We now consider the average functional where ϕ(x)=d(x,x0)1\phi(x)=d(x,x_{0})^{-1} is maximized at x0=1/πx_{0}=1/\pi, a non-recurrent point for the doubling map T(x)=2xmod1T(x)=2x\mod 1. Before we state the results, we remark that the choice of α=1\alpha=1 holds numerical significance for this case. The authors in [12] found that for Gumbel observables defined by ψ(x)=log(d(x,x0))\psi(x)=-\log(d(x,x_{0})), estimates of bnb_{n} (for some fixed NN, this plays the role of the parameter μ\mu), are x0x_{0} density-inverse affected by noise, [33, Prop 7.5.1]. In other words, if x0x_{0} is highly recurrent, and therefore has a reasonably large local density111”local density” here refers to the measure of an ε\varepsilon ball about x0x_{0} on the map TT, where ε\varepsilon is on the order of the noise. then the noise in the system has a lower effect than it does for those points x0x_{0} that are non-recurrent (or sporadically recurrent). Choosing a shape parameter α=1\alpha=1 provides enough local density around the point x0=1/πx_{0}=1/\pi to ensure that our estimates for bnb_{n} align with the theoretical estimates222bnb_{n} is taken as in equation (9.1.8). An alternative to this is to simulate large amounts of data; however, this comes at an obvious additional numerical cost. Interestingly, estimates of ana_{n}, and hence estimates of the scale parameter σ\sigma for some fixed NN, are unaffected by choice of x0x_{0} and will hold for any 1α>01\geq\alpha>0 readily. Indeed, we see this in our investigation as well. We now state our results.

From Theorem 7.1, we expect that

μ2(k)=μ1k,σ2(k)=σ1k,ξ2(k)=ξ1=1.\mu_{2}(k)=\frac{\mu_{1}}{k},\quad\sigma_{2}(k)=\frac{\sigma_{1}}{k},\quad\xi_{2}(k)=\xi_{1}=1.

We refer to figure 3 for an illustration of the numerical results for this case which agree well with Theorem 7.1. In terms of Lemma 4.1, we remark that θ2(k)=θ1k\theta_{2}(k)=\frac{\theta_{1}}{k} as illustrated in figure 3(c) and therefore g(k,T)=kξ1g(k,T)=k^{\xi-1}, as expected.

Refer to caption
  (a)            (b)            (c)
Figure 3. Numerical agreement with Theorem 7.1 for the doubling map and with the kk-average functional, Fréchet observable, and x0=1/πx_{0}=1/\pi as the non-recurrent point.

9.3. Real-world rainfall data (Germany): Climate applications in the Fréchet case.

In this section, we explore the application of Theorem 5.1 (a) of the kk-exceedance functional and Lemma 4.1 (a) of the kk-average functional for observables with an extremal Fréchet distribution. Interestingly, we have found that although real-world rainfall extremes almost always follow a Fréchet distribution, general circulation models, like PlaSim give Weibull, or more rarely, Gumbel distributions of rainfall extremes. As a result, our investigation begins with an application of real-world rainfall data taken from the Deutscher Wetterdienst for stations throughout Germany [6].

Rainfall from Germany provides the ideal data as it is mostly stationary, in the sense that standard cycles have little effect on the total daily rainfall (on the order of ±2\pm 2 mm per day). Occurring at midlatitudes, the quantity of daily rainfall is also not impacted by climate-oceanic cycles, like the El Niño Southern Oscillation. In many cases, indeed as seen here, we expect daily rainfall extremes to follow a Fréchet distribution. Results on the kk-exceedance functional of window size kk, for example, can tell us how often we expect the minimum daily rainfall will be above some high threshold for all of the kk consecutive days. Quantity of available data on daily rainfall is limited by technology, geography, and time, among other things. With the inclusion of a windowed kk-average or kk-exceedance functional we also reduce the amount of stochasticity in the data, so measurements of extremes in the tails become rarer. This reduction in tail sampling affects our estimates of the parameters of the extreme value distribution. It is an important question to ask whether we can apply the previous scaling results to real-world Fréchet data, for example data representing daily rainfall extremes.

Scaling lemmas and theorems like the ones in this paper can provide apriori expectations on the extremal distributions of these functionals that allow us to estimate the shape parameter more accurately by applying the relation to estimates using the original time series (e.g. k=1k=1) where the rate of convergence is maximized over all kk. How? Since measurements of extremes in the tails become rarer by design of the windowed kk-average or kk-exceedance functional, provided the shape parameter does not change as a function of the window size, it would be optimal to estimate the shape parameter using the extremes of daily rainfall, keep this as the true shape parameter for the windowed average and minimum daily extremes, and then extrapolate the distribution by using the derived relationship between the location and scale parameter and the window size.

First, we show that the expected behavior of the location and scale parameter, with respect to window size, is verified for the rainfall data in both the kk-exceedance and kk-average functional. Then, we illustrate how to numerically perform fits using our results for the kk-exceedance functional. (We do not have enough data to prove empirical fits for the kk-average functional; however, the results would still hold if enough data were available.) Finally, we show that (1) our results are as good as the Maximum Likelihood Estimate (MLE) for smaller window sizes and (2) outperform the Maximum Likelihood Estimate for increasing window sizes. This holds true until the tail can no longer be represented by the time series for large window sizes.

We now check the numerical results shown above for simple, chaotic dynamical systems against the behavior of extremes for daily rainfall data from stations across Germany. Daily rainfall is maximized in a storm system; hence we expect the time-series to behave like an observable ϕ(x)\phi(x) maximized at an invariant set. Since we can reasonably assume that our daily rainfall extremes follow a Fréchet distribution, our expectation is that we are in the setting of Lemma 4.1 and Theorem 5.1 for the kk-average and kk-exceedance functional, respectively. To show this is true, we approximate the relationship between the maximum likelihood estimate of the generalized extreme value distribution corresponding to yearlyblock maxima. In total, we have 23 years (1995-2018) of daily rainfall data taken from 60 stations. That is, we have 23×365×60=503,70023\times 365\times 60=503,700 data points (approximately, with a small number of missing values replaced by interpolation). We perform maximum likelihood estimation of the yearly block maxima for window sizes k=1:10k=1:10, for the kk-average functional and k=1:7k=1:7 for the kk-exceedance functional. Window sizes larger than these values result in low tail sampling and hence, unstable shape estimates which heavily affect the accuracy of the MLE.

From Lemma 4.1, our expectation for the location of the kk-average functional is that

μ2(k)=g(k,T)μ1\mu_{2}(k)=g(k,T)\mu_{1}

where μ1\mu_{1} is the location parameter for the yearly block maximum of daily rainfall and μ2(k)\mu_{2}(k) is the location parameter for the yearly block maximum of the kk-average functional. If we let y=log(μ2(k))y=\log(\mu_{2}(k)) and x=log(k)x=\log(k), then if g(k,T)(O(kα))g(k,T)\sim(O(k^{-\alpha})), we expect a linear fit,

y=mx+log(μ1)y=mx+\log(\mu_{1})

with slope mm and intercept b=log(μ1)b=\log(\mu_{1}). The generalized linear model that is fit to the data using the relationship described by ()(\ast) results in m0.7m\approx-0.7 and b3.37=log(μ1)b\approx 3.37=\log(\mu_{1}), so that g(k,T)g(k,T) is estimated as,

g(k,T)km=k0.7g(k,T)\approx k^{m}=k^{-0.7}

We validate this result on σ2(k)\sigma_{2}(k) by showing that the relation,

σ2=g(k,T)σ1k0.7σ1\sigma_{2}=g(k,T)\sigma_{1}\approx k^{-0.7}\sigma_{1}

holds true for our estimated g(k,T)g(k,T). See figure 4(a,b) for an illustration of the fit results for the kk-average functional of yearly block maximum.

Refer to caption
  (a)                   (b)
Refer to caption
  (c)                   (d)
Figure 4. Rainfall in Germany location and scale relation for (a, b) kk-average functional and (c, d) kk-exceedance functional showing agreement with Lemma 4.1(a) and Theorem 5.1 (a), respectively.

From Theorem 5.1, our expectation for the location of the kk-exceedance functional is that

μ2(k)=λ(k1)αμ1\mu_{2}(k)=\lambda^{-(k-1)\alpha}\mu_{1}

where μ1\mu_{1} is the location parameter for the yearly block maximum of daily rainfall, μ2(k)\mu_{2}(k) is the location parameter for the yearly block maximum of the kk-exceedance functional, and λ\lambda is the expansion rate parameter. Letting y=log(μ2(k))y=\log(\mu_{2}(k)) and x=k1x=k-1, provided our model is correct, we expect a linear fit,

y=αlog(λ)x+log(μ1)y=-\alpha\log(\lambda)x+\log(\mu_{1})

where α=ξ\alpha=\xi and hence our slope is m=ξlog(λ)m=-\xi\log(\lambda) and our intercept b=log(μ1)b=\log(\mu_{1}). In practice, we do not know the expansion rate of the map, λ\lambda, because we do not know the underlying complex map (or flow) that completely describes the climate; however, we can estimate such a λ\lambda by the derived relationship θ=11λ\theta=1-\frac{1}{\lambda}, where θ\theta is the extremal index. Hence, if our model is correct, we expect 11em/ξ=θ1-\frac{1}{e^{-m/\xi}}=\theta and b=log(μ1)b=\log(\mu_{1}). We estimate the extremal index θ\theta from the time series by Ferro-Segers and compare the numerical estimation of extremal index we obtain by estimating λ\lambda from the slope mm of the generalized linear model. See figure 4(c,d) for an illustration of our numerical results. We find our model performs reasonably well with 11em0.351-\frac{1}{e^{-m}}\approx 0.35, (θ=0.34\theta=0.34, approximated by Ferro-Segers) and b4.08b\approx 4.08 (log(μ1)=4.34\log(\mu_{1})=4.34, approximated by the MLE). Similar results are found for the scale parameter.

Refer to caption
(a)
Refer to caption
(b)
Figure 5. Return level plots using Theorem 5.1 (a) verses maximum likelihood estimation for (a) k=5k=5 and (b) k=6k=6. Probabilities for the block maxima of the kk-exceedance functional for k=5k=5 (similarly, k=6k=6) correspond to the yearly maximum of 5 (similarly, 6) consecutive daily rainfall values occurring over some sequence of high rainfall thresholds. *95% confidence intervals do not include error from GLM fits for the function g(k,T)g(k,T).

We now estimate the return level plots of rainfall data from weather stations across Germany in the same way as Section 9.2. Recall that, in total, we have 23 years (1995-2018) of daily rainfall data taken from 60 stations. We use all available data to estimate the empirical tail probability estimates for maxima taken over block sizes of 1 year of daily rainfall, in total we have 1,380 block maxima. We repeat this for increasing window sizes k=1:8k=1:8, where k>8k>8 results in unreliable estimates as there is no longer enough available data to sample the tail. We then limit our number of block maxima to 100 block maxima, run maximum likelihood estimation to estimate the parameters of the generalized extreme value distribution, and use our method to estimate these parameters. We compare the resulting tail estimates (80-99.99% quantiles) to the empirical estimate of the tail probabilities. For window size k3k\geq 3, we obtain significantly better fits using our results, compared to maximum likelihood estimation. One clear reason is the fluctuation in the shape parameter that is observed in for the MLE.

9.4. Numerical Applications in the Weibull case.

Given that we benchmark the numerical theorems in this paper against maximum likelihood estimates of the parameters for the GEV, we limit this discussion to shape parameters ξ>0.5\xi>-0.5, where standard regularity properties, such as asymptotic normality, can be assumed on the maximum likelihood estimator [45]. We consider the observable ϕ(x)=Cd(x,S)α\phi(x)=C-d(x,S)^{-\alpha} for α<0\alpha<0 which guarantee the shape parameter ξ>0.5\xi>-0.5.

9.4.1. Numerical applications of Theorem 5.1 (b): kk-exceedances, maximized at an invariant set

Consider the doubling map T(x)=2xmod1T(x)=2x\mod 1 equipped with the observable ϕ(x)=1d(x,0)0.4\phi(x)=1-d(x,0)^{-0.4}. From Theorem 5.1 (b), we expect,

μ2(k)=σ1ξ1(2(k1)0.41)+μ1,σ2(k)=2(k1)0.4σ1,ξ2(k)=ξ1=1\mu_{2}(k)=\frac{\sigma_{1}}{\xi_{1}}(2^{(k-1)\cdot 0.4}-1)+\mu_{1},\quad\sigma_{2}(k)=2^{(k-1)\cdot 0.4}\sigma_{1},\quad\xi_{2}(k)=\xi_{1}=1

We refer to figure 6 for an illustration of the numerical agreement we obtain for Theorem 5.1 (b). In terms of Lemma 4.1 (b), we remark that θ2(k)=θ1\theta_{2}(k)=\theta_{1} for all kk in this example, as expected.

Refer to caption
  (a)               (b)
Figure 6. Numerical agreement with Theorem 7.1 for the doubling map and with the kk-average functional, Weibull observable, and x0=0x_{0}=0 as the invariant fixed point.

9.5. Simulated temperature data (PlaSim): Climate applications in the Weibull case.

Planet simulator (PlaSim) is a general circulation model (GCM) of intermediate complexity developed by the Universität Hamburg Meteorological Institute [34]. Like most atmospheric models, PlaSim is a simplified model derived from the Navier Stokes equation in a rotating frame of reference. A stripped version of the PlaSim model, known as PUMA, is a simple GCM with all the processes of PlaSim excluding moist processes. The model structure is described by five partial differential equations which allow for the conservation of mass, momentum, and energy.

The system of five differential equations are solved numerically with discretization given by a (variable) horizontal Gaussian grid and a vertical grid of equally spaced levels so that each grid-point has a corresponding latitude, longitude and depth triplet. (The default resolution is 32 latitude grid points, 64 longitude grid points and 5 levels.) At every fixed time step t and each grid point, the atmospheric flow is determined by solving the set of model equations through the spectral transform method which results in a set of time series describing the system; including temperature, pressure, zonal, meridional and horizontal wind velocity, among others. The resulting time series can be converted through the PlaSim interface into a readily accessible data file (such as netcdf) where further analysis can be performed using a variety of platforms. We refer to [34] for more information.

For our purposes, we only consider the extremes of the output of temperature simulated by PUMA with daily and yearly cycles removed before simulation at a single, randomly selected latitude and longitude coordinate pair and a ground-level depth.

We first consider the kk-exceedance functional so that from Theorem 5.1 (b), we expect,

σ2(k)=g(k,T)σ1,\sigma_{2}(k)=g(k,T)\sigma_{1},

where g(k,T)=λ(k1)ξg(k,T)=\lambda^{-(k-1)\xi}. We cannot assume λ\lambda is exactly an expansion parameter for the system; however, we do not need this assumption for estimation. In fact, following the strategy from previous sections, we fit a GLM to the log-transformed data given by,

log(σ2(k))=(k1)ξλ+log(σ1).\log(\sigma_{2}(k))=-(k-1)\xi\lambda+\log(\sigma_{1}).

By letting y=log(σ2(k))y=\log(\sigma_{2}(k)) and x=k1x=k-1, we can expect a linear fit of the data with slope m=ξλm=\xi\lambda and intercept b=log(σ1)b=\log(\sigma_{1}). Our numerical results indicate that (1) a linear fit is indeed a good approximation to this log-transformed data (e.g. the model structure for σ2(k)\sigma_{2}(k) follows what is expected) and (2) the intercept is within a small, O(103)O(10^{-3}) or less, tolerance of the expected intercept.

Next, we apply the estimated g(k,T)g(k,T) to the relationship expected for μ2(k)\mu_{2}(k) given by,

μ2(k)=σξ(g(k,T)1)+μ1\mu_{2}(k)=\frac{\sigma}{\xi}(g(k,T)-1)+\mu_{1}

We find good agreement with the theoretical results stated in Theorem 5.1. These results are illustrated in figure 7.

Refer to caption
  (a)                  (b)
Figure 7. Plasim generated yearly maximum temperature (a) scale and (b) location relation for kk-exceedance functional showing good numerical agreement to Theorem 5.1 (b).

9.6. Real-world temperature data (Germany): Climate applications in the Wei-bull case II

In contrast to the rainfall analysis using Fréchet results, real-world temperature data has a less evident relationship to the Weibull distance observable explored in the dynamical setting. In addition, daily and annual cycles govern the extremes of temperature and while there are ways of including such non-stationarity into a GEV model, an investigation of this magnitude is beyond the scope of this paper. We therefore focus on the application of the scaling Lemma 4.1 (b) and show that this result holds true for yearly maxima of hourly temperature measured at stations throughout Germany [7]. Recalling that Lemma 4.1 (b) assumes that a scaling unu_{n} and wnw_{n} are related by a function g(k,T)g(k,T), depending on the window length kk and the system TT, that the scale parameter σ2(k)\sigma_{2}(k) is given by,

σ2(k)=g(k,T)σ1(θ2(k)θ1)ξ\sigma_{2}(k)=g(k,T)\sigma_{1}\bigg{(}\frac{\theta_{2}(k)}{\theta_{1}}\bigg{)}^{\xi}

where ξ\xi is the shape parameter of the GEV for the original time-series (window length k=1k=1), σ2(k)\sigma_{2}(k) is the scale parameter and θ2(k)\theta_{2}(k) is the extremal index of the time-series with window length kk. Admittedly, the structure of the function g(k,T)g(k,T) is not known apriori; however, taking the natural logarithm of the relationship ()(\ast) can allow us to numerically linearize and estimate the log(g(k,T))\log(g(k,T)). The numerical estimate of the function g(k,T)g(k,T) then amounts to fitting a generalized linear model (GLM) on,

log(g(k,T))=log(σ2(k)σ1)ξlog(θ2(k)θ1).\log(g(k,T))=\log\bigg{(}\frac{\sigma_{2}(k)}{\sigma_{1}}\bigg{)}-\xi\log\bigg{(}\frac{\theta_{2}(k)}{\theta_{1}}\bigg{)}.

In practice, this also requires reliable estimates of θ2(k)\theta_{2}(k) and σ2(k)\sigma_{2}(k) for some sequence of k=1:jk=1:j to fit the GLM on jj points. As a result, our ability to accurately estimate log(g(k,T))\log(g(k,T)) depends on: the stability of the shape parameter for increasing kk; the convergence time of σ2(k)\sigma_{2}(k) for increasing values of kk, which is likely increasing with increasing kk due to lower tail sampling and increased dependence; and the convergence time of the estimate for θ2(k)\theta_{2}(k). Despite these difficulties, we are able to show that the scaling Lemma 4.1 (b) does indeed describe the behavior we observe in the GEV for the kk-exceedance functional taken over windows of length kk.

Calculating the maximum over yearly blocks, we obtain stable maximum likelihood estimates of the shape of the GEV for a sequence of windows of length k=1:10k=1:10, see figure 8 (a). Using the maximum likelihood estimates of the scale σ2(k)\sigma_{2}(k) for k=1:10k=1:10, and the Ferro-Segers estimate of the extremal index θ2(k)\theta_{2}(k), we obtain j=10j=10 points with which to estimate the function log(g(k,T))\log(g(k,T)) as a GLM of the data. Results for this estimate are illustrated in figure 8 (b).

The GLM representing log(g(k,T))\log(g(k,T)) is then given by some y(k)=b0+b1(k1)y(k)=b_{0}+b_{1}(k-1) so that,

g^(k,T)=exp{b0+b1(k1)}.\hat{g}(k,T)=\exp\{b_{0}+b-1(k-1)\}.

Finally, we use this estimate in the following relationship for the location parameter of the kk-exceedance functional,

μk=σ1ξ1(g^(k,T)1)+μ1\mu_{k}=\frac{\sigma_{1}}{\xi_{1}}(\hat{g}(k,T)-1)+\mu_{1}

where μ1\mu_{1}, σ1\sigma_{1}, and ξ1\xi_{1} are the maximum likelihood estimates of the location, scale, and shape, respectively, for the GEV of the yearly maximum of hourly temperature. Estimates from this application of the scaling Lemma 4.1 (b) and the maximum likelihood estimates of the location for increasing sliding windows kk of the kk-exceedance functional are illustrated in figure 8. We obtain good numerical agreements with the theoretical relationship.

Refer to caption
  (a)                 (b)
Figure 8. Germany temperature yearly maximum (a) scale and (b) location relation for kk-exceedance functional showing good numerical agreement to Lemma 4.1(b). Note that θ2(k)\theta_{2}(k) is not constant in this case, so that results on the trend of the location parameter found in Theorem 5.1 (b) are not expected to hold.

Although the results here support the appropriateness of the scaling Lemma 4.1 (b) to applications in temperature data, the practical use in replacing traditional likelihood estimates with the scale σ\sigma and location μ\mu relationships outlined in our lemma needs further investigation. In particular, one would need to consider convergence times of σ\sigma and μ\mu for increasing kk against the prediction horizon for the GLM representing g(k,T)g(k,T) for large kk. In any case, the results shown here support the strength of the scaling lemma, particularly in informing the shape parameter ξ\xi which remains constant for any kk. With such a result, one can also consider keeping ξ\xi constant and optimizing the profile likelihood for the scale, σ\sigma and location μ\mu parameters for any value of kk. We plan to do this investigation in the future for practical data application of these scaling laws.

10. Discussion of numerical results

The implications of these results reach further when one notes that return level plots of either the Generalized Pareto (GP) or the Generalized Extreme Value (GEV) distribution by design, average out the returns of consecutive extremes. In the return level plot for the GP, one must decluster the data by its extremal index prior to fitting a GP and then average the effects of the extremal index back into the model by,

P(X>x)=P(X>u)[1+ξ(xmuσ)]1/ξ=1mθP(X>x)=P(X>u)\bigg{[}1+\xi\bigg{(}\frac{x_{m}-u}{\sigma}\bigg{)}\bigg{]}^{-1/\xi}=\frac{1}{m\theta}

where xmx_{m} is the return level associated to the return period mm with probability 1/m1/m.

As an example, suppose that a location has an extremal index θ=1/2\theta=1/2 so that, on average, once an exceedance above a threshold xmx_{m} is observed, it is followed by another exceedance. Then, for the exceedance threshold xmx_{m} corresponding to probability 1365\frac{1}{365}, that is once per year, the probability occurring at xmx_{m} using the GP distribution after incorporating the extremal index is reduced to 1182.5\frac{1}{182.5}. Although this does translate to two occurrences of daily rainfall exceeding xmx_{m} in a single year, it loses all information on the consecutive property of these occurrences.

GEV fitting of the block maxima takes away our ability to interpret results on consecutive over threshold values occurring on the order of the time-series because the blocking method only allows us to keep a single maximum value over the block. As a result, the extremal index gets factored into the location μ\mu and scale σ\sigma parameters as stated in the proof of Lemma 4.1 and [4, Theorem 5.2, Page 96] which further reduces any practical information on consecutive extremes in the return level plots for the GEV.

On the other hand, the kk-exceedance and kk-average functional, by their structure, preserve information on kk consecutive extremes and extremes of kk averages, respectively.

Arguably the most impactful result of this investigation is the unchanging shape parameter observed in the Fréchet and Weibull cases for both the kk-average and kk-exceedance functionals. Such a result is enough to obtain more accurate estimates on the kk consecutive, and kk averaged, returns of extreme weather events as estimation of the tail behavior for any kk can be done using the original time-series (k=1k=1) which features more tail sampling and hence drastically more accurate estimates of the shape parameter. Some examples on the implications of results in the Fréchet case for more accurate real-world return time estimates of consecutive returns of rainfall extremes are provided and discussed for stations throughout Germany.

Although data from Germany are used for applications of our scaling laws to extreme weather due to the quantity and quality of data available from the Deutscher Wetterdienst, we emphasize that these scaling laws can be applied to a wide range of data. For example, the general Lemma 4.1 (a,b) can be applied to any data having extremes with limiting distributions of the Fréchet and Weibull types, respectively, while Theorem 5.1 can be applied in the non-stationarity setting with appropriate time-dependent adjustments to the GEV parameters. We plan a future numerical investigation using the scaling results introduced in Theorem 5.1 to non-stationary modeling of successive rainfall extremes using Australian rainfall data which has a known dependence on ENSO. Moreover, more accurate estimates of successive extremes in both the temperature and rainfall setting can be investigated by spatial pooling of data from weather stations using similar techniques as those in the time-dependent GEV setting.

References

  • [1] A. Boyarsky and P. Góra. Laws of chaos. Invariant measures and dynamical systems in one dimension. Probability and its Applications. Birkhäuser Boston, Inc., Boston, MA, 1997. ISBN: 0-8176-4003-7
  • [2] M. Carney, M. Holland and M. Nicol. Extremes and extremal indices for level set observables on hyperbolic systems. Nonlinearity 34 (2021), no. 2, 1136-1167.
  • [3] P. Collet. Statistics of closest return for some non-uniformly hyperbolic systems. Ergodic Theory and Dynamical Systems. 21, (2001), 401-420.
  • [4] S. Coles. An Introduction to Statistical Modeling of Extreme Values. Springer, 2004.
  • [5] M. Carvalho, A. C. M. Freitas, J. M. Freitas, M. Holland and M. Nicol. Extremal dichotomy for hyperbolic toral automorphisms. Dynamical Systems: An International Journal, 30, (4), (2015), 383-403.
  • [6] DWD Climate Data Center (CDC): Historical hourly station observations of precipitation for Germany, versionv006, available at:
    https://opendata.dwd.de/climate_environment/CDC/ observations_germany/
    climate/hourly/precipitation/historical/
    
  • [7] DWD Climate Data Center (CDC): Historical hourly station observations of 2m air temperature and humidity for Germany, version v006, available at:
    https: //opendata.dwd.de/climate_environment/CDC/observations_ germany/
    climate/hourly/air_temperature/historical/
    
  • [8] P. Embrechts, C. Klüpperlberg, and T. Mikosch. Modelling extremal events for insurance and finance. Applications of Mathematics (New York), 33, Springer-Verlag, Berlin, 1997.
  • [9] D. Faranda, H. Ghoudi, P. Guirard and S. Vaienti. Extreme value theory for synchronization of coupled map lattices, Nonlinearity, 31, (7), (2018), 3326-3358
  • [10] D. Faranda, J. Freitas, V. Lucarini, G. Turchetti and S. Vaienti. Extreme value statistics for dynamical systems with noise. Nonlinearity, 26, (2013), 2597.
  • [11] D. Faranda, M. Mestre, and G. Turchetti. Analysis of round off errors with reversibility test as a dynamical indicator. International Journal of Bifurcation and Chaos, 22, (9), (2012), 1250215.
  • [12] D. Faranda and S. Vaienti. Extreme value laws for dynamical systems under observed. noise. Physica D: Nonlinear Phenomena, 280-281 (2014), 86-94.
  • [13] R. A. Fisher and L. H. C. Tippett. Limiting forms of the frequency distribution of the largest or smallest member of a sample. Proc. Cambridge Philos. Soc., 24, (1928), 180-190.
  • [14] A. C. M. Freitas and J. M. Freitas. On the link between dependence and independence in extreme value theory for dynamical systems, Stat. Probab. Lett., 78, (2008), 1088-1093.
  • [15] A. Freitas, J. Freitas, F. Rodrigues, and J. Soares. Rare events for Cantor target sets, Preprint 2019, arXiv:1903.07200.
  • [16] J. Freitas, A. Freitas and M. Todd. Hitting Times and Extreme Value Theory. Probab. Theory Related Fields, 147, (3), (2010). 675–710.
  • [17] A. Freitas, J. Freitas, and M. Todd. Extremal Index. Hitting Time Statistics and periodicity. Adv. Math., 231, (5), (2012), 2626-2665.
  • [18] A. C. M. Freitas, J. M. Freitas, and M. Todd. Speed of convergence for laws of rare events and escape rates. Stochastic Process. Appl., 125, (4), (2015), 1653-1687.
  • [19] J. Freitas, N. Haydn and M. Nicol. Convergence of rare events point processes to the Poisson for billiards. Nonlinearity, 27, (2014), 1669-1687.
  • [20] J. Galambos. The Asymptotic Theory of Extreme Order Statistics. John Wiley and Sons, 1978.
  • [21] V. Galfi, V. Lucarini and J. Wouters. A large deviation theory based analysis of heat waves and cold spells in a simplified model of the general circulation of the atmosphere. Journal of Statistical Mechanics: Theory and Experiment. (2019).
  • [22] C. Gupta. Extreme-value distributions for some classes of non-uniformly partially hyperbolic dynamical systems. Ergodic Theory and Dynamical Systems. 30, (3), (2011), 757-771.
  • [23] C. Gupta, M. Holland and M. Nicol. Extreme value theory and return time statistics for dispersing billiard maps and flows, Lozi maps and Lorenz-like maps. Ergodic Theory and Dynamical Systems. 31, (2011), 1363–1390.
  • [24] C. Gupta, M. Nicol and W. Ott. A Borel-Cantelli lemma for non-uniformly expanding dynamical systems. Nonlinearity 23, (8), (2010), 1991–2008.
  • [25] M. Hirata. Poisson Limit Law for Axiom A diffeomorphisms. Ergodic Theory and Dynamical Systems. 13, (3), (1993), 533–556.
  • [26] M. Holland, M. Nicol, A. Török. Extreme value theory for non-uniformly expanding dynamical systems. Trans. Am. Math. Soc., 364, (2012), 661-688.
  • [27] M. Holland, M. Nicol, A. Török. Almost sure convergence of maxima for chaotic dynamical systems. Stochastic Process. Appl. 126, (10), (2016), 3145–3170.
  • [28] M. P. Holland, P. Rabassa, A. E. Sterk. Quantitative recurrence statistics and convergence to an extreme value distribution for non-uniformly hyperbolic dynamical systems. Nonlinearity, 29, (8), (2016).
  • [29] M. Holland, R. Vitolo, P. Rabassa, A. E. Sterk, H. Broer. Extreme value laws in dynamical systems under physical observables. Phys. D, 241, (2012), 497-513.
  • [30] G. Keller. Rare events, exponential hitting times and extremal indices via spectral perturbation. Dynamical Systems: An International Journal. 27, (1), (2012), 11–27.
  • [31] D. Kim. The dynamical Borel-Cantelli lemma for interval maps. Discrete Contin. Dyn. Syst. 17, (4), (2007), 891–900.
  • [32] Leadbetter, M. R., Lindgren, G., and Rootzen, H. Extremes and Related Properties of Random Sequences and Processes. Springer-Verlag, New York, 1983.
  • [33] V. Lucarini, D. Faranda, A.C. Freitas, J.M. Freitas, M. P. Holland, T. Kuna, M. Nicol, M. Todd, S. Vaienti, Extremes and Recurrence in Dynamical Systems, Pure and Applied Mathematics: A Wiley Series of Texts, Monographs, and Tracts, 2016.
  • [34] K. Fraedrich, E. Kirk and F. Lunkeit. PUMA Portable University Model of the Atmosphere. World Data Center for Climate (WDCC) at DKRZ. 2009.
  • [35] V. Lucarini, D. Faranda, J. Wouters J, and T. Kuna. Towards a General Theory of Extremes for Observables of Chaotic Dynamical Systems. Journal of Statistical Physics, 154, (3), (2014), 723–750.
  • [36] Mañé, R. Ergodic theory and differentiable dynamics. Translated from the Portuguese by Silvio Levy. Ergebnisse der Mathematik und ihrer Grenzgebiete (3) [Results in Mathematics and Related Areas (3)], 8. Springer-Verlag, Berlin, 1987. xii+317 pp. ISBN: 3-540-15278-4
  • [37] H. Niederreiter. Random Number Generation and Quasi-Monte Carlo Methods, Soc. Industrial and Applied Mathematics, 1992.
  • [38] F. Ragone, J. Wouters and F. Bouchet. Computation of extreme heat waves in climate models using a large deviation algorithm. PNAS, vol 115, no. 1, 24-29, (2018)
  • [39] F. and Ragone and F. Bouchet. Computation of extreme values of time averaged observables in climate models with large deviation techniques. J. Stat. Phys. 179 (2020), no. 5-6, 1637-1665.
  • [40] F. Péne and B. Saussol. Back to balls in billiards. Comm. Math. Phys. 293, (3), (2010), 837-866.
  • [41] A. E. Sterk, M. P. Holland, P. Rabassa, H. W. Broer, R. Vitolo. Predictability of extreme values in geophysical models. Nonlinear Processes in Geophysics, 19, (2012), 529–539.
  • [42] M. Süveges, Likelihood estimation of the extremal index. Extremes, Springer, 10, (1-2), (2007), 41-55.
  • [43] Fan Yang. Rare event processes and entry times distribution for arbitrary null sets on compact manifolds, arxiv:1905.09956.
  • [44] L.-S. Young. Statistical properties of dynamical systems with some hyperbolicity. Ann. Math. 147, (1998), 585–650.
  • [45] R. L. Smith Maximum likelihood estimaion in a class of nonregular cases. Biometrika 72, (1985), 67-90