This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Wavelet estimation of nonstationary spatial covariance function

Yangyang Chen Department of Statistics, Institute of Mathematics and Statistics, University of São Paulo Pedro A. Morettin Department of Statistics, Institute of Mathematics and Statistics, University of São Paulo Ronaldo Dias Department of Statistics, Institute of Mathematics, Statisctics and Scientific Computing, University of Campinas Chang Chiann Department of Statistics, Institute of Mathematics and Statistics, University of São Paulo
Abstract

This work proposes a new procedure for estimating the non-stationary spatial covariance function for Spatial-Temporal Deformation. The proposed procedure is based on a monotonic function approach. The deformation functions are expanded as a linear combination of the wavelet basis. The estimate of the deformation guarantees an injective transformation. Such that two distinct locations in the geographic plane are not mapped into the same point in the deformation plane. Simulation studies have shown the effectiveness of this procedure. An application to historical daily maximum temperature records exemplifies the flexibility of the proposed methodology when dealing with real datasets.

*Corresponding author.E-mail address: [email protected] (P. A. Morettin).

Keywords Spatio-temporal statistics, Nonstationary, Wavelets

1 Introduction

In geostatistics it is common to assume that the stochastic process is stationary, which means that the distribution does not change when shifted in the origin of the index set, and is isotropic, that is, the process is invariant under rotations around the origin. But in practice, the assumption of stationarity and isotropic are often difficult to hold in real applications; see for instance Guttorp et al. (1994), Schmidt and O’Hagan (2003), and Finley (2011).

If a bijective transformation ff that maps the sampling locations at a geographic domain (G plane) into space representations of the deformation domain (D plane) is built, the spatial correlation can be considered isotropic in the D plan. The injectivity of transformation is one of the most important requirements to guarantee that two distinct locations in the G plane are not mapped into the same point in the D plane. (See Damian et al. (2001)). A sufficient condition for ff being injective is that the Jacobian determinant of ff is non-zero.

To guarantee the injectivity of the mapping function, Choi and Lee (2000) suggested the box constraints for uniform cubic BB-splinespline deformation coefficients. Musse et al. (2001) enforced Jacobian positivity with a novel constrained hierarchical parametric model. Chun and Fessler (2009) provided sufficient conditions that restrict BB-splinessplines based deformation coefficients to ensure that the Jacobian of such transformation is positive which extended the conditions of Kim (2004). Although these methods ensure the positivity of Jacobian, deterministic models were used. Sampson and Guttorp (1992) introduced the topic with a stochastic model but no guarantee that the transformation based on thin-plate splines is injective. Damian et al. (2001) suggested a solution to guarantee the injectivity of the transformation in a stochastic model using a Bayesian approach.

In this paper, we propose a method for nonstationary covariance function modeling, the deformation guarantees the injectivity of the transformation and is based on the monotonic function approach. Note that the wavelet expansion is also used for the deformation.

The plan of the paper is as follows. In section 2 we briefly review wavelets and introduce the spatio-temporal model used in the paper. Then, the deformation proposed is presented in section 3. Some simulations are described in section 4 and an application to historical daily maximum temperature records is given in section 5. Finally, the paper ends with some comments in section 6.

2 Background

2.1 Wavelets

In wavelets analysis, a function fL2()f\in L^{2}(\mathbb{R}) can be approximated by a linear combination of binary dilations 2j2^{j} and dyadic translations k2jk2^{-j} of a function ϕ(t)\phi(t), called scaling function or father wavelet (which is used for capturing the smooth and the low-frequency of the data) and/or of a function ψ(t)\psi(t), called mother wavelet (which is used for capturing the details and the high-frequency of the data). Thus, a wavelet basis is composed of functions {ϕj,k(t)ψj,k(t),j,k}L2()\{\phi_{j,k}(t)\cup\psi_{j,k}(t),j,k\in\mathbb{Z}\}\in L^{2}(\mathbb{R}), where

ϕj,k(t)=2j/2ϕ(2jtk),\phi_{j,k}(t)=2^{j/2}\phi(2^{j}t-k), (1)

and

ψj,k(t)=2j/2ψ(2jtk).\psi_{j,k}(t)=2^{j/2}\psi(2^{j}t-k). (2)

One way of obtaining the scaling function is a solution of the equation

ϕ(t)=2klkϕ(2tk),\phi(t)=\sqrt{2}\sum_{k}l_{k}\phi(2t-k), (3)

and ψ(t)\psi(t) is obtained from ϕ(t)\phi(t) by

ψ(t)=2khkϕ(2tk),\psi(t)=\sqrt{2}\sum_{k}h_{k}\phi(2t-k), (4)

where

hk=(1)kl1kh_{k}=(-1)^{k}l_{1-k} (5)

and

lk=2ϕ(t)ϕ(2tk)𝑑t.l_{k}=\sqrt{2}\int_{-\infty}^{\infty}\phi(t)\phi(2t-k)dt. (6)

In multiresolution expansion, any function fL2()f\in L^{2}(\mathbb{R}) can be represented as

f(t)=kcJ0,kϕJ0,k(t)+jJ0kdj,kψj,k(t),f(t)=\sum_{k}c_{J_{0},k}\phi_{J_{0},k}(t)+\sum_{j\geq J_{0}}\sum_{k}d_{j,k}\psi_{j,k}(t), (7)

for some coarse scale J0J_{0}, usually taken as zero, for more details see Hardle et al. (1997).

There are many families of wavelets, such as the Mexican hat wavelet, Shannon wavelet, Daubechies wavelet, and Haar wavelet, the oldest and the simplest possible wavelet. In this paper, the well-known Mexican hat and Shannon wavelets are employed, they are continuous wavelets and are defined by analytical expressions. Figures 1 and 2 present the scaling (ϕ(t)\phi(t)) and wavelet (ψ(t)\psi(t)) functions from these wavelets.

Refer to caption
Figure 1: Wavelet function from the Mexican hat wavelet.
Refer to caption Refer to caption
Figure 2: Scaling (left) and wavelet (right) functions from the Shannon wavelet.

2.2 Spatio-Temporal Model

As the name suggests, spatio-temporal data are collected across space and time. Many fields, including geoscience, meteorology, neuroscience, and climate science, generate data that have both spatial and temporal components. For a comprehensive overview of spatio-temporal data, see Cressie and Wikle (2011). Let Zi(t)=Z(xi,t)Z_{i}(t)=Z(\textbf{x}_{i},t), i=1,,ni=1,...,n, t=1,,Tt=1,...,T, be a sample from a spatio-temporal process and the model proposed by Damian et al. (2001) will be considered for the underlying process:

Z(xi,t)=μ(xi,t)+ν(xi)Eτ(xi)+Eϵ(xi,t),Z(\textbf{x}_{i},t)=\mu(\textbf{x}_{i},t)+\sqrt{\nu(\textbf{x}_{i})}E_{\tau}(\textbf{x}_{i})+E_{\epsilon}(\textbf{x}_{i},t), (8)

where xi\textbf{x}_{i} denotes location and tt time, μ(xi,t)\mu(\textbf{x}_{i},t) represents the spatio-temporal mean, ν(xi)\nu(\textbf{x}_{i}) refers to the variance of the process observed at location xi\textbf{x}_{i}. Eτ(xi)E_{\tau}(\textbf{x}_{i}) is a Gaussian spatial process and Cov[Eτ(xi)E_{\tau}(\textbf{x}_{i}),Eτ(xj)E_{\tau}(\textbf{x}_{j})] \rightarrow 1 when xixj\textbf{x}_{i}\rightarrow\textbf{x}_{j} and EϵE_{\epsilon} represents measurement error and small-scale spatial variability.

We assume that μ(xi,t)μ(xi)\mu(\textbf{x}_{i},t)\equiv\mu(\textbf{x}_{i}) is constant in time. Then, the covariance of the process at the locations xi\textbf{x}_{i} and xj\textbf{x}_{j}, and time tt, is given by

Cov[Z(xi,t),Z(xj,t)]={ν(xi)ν(xj)Corr[Eτ(xi)Eτ(xj)],ifxixj,ν(xi)+σϵ2,ifxi=xj.\hbox{Cov}[Z(\textbf{x}_{i},t),Z(\textbf{x}_{j},t)]=\left\{\begin{array}[]{rll}\sqrt{\nu(\textbf{x}_{i})\nu(\textbf{x}_{j})}\hbox{Corr}[E_{\tau}(\textbf{x}_{i})E_{\tau}(\textbf{x}_{j})],&\hbox{if}&\textbf{x}_{i}\neq\textbf{x}_{j},\\ \nu(\textbf{x}_{i})+\sigma_{\epsilon}^{2},&\hbox{if}&\textbf{x}_{i}=\textbf{x}_{j}.\end{array}\right. (9)

Following Sampson and Guttorp (1992), the non-stationary spatial correlation of EτE_{\tau} is obtained by:

Corr[Eτ(xi)Eτ(xj)]=ρ(|f(xi)f(xj)|),\hbox{Corr}[E_{\tau}(\textbf{x}_{i})E_{\tau}(\textbf{x}_{j})]=\rho(|f(\textbf{x}_{i})-f(\textbf{x}_{j})|), (10)

where ρ\rho is a correlation function with unknown parameter(s) and ff is a transformation that maps the sampling locations at the G plane into space representations of the D plane, where the spatial correlation can be considered isotropic.

3 Deformation based on Monotonic Functions

3.1 Strictly Monotonic functions

A monotonic function is a function that is either entirely non-increasing or non-decreasing. An important property of this kind of function is that if a function is a strictly monotonic function, then it is injective on its domain. In this paper, we use the monotonic function introduced by Ramsay (1998). It is an arbitrary twice differentiable strictly monotonic function defined on an interval closed on the left.

The strictly monotonic function gg satisfies the conditions: ln(DgDg) is differentiable and D{ln(Dg)}=D2g/DgD\{\ln(Dg)\}=D^{2}g/Dg is square Lebesgue integrable, and DD refers to the operation of taking the derivative and D1D^{-1} means the integration operator. These conditions guarantee the function’s first derivative is smooth and bounded almost everywhere. The following theorem shows that gg is a general solution of a differential equation

Theorem 3.1

(Ramsay (1998)) Every monotonic function gg is representable as either

g(x)=C0+C1D1{exp(D1w(x))}g(x)=C_{0}+C_{1}D^{-1}\{\exp(D^{-1}w(x))\} (11)

or as a solution of the homogeneous linear differential equation

D2g(x)=w(x)Dg(x),D^{2}g(x)=w(x)Dg(x), (12)

where w(x)w(x) is a Lebesgue square integrable function and C0C_{0} and C1C_{1} are arbitrary constants.

Note that the function g(x)g(x) is strictly monotone increasing, Dg(x)=eW(x)Dg(x)=e^{W}(x), where W(x)=D1w(x)+logC1W(x)=D^{-1}w(x)+\log C_{1} were assumed, for more details see Ramsay and Silverman (2006).

3.2 Basis function expansions

By Theorem 3.1, instead of estimating the constrained function gg, the problem becomes computing the unconstrained function ww. Since ww can be positive or negative, we expand it as a linear combination of a set of wavelet basis functions.

As previously mentioned, two types of wavelets will be used:

  • Mexican hat, given by

    ψMex(x)=23π1/4(1x2)ex2/2.\psi^{Mex}(x)=\frac{2}{\sqrt{3}\pi^{1/4}}(1-x^{2})e^{-x^{2}/2}. (13)

    Then we can expand ω\omega as

    ωMex(x)=j=0Jk=02j1cj,kψj,kMex(x).\omega_{Mex}(x)=\sum_{j=0}^{J}\sum_{k=0}^{2^{j}-1}c_{j,k}\psi^{Mex}_{j,k}(x). (14)
  • Shannon wavelet, given by

    ψShan(x)\displaystyle\psi^{Shan}(x) =\displaystyle= sinc(x2)cos(3πx2)=2sinc(2x)sinc(x),\displaystyle\hbox{sinc}(\frac{x}{2})\cos(\frac{3\pi x}{2})=2\hbox{sinc}(2x)-\hbox{sinc}(x), (15)
    ϕShan(x)\displaystyle\phi^{Shan}(x) =\displaystyle= sinc(x),\displaystyle\hbox{sinc}(x), (16)

    where sinc(x)=sinπxπx\hbox{sinc}(x)=\frac{\sin\pi x}{\pi x}. And ω\omega can be expressed by

    ωShan(x)=c0ϕShan(x)+j=0Jk=02j1cj,kψj,kShan(x).\omega_{Shan}(x)=c_{0}\phi^{Shan}(x)+\sum_{j=0}^{J}\sum_{k=0}^{2^{j}-1}c_{j,k}\psi^{Shan}_{j,k}(x). (17)

3.3 Deformation based on Monotonic Functions

Let xi=(xi1,xi2)\textbf{x}_{i}=(x_{i1},x_{i2}), location ii in a G plane and yi=(yi1,yi2)\textbf{y}_{i}=(y_{i1},y_{i2}), its deformed location in a D plane, i=1,,ni=1,...,n. As there is no natural sort order in 2\mathbb{R}^{2}, it’s very difficult to get a bidimensional monotonic function. Adapting the generalized additive model introduced by Hastie e Tibshirani (1990) to estimate the deformation, we can consider each coordinate of the representation of the D plane as the response variable and the coordinates of the location in the G plane as predictor variables and thus we have an additive model as

yil=β0+j=12gjl(xij)+ϵil,l=1,2,y_{il}=\beta_{0}+\sum_{j=1}^{2}g^{l}_{j}(x_{ij})+\epsilon_{il},\ l=1,2, (18)

where β0\beta_{0} represent intercept and ϵil\epsilon_{il} indicates random effect.

Suppose that β0=0\beta_{0}=0 and random effect is null. Therefore, the representation in the D plane, yi=(yi1,yi2)\textbf{y}_{i}=(y_{i1},y_{i2}), can be written as

yi=[yi1yi2]=[g11(xi1)+g21(xi2)g12(xi1)+g22(xi2)],\textbf{y}_{i}=\left[\begin{array}[]{c}y_{i1}\\ y_{i2}\\ \end{array}\right]=\left[\begin{array}[]{c}g^{1}_{1}(x_{i1})+g^{1}_{2}(x_{i2})\\ g^{2}_{1}(x_{i1})+g^{2}_{2}(x_{i2})\\ \end{array}\right], (19)

where g1l(xi1)g_{1}^{l}(x_{i1}) and g2l(xi2)g_{2}^{l}(x_{i2}) are monotonic functions given by (11). Since g1l(xi1)g^{l}_{1}(x_{i1}) and g2l(xi2)g^{l}_{2}(x_{i2}) are strictly monotonic on the range [0,)[0,\infty), yily_{il} is also strictly monotonic in this range. Then, yily_{il} is a injective function. And we can write

yil=C10l+C11lD1exp{D1ωl1(xi1)}+C20l+C21lD1exp{D1ωl2(xi2)}.y_{il}=C^{l}_{10}+C^{l}_{11}D^{-1}\exp\{D^{-1}\omega^{l1}(x_{i1})\}+C^{l}_{20}+C^{l}_{21}D^{-1}\exp\{D^{-1}\omega^{l2}(x_{i2})\}. (20)

According to Theorem 3.1, C0C_{0} and C1C_{1} are arbitrary constants, suppose that C10l=C20l=0C^{l}_{10}=C^{l}_{20}=0 and C11l=C21l=1C^{l}_{11}=C^{l}_{21}=1. Therefore, (20) can be simplified to

yil=D1exp{D1ωl1(xi1)}+D1exp{D1ωl2(xi2)}.y_{il}=D^{-1}\exp\{D^{-1}\omega^{l1}(x_{i1})\}+D^{-1}\exp\{D^{-1}\omega^{l2}(x_{i2})\}. (21)

Note that ωl1(xi1)\omega^{l1}(x_{i1}) and ωl2(xi2)\omega^{l2}(x_{i2}) can be written in the form (14) or (17). Thus, the estimated deformations using Mexican hat and Shannon wavelets can be written as

y^ilMex=D1exp{ωMexl1(xi1)}+D1exp{ωMexl2(xi2)},\displaystyle\hat{y}^{Mex}_{il}=D^{-1}\exp\{\omega^{l1}_{Mex}(x_{i1})\}+D^{-1}\exp\{\omega^{l2}_{Mex}(x_{i2})\}, (22)
y^ilShan=D1exp{ωShanl1(xi1)}+D1exp{ωShanl2(xi2)},\displaystyle\hat{y}^{Shan}_{il}=D^{-1}\exp\{\omega^{l1}_{Shan}(x_{i1})\}+D^{-1}\exp\{\omega^{l2}_{Shan}(x_{i2})\}, (23)

for i=1,,n,l=1,2i=1,...,n,\ l=1,2.

3.4 Process Optimization

Let xi=(xi1,xi2)G2\textbf{x}_{i}=(x_{i1},x_{i2})\in G\subset\mathbb{R}^{2} coordinates of location ii in a G plane and suppose that a collection of nn locations ((x11,x12),,(xn1,xn2))((x_{11},x_{12}),...,(x_{n1},x_{n2})) was obtained. The optimization procedure is used to estimate the deformation yi\textbf{y}_{i}’s, the spatial variance ν=ν(x)\nu=\nu(\textbf{x}) and the parameters 𝜽\theta of the correlation function ρ\rho, maximizing the likelihood function of the samples zit=Z(xi,t)z_{it}=Z(\textbf{x}_{i},t), i=1,..,ni=1,..,n, t=1,..,Tt=1,..,T.

Assuming zt=(z1t,,znt)Nn(𝝁,Σ𝜼)\textbf{z}_{t}=(z_{1t},...,z_{nt})^{\prime}\sim N_{n}(\mbox{\boldmath{$\mu$}},\Sigma_{\mbox{\boldmath{$\eta$}}}), the likelihood function is

L(𝝁,Σ𝜼|z)\displaystyle L(\mbox{\boldmath{$\mu$}},\Sigma_{\mbox{\boldmath{$\eta$}}}|\textbf{z}) =\displaystyle= (2π)nT/2×det(Σ𝜼)T/2×exp{12t=1T(zt𝝁)TΣ𝜼1(zt𝝁)}\displaystyle(2\pi)^{-nT/2}\times det(\Sigma_{\mbox{\boldmath{$\eta$}}})^{-T/2}\times\exp\{-\frac{1}{2}\sum_{t=1}^{T}(\textbf{z}_{t}-\mbox{\boldmath{$\mu$}})^{T}\Sigma_{\mbox{\boldmath{$\eta$}}}^{-1}(\textbf{z}_{t}-\mbox{\boldmath{$\mu$}})\} (24)
=\displaystyle= (2π)nT/2×det(Σ𝜼)T/2×exp{12tr[Σ𝜼1(t=1T(ztz¯)(ztz¯)T+T(z¯𝝁)(z¯𝝁)T)]}\displaystyle(2\pi)^{-nT/2}\times det(\Sigma_{\mbox{\boldmath{$\eta$}}})^{-T/2}\times\exp\{-\frac{1}{2}tr[\Sigma^{-1}_{\mbox{\boldmath{$\eta$}}}(\sum_{t=1}^{T}(\textbf{z}_{t}-\bar{\textbf{z}})(\textbf{z}_{t}-\bar{\textbf{z}})^{T}+T(\bar{\textbf{z}}-\mbox{\boldmath{$\mu$}})(\bar{\textbf{z}}-\mbox{\boldmath{$\mu$}})^{T})]\}
=\displaystyle= (2π)nT/2×det(Σ𝜼)T/2×exp{T2tr[Σ𝜼1S]T2(z¯𝝁)TΣ𝜼1(z¯𝝁)},\displaystyle(2\pi)^{-nT/2}\times det(\Sigma_{\mbox{\boldmath{$\eta$}}})^{-T/2}\times\exp\{-\frac{T}{2}tr[\Sigma_{\mbox{\boldmath{$\eta$}}}^{-1}S]-\frac{T}{2}(\bar{\textbf{z}}-\mbox{\boldmath{$\mu$}})^{T}\Sigma_{\mbox{\boldmath{$\eta$}}}^{-1}(\bar{\textbf{z}}-\mbox{\boldmath{$\mu$}})\},

where z¯\bar{\textbf{z}} is the vector of means at each location, SS is sample spatial covariance matrix whose element Sij=t=1T(zitz¯i.)(zjtz¯j.)S_{ij}=\sum_{t=1}^{T}(z_{it}-\bar{z}_{i.})(z_{jt}-\bar{z}_{j.}) and the covariance matrix

Σ𝜼=(c,ν,𝜽)=(σij),\Sigma_{\mbox{\boldmath{$\eta$}}=(\textbf{c},\nu,\mbox{\boldmath{$\theta$}})}=(\sigma_{ij}), (25)

where

σij=vivjρ𝜽(|yiyj|),\sigma_{ij}=\sqrt{v_{i}v_{j}}\rho_{\mbox{\boldmath{$\theta$}}}(|\textbf{y}_{i}-\textbf{y}_{j}|), (26)

with the coordinates yi\textbf{y}_{i} and yj\textbf{y}_{j} of the representation of the D plane, the spatial variance νi\nu_{i} and νj\nu_{j} at locations ii and jj, respectively, and the parameters 𝜽\theta of the correlation function ρ\rho.

Without loss of generality, set 𝝁=0\mbox{\boldmath{$\mu$}}=\textbf{0}, the optimization problem becomes

max𝜼L(Σ𝜼|z).\underset{\mbox{\boldmath{$\eta$}}}{\hbox{max}}\ L(\Sigma_{\mbox{\boldmath{$\eta$}}}|\textbf{z}). (27)

The optimization procedure to estimate the covariance matrix (25) consists of the following steps:

  • 1.

    Let c0\textbf{c}^{0} be the initial values of c, coefficients of ω\omega and calculate 𝜸0=(ν0,𝜽0)\mbox{\boldmath{$\gamma$}}^{0}=(\nu^{0},\mbox{\boldmath{$\theta$}}^{0}) such that

    𝜸0=arg maxl(Σν,𝜽|z,c0),\mbox{\boldmath{$\gamma$}}^{0}=\hbox{arg max}\ l(\Sigma_{\nu,\mbox{\boldmath{$\theta$}}}|\textbf{z},\textbf{c}^{0}), (28)

    where l(Σν,𝜽|z,c0)=logL(Σν,𝜽|z,c0)l(\Sigma_{\nu,\mbox{\boldmath{$\theta$}}}|\textbf{z},\textbf{c}^{0})=\log L(\Sigma_{\nu,\mbox{\boldmath{$\theta$}}}|\textbf{z},\textbf{c}^{0}).

  • 2.

    Given 𝜸0\mbox{\boldmath{$\gamma$}}^{0} obtained in Step 1, calculate

    c1=arg maxl(c|z,Σν0,𝜽0),\textbf{c}^{1}=\hbox{arg max}\ l(\textbf{c}|\textbf{z},\Sigma_{\nu^{0},\mbox{\boldmath{$\theta$}}^{0}}), (29)

    where l(c|z,Σν0,𝜽0)=logL(c|z,Σν0,𝜽0)l(\textbf{c}|\textbf{z},\Sigma_{\nu^{0},\mbox{\boldmath{$\theta$}}^{0}})=\log L(\textbf{c}|\textbf{z},\Sigma_{\nu^{0},\mbox{\boldmath{$\theta$}}^{0}}).

  • 3.

    Replace c0\textbf{c}^{0} by c1\textbf{c}^{1}, return to Step 1.

  • 4.

    Repeat Step (2) - (3) until convergence.

4 Simulations

This section presents some simulations in order to assess the performance of the proposed procedure. These simulations are carried out in the cases that the deformations are generated by functions linear, quadratic, non-linear and wavelet.

The deformed coordinates yi\textbf{y}_{i}’s, i.e. representations of D plane at location ii, i=1,..,ni=1,..,n, are generated as follows:

  • Linear case:

    yi1=0.75xi1+xi2,\displaystyle y_{i1}=0.75x_{i1}+x_{i2}, (30)
    yi2=xi1+0.25xi2.\displaystyle y_{i2}=x_{i1}+0.25x_{i2}. (31)
  • Quadratic case:

    yi1=0.5(xi10.5)2+(xi20.5)+0.6,\displaystyle y_{i1}=-0.5(x_{i1}-0.5)^{2}+(x_{i2}-0.5)+0.6, (32)
    yi2=(xi10.5)0.5(xi20.5)2+0.6.\displaystyle y_{i2}=(x_{i1}-0.5)-0.5(x_{i2}-0.5)^{2}+0.6. (33)
  • Non-linear case:

    yi1=cos(angle)(xi10.5)+sin(angle)(xi20.5)+0.5,\displaystyle y_{i1}=\cos(\hbox{angle})(x_{i1}-0.5)+\sin(\hbox{angle})(x_{i2}-0.5)+0.5, (34)
    yi2=sin(angle)(xi10.5)+cos(angle)(xi20.5)+0.5,\displaystyle y_{i2}=-\sin(\hbox{angle})(x_{i1}-0.5)+\cos(\hbox{angle})(x_{i2}-0.5)+0.5, (35)

    where angle=2.5exp{(xi10.5)2(xi20.5)2}+3π/2\hbox{angle}=2.5\exp\{-(x_{i1}-0.5)^{2}-(x_{i2}-0.5)^{2}\}+3\pi/2.

  • Wavelet case:

    yi1=D1exp{j=01k=02j1cj,k11ψj,k(xi1)}+D1exp{j=01k=02j1cj,k12ψj,k(xi2)},\displaystyle y_{i1}=D^{-1}\exp\{\sum_{j=0}^{1}\sum_{k=0}^{2^{j}-1}c^{11}_{j,k}\psi_{j,k}(x_{i1})\}+D^{-1}\exp\{\sum_{j=0}^{1}\sum_{k=0}^{2^{j}-1}c^{12}_{j,k}\psi_{j,k}(x_{i2})\}, (36)
    yi2=D1exp{j=01k=02j1cj,k21ψj,k(xi1)}+D1exp{j=01k=02j1cj,k22ψj,k(xi2)},\displaystyle y_{i2}=D^{-1}\exp\{\sum_{j=0}^{1}\sum_{k=0}^{2^{j}-1}c^{21}_{j,k}\psi_{j,k}(x_{i1})\}+D^{-1}\exp\{\sum_{j=0}^{1}\sum_{k=0}^{2^{j}-1}c^{22}_{j,k}\psi_{j,k}(x_{i2})\}, (37)

    where ψ(t)\psi(t) are Mexican hat wavelets, c0,011=0.25c^{11}_{0,0}=0.25, c1,011=0.01c^{11}_{1,0}=0.01, c1,111=0.036c^{11}_{1,1}=-0.036, c0,012=0.37c^{12}_{0,0}=-0.37, c1,012=0.065c^{12}_{1,0}=0.065, c1,112=1.2c^{12}_{1,1}=-1.2, c0,021=0.032c^{21}_{0,0}=-0.032, c1,021=0.043c^{21}_{1,0}=-0.043, c1,121=1c^{21}_{1,1}=-1, c0,022=0.031c^{22}_{0,0}=-0.031, c1,022=0.11c^{22}_{1,0}=0.11, c1,122=0.19c^{22}_{1,1}=0.19.

The simulations consist of the following steps:

  • 1.

    Let nn = 50 sample locations generated in the geographical domain G = [0,1]×[0,1][0,1]\times[0,1], the coordinates (longitude and latitude) are generated following a uniform dispersion in (0,1)(0,1). Figure 3 shows the sampling locations, and the deformed locations are presented in Figure 4.

  • 2.

    The sample data (z = (z1,,zT\textbf{z}_{1},...,\textbf{z}_{T})) are simulated from a Gaussian random field with mean function μ(z)=0\mu(\textbf{z})=\textbf{0} and covariance function

    Cov(zi,zj)=νρ𝜽(|yiyj|)+σϵ2𝕀(zi=zj)=νexp{|yiyj|/θ}+σϵ2𝕀(zi=zj)\hbox{Cov}(\textbf{z}_{i},\textbf{z}_{j})=\nu\rho_{\mbox{\boldmath{$\theta$}}}(|\textbf{y}_{i}-\textbf{y}_{j}|)+\sigma_{\epsilon}^{2}\mathbb{I}_{(\textbf{z}_{i}=\textbf{z}_{j})}=\nu\exp\{-|\textbf{y}_{i}-\textbf{y}_{j}|/\theta\}+\sigma_{\epsilon}^{2}\mathbb{I}_{(\textbf{z}_{i}=\textbf{z}_{j})} (38)

    with parameters ν=1,θ=0.25\nu=1,\ \theta=0.25 and σϵ2=0.05\sigma_{\epsilon}^{2}=0.05. The length of each series was fixed at T=2048T=2048.

  • 3.

    Calculate the estimates of yi\textbf{y}_{i}, y^i=(y^i1,y^i2)\hat{\textbf{y}}_{i}=(\hat{y}_{i1},\hat{y}_{i2}) for i=1,,n=50i=1,...,n=50 and the parameters of the covariance function using the optimization procedure described in section 3.4 with Mexican hat and Shannon wavelets.

  • 4.

    Calculate mean squared error (MSE) of the estimates of the correlation matrix,

    MSE=1n2i=1nj=1n(corrijexp{|y^iy^j|/θ^})2,\hbox{MSE}=\frac{1}{n^{2}}\sum_{i=1}^{n}\sum_{j=1}^{n}(\hbox{corr}_{ij}-\exp\{-|\hat{\textbf{y}}_{i}-\hat{\textbf{y}}_{j}|/\hat{\theta}\})^{2}, (39)

    where corrij\hbox{corr}_{ij} is the element of the iith row and jjth column in the correlation matrix of z, i.e. is the correlation between zi\textbf{z}_{i} and zj\textbf{z}_{j}.

Refer to caption
Figure 3: Simulated locations.
Refer to caption
(a) Linear Deformation
Refer to caption
(b) Quadratic Deformation
Refer to caption
(c) Non-linear Deformation
Refer to caption
(d) Wavelet Deformation
Figure 4: Sampling locations (and regular grid) in G plane, in black, and deformed locations (and deformed grid) in plane D, in red.

Table 1 shows the estimated parameters of the covariance function and the MSE of the estimates of the correlation matrix in several fits. Observe that in the linear, non-linear, and wavelet case, the estimates that used JJ = 3 and Shannon wavelet were closer to the true values of the parameters, whereas in the quadratic case, they were closer when using the Mexican hat wavelet. Note that in three cases the lowest MSE was obtained using the Mexican hat wavelet, however with JJ = 2 in the linear case and JJ = 3 in the non-linear and wavelet case. Only in the quadratic case, the smallest MSE was obtained using the Shannon wavelet and JJ = 4.

Figures 5-12 show the estimated deformation and the scatter plots of the upper-diagonal entries of the estimated correlation matrices for the sample data, versus the true correlation matrices. The comparison of estimated correlation matrices is of great convenience to evaluate the estimation results. According to the MSE’s in Table 1, the scatter plot is more accurate for smaller MSE. Note that in the linear and wavelet case, the estimated deformations are very close to the true one.

Table 1: Estimated parameters of the covariance function and MSE of the correlation matrix of different fits.
True value of the parameters: ν\nu = 1 θ=0.25\theta=0.25 σϵ2\sigma^{2}_{\epsilon} = 0.05
Mexican hat Shannon
Linear Deformation
J = 2 J = 3 J = 4 J = 2 J = 3 J = 4
ν\nu 1.03540 1.04229 1.03757 1.03493 1.02579 1.03192
θ\theta 0.19197 0.18178 0.20000 0.20204 0.23975 0.21140
σϵ2\sigma^{2}_{\epsilon} 0.04117 0.03905 0.03771 0.04380 0.06739 0.04512
MSE 0.00173 0.00505 0.00290 0.00196 0.00409 0.00708
Quadratic Deformation
J = 3 J = 4 J = 5 J = 3 J = 4 J = 5
ν\nu 1.01631 1.03535 1.01537 1.04410 1.04491 1.00799
θ\theta 0.25529 0.26601 0.25003 0.26182 0.25878 0.25973
σϵ2\sigma^{2}_{\epsilon} 0.04272 0.04371 0.04082 0.04140 0.04032 0.04690
MSE 0.00309 0.00165 0.00195 0.00220 0.00124 0.00205
Non-linear Deformation
J = 3 J = 4 J = 5 J = 3 J = 4 J = 5
ν\nu 1.04400 1.04378 1.02342 1.02903 0.99906 1.04291
θ\theta 0.22920 0.21801 0.23132 0.25214 0.24048 0.23826
σϵ2\sigma^{2}_{\epsilon} 0.04117 0.03905 0.03771 0.04183 0.03259 0.03297
MSE 0.00387 0.00539 0.00485 0.00409 0.00436 0.00451
Wavelet Deformation
J = 3 J = 4 J = 5 J = 3 J = 4 J = 5
ν\nu 1.00827 1.02275 1.01732 1.02485 1.00019 1.00496
θ\theta 0.19449 0.19605 0.20986 0.22260 0.20877 0.21827
σϵ2\sigma^{2}_{\epsilon} 0.05629 0.04681 0.05375 0.06022 0.06107 0.06728
MSE 0.00179 0.00279 0.00218 0.00296 0.00222 0.00226
Refer to caption Refer to caption Refer to caption
Refer to caption Refer to caption Refer to caption
Figure 5: Estimated deformation (in blue) when J=2,3,4J=2,3,4 (from left to right) for linear deformation case using Mexican hat (upper) and Shannon (bottom) wavelets.
Refer to caption Refer to caption Refer to caption
Refer to caption Refer to caption Refer to caption
Figure 6: Comparison of the estimated correlation matrix versus the true correlation matrix when J=2,3,4J=2,3,4 (from left to right) for linear deformation case using Mexican hat (upper) and Shannon (bottom) wavelets.
Refer to caption Refer to caption Refer to caption
Refer to caption Refer to caption Refer to caption
Figure 7: Estimated deformation (in blue) when J=3,4,5J=3,4,5 (from left to right) for quadratic deformation case using Mexican hat (upper) and Shannon (bottom) wavelets.
Refer to caption Refer to caption Refer to caption
Refer to caption Refer to caption Refer to caption
Figure 8: Comparison of the estimated correlation matrix versus the true correlation matrix when J=3,4,5J=3,4,5 (from left to right) for quadratic deformation case using Mexican hat (upper) and Shannon (bottom) wavelets.
Refer to caption Refer to caption Refer to caption
Refer to caption Refer to caption Refer to caption
Figure 9: Estimated deformation (in blue) when J=3,4,5J=3,4,5 (from left to right) for non-linear deformation case using Mexican hat (upper) and Shannon (bottom) wavelets.
Refer to caption Refer to caption Refer to caption
Refer to caption Refer to caption Refer to caption
Figure 10: Comparison of the estimated correlation matrix versus the true correlation matrix when J=3,4,5J=3,4,5 (from left to right) for non-linear deformation case using the Mexican hat (upper) and the Shannon (bottom) wavelets.
Refer to caption Refer to caption Refer to caption
Refer to caption Refer to caption Refer to caption
Figure 11: Estimated deformation (in blue) when J=3,4,5J=3,4,5 (from left to right) for wavelet deformation case using the Mexican hat (upper) and the Shannon (bottom) wavelets.
Refer to caption Refer to caption Refer to caption
Refer to caption Refer to caption Refer to caption
Figure 12: Comparison of the estimated correlation matrix versus the true correlation matrix when J=3,4,5J=3,4,5 (from left to right) for wavelet deformation case using the Mexican hat (upper) and the Shannon (bottom) wavelets.

5 Application

The dataset used to illustrate the optimization procedure described in Section 3.4 provides historical daily maximum temperature records. The data can be obtained directly from GHCN (Global Historical Climatology Network)-Daily, an integrated public database of NOAA (National Oceanic and Atmospheric Administration) using R package rnoaa. The dataset deals with daily maximum temperature records (in tenths of degrees Celsius) from the Midwestern states of the USA. The region consists of 12 states: North Dakota, South Dakota, Illinois, Indiana, Iowa, Kansas, Michigan, Minnesota, Missouri, Nebraska, Ohio, and Wisconsin. We use only climate monitoring stations containing no missing data between 1980 and 1999 (inclusive). The 51 stations selected are shown in Figure 13. Figure 14 shows the maximum temperature recorded at the 4 sampling stations.

Refer to caption
Figure 13: Locations of the stations selected from Midwestern states of the USA.
Refer to caption
Figure 14: Maximum temperature recorded at the 4 sampling stations.

The results were obtained from the optimization procedure described in Section 3.4 and deformation (22), (23) with J = 2, 3, 4 using Mexican hat and Shannon wavelets. And the covariance function used has the formula of equation (38). Figure 15 presents the estimated deformation and Table 2 shows the MSE’s of the correlation matrix for the several fits. We can see that MSE are better for both Mexican hat and Shannon when J = 3. Same conclusions can be seen in the scatter plots (Figure 16). Note that the correlations are strong, maybe because the stations are from the same region.

Table 2: Estimated parameters and MSE of the correlation matrix for the several fits.
Mexican hat Shannon
J = 2 J = 3 J = 4 J = 2 J = 3 J = 4
ν\nu 0,06438 0,06299 0,07400 0,06826 0,06871 0,06746
θ\theta 5,11259 4,24264 5,86402 4,04539 4,11567 4,98565
σϵ2\sigma^{2}_{\epsilon} 0,00024 0,00016 0,00022 0,00015 0,00011 0,00019
MSE 0,00106 0,00103 0,00174 0,00146 0,00109 0,00122
Refer to caption Refer to caption Refer to caption
Refer to caption Refer to caption Refer to caption
Figure 15: Estimated deformation (in red) when J=2,3,4J=2,3,4 (from left to right) using Mexican Hat (upper) and Shannon (bottom) wavelets.
Refer to caption Refer to caption Refer to caption
Refer to caption Refer to caption Refer to caption
Figure 16: Comparison of the estimated correlation matrix versus the true correlation matrix when J=2,3,4J=2,3,4 (from left to right) using Mexican Hat (upper) and Shannon (bottom) wavelets.

6 Further comments

In this paper, we have presented a deformation method based on the monotonic function and expansion of wavelet basis that guarantees the injetivity of the transformation. Some simulations and an application were performed with the proposed optimization.

By the inverse function theorem, if the Jacobian determinant of ff, denoted by det(J(f))det(J(f)), is non-zero, the ff is an injective function. The deformation proposed in this paper satisfies the condition that the Jacobian determinant is invertible. When det(J(f))>0det(J(f))>0, the ff is considered as a bijective function, that is, the orientation preserving of the function ff is guaranteed. Then, some methods that guarantee the bijectivity of the transformation under the stochastic model considered could be explored in future studies.

Acknowledgements

Yangyang Chen acknowledges the support of FAPESP, through grant 2019/05917-6. Pedro Alberto Morettin, Ronaldo Dias and Chang Chiann acknowledge the partial support of FAPESP grant 2018/04654-9.

References

Choi, Y. and Lee, S. (2000) Injectivity conditions of 2D and 3D uniform cubic B-spline functions, Graphical Models, vol.62, Issue 6, 411–427.

Chun, S. Y. and Fessler, J. A. (2009) A simple regularizer for B-spline nonrigid image restration that encourages local invertibility, IEEE journal of selected topics in signal processing 3(1): 159-169.

Cressie, N. and Wikle, C. K. (2011) Statistic for Spatio-Temporal Data, New York: Wiley.

Damian, D., Sampson, P. and Guttorp, P. (2001) Bayesian estimation of semiparametric nonstationary spatial covariance structures, Environmentrics 12(2):161-178.

Guttorp, P., Meiring, W. and Sampson, P. D. (1994) A space-time analysis of ground-level ozone data, Environmetrics 5(3):241-254.

Hardle, W., Kerkyacharian, G., Picard, D., Tsybakov, A. (1997) Wavelets Approximations and Statistical Applications, New York: Springer.

Hastie, T.J. and Tibshirani, R.J. (1990) Generalized Additive Models, Chapman and Hall, New York.

Kim, J. (2004) Intensity Based Image Registration Using Robust Similarity Measure and Constrained Optimization: Applications for Radiation Therapy, Ph.D. dissertation, University of Michigan, Ann Arbor, MI.

Musse, O. Heitz, F. and Armspach, J. (2001) Topology preserving deformable image matching using constrained hierarchical parametric models, IEEE Trans. Image Process., vol. 10, Inssue 7, 1081–1093.

Ramsay, J. O. (1998) Estimating smooth monotone functions. Journal of Royal Statistical Society, Series B, 60(2), 365-375.

Ramsay, J. O. and Silverman, B. W. (2006) Functional Data Analysis, 2nd Edition, New York: Springer.

Sampson, P. and Guttorp, P. (1992) Nonparametric estimation of nonstationary spatial covariance structure, Jounal of the American Statistical Association 87(417): 108-119.

Schmidt, A. M. and O’Hagan, A. (2003) Bayesian inference for non-stationary spatial covariance structure via spatial deformations. Journal of the Royal Statistical Society, Series B, 65(3), 743-758.