This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Spatial Confidence Regions for Combinations of Excursion Sets in Image Analysis

Thomas Maullin-Sapey1, Armin Schwartzman2,3 and Thomas E. Nichols1

1Big Data Institute, Li Ka Shing Centre for Health Information and Discovery,
Nuffield Department of Population Health, University of Oxford, Oxford, UK;
2Division of Biostatistics, University of California, San Diego, CA, USA;
3Halicioğlu Data Science Institute, University of California, San Diego, USA
TMS: [email protected]. AS: [email protected]. TEN: [email protected]
Abstract

The analysis of excursion sets in imaging data is essential to a wide range of scientific disciplines such as neuroimaging, climatology and cosmology. Despite growing literature, there is little published concerning the comparison of processes that have been sampled across the same spatial region but which reflect different study conditions. Given a set of asymptotically Gaussian random fields, each corresponding to a sample acquired for a different study condition, this work aims to provide confidence statements about the intersection, or union, of the excursion sets across all fields. Such spatial regions are of natural interest as they directly correspond to the questions “Where do all random fields exceed a predetermined threshold?”, or “Where does at least one random field exceed a predetermined threshold?”. To assess the degree of spatial variability present, we develop a method that provides, with a desired confidence, subsets and supersets of spatial regions defined by logical conjunctions (i.e. set intersections) or disjunctions (i.e. set unions), without any assumption on the dependence between the different fields. The method is verified by extensive simulations and demonstrated using a task-fMRI dataset to identify brain regions with activation common to four variants of a working memory task.

Keywords: Spatial Statistics, Excursion Sets, Confidence Regions, Linear Model

1 Introduction

The collection and analysis of imaging data, modelled as a random field sampled on a spatial domain, is central to a broad range of scientific disciplines such as neuroimaging, climatology, and cosmology. Often, spatial data drawn from nn observations of a random field are combined to obtain an estimate, μ^n\hat{\mu}_{n}, of some spatially varying target function μ\mu. The target function μ\mu is defined on a closed spatial domain, SNS\subset\mathbb{R}^{N}, and maps spatial locations to some variable of interest in \mathbb{R} (e.g. seismic activity, infrared heat, changes in blood oxygenation in the brain, etc.). In such applications, it is typically desirable to ask “At which locations does the target function exceed a certain value, cc?”. For example, “Where has a significant change in temperature occurred?”; “Where do significant heat readings indicate the presence of celestial bodies?”. Such questions are addressed by the study of excursion sets, i.e. sets of the form {sS:μ(s)c}\{s\in S:\mu(s)\geq c\}.

A wealth of literature has previously focused on documenting the geometric and topological properties of excursion sets of random functions (c.f. Adler (1981), Torres (1994), Cao (1999), Azas and Wschebor (2009), Adler and Taylor (2009)). These properties include, for example, the Euler characteristic (a measure of topological structure), the Hausdorff dimension (indicating how fractal the set may be), Lipschitz Killing curvatures (describing high-dimensional volumes and areas) and Betti Numbers (describing how many stationary points appear above the threshold, cc) (Worsley (1996), Adler (1977), Adler et al. (2017), Pranav et al. (2019)). Much of this work, though, is limited to homogeneous stationary processes, i.e. those with zero mean and a spatial covariance structure which is dependent upon only distance. Only the most recent literature, including the present paper, concerns processes with a non-zero mean function and a potentially heterogeneous covariance function.

In addition, at present there is little published concerning how excursion sets may be compared to one another. In many applications, it is typical for a researcher to collect data from two or more study conditions and contrast the results to draw some meaningful inference (e.g. temperature changes in winter vs summer, heat measurements gathered using different imaging modalities, brain activity in healthy subjects across a range of tasks, etc.). Such settings are akin to having MM spatially aligned, potentially correlated, estimates, μ^n1,μ^n2,μ^nM:S\hat{\mu}_{n}^{1},\hat{\mu}_{n}^{2},...\hat{\mu}_{n}^{M}:S\rightarrow\mathbb{R}, of MM target functions, μ1,μ2,μM\mu^{1},\mu^{2},...\mu^{M}.

Often, pertinent questions that arise in such settings may be formally expressed using logical statements which involve conjunctions (logical ‘and’s), negations (logical ‘not’s) and disjunctions (logical ‘or’s). For example, a climatologist may ask where significant changes in temperature were observed during either winter or summer (i.e. “Where does μ1\mu^{1} or μ2\mu^{2} exceed cc?’). Alternatively, a neuroscientist may ask “At which locations in the brain did subjects exhibit signs of cognitive behaviour during one task and not another?” (e.g. “At which locations does μ1\mu^{1}, and not μ2\mu^{2}, exceed cc?”).

One approach to answering such questions is to employ null-hypothesis testing. A hypothesis test of a disjunction of nulls to assess evidence for a conjunction of alternative hypotheses can be conducted with an ‘intersection-union’ test. An intersection-union test rejects at level α\alpha when all of the individual nulls are rejected at level α\alpha. This approach has proven particularly useful for tests of bioequivalence (Berger and Hsu (1996), Berger (1997)), but does not consider the spatial aspect that is our focus here.

It is common practice for researchers to compare images corresponding to different study conditions via rudimentary visual inspection. (In fMRI, just a few recent examples of qualitative assessment of ‘overlap’ and ‘differences’ are Zhang et al. (2021), Dijkstra et al. (2021), Ferreira et al. (2021)). Such a subjective practice may lead to biased or misleading results. In recent years, much literature has been published on the theory of confidence regions, which focuses on quantifying spatial uncertainty by providing, with a fixed confidence, sub- and super-sets bounding the excursion set of a single target function (Sommerfeld et al. (2018), Bowring et al. (2019), Bowring et al. (2021)). However, the theory of confidence regions does not currently offer a means for investigating logical conjunctions and disjunctions of exceedance statements (i.e. intersections and unions of excursion sets). This work addresses this issue by first proposing a theory of spatial confidence regions for ‘conjunction inference’ (i.e. inference for logical statements involving conjunctions) in image analysis. Following this, the proposed method is extended to allow the investigation of statements containing disjunctions and negations.

Refer to caption
Figure 1: Illustration of the set c\mathcal{F}_{c} for a setting in which M=2M=2. Shown in (a)(a) are two spatially-varying target functions, μ1(s)\mu^{1}(s) (blue) and μ2(s)\mu^{2}(s) (green), overlaid and thresholded at the level cc. In (b)(b), the regions at which μ1(s)c\mu^{1}(s)\geq c and μ2(s)c\mu^{2}(s)\geq c are displayed as red and yellow circles, respectively, with the intersection set, c\mathcal{F}_{c}, illustrated in purple. In (c)(c), potential confidence regions, ^c+\hat{\mathcal{F}}_{c}^{+} (grey) and ^c\hat{\mathcal{F}}_{c}^{-} (orange), for c\mathcal{F}_{c} are illustrated.

Formally, our work primarily focuses on the intersection of MM excursion sets (i.e. the region over which the conjunction statement “μ1(s)c\mu^{1}(s)\geq c and μ2(s)c\mu^{2}(s)\geq c andand μM(s)c\mu^{M}(s)\geq c” holds). We denote ={1,,M}\mathcal{M}=\{1,...,M\}, and for each ii\in\mathcal{M} (i.e. for each study condition) define the ithi^{th} true excursion set and ithi^{th} estimated excursion set as:

𝒜ci={sS:μi(s)c} and 𝒜^ci={sS:μ^ni(s)c},\mathcal{A}^{i}_{c}=\{s\in S:\mu^{i}(s)\geq c\}\quad\text{ and }\quad\hat{\mathcal{A}}^{i}_{c}=\{s\in S:\hat{\mu}_{n}^{i}(s)\geq c\},

respectively. Next, we define c\mathcal{F}_{c} to be the intersection of the sets {𝒜ci}i\{\mathcal{A}_{c}^{i}\}_{i\in\mathcal{M}}, and ^c\hat{\mathcal{F}}_{c} to be the intersection of the sets {𝒜^ci}i\{\hat{\mathcal{A}}_{c}^{i}\}_{i\in\mathcal{M}}. In other words, the spatial region we are interested in, and our estimate of that region, are given by:

c={sS:miniμi(s)c}and^c={sS:miniμ^ni(s)c},\mathcal{F}_{c}=\bigg{\{}s\in S:\min_{i\in\mathcal{M}}\mu^{i}(s)\geq c\bigg{\}}\quad\text{and}\quad\hat{\mathcal{F}}_{c}=\bigg{\{}s\in S:\min_{i\in\mathcal{M}}\hat{\mu}_{n}^{i}(s)\geq c\bigg{\}},

respectively. An illustration of c\mathcal{F}_{c} in a setting in which M=2M=2 is provided by Fig. 1.

We note here that the above construction can be adjusted to allow cc to vary across study conditions (i.e. “μ1(s)c1\mu^{1}(s)\geq c_{1} andand μM(s)cM\mu^{M}(s)\geq c_{M}” for c1,,cMc_{1},...,c_{M}\in\mathbb{R}). Such an adjustment would involve simply translating each field upwards or downwards (e.g. substituting {μi}i\{\mu^{i}\}_{i\in\mathcal{M}} for {μic+ci}i\{\mu^{i}-c+c_{i}\}_{i\in\mathcal{M}}). For ease in the proceeding text, we shall assume cc is equal for all study conditions.

Our goal is to obtain confidence regions ^c+\hat{\mathcal{F}}^{+}_{c} and ^c\hat{\mathcal{F}}^{-}_{c} such that the below probability holds asymptotically;

[^c+c^c]=1α\mathbb{P}\bigg{[}\hat{\mathcal{F}}^{+}_{c}\subseteq\mathcal{F}_{c}\subseteq\hat{\mathcal{F}}^{-}_{c}\bigg{]}=1-\alpha (1)

for a predefined tolerance level α\alpha (α=0.05\alpha=0.05, for example). To generate such regions, we build on the theory of Sommerfeld et al. (2018), to show that under an appropriate definition of ^c+\hat{\mathcal{F}}^{+}_{c} and ^c\hat{\mathcal{F}}^{-}_{c}, the above probability may be approximated using quantiles of a well-defined random variable. Using a wild tt-bootstrapping procedure, we will then demonstrate that the relationship between this random variable and the above probability can be used to generate ^c+\hat{\mathcal{F}}^{+}_{c} and ^c\hat{\mathcal{F}}^{-}_{c} for any desired value of α\alpha. Unlike much of the previous literature on random excursion sets, in this work no assumption is made on the processes’ means or spatial covariance structure, and we do not make any assumption of between-‘study condition’ independence (i.e. {μ^ni}i\{\hat{\mu}_{n}^{i}\}_{i\in\mathcal{M}} may be correlated with one another).

In the following sections, we first describe the notation and assumptions upon which our theory relies. Following this, we provide the central theoretical result of this work, which relates Equation (1) to an exceedance statement for a well-defined random variable. Next, via the use of the wild tt-bootstrap, we detail how this result may be employed to obtain confidence regions for conjunction inference in the setting of linear regression modelling. Finally, we validate our results with simulations and a real data example based on fMRI data taken from the Human Connectome Project (Van Essen et al. (2013)). Proofs of the theory presented in this work, alongside further illustration and extensive simulation results, are provided as supplementary material.

2 Confidence Regions for Excursion Set Combinations

2.1 Notation

In the following sections, we shall need notation to describe the numerous possible low dimensional sub-manifolds which can arise from intersecting the boundaries of MM excursion sets. To do so, we first denote the power set of a finite set, AA, as 𝒫(A)\mathcal{P}(A), and define 𝒫+(A)\mathcal{P}^{+}(A) as 𝒫+(A)=𝒫(A){}\mathcal{P}^{+}(A)=\mathcal{P}(A)\setminus\{\emptyset\}. For each ii\in\mathcal{M}, we define 𝒜ci\partial\mathcal{A}_{c}^{i} as the level set {sS:μi(s)=c}\{s\in S:\mu^{i}(s)=c\}. Similarly, we define c\partial\mathcal{F}_{c} as the level set {sS:miniμi(s)=c}\{s\in S:\min_{i\in\mathcal{M}}\mu^{i}(s)=c\}. For a spatial set, BSB\subseteq S, its complement in SS is denoted Bc=SBB^{c}=S\setminus B, its topological closure is denoted B¯\overline{B}, its interior is denoted BB^{\circ} and its boundary is denoted B\partial B. In addition, for closed BB, the notation B1B^{1} and B1B^{-1} shall represent BB and Bc¯\overline{B^{c}}, respectively. We note here that the notation 𝒜ci\partial\mathcal{A}^{i}_{c} and B\partial B may conflict with one another. This conflict is resolved, however, by the assumptions of Section 2.2 which ensure that the boundaries of {𝒜ci}i\{\mathcal{A}_{c}^{i}\}_{i\in\mathcal{M}} and c\mathcal{F}_{c} are equal to the level sets {𝒜ci}i\{\partial\mathcal{A}_{c}^{i}\}_{i\in\mathcal{M}} and c\partial\mathcal{F}_{c} (at least locally, in the vicinity of c\partial\mathcal{F}_{c}, which is sufficient for our purposes).

We can now partition the level set c\partial\mathcal{F}_{c} into sub-manifolds using the set of possible intersections of the MM excursion set boundaries. For all α𝒫+()\alpha\in\mathcal{P}^{+}(\mathcal{M}), we define the boundary segment αc\partial^{\alpha}\mathcal{F}_{c} as follows;

αc=c(iα𝒜ci)(jα(𝒜cj)c).\partial^{\alpha}\mathcal{F}_{c}=\partial\mathcal{F}_{c}\cap\bigg{(}\bigcap_{i\in\alpha}\partial\mathcal{A}_{c}^{i}\bigg{)}\cap\bigg{(}\bigcap_{j\in\mathcal{M}\setminus\alpha}(\partial\mathcal{A}_{c}^{j})^{c}\bigg{)}.

We note it is possible that, for some α𝒫+()\alpha\in\mathcal{P}^{+}(\mathcal{M}), the boundary segment αc\partial^{\alpha}\mathcal{F}_{c} is empty (in fact, this is often the case in practice). This notation is crucial to the statement of Theorem 2.1 and is illustrated by Fig. 2.

Refer to caption
Figure 2: Annotated images of ‘overlapping’ excursion sets (top) with corresponding illustrations of boundary partitions (bottom) for settings in which the number of study conditions, MM, is equal to 22 (left) or 33 (right).

2.2 Assumptions

In this section, we outline and discuss the assumptions upon which our theory relies. Additional discussion of why each assumption is required, alongside illustration, is given in Supplementary Theory Section S2.

Assumption 2.2.1.

For each ii\in\mathcal{M}, there exists a bounded function σi:S+\sigma^{i}:S\rightarrow\mathbb{R}^{+} and positive sequence τn0\tau_{n}\rightarrow 0, such that the below Central Limit Theorem (CLT) holds:

{μ^ni(s)μi(s)τnσi(s)}sS,i𝑑{Gi(s)}sS,i\bigg{\{}\frac{\hat{\mu}^{i}_{n}(s)-\mu^{i}(s)}{\tau_{n}\sigma^{i}(s)}\bigg{\}}_{s\in S,i\in\mathcal{M}}\xrightarrow{d}\{G^{i}(s)\}_{s\in S,i\in\mathcal{M}}

where {Gi(s)}sS,i\{G^{i}(s)\}_{s\in S,i\in\mathcal{M}} is a well-defined multi-variate random field with continuous sample paths in SS and 𝑑\xrightarrow{d} represents convergence in distribution.

Remark.

This assumption introduces the notation σi(s)\sigma^{i}(s) and τn\tau_{n}. In typical applications, σi(s)\sigma^{i}(s) represents standard deviation across observations (for study condition ii, at spatial location ss) and τn\tau_{n} is a decreasing function of sample size. For conciseness in the following text, we now define {gi}i\{g^{i}\}_{i\in\mathcal{M}} and {g^ni}i\{\hat{g}_{n}^{i}\}_{i\in\mathcal{M}} as;

gi(s)=μi(s)cσi(s) and g^ni(s)=μ^ni(s)cσi(s).g^{i}(s)=\frac{\mu^{i}(s)-c}{\sigma^{i}(s)}\quad\text{ and }\quad\hat{g}_{n}^{i}(s)=\frac{\hat{\mu}_{n}^{i}(s)-c}{\sigma^{i}(s)}.
Assumption 2.2.2.

We assume that:

  1. (a)

    The functions {gi}i\{g^{i}\}_{i\in\mathcal{M}} are continuous on SS.

  2. (b)

    For nn large enough, the functions {g^ni}i\{\hat{g}^{i}_{n}\}_{i\in\mathcal{M}} are continuous on SS.

Remark.

In practice, assumptions of this form are already common in the imaging literature, as in many applications {μi}i\{\mu^{i}\}_{i\in\mathcal{M}}, {μ^ni}i\{\hat{\mu}_{n}^{i}\}_{i\in\mathcal{M}} and {σi}i\{\sigma^{i}\}_{i\in\mathcal{M}} are assumed to be continuous across space. For further remarks, see Supplementary Theory Section S2.2.

Assumption 2.2.3.

We assume that:

  1. (a)

    Every open ball around a point on the boundary c\partial\mathcal{F}_{c} has a non-empty intersection with (c)(\mathcal{F}_{c})^{\circ}. In addition, for all α𝒫+()\alpha\in\mathcal{P}^{+}(\mathcal{M}) every open ball around a point in αc\partial^{\alpha}\mathcal{F}_{c} has a non-empty intersection with the set:

    𝒥cα:=(iα(𝒜ci)c)(jα(𝒜cj))\mathcal{J}_{c}^{\alpha}:=\bigg{(}\bigcap_{i\in\alpha}(\mathcal{A}^{i}_{c})^{c}\bigg{)}\cap\bigg{(}\bigcap_{j\in\mathcal{M}\setminus\alpha}(\mathcal{A}^{j}_{c})^{\circ}\bigg{)}

    where, if α\mathcal{M}\setminus\alpha is empty, the last term is defined to be equal to SS.

  2. (b)

    There is an open neighbourhood of c\partial\mathcal{F}_{c} over which the functions {gi}i\{g^{i}\}_{i\in\mathcal{M}} and, for nn large enough, {g^ni}i\{\hat{g}^{i}_{n}\}_{i\in\mathcal{M}} are C1C^{1} with finite, non-zero, gradients.

  3. (c)

    For every point scs\in\partial\mathcal{F}_{c} and α𝒫+()\alpha\in\mathcal{P}^{+}(\mathcal{M}), if s(iα𝒜ci)s\in\partial\big{(}\bigcap_{i\in\alpha}\mathcal{A}_{c}^{i}\big{)} then the set (iα𝒜ci)\partial\big{(}\bigcap_{i\in\alpha}\mathcal{A}_{c}^{i}\big{)} partitions every sufficiently small open ball around ss into exactly two components, each of which is path connected.

Remark.

The above statements ensure that c\mathcal{F}_{c} is non-empty and has a well defined boundary which is equal to the level set c={sS:minigi(s)=0}\partial\mathcal{F}_{c}=\{s\in S:\min_{i\in\mathcal{M}}g^{i}(s)=0\}. All three statements ensure that no changes in topology occur at the level cc by requiring that c\partial\mathcal{F}_{c} does not contain plateaus (spatial regions of non-zero measure over which minigi(s)=0\min_{i\in\mathcal{M}}g^{i}(s)=0 exactly), local minima or maxima. In addition, each statement handles pathological cases which the other statements do not. For brevity, we do not list such cases here but instead provide further discussion in Supplementary Theory Section S2.3.

Each of the examples presented in Fig. 2 satisfy Assumptions 2.2.2 and 2.2.3 and highlight several features worthy of note. Firstly, we do not assume c\partial\mathcal{F}_{c} is a C1C^{1} curve, but rather piece-wise C1C^{1} (c.f. examples (d)(d) and (e)(e)). Secondly, the assumptions allow for the possibility that the excursion sets could be nested within one another (c.f. examples (c)(c) and (f)(f)). Thirdly, the assumptions permit the excursion sets to share common boundaries (see examples (b)(b), (c)(c) and (f)(f)). The last two observations are of practical relevance as, in many applications, it may be expected that the target functions for different study conditions exhibit a strong similarity. For instance, in the neuroimaging example, it may be expected that some anatomical regions of the brain display evidence of activation for all of the study conditions. We note here that there is no requirement that c\mathcal{F}_{c} be (globally) path-connected; c\mathcal{F}_{c} may consist of several disconnected components without violating the assumptions of this section.

2.3 Theory

In this section, we present the central result of this work, Theorem 2.1, alongside two corollaries, Corollary 2.1 and Corollary 2.2. To do so, we now define the nested sets ^c\hat{\mathcal{F}}^{-}_{c} and ^c+\hat{\mathcal{F}}^{+}_{c} by thresholding the statistic τn1minig^ni(s)\tau_{n}^{-1}\min_{i\in\mathcal{M}}\hat{g}^{i}_{n}(s):

^c:=^c(a)={sS:τn1minig^ni(s)a},\hat{\mathcal{F}}^{-}_{c}:=\hat{\mathcal{F}}^{-}_{c}(a)=\{s\in S:\tau_{n}^{-1}\min_{i\in\mathcal{M}}\hat{g}^{i}_{n}(s)\geq-a\},
^c+:=^c+(a)={sS:τn1minig^ni(s)+a},\hat{\mathcal{F}}^{+}_{c}:=\hat{\mathcal{F}}^{+}_{c}(a)=\{s\in S:\tau_{n}^{-1}\min_{i\in\mathcal{M}}\hat{g}^{i}_{n}(s)\geq+a\},

using some constant a+a\in\mathbb{R}^{+}. To find an appropriate value of aa, such that Equation (1) holds for a desired tolerance level (e.g. α=0.05\alpha=0.05), Theorem 2.1 is required.

Theorem 2.1.

Under the assumptions of Section 2.2, the below holds:

limn[^c+c^c]=[Ha],\lim_{n\rightarrow\infty}\mathbb{P}[\hat{\mathcal{F}}^{+}_{c}\subseteq\mathcal{F}_{c}\subseteq\hat{\mathcal{F}}^{-}_{c}]=\mathbb{P}[H\leq a], (2)

where the variable HH is defined as follows;

H=maxα𝒫+()(supsαc|miniα(Gi(s))|).H=\max_{\alpha\in\mathcal{P}^{+}(\mathcal{M})}\bigg{(}\sup_{s\in\partial^{\alpha}\mathcal{F}_{c}}\big{|}\min_{i\in\alpha}(G^{i}(s))\big{|}\bigg{)}.

Conceptually, the event {H>a}\{H>a\} may be thought of as the situation in which, at some point, ss, inside some boundary segment, αc\partial^{\alpha}\mathcal{F}_{c}, a large value was observed for the statistic |miniα(Gi(s))||\min_{i\in\alpha}(G^{i}(s))|. Therefore, Theorem 2.1 tells us the probability that the statement ^c+c^c\hat{\mathcal{F}}^{+}_{c}\subseteq\mathcal{F}_{c}\subseteq\hat{\mathcal{F}}^{-}_{c} is violated is directly determined by such points. If we can find a value of aa such that [Ha]=1α\mathbb{P}[H\leq a]=1-\alpha then asymptotically, the inclusion probability, [^c+c^c]\mathbb{P}[\hat{\mathcal{F}}^{+}_{c}\subseteq\mathcal{F}_{c}\subseteq\hat{\mathcal{F}}^{-}_{c}], will also equal 1α1-\alpha. In other words, if the (1α)th(1-\alpha)^{th} quantile of the distribution of HH is known, then confidence regions ^c\hat{\mathcal{F}}^{-}_{c} and ^c+\hat{\mathcal{F}}^{+}_{c} may be constructed which satisfy Equation (1). We shall take up the task of evaluating the quantiles of HH in Section 3.

Theorem 2.1 is key to generating confidence regions for conjunction inference (intersections of excursion sets). However, Theorem 2.1 may also be extended to allow investigation of other logical statements involving negations or disjunctions. Below we provide two corollaries, each of which is illustrated via a worked example. Corollary 2.1 extends Theorem 2.1 to account for logical negations, whilst Corollary 2.2 provides an equivalent statement for inference performed upon logical disjunctions (unions of excursion sets).

Example 2.1.

Suppose, given two target functions, μ1\mu^{1} and μ2\mu^{2}, we were interested in the region defined by the logical conjunction ‘μ1c\mu^{1}\geq c and μ2c\mu^{2}\leq c’ (i.e. “Where did a variable of interest exceed a threshold under one study condition and not under another?”). In the notation of Section 2.1, this region is readily seen to be given by (𝒜c1)1(𝒜c2)1(\mathcal{A}_{c}^{1})^{1}\cap(\mathcal{A}_{c}^{2})^{-1}.

To apply the result of Theorem 2.1, we first define μ~1=μ1\tilde{\mu}^{1}=\mu^{1} and μ~2=2cμ2\tilde{\mu}^{2}=2c-\mu^{2} and note that the logical statement ‘μ1c\mu^{1}\geq c and μ2c\mu^{2}\leq c’ is identical to the statement ‘μ~1c\tilde{\mu}^{1}\geq c and μ~2c\tilde{\mu}^{2}\geq c’. By using the latter statement, Theorem 2.1 may now be applied to obtain the desired confidence regions (assuming that an appropriate mechanism exists for evaluating the quantiles of HH). However, during this process, the limiting variables {Gi}i{1,2}\{G^{i}\}_{i\in\{1,2\}}, which correspond to the target functions {μi}i{1,2}\{\mu^{i}\}_{i\in\{1,2\}}, must be replaced by variables corresponding to the functions {μ~i}i{1,2}\{\tilde{\mu}^{i}\}_{i\in\{1,2\}}, {G~i}i{1,2}\{\tilde{G}^{i}\}_{i\in\{1,2\}}. By noting the definitions of {μ~i}i{1,2}\{\tilde{\mu}^{i}\}_{i\in\{1,2\}} and {G~i}i{1,2}\{\tilde{G}^{i}\}_{i\in\{1,2\}}, it can be seen that G~1=G1\tilde{G}^{1}=G^{1} and G~2=G2\tilde{G}^{2}=-G^{2}.

In summary, a procedure for generating confidence regions corresponding to the logical conjunction ‘μ1c\mu^{1}\geq c and μ2c\mu^{2}\leq c’ would be identical to that previously described, except that G2G^{2} would be substituted for G2-G^{2} and c\partial\mathcal{F}_{c} would be substituted for ((𝒜c1)1(𝒜c2)1)\partial((\mathcal{A}^{1}_{c})^{1}\cap(\mathcal{A}^{2}_{c})^{-1}). In general, this example may be extended to allow for inference to be performed upon arbitrary conjunctions of statements and negated statements. To achieve this, the definitions of c\mathcal{F}_{c}, ^c\hat{\mathcal{F}}^{-}_{c} and ^c+\hat{\mathcal{F}}^{+}_{c} must be extended as follows.

Corollary 2.1.

Let {δi}i\{\delta_{i}\}_{i\in\mathcal{M}} be an arbitrary sequence of integers in {1,1}\{-1,1\} and define c\mathcal{F}_{c} as c=i(𝒜ci)δi\mathcal{F}_{c}=\bigcap_{i\in\mathcal{M}}(\mathcal{A}_{c}^{i})^{\delta_{i}}. Similarly, extend the definition of the sets ^c+\hat{\mathcal{F}}^{+}_{c} and ^c\hat{\mathcal{F}}^{-}_{c} to:

^c+=i{sS:τn1g^ni(s)+a}δiand^c=i{sS:τn1g^ni(s)a}δi,\hat{\mathcal{F}}^{+}_{c}=\bigcap_{i\in\mathcal{M}}\{s\in S:\tau_{n}^{-1}\hat{g}^{i}_{n}(s)\geq+a\}^{\delta_{i}}\quad\text{and}\quad\hat{\mathcal{F}}^{-}_{c}=\bigcap_{i\in\mathcal{M}}\{s\in S:\tau_{n}^{-1}\hat{g}^{i}_{n}(s)\geq-a\}^{\delta_{i}},

respectively and, for each α𝒫+()\alpha\in\mathcal{P}^{+}(\mathcal{M}), define αc\partial^{\alpha}\mathcal{F}_{c} as before. If c\mathcal{F}_{c} satisfies the assumptions of Section 2.2, then the below statement holds asymptotically:

limn[^c+c^c]=[maxα𝒫+()(supsαc|miniα(δiGi(s))|)a]\lim_{n\rightarrow\infty}\mathbb{P}[\hat{\mathcal{F}}^{+}_{c}\subseteq\mathcal{F}_{c}\subseteq\hat{\mathcal{F}}^{-}_{c}]=\mathbb{P}\bigg{[}\max_{\alpha\in\mathcal{P}^{+}(\mathcal{M})}\bigg{(}\sup_{s\in\partial^{\alpha}\mathcal{F}_{c}}\big{|}\min_{i\in\alpha}(\delta_{i}G^{i}(s))\big{|}\bigg{)}\leq a\bigg{]}
Example 2.2.

Suppose, given two target functions, μ1\mu^{1} and μ2\mu^{2}, interest lay in the logical disjunction of ‘μ1c\mu^{1}\geq c or μ2c\mu^{2}\geq c’ (i.e. “Where did at least one of the variables of interest exceed the threshold?”). The region of space over which this statement holds is given by 𝒜c1𝒜c2\mathcal{A}_{c}^{1}\cup\mathcal{A}_{c}^{2}, and shall be denoted 𝒢c\mathcal{G}_{c}.

To generate confidence regions for 𝒢c\mathcal{G}_{c}, we first assume that 𝒢c\mathcal{G}_{c} satisfies Assumptions 2.2.1-2.2.3. By Assumptions 2.2.2 and 2.2.3 and De Morgan’s law, it can be seen that 𝒢c=((𝒜c1)1(𝒜c2)1)1\mathcal{G}_{c}=((\mathcal{A}_{c}^{1})^{-1}\cap(\mathcal{A}_{c}^{2})^{-1})^{-1}. It must be noted that Assumptions 2.2.2 and 2.2.3 are necessary for this statement to hold due to the fact that (𝒜ci)1=(𝒜ci)c¯(\mathcal{A}_{c}^{i})^{-1}=\overline{(\mathcal{A}_{c}^{i})^{c}} and not (𝒜ci)c(\mathcal{A}_{c}^{i})^{c}. By employing Corollary 2.1, for a fixed value of α\alpha, confidence regions, ^c\hat{\mathcal{F}}^{-}_{c} and ^c+\hat{\mathcal{F}}^{+}_{c}, may be obtained which satisfy the following;

limn[(^c)c((𝒜c1)1(𝒜c2)1)c(^c+)c]=1α.\lim_{n\rightarrow\infty}\mathbb{P}[(\hat{\mathcal{F}}^{-}_{c})^{c}\subseteq((\mathcal{A}_{c}^{1})^{-1}\cap(\mathcal{A}_{c}^{2})^{-1})^{c}\subseteq(\hat{\mathcal{F}}^{+}_{c})^{c}]=1-\alpha.

By taking the closure of each of the sets in the above probability, and again via careful consideration of Assumptions 2.2.2 and 2.2.3, the below is obtained;

limn[(^c)1𝒢c(^c+)1]=1α.\lim_{n\rightarrow\infty}\mathbb{P}[(\hat{\mathcal{F}}^{-}_{c})^{-1}\subseteq\mathcal{G}_{c}\subseteq(\hat{\mathcal{F}}^{+}_{c})^{-1}]=1-\alpha.

In other words, the sets (^c)1(\hat{\mathcal{F}}^{-}_{c})^{-1} and (^c+)1(\hat{\mathcal{F}}^{+}_{c})^{-1} serve as confidence regions for disjunction inference on μ1\mu^{1} and μ2\mu^{2}. Note that, as in the previous example, great attention must paid to the signs which appear in front of the variables {Gi}i\{G^{i}\}_{i\in\mathcal{M}} in the definition of HH. As this example began by generating confidence regions for (𝒜c1)1(𝒜c2)1(\mathcal{A}_{c}^{1})^{-1}\cap(\mathcal{A}_{c}^{2})^{-1}, it can be seen that the variables G1G^{1} and G2G^{2} must be appended with negative signs (i.e. when applying the result of Corollary 2.1, both δ1\delta_{1} and δ2\delta_{2} were set to 1-1). This example may be expanded upon to obtain similar results for larger values of MM, as shown by Corollary 2.2.

Corollary 2.2.

Let {δi}i\{\delta_{i}\}_{i\in\mathcal{M}} be an arbitrary sequence of integers in {1,1}\{-1,1\} and define 𝒢c\mathcal{G}_{c} as 𝒢c=i(𝒜ci)δi\mathcal{G}_{c}=\bigcup_{i\in\mathcal{M}}(\mathcal{A}_{c}^{i})^{\delta_{i}}. Define the sets 𝒢^c+\hat{\mathcal{G}}^{+}_{c} and 𝒢^c\hat{\mathcal{G}}^{-}_{c} as:

𝒢^c+=(i{sS:τn1g^ni(s)a}δi)1and𝒢^c=(i{sS:τn1g^ni(s)+a}δi)1,\hat{\mathcal{G}}^{+}_{c}=\bigg{(}\bigcap_{i\in\mathcal{M}}\{s\in S:\tau_{n}^{-1}\hat{g}^{i}_{n}(s)\geq-a\}^{\delta_{i}}\bigg{)}^{-1}\quad\text{and}\quad\hat{\mathcal{G}}^{-}_{c}=\bigg{(}\bigcap_{i\in\mathcal{M}}\{s\in S:\tau_{n}^{-1}\hat{g}^{i}_{n}(s)\geq+a\}^{\delta_{i}}\bigg{)}^{-1},

respectively and, for each α𝒫+()\alpha\in\mathcal{P}^{+}(\mathcal{M}), define α𝒢c\partial^{\alpha}\mathcal{G}_{c} analogously to αc\partial^{\alpha}\mathcal{F}_{c} in the previous sections. If 𝒢c\mathcal{G}_{c} satisfies the assumptions of Section 2.2, then the below statement holds asymptotically:

limn[𝒢^c+𝒢c𝒢^c]=[maxα𝒫+()(supsα𝒢c|miniα(δiGi(s))|)a].\lim_{n\rightarrow\infty}\mathbb{P}[\hat{\mathcal{G}}^{+}_{c}\subseteq\mathcal{G}_{c}\subseteq\hat{\mathcal{G}}^{-}_{c}]=\mathbb{P}\bigg{[}\max_{\alpha\in\mathcal{P}^{+}(\mathcal{M})}\bigg{(}\sup_{s\in\partial^{\alpha}\mathcal{G}_{c}}\big{|}\min_{i\in\alpha}(-\delta_{i}G^{i}(s))\big{|}\bigg{)}\leq a\bigg{]}.

3 Spatial Conjunction Inference for the Linear Model

In this section, we build upon the work of Sommerfeld et al. (2018) and Bowring et al. (2019), in which methods for generating confidence regions were presented for a single target function, μ(s)\mu(s), derived from a linear regression model. In Section 3.1 we formally describe the spatially-varying Linear Model (LM) for MM study conditions and, in Section 3.2, we explain how the wild tt-bootstrap may be applied to estimate quantiles of the variable HH. Further implementation details, for when data are sampled from a discrete lattice rather than across a continuous space, alongside pseudocode, are provided in Supplementary Results Section S22. This section focuses solely on conjunction inference. However, the methods described here may equally be applied to perform inference on other logical statements via the use of the corollaries and examples presented in Section 2.3.

3.1 Model Specification

For each ii\in\mathcal{M} (i.e. for each study condition), we assume that the data follow a spatially-varying Linear Model (LM) defined at location sSs\in S as:

Yi(s)=Xiβi(s)+ϵi(s),ϵi(s)N(0,Σi(s)).Y^{i}(s)=X^{i}\beta^{i}(s)+\epsilon^{i}(s),\quad\epsilon^{i}(s)\sim N(0,\Sigma^{i}(s)). (3)

The known quantities in the model are; the (n×1)(n\times 1) vector of responses, Yi(s)Y^{i}(s), and the (n×p)(n\times p) design matrix, XiX^{i}. The unknown model parameters are; the (p×1)(p\times 1) vector of regression coefficients, βi(s)\beta^{i}(s), and the (n×n)(n\times n) random error covariance matrix, Σi(s)\Sigma^{i}(s). For each spatial location, sSs\in S, the Generalized Least Squares (GLS) estimator of the parameter vector βi(s)\beta^{i}(s), β^i(s)\hat{\beta}^{i}(s), is given by:

β^i(s)=(XiΣi(s)1Xi)1XiΣi(s)1Yi(s).\hat{\beta}^{i}(s)=(X^{i^{\prime}}\Sigma^{i}(s)^{-1}X^{i})^{-1}X^{i^{\prime}}\Sigma^{i}(s)^{-1}Y^{i}(s).

In this context, under the ithi^{th} study condition, at each spatial location ss, our interest lies in assessing whether linear relationships hold between the elements of the parameter vector βi(s)\beta^{i}(s). Such relationships may be expressed using a contrast vector LiL^{i} of dimension (p×1)(p\times 1). In the notation of the previous sections, the target and estimator functions for the spatially-varying LM are given as μi(s)=Liβi(s)\mu^{i}(s)=L^{i^{\prime}}\beta^{i}(s) and μ^ni(s)=Liβ^i(s)\hat{\mu}^{i}_{n}(s)=L^{i^{\prime}}\hat{\beta}^{i}(s) respectively, and τn1σi(s)\tau_{n}^{-1}\sigma^{i}(s) is the contrast standard error, given as:

τn1σi(s)=[Li(XiΣi(s)1Xi)1Li]12.\tau_{n}^{-1}\sigma^{i}(s)=\big{[}L^{i^{\prime}}(X^{i^{\prime}}\Sigma^{i}(s)^{-1}X^{i})^{-1}L^{i}\big{]}^{\frac{1}{2}}.

Whilst in the above notation we have presented {β^i(s)}i\{\hat{\beta}^{i}(s)\}_{i\in\mathcal{M}} as being estimated separately using MM different models, we note that nothing in our theory prevents the same model being used across all MM study conditions. For example, it is possible for the model matrices, {Yi(s)}i\{Y^{i}(s)\}_{i\in\mathcal{M}} and {Xi}i\{X^{i}\}_{i\in\mathcal{M}}, and parameter vector, {βi(s)}i\{\beta^{i}(s)\}_{i\in\mathcal{M}}, to be equal for all ii\in\mathcal{M}, with the only distinction between study conditions being represented by the contrast vector LiL^{i}. This observation is noteworthy as, in many applications, it is common for researchers to compile all study conditions into a single LM to obtain a heightened statistical power.

To apply Theorem 2.1 to the above LM, we now assume that Assumptions 2.2.1-2.2.3 hold. Such assumptions are commonly satisfied by standard conditions placed on the continuity and boundedness of the increments and moments of {βi(s)}sS\{\beta^{i}(s)\}_{s\in S} and {ϵi(s)}sS\{\epsilon^{i}(s)\}_{s\in S} (see Sommerfeld et al. (2018) for further detail). We note here that the covariance matrix, Σi(s)\Sigma^{i}(s), is usually unknown in practice, meaning the function τn1σi(s)\tau_{n}^{-1}\sigma^{i}(s) cannot be calculated. As demonstrated in Sommerfeld et al. (2018), however, Σi(s)\Sigma^{i}(s) can be replaced by any consistent estimator Σ^i(s)\hat{\Sigma}^{i}(s) and the assumptions given in Section 2.2 are still satisfied. Such estimation is common in the statistics literature and is typically achieved by assuming that some fixed correlation structure applies to Σi(s)\Sigma^{i}(s) (e.g. diagonal independence, auto-regressive, etc.) so that the number of independent variance parameters which must be estimated is small.

3.2 The Wild tt-bootstrap

In practice, the distribution of HH is unknown and must be estimated. In this section, for the model specification described in Section 3.1, we employ a wild tt-bootstrap resampling scheme to obtain the quantile, aa, of HH which satisfies [Ha]=1α\mathbb{P}[H\leq a]=1-\alpha.

To do so, we first define the (n×1)(n\times 1)-dimensional residual vector, RiR^{i}, for the LM of the ithi^{th} study condition (ii\in\mathcal{M}) as:

[R1i(s),,Rni(s)]=Ri(s)=[Σ^i(s)]12(Yi(s)Xiβ^i(s))[R^{i}_{1}(s),...,R^{i}_{n}(s)]^{\prime}=R^{i}(s)=[\hat{\Sigma}^{i}(s)]^{-\frac{1}{2}}(Y^{i}(s)-X^{i}\hat{\beta}^{i}(s))

The wild tt-bootstrap proceeds by, in each bootstrap instance, generating nn i.i.d. Rademacher random variables, {r1,,rn}\{r_{1},...,r_{n}\}, (random variables which take the values 1-1 and +1+1 with equal probability) independently of the data. For the ithi^{th} study condition, at spatial location ss, a new bootstrap sample is then obtained by multiplying the elements of the residual vector by the Rademacher variables (i.e. the new sample is given by {r1R1i(s),,rnRni(s)}\{r_{1}R^{i}_{1}(s),...,r_{n}R^{i}_{n}(s)\}). Once the boostrap sample has been generated, the standard deviation of the bootstrap sample, σ^,i(s)\hat{\sigma}^{*,i}(s), is calculated. A bootstrap instance of the variable GiG^{i}, G~i\tilde{G}^{i}, is now generated as follows;

G~i(s)=n12l=1nrlRli(s)σ^,i(s).\tilde{G}^{i}(s)=n^{-\frac{1}{2}}\sum_{l=1}^{n}\frac{r_{l}R^{i}_{l}(s)}{\hat{\sigma}^{*,i}(s)}.

A bootstrap instance of the variable HH, H~\tilde{H}, may then be computed using the bootstrap variables {G~i(s)}sS,i\{\tilde{G}^{i}(s)\}_{s\in S,i\in\mathcal{M}} as follows:

H~=maxα𝒫+()(supsα^c|miniα(G~i(s))|)\tilde{H}=\max_{\alpha\in\mathcal{P}^{+}(\mathcal{M})}\bigg{(}\sup_{s\in\partial^{\alpha}\hat{\mathcal{F}}_{c}}\big{|}\min_{i\in\alpha}(\tilde{G}^{i}(s))\big{|}\bigg{)} (4)

Quantiles of the distribution of HH may now be estimated empirically from the observed sampling distribution of H~\tilde{H}. Discussion of how the above supremum, taken across continuous space, is evaluated in practice is deferred to Supplementary Results Section S2.12.1. However, we do briefly note here that, as the location of the true boundary segment αc\partial^{\alpha}\mathcal{F}_{c} is in practice unknown, the boundary segment αc\partial^{\alpha}\mathcal{F}_{c} has been replaced by an estimate α^c\partial^{\alpha}\hat{\mathcal{F}}_{c}. It should be noted that, as a result of this substitution, our proposed method may not be applied when ^c\hat{\mathcal{F}}_{c} is empty.

Several strong references exist that argue and verify the correctness of using the wild tt-bootstrap for estimating quantiles of maxima distributions in the above manner. Of particular note are the works of Chang et al. (2017) and Bowring et al. (2019), in which the wild tt-bootstrap is proposed for the estimation of Gaussian maxima distributions and the generation of confidence regions, respectively. Discussion of the wild tt-bootstrap in an imaging context may also be found in Telschow and Schwartzman (2021). Other significant references which serve as precursors to this work include Chernozhukov et al. (2013) and Sommerfeld et al. (2018), in which the Gaussian multiplier bootstrap was investigated for estimating Gaussian maxima distributions and generating confidence regions, respectively. More general introductions to such bootstrapping procedures may be found in Wu (1986), Efron and Tibshirani (1994) and Hesterberg (2014).

In the form that it has been presented here, the wild tt-bootstrap requires that {Gi}i\{G^{i}\}_{i\in\mathcal{M}} be symmetric. Further, as noted above, the wild tt-bootstrap has been extensively verified for estimating maxima distributions only in settings in which {Gi}i\{G^{i}\}_{i\in\mathcal{M}} are Gaussian. Here we stress that, whilst the theory presented in Section 2 makes no assumptions on the distribution of {Gi}i\{G^{i}\}_{i\in\mathcal{M}}, the bootstrap theory presented throughout Section 3 requires {Gi}i\{G^{i}\}_{i\in\mathcal{M}} to be Gaussian. In order to utilise the results of Section 2 when the random variables {Gi}i\{G^{i}\}_{i\in\mathcal{M}} are not Gaussian, alternative methods must be employed for estimating the quantiles of HH.

4 Simulations

4.1 Simulation Settings

To assess the correctness and performance of the method, a range of simulations were conducted using synthetic data. Each simulation was designed to investigate how the empirical coverage (i.e. the proportion of simulation instances in which the inclusion ^c+c^c\hat{\mathcal{F}}^{+}_{c}\subseteq\mathcal{F}_{c}\subseteq\hat{\mathcal{F}}^{-}_{c} held) was affected by various secondary factors of practical interest. In this section, we describe three simulations designed to investigate how the methods’ performance was influenced by; (1)(1) the degree of overlap between excursion sets, (2)(2) the presence of correlation between the noise fields for different study conditions, and (3)(3) the magnitude of the spatial rate of change of the signal.

In all simulations, for ii\in\mathcal{M}, the model used to generate the synthetic data took the form;

Yi(s)=μi(s)+ϵi(s),ϵi(s)N(0,σi(s)In).Y^{i}(s)=\mu^{i}(s)+\epsilon^{i}(s),\quad\epsilon^{i}(s)\sim N(0,\sigma^{i}(s)I_{n}). (5)

where nn was allowed to vary from 4040 to 500500 (in increments of 2020). The aim, across all simulations, was to assess the performance of the method when employed for conjunction inference on the true signal {μi}i\{\mu^{i}\}_{i\in\mathcal{M}} (i.e. when asked the question “Where do all of the {μi}i\{\mu^{i}\}_{i\in\mathcal{M}} exceed a predefined threshold?”).

The mechanism used to generate the true signal, {μi}i\{\mu^{i}\}_{i\in\mathcal{M}}, varied across simulations. In Simulations 1 and 2, {μi}i\{\mu^{i}\}_{i\in\mathcal{M}} were generated using two binary images of circles and squares, respectively, positioned in a ‘Venn diagram’ arrangement. Each binary image was scaled by a predefined amount and then smoothed using an isotropic Gaussian filter with a Full-Width Half Maximum (FWHM) of 55 pixels. In Simulation 33, μ1\mu^{1} and μ2\mu^{2} were simulated as a horizontal and vertical linear ramp, respectively, with predefined gradients.

Each simulation was performed twice, once using noisier ‘low-Signal-to-Noise Ratio (SNR)’ synthetic data and again using less noisy ‘high-SNR’ synthetic data. To generate the high-SNR synthetic data for Simulations 11 and 22, all simulated {μi}i\{\mu^{i}\}_{i\in\mathcal{M}} were scaled by a factor of 33 prior to smoothing. This data generation process is identical to that employed in Bowring et al. (2019), in which it is noted that synthetic data generated using these parameters strongly resembles that of an fMRI analysis when voxels are of dimension 22mm3. In Simulation 33, for the high-SNR data, the linear ramps were initially simulated with a gradient of 88 per 5050 pixels. Across all simulations, the low-SNR data was generated by reducing the signal magnitude of the high-SNR data by a factor of 44. In all simulations, thresholds of c=1/2c=1/2 and c=2c=2 were employed for the low-SNR data and high-SNR data, respectively.

As Simulation 11 aimed to investigate the method’s performance as the overlap between excursion sets increased, in this simulation, the distance between the centre of the circles was varied, ranging from 0 pixels (full overlap) to 5050 pixels (barely overlapping) in increments of 22 pixels. Similarly, as Simulation 22 aimed to assess the effect of correlation between ϵ1\epsilon^{1} and ϵ2\epsilon^{2}, in this simulation, this correlation was varied between 1-1 and 11 in increments of 0.10.1 (in all other simulations, the noise fields were generated independently). As Simulation 33 served to assess how sensitive the empirical coverage was to the rate of change of μ1\mu^{1} and μ2\mu^{2} over space, in this simulation, the initial ramp gradients were multiplied by a factor of kk, where kk was varied from 0.250.25 to 1.751.75 in 0.050.05 increments.

All simulation results were obtained as averages taken across 25002500 simulation instances and compared to the nominal coverage using 95%95\% binomial confidence intervals. In all simulation instances, the image dimensions were (100×100)(100\times 100) pixels, confidence regions were computed for a tolerance level of α=0.05\alpha=0.05 using 50005000 bootstrap realizations, σi(s)\sigma^{i}(s) was computed using the ordinary least squares estimator and τn\tau_{n} was computed as τn=n0.5\tau_{n}=n^{-0.5}. In each simulation instance, the assessment of whether the inclusion statement, ^c+c^c\hat{\mathcal{F}}^{+}_{c}\subseteq\mathcal{F}_{c}\subseteq\hat{\mathcal{F}}^{-}_{c}, held was verified using the interpolation-based methods of Bowring et al. (2019). In this approach, the inclusion statement was deemed to hold if, and only if, the sets of pixels which were identified as ^c\hat{\mathcal{F}}_{c}^{-}, c\mathcal{F}_{c} and ^c+\hat{\mathcal{F}}_{c}^{+} were appropriately nested and, in addition, interpolation along the boundary c\partial\mathcal{F}_{c} confirmed that no violations of the inclusion statement had occurred.

A further six simulations which investigated the influence of other secondary factors of interest were also conducted. The secondary factors considered by these simulations include; the presence of spatial structure in the noise, the effect of {1}c\partial^{\{1\}}\mathcal{F}_{c} and {2}c\partial^{\{2\}}\mathcal{F}_{c} having differing lengths, the effect of ϵ1\epsilon^{1} having a much larger variance than ϵ2\epsilon^{2} and the impact of varying the value of MM. A full discussion of these simulations is deferred to Supplementary Results Section S44. In addition, in keeping with the previous literature, tolerance levels of α=0.1\alpha=0.1 and α=0.2\alpha=0.2, as well as the substitution of c\partial\mathcal{F}_{c} for ^c\partial\hat{\mathcal{F}}_{c} in Equation (4), were also considered for simulation (c.f. Sommerfeld et al. (2018), Bowring et al. (2019), Bowring et al. (2021)). Again, the results of these simulations may be found in the Supplementary Results document in Sections S55 and S66.

4.2 Simulation Results

For a majority of the simulations conducted, the empirical coverage estimates were tightly distributed around the expected nominal coverage. In particular, for the high-SNR data, across all simulations, the 95%95\% binomial confidence intervals consistently captured the nominal coverage at the predicted rate. However, when the simulations employed the low-SNR data, it can be seen that the confidence regions produced by the method were conservative when the data were generated under certain unfavourable conditions.

Refer to caption
Figure 3: Simulation results generated using the low-SNR synthetic data. Nominal coverage is shown as a dashed line, with a corresponding binomial confidence interval also shaded. All data points are averages taken across 25002500 simulation instances.

For example, in Fig. 3, it can be seen that some over-coverage was observed for Simulation 11 when the data was noisy, and the number of subjects was low. This observation is corroborated by the results of the remaining low-SNR synthetic data simulations (c.f. Supplementary Results Section S55). However, in all cases, the degree of agreement between the empirical and nominal coverage improved as the sample size increased. Furthermore, no such over-coverage was observed for the equivalent high-SNR simulations. This observation matches expectation, as Theorem 2.1 holds only asymptotically, and is supported by the previous literature on confidence regions, in which similar results were observed for the single-‘study condition’ setting (c.f. Sommerfeld et al. (2018), Bowring et al. (2019)).

The presence of strong negative correlation between the study conditions and the rate at which the signal varied across space both also appeared to influence the methods’ performance (c.f. Fig 3, Simulations 2 and 3). In both instances, it is likely that the observed over-coverage is due to difficulties in estimating the true location of the boundary c\partial\mathcal{F}_{c}, as each of these factors make it harder to identify the locations at which small changes in miniμi\min_{i\in\mathcal{M}}\mu^{i} occur. When high-SNR data was used in the place of low-SNR data, both of these factors had a substantially reduced impact on the empirical coverage.

Despite the above observations, the simulation results overwhelmingly supported the claim that the proposed method is robust to a range of secondary factors. Included in the list of such factors are; variation in the shape and size of c\mathcal{F}_{c}, spatial correlation in the noise, variation in the lengths of individual boundary segments, between-‘study condition’ correlation, realistic levels of variation in the magnitude of the noise, large numbers of study conditions and situations in which boundary segments are shared by many excursion sets. For further detail, see Supplementary Results Sections S55 and S66.

In terms of time efficiency, the wild tt-bootstrap provided extremely fast computational performance. For instance, using an Intel(R) Xeon(R) Gold 6126 2.60GHz processor with 16GB RAM, and averaging over the 25002500 simulation instances conducted for n=500n=500 high-SNR observations, in Simulation 11, the time taken to generate confidence regions for a circle separation of 2020 pixels was 5.985.98 seconds. For a comprehensive summary of computation times, see Supplementary Results Section S77.

5 Real Data Application

To demonstrate the method in practice, we apply it to fMRI data drawn from the Human Connectome Project (HCP) dataset (Van Essen et al. (2013)). In this example, task fMRI data were collected as part of a block design from 8080 healthy, unrelated, young adults as part of a working memory task that had four distinct components, each using a different stimuli type: pictures of places, tools, faces and body parts. In Supplementary Results Section S33, we provide a brief overview of the imaging acquisition protocol, task paradigm, preprocessing stages and first-level analysis employed for generating this dataset (for exhaustive detail, see Barch et al. (2013) and Glasser et al. (2013)). Following first-level analysis, the data consisted of 44 images for each of the 8080 subjects, measuring the %BOLD response to each of the 44 stimuli types. In this example, our interest lies in identifying, at the group-level, which regions of the brain are associated with working memory regardless of stimuli type (i.e. “Which regions are active in response to all four working memory stimuli types?”).

As this work has predominantly focused on two-dimensional excursion sets, a single slice of the brain was chosen for analysis (z=46z=46mm, covering portions of the frontal gyrus involved in working memory). For each stimuli type, an n=80n=80 group-level linear model was constructed. Using the proposed method, confidence regions were then generated at the 5%5\% confidence level to assess where the group-level percentage BOLD change exceeded 1%1\% for all four stimuli types. To compare the proposed method to standard fMRI inference procedures, a group-level contrast was also generated using the Big Linear Model toolbox for each of the four stimuli types. In addition, single-‘study condition’ confidence regions were also generated using the methods of Sommerfeld et al. (2018) for each of the four stimuli types.

The results are shown in Fig. 4. The conjunction inference identified several localized regions, including the Superior Frontal Gyrus, which is well known for its involvement in working memory (c.f. Boisgueheneuc et al. (2006), Vogel et al. (2016), Alagapan et al. (2019)), and the Angular Gyri, which are well known to be involved in a range of tasks including memory retrieval, attention and spatial cognition (c.f. Seghier (2013), Bréchet et al. (2018)).

Refer to caption
Figure 4: %\%BOLD images (top) and 95% confidence regions (bottom left) for the HCP working memory task for each of the four visual stimuli types, alongside conjunction confidence regions assessing the overlap of the four thresholded %\%BOLD images (bottom right). Displayed is axial slice z=46z=46mm. The thresholds employed were c=1%c=1\% BOLD change, for all images, and α=0.05\alpha=0.05, for the confidence regions. Upper and lower confidence regions are displayed in red and blue, respectively. Across the bottom row, the yellow sets are the point estimates 𝒜^c1,𝒜^c4\hat{\mathcal{A}}_{c}^{1},...\hat{\mathcal{A}}_{c}^{4} and ^c\hat{\mathcal{F}}_{c} respectively. The red conjunction set has localized regions in the Superior Frontal Gyrus (top) and the Angular Gyri (left/right), for which we can assert with 95%95\% confidence that, for all four study conditions, there was (at least) a 1.0%1.0\% change in BOLD response.

Whilst the confidence regions for conjunction inference illustrated in Fig. 4 exhibit a clear resemblance to the four contrast images, we note that the red set, ^c+\hat{\mathcal{F}}_{c}^{+}, is very small in comparison to the blue set, ^c\hat{\mathcal{F}}_{c}^{-}, which is large and diffuse. This observation conveys important information about the spatial variability of the regions identified. In particular, it can be seen that, in the regions immediately surrounding the Superior Frontal Gyrus and Angular Gyri, there is a much higher resemblance between the blue (^c\hat{\mathcal{F}}_{c}^{-}) and yellow (^c\hat{\mathcal{F}}_{c}) sets than is seen across the rest of the brain. This resemblance provides an insight into the degree of spatial variation present surrounding these regions. In this case, the strength and localization of this resemblance suggest that the spatial variability of ^c\hat{\mathcal{F}}_{c} is less severe in and around the aforementioned anatomical regions than it is across the rest of the brain. It may be concluded that the estimated yellow ‘blobs’ corresponding to the Superior Frontal Gyrus and the Angular Gyri have been more reliably localized than the other yellow ‘blobs’ appearing in the image.

In general, the single-‘study condition’ confidence regions can be seen to be much larger than those observed for the conjunction inference. This observation matches expectation as the overlap of the four excursion sets is smaller than each set individually. In addition, in this example, the conjunction method employed the entire experimental design for analysis (as opposed to the single-‘study condition’ method which employed only the data concerning the stimuli of interest) and is therefore expected to have a higher statistical power and, thus, tighter confidence bounds.

6 Conclusion and Discussion

In this work, we have produced a method for generating confidence regions for intersections and unions of multiple excursion sets. The confidence regions generated by the method generalise the notion of confidence intervals to arbitrary spatial dimensions and possess the usual frequentist interpretation that, were the 0.950.95 procedure repeated on numerous samples, the proportion of confidence regions, ^c+\hat{\mathcal{F}}^{+}_{c} and ^c\hat{\mathcal{F}}^{-}_{c}, that correctly enclose c\mathcal{F}_{c} would tend to 0.950.95. Such confidence regions serve as an indicator of the reliability of ^c\hat{\mathcal{F}}_{c} as an estimate of c\mathcal{F}_{c}. Confidence regions which closely resemble the set ^c\hat{\mathcal{F}}_{c} may be interpreted as signifying that ^c\hat{\mathcal{F}}_{c} is a reliable estimate of c\mathcal{F}_{c}, whilst little resemblance may suggest that there is a high degree of spatial variability present in the data and that the estimated shape, size and locale of ^c\hat{\mathcal{F}}_{c} are not particularly reliable. Such statements are of particular value in imaging applications where the aim of a statistical analysis is typically to assess how reliably some form of activity may be localized to a particular spatial region.

We stress here that, whilst ^c\hat{\mathcal{F}}_{c} is equal to the intersection of {𝒜^ci}i\{\hat{\mathcal{A}}_{c}^{i}\}_{i\in\mathcal{M}}, the confidence regions ^c±\hat{\mathcal{F}}^{\pm}_{c} are not the intersections of the single-‘study condition’ confidence regions {𝒜^c±,i}i\{\hat{\mathcal{A}}^{\pm,i}_{c}\}_{i\in\mathcal{M}} which would be obtained using the methods of Sommerfeld et al. (2018). This can be seen by noting that {𝒜^c±,i}i\{\hat{\mathcal{A}}^{\pm,i}_{c}\}_{i\in\mathcal{M}} are defined using separate thresholds from different bootstraps and can be heavily influenced by the behavior of {𝒜ci}i\{\partial\mathcal{A}_{c}^{i}\}_{i\in\mathcal{M}} in spatial regions far from c\mathcal{F}_{c}. In fact, treating the naive intersection of the (1α1-\alpha) confidence regions {𝒜^c±,i}i\{\hat{\mathcal{A}}^{\pm,i}_{c}\}_{i\in\mathcal{M}} as confidence regions for c\mathcal{F}_{c} is not a valid method in general and can result in undesirable asymptotic coverage lying anywhere inside the range [1Mα,1][1-M\alpha,1]. This claim is further discussed in Supplementary Theory Section S4. To our knowledge, the only valid approach for generating confidence regions for conjunction inference is that outlined in this work.

One potential limitation of the specific methods of Section 3 is that they allow for confidence regions to be generated only when the target functions, {μi}i\{\mu^{i}\}_{i\in\mathcal{M}}, are linear combinations of regression parameter estimates. As noted by Bowring et al. (2021), it is often preferable for an analysis to consider standardized effect sizes, such as Cohen’s dd (Cohen (2013)) or Hedges’ GG (Hedges (1981)), instead of regression parameter estimates, as standardized effect sizes allow for information about statistical power to be incorporated into the analysis. To this end, Bowring et al. (2021) outlined an approach for generating confidence regions (in the single-‘study condition’ setting) for images of Cohen’s dd effect sizes. Here, we suggest that a potential avenue for future work is to combine the methods of Bowring et al. (2021) and those proposed in this work to allow for conjunction, or disjunction, inference to be performed on standardized effect sizes rather than regression parameter estimates.

Another limitation noted here is that, whilst our proposed method theoretically works for data of arbitrary dimensions, the simulations of Section 4 verify the performance of our method only for two-dimensional data. Whilst it is typical for many practical applications to focus on two-dimensional data (e.g. climate maps, surface images), higher-dimensional datasets are abundant in a wealth of disciplines (e.g. brain images, astrological maps). For this reason, we intend to investigate further the performance of the method in higher dimensions in future work.

Acknowledgment(s)

This work was partially supported by the NIH under Grant [R01EB026859] (A.S., T.M., T.N.).

Data were provided by the Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University. The code used to obtain the results of Sections 4 and 5 is freely available and may be found at:
https://github.com/TomMaullin/ConfSets

The authors would like to thank Dr. Fabian J.E. Telschow for his early involvement in formulating the project, and Professor Gary R. Lawlor for his email correspondence regarding the differentiability requirements for L’hospital’s rule for multivariable functions.

Disclosure statement

The authors report there are no competing interests to declare.

References

  • Adler [1981] R.J. Adler. The geometry of random fields. Wiley series in probability and mathematical statistics. Probability and mathematical statistics. J. Wiley, 1981. ISBN 9780471278443.
  • Adler and Taylor [2009] R.J. Adler and J.E. Taylor. Random Fields and Geometry. Springer Monographs in Mathematics. Springer New York, 2009. ISBN 9780387481166.
  • Adler [1977] Robert J. Adler. Hausdorff Dimension and Gaussian Fields. The Annals of Probability, 5(1):145 – 151, 1977. doi: 10.1214/aop/1176995900.
  • Adler et al. [2017] Robert J. Adler, Kevin Bartz, Sam C. Kou, and Anthea Monod. Estimating thresholding levels for random fields via euler characteristics, 2017.
  • Alagapan et al. [2019] Sankaraleengam Alagapan, Eldad Lustenberger, Caroline; Hadar, Hae Won Shin, and Flavio Fröhlich. Low-frequency direct cortical stimulation of left superior frontal gyrus enhances working memory performance. NeuroImage, 184, 01 2019. doi: 10.1016/j.neuroimage.2018.09.064.
  • Azas and Wschebor [2009] Jean-Marc Azas and Mario Wschebor. Level sets and extrema of random processes and fields. Level Sets and Extrema of Random Processes and Fields, 01 2009. doi: 10.1002/9780470434642.
  • Barch et al. [2013] Deanna M. Barch, Gregory C. Burgess, Michael P. Harms, Steven E. Petersen, Bradley L. Schlaggar, Maurizio Corbetta, Matthew F. Glasser, Sandra Curtiss, Sachin Dixit, Cindy Feldt, Dan Nolan, Edward Bryant, Tucker Hartley, Owen Footer, James M. Bjork, Russ Poldrack, Steve Smith, Heidi Johansen-Berg, Abraham Z. Snyder, and David C. Van Essen. Function in the human connectome: Task-fmri and individual differences in behavior. NeuroImage, 80:169–189, 2013. ISSN 1053-8119. doi: https://doi.org/10.1016/j.neuroimage.2013.05.033. Mapping the Connectome.
  • Berger [1997] Roger L. Berger. Likelihood Ratio Tests and Intersection-Union Tests, pages 225–237. Birkhäuser Boston, Boston, MA, 1997. ISBN 978-1-4612-2308-5. doi: 10.1007/978-1-4612-2308-5˙15.
  • Berger and Hsu [1996] Roger L. Berger and Jason C. Hsu. Bioequivalence trials, intersection-union tests and equivalence confidence sets. Statistical Science, 11(4):283 – 319, 1996. doi: 10.1214/ss/1032280304.
  • Boisgueheneuc et al. [2006] Foucaud du Boisgueheneuc, Richard Levy, Emmanuelle Volle, Magali Seassau, Hughes Duffau, Serge Kinkingnehun, Yves Samson, Sandy Zhang, and Bruno Dubois. Functions of the left superior frontal gyrus in humans: a lesion study. Brain, 129(12):3315–3328, 09 2006. ISSN 0006-8950. doi: 10.1093/brain/awl244.
  • Bowring et al. [2019] Alexander Bowring, Fabian Telschow, Armin Schwartzman, and Thomas E. Nichols. Spatial confidence sets for raw effect size images. NeuroImage, 203:116187, 2019. ISSN 1053-8119. doi: https://doi.org/10.1016/j.neuroimage.2019.116187.
  • Bowring et al. [2021] Alexander Bowring, Fabian J.E. Telschow, Armin Schwartzman, and Thomas E. Nichols. Confidence sets for cohen’s d effect size images. NeuroImage, 226:117477, 2021. ISSN 1053-8119. doi: https://doi.org/10.1016/j.neuroimage.2020.117477.
  • Bréchet et al. [2018] Lucie Bréchet, Petr Grivaz, Baptiste Gauthier, and Olaf Blanke. Common recruitment of angular gyrus in episodic autobiographical memory and bodily self-consciousness. Frontiers in Behavioral Neuroscience, 12, 11 2018. doi: 10.3389/fnbeh.2018.00270.
  • Cao [1999] J. Cao. The size of the connected components of excursion sets of χ\chi2, t and f fields. Advances in Applied Probability, 31(3):579–595, 1999. doi: 10.1239/aap/1029955192.
  • Chang et al. [2017] Chung Chang, Xuejing Lin, and R. Todd Ogden. Simultaneous confidence bands for functional regression models. Journal of Statistical Planning and Inference, 188:67–81, 2017. ISSN 0378-3758. doi: https://doi.org/10.1016/j.jspi.2017.03.002.
  • Chernozhukov et al. [2013] Victor Chernozhukov, Denis Chetverikov, and Kengo Kato. Gaussian approximations and multiplier bootstrap for maxima of sums of high-dimensional random vectors. The Annals of Statistics, 41(6), Dec 2013. ISSN 0090-5364. doi: 10.1214/13-aos1161.
  • Cohen [2013] J. Cohen. Statistical Power Analysis for the Behavioral Sciences. Elsevier Science, 2013. ISBN 9781483276489. URL https://books.google.co.uk/books?id=rEe0BQAAQBAJ.
  • Dijkstra et al. [2021] Nadine Dijkstra, Simon van Gaal, Linda Geerligs, Sander E Bosch, and Marcel van Gerven. No overlap between unconscious and imagined representations, May 2021.
  • Efron and Tibshirani [1994] Bradley Efron and R.J. Tibshirani. An Introduction to the Bootstrap. Chapman and Hall/CRC, 1 edition, 1994.
  • Ferreira et al. [2021] Roberto A. Ferreira, David Vinson, Ton Dijkstra, and Gabriella Vigliocco. Word learning in two languages: Neural overlap and representational differences. Neuropsychologia, 150:107703, 2021. ISSN 0028-3932. doi: https://doi.org/10.1016/j.neuropsychologia.2020.107703.
  • Glasser et al. [2013] Matthew F. Glasser, Stamatios N. Sotiropoulos, J. Anthony Wilson, Timothy S. Coalson, Bruce Fischl, Jesper L. Andersson, Junqian Xu, Saad Jbabdi, Matthew Webster, Jonathan R. Polimeni, David C. Van Essen, and Mark Jenkinson. The minimal preprocessing pipelines for the human connectome project. NeuroImage, 80:105–124, 2013. ISSN 1053-8119. doi: https://doi.org/10.1016/j.neuroimage.2013.04.127. Mapping the Connectome.
  • Hedges [1981] Larry V. Hedges. Distribution theory for glass’s estimator of effect size and related estimators. Journal of Educational Statistics, 6(2):107–128, 1981. ISSN 03629791. URL http://www.jstor.org/stable/1164588.
  • Hesterberg [2014] Tim Hesterberg. What teachers should know about the bootstrap: Resampling in the undergraduate statistics curriculum, 2014.
  • Pranav et al. [2019] Pratyush Pranav, Rien van de Weygaert, Gert Vegter, Bernard J T Jones, Robert J Adler, Job Feldbrugge, Changbom Park, Thomas Buchert, and Michael Kerber. Topology and geometry of gaussian random fields i: on betti numbers, euler characteristic, and minkowski functionals. Monthly Notices of the Royal Astronomical Society, 485(3):4167–4208, Feb 2019. ISSN 1365-2966. doi: 10.1093/mnras/stz541.
  • Seghier [2013] Mohamed L. Seghier. The angular gyrus: multiple functions and multiple subdivisions. The Neuroscientist : a review journal bringing neurobiology, neurology and psychiatry, 19(1):43–61, Feb 2013. ISSN 1089-4098. doi: 10.1177/1073858412440596.
  • Sommerfeld et al. [2018] Max Sommerfeld, Stephan Sain, and Armin Schwartzman. Confidence regions for spatial excursion sets from repeated random field observations, with an application to climate. Journal of the American Statistical Association, 113(523):1327–1340, 2018. doi: 10.1080/01621459.2017.1341838. PMID: 31452557.
  • Telschow and Schwartzman [2021] Fabian Telschow and Armin Schwartzman. Simultaneous confidence bands for functional data using the gaussian kinematic formula. Journal of Statistical Planning and Inference, 216, 06 2021. doi: 10.1016/j.jspi.2021.05.008.
  • Torres [1994] Sergio Torres. Topological Analysis of COBE-DMR Cosmic Microwave Background Maps. apjl, 423:L9, March 1994. doi: 10.1086/187223.
  • Van Essen et al. [2013] David Van Essen, Stephen Smith, Deanna Barch, Timothy Behrens, Essa Yacoub, and Kamil Ugurbil. The wu-minn human connectome project: an overview. NeuroImage, 80, 05 2013. doi: 10.1016/j.neuroimage.2013.05.041.
  • Vogel et al. [2016] Tobias Vogel, Renata Smieskova, Anna Schmidt, André; Walter, Fabienne Harrisberger, Anne Eckert, Undine E. Lang, Anita Riecher-Rössler, Marc Graf, and Stefan Borgwardt. Increased superior frontal gyrus activation during working memory processing in psychosis: Significant relation to cumulative antipsychotic medication and to negative symptoms. Schizophrenia Research, 4 2016. doi: 10.1016/j.schres.2016.03.033.
  • Worsley [1996] Keith J. Worsley. The geometry of random images. CHANCE, 9(1):27–40, 1996. doi: 10.1080/09332480.1996.10542483.
  • Wu [1986] C. F. J. Wu. Jackknife, Bootstrap and Other Resampling Methods in Regression Analysis. The Annals of Statistics, 14(4):1261 – 1295, 1986. doi: 10.1214/aos/1176350142.
  • Zhang et al. [2021] Xiaoxia Zhang, Linling Li, Gan Huang, Li Zhang, Zhen Liang, Li Shi, and Zhiguo Zhang. A multisensory fmri investigation of nociceptive-preferential cortical regions and responses. Frontiers in neuroscience, Apr 2021.