This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Analyzing whale calling through Hawkes process modeling

Bokgyeong Kang
Department of Statistical Science, Duke University,

Erin M. Schliep
Department of Statistics, North Carolina State University,

Alan E. Gelfand
Department of Statistical Science, Duke University,

Tina M. Yack
Nicholas School of the Environment, Duke University,

Christopher W. Clark
K. Lisa Yang Center for Conservation Bioacoustics, Cornell University,
and
Robert S. Schick
Southall Environmental Associates
Abstract

Sound is assumed to be the primary modality of communication among marine mammal species. Analyzing acoustic recordings helps to understand the function of the acoustic signals as well as the possible impact of anthropogenic noise on acoustic behavior. Motivated by a dataset from a network of hydrophones in Cape Cod Bay, Massachusetts, utilizing automatically detected calls in recordings, we study the communication process of the endangered North Atlantic right whale. For right whales an “up-call” is known as a contact call, and ensuing counter-calling between individuals is presumed to facilitate group cohesion. We present novel spatiotemporal excitement modeling consisting of a background process and a counter-call process. The background process intensity incorporates the influences of diel patterns and ambient noise on occurrence. The counter-call intensity captures potential excitement, that calling elicits calling behavior. Call incidence is found to be clustered in space and time; a call seems to excite more calls nearer to it in time and space. We find evidence that whales make more calls during twilight hours, respond to other whales nearby, and are likely to remain quiet in the presence of increased ambient noise.

Keywords: Gaussian process, North Atlantic right whales, random time change theorem, spatial process, temporal point patterns

1 Introduction

Sound is assumed to be the primary mode of communication among many marine mammal species (Bass and Clark,, 2003). Scientists have been recording the sounds produced by marine mammals for many decades (Schevill and Watkins,, 1962; Payne and McVay,, 1971; Tyack and Clark,, 2000), and over time have developed more advanced methods to detect (Moore et al.,, 2006; Mellinger et al.,, 2007), classify (Gillespie,, 2004), localize (Watkins and Schevill,, 1972), and understand the utility of acoustic signals produced by these animals (Richardson et al.,, 1995; Au and Hastings,, 2008; Bradbury and Vehrencamp,, 2011).

Our focus is on the North Atlantic right whale (Eubalaena glacialis, hereafter NARW), an endangered baleen whale whose primary habitat is the western North Atlantic Ocean (Kraus and Rolland,, 2007). NARWs face a variety of threats, including ship strikes (Vanderlaan and Taggart,, 2007), entanglement with fishing gear (Knowlton et al.,, 2022), and climate-change induced impacts on their habitat (Meyer-Gutbrod et al.,, 2023). Their population is currently estimated to be 356 (346-363) animals (Linden,, 2023). NARWs have a diverse acoustic repertoire (Clark,, 1982; Matthews and Parks,, 2021), with a variety of different call types that vary by habitat (Cusano et al.,, 2019), behavior (Parks and Tyack,, 2005) and age (Root-Gutteridge et al.,, 2018). One of these call types is a frequency-modulated “up-call” (Clark,, 1982), which is thought to serve as a contact call between individual whales (Clark and Clark,, 1980). The ensuing counter-calling between individuals, in response to the up-calls, is presumed to serve as a mechanism for maintaining group cohesion. Up-calls have been used to document the presence and relative abundance of NARWs across broad spatial (Davis et al.,, 2017) and temporal scales (Davis et al.,, 2023). Our analysis aims to better understand the dynamics of up-calling behavior of NARWs within and across days.

Tracking the calls of individuals through time is challenging. It is often done with a combination of survey platforms (Clark et al.,, 1986), on navy ranges with a dense array of hydrophones (Helble et al.,, 2016), or using short-term movement and acoustic tags on animals (Johnson and Tyack,, 2003; Parks et al.,, 2010, 2019). Though movement and acoustic tags allow you to detect the precise timing of calling by the tagged individual, they are not without limitations; for example, 1) the tag deployments are short-term, typically on the order of hours; 2) they are archival, so they must be retrieved; and 3), as compared to an array of hydrophones, they are spatially limited. In contrast, a distributed array of acoustic recorders can cover a broader spatial area over a longer period of time. The acoustic recorder data analyzed here also have limitations, including the fact that they preclude direct observation of up-calls from known individuals. These data consist of the set of times at which individual NARW up-calls were received at each acoustic recorder in a distributed array of 10 recorders. Each recorder captures a unique sequence of up-call detection times. At the level of the array, each sequence is a marked point pattern, i.e., an event time with an associated recorder label for each fixed recorder location.

Our analysis aims to better understand how diel patterns and ambient noise levels influence up-calling behavior across individuals (Matthews and Parks,, 2021). We also seek to identify evidence of counter-calling behavior, i.e., whether NARWs produce up-calls in response to other up-calling whales. If counter-calling behavior is present, the aggregated sequence will exhibit excitement since the incidence of an up-call tends to encourage more up-calls than otherwise expected in a window of time after the up-call. This excitement could occur both within or across recorders. Lastly, given concerns about levels of anthropogenic noise in the ocean (Radford et al.,, 2014; Hatch et al.,, 2016), there is a need to quantify the potential impacts of anthropogenic sounds on the calling behavior of NARWs. To accomplish these goals, we specify a novel multivariate Hawkes process where the intensity for the up-calls is the summation of a so-called background process that produces contact calls and a counter-call process that can explain the potential excitement.111The term intensity has very different meaning for acousticians, who use it to describe the strength or amplitude of a sound. Throughout the manuscript, we use intensity in the statistical sense. The notion of cross-recorder excitement can potentially provide insight into communication between different individuals. Very recent ecological work (Nicvert et al.,, 2024) has employed a multivariate Hawkes process to study interactions between multiple species from camera trap data across time.

Since each acoustic sensor has a unique spatial location, joint modeling of the event sequences from the recorders requires spatial specification. We assume that neighboring recorders exhibit similar patterns of contact calling. Similarly, we assume that an up-call first detected at a particular recorder is more likely to excite counter-calls at the same or nearby recorders. As such, a model for multi-recorder data requires the introduction of spatial dependence. We introduce spatial dependence into the background process through spatially-varying regression coefficients. We introduce distance-based dependence into the counter-call (CC) process. Further, we expand the customary specification for the multivariate Hawkes process by enriching the background process from a nonhomogeneous Poisson process (NHPP) to a process allowing for local adjustment to the intensity through a Gaussian process (GP). Altogether, we label this model as NHPP+GP+CC. This is our richest specification and we investigate it with regard to its ability to distinguish between the GP and CC components of the overall intensity.

The models developed here differ from customary spatial self-exciting process modeling in the literature (see, e.g., Rathbun,, 1996; Reinhart,, 2018). In the usual setting, the data consist of a marked point pattern—a single event time sequence with a mark denoting the spatial location over the region. This formulation results in a conditional intensity over space and time. In our analysis, we specify a conditional intensity of the event time sequence for each hydrophone, reflecting its spatial location. These conditional intensities are modeled jointly across space. Conceptually similar is the network Hawkes modeling in Linderman and Adams, (2014).

We consider the following ensemble of models: (i) NHPP; (ii) NHPP+GP; (iii) NHPP+CC; (iv) NHPP+GP+CC. Evidently, models (i) to (iii) are special cases of (iv). All model specification and fitting are done in a hierarchical Bayesian framework. We introduce a simulation study in order to learn how well the fitting models can distinguish generating models. To assess model adequacy, we extend the random time change theorem (RTCT) (Brown and Nair,, 1988) to a fully Bayesian modeling setting. For model comparison, we add posterior likelihood comparison along with model dimension penalization through the Deviance Information Criterion (DIC).

Our primary contribution is an analysis of two NARW vocalization datasets, one from a single recorder located just west of Race Point at the Northern edge of Cape Cod Bay and the second from a network of recorders within the central part of Cape Cod Bay (CCB). We employ the event series of received up-call times to explore three features: 1) the periodic/diel behavior of contact calls; 2) evidence of counter-calling behavior in the data; and 3) the role of ambient noise in altering the up-calling behavior in NARW.

The format of the paper is as follows. Section 2 provides details of the data collection as well as some exploratory data analysis for the single and multi-channel data sources. Section 3 elaborates the multi-channel models with the single channel model as a special case. Section 4 introduces model adequacy and model comparison tools. Section 5 offers some simulation work to demonstrate the ability of our modeling to recover the true generative specification. Section 6 provides our findings for both the single channel and the multi-channel. Section 7 concludes with a summary of our contributions, our findings, and some plans for future work.

2 Passive acoustic monitoring data

Researchers have been monitoring and recording the sounds produced by NARWs for several decades, and acoustic detections have provided information about the spatial and temporal distribution of these whales (Davis et al.,, 2017). We focus on one type of sound referred to as the up-call, which serves as a contact call to maintain social cohesion (Clark,, 1982). Acoustic data have been collected with a distributed array of bottom-mounted hydrophones which remain underwater for many months at a time while continuously recording acoustic data (Calupca et al.,, 2000). From these continuous records, bioacousticians use a variety of methods to detect and classify the call made by individual(s) (Clark and Clark,, 1980). This process leads to a temporally explicit series of unique up-call events. We explore up-calling behavior of NARWs recorded from both a single-channel recorder and a distributed network of 10 recorders.

2.1 Single-channel data

Refer to caption
Figure 1: (a) The location of Cape Cod Bay, MA. (b) The locations of the single MARU (blue circle) and the distributed network of 10 MARUs in CCB. The orange and green colors correspond to two highlighted MARUs which will be referred to in subsequent figures.

The single-channel dataset was collected and processed by the National Oceanic and Atmospheric Administration (NOAA) (DCLDE 2013,, 2023). This comprised recordings from a week-long deployment of a bottom-mounted hydrophone system, referred to as a marine autonomous recording unit or MARU (Calupca et al.,, 2000); the MARU was deployed to the west of Race Point on the northern edge of Cape Cod Bay (Figure 1). MARUs were programmed to record sound continuously in the 10-585 Hz low-frequency band (Parks et al.,, 2009) including biotic, abiotic, and anthropogenic sources. Post-processing was done with a software routine designed to automatically detect and classify low-frequency calls of baleen whales, including NARW (Baumgartner and Mussoline,, 2011). Following automated detection, a trained analyst manually reviewed and verified possible NARW up-calls, and the processed dataset was made available to the acoustic processing community as a benchmark dataset to be used in subsequent algorithm development. There were 5,302 unique and verified up-calls detected across the seven days (2009-03-28 to 2009-04-03), and we extracted ambient noise levels (dB re 1 µPa) across the same frequency band (60-400 Hz) as the NARW up-call using the PAMGuard software program (https://www.pamguard.org). Ambient sound was in dB re 1 µ-Pa, i.e., decibel units of Sound Pressure Level, root mean square, with reference to 1 µ-Pascal—a standard metric in the bioacoustics community (Au and Hastings,, 2008). This noise metric was used as a covariate in the modeling.

2.2 Multi-channel CCB data

Refer to caption
Figure 2: (a) 30-minute rolling average time series of ambient noise measured at two example MARUs in CCB (MARU 10 in green; MARU 5 in orange). Units are decibels, Root-Mean-Squared. (b) Event sequence of unique up-calls received for each MARU within CCB, colors as in (a).

The multi-channel dataset consists of recordings from the distributed network of 10 MARUs located within Cape Cod Bay, MA. The distances between MARUs are found in the supplement Figure S1. This bay is seasonally inhabited by NARWs (Mayo et al.,, 2018), with upwards of 70% of the entire species sometimes being observed during aerial surveys and vessel-based surveys (C. Hudak, pers comm). Starting in the early 2000s, researchers deployed passive acoustic arrays of different size and configuration in CCB coincident with the season when NARWs occur in this habitat (Clark et al.,, 2010; Urazghildiiev,, 2014). These arrays use the same type of MARU used in the single-channel dataset described earlier, however the data processing is slightly different. For our multi-channel analysis, we used acoustic data collected from 10 MARUs in a distributed network across 9 days (2010-04-02 to 2010-04-10). The data are archival, so the buoys must be recovered to obtain the sound files (Calupca et al.,, 2000).

Refer to caption
Figure 3: Pairwise correlation between a kernel intensity estimate for the total event sequence at MARU 10 versus the kernel intensity estimates for each of the other nine MARUs.

We used the software PAMGuard to process the raw sound files, detect and classify NARW up-calls, extract ambient noise metrics, and localize individual up-calling NARWs (Gillespie,, 2004; Gillespie et al.,, 2020). Localization algorithms use the time-difference-of-arrival of three or more matched up-calls (Urazghildiiev,, 2014). To determine the times of unique up-calls, i.e., those arriving from a single whale, we found all up-calls associated with each unique localization; from this associated set we extracted the first arrival time, as well as its associated MARU. Across the 9 days, a total of 2,750 unique up-calls were extracted from the data. The arrival times on the MARUs are the data that comprise the analysis. Figure 2 (a) shows the ambient noise for two illustrative MARUs (MARU 5 and MARU 10 in Figure 1)) highlighting variation across MARUs as well as within MARUs across days. Figure 2 (b) depicts the event sequence of unique up-calls received on each of the MARUs within CCB for the 8-day period. The up-call sequences show diel patterns as well as possible excitement in up-call rates, motivating the inclusion of diel harmonics in the intensity of the process as well as Hawkes process modeling.

Finally, we investigate possible spatial dependence in the up-calling behavior captured by the array of MARUs. We might expect nearby hydrophones to capture similar up-calling behavior as a response to noise and temporal harmonics. In addition, we might expect the number of counter-calls received at MARU kk that were excited by up-calls at MARU ll to decrease with the distance between MARUs ll and kk. For each MARU, we compute the kernel intensity estimate for the total event sequence of up-calls. Then, we compute pairwise correlations between these estimates. In Figure 3, we show these pairwise correlations spatially between MARU 10 and each of the other MARUs. Positive correlation is detected between MARU 10 and each of the nearby MARUs, suggesting evidence of spatial dependence across the array.

3 Model specification and inference

3.1 Multivariate Hawkes process

We propose four different multivariate Hawkes process models for studying the behavior of NARW up-calls. Let 𝒯={(ti,mi),i=1,,n}\mathcal{T}=\{(t_{i},m_{i}),i=1,\dots,n\} be the sequence of paired observations where ti(0,T]t_{i}\in(0,T] is an up-call time with ti<ti+1t_{i}<t_{i+1} and mi{1,,K}m_{i}\in\{1,\dots,K\} indicates the MARU which received up-call ii at time tit_{i}. Let t\mathcal{H}_{t} be the history of all up-calls up to time tt, i.e., t={(ti,mi):ti<t}\mathcal{H}_{t}=\{(t_{i},m_{i}):t_{i}<t\}. Such a point process in time can be characterized by a conditional intensity defined as

λ(tt)=limΔt0E{N((t,t+Δt])t}Δt,\displaystyle\lambda(t\mid\mathcal{H}_{t})=\lim_{\Delta t\to 0}\frac{\textup{E}\left\{N\left((t,t+\Delta t]\right)\mid\mathcal{H}_{t}\right\}}{\Delta t},

where N(A)N(A) is the number of up-calls recorded over A(0,T]A\subseteq(0,T].

We label our most general model as NHPP+GP+CC (nonhomogeneous Poisson process + Gaussian process + counter-call process). For this model we define the conditional intensity of up-calls at MARU kk at time tt as

λk(tt;𝜽)\displaystyle\lambda_{k}(t\mid\mathcal{H}_{t};\mbox{\boldmath$\theta$}) =μk(t)+i:ti<t,ti𝒯αmieη(tti)eϕdmi,k,\displaystyle=\mu_{k}(t)+\sum_{i:t_{i}<t,t_{i}\in\mathcal{T}}\alpha_{m_{i}}e^{-\eta(t-t_{i})}e^{-\phi d_{m_{i},k}}, (1)

where 𝜽\theta is the collection of model parameters. The conditional intensity of up-calls across KK MARUs at time tt is λ(tt;𝜽)=k=1Kλk(tt;𝜽)\lambda(t\mid\mathcal{H}_{t};\mbox{\boldmath$\theta$})=\sum_{k=1}^{K}\lambda_{k}(t\mid\mathcal{H}_{t};\mbox{\boldmath$\theta$}).

The first term μk(t)\mu_{k}(t) of (1) explains the rate of contact calls at MARU kk at time tt. We use minutes and kilometers (km) for the units of time and space. To capture the impact of noise on contact calls as well as the potential daily pattern of contact calls, we assume

μk(t)=exp{\displaystyle\mu_{k}(t)=\exp\bigg{\{} β0k+β1kNoisek(t)+β2ksin(2πt8×60)+β3kcos(2πt8×60)+β4ksin(2πt12×60)\displaystyle\beta_{0k}+\beta_{1k}\text{Noise}_{k}(t)+\beta_{2k}\sin\left(\frac{2\pi t}{8\times 60}\right)+\beta_{3k}\cos\left(\frac{2\pi t}{8\times 60}\right)+\beta_{4k}\sin\left(\frac{2\pi t}{12\times 60}\right)
+β5kcos(2πt12×60)+β6ksin(2πt24×60)+β7kcos(2πt24×60)+δkw(t)},\displaystyle+\beta_{5k}\cos\left(\frac{2\pi t}{12\times 60}\right)+\beta_{6k}\sin\left(\frac{2\pi t}{24\times 60}\right)+\beta_{7k}\cos\left(\frac{2\pi t}{24\times 60}\right)+\delta_{k}w(t)\bigg{\}},

where Noise(t)k{}_{k}(t) is the ambient noise level at MARU kk at time tt, and the three sine and cosine pairs are the 8-hour, 12-hour, and 24-hour harmonics, respectively. The GP, w()w(\cdot), is mean 0 and variance 1, and is used to account for variability in the contact call process not captured by the other covariates. We assume an exponential correlation function with an effective range of 3 hours to capture local temporal dependence. The δk\delta_{k}’s are MARU-specific coefficients for the GP. Let 𝜷j=(βj1,,βjK)\mbox{\boldmath$\beta$}_{j}=\left(\beta_{j1},\dots,\beta_{jK}\right)^{\top} denote the length KK variable-specific vector of coefficients, where j=0j=0 denotes the intercept. Let 𝜹=(δ1,,δK)\mbox{\boldmath$\delta$}=\left(\delta_{1},\dots,\delta_{K}\right)^{\top} denote a vector of the GP coefficients. To account for spatial dependence, we assume that βjk\beta_{jk}’s are dependent across kk. Let dk,d_{k,\ell} denote the distance between MARUs kk and \ell. To improve Markov chain mixing, we incorporate customary hierarchical centering (Gelfand et al.,, 1996). For j=0,1,,pj=0,1,\dots,p, we specify 𝜷jMVN(β~j1,τjV)\mbox{\boldmath$\beta$}_{j}\sim\text{MVN}(\tilde{\beta}_{j}\textbf{1},\tau_{j}V) where Vk,V_{k,\ell} is e3dk,/maxk,{dk,}e^{-3d_{k,\ell}/\max_{k^{\prime},\ell^{\prime}}\{d_{k^{\prime},\ell^{\prime}}\}} so that its effective range entirely covers the array of MARUs. The GP coefficients δk\delta_{k}’s are also assumed to be spatially dependent and given by log(𝜹)MVN(δ~1,τδV)\log(\mbox{\boldmath$\delta$})\sim\text{MVN}(\tilde{\delta}\textbf{1},\tau_{\delta}V).

The second term of (1) captures the conditional intensity component for counter-calls at MARU kk at time tt. It is natural to assume that the intensity of counter-calls caused by an up-call at tit_{i} is higher in time near tit_{i} and higher in space near mim_{i} and decreases as we move forward in time and away in space. Some motivation for this specification is provided by Figure 3. We assume the αk\alpha_{k}’s are independent with a Gamma prior.

In this manuscript we consider the following four models.

  1. (i)

    NHPP: λk(tt;𝜽)=eβ0k+xk(t)𝜷k\lambda_{k}(t\mid\mathcal{H}_{t};\mbox{\boldmath$\theta$})=e^{\beta_{0k}+\textbf{x}_{k}(t)^{\top}\boldsymbol{\beta}_{k}}

  2. (ii)

    NHPP+GP: λk(tt;𝜽)=eβ0k+xk(t)𝜷k+δkw(t)\lambda_{k}(t\mid\mathcal{H}_{t};\mbox{\boldmath$\theta$})=e^{\beta_{0k}+\textbf{x}_{k}(t)^{\top}\boldsymbol{\beta}_{k}+\delta_{k}w(t)}

  3. (iii)

    NHPP+CC: λk(tt;𝜽)=eβ0k+xk(t)𝜷k+i:ti<tαmieη(tti)eϕdmi,k\lambda_{k}(t\mid\mathcal{H}_{t};\mbox{\boldmath$\theta$})=e^{\beta_{0k}+\textbf{x}_{k}(t)^{\top}\boldsymbol{\beta}_{k}}+\sum_{i:t_{i}<t}\alpha_{m_{i}}e^{-\eta(t-t_{i})}e^{-\phi d_{m_{i},k}}

  4. (iv)

    NHPP+GP+CC: λk(tt;𝜽)=eβ0k+xk(t)𝜷k+δkw(t)+i:ti<tαmieη(tti)eϕdmi,k\lambda_{k}(t\mid\mathcal{H}_{t};\mbox{\boldmath$\theta$})=e^{\beta_{0k}+\textbf{x}_{k}(t)^{\top}\boldsymbol{\beta}_{k}+\delta_{k}w(t)}+\sum_{i:t_{i}<t}\alpha_{m_{i}}e^{-\eta(t-t_{i})}e^{-\phi d_{m_{i},k}}

Models (i)-(iii) are submodels of Model (iv). Models (i) and (ii) assume all up-calls are contact calls and that there are no counter-calls. Models (i) and (iii) are suitable when the covariate effects are sufficient to explain the rate of contact calls. The likelihood function (White and Gelfand,, 2021) associated with Model (iv) is given by

L(𝜽𝒯)=exp{k=1K0Tλk(tt)𝑑t}i=1nλmi(titi).\displaystyle L(\mbox{\boldmath$\theta$}\mid\mathcal{T})=\exp\left\{-\sum_{k=1}^{K}\int_{0}^{T}\lambda_{k}(t\mid\mathcal{H}_{t})dt\right\}\prod_{i=1}^{n}\lambda_{m_{i}}(t_{i}\mid\mathcal{H}_{t_{i}}). (2)

Model fitting details are found in the supplementary material.

3.2 Novel inference

The compensator for a point process provides the expected number of events under the intensity in the time window (0,t](0,t]. The compensator for a process on (0,T](0,T] with intensity λ(tt;𝜽)\lambda(t\mid\mathcal{H}_{t};\mbox{\boldmath$\theta$}) is defined as Λ(tt;𝜽)=0tλ(uu;𝜽)𝑑u\Lambda(t\mid\mathcal{H}_{t};\mbox{\boldmath$\theta$})=\int_{0}^{t}\lambda(u\mid\mathcal{H}_{u};\mbox{\boldmath$\theta$})du. For any tt, the compensator is a function of the data, i.e., it is a random variable. Here we focus on inference using the compensator. We will employ the compensator for assessing model adequacy in Section 4.1. If we evaluate the compensator at t=Tt=T, i.e., Λ(TT;𝜽)\Lambda(T\mid\mathcal{H}_{T};\mbox{\boldmath$\theta$}), we obtain the expected total number of events given the data. If we separate the integral of the conditional intensity, we obtain the expected number of contact calls and counter-calls received, respectively, in the time window, (0,T](0,T], given the data. Using an earthquake analogy, this separates the expected number of earthquakes and aftershocks.

First, suppose that there is a single MARU. If we integrate the counter-call component, i.e., i:ti𝒯tiTαeη(tti)𝑑t\sum_{i:t_{i}\in\mathcal{T}}\int_{t_{i}}^{T}\alpha e^{-\eta(t-t_{i})}dt, we obtain α/ηi:ti𝒯[1eη(Tti)]\alpha/\eta\sum_{i:t_{i}\in\mathcal{T}}[1-e^{-\eta(T-t_{i})}]. We can interpret the iith term in the sum as the contribution of the event at time tit_{i} to the expectation. From the Bayesian perspective, given the data, using posterior samples of the model parameters, we can obtain the posterior distributions of these functions, i.e., the joint distribution of the expected number of contact calls and the expected number of counter-calls, their marginal distributions, and the distribution of the sum.

Consider an array of KK MARUs. The intensity of up-calls received by each MARU kk is specified by λk(tt;𝜽)=μk(t)+i:ti<tαmieη(tti)eϕdmik\lambda_{k}(t\mid\mathcal{H}_{t};\mbox{\boldmath$\theta$})=\mu_{k}(t)+\sum_{i:t_{i}<t}\alpha_{m_{i}}e^{-\eta(t-t_{i})}e^{-\phi d_{m_{i}k}}, which splits the conditional intensity into an intensity for contact calls received by MARU kk and an intensity for counter-calls received by MARU kk. An up-call received at any MARU at any time prior to tt provides a contribution to the counter-call intensity for MARU kk at time tt. This is the interpretation of i:ti<tαmieη(tti)eϕdmik\sum_{i:t_{i}<t}\alpha_{m_{i}}e^{-\eta(t-t_{i})}e^{-\phi d_{m_{i}k}}. The expected number of contact calls received by MARU kk over (0,T](0,T] is 0Tμk(t)𝑑t\int_{0}^{T}\mu_{k}(t)dt. The expected number of counter-calls received at MARU kk over (0,T](0,T] is 0T\int_{0}^{T} ti<t\sum_{t_{i}<t} αmi\alpha_{m_{i}} eη(tti)e^{-\eta(t-t_{i})} eϕdmi,kdt=i:ti𝒯tiTαmieη(tti)eϕdmi,k𝑑t=i:ti𝒯(αmi/η)[1eη(Tti)]eϕdmi,ke^{-\phi d_{m_{i},k}}dt=\sum_{i:t_{i}\in\mathcal{T}}\int_{t_{i}}^{T}\alpha_{m_{i}}e^{-\eta(t-t_{i})}e^{-\phi d_{m_{i},k}}dt=\sum_{i:t_{i}\in\mathcal{T}}(\alpha_{m_{i}}/\eta)[1-e^{-\eta(T-t_{i})}]e^{-\phi d_{m_{i},k}}. With multiple MARUs, if we sum the contact call component over kk we obtain the overall expected number of contact calls. If we sum the counter-call component over kk we obtain the overall expected number of counter-calls. As above, we can obtain the joint posterior distribution of the expected number of contact calls and the expected number of counter-calls, their marginal distributions, and the posterior distribution of total expected number of up-calls. We can also obtain the expected number of counter-calls at MARU kk that were excited by up-calls at a specific MARU \ell in the time window (0,T](0,T]. This is given by i:ti𝒯𝕀(mi=)(αmi/η)[1eη(Tti)]eϕdmi,k\sum_{i:t_{i}\in\mathcal{T}}\mathbb{I}(m_{i}=\ell)(\alpha_{m_{i}}/\eta)[1-e^{-\eta(T-t_{i})}]e^{-\phi d_{m_{i},k}} where 𝕀()\mathbb{I}(\cdot) is the indicator function.

4 Model assessment

4.1 RTCT and its implementation in a Bayesian framework

We define the compensator associated with our model as Λ(tt;𝜽)=0tλ(uu;𝜽)𝑑u\Lambda(t\mid\mathcal{H}_{t};\mbox{\boldmath$\theta$})=\int_{0}^{t}\lambda(u\mid\mathcal{H}_{u};\mbox{\boldmath$\theta$})du as in Section 3.2. By the random time change theorem (RTCT) (Brown and Nair,, 1988), if 𝜽\theta is “true” and ti=Λ(titi;𝜽)t^{*}_{i}=\Lambda(t_{i}\mid\mathcal{H}_{t_{i}};\mbox{\boldmath$\theta$}) for i=1,2,,ni=1,2,...,n, then the set {ti}\{t^{*}_{i}\} are a realization from a homogeneous Poisson process with rate 1 (HPP(1)) and the associated di=titi1d^{*}_{i}=t^{*}_{i}-t^{*}_{i-1} with t0=0t^{*}_{0}=0 are an i.i.d. sample from an exponential distribution with rate 1 (Exp(1)). For example, consider data 𝒯\mathcal{T} generated from an HPP(λ)(\lambda). The associated waiting times di=titi1d_{i}=t_{i}-t_{i-1} with t0=0t_{0}=0 are independent and have an Exp(λ)(\lambda) distribution. The compensator associated with HPP(λ\lambda) is ti=0tiλ𝑑u=λtit^{*}_{i}=\int_{0}^{t_{i}}\lambda du=\lambda t_{i}. Then the associated di=titi1=λ(titi+1)=λdid^{*}_{i}=t^{*}_{i}-t^{*}_{i-1}=\lambda(t_{i}-t_{i+1})=\lambda d_{i} has an Exp(1) distribution.

The compensator Λ(tt;𝜽)\Lambda(t\mid\mathcal{H}_{t};\mbox{\boldmath$\theta$}), as a function of the data, is a random function. In the frequentist setting, there is a “true” 𝜽\theta and only 𝒯\mathcal{T} is random. We have to estimate 𝜽\theta and then use the estimated compensator Λ(tt;𝜽^)\Lambda(t\mid\mathcal{H}_{t};\hat{\mbox{\boldmath$\theta$}}). This yields {di(𝒯;𝜽^)}\{d^{*}_{i}(\mathcal{T};\hat{\mbox{\boldmath$\theta$}})\}. The RTCT does not formally apply to {di(𝒯;𝜽^)}\{d^{*}_{i}(\mathcal{T};\hat{\mbox{\boldmath$\theta$}})\} but the argument is that, if 𝜽^\hat{\mbox{\boldmath$\theta$}} is close to the true 𝜽\theta, the result holds approximately.

We consider the RTCT in a fully Bayesian modeling setting. From a frequentist perspective the compensator is a random variable as a function of the data. From the Bayesian perspective it is a random variable as a function of the parameters given the data and we examine it through posterior inference. Consider the simple HPP(λ\lambda) case. If we draw λbp()\lambda_{b}\sim p(\cdot) a priori and 𝒯b\mathcal{T}_{b}\sim HPP(λb)(\lambda_{b}), then the set {db,i}\{d_{b,i}^{*}\} are an i.i.d. sample from Exp(1). This is also true if we generate 𝒯\mathcal{T}\sim HPP(λ)(\lambda) and define λbπ(𝒯)\lambda_{b}\sim\pi(\cdot\mid\mathcal{T}) as the posterior distribution. If the model is correctly specified, the posterior sample {db,i}\{d_{b,i}^{*}\} are a realization from Exp(1). In other words, if the posterior sample {db,i}\{d_{b,i}^{*}\} do not look like Exp(1) samples, then the model is not well specified. The same argument applies to a general point process model.

For the multivariate Hawkes process model, model assessment using RTCT only requires evaluating db,i=tb,itb,i1=ti1tiλ(uu;𝜽b)𝑑ud_{b,i}^{*}=t_{b,i}^{*}-t_{b,i-1}^{*}=\int_{t_{i-1}}^{t_{i}}\lambda(u\mid\mathcal{H}_{u};\mbox{\boldmath$\theta$}_{b})du for i=1,,ni=1,\dots,n and for each posterior sample 𝜽b\mbox{\boldmath$\theta$}_{b}. We can obtain the estimated posterior mean d^i\hat{d}_{i}^{*} of did_{i}^{*} with associated uncertainty and examine if it resembles a sample from Exp(1).

4.2 Model adequacy and model comparison

To examine model adequacy, we use the Q-Q plot for {di}\{d_{i}^{*}\} against an Exp(1) distribution. Specifically, we obtain the order statistics {d^(i)}\{\hat{d}_{(i)}^{*}\} of the estimated posterior means {d^i}\{\hat{d}_{i}^{*}\} and plot them against the theoretical quantiles for an Exp(1), i.e., lognni0.5\log\frac{n}{n-i-0.5}. The Q-Q plot can be overlaid with the associated 95% credible band. We look for fidelity to a 45o45^{o} reference line as an assessment of model adequacy.

In this manuscript we consider four different models: (i) NHPP, (ii) NHPP+GP, (iii) NHPP+CC, (iv) NHPP+GP+CC. Visual comparison of the Q-Q plots, with associated uncertainties provides informal model comparison. Using the metric 1ni(d^(i)lognni0.5)2\frac{1}{n}\sum_{i}\big{(}\hat{d}_{(i)}^{*}-\log\frac{n}{n-i-0.5}\big{)}^{2}, which quantifies the mean squared difference between sample and theoretical quantiles, we can assign a measure of model performance associated with the Q-Q plot.

For model comparison we consider DIC = 2logL^+pD\widehat{-2\log L}+p_{D}. The first term is the estimated posterior mean of 2logL(𝜽𝒯)-2\log L(\mbox{\boldmath$\theta$}\mid\mathcal{T}) and rewards model fit. The second term pD=2logL^+logL(𝜽^𝒯)p_{D}=\widehat{-2\log L}+\log L(\hat{\mbox{\boldmath$\theta$}}\mid\mathcal{T}), where 𝜽^\hat{\mbox{\boldmath$\theta$}} is the estimated posterior mean of 𝜽\theta, is the estimated effective number of parameters and penalizes model complexity. The DIC is easily evaluated using posterior samples in the Bayesian framework.

5 Simulation study

The aims of this simulation experiment are to evaluate the capability of fitting models to identify generating models and to demonstrate the ability to distinguish between the GP and CC components of the intensity.

We simulate a dataset from each of the four models (i) NHPP, (ii) NHPP+GP, (iii) NHPP+CC, and (iv) NHPP+GP+CC. We use the actual locations of K=10K=10 MARUs in CCB and consider 5 consecutive days for the observed time window. For each MARU kk, we use μk(t)\mu_{k}(t) to generate a set of contact call times from an NHPP employing the method of Lewis and Shedler, (1979). This requires sampling from an HPP with intensity of maxt(0,T]{μk(t)}\max_{t\in(0,T]}\{\mu_{k}(t)\} and using rejection sampling to obtain the resulting event sequence. Aggregating across the MARUs, we have a set of contact call times. Each up-call time tit_{i} has a mark mim_{i} indicating the MARU that received the up-call. We can order them as usual to create {(ti,mi)}\{(t_{i},m_{i})\}.

Refer to caption
Figure 4: Q-Q plots for d^i\hat{d}^{\ast}_{i}’s against an Exp(1) distribution. Gray shades represent 95% credible bands. Top and bottom labels denote generating and fitting models, respectively.

For Models (iii) and (iv), we generate counter-calls associated with each of the contact calls. For each {(ti,mi)}\{(t_{i},m_{i})\}, we generate the offspring at MARU kk using the counter-call intensity associated with MARU kk, i.e., αmieη(tti)eϕdmi,k\alpha_{m_{i}}e^{-\eta(t-t_{i})}e^{-\phi d_{m_{i},k}}. To do this, we again simulate from an NHPP and use the Lewis and Shedler method to generate events on (ti,T](t_{i},T]. Note that there is no generation of marks; these events all have mark kk. This process is repeated for each kk and all {(ti,mi)}\{(t_{i},m_{i})\} starting from {(t1,m1)}\{(t_{1},m_{1})\}. The entire resulting collection of events becomes a realization of a multivariate Hawkes process. Importantly, this simulation method provides a generating model that agrees with the fitting model described above.

For all but β0k\beta_{0k}, the true values of the model parameters are set to the estimated posterior means from Model (iv) fitted to the multi-channel dataset (as discussed in Section 6.2). We choose β0k\beta_{0k} such that the total number of up-calls is approximately equal to what is observed in the real data.

We fit Models (i) to (iv) to each of the four simulated datasets using Markov chain Monte Carlo (MCMC) run for 100,000 iterations. The first 10,000 posterior samples are discarded as burn-in and the remaining 90,000 are used for posterior inference. We examined trace plots of chains for convergence and no issues were detected.

Table 1: DIC from Models (i) to (iv) fitted to data generated from each of the models. All entries are of order 10410^{4}. Bolds denote the fitting models with the smallest DIC.
Generating \\backslash Fitting (i) NHPP (ii) NHPP+GP (iii) NHPP+CC (iv) NHPP+GP+CC
(i) NHPP 2.24 2.24 2.24 2.24
(ii) NHPP+GP 1.93 1.84 1.89 1.84
(iii) NHPP+CC 2.15 2.09 2.03 2.03
(iv) NHPP+GP+CC 2.18 2.02 1.96 1.94

We use RTCT and DIC to assess how well fitting models can distinguish generating models. Figure 4 displays the posterior mean empirical Q-Q plots for did^{\ast}_{i}’s against an Exp(1) distribution for each combination of generating and fitting models. The points of (i) NHPP follow the 45 reference line well for data generated from (i) NHPP, however they significantly deviate from the reference line for data from the other models. This implies that Model (i) only fits data generated under Model (i). Models (ii) to (iv) seem to fit every dataset fairly well. Their points generally fall on the reference line. This may imply that the GP and CC components are flexible enough to account for variability from other sources. The 95% credible band (CB) of Models (ii) NHPP+GP and (iv) NHPP+GP+CC are wider than the other models, attributing to the extra uncertainty from the GP. Table 1 shows that DIC correctly selects the true model and model(s) that have the true model as a submodel.

Refer to caption
Figure 5: Posterior distributions of the expected total number of up-calls (top), the expected number of contact calls (middle), and the expected number of counter-calls (bottom) from (iii) NHPP+CC (light gray) and (iv) NHPP+GP+CC (dark gray). Facet labels denote generating models. Red dashed vertical lines are the observed values.

We estimate the expected total number of up-calls, expected number of contact calls, and expected number of counter-calls for each scenario. Each model recovers the true expected total number of up-calls well for every dataset. When there are counter-calls in the generated data, Models (i) and (ii), which have no CC component, overestimate the expected number of contact calls to recover the expected total number of up-calls. Figure 5 displays the empirical posterior distributions for the expectations from Models (iii) NHPP+CC and (iv) NHPP+GP+CC. Model (iii) recovers the simulated truth well for datasets generated from itself and its submodel, i.e., Model (i) and (iii). When there is a GP in the generative model, Model (iii) considerably underestimates the expected number of contact calls and overestimates the expected number of counter-calls. This implies that the CC component of Model (iii) inappropriately accounts for variability from the GP component of the intensity. Model (iv) recovers the truth fairly well for every dataset. This implies that the GP component is well distinguished from the CC component of the intensity. See the supplementary material for additional simulation results.

6 Analysis of North Atlantic right whale up-call data

6.1 Single-channel data

First we analyze NARW up-calls observed by a single recorder (Section 2.1). We set the number of MARUs K=1K=1 and fit Models (i)-(iv) described in Section 3.

Refer to caption
Figure 6: Q-Q plots for d^i\hat{d}^{\ast}_{i}’s against an Exp(1) distribution. Gray shades represent 95% credible bands. Facet labels denote models fitted to the single-channel dataset.

We assess model adequacy using the RTCT. Figure 6 displays the posterior mean empirical Q-Q plots for {di,i=1,,n}\{d^{\ast}_{i},i=1,\dots,n\} against an Exp(1) distribution. For Model (i) the points significantly deviate from the 45 reference line, which is not captured by the 95% credible band (CB). This implies that the NHPP is not adequate for this dataset. The 95% CBs of Models (ii) and (iv) are wider than the other models, attributable to the extra uncertainty from the Gaussian process, and do include the reference line. The points of Models (iii) and (iv) follow the line best although they slightly curve away near the upper tail. As for the mean squared difference (MSD) between sample and theoretical quantiles, Model (iv) provides the smallest MSD = 0.006, followed by Model (iii) with the second smallest MSD = 0.012, Model (ii) with the third smallest MSD = 0.08, and Model (i) with the largest MSD = 24.02.

Table 2: DIC from Models (i) to (iv) fitted to the single-channel dataset. We compare DIC up to three significant digits. Bold represents the model with the smallest DIC.
Model 2logL^\widehat{-2\log L} (95% HPD) DIC
(i) NHPP 14159 (14152, 14166) 14167
(ii) NHPP+GP 7646 (7588, 7695) 7874
(iii) NHPP+SE 7834 (7826, 7843) 7843
(iv) NHPP+GP+SE 7452 (7384, 7530) 7558

We use DIC for model comparison. Table 2 shows that Model (iv) has a significantly small value of 2logL-2\log L compared to Models (i) and (iii). This implies that Model (iv) fits the data significantly better than Models (i) and (iii). Model (ii) fits the data similarly well to Model (iv) but has more than twice as large of a model complexity penalty as Model (iv). This may indicate that the GP component of Model (ii) inappropriately accounts for variability from excitement. DIC provides consistent results with MSD. Model (iv) has the smallest DIC, followed by Model (iii) with the second smallest DIC, Model (ii) with the third smallest DIC, and Model (i) with the largest DIC. Collectively, these metrics support our choice of Model (iv) for this dataset and for which we present further model inference.

Refer to caption
Figure 7: Estimated daily harmonic effect over time. Blue shade indicates 95% credible bands. Gray shade indicates 6 p.m. to 6 a.m. in local time.

The estimated posterior mean of the coefficient β1\beta_{1} of the ambient noise variable is -0.49 with 95% HPD interval of (-0.71, -0.28). This significant effect implies that NARWs are less likely to make up-calls during periods with high levels of ambient noise. The observed responses of NARWs to increased levels of noise include changes in sound level (Parks et al.,, 2010; Palmer et al.,, 2022), frequency content (Urazghildiiev,, 2014) and up-calling rate (CWC pers obs.). Here we quantify the reduction in up-calling behavior as background noise levels increases, a response that is consistent with previous work on NARWs in CCB (Urazghildiiev,, 2014) and bowhead whales (Balaena mysticetus) in the Arctic (Blackwell et al.,, 2013). Figure 7 shows the resulting estimated overall daily effect as captured by the harmonics over time. Shown is the posterior estimate of β^2sin(2πt8×60)+β^3cos(2πt8×60)+β^4sin(2πt12×60)+β^5cos(2πt12×60)+β^6sin(2πt24×60)+β^7cos(2πt24×60)\hat{\beta}_{2}\sin(\frac{2\pi t}{8\times 60})+\hat{\beta}_{3}\cos(\frac{2\pi t}{8\times 60})+\hat{\beta}_{4}\sin(\frac{2\pi t}{12\times 60})+\hat{\beta}_{5}\cos(\frac{2\pi t}{12\times 60})+\hat{\beta}_{6}\sin(\frac{2\pi t}{24\times 60})+\hat{\beta}_{7}\cos(\frac{2\pi t}{24\times 60}) over the 24-hour window, which better depicts the diel behaviors than any of the individual coefficient estimates separately. Contact calls tend to increase during the twilight hours between dusk and sunset, as well as around midnight, which has been well documented in NARW across a broad range of habitats (Matthews et al.,, 2001; Matthews and Parks,, 2021).

From this analysis of NARW up-call times of occurrence, we find strong evidence of excitement. The estimated posterior mean of the excitement parameter α\alpha is 0.34 with 95% HPD interval of (0.31, 0.38), which is much greater than 0. The estimated posterior mean of the decay parameter η\eta is 0.51 with 95% HPD interval of (0.44, 0.58). This implies that the expected median response time for NARW up-calls is approximately log(2)0.51=1.36\frac{\log(2)}{0.51}=1.36 minutes, consistent with direct measurements of up-calls from animals tagged with DTAGs in other NARW habitats (Parks et al.,, 2010). During the study period, the estimated posterior mean for the expected total number of up-calls is 5301 with 95% HPD interval of (5162, 5448), which contains the observed total number of up-calls of 5302. The estimated posterior means of the expected number of contact calls and counter-calls are 1735 with 95% HPD interval of (1364, 2066) and 3567 with 95% HPD interval of (3223, 3947), respectively. We observe approximately twice as many counter-calls as contact calls on average.

6.2 Multi-channel CCB data

To study spatio-temporal patterns of NARW up-calls across CCB, we apply Models (i)-(iv) described in Section 3 to the sequence of up-call times received by the array of ten MARUs within the region (Section 2.2).

Refer to caption
Figure 8: Q-Q plots for d^i\hat{d}^{\ast}_{i}’s against an Exp(1) distribution. Gray shades represent 95% credible bands. Facet labels denote models fitted to the multi-channel dataset.

Model adequacy for the multi-channel data is assessed using the RTCT. Figure 8 shows that Model (i) is again inadequate for these NARW up-call data, as evident by the points and 95% credible band (CB) that significantly deviate from the reference line. The points of Models (ii) to (iv) follow the line better, but deviate slightly near the upper tail. As with the results for the single-channel data, the 95% CBs of Models (ii) and (iv) are wide due to the extra uncertainty from the Gaussian process; only these two models completely include the reference line. As for the mean squared difference (MSD) between sample and theoretical quantiles, Models (ii) to (iv) provide the smallest MSD = 0.02 and Model (i) provides the largest MSD = 1.87.

Table 3: DIC from Models (i) to (iv) fitted to the multi-channel dataset. We compare DIC up to three significant digits. Bold represents the model with the smallest DIC.
Model 2logL^\widehat{-2\log L} (95% HPD) DIC
(i) NHPP 24483 (24460, 24508) 24558
(ii) NHPP+GP 22501 (22435, 22564) 22842
iii) NHPP+SE 22130 (22105, 22155) 22196
(iv) NHPP+GP+SE 21712 (21635, 21792) 21908

Table 3 reports the DIC values for model comparison. Notably, Model (iv) has a significantly smaller value of 2logL-2\log L than the other models. Model (iv) also achieves the smallest DIC, indicating this is the preferred model for these data. We again select Model (iv), that with the GP and CC, as our final model for this dataset and present the associated results.

Refer to caption
Figure 9: (a) Estimated posterior means (shaded dots), 95% HPD intervals (thin horizontal bars), and 90% HPD intervals (thick horizontal bars) for the coefficient of the ambient noise per MARU. (b) Estimated diel harmonic effects over time for each MARU. Blue shades are 95% credible bands. Gray shades indicate 6 p.m. to 6 a.m. in local time.

Figure 9 (a) displays the estimated posterior mean and HPD intervals of the noise coefficient β1k\beta_{1k} for each MARU kk. The posterior mean estimate is negative for each MARU, and the coefficient is significantly negative as evident by the 95% HPD for MARUs located in the bottom edge of the array, i.e., MARUs 3, 4, 5, 9, and 10. The 95% HPD intervals of MARU 1 and 7 include zero but have upper bounds very close to zero. MARU 6 has a very wide HPD interval, which we attribute to the small sample size for this MARU; the expected number of unique up-calls at MARU-6 is estimated to be only 26. Figure 9 (b) shows the estimated overall daily effect as captured by the harmonics over the 24-hour window for each MARU. In general, contact calls tend to increase during the twilight hours between dusk and sunset. We observe a significant increase in up-calling during the night for MARUs 1, 7, 8, and 10. Since MARU 10 is close to a shipping lane, it’s possible that up-calling increases at nighttime with lower shipping. NARW have been known to up-call more at night (Matthews et al.,, 2001; Matthews and Parks,, 2021); however because we lack visual observations during this period, we cannot say for sure if this is in response to non-foraging aggregations or distribution changes within and across the bay.

Refer to caption
Figure 10: (a) Estimated posterior mean for the expected number of within-recorder counter-calls received at each MARU. (b) Estimated posterior mean for the expected number of cross-recorder counter-calls received at each MARU. The number of counter-calls is shown proportionally by the size of the circle.

Except for MARU 10, we find strong evidence of excitement in NARW up-calls; the 95% HPD interval of αk\alpha_{k} is greater than zero for k=1,,9k=1,\dots,9. MARU 10 is located in a region with the highest shipping traffic. This might prevent whales from hearing and responding to up-calls and disturb communication between whales. The estimated posterior mean of the temporal decay parameter η\eta is 0.151 with 95% HPD interval of (0.150, 0.154). This implies that the expected median response time for NARW up-calls is approximately log(2)0.151=4.59\frac{\log(2)}{0.151}=4.59 minutes. The estimated posterior mean of the spatial decay parameter ϕ\phi is 0.32 with 95% HPD interval of (0.29, 0.35). The shortest distance between MARUs is 7.1 km. This implies that the probability of a counter-call being received at a MARU at least 7.1km away is approximately P(Exp(0.32)>7.1)=0.10P(\text{Exp}(0.32)>7.1)=0.10.

During the study period, the estimated posterior mean for the expected total number of up-calls is 2716 with 95% HPD interval of (2613, 2820) which contains the observed total number of up-calls of 2750. The model also recovers the observed number of up-calls recorded by each MARU within the 95% HPD intervals. Figure 10 (a) displays the estimated posterior mean for the expected number of within-recorder counter-calls, i.e., counter-calls at MARU kk that were excited by calls at the MARU. Figure 10 (b) shows the estimated posterior mean for the expected number of cross-recorder counter-calls, i.e., counter-calls at MARU kk that were excited by up-calls at MARU \ell for k\ell\neq k. We see that MARU 10 received a relatively small number of within- and cross-recorder counter-calls. MARU 10 is closest to the path of vessels that transit from the Cape Cod Canal to Boston Harbour. It is the loudest MARU (Figure 2), which may indicate that underwater noise caused by shipping can inhibit whales from being able to communicate with each other. Elevated ambient noise can also impact our ability to detect up-calls (Palmer et al.,, 2022). MARUs 1, 4, 5, and 8 received a substantial number of within-recorder counter-calls compared to the others. In contrast to MARU 10, MARU 5 was one of the quietest MARUs (Figure 2). MARUs 1 and 8 are at the top of the array (Figure 1), and thus are likely hearing up-calls that are made outside of CCB as well. CCB is a relatively small area, and sound can be heard across much of the Bay; the average swim speed of NARW is approximately 3.7 km per hour, and the shortest distance between MARUs is 7.1 km. Thus, the excitement across recorders likely indicates communication between different whales, which is consistent with the role of the contact call (Clark and Clark,, 1980; Clark,, 1982).

Refer to caption
Figure 11: Joint posterior distribution of the expected number contact calls and counter-calls received at each MARU.

Figure 11 shows the joint posterior distribution of the expected number contact calls and counter-calls received at each MARU. The majority of MARUs received a similar or greater number of counter-calls compared to contact calls. Uniquely, MARU 3 received twice as many contact calls as counter-calls and MARU 10 received mostly contact calls and very few counter-calls. More detailed evidence of this behavior is presented in the supplementary material. Figure S5 displays the posterior mean estimates of the background intensity and the counter-call intensity. The panel for MARU 10 further depicts the sparsity of expected counter-calls there, reflecting the estimate of α10\alpha_{10} which is essentially 0.

Figure S6 of the Supplementary Material aggregates the two intensity components for each MARU and overlays, with rug tassels, the observed event sequence (Figure 2 (b)). It offers informal/visual evidence of good model performance for the observed data.

7 Summary and future work

We have developed a multivariate Hawkes process model framework to address the current knowledge gap on the communication behavior between NARWs over space and time. The class of models are informed from sequences of event times for NARW up-calls collected at a small network of recorders in Cape Cod Bay, MA. This is the first network level investigation of such up-calls, enabling different inference than what can be obtained from putting acoustic tags on individual animals. As mentioned in the introduction, acoustic data are often collected from three sources: 1) tags deployed on animals; 2) arrays supplemented with visual tracking data; or 3) towed or fixed arrays. Each provides a different snapshot of acoustic behavior. We believe our inferential approach provides an excellent and needed way to examine inter-whale communication across an array.

Our models consider the incidence of contact calls as well as counter-calls using multi-channel excitement models. We specify conditional intensities for each channel in the network which enable contact calls to be described using fixed and random effects and counter-calls using a mutually exciting conditional component. We find that the model incorporating a nonhomegenous Poisson process, a Gaussian process, and counter-calling to be preferred over simpler models. As such, the richer models allow us to uncover subtler patterns of acoustic behavior among NARWs.

One important limitation of our work is that while we infer about whale communication, what we are modeling are up-calls received at the set of recorders. That is, we consider the behavior of received up-calls to be a proxy for the behavior of individual NARW up-calling. Specifically, we are modeling contact calling and counter-calling in the context of the event sequence of received up-calls for the network of recorders. Given the rapid decay specified in exciting behavior and the spatial locations of the recorders, if self or mutual excitement occurs across recorders, we are imagining that this is most plausibly the result of communication between different whales. Another limitation is that we have developed our dataset to ensure that a counter-call is associated with one and only one recorder and we have eliminated delays in up-calls, i.e., the time an up-call was made compared with the time it was received at a recorder. With manual annotation of up-calls (Palmer et al.,, 2022), it may be able to identify the time and location of an up-call, which would likely enrich our understanding of the behavior. Lastly, we do not account for acoustic detection (Palmer et al.,, 2022), which may mean we are missing up-calls from whales that are either farther away from a MARU, or made during periods of elevated ambient noise. Here we worked only with up-calls that were sufficient in strength and number to be associated and localized. All of these issues present opportunities for future work.

Because NARW are not obligate callers (Parks et al.,, 2011), incorporating auxiliary information about their observed behavior from aerial or vessel-based surveys may enable better understanding of state-specific calling rates (Matthews and Parks,, 2021). Methods for estimating absolute abundance using up-call data have assumed that individual up-calling rates are known (Matthews and Parks,, 2021; Schliep et al.,, 2023). Our work can provide a better understanding of the variation in NARW up-calling behavior across time and space. This may help better estimate whales’ abundance and spatiotemporal distribution when employed with a combination of visual and acoustic observations.

Acknowledgements

We thank the participants who attended a workshop immediately following the 2023 Annual Right Whale Consortium Meeting in Halifax, Nova Scotia for ideas and critiques. We thank Doug Gillespie for help with PAMGuard. This work was supported by the US Office of Naval Research Award N000141712817. Additional funding was provided by NOAA Fisheries under award NA20NMF0080246, and SERDP award RC20-1097.


SUPPLEMENTARY MATERIAL

Supplementary material:

Model fitting details, tables and figures.

References

  • Au and Hastings, (2008) Au, W. W. and Hastings, M. C. (2008). Principles of marine bioacoustics, volume 510. Springer.
  • Bass and Clark, (2003) Bass, A. H. and Clark, C. W. (2003). The physical acoustics of underwater sound communication. In Acoustic communication, pages 15–64. Springer.
  • Baumgartner and Mussoline, (2011) Baumgartner, M. F. and Mussoline, S. E. (2011). A generalized baleen whale call detection and classification system. J. Acoust. Soc. Am., 129(5):2889–2902.
  • Blackwell et al., (2013) Blackwell, S. B., Nations, C. S., McDonald, T. L., Greene Jr., C. R., Thode, A. M., Guerra, M., and Michael Macrander, A. (2013). Effects of airgun sounds on bowhead whale calling rates in the Alaskan Beaufort Sea. Marine Mammal Science, 29(4):E342–E365.
  • Bradbury and Vehrencamp, (2011) Bradbury, J. W. and Vehrencamp, S. L. (2011). Principles of animal communication, Second Edition. Sinauer Associates Sunderland, MA.
  • Brown and Nair, (1988) Brown, T. C. and Nair, M. G. (1988). A simple proof of the multivariate random time change theorem for point processes. Journal of Applied Probability, 25(1):210–214.
  • Calupca et al., (2000) Calupca, T. A., Fristrup, K. M., and Clark, C. W. (2000). A compact digital recording system for autonomous bioacoustic monitoring. The Journal of the Acoustical Society of America, 108(Supplement):2582–2582.
  • Clark, (1982) Clark, C. W. (1982). The acoustic repertoire of the Southern right whale, a quantitative analysis. Anim. Behav., 30(4):1060–1071.
  • Clark et al., (2010) Clark, C. W., Brown, M. W., and Corkeron, P. (2010). Visual and acoustic surveys for North Atlantic right whales, Eubalaena glacialis, in Cape Cod Bay, Massachusetts, 2001–2005: Management implications. Marine Mammal Science, 26(4):837–854.
  • Clark and Clark, (1980) Clark, C. W. and Clark, J. M. (1980). Sound playback experiments with Southern right whales (Eubalaena australis). Science, 207(4431):663–665.
  • Clark et al., (1986) Clark, CW., Ellison, WT., and Beeman, K. (1986). A preliminary account of the acoustic study conducted during the 1985 spring bowhead whale, Balaena mysticetus, migration off Point Barrow, Alaska. Report of the International Whaling Commission, 36:311–316.
  • Cusano et al., (2019) Cusano, D. A., Conger, L. A., Van Parijs, S. M., and Parks, S. E. (2019). Implementing conservation measures for the North Atlantic right whale: Considering the behavioral ontogeny of mother-calf pairs. Anim. Conserv., 22(3):228–237.
  • Davis et al., (2017) Davis, G. E., Baumgartner, M. F., Bonnell, J. M., Bell, J., Berchok, C., Bort Thornton, J., Brault, S., Buchanan, G., Charif, R. A., Cholewiak, D., Clark, C. W., Corkeron, P., Delarue, J., Dudzinski, K., Hatch, L., Hildebrand, J., Hodge, L., Klinck, H., Kraus, S., Martin, B., Mellinger, D. K., Moors-Murphy, H., Nieukirk, S., Nowacek, D. P., Parks, S., Read, A. J., Rice, A. N., Risch, D., Širović, A., Soldevilla, M., Stafford, K., Stanistreet, J. E., Summers, E., Todd, S., Warde, A., and Van Parijs, S. M. (2017). Long-term passive acoustic recordings track the changing distribution of North Atlantic right whales (Eubalaena glacialis) from 2004 to 2014. Sci. Rep., 7(1):13460.
  • Davis et al., (2023) Davis, G. E., Tennant, S. C., and Van Parijs, S. M. (2023). Upcalling behaviour and patterns in North Atlantic right whales, implications for monitoring protocols during wind energy development. ICES Journal of Marine Science, page fsad174.
  • DCLDE 2013, (2023) DCLDE 2013 (2023). The North Atlantic right whale annotations from the detection classification localization and density estimate conference in 2013. NOAA National Centers for Environmental Information. https://data.noaa.gov/metaview/page?xml=NOAA/NESDIS/NGDC/MGG/passive_acoustic//iso/xml/DCLDE_2013.xml&view=getDataView.
  • Gelfand et al., (1996) Gelfand, A. E., Sahu, S. K., and Carlin, B. P. (1996). Efficient parametrizations for generalized linear mixed models. Bayesian Statistics 5: Proceedings of the Fifth Valencia International Meeting J M Bernardo (ed.) et al., pages 165–180.
  • Gillespie, (2004) Gillespie, D. (2004). Detection and classification of right whale calls using an ‘edge’ detector operating on a smoothed spectrogram. Canadian Acoustics, 32(2):39–47.
  • Gillespie et al., (2020) Gillespie, D., Palmer, L., Macaulay, J., Sparling, C., and Hastie, G. (2020). Passive acoustic methods for tracking the 3D movements of small cetaceans around marine structures. PLOS ONE, 15(5):e0229058.
  • Hatch et al., (2016) Hatch, L. T., Wahle, C. M., Gedamke, J., Harrison, J., Laws, B., Moore, S. E., Stadler, J. H., and Van Parijs, S. M. (2016). Can you hear me here? Managing acoustic habitat in US waters. Endanger. Species Res., 30:171–186.
  • Helble et al., (2016) Helble, T. A., Henderson, E. E., Ierley, G. R., and Martin, S. W. (2016). Swim track kinematics and calling behavior attributed to bryde’s whales on the navy’s pacific missile range facility. The Journal of the Acoustical Society of America, 140(6):4170–4177.
  • Johnson and Tyack, (2003) Johnson, M. P. and Tyack, P. L. (2003). A digital acoustic recording tag for measuring the response of wild marine mammals to sound. IEEE J. Oceanic Eng., 28(1):3–12.
  • Knowlton et al., (2022) Knowlton, A. R., Clark, J. S., Hamilton, P. K., Kraus, S. D., Pettis, H. M., Rolland, R. M., and Schick, R. S. (2022). Fishing gear entanglement threatens recovery of critically endangered North Atlantic right whales. Conservation Science and Practice, 4(8):e12736.
  • Kraus and Rolland, (2007) Kraus, S. D. and Rolland, R. M. (2007). The Urban Whale: North Atlantic Right Whales at the Crossroads. Harvard University Press.
  • Lewis and Shedler, (1979) Lewis, P. A. and Shedler, G. S. (1979). Simulation of nonhomogeneous Poisson processes by thinning. Naval research logistics quarterly, 26:403–413.
  • Linden, (2023) Linden, D. (2023). Population size estimation of North Atlantic right whales from 1990-2022. US Dept. Commer. Northeast Fish. Sci. Cent. Tech Memo 314. Technical report, NOAA Fisheries.
  • Linderman and Adams, (2014) Linderman, S. and Adams, R. (2014). Discovering Latent Network Structure in Point Process Data. In Proceedings of the 31st International Conference on Machine Learning, pages 1413–1421. PMLR.
  • Matthews et al., (2001) Matthews, J., Brown, S., Gillespie, D., Johnson, M., McLanaghan, R., Moscrop, A., Nowacek, D., Leaper, R., Lewis, T., Tyack, P., et al. (2001). Vocalisation rates of the north atlantic right whale (eubalaena glacialis). J. Cetacean Res. Manage., 3(3):271–282.
  • Matthews and Parks, (2021) Matthews, L. P. and Parks, S. E. (2021). An overview of North Atlantic right whale acoustic behavior, hearing capabilities, and responses to sound. Marine Pollution Bulletin, 173:113043.
  • Mayo et al., (2018) Mayo, C. A., Ganley, L., Hudak, C. A., Brault, S., Marx, M. K., Burke, E., and Brown, M. W. (2018). Distribution, demography, and behavior of North Atlantic right whales ( Eubalaena glacialis ) in Cape Cod Bay, Massachusetts, 1998-2013 : Right Whales in Cape Cod Bay. Mar. Mamm. Sci., 34(4):979–996.
  • Mellinger et al., (2007) Mellinger, D. K., Stafford, K. M., Moore, S. E., Dziak, R. P., and Matsumoto, H. (2007). An Overview of Fixed Passive Acoustic Observation Methods for Cetaceans. Oceanography, 20(4):36–45.
  • Meyer-Gutbrod et al., (2023) Meyer-Gutbrod, E. L., Davies, K. T. A., Johnson, C. L., Plourde, S., Sorochan, K. A., Kenney, R. D., Ramp, C., Gosselin, J.-F., Lawson, J. W., and Greene, C. H. (2023). Redefining North Atlantic right whale habitat-use patterns under climate change. Limnology and Oceanography, 68(S1):S71–S86.
  • Moore et al., (2006) Moore, S. E., Stafford, K. M., Mellinger, D. K., and Hildebrand, J. A. (2006). Listening for Large Whales in the Offshore Waters of Alaska. Bioscience, 56(1):49–55.
  • Nicvert et al., (2024) Nicvert, L., Donnet, S., Keith, M., Peel, M., Somers, M. J., Swanepoel, L. H., Venter, J., Fritz, H., and Dray, S. (2024). Using the multivariate Hawkes process to study interactions between multiple species from camera trap data. Ecology.
  • Palmer et al., (2022) Palmer, K. J., Wu, G.-M., Clark, C., and Klinck, H. (2022). Accounting for the Lombard effect in estimating the probability of detection in passive acoustic surveys: Applications for single sensor mitigation and monitoring. The Journal of the Acoustical Society of America, 151(1):67–79.
  • Parks et al., (2019) Parks, S. E., Cusano, D. A., Van Parijs, S. M., and Nowacek, D. P. (2019). Acoustic crypsis in communication by North Atlantic right whale mother-calf pairs on the calving grounds. Biol. Lett., 15(10):20190485.
  • Parks et al., (2010) Parks, S. E., Johnson, M., Nowacek, D., and Tyack, P. L. (2010). Individual right whales call louder in increased environmental noise. Biology Letters, 7(1):33–35.
  • Parks et al., (2011) Parks, S. E., Searby, A., Célérier, A., Johnson, M. P., Nowacek, D. P., and Tyack, P. L. (2011). Sound production behavior of individual North Atlantic right whales: Implications for passive acoustic monitoring. Endangered Species Research, 15(1):63–76.
  • Parks and Tyack, (2005) Parks, S. E. and Tyack, P. L. (2005). Sound production by North Atlantic right whales (Eubalaena glacialis) in surface active groups. The Journal of the Acoustical Society of America, 117(5):3297–3306.
  • Parks et al., (2009) Parks, S. E., Urazghildiiev, I., and Clark, C. W. (2009). Variability in ambient noise levels and call parameters of North Atlantic right whales in three habitat areas. J. Acoust. Soc. Am., 125(2):1230–1239.
  • Payne and McVay, (1971) Payne, R. S. and McVay, S. (1971). Songs of humpback whales: Humpbacks emit sounds in long, predictable patterns ranging over frequencies audible to humans. Science, 173(3997):585–597.
  • Radford et al., (2014) Radford, A. N., Kerridge, E., and Simpson, S. D. (2014). Acoustic communication in a noisy world: Can fish compete with anthropogenic noise? Behavioral Ecology, 25(5):1022–1030.
  • Rathbun, (1996) Rathbun, S. L. (1996). Asymptotic properties of the maximum likelihood estimator for spatio-temporal point processes. Journal of Statistical Planning and Inference, 51:55–74.
  • Reinhart, (2018) Reinhart, A. (2018). A review of self-exciting spatio-temporal point processes and their applications. Statistical Science, 33:299–318.
  • Richardson et al., (1995) Richardson, W. J., Greene Jr, C. R., Malme, C. I., and Thomson, D. H. (1995). Marine Mammals and Noise. Academic Press.
  • Root-Gutteridge et al., (2018) Root-Gutteridge, H., Cusano, D. A., Shiu, Y., Nowacek, D. P., Van Parijs, S. M., and Parks, S. E. (2018). A lifetime of changing calls: North Atlantic right whales, Eubalaena glacialis, refine call production as they age. Anim. Behav., 137:21–34.
  • Schevill and Watkins, (1962) Schevill, W. E. and Watkins, W. A. (1962). A photograph record. Technical report, Woods Hole Oceanographic Institution. Woods Hole, Mass.
  • Schliep et al., (2023) Schliep, E. M., Gelfand, A. E., Clark, C. W., Mayo, C. M., McKenna, B., Parks, S. E., Yack, T. M., and Schick, R. S. (2023). Assessing marine mammal abundance: a novel data fusion. arXiv preprint arXiv:2310.08397.
  • Tyack and Clark, (2000) Tyack, P. L. and Clark, C. W. (2000). Communication and acoustic behavior of dolphins and whales. In Hearing by whales and dolphins, pages 156–224. Springer.
  • Urazghildiiev, (2014) Urazghildiiev, I. R. (2014). Statistical analysis of North Atlantic right whale (Eubalaena glacialis) signal trains in Cape Cod Bay, spring 2012. The Journal of the Acoustical Society of America, 136(5):2851–2860.
  • Vanderlaan and Taggart, (2007) Vanderlaan, A. S. M. and Taggart, C. T. (2007). Vessel colisions with whales: The probability of lethal injury based on vessel speed. Mar. Mamm. Sci., 23(1):144–156.
  • Watkins and Schevill, (1972) Watkins, W. A. and Schevill, W. E. (1972). Sound source location by arrival-times on a non-rigid three-dimensional hydrophone array. Deep Sea Research and Oceanographic Abstracts, 19(10):691–706.
  • White and Gelfand, (2021) White, P. A. and Gelfand, A. E. (2021). Generalized evolutionary point processes: model specifications and model comparison. Methodology and Computing in Applied Probability, 23:1001–1021.