This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Likelihood-Free Dynamical Survival Analysis Applied to the COVID-19 Epidemic in Ohio

Colin Klaus [Uncaptioned image]    The Ohio State University
Matthew Wascher [Uncaptioned image]
   University of Dayton
Wasiur R. KhudaBukhsh [Uncaptioned image]
   The University of Nottingham
Grzegorz A. Rempała [Uncaptioned image]
   The Ohio State University
Abstract

The Dynamical Survival Analysis (DSA) is a framework for modeling epidemics based on mean field dynamics applied to individual (agent) level history of infection and recovery. Recently, DSA has been shown to be an effective tool in analyzing complex non-Markovian epidemic processes that are otherwise difficult to handle using standard methods. One of the advantages of DSA is its representation of typical epidemic data in a simple although not explicit form that involves solutions of certain differential equations. In this work we describe how a complex non-Markovian DSA model may be applied to a specific data set with the help of appropriate numerical and statistical schemes. The ideas are illustrated with a data example of the COVID-19 epidemic in Ohio.

1 Introduction

As of July, 2022, the coronavirus disease 2019 (COVID-19) pandemic, caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has taken more than six million lives worldwide. In response to the pandemic, scientists from all disciplines have made a concerted effort to address the ever growing analytic challenges of prediction, intervention, and control. The mathematical tools employed range from the purely deterministic Ordinary Differential Equation (ODE)/ Partial Differential Equation (PDE) models to describe population dynamics (at the macro or ecological scale) to fully stochastic agent-based models (at the micro scale); from physics-inspired mechanistic models, both stochastic and deterministic, to purely statistical approaches such as ensemble models. However, despite the longstanding history of the discipline of mathematical epidemiology and the enormous recent efforts, the pandemic has laid bare crucial gaps in the state-of-the-art methodology.

While the macro models are simple to interpret and easy to calibrate, the micro agent-based models provide more flexibility to model elaborate what-if scenarios. In a similar vein, the mechanistic models provide insights into the underlying biology and epidemiology of the disease but are often outperformed by purely statistical methods when it comes to accuracy in prediction and forecasting. While there is no obvious way to completely bypass such trade-offs between these often diametrically opposed modeling approaches, the Dynamical Survival Analysis (DSA) method [5, 19, 6, 36, 18, 35, 21, 2, 32], a survival analytic statistical method derived from dynamical systems, holds some promise. The present paper is about a likelihood-free means of performing DSA in the context of non-Markovian models of infectious disease epidemiology. More specifically, we develop a DSA method for a non-Markovian epidemic model with vaccination based on the Approximate Bayesian Computation (ABC) framework.

Why non-Markov models? The standard compartmental Markovian models, which employ Continuous Time Markov Chains to keep track of counts of individuals in different compartments (e.g., individuals with different immunological statuses), assume the infectious period and the contact interval [17, 6] are exponentially distributed and are thus characterized by a constant hazard function. This simplistic assumption almost always misrepresents the underlying biology of the disease and hence, is untenable. See [6, 34] for a detailed discussion on this point. Also, see [17, Table 1 and Figure 1] for a numerical illustration of the bias in the estimates of model parameters if a Markovian model is wrongly assumed when the underlying model is non-Markovian.

A popular and analytically convenient approach to building more realistic non-Markovian models is to make use of the general theory of measure-valued processes that keep track of not only the population counts but also additional covariates such as individual’s age. Here the word “age” is used as an umbrella term to refer to the physical age of the individual, the age of infection, time since vaccination etc. While the measure-valued processes do require more mathematical sophistication than their CTMC counterparts, as we discuss in Section 2, the age-stratified population densities can be described by a comparatively simple system of PDEs in the limit of a large population [6, 13, 15, 30]. The crux of the DSA method is to interpret this mean-field limiting system of PDEs as describing probability distributions of transfer times (the time required to move from one compartment to another). We shall make this notion clear in Section 2.2 and discuss the statistical benefits of the DSA method in Section 6.

As an illustration of the ABC-based DSA method, we apply it to the COVID-19 epidemic in the state of Ohio, USA. The method is shown to fit the real case count data well and capture nontrivial trends. Detailed numerical results are provided in Section 5. In addition to the introduction of ABC-DSA methodology, our second contribution in this paper is a solution method for the mean-field limiting system of PDEs with nonlocal boundary conditions. For the sake of algorithmic implementation, we also present it in the form of a pseudocode. An implementation in the Julia programming language is made available for the wider community.

The rest of the paper is structured as follows: The stochastic non-Markovian epidemic model is described in Section 2. The mean-field limit in the form of a system of PDEs and the DSA methodology are also described in Section 2. In Section 3, we describe the main technical details of how we solve the limiting mean-field PDE system. We describe the statistical approach to parameter inference in Section 4, followed by numerical results in Section 5. Finally, we conclude with a brief discussion in Section 6. For the sake of completeness, additional mathematical details, numerical results, and figures are provided in the Appendix.

2 Non-markovian mass-action model

Our stochastic model is adapted from [6]. As shown in Figure 1, the model has four compartments: Susceptible, Vaccinated, Infectious, and Removed. Individuals are in exactly one of the four compartments. Upon vaccination, susceptible individuals move to the V compartment. We assume only susceptible individuals are vaccinated. Both (unvaccinated) susceptible and vaccinated individuals can get infected, in which case they move to the I compartment. Finally, infected individuals either recover or are removed. In either case, they move to the terminal compartment R. In addition to the counts of individuals in the four compartments, we keep track of the age distribution. Here, the word “age” refers to the physical age for the susceptible individuals, time since vaccination for the vaccinated individuals, time since infection or age of infection for the infected individuals, and finally, time since recovery or removal for the removed individuals. The age of the removed individuals are of no interest because the removed individuals do not contribute to the dynamics. It is possible to include other important covariates, such as sex, comorbidity, in the model. However, for the sake of simplicity, we only keep track of age. Suppose we have nn individuals.

Susceptible (S)Infectious (I)Removed (R)Vaccinated (V)Υ(t)\Upsilon(t)λ(t,s)\lambda(t,s)γ(s)\gamma(s)α(s)\alpha(s)(1α(s))Υ(t)(1-\alpha(s))\Upsilon(t)
Figure 1: Diagrammatic representation of the non-Markovian compartmental model. At time tt, a susceptible individual with age ss moves either to a vaccinated compartment with instantaneous rate λ(t,s)\lambda(t,s), or to the infectious compartment with instantaneous rate Υ(t)0β(s)yI(t,s)ds\Upsilon(t)\coloneqq\int_{0}^{\infty}\beta(s)y_{I}(t,s)\mathrm{d}s. The vaccinated individuals are also subject to infection. An infected individual at age ss can be removed with instantaneous rate γ(s)\gamma(s). The vaccine efficacy α\alpha is assumed age-dependent.

The age distribution of individuals in different compartments is described in terms of finite, point measure-valued processes whose atoms are individual ages. See [6]. Such an approach has been previously used to model age-stratified Birth-Death (BD) processes [33, 9], spatially stratified populations [12], delays in Chemical Reaction Networks [20]. The instantaneous rates of jump are assumed to depend on the individual ages. In particular, a susceptible individual of age ss has an instantaneous rate λ(t,s)\lambda(t,s) of getting vaccinated at time tt. Moreover, a susceptible individual is also subject to an infection pressure exerted by the infectious individuals. We denote the hazard function for the probability distribution of the contact interval [6, 17] by β\beta. Therefore, an infectious individual of age of infection ss makes an infectious contact with a susceptible or vaccinated individual at an instantaneous rate β(s)\beta(s). The hazard function of the probability law of the infectious period is denoted by γ\gamma. The vaccine efficacy is also assumed to depend on the age of the vaccinated individual (time since vaccination). At age ss, a vaccinated individual moves directly to the R compartment at rate unity with probability α(s)\alpha(s), while with probability (1α(s))(1-\alpha(s)), they are subject to the same infection pressure as the susceptible individuals and can get infected at the same instantaneous rate, which is the population sum total, scaled by 1/n1/n, of the β\beta’s evaluated at the individual ages of infection.

In the limit of a large population (nn\to\infty) and under suitable constraints on the hazard functions β,γ,λ\beta,\gamma,\lambda, the measure-valued processes, when appropriately scaled by n1n^{-1}, can be shown to converge to deterministic continuous measure-valued functions [6, 33, 12, 9, 20]. The densities (with respect to the Lebesgue measure) of those limiting measure-valued functions can be then described in terms of a system of PDEs. Let us define

  • yS(t,s)y_{S}(t,s): The density of susceptible individuals with age ss at time tt;

  • yV(t,s)y_{V}(t,s): The density of vaccinated individuals with age (of vaccination) or time since vaccination ss at time tt;

  • yI(t,s)y_{I}(t,s): The density of infectious individuals with age (of infection) or time since infection ss at time tt;

  • yR(t)y_{R}(t): The proportion of removed individuals.

These quantities are taken over rectangular domains that share a tt-axis but, in general, may have different length ss-axes. In particular, the quantities ySy_{S} and λ\lambda are defined over a common domain RSR_{S}. The quantities yVy_{V} and α\alpha are defined over a common domain RVR_{V}, and the quantities yIy_{I}, γ\gamma, and β\beta are defined over a common domain RIR_{I}. When these distinctions are not needed, we subsume these s-axes into the common interval [0,)\left[0,\infty\right).

Analogous to [6], the limiting system can be described as

(t+s)yS(t,s)\displaystyle(\partial_{t}+\partial_{s})\,y_{S}(t,s) =(λ(t,s)+0β(u)yI(t,u)du)yS(t,s),\displaystyle{}=-\left(\lambda(t,s)+\int_{0}^{\infty}\beta(u)y_{I}(t,u)\mathrm{d}u\right)y_{S}(t,s), (1)
(t+s)yV(t,s)\displaystyle(\partial_{t}+\partial_{s})\,y_{V}(t,s) =(α(s)+(1α(s))0β(u)yI(t,u)du)yV(t,s),\displaystyle{}=-\left(\alpha(s)+(1-\alpha(s))\int_{0}^{\infty}\beta(u)y_{I}(t,u)\mathrm{d}u\right)y_{V}(t,s),
(t+s)yI(t,s)\displaystyle(\partial_{t}+\partial_{s})\,y_{I}(t,s) =γ(s)yI(t,s)\displaystyle{}=-\gamma(s)y_{I}(t,s)
ddtyR(t)\displaystyle\frac{\mathrm{d}}{\mathrm{d}t}y_{R}(t) =0(α(s)yV(t,s)+γ(s)yI(t,s))ds,\displaystyle{}=\int_{0}^{\infty}\left(\alpha(s)y_{V}(t,s)+\gamma(s)y_{I}(t,s)\right)\mathrm{d}s, (2)

with initial and boundary conditions

yS(t,0)\displaystyle y_{S}(t,0) =0, for all t0,\displaystyle{}=0\mbox{, for all }t\geq 0, (3)
yS(0,s)\displaystyle y_{S}(0,s) =fS(s),\displaystyle{}=f_{S}(s),
yV(t,0)\displaystyle y_{V}(t,0) =0λ(t,s)yS(t,s)ds\displaystyle{}=\int_{0}^{\infty}\lambda(t,s)y_{S}(t,s)\mathrm{d}s (4)
yV(0,s)\displaystyle y_{V}(0,s) =0, for all s0,\displaystyle{}=0\mbox{, for all }s\geq 0,
yI(t,0)\displaystyle y_{I}(t,0) =0yS(t,s)0β(u)yI(t,u)duds+0(1α(s))yV(t,s)0β(u)yI(t,u)duds,\displaystyle{}=\int_{0}^{\infty}y_{S}(t,s)\int_{0}^{\infty}\beta(u)y_{I}(t,u)\mathrm{d}u\mathrm{d}s+\int_{0}^{\infty}(1-\alpha(s))y_{V}(t,s)\int_{0}^{\infty}\beta(u)y_{I}(t,u)\mathrm{d}u\mathrm{d}s, (5)
yI(0,s)\displaystyle y_{I}(0,s) =ρfI(s),,\displaystyle{}=\rho f_{I}(s),,
yR(0)\displaystyle y_{R}(0) =0.\displaystyle{}=0. (6)

We assume there are no vaccinated or removed individuals initially. The nonnegative functions fSf_{S} and fIf_{I} describe the initial age distributions of the susceptible and the infected individuals, and satisfy

0fS(s)ds=1,0fI(s)ds=1.\displaystyle\int_{0}^{\infty}f_{S}(s)\mathrm{d}s=1,\quad\int_{0}^{\infty}f_{I}(s)\mathrm{d}s=1.

The parameter ρ\rho is the initial proportion of infected individuals in the population. It is worthwhile to point out that the system is mass conserved, i.e.,

0(yS(t,s)+yV(t,s)+yI(t,s))ds=1+ρyR(t), for all t0.\displaystyle\int_{0}^{\infty}\left(y_{S}(t,s)+y_{V}(t,s)+y_{I}(t,s)\right)\mathrm{d}s=1+\rho-y_{R}(t),\quad\text{ for all }t\geq 0.

A consequence of the above conservation law is that the equation (2) is in fact redundant.

2.1 Continuity constraints on boundary conditions

When we seek solutions of (1)-(5) in the space of continuous functions, the explicit and implicit boundary data must be assigned continuously at the origin. This criterion is not satisfied freely but imposes additional constraints on the equation coefficients and the initial data. We obtain these constraints by equating the expressions for the explicit and implicit data at the origin and derive

fS(0)\displaystyle f_{S}(0) =0,\displaystyle=0, (7)
λ(0,s)\displaystyle\lambda(0,s) =0, for all s0,\displaystyle=0,\text{ for all }s\geq 0,
fI(0)\displaystyle f_{I}(0) =0β(u)fI(u)du.\displaystyle=\int^{\infty}_{0}\beta(u)f_{I}(u)\mathrm{d}u.

The last equality generally reduces the combined degrees of freedom for β\beta and fIf_{I} by one. We note also that the above constraint on λ\lambda could be relaxed to holding over the support of fSf_{S} only.

2.2 DSA perspective of the mean-field PDEs

Before describing how we solve the limiting mean-field PDE system in the next section, let us briefly discuss how DSA interprets the limiting PDEs as probabilistic quantities.

The DSA method [19, 6, 36, 18, 35, 21, 2, 32] combines classical dynamical systems theory and survival analysis to interpret the mean-field limits of scaled population counts or densities as characterizing probability laws of transfer times between compartments. In the Markovian model, this interpretation boils down to treating the mean-field ODEs as satisfying a time inhomogeneous Chapman–Kolmogorov equation for the marginal probability law of a Markov chain on the state space {S,V,I,R}\{\textbf{S},\textbf{V},\textbf{I},\textbf{R}\} describing the time-evolving status of a single individual embedded in an infinitely large population. See [6, Section 3.4] for the standard Susceptible-Infected-Recovered (SIR) model example. In the non-Markovian case, we construct a Markov process on the state space {S,V,I,R}×[0,)\{\textbf{S},\textbf{V},\textbf{I},\textbf{R}\}\times[0,\infty) to keep track of the time-evolving status as well as the age information of an individual embedded in an infinitely large population. As such, DSA interprets the mean-field limiting PDEs as describing the transition kernel for the Markov process. The transition kernel could be used to simulate individual trajectories.

Note that the individual-based Markov process (or chain in the Markovian case) is entirely characterized by the mean-field limiting PDEs (or ODEs in the Markovian case) describing population-level densities (or counts). It is precisely in this sense that DSA turns an ecological model into an agent-based model! Moreover, this agent-based description gives us the following probability measures

μS(t,A)=AyS(t,s)ds1+ρ,μV(t,A)=AyV(t,s)ds1+ρ,μI(t,A)=AyI(t,s)ds1+ρ,\displaystyle\mu_{S}(t,A)=\frac{\int_{A}y_{S}(t,s)\mathrm{d}s}{1+\rho},\quad\mu_{V}(t,A)=\frac{\int_{A}y_{V}(t,s)\mathrm{d}s}{1+\rho},\quad\mu_{I}(t,A)=\frac{\int_{A}y_{I}(t,s)\mathrm{d}s}{1+\rho}, (8)

for Borel subsets AA of [0,)[0,\infty). Here, the quantity μS(t,A)\mu_{S}(t,A) describes the probability of a randomly chosen individual with age in the set A[0,)A\subset[0,\infty) to be in the S compartment at time tt. The other probability measures are interpreted in a similar fashion. Based on a random sample of infection, vaccination and/or recovery times, which are allowed to be censored, truncated or even aggregated over time or individuals, the above probability measures (and the transition kernel) can be used to write an individual-based product-form likelihood function, called the DSA-likelihood. See [6, Section 3.3] for an explicit example of the DSA-likelihood. The DSA-likelihood can be used for likelihood-based approaches to parameter inference, such as Maximum Likelihood Estimate (MLE), or Bayesian methods employing Markov Chain Monte Carlo (MCMC) techniques. In this paper, we focus on an alternative likelihood-free approach based on the ABC method, which we describe in Section 5. In the next section, we describe how we solve the mean-field limiting PDE system.

3 Solving the limiting PDE system

For the remainder of the paper, we will make a simplifying assumption that the initial data fSf_{S} and fIf_{I} are compactly supported. This assumption holds in almost all practical cases of interest. For example individuals are of bounded age, and infection lasts for at most a bounded length of time. Moreover, by a suitable enlargement of the rectangular domains of ySy_{S}, yVy_{V}, and yIy_{I}, respectively RSR_{S}, RVR_{V}, and RIR_{I}, we may also assume without loss of generality that the solutions stay compactly supported in their domains for the time horizon simulated. This follows from the method of characteristics used in Section 3.1 and the forcing terms on the right-hand side of (1) which determine exponential solutions. For concreteness, let us write RS=[0,T]×[0,LS]R_{S}=\left[0,T\right]\times\left[0,L_{S}\right], RV=[0,T]×[0,LV]R_{V}=\left[0,T\right]\times\left[0,L_{V}\right], and RI=[0,T]×[0,LI]R_{I}=\left[0,T\right]\times\left[0,L_{I}\right].

3.1 Solution method

The governing equations, (1)-(5), constitute a quasi-linear PDE system of nonlocal conservation laws. Since the associated directions of their derivative operators are constant and do not intersect, the solutions into the domain interior are naturally suited for construction by the method of characteristics, which recasts the first order PDEs into equivalent flow equations. For more details on this construction, we refer readers to standard references, such as [8, Ch 3] and [7, Ch 7].

We note that the implicit and nonlocal boundary conditions, (4)-(5), are not standard. In order to handle the boundary conditions, we exploit a special feature of this system that remarkably depends on the nonlocal character of the implicit boundary terms. Specifically, we differentiate the implicit boundary conditions in tt and recover a governing flow for them, which only depends on the solution and not on the solution’s derivatives. For example, consider differentiating (4):

tyV(t,0)\displaystyle\partial_{t}y_{V}(t,0) =0LS(tλyS+λtyS)𝑑s=0LS(tλyS+λ(sySλySyS0LIβ(u)yI(t,u)𝑑u))𝑑s\displaystyle=\int^{L_{S}}_{0}\left(\partial_{t}\lambda y_{S}+\lambda\partial_{t}y_{S}\right)ds=\int^{L_{S}}_{0}\left(\partial_{t}\lambda y_{S}+\lambda\left(-\partial_{s}y_{S}-\lambda y_{S}-y_{S}\int^{L_{I}}_{0}\beta(u)y_{I}(t,u)du\right)\right)ds
=[λyS](t,s)|s=0LS+0LS[(tλ+sλ)yS](t,s)𝑑s0LS[λ2yS](t,s)𝑑s\displaystyle=\left.-\left[\lambda y_{S}\right](t,s)\big{|}^{L_{S}}_{s=0}\right.+\int^{L_{S}}_{0}\left[\left(\partial_{t}\lambda+\partial_{s}\lambda\right)y_{S}\right](t,s)ds-\int^{L_{S}}_{0}\left[\lambda^{2}y_{S}\right](t,s)ds
(0LS[λyS](t,s)𝑑s)(0LI[βyI](t,s)𝑑s).\displaystyle\;\;\;\;-\left(\int^{L_{S}}_{0}\left[\lambda y_{S}\right](t,s)ds\right)\left(\int^{L_{I}}_{0}\left[\beta y_{I}\right](t,s)ds\right).

In the second equality, we have substituted for tyS\partial_{t}y_{S} using the ySy_{S} equation of (1), and in the third equality, we performed an integration by parts. Continuing we may use the compactly supported initial data to conclude that the boundary term at s=LSs=L_{S} is vanishing and substitute (3) into the boundary term at s=0s=0. For the particular case of yVy_{V}, we see both these terms are vanishing. A similar argument works for tyI(t,0)\partial_{t}y_{I}(t,0). Below we present the resulting evolution equations for the solution’s boundary values. Of course, yS(t,0)0y_{S}(t,0)\equiv 0 by (3), so we need only consider the remaining two:

tyV(t,0)\displaystyle\partial_{t}y_{V}(t,0) =0LSyS(t+s)λ𝑑s0LSλ2yS𝑑s(0LSλyS𝑑s)(0LIβyI𝑑s),\displaystyle=\int^{L_{S}}_{0}y_{S}(\partial_{t}+\partial_{s})\lambda ds-\int^{L_{S}}_{0}\lambda^{2}y_{S}ds-\left(\int^{L_{S}}_{0}\lambda y_{S}ds\right)\left(\int^{L_{I}}_{0}\beta y_{I}ds\right), (9)
tyI(t,0)\displaystyle\partial_{t}y_{I}(t,0) =(0LIβyIds)[0LSλySds(0LSySds)(0LIβyIds)0LVyV(t+s)αds\displaystyle=\left(\int^{L_{I}}_{0}\beta y_{I}ds\right)\left[-\int^{L_{S}}_{0}\lambda y_{S}ds-\left(\int^{L_{S}}_{0}y_{S}ds\right)\left(\int^{L_{I}}_{0}\beta y_{I}ds\right)-\int^{L_{V}}_{0}y_{V}(\partial_{t}+\partial_{s})\alpha ds\right.
+[(1α(t,0))yV(t,0)]0LV(1α)αyVds(0LV(1α)2yVds)(0LIβyIds)]\displaystyle\left.\;\;\;\;\;\;+\left[\big{(}1-\alpha(t,0)\big{)}y_{V}(t,0)\right]-\int^{L_{V}}_{0}\left(1-\alpha\right)\alpha y_{V}ds-\left(\int^{L_{V}}_{0}\left(1-\alpha\right)^{2}y_{V}ds\right)\left(\int^{L_{I}}_{0}\beta y_{I}ds\right)\right]
+(0LSyS𝑑s+0LV(1α)yV𝑑s)(0LIyI(t+s)β𝑑s+β(0)yI(t,0)0LIβγyI𝑑s).\displaystyle\;\;+\left(\int^{L_{S}}_{0}y_{S}ds+\int^{L_{V}}_{0}(1-\alpha)y_{V}ds\right)\left(\int^{L_{I}}_{0}y_{I}(\partial_{t}+\partial_{s})\beta ds+\beta(0)y_{I}(t,0)-\int^{L_{I}}_{0}\beta\gamma y_{I}ds\right).

Note that in (9), it is understood that all integrands are evaluated at (t,s)(t,s) and integrated in ss. In our particular application, we also have (t+s)α=sα\left(\partial_{t}+\partial_{s}\right)\alpha=\partial_{s}\alpha and similarly for the hazard function β\beta. We have chosen to present (9) in generality to highlight the underlying structure of (1)-(5). Note also, the validity of the equations (9) depends on the compact support of the initial data. Otherwise, these flow equations would need to contain additional boundary terms at the right-end points of the s-axes.

Our solution method of (1)-(5) now consists of coupling (1) in the interior domain together with the alternative flow formulation (9) of the implicit boundary conditions at the [s=0]\left[s=0\right] boundary. The explicit boundary data assigned at [t=0]\left[t=0\right] in (3)-(5) becomes this system’s initial data. For the benefit of readers mostly interested in applications, we defer a rigorous analysis of this alternative formulation and its equivalence with the original one to Appendix B.1. Also, in this investigation, we assume equation coefficients in classical function spaces and classical notions of solvability. Future work may investigate deeper questions of equivalence between the original and flow formulations when equation coefficients and the solution belong to suitably identified Sobolev spaces. For more information on the distinctions between classical solutions and weak solutions in Sobolev spaces, we refer again to [8, 7]. We present a numerical demonstration of this method approximating the implicit boundary conditions, (4)-(5), in Section B.2.

3.2 Numerical implementation

For numerically approximating (1) and (9), we take a semi-discrete approach where we discretize the space axis, ss, into a number of uniform intervals whose intersection points are called nodes. The number of nodes determines the mesh resolution. We then approximate each component of the true solution – ySy_{S}, yVy_{V}, and yIy_{I} – by its own nodal basis expansion

y(t,s)i=1nyi(t)ϕi(s).y(t,s)\approx\sum^{n}_{i=1}y_{i}(t)\phi_{i}(s).

The {ϕi(s)}i\left\{\phi_{i}(s)\right\}_{i} functions are piecewise linear, C0C^{0} basis splines determined by a given mesh resolution. Each basis spline is associated with an ss-axis node, sis_{i}, and is defined by taking the value 1 at that node and 0 at all others. At each tt-level, we thus approximate the true solution with a linear, interpolant spline whose value at (t,si)(t,s_{i}) is given by yi(t)y_{i}(t). Integrals of the true solution and its derived quantities are approximated by taking integrals of their corresponding linear, interpolant splines by trapezoidal rule.

The {yi(t)}i\left\{y_{i}(t)\right\}_{i} evolve according to the flows of (1) and (9). We initialize at the t=0t=0 level by projecting the initial data onto the interpolant basis. This is accomplished by sampling those functions at the nodes. We then use an explicit Euler scheme to extend the solution up to the next time step, t+Δtt+\Delta t along the boundary axis where the implicit data is prescribed according to the ODEs of (9). This determines the first value y1(t+Δt)y_{1}(t+\Delta t) at the node s=0s=0. For the remaining nodal values, each of their nodes originate along a characteristic from either the previous tt-level or within that part of the boundary axis that was previously approximated in computing y1y_{1}. We now evolve each of those originating, solution values by the method of characteristics along the ODEs that correspond to (1) by a second explicit Euler scheme to obtain the remaining {yi(t+Δt)}i=2n\left\{y_{i}(t+\Delta t)\right\}^{n}_{i=2}. By iterating this process, we construct a numerical approximation over the desired time interval for simulation.

In practice, we use an adaptive Euler step to improve solution accuracy. We specify tolerances atol and rtol, which are respectively the absolute and the relative tolerances set by the user. The solver approximates the solution using a single Euler step of size Δt\Delta t and two consecutive smaller Euler steps of size Δt/2\Delta t/2. The solver then checks if at each node the difference between the two Euler approximations satisfies either the atol or rtol thresholds. It does this similarly for the values of the nonlocal integrals. It accepts the step if all quantities meet this criterion, or else it halves the step size and repeats this process. Note that in evaluating the absolute error, we take the magnitude difference in the Euler solutions and then scale by the length of their ss-axes. This is done because, in general, ySy_{S}, yVy_{V}, and yIy_{I} are defined on ss-axes of different lengths, while probability densities vary inversely with the length of their domain. By this adjustment, we normalize the errors in ySy_{S}, yVy_{V}, and yIy_{I} to a common scale.

Below we also provide a pseudocode of the algorithm described in this section.

Pseudocode 1.

Numerical solver for (1) and (9)

  1. 1.

    Initialize the solution at t=0t=0 by storing values of the initial data at the nodes,

  2. 2.

    Propagate from y1(t)y_{1}(t) to y1(t+Δt)y_{1}(t+\Delta t) by taking an explicit Euler step of size Δt\Delta t along (9),

  3. 3.

    Propagate from {yi(t)}i=1n\left\{y_{i}(t)\right\}^{n}_{i=1} and y1(t+Δt)y_{1}(t+\Delta t) to {yi(t+Δt)}i=2n\left\{y_{i}(t+\Delta t)\right\}^{n}_{i=2} by taking Euler steps of size at most 2Δt\sqrt{2}\Delta t along the characteristics of (1),

  4. 4.

    Repeat Steps 2-3 for two consecutive Euler steps with Δt=Δt/2\Delta t=\Delta t/2,

  5. 5.

    If the y-nodal values and integral quantities computed by the single and two-half Euler steps satisfy either of the absolute and relative error thresholds, atol and rtol, then accept the Step 4 solution at t+Δtt+\Delta t by storing its nodals and continue with Δt=2Δt\Delta t=2\Delta t. Otherwise, repeat steps 2-4 with Δt=Δt/2\Delta t=\Delta t/2.

Code availability

We provide an implementation of the algorithms described in this paper in the Julia programming language along with jupyter notebooks and html at [22]. Additional information on Julia may be found at [3].

4 Model parameters

For the vaccine efficacy, α\alpha, we begin with a function that linearly increases from 0 to a maximum of αeff\alpha_{\mathrm{eff}} over a period of αL\alpha_{L} days. After αL\alpha_{L}, it was taken to be constant. For the contact interval and infectious period hazard functions, β\beta and γ\gamma, we choose Weibull distributions as the Weibull distribution has been shown to be a flexible choice in modelling epidemics [6]. Therefore, we have

α(s)\displaystyle\alpha(s) min(s,αL)αL×αeff,s0,\displaystyle{}\coloneqq\frac{\min(s,\alpha_{L})}{\alpha_{L}}\times\alpha_{\mathrm{eff}},\quad s\geq 0,
β(s)\displaystyle\beta(s) βαβθ(sβθ)βα1,s0,\displaystyle{}\coloneqq\frac{\beta_{\alpha}}{\beta_{\theta}}\left(\frac{s}{\beta_{\theta}}\right)^{\beta_{\alpha}-1},\quad s\geq 0,
γ(s)\displaystyle\gamma(s) γαγθ(sγθ)γα1,s0.\displaystyle{}\coloneqq\frac{\gamma_{\alpha}}{\gamma_{\theta}}\left(\frac{s}{\gamma_{\theta}}\right)^{\gamma_{\alpha}-1},\quad s\geq 0.

Further, we smoothed α\alpha by a moving integral average to ensure its derivative was everywhere classical. We took the density for the initial infected population, fIf_{I}, to be approximately uniform over a period of fourteen days subject to the further requirement that it should be compactly supported and continuous. For this purpose, fIf_{I} was constant up to day 13.25 and then linearly decayed to 0 by day 13.75. As with α\alpha, we smoothed fIf_{I} by an integral moving average.

We inferred the parameters in (1)-(5) using COVID-19 epidemic data for the US state of Ohio, [28], during the time period Nov 15, 2020- Jan 15, 2021. We directly estimated the hazard rate for vaccination for this same time period from the Ohio Department of Health (ODH) and Centers for Disease Control and Prevention (CDC) data, [28, 10], using b-splines. We also directly estimated the density for the initial susceptible population, fSf_{S}, consistent with US census information on age distributions in Ohio, [4].

4.1 Empirical estimates

4.1.1 The distribution for the initial susceptible population, fSf_{S}

Figure 2 shows an empirical density for age distributions in the state of Ohio partitioned into ten year age groups. This data suggested a simple piecewise linear characterization of fSf_{S}, which is also shown. Additionally, we normalize fSf_{S} so that it integrates to one.

Refer to caption
Figure 2: An empirical density for the age distributions in Ohio partitioned into ten year brackets. From this we extracted a probability density curve, fSf_{S}. Later we additionally make this density decay to 0 at the origin consistent with (7).

4.1.2 The hazard rate for time to vaccination λ\lambda

Figure 3 shows the empirical, cumulative vaccination doses given in Ohio from Nov 15th, 2020 - Jan 15th, 2021 (top panel) broken down by age group. These age-breakdowns were estimated from the vaccination counts provided by the ODH and CDC, [28, 10]. We observe that vaccine rollout did not begin until Dec 15th. It also shows the derived empirical estimations of the hazard rate for vaccination, λ\lambda, from this data (bottom two panels). In order to do this, we measured an empirical survival function for vaccination using the cumulative vaccine doses curves and the age demography in Ohio and then applied to these values the negative ln\ln-transformation. These data points were then fit by least squares with C2C^{2} cubic splines, and their pointwise derivatives corresponded to the desired vaccination hazard rate. For more information on computing with splines, we refer to [29]. These several curves, parameterized in tt, were combined into a single λ(t,s)\lambda(t,s) function by using smoothed linear transitions in the age-variable ss across age-groups.

Refer to caption
Figure 3: (Top) Cumulative vaccine doses administered in Ohio. (Bottom left) From the cumulative vaccine doses and the Ohio population’s age distribution, we log transformed the empirical vaccine survival function. The empirical data is shown as dots. These points were then fit by least-squares cubic splines joined together with C2C^{2} smoothness which are also shown. The slopes of these curves represent the vaccine hazard rate. (Bottom right) The nonparametric hazard rates, computed as the derivatives of the spline curves in the adjacent panel, are shown.

4.2 Estimation by Approximate Bayesian Computation

The remaining model parameters were not explicitly given by data, and we used instead an ABC scheme to estimate their values. The goal of Bayesian inference is to obtain samples from the posterior distribution of the parameters, which is proportional to a prior distribution on the parameters times the likelihood of the data given the parameters. However, in many cases, it is difficult or expensive to compute or even sample from the posterior. In particular, many methods for sampling from the posterior require computing the likelihood, which may itself be intractable.

ABC provides a method to approximate posterior samples without computing the likelihood. The simplest way to perform ABC is the rejection sampling method, which goes as follows: First, one proposes a vector of parameters from the joint prior. Next, one simulates data (or some summary statistic thereof) from the model using this vector of parameters. Finally, one compares this simulated data to the observed data using some distance metric and either accepts the proposed vector of parameters as a sample from an approximation of the posterior or rejects it depending on how close it is to the observed data. What is sufficiently close for acceptance may be determined by an absolute threshold or by retaining some proportion of the best samples. The intuition is that a vector of parameters with higher likelihood will more often produce simulated data that is close to the empirical data, and so ABC approximates the usual acceptance procedure based on the likelihood ratio that is often used in MCMC. For a more thorough discussion of ABC, we direct the reader to [25, 31].

We proposed the vector of parameters (βα,γα,γθ,ρ,αeff)(\beta_{\alpha},\gamma_{\alpha},\gamma_{\theta},\rho,\alpha_{\textrm{eff}}) by drawing each quantity independently from a uniform prior with the bounds given in Table 1. We then accepted it if the associated infectious period was less than three weeks with 99.9% probability and if the mean contact interval was less than the mean infectious period. The parameter αL\alpha_{L} was fixed to be two weeks. The parameter βθ\beta_{\theta} was not proposed, and hence, does not have a prior, since its value was determined from the other parameters. This followed from the last constraint in (7). In fact, for the particular pairing of a Weibull distribution for the contact interval and a uniform distribution for the initial infected, it follows from a little algebra that (7) implies βθ\beta_{\theta} is the length of the support of fIf_{I} and independent of the other parameters.

We then used the PDE model to generate a predicted incidence trajectory for the Nov 15, 2020 - Jan 15, 2021 time period and compared the predicted trajectory to the empirical trajectory using the root mean square error ( Root Mean Square Error (RMSE)). Note that the prevalence at time tt is given by n0yI(t,s)dsn\int_{0}^{\infty}y_{I}(t,s)\mathrm{d}s, where nn is the total population size. We generated 5000 ABC sample trajectories in this fashion and retained the 10%10\% of sample trajectories with the lowest RMSE. The results are shown in Figure 4.

5 Numerical results

Here we demonstrate how the PDE-DSA model can be used in conjunction with with the ABC method to infer model parameters from epidemic data. As discussed in Section 4.2, the ABC method does not require an explicit likelihood but instead uses a computable error between synthetic and true data values to assess the quality of proposed parameter values. Aggregate population-level counts of infection are prototypical data that would be used with DSA, and so we apply this approach to daily reported incidence supplied by the ODH during the period of Nov 15, 2020, - Jan 15, 2021. This period encompassed the epidemic wave promoted by the rise of the COVID-19 α\alpha-variant [11] as well as the start of vaccine roll-out in Ohio.

The primary parameters of interest were for the contact interval characterized by β\beta and the infectious period characterized by γ\gamma, along with estimates of the initial amount of infection ρ\rho. The vaccine efficacy parameter αeff\alpha_{\mathrm{eff}}, while also relevant, would not be identifiable due to still low rate of vaccine administration during the time window of interest. Figure 4 shows the model predictions together with the ODH reported daily incidence. The best fit obtained across all 5000 ABC samples captures the nontrivial empirical trends. Table 1 lists the model parameters with their best fit values and the ABC posterior credible intervals. The posterior distributions correspond to the parameter values that produced trajectories within the shaded bands of Figure 4. In Figure 5, we see that the mean time before transmission was approximately estimated to be between (12.25,13.0)\left(12.25,13.0\right) days while the mean time to recovery was approximately estimated to be between (12.25,15.0)\left(12.25,15.0\right) days. These estimates are in line with estimates from similar studies reported in the literature [6].

Refer to caption
Figure 4: The PDE model fit to the ODH daily incidence data as calibrated by ABC. A seven day moving average of the ODH data is shown as a dashed orange line. The solid blue line corresponds to the model’s best fit observed across 5000 ABC samples, while the blue band represents the best 10% of all ABC sampled trajectories as measured by the RMSE between the model prediction and empirical incidence.

In the second panel of Figure 5, we give the posterior distribution for the basic reproduction number R0R_{0}. This quantity is computed using the formula R0=0Sγ(s)β(s)dsR_{0}=\int^{\infty}_{0}S_{\gamma}(s)\beta(s)\mathrm{d}s as in [19], where Sγ(s)S_{\gamma}(s) is the survival function of the probability distribution characterized by the hazard rate γ\gamma. Note that this formula ignores vaccination, and therefore, is an upper bound on the true basic reproduction number. The model estimated a mean of R0=1.28R_{0}=1.28 with a 95% credible interval of (0.89,1.94)\left(0.89,1.94\right), which is consistent with other studies of basic reproduction numbers associated with the COVID-19.

Parameter Unit Description
Value/
Prior
Best
Fit
ABC Posterior
95% Credible Interval
Contact interval
βα\beta_{\alpha} Weibull shape parameter (5,20)\left(5,20\right)^{*} 5.24 (5.06,9.55)\left(5.06,9.55\right)
βθ\beta_{\theta} day Weibull scale parameter 13.5038 (13.5036,13.5076)\left(13.5036,13.5076\right)
Infectious period
γα\gamma_{\alpha} Weibull shape parameter (2,8)\left(2,8\right)^{\ast} 6.05 (4.98,7.94)\left(4.98,7.94\right)
γθ\gamma_{\theta} day Weibull scale parameter (1,18)\left(1,18\right)^{*} 13.6 (13.43,15.03)\left(13.43,15.03\right)
Starting infection
ρ\rho %
mass of the initial
infected population
relative the size of the
susceptible population
(0.1,5)\left(0.1,5\right) 4.48 (0.79,4.95)\left(0.79,4.95\right)
Vaccine
αL\alpha_{L} day
time for vaccine to achieve
full efficacy
14
αeff\alpha_{\mathrm{eff}} % vaccine blocking efficacy (70,100)\left(70,100\right) 97.0 (70.7,99.2)\left(70.7,99.2\right)
Table 1: Model parameters used to fit the ODH daily case data from Nov 15, 2020 - Jan 15, 2021. We additionally report their assigned ABC priors, best fit values, and ABC posterior credible intervals from the numerical simulations presented in Section 5. The ABC posteriors were obtained by conditioning 5000 ABC samples to the best 10% of observed RMSE between the ODH daily incidence and model prediction. The priors for β\beta and γ\gamma are listed with * since, in addition to these uniform ranges, we further conditioned on parameters where the infectious period lasted no more than 21 days with 99.9% probability and where the mean contact interval was less than the mean infectious period. Moreover, a prior for βθ\beta_{\theta} is not given since (7) determines βθ=βθ(βα,fI)\beta_{\theta}=\beta_{\theta}(\beta_{\alpha},f_{I}). The narrow posterior of βθ\beta_{\theta} was expected as reasoned in Section 4.2.
Refer to caption
Figure 5: (Left) Posterior distribution of the mean contact interval compared with the posterior distribution for the mean infectious period. (Right) Posterior distribution for R0R_{0}.

6 Discussion

The COVID-19 pandemic has spurred the development of a cottage industry of ever-more elaborate mathematical models of epidemic dynamics that have been applied to empirical data across the world with varying degree of success [24, 27, 14, 16, 1]. Since most of the current and historic COVID-19 data have been available in aggregate, the models tend to focus on aggregate behavior which may sometimes lead to erroneous insights [23]. The DSA method discussed here offers a different modeling approach and in particular accounts more fully that a typical compartmental model for the heterogeneity of individual behaviors.

Indeed, even since its introduction in [19] on the eve of an outbreak of the global COVID-19 pandemic, the DSA approach has been shown to be a viable way of analyzing epidemic data in order to predict epidemic progression, evaluate the long term effects of public health policies (testing, vaccination, lock downs, etc.) and individual-level decisions (masking, social distancing, vaccine hesitancy) [32, 35, 21, 36, 18]. As it was argued in [19], the DSA method has several advantages over more traditional approaches to analyzing data in SIR-type epidemic curves. Perhaps the most important one is that the method does not require the knowledge of the full curve trajectory or the size of the total susceptible population. Indeed, such quantities are rarely known in practice and often require ad-hoc adjustments leading to severely biased analysis [17].

In this work we discuss a relatively simple yet powerful non-Markovian extension of the original DSA method formulation introduced in [19]. This extension allows one to take into account the additional heterogeneity of the transmission patterns due to both the changes in infectiousness over the individuals’ infectious periods and the changes in immunity over the individuals’ periods of vaccine-derived protection. However, the practical price to pay for these modeling improvements is that a more elaborate numerical scheme is required to evaluate the DSA model along with its increased computational cost to fit empirical data. Here we have chosen to apply a likelihood-free ABC approach, which allows us to avoid the large numerical overhead usually associated with DSA likelihood-based methods [6]. This is accomplished by pregenerating parameter samples and then running simulations in parallel using modern multi-core capabilities. This capacity of ABC for parallel processing helps also mitigate the computational burden of a non-Markovian DSA, namely evaluating its nonlocal and implicit boundary conditions and associated flow formulation of (9). The analytic properties of (1) and (9) are of intrinsic interest in their own right, as a deeper understanding of solution regularity could help identify numerical schemes of higher and optimal convergence order. We hope to further pursue these lines of investigation in the future. As seen from the numerical examples in Section 5, using ABC we were able to fit the highly irregular model of Ohio COVID-19 epidemic with proper accounting for various sources of uncertainty in all of the relevant model parameters.

Appendix A Acronyms

ABC
Approximate Bayesian Computation
ABM
Agent-based Model
BA
Barabási-Albert
BD
Birth-Death
CDC
Centers for Disease Control and Prevention
CDF
Cumulative Distribution Function
CLT
Central Limit Theorem
CM
Configuration Model
CME
Chemical Master Equation
CRM
Conditional Random Measure
CRN
Chemical Reaction Network
CTBN
Continuous Time Bayesian Network
CTMC
Continuous Time Markov Chain
DSA
Dynamical Survival Analysis
DTMC
Discrete Time Markov Chain
DRC
Democratic Republic of Congo
ER
Erdös-Rényi
ESI
Enzyme-Substrate-Inhibitor
FCLT
Functional Central Limit Theorem
FLLN
Functional Law of Large Numbers
HJB
Hamilton–Jacobi–Bellman
iid
independent and identically distributed
IPS
Interacting Particle System
KL
Kullback-Leibler
LDP
Large Deviations Principle
LLN
Law of Large Numbers
LNA
Linear Noise Approximation
MAPK
Mitogen-activated Protein Kinase
MCMC
Markov Chain Monte Carlo
MGF
Moment Generating Function
MLE
Maximum Likelihood Estimate
MM
Michaelis-Menten
MPI
Message Passing Interface
MSE
Mean Squared Error
ODE
Ordinary Differential Equation
ODH
Ohio Department of Health
PDE
Partial Differential Equation
PDF
Probability Density Function
PGF
Probability Generating Function
PGM
Probabilistic Graphical Model
PMF
Probability Mass Function
psd
positive semi-definite
PT
Poisson-type
QSSA
Quasi-Steady State Approximation
rQSSA
reversible QSSA
RMSE
Root Mean Square Error
SD
Standard Deviation
SEIR
Susceptible-Exposed-Infected-Recovered
SI
Susceptible-Infected
SIR
Susceptible-Infected-Recovered
SIS
Susceptible-Infected-Susceptible
sQSSA
standard QSSA
ssLNA
Slow-scale Linear Noise Approximation
tQSSA
total QSSA
WS
Watts-Strogatz
whp
with high probability

Acknowledgments

WRK acknowledges financial support from the London Mathematical Society (LMS) under a Scheme 4 grant (Ref No: 42118). GAR acknowledges support from the National Science Foundation under grants DMS-2027001 and DMS-1853587.

Conflict of interest

The authors declare no conflict of interest.

References

  • [1] N. Barda, D. Riesel, A. Akriv, J. Levy, U. Finkel, G. Yona, D. Greenfeld, S. Sheiba, J. Somer, E. Bachmat, G. N. Rothblum, U. Shalit, D. Netzer, R. Balicer and N. Dagan, Developing a COVID-19 mortality risk prediction model when individual-level data are not available, Nat Commun, 11 (2020), 4439.
  • [2] C. D. Bastian and G. A. Rempala, Throwing stones and collecting bones: Looking for poisson-like random measures, Mathematical Methods in the Applied Sciences, 43 (2020), 4658–4668.
  • [3] J. Bezanson, A. Edelman, S. Karpinski and V. B. Shah, Julia: A fresh approach to numerical computing, SIAM Review, 59 (2017), 65–98.
  • [4] U. Census, County Population Totals 2010-2019, URL {https://www.census.gov/data/datasets/time-series/demo/popest/2010s-counties-total.html}, Accessed: September 13, 2021.
  • [5] B. Choi, S. Busch, D. Kazadi, B. Ilunga, E. Okitolonda, Y. Dai, R. Lumpkin, O. Saucedo, W. R. KhudaBukhsh, J. Tien, E. Kenah and G. Rempala, Modeling outbreak data: Analysis of a 2012 Ebola virus disease epidemic in DRC, Biomath, 8 (2019).
  • [6] F. Di Lauro, W. R. KhudaBukhsh, I. Z. Kiss, E. Kenah, M. Jensen and G. A. Rempała, Dynamic survival analysis for non-markovian epidemic models, Journal of The Royal Society Interface, 19 (2022), 20220124.
  • [7] E. DiBenedetto, Partial differential equations, 2nd edition, Cornerstones, Birkhäuser Boston, Ltd., Boston, MA, 2010.
  • [8] L. C. Evans, Partial differential equations, vol. 19 of Graduate Studies in Mathematics, 2nd edition, American Mathematical Society, Providence, RI, 2010.
  • [9] R. Ferrière and V. C. Tran, Stochastic and deterministic models for age-structured populations with genetically variable traits, in CANUM 2008, vol. 27 of ESAIM Proc., EDP Sci., Les Ulis, 2009, 289–310.
  • [10] C. for Disease Control and P. (CDC), US Centers for Disease Control and Prevention: COVID-19 vaccinations in the United States, County, URL {https://data.cdc.gov/Vaccinations/COVID-19-Vaccinations-in-the-United-States-County/8xkx-amqh}, Accessed: September 8th, 2021.
  • [11] C. for Disease Control and P. (CDC), US Centers for Disease Control and Prevention: SARS-CoV-2 Variant Classifications and Definitions, URL {https://www.cdc.gov/coronavirus/2019-ncov/variants/variant-classifications.html}, Accessed: July 13, 2022.
  • [12] N. Fournier and S. Méléard, A microscopic probabilistic description of a locally regulated population and macroscopic approximations, The Annals of Applied Probability, 14 (2004), 1880–1919.
  • [13] E. Franco, M. Gyllenberg and O. Diekmann, One dimensional reduction of a renewal equation for a measure-valued function of time describing population dynamics, Acta Applicandae Mathematicae, 175 (2021), 12.
  • [14] I. Holmdahl and C. Buckee, Wrong but useful—what covid-19 epidemiologic models can and cannot tell us, New England Journal of Medicine, 383 (2020), 303–305.
  • [15] J. M. Hyman and J. Li, Infection-age structured epidemic models with behavior change or treatment, Journal of Biological Dynamics, 1 (2007), 109–131.
  • [16] N. P. Jewell, J. A. Lewnard and B. L. Jewell, Predictive Mathematical Models of the COVID-19 Pandemic: Underlying Principles and Value of Projections, JAMA, 323 (2020), 1893–1894, URL https://doi.org/10.1001/jama.2020.6585.
  • [17] E. Kenah, Contact intervals, survival analysis of epidemic data, and estimation of R0R_{0}, Biostatistics, 12 (2011), 548–566.
  • [18] W. R. KhudaBukhsh, C. D. Bastian, M. Wascher, C. Klaus, S. Y. Sahai, M. H. Weir, E. Kenah, E. Root, J. H. Tien and G. A. Rempala, Projecting COVID-19 Cases and Subsequent Hospital Burden in Ohio, 2022, medRxiv, URL https://www.medrxiv.org/content/10.1101/2022.07.27.22278117v1.full.pdf+html.
  • [19] W. R. KhudaBukhsh, B. Choi, E. Kenah and G. A. Rempała, Survival dynamical systems: individual-level survival analysis from population-level epidemic models, Interface Focus, 10 (2020).
  • [20] W. R. KhudaBukhsh, H.-W. Kang, E. Kenah and G. Rempała, Incorporating age and delay into models for biophysical systems, Physical Biology, 18 (2021).
  • [21] W. R. KhudaBukhsh, S. K. Khalsa, E. Kenah, G. A. Rempala and J. H. Tien, COVID-19 dynamics in an Ohio prison, 2021, medRxiv, URL https://www.medrxiv.org/content/early/2021/01/15/2021.01.14.21249782.
  • [22] C. Klaus, PDE-DSA github repository, 2022, URL {https://github.com/klauscj68/PDE-Vax}.
  • [23] C. Klaus, M. Wascher, W. R. KhudaBukhsh, J. H. Tien, G. A. Rempała and E. Kenah, Assortative mixing among vaccination groups and biased estimation of reproduction numbers, The Lancet Infectious Diseases, 22 (2022), 579–581.
  • [24] K. Koelle, M. A. Martin, R. Antia, B. Lopman and N. E. Dean, The changing epidemiology of sars-cov-2, Science, 375 (2022), 1116–1121, URL https://www.science.org/doi/abs/10.1126/science.abm4915.
  • [25] T. Kypraios, P. Neal and D. Prangle, A tutorial introduction to bayesian inference for stochastic epidemic models using approximate bayesian computation, Mathematical biosciences, 287 (2017), 42–53.
  • [26] O. A. Ladyženskaja, V. A. Solonnikov and N. N. Ural’ceva, Linear and quasilinear equations of parabolic type, Translations of Mathematical Monographs, Vol. 23, American Mathematical Society, Providence, R.I., 1968, Translated from the Russian by S. Smith.
  • [27] M. O’Driscoll, G. Ribeiro Dos Santos, L. Wang, D. A. T. Cummings, A. S. Azman, J. Paireau, A. Fontanet, S. Cauchemez and H. Salje, Age-specific mortality and immunity patterns of SARS-CoV-2, Nature, 590 (2021), 140–145.
  • [28] O. D. of Health, Ohio Department of Health COVID Dashboard, URL {https://coronavirus.ohio.gov/wps/portal/gov/covid-19/dashboards/overview}, Accessed: October 29, 2021.
  • [29] L. L. Schumaker, Spline functions: Computational Methods, Society for Industrial and Applied Mathematics, Philadelphia, PA, 2015, doi:10.1137/1.9781611973907.
  • [30] N. Sherborne, J. C. Miller, K. B. Blyuss and I. Z. Kiss, Mean-field models for non-markovian epidemics on networks, Journal of Mathematical Biology, 76 (2018), 755–778.
  • [31] S. A. Sisson, Y. Fan and M. A. Beaumont, Handbook of Approximate Bayesian Computation, CRC Press, Boca Raton, FL, 2020.
  • [32] I. Somekh, W. R. KhudaBukhsh, E. D. Root, G. A. Rempała, E. Simões and E. Somekh, Quantifying the Population-Level Effect of the COVID-19 Mass Vaccination Campaign in Israel: A Modeling Study, Open Forum Infectious Diseases, 9 (2022).
  • [33] V. C. Tran, Large population limit and time behaviour of a stochastic particle model describing an age-structured population, ESAIM. Probability and Statistics, 12 (2008), 345–386.
  • [34] N. van Kampen, Remarks on Non-Markov Processes, Brazilian Journal of Physics, 28 (1998).
  • [35] H. Vossler, P. Akilimali, Y. Pan, W. R. KhudaBukhsh, E. Kenah and G. A. Rempała, Analysis of Individual-level Epidemic Data: Study of 2018-2020 Ebola Outbreak in Democratic Republic of the Congo, Scientific Reports, 12 (2022).
  • [36] M. Wascher, P. M. Schnell, W. R. KhudaBukhsh, M. Quam, J. H. Tien and G. A. Rempała, Monitoring sars-cov-2 transmission and prevalence in populations under repeated testing, 2021, URL https://www.medrxiv.org/content/10.1101/2021.06.22.21259342v1.

Appendix B Appendix

B.1 Analysis of the coupled flow system (1) and (9)

In this section, we begin by assuming the equation coefficients α\alpha, β\beta, γ\gamma, and λ\lambda are continuous and that α\alpha, β\beta, and λ\lambda also have directional derivatives along characteristics which are continuous as well. We then also assume that the initial data fSf_{S} and fIf_{I} are continuous, compactly supported, and satisfying the compatibility conditions (7). As observed in Remark 2 below, we will also come to require that coefficients are bounded.

B.1.1 Structure conditions

First we truncate the domains of the integrals in (1) to span their respective RR’s which we now index from 1 to 3. These RR’s share a common t-axis but have different s-axes respectively spanning [0,Li]\left[0,L_{i}\right]. Thus, we initially solve a separate problem than (1) where instead of integrals over [0,)\left[0,\infty\right) we have finite domains. However, because of the compactness of our initial data, this problem will be equivalent to the original formulation a posteriori. Letting y=(yS,yV,yI)y=\left(y_{S},y_{V},y_{I}\right) and 𝒞(R)\mathcal{C}(R) stand for the set of continuous functions on the domain RR, we now rewrite (1) in the form

(t+s)yS\displaystyle\left(\partial_{t}+\partial_{s}\right)y_{S} =Λ1[y],\displaystyle=-\Lambda_{1}\left[y\right], (10)
(t+s)yV\displaystyle\left(\partial_{t}+\partial_{s}\right)y_{V} =Λ2[y],\displaystyle=-\Lambda_{2}\left[y\right],
(t+s)yI\displaystyle\left(\partial_{t}+\partial_{s}\right)y_{I} =Λ3[y],\displaystyle=-\Lambda_{3}\left[y\right],

for functionals Λi:j=13𝒞(Rj)𝒞(Ri)\Lambda_{i}:\prod^{3}_{j=1}\mathcal{C}\left(R_{j}\right)\rightarrow\mathcal{C}\left(R_{i}\right). Moreover, for positive quantities C1C_{1} and 1\mathcal{L}_{1}, the Λi\Lambda_{i} satisfy boundedness and Lipschitz conditions

Λi[y]\displaystyle\left\|\Lambda_{i}\left[y\right]\right\|_{\infty} C1(λ,α,γ,LI,y),\displaystyle\leq C_{1}\left(\left\|\lambda\right\|_{\infty},\left\|\alpha\right\|_{\infty},\left\|\gamma\right\|_{\infty},L_{I},\left\|y\right\|_{\infty}\right), (11)
Λi[y1]Λi[y2]\displaystyle\left\|\Lambda_{i}\left[y_{1}\right]-\Lambda_{i}\left[y_{2}\right]\right\|_{\infty} 1(α,β,LI,y1,y2)y1y2.\displaystyle\leq\mathcal{L}_{1}\left(\left\|\alpha\right\|_{\infty},\left\|\beta\right\|_{\infty},L_{I},\left\|y_{1}\right\|_{\infty},\left\|y_{2}\right\|_{\infty}\right)\left\|y_{1}-y_{2}\right\|_{\infty}. (12)

The boundary flow kernels in (9) are more elaborate but they satisfy similar structure conditions. The key point is that the flow equations are algebraic combinations of functionals which themselves satisfy boundedness and Lipschitz conditions. We may rewrite (9) as

tyV(t,0)=Γ2[y],\displaystyle\partial_{t}y_{V}(t,0)=\Gamma_{2}\left[y\right], (13)
tyI(t,0)=Γ3[y],\displaystyle\partial_{t}y_{I}(t,0)=\Gamma_{3}\left[y\right],

for functionals Γi:j=13𝒞(Rj)𝒞([0,T])\Gamma_{i}:\prod^{3}_{j=1}\mathcal{C}\left(R_{j}\right)\rightarrow\mathcal{C}\left(\left[0,T\right]\right). Their boundedness and Lipschitz conditions are

Γi[y]\displaystyle\left\|\Gamma_{i}\left[y\right]\right\|_{\infty} C2(Dχλ,Dχα,Dχβ,γ,LS,LV,LI,y),\displaystyle\leq C_{2}\left(\left\|D_{\chi}\lambda\right\|_{\infty},\left\|D_{\chi}\alpha\right\|_{\infty},\left\|D_{\chi}\beta\right\|_{\infty},\left\|\gamma\right\|_{\infty},L_{S},L_{V},L_{I},\left\|y\right\|_{\infty}\right), (14)
Γi[y1]Γi[y2]\displaystyle\left\|\Gamma_{i}\left[y_{1}\right]-\Gamma_{i}\left[y_{2}\right]\right\|_{\infty} 2(Dχλ,Dχα,Dχβ,γ,LS,LV,LI,y1,y2)y1y2.\displaystyle\leq\mathcal{L}_{2}\left(\left\|D_{\chi}\lambda\right\|_{\infty},\left\|D_{\chi}\alpha\right\|_{\infty},\left\|D_{\chi}\beta\right\|_{\infty},\left\|\gamma\right\|_{\infty},L_{S},L_{V},L_{I},\left\|y_{1}\right\|_{\infty},\left\|y_{2}\right\|_{\infty}\right)\left\|y_{1}-y_{2}\right\|_{\infty}. (15)

Above we let Dχf\left\|D_{\chi}f\right\|_{\infty} stand for the max supremum norm of both ff and (t+s)f\left(\partial_{t}+\partial_{s}\right)f.

Remark 1.

Without loss of generality, we may assume C1C_{1}, C2C_{2}, 1\mathcal{L}_{1}, and 2\mathcal{L}_{2} are increasing functions of their arguments.

Remark 2.

The argument dependence of the Lipschitz constants in (11)-(12) and (14)-(15) on norms of the equation coefficients quantifies boundedness assumptions needed on equation coefficients for the Lipschitz assumptions to hold.

Remark 3.

Hereafter, we will abbreviate a constant’s dependence on equation coefficients or geometry as data. So C1=C1(data,y)C_{1}=C_{1}(\mathrm{data},\left\|y\right\|_{\infty}). Similar holds for C2C_{2}, 1\mathcal{L}_{1}, and 2\mathcal{L}_{2}.

B.1.2 A fixed point characterization of (10) and (13)

Mimicking the standard approach in ODEs, we now recast the solution of (10) and (13) as a fixed point for an integral equation. To that end, let τ=min(s,t)\tau^{*}=\min(s,t) and define
T:j=13𝒞(Rj)j=13𝒞(Rj)\mathcal{F}_{T}:\prod^{3}_{j=1}\mathcal{C}\left(R_{j}\right)\rightarrow\prod^{3}_{j=1}\mathcal{C}\left(R_{j}\right) by

T[y](t,s)={fS(sτ)+0τΛ1[y](tτ+χ,sτ+χ)𝑑χ0(tτ)Γ2[y](θ)𝑑θ+0τΛ2[y](tτ+χ,sτ+χ)𝑑χfI(sτ)+0(tτ)Γ3[y](θ)𝑑θ+0τΛ3[y](tτ+χ,sτ+χ)𝑑χ.\mathcal{F}_{T}\left[y\right]\left(t,s\right)=\begin{cases}f_{S}(s-\tau^{*})+\int^{\tau^{*}}_{0}\Lambda_{1}\left[y\right]\big{(}t-\tau^{*}+\chi,s-\tau^{*}+\chi\big{)}d\chi\\ \int^{\left(t-\tau^{*}\right)}_{0}\Gamma_{2}\left[y\right]\left(\theta\right)d\theta+\int^{\tau^{*}}_{0}\Lambda_{2}\left[y\right]\big{(}t-\tau^{*}+\chi,s-\tau^{*}+\chi\big{)}d\chi\\ f_{I}(s-\tau^{*})+\int^{\left(t-\tau^{*}\right)}_{0}\Gamma_{3}\left[y\right]\left(\theta\right)d\theta+\int^{\tau^{*}}_{0}\Lambda_{3}\left[y\right]\big{(}t-\tau^{*}+\chi,s-\tau^{*}+\chi\big{)}d\chi\end{cases}. (16)

Then a fixed point of T\mathcal{F}_{T} is a classical solution of (1) and (9) with the initial conditions matching the prescribed explicit boundary data.

Lemma 1.

Let h:j=13𝒞(R¯j)j=13𝒞(R¯j)\mathcal{F}_{h}:\prod^{3}_{j=1}\mathcal{C}\left(\bar{R}_{j}\right)\rightarrow\prod^{3}_{j=1}\mathcal{C}\left(\bar{R}_{j}\right) where R¯i\bar{R}_{i} has the same s-axis as RiR_{i} but in the t-axis only spans [0,h]\left[0,h\right]. Then M,c,h\exists M,c,h all strictly positive and c<1c<1 which depend only on the data so that

  1. 1.

    The restriction h:B¯M(0)B¯M(0)\mathcal{F}_{h}:\bar{B}_{M}(0)\rightarrow\bar{B}_{M}(0) and so is bounded on a Banach space,

  2. 2.

    For y1,y2B¯M(0)y_{1},y_{2}\in\bar{B}_{M}(0), h(y1)h(y2)cy1y2\left\|\mathcal{F}_{h}(y_{1})-\mathcal{F}_{h}(y_{2})\right\|_{\infty}\leq c\left\|y_{1}-y_{2}\right\|_{\infty}, and so h\mathcal{F}_{h} is a contraction mapping.

Proof: For the first part, begin by picking an MM larger than twice the supremum norm of fSf_{S} and fIf_{I} which we label f\left\|f\right\|_{\infty} for brevity. From the sup bounds (11) and (14), we see that on B¯M(0)\bar{B}_{M}(0) the vector functionals Λ,Γ\Lambda,\Gamma are bounded in terms of this constant M. We may now pick hh so that hmax(Λ,Γ)<f/3h\max(\left\|\Lambda\right\|_{\infty},\left\|\Gamma\right\|_{\infty})<\left\|f\right\|_{\infty}/3. Now examining the sums in (16), we see these can be no more than 53f<M\frac{5}{3}\left\|f\right\|_{\infty}<M owing to the integral domains having length O(h)O(h).

For the second part, we may now use that y1,y2y_{1},y_{2} are supremum bound by MM to reduce the dependence of the Lipschitz constants in (12) and (15) to just the data. But now we conclude in the usual manner by seeing that the integral domains in (16) have length O(h)O(h) and so, by further reduction in hh, we make the product of hh with those Lipschitz constants suitably underneath a threshold c<1c<1.  

Corollary 1.

For initial data (fS,0,fI)\left(f_{S},0,f_{I}\right) and an MM strictly larger than the supremum norm of the ff’s, then over a time span h=h(data,M)h=h(\mathrm{data},M), the coupled flow equations (1) and (9) have a unique bounded solution in the space j=13𝒞(R¯j)\prod^{3}_{j=1}\mathcal{C}\left(\bar{R}_{j}\right). This solution may be further taken to be bounded by M over the time span hh.

Proof: This follows from the last lemma and the uniqueness of fixed points for contraction mappings in Banach space.  

Remark 4.

When instead the initial data has a nonzero density fVf_{V} for the vaccinated compartment, the same arguments and conclusions hold just with MM now also larger than that component’s supremum norm.

Remark 5.

Since the size of hh is in a monotone decreasing relationship with M but otherwise depends on data which are fixed, we see that so long as an a priori global bound can be derived for solutions of (1) and (9), then they will persist globally and be unique. This follows by the usual iteration of extending the solution by patches [kh,(k+1)h]\left[kh,(k+1)h\right] along the t-axis.

B.1.3 On the equivalence of (1) and (9) with the original formulation (1) and (3)-(5)

It remains to show that fixed points of (16) coincide at the t-axis with their intended implicit boundary values (3)-(5). The technical concern is that while (16) is sufficient to imply a fixed point solution is differentiable along characteristics, it does not imply a priori that a solution is differentiable along either the t-direction or s-direction individually. Recall in motivating (9), we treated the s\partial_{s} and t\partial_{t} derivatives separately. Nevertheless, we show here fixed point solutions do take the intended implicit boundary data. For further examples on the uses of difference quotients and summation by parts in the theory of PDEs, we refer to standard references such as [26].

For the fixed point solution yy of (16), Let us notate ϕV(t)=0LSλ(t,s)yS(t,s)𝑑s\phi_{V}(t)=\int^{L_{S}}_{0}\lambda(t,s)y_{S}(t,s)ds. We aim to show ϕV(t)=yV(t,0)\phi_{V}(t)=y_{V}(t,0). We will also notate Δvf(t,s)=f(t+v1,s+v1)\Delta_{v}f(t,s)=f(t+v_{1},s+v_{1}), which is the finite difference of the function ff along the vector vv. We may now compute

Δhe1ϕV(t)\displaystyle\Delta_{he_{1}}\phi_{V}(t) =0LS(Δhe1λ(t,s)yS(t+h,s)+λ(t,s)Δhe1yS(t,s))𝑑s\displaystyle=\int^{L_{S}}_{0}\bigg{(}\Delta_{he_{1}}\lambda(t,s)y_{S}(t+h,s)+\lambda(t,s)\Delta_{he_{1}}y_{S}(t,s)\bigg{)}ds
=0LS(Δhe1λ(t,s)yS(t+h,s)+λ(t,s)Δhe1+he2yS(t,s)λ(t,s)Δhe2yS(t+h,s))𝑑s.\displaystyle=\int^{L_{S}}_{0}\bigg{(}\Delta_{he_{1}}\lambda(t,s)y_{S}(t+h,s)+\lambda(t,s)\Delta_{he_{1}+he_{2}}y_{S}(t,s)-\lambda(t,s)\Delta_{he_{2}}y_{S}(t+h,s)\bigg{)}ds.

We next apply summation by parts to the last term:

0LSλ(t,s)Δhe2yS(t+h,s)𝑑s\displaystyle-\int^{L_{S}}_{0}\lambda(t,s)\Delta_{he_{2}}y_{S}(t+h,s)ds =0LSλ(t,s)yS(t+h,s+h)𝑑s+0LSλ(t,s)yS(t+h,s)𝑑s\displaystyle=-\int^{L_{S}}_{0}\lambda(t,s)y_{S}(t+h,s+h)ds+\int^{L_{S}}_{0}\lambda(t,s)y_{S}(t+h,s)ds
=hLSλ(t,sh)yS(t+h,s)𝑑s+0LSλ(t,s)yS(t+h,s)𝑑s\displaystyle=-\int^{L_{S}}_{h}\lambda(t,s-h)y_{S}(t+h,s)ds+\int^{L_{S}}_{0}\lambda(t,s)y_{S}(t+h,s)ds
=hLSΔhe2λ(t,s)yS(t+h,s)𝑑s+0hλ(t,s)yS(t+h,s)𝑑s.\displaystyle=-\int^{L_{S}}_{h}\Delta_{-he_{2}}\lambda(t,s)y_{S}(t+h,s)ds+\int^{h}_{0}\lambda(t,s)y_{S}(t+h,s)ds.

If we now combine these equations, divide by hh, and then pass to the limit as h0h\rightarrow 0, we obtain from the regularity of all terms involved that

tϕV(t)\displaystyle\partial_{t}\phi_{V}(t) =0LS(yS(t,s)(t+s)λ(t,s)+λ(t,s)(t+s)yS)𝑑s+λ(t,0)yS(t,0)\displaystyle=\int^{L_{S}}_{0}\bigg{(}y_{S}(t,s)\left(\partial_{t}+\partial_{s}\right)\lambda(t,s)+\lambda(t,s)\left(\partial_{t}+\partial_{s}\right)y_{S}\bigg{)}ds+\lambda(t,0)y_{S}(t,0)
=0LS((t+s)λ(t,s)yS(t,s)+λ(t,s)(λ(t,s)0LIβ(u)yI(t,u)𝑑u)yS(t,s))𝑑s.\displaystyle=\int^{L_{S}}_{0}\bigg{(}\left(\partial_{t}+\partial_{s}\right)\lambda(t,s)y_{S}(t,s)+\lambda(t,s)\left(-\lambda(t,s)-\int^{L_{I}}_{0}\beta(u)y_{I}(t,u)du\right)y_{S}(t,s)\bigg{)}ds.

Notice in the last equality that we have used directional derivatives of the solution along characteristics are classical and given by (1). Also, we used that yS(t,0)=0y_{S}(t,0)=0 by (3). Upon inspection, we see this is exactly the flow equation for the boundary data of yVy_{V} in (9). It now follows from the uniqueness of solutions for ODEs that ϕV(t)\phi_{V}(t) and yV(t,0)y_{V}(t,0) must coincide. These arguments may be repeated for the yIy_{I} solution to show its intended implicit boundary data also evolves according to the flow of (9).

B.2 Demonstrating numerical convergence of the implicit boundary conditions

In Fig 6 we demonstrate the PDE numerical solution better satisfying the implicit boundary conditions as the mesh resolution and ode integrator tolerances become increasingly fine. For the figure legend, we note that nnd is the number of mesh nodes along the s-axis while atol and rtol are respectively the adaptive Euler schemes absolute and relative tolerances for refining the step.

Refer to caption
Figure 6: Convergence demonstrations of the PDE DSA solver at the implicit boundary axis as the mesh resolution and ode solver tolerances become increasingly fine when using the best fit parameters obtained by ABC in Fig 4 and Table 1 of the main body. (Top) The absolute error, scaled by the domain length, between the solution’s value evaluated at the boundary where implicit data is prescribed and the implicit integral quantity which it is supposed to equal. These are respectively the left and right hand sides of (4)-(5). The left panel is for yVy_{V}, and the right panel is for yIy_{I}. (Bottom) Similar as the above panels but for the relative error as measured by the magnitude error divided by the solution value at that boundary point.

Recall that, since the magnitude of a probability density varies inversely with the length of its domain, the absolute errors shown are for the solution density multiplied by the size of its domain. In this sense, the absolute error reported is for the solution density if it were first rescaled to a density on the unit interval.

We observe a slight early increase in the relative error of yVy_{V}’s implicit boundary data as resolutions become increasingly fine. We expect this is due to the true magnitude of yVy_{V} becoming increasingly small, which thus leads to division by a smaller number.

B.3 Parameter posteriors

Refer to caption
Figure 7: Posterior distributions for ABC estimated model parameters. The best 10% of all 5000 ABC samples were kept. The parameter descriptions are given in Table 1 of the main body; however, we give the means, μ\mu, and standard deviations, σ\sigma, for the contact interval and infectious period rather than their shape and scale parameters. This amounts to an equivalent parameterization of the two-parameter Weibull distributions.