Likelihood-Free Dynamical Survival Analysis Applied to the COVID-19 Epidemic in Ohio

Colin Klaus

The Ohio State University
Matthew Wascher

University of Dayton
Wasiur R. KhudaBukhsh

The University of Nottingham
Grzegorz A. Rempała

The Ohio State University

Abstract

The Dynamical Survival Analysis (DSA) is a framework for modeling epidemics based on mean field dynamics applied to individual (agent) level history of infection and recovery. Recently, DSA has been shown to be an effective tool in analyzing complex non-Markovian epidemic processes that are otherwise difficult to handle using standard methods. One of the advantages of DSA is its representation of typical epidemic data in a simple although not explicit form that involves solutions of certain differential equations. In this work we describe how a complex non-Markovian DSA model may be applied to a specific data set with the help of appropriate numerical and statistical schemes. The ideas are illustrated with a data example of the COVID-19 epidemic in Ohio.

1 Introduction

As of July, 2022, the coronavirus disease 2019 (COVID-19) pandemic, caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has taken more than six million lives worldwide. In response to the pandemic, scientists from all disciplines have made a concerted effort to address the ever growing analytic challenges of prediction, intervention, and control. The mathematical tools employed range from the purely deterministic Ordinary Differential Equation (ODE)/ Partial Differential Equation (PDE) models to describe population dynamics (at the macro or ecological scale) to fully stochastic agent-based models (at the micro scale); from physics-inspired mechanistic models, both stochastic and deterministic, to purely statistical approaches such as ensemble models. However, despite the longstanding history of the discipline of mathematical epidemiology and the enormous recent efforts, the pandemic has laid bare crucial gaps in the state-of-the-art methodology.

While the macro models are simple to interpret and easy to calibrate, the micro agent-based models provide more flexibility to model elaborate what-if scenarios. In a similar vein, the mechanistic models provide insights into the underlying biology and epidemiology of the disease but are often outperformed by purely statistical methods when it comes to accuracy in prediction and forecasting. While there is no obvious way to completely bypass such trade-offs between these often diametrically opposed modeling approaches, the Dynamical Survival Analysis (DSA) method [5, 19, 6, 36, 18, 35, 21, 2, 32], a survival analytic statistical method derived from dynamical systems, holds some promise. The present paper is about a likelihood-free means of performing DSA in the context of non-Markovian models of infectious disease epidemiology. More specifically, we develop a DSA method for a non-Markovian epidemic model with vaccination based on the Approximate Bayesian Computation (ABC) framework.

Why non-Markov models? The standard compartmental Markovian models, which employ Continuous Time Markov Chains to keep track of counts of individuals in different compartments (e.g., individuals with different immunological statuses), assume the infectious period and the contact interval [17, 6] are exponentially distributed and are thus characterized by a constant hazard function. This simplistic assumption almost always misrepresents the underlying biology of the disease and hence, is untenable. See [6, 34] for a detailed discussion on this point. Also, see [17, Table 1 and Figure 1] for a numerical illustration of the bias in the estimates of model parameters if a Markovian model is wrongly assumed when the underlying model is non-Markovian.

A popular and analytically convenient approach to building more realistic non-Markovian models is to make use of the general theory of measure-valued processes that keep track of not only the population counts but also additional covariates such as individual’s age. Here the word “age” is used as an umbrella term to refer to the physical age of the individual, the age of infection, time since vaccination etc. While the measure-valued processes do require more mathematical sophistication than their CTMC counterparts, as we discuss in Section 2, the age-stratified population densities can be described by a comparatively simple system of PDEs in the limit of a large population [6, 13, 15, 30]. The crux of the DSA method is to interpret this mean-field limiting system of PDEs as describing probability distributions of transfer times (the time required to move from one compartment to another). We shall make this notion clear in Section 2.2 and discuss the statistical benefits of the DSA method in Section 6.

As an illustration of the ABC-based DSA method, we apply it to the COVID-19 epidemic in the state of Ohio, USA. The method is shown to fit the real case count data well and capture nontrivial trends. Detailed numerical results are provided in Section 5. In addition to the introduction of ABC-DSA methodology, our second contribution in this paper is a solution method for the mean-field limiting system of PDEs with nonlocal boundary conditions. For the sake of algorithmic implementation, we also present it in the form of a pseudocode. An implementation in the Julia programming language is made available for the wider community.

The rest of the paper is structured as follows: The stochastic non-Markovian epidemic model is described in Section 2. The mean-field limit in the form of a system of PDEs and the DSA methodology are also described in Section 2. In Section 3, we describe the main technical details of how we solve the limiting mean-field PDE system. We describe the statistical approach to parameter inference in Section 4, followed by numerical results in Section 5. Finally, we conclude with a brief discussion in Section 6. For the sake of completeness, additional mathematical details, numerical results, and figures are provided in the Appendix.

2 Non-markovian mass-action model

Our stochastic model is adapted from [6]. As shown in Figure 1, the model has four compartments: Susceptible, Vaccinated, Infectious, and Removed. Individuals are in exactly one of the four compartments. Upon vaccination, susceptible individuals move to the V compartment. We assume only susceptible individuals are vaccinated. Both (unvaccinated) susceptible and vaccinated individuals can get infected, in which case they move to the I compartment. Finally, infected individuals either recover or are removed. In either case, they move to the terminal compartment R. In addition to the counts of individuals in the four compartments, we keep track of the age distribution. Here, the word “age” refers to the physical age for the susceptible individuals, time since vaccination for the vaccinated individuals, time since infection or age of infection for the infected individuals, and finally, time since recovery or removal for the removed individuals. The age of the removed individuals are of no interest because the removed individuals do not contribute to the dynamics. It is possible to include other important covariates, such as sex, comorbidity, in the model. However, for the sake of simplicity, we only keep track of age. Suppose we have $n$ individuals.

Figure 1: Diagrammatic representation of the non-Markovian compartmental model. At time

t

, a susceptible individual with age

s

moves either to a vaccinated compartment with instantaneous rate

\lambda(t,s)

, or to the infectious compartment with instantaneous rate

\Upsilon(t)\coloneqq\int_{0}^{\infty}\beta(s)y_{I}(t,s)\mathrm{d}s

. The vaccinated individuals are also subject to infection. An infected individual at age

s

can be removed with instantaneous rate

\gamma(s)

. The vaccine efficacy

\alpha

is assumed age-dependent.

The age distribution of individuals in different compartments is described in terms of finite, point measure-valued processes whose atoms are individual ages. See [6]. Such an approach has been previously used to model age-stratified Birth-Death (BD) processes [33, 9], spatially stratified populations [12], delays in Chemical Reaction Networks [20]. The instantaneous rates of jump are assumed to depend on the individual ages. In particular, a susceptible individual of age $s$ has an instantaneous rate $\lambda(t,s)$ of getting vaccinated at time $t$ . Moreover, a susceptible individual is also subject to an infection pressure exerted by the infectious individuals. We denote the hazard function for the probability distribution of the contact interval [6, 17] by $\beta$ . Therefore, an infectious individual of age of infection $s$ makes an infectious contact with a susceptible or vaccinated individual at an instantaneous rate $\beta(s)$ . The hazard function of the probability law of the infectious period is denoted by $\gamma$ . The vaccine efficacy is also assumed to depend on the age of the vaccinated individual (time since vaccination). At age $s$ , a vaccinated individual moves directly to the R compartment at rate unity with probability $\alpha(s)$ , while with probability $(1-\alpha(s))$ , they are subject to the same infection pressure as the susceptible individuals and can get infected at the same instantaneous rate, which is the population sum total, scaled by $1/n$ , of the $\beta$ ’s evaluated at the individual ages of infection.

In the limit of a large population ( $n\to\infty$ ) and under suitable constraints on the hazard functions $\beta,\gamma,\lambda$ , the measure-valued processes, when appropriately scaled by $n^{-1}$ , can be shown to converge to deterministic continuous measure-valued functions [6, 33, 12, 9, 20]. The densities (with respect to the Lebesgue measure) of those limiting measure-valued functions can be then described in terms of a system of PDEs. Let us define

•

$y_{S}(t,s)$ : The density of susceptible individuals with age $s$ at time $t$ ;
•

$y_{V}(t,s)$ : The density of vaccinated individuals with age (of vaccination) or time since vaccination $s$ at time $t$ ;
•

$y_{I}(t,s)$ : The density of infectious individuals with age (of infection) or time since infection $s$ at time $t$ ;
•

$y_{R}(t)$ : The proportion of removed individuals.

These quantities are taken over rectangular domains that share a $t$ -axis but, in general, may have different length $s$ -axes. In particular, the quantities $y_{S}$ and $\lambda$ are defined over a common domain $R_{S}$ . The quantities $y_{V}$ and $\alpha$ are defined over a common domain $R_{V}$ , and the quantities $y_{I}$ , $\gamma$ , and $\beta$ are defined over a common domain $R_{I}$ . When these distinctions are not needed, we subsume these s-axes into the common interval $\left[0,\infty\right)$ .

Analogous to [6], the limiting system can be described as

$\displaystyle(\partial_{t}+\partial_{s})\,y_{S}(t,s)$	$\displaystyle{}=-\left(\lambda(t,s)+\int_{0}^{\infty}\beta(u)y_{I}(t,u)\mathrm{d}u\right)y_{S}(t,s),$	(1)
$\displaystyle(\partial_{t}+\partial_{s})\,y_{V}(t,s)$	$\displaystyle{}=-\left(\alpha(s)+(1-\alpha(s))\int_{0}^{\infty}\beta(u)y_{I}(t,u)\mathrm{d}u\right)y_{V}(t,s),$
$\displaystyle(\partial_{t}+\partial_{s})\,y_{I}(t,s)$	$\displaystyle{}=-\gamma(s)y_{I}(t,s)$
$\displaystyle\frac{\mathrm{d}}{\mathrm{d}t}y_{R}(t)$	$\displaystyle{}=\int_{0}^{\infty}\left(\alpha(s)y_{V}(t,s)+\gamma(s)y_{I}(t,s)\right)\mathrm{d}s,$	(2)

with initial and boundary conditions

$\displaystyle y_{S}(t,0)$	$\displaystyle{}=0\mbox{, for all }t\geq 0,$	(3)
$\displaystyle y_{S}(0,s)$	$\displaystyle{}=f_{S}(s),$
$\displaystyle y_{V}(t,0)$	$\displaystyle{}=\int_{0}^{\infty}\lambda(t,s)y_{S}(t,s)\mathrm{d}s$	(4)
$\displaystyle y_{V}(0,s)$	$\displaystyle{}=0\mbox{, for all }s\geq 0,$
$\displaystyle y_{I}(t,0)$	$\displaystyle{}=\int_{0}^{\infty}y_{S}(t,s)\int_{0}^{\infty}\beta(u)y_{I}(t,u)\mathrm{d}u\mathrm{d}s+\int_{0}^{\infty}(1-\alpha(s))y_{V}(t,s)\int_{0}^{\infty}\beta(u)y_{I}(t,u)\mathrm{d}u\mathrm{d}s,$	(5)
$\displaystyle y_{I}(0,s)$	$\displaystyle{}=\rho f_{I}(s),,$
$\displaystyle y_{R}(0)$	$\displaystyle{}=0.$	(6)

We assume there are no vaccinated or removed individuals initially. The nonnegative functions $f_{S}$ and $f_{I}$ describe the initial age distributions of the susceptible and the infected individuals, and satisfy

\displaystyle\int_{0}^{\infty}f_{S}(s)\mathrm{d}s=1,\quad\int_{0}^{\infty}f_{I}(s)\mathrm{d}s=1.

The parameter $\rho$ is the initial proportion of infected individuals in the population. It is worthwhile to point out that the system is mass conserved, i.e.,

\displaystyle\int_{0}^{\infty}\left(y_{S}(t,s)+y_{V}(t,s)+y_{I}(t,s)\right)\mathrm{d}s=1+\rho-y_{R}(t),\quad\text{ for all }t\geq 0.

A consequence of the above conservation law is that the equation (2) is in fact redundant.

2.1 Continuity constraints on boundary conditions

When we seek solutions of (1)-(5) in the space of continuous functions, the explicit and implicit boundary data must be assigned continuously at the origin. This criterion is not satisfied freely but imposes additional constraints on the equation coefficients and the initial data. We obtain these constraints by equating the expressions for the explicit and implicit data at the origin and derive

$\displaystyle f_{S}(0)$	$\displaystyle=0,$	(7)
$\displaystyle\lambda(0,s)$	$\displaystyle=0,\text{ for all }s\geq 0,$
$\displaystyle f_{I}(0)$	$\displaystyle=\int^{\infty}_{0}\beta(u)f_{I}(u)\mathrm{d}u.$

The last equality generally reduces the combined degrees of freedom for $\beta$ and $f_{I}$ by one. We note also that the above constraint on $\lambda$ could be relaxed to holding over the support of $f_{S}$ only.

2.2 DSA perspective of the mean-field PDEs

Before describing how we solve the limiting mean-field PDE system in the next section, let us briefly discuss how DSA interprets the limiting PDEs as probabilistic quantities.

The DSA method [19, 6, 36, 18, 35, 21, 2, 32] combines classical dynamical systems theory and survival analysis to interpret the mean-field limits of scaled population counts or densities as characterizing probability laws of transfer times between compartments. In the Markovian model, this interpretation boils down to treating the mean-field ODEs as satisfying a time inhomogeneous Chapman–Kolmogorov equation for the marginal probability law of a Markov chain on the state space $\{\textbf{S},\textbf{V},\textbf{I},\textbf{R}\}$ describing the time-evolving status of a single individual embedded in an infinitely large population. See [6, Section 3.4] for the standard Susceptible-Infected-Recovered (SIR) model example. In the non-Markovian case, we construct a Markov process on the state space $\{\textbf{S},\textbf{V},\textbf{I},\textbf{R}\}\times[0,\infty)$ to keep track of the time-evolving status as well as the age information of an individual embedded in an infinitely large population. As such, DSA interprets the mean-field limiting PDEs as describing the transition kernel for the Markov process. The transition kernel could be used to simulate individual trajectories.

Note that the individual-based Markov process (or chain in the Markovian case) is entirely characterized by the mean-field limiting PDEs (or ODEs in the Markovian case) describing population-level densities (or counts). It is precisely in this sense that DSA turns an ecological model into an agent-based model! Moreover, this agent-based description gives us the following probability measures

\displaystyle\mu_{S}(t,A)=\frac{\int_{A}y_{S}(t,s)\mathrm{d}s}{1+\rho},\quad\mu_{V}(t,A)=\frac{\int_{A}y_{V}(t,s)\mathrm{d}s}{1+\rho},\quad\mu_{I}(t,A)=\frac{\int_{A}y_{I}(t,s)\mathrm{d}s}{1+\rho},

(8)

for Borel subsets $A$ of $[0,\infty)$ . Here, the quantity $\mu_{S}(t,A)$ describes the probability of a randomly chosen individual with age in the set $A\subset[0,\infty)$ to be in the S compartment at time $t$ . The other probability measures are interpreted in a similar fashion. Based on a random sample of infection, vaccination and/or recovery times, which are allowed to be censored, truncated or even aggregated over time or individuals, the above probability measures (and the transition kernel) can be used to write an individual-based product-form likelihood function, called the DSA-likelihood. See [6, Section 3.3] for an explicit example of the DSA-likelihood. The DSA-likelihood can be used for likelihood-based approaches to parameter inference, such as Maximum Likelihood Estimate (MLE), or Bayesian methods employing Markov Chain Monte Carlo (MCMC) techniques. In this paper, we focus on an alternative likelihood-free approach based on the ABC method, which we describe in Section 5. In the next section, we describe how we solve the mean-field limiting PDE system.

3 Solving the limiting PDE system

For the remainder of the paper, we will make a simplifying assumption that the initial data $f_{S}$ and $f_{I}$ are compactly supported. This assumption holds in almost all practical cases of interest. For example individuals are of bounded age, and infection lasts for at most a bounded length of time. Moreover, by a suitable enlargement of the rectangular domains of $y_{S}$ , $y_{V}$ , and $y_{I}$ , respectively $R_{S}$ , $R_{V}$ , and $R_{I}$ , we may also assume without loss of generality that the solutions stay compactly supported in their domains for the time horizon simulated. This follows from the method of characteristics used in Section 3.1 and the forcing terms on the right-hand side of (1) which determine exponential solutions. For concreteness, let us write $R_{S}=\left[0,T\right]\times\left[0,L_{S}\right]$ , $R_{V}=\left[0,T\right]\times\left[0,L_{V}\right]$ , and $R_{I}=\left[0,T\right]\times\left[0,L_{I}\right]$ .

3.1 Solution method

The governing equations, (1)-(5), constitute a quasi-linear PDE system of nonlocal conservation laws. Since the associated directions of their derivative operators are constant and do not intersect, the solutions into the domain interior are naturally suited for construction by the method of characteristics, which recasts the first order PDEs into equivalent flow equations. For more details on this construction, we refer readers to standard references, such as [8, Ch 3] and [7, Ch 7].

We note that the implicit and nonlocal boundary conditions, (4)-(5), are not standard. In order to handle the boundary conditions, we exploit a special feature of this system that remarkably depends on the nonlocal character of the implicit boundary terms. Specifically, we differentiate the implicit boundary conditions in $t$ and recover a governing flow for them, which only depends on the solution and not on the solution’s derivatives. For example, consider differentiating (4):

	$\displaystyle\partial_{t}y_{V}(t,0)$	$\displaystyle=\int^{L_{S}}_{0}\left(\partial_{t}\lambda y_{S}+\lambda\partial_{t}y_{S}\right)ds=\int^{L_{S}}_{0}\left(\partial_{t}\lambda y_{S}+\lambda\left(-\partial_{s}y_{S}-\lambda y_{S}-y_{S}\int^{L_{I}}_{0}\beta(u)y_{I}(t,u)du\right)\right)ds$
		$\displaystyle=\left.-\left[\lambda y_{S}\right](t,s)\big{\|}^{L_{S}}_{s=0}\right.+\int^{L_{S}}_{0}\left[\left(\partial_{t}\lambda+\partial_{s}\lambda\right)y_{S}\right](t,s)ds-\int^{L_{S}}_{0}\left[\lambda^{2}y_{S}\right](t,s)ds$
		$\displaystyle\;\;\;\;-\left(\int^{L_{S}}_{0}\left[\lambda y_{S}\right](t,s)ds\right)\left(\int^{L_{I}}_{0}\left[\beta y_{I}\right](t,s)ds\right).$

In the second equality, we have substituted for $\partial_{t}y_{S}$ using the $y_{S}$ equation of (1), and in the third equality, we performed an integration by parts. Continuing we may use the compactly supported initial data to conclude that the boundary term at $s=L_{S}$ is vanishing and substitute (3) into the boundary term at $s=0$ . For the particular case of $y_{V}$ , we see both these terms are vanishing. A similar argument works for $\partial_{t}y_{I}(t,0)$ . Below we present the resulting evolution equations for the solution’s boundary values. Of course, $y_{S}(t,0)\equiv 0$ by (3), so we need only consider the remaining two:

$\displaystyle\partial_{t}y_{V}(t,0)$	$\displaystyle=\int^{L_{S}}_{0}y_{S}(\partial_{t}+\partial_{s})\lambda ds-\int^{L_{S}}_{0}\lambda^{2}y_{S}ds-\left(\int^{L_{S}}_{0}\lambda y_{S}ds\right)\left(\int^{L_{I}}_{0}\beta y_{I}ds\right),$	(9)
$\displaystyle\partial_{t}y_{I}(t,0)$	$\displaystyle=\left(\int^{L_{I}}_{0}\beta y_{I}ds\right)\left[-\int^{L_{S}}_{0}\lambda y_{S}ds-\left(\int^{L_{S}}_{0}y_{S}ds\right)\left(\int^{L_{I}}_{0}\beta y_{I}ds\right)-\int^{L_{V}}_{0}y_{V}(\partial_{t}+\partial_{s})\alpha ds\right.$
	$\displaystyle\left.\;\;\;\;\;\;+\left[\big{(}1-\alpha(t,0)\big{)}y_{V}(t,0)\right]-\int^{L_{V}}_{0}\left(1-\alpha\right)\alpha y_{V}ds-\left(\int^{L_{V}}_{0}\left(1-\alpha\right)^{2}y_{V}ds\right)\left(\int^{L_{I}}_{0}\beta y_{I}ds\right)\right]$
	$\displaystyle\;\;+\left(\int^{L_{S}}_{0}y_{S}ds+\int^{L_{V}}_{0}(1-\alpha)y_{V}ds\right)\left(\int^{L_{I}}_{0}y_{I}(\partial_{t}+\partial_{s})\beta ds+\beta(0)y_{I}(t,0)-\int^{L_{I}}_{0}\beta\gamma y_{I}ds\right).$

Note that in (9), it is understood that all integrands are evaluated at $(t,s)$ and integrated in $s$ . In our particular application, we also have $\left(\partial_{t}+\partial_{s}\right)\alpha=\partial_{s}\alpha$ and similarly for the hazard function $\beta$ . We have chosen to present (9) in generality to highlight the underlying structure of (1)-(5). Note also, the validity of the equations (9) depends on the compact support of the initial data. Otherwise, these flow equations would need to contain additional boundary terms at the right-end points of the s-axes.

Our solution method of (1)-(5) now consists of coupling (1) in the interior domain together with the alternative flow formulation (9) of the implicit boundary conditions at the $\left[s=0\right]$ boundary. The explicit boundary data assigned at $\left[t=0\right]$ in (3)-(5) becomes this system’s initial data. For the benefit of readers mostly interested in applications, we defer a rigorous analysis of this alternative formulation and its equivalence with the original one to Appendix B.1. Also, in this investigation, we assume equation coefficients in classical function spaces and classical notions of solvability. Future work may investigate deeper questions of equivalence between the original and flow formulations when equation coefficients and the solution belong to suitably identified Sobolev spaces. For more information on the distinctions between classical solutions and weak solutions in Sobolev spaces, we refer again to [8, 7]. We present a numerical demonstration of this method approximating the implicit boundary conditions, (4)-(5), in Section B.2.

3.2 Numerical implementation

For numerically approximating (1) and (9), we take a semi-discrete approach where we discretize the space axis, $s$ , into a number of uniform intervals whose intersection points are called nodes. The number of nodes determines the mesh resolution. We then approximate each component of the true solution – $y_{S}$ , $y_{V}$ , and $y_{I}$ – by its own nodal basis expansion

y(t,s)\approx\sum^{n}_{i=1}y_{i}(t)\phi_{i}(s).

The $\left\{\phi_{i}(s)\right\}_{i}$ functions are piecewise linear, $C^{0}$ basis splines determined by a given mesh resolution. Each basis spline is associated with an $s$ -axis node, $s_{i}$ , and is defined by taking the value 1 at that node and 0 at all others. At each $t$ -level, we thus approximate the true solution with a linear, interpolant spline whose value at $(t,s_{i})$ is given by $y_{i}(t)$ . Integrals of the true solution and its derived quantities are approximated by taking integrals of their corresponding linear, interpolant splines by trapezoidal rule.

The $\left\{y_{i}(t)\right\}_{i}$ evolve according to the flows of (1) and (9). We initialize at the $t=0$ level by projecting the initial data onto the interpolant basis. This is accomplished by sampling those functions at the nodes. We then use an explicit Euler scheme to extend the solution up to the next time step, $t+\Delta t$ along the boundary axis where the implicit data is prescribed according to the ODEs of (9). This determines the first value $y_{1}(t+\Delta t)$ at the node $s=0$ . For the remaining nodal values, each of their nodes originate along a characteristic from either the previous $t$ -level or within that part of the boundary axis that was previously approximated in computing $y_{1}$ . We now evolve each of those originating, solution values by the method of characteristics along the ODEs that correspond to (1) by a second explicit Euler scheme to obtain the remaining $\left\{y_{i}(t+\Delta t)\right\}^{n}_{i=2}$ . By iterating this process, we construct a numerical approximation over the desired time interval for simulation.

In practice, we use an adaptive Euler step to improve solution accuracy. We specify tolerances atol and rtol, which are respectively the absolute and the relative tolerances set by the user. The solver approximates the solution using a single Euler step of size $\Delta t$ and two consecutive smaller Euler steps of size $\Delta t/2$ . The solver then checks if at each node the difference between the two Euler approximations satisfies either the atol or rtol thresholds. It does this similarly for the values of the nonlocal integrals. It accepts the step if all quantities meet this criterion, or else it halves the step size and repeats this process. Note that in evaluating the absolute error, we take the magnitude difference in the Euler solutions and then scale by the length of their $s$ -axes. This is done because, in general, $y_{S}$ , $y_{V}$ , and $y_{I}$ are defined on $s$ -axes of different lengths, while probability densities vary inversely with the length of their domain. By this adjustment, we normalize the errors in $y_{S}$ , $y_{V}$ , and $y_{I}$ to a common scale.

Below we also provide a pseudocode of the algorithm described in this section.

Pseudocode 1.

Numerical solver for (1) and (9)

1.

Initialize the solution at $t=0$ by storing values of the initial data at the nodes,
2.

Propagate from $y_{1}(t)$ to $y_{1}(t+\Delta t)$ by taking an explicit Euler step of size $\Delta t$ along (9),
3.

Propagate from $\left\{y_{i}(t)\right\}^{n}_{i=1}$ and $y_{1}(t+\Delta t)$ to $\left\{y_{i}(t+\Delta t)\right\}^{n}_{i=2}$ by taking Euler steps of size at most $\sqrt{2}\Delta t$ along the characteristics of (1),
4.

Repeat Steps 2-3 for two consecutive Euler steps with $\Delta t=\Delta t/2$ ,
5.

If the y-nodal values and integral quantities computed by the single and two-half Euler steps satisfy either of the absolute and relative error thresholds, atol and rtol, then accept the Step 4 solution at $t+\Delta t$ by storing its nodals and continue with $\Delta t=2\Delta t$ . Otherwise, repeat steps 2-4 with $\Delta t=\Delta t/2$ .

Code availability

We provide an implementation of the algorithms described in this paper in the Julia programming language along with jupyter notebooks and html at [22]. Additional information on Julia may be found at [3].

4 Model parameters

For the vaccine efficacy, $\alpha$ , we begin with a function that linearly increases from 0 to a maximum of $\alpha_{\mathrm{eff}}$ over a period of $\alpha_{L}$ days. After $\alpha_{L}$ , it was taken to be constant. For the contact interval and infectious period hazard functions, $\beta$ and $\gamma$ , we choose Weibull distributions as the Weibull distribution has been shown to be a flexible choice in modelling epidemics [6]. Therefore, we have

	$\displaystyle\alpha(s)$	$\displaystyle{}\coloneqq\frac{\min(s,\alpha_{L})}{\alpha_{L}}\times\alpha_{\mathrm{eff}},\quad s\geq 0,$
	$\displaystyle\beta(s)$	$\displaystyle{}\coloneqq\frac{\beta_{\alpha}}{\beta_{\theta}}\left(\frac{s}{\beta_{\theta}}\right)^{\beta_{\alpha}-1},\quad s\geq 0,$
	$\displaystyle\gamma(s)$	$\displaystyle{}\coloneqq\frac{\gamma_{\alpha}}{\gamma_{\theta}}\left(\frac{s}{\gamma_{\theta}}\right)^{\gamma_{\alpha}-1},\quad s\geq 0.$

Further, we smoothed $\alpha$ by a moving integral average to ensure its derivative was everywhere classical. We took the density for the initial infected population, $f_{I}$ , to be approximately uniform over a period of fourteen days subject to the further requirement that it should be compactly supported and continuous. For this purpose, $f_{I}$ was constant up to day 13.25 and then linearly decayed to 0 by day 13.75. As with $\alpha$ , we smoothed $f_{I}$ by an integral moving average.

We inferred the parameters in (1)-(5) using COVID-19 epidemic data for the US state of Ohio, [28], during the time period Nov 15, 2020- Jan 15, 2021. We directly estimated the hazard rate for vaccination for this same time period from the Ohio Department of Health (ODH) and Centers for Disease Control and Prevention (CDC) data, [28, 10], using b-splines. We also directly estimated the density for the initial susceptible population, $f_{S}$ , consistent with US census information on age distributions in Ohio, [4].

4.1 Empirical estimates

4.1.1 The distribution for the initial susceptible population, $f_{S}$

Figure 2 shows an empirical density for age distributions in the state of Ohio partitioned into ten year age groups. This data suggested a simple piecewise linear characterization of $f_{S}$ , which is also shown. Additionally, we normalize $f_{S}$ so that it integrates to one.

Refer to caption — Figure 2: An empirical density for the age distributions in Ohio partitioned into ten year brackets. From this we extracted a probability density curve, $f_{S}$ . Later we additionally make this density decay to 0 at the origin consistent with (7).

4.1.2 The hazard rate for time to vaccination $\lambda$

Figure 3 shows the empirical, cumulative vaccination doses given in Ohio from Nov 15th, 2020 - Jan 15th, 2021 (top panel) broken down by age group. These age-breakdowns were estimated from the vaccination counts provided by the ODH and CDC, [28, 10]. We observe that vaccine rollout did not begin until Dec 15th. It also shows the derived empirical estimations of the hazard rate for vaccination, $\lambda$ , from this data (bottom two panels). In order to do this, we measured an empirical survival function for vaccination using the cumulative vaccine doses curves and the age demography in Ohio and then applied to these values the negative $\ln$ -transformation. These data points were then fit by least squares with $C^{2}$ cubic splines, and their pointwise derivatives corresponded to the desired vaccination hazard rate. For more information on computing with splines, we refer to [29]. These several curves, parameterized in $t$ , were combined into a single $\lambda(t,s)$ function by using smoothed linear transitions in the age-variable $s$ across age-groups.

4.2 Estimation by Approximate Bayesian Computation

The remaining model parameters were not explicitly given by data, and we used instead an ABC scheme to estimate their values. The goal of Bayesian inference is to obtain samples from the posterior distribution of the parameters, which is proportional to a prior distribution on the parameters times the likelihood of the data given the parameters. However, in many cases, it is difficult or expensive to compute or even sample from the posterior. In particular, many methods for sampling from the posterior require computing the likelihood, which may itself be intractable.

ABC provides a method to approximate posterior samples without computing the likelihood. The simplest way to perform ABC is the rejection sampling method, which goes as follows: First, one proposes a vector of parameters from the joint prior. Next, one simulates data (or some summary statistic thereof) from the model using this vector of parameters. Finally, one compares this simulated data to the observed data using some distance metric and either accepts the proposed vector of parameters as a sample from an approximation of the posterior or rejects it depending on how close it is to the observed data. What is sufficiently close for acceptance may be determined by an absolute threshold or by retaining some proportion of the best samples. The intuition is that a vector of parameters with higher likelihood will more often produce simulated data that is close to the empirical data, and so ABC approximates the usual acceptance procedure based on the likelihood ratio that is often used in MCMC. For a more thorough discussion of ABC, we direct the reader to [25, 31].

We proposed the vector of parameters $(\beta_{\alpha},\gamma_{\alpha},\gamma_{\theta},\rho,\alpha_{\textrm{eff}})$ by drawing each quantity independently from a uniform prior with the bounds given in Table 1. We then accepted it if the associated infectious period was less than three weeks with 99.9% probability and if the mean contact interval was less than the mean infectious period. The parameter $\alpha_{L}$ was fixed to be two weeks. The parameter $\beta_{\theta}$ was not proposed, and hence, does not have a prior, since its value was determined from the other parameters. This followed from the last constraint in (7). In fact, for the particular pairing of a Weibull distribution for the contact interval and a uniform distribution for the initial infected, it follows from a little algebra that (7) implies $\beta_{\theta}$ is the length of the support of $f_{I}$ and independent of the other parameters.

We then used the PDE model to generate a predicted incidence trajectory for the Nov 15, 2020 - Jan 15, 2021 time period and compared the predicted trajectory to the empirical trajectory using the root mean square error ( Root Mean Square Error (RMSE)). Note that the prevalence at time $t$ is given by $n\int_{0}^{\infty}y_{I}(t,s)\mathrm{d}s$ , where $n$ is the total population size. We generated 5000 ABC sample trajectories in this fashion and retained the $10\%$ of sample trajectories with the lowest RMSE. The results are shown in Figure 4.

5 Numerical results

Here we demonstrate how the PDE-DSA model can be used in conjunction with with the ABC method to infer model parameters from epidemic data. As discussed in Section 4.2, the ABC method does not require an explicit likelihood but instead uses a computable error between synthetic and true data values to assess the quality of proposed parameter values. Aggregate population-level counts of infection are prototypical data that would be used with DSA, and so we apply this approach to daily reported incidence supplied by the ODH during the period of Nov 15, 2020, - Jan 15, 2021. This period encompassed the epidemic wave promoted by the rise of the COVID-19 $\alpha$ -variant [11] as well as the start of vaccine roll-out in Ohio.

The primary parameters of interest were for the contact interval characterized by $\beta$ and the infectious period characterized by $\gamma$ , along with estimates of the initial amount of infection $\rho$ . The vaccine efficacy parameter $\alpha_{\mathrm{eff}}$ , while also relevant, would not be identifiable due to still low rate of vaccine administration during the time window of interest. Figure 4 shows the model predictions together with the ODH reported daily incidence. The best fit obtained across all 5000 ABC samples captures the nontrivial empirical trends. Table 1 lists the model parameters with their best fit values and the ABC posterior credible intervals. The posterior distributions correspond to the parameter values that produced trajectories within the shaded bands of Figure 4. In Figure 5, we see that the mean time before transmission was approximately estimated to be between $\left(12.25,13.0\right)$ days while the mean time to recovery was approximately estimated to be between $\left(12.25,15.0\right)$ days. These estimates are in line with estimates from similar studies reported in the literature [6].

In the second panel of Figure 5, we give the posterior distribution for the basic reproduction number $R_{0}$ . This quantity is computed using the formula $R_{0}=\int^{\infty}_{0}S_{\gamma}(s)\beta(s)\mathrm{d}s$ as in [19], where $S_{\gamma}(s)$ is the survival function of the probability distribution characterized by the hazard rate $\gamma$ . Note that this formula ignores vaccination, and therefore, is an upper bound on the true basic reproduction number. The model estimated a mean of $R_{0}=1.28$ with a 95% credible interval of $\left(0.89,1.94\right)$ , which is consistent with other studies of basic reproduction numbers associated with the COVID-19.

Parameter

Unit

Description

Value/

Prior

Best

Fit

ABC Posterior

95% Credible Interval

Contact interval

\beta_{\alpha}

–

Weibull shape parameter

\left(5,20\right)^{*}

5.24

\left(5.06,9.55\right)

\beta_{\theta}

day

Weibull scale parameter

–

13.5038

\left(13.5036,13.5076\right)

Infectious period

\gamma_{\alpha}

–

Weibull shape parameter

\left(2,8\right)^{\ast}

6.05

\left(4.98,7.94\right)

\gamma_{\theta}

day

Weibull scale parameter

\left(1,18\right)^{*}

13.6

\left(13.43,15.03\right)

Starting infection

\rho

mass of the initial

infected population

relative the size of the

susceptible population

\left(0.1,5\right)

4.48

\left(0.79,4.95\right)

Vaccine

\alpha_{L}

day

time for vaccine to achieve

full efficacy

–

\alpha_{\mathrm{eff}}

vaccine blocking efficacy

\left(70,100\right)

97.0

\left(70.7,99.2\right)

Table 1: Model parameters used to fit the ODH daily case data from Nov 15, 2020 - Jan 15, 2021. We additionally report their assigned ABC priors, best fit values, and ABC posterior credible intervals from the numerical simulations presented in Section 5. The ABC posteriors were obtained by conditioning 5000 ABC samples to the best 10% of observed RMSE between the ODH daily incidence and model prediction. The priors for

\beta

and

\gamma

are listed with * since, in addition to these uniform ranges, we further conditioned on parameters where the infectious period lasted no more than 21 days with 99.9% probability and where the mean contact interval was less than the mean infectious period. Moreover, a prior for

\beta_{\theta}

is not given since (7) determines

\beta_{\theta}=\beta_{\theta}(\beta_{\alpha},f_{I})

. The narrow posterior of

\beta_{\theta}

was expected as reasoned in Section 4.2.

6 Discussion

The COVID-19 pandemic has spurred the development of a cottage industry of ever-more elaborate mathematical models of epidemic dynamics that have been applied to empirical data across the world with varying degree of success [24, 27, 14, 16, 1]. Since most of the current and historic COVID-19 data have been available in aggregate, the models tend to focus on aggregate behavior which may sometimes lead to erroneous insights [23]. The DSA method discussed here offers a different modeling approach and in particular accounts more fully that a typical compartmental model for the heterogeneity of individual behaviors.

Indeed, even since its introduction in [19] on the eve of an outbreak of the global COVID-19 pandemic, the DSA approach has been shown to be a viable way of analyzing epidemic data in order to predict epidemic progression, evaluate the long term effects of public health policies (testing, vaccination, lock downs, etc.) and individual-level decisions (masking, social distancing, vaccine hesitancy) [32, 35, 21, 36, 18]. As it was argued in [19], the DSA method has several advantages over more traditional approaches to analyzing data in SIR-type epidemic curves. Perhaps the most important one is that the method does not require the knowledge of the full curve trajectory or the size of the total susceptible population. Indeed, such quantities are rarely known in practice and often require ad-hoc adjustments leading to severely biased analysis [17].

In this work we discuss a relatively simple yet powerful non-Markovian extension of the original DSA method formulation introduced in [19]. This extension allows one to take into account the additional heterogeneity of the transmission patterns due to both the changes in infectiousness over the individuals’ infectious periods and the changes in immunity over the individuals’ periods of vaccine-derived protection. However, the practical price to pay for these modeling improvements is that a more elaborate numerical scheme is required to evaluate the DSA model along with its increased computational cost to fit empirical data. Here we have chosen to apply a likelihood-free ABC approach, which allows us to avoid the large numerical overhead usually associated with DSA likelihood-based methods [6]. This is accomplished by pregenerating parameter samples and then running simulations in parallel using modern multi-core capabilities. This capacity of ABC for parallel processing helps also mitigate the computational burden of a non-Markovian DSA, namely evaluating its nonlocal and implicit boundary conditions and associated flow formulation of (9). The analytic properties of (1) and (9) are of intrinsic interest in their own right, as a deeper understanding of solution regularity could help identify numerical schemes of higher and optimal convergence order. We hope to further pursue these lines of investigation in the future. As seen from the numerical examples in Section 5, using ABC we were able to fit the highly irregular model of Ohio COVID-19 epidemic with proper accounting for various sources of uncertainty in all of the relevant model parameters.

Appendix A Acronyms

ABC: Approximate Bayesian Computation
ABM: Agent-based Model
BA: Barabási-Albert
BD: Birth-Death
CDC: Centers for Disease Control and Prevention
CDF: Cumulative Distribution Function
CLT: Central Limit Theorem
CM: Configuration Model
CME: Chemical Master Equation
CRM: Conditional Random Measure
CRN: Chemical Reaction Network
CTBN: Continuous Time Bayesian Network
CTMC: Continuous Time Markov Chain
DSA: Dynamical Survival Analysis
DTMC: Discrete Time Markov Chain
DRC: Democratic Republic of Congo
ER: Erdös-Rényi
ESI: Enzyme-Substrate-Inhibitor
FCLT: Functional Central Limit Theorem
FLLN: Functional Law of Large Numbers
HJB: Hamilton–Jacobi–Bellman
iid: independent and identically distributed
IPS: Interacting Particle System
KL: Kullback-Leibler
LDP: Large Deviations Principle
LLN: Law of Large Numbers
LNA: Linear Noise Approximation
MAPK: Mitogen-activated Protein Kinase
MCMC: Markov Chain Monte Carlo
MGF: Moment Generating Function
MLE: Maximum Likelihood Estimate
MM: Michaelis-Menten
MPI: Message Passing Interface
MSE: Mean Squared Error
ODE: Ordinary Differential Equation
ODH: Ohio Department of Health
PDE: Partial Differential Equation
PDF: Probability Density Function
PGF: Probability Generating Function
PGM: Probabilistic Graphical Model
PMF: Probability Mass Function
psd: positive semi-definite
PT: Poisson-type
QSSA: Quasi-Steady State Approximation
rQSSA: reversible QSSA
RMSE: Root Mean Square Error
SD: Standard Deviation
SEIR: Susceptible-Exposed-Infected-Recovered
SI: Susceptible-Infected
SIR: Susceptible-Infected-Recovered
SIS: Susceptible-Infected-Susceptible
sQSSA: standard QSSA
ssLNA: Slow-scale Linear Noise Approximation
tQSSA: total QSSA
WS: Watts-Strogatz
whp: with high probability

Acknowledgments

WRK acknowledges financial support from the London Mathematical Society (LMS) under a Scheme 4 grant (Ref No: 42118). GAR acknowledges support from the National Science Foundation under grants DMS-2027001 and DMS-1853587.

Conflict of interest

The authors declare no conflict of interest.

References

[1] N. Barda, D. Riesel, A. Akriv, J. Levy, U. Finkel, G. Yona, D. Greenfeld, S. Sheiba, J. Somer, E. Bachmat, G. N. Rothblum, U. Shalit, D. Netzer, R. Balicer and N. Dagan, Developing a COVID-19 mortality risk prediction model when individual-level data are not available, Nat Commun, 11 (2020), 4439.
[2] C. D. Bastian and G. A. Rempala, Throwing stones and collecting bones: Looking for poisson-like random measures, Mathematical Methods in the Applied Sciences, 43 (2020), 4658–4668.
[3] J. Bezanson, A. Edelman, S. Karpinski and V. B. Shah, Julia: A fresh approach to numerical computing, SIAM Review, 59 (2017), 65–98.
[4] U. Census, County Population Totals 2010-2019, URL {https://www.census.gov/data/datasets/time-series/demo/popest/2010s-counties-total.html}, Accessed: September 13, 2021.
[5] B. Choi, S. Busch, D. Kazadi, B. Ilunga, E. Okitolonda, Y. Dai, R. Lumpkin, O. Saucedo, W. R. KhudaBukhsh, J. Tien, E. Kenah and G. Rempala, Modeling outbreak data: Analysis of a 2012 Ebola virus disease epidemic in DRC, Biomath, 8 (2019).
[6] F. Di Lauro, W. R. KhudaBukhsh, I. Z. Kiss, E. Kenah, M. Jensen and G. A. Rempała, Dynamic survival analysis for non-markovian epidemic models, Journal of The Royal Society Interface, 19 (2022), 20220124.
[7] E. DiBenedetto, Partial differential equations, 2nd edition, Cornerstones, Birkhäuser Boston, Ltd., Boston, MA, 2010.
[8] L. C. Evans, Partial differential equations, vol. 19 of Graduate Studies in Mathematics, 2nd edition, American Mathematical Society, Providence, RI, 2010.
[9] R. Ferrière and V. C. Tran, Stochastic and deterministic models for age-structured populations with genetically variable traits, in CANUM 2008, vol. 27 of ESAIM Proc., EDP Sci., Les Ulis, 2009, 289–310.
[10] C. for Disease Control and P. (CDC), US Centers for Disease Control and Prevention: COVID-19 vaccinations in the United States, County, URL {https://data.cdc.gov/Vaccinations/COVID-19-Vaccinations-in-the-United-States-County/8xkx-amqh}, Accessed: September 8th, 2021.
[11] C. for Disease Control and P. (CDC), US Centers for Disease Control and Prevention: SARS-CoV-2 Variant Classifications and Definitions, URL {https://www.cdc.gov/coronavirus/2019-ncov/variants/variant-classifications.html}, Accessed: July 13, 2022.
[12] N. Fournier and S. Méléard, A microscopic probabilistic description of a locally regulated population and macroscopic approximations, The Annals of Applied Probability, 14 (2004), 1880–1919.
[13] E. Franco, M. Gyllenberg and O. Diekmann, One dimensional reduction of a renewal equation for a measure-valued function of time describing population dynamics, Acta Applicandae Mathematicae, 175 (2021), 12.
[14] I. Holmdahl and C. Buckee, Wrong but useful—what covid-19 epidemiologic models can and cannot tell us, New England Journal of Medicine, 383 (2020), 303–305.
[15] J. M. Hyman and J. Li, Infection-age structured epidemic models with behavior change or treatment, Journal of Biological Dynamics, 1 (2007), 109–131.
[16] N. P. Jewell, J. A. Lewnard and B. L. Jewell, Predictive Mathematical Models of the COVID-19 Pandemic: Underlying Principles and Value of Projections, JAMA, 323 (2020), 1893–1894, URL https://doi.org/10.1001/jama.2020.6585.
[17] E. Kenah, Contact intervals, survival analysis of epidemic data, and estimation of $R_{0}$ , Biostatistics, 12 (2011), 548–566.
[18] W. R. KhudaBukhsh, C. D. Bastian, M. Wascher, C. Klaus, S. Y. Sahai, M. H. Weir, E. Kenah, E. Root, J. H. Tien and G. A. Rempala, Projecting COVID-19 Cases and Subsequent Hospital Burden in Ohio, 2022, medRxiv, URL https://www.medrxiv.org/content/10.1101/2022.07.27.22278117v1.full.pdf+html.
[19] W. R. KhudaBukhsh, B. Choi, E. Kenah and G. A. Rempała, Survival dynamical systems: individual-level survival analysis from population-level epidemic models, Interface Focus, 10 (2020).
[20] W. R. KhudaBukhsh, H.-W. Kang, E. Kenah and G. Rempała, Incorporating age and delay into models for biophysical systems, Physical Biology, 18 (2021).
[21] W. R. KhudaBukhsh, S. K. Khalsa, E. Kenah, G. A. Rempala and J. H. Tien, COVID-19 dynamics in an Ohio prison, 2021, medRxiv, URL https://www.medrxiv.org/content/early/2021/01/15/2021.01.14.21249782.
[22] C. Klaus, PDE-DSA github repository, 2022, URL {https://github.com/klauscj68/PDE-Vax}.
[23] C. Klaus, M. Wascher, W. R. KhudaBukhsh, J. H. Tien, G. A. Rempała and E. Kenah, Assortative mixing among vaccination groups and biased estimation of reproduction numbers, The Lancet Infectious Diseases, 22 (2022), 579–581.
[24] K. Koelle, M. A. Martin, R. Antia, B. Lopman and N. E. Dean, The changing epidemiology of sars-cov-2, Science, 375 (2022), 1116–1121, URL https://www.science.org/doi/abs/10.1126/science.abm4915.
[25] T. Kypraios, P. Neal and D. Prangle, A tutorial introduction to bayesian inference for stochastic epidemic models using approximate bayesian computation, Mathematical biosciences, 287 (2017), 42–53.
[26] O. A. Ladyženskaja, V. A. Solonnikov and N. N. Ural’ceva, Linear and quasilinear equations of parabolic type, Translations of Mathematical Monographs, Vol. 23, American Mathematical Society, Providence, R.I., 1968, Translated from the Russian by S. Smith.
[27] M. O’Driscoll, G. Ribeiro Dos Santos, L. Wang, D. A. T. Cummings, A. S. Azman, J. Paireau, A. Fontanet, S. Cauchemez and H. Salje, Age-specific mortality and immunity patterns of SARS-CoV-2, Nature, 590 (2021), 140–145.
[28] O. D. of Health, Ohio Department of Health COVID Dashboard, URL {https://coronavirus.ohio.gov/wps/portal/gov/covid-19/dashboards/overview}, Accessed: October 29, 2021.
[29] L. L. Schumaker, Spline functions: Computational Methods, Society for Industrial and Applied Mathematics, Philadelphia, PA, 2015, doi:10.1137/1.9781611973907.
[30] N. Sherborne, J. C. Miller, K. B. Blyuss and I. Z. Kiss, Mean-field models for non-markovian epidemics on networks, Journal of Mathematical Biology, 76 (2018), 755–778.
[31] S. A. Sisson, Y. Fan and M. A. Beaumont, Handbook of Approximate Bayesian Computation, CRC Press, Boca Raton, FL, 2020.
[32] I. Somekh, W. R. KhudaBukhsh, E. D. Root, G. A. Rempała, E. Simões and E. Somekh, Quantifying the Population-Level Effect of the COVID-19 Mass Vaccination Campaign in Israel: A Modeling Study, Open Forum Infectious Diseases, 9 (2022).
[33] V. C. Tran, Large population limit and time behaviour of a stochastic particle model describing an age-structured population, ESAIM. Probability and Statistics, 12 (2008), 345–386.
[34] N. van Kampen, Remarks on Non-Markov Processes, Brazilian Journal of Physics, 28 (1998).
[35] H. Vossler, P. Akilimali, Y. Pan, W. R. KhudaBukhsh, E. Kenah and G. A. Rempała, Analysis of Individual-level Epidemic Data: Study of 2018-2020 Ebola Outbreak in Democratic Republic of the Congo, Scientific Reports, 12 (2022).
[36] M. Wascher, P. M. Schnell, W. R. KhudaBukhsh, M. Quam, J. H. Tien and G. A. Rempała, Monitoring sars-cov-2 transmission and prevalence in populations under repeated testing, 2021, URL https://www.medrxiv.org/content/10.1101/2021.06.22.21259342v1.

Appendix B Appendix

B.1 Analysis of the coupled flow system (1) and (9)

In this section, we begin by assuming the equation coefficients $\alpha$ , $\beta$ , $\gamma$ , and $\lambda$ are continuous and that $\alpha$ , $\beta$ , and $\lambda$ also have directional derivatives along characteristics which are continuous as well. We then also assume that the initial data $f_{S}$ and $f_{I}$ are continuous, compactly supported, and satisfying the compatibility conditions (7). As observed in Remark 2 below, we will also come to require that coefficients are bounded.

B.1.1 Structure conditions

First we truncate the domains of the integrals in (1) to span their respective $R$ ’s which we now index from 1 to 3. These $R$ ’s share a common t-axis but have different s-axes respectively spanning $\left[0,L_{i}\right]$ . Thus, we initially solve a separate problem than (1) where instead of integrals over $\left[0,\infty\right)$ we have finite domains. However, because of the compactness of our initial data, this problem will be equivalent to the original formulation a posteriori. Letting $y=\left(y_{S},y_{V},y_{I}\right)$ and $\mathcal{C}(R)$ stand for the set of continuous functions on the domain $R$ , we now rewrite (1) in the form

$\displaystyle\left(\partial_{t}+\partial_{s}\right)y_{S}$	$\displaystyle=-\Lambda_{1}\left[y\right],$	(10)
$\displaystyle\left(\partial_{t}+\partial_{s}\right)y_{V}$	$\displaystyle=-\Lambda_{2}\left[y\right],$
$\displaystyle\left(\partial_{t}+\partial_{s}\right)y_{I}$	$\displaystyle=-\Lambda_{3}\left[y\right],$

for functionals $\Lambda_{i}:\prod^{3}_{j=1}\mathcal{C}\left(R_{j}\right)\rightarrow\mathcal{C}\left(R_{i}\right)$ . Moreover, for positive quantities $C_{1}$ and $\mathcal{L}_{1}$ , the $\Lambda_{i}$ satisfy boundedness and Lipschitz conditions

	$\displaystyle\left\\|\Lambda_{i}\left[y\right]\right\\|_{\infty}$	$\displaystyle\leq C_{1}\left(\left\\|\lambda\right\\|_{\infty},\left\\|\alpha\right\\|_{\infty},\left\\|\gamma\right\\|_{\infty},L_{I},\left\\|y\right\\|_{\infty}\right),$		(11)
	$\displaystyle\left\\|\Lambda_{i}\left[y_{1}\right]-\Lambda_{i}\left[y_{2}\right]\right\\|_{\infty}$	$\displaystyle\leq\mathcal{L}_{1}\left(\left\\|\alpha\right\\|_{\infty},\left\\|\beta\right\\|_{\infty},L_{I},\left\\|y_{1}\right\\|_{\infty},\left\\|y_{2}\right\\|_{\infty}\right)\left\\|y_{1}-y_{2}\right\\|_{\infty}.$		(12)

The boundary flow kernels in (9) are more elaborate but they satisfy similar structure conditions. The key point is that the flow equations are algebraic combinations of functionals which themselves satisfy boundedness and Lipschitz conditions. We may rewrite (9) as

	$\displaystyle\partial_{t}y_{V}(t,0)=\Gamma_{2}\left[y\right],$		(13)
	$\displaystyle\partial_{t}y_{I}(t,0)=\Gamma_{3}\left[y\right],$

for functionals $\Gamma_{i}:\prod^{3}_{j=1}\mathcal{C}\left(R_{j}\right)\rightarrow\mathcal{C}\left(\left[0,T\right]\right)$ . Their boundedness and Lipschitz conditions are

	$\displaystyle\left\\|\Gamma_{i}\left[y\right]\right\\|_{\infty}$	$\displaystyle\leq C_{2}\left(\left\\|D_{\chi}\lambda\right\\|_{\infty},\left\\|D_{\chi}\alpha\right\\|_{\infty},\left\\|D_{\chi}\beta\right\\|_{\infty},\left\\|\gamma\right\\|_{\infty},L_{S},L_{V},L_{I},\left\\|y\right\\|_{\infty}\right),$		(14)
	$\displaystyle\left\\|\Gamma_{i}\left[y_{1}\right]-\Gamma_{i}\left[y_{2}\right]\right\\|_{\infty}$	$\displaystyle\leq\mathcal{L}_{2}\left(\left\\|D_{\chi}\lambda\right\\|_{\infty},\left\\|D_{\chi}\alpha\right\\|_{\infty},\left\\|D_{\chi}\beta\right\\|_{\infty},\left\\|\gamma\right\\|_{\infty},L_{S},L_{V},L_{I},\left\\|y_{1}\right\\|_{\infty},\left\\|y_{2}\right\\|_{\infty}\right)\left\\|y_{1}-y_{2}\right\\|_{\infty}.$		(15)

Above we let $\left\|D_{\chi}f\right\|_{\infty}$ stand for the max supremum norm of both $f$ and $\left(\partial_{t}+\partial_{s}\right)f$ .

Remark 1.

Without loss of generality, we may assume $C_{1}$ , $C_{2}$ , $\mathcal{L}_{1}$ , and $\mathcal{L}_{2}$ are increasing functions of their arguments.

Remark 2.

The argument dependence of the Lipschitz constants in (11)-(12) and (14)-(15) on norms of the equation coefficients quantifies boundedness assumptions needed on equation coefficients for the Lipschitz assumptions to hold.

Remark 3.

Hereafter, we will abbreviate a constant’s dependence on equation coefficients or geometry as data. So $C_{1}=C_{1}(\mathrm{data},\left\|y\right\|_{\infty})$ . Similar holds for $C_{2}$ , $\mathcal{L}_{1}$ , and $\mathcal{L}_{2}$ .

B.1.2 A fixed point characterization of (10) and (13)

Mimicking the standard approach in ODEs, we now recast the solution of (10) and (13) as a fixed point for an integral equation. To that end, let $\tau^{*}=\min(s,t)$ and define
$\mathcal{F}_{T}:\prod^{3}_{j=1}\mathcal{C}\left(R_{j}\right)\rightarrow\prod^{3}_{j=1}\mathcal{C}\left(R_{j}\right)$ by

\mathcal{F}_{T}\left[y\right]\left(t,s\right)=\begin{cases}f_{S}(s-\tau^{*})+\int^{\tau^{*}}_{0}\Lambda_{1}\left[y\right]\big{(}t-\tau^{*}+\chi,s-\tau^{*}+\chi\big{)}d\chi\\ \int^{\left(t-\tau^{*}\right)}_{0}\Gamma_{2}\left[y\right]\left(\theta\right)d\theta+\int^{\tau^{*}}_{0}\Lambda_{2}\left[y\right]\big{(}t-\tau^{*}+\chi,s-\tau^{*}+\chi\big{)}d\chi\\ f_{I}(s-\tau^{*})+\int^{\left(t-\tau^{*}\right)}_{0}\Gamma_{3}\left[y\right]\left(\theta\right)d\theta+\int^{\tau^{*}}_{0}\Lambda_{3}\left[y\right]\big{(}t-\tau^{*}+\chi,s-\tau^{*}+\chi\big{)}d\chi\end{cases}.

(16)

Then a fixed point of $\mathcal{F}_{T}$ is a classical solution of (1) and (9) with the initial conditions matching the prescribed explicit boundary data.

Lemma 1.

Let $\mathcal{F}_{h}:\prod^{3}_{j=1}\mathcal{C}\left(\bar{R}_{j}\right)\rightarrow\prod^{3}_{j=1}\mathcal{C}\left(\bar{R}_{j}\right)$ where $\bar{R}_{i}$ has the same s-axis as $R_{i}$ but in the t-axis only spans $\left[0,h\right]$ . Then $\exists M,c,h$ all strictly positive and $c<1$ which depend only on the data so that

1.

The restriction $\mathcal{F}_{h}:\bar{B}_{M}(0)\rightarrow\bar{B}_{M}(0)$ and so is bounded on a Banach space,
2.

For $y_{1},y_{2}\in\bar{B}_{M}(0)$ , $\left\|\mathcal{F}_{h}(y_{1})-\mathcal{F}_{h}(y_{2})\right\|_{\infty}\leq c\left\|y_{1}-y_{2}\right\|_{\infty}$ , and so $\mathcal{F}_{h}$ is a contraction mapping.

Proof: For the first part, begin by picking an $M$ larger than twice the supremum norm of $f_{S}$ and $f_{I}$ which we label $\left\|f\right\|_{\infty}$ for brevity. From the sup bounds (11) and (14), we see that on $\bar{B}_{M}(0)$ the vector functionals $\Lambda,\Gamma$ are bounded in terms of this constant M. We may now pick $h$ so that $h\max(\left\|\Lambda\right\|_{\infty},\left\|\Gamma\right\|_{\infty})<\left\|f\right\|_{\infty}/3$ . Now examining the sums in (16), we see these can be no more than $\frac{5}{3}\left\|f\right\|_{\infty}<M$ owing to the integral domains having length $O(h)$ .

For the second part, we may now use that $y_{1},y_{2}$ are supremum bound by $M$ to reduce the dependence of the Lipschitz constants in (12) and (15) to just the data. But now we conclude in the usual manner by seeing that the integral domains in (16) have length $O(h)$ and so, by further reduction in $h$ , we make the product of $h$ with those Lipschitz constants suitably underneath a threshold $c<1$ .

Corollary 1.

For initial data $\left(f_{S},0,f_{I}\right)$ and an $M$ strictly larger than the supremum norm of the $f$ ’s, then over a time span $h=h(\mathrm{data},M)$ , the coupled flow equations (1) and (9) have a unique bounded solution in the space $\prod^{3}_{j=1}\mathcal{C}\left(\bar{R}_{j}\right)$ . This solution may be further taken to be bounded by M over the time span $h$ .

Proof: This follows from the last lemma and the uniqueness of fixed points for contraction mappings in Banach space.

Remark 4.

When instead the initial data has a nonzero density $f_{V}$ for the vaccinated compartment, the same arguments and conclusions hold just with $M$ now also larger than that component’s supremum norm.

Remark 5.

Since the size of $h$ is in a monotone decreasing relationship with M but otherwise depends on data which are fixed, we see that so long as an a priori global bound can be derived for solutions of (1) and (9), then they will persist globally and be unique. This follows by the usual iteration of extending the solution by patches $\left[kh,(k+1)h\right]$ along the t-axis.

B.1.3 On the equivalence of (1) and (9) with the original formulation (1) and (3)-(5)

It remains to show that fixed points of (16) coincide at the t-axis with their intended implicit boundary values (3)-(5). The technical concern is that while (16) is sufficient to imply a fixed point solution is differentiable along characteristics, it does not imply a priori that a solution is differentiable along either the t-direction or s-direction individually. Recall in motivating (9), we treated the $\partial_{s}$ and $\partial_{t}$ derivatives separately. Nevertheless, we show here fixed point solutions do take the intended implicit boundary data. For further examples on the uses of difference quotients and summation by parts in the theory of PDEs, we refer to standard references such as [26].

For the fixed point solution $y$ of (16), Let us notate $\phi_{V}(t)=\int^{L_{S}}_{0}\lambda(t,s)y_{S}(t,s)ds$ . We aim to show $\phi_{V}(t)=y_{V}(t,0)$ . We will also notate $\Delta_{v}f(t,s)=f(t+v_{1},s+v_{1})$ , which is the finite difference of the function $f$ along the vector $v$ . We may now compute

	$\displaystyle\Delta_{he_{1}}\phi_{V}(t)$	$\displaystyle=\int^{L_{S}}_{0}\bigg{(}\Delta_{he_{1}}\lambda(t,s)y_{S}(t+h,s)+\lambda(t,s)\Delta_{he_{1}}y_{S}(t,s)\bigg{)}ds$
		$\displaystyle=\int^{L_{S}}_{0}\bigg{(}\Delta_{he_{1}}\lambda(t,s)y_{S}(t+h,s)+\lambda(t,s)\Delta_{he_{1}+he_{2}}y_{S}(t,s)-\lambda(t,s)\Delta_{he_{2}}y_{S}(t+h,s)\bigg{)}ds.$

We next apply summation by parts to the last term:

	$\displaystyle-\int^{L_{S}}_{0}\lambda(t,s)\Delta_{he_{2}}y_{S}(t+h,s)ds$	$\displaystyle=-\int^{L_{S}}_{0}\lambda(t,s)y_{S}(t+h,s+h)ds+\int^{L_{S}}_{0}\lambda(t,s)y_{S}(t+h,s)ds$
		$\displaystyle=-\int^{L_{S}}_{h}\lambda(t,s-h)y_{S}(t+h,s)ds+\int^{L_{S}}_{0}\lambda(t,s)y_{S}(t+h,s)ds$
		$\displaystyle=-\int^{L_{S}}_{h}\Delta_{-he_{2}}\lambda(t,s)y_{S}(t+h,s)ds+\int^{h}_{0}\lambda(t,s)y_{S}(t+h,s)ds.$

If we now combine these equations, divide by $h$ , and then pass to the limit as $h\rightarrow 0$ , we obtain from the regularity of all terms involved that

	$\displaystyle\partial_{t}\phi_{V}(t)$	$\displaystyle=\int^{L_{S}}_{0}\bigg{(}y_{S}(t,s)\left(\partial_{t}+\partial_{s}\right)\lambda(t,s)+\lambda(t,s)\left(\partial_{t}+\partial_{s}\right)y_{S}\bigg{)}ds+\lambda(t,0)y_{S}(t,0)$
		$\displaystyle=\int^{L_{S}}_{0}\bigg{(}\left(\partial_{t}+\partial_{s}\right)\lambda(t,s)y_{S}(t,s)+\lambda(t,s)\left(-\lambda(t,s)-\int^{L_{I}}_{0}\beta(u)y_{I}(t,u)du\right)y_{S}(t,s)\bigg{)}ds.$

Notice in the last equality that we have used directional derivatives of the solution along characteristics are classical and given by (1). Also, we used that $y_{S}(t,0)=0$ by (3). Upon inspection, we see this is exactly the flow equation for the boundary data of $y_{V}$ in (9). It now follows from the uniqueness of solutions for ODEs that $\phi_{V}(t)$ and $y_{V}(t,0)$ must coincide. These arguments may be repeated for the $y_{I}$ solution to show its intended implicit boundary data also evolves according to the flow of (9).

B.2 Demonstrating numerical convergence of the implicit boundary conditions

In Fig 6 we demonstrate the PDE numerical solution better satisfying the implicit boundary conditions as the mesh resolution and ode integrator tolerances become increasingly fine. For the figure legend, we note that nnd is the number of mesh nodes along the s-axis while atol and rtol are respectively the adaptive Euler schemes absolute and relative tolerances for refining the step.

Recall that, since the magnitude of a probability density varies inversely with the length of its domain, the absolute errors shown are for the solution density multiplied by the size of its domain. In this sense, the absolute error reported is for the solution density if it were first rescaled to a density on the unit interval.

We observe a slight early increase in the relative error of $y_{V}$ ’s implicit boundary data as resolutions become increasingly fine. We expect this is due to the true magnitude of $y_{V}$ becoming increasingly small, which thus leads to division by a smaller number.

	$\displaystyle\left\\|\Lambda_{i}\left[y\right]\right\\|_{\infty}$	$\displaystyle\leq C_{1}\left(\left\\|\lambda\right\\|_{\infty},\left\\|\alpha\right\\|_{\infty},\left\\|\gamma\right\\|_{\infty},L_{I},\left\\|y\right\\|_{\infty}\right),$		(11)
	$\displaystyle\left\\|\Lambda_{i}\left[y_{1}\right]-\Lambda_{i}\left[y_{2}\right]\right\\|_{\infty}$	$\displaystyle\leq\mathcal{L}_{1}\left(\left\\|\alpha\right\\|_{\infty},\left\\|\beta\right\\|_{\infty},L_{I},\left\\|y_{1}\right\\|_{\infty},\left\\|y_{2}\right\\|_{\infty}\right)\left\\|y_{1}-y_{2}\right\\|_{\infty}.$		(12)

	$\displaystyle\left\\|\Gamma_{i}\left[y\right]\right\\|_{\infty}$	$\displaystyle\leq C_{2}\left(\left\\|D_{\chi}\lambda\right\\|_{\infty},\left\\|D_{\chi}\alpha\right\\|_{\infty},\left\\|D_{\chi}\beta\right\\|_{\infty},\left\\|\gamma\right\\|_{\infty},L_{S},L_{V},L_{I},\left\\|y\right\\|_{\infty}\right),$		(14)
	$\displaystyle\left\\|\Gamma_{i}\left[y_{1}\right]-\Gamma_{i}\left[y_{2}\right]\right\\|_{\infty}$	$\displaystyle\leq\mathcal{L}_{2}\left(\left\\|D_{\chi}\lambda\right\\|_{\infty},\left\\|D_{\chi}\alpha\right\\|_{\infty},\left\\|D_{\chi}\beta\right\\|_{\infty},\left\\|\gamma\right\\|_{\infty},L_{S},L_{V},L_{I},\left\\|y_{1}\right\\|_{\infty},\left\\|y_{2}\right\\|_{\infty}\right)\left\\|y_{1}-y_{2}\right\\|_{\infty}.$		(15)