This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\jyear

2021

[1]\fnmAnotida \surMadzvamuse

[1]\orgdivSchool of Mathematical and Physical Sciences, \orgaddress\streetDepartment of Mathematics, \orgnameUniversity of Sussex, \cityBrighton, \postcodeBN1 9QH, \countryUK

2]\orgdivSchool of Mathematical, Statistical and Actuarial Sciences, \orgaddress\streetDepartment of Statistical Methodology and Applications, \orgnameUniversity of Kent, \cityCanterbury, \postcodeCT2 7PE, \countryUK

3]\orgdivDepartment of Primary Care and Public Health, \orgaddress\streetBrighton and Sussex Medical School, \cityBrighton, \postcodeBN1 9PH, \countryUK

A hospital demand and capacity intervention approach for COVID-19 in the UK

\fnmJames \surVan Yperen    \fnmEduard \surCampillo-Funollet    \fnmRebecca \surInkpen    \fnmAnjum \surMemon    [email protected] * [ [
Abstract

The mathematical interpretation of interventions for the mitigation of epidemics and pandemics in the literature often involves finding the optimal time to initiate an intervention and/or the use of infections to manage impact. Whilst these methods may work in theory, in order to implement they may require information which is likely not available whilst one is in the midst of an epidemic, or they may require impeccable data about infection levels in the community. In practice, testing and cases data is only as good as the policy of implementation and the compliance of the individuals, which means that understanding the levels of infections becomes difficult or complicated from the data that is provided. In this paper, we aim to develop a different approach to the mathematical modelling of interventions, not based on optimality, but based on demand and capacity of local authorities who have to deal with the epidemic on a day to day basis. In particular, we use data-driven modelling to calibrate an Susceptible Exposed Infectious Recovered-Died (SEIR-D) model to infer parameters that depict the dynamics of the epidemic in a region of the UK. We use the calibrated parameters for forecasting scenarios and understand, given a maximum capacity of hospital healthcare services, how the timing of interventions, severity of interventions, and conditions for the releasing of interventions affect the overall epidemic-picture.

keywords:
COVID-19, mathematical, modelling, interventions, healthcare, data-driven

1 Introduction

The resurgence of the severe acute respiratory syndrome coronavirus 2 (SARS-Cov-2) that causes COVID-19, and its mutant variants, are putting national health systems in most countries under significant pressure/strain due to an increase in COVID-19 hospitalisations and the provision of critical care for patients in need. Towards the end of 2020 and the beginning of 2021 the UK was battling the spike of COVID-19 infections across the country, which likely started in the Kent region, due to the spread of the Alpha variant. Since the end of 2021, the UK has been battling the spike of infections across the country, which this time likely started in London, due to the spread of the Omicron variant. The UK government has been pushing a combination of non-pharmaceutical interventions for England, such as the four step roadmap out of lockdown, which relies on the surveillance of the infection rate, and the effectiveness of the national vaccination programme to combat the spread of the Alpha variant. Although the success of previous lockdowns and tiered systems have come under scrutiny DBJal20 , the success of the roadmap interventions and the vaccination programme until approximately late May 2021 can be seen by examining the reported COVID-19 data on the Coronavirus Dashboard produced by the UK Government111https://coronavirus.data.gov.uk/. From May onwards we start to see an increase in the number of cases nationally, which had not materialised into a significant increase in hospitalisations until the Omicron wave of December 2021. Indeed, in July 2021, the reported daily cases in the UK were around the same amount compared to the spike at the beginning of 2021 which, amongst other factors, caused the UK government to initiate “lockdown 3”, whereas daily hospitalisation counts in July 2021 were a quarter of what they were in January 2021. However, as we moved into winter, when infectious disease hospital activity is normally at its peak, with competing infections such as influenza, population healthcare management in hospitals and local authorities needed to be able to forecast the potential impact COVID-19 resurgence on hospital demand and capacity. Elective treatments, such as surgery and chemotherapy, have been and still are substantially backlogged due to the pandemic, and another resurgence of COVID-19 could add more pressure on healthcare systems without proper planning. Given the current situation, and the current state-of-the-art modelling of epidemics, hospital planners need to be able identify indicators in community infections which would cause a surge in hospital activity - in particular, using predictive modelling, they need metrics and rules as to when a potential surge may be occurring given the current data. This concept is particularly important with the reduction in the collection of COVID-19 data at the end of March 2022, with the implication that we are looking to “live with” COVID-19 now, a.k.a the endemic phase, and so healthcare systems need quantitative approaches to be able to identify when an increase in hospitalisations implies a larger incoming wave. What this study aims to contribute to the epidemiological, public health and mathematical communities are ways in which local authorities can use national information in their own trained mathematical models, models which are fitted to the data of their region, and to run plausible scenarios based on effective measures to control their hospital demand and capacity.

In the literature, there is a plethora of pre-prints and peer-reviewed articles that present work on forecasting COVID-19 dynamics, which can be used for future pandemics when they arise CVAal20 ; DZSal20 ; EHMal20 ; FLNal20 ; FGLal20 ; GLBal20 ; JLJ20 ; LTY20 ; LMKal20 ; MG20 . With the rise of data-science over the last decade, data availability and quality has enabled data-driven modelling and research, allowing for clear applications of infectious disease modelling rather than just theoretical work. In particular, the availability of government-led data initiatives making epidemiological and public health data accessible Ar20 . For example, here in the UK, during the COVID-19 pandemic the government produced the Coronavirus Dashboard which gives public access to local, regional and national data concerning testing, cases, hospitalisations, vaccinations and deaths - all major contributions when considering a data-driven model of healthcare demand and capacity. Modelling efforts now should be conducted in such a way which allow them to be used in the future, to which we can use COVID-19 data and policy as an application of the work. However, the majority of mathematical modelling publications are aimed either at national level modelling with an assumption that the reader knows the standard mathematical jargon. The inherent assumptions behind the modelling decisions and the parameters that are adjusted for interventions scenarios are often not made clear. The swift wave of COVID-19 across the globe has identified the need for reliable, sensitive and validated data-driven approaches that are accessible by local authorities to make quantitative and qualitative decisions on policy. To combat this, public-policy in mind, epidemiological research groups across the UK, and in fact across the world, have been producing web-based tools to combat COVID-19 and provide ways for non-mathematicians to picture and understand the data available. A comprehensive review of different web-based tools can be found in MHG21 . Since these models are readily available to be used, and with the conclusion and recommendations of the Goldacre report for public healthcare management to “embrace help from other sections such as academia” goldacre , it is more important now more than ever that the mathematically modelling assumptions are present, visible and understandable and that the scope of the model is clear, something which we look to obtain with our toolkit Halogen222https://www.halogen-health.org/.

Although we focus on COVID-19 and use COVID-19 data to fit our model, we note that the concept behind this study can be applied to any infectious disease, or other application, provided it has an SIR type model describing the mathematics and there is the appropriate data available. Whilst it is reasonably simple to generate an SIR type model for an infectious disease, COVID-19 is seemingly the first infectious disease to have data collected and made publicly available for parameter estimation in the manner we present, and so we can not claim that our approach would work without such detailed data. One may assume that, in the scenario of future pandemics, similar such government-led data initiatives will be available to enable future modelling efforts. Indeed, similar conceptual approaches have been used by modelling groups across the country to understand the impact of the UK Foot and Mouth outbreak of 2001, see K05 and references therein, and the UK SARS H1N1 pandemic of 2009, see BVJFWE10 and references therein. For a mathematical modelling approaches to a generic influenza pandemic at a national or international level, see also FCFCCB06 and references therein.

The objective of this paper is to demonstrate the flexibility of data-driven epidemiological and mathematical modelling for providing robust intervention scenarios. In this study we focus on hospital capacity as the metric to decide if an intervention is initiated. We consider the situation whereby given information about the spread of the disease, in particular the “known” RR number, we demonstrate how using current hospital capacity to lift an intervention can significantly change the epidemic and demonstrate an approach to judging what the thresholds of occupied beds should be given a resource limit (maximum number of beds available for COVID-19 patients) before initiating an intervention. Similar studies have been conducted using ICU capacity as a determinant, such as AFG22 ; MSW20 , best intervention scenario possible given parameters. such as DKM21 ; NPP16 , or only consider intervention scenarios based on the current number of infections, such as FIL21 ; JG22 ; S21 . What we aim to demonstrate in this paper is the ability to conduct decision making that does not require information not available in the moment (such as the peak of epidemic curves).

The outline of this paper is as follows. In Section 2 we demonstrate the flexibility of our approach in CVAal20 by fitting parameters of regions with publicly available data. In Section 3 we apply intervention measures based on hospital occupancy using the parameters previously inferred. In Section 4 we introduce an agent-based equivalent to the mathematical model in Section 2, validate the approach and apply the intervention scenarios in Section 3, conduct a further investigation into how transmission and contacts effect the results of the scenarios and compare the results to the results in Section 3 to understand the role of stochastic perturbations. In Section 5 we outline some limitations of this work with potential solutions and in Section 6 we summarise the main findings of this study.

2 A data-driven SEIR-D model

2.1 Data

Following CVAal20 , in this section we present a simple data-driven susceptible-exposed-infected-recovered-dead (SEIR-D) model as demonstrated in Figure 1. The model breaks down the typical infectious compartment of an SEIR-D model into two compartments, one strand to model individuals who become infectious with COVID-19 and will be going to hospital (II), and the other strand to model individuals who will not need to go to hospital and thus remain undetected by hospital healthcare requirements (UU). Splitting the infectious compartment this way allows us to describe the number of individuals who are in hospital with COVID-19 at any time, which allows us to fit the hospital data to the model to be able to infer information about the number of infectious cases in the community. This approach allows us to circumvent the issue of parameter identifiability and estimation due to the lack of obtainable and usable data regarding those who are infectious and asymptomatic.

In CVAal20 we used specific regional datasets to calibrate the model. As part of the national COVID response, all the National Health Service (NHS) hospitals in England treating COVID-19 patients submitted a Daily Situation Report (SITREP) to NHS England. The data associated to Sussex NHS Trusts were extracted and combined to weekly death counts from the Office for National Statistics (ONS) with COVID-19 reported as the underlying cause of death, which we then received and fitted. To be precise, by Sussex we mean the collective term for geographies pertaining to the counties of East Sussex and West Sussex in South East England. The SITREP contained counts of daily admissions, daily discharges, and the beds occupied daily, whilst the death dataset contained the number of deaths recorded outside of hospitals and the number of deaths recorded within hospitals.

Whilst the ONS death data are publicly available, the SITREP data in general are not, but, given the national need for data, the UK government produced the Coronavirus Dashboard. It provides users with the ability to look at different metrics of COVID-19, such as hospitalisations and deaths, for different regions and provides an API for users to download the datasets themselves. The granularity of the data depends on the size of the region the data are required for, typically all data are available for each nation of the UK, but hospitalisations and deaths are split depending on their geographical location. Indeed, hospitalisations are recorded using NHS regions and NHS trusts, and deaths are recorded using local authority regions. The unfortunate difference between using the Coronavirus dashboard and the SITREP is that the Coronavirus dashboard does not contain the number of daily discharges, and it does not differentiate place of death like the ONS weekly death registry does. In order to apply the approach presented in CVAal20 we adapted the Coronavirus dashboard data in the following way: we calculated the proportion of hospital COVID-19 deaths to the total number of COVID-19 deaths for the region being considered from the ONS weekly death registry and applied this proportion to the deaths dataset acquired from the Coronavirus Dashboard to give us a proxy dataset on deaths in hospital and deaths outside of hospital. Next, we then used the deaths in hospital with the admissions and beds occupied to find a proxy dataset for the discharges, since the offset of beds occupied between each day depends on the number of admissions, discharges and deaths that day. We note that, although the proxy death datasets might closely track the ONS weekly death registry, the discharges dataset is very noisy due to noise accumulating from multiple sources. For this reason, we decided to apply a 7-day rolling average to the discharges. We note here that the Coronavirus dashboard has access to patients in mechanical ventilation beds and so we could include a compartment that describes the high-dependency unit (HDU). We decided not to do this since we do not have access to the number of patients who have died in HDU, we only have access to the number of patients who died in hospital, and so parameter identifiability would be an issue.

For illustrative purposes, we only consider the geographical regions of North West England, South East England and the nation of England. We chose these as both NHS region and local authority region are the same.

Refer to caption
Figure 1: Schematic representation of the compartmental pathways on which the SEIR-D mathematical model is formulated used first principles.

2.2 Model and parameter estimation

The mathematical model is a data-driven generalisation of a simple temporal epidemiological dynamic system of ordinary differential equations, governed by the interactions depicted in Figure 1, supported by non-negative initial conditions

S˙\displaystyle\dot{S} =βU+INS,t(0,T],S(0)=S0,\displaystyle=-\beta\frac{U+I}{N}S,\qquad\quad\,\,\,\quad t\in(0,T],\qquad S(0)=S_{0}, (2.1)
E˙\displaystyle\dot{E} =βU+INSγEE,t(0,T],E(0)=E0,\displaystyle=\beta\frac{U+I}{N}S-\gamma_{E}E,\quad\quad t\in(0,T],\qquad E(0)=E_{0}, (2.2)
U˙\displaystyle\dot{U} =pγEEγUU,t(0,T],U(0)=U0,\displaystyle=p\,\gamma_{E}E-\gamma_{U}U,\qquad\quad\,\,\,\,t\in(0,T],\qquad U(0)=U_{0}, (2.3)
I˙\displaystyle\dot{I} =(1p)γEEγII,t(0,T],I(0)=I0,\displaystyle=(1-p)\gamma_{E}E-\gamma_{I}I,\quad\,\,\,\,t\in(0,T],\qquad I(0)=I_{0}, (2.4)
H˙\displaystyle\dot{H} =γII(γH+μH)H,t(0,T],H(0)=H0\displaystyle=\gamma_{I}I-(\gamma_{H}+\mu_{H})H,\quad\,t\in(0,T],\qquad H(0)=H_{0} (2.5)
R˙U\displaystyle\dot{R}_{U} =(1mU)γUU,t(0,T],RU(0)=RU,0,\displaystyle=(1-m_{U})\gamma_{U}U,\qquad\quad\,\,\,t\in(0,T],\qquad R_{U}(0)=R_{U,0}, (2.6)
R˙H\displaystyle\dot{R}_{H} =γHH,t(0,T],RH(0)=RH,0,\displaystyle=\gamma_{H}H,\qquad\qquad\qquad\quad\,\,t\in(0,T],\qquad R_{H}(0)=R_{H,0}, (2.7)
D˙U\displaystyle\dot{D}_{U} =mUγUU,t(0,T],DU(0)=DU,0,\displaystyle=m_{U}\gamma_{U}U,\qquad\qquad\qquad t\in(0,T],\qquad D_{U}(0)=D_{U,0}, (2.8)
D˙H\displaystyle\dot{D}_{H} =μHH,t(0,T],DH(0)=DH,0.\displaystyle=\mu_{H}H,\qquad\qquad\qquad\quad\,t\in(0,T],\qquad D_{H}(0)=D_{H,0}. (2.9)

Here, the dot above the notation denotes the time derivative. In this setting, NN denotes the total regional population. S(t)S(t) denotes the proportion of the total population NN who are susceptible to the disease, COVID-19. Susceptible individuals become exposed to the disease, i.e. they are carrying the disease but are not currently infectious, to form the E(t)E(t) subpopulation at rate λ(t)\lambda(t) which represents the average infectivity. The rate λ(t)\lambda(t) is the product between β\beta, the average transmission rate, and the probability of meeting an infectious person (U(t)+I(t))N1(U(t)+I(t))N^{-1}. The E(t)E(t) subpopulation is an incubation compartment and further evolves in two ways. A proportion of E(t)E(t) becomes infectious but, in the spirit of the hospital healthcare demand, remains undetected with probability pp, denoted U(t)U(t), at a rate γE\gamma_{E}, or a proportion of E(t)E(t) becomes infectious and will require hospitalisation in the future with a probability of 1p1-p, denoted I(t)I(t), at a rate γE\gamma_{E}. The subpopulation that does not require hospitalisation can either progress to recover with a probability of 1mU1-m_{U}, at rate γU\gamma_{U}, to form the recovered population, denoted by RU(t)R_{U}(t), or die with a probability of mUm_{U}, at rate γU\gamma_{U}, to form the dead population, denoted by DU(t)D_{U}(t). Considering the infectious population that will be going to hospital, these individuals will become hospitalised, denoted by H(t)H(t), and thus be in hospital care at rate γI\gamma_{I}. We assume that once a patient has been admitted into hospital, they no longer transmit to other non-COVID-19 patients, visitors or workers. This could be due to appropriate isolation and use of personal hygiene measures in hospital, however we do note that transmission in hospital is not unheard of. For the model, this means that we assume transmission is only conducted in the community. Once in hospital, patients can evolve in two separate pathways, a proportion of the hospitalised population can fully recover at rate γH\gamma_{H} to form the subpopulation denoted by RH(t)R_{H}(t). Alternatively, if they can not recover, then they die while in hospital at rate μH\mu_{H}, to form the dead population denoted by DH(t)D_{H}(t). We note that γH\gamma_{H} and μH\mu_{H} are closely related to the length of stay in hospitals.

As is standard for epidemiological models of this nature, β\beta denotes the average transmission rate, γE1\gamma_{E}^{-1} denotes the average incubation time, pp denotes the average proportion of infectious individuals who will not require hospital treatment, γU1\gamma_{U}^{-1} denotes the average infectious period for those not needing hospital treatment, mUm_{U} denotes the infected fatality ratio for undetected cases, γI1\gamma_{I}^{-1} denotes the average infectious period from becoming infectious to being admitted to hospital, γH1\gamma_{H}^{-1} denotes the average hospitalisation period for those who recover and μH1\mu_{H}^{-1} represents the average hospitalisation period for those who die. For this model, using the method of next generation matrices DHR10 , we derive the formula for the basic reproduction number 0\mathcal{R}_{0} as the following

0:=β(pγU+1pγI).\displaystyle\mathcal{R}_{0}:=\beta\left(\frac{p}{\gamma_{U}}+\frac{1-p}{\gamma_{I}}\right). (2.10)

Using this, we calculate the effective reproduction number t\mathcal{R}_{t} as

t:=0S(t)N.\displaystyle\mathcal{R}_{t}:=\mathcal{R}_{0}\,\frac{S(t)}{N}. (2.11)

The effective reproduction number t\mathcal{R}_{t} is often referred to as the RR number. We briefly explain the fitting procedure presented in CVAal20 , we refer the interested reader to CVAal20 for further details. First, we utilise the linear relationship between the model description of hospital discharges and hospital deaths and use linear regression analysis to calculate the ratio of discharges to deaths. One can see that, in terms of the model and its parameters, the daily discharges can be written as

Dis(t):=γHt1tH(s)ds,\displaystyle Dis(t):=\gamma_{H}\int_{t-1}^{t}H(s)\,\text{d}s, (2.12)

and the daily hospital deaths can be written as

DthH(t):=μHt1tH(s)ds.\displaystyle Dth_{H}(t):=\mu_{H}\int_{t-1}^{t}H(s)\,\text{d}s. (2.13)

The linear regression allows us to estimate the ratio between γH\gamma_{H} and μH\mu_{H}. Next, we rewrite equations (2.1)–(2.5) in terms of the data, we call this the “observational” model, whereby (2.6)–(2.9) are not considered since they are cumulative representations of the compartments in (2.1)–(2.5). The observational model is formed of compartments that are available from the data. Indeed, one can see that, in terms of the model and its parameters, the daily admissions can be written as

Adm(t):=γIt1tI(s)ds,\displaystyle Adm(t):=\gamma_{I}\int_{t-1}^{t}I(s)\,\text{d}s, (2.14)

the daily deaths outside of hospital is

DthU(t):=γUmUt1tU(s)ds,\displaystyle Dth_{U}(t):=\gamma_{U}m_{U}\int_{t-1}^{t}U(s)\,\text{d}s, (2.15)

and daily discharges as in (2.12) above. Thus, the observational model, as presented in CVAal20 , is

H˙\displaystyle\dot{H} =γIIγH(1+μHγH)H,\displaystyle=\gamma_{I}I-\gamma_{H}\left(1+\frac{\mu_{H}}{\gamma_{H}}\right)H, (2.16)
U˙\displaystyle\dot{U} =p1p(I˙+γII)γUU,\displaystyle=\frac{p}{1-p}\left(\dot{I}+\gamma_{I}I\right)-\gamma_{U}U, (2.17)
I˙˙˙\displaystyle\dddot{I} =(I¨+(γE+γI)I˙+γEγII)[pγII1p+I˙1pγUUβN(U+I)2](U+I)1\displaystyle=\left(\ddot{I}+(\gamma_{E}+\gamma_{I})\dot{I}+\gamma_{E}\gamma_{I}I\right)\left[\frac{p\gamma_{I}I}{1-p}+\frac{\dot{I}}{1-p}-\gamma_{U}U-\frac{\beta}{N}(U+I)^{2}\right](U+I)^{-1}
(γE+γI)I¨γEγII˙.\displaystyle\qquad-(\gamma_{E}+\gamma_{I})\ddot{I}-\gamma_{E}\gamma_{I}\dot{I}. (2.18)

We solve the observational model, compute (2.12), (2.14) and (2.15) and compare against the datasets given. This allows us to use a maximum likelihood estimation approach by minimising a negative log-likelihood with some constraints on the initial conditions and on the effective reproduction number. These constraints are needed to keep the feasible region of the parameters close to the realistic sets. Without these constraints, the peak of the datasets could be explained by being close to herd immunity (i.e., a lot of infections have already occurred before the lockdown) or a large value of t\mathcal{R}_{t} combined with a small value of pp. We note here that (2.13) does not need to be used in the observational model due to the linear regression and the fact that the resulting log-likelihood functions from the regression and observational model are independent. This means that there is one less parameter to infer from solving the observational model.

Since we are considering the data from the first day of lockdown, we also need to infer the initial conditions. We have not conducted a formal investigation into the resulting log-likelihood, but it is clear that there is a continuous dependence between initial conditions and the parameters, see CWVDM21 for a comprehensive discussion. In practice, we see this manifest as an issue to fit pp - if the first guess of initial conditions and parameters is not close to the “true” values, then it is pp which changes in value the most. Although in reality, pp is characterised by how COVID-19 affects individuals, from demographics such as age, ethnicity, gender, however we speculate this value should not change drastically between regions. In view of this, in the parameter estimation process we fix pp to be the same throughout the estimation for the three regions.

From the model and using the fitting procedure described above, we gain the parameters in Table 1 and initial conditions in Table 2, with a demonstration of the fit for beds occupied demonstrated in Figure 3. The equivalent figure for the Sussex region fit can be found in CVAal20 in Figure 2. We note that the infected fatality ratio and average hospitalisation period for those who recover are similar across all regions, but the average hospitalisation period for those who die and the value of t\mathcal{R}_{t} when lockdown commenced are quite different. The varied values of t\mathcal{R}_{t} could be explained by the amount of infections in each region, it was reported at the time that Sussex and the South East escaped quite lightly on the number of infections, whilst the North of England did not. This is shown in Figure 2, whereby we have taken the model output of the beds occupied and divided it by the population size for each region and multiplied it by 100. Proportionally, the North West saw almost double the amount of patients as the South East. The higher value of t\mathcal{R}_{t} for North West can also be seen by looking at the gradient of the beds occupied. There could be a myriad of reasons for this, such as the demographic of the population or the geography of the region. One should also note that, as we came out of the first lockdown and went into the first iteration of the tiered system, it was cities in the north which first started to show signs of resurgence (such as Manchester). Using γH\gamma_{H} and μH\mu_{H} for each region, one can calculate that the estimate of the probability of discharge for England, South East and North West was 58.03%, 61.57%, 55.31% respectively. As for the reason why, one may speculate that this is due to the number of patients in the hospitals being higher in the north which puts pressure on the health system.

Refer to caption
Figure 2: Proportional of beds occupied for England, South East and North West.
Refer to caption
Figure 3: Beds occupied fit for England, South East and North West.
Table 1: Parameters of interest derived using method in CVAal20 for three different regions of the UK.
Parameter England South East North West
t\mathcal{R}_{t} 0.763 0.733 0.835
γH1\gamma_{H}^{-1} 13.11 days 13.20 days 13.28 days
mUm_{U} 0.0013 0.0013 0.0010
μH1\mu_{H}^{-1} 18.13 days 21.16 days 16.43 days
\botrule
Table 2: Initial conditions of interest derived using method in CVAal20 for three different regions of the UK.
Initial Condition (%NN) England South East North West
E0E_{0} 0.891% 0.710% 1.061%
U0U_{0} 0.025% 0.018% 0.025%
I0I_{0} 0.041% 0.001% 0.002%
\botrule

3 Intervention measures based on hospital occupancy

For the remainder of this study we will use the parameters identified in Tables 1 but we will fix the initial conditions to represent the beginning of the epidemic. Namely, in (2.1) and (2.2) we set

S0=0.999N,andE0=0.001N,\displaystyle S_{0}=\left\lceil 0.999N\right\rceil,\quad\text{and}\quad E_{0}=\left\lfloor 0.001N\right\rfloor, (3.1)

and for the remaining equations (2.3)–(2.9) we set the initial conditions equal to 0, where x\left\lceil x\right\rceil represents the ceiling of xx and x\left\lfloor x\right\rfloor represents the floor of xx.

Using (2.1)–(2.9) we will model an intervention as a social distancing effect, by manipulating the average transmission rate β\beta. Throughout this study we will be measuring the “success” of an intervention by the percentage of individuals who have died throughout the simulation, in the sense that reducing this statistic means a more successful intervention.

The outbreak is regarded to have been contained at a time T>1T>1 such that E(T)+U(T)+I(T)<1E(T)+U(T)+I(T)<1, t<1\mathcal{R}_{t}<1 and there is no ongoing intervention. This implies that herd immunity has been achieved and thus the system has reached its steady state. This description highlights one of the drawbacks of using continuum equations over their stochastic counterparts, namely in the continuum setting there is always “some amount” of the disease leftover in the community (i.e. 0<E(t)+I(t)+U(t)0<E(t)+I(t)+U(t)). This can amount to another outbreak if the parameters are changed appropriately, which we would not expect to happen in reality.

We numerically approximate the solutions to the system (2.1)–(2.9) by using the SciPy implementation of the “lsoda” method, which is a combination of the Adams methods and the Backward Differentiation Formula (BDF) family of methods BG08 ; GH10 ; P83 . Given the multi-step approach of the ODE solver, each time we manipulate the parameters during a simulation, to initiate an intervention, we stop the current solver and start it again using initial conditions as the final values of the last solver. This bypasses difficulties of having a discontinuous ODE system (with respect to the parameters). One notes that this can also be bypassed by using a much simpler solver, like the forward Euler scheme, however this would result in the need for significantly smaller timesteps.

3.1 The “do-nothing” approach

As a reference to compare how the interventions are working, we use this section to demonstrate the “do-nothing” approach, which is simply to let the disease take its course. This will provide us with statistics to compare to the interventions later on to demonstrate their effectiveness. In reality, we are aware that this approach will not be implored in practice and an intervention will occur, as has happened all over the world with national level lockdowns and social distancing measures. Since the parameters presented in Table 1 depict a lockdown scenario, we scale β\beta appropriately to several values to establish an epidemic, i.e. so that 0>1\mathcal{R}_{0}>1. An increase in average transmission rate can simply be interpreted as more individuals meeting each other and spreading the disease. In Table 3 we measure the maximum hospital capacity needed as a percentage of the total population, the day in the simulation that maximum is reached and the percentage of dead individuals at the end of the outbreak, for each region. As intuitively expected, as 0\mathcal{R}_{0} increases the percentage of maximum number of patients in the hospital increases, the day of that peak is sooner and the percentage of dead individuals increases. Interestingly, the day of the peak does not seem to change even though the hospitalisation parameters are all quite varied, we suspect that this is somehow related to the initial conditions and the fact that 0\mathcal{R}_{0} is not effected by the parameters that describe hospitalisations. We see that the percentage of maximum beds occupied is worse in the South East, followed by England and then the North West respectively. This is to be expected since the average hospitalisation periods (for deaths) are longer in the respective order which implies more people in hospital at any one moment. However, the percentage of dead individuals is higher in the North West, followed by England and then the South East respectively. This is also to be expected since the probability of death in hospitals is larger in the respective order whilst the mortality probability outside of hospitals are very similar for each of the regions. In Figure 4 we demonstrate the effective reproduction number t\mathcal{R}_{t} of the simulation using the England parameters. This shows us that, when 0\mathcal{R}_{0} is larger, the actual outbreak is much shorter in length and reaches much smaller values of t\mathcal{R}_{t}. This description follows the notion that the larger the value of 0\mathcal{R}_{0}, the more aggressive the disease is following the exponential growth of those who are infectious, as can be seen in Figure 4 by the steep decline in t\mathcal{R}_{t}. In Figure 5 we demonstrate a comparison of the percentage of beds occupied for each of the regions and for some values of 0\mathcal{R}_{0}. We note that in Figure 5 we truncate the simulation to make the visualisation of the hospitalisations easier.

\sidewaystablefn
Table 3: Measurements using the “do-nothing” approach.
Max beds occupied (%NN) Peak of beds occupied (day) Dead individuals (%NN)
0\mathcal{R}_{0} England South East North West England South East North West England South East North West
1.3 0.145% 0.154% 0.140% 168 170 168 1.122% 1.032% 1.178%
1.4 0.222% 0.236% 0.214% 144 145 144 1.352% 1.244% 1.420%
1.5 0.304% 0.322% 0.293% 126 128 126 1.540% 1.417% 1.618%
1.6 0.386% 0.408% 0.372% 114 115 114 1.696% 1.560% 1.781%
1.7 0.466% 0.492% 0.450% 104 105 104 1.825% 1.679% 1.917%
1.8 0.544% 0.574% 0.525% 96 97 96 1.933% 1.779% 2.031%
1.9 0.619% 0.652% 0.598% 89 90 89 2.025% 1.863% 2.127%
2.0 0.686% 0.725% 0.666% 84 85 84 2.103% 1.935% 2.209%
\botrule
Refer to caption
Figure 4: t\mathcal{R}_{t} using the “do-nothing” approach with the England parameters. Here North West is abbreviated to NW and South East is abbreviated to SE.
Refer to caption
Figure 5: Percentage of beds occupied each day using the “do-nothing” approach for each region. We note that we have truncated the simulation to make visualisation easier. Here North West is abbreviated to NW and South East is abbreviated to SE.

3.2 Notation for an intervention

In the following sections we want to investigate how one can use hospital capacity as a measurement for whether interventions are put into place. We aim to model the situation where an intervention is triggered when hospital capacity is almost full, and then finish the intervention when the hospital capacity has reached an “opening” threshold of significantly lower patients. For ease of demonstration we will simply set a threshold that once breached will trigger the intervention, in some cases this will mean that the capacity is “breached” - we deal with this in subsection 3.5. We denote the state of being in an intervention using the notation \ell, namely if =1\ell=1 then we are in an intervention, otherwise we set =0\ell=0. Using this, we describe the average transmission rate as

β¯(t;):=[=1]β+[=0]C0β,\displaystyle\overline{\beta}(t\mathchar 24635\relax\;\ell):=[\ell=1]\,\beta+[\ell=0]\,C_{\mathcal{R}_{0}}\beta, (3.2)

where β\beta is the average transmission rate associated to the first lockdown deduced from the parameter inference for each region, C0C_{\mathcal{R}_{0}} is a scaling constant to give the initial value of 0\mathcal{R}_{0} wanted, and [][\cdot] are the Iverson brackets K92 defined as

[P]:={1 if P is true,0 otherwise.\displaystyle[P]:=\begin{cases}1&\text{ if }P\text{ is true,}\\ 0&\text{ otherwise.}\end{cases}

We describe the intervention in the following recursive way

:=[=0][H(t)>Hu]+[=1][H(t)>Hl],\displaystyle\ell:=[\ell=0][H(t)>H_{u}]+[\ell=1][H(t)>H_{l}], (3.3)

which is to say that the intervention is triggered at time tt when the number of patients in hospital goes above an upper limit HuH_{u}, and the intervention stays triggered until the number of patients in hospital goes below a lower limit HlH_{l}. Initially \ell is set equal to 0. The values of HuH_{u} and HlH_{l} are regionally dependent since they depend on the maximum capacity of all the hospitals in a region. Given the maximum hospital capacities in Table 3, we fix Hu:=0.0012NH_{u}:=0.0012N, to guarantee at least one lockdown for each 0\mathcal{R}_{0} chosen, and vary the opening up threshold between Hl:=0.025HuH_{l}:=0.025H_{u} and Hl:=0.5HuH_{l}:=0.5H_{u}. We will present the results from all the regions in various tables, however graphically we will restrict the figures to the South East region so the figures are not a visual burden. To this end, we present Figure 6 which is the only comparison of the interventions for each region. The simulations all look reasonably similar and follow a similar trend, the noticeable differences come from when one region goes into an intervention (light grey coloured line) whilst the others don’t. This explains some of the interesting numbers in Tables 45. We go into more detail in the following sections.

There are some caveats to this study we want to highlight. It is somewhat unrealistic that transmission would immediately revert to normal amounts after an intervention MB20 , however we do not consider this here. Another aspect to consider with this study is that we are considering the capacity of all the hospitals in a region, due to modelling constraints and data access. This means that we are assuming hospitals can move patients throughout each region due the physical constraints of each hospital, in response to the bed capacity of each individual hospital.

Refer to caption
Figure 6: Percentage of beds occupied each day using the hospital capacity intervention scenario for each region. We fix Hl:=0.25HuH_{l}:=0.25H_{u}. The grey line represents the time in an intervention. We note that we have truncated the simulation to make visualisation easier. Here North West is abbreviated to NW and South East is abbreviated to SE.

3.3 Changing 0\mathcal{R}_{0}

In this section, we will vary 0\mathcal{R}_{0} whilst fixing Hl:=0.25HuH_{l}:=0.25H_{u} or Hl:=0.5HuH_{l}:=0.5H_{u}. In Table 4 we measure the percentage of dead individuals for each region for varying values of 0\mathcal{R}_{0}. In Figure 7 we demonstrate the percentage of beds occupied in the South East region whilst in Figure 8 we demonstrate the effective reproduction number over the simulation. The light grey colour in Figure 7 represents the time when the regions are in an intervention, the same time period is represented by the gaps in the lines in Figure 8.

Comparing Table 3 to Table 4 we can see that the intervention has dramatically decreased the percentage of total deaths for larger values of 0\mathcal{R}_{0}, as expected. By looking at Figure 7 we can see that as we increase 0\mathcal{R}_{0}, the number of interventions needed increases and also the length of the initial intervention increases. This is due to the number of future patients in the exposed compartment EE and the infectious compartment II. This is emphasised by the fact that the initial intervention is sooner for a larger value of 0\mathcal{R}_{0} due to the increased average transmission. It can also be seen that the time between each intervention decreases as 0\mathcal{R}_{0} increases. We can also see, by comparing Figure 4 with Figure 8, that the epidemic actually lasts significantly longer with interventions included. These observations are realistically expected, however there are some results which are not necessarily expected or intuitive such as the percentage of dead individuals in the South East region decreasing from 0=1.5\mathcal{R}_{0}=1.5 to 0=1.6\mathcal{R}_{0}=1.6 when considering Hl=0.25HuH_{l}=0.25H_{u} but not decreasing for when Hl=0.5HuH_{l}=0.5H_{u}. This is not specific to this value of 0\mathcal{R}_{0}, rather to the circumstance that this simulation finds itself after the final intervention. Namely, at this stage of the simulation for 0=1.6\mathcal{R}_{0}=1.6, herd immunity has almost been reached, i.e. t\mathcal{R}_{t} is only slightly larger than 1. A value of t\mathcal{R}_{t} slightly larger than 1 means that although there is still an increase in the number of infectious individuals, the rate of that increase is much slower comparatively to an t\mathcal{R}_{t} value of, say, 1.5. At this stage, the number of incubating and infectious individuals in the case of 0=1.6\mathcal{R}_{0}=1.6 are small which means that the final bump in the simulation for t=1.5\mathcal{R}_{t}=1.5 is much larger than the respective bump for t=1.6\mathcal{R}_{t}=1.6, as can be seen in Figure 7. One can see a similar behaviour happening between 0=1.4\mathcal{R}_{0}=1.4 and 0=1.5\mathcal{R}_{0}=1.5 when considering Hl=0.5HuH_{l}=0.5H_{u}. This interplay between parameter values, herd immunity and t\mathcal{R}_{t} is difficult to analyse and demonstrates that intuition is not necessarily enough when forecasting.

Refer to caption
Figure 7: Percentage of patients in hospitals in the South East region using the hospital capacity intervention approach, changing 0\mathcal{R}_{0} and fixing Hl:=0.25HuH_{l}:=0.25H_{u} or Hl:=0.5HuH_{l}:=0.5H_{u}. The grey lines represent the times when the South East region is in an intervention. We note that we have truncated the simulation to make visualisation easier.
Refer to caption
Figure 8: t\mathcal{R}_{t} in the South East region using the hospital capacity intervention approach, changing 0\mathcal{R}_{0} and fixing Hl:=0.25HuH_{l}:=0.25H_{u} or Hl:=0.5HuH_{l}:=0.5H_{u}. The gaps in the lines represent the times when the South East region is in an intervention. The black dashed line represents herd immunity t=1\mathcal{R}_{t}=1. We note that we have truncated the simulation to make visualisation easier.
Table 4: Percentage of dead individuals at the end of each simulation for each region. We fix Hl:=0.25HuH_{l}:=0.25H_{u} or Hl:=0.5HuH_{l}:=0.5H_{u} and vary 0\mathcal{R}_{0}.
England South East North West
0\mathcal{R}_{0} HlHu=0.25\frac{H_{l}}{H_{u}}=0.25 HlHu=0.5\frac{H_{l}}{H_{u}}=0.5 HlHu=0.25\frac{H_{l}}{H_{u}}=0.25 HlHu=0.5\frac{H_{l}}{H_{u}}=0.5 HlHu=0.25\frac{H_{l}}{H_{u}}=0.25 HlHu=0.5\frac{H_{l}}{H_{u}}=0.5
1.3 0.797% 0.876% 0.735% 0.802% 0.843% 0.930%
1.4 1.048% 1.106% 0.978% 1.023% 1.073% 1.147%
1.5 1.263% 1.307% 1.176% 1.000% 1.291% 1.349%
1.6 1.139% 1.240% 1.069% 1.152% 1.473% 1.279%
1.7 1.311% 1.394% 1.234% 1.297% 1.315% 1.423%
1.8 1.460% 1.527% 1.369% 1.420% 1.467% 1.557%
1.9 1.584% 1.430% 1.481% 1.317% 1.599% 1.675%
2.0 1.689% 1.519% 1.323% 1.416% 1.712% 1.777%
\botrule

3.4 Changing HlH_{l}

In this section, we will be experimenting by changing HlH_{l}, the lower threshold of patients that signals the ending of the intervention, and fixing 0=1.5\mathcal{R}_{0}=1.5 or 0=1.6\mathcal{R}_{0}=1.6. In Table 5 we measure the number of interventions needed, the length of each intervention (measured in days), the day of the initiation of each intervention and the percentage of the total deaths at the end of the epidemic. We demonstrate a few of the simulations in Figure 9 and we demonstrate the effective reproduction number over each outbreak in Figure 10.

Comparing Table 3 to Table 5 we can see that again the intervention has dramatically decreased the percentage of total deaths compared to the “do-nothing” approach. Interestingly, changing the threshold HlH_{l} mostly does not have that much of an effect on the percentage of total deaths, unlike the difference we saw in Table 4 when changing 0\mathcal{R}_{0}, but it does have a large effect on the length of the outbreak. The effect of t\mathcal{R}_{t} is slightly different in this section, namely the sudden drop in the percentage of deaths is not caused primarily by a well-timed re-opening. Instead, the jump in percentage is due to the fact that an extra intervention has been used. Considering the South East region, and focusing on 0=1.5\mathcal{R}_{0}=1.5, we can see that the percentage of deaths drops from 1.1682% using Hl=0.1875HuH_{l}=0.1875H_{u} to 1.000% using Hl=0.5HuH_{l}=0.5H_{u}, and by Figure 9 or 10 that there is one intervention using Hl=0.1875HuH_{l}=0.1875H_{u} whilst there is two interventions using Hl=0.5HuH_{l}=0.5H_{u}.

The outcomes of our study so far imply that the timing and the lengths of interventions are extremely important. Getting closer to herd immunity when ending an intervention has the potential to save a huge number of lives. However, calculating t\mathcal{R}_{t} in real life is in general very challenging which leaves the process of timing for herd immunity difficult. One also notices that, although the percentage of total deaths decreases with an intervention, the length of most of the interventions is large due to the criteria set. This is mainly due to the fact that the average hospitalisation period is large and that the scenario we are simulating means that intervention will be in place until hospitals go from full capacity to between 2.5% and 75% capacity. Fortunately, we see that as the target capacity percentage increases, the percentage of total deaths does not increase dramatically, and the length of interventions decreases from the best part of 4 months to under 1 month. Similarly, as the outbreak progresses, one would expect the average hospitalisation length to decrease, since awareness of the disease and treatment gets better, as well as an increase in resources and the development of vaccines. This final point is important as it means realistic interventions can be implemented as circuit breakers and still maintain a large decrease in the number of total deaths. However, one aspect of this which is overlooked in this study is the potential for nosocomial outbreaks, whereby the probability of an outbreak increases with a larger number of infectious patients.

Refer to caption
Figure 9: Percentage of patients in hospitals in the South East region using the hospital capacity intervention approach, changing HlH_{l} and fixing 0=1.5\mathcal{R}_{0}=1.5 or 0:=1.6\mathcal{R}_{0}:=1.6. The grey lines represent the times when the South East region is in an intervention. We note that we have truncated the simulation to make visualisation easier.
Refer to caption
Figure 10: t\mathcal{R}_{t} in the South East region using the hospital capacity intervention approach, changing HlH_{l} and fixing 0=1.5\mathcal{R}_{0}=1.5 or 0:=1.6\mathcal{R}_{0}:=1.6. The gaps in the lines represent the times when the South East region is in an intervention. The black dashed line represents herd immunity t=1\mathcal{R}_{t}=1. We note that we have truncated the simulation to make visualisation easier.
Table 5: Percentage of dead individuals at the end of each simulation for each region. We fix 0=1.5\mathcal{R}_{0}=1.5 or 0=1.6\mathcal{R}_{0}=1.6 and vary HlH_{l}.
England South East North West
HlHu\frac{H_{l}}{H_{u}} 0=1.5\mathcal{R}_{0}=1.5 0=1.6\mathcal{R}_{0}=1.6 0=1.5\mathcal{R}_{0}=1.5 0=1.6\mathcal{R}_{0}=1.6 0=1.5\mathcal{R}_{0}=1.5 0=1.6\mathcal{R}_{0}=1.6
0.025 1.221% 1.407% 1.145% 0.962% 1.232% 1.428%
0.0625 1.229% 1.096% 1.151% 0.987% 1.242% 1.435%
0.09375 1.235% 1.087% 1.155% 1.003% 1.251% 1.442%
0.125 1.240% 1.091% 1.159% 1.018% 1.259% 1.448%
0.1875 1.252% 1.113% 1.168% 1.0448% 1.275% 1.461%
0.25 1.263% 1.139% 1.176% 1.069% 1.291% 1.473%
0.5 1.307% 1.240% 1.000% 1.152% 1.349% 1.279%
0.75 1.180% 1.334% 1.082% 1.230% 1.242% 1.382%
\botrule

3.5 Capacity thresholds

In this section we calculate what the highest capacity threshold HuH_{u} is, for different values of 0\mathcal{R}_{0}, so that the hospitals do not go over their maximum capacity HmaxH_{max}. In particular, this approach can be used as an early warning system by outlining when interventions need to be enforced to maintain a manageable capacity. This approach can complement scenario-based forecasting approaches by giving a range of indicators of when to open up further capacity in hospitals or introduce an intervention which can be tracked against with incoming data daily. Ultimately, in practice, healthcare systems will want to utilise their capacity appropriately whilst not impacting the general public with an intervention, and so optimising the difference between the resource capacity and the maximum number of patients per parameter set is important.

In this section, we only consider the situation of finding HuH_{u} using one intervention, that is to say (3.3) becomes

:=[=0][H(t)>Hu],\displaystyle\ell:=[\ell=0][H(t)>H_{u}],

with (3.2) the same. We only need to consider one intervention here because in most situations, once the hospitals have opened back up again, there will be another spike in capacity, however one can reason that this second spike is essentially the same as the first spike, just with slightly different parameters. For a specified value of 0\mathcal{R}_{0}, we look to find the root of the following function

(Hu;H,Hmax):=maxt>0H(t)Hmax,\displaystyle\mathcal{L}(H_{u}\mathchar 24635\relax\;H,H_{max}):=\max_{t>0}H(t)-H_{max},

where we note that HH depends on HuH_{u}. Since HH is continuous, this function is continuous and will always have a root provided 0\mathcal{R}_{0} is chosen high enough such that the maximum of HH without an intervention is at least as large as HmaxH_{max}. Figure 11 depicts the situation where 0\mathcal{R}_{0} is chosen appropriately for the South East region. For this section, we take Hmax:=0.0012NH_{max}:=0.0012N.

We demonstrate the results in Figure 12. This figure should be read in the following way: if we know 0\mathcal{R}_{0}, then we look to determine that an intervention should be initiated when the healthcare system reaches a certain percentage of the maximum resource capacity. We can see that as 0\mathcal{R}_{0} increases, the cap on the occupancy for the intervention decreases, which is to be expected. This measurement is useful, as, depending on the value of 0\mathcal{R}_{0}, one can calculate what the cap is on occupancy and thus can judge when to take steps to initiate an intervention.

Refer to caption
Figure 11: Comparison of HuH_{u} against the maxH\max H for different values of 0\mathcal{R}_{0} in the South East region. The black dashed line represents HmaxH_{max}.
Refer to caption
Figure 12: The percentage of HmaxH_{max} corresponding to the roots of \mathcal{L} for different values of 0\mathcal{R}_{0}, where the roots correspond to values of HuH_{u}. Here North West is abbreviated to NW and South East is abbreviated to SE.

4 Agent-based Approach

In this section we look to explore the use of an agent-based model for hospital occupancy. Similar to standard compartmental models, agent-based models (also known as individual-based models) are versatile in what they can model, being used in areas such as biology, engineering, politics and economics DD19 ; PSR98 ; RG19 ; SJNSS18 . The agent-based approach considers agents and their behaviours, whilst the standard compartmental approach, also referred to as equation-based modelling in the literature, focuses on observables and equations. There are direct parallels between the two approaches, as we will demonstrate, but the manifestation of the approaches can lead to different results. Due to the nature of infectious diseases, having recognisable transmissions between distinct states, agent-based approaches have slowly been making a surge to becoming the standard method for mathematical epidemiological modelling D05 ; LS12 ; PD09 . The agent-based approach can characterise variable contacts between agents in a way which is difficult to describe using the equation-based approach, since the agent-based approach is more flexible in its description due to the use of probability. The famed Imperial College Model FLNal20 , the implementation is known as CovidSim, is an agent-based model, based on previous works conducted on modelling influenza pandemics FCCFRMIB05 ; FCFCCB06 ; HFELCLXFVAGal08 . Other widely used agent-based models, particularly for the modelling of COVID-19, are covasim and OpenABM-Covid19 AHWLPWESRD21 ; HPNKWHLBZS21 ; KSMARHNRCSH21 ; PKSMKVB20 . We look to compare the outcomes of the previous sections, using the same parameters, in the agent-based approach and demonstrate that the agent-based approach offers more flexibility in devising interventions.

4.1 Model formulation and validation

In this section we will present the agent-based version of the equations in (2.1)–(2.9). In particular, we look to draw parallels between the two schematics in Figures 1 and 13. In a similar manner to Figure 1, Figure 13 describes the way an agent moves between states and should be read as a flowchart, where we have labelled the states similarly to demonstrate some of the parallels. In Figure 13 the circles represent the state an individual can be, and the diamonds represent a decision. The arrows from a states to a decisions are an event in the epidemic, and the arrows from a decision to a state or decision are the outcome of the decision. Some arrows have an outcome of a decision with a parameter in parentheses next to them, these parameters are probabilities of traversing a different section of the flowchart and represent the same parameter as when there is a fork in states in Figure 1, such as the probability of not going to hospital pp. Some arrows have an event with a state in brackets next to them, these are to distinguish between the different rate parameters as the states represent similar characteristics, such as the multiple types of being infectious UU and II.

For ease of exposition, we consider the time unit of the simulation to be days. Each event decision is decided by drawing one pseudo-random uniform number from the interval [0,1][0,1], let us call this value xx, and comparing it to the probability of the event on a given day, let us call this value dd. We can describe this process as the following categorical function

f(x;d):=[xd]“Yes”+[x>d]“No”.\displaystyle f(x\mathchar 24635\relax\;d):=[x\leq d]\text{``Yes"}+[x>d]\text{``No"}.

We note that for each event, a new xx is generated, and each dd is different. The events can be split into two categories, transmission and progression. We begin by describing the transmission event, event: contact. For ease of exposition, each day we assume that all agents make a fixed CC random contacts with a probability of a successful transmission aa, where β=Ca\beta=Ca. For a susceptible agent, of those CC contacts, we denote κ\kappa to be the number of contacts they make with an infectious agent, namely an agent in state UU or II. Then, we have that

d=(Xκ)=1(1a)κ,\displaystyle d=\mathbb{P}(X\leq\kappa)=1-(1-a)^{\kappa},

where XGeo(a)X\sim Geo(a) and XX has support on {1,2,}\{1,2,\dots\}. The remaining events are all progression events, whereby the length of time an agent has spent in a state is important. We will describe “event: incubating”, but note that the same description can be used for each of the events using their respective rate parameters. Let ll represent the number of days an agent has spent in the EE state, then we have

d=(YEl)=1eγEl,\displaystyle d=\mathbb{P}(Y_{E}\leq l)=1-e^{-\gamma_{E}l},

where YEExp(γE)Y_{E}\sim Exp(\gamma_{E}). One can see that event: infectious [U][U] gives YUExp(γU)Y_{U}\sim Exp(\gamma_{U}), event: infectious [I][I] gives YIExp(γI)Y_{I}\sim Exp(\gamma_{I}), and event: hospitalisation gives YHExp(γH+μH)Y_{H}\sim Exp(\gamma_{H}+\mu_{H}). We note that this is not the standard methodology for agent-based models, typically the length of stay for each agent is not stochastically generated, but instead each day an agent runs a Bernoulli trial using the probability associated to the event of progression. In our framework, it is obvious to see that compartments are exponentially distributed, and there is an obvious way to change that assumption if needed without having to describe the underlying mathematics differently. For example, if it was known that the incubation event was in fact akin to an Erlang distribution rather than an Exponential, then we can update the agent-based approach quite simply by updating the random variable YEY_{E}, however in the equation-based approach we would need to add more compartments (the additional number depending on the distribution’s shape parameter) CDE18 .

Refer to caption
Figure 13: Schematic representation of the flowchart which results in the agent-based mathematical model.

For the remainder of this study, we will focus only on the outcomes using the South East parameters. In order to compare the agent-based approach to the standard deterministic SEIR-D approach, we run 100 Monte Carlo simulations and take the average number of agents in each state for each day. We note that typically 100 Monte Carlo simulations are not enough but we see very little variation in the average from using as low as 5 iterations. Due to the stochastic and numerical discretisation, we adjust the rate parameters by adding a correction term in the form of

c(γ;Δt)=γ2Δt2γΔt,\displaystyle c(\gamma\mathchar 24635\relax\;\Delta t)=\frac{\gamma^{2}\Delta t}{2-\gamma\Delta t}, (4.1)

where Δt\Delta t denotes the time interval being considered in ratio of days, e.g. Δt=0.5\Delta t=0.5 would be considering half days, see Appendix A for justification. We note that Δt\Delta t needs to be chosen appropriately so that the denominator in cc is non-zero, however this is a reasonable assumption since typically γ<1\gamma<1 (as it is a rate and its reciprocal in normally larger than one) and Δt1\Delta t\leq 1 since we typically consider simulating each day or several simulations per day. Using the parameters defined in Table 1, with the associated fitted initial conditions, taking Δt=0.25\Delta t=0.25 results in Figure 14, whereby we see a clear agreement between the two approaches and the data. We note that this result is also due to the fact that NN is large, and the initial conditions are suitably large, which combined are often called the mean-field assumption (or the thermodynamic limit). From here onwards, we will fix Δt=1\Delta t=1 for speed but we note in Appendix A that using Δt=1\Delta t=1 and the correction term results in outputs sufficiently close to the SEIR-D model.

Refer to caption
Figure 14: Comparison of the agent-based approach to the standard deterministic SEIR-D approach using the South East parameters. Both approaches match the data closely throughout the time period.

4.2 Intervention scenarios

In this section, we look to apply the techniques and parameter changes in Section 3 to the agent-based approach. We know that β\beta is multiplicatively formed of the average probability of transmission aa, and the average number of contacts per day CC. When changing the transmission rate, β\beta, we will consider two different approaches. We look to investigate whether just changing the probability of transmission aa or average number of contacts per day CC have different impacts on the outcome of the epidemic. One can reason that a change in probability could be due to new strains of an infectious disease or a change in public behaviour, such as wearing masks or increased personal hygiene. Changing the number of contacts could be due to a national level intervention, such as lockdowns or closing schools. We note that we whilst it would be scientifically ideal to keep aa fixed, due to the integer nature of contacts aa has to be adjusted to maintain the same t\mathcal{R}_{t} value across the measurements, thus making sure the tables are comparable.

By using the “do-nothing” approach, we see that in fact changing either parameter does not make an overall difference to the outputs, which is to be expected given the mean-field limit of the agent-based approach to the compartmental model. From here onwards, we will fix C=9C=9 contacts per day and only adjust aa. The results in Tables 6 and 7 also match up to the results in Table 3. Applying the agent-based approach to the intervention scenarios described in Section 3.2, we see that the results in Table 8 match with the results in Table 4 as expected, even maintaining the interesting behaviour around the timing of herd-immunity.

\sidewaystablefn
Table 6: Measurements of the “do-nothing” approach using the agent-based approach with fixed CC.
0\mathcal{R}_{0} CC aa Max beds occupied (%NN) Peak of beds occupied (day) Dead individuals (%NN)
1.3 9 0.0300 0.158% 172 1.044%
1.4 9 0.0326 0.241% 149 1.253%
1.5 9 0.0349 0.328% 130 1.427%
1.6 9 0.0373 0.415% 117 1.569%
1.7 9 0.0396 0.502% 108 1.688%
1.8 9 0.0419 0.585% 100 1.784%
1.9 9 0.0443 0.661% 93 1.869%
2.0 9 0.0466 0.739% 89 1.938%
\botrule
\sidewaystablefn
Table 7: Measurements of the “do-nothing” approach using the agent-based approach with varied CC.
0\mathcal{R}_{0} CC aa Max beds occupied (%NN) Peak of beds occupied (day) Dead individuals (%NN)
1.3 9 0.0300 0.158% 172 1.045%
1.4 10 0.0293 0.241% 147 1.256%
1.5 11 0.0286 0.328% 131 1.422%
1.6 12 0.0279 0.415% 117 1.568%
1.7 13 0.0274 0.501% 108 1.689%
1.8 14 0.0269 0.585% 100 1.788%
1.9 15 0.0266 0.664% 94 1.870%
2.0 16 0.0262 0.739% 88 1.940%
\botrule
Table 8: Percentage of dead individuals at the end of each simulation for the South East region using the agent-based approach. We fix Hl:=0.25HuH_{l}:=0.25H_{u} or Hl:=0.5HuH_{l}:=0.5H_{u} and vary 0\mathcal{R}_{0}.
0\mathcal{R}_{0} HlHu=0.25\frac{H_{l}}{H_{u}}=0.25 HlHu=0.5\frac{H_{l}}{H_{u}}=0.5
1.3 0.742% 0.802%
1.4 0.992% 1.023%
1.5 1.185% 1.003%
1.6 1.067% 1.138%
1.7 1.230% 1.288%
1.8 1.356% 1.414%
1.9 1.476% 1.310%
2.0 1.309% 1.386%
\botrule

4.3 Capacity thresholds

Now, using the agent-based approach, we look to explore and compare against the results presented in Section 3.5. It must be noted that in the deterministic setup we calculated the threshold of the maximum number of patients in hospital before an intervention is needed to guarantee that capacity is not breached. In this section, we want to calculate the number of Monte Carlo realisations that still result in a breached capacity using the calculated deterministic threshold and how the number of realisations that breach capacity decreases when that threshold is reduced. In particular, we look to do a more realistic investigation to the thresholds using the in-built stochasticity of the agent-based approach. This investigation can act as some sort of buffer for hospital management to understand how close to the threshold they can get before having to make changes. We note here that the agent-based approach tends to overestimate its deterministic counterpart when Δt=1\Delta t=1, as approaches the deterministic solution from above when reducing Δt\Delta t, as seen in Figure 16 in Appendix A. This implies that the results will be skewed towards a breach in the threshold. In this investigation we measure three metrics, the number of Monte Carlo realisations that go over HmaxH_{max}, the average maximum amount the realisations go above HmaxH_{max} (to give an idea of how severely the threshold is breached), and the average maximum difference of the simulated hospital capacity HH from HmaxH_{max} across all realisations, given a value of HuH_{u} associated to a value of 0\mathcal{R}_{0} from Figure 12. We measure these metrics against the percentage of HuH_{u} which initiates an intervention, ranging from 90% of HuH_{u} to 100% of HuH_{u}. We make several observations from the results in Table 9:

  1. 1.

    The higher the value of 0\mathcal{R}_{0}, the higher the chance the system will be breached;

  2. 2.

    The higher the value of 0\mathcal{R}_{0}, the lower the percentage of the threshold needs to be considered to reduce the chance of a breach;

  3. 3.

    The higher the value of 0\mathcal{R}_{0}, the larger the system breach will be on average.

Intuitively, these observations make sense and should not necessarily come as a surprise. However, what is useful about this study is understanding what percentage of HuH_{u} is needed to reduce the breaches. This result and process can be used by hospital administrations, planners and managers to prepare for an incoming wave and set aside surge capacity in case of a breaching realisation.

\sidewaystablefn
Table 9: Understanding the breaching of the maximum capacity using the agent-based approach by changing the threshold level determined in Section 3.5.
Number of breaching realisations Average maximum amount over HmaxH_{max} (%) Average maximum difference from HmaxH_{max} (%)
%HuH_{u} 0=1.3\mathcal{R}_{0}=1.3 0=2\mathcal{R}_{0}=2 0=1.3\mathcal{R}_{0}=1.3 0=2\mathcal{R}_{0}=2 0=1.3\mathcal{R}_{0}=1.3 0=2\mathcal{R}_{0}=2
90% 0 20 0 1.934% -6.421% -2.900%
95% 0 80 0 2.780% -2.060% 1.017%
97.5% 32 90 0.863% 4.437% -0.108% 3.793%
100% 90 96 2.316% 6.625% 1.019% 5.960%
\botrule

5 Limitations of the modelling approach and their mitigation

In this study, we assumed that the average probability of going to hospital is the same throughout the different regions due to problems with parameter estimation and the initial conditions. We speculate that the issue of estimating the initial conditions without keeping pp fixed is solvable by reformulating the non-linear initial value problem into a non-linear boundary value problem, where the data is used directly in the model rather than an attribute of the fitting process. We have so far demonstrated how the boundary value problem would be conceived for a simple SIR model and prove existence and uniqueness of the problem for fixed parameters, in the future we look at parameter identifiability, parameter estimation and efficient numerical algorithms using this method CWVDM21 . Methods such as the shooting method, nonlinear least squares of the equivalent initial value problem, or numerical continuation are the standard methods of choice AG12 ; B81 ; HMTMW00 ; RHCC07 , however it is not clear which solution method is the most appropriate for the solution to the nonlinear boundary value problems that can be derived from models used in epidemiology.

The current research into vaccines will prove pivotal in the role to reduce COVID-19 resurgences in the future BI20 ; BRKLCGL21 ; LSHH20 ; MHTDK21 ; SWBal20 . Mathematically, the addition of a vaccine into the model has been undertaken in previous works of similar nature GTNT07 ; MHTDK21 , but as with the other aspects we have mentioned, getting reliable data and understanding the appropriate mathematical assumptions remains the challenge. The ideal situation is that the model captures the reality of the vaccination strategy, namely vaccination rates and parameters are based on live information and data. Whilst the number of COVID-19 vaccinations is published at least on a daily basis, the difficulty lies in understanding the vaccination status of those in hospital. Vaccination assumptions and contributions to models often take one of two forms: the sterilizing vaccine and the leaky vaccine approach. The sterilizing approach assumes that a proportion of those receiving vaccines gain (some form of) immunity whilst the others do not. This style of approach lends itself well to agent-based approaches due to the application of a criteria to specific individuals (agents). The leaky vaccine approach considers that all individuals who receive a vaccine receive a proportion of protection, this style of approach lends itself well to equation-based models as it considers the population as an aggregated state.

An oversight of the work here is that the county is not homogeneous with respect to age. Different age-groups have different social structures, responsibilities - such as working, school life, or family, and different responses to a COVID-19 infection. In this work, we have not explored the impact of inhomogeneity within the population on hospital demand and capacity, in principle due to the lack of publicly available data. In general, the intervention we impose on our system is a total lockdown of all ages, similar to the national lockdown during the first wave, however utilising age-groups within the model will allow for dedicated forecasting into the effect some social events like schools opening or returning to offices will have on interventions DBDal20 ; KKCal20 ; NMDal20 .

Sticking to the models presented here, we can take steps forward to consider that maximising capacity and having longer lockdowns might not be more beneficial than small “circuit breaker” lockdowns when one considers the cost of hospital use and the local economy. For example, by associating a cost to hospital usage or to a lockdown in general, we can find the maximum capacity threshold to go into a lockdown such that, for a specific value of 0\mathcal{R}_{0}, we minimise the total costs by using some of the measurements we presented here such as the length of lockdown, number of lockdowns, and the time until the peak from the initiation of a lockdown. A similar study was conducted in EHMal20 where they calculated the cost of capital (e.g. extra hospitals, provision of hand-washing stations) and one-time costs (e.g. hiring consultants to adapt policy, prepare online training courses), the cost of commodities (e.g. extra single use masks, specific increase in drugs) and the cost of human resources (e.g. extra doctors, extra cleaners) and combined it with the the estimated number of cases from the Imperial College model FLNal20 after four weeks and twelve weeks, using an increase and decrease of 50% transmission rate for an interval of costs. Since their model is on a national scale and uses national derived parameters, one can extend their modelling approach to regional and local levels by using our fitted model.

From a practical perspective, the next question to ask is: now we know what levels the hospital can take, what about the recovery procedure? It is well known that recovering from COVID-19 is not as easy as recovering from, say, the common cold CBLal20 ; YWVal20 , some people completely recover but invasive treatment may have caused further complications, whilst some people may continue to show effects of COVID-19, namely suffering from long COVID Al20 ; Sh20 ; SMVGPBPCKAC21 . In this sense, it is natural to extend the model here to include, what the NHS Clinical Commissioning Group label as, the discharge pathways, which describe the nature of the discharge of a patient and what recovery services they will need. Each pathway describes the level of need of a discharged patient, each level having an associated requirement and cost. Hence the following question arises: what will the burden to healthcare across the country be in one year, five years, and so on? Understanding the pressure on discharge pathways due to COVID-19 may give an indication on recovery costs post COVID-19 infection and/or hospitalisation.

6 Conclusion

In this study, we have presented a computational approach for measuring the impact of healthcare demand and capacity due to surges in COVID-19 infections and hospitalisations. We have used the notion of hospital capacity as a measure for exploring intervention scenarios that will allow hospitals to predict and forecast when demand and capacity are close to being breached and therefore allow resource allocations where necessary.

The key findings are:

  • we have demonstrated that interventions will make a significant impact on the percentage of individuals who will die as a result of COVID-19;

  • we have described an easily definable and understandable method of introducing an intervention which does not depend on prior knowledge of when the peak of infections will be;

  • we have described parallels between equation-based modelling and agent-based modelling and demonstrated that the interventions work in both frameworks;

  • we have demonstrated that only changing the number of contacts and only changing the probability of transmission provide the same outputs;

  • we have shown that, although a threshold to hospital capacity to deny a breach is calculated in the deterministic setup, that threshold needs to be decreased in order to account for stochastic perturbation of a realisation of an epidemic to not go over capacity.

Our approaches are built around using a simple SEIR-D model coupled with novel statistical methods for parameter estimation and reframing the system, with the found parameters, in the agent-based approach to allow us to explore various plausible hypothetical scenarios that are of interest to the NHS local healthcare management teams and death management teams in local authorities CVAal20 . The theoretical and computational approach has a strong interplay between data and the model, whereby data drives the optimal parameter estimates and these in turn drive model predictions through dynamic models.

Acknowledgements and software: The authors would like to acknowledge the use of Python, in particular the NumPy, Matplotlib, pandas and SciPy libraries numpy ; matplotlib ; pandas ; scipy . This work was partly supported by Brighton and Hove City Council, East and West Sussex County Councils and the NHS Sussex commissioners (JVY), and partly supported by the Global Challenges Research Fund through the Engineering and Physical Sciences Research Council grant number EP/T00410X/1: UK-Africa Postgraduate Advanced Study Institute in Mathematical Sciences. AMa acknowledges partial support from the Health Foundation (1902431), the NIHR (NIHR133761) and by an individual grant from the Dr Perry James (Jim) Browne Research Centre on Mathematics and its Applications (University of Sussex). AMa is a Royal Society Wolfson Research Merit Award Holder funded generously by the Wolfson Foundation (2016-2021). AMa is a Distinguished Visiting Scholar to the Department of Mathematics, University of Johannesburg, South Africa. All authors acknowledge the continued support and collaboration of Brighton and Hove City Council, East and West Sussex County Councils, Sussex Health and Care Partnership and the NHS Sussex Commissioners, and thank them for the opportunity for past collaborations which lead to the conceiving of this manuscript.

Competing interests: The authors have no relevant financial or non-financial interests to disclose.

Author Contributions: All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by JVY and RI. The draft of the manuscript was written by JVY and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

References

  • \bibcommenthead
  • (1) Davies, N.G., Barnard, R.C., Jarvis, C.I., Russell, T.W., Semple, M.G., Jit, M., Edmunds, W.J., et al.: Association of tiered restrictions and a second lockdown with covid-19 deaths and hospital admissions in england: a modelling study. Lancet Infect. Dis. (2020). https://doi.org/10.1016/S1473-3099(20)30984-1
  • (2) Campillo-Funollet, E., Van Yperen, J., Allman, P., Bell, M., Beresford, W., Clay, J., Dorey, M., Evans, G., Gilchrist, K., Memon, A., et al.: Predicting and forecasting the impact of local outbreaks of covid-19: Use of seir-d quantitative epidemiological modelling for healthcare demand and capacity. Int. J. Epidemiol. 50(4), 1103–1113 (2021). https://doi.org/10.1093/ije/dyab106
  • (3) Dehning, J., Zierenberg, J., Spitzner, F.P., Wibral, M., Neto, J.P., Wilczek, M., Priesemann, V.: Inferring change points in the spread of covid-19 reveals the effectiveness of interventions. Science (2020). https://doi.org/10.1126/science.abb9789
  • (4) Edejer, T.T.-T., Hanssen, O., Mirelman, A., Verboom, P., Lolong, G., Watson, O.J., Boulanger, L.L., Soucat, A.: Projected health-care resource needs for an effective response to covid-19 in 73 low-income and middle-income countries: a modelling study. Lancet Glob. Health (2020). https://doi.org/10.1016/S2214-109X(20)30383-1
  • (5) Ferguson, N.M., Laydon, D., Nedjati-Gilani, G., Imai, N., Ainslie, K., Baguelin, M., Bhatia, S., Boonyasiri, A., Cucunubá, Z., Cuomo-Dannenburg, G., et al.: Impact of non-pharmaceutical interventions (npis) to reduce covid-19 mortality and healthcare demand (2020). https://doi.org/10.25561/77482
  • (6) Ferstad, J.O., Gu, A.J., Lee, R.Y., Thapa, I., Shin, A.Y., Salomon, J.A., Glynn, P., Shah, N.H., Milstein, A., Schulman, K., et al.: A model to forecast regional demand for COVID-19 related hospital beds. Preprint at https://www.medrxiv.org/content/10.1101/2020.03.26.20044842v3 (2020)
  • (7) Gariboldi, M.I., Lin, V., Bland, J., Auplish, M., Cawthorne, A.: Foresight in the time of covid-19. The Lancet Regional Health-Western Pacific 6, 100049 (2020). https://doi.org/10.1016/j.lanwpc.2020.100049
  • (8) Jewell, N.P., Lewnard, J.A., Jewell, B.L.: Predictive mathematical models of the covid-19 pandemic: Underlying principles and value of projections. JAMA 323(19), 1893–1894 (2020). https://doi.org/10.1001/jama.2020.6585
  • (9) Liu, M., Thomadsen, R., Yao, S.: Forecasting the spread of covid-19 under different reopening strategies. Sci. Rep. 10(1), 1–8 (2020). https://doi.org/10.1038/s41598-020-77292-8
  • (10) Liu, Y., Morgenstern, C., Kelly, J., Lowe, R., Jit, M.: The impact of non-pharmaceutical interventions on sars-cov-2 transmission across 130 countries and territories. BMC Med. 19(1), 1–12 (2021). https://doi.org/10.1186/s12916-020-01872-8
  • (11) Mavragani, A., Gkillas, K.: Covid-19 predictability in the united states using google trends time series. Sci. Rep. 10(1), 1–12 (2020). https://doi.org/10.1038/s41598-020-77275-9
  • (12) Arino, J.: Mathematical epidemiology in a data-rich world. Infect. Dis. Model. 5, 161–188 (2020). https://doi.org/10.1016/j.idm.2019.12.008
  • (13) Mercatelli, D., Holding, A.N., Giorgi, F.M.: Web tools to fight pandemics: the covid-19 experience. Brief. Bioinform. 22(2), 690–700 (2021). https://doi.org/10.1093/bib/bbaa261
  • (14) Goldacre, B., Morley, J.: Better, broader, safer: using health data for research and analysis. A Review Commissioned by the Secretary of State for Health and Social Care (2022)
  • (15) Keeling, M.J.: Models of foot-and-mouth disease. Proc. R. Soc. B: Biol. Sci. 272(1569), 1195–1202 (2005). https://doi.org/10.1098/rspb.2004.3046
  • (16) Baguelin, M., Van Hoek, A.J., Jit, M., Flasche, S., White, P.J., Edmunds, W.J.: Vaccination against pandemic influenza a/h1n1v in england: a real-time economic evaluation. Vaccine 28(12), 2370–2384 (2010). https://doi.org/10.1016/j.vaccine.2010.01.002
  • (17) Ferguson, N.M., Cummings, D.A., Fraser, C., Cajka, J.C., Cooley, P.C., Burke, D.S.: Strategies for mitigating an influenza pandemic. Nature 442(7101), 448–452 (2006). https://doi.org/10.1038/nature04795
  • (18) Avram, F., Freddi, L., Goreac, D.: Optimal control of a sir epidemic with icu constraints and target objectives. Appl. Math. Comput. 418, 126816 (2022). https://doi.org/10.1016/j.amc.2021.126816
  • (19) Miclo, L., Spiro, D., Weibull, J.: Optimal epidemic suppression under an icu constraint: An analytical solution. J. Math. Econ., 102669 (2022). https://doi.org/10.1016/j.jmateco.2022.102669
  • (20) Di Lauro, F., Kiss, I.Z., Miller, J.C.: Optimal timing of one-shot interventions for epidemic control. PLoS Comput. Biol. 17(3), 1008763 (2021). https://doi.org/10.1371/journal.pcbi.1008763
  • (21) Nowzari, C., Preciado, V.M., Pappas, G.J.: Analysis and control of epidemics: A survey of spreading processes on complex networks. IEEE Control Syst. 36(1), 26–46 (2016). https://doi.org/10.1109/MCS.2015.2495000
  • (22) Feng, Y., Iyer, G., Li, L.: Scheduling fixed length quarantines to minimize the total number of fatalities during an epidemic. J. Math. Biol. 82(7), 1–17 (2021). https://doi.org/10.1007/s00285-021-01615-0
  • (23) Jana, S., Ghose, D.: Optimal Lockdown Management using Short Term COVID-19 Prediction Model. Preprint at https://arxiv.org/abs/2203.03488 (2022)
  • (24) Sontag, E.D.: An explicit formula for minimizing the infected peak in an sir epidemic model when using a fixed number of complete lockdowns. Int. J. Robust Nonlinear Control. (2021). https://doi.org/10.1002/rnc.5701
  • (25) Diekmann, O., Heesterbeek, J., Roberts, M.G.: The construction of next-generation matrices for compartmental epidemic models. J. R. Soc. Interface 7(47), 873–885 (2010). https://doi.org/10.1098/rsif.2009.0386
  • (26) Campillo-Funollet, E., Wragg, H., Van Yperen, J., Duong, D.-L., Madzvamuse, A.: Reformulating the SIR model in terms of the number of COVID-19 detected cases: well-posedness of the observational model. Accepted in Philosophical Transactions A (Royal Society) (2021)
  • (27) Butcher, J.C.: Numerical Methods for Ordinary Differential Equations. John Wiley & Sons, Ltd, Chichester (2016)
  • (28) Griffiths, D.F., Higham, D.J.: Numerical Methods for Ordinary Differential Equations: Initial Value Problems. Springer, London (2010). https://doi.org/10.1007/978-0-85729-148-6
  • (29) Petzold, L.: Automatic selection of methods for solving stiff and nonstiff systems of ordinary differential equations. SIAM J. Sci. Comput. 4(1), 136–148 (1983). https://doi.org/10.1137/0904010
  • (30) Knuth, D.E.: Two notes on notation. Am. Math. Mon. 99(5), 403–422 (1992). https://doi.org/10.1080/00029890.1992.11995869
  • (31) Maier, B.F., Brockmann, D.: Effective containment explains subexponential growth in recent confirmed covid-19 cases in china. Science 368(6492), 742–746 (2020). https://doi.org/10.1126/science.abb4557
  • (32) DeAngelis, D.L., Diaz, S.G.: Decision-making in agent-based modeling: A current review and future prospectus. Front. Ecol. Evol. 6, 237 (2019). https://doi.org/10.3389/fevo.2018.00237
  • (33) Parunak, H.V.D., Savit, R., Riolo, R.L.: Agent-based modeling vs. equation-based modeling: A case study and users’ guide. In: Sichman, J.S., Conte, R., Gilbert, N. (eds.) International Workshop on Multi-agent Systems and Agent-based Simulation, pp. 10–25. Springer, Berlin (1998). https://doi.org/10.1007/10692956_2
  • (34) Railsback, S.F., Grimm, V.: Agent-based and Individual-based Modeling: a Practical Introduction. Princeton university press, Princeton (2019)
  • (35) Sheikh, M.J., Jandaghi, G., Naeini, A.B., Shafia, M.A., Sabzian, H.: A review of agent-based modeling (abm) concepts and some of its main applications in management science. Iran. J. Manag. Stud. 11(4) (2018). https://doi.org/10.22059/IJMS.2018.261178.673190
  • (36) Dunham, J.B.: An agent-based spatially explicit epidemiological model in mason. J. Artif. Soc. Soc. Simul. 9(1) (2005)
  • (37) Luke, D.A., Stamatakis, K.A.: Systems science methods in public health: dynamics, networks, and agents. Annu. Rev. Public Health 33, 357–376 (2012). https://doi.org/10.1146/annurev-publhealth-031210-101222
  • (38) Perez, L., Dragicevic, S.: An agent-based approach for modeling dynamics of contagious disease spread. Int. J. Health Geogr. 8(1), 1–17 (2009). https://doi.org/10.1186/1476-072X-8-50
  • (39) Ferguson, N.M., Cummings, D.A., Cauchemez, S., Fraser, C., Riley, S., Meeyai, A., Iamsirithaworn, S., Burke, D.S.: Strategies for containing an emerging influenza pandemic in southeast asia. Nature 437(7056), 209–214 (2005). https://doi.org/10.1038/nature04017
  • (40) Halloran, M.E., Ferguson, N.M., Eubank, S., Longini, I.M., Cummings, D.A., Lewis, B., Xu, S., Fraser, C., Vullikanti, A., Germann, T.C., et al.: Modeling targeted layered containment of an influenza pandemic in the united states. PNAS 105(12), 4639–4644 (2008). https://doi.org/10.1073/pnas.0706849105
  • (41) Abueg, M., Hinch, R., Wu, N., Liu, L., Probert, W., Wu, A., Eastham, P., Shafi, Y., Rosencrantz, M., Dikovsky, M., et al.: Modeling the effect of exposure notification and non-pharmaceutical interventions on covid-19 transmission in washington state. NPJ Digit. Med. 4(1), 1–10 (2021). https://doi.org/10.1038/s41746-021-00422-7
  • (42) Hinch, R., Probert, W.J., Nurtay, A., Kendall, M., Wymant, C., Hall, M., Lythgoe, K., Bulas Cruz, A., Zhao, L., Stewart, A., et al.: Openabm-covid19—an agent-based model for non-pharmaceutical interventions against covid-19 including contact tracing. PLoS Comput. Biol. 17(7), 1009146 (2021). https://doi.org/%****␣EJE_submission.bbl␣Line␣725␣****10.1371/journal.pcbi.1009146
  • (43) Kerr, C.C., Stuart, R.M., Mistry, D., Abeysuriya, R.G., Rosenfeld, K., Hart, G.R., Núñez, R.C., Cohen, J.A., Selvaraj, P., Hagedorn, B., et al.: Covasim: an agent-based model of covid-19 dynamics and interventions. PLOS Comput. Biol. 17(7), 1009149 (2021). https://doi.org/10.1371/journal.pcbi.1009149
  • (44) Panovska-Griffiths, J., Kerr, C.C., Stuart, R.M., Mistry, D., Klein, D.J., Viner, R.M., Bonell, C.: Determining the optimal strategy for reopening schools, the impact of test and trace interventions, and the risk of occurrence of a second covid-19 epidemic wave in the uk: a modelling study. Lancet Child Adolesc. Health 4(11), 817–827 (2020). https://doi.org/10.1016/S2352-4642(20)30250-9
  • (45) Champredon, D., Dushoff, J., Earn, D.J.: Equivalence of the erlang-distributed seir epidemic model and the renewal equation. SIAM J. Appl. Math. 78(6), 3258–3278 (2018). https://doi.org/10.1137/18M1186411
  • (46) Allgower, E.L., Georg, K.: Numerical Continuation Methods: an Introduction. Springer, Berlin (2012). https://doi.org/10.1007/978-3-642-61257-2
  • (47) Bock, H.G.: Numerical treatment of inverse problems in chemical reaction kinetics. In: Ebert, K.H., Deufflhard, P., Jäger, W. (eds.) Modelling of Chemical Reaction Systems, pp. 102–125. Springer, Berlin (1981). https://doi.org/10.1007/978-3-642-68220-9
  • (48) Horbelt, W., Müller, T., Timmer, J., Melzer, W., Winkler, K.: Analysis of nonlinear differential equations: parameter estimation and model selection. In: Brause, R.W., Hanisch, E. (eds.) International Symposium on Medical Data Analysis, pp. 152–159. Springer, Berlin (2000). https://doi.org/10.1007/3-540-39949-6_19
  • (49) Ramsay, J.O., Hooker, G., Campbell, D., Cao, J.: Parameter estimation for differential equations: a generalized smoothing approach. J. R. Stat. Soc. Series B Stat. Methodol. 69(5), 741–796 (2007). https://doi.org/10.1111/j.1467-9868.2007.00610.x
  • (50) Bar-Zeev, N., Inglesby, T.: Covid-19 vaccines: early success and remaining challenges. Lancet (2020). https://doi.org/10.1016/S0140-6736(20)31867-5
  • (51) Bubar, K.M., Reinholt, K., Kissler, S.M., Lipsitch, M., Cobey, S., Grad, Y.H., Larremore, D.B.: Model-informed covid-19 vaccine prioritization strategies by age and serostatus. Science 371(6532), 916–921 (2021). https://doi.org/10.1126/science.abe6959
  • (52) Lurie, N., Saville, M., Hatchett, R., Halton, J.: Developing covid-19 vaccines at pandemic speed. N. Engl. J. Med. 382(21), 1969–1973 (2020). https://doi.org/10.1056/NEJMp2005630
  • (53) Moore, S., Hill, E.M., Tildesley, M.J., Dyson, L., Keeling, M.J.: Vaccination and non-pharmaceutical interventions for covid-19: a mathematical modelling study. Lancet Infect. Dis. 21(6), 793–802 (2021). https://doi.org/10.1016/S1473-3099(21)00143-2
  • (54) Saad-Roy, C.M., Wagner, C.E., Baker, R.E., Morris, S.E., Farrar, J., Graham, A.L., Levin, S.A., Mina, M.J., Metcalf, C.J.E., Grenfell, B.T.: Immune life history, vaccination, and the dynamics of sars-cov-2 over the next 5 years. Science 370(6518), 811–818 (2020). https://doi.org/10.1126/science.abd7343
  • (55) Gao, S., Teng, Z., Nieto, J.J., Torres, A.: Analysis of an sir epidemic model with pulse vaccination and distributed time delay. J. Biotechnol. Biomed. (2007). https://doi.org/10.1155/2007/64870
  • (56) Di Lauro, F., Berthouze, L., Dorey, M.D., Miller, J.C., Kiss, I.Z.: The impact of network properties and mixing on control measures and disease-induced herd immunity in epidemic models: a mean-field model perspective. Preprint at https://arxiv.org/abs/2007.06975 (2020)
  • (57) Klepac, P., Kucharski, A.J., Conlan, A.J., Kissler, S., Tang, M., Fry, H., Gog, J.R.: Contacts in context: large-scale setting-specific social mixing matrices from the BBC Pandemic project. Preprint at https://www.medrxiv.org/content/10.1101/2020.02.16.20023754v2 (2020)
  • (58) Nussbaumer-Streit, B., Mayr, V., Dobrescu, A.I., Chapman, A., Persad, E., Klerings, I., Wagner, G., Siebert, U., Christof, C., Zachariah, C., et al.: Quarantine alone or in combination with other public health measures to control covid-19: a rapid review. Cochrane Database Syst. Rev. (4) (2020). https://doi.org/10.1002/14651858.CD013574.pub2
  • (59) Carfì, A., Bernabei, R., Landi, F., et al.: Persistent symptoms in patients after acute covid-19. JAMA 324(6), 603–605 (2020). https://doi.org/10.1001/jama.2020.12603
  • (60) Yelin, D., Wirtheim, E., Vetter, P., Kalil, A.C., Bruchfeld, J., Runold, M., Guaraldi, G., Mussini, C., Gudiol, C., Pujol, M., et al.: Long-term consequences of covid-19: research needs. Lancet Infect. Dis. (2020). https://doi.org/10.1016/S1473-3099(20)30701-5
  • (61) Alwan, N.A.: A negative covid-19 test does not mean recovery. Nature, 170–170 (2020). https://doi.org/10.1038/d41586-020-02335-z
  • (62) Sheehy, L.M.: Considerations for postacute rehabilitation for survivors of covid-19. JMIR public health and surveillance 6(2), 19462 (2020). https://doi.org/10.2196/19462
  • (63) Sudre, C.H., Murray, B., Varsavsky, T., Graham, M.S., Penfold, R.S., Bowyer, R.C., Pujol, J.C., Klaser, K., Antonelli, M., Canas, L.S., et al.: Attributes and predictors of long covid. Nat. Med. 27(4), 626–631 (2021). https://doi.org/10.1038/s41591-021-01292-y
  • (64) Harris, C.R., Millman, K.J., van der Walt, S.J., Gommers, R., Virtanen, P., Cournapeau, D., Wieser, E., Taylor, J., Berg, S., Smith, N.J., Kern, R., Picus, M., Hoyer, S., van Kerkwijk, M.H., Brett, M., Haldane, A., Fernández del Río, J., Wiebe, M., Peterson, P., Gérard-Marchant, P., Sheppard, K., Reddy, T., Weckesser, W., Abbasi, H., Gohlke, C., Oliphant, T.E.: Array programming with NumPy. Nature 585, 357–362 (2020). https://doi.org/10.1038/s41586-020-2649-2
  • (65) Hunter, J.D.: Matplotlib: A 2d graphics environment. Comput. Sci. Eng. 9(03), 90–95 (2007). https://doi.org/10.1109/MCSE.2007.55
  • (66) McKinney, W., et al.: Data structures for statistical computing in python. In: van der Walt, S., Millman, J. (eds.) Proceedings of the 9th Python in Science Conference, vol. 445, pp. 51–56 (2010). https://doi.org/10.25080/Majora-92bf1922-00a
  • (67) Virtanen, P., Gommers, R., Oliphant, T.E., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright, J., van der Walt, S.J., Brett, M., Wilson, J., Millman, K.J., Mayorov, N., Nelson, A.R.J., Jones, E., Kern, R., Larson, E., Carey, C.J., Polat, İ., Feng, Y., Moore, E.W., VanderPlas, J., Laxalde, D., Perktold, J., Cimrman, R., Henriksen, I., Quintero, E.A., Harris, C.R., Archibald, A.M., Ribeiro, A.H., Pedregosa, F., van Mulbregt, P., SciPy 1.0 Contributors: SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nat. Methods 17, 261–272 (2020). https://doi.org/10.1038/s41592-019-0686-2

Appendix Appendix A Verification of parameters

In order to verify that the agent-based approach was modelling the exact same problem as the SEIR-D model (and thus the data) without having to do any parameter estimation, we chose to check the following criteria:

  • is the mean of the output of the agent-based approach almost indistinguishable to the output of the SEIR-D model?

  • is the mean of the time spent in the states EE, UU and II close to the inverse of the associated rate parameter?

  • is the proportion of people going into UU, DUD_{U} and DHD_{H} close to the associated probability parameters?

The first criterion was measured by plotting both approaches, the second criterion was calculated by measuring the frequency of the length of stay of each agent in each compartment during the simulation, and the third criterion was calculated by counting the number of agents who went along each decision.

Whilst conducting this investigation we noticed that the agent-based approach overestimated the result, as can be seen in Figure 15, but converges towards the SEIR-D approach as we reduce Δt\Delta t. When checking the rate parameters, we noticed that the observed mean was approximately 0.5Δt0.5\Delta t greater than the expected mean from the fitted parameters, which can be seen in Figure 17 both by value (in the title of each plot) and by the fact that the red line (the expected probability density function) mostly goes through the next histogram column at the corner. We also noticed that this was independent of the number of Monte Carlo realisations. This led us to add on a correction term to the fitted parameters to reduce the observed mean towards the expected mean, namely by solving

1γ+x=1γΔt2,\displaystyle\frac{1}{\gamma+x}=\frac{1}{\gamma}-\frac{\Delta t}{2},

which rearranges to (4.1). We note that c(γ;Δt)0c(\gamma\mathchar 24635\relax\;\Delta t)\rightarrow 0 as Δt0\Delta t\rightarrow 0. Utilising this correction term, we see that the observed means for the parameters are significantly more accurate in Figure 18 and the output of the agent-based approach matches the SEIR-D approach better in Figure 16. It is important to stress that whilst we do see convergence as Δt\Delta t tends to 0, significantly more computational power is needed when reducing Δt\Delta t. By adding the correction term, we can use a larger time step, and thus less computational power, whilst still maintaining accurate results. This is particularly important when one could consider using a agent-based approach to model a complex phenomena, say one which does not have an obvious equation-based approach, and still be able to conduct parameter estimation.

As for the probability parameters, we noticed that as the number of Monte Carlo realisations increased, the closer the probabilities got to the SEIR-D parameter, which can be seen in Table 10. Intuitively, this is to be expected as the more decisions being made, the closer the probability should be approximated. One also notices that you can calculate the probabilities at the end of the simulation (rather than count) by manipulating the SEIR-D approach in the following way. To obtain mUm_{U}, one integrates (2.6) and rearranges to find

γU0TU(s)ds=11mU(RU(T)RU(0)),\displaystyle\gamma_{U}\int_{0}^{T}U(s)\,\text{d}s=\frac{1}{1-m_{U}}(R_{U}(T)-R_{U}(0)),

which, by integrating (2.8), inserting above and rearranging, gives

mU=DU(T)DU(0)RU(T)RU(0)+DU(T)DU(0).\displaystyle m_{U}=\frac{D_{U}(T)-D_{U}(0)}{R_{U}(T)-R_{U}(0)+D_{U}(T)-D_{U}(0)}.

One can apply the same idea to get

mH=DH(T)DH(0)RH(T)RH(0)+DH(T)DH(0),\displaystyle m_{H}=\frac{D_{H}(T)-D_{H}(0)}{R_{H}(T)-R_{H}(0)+D_{H}(T)-D_{H}(0)},

and

p=U(T)U(0)+γU0TU(s)dsU(T)U(0)+γU0TU(s)ds+I(T)U(0)+γI0TI(s)ds.\displaystyle p=\frac{U(T)-U(0)+\gamma_{U}\int_{0}^{T}U(s)\,\text{d}s}{U(T)-U(0)+\gamma_{U}\int_{0}^{T}U(s)\,\text{d}s+I(T)-U(0)+\gamma_{I}\int_{0}^{T}I(s)\,\text{d}s}.
Refer to caption
Figure 15: Comparison of the agent-based approach to the SEIR-D model, reducing Δt\Delta t and considering beds occupied for the South East without using the correction term.
Refer to caption
Figure 16: Comparison of the agent-based approach to the SEIR-D model, reducing Δt\Delta t and considering beds occupied for the South East using the correction term.
Refer to caption
Figure 17: Frequency density of lengths of agents in certain state without the use of the correction term. The titles of each plot depict the observed mean of the frequencies of the lengths, the expected mean given the state and fitted parameter, and the unit of days being considered. The first row corresponds to lengths of time agents spend in the EE state, which should correspond to an exponential distribution with mean γE\gamma_{E}, the second row corresponds to lengths of time agents spend in the UU state, which should correspond to an exponential distribution with mean γU\gamma_{U}, and the final row corresponds to lengths of time agents spend in the II state, which should correspond to an exponential distribution with mean γI\gamma_{I}. The first column uses a time unit of 1 day, the second column uses a time unit half a day and the last column uses a time unit of a quarter of a day.
Refer to caption
Figure 18: Frequency density of lengths of agents in certain state with the use of the correction term. The titles of each plot depict the observed mean of the frequencies of the lengths, the expected mean given the state and fitted parameter, and the unit of days being considered. The first row corresponds to lengths of time agents spend in the EE state, which does correspond to an exponential distribution with mean γE\gamma_{E}, the second row corresponds to lengths of time agents spend in the UU state, which does correspond to an exponential distribution with mean γU\gamma_{U}, and the final row corresponds to lengths of time agents spend in the II state, which does correspond to an exponential distribution with mean γI\gamma_{I}. The first column uses a time unit of 1 day, the second column uses a time unit half a day and the last column uses a time unit of a quarter of a day.
Table 10: Observed values of the probability parameters using a different number of Monte Carlo iterations to compare the observed value against the expected value given the state and fitted parameter, where mHm_{H} is the proportion of individuals who die in hospital (calculated by μH(γH+μH)1\mu_{H}(\gamma_{H}+\mu_{H})^{-1}).
Monte Carlo iterations
Parameter 1 5 10 20 Expected value
pp 0.9440 0.9411 0.9424 0.9405 0.9401
mUm_{U} 0.0019 0.0017 0.0015 0.0014 0.0013
mHm_{H} 0.4480 0.3573 0.3935 0.3815 0.3843
\botrule