Age-Stratified COVID-19 Spread Analysis and Vaccination: A Multitype Random Network Approach
Abstract
The risk for severe illness and mortality from COVID-19 significantly increases with age. As a result, age-stratified modeling for COVID-19 dynamics is the key to study how to reduce hospitalizations and mortality from COVID-19. By taking advantage of network theory, we develop an age-stratified epidemic model for COVID-19 in complex contact networks. Specifically, we present an extension of standard SEIR (susceptible-exposed-infectious-removed) compartmental model, called age-stratified SEAHIR (susceptible-exposed-asymptomatic-hospitalized-infectious-removed) model, to capture the spread of COVID-19 over multitype random networks with general degree distributions. We derive several key epidemiological metrics and then propose an age-stratified vaccination strategy to decrease the mortality and hospitalizations. Through extensive study, we discover that the outcome of vaccination prioritization depends on the reproduction number . Specifically, the elderly should be prioritized only when is relatively high. If ongoing intervention policies, such as universal masking, could suppress at a relatively low level, prioritizing the high-transmission age group (i.e., adults aged 20-39) is most effective to reduce both mortality and hospitalizations. These conclusions provide useful recommendations for age-based vaccination prioritization for COVID-19.
Index Terms:
COVID-19, epidemic modeling, random network, vaccination.I Introduction
Between January 2020 and November 30, 2020, about 1.47 million deaths from the novel coronavirus disease (COVID-19) are reported worldwide[1]. On the one hand, COVID-19 is much more deadly than most strains of flu. On the other hand, many people infected with the coronavirus do not develop symptoms, and hence they can transmit the virus to others without being aware of it[2], which makes the pandemic extremely difficult to contain.
To live with the COVID-19 pandemic, governments and healthcare systems are always struggling to save lives and “flatten the curve”, i.e., reducing the mortality and the peak of hospitalizations. Since severity and mortality rates of COVID-19 greatly vary across age-groups and increase dramatically for the elderly[3, 4], effective intervention policies to achieve these two goals must prevent elderly, who are at high-risk for severe clinical outcomes, from infections. For this reason, age-stratified modeling for COVID-19 dynamics indeed serves as the basis of accurately assessing the effectiveness of control policies in decreasing illness severity and mortality. In this respect, some age-stratified mathematical models have already been proposed to analyze the spread of COVID-19 for different purposes[5, 6, 7, 8]. However, these models are based on an oversimplified assumption that people are fully mixing, i.e., everyone contracting and spreading the virus to every other with equal probability, within each age group, which clearly fail to incorporate enough details in real-life contact networks. In reality, people in the same age group still differ greatly in the way of spreading the disease. As a consequence of this heterogeneity, it is found that epidemic outcomes in complex networks could deviate greatly from the results obtained from fully mixing epidemiological models[9, 10].
Motivated by the aforementioned observations, in this paper, we present a unified yet simple mathematical model for COVID-19 spread analysis by accounting for both the age-specific risk and the heterogeneity in contact patterns within and across age groups. We take advantage of random network theory to analyze the spread of COVID-19 in contact networks with general degree distributions. More specifically, we present an extension of standard SEIR (susceptible-exposed-infectious-removed) compartmental model, called age-stratified SEAHIR (susceptible-exposed-asymptomatic-hospitalized-infectious-removed) model to describe the disease progression for infected individuals, and study the epidemic spreading process in multitype random networks where each type of nodes is treated as an age group. Some key epidemiological metrics, such as time-dependent dynamics, steady-state epidemic size (which will be termed as epidemic size throughout this paper), epidemic probability, and reproduction number, are derived, allowing us to analyze the epidemics and the impact of control policies in a thorough and effective manner. Due to the consideration of stochasticity and network structure, the proposed model is capable of offering some useful epidemic results that the existing fully mixing age-stratified models are unable to provide, like assessing the impact of preferential isolation of nodes (e.g., immunizing essential workers first). Given that many contagious diseases, including influenza, also exhibit distinct characteristics for different groups of people[11], the proposed model can be easily generalized to modeling many other infectious diseases.

While non-pharmaceutical intervention (NPI) policies, such as masking and social distancing, are effective in reducing the transmissions and mitigating the healthcare burden, it has become increasingly clear that vaccination is the only way to eliminate the pandemic worldwide. Unfortunately, vaccine availability will be highly constrained for general population during at least the first several months of the vaccine distribution campaign. Therefore, vaccination prioritization decision will play a pivotal role in reducing the effects of COVID-19 during such a period[13, 14]. Under our proposed framework, we present an age-stratified vaccination strategy for the considered multitype network. In simulations, we focus on answering the following question: with limited doses available, who should be vaccinated first to reduce mortality and hospitalizations as much as possible? Our simulation results show that the answer depends on the value of reproduction number . The reason behind is that, the epidemic size (i.e., the fraction of population eventually getting infected) increases slowly in large region, while increasing steeply in small region. As a result, vaccinating the high-transmission group (adults aged 20-39) is highly effective in blocking COVID-19 transmissions in small region, which thus protects the high-risk group (the elderly) indirectly. In contrast, in large region, even if high-transmission group is prioritized, it will have little impact on epidemic size as long as the vaccine supply is limited. Consequently, directly vaccinating the high-risk group becomes the preferable strategy. We illustrate this phenomenon in Fig. 1, where prioritizing young people aged 20-39 is preferable when , whereas prioritizing the elderly is the better choice only when . Although most studies estimate that for COVID-19 is between 2-3.5 under pre-intervention scenarios[15], it certainly can be pushed to a relatively low level, e.g., below , via NPI policies or even natural immunity (the latter only meaningful for highly infected places[16]). Thus, our finding indicates that vaccination prioritization should be customized for different places by considering the ongoing NPI policies and other effects that could suppress . The key contributions of this paper are summarized as follows.
-
•
We employ multitype random network theory to develop an age-stratified epidemic model for COVID-19. We derive the time-dependent epidemic dynamics, where each individual could belong to one of six compartments, i.e., susceptible, exposed, asymptomatic, hospitalized, infectious and removed.
-
•
To analyze the stochastic property and final state of the epidemic, we derive other critical epidemiological metrics, such as epidemic size, epidemic probability, and reproduction number for the considered networks.
-
•
We present an age-stratified vaccination strategy based on the proposed model. The simulation results indicate that high-risk age group should be vaccinated first to diminish mortality and hospitalizations in large region. Conversely, when is suppressed at a low level, prioritizing the high-transmission age group becomes the most effective strategy.
The reminder of this paper is organized as follows. In Section II, we describe the related work. In Section III, we introduce the network model, and derive the time-dependent epidemic dynamics and other key epidemiological metrics. In Section IV, we devise an vaccination strategy for the considered networks. In Section V, we conduct simulations to compare different age-specific vaccination prioritization strategies. In Section VI, we draw our conclusions.
II Related Work
Some mathematical models for COVID-19 have been presented to account for the age-varying risks for mortality and severe illness. In [5], Singh et al. use an age-stratified SIR (susceptible-infective-removed) model to study the impact of social distancing measures, including workplace non-attendance, school closure, and lockdown, on the course of the COVID-19 pandemic. In [6], Balabdaoui et al. propose an age-stratified discrete compartmental model to describe the day-by-day progression of an infected individual in modern healthcare systems, e.g., in intensive care unit (ICU), with the objective of precisely projecting the occupancy of health care resources. In [7], Tuite et al. develop an age-stratified COVID-19 model to identify intervention strategies that keep the number of projected severe cases lower than the capacity of local health care systems. The aforementioned models make full-mixing assumption within each age group, which hence fail to capture enough details of population heterogeneity. In [17], Chang et al. propose an agent-based model to predict the infected number in Australia by considering the age-dependent effects. While agent-based models incorporate more realistic factors, they demand computationally intensive simulations, and generally offer limited insights into epidemic outcomes.
Random network theory allows us to model epidemics by taking heterogeneous contact network structure into account while bypassing computationally complicated simulations. Epidemic propagation in networks can be exactly interpreted as a bond percolation process, which hence can be analyzed by well-understood physics models, such as percolation[18]. Although several works have applied percolation theory to analyze the spread of COVID-19[10, 19, 20], they have not taken the age-varying effects into consideration. On the other hand, given that an age-stratified population can be characterized as a multitype random network in which each type of vertices correspond to an age group, one possible direction is to directly map the epidemic spread to bond percolation in multitype random graphs[21, 22]. Unfortunately, percolation theory is mostly limited to analysis of final state of networks, and cannot predict time-dependent transient dynamics. In [23], Miller et al. propose an edge-based SIR compartmental model to describe the time-dependent epidemic dynamics in complex networks. Inspired by their approach, we solve the time-dependent dynamics for COVID-19 in multitype random networks, and then derive the expressions for epidemic size, epidemic probability, and reproduction number by performing analysis on the final state of the considered networks.
The design of vaccination prioritization strategies for COVID-19 has also attracted some research attention. Nonetheless, most of works draw the conclusion that vaccinating the older groups first is the robust strategy to minimize mortality or hospitalizations during a vaccine shortage[24, 25, 26]. This perhaps is because they fail to identify the underlying relationship between the priority population and the reproduction number . Our finding coincides with these works only when is great. Recently, Jentsch et al. show that prioritizing the high-transmission group will reduce the death toll from COVID-19 most if vaccines become available late next year for Ontario, because high level of natural immunity may be already achieved in Ontario at that time[8]. Their conclusion essentially shares the same observation with ours as higher natural immunity leads to a lower . Different from their work, by taking advantage of our epidemic model, we also study the impact of vaccination prioritization strategies on hospitalizations, and the effectiveness of immunizing people with high activity, i.e., the essential workers. Furthermore, our simulation results show that vaccinating high-transmission group is highly effective as long as is small, which applies to areas that are either hit hard as in [8] or only have few infections but with relatively strict NPI policies, e.g., masking mandate.
III Epidemic analysis
III-A Network and compartmental model
Let us consider a multitype network which consists of types of nodes, each corresponding to an age group in a population. We use to represent the fraction of the nodes of type . The contact from a type- node to others follows degree distribution , describing the joint probability for type- node to be connected with type- node, type- node, …, and type- node, where . The considered network can be generated by the following procedure: 1) generate stubs for every node following degree distribution , where each stub contains the information about which type of node it reaches. 2) randomly wire two matching stubs together to create an edge and repeat this process until no stubs left.
Susceptible-Infected-Removed (SIR) and Susceptible-Exposed-Infected-Removed (SEIR) compartmental models are widely used for epidemic modeling. In SEIR compartmental model, each individual can be in one of the four states, i.e., Susceptible, Exposed, Infected, or Removed. Here, to capture the salient features of COVID-19, we present a novel compartmental model, i.e., SEAHIR model, which adds two additional compartments, i.e., asymptomatic and hospitalized, to the classic SEIR model. In SEAHIR model, each individual can be in one of the six states: susceptible (S), exposed (E), symptomatic and infectious (I), asymptomatic and infectious (A), hospitalized (H), and removed (R). Both new compartments are paramount to describe the dynamics of COVID-19: the number of people in H state indicates the hospitalizations, which must be kept lower than health care capacity; patients in A state have different level of infectivity compared with symptomatic ones[27]. We assume that individuals in E state is not infectious because of low virus load, and individuals in H state are properly isolated. Individuals in I and A states are assumed to be infectious to others, where the infection rate from a type- source node to a type- node is or given the source node belongs to I or A state. For conciseness, we do not distinguish recovery and death in R state, but assume that an age-dependent fraction of infected people will die. Furthermore, given the fact that the cases of reinfection with COVID-19 are still extremely rare, we do not consider the transition from R state to S state. For type- nodes, the transitions among the compartments are illustrated in Fig. 2, where the symbols on the arrows denote the corresponding transition rates from one to another, which are all dependent on node type to account for the age-dependent effects.
III-B Time-dependent dynamics

Disease propagates from infectious nodes to its neighbors, leading to an epidemic if the epidemic size is comparable to the whole population. We employ the edge-based compartmental method to solve the equations of dynamics governing the epidemic spread over random graph[23]. The core idea of the edge-based method is to shift our attention from an individual node to the status of its neighbor reached by an edge. To study the impact of vaccination or natural immunity, we consider that a fraction of population may be already immune at the beginning of analysis: represents the fraction of type- nodes with degree that are initially susceptible. Besides, we use to denote the probability that a type- neighbor has not transmitted the disease to a type- node by time given that the type- node is susceptible at time . can be interpreted as a state of a type- neighbor of an initially susceptible type- node. It is noted that according to the definition. Based on Fig. 2, we can construct the following equations to characterize the time-dependent epidemic process.
(1) | ||||
(2) | ||||
(3) | ||||
(4) | ||||
(5) | ||||
(6) |
where , , , , , and represent the proportions of type- nodes in the corresponding states at time , respectively. From Markov chain theory, when the network size is sufficiently large, the fraction of nodes in , , , and states can be described well by the differential equations (2)-(5) due to the flow diagram in Fig. 2. Moreover, is obtained from . For the initial conditions, we assume and . Obviously, one can solve the above equations as long as the key probability, i.e., , is derived. To calculate , following the approach in [23], we break it into six parts, i.e., , , , , , and . Specifically, , , , , , or represents the probability that the considered type- neighbor is in , , , , , or state, respectively, and has not transmitted the disease to the initially susceptible type- node by time , satisfying
(7) |
A type- neighbor in state cannot infect the type- node, and hence is simply equal to the probability that the considered type- neighbor is susceptible. The degree distribution of the considered type- neighbor is given by , where is the average degree leaving from type- node to type- node, which normalizes the probability distribution. This quantity is proportional to because type- nodes with more edges incident to type- nodes are more likely to become the neighbors of type- nodes[21]. We can obtain by computing the probability that the type- neighbor is initially susceptible and has not been infected by any of its neighbors, except the considered type- node, by time , i.e.,
(8) |
where is the Kronecker delta operator, with only when , and otherwise.
As mentioned before, two states in our compartmental model, i.e., and states, are infectious. Thus, the decrease in comes from two joint events: 1) the type- neighbor is or state, and 2) it transmits the disease to the type- node with infection rate or , i.e.,
(9) |
One can also interpret (9) in this way: there is a state (which corresponds to that the type- neighbor has transmitted the disease to the initially susceptible type- node by time ) receiving the flows from both and .
If staying in , or state, the type- neighbor transits to R state with rate , or , which means
(10) |
The type- neighbor progresses from I state to H state with rate , leading to
(11) |
States and receive the flows from state . Besides, progresses to state and , while progresses to state , , and , yielding
(12) | ||||
(13) |
In summary, one can solve from the following equations.
(14) |
where , , and . By plugging into (1), we can obtain the fraction of type- nodes in each compartment at given time from (1)-(6). As a result, the desired age-stratified epidemic dynamics can be obtained by solving equations, where is the number of age groups. One can see that (1)-(6) and (14) account for considerably more population structure than a fully mixing model by only introducing marginally more complexity.
III-C Epidemic size and epidemic probability
Epidemic size measures the fraction of people eventually getting infected, and epidemic probability is defined as the likelihood that the first infected patient sparks an epidemic[9]. SEAHIR model contains more compartments than traditional SIR or SEIR models, which complicates the derivations of these two key metrics. Fortunately, both metrics only depend on final state of networks. From Fig. 2, it is intuitive to see that the network progresses to an equilibrium when , where . By using this fact, we use a single compartment to replace all the infected states, i.e., E, A, I, H states in the original SEAHIR compartmental model. Here is our trick: although SR model cannot be used to capture the temporal dynamics of SEAHIR model, it can be calibrated appropriately to share the same final network state, i.e., and , with SEAHIR model. Let denote the probability that an infected type- node ultimately transmits the disease to an initially susceptible adjacent type- node. We assume that the SR model is with infection rate from type- node to type- node and transition rate from state to R state for a type- neighbor of a type- node. Since for the SR model is , the SR model has exactly the same final network state as the SEAHIR model as long as is set to the in the SEAHIR model.
Let us first calculate in the SEAHIR model, and then derive the desired metrics based on the SR model. An infected type- node may enter one of two infectious states, i.e., state or state, with probability or . Given that the type- node in I or A state, suppose that there are two stages that it will progress to, i.e., “infecting the adjacent type- node” and “leaving current state”, with rates (or ) and (or ), respectively. is the probability that the considered node enters the first stage, which equals if the type- node is in state, and equals if it is in state, yielding
(15) |
In SR model, we use , , or to represent the probability that a type- neighbor of an initially susceptible type- node is in , , or state and meanwhile has not infected this type- node by time , and use to denote the probability that the type- neighbor has not infected the initially susceptible type- node by time , satisfying . Notice that only when , because SEAHIR and SR model share the same final network state while having different temporal dynamics. By analogy with the preceding subsection, we can obtain
(16) | ||||
(17) | ||||
(18) | ||||
(19) |
where is the probability that the type- neighbor is initially removed (immune), and . The derivation of (16) and (18) is similar to that of (8) and (9). Analogous to the preceding subsection, probability corresponds to that the type- neighbor has transmitted the disease to the initially susceptible type- node by time . State transits to state with rate while transiting to state with rate , leading to the relationship between and in (17).
Clearly, the epidemic dynamics can be governed by (18) and (19). By taking (19) into (18), we have
(20) |
Now the advantage of using SR compartmental model becomes clearer: we arrive at (20) which is only related to . Since the population is closed, the epidemic will eventually go extinct, implying . Given that and , and recall that , we can go back to the original SEAHIR model and obtain
(21) |
where can be calculated from (15). One can solve from the above equation. In fact, (21) can be explained in an intuitive way: a type- node has not been infected by a type- neighbor since either because it cannot be reached from its neighbor with probability , or it can be reached from its neighbor with probability but the neighbor has not been infected since . Thus, as , the fraction of type- nodes that have been infected since is given by , i.e.,
(22) |
The epidemic size is therefore expressed as
(23) |
It is noted that when the population is fully susceptible, i.e., for all and , solved from (21) and (23) is the size of giant component in multitype networks obtained by percolation theory[21, 22]. We can compute epidemic probability in a similar way. Let denote the probability that a type- node (which is assumed to be infected) does not spark an epidemic via a type- neighbor. Analogous to (21), the following argument is true: a type- node does not ignite an epidemic via a type- node either because it cannot infect its type- neighbor, or it can infect its type- neighbor but the latter cannot spark an epidemic, which means
(24) |
As a result, the probability that an infected type- node ignites an epidemic is given by
(25) |
By randomly choosing a susceptible node to infect, we define the likelihood that it starts an epidemic as the epidemic probability:
(26) |
where is the probability that the randomly chosen susceptible node is of type , which reduces to in a fully susceptible population.
III-D Reproduction number
One of the fundamental parameters for an epidemic is its reproduction number, i.e., the average number of secondary cases caused by an infected individual. Again, since this metric is unrelated to temporal dynamics, we can derive it from SR model to simplify our calculation. According to the seminal work [28], the reproduction number is equivalent to the spectral radius of the next generation matrix , where the element of matrix is the rate of new infections entering infected state caused by infected state , and element of matrix is the rate at which infected state transfers to infected state , assuming that the population remains near the disease-free equilibrium. In our edge-based SR model (18) and (19), can be treated as the infected state. Therefore, we have infected states in total. Recall that there should be , where is calculated from (15). To simplify the derivations below, we further assume that (one can derive the same in (30) without this assumption, because is only related to ). By differentiating (19) and taking (18) into it, we have
(27) |
Then, to obtain the linearized subsystem for infected states about the disease-free equilibrium, we linearize (27) at the origin ( and ):
(28) |
Then, we can construct as an matrix, with element equal to
(29) |
and construct as an identity matrix. Reproduction number is therefore given by
(30) |
where represents spectral radius. From Theorem 2 in [28], marks the epidemic threshold in the sense that the disease-free equilibrium is asymptotically stable if , and is unstable if . In particular, in the case of for all and , becomes the basic reproduction number, i.e., the average number of secondary cases caused by an infected individual in a completely susceptible population. In this special case, the epidemic threshold is also in agreement with the threshold for multitype random network obtained by percolation theory[21, 22].
IV Age-Stratified Vaccination
Under our proposed analytical framework, in this section, we present an age-stratified vaccination scheme and study its impact on epidemic outcomes. Considering a network with nodes of type , we use the following function to characterize the immunization strategy for type- nodes[29, 30]:
(31) |
where is the probability that a susceptible node of type with -degree is chosen to be immunized, and is an exponent quantifying the immunization preference towards nodes with high degree. We use the tilde operator on to represent the total degree of a node. A greater indicates that nodes with higher degree (e.g., essential workers) are more likely to be immunized. In particular, represents a node immunization process in an entire descending order, i.e., from the highest degree to the lowest degree. In contrast, when , we have , implying a uniform immunization strategy for nodes of type . Notice that since the full knowledge of a contact network is generally unavailable, immunizing nodes in a descending order is rather unrealistic. The value of depends on the strategy and knowledge of a vaccine distributor.
Recall that denotes the fraction of type- nodes with degree that are initially susceptible. In our model, studying the impact of the immunization strategy in (31) only requires solving new initial conditions , i.e., the fraction of type- nodes with degree that are still susceptible when only fraction of type- nodes remain susceptible after implementing the immunization strategy, where , with being the fraction of type- nodes immunized by vaccination. Let be the probability that a type- node is with degree , and be the fraction and the number of type- nodes with degree that are still susceptible when only fraction of type- nodes remain susceptible, respectively. Due to the fact that (31) only depends on total node degree , the subsequent development is only related to instead of the vector of node degree . According to the definitions, we have the relationship
(32) |
In the limit of , (33) can be expressed as
(35) |
In the spirit of [29], we define a new function and introduce a new variable in order to solve (36), where in is the fraction of type- nodes with degree that are susceptible before implementing the immunization strategy. One can find that
(37) |
exactly satisfies (36), which hence is the solution to (36). Equivalently, considering the vector of node degree with as the total degree, we have
(38) |
By simply replacing with the new initial conditions , we can obtain the various epidemic outcomes in the preceding section by taking the age-stratified immunization into account.
V Simulations
V-A Parameter settings
We now study the impact of control policies, particularly age-specific vaccination strategies, for the COVID-19 pandemic. We use the estimated social contact data by age groups for the United States to conduct our simulations, where represents the contact matrix by age, with being the average number of contacts that a node of type has with nodes of type [12]. Notice, however, that the conclusions drawn from our simulations are also generalizable to many other countries, because age-stratified contact exhibits similar patterns in most countries[31, 12]. The population is partitioned into age groups, i.e., populations of , , , , and years old.
Following [32], we assume that the susceptibility to infection for adults over 20 years is identical, and the susceptibility to infection for individuals under 20 years old is half of that for adults over 20 years old. Specifically, we set transmission rate for all and , and for all and . We set to account for the fact that asymptomatic people are less infectious than symptomatic ones111It is commonly recognized that symptomatic patients are more infectious than asymptomatic ones because cough and sneeze could help spread the virus.. The course of an epidemic is primarily governed by the basic reproduction number. Being consistent with [15], we assume that the basic reproduction number is , and derive the transmission rate from (30) accordingly. Given that young people develop milder symptoms or no symptoms more frequently than the elderly, the symptomatic probabilities are set to , , , , , and , and the probabilities of needing to be hospitalized for symptomatic cases are set to , , , , , and from young age groups to old age groups[3]. Furthermore, the infection fatality ratios are set to , , , , , and from the young to the elderly, respectively[3]. We set the average time from the exposure to the onset of being infected (i.e., A or I states) to 5 days, the average infection period to days if not admitted to hospital, and the average time stay in hospitals to days[33]. To compare the effectiveness of vaccination prioritization strategies, we consider a completely susceptible population before vaccination. Later, this assumption is relaxed in Fig. 8 to demonstrate the consistency of our conclusion by considering a population with a high level of natural immunity.

V-A1 Modeling of Universal Masking
At the early stage of vaccine distribution campaign, masking and/or social distancing measures are still needed. Thus, assessing the effectiveness of an vaccination prioritization strategy requires the considerations of ongoing NPI policies. We denote the population contact matrix as , where , , and are the age-stratified contact matrices for home, workplace, school, and other locations, respectively[12]. For instance, the -th element in , denoted by , is the average degree from a node of type to nodes of type at home. By considering the NPIs, we modify the population contact matrix as follows.
(39) |
where is the scaling factor accounting for the change in transmission rate per contact due to the presence of NPIs. We assume that widespread face masking in public (i.e., workplace, school, and other locations) are recommended. Since reducing the transmission rate for an edge (contact) by is equivalent to removing this edge with probability for the spread of epidemic[9], we directly scale the contact matrices , , and to reflect the reduction in transmission rates in these places. The value of can be estimated from mask coverage (the fraction of population wearing masks) and mask efficacy (the fraction of effective transmissions blocked by masking)[34]. In what follows, we assume for illustrative purpose. By this scaling, reproduction number is pushed from to . We will compare different vaccination prioritization strategies under no-masking scenario with and masking scenario with . We remark that weaker mask use in conjunction with other NPIs in public places (i.e., social distancing measures) may have a similar effect on . This is because social distancing also reduces the transmission opportunity between two individuals, which has no difference with masking in terms of mathematical modeling.
To demonstrate that our parameter is realistic, we refer the readers to the reference [34]. According to Ref. [34], when the product of mask coverage and mask efficacy is , e.g., mask coverage is and mask efficacy is , the relative transmission rate of COVID-19 reduces to compared with the no-masking case. In the real world, the efficacy of surgical masks is estimated to be about [34]. Therefore, three quarters of population wearing surgical masks in the public places (i.e., workplace, school, and other locations) may lead to .
V-A2 Impact of Population Structure Heterogeneity
Even with the same and contact matrix (containing average contact number across age groups), the epidemic may still spread differently in networks because of the assumption on degree distributions. Fig. 3 sheds light on the effects of structural heterogeneity on epidemic outcome. Based on the same contact matrix, we examine two types of degree distributions: Poisson distribution and power law distribution with the law’s exponent equal to [9]. From the estimates in [12], the contact numbers are assumed to follow Poisson distributions. Compared with Poisson distributions, power law distributions have a quite “heterogeneous” structure: it contains not only many nodes with few contacts, but also a handful of “superspreaders” with very high degree. As illustrated in Fig. 3, power law network shrinks the epidemic size compared with Poisson network, which clearly reveals that the same and matrix of average contact number are still not enough to accurately forecast epidemic dynamics. In fact, the assumption that contact networks follow Poisson distributions is rather ideal, as it fails to capture the superspreader events that may greatly drive the transmissions of COVID-19[16]. An estimate of degree distributions of real-world contact networks is still needed in the future research to improve the precision of projected epidemic results. However, noting that our mission in this section is to seek the effective vaccination prioritization policies rather than providing the exact or even best estimate of epidemic dynamics, we assume that the contact network follows Poisson distributions without loss of generality.
V-B Effectiveness of Vaccination Prioritization Strategies




We intend to compare the effectiveness of age-specfic vaccination prioritization strategies under two scenarios: no-masking scenario and masking scenario. Unless specified otherwise, we consider uniform vaccination within the same age group by setting in (38). Although we conduct simulations by assuming the basic reproduction number , our conclusions below also hold for other reasonable estimated values for COVID-19, such as from to . We assume that the vaccine efficacy for individuals below years old is , and for individuals over years old is according to the Moderna’s clinical trial data[35]. In fact, our findings below also hold for Pfizer’s vaccine efficacy, which is about for all age groups[36]. Since the elderly aged 60+ has the lowest average contact number but the highest mortality and illness severity, while the adults aged have a high average contact rate and low mortality and illness severity, we call the adults aged as the high-transmission group, and the elderly aged 60+ as the high-risk group. We remark that, even though the children aged 5-19 have the highest average contact number among the population[12], they are assumed to be less susceptible to the infection as mentioned before, thus contributing less to the COVID-19 transmissions than the adults aged .
In Fig. 4, we compare the epidemic size under different vaccination prioritization strategies by varying the fraction of the whole population being vaccinated. In the figure, we use “20-39 prioritized” (or other age ranges) to represent that the vaccine doses are all given to the population aged 20-39. In particular, since the children aged only constitute about of the whole population, the remaining vaccines, if all the children aged get vaccinated (i.e., in the case where or population is vaccinated in the figure), are uniformly allocated to other age groups. As shown in the figure, prioritization of the adults aged 20-39 is most effective in blocking the transmissions and reducing the infections under both no-masking and masking scenarios. However, we should notice the difference: the reduction in epidemic size achieved by prioritization of the high-transmission group under masking scenario is much more significant than that of no-masking scenario. As illustrated in Fig. 3, when around , i.e., under no-masking scenario, epidemic size decreases slowly with . As a result, no matter which vaccination strategy is implemented, it will not affect the epidemic size much, as observed from Fig. 4(a). In contrast, under masking scenario with , epidemic size shrinks fast as reduces. For this reason, prioritizing the high-transmission group reduces the epidemic size significantly as shown in Fig. 4(b).
In Fig. 5, we investigate which age-specific vaccination prioritization strategy reduces the mortality (the death toll over the whole population) most. This metric is calculated from the epidemic size and the age-dependent mortality ratios. As shown in Fig. 5(a), prioritizing the elderly achieves the lowest mortality under no-masking scenario. This is due to two facts: 1) no matter which prioritization strategy is implemented, limited vaccine doses will not decrease the epidemic size much when is great. 2) The elderly people have remarkably higher mortality risk than the remaining population. Consequently, protecting the elderly directly is a wise method in such a case. Nonetheless, under masking scenario, inoculating the adults aged becomes the most effective strategy to reduce the mortality as illustrated in Fig. 5(b). This is because prioritization of high-transmission groups substantially blocks COVID-19 transmissions in small region as aforementioned, thereby in turn protecting the elderly even though they have not been vaccinated. There exist some related simulation studies in the literature. It is shown in [8] that vaccinating high-transmission group is the best strategy to minimize the death from COVID-19 when the NPIs (for Ontario) is combined with high level of natural immunity. Our results further illustrate that relatively strong NPI strategies, even without natural immunity, could lead to the same conclusion. In [24], the researchers show that high-transmission group should be prioritized to minimize death only when the vaccination covers a large proportion of the population (e.g., over coverage when the vaccine efficacy is ). This finding may be due to that they have not combined vaccination with NPIs. As a consequence of this difference, our results instead indicate that the high-transmission group should be prioritized under the presence of relatively strong NPIs, even if the vaccine coverage is very limited, say, , as shown in Fig. 5(b).


In Fig. 6, we evaluate how different vaccination prioritization strategies affect hospitalizations. Similar to the results for mortality, vaccinating high-transmission group first is still more effective under masking scenario, as shown in Fig. 6(b). On the other hand, while both severity and mortality risks increase with age, the disparity in severity ratio between the elderly and younger groups is not as significant as the disparity in the mortality ratio. As a result, under no-masking scenario, vaccinating the adults aged or first even slightly outperforms vaccinating the elderly first in terms of reducing hospitalizations from COVID-19 as depicted in Fig. 6(a), because the former age groups have much higher average contact rates than the elderly people.


Fig. 7 evaluates the performance of different age-specific vaccination strategies versus . (38) with corresponds to a vaccine plan targeting at people with high activity level (e.g., essential workers) in that age group. Thus, it is not surprising to see that the vaccination strategies with outperform the corresponding vaccination strategies with in both no-masking and masking scenarios. The effect of changing is more significant in Fig. 7(b) than Fig. 7(a), as the epidemic results in small region are more sensitive to the limited vaccination. In real world, reflects the vaccine allocation plan and the knowledge of the vaccine distributor towards the population structure, which must be taken into account to forecast the effectiveness of certain immunization strategies. This preferential immunization for human networks with general degree distributions, however, cannot be evaluated based on traditional age-stratified homogeneous-mixing models.
Fig. 8 examines the consistency of our conclusions under a hard-hitting scenario with population naturally immune before vaccination. To obtain the initial conditions for this case, we simulate the disease spread according to the time-dependent dynamics in (1)-(6) until when around people get infected. Due to the natural immunity, reduces from to under the no-masking scenario. To sustain above , we set , which corresponds to a looser masking measure, resulting in under the masking scenario. As can be observed from Fig. 8, vaccinating the elderly still reduces the mortality from COVID-19 most in the no-masking case, and vaccinating adults aged still decreases the mortality most in the masking scenario. This phenomenon demonstrates that our conclusion still holds when a high level of natural immunity has already been achieved.


V-C Epidemic Probability

The proposed epidemic model is capable of capturing the stochastic property of epidemics. For countries or areas that have no active cases inside, it is useful to estimate the probability that a new import case, if not quarantined properly, sparks an epidemic. In Fig. 9, we study how universal masking could suppress the epidemic probability under both no-masking and masking scenarios. Recall that the epidemic probability is the likelihood that a zero patient randomly chosen from the population starts an epidemic. Under the no-masking scenario, it is shown that the epidemic probability is about when , implying that most new cases, if not isolated, will give rise to an epidemic. Conversely, face mask wearing effectively suppresses the epidemic probability to about . As reported in China, import cold-chain food contamination is even a source for COVID-19 resurgence, and therefore it is nearly impossible to isolate all new (or import) case properly. Consequently, taking some low-cost control policies, such as universal masking, is still important for disease-free areas to reduce or eliminate the risk of COVID-19 outbreak or resurgence.
VI Conclusion
To combat the COVID-19 pandemic, one of the most important research tasks is to find out how to effectively decrease mortality and severe illness from COVID-19. To achieve this goal, we present a unified analytical framework for COVID-19 by considering both age-dependent risks and heterogeneity in contact networks within and across age groups. Under this framework, we use a novel age-stratified SEAHIR compartmental model to account for the distinct dynamics in a micro-state level, and employ the multitype random network approach to characterize the spread of epidemics. Several critical epidemiological metrics, including time-dependent dynamics, epidemic size, epidemic probability, and reproduction number are rigorously derived to capture essential features to be used to manage the pandemic.
Based on our proposed epidemic model, we have also studied the vaccination problem. It turns out that what is the best vaccination prioritization strategy to decrease mortality and hospitalizations depends on the reproduction number . In other words, the effective strategies may vary across different areas, and heavily depends on the level of local NPI policies, such as masking, that suppress COVID-19 transmissions. We conclude that vaccinating the high-risk group is only effective in reducing mortality when is relatively high, e.g., under the no-masking scenario, whereas vaccinating the high-transmission group turns out to be the wise strategy if intervention policies have already suppressed at a low level. Although there are many social and ethical considerations in vaccination allocation, our results provide the rationale for vaccination prioritization at early stage of vaccination campaign.
There are several promising directions for future research. First, the COVID-19 reinfection can be incorporated into the epidemic model. In this paper, we assume that once a person becomes immune (either via getting infected or vaccinated), the person will never contract the disease. In a relatively short term (e.g., several months), this assumption may be reasonable given the rare reports of reinfection and the current understanding of the vaccination. However, how long the protective antibodies last remains an open problem. To take the possible reinfection into account, we need to break the state into recovery, vaccinated and death states, and consider the transition from the recovery and vaccinated states to state . Under this case, we can still obtain the time-dependent epidemic dynamics by using the proposed approach in Section III-B. Nevertheless, since the final state of the network may not be disease-free due to the existence of reinfection, other fundamental epidemic metrics, such as epidemic size and reproduction number, cannot be calculated via the proposed approach, which is worth studying in the future. Second, it is useful to evaluate many other NPIs based on our proposed model. For instance, what is the impact of limiting some gatherings or events, such as mass gatherings in bars, gyms, and churches, on the epidemic spread? To answer these kinds of questions, one can simulate a realistic contact network (e.g., with households, schools, bars, gyms, and churches), as in [9], to discover the impact of different gatherings or events on the network structure, and then perform the epidemic analysis using our mathematical framework by removing the edges contributed by these gatherings.
References
- [1] World Health Organization (WHO), “WHO Coronavirus Disease (COVID-19) Dashboard,” https://covid19.who.int/, 2020.
- [2] Centers for Disease Control And Prevention (CDC), “Similarities and differences between flu and COVID-19,” https://www.cdc.gov/flu/symptoms/flu-vs-covid19.htm, 2020.
- [3] N. Ferguson, D. Laydon, G. Nedjati Gilani, N. Imai, K. Ainslie, M. Baguelin, S. Bhatia, A. Boonyasiri, Z. Cucunuba Perez, G. Cuomo-Dannenburg et al., “Report 9: Impact of non-pharmaceutical interventions (NPIs) to reduce COVID19 mortality and healthcare demand,” https://dsprdpub.cc.ic.ac.uk:8443/handle/10044/1/77482, 2020.
- [4] R. Verity, L. C. Okell, I. Dorigatti, P. Winskill, C. Whittaker, N. Imai, G. Cuomo-Dannenburg, H. Thompson, P. G. Walker, H. Fu et al., “Estimates of the severity of coronavirus disease 2019: a model-based analysis,” The Lancet infectious diseases, June 2020.
- [5] R. Singh and R. Adhikari, “Age-structured impact of social distancing on the COVID-19 epidemic in India,” arXiv preprint arXiv:2003.12055, 2020.
- [6] F. Balabdaoui and D. Mohr, “Age-stratified model of the COVID-19 epidemic to analyze the impact of relaxing lockdown measures: nowcasting and forecasting for Switzerland,” medRxiv, 2020.
- [7] A. R. Tuite, D. N. Fisman, and A. L. Greer, “Mathematical modelling of COVID-19 transmission and mitigation strategies in the population of Ontario, Canada,” CMAJ, vol. 192, no. 19, pp. 497–505, May 2020.
- [8] P. Jentsch, M. Anand, and C. T. Bauch, “Prioritising COVID-19 vaccination in changing social and epidemiological landscapes,” medRxiv, 2020.
- [9] L. A. Meyers, B. Pourbohloul, M. E. Newman, D. M. Skowronski, and R. C. Brunham, “Network theory and SARS: predicting outbreak diversity,” Journal of theoretical biology, vol. 232, no. 1, pp. 71–81, January 2005.
- [10] L. Hébert-Dufresne, B. M. Althouse, S. V. Scarpino, and A. Allard, “Beyond R0: Heterogeneity in secondary infections and probabilistic epidemic forecasting,” medRxiv, 2020.
- [11] J. Medlock and A. P. Galvani, “Optimizing influenza vaccine distribution,” Science, vol. 325, no. 5948, pp. 1705–1708, September 2009.
- [12] K. Prem, A. R. Cook, and M. Jit, “Projecting social contact matrices in 152 countries using contact surveys and demographic data,” PLoS computational biology, vol. 13, no. 9, p. e1005697, September 2017.
- [13] Centers for Disease Control And Prevention (CDC), “How CDC is making COVID-19 vaccine recommendations,” https://www.cdc.gov/coronavirus/2019-ncov/vaccines/recommendations-process.html.
- [14] “WHO unveils global plan to fairly distribute COVID-19 vaccine, but challenges await,” https://www.sciencemag.org/news/2020/09/who-unveils-global-plan-fairly-distribute-covid-19-vaccine-challenges-await, Science.
- [15] T. Britton, F. Ball, and P. Trapman, “A mathematical model reveals the influence of population heterogeneity on herd immunity to SARS-CoV-2,” Science, vol. 369, no. 6505, pp. 846–849, August 2020.
- [16] A. V. Tkachenko, S. Maslov, A. Elbanna, G. N. Wong, Z. J. Weiner, and N. Goldenfeld, “Persistent heterogeneity not short-term overdispersion determines herd immunity to COVID-19,” arXiv preprint arXiv:2008.08142, 2020.
- [17] S. L. Chang, N. Harding, C. Zachreson, O. M. Cliff, and M. Prokopenko, “Modelling transmission and control of the covid-19 pandemic in australia,” Nature communications, vol. 11, no. 1, pp. 1–13, November 2020.
- [18] M. E. Newman, “Spread of epidemic disease on networks,” Physical review E, vol. 66, no. 1, p. 016128, July 2002.
- [19] A. Allard, C. Moore, S. V. Scarpino, B. M. Althouse, and L. Hébert-Dufresne, “The role of directionality, heterogeneity and correlations in epidemic risk and spread,” arXiv preprint arXiv:2005.11283, 2020.
- [20] Y.-C. Chen, P.-E. Lu, C.-S. Chang, and T.-H. Liu, “A Time-dependent SIR model for COVID-19 with undetectable infected persons,” IEEE Transactions on Network Science and Engineering, October-December 2020.
- [21] A. Allard, P.-A. Noël, L. J. Dubé, and B. Pourbohloul, “Heterogeneous bond percolation on multitype networks with an application to epidemic dynamics,” Physical Review E, vol. 79, no. 3, p. 036113, March 2009.
- [22] A. Allard, L. Hébert-Dufresne, J.-G. Young, and L. J. Dubé, “General and exact approach to percolation on random graphs,” Physical Review E, vol. 92, no. 6, p. 062807, December 2015.
- [23] J. C. Miller, A. C. Slim, and E. M. Volz, “Edge-based compartmental modelling for infectious disease spread,” Journal of the Royal Society Interface, vol. 9, no. 70, pp. 890–906, October 2011.
- [24] L. Matrajt, J. Eaton, T. Leung, and E. R. Brown, “Vaccine optimization for COVID-19, who to vaccinate first?” medRxiv, 2020.
- [25] K. M. Bubar, S. M. Kissler, M. Lipsitch, S. Cobey, Y. Grad, and D. B. Larremore, “Model-informed COVID-19 vaccine prioritization strategies by age and serostatus,” medRxiv, 2020.
- [26] J. H. Buckner, G. Chowell, and M. R. Springborn, “Optimal dynamic prioritization of scarce COVID-19 vaccines,” medRxiv, 2020.
- [27] R. Li, S. Pei, B. Chen, Y. Song, T. Zhang, W. Yang, and J. Shaman, “Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2),” Science, vol. 368, no. 6490, pp. 489–493, May 2020.
- [28] P. Van den Driessche and J. Watmough, “Reproduction numbers and sub-threshold endemic equilibria for compartmental models of disease transmission,” Mathematical biosciences, vol. 180, no. 1-2, pp. 29–48, November-December 2002.
- [29] J. Shao, S. V. Buldyrev, L. A. Braunstein, S. Havlin, and H. E. Stanley, “Structure of shells in complex networks,” Physical Review E, vol. 80, no. 3, p. 036105, September 2009.
- [30] W. Wang, M. Tang, H.-F. Zhang, H. Gao, Y. Do, and Z.-H. Liu, “Epidemic spreading on complex networks with general degree and weight distributions,” Physical Review E, vol. 90, no. 4, p. 042803, October 2014.
- [31] J. Mossong, N. Hens, M. Jit, P. Beutels, K. Auranen, R. Mikolajczyk, M. Massari, S. Salmaso, G. S. Tomba, J. Wallinga et al., “Social contacts and mixing patterns relevant to the spread of infectious diseases,” PLoS Med, vol. 5, no. 3, p. e74, March 2008.
- [32] N. G. Davies, P. Klepac, Y. Liu, K. Prem, M. Jit, and R. M. Eggo, “Age-dependent effects in the transmission and control of COVID-19 epidemics,” Nature medicine, vol. 26, no. 8, pp. 1205–1211, June 2020.
- [33] D. Wang, B. Hu, C. Hu, F. Zhu, X. Liu, J. Zhang, B. Wang, H. Xiang, Z. Cheng, Y. Xiong et al., “Clinical characteristics of 138 hospitalized patients with 2019 novel coronavirus–infected pneumonia in Wuhan, China,” Jama, vol. 323, no. 11, pp. 1061–1069, February 2020.
- [34] S. E. Eikenberry, M. Mancuso, E. Iboi, T. Phan, K. Eikenberry, Y. Kuang, E. Kostelich, and A. B. Gumel, “To mask or not to mask: Modeling the potential for face mask use by the general public to curtail the COVID-19 pandemic,” Infectious Disease Modelling, April 2020.
- [35] “FDA Briefing Document Moderna COVID-19 Vaccine,” https://www.fda.gov/media/144434/download, December 2020.
- [36] F. P. Polack, S. J. Thomas, N. Kitchin, J. Absalon, A. Gurtman, S. Lockhart, J. L. Perez, G. Pérez Marc, E. D. Moreira, C. Zerbini et al., “Safety and efficacy of the BNT162b2 mRNA Covid-19 vaccine,” New England Journal of Medicine, vol. 383, no. 27, pp. 2603–2615, December 2020.