A methodological approach for modeling the spread of disease using geographical discrete-event spatial models

Abstract

The study of infectious disease models has become increasingly important during the COVID-19 pandemic. The forecasting of disease spread using mathematical models has become a common practice by public health authorities, assisting in creating policies to combat the spread of the virus. Common approaches to the modeling of infectious diseases include compartmental differential equations and cellular automata, both of which do not describe the spatial dynamics of disease spread over unique geographical regions. We introduce a new methodology for modeling disease spread within a pandemic using geographical models. We demonstrate how geography-based Cell-Discrete-Event Systems Specification (DEVS) and the Cadmium JavaScript Object Notation (JSON) library can be used to develop geographical cellular models. We exemplify the use of these methodologies by developing different versions of a compartmental model that considers geographical-level transmission dynamics (e.g. movement restriction or population disobedience to public health guidelines), the effect of asymptomatic population, and vaccination stages with a varying immunity rate. Our approach provides an easily adaptable framework that allows rapid prototyping and modifications. In addition, it offers deterministic predictions for any number of regions simulated simultaneously and can be easily adapted to unique geographical areas. While the baseline model has been calibrated using real data from Ontario, we can update and/or add different infection profiles as soon as new information about the spread of the disease become available.

Keywords

Theory and methodology DEVS methodology

1. Introduction

Bacteria, viruses, fungi, and parasites are all infectious agents, capable of infecting many people and causing mass fatalities. In December 2019, a novel coronavirus named SARS-COV-2, which produces a disease known as COVID-19, emerged and became a global pandemic within the span of 1 month. Never in the history of our species have pathogens such as SARS-COV-2 been able to travel the entire globe at a rapid pace. Actions were taken by governments to restrict international travel and encourage the social isolation of citizens, but all precautions proved insufficient to prevent the pandemic. The cumulative global fatality count due to COVID-19 infection exceeds 5.8 million people as of February 2022,¹ resulting also in significant economic impacts.

The COVID-19 pandemic highlights the importance of studying and combatting the spread of pathogens so that their effect on daily life may be controlled. If the mechanisms of disease spread are understood, then efficient interventions may be implemented to contain local pandemics before they spread globally, and in the event, they do become global to prepare for the impacts. In particular, the study of pandemics and pathogens can be used to develop predictive models of how particular infectious agents may spread in specific geographical environments over time, which can assist governments and health authorities in planning and implementing containment policies.

Despite the efforts of health agencies, many countries are still coping with new waves of COVID-19 cases, often associated with variants of concern, are typically reaching higher peaks than the first wave in 2020.^2,3 One difficulty with the control of the pandemic of COVID-19 is due to the asymptomatic carriers—that is, those individual not showing symptoms and being unaware that they are infected. As these individuals do not take precautions, they may spread the disease to at-risk individuals unknowingly.⁴ The proportion of asymptomatic carriers can be as high as 80%.^5–8 In December 2021, a new variant of COVID-19, Omicron was discovered in over 89 countries. Omicron spread throughout the world faster than any other variants of concern. Studies were performed in sub-Saharan Africa to study the new variant, one of these studies found that the Omicron variant had a much higher rate of asymptomatic carriage compared to other variants of concern. The study found many of these asymptomatic carriers had high nasal viral loads, suggesting these asymptomatic carriers may have been a major factor in the rapid spread of Omicron.⁹ Due to these issues, short of regular testing of the population (which did not prove overly effective in the few countries attempting it), modeling and simulation showed to be helpful for estimating the spread of the disease, including the prevalence of asymptomatic carriers.¹⁰

This is not the first time that organizations and governments use mathematical models and simulations to forecast and prevent the spread of infectious diseases. Models have been used to predict the spread of serious pathogens such as Ebola and HIV,^11,12 and health agencies have long relied on predictive models to estimate future trends and to assess the potential effectiveness of various disease control methods.¹³ One of the most widely used models uses compartmental differential equation models, describing how individuals in different states of infection interact and influence one another. These compartmental epidemiological models, introduced by Kermack and McKendrick,¹⁴ classify the population in individuals Susceptible to the disease, Infected (i.e. can transmit the disease), and Recovered (SIR). Modifying the parameters of the model, public health officials can investigate how much a disease might evolve depending on the measures put in place (such as lockdowns, mandatory quarantines, and physical distancing). With the emergence of the SARS-CoV-2 virus, these models have become relevant, being used to track and monitor the potential spread of the disease.

Often missing from the differential equation approach to modeling is the consideration of how populations are distributed over physical space, and how they travel between geographical regions with unique characteristics. Such so-called geographical models allow for geolocated simulations of various phenomena; their resolution typically ranges from continental models to city neighborhoods.¹⁵ For instance, each component in city areas can use its own defined population and characteristics, allowing for an accurate representation of a given geography. Geographical modeling can determine which regions are most impacted by a given infection. These insights can help create improved public health measures.

A popular method to model spatial distribution of the disease is cellular automata (CA), which has been widely used in recent years to investigate the spatial dynamics of disease spread.^16–18 CA divides physical space into a grid representation of cells, with rules of interaction between adjacent or neighboring cells. This division is traditionally a two-dimensional (2D) uniform square grid, where cells are related to one another in a uniform neighborhood pattern. However, the geography in which populations reside is rarely uniform, contradictory to the use of regular neighborhood patterns in traditional CA.

Considering these issues, this research discusses the definition of new methods to represent the spread of COVID-19, with the objective of predicting the spread of COVID-19 in distinct geographical areas using a geography-based model. Our research considers the population of each geographical area in a non-uniform neighborhood topology, where each cell population is divided into age groups, each of which may have different infection characteristics and behaviors. We use groups as an example of a population division where the behavior for transmitting the disease differs. In other scenarios, age groups could represent people with different levels of education, social status, wealth, and so on.

More specifically, the contribution of this paper is introducing a new methodology for modeling disease spread using geographical models, Cell-Discrete-Event Systems Specification (DEVS), and the Cadmium JavaScript Object Notation (JSON) library. We demonstrate how these methodologies and tools can be used by developing different versions of SIR-type compartmental models that consider geographical-level transmission dynamics (e.g. movement restriction or population disobedience to public health guidelines), the effect of asymptomatic population, and vaccination stages with a varying immunity rate.

Our approach provides an easily adaptable framework that allows rapid prototyping and modifications. We show how the model can be quickly revised as new disease information is discovered (for instance, the change of infectivity by new variants like Omicron). Our implementation allows users to run the model in user-specified geographical regions, to visualize how COVID-19 might spread through a city, town, or country. For example, Ralli et al.⁸ show the negative effects of asymptomatic COVID-19 infections within homeless shelters. Our model could be used to geographically locate homeless shelters within specific neighborhoods and simulate the effect of homeless shelters on surrounding neighborhoods.

Section 2 explains the foundations of the SIR model, CA, the Cell-DEVS formalism, and irregular geography-based cell spaces. Using this knowledge, section 3 defines a baseline Cell-DEVS model and gives simulations of the model in the geography of Ontario, Canada. Section 4 gives an extension of the baseline model to accommodate the effect of asymptomatic infection carriers. Section 5 gives a further extension of the baseline model in which a two-dose vaccination schedule is implemented, and finally, this paper concludes in section 6.

2. Background

Mathematical disease models date back to the 17th century, where Bernoulli published the first formal disease spread model which assessed the effect that variolation to smallpox could have on average life expectancy in France.¹⁹ In 1927, Kermack and McKendrick¹⁴ defined the SIR model using differential equations. This model remains the basis of many infectious disease spread models used at present because of its success in describing the spread of disease. In this section, we will discuss the related work used as a basis for our model specifications.

2.1. SIR models

Following Ross²⁰ and Ross and Hudson,²¹ Kermack and McKendrick¹⁴ defined a model that classified a given population into three “compartments”: SIR. They defined how individuals within a population could move from one compartment to another over time. The SIR model divides each member of a population into one of three categories: susceptible (S), infected (I), and recovered (R). Susceptible individuals are vulnerable to the pathogen and will become infected if they have sufficient contact with infected people. Infected people spread the disease to susceptible individuals through their daily contacts. Recovered people have overcome the disease and have permanent immunity (or died) and are no longer infectious.

The amount of people in the states S, I, and R over time, can be expressed as a function of time, and as a fraction of total population (N). For example, s(t) = S(t)/N represent the fraction of susceptible people at time t. At time zero (and therefore, at any other time), the sum of these fractions must be 100% (i.e. s(0) + i(0) + r(0) = 1).

By making some assumptions about the behavior of the population, the model is completed by the set of differential equations (Equation (1)), which describe the rate of change between states over time, typically measured in days:

{\begin{matrix} \frac{d s}{d t} = - λ s (t) i (t) & (1 a) \\ \frac{d i}{d t} = λ s (t) i (t) - γ i (t) & (1 b) \\ \frac{d r}{d t} = γ i (t) & (1 c) \end{matrix}

(1)

Two new parameters are introduced in the set of equations (Equation (1)) to model disease characteristics: λ (the virulence of the disease), which describes the level of infectious contact that Infected people have with Susceptible people, and γ (the recovery rate), which defines the rate at which Infected people become Recovered each day. The change in the Susceptible population is described in Equation (1a) as being proportional to s(t), i(t), and the virulence rate of the disease. The negative sign indicates that this is always a net decrease in the number of people in the Susceptible population, as members of the population transition to Infected. Equation (1b) describes the change in the Infected population as the addition of new infections (λs(t)i(t)) and the subtraction of new recoveries (γi(t)). The daily change in the Recovered population is given in Equation (1c) as the product of the Infected population i(t) and the recovery rate γ. All members of the population are also assumed to have a consistent number of daily contacts with others, implied in λ, no matter their infection state. These assumptions serve to simplify the model, but certainly decrease the accuracy of predictions. In addition, since the sum of s(t), i(t), and r(t) is 100% at time zero and the model assumes there is no birth, death, or migration in the population, that condition will be satisfied at any time.

The typical trajectory of an SIR model pandemic is shown in Figure 1, where the curves shown are the values of s(t), i(t), and r(t) in percentage as defined in Equation (1). The trajectory described by the SIR model is predictable: a population begins as 100% Susceptible before a pathogen spreads, the Infected then rises exponentially as the pathogen spreads and infects the population, and the number of Recovered increases over time as in the graph of Figure 1. The exponential growth of the Infected is slowed by the decreasing number of Susceptible that may become Infected, as the Recovered state is terminal.

Figure 1.

SIR model trajectory.

Kermack and McKendrick’s work defined the framework and mathematics that SIR-type models continue to follow today. The model assumes that every person behaves identically, having a constant number of daily contacts at random, regardless of any person’s infection status. These assumptions make such a simple model impractical for predicting the real world. To increase the accuracy of this model, more infection states can be added. This standard SIR model has evolved over the years to incorporate more advanced disease spread rules and more compartments. The simplest of these evolutions is the SIRD model which incorporates a Deceased (D) compartment which models terminal illness and the permanent removal of individuals from the population and includes death factors and fatality rates.²² SEIRD models add an Exposed (E) state which describe the latency period preceding contagious infection used as a transition from Susceptible to Infected.²³ Over time, these models became significantly more advanced, having different, complex compartments such as quarantined, which describe a period of reduced human contact while in a phase of infection;²⁴ Asymptomatic states (A) which define individuals who are not aware of their infection but are contagious; Hospitalized,²⁵ Diagnosed²³, and among others. The infection states that do vary over time in the real world (Exposed, Infected, Recovered) can also be modeled with sub-states that represent time-varying disease characteristics in sequential days of infection. Real-world data can then be used to profile how the infection behaves on average per infection day, from initial exposure to eventual recovery or fatality. Populations in the Exposed state can transition to the Infected state according to a statistical curve of the pathogen’s incubation time. Each day of the Infected state can have variable contagiousness, probability of recovery, and probability of fatality. A state transition diagram of an SEIRDS model is given in Figure 2. Adding a second S to the end of the model type name indicates that the Recovered population becomes Susceptible again after several days spent in the Recovered state.

Figure 2.

SEIRDS state diagram.

The spread of COVID-19 using non-geographical SIR models has been successfully predicted in individual cities and countries. Caccavo²⁶ describes an SIRD model that correctly estimates the spread of COVID-19 in China and Italy. An SEIIR model presented in Danon et al.²⁷ having two distinct phases of Infected was shown to predict the COVID-19 outbreaks in Wales and England. Similar compartmental differential equation models have accurately predicted the spread of COVID (as discussed in detail in the next section). However, not may offer specific estimations of spatial disease spread dynamics.

2.2. Asymptomatic infection and SIR-type models

In medicine, an asymptomatic patient is one that tests positive for a disease but shows no symptoms.²⁴ Asymptomatic carriers can shed the disease to those around them, but at a slower rate than those that are symptomatic.⁴ The main issue is that asymptomatic carriers do not know they have the disease; thus, they may not follow the same procedures as someone who knows they are infectious. For example, someone who has a cough may cover their mouth to protect those around them; however, if they did not have any noticeable symptoms, they will spread the disease unknowingly.⁸

The asymptomatic effect has caused problems in tracking and planning for many diseases including COVID-19.^5–7 The proportion of asymptomatic infections that make up the COVID-19 pandemic has been widely debated, and ranges between 4% and 80%.^4–8 The problem with these studies is how to collect and validate their data, which is more complex considering the asymptomatic carriers.

Several studies have explored the integration of the asymptomatic state in disease spread modeling. For example, Sen and Sen²⁸ proposed an SIARD (Susceptible, Infected, Asymptomatic, Recovered, Dead) and an SQIARD model (where Q is the Quarantine state). The SIARD model uses a simple transition from the Susceptible state to the Infected or Asymptomatic state using a given infectious rate. The SQIARD model incorporated the Asymptomatic state as a transition from the Quarantine state. The model splits the population that moves from the Quarantine state to the Asymptomatic or Infected state using specific rates. This model provides results that resemble real-world case counts in different countries such as China, India, Italy, and the United States. Giordano et al.²⁵ proposed an advanced SIDARTHE model, which also includes states for Diagnosed (D), Ailing (A), Recognized (R), Threatened (T), Healed (H), and Extinct (E) individuals. Asymptomatic individuals are added using multiple disease subcategories under one state: Asymptomatic is a subcategory of the Infected state. When an individual moves from Susceptible to Infected, they can become Asymptomatic, Infected, or undetected. The Asymptomatic individuals will then move to either Diagnosed, Ailing, or Healing. If an Asymptomatic individual becomes detected, they are considered diagnosed Asymptomatic. Asymptomatic individuals who move to Ailing develop symptoms and become undiagnosed symptomatic, and those who move to Healing will recover from the infection. The proportion of individuals who move to each state is defined by the states specific transition rate, that is, those who move from Asymptomatic to Ailing is denoted by the probability for a host to develop symptoms. The authors also remark the importance of those who are asymptomatic or undetected as they will not be isolating like those who are known infectious.

Barman et al.²³ presented an SEAIRD model that showed a similar transition method as those described previously. They use an Asymptomatic state where an asymptomatic rate α is defined to split the infected population into Infected (I) or Asymptomatic (A). Susceptible individuals can become exposed to the virus (E) or remain in the susceptible (S) state. Exposed individuals can become asymptomatic (A = αE) or infectious (I = α(1 − E)). The model showed how asymptomatic cases can affect the rate and magnitude of new infectious cases in a population. The authors also described a more advanced model called the SEAIRD-Control model where a Quarantine state and a Hospitalization state have been introduced. Although our focus is on the influence of asymptomatic cases, these extra states (Quarantined, Hospitalized, etc.) can be considered as future additions to our model.

None of the models above include the geographical aspect of the disease where the relationship between two neighborhoods impacts how the disease can spread. The model described in this paper addresses this idea and incorporates geographical attributes in disease transmission dynamics.

2.3. CA pandemic models

Traditional SIR models have no consideration of physical space, and therefore, geographical spread of pathogen. CA can be used together with differential equations to create a spatial pandemic model. CA is a mathematical formalism that describes an N-dimensional space of adjacent cells that contain values that can change over time. Any number of dimensions may be represented in CA, but the most practical models are in either two or three dimensions. The value of each cell is computed by the same set of transition rules executed in discrete time steps, which describe how the value of neighboring cells affects each other.

SIR-based discrete time CA models have widely studied, including to study the spatial spread of COVID-19. Medrek and Pastuszak¹⁸ described a CA SEIR model that uses the age group distributions and population counts of regions to stochastically generate heterogeneous cell spaces. The model includes geographical considerations in creating the cell spaces and uses a regular neighborhood pattern. A probabilistic SEIQR CA defined in Ghosh and Bhattacharya¹⁷ use similar methods of cell space generation based on population density and variation to define a cell space representing a country. The authors use a genetic algorithm to find the optimal disease parameters based on what is known from the epidemiological data of near areas.

A limitation of traditional CA is that it may not provide realistic neighborhoods when modeling large and complex geographical areas. Using real geographical data to define cell spaces is preferred to hypothetical scenarios because the model results are verifiable to a degree and offer more useful results to decision-makers. The research in Tobler²⁹ gives a foundation of cellular geography, including the description of geographical regions as cells in both regular and irregular neighborhood patterns. Zhong et al.¹⁶ defined a geographical SIRS CA using irregular geography-based cell spaces, where a cell’s neighborhood is defined by the amount of border length shared with other regions. This neighborhood definition allows cells to have unequal influences on one another, based on the shape of their 2D geographical borders. Cárdenas et al.³⁰ introduced a similar SIR model with unequal border lengths using the Cadmium Discrete-Event System Specification simulation library (the software used to implement the model presented in this research).

The question of how to best model geographical relations is difficult. A simple guiding heuristic principle of geographical relation is every geographical area is related to every other geographical area, but near areas are more related than distant areas.³¹ Using this principle, a correlation factor can be calculated to represent the level of relation or connection populations have between a pair of geographical areas. The simplest possible correlation factor is represented as a binary state, either indicating that no correlation exists (zero) or a correlation does exist (one). More complex correlation factors can be derived based on geographical features and population sizes. For instance, when working with irregular topologies in which each cell represents a geographical region, we can compute a geographical weight $w_{i, j}$ between regions i and j as the length of the shared border between them, divided by the perimeter of the region represented by cell i. Alternatively, if we have enough data about the area of interest, $w_{i, j}$ could depend on the average mobility of people from cell j to cell i. Figure illustrates how geographical data can be processed to generate cellular models that support irregular topologies. For each geographical region in Figure 3(a), a cell with the same identification number is created in Figure 3(b). The arrows between cells represent the cell neighborhoods and their directionality of influence. The color of the arrows interconnecting cells indicates the degree correlation factor.

Figure 3.

Irregular cell-DEVS schematic: (a) Real Map and(b) Irregular Cell-DEVS equivalent.

Cell i is usually a neighbor of itself, and its autocorrelation factor $w_{i, i}$ is usually close to 1 (in Figure 3(b), these autocorrelation arrows are black). Correlation factors can be directional (i.e. $w_{i, j} \neq w_{j, i}$ if $i \neq j$ ). For example, $w_{3, 4}$ is high, as people of region 4 usually move to region 3. However, $w_{4, 3}$ is close to 0 (in the figure, the arrow’s color is light gray). It is also possible that cell i belongs to the neighborhood of cell j, but cell i does not consider cell j as a neighbor (this is the case between cells 2 and 4 in Figure 3(b)).

As SIR-based models track infection of a geographically dispersed population, its correlation factors should represent the flow of population between geographical areas.

A correlation factor c_ij based on shared border between two cells i and j can be calculated as follows:

{\begin{matrix} w_{i j} = \frac{z_{i j}}{l_{j}} & (2 a) \\ w_{i j} = w_{ji} = \frac{\frac{z_{i j}}{l_{i}} + \frac{z_{ji}}{l_{j}}}{2} & (2 b) \\ c_{i j} = w_{i j} & (2 c) \end{matrix}

(2)

Equation (2a) describes a weight w_ij between two cells i and j based on their shared border length z_ij and the total border length l_j. This weight is not identical in both directions of correlation between the two cells. Equation (2b) defines a weighting factor for cells i and j that are identical in both directions of correlation. Equation (2c) states that the correlation factor is equal to the weighting factor in this case, and no additional computation is required to scale w_ij and w_ji to the range of 0–1.

The correlation factor of Equation (2c) can be used to describe how strongly a Susceptible population of one cell interacts with the Infected population of another cell. Cell-DEVS neighborhoods can be defined by Equation (2c) instead of by predetermined neighborhood patterns, where a cell is in another cell’s neighborhood if there exists a correlation factor between them that is greater than 0. A neighborhood defined this way allows cells to have any number of neighbors. We have used this method of neighborhood construction and shared border length correlation factors.

Cárdenas et al.³⁰ define an SEIRDS with more details in the infection profile, built using new simulation methodologies and tools. The model described here builds on this, including vaccinated states, and in turn, immunity rates, which affects the rate of exposures and thus infections.

We define an SEIRD cellular model that predicts the state of infection of a group of cells over time, where the state variables S, E, I, R, and D describe the percentage of total cell population having that state at a given time t. We further specify some of these states with time-varying behavior by giving infection characteristics that vary per day spent in that state. For instance, the length of the Exposed state is T_E days, where each sequential day incurs a higher probability of transitioning to Infected. Infected individuals who have caught the disease are contagious for T_I days, and have time-varying contagiousness. Recovered individuals have overcome the disease and are indefinitely immune. In a model that includes reinfection, the Recovered population would have immunity from the disease for a period of T_R days before becoming Susceptible again. The Deceased state describes the population that dies because of infection but do remain in the original population count.

The cell space of the SEIRD CA model is formed using 2D geography-based neighborhoods, consisting of a set of cells M = (m₁, m₂,…, mi) each with population N_i, where the subscript i is the cell’s index. Each cell i has a unique neighborhood definition V_i that is defined as a set of cell names (neighbors) and their correlational weights, V_i = {(m_i, 1), (m_k, c_ik), …, (m_x, c_ix)}, where a cell’s autocorrelation factor is 1. The Exposed and Infected states of this model include time-varying characteristics per day spent in the state, where each day within a state is referred to as the phase q of the state. The population in a specific phase q of Exposed at time t can be further specified as E_i^t(q), and the population of a specific phase of Infected can be specified as I_i^t(q). Each cell population N_i is divided into age groups, specified by the subscript a, which describes the proportion of total cell population N_i that are members of age group a. Dividing each population into age groups allows the incorporation of the age-specific factors that influence human behavior and disease outcomes. A summary of this notation is given in Table 1, and will be used throughout this paper.

Table 1.

SEIRD CA cell and state notation.

Parameter	Symbol
Cell specifier index	i, j
Time index	t
Neighborhood of cell i	V_i
Age group index	a
Phase index of state	q
Susceptible population of cell i in age group a at time t	$S_{i, a}^{t}$
Exposed population of cell i in age group a at time t, with phase q	$E_{i, a}^{t} (q)$
Infected population of cell i in age group a at time t, with phase q	$I_{i, a}^{t} (q)$
Recovered population of cell i in age group a at time t	$R_{i, a}^{t}$
Deceased population of cell i in age group a at time t	$D_{i, a}^{t}$

SEIRD: Susceptible, Exposed, Infected, Recovered, Deceased; CA: cellular automata.

The probability that an Exposed individual enters the Infected state is controlled by the incubation rate ε(q). The probability that an Infected individual becomes Recovered is controlled by the recovery rate γ(q). The probability that an Infected individual enters the Dead state from each phase of Infected is controlled by the fatality rate δ(q). The contagiousness of individuals in each phase of Infected is given by the virulence rate λ(q), which approximates the epidemiological R_o of the disease over the phases of Infected. The rate at which individuals make potentially infectious contact with others is described by the mobility rate µ(q), which models the difference in movement behavior between age groups. Further definitions of parameter symbology are given in Table 2.

Table 2.

SEIRD CA cell and state notation.

Parameter	Symbol
Cell i’s set of neighbors, where a neighborhood relationship consists of cell name and correlational weight. Index j iterates through i’s neighbors in V.	$V$
Cell i’s set of age group proportions.	$A$
Proportion of cell j’s total population of age group b ([0,1]). Index b iterates through A.	$a_{j, b}$
The incubation rate of age group a in phase q of Exposed.	$ε_{a} (q)$
The virulence rate of age group a in phase q of Infected.	$λ_{b} (q)$
The mobility rate of age group a in phase q of Infected.	$μ_{b} (q)$
The recovery rate of age group a in phase q of Infected.	$γ_{a} (q)$
The fatality rate of age group a in phase q of Infected.	$δ_{a} (q)$
The geographical correlation factor between cell’s i and j.	$c_{i j}$
Number of phases in the Exposed state.	T_e
Number of phases in the Infected state.	T_i

SEIRD: Susceptible, Exposed, Infected, Recovered, Deceased; CA: cellular automata.

The local transition function of the discrete time SEIRD model is defined by Equations (3a)–(3g):

{\begin{matrix} E_{i, a}^{t + 1} (1) = S_{i, a}^{t} • [\sum_{j \in V} (c_{i j}) • [\sum_{\begin{matrix} b \in A \\ q \in {1, 2, \dots, T_{i}} \end{matrix}} (a_{j, b} • μ_{b} (q) • λ_{b} (q) • I_{j, b}^{t} (q))]] & (a) \\ E_{i, a}^{t + 1} (q) = \begin{matrix} (1 - ε_{a} (q - 1)) • (E_{i, a}^{t} (q - 1)) \\ with q \in {2, 3, \dots, T_{e}} \end{matrix} & (b) \\ I_{i, a}^{t + 1} (1) = E_{i, a}^{t} (T_{e}) + [\sum_{q = 1}^{T_{e} - 1} (ε_{a} (q) • E_{i, a}^{t} (q))] & (c) \\ I_{i, a}^{t + 1} (q) = \begin{matrix} (I_{i, a}^{t} (q - 1)) • (1 - γ_{a} (q - 1) - δ_{a} (q - 1)) \\ with q \in {2, 3, \dots, T_{i}} \end{matrix} & (d) \\ R_{i, a}^{t + 1} = R_{i, a}^{t} + (I_{i, a}^{t} (T_{i}) • (1 - δ_{a} (T_{i}))) + [\sum_{q = 1}^{T_{i} - 1} (γ_{a} (q) • I_{i, a}^{t} (q))] & (e) \\ D_{i, a}^{t + 1} = D_{i, a}^{t} + [\sum_{q = 1}^{T_{i}} (δ_{a} (q) • I_{i, a}^{t} (q)) \sum] & (f) \\ S_{i, a}^{t + 1} = 1 - [\sum_{q = 1}^{T_{e}} (E_{i, a}^{t + 1} (q))] - [\sum_{q = 1}^{T_{i}} (I_{i, a}^{t + 1} (q))] - R_{i, a}^{t + 1} - D_{i, a}^{t + 1} & (g) \end{matrix} (3)

(3)

Equation (3a) describes how the Susceptible population of cell i interacts with neighboring Infected populations of all age groups, calculating the new Exposed. The first Exposed phase population (q = 1) is equal to the product of the Susceptible population, the value c_ij ∈ [0,1], and the Infected in each phase q considering the virulence and the mobility of infected individuals in phase q. The geographical correlation factor between regions i and j (c_ij) describes the amount of population interaction between cells i and j. The A symbol represents the set of age groups within cell j. Equation (3b) describes the population in all Exposed phases except the first. The population in all subsequent Exposed phases is the Exposed population in the previous phase multiplied by 1 minus the incubation rate of that phase. Equation (3c) describes the newly infectious population in Infected phase 1. The population in Infected phase 1 is equal to the population leaving the Exposed state, where all individuals who reach the final Exposed phase become Infected on the next time step. Equation (3d) states that the Infected population in each subsequent phase of Infected is equal to the Infected of the previous phase, minus the recoveries and deaths that occur during that phase. Equation (3e) states that the Recovered population is equal to the Recovered population of the previous time step, plus the sum of all new recoveries from each phase of Infected, plus the entirety of all Infected that reach phase T_i and do not die. The Dead Equation (3f) states that the total deaths within a cell are equal to the deaths of the previous time step plus all new deaths occurring due to each phase of Infected. The Susceptible Equation (3g) states that the sum of all infection states within an age group is always equal to one. The sum of all age group proportions in A is also always one.

2.4. Geographical Cell-DEVS models

As discussed earlier, our research focuses on defining advanced geographical models that consider the population of each geographical area and uses a non-uniform topology including non-uniform cells divided into age groups each of which may have different infection characteristics and behaviors. Traditional CA is not adapted to such kinds of representation. Instead, Cell-DEVS^32,52 allows the incorporation of these factors. The Cell-DEVS modeling methodology allows the definition of cell spaces based on the DEVS.³³ Cell-DEVS describes an n-dimensional cell space where each cell represents a DEVS atomic model. The cell space containing the n cells is defined as a DEVS coupled model where each cell is connected to its neighboring cells, as in Figure 4.

Figure 4.

Cell-DEVS model: (a) atomic cell schematics and (b) two-dimensional Cell-DEVS neighborhood.

When a cell receives an input, the local computing function τ is activated, which will compute the next state for the cell. This discrete-event approach only considers and computes active cells using a continuous time base. If there is a change in the cell’s state, the change is transmitted after a time delay d. In Figure 4(b), we can see how a cell (center) will transmit information to the neighboring cells using a von Neumann neighborhood. Cell-DEVS accepts other neighborhoods and irregular topologies as well (as discussed later). Compared to CA models, each cell executes asynchronously only when activity is detected, improving performance. Discrete-event models use a continuous time base, making it simpler to deal with a variety of time scales in the models. The delay functions also allow defining complex time specifications at the cell’s level, without relying in a centralized global clock which can introduce timing management complexities in the model.

Cell-DEVS inherits the modularity and hierarchical modeling ability of DEVS. This allows for models to better interact with other models, tools, data sets, and visualization tools, making it an easy and efficient method to build complex cellular models. The DEVS is a formalism used to model and simulate discrete-event systems. Cell-DEVS can be used to specify and implement cellular models and facilitates their simulation and integration with other models. In this research, we use the Cadmium tool,³⁴ which allows users to define model inputs using JSON, a data format to store and transmit large amounts of human readable data. JSON stores data in key-value pairs allowing for the simple representation of neighborhoods, their attributes, and their relationships. Cadmium allows the user to include complex geographical inputs that load into the model at runtime resulting in a flexible model that allows for efficient rapid prototyping. Rapid prototyping allows modelers to add new disease characteristics to existing models, adapting them to fit their evolving needs. These changes should be efficient and accessible to modelers. This approach also enables one to incorporate additional critical parameters not known when the modeling commenced. The Cadmium JSON library enables the user to load cells from a JSON format file, allowing the user to generate complex neighborhoods and state variables to be run as scenarios of the model. This project generates a Cadmium JSON cell space from geographical data and state information before using it as an input to the model. The 2016 Canadian census offers good data on populations per geographical regions and is used in generating the results of this model. The input JSON is generated by a python script, which parses geographical census data from input files and creates a cell space with neighborhoods corresponding to the geographical data, virus characteristics, and initial pandemic conditions. For each geographical area in the input data, a cell ID is created, a set of cell state variables are initialized, and the neighborhoods of each cell are constructed by a correlation factor.

SIR models have been enhanced through the use of complex topologies based on geographical information. Cárdenas et al.³⁰ described a model for the spread of infectious diseases among geographic regions. They describe how individuals can become mobile and be in contact with individuals in other regions, resulting in the spread of a disease across regions. They show how geographical information can complement the standard SIR model as well as lead to better, more defined results. Cárdenas et al.³⁰ described a geographical Cell-DEVS SIR model. Their model is based on Zhong et al.¹⁶ to simulate the spread of epidemics in a geographical-based 2D cell space. The model has described that at time t, a given cell (i, j) has a given population N_ij. Each cell stores the proportion of individuals in each state. The SIR model described in Zhong et al.¹⁶ and translated to Cell-DEVS in Cárdenas et al.³⁰ uses a geographical correlation factor defined by the shared boundaries as defined in the set of Equation (3). The correlation factor is a method the model uses to link two regions together to allow for interaction between their populations. This is not necessarily the most accurate method to model the flow of populations as it does not take other key factors into account. For example, it does not consider the density of population areas and workplace hubs such as downtown Ottawa. Adding transportation networks³⁵ or human movement and mixing models³⁶ shows how the addition of these factors would give more accurate results. However, the shared border length correlation method has been shown to produce accurate results, and it is easily adaptable for geographical simulations.¹⁶ Being able to quickly adapt a model to receive new data is crucial for rapid prototyping, and this model allows the user to change both the geographical level they are simulating as well as the disease characteristics. The equations for the correlation factor between neighborhoods (i.e. cells) are the ones shown and explained in Equation (2). The model developed in Cárdenas et al.³⁰ also includes parameters that define hospital capacities and lockdowns correction factors.

Cárdenas et al.³⁰ extend the geographical model described in Zhong et al.¹⁶ to incorporate deaths and the ability for a cell’s recovered populations to become re-infected.^37,38 Davidson and Wainer³⁹ further extend the model with the ability for a cell’s population to move from the Susceptible state to an Exposed state before becoming Infected, developing the SEIRD model presented in section 3 that was used as starting point for the SEAIRD model framework we propose in this research. When adding the Exposed state, the rate at which a cell’s population would move from Susceptible to Exposed remained unchanged. But, when the population moved from Susceptible to Exposed, a new value was considered, the incubation rate, ε. The Exposed state is an important addition to the model as it includes the variable onset in infectiousness seen in individuals. The model can simulate realistic incubation period of COVID-19, showing the time it takes for someone to be exposed to when they show symptoms. The problem with this is that not all individuals develop symptoms after exposure, thus the need for an asymptomatic state.

3. Baseline Cell-DEVS model

The model presented here is an evolution of a simple Cell-DEVS model first presented in Cárdenas et al.,³⁰ and is based upon a discrete time SEIRDS CA model.¹⁶

3.1. Methodology, software, and hardware

To define the model, we follow the Cell-DEVS formalism. We first define a Cell-DEVS SEIRD model as an extension of the model in section 2.3. The model is implemented using the Cadmium tool, a Cell-DEVS simulator implemented using C++ programing language. The model can be run in different operating systems including MacOS, Ubuntu, and Windows. The models presented in this paper were run on regular desktop and laptop computers with 16 GB RAM and Intel core i7. The simulation results were available in the order of minutes (between 2 and 10 min depending on the scenario), and the graphs were automatically generated using Python scripts. The Cell-DEVS model definition and implementation presented in this section becomes the foundation of SEAIRD and SEVIRDS models given in later sections.

3.2. Model definition

The SEIRD model includes new features, including a fatality rate modifier when high levels of infection are present in a cell’s population, and movement restriction effects that model public health lockdown policies. Parameters for the model are also chosen based on known characteristics of COVID-19 before using the model to generate simulation results.

Equation (3) was used to define the local computation function of the Cell-DEVS model, with direct substitutions to the fatality rate δ_a(q), and the geographical correlation factor c_ij to account for new functionality. To model the effect of an over-capacity health care system in times of high levels of infection, above a certain level of cell infection the baseline fatality rate is multiplied by a constant factor F (called the fatality rate modifier). The fatality rate modifier is greater than 1, while the product of F and the fatality rate does not exceed 100%. Equation (4) defines this modified fatality rate $F_{a} (q)$ as a function of the baseline fatality rate δ_a(q), F, and the cumulative sum of all Infected $\sum I_{i, t}$ :

F_{a} (q) = {\begin{matrix} δ_{a} (q) & if \sum I_{i, t} < hospital capacity threshold \\ δ_{a} (q) • F & if \sum I_{i, t} \geq hospital capacity threshold \end{matrix}

(4)

Equation (5) shows the cumulative sum of the proportion of population in each of the Infected phases across all age groups at a given time. Levels of infection within a cell that cause model parameters to change are also referred to as infection thresholds, such as the hospital capacity threshold of Equation (4). Equation (6) can be used to check whether a cell exceeds a target infection, always bounded by the range [0,1]:

\sum I_{i, t} = \sum_{\begin{matrix} b ∊ A \\ q ∊ {1, 2, \dots, T_{i}} \end{matrix}} [I_{i, b}^{t} (q)]

(5)

The concept of infection thresholds is also used to define movement restrictions. Instead of using a static geographical correlation factor c_ij as in Equation (2), new factors are introduced which decrease the amount of correlation between cells when the infection thresholds within each cell are exceeded. This serves to model the restriction of mobility both within a cell’s own population and the restriction of mobility between neighboring cells. An infection correction factor k_ij is now multiplied with c_ij to yield an infection dependent geo-correlation factor g_ij as in the following Equation (6):

g_{i j} = c_{i j} • k_{i j}

(6)

For each cell, infection thresholds and correlation modifiers are assigned to model each cell’s individual set of lockdown policies. Each correlation modifier, also known as a mobility restriction factor, is assigned a hysteresis value so that mobility restriction policies remain in place for some time after infections fall below the threshold at which they were triggered. The nth mobility restriction policy of cell i (K_n,i) is defined by the three variables: infection threshold (τ_n), mobility restriction factor (m_n), and hysteresis factor (h_n) in a map relationship as seen in Equation (7). Cell i’s set of lockdown policies can then be defined as a set of n mobility restriction policies, as seen in Equation (8):

K_{n, i} = {τ_{n} : (m_{n}, h_{n})}

(7)

K_{i} = {t_{1} : (m_{1}, h_{1}), t_{2} : (m_{1}, h_{2}), \dots, t_{n} : (m_{n}, h_{n})}

(8)

Each neighbor pair’s combined set of lockdown policies (K_i and K_j) are then used to calculate a k_ij between two neighboring cells. As individual cells always have a single lockdown policy in effect (K_n,i), the policy with the highest infection threshold that has been exceed between the two cells is selected for their interaction. In other words, the more restrictive policy in effect between two cells is always used in their correlation, as demonstrated by the edge case of a cell with closed borders (m_n = 0), where no infection should cross borders in either direction. This selection of a single K_n,i for interaction between cells i and j, demonstrated by Equation (9), yields the mobility restriction factor K_i,j that is used to calculate the infection correction factor k_i,j. Note that a cell’s interaction with itself simply selects the K_n,i with the highest infection threshold in the set K_i that has been exceeded:

K_{i j} = min (K_{n, i}, K_{n, j})

(9)

A public health agency recommending mobility restrictions is, however, not enough to guarantee that all members of the population follow them. The final piece of the infection correction factor is a variable that describes the proportion of population that do not follow any mobility restrictions in effect. The disobedience factor d describes the portion of a cell’s population that are unaffected by any K_i,j selected for cell interactions. The infection correction factor for the interaction between two cells i and j as used in Equation (6) can be calculated using the disobedience factor and the mobility restriction factor K_i,j selected for i and j as in Equation (10). The extended SEIRD Cell-DEVS model simply uses the set of Equation (3) with the substitutions of g_i,j in place of c_i,j, and $F_{a}$ (q) in place of δ _a (q):

k_{i j} = (d + (1 - d) • K_{i j})

(10)

The model parameters were defined using known disease data for COVID-19. The mean incubation rate of COVID-19 used was 5.4 days following a log-normal distribution.⁴⁰ This distribution was used to define both T_E = 14 and the incubation rate ε(q). The mean symptomatic infection length of COVID-19 was considered to be 10 days, and it was used to develop an infection profile of T_I = 12 days, where the recovery rate γ(p) controls the proportion that recover in each phase of Infected.⁴¹ The virulence and fatality rates of the model were varied to match infection case data during experimentation.

3.3. Model implementation

The extended model is implemented in Cadmium, defining atomic cell models of type geographical_cell coupled to form a cell space. Each geographical cell uses a cell ID (C), a state object that contains state variables (S), and a vicinity object that describes the structure of the neighborhood (V). The relationships between the classes involved can be seen in the UML diagram of Figure 5.

Figure 5.

Geographical cell model UML class diagrams.

The infection state variables S, E, I, R, and D computed in the local transition function are discretized per Equations (11a)–(11e). The precision divider P is used to discretize each state variable, and the use of square brackets denotes a round operation:

{\begin{matrix} {D S}_{i, a}^{t} = \frac{[{PS}_{i, a}^{t}]}{P} & (a) \\ {D E}_{i, a}^{t} (q) = \frac{[{PE}_{i, a}^{t} (q)]}{P} & (b) \\ with q \in [1, 2, \dots, T_{e}] \\ {D I}_{i, a}^{t} (q) = \frac{[{PI}_{i, a}^{t} (q)]}{P} & (c) \\ with q \in [1, 2, \dots, T_{i}] \\ {D R}_{i, a}^{t} = \frac{[{PR}_{i, a}^{t}]}{P} & (d) \\ {D D}_{i, a}^{t} = \frac{[{PD}_{i, a}^{t}]}{P} & (e) \end{matrix}

(11)

The local computation function implemented in the geographical_cell class is described by the pseudocode of Table 3. The algorithm begins by allocating a state object new in line 1, which will be used to store the new values of cell state variables. Lines 2 through 8 calculate the new state variables for each age group based on the cell’s current state object s and store the result in new. These six lines implement a cell’s local computation function at a high level (Equations (3a)–(3g)). The resulting state object is returned in line 9 to be written as the new cell state.

Table 3.

Geographical cell local computation function.

Input: none. Output: new state (struct seird)
1: new = seird() 2: for(each age group a in s.age_groups) 3: new.E.at(a) = new_exposed(a,s) 4: new.I.at(a) = new_infected(a,s) 5: new.R.at(a) = new_recovered(a,s) 6: new.D.at(a) = new_dead(a,s) 7: new.S.at(a) = 1 − new.E.at(a) − new.I.at (a) − new.R.at(a) − new.D.at(a) 8: endfor 9: return new

The method new_exposed() (line 3, Table 3) described in the local computation function is responsible for calculating the infective interactions between all neighboring cells. This method corresponds to Equation (3a) of the local computation function with the extensions defined in this section, including the calculation of the infection correction factor k_i,j of Equation (6). The calculation of new exposures for an age group b of a single cell i can be seen in the pseudocode in Table 4. The algorithm begins by calculating the movement correction multiplier of cell i, using the movement correction function which scans the set of movement restriction factors k_i and chooses the most suitable factor to apply. The infection correction factor g_i is calculated in line 2 by considering the proportion of disobedience d that will follow the movement restriction policy and the mobility restriction of the k_i selected. In line 3, the variable infStrength is declared and used to sum how much infective contact is made with cell i from each of its neighbors. We then iterate over all neighbors of cell i, summing the amount of infective contact made with all neighbors. This is done by first calculating the infection correction factor of the neighbor cell j, and then choosing the more restrictive correction factor between i and j to describe their interaction (lines 5–7).

Table 4.

New exposures algorithm.

Input: age segment b, current state si. Output: new exposed
1: µi = movement_correction(Ki, infected_i) 2: gi = d + (1 − d)µi 3: infStrength = 0 4: for(all neighbors j of cell i) 5: µj = movement_correction(Kj, infected_j) 6: gj = d + (1 − d)µj 7: gij = min(gi, gj) 8: for (all age groups a in cell j) 9: iSG = 0; 10: for (q in infected phases) 11: iSG += sj.λ(a,q)*sj.µ(a,q)*sj.I(a,q) 12: endfor 13:gij = kij*cij 14: infStrength+ = (Nj.at(a)/Nj)iSGgij 15: endfor 16: endfor 17: new_exposed = si.S.at(b)*infStrength 18: return new_exposed

Input: age segment b, current state si. Output: new exposed

1: µi = movement_correction(Ki, infected_i)
2: gi = d + (1 − d)*µi
3: infStrength = 0
4: for(all neighbors j of cell i)
5: µj = movement_correction(Kj, infected_j)
6: gj = d + (1 − d)*µj
7: gij = min(gi, gj)
8: for (all age groups a in cell j)
9: iSG = 0;
10: for (q in infected phases)
11: iSG += sj.λ(a,q)*sj.µ(a,q)*sj.I(a,q)
12: endfor
13:gij = kij*cij
14: infStrength+ = (Nj.at(a)/Nj)*iSG*gij
15: endfor
16: endfor
17: new_exposed = si.S.at(b)*infStrength
18: return new_exposed

Next, the algorithm calculates the infective potential of each age group in neighbor cell j by summing over all infection phases the proportion of population with phase q multiplied by their mobility rate (µ) and their infectivity rate (λ) (line 11). The total infective weight of the age group is then calculated in line 14 as a function of the proportion of population in that age group, the infective potential of the age group, the infection correction factor k_ij, and the geo-correlation factor c_ij. The algorithm concludes in lines 17 and 18 by multiplying the Susceptible population in age group b by the sum of infective strength of the neighborhood and returning the result.

The baseline model was validated and calibrated using real-world data of the province of Ontario, Canada, including 34 Public Health Units (PHUs), as shown in the map presented in Figure 6.⁴² The simulation results were compared to the reported case data in these regions for the period of 1 January 2020 to 2 February 2021. The case data used were reported by the Government of Ontario, in the form of confirmed new cases per date and their resulting outcome.⁴³

Figure 6.

PHUs of Ontario.⁴²

The SEIRD model predicts the number of active cases present at any time but does not report the data in the form of new cases per day. To calibrate and validate the model results, the confirmed cases were converted to active infections and cumulative fatalities over time. Each confirmed case was assumed to follow the 10 days mean infection length of COVID-19 when converting case reports to active cases. Fatal outcomes were still considered to produce contagious individuals with an active infection length of 10 days. The geographical boundary used in defining the cell space for the Ontario PHUs was provided in Ontario GeoHub.⁴⁴ The PHUs of Ontario were placed in five groups based on their location: Central East, Central West, Southwest, East, and North. A visualization of the case data over for these groups is shown in Figure 7. The default experimentation parameters used in experimentation as well as the specific structure of the cell state variables can be found at https://github.com/SimulationEverywhere-Models/Geography-Based-SEIRDS. We examined the effect of movement restriction policies in the Toronto PHU in terms of the movement correction factor’s infection thresholds, hysteresis values, and mobility multipliers. Without movement restriction policies to limit infection spread, most of the population in all 34 cells become rapidly infected in a severe trajectory.

Figure 7.

Ontario PHU Case Data from December 2020 to February 2021: (a) Central East; (b) Central West; (c) Southwest; (d) East; (e) North.

Figure 8 shows simulation data compared to the reported active cases. This demonstrates an infection threshold (k_i) that is too high, resulting in movement restrictions that are applied later than what is represented in the case data. The predicted active infections would have been more accurate had the infection threshold of the limiting policy been in effect at half the number of active infections.

Figure 8.

Toronto pandemic, movement restriction factor #1.

Using an infection threshold 50% lower than the simulation of Figure 9, the movement restriction policy comes into effect at a more accurate level of infection, as shown in Figure 9.

Figure 9.

Toronto pandemic, movement restriction factor #2.

However, these restrictions are lifted too soon at day 170 of the simulation, and a second wave begins while the case data indicate a continual fall of cases at this time. This indicates that not enough hysteresis was used in the dominant infection correction factor, and that the hysteresis factor should be increased such that the number of active cases will fall to a lower level before movement restrictions are lifted. A larger hysteresis factor results in a better match of the Toronto PHU infection, shown in Figure 10.

Figure 10.

Toronto pandemic, movement restriction factor #3.

After these preliminary tests, we conducted a series of simulations and compared the model results to the known case data to calibrate the parameters of the model. The virulence rate and infection correction factors were found to be the most influential determinants of model accuracy, which was improved by increasing the virulence rate above the default level.

While the first wave of the COVID-19 pandemic in Ontario was the strongest in Toronto, other regions further away also had significant levels of infection relative to their total population size, namely, the Ottawa and Windsor Essex PHUs. A single infection source beginning in the Toronto cell was insufficient in producing the first wave infection curves far away from Toronto, so a smaller amount of infection was added to the initial conditions of Ottawa and Windsor Essex which also had considerable levels of first wave infection. Figure 11 shows the results in the regions surrounding Toronto, Ottawa, and Windsor Essex.

Figure 11.

Pandemic in (a) Toronto; (b) Peel; (c) York; (d) Durham; (e) Ottawa; (f) Windsor Essex.

The simulations in these regions do track the general shape of the case data, but some predictions significantly overshoot case estimation. Toronto has the highest overall accuracy as the model parameters were initially developed to match Toronto before considering more regions. Every cell is using the same set of infection correction factors, which could be a significant source of error because each region’s population density and human characteristics are region specific by nature. In fact, in the case of regions directly adjacent to Toronto, the level of simulation error is proportional to the population density of each region. The population density of the Peel PHU is 25% that of Toronto. Similarly, the population density of York is 11.74% of Toronto, and Durham is 5.46% of Toronto. Given that Toronto has both the highest population density and the highest model accuracy, and that Durham has both the lowest population density and accuracy, a population density correction should be investigated in the future.

4. Extending the model: asymptomatic individuals

The validated baseline models presented in section 3 were extended to include asymptomatic individuals, and then incorporated into a geographical SEAIRD model that can be used for conducting experiments with different population scenarios as will be shown in this section. Each model shares a defined asymptomatic rate where a given proportion of the Exposed population move to either Infected or Asymptomatic. We use the methodology, software, and hardware explained in section 3.1.

4.1. Model definition

Our proposed geographical SEAIRD model is based on Cárdenas et al.³⁰ and Davidson and Wainer³⁹ by adding an Asymptomatic state (A) as depicted in the diagram in Figure 12, which shows that a cell’s population starts in a Susceptible state before becoming exposed. From there, the Exposed population will move to either Asymptomatic or Infected. If Asymptomatic, they will eventually become Recovered, but if an individual is Infected, they can move to either Recovered or Deceased.

Figure 12.

SEAIRD state diagram.

The dotted line in Figure 12 from Recovered to Susceptible shows how a population can become re-susceptible after recovery. Each transition is based on their defined time behavior and described using the delay function. Exposed, Infected, Asymptomatic, and Recovered states have a defined set of days that a population can be within the state described by T_e, T_i, T_ai, and T_r. Each state has a defined state transition that occurs at each day within the state. The days within each state set of days are described by q = {1, 2, …, T_state}. For example, $A_{i, a}^{t} (q)$ , describes the proportion of asymptomatic cases for the age group a in cell i at Asymptomatic state q = {1, 2, …, T_ai}, at time t.

Our model uses m unique geographical cells. The proportion of a population’s age group a found in each state is described by: $S_{i, a}^{t}, E_{i, a}^{t}, A_{i, a}^{t}, I_{i, a}^{t}, R_{i, a}^{t}, and D_{i, a}^{t}$ , where i is the cell being described at time t. The state transitions are built using the Cell-DEVS transition and delay functions, which implement Equations (11a)–(11l). Parameter definitions that vary from the Cell-DEVS SEIRDS model of section 3 are given in Table 5 to complement the location transition function equations.

{\begin{matrix} E_{i, a}^{t + 1} (1) = S_{i, a}^{t} • [\sum_{j \in V} (c_{i j}) • (\begin{matrix} k_{i j} • [\sum_{\begin{matrix} b \in A \\ q \in {1, 2, \dots, T_{i}} \end{matrix}} (a_{j, b} • μ_{b} (q) • λ_{b} (q) • I_{j, b}^{t} (q))] \\ + [\sum_{\begin{matrix} b \in A \\ q \in {1, 2, \dots, T_{ai}} \end{matrix}} (a_{j, b} • μ_{b} (q) • λ_{b} (q) • {Ai}_{j, b}^{t} (q))] \end{matrix})] & (a) \\ E_{i, a}^{t + 1} (q) = \begin{matrix} (1 - ε_{a} (q - 1)) • (E_{i, a}^{t} (q - 1)) \\ with q \in {2, 3, \dots, T_{e}} \end{matrix} & (b) \\ I_{i, a}^{t + 1} (1) = E_{i, a}^{t} (T_{e}) + [\sum_{q = 1}^{T_{e} - 1} (ε_{a} (q) • E_{i, a}^{t} (q) • (1 - φ))] & (c) \end{matrix}

Table 5.

SEAIRD model new definitions.

Parameter	Symbol
Asymptomatic population of cell i in age group a at time t, with phase q	$A_{i, a}^{t} (q)$
The asymptomatic infection rate	$φ$
The length of the Asymptomatic Infected state	T_ai

SEAIRD: Susceptible, Exposed, Asymptomatic, Infected, Recovered, Deceased.

{\begin{matrix} I_{i, a}^{t + 1} (q) = (I_{i, a}^{t} (q - 1)) • (1 - γ_{a} (q - 1) - F_{a} (q - 1)) & (d) \\ with q \in {2, 3, \dots, T_{i}} \\ A_{i, a}^{t + 1} (1) = E_{i, a}^{t} (T_{e}) + + [\sum_{q = 1}^{T_{e} - 1} (ε_{a} (q) • E_{i, a}^{t} (q) • φ)] & (e) \\ A_{i, a}^{t + 1} (q) = (A_{i, a}^{t} (q - 1)) • (1 - γ_{a} (q - 1)) & (f) \\ with q \in {2, 3, \dots, T_{ai}} \\ R_{i, a}^{t + 1} (1) = (I_{i, a}^{t} (T_{i}) + A_{i, a}^{t} (T_{ai})) + [\sum_{q = 1}^{T_{i} - 1} (γ_{a} (q) • I_{i, a}^{t} (q))] + [\sum_{q = 1}^{T_{ai} - 1} (γ_{a} (q) • A_{i, a}^{t} (q))] & (g) \\ R_{i, a}^{t + 1} (q) = R_{i, a}^{t} (q - 1) & (h) \\ with q \in {2, 3, \dots, T_{r} - 1} \cap Resusceptibility = false \\ R_{i, a}^{t + 1} (q) = R_{i, a}^{t} (q - 1) & (i) \\ with q \in {2, 3, \dots, T_{r}} \cap Resusceptibility = true \\ R_{i, a}^{t + 1} (T_{r}) = R_{i, a}^{t} (T_{r}) + R_{i, a}^{t} (T_{r} - 1) & (j) \\ with Resusceptibility = false \\ D_{i, a}^{t + 1} = D_{i, a}^{t} + [\sum_{q = 1}^{T_{i}} (F_{a} (q) • I_{i, a}^{t} (q))] & (k) \\ S_{i, a}^{t + 1} = 1 - [\sum_{q = 1}^{T_{e}} (E_{i, a}^{t + 1} (q))] - [\sum_{q = 1}^{T_{i}} (I_{i, a}^{t + 1} (q))] - [\sum_{q = 1}^{T_{ai}} (A_{i, a}^{t + 1} (q))] - [\sum_{q = 1}^{T_{r}} (R_{i, a}^{t + 1} (q))] - D_{i, a}^{t + 1} & (l) \end{matrix}

(12)

Equation (12a) is based on Equation (3a), and it is used to calculate the proportion of newly Exposed population. This is a result of the Susceptible population in contact with either the entire Infected population or the Asymptomatic population of neighboring cells j. The first sum in the second part of the equation calculates the proportion of a cell’s Susceptible population exposed to an infectious individual (I) and the second sum, the proportion exposed to an asymptomatic individual (Ai). The symbol A defines the set of age groups in cell j, where each age group is represented by b. Each cell’s population represented by the subscript b, which represents N_j divided into age groups. As before, each cell is related to its neighbor by a geographical correlation factor c_ij that describes the impact each neighboring cell has on a given cell, including virulence and mobility rates a given cell’s population has with its neighbors. Finally, k_ij defines a correction factor between cells i and j, applied to the infectious half of the equation to simulate different behavior for infectious and asymptomatic populations: we consider that asymptomatic individuals to be more carefree; thus, they will expose more individuals. The correction factor k_ij is defined using the models disobedience factor d, where k_ij = min(k_i,k_j). The correction for individual cells i and j is defined as k_cell = d + (1 − d)*m_c. The infection correction factor mc is defined in the model as a function of the infection threshold (I_TH) that triggers a specific mobility correction factor (cm) and a hysteresis level (H).

Equation (12b) describes how the Exposed population transitions to the Infected or Asymptomatic state. The equation defines the Exposed in phase q as equal to the exposed of the previous day multiplied by 1 − ε_a(q − 1), where ε_a (q − 1) defines the incubation rate for an age group a for state q − 1. The incubation rate defines the probability of the population moving to Infected or Asymptomatic. Equation (12c) describes the new Infected population that will occupy day 1. The equation considers the Exposed population from all phases and all age groups. As defined above in Equation (12b), a proportion of the Exposed population moves to Infected or Asymptomatic depending on the incubation rate ε_a. The rate at which the Exposed population becomes either Infected or Asymptomatic is defined by asymptomatic rate φ. Thus, for the case of new Infected population, the rate is defined as (1 − φ). Equation (12d) describes the portion of the Infected population that moves to the next phase. The infectious population for phase q equals the population of infectious in the previous phase, q − 1 minus the population who move to either Recovered or Deceased. The portion of the population that moves to the Recovered or Deceased states is defined by recovery rate γ and fatality rate $f$ , respectively.

Equations (12e)–(12f) define the asymptomatic state behavior following the same rules described in Equations (12c) and (12d). Equation (12e) defines the proportion of the exposed population that moves to the asymptomatic state (here, the asymptomatic population rate remains as φ). Equation (12f) follows the same rules defined when asymptomatic cases either move to the next phase q, recovered, or deceased.

Equation (12g) describes the proportion of infectious or asymptomatic populations that become recovered. The equation defines that the total number of recoveries is equal to the total number of recoveries from the previous day plus the newly recovered population. The current day recoveries are calculated by taking the proportion of infectious and asymptomatic infections that move to the recovered phase using rate γ. Finally, the equation checks for the population that is on the final day of either infectious or asymptomatic, if their population does not move to the deceased state, they are added to the recovered state. (Equations (12h) and 12j) are used only if re-susceptibility is not enabled. Once the Recovered population reaches the final day of recovery, they remain there for the rest of the simulation time. Equation (12i) is an equation only used when re-susceptibility is enabled, that is, patients who are recovered will go through each day of recovery, when they reach the final day of recovery, the population will move back into the susceptible population pool where they can be re-exposed.

Let us consider f_a(q) as the fatality rate of infected phase q for age group a; λ_a(q) their virulence; μ_a(q) their mobility rate; ε_a(q) the incubation rate; γ_a(q) the recovery rate and φ as the asymptomatic infection rate. As defined in section 3, c_ij is the geographical correlation factor between cells i and j; k_ij is the correction factor applied to both cells i and j to model movement restriction and disobedience. Equation (12k) is used to calculate the proportion of Deceased at a time t. The proportion of Deceased next day is the total of current Deceased plus the sum of the Infected population that died the day before. New Deceased is equal to the newly deceased population moving from the infectious state multiplied by the fatality rate. The Deceased transition does not consider asymptomatic infections as they do not lead to deaths. Equation (12l) is a “special equation” needed for the integrity of the model. Since we know that any given population starts in the susceptible state (excluding the starting cell), then the population that are not in any other state should remain susceptible.

4.2. Model implementation

The model is implemented as a coupled Cell-DEVS where the cell space represents a geographical region, and each cell (of irregular topology) is a district in the city/province. It relates to its neighboring cells using an irregular topology. Each cell consists of a cell ID, a set of state variables, a model configuration, and neighboring cell’s correlation factors.

We implemented these equations using the baseline model in section 3 and included the equations in this section. When all the geographical cells are defined, they are placed into a top level coupled cell model called geographical_coupled, with configuration as seen in Figure 13. At runtime, geographical_coupled is initialized using the cell’s data provided from in a JSON input file (using the methods described in the top model class cadmium::celldevs::cells_coupled <T,C,S,V>); Figure 13 shows how this coupled cell model is defined. At the bottom level of Figure 13, the three structures define the inputs to the geographical cells.

Figure 13.

SEAIRD coupled cell diagram.

The SEAIRD structure defined the state variables that will hold the population as well as the infection correction factors and the disobedience factor (Figure 14). The simulation configuration structure defines the attributes used to characterize the disease being modeled including recovery rates, fatality rates, and asymptomatic rates (Figure 15). The vicinity structure holds the information that defines the correlation factors between two cells (geographical_ cell). The three structures are read in at runtime to create the single parameterized model geographical_cell. The collection of geographical cells and their relationships define the geographical coupled model.

Figure 14.

SEAIRD configuration code.

Figure 15.

Simulation configuration.

Each cell contains the relevant information defined in the SEAIRD configuration file. At runtime, each cell has a unique population which is divided into described age groups. Each cell’s population will then be divided into one of the six states. If modeling the beginning of a pandemic, a single cell will hold the initial case(s) and the remaining cells will be 100% susceptible. At t = 0, the proportion of a cell’s population in each state is defined in by the values provided in the SEAIRD structure. Defining the SEAIRD structure at runtime with user-defined inputs allows users to choose the point of time they want to start a model (if they are interested in the middle of a pandemic, they can tailor the input values to hold the number of individuals in each state at that time).

In Figure 15, the simulation configuration is declared; these values are used to change the ways a population transferred from one state to another. These structures define the geographical_cell atomic model presented in Figure 13. A geographical_cell atomic model is defined for each geographical cell in the model, these cells make up a geographical_coupled model where each cell is connected by a correlation factor. Finally, this geographical_coupled model is defined by methods found in class cadmium::celldevs::cells_coupled <T,C,S,V>.

4.3. Simulation results

The SEAIRD results presented in this section are generated using source data from the 34 Ontario PHUs where the population is generated using census data.⁴⁵ A configuration file is built using a geopackage Geopandas⁴⁶ to determine shared boundaries (correlation factor) between each PHU. Figures 16 –19 were created using the graphing tools in Cárdenas et al.,³⁰ R,⁴⁷ and Plotly.⁴⁸ The implementation of the model is available at https://github.com/SimulationEverywhere-Models/Geography-Based-SEAIRDS. The results presented in this section show and compare the effect the asymptomatic state has when added to the simulation. The parameters used in this study are shown in Table 6.

Table 6.

Test case configuration.

Parameter	Value
Population	Varies per cell based on census data³²
Age groups	[0.216, 0.279, 0.268, 0.193, 0.044]³²
Disobedience	[0.29, 0.25, 0.23, 0.21, 0.24]^33,35
Asymptomatic rate	Varies per simulation (see Figures 6 –8)
Virulence	0.1 across all states and age groups
Incubation	14-day profile²⁶
Mobility rates	1.0 across all states and age groups
Recovery rates	0.07 across all states and age groups
Fatality rates	0.005 across all states and age groups
Infection correction factors (lockdown)	0.001:[0.60, 0.0008], 0.005:[0.50, 0.003], 0.01: [0.40, 0.005], 0.03: [0.30, 0.015], 0.08: [0.20, 0.0005], 0.15: [0.1, 0.08], 0.20: [0.01, 0.12]

Figure 16.

SEAIRD model (a) 0% Asymptomatic and (b) 80% asymptomatic.

Figure 17.

Single Cell SEAIRD (a) 0% Asymptomatic and (b) 80% Asymptomatic.

Figure 18.

SEVIRDS model infection state diagram.

Figure 19.

Baseline simulation of SEVIRDS model with vaccination disabled.

We start with the population of each geographical area. We then use a vector of values to represent the proportion of the total population in each age group. Age groups are an abstract set of values defined by the modeler; in our case, individuals 0–12, 1319; 20-44; 45–65, and over 65 years old. Disobedience follows the same format as age groups where the values in the vector represent the proportion of each age group that is disobedient to lockdowns. These values were estimated using data gathered from.^49,50 The asymptomatic rate is the proportion of the exposed population that become asymptomatic (the rest become infectious). The virulence rate represents the rate at which the disease spreads; the value represents the amount of age groups population in contact with the infected population per day of the infection. Incubation rate is the proportion of the exposed population that become infectious or asymptomatic, defined using a 14-day profile where each day a proportion of the exposed population will move to the next state.³⁹ Mobility rates define the freedom of the population to move (1.0 mobility rating means the population can move freely). Mobility rates are defined for each age group for each day of the infection; it is assumed that mobility rates are not restricted at all at the beginning of the pandemic. Recovery rates define the proportion of an age group infected population that will recover each day of the infection. Fatality rates define the proportion of an age group–infected population that will move to the deceased state on each day. Infection correction factors describe the proportion of the population that needs to be infected before a lockdown is put in place, described in Equations (7)–(11) where the mobility restriction factor reduces the mobility of a cell by the given value.

Virulence rates, recovery rates, fatality rates, and infection correction factors were informed by data gathered at Public Health Ontario.³ The values were evaluated and slightly modified; the final values shown in Table 6 provided the most accurate results in testing.

Figure 16(a) shows the simulation results with 0% asymptomatic cases. The results show the steady rise of exposed individuals (orange line), and then 1–14 days after their exposure, they become infected (red line). The results show how the initial wave rises and settles in a little over 350 days with approximately 8% of the population becoming infected. Next, we studied the effect of an 80% asymptomatic rate using the same parameters. Figure 16(b) shows a lower rate of infectious carriers and a higher rate of asymptomatic carriers.

With this higher rate of asymptomatic carriers, the total exposed population reaches a higher peak than it had without asymptomatic cases (approximately 110% more exposures occur). This higher exposed count is due to that the different asymptomatic and infectious carriers have on the susceptible population. Since the asymptomatic carriers travel more than the infectious carriers, more of the population are exposed to them, causing higher overall infections. The initial curve rises and settles in approximately 250 days; this is 100 days less than the model with no asymptomatic infections. This shows that the asymptomatic carriers expose the susceptible population at a much higher rate than the model showing no asymptomatic cases. We can see that approximately 14% of the population become asymptomatic carriers with an additional 4% being infectious carriers. Although in this case, we can see higher overall rates of infections, the asymptomatic infections are less lethal, leading to less deaths and more recoveries. We see this difference when analyzing a single neighborhood cell in Figure 17.

Figure 17(a) shows a single cell and how its population transitions through the states with a 0% asymptomatic rate. It should be noted the cell is still exposed to its neighboring cells. When examining the graph at day 133, we can see a cumulative exposed population of 7.1%, cumulative infectious population of 5.65%, cumulative fatalities of 0.4%, and since we have no asymptomatic infections, an asymptomatic infected population of 0%. If we then compare this to a graph showing the curves with an asymptomatic rate of 80% in Figure 17(b), we can see that at day 133 we have a cumulative exposed population of 27.7%, cumulative infectious population of 4.7%, cumulative fatalities of 0.02%, and an asymptomatic population of 19.4%. We can see that the exposed grown to 27.7% from 7.1%. This 20% increase can be accredited to the asymptomatic carriers spreading the disease to the surrounding neighbors at a higher rate than the infectious population. We can also see that we have fewer deaths, linked to the fewer infectious cases.

We can now simulate the effect an “invisible” population of disease carriers may have on a pandemic combined with the advantage of having a model that includes geographical aspects. If 80% of total COVID-19 cases are asymptomatic and the case counts show 5% of a population are infected, we can expect that 20% more of the population is also infected but not showing symptoms and not aware they are infected. These asymptomatic individuals will be spreading the disease to the healthy population, causing rises in the number of exposed individuals, resulting in more infectious. Although a high asymptomatic rate will lead to more overall cases, these cases are not as deadly; asymptomatic cases do not show symptoms, and in the case of COVID-19, they do not lead to death. If we were to model a disease where asymptomatic cases could lay dormant for years and later lead to death, we would see more interesting fatality results.

5. Rapid prototyping of vaccination effects

The baseline Cell-DEVS model defined in section 3 was also extended to model the effect of population vaccination, providing a mechanism for experimentation. Vaccines decrease the likelihood of infection and death from SARS-CoV-2, but do not provide full immunity from the disease. The addition of the vaccinated state makes this model an SEVIRDS model, where re-susceptibility to disease is specified by the model’s user. We modeled vaccination using a two-dose schedule like the one seen the global distribution of mRNA COVID-19 vaccines from Pfizer or Moderna.⁵¹ Three classes of vaccination status for individuals exist concurrently within this model: unvaccinated, vaccinated with dose 1 only, and vaccinated with two doses. Infections in the unvaccinated class of population are implemented using the Cell-DEVS SEIRD model defined in section 3, while the other two classes of vaccinated population are implemented using a modified version. We used Thomas et al.⁵¹ in which the vaccines do not provide statistically significant immunity to infection with SARS-CoV-2. Vaccinated populations were modeled using a modified Susceptible state, having increased resistance to infection and fatality with each sequential dose of vaccine. The model can be easily extended to include third or fourth doses.

5.1. SEVIRDS model definition

The three classes of vaccination status are modeled using concurrent SERID models with a shared Deceased state, and flow of population from less vaccinated states to more vaccinated states. Individuals are only considered for vaccination if they are not in an Exposed, Infected, or Deceased state. A state diagram for the model’s population is given in Figure 18, where the bold text indicates the name of the state, and the text below each name is the state symbology used in the local computation function’s equations.

All states that end with the letters “NV” specify the non-vaccinated population. Similarly, all states that end with “V1” specify the population that has received only one dose of vaccine, and all states that end with “V2” specify the population that has received two doses of vaccine. Members of the Susceptible_NV and Recovered_NV states are considered eligible for vaccination, and transition to state Suscetible_V1 after vaccination according to the population vaccination rate of dose 1 (v_a,d₁). Members of the Susceptible_V1 and Recovered_V1 states are also considered eligible for vaccination, and transition to state Suscetible_V2 after vaccination according to the population vaccination rate of dose 2 (v_a,d₂(q)). The timing of vaccinations within the states Recovered_NV, Susceptible_V1, and Recovered_V1 is restricted only to the later phases of these states so that a minimum recovered time before vaccination, and a minimum second-dose interval is implemented. Members of the vaccinated states have increased protection from infectious exposure specified by immunity rates (i_a,d₁, i_a,d₂), and have incubation rates, recovery rates, and fatality rates that differ from the non-vaccinated population. When re-susceptibility is enabled, individuals enter the final phase of susceptibility in their vaccination class when they become susceptible again, meaning that in the case of Susceptible_NV and Susceptible_V1, individuals who enter these states as re-susceptible are immediately eligible for vaccination. The parameters that are specific to the vaccinated states and are new to the SEVIRDS model are given in Table 7. The parameters that were defined in the base Cell-DEVS model but now have vaccination status dependency are given in Table 8.

Table 7.

Vaccinated state-specific parameters.

Parameter	Symbol
Number of phases in Vaccinated_D1 state	$T_{d 1}$
Number of phases in Vaccinated_D2 state	$T_{d 2}$
Minimum number of days Vaccinated_D1 waits before eligible for dose 2	$m T_{d 1}$
Minimum number of days Recovered_NV and Recovered_V1 waits before vaccine eligible	$m d_{vr}$
Vaccination rate for dose 1	$v_{a, d 1}$
Vaccination rate for dose 2 per phase	$v_{a, d 2} (q)$
Immunity rate of Vaccinated_D1 per phase	$i_{a, d 1} (q)$
Immunity rate of Vaccinated_D2 per phase	$i_{a, d 2} (q)$

Table 8.

Vaccinated state-specific parameters.

Parameter	Non-vaccinated	Vaccinated dose 1	Vaccinated dose 2
Incubation rate	$ε_{a} (q)$	$ε V 1_{a} (q)$	$ε V 2_{a} (q)$
Recovery rates	$γ_{a} (q - 1)$	$γ V 1_{a} (q - 1)$	$γ V 2_{a} (q - 1)$
Fatality rates	$F_{a} (q)$	$FV 1_{a} (q)$	$FV 2_{a} (q)$

The equations that govern the Vaccinated states of dose 1 and dose 2 are given in the set of Equation (13), which describe these states for a collection of cells M = {m₀, m₁, …, m_i} at time t. Equations (13a)–(13d) describe the phases of the Vaccinated_D1 state, and Equation (13e)–(13g) describe the phases of the Vaccinated_D2 state. Age group-specific variables are indexed by the subscript a character, and the cell for which the equations are applied is specified by the subscript i character as previously defined in sections 2, 3, and 4. The symbol $E_{i}$ represents the neighbor impact factor, defined later in Equation (14), and is a variable that represents the total strength of infectious contact cell i has with all neighbors. $E_{i}$ is used to simplify the representation of the new exposures and performs the same functions seen in Equations (3a) and (11a) but for the SEVIRS model. The vaccinated set of Equation (13) assumes that the length of the Recovered_D1 state is greater than the Vaccinated_D1 state (T_r,V₁ ≥ T_d₁).

Equation (13a) defines the new Vaccinated_D1 (q = 1) with their first dose as the sum of the Susceptible_NV and Recovered_NV populations that receive their first dose. Both the Susceptible_NV and Recovered_NV populations are vaccinated at the age group–specific rate of v_a,d₁, where the Recovered_NV are only vaccinated after having been recovered for a minimum of md_vr days. Equation (13b) defines how the Vaccinated_D1 population advances through the phases of Vaccinated_D1 before they become eligible to receive their second dose at q = mT_d₁. It states that the Vaccinated_D1 population in phases q = 2 to q < mT_d₁ is equal to the Vaccinated_D1 population on the previous day, minus those that become Exposed_D1 according to their immunity rate i_a,V₁(q − 1) and the level of infectious neighbor contact in $E_{i}$ . Once the Vaccinated_D1 populations reach phase q = mT_d₁, they are vaccinated at a rate of v_a,d₂(q − 1), indicated in Equation (13c), where (1 − v_a,d₂(q − 1)) is the proportion of Vaccinated_D1 that do not receive dose 2 and will advance to the next phase of Vaccinated_D1. This equation states that the Vaccinated_D1 populations in all phases q ≥ mT_d₁ are equal to the Vaccinated_D1 population of the previous day, minus those that get vaccinated with dose 2, and those that become exposed. Equation (13d) defines the Vaccinated_D1 population in the last phase of Vaccinated_D1, where individuals may enter this state from either Vaccinated_D1(q = T_d₁ − 1) or from the last recovered phase Recovered_D1(T_r,V₁). Specifically, the Vaccinated_D1(T_d₁) is equal to the Vaccinated_D1 population from the previous phase that do not receive dose 2, plus the newly re-susceptible population from $RV 1_{i, a}^{t} (T_{r, V 1})$ that do not receive dose 2, minus the Vaccinated_D1 from the previous phase that become Exposed_D1. The new Vaccinated_D2 population (q = 1) is defined in Equation (7e) as the sum of all Vaccinated_D1(T_d₁) that do not become exposed, the Vaccinated_D1 population that gets their second dose early (q ≥ mT_D₁), and the Recovered_D1 that gets their second dose (q ≥ md_vr). Note that all population members in the last phase Vaccinated_D1(T_d₁) become Vaccinated_D2(1) or Exposed_D1(1) on the next day. Equation (7f) states that the Vaccinated_D2 population in phases q = 2 to q = T_d₂ − 1 is always equal to the Vaccinated_D2 of the previous phase that do not become Exposed_D2. Finally, Equation (7g) defines the population in the last phase of Vaccinated_D2 (q = T_d₂) as the sum of Vaccinated_D2(T_d₂) and Vaccinated_D2(T_d₂ − 1) from the previous day, minus the new Exposures in the populations from these two phases, plus the re-susceptible entering Vaccinated_D2(T_d₂) from the last phase of Recovered_D2:

{\begin{matrix} {V 1}_{i, a}^{t + 1} (1) = (v_{a, d 1} • S_{i, a}^{t}) + v_{a, d 1} • [\sum_{q = m d_{v r}}^{T_{r}} R_{i, a}^{t} (q)] & (a) \\ {V 1}_{i, a}^{t + 1} (q) = \begin{matrix} {V 1}_{i, a}^{t} (q - 1) - ({V 1}_{i, a}^{t} (q - 1)) • (1 - i_{a, V 1} (q - 1)) • E_{i} \\ with q ∊ {2 \dots T_{d 1} - 1} \cap q < m T_{d 1} \end{matrix} & (b) \\ {V 1}_{i, a}^{t + 1} (q) = \begin{matrix} ({V 1}_{i, a}^{t} (q - 1)) • (1 - v_{a, d 2} (q - 1)) - ({V 1}_{i, a}^{t} (q - 1)) • (1 - i_{a, V 1} (q - 1)) • E_{i} \\ with i ∊ {2 \dots T_{d 1} - 1} \cap q \geq m T_{d 1} \end{matrix} & (c) \\ {V 1}_{i, a}^{t + 1} (T_{d 1}) = \begin{matrix} ({V 1}_{i, a}^{t} (T_{d 1} - 1) + {R V 1}_{i, a}^{t} T_{r, V 1}) • (1 - v_{a, d 2} (T_{d 1} - 1)) \\ - {V 1}_{i, a}^{t} (T_{d 1} - 1) • (1 - i_{a, V 1} (T_{d 1} - 1)) • E_{i} \\ if T_{r, V 1} \geq T_{d 1} \end{matrix} & (d) \\ {V 2}_{i, a}^{t + 1} (1) = \begin{matrix} {V 1}_{i, a}^{t} (T_{d 1}) - ({V 1}_{i, a}^{t} (T_{d 1}) • E_{i}) + [\sum_{q = m T_{d 1}}^{T_{d 1} - 1} (v_{a, d 2} (q) • {V 1}_{i, a}^{t} (q))] \\ + [\sum_{q = m d_{V r}}^{T_{r}, V 1} (v_{a, d 2} (q) • {R V 1}_{i, a}^{t} (q))] \end{matrix} & (e) \\ {V 2}_{i, a}^{t + 1} (q) = \begin{matrix} {V 2}_{i, a}^{t} (q - 1) - ({V 2}_{i, a}^{t} (q - 1)) • (1 - i_{a, V 2} (q - 1)) • E_{i} \\ with q ∊ {2 \dots T_{d 2} - 1} \cap T_{r, V 2} < T_{d 2} \end{matrix} & (f) \\ {V 2}_{i, a}^{t + 1} (T_{d 2}) = \begin{matrix} {V 2}_{i, a}^{t} (T_{d 2}) + {V 2}_{i, a}^{t} (T_{d 2} - 1) + {R V 2}_{i, a}^{t} (T_{r, V 2}) \\ - ({V 2}_{i, a}^{t} (T_{d 2} - 1)) • (1 - i_{a, V 2} (T_{d 2} - 1)) • E_{i} - ({V 2}_{i, a}^{t} (T_{d 2})) • (1 - i_{a, V 2} (T_{d 2})) • E_{i} \\ if T_{r, V 2} \geq T_{d 2} \end{matrix} & (g) \end{matrix}

(13)

The new exposures in this SEVIRDS model are calculated as the sum of exposures resulting from all age groups and classes of vaccination status interacting with the Susceptible populations: Susceptible_NV, Susceptible_V1, and Susceptible_V2. The mobility rate per age group, vaccination status, and the per-cell movement restriction policies make these population interactions non-heterogeneous. The neighbor impact factor $E_{i}$ used in calculating new exposures is shown in Equation (14) to simplify the exposed set of Equation (15), where the symbols used in the calculation of $E_{i}$ are defined in the baseline Cell-DEVS model, with the exception that the mobility rates (µ) and the virulence rate (λ) are now vaccination dependent, having a “V1” or “V2” following these symbols for vaccination states. The calculation of $E_{i}$ sums the amount of infective interaction cell i has with the infected of all age groups in its neighborhood in all three classes. The calculation of the exposed populations in all three classes of vaccination status is given in the set of Equation (15). Equations (15a)–(15c) define the new exposed with phase q = 1 in the three classes. Equations (15d)–(15f) define the exposed states for all phases except the first:

\begin{matrix} E_{i} = \sum_{j \in V} (g_{i j}) • [\sum_{b \in A} (a_{j, b}) • [\begin{matrix} \sum_{q = 1}^{T_{i, V 1}} (μ V 1_{b} (q) • λ_{b} (q) • IV 1_{j, b}^{t} (q)) + \\ \sum_{q = 1}^{T_{i, V 2}} (μ V 2_{b} (q) • λ_{b} (q) • IV 2_{j, b}^{t} (q)) + \\ \sum_{q = 1}^{T_{i}} (μ_{b} (q) • λ_{b} (q) • I_{j, b}^{t} (q)) \end{matrix}]] \end{matrix}

(14)

Equation (15a) states that the new Exposed_NV population in age group a is equal to the product of the Susceptible_NV population and the neighbor impact factor $E_{i}$ . Equations (15b) and (15c) perform a similar calculation for the Exposed_V1 and Exposed_V2 populations, while incorporating phase-specific immunity rates i_a,V₁(q) and i_a,V₂(q) to model increased protection from vaccination. The immunity rates are phase-specific as the protection provided from vaccination increases following inoculation. Equations (15d)–(15f) describe how the disease incubates over time and individuals transition to infected states. The exposed population of any vaccination status in all phases except the first is equal to exposed of the previous day and phase multiplied by the incubation rate of that phase (ε_a(q)). The remainder of people who make it to the end phase of an exposed state enter an infected state on the following day. Equation (15g) defines the total proportion of population that is in an exposed state (E_T) as the sum of all exposed in all phases from the Exposed_NV, Exposed_V1, and Exposed_V2 states. The Infected, Recovered, and Deceased equations are not given, as these states are calculated per the baseline Cell-DEVS model for the three classes of vaccination status, where each class operates as a unique SEIRDS model:

{\begin{matrix} E_{i, a}^{t + 1} (1) = S_{i, a}^{t} • E_{i} & (a) \\ \begin{matrix} {E V 1}_{i, a}^{t + 1} (1) = \sum_{q = 1}^{T_{d 1}} \\ [({V 1}_{i, a}^{t} (q)) • (1 - i_{a, V 1} (q)) • E_{i}] \end{matrix} & (b) \\ \begin{matrix} {E V 2}_{i, a}^{t + 1} (1) = \sum_{q = 1}^{T_{d 2}} \\ [({V 2}_{i, a}^{t} (q)) • (1 - i_{a, V 2} (q)) • E_{i}] \end{matrix} & (c) \\ \begin{matrix} E_{i, a}^{t + 1} (q) = (1 - ε_{a} (q - 1)) \partial E_{i, a}^{t} (q - 1) \\ with q ∊ {2 \dots T_{e}} \end{matrix} & (d) \\ \begin{matrix} {E V 1}_{i, a}^{t + 1} (q) = (1 - {ε V 1}_{a} (q - 1)) \partial {E V 1}_{i, a}^{t} (q - 1) \\ with q ∊ {2 \dots T_{e}} \end{matrix} & (e) \\ \begin{matrix} {E V 2}_{i, a}^{t + 1} (q) = (1 - {ε V 2}_{a} (q - 1)) \partial {E V 2}_{i, a}^{t} (q - 1) \\ with q ∊ {2 \dots T_{e}} \end{matrix} & (f) \\ \begin{matrix} E_{T (i, a)}^{t + 1} = \sum_{q = 1}^{T_{e}} \\ [(E_{i, a}^{t + 1} (q)) + ({E V 1}_{i, a}^{t + 1} (q)) + ({E V 2}_{i, a}^{t + 1} (q))] \end{matrix} & (g) \end{matrix}

(15)

At all times, the Susceptible_NV follows Equation (16), stating that the sum of all population proportions per age group is equal to 1. The total proportion of population in infected states (I_T), and recovered states (R_T), are calculated as a sum across the three vaccination classes, following the form of Equation (15g):

\begin{matrix} S_{i, a}^{t + 1} = 1 - {V 1}_{T (i, a)}^{t + 1} - {V 2}_{T (i, a)}^{t + 1} - E_{T (i, a)}^{t + 1} - I_{T (i, a)}^{t + 1} \\ - R_{T (i, a)}^{t + 1} - D_{T (i, a)}^{t + 1} \end{matrix}

(16)

5.2. SEVIRDS model implementation

The SEVIRDS model is implemented using Cadmium as before, having the same structure as Figure 5, similar state discretization to Equation (11), and a local computation function is based on the one in Table 1. The state object of each geographical cell is now a struct sevirds, containing all infection state variables. A C++ class AgeData was developed to calculate the infection state variables per age group and vaccination status. A single AgeData object is used to hold the infection state variables S, E, I, R, and D for a single age group and vaccination status, where three AgeData objects are used to calculate the struct sevirds state variables for a single age group. The use the of AgeData can be seen in Table 9, where the non-vaccinated states, vaccinated dose 1 states, and vaccinated dose 2 states are stored in AgeData objects (lines 3–5). The variables new_state (line 1) and current_state are of type struct sevird, the first being allocated to hold the result of the local computation function’s call, and the second containing the value of the cell’s state variables before the local computation function is called. The local computation function calculates all struct sevirds state variables one age group at a time (line 2). The set of Equation (13) is calculated in line 6 using all three AgeData objects of the current age group. The exposed, infected, recovered, and deceased state variables are calculated in line 7 using the three AgeData objects with the exception of Susceptible_NV. Equation (11) describing the proportion of Susceptible_NV is calculated in line 8. Lines 9–11 copy the new AgeData information into the struct sevirds new_state. Once the calculation is complete for all age groups, the object new_state is returned to conclude the local computation function.

Table 9.

Geographical cell local computation function for SEVIRDS model.

Input: none. Output: new state (struct sevird)

1: new_state = sevirds()
2: for(each age group a in current_state.age_groups)
3: ACD[0] = current_state.NVAC.at(a)
4: ACD[1] = current_state.VAC1.at(a)
5: ACD[2] = current_state.VAC2.at(a)
6: compute_vaccinated(ACD)
7: compute_EIRD(ACD)
8: ACD[0].S = 1−ACD.[1]S−ACD.[2]S − total_EIRD(ACD)
9: new_state.NVAC.at(a) = ACD[0].new
10: new_state.VAC1.at(a) = ACD.[1]new
11: new_state.VAC2.at(a) = ACD.[2]new
12: endfor
13: return new_state;

5.3. SEVIRDS model simulation results withoutre-susceptibility

In the following, we present a simulation based on Ontario’s population distribution and COVID-19 relevant data (e.g. virulence rates and recovery rates) gathered at Public Health Ontario,³ that show the effect of a two-dose vaccinated population. In Figure 19, we show the results of the first simulation run, without vaccines and without re-susceptibility enabled.

The first graph in Figure 19 shows how over 500 days the population of Ontario moves gradually from Susceptible to Exposed, then Infected, then either Recovered or Deceased. Once in Recovered, they do not return to being Susceptible as re-susceptibility is disabled, and thus, if the simulation ran for longer, we would see the Susceptible curve decline to zero, with the Recovered curve being nearly 100% of the population. The second graph shows the Exposed and Infected curves, while also show the Deceased curve. We can see the percentage of the population being added to each of the respective states on each day of the simulation, that is, new exposures, new infections, and new recoveries. For example, the peak for the Exposed state, which was around day 160, translates to just above 0.3%, meaning roughly that percentage of the population moved to the Exposed state on that day. In other words, whereas Figure 21 show the total population percentages in each of the states, Figure 20 shows the percentage of the population being added to the Exposed, Infected, and Recovered states, respectively. With respect to Equations (7)–(9), this graph shows, for each day of the simulation, the percentages for the different states when $q = 1$ .

Figure 20.

Daily new exposed, infected, and recovered for baseline simulation.

We use these base simulation results to analyze the effect of different types of immunity for vaccines.

5.3.1. Vaccination with linear increase in immunity rate

We first analyze the effect of vaccination configuring vaccines so that the immunity received from each dose increases in linear fashion respective to each dose. We assume that the immunity is uniform across all age groups. As there is no precise data, we show the simulation results obtained using Table 10 (since these are rapid prototyping parameters, they can be adjusted as soon as new data becomes available).

Table 10.

Linear increase immunity rates per week.

First-dose immunity	0.017, 0.036, 0.054, 0.071, 0.089
Second-dose immunity	0.1, 0.12, 0.143, 0.45, 0.161, 0.196, 0.214, 0.232, 0.25

When a portion of the population is vaccinated, they begin with 0.017 immunity (i.e. 1.7%), this increases to 0.036 the following week (i.e. 3.6%), and so on until the end of the first dose. They are then moved to the second dose, with 0.1 immunity (i.e. 10%) and this increases until 0.25 (i.e. 25%), as re-susceptibility is off. These changes to the configuration produce the results shown in Figure 21.

Figure 21.

Vaccination simulation with linearly increasing immunity rates.

As we can see in Figure 21 (top), there are two new curves, Vaccinated_D1 and Vaccinated_D2, which track the two susceptible vaccinated states. Exposed_D1, Exposed_D2, Infected_D1, Infected D2, Recovered_D1, and Recovered_D2 states are aggregated with their non-vaccinated equivalents for brevity. The Susceptible curve represents the Susceptible_NV discussed earlier. This figure shows how the population moves quickly from Susceptible_NV to their first dose, then just as quickly to their second dose, where they mostly remain, although close to 40% eventually contract COVID-19, with most moving to the Recovered_V2 state and around 2.5% moving to the Deceased state. In the bottom part of the figure, the Exposed and Infected curves are half of what the respective curves are in the no-vaccine simulation (see Figure 19), while the Deceased curve rises slower and is also around 1% lower than in the previous simulation, demonstrating the protection provided by vaccination.

Figure 22 shows the new exposures, infections, and recoveries. Compared to the based scenario (Figure 20), the curves have a similar share, but peaks at a smaller percentage. The peak for new daily exposures, in the simulation without vaccines, was nearing 0.35%, and it remained above 0.3% for a few days, whereas the very slim peak in this simulation is only nearing 0.175%. Interestingly, the second “wave” in this simulation appears to rise higher, proportionally to the whole simulation. That may be explained because the number of people infected in the first way was smaller than in the original scenario. Nonetheless, there is an overall decrease in exposures and infections per day with a very basic vaccine configuration.

Figure 22.

Daily new exposed, infected, and recovered in linear immunity rate.

5.3.2. Vaccination with immunity rate decay after linear increase

This scenario is a complex vaccine simulation that by includes a linear decay to the immunity received from the second dose over time. The second-dose state is short to quickly show what happens when immunity decreases over time and to demonstrate the configurability of the model.

This setup works the same as the previous simulation with a two changes. First, first- and second-dose immunity rises quicker as shown in Table 11. Second, halfway through the second-dose state, the immunity begins to decrease at the same rate that it increased during the first half of the state.

Table 11.

Linear increase immunity rates with decay.

First-dose immunity	0.03, 0.05, 0.075, 0.1, 0.125
Second-dose immunity	0.15, 0.175, 0.2, 0.225, 0.25, 0.22, 0.2, 0.175, 0.15

This configuration produces the simulation results as shown in Figures 23 and 24. As shown in Figure 23 (top), with re-susceptibility still disabled, there is no visible change to the Susceptible_NV, Vaccinated_D1, and Vaccinated_D2 curves. The changes are evident in bottom part of the figure where the Exposed and Infected states still hit the same peak but remains near it much longer than in the previous simulation. In addition, the deceased curve rises to 2%, which is still lower than the simulations without vaccines but around 0.5% higher than with no decrease in immunity. In Figure 24, we can see that the daily increases are slightly higher than in the previous simulation, with the peak being wider and just past 0.2% versus 0.175%. However, it is still just over 0.1% lower than without vaccines.

Figure 23.

Vaccination with decaying immunity rate.

Figure 24.

Daily new exposed, infected, and recovered with decaying immunity rate.

5.3.3. Vaccination with immunity rate decay after curved increase

We show an additional scenario without re-susceptibility where the immunity rate increases slower than in the previous scenario but decreases faster. The immunity rate values can be seen in Table 12.

Table 12.

Curved increase immunity rates with decay.

First-dose immunity	0.003, 0.06, 0.08, 0.093, 0.099
Second-dose immunity	0.1, 0.223, 0.249, 0.243, 0.22, 0.186, 0.146, 0.106, 0.09

These configuration parameters produce the results shown in Figures 25 and 26.

Figure 25.

Curved increase immunity rate simulation with decaying immunity rate.

Figure 26.

Daily new exposed, infected, and recovered with curved increase immunity rate and decay.

The difference with the previous scenario (section 5.3.2) is that the final phases of Vaccinated_D2 provide lower immunity rates, resulting in a higher magnitude of exposures, infections, and recoveries. The value of the second-dose immunity rate’s final phase is the most impactful value, as people who reach this remain at the level of immunity rate unless they become Exposed. The peak infected in this scenario reaches 2.2% of Ontario’s total population, versus 2.9% in the case of the base simulation without any vaccinations, and 1.8% in the case of previous simulation. While the immunity rate profiles of the Vaccinated_D2 state of this simulation and the previous simulation (section 5.3.2) are similar, their outcomes differ greatly because of the difference in the lasting immunity rate i_a,V₂(T_d₂) of Vaccinated_D2.

In Figure 26, we can see how the cases per day peak and then drop before raising again but are less in magnitude than the base scenario. Compared to the previous simulation scenario, the curves in this graph are very similar, having a slightly larger magnitude of infections because the final phase Vaccinated_D2 immunity rate is now lower (0.09) than the previous simulation (0.15).

5.4. SEVIRDS model simulation results with re-susceptibility

We have also studied a base scenario with no vaccination and re-susceptibility. The configuration is identical to the base scenario presented in section 5.3, but with re-susceptibility. Re-susceptibility highlights the importance of vaccines.

Figure 27 shows the simulation results of this new scenario. Compared to the base scenario presented in section 5.3 (Figures 19 and 20), the smaller recovery curve is the most evident difference, along with the very aggressive incline on the Deceased curve. As re-susceptibility is enabled, once recovered population has reached the end of the recovered state, they become susceptible again, and this pattern will, in theory, continue like this until all the population are in the Deceased state. The results in this simulation do not seem so severe as around 75%–80% of the population remains either Susceptible or Recovered. However, analyzing the bottom part of Figure 27, the Exposed and Infected curves show no sign of decreasing over time, they remaining is stable at a 3% rate. What is more worrying is that the Deceased curve has an increasing trend and has already passed 5% (around 2% higher than without re-susceptibility).

Figure 27.

Baseline SEVIRDS simulation with re-susceptibility.

In Figure 28, we also observe that the Exposed and Infected curves show little signs of dropping, and it spikes around 0.05% higher than in the base scenario without re-susceptibility. The Recovered curve is very close to the Infected curve, meaning most people recover from COVID-19 in this simulation. The gap between them represents the daily portions of the population that are Deceased, and as this is not zero, eventually, as alluded to earlier, the whole population would end up in the Deceased state unless some measures are taken to control the spread.

Figure 28.

Daily new exposed, infected, and recovered for baseline simulation with re-susceptibility.

6. Conclusion

A Cell-DEVS COVID-19 model was defined and simulated to study the spread of infectious disease in geographical environments. The model defined in this paper uses a method of geographical CA like that used in Zhong et al.¹⁶ in their simulation of the original SARS pandemic but considers more factors in the implementation of movement restriction policies. The model was simulated using the geography of Ontario, Canada, and divided the province into geographical cells by PHU. The methodology of adjusting the model and disease parameters was demonstrated, and the accuracy of the model was improved substantially. Many directions for future improvements exist including the following:

Defining region-specific disease characteristics based on epidemiological data.

The addition of a different class of cell to represent the coupling of each cell to the outside world.

The addition of a population density correction factor.

Increasing disobedience or changing the infection correction factors over time.

We implemented the asymptomatic cases using a basic infectious/asymptomatic ratio value that can be input by the user. The asymptomatic carriers expose more individuals than their non-asymptomatic counterparts. This is intended to represent the effect of not taking adequate precautions due to the absence of awareness. Users can also input a described group of neighborhoods and run the model through the neighborhoods, allowing for visualization of how a disease might spread through a city, town, or country. We use a case study where the model’s neighborhoods are defined as the PHUs of Ontario. Our results show how an asymptomatic set of carriers can lead to sharper increases in case counts resulting in a change to the total numbers of cases that a population would experience. Our model provides a framework to rapidly prototype disease spread in their neighborhood where asymptomatic infections can be considered and incorporated where necessary.

We presented a model that allows users to create rapid simulation prototypes to simulate how much of an impact asymptomatic cases would have on disease case counts. Another important note about the model, although it has been built around the COVID-19 pandemic, it can be used for any other disease. This can be done in a quick, efficient manner. If users have the relevant information for a disease along with the asymptomatic rate at which a disease transmits, they can simply change the parameters and re-run the model. The model also gives the users the ability to efficiently adapt the geographical level that the model is being run on.

Future work will focus on deeper analysis of the models to investigate the influence of different model parameters in the transmission of the disease. For example, we will investigate how is connectedness correlated with transmission or how does the number of spatial units impact the robustness of the model.

Future adaptations plan to incorporate asymptomatic disease transmission rates (modified virulence rates), asymptomatic rates by age group, a more accurate susceptible to exposed transition and new compartments to represent individuals who are vaccinated, and therefore less susceptible to the disease. The model can be easily adapted to simulate variants of concern by tuning the test case configuration data found in Table 1. In future adaptations, variants of concern will be addressed in a formal manner allowing for the simulation of single and multiple variants of concern.

Footnotes

Funding

This research is funded by NSERC Alliance research grants.

ORCID iDs

Cristina Ruiz Martin

Gabriel Wainer

Author biographies

Glenn Davidson is a master’s graduated at the Department of Systems and Computer Engineering at Carleton University.

Aidan Fahlman is a master’s student at the Department of Systems and Computer Engineering at Carleton University. His email address is aidanfahlman@cmail.carleton.ca.

Eric Mereu is an undergraduate student in Computer Science at Carleton University.

Cristina Ruiz-Martín received a PhD in Cotutelle between Universidad de Valladolid and Carleton University (2018). She is an Instructor at Carleton University. Her research interests include modeling and simulation in decision-making. She can be reached at cristinaruizmartin@sce.carleton.ca.

Gabriel Wainer received the PhD degree from Université d’Aix-Marseille III. He is a full professor in Carleton University. His current research interests are related with modeling methodologies and tools, parallel/distributed simulation, and real-time systems. His email is gwainer@sce.carleton.ca.

Peter Dobias received his PhD degree in Physics from University of Alberta. He currently works at the Centre for Operational Research and Analysis, Defence Research and Development Canada. He does research in complex systems, optimization, wargaming, and capability-based analysis. His email is peter.dobias@drdc-rddc.gc.ca.

Mark Rempel received his MSc in Physics from University of Saskatchewan. He currently works at the Centre for Operational Research and Analysis, Defence Research and Development Canada. He does research in complex systems, optimization, wargaming, and capability-based analysis. His email is mark.rempel@drdc-rddc.gc.ca.

References

Johns Hopkins University & Medicine. COVID-19 map, https://coronavirus.jhu.edu/map.html (2022, accessed 25 January 2022).

World Health Organization. WHO Coronavirus (COVID-19) dashboard, https://covid19.who.int/ (2021, accessed 25 January 2022).

Public Health Ontario. Ontario COVID-19 data tool, https://www.publichealthontario.ca/ (2021, accessed 25 January 2022).

Yanes-Lane

Winters

Fregonese

, et al. Proportion of asymptomatic infection among COVID-19 positive persons and their transmission potential: a systematic review and meta-analysis. PLoS ONE 2020; 15: e0241536.

Buitrago-Garcia

Egli-Gany

Counotte

, et al. Occurrence and transmission potential of asymptomatic and presymptomatic SARSCoV-2 infections: a living systematic review and meta-analysis. PLoS Medicine 2020; 17: e1003346.

Byambasuren

Cardona

Bell

, et al. Estimating the extent of asymptomatic COVID-19 and its potential for community transmission: systematic review and meta-analysis. J Assoc Med Microbiol Infect Dis Can 2020; 5: 223–234.

Pollock

Lancaster

Asymptomatic transmission of covid-19. BMJ 2020; 371: 19–20.

Ralli

Morrone

Arcangeli

, et al. Asymptomatic Patients as a Source of Transmission of COVID-19 in Homeless Shelters. Int J Infect Dis 2021; 103: 243–245.

Garrett

Tapley

Andriesen

, et al. High rate of asymptomatic carriage associated with variant strain omicron. medRxiv, 2021, https://doi.org/10.1101/2021.12.20.21268130

10.

Oran

Topol

EJ.

Prevalence of asymptomatic SARS-CoV-2 infection: a narrative review. Ann Intern Med 2020; 173: 362–367.

11.

Rachah

Torres

DFM

. Dynamics and optimal control of Ebola transmission. Math Comput Sci 2016; 10: 331–342.

12.

Djordjevic

Silva

Torres

DFM

. A stochastic SICA epidemic model for HIV transmission. Appl Math Lett 2018; 84: 168–175.

13.

Flaxman

Mishra

Gandy

, et al. Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe. Nature 2020; 584: 257–261.

14.

Kermack

McKendrick

A contribution to the mathematical theory of epidemics. Proc Roy Soc A-Math Phy Eng Sci 1927; 115: 700–721.

15.

Sattenspiel

Dietz

A structured epidemic model incorporating geographic mobility among regions. Math Biosci 1995; 128: 71–91.

16.

Zhong

Huang

Song

Simulation of the spread of infectious diseases in a geographical environment. Sci China Ser D-Earth Sci 2009; 52: 550–561.

17.

Ghosh

Bhattacharya

A data-driven understanding of COVID-19 dynamics using sequential genetic algorithm based probabilistic cellular automata. Appl Soft Comput 2020; 96: 106692.

18.

Medrek

Pastuszak

Numerical simulation of the novel Coronavirus spreading. Expert Syst Appl 2021; 166: 114109.

19.

Dietz

Heesterbeek

Daniel Bernoulli’s epidemiological model revisited. Math Biosci 2002; 180: 1–21.

20.

Ross

An application of the theory of probabilities to the study of a priori pathometry—Part I. Proc Roy Soc A-Math Phy Eng Sci 1916; 92: 204–230.

21.

Ross

Hudson

An application of the theory of probabilities to the study of a priori pathometry—Part II. Proc Roy Soc A-Math Phy Eng Sci 1917; 93: 212–225.

22.

Hethcote

HW.

Mathematics of infectious diseases. SIAM Rev 2020; 42: 599–653.

23.

Barman

Nayak

Yadav

, et al. Modeling control, lockdown & exit strategies for COVID-19 pandemic in India. medRxiv, 2020, https://arxiv.org/abs/2007.07988.

24.

Lin

Chen

Hung

, et al. The data forecast in COVID-19 model with applications to US, South Korea, Brazil, India, Russia and Italy. ArXiv, 2021, https://arxiv.org/abs/2011.04738

25.

Giordano

Blanchini

Bruno

, et al. Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy. Nat Med 2020; 26: 855–860.

26.

Caccavo

Chinese and Italian COVID-19 outbreaks can be correctly described by a modified SIRD model. MedRxiv, 2020, https://doi.org/10.1101/2020.03.19.20039388

27.

Danon

Brooks-Pollock

Bailey

, et al. A spatial model of CoVID-19 transmission in England and Wales: early spread, peak timing and the impact of seasonality. Philos Trans R Soc Lond B Biol Sci 2020; 376: 20200272.

28.

Sen

Use of a modified SIRD model to analyze COVID-19 data. Ind Eng Chem Res 2021; 60: 4251–4260.

29.

Tobler

WR.

Cellular geography. In: Olsson

(ed.) Philosophy in Geography. Dordrecht: Springer, 1979, pp. 379–386.

30.

Cárdenas

Henares

Ruiz-Martín

, et al. Cell-DEVS models for the spread of COVID-19. In: ACRI 2020, Łódź, 2–4 December 2020.

31.

Waters

. Tobler’s first law of geography. In: Richardson

Castree

Goodchild

, et al. (eds) International encyclopedia of geography. Hoboken, NJ: Wiley, 2018.

32.

Wainer

Cellular modeling with cell-DEVS: a discrete-event cellular automata formalism. In: International conference on cellular automata for research and industry, Kraków, 22–25 September 2014.

33.

Zeigler

Praehofer

Kim

TG.

Theory of modeling and simulation: integrating discrete event and continuous complex dynamic systems. San Diego, CA: Academic Press, 2000.

34.

Belloli

Vicino

Ruiz-Martin

, et al. Building Devs models with the cadmium tool. In: Winter simulation conference, National Harbor, MD, 8–11 December 2019.

35.

Xiang

Modeling epidemic spread in transportation networks: a review. Int J Transp Eng 2021; 8: 139–152.

36.

Kraemer

MUG

Golding

Bisanzio

, et al. Utilizing general human movement models to predict the spread of emerging infectious diseases. Sci Rep 2019; 9: 5151.

37.

Ota

Will we see protection or reinfection in COVID-19?

Nat Rev Immunol 2020; 20: 351.

38.

Stokel-Walker

What we know about Covid-19 reinfection so far. BMJ 2021; 372: n99.

39.

Davidson

Wainer

Studying Covid-19 spread using a geography based cellular model. In: Winter Simulation Conference, Phoenix, AZ, 12–15 December 2021.

40.

McAloon

Collins

Hunt

, et al. Incubation period of COVID-19: a rapid systematic review and meta-analysis of observational research. BMJ Open 2020; 10: e039652.

41.

CDC. Interim guidance on ending isolation and precautions for adults with COVID-19, https://www.cdc.gov/coronavirus/2019-ncov/hcp/duration-isolation.html (2021, accessed 25 January 2022).

42.

OHESI. Health regions, http://www.ohesi.ca/health-regions (2021, accessed 25 January 2022).

43.

Government of Ontario. Confirmed positive cases of COVID-19 in Ontario. Ontario Open Data Catalog, https://data.ontario.ca/en/dataset (2021, accessed 25 January 2022).

44.

Ontario GeoHub. Ministry of Health Public Health Unit Boundary, https://geohub.lio.gov.on.ca/datasets/ministry-of-health-public-health-unit-boundary (2021, accessed 25 January 2022).

45.

Government of Ontario–Ministry of Health. Public health unit locations, https://www.health.gov.on.ca/en/common/system/services/phu/locations.aspx (2021, accessed 25 January 2022).

46.

GeoPandas Developers. GeoPandas 0.10.2, https://geopandas.org/en/v0.10.2/index.html (2021, accessed 25 January 2022).

47.

Team R Development Core. A language and environment for statistical computing, http://www.r-project.org (2018, accessed 25 January 2022).

48.

Plotly Technologies Inc. Collaborative data science, https://plot.ly (2015, accessed 25 January 2022).

49.

Carlucci

D’Ambrosio

Balsamo

Demographic and attitudinal factors of adherence to quarantine guidelines during Covid-19: the Italian model. Front Psychol 2020; 11: 559288.

50.

Angus Reid Institute. COVID-19 compliance: one-in-five Canadians making little to no effort to stop Coronavirus spread, http://angusreid.org/covid-compliance/ (2020, accessed 25 January 2022).

51.

Thomas

Moreira

Kitchin

, et al. Safety and efficacy of the BNT162b2 mRNA Covid-19 vaccine through 6 months. N Engl J Med 2021; 385: 1761–1773.

52.

Wainer

Discrete-event modeling and simulation: a practitioner’s approach. Boca Raton, FL: CRC Press, 2009.