2.1. Presumptions of the Model
The model is based on the following presumptions:
The number of susceptible individuals is proportional to the rate of growth (the relative increment).
The rate of growth is fixed in the first days of the propagation; therefore, the cumulative number of confirmed cases follows an exponential trend.
The trend of time series of growth rate is decreasing. At first, it faces a significant sudden decrease, and then it falls gradually.
Several successively lower growth rates lead to lower growth rates in the following days (due to increasing the steps or stages).
Based on the presumptions, the model is not suitable to represent the propagation of epidemics in which after some severe limitations (quarantine, staying home, social distancing, etc.), the condition of the population comes back to the previous status. To model this sort of epidemics, we need to update the model by changing the step of the model.
2.2. Mathematical Definition of the Model
We present a model to cover all the three stages; CVJR1 (a, R, l, b). This model is a stochastic process (time series model) with four parameters:
a: Representation of the decreasing probability of the rate of spreading corresponding to different items in a specified step.
We assume that the probability of acute outbreaks decreases exponentially, and conversely, the probability of mild and gradual increments increases.
R: The rate of the growth in the initial days (the first b days) of the spreading.
It is worth noting that one of our model’s presumptions is that the number of infected cases rises exponentially in the first b days. Thereafter, R, which depends on parameters a and l, falls. The role of R in our model is analogous to the role of R0 and λ in macro-dynamic and micro-dynamic models, respectively.
l: The number of days of absence of the more intense epidemiological behavior of the disease that ensures us that the level of the spreading of the infection has decreased by one step (The criterion for specifying the retreat of the disease from a more intense step to a less intense step).
b: The number of the first days that the infection faces no resistance or control.
Our model is formulated as follows:
I. The Initial Condition:
and
II. The Evolution Law:
Therefore, we can classify this model as an autoregressive multivariate time series.
Xt: The number of infected individuals up to time t.
Stept: Determining the phase of the outbreak of the disease.
As mentioned before, by this indexing, steps 3, 4, and so on are considered together as the step of retreat or stage 3 of the infection (in this case, COVID-19).
Itemt: The item of the disease in a special step.
Here, the variable item is more detailed and more partial than the variable step. By variable item, we are equipped with a tool to meet the decreasing trend of the rate of transition, reproductive number, or growth rate during a specified step.
U(t + 1): A function whose input is the variable item to determine the intensity of the spread of the disease during a time unit (in this model, the time unit is taken as one day, and it can be replaced with one month, one week, one hour, and so on).
The computations and simulations of the paper were done in MatLab 8.6. Throughout the paper, each curve is obtained through 200 simulations. In all figures, the X and Y axes represent the days and the number of confirmed cases, respectively.
Four scenarios illustrate the flexibility of our model and its capacity to cover a great variety of real datasets (
Figure 1A-
D).
Increasing
a leads to descending the probability of acute growth periods; therefore, the plot becomes smooth earlier and the range of the axis
Y decreases. The lower the parameter
a is, the more the intensity of the outbreak of the disease is. As
R increases, the growth rate of the number of infected cases goes upper. Therefore, the height of the plot rises, and the slope of the plot increases. According to this model, by declining
l, the condition for the change in phases (retreat of the process) gets harder. Therefore, the plot tends to increase by a higher acceleration. Finally,
b is the length of the most intense spread. Then, the more the value of
b is, the more the increase in the number of cases will be. To illustrate the novel method, let (
a,
R,
l,
b) = (0.9, 2.5, 4, 11). Our model suggests the first stage of the spreading of infection (
Table 1).
| Day | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 |
|---|
| Number of cases | R0 | R1 | R2 | R3 | R4 | R5 | R6 | R7 | R8 | R9 | R10 |
| Rounded number | 1 | 2 | 6 | 16 | 39 | 98 | 244 | 610 | 1526 | 3815 | 9537 |
The continuation of spreading (stage 2) is sometimes like its previous phase and sometimes like its current phase (
Table 2).
| Item | 1 | 2 | 3 | 4 | 5 | 6 |
|---|
| Current number | 9537a | 23842 | 38147 | 61035 | 97656 | 156249 |
| Previous phaseb | 3815 | 9537a | 15259a | 24414a | 39063a | 62500 |
| U | 0 | 1 | 1 | 1 | 1 | |
| Intensity | More | less | less | less | less | |
aThe figures determine the numbers that multiplying them by (1 - R) is equal to the growth of the process.
bPrevious phase is not . It is equal to .
Since the parameter
l is taken equal to 4, four consecutive 1s lead to a movement from step 2 to step 3 (generally, from step
n to step
n + 1). Accordingly, the results show the continuation of the process during stage 3 (
Table 3). The evolution in step
n is similar to the second step, but in this step, 1 and 0 correspond to
n - 1 and
n - 2 steps before, respectively.
| Item | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
|---|
| Current number | 156249 | 193749 | 240249 | 297909 | 476655 | 591052 | 732904 |
| Previous phase | 62500 | 77500 | 96100 | 119164a | 190662 | 236421 | … |
| Two phases ago | 25000a | 31000a | 38440a | 47665 | 76264a | 94568a | … |
| Run | 1 | 2 | 3 | 0 | 1 | 2 | … |
| U | 1 | 1 | 1 | 0 | 1 | 1 | … |
| Intensity | less | less | less | more | less | less | … |
aThe figures indicate the numbers that multiplying them by (1 - R) is equal to the growth of the process.
It is noticeable that the variable run presents the number of consecutive 1s (less acute outbreak). It is observed that happening zero for the variable U or increasing the steps makes this variable equal to zero immediately. Accordingly, the following sequence is obtained as a realization of our model:
1, 2, 6, 16, 39, 98, 244, 610, 1526, 3815, 9537, 23842, 38147, 61035, 97656, 156249, 193749, 240249, 297909, 476655, 591052, 732904, ...
By repeating this procedure, we can obtain 15 realizations (
Figure 2).
Some realizations of the present model with parameters (0.9, 2.5, 4, 11)
Now, we address the flexibility of the introduced model that enables the model to represent a wide range of epidemic data. The validity of the model is assessed in two ways: the ability to model past observations and the validity of predictions of the model. The former is illustrated by
Figure 3, while
Figure 4 implies the latter ability of the model. The average curve of the realizations of our model could fit some known datasets, such as the pattern of the number of users of Facebook (
16), the number of confirmed cases of SARS in China (
17), and the number of infected cases of COVID-19 worldwide (
18) (
Figure 3A-
C).
A, The mean curve of CVJR1 (0.75, 1.35, 3, 1) for modeling the sequence of Facebook users from 2014 to 2019; B, the mean curve of CVJR1 (0.6, 1.23, 2, 2) for modeling the time series of confirmed cases of SARS in China from March 17 to June 17; C, the mean curve of CVJR1 (3.2, 1.44, 3, 13) for modeling the procedure of COVID-19 from January 13 to March 5.
A, Modeling the first time series using our model (CVJR1), the mean curve, 90% upper bound, and 90% lower bound with parameters (0.75, 1.35, 3, 1) for prediction of the sequence of Facebook users in 2018 - 2019 (eight units); B, modeling the second time series using our model (CVJR1), the mean curve, 90% upper bound, and 90% lower bound with parameters (0.6, 1.23, 2, 2) for prediction of time series of confirmed cases of SARS in China, from May 18 to June 17 (30 units); C, modeling the third time-series using our model (CVJR1), the mean curve, 90% upper bound, and 90% lower bound with parameters (3.2, 1.44, 3, 13) for prediction of the procedure of COVID-19 from February 25 to March 5 (10 units).