Modeling COVID-19 in Different Countries as Sequences of SI Waves
Rainer Janssen 1 , Juergen Mimkes 2*
1 Janssenplan Engineering Office, Paderborn, Germany
2 Physics Department, University of Paderborn, Germany
*Corresponding author: Juergen Mimkes, Physics Department, University of Paderborn, Germany.
Received Date: 03 March 2023
Accepted Date: 09 March 2023
Published Date: 13 March 2023
Citation: Janssen R, Mimkes J (2023) Modeling COVID-19 in Different Countries as Sequences of SI Waves. Infect Dis Diag Treat 7: 207. DOI: https://doi.org/10.29011/2577-1515.100207
Abstract
The COVID-19 pandemic has been a huge task worldwide for many institutions, researchers, national health organizations, and the pharmaceutical industry. As natural scientists and engineers, we attempt to contribute by calculating models and analyzing data to keep track of the pandemic.
While a frequent goal is to predict the next pandemic wave by considering all influencing parameters, we examined methods to calculate a model course of the entire pandemic. This is done by reconstructing the course of infections into multiple model waves, that sum up into a pandemic model close to the real course. The model wave parameters are varied by an algorithm such as the Excel solver to minimize the difference between the real and the model course.
By reconstructing the course of infections by the commonly known SIR model, we found the calculated model parameters to be ambiguous and difficult to interpret. In contrast, we found sequenced SI model waves to provide an astonishing precise digital representation of the pandemic course. Until November 2022, we found between six and 16 waves (depending on the country) in each of the 14 countries investigated.
The calculated parameters are easy to interpret, comparable among each other and between countries. They wave parameters may be correlated to virus types and measures in each country by other researchers. New waves are detectable early, as they show a certain deviation from the actual model wave. After the maximum of the last real wave, the model indicates the further procedure of the pandemic course.
Introduction
Modeling epidemic courses started with differential equations and probability calculations by Ross [1] and Kermack and McKendrick [2]. They developed the SI model of infections which considers only infected (I) and susceptible (S) person, and later the SIR model, which also includes recovered (R) persons.
Since January 2020, Johns Hopkins University (JHU, [3]) has been publishing data on global COVID-19 infections. When the first data of the new epidemic in China were available on the Internet, we started to model the John’s Hopkins data according to the SI model of epidemic growth, successfully.
Later on, we have started to use data by the organization Our World in Data (OWID, [4]) as international input data. For Germany, we use data by the Robert-Koch-Institute (RKI, [5]).
We are varying the SI model parameters to minimize the difference between the real and the model course. In the first wave this has been done manually, later on, the high number of parameters lead us to use the Excel solver tool with the GRG nonlinear method.
The SI model
We have started to analyze the data using the simple SI infection model of a population N that contains susceptible S(t) and infected people I(t):
N = S(t) + I(t) (1)
The number of daily new infected people n(t) at time t depends on the infection rate (b),
n(t) = d I(t) /(d t) = b * I(t) * S(t) / N (2)
Eq. (2) may be solved by a closed mathematical term, the sum of all infected people I(t) at time t is
I(t) = N / [1 + (N / I(0) – 1) * exp (- b * t)] (3)
I(0) is the number of all infected people in the beginning, t = 0. The solution (3) contains two unknown parameters: the total number of susceptible people N and the infection rate b. These constants are easily obtained from the logarithmic plot of infection data, see below in figure 1.
China was the first country that was hit by COVID-19 infections. Initially, the fit of the SI model was in good agreement to the COVID-19 data in China, as shown in figure 1. The two unknown constants b and N have been obtained from the slope and the maximum in the logarithmic plot in figure 1.
 
Figure 1: COVID-19 infection I(t) in China and SI model (logarithmic) [6]
In Europe, Italy was the first country that was hit by COVID-19 infections. When we modeled the course of infections in Italy after April, 2020, the simple SI model in figure 2 did not fit to the real course, as the SI model only allows for symmetrical model waves.
 
Figure 2: Asymmetrical wave of daily infected in Italy until the end of June 2020.
SIR model
To improve our modeling of COVID-19 data we started to implement the SIR model [2], which considers additionally the number of resistant, removed or recovered people R(t):
N = I(t) + S(t) + R(t) (4)
A numerical iterative solution results in 3 equations,
Δ S(t) = -b´ * S(t-1) * I(t-1) (5)
Δ I(t) = b´ * S(t-1) * I(t-1) – γ * I(t-1) (6)
Δ R(t) = γ * I(t-1) (7)
where b’ = b/N is the infection rate, and γ (gamma) is the recovery rate.
If gamma equals zero, the model is equivalent to the SI model.
Figure 2 shows the data for new infections in Italy at the end of June 2020. The data have been approximated by the SI and the SIR model. As expected, the SI model does not agree very well with the data. The SIR model leads to an improvement, but still does not fit the data well. Only, when we added two SIR model waves, we obtained a very good approximation to the real course in figure 3, where we scaled the cases to “per million persons” (“pmp”):
 
Figure 3: Daily infected in Italy with two model waves
The two different waves may be caused by
- local outbreaks in certain groups or regions
- different test measures
- different preventive measures - another virus type etc.
Accordingly, we switched over to model with multiple SIR waves in the second wave of COVID-19 for obtaining better approximations.
India as an example of multiple model waves
When we took India as an example for approximating the real course by multiple SIR waves, we also got a very good approximation result (figure 4):
 
Figure 4: Daily infected in India
But when we look to the parameters of the iteratively calculated SIR model waves, the SIR parameters are not plausible (table I):
| wave # | infection rate b‘ | infected persons pmp per wave, N | start day of next wave | gamma | 
| 1 | 6 % | 9 383 | 0 | 0.02 | 
| 2 | 13 % | 22 034 | 345 | 0.20 | 
| 3 | 14 % | 53 063 | 357 | 0.73 | 
| 4 | 60 % | 6 566 | 656 | 0.19 | 
Table I: Modeled SIR waves in India (start day 0 is February 21, 2020).
The start day of a wave has been defined by us by one infected person per one million people.
The 3rd flat wave in India obviously does not have an 8 times higher value of infected persons than the 4th wave. This contradiction may be due to the fact, that SIR parameters are nonlinearly interconnected and cannot be separated unambiguously during the iteration process.
As approximated SIR parameters apparently do not give an insight into a wave, we switched back to SI models.
When we applied SI model waves to India instead of SIR waves, we got reasonable results. The 3rd and the 4th wave now have similar values, and all infection rates appear realistic for each wave (table II).
| wave # | infection rate b‘ | infected persons pmp per wave, N | start day of next wave | gamma | 
| 1 | 3 % | 7 319 | 0 | 0 | 
| 2 | 9 % | 11 686 | 333 | 0 | 
| 3 | 2 % | 5 826 | 34 | 0 | 
| 4 | 15 % | 5 821 | 642 | 0 | 
Table II: Modeled SI waves in India (start day 0 is February 20, 2020)
Spain as an example of multiple SI waves
Also, for more complicated courses, using SI model waves shows a low deviation between course and model (figure 5, figure A12). Figure 6 shows 10 modeled SI waves. The incidence scale in figure 5 is 0.7 times lower than the cases per day pmp scale in figure 6, because the incidence relates to a sum of 7 days and 100 000 people, while the cases per day relate to one day and 1 000 000 people.
 
Figure 5: Daily infected in Spain
 
Figure 6: Daily infected in Spain, broken into SI model waves (cases/d pmp)
A new wave is modeled, if the total course approximation is improved.
Approximation error
Table III shows the relative error of our model for different countries. The mean daily deviation of our model is given in relation to the mean value of daily cases.
| Country | Mean approximation error 2022 | 
| AUT Austria | 3.6 % | 
| BRA Brazil | 8.3 % | 
| ESP Spain | 8.3 % | 
| DEU Germany | 2.9 % | 
| FRA France | 5.9 % | 
| GBR Great Britain | 6.1 % | 
| GRC Greece | 3.9 % | 
| IND India | 6.7 % | 
| ITA Italy | 3.8 % | 
| ISR Israel | 7.6 % | 
| JPN Japan | 3.1 % | 
| NLD Netherlands | 7.4 % | 
| SWE Sweden | 11.0 % | 
| USA | 4.8 % | 
| World | 2.7 % | 
Table III: Approximation error per country.
The mean relative error of the investigated countries in table III is about 5.8 %. During 2022, some countries have begun to report only once a week. Therefore, the mean approximation error will increase.
SI waves and virus types
In Germany, the predominant virus types at each time are published by RKI [6], and we have assigned them to the SI waves. For some virus types apparently more than one wave occurred, indicated by roman numbers in figure 7, figure A04, and in table A04:
| wave no. | day of maximum | main virus type | 
| 1 | 20-04-05 | |
| 2 | 20-12-16 | |
| 3 | 21-04-17 | Alpha | 
| 4 | 21-09-11 | Delta I | 
| 5 | 21-11-30 | Delta II | 
| 6 | 22-02-08 | Omicron BA.1 | 
| 7 | 22-03-22 | Omicron BA.2 I | 
| 8 | 22-04-28 | Omicron BA.2 II | 
| 9 | 22-06-25 | Omicron BA.5 I | 
| 10 | 22-07-19 | Omicron BA.5 II | 
| 11 | 22-08-20 | Omicron BA.5 III | 
| 12 | 22-10-14 | Omicron BA.5 IV | 
 
Figure 7: Daily infected in Germany (Updated. Left axis: real course and model. Right axis: model waves)
Even when there are no virus type data available, it may be possible to assign the virus types by comparison between waves of different countries during the same time interval.
SI waves for prediction
The last model wave of a real course may appear as suited to predict the further progression of infections. This may be possible once the wave in reality has reached its maximum. But at the start of a wave, the slope of an SI function is nearly independent of the later maximum (figure 8). That means that slight variations of the real data result in strong variations of the time point and height of the model’s maximum.
 
Figure 8: SI waves with different peaks but similar positive slope (logarithmic scale)
We have to wait until the slope of the real wave becomes negative to be sure that the maximum has really been reached. At least, the positive slope indicates the minimum number of persons who will be infected until the end of that wave. On the other hand, if the negative slope of the real data deviates strongly from the modeled waves, it indicates a new wave. After June 2020, that happened almost every time in all investigated countries during a negative slope (figure A01 to figure A14).
Using SI parameters
SI waves are comparable or sortable by each parameter, including the start day, the calculated maximum day, or the duration of a wave. Table IV shows an example where the Omicron BA.1 I waves are sorted by the number N of infected persons.
| Country | day of maximum | Infection rate | infected in wave BA.1 I (pmp) | |
| FRA | 22-01-24 | 10,7% | 196.562 | |
| NLD | 22-02-05 | 10,4% | 174.105 | |
| ISR | 22-01-26 | 17,0% | 168.968 | |
| AUT | 22-02-06 | 9,8% | 154.388 | |
| ESP | 22-01-17 | 9,3% | 121.113 | |
| ITA | 22-01-19 | 10,1% | 119.733 | |
| SWE | 22-01-28 | 14,8% | 94.485 | |
| DEU | 22-02-08 | 9,3% | 91.956 | |
| USA | 22-01-17 | 11,7% | 76.324 | |
| GBR | 22-01-05 | 15,8% | 52.371 | |
| GRC | 22-01-03 | 29,2% | 39.021 | |
| BRA | 22-01-30 | 14,8% | 18.238 | |
| JPN | 22-02-06 | 13,9% | 17.257 | |
| IND | 22-01-24 | 15,2% | 5.808 | 
Table IV: Waves of Omicron BA.1 I, sorted by the number N of infected persons.
Special observations
In Great Britain (figure 9, figure A05) and in the world data (figure 10, figure A15), we detected also very long waves, lasting more than 18 months.
 
Figure 9: Daily infected in Great Britain, long wave 6 (blue)
 
Figure 10: Daily infected in the World, long wave 7.
Another interesting effect is that despite of lock downs, vaccinations or other measures, the total course of the pandemic may be modeled very well by simple symmetrical SI waves. This is obviously due to the fact that lock downs and other measures only affect the height of a wave, not the shape.
Publication of the modeling results
Our results are used by a local medical COVID-19 committee in our city of Paderborn/Germany.
Link: praxisnetz-pb.de/aktuelles-2/
We used the Excel solver for iteration of parameters allowing for an easy use for everyone. The Excel sheets also do not contain macros. The complete Excel sheets are available for free download and free use. Each of about 190 countries reported by “Our World in Data” is available by changing the country code within a sheet.
Link: www.janssenplan.de („ activities “)
Conclusions
The complete COVID-19 course of 14 countries has been modeled well by about six to 16 SI waves until November 2022. Asymmetrical waves of cases per day have been modeled by two or more symmetrical waves. While SIR wave parameters are ambiguous and give no insight into a wave, we use SI wave parameters, that are unambiguous and easy to assign. They are well comparable among each other and between countries. The parameters may be correlated to virus types and also to preventive measures if known in each country. New waves are early detectable, as they show an early deviation from the actual model wave. The last wave of the wave sequence gives an indication how the pandemic will proceed.
All wave data and the underlying Excel file including a help sheet are published and are free for public use.
References
- Ross R (1916) An application of the theory of probabilities to the study of a priori pathometry. —Part I. Proceedings of the Royal Society of London. Series A 92:204-230.
- Kermack WO, McKendrick AG (1927) A Contribution to the Mathematical Theory of Epidemics. Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character 115 :700-721.
- Johns-Hopkins-University (2020).
- Our World in Data OWID.
- Robert-Koch-Institut RKI.
- Mimkes J, Janssen R (2020) On the numbers of infected and deceased in the second Corona wave.
- Robert-Koch-Institut RKI (2022) Wöchentlicher Lagebericht des RKI zur Coronavirus-Krankheit-2019 (COVID-19).
Annex
In each figure A01 to A15 in this annex, the course of infections, the model of the course and the model waves are shown in a common figure. For a better visibility, the waves are shown there with a small distance to the course and the model course. In addition, the wave parameters are given in table A01 to table A15. The start day is omitted if it lies outside of the range the figure.
If input data have been extremely implausible, they have been replaced by the mean of the neighbored values.
 
Figure A01: Austria, real course and models.
| wave no. | day of maximu m | infection rate, b´  | infected per wave pmp, N | start
  day of wave, t0 | infected at wave start  | main virus type | 
| 1 | 20-04-01 | 19.4% |                1 689                            0.9  |  | ||
| 2 | 20-11-20 | 7.0% |              37 828       121              1.0  |  | ||
| 3 | 21-03-29 | 4.8% |              28 667       186              1.0  |  | ||
| 4 | 21-09-23 | 5.4% |              15 973       399              1.0  |  | ||
| 5 | 21-11-22 | 12.0% |              50 265       550              1.0  |  | ||
| 6 | 22-02-06 | 9.8% |            154 388       594              1.0  | Omicron BA.1 | ||
| 7 | 22-03-20 | 11.8% |            163 893       656              1.0  |  | ||
| 8 | 22-05-02 | 8.6% |              27 732       681              1.0  |  | ||
| 9 | 22-06-29 | 13.9% |              21 669       787              1.0  |  | ||
| 10 | 22-07-23 | 12.0% |              39 922       794              1.0  |  | ||
| 11 | 22-08-26 | 12.5% |              15 378       839              1.0  |  | ||
| 12 | 22-10-10 | 11.9% |              52 225       870              1.0  |  | ||
| 13 | 22-11-16 | 13.1% |              11 116       927              1.0  |  | ||
Table A01: Austria, SI parameters.
 
Figure A02: Brazil, real course and models.
 
Table A02: Brazil, SI parameters.
 
Figure A03: France, real course and models.
 
Table A03: France, SI parameters.
 
Figure A04: Germany, real course and models.
 
Table A04: Germany, SI parameters.
Input data are taken from RKI [5], as they are continuously updated also retroactiv. Main virus types are assigned on base of RKI [7].
 
Figure A05: Great Britain, real course and models.
 
Table A05: Great Britain, SI parameters.
 
Figure A06: Greece, real course and models.
 
Table A06: Greece, SI parameters.
 
Figure A07: India, real course and models.
 
Table A07: India, SI parameters.
 
Figure A08: Italy, real course and models.
 
Table A08: Italy, SI parameters
 
Figure A09: Israel, real course and models.
 
Table A09: Israel, SI parameters.
 
Figure A10: Japan, real course and models.
 
Table A10: Japan, SI parameters.
 
Figure A11: Netherlands, real course and models.
 
Table A11: Netherlands, SI parameters.
 
Figure A12: Spain, real course and models.
 
Table A12: Spain, SI parameters.
 
Figure A13: Sweden, real course and models.
 
Table A13: Sweden, SI parameters.
 
Figure A14: USA, real course and models.
 
Table A14: USA, SI parameters.
 
Figure A15: World, real course and models.
 
Table A15: World, SI parameters.
© by the Authors & Gavin Publishers. This is an Open Access Journal Article Published Under Attribution-Share Alike CC BY-SA: Creative Commons Attribution-Share Alike 4.0 International License. Read More About Open Access Policy.
 review article
          review article