Mathematics AI SL's Sample Internal Assessment

Mathematics AI SL's Sample Internal Assessment

To investigate the spread of the pandemic COVID-19 in New Delhi, India by studying the variations of the susceptible, infected and recovered population with respect to time over a period of three months and determination of correlation and regression between them

6/7
6/7
10 mins read
10 mins read
Candidate Name: N/A
Candidate Number: N/A
Session: N/A
Word count: 1,977

Table of content

Rationale

Mathematics is an essential and versatile subject to analytically interpret various real life scenarios. I have always been fascinated about the application of Mathematics in the real world. It has always intrigued me to go beyond the academic curriculum and think about various real life scenarios from a mathematical point of view. Recently, the outbreak of the COVID-19 pandemic has attracted my attention towards it. I got to know about the concept of mathematical modeling of a pandemic, how it spreads and based on that study important measures are taken in controlling the pandemic.

 

Although, the world is suffering, the pandemic offered me an opportunity to stir up my interests. Since, the concept of mathematical modeling of an epidemic has occupied my mind, I thought of doing my own mathematical analysis and constructing one such model to study the spread of this disease COVID-19. I took the data of Susceptible, Infected and Recovered population over a period of three months and three days. In this study, the concepts of maxima, minima, correlation and regression really helped me arrive at some important conclusions.

 

The study would help more enthusiasts like me to get an idea of how mathematics is extensively used in the study of spread of a pandemic. It would also help to get an idea of how mathematical modeling is necessary to make important decisions regarding controlling of the pandemic among the population.

Aim

To investigate the spread of the pandemic COVID-19 in New Delhi, India by studying the variations of the Susceptible, Infected and Recovered population with respect to time over a period of three months and determining the extent of correlation and regression between them.

Background

Susceptible, infected and recovered population

In epidemiology, Susceptible population is the population which is at a risk of becoming infected by the disease. Infected population includes the individuals who are infected by the disease. Recovered population consists of the inviduals who were once infected by the disease but recovered from it.

Methodology

Data collection and processing

YearMonthIndexSusceptible population
2020April31022
2020April61346
2020April91712
2020April122089
2020April152591
2020April182943
2020April213432
2020April243946
2020April274465
2020April305087
2020May335738
2020May366526
2020May397169
2020May427622
2020May458312
2020May488867
2020May519299
2020May549769
2020May5711351
2020May6016512
2020June6317136
2020June6619005
2020June6928601
2020June7236650
2020June7543884
2020June7850283
2020June8155689
2020June8459453
2020June8764887
2020June9068797
2020July9376438

Figure 1 - Table On Variation Of Susceptible Population (Taken Per Three Days) From 1st April, 2020 to 3rd July, 2020

Figure 2 - Variation Of Susceptible Population (Taken Per Three Days) From 1st April, 2020 to 3rd July, 2020

After plotting the values of Susceptible population in the y axis with respect to time (in number of days), taken in the x axis, we get the expression for Susceptible population as a function of time. The equation is given by, y = 15.82x- 761.4x + 9369...(i), where y = S (say) denotes the Susceptible population or the number of susceptible individuals and x  = t (say) denotes the time (in number of days).

 

∴ S(t) = 15.82t- 761.4t + 9369...(i)

Year MonthIndexInfected population
2020April3822
2020April61146
2020April91512
2020April121889
2020April152291
2020April182743
2020April213232
2020April243646
2020April274265
2020April304587
2020May335138
2020May365526
2020May396169
2020May426822
2020May457612
2020May488167
2020May518799
2020May549169
2020May5710351
2020May6012512
2020June6314136
2020June6616005
2020June6922601
2020June7230650
2020June7539884
2020June7847283
2020June8150689
2020June8454453
2020June8761887
2020June9063797
2020July9368438

Figure 3 - Table On Variation Of Infected Population (Taken Per Three Days) From 1st April, 2020 to 3rd July, 2020

Figure 4 - Variation Of Infected Population (Taken Per Three Days) From 1st April, 2020 To 3rd July, 2020

After plotting the values of Infected population in the y axis with respect to time (in number of days), taken in the x axis, we get the expression for Infected population as a function of time. The equation is given by, y = 1001e0.046...(ii), where y = I (say) denotes the Infected population or the number of infected individuals and x = t (say) denotes the time (in number of days).

 

 I(t) = 1001e0.046t     ...(ii)

YearMonthIndexRecovered population
2020April3315
2020April6866
2020April91013
2020April121262
2020April151765
2020April182076
2020April212689
2020April243043
2020April273577
2020April303844
2020May334268
2020May364981
2020May395369
2020May425822
2020May456712
2020May487567
2020May518299
2020May548669
2020May579151
2020May609812
2020June6310436
2020June6611105
2020June6912066
2020June7212742
2020June7513884
2020June7815683
2020June8116989
2020June8418553
2020June8719787
2020June9021297
2020July9324738

Figure 5 - Table On Variation Of Recovered Population (Taken Per Three Days) From 1st April, 2020 To 3rd July, 2020

Figure 6 - Variation Of Recovered Population (Taken Per Three Days) From 1st April, 2020 To 3rd July, 2020:

After plotting the values of Recovered population in the y axis with respect to time (in number of days), taken in the x axis, we get the expression for Recovered population as a function of time. The equation is given by, y = 2.38x2 + 7.737x + 1088...(iii), where y = R (say) denotes the Recovered population or the number of recovered individuals and x = t (say) denotes the time (in number of days).

 

∴ R(t) = 2.38t2 + 7.737t + 1088  ...(iii)

 

Now, considering the total population N to be 100000000 i.e., 108, we define our second set of dependent variables which are,

 

\(s(t) = \frac{s(t)}{N},\) the susceptible fraction of the population

 

\(i(t) = \frac{I(t)}{N},\) the infected fraction of the population

 

\(r(t) = \frac{R(t)}{N},\) the recovered fraction of the population

 

The two sets of dependent variables i.e., S(t), I(t), R(t) and s(t), i(t), r(t) are proportional to each other. So, studying either set of the dependent variables will give us similar information and spread or progress of the pandemic.

 

\(∴s(t) = \frac{15.82t^2-761.4t+9369}{10^8}\)

 

\(i(t) = \frac{1001e^{0.046t}}{10^8}\)

 

and, \(r(t) = \frac{2.38t^2+7.737t+1088}{10^8}\)

Figure 7 - Variation Of Susceptible Fraction Of The Population With Respect To Time

We sketched the graph of the variation of susceptible fraction of population i.e., of the equation

 

\(s(t) = \frac{15.82t^2-761.4t+9369}{10^8}\)

 

Comparing the graphs of the variation of the Susceptible population vs. time and the variation of susceptible fraction of population vs. time, we observe that the increase rate of the Susceptible population is much greater than the increase rate of the susceptible fraction of population. Thus, the growth of the susceptible fraction of population is very slow with respect to time.

Figure 8 - Variation Of Infected Fraction Of The Population With Respect To Time

We sketched the graph of the variation of infected fraction of population i.e., of the equation

 

\(i(t) = \frac{1001e^{0.046t}}{10^8}\)

 

Comparing the graphs of the variation of the Infected population vs. time and the variation of infected fraction of population vs. time, we observe that the increase rate of the Infected population is greater than the increase rate of the infected fraction of the population between time t = 70 to t = 100 and the increase rate of the infected fraction of the population is greater than the Infected population between time t = 200 to t = 400.

Figure 9 - Variation Of Recovered Fraction Of The Population With Respect To Time

We sketched the graph of the variation of recovered fraction of population i.e., of the equation

 

\(r(t) = \frac{2.38t^2+7.737t+1088}{10^8}\)

 

Comparing the graphs of the variation of the Recovered population vs. time and the variation of recovered fraction of population vs. time, we observe that the increase rate of the Recovered population is much greater than the increase rate of the recovered fraction of the population. Thus, the growth of the recovered fraction of population is very slow with respect to time.

 

Now, notice that the Infected population depends on the Susceptible population and the Recovered population depends on the number of Infected individuals. The only way the susceptible number of individuals is affected i.e., an individual leaves the susceptible group only when the individual gets infected. So, we can assume that the time rate of change of Susceptible population S(t) depends on the number of susceptible individuals, number of infected individuals and the amount of contact between these two group of individuals.

 

If we assume that an infected individual has n (a fixed number) of contacts per day with the population. The fraction of these contact made by the infected individual with the susceptible individuals is s(t). Hence, an infected individual generates n×s(t) number of new infected individuals per day.

 

Therefore, the time rate of change of the Susceptible population S(t) is given by,

 

\(\frac{dS}{dt} = - n × s(t) × I(t)\)

 

One infected individual produces n×s(t) number of new infected individuals per day so, the total number of infected individuals I(t) at a given time t produces n×s(tI(t) new infected individuals. In the above differential equation, the negative sign indicates the loss of the infected individuals from the total susceptible population at a time t.

 

Therefore, the time rate of change of the susceptible fraction of the population is given by,

 

\(\frac{ds}{dt} = - n×s(t)×i(t)\)

 

Now, we further assume that a fixed fraction h of the group of infected individuals recovers per day. For example, if the infection lasts for 7 days in an individual then, one-seventh of the Infected population recovers per day.

Conclusion

According to the data taken of the variation of Susceptible population, Infected population and Recovered population and the mathematical analysis made studying it, the following assumption can be made:

  • There exists a very strong correlation between the Susceptible population and the Infected population. This indicates that there is a very strong tendency that almost the entire exposed population, who are at a risk of the infection, is going to get infected.
  • Graphically, neither the Susceptible population nor the Infected population reaches a peak within our considered time frame and it is very less likely that a peak would be reached anytime soon.
  • There exists a moderately strong correlation between the Infected population and the Recovered population. Graphically, there is a clear indication that the rate of infection is higher than the rate of recovery. This indicates that either the death rate is increasing or the infection takes a much longer time to be healed.
  • There is a very weak possibility of a herd immunity to be reached in the near future.

Reflection

Strength

  • The data of the variations of Susceptible population, Infected population and Recovered population has been taken over a period of 3 months and 3 days, from 1st April, 2020 to 3rd July, 2020, following the standard SIR model. The variations and the correlation between them have been studied and important conclusions have been made. This makes the study coherent.
  • A wide range of data has been taken and studied. This makes our study relevant to an extent to arrive at important conclusions.
  • The data has been taken from the official website of WHO (World Health Organization), which makes it authentic, error free and could be readily considered to do a critical scientific analysis.

Weakness

  • The data of the variations of the Susceptible, Infected and Recovered has been taken per three days, over a period of 3 months and studied. The data would have been more accurate if the values were taken per one day instead of per three days.
  • The study would have been more reliable if the number of deaths and the variation of the deaths with respect to time were included.
  • The study would be more scientific and reliable to actually use in real life if higher mathematical and statistical tools like differential equations, Time series analysis, Continuum models, spatial models, Markov chain models etc.