ohcnetwork / analytics

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

What is the effective reproduction rate of a pandemic, Rt ?

The effective reproduction rate of a pandemic, R0, is defined as is the average number of secondary cases per infectious case in a population made up of both susceptible and non-susceptible hosts. However, in COVID-19 pandemic, a static R0 does not adequately reflect the reality in time and space due to changes in social behaviors and restrictions.

Computation of Rt for COVID-19 enables understanding how effectively a local or state government handles the pandemic and gives the authority helpful information in decision to loosen and tighten measures of social restrictions. As the pandemic spreads with great acceleration, Rt is much more larger than 1. On the contrary, when the pandemic slows down and dies out, Rt is smaller than 1 and approaches 0. This project focuses on computation of Rt for every state on the Kerala based on the number of new cases k reported daily by the state's Department of Health. The value of Rt is related to that of a day before Rt-1, and every previous value of n days before, Rt-n.

Simulating the effective reproduction number Reff

When we derive the expression for the basic reproduction number R0 in the simple SIR model: we get an epidemic only if $\beta$/$\gamma$ > 1, i.e. if the average number of secondary cases caused by a single infected case in a totally susceptible population is greater than 1. As susceptibility declines over the course of the epidemic, the effective reproduction number Reff determines the shape of the epidemic curve as it reflects the amount of immunity in the population at any given time.

In a simple homogenous SIR model, Reff is directly related to the proportion of the population that is susceptible: \begin{align} R_{eff} = R_{0} \frac{S}{N} \end{align}

In this we model an epidemic and study the connection between the behaviour of Reff and the epidemic curve. We are looking at a closed fully susceptible population, into which a single infected person is introduced.

How does Reff vary over the course of the epidemic? What do you notice about the connection between the change in Reff and the epidemic curve over time?

The effective reproduction number is highest when everyone is susceptible: at the beginning, Reff = R0. Over the course of the epidemic, Reff declines in proportion to susceptibility.

The peak of the epidemic happens when Reff goes down to 1. As Reff decreases further below 1, the epidemic prevalence goes into decline.

Algorithms used in this project

Bettencourt & Ribeiro

We will use the same approach outlined by Kevin in his article, which uses the paper [1] as its underlying basis.

Modeling arrivals

The first step is to model the 'arrival' process of infections. A popular choice for the distribution of arrivals amongst statisticians is the Poisson Distribution. Accordingly, if we let λ represent the average rate of infections per day, then the probability that we are likely to see k new cases on a day, is given by

P(k|λ) = λk * e−λ / k!

Given this setup, we can construct the probability distribution of new cases for a set of λs.

Poisson likelihood

Modeling the arrival process as a Poisson distribution allows us to predict the distribution of new cases in a day as a function of the arrival rate λ. However, in reality, we only observe the number of arrivals. So the key question now is how do we go from the observed number of arrivals to get a sense of the distribution of λ. Thankfully, the answer to this question is simple.

L(λ|k) = λk * e−λ / k!

The distribution of λ over k is called the likelihood function, and it has the same mathematical expression as the probability mass function we used earlier. We can visualize the likelihood function by fixing the number of new cases observed (k), and computing the likelihood function over a range of values of λ.

Relating λ and Rt

According to this paper by Bettencourt & Ribeiro, the relationship between arrival rate λ and effective reproduction rate is defined as follows:

λ = kt−1 * eγ(Rt−1)

Note that γ here is the reciprocal of the serial interval (about 4 days for COVID19), and kt−1 is the number of new cases observed in the time interval t−1.

We can use this expression for λ and reformulate the likelihood function in terms of Rt.

L(Rt|k) = λk * e −λ / k!

Dataset

For this project, we use the data from https://keralastats.coronasafe.live/histories.json page where daily counts of new COVID-19 cases are reported for every district in the state of Kerala. The data is then cleaned and wrangled in a proper dataframe containing the daily count of each district. We select Ernakulam to compute the district's effective reproduction rate of the COVID-19 pandemic, Rt. Every district's Rt can be computed at users' choice by modifying the vector districts in the Analytics markdown file.

Computation Steps

The process to compute Rt can be briefly described as follows:

  1. Import the all districts' daily counts to a dataframe
  2. Initialize a value of γ and a set of discrete values of Rt
  3. Select one or more states of interest
  4. Smooth out the daily counts to flatten the choppy data points
  5. Compute the log-likelihood distribution P(k|Rt)
  6. Compute the posterior P(Rt|kt)
  7. Compute the estimate of Rt, including the most-likely, the max, and the min values of Rt
  8. Add Lockdown data to understand its impact
  9. Compute the Growth rate of the covid-19 cases

References

[1] https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0002185

[2] https://www.datacamp.com/community/tutorials/replicating-in-r-covid19

[3] https://github.com/calldrj/COVID19.Effective.Reproduction.Rate

About


Languages

Language:R 100.0%