What is the effective reproduction rate of a pandemic, R_t ?

The effective reproduction rate of a pandemic, R₀, is defined as is the average number of secondary cases per infectious case in a population made up of both susceptible and non-susceptible hosts. However, in COVID-19 pandemic, a static R0 does not adequately reflect the reality in time and space due to changes in social behaviors and restrictions.

Computation of Rt for COVID-19 enables understanding how effectively a local or state government handles the pandemic and gives the authority helpful information in decision to loosen and tighten measures of social restrictions. As the pandemic spreads with great acceleration, R_t is much more larger than 1. On the contrary, when the pandemic slows down and dies out, Rt is smaller than 1 and approaches 0. This project focuses on computation of Rt for every state on the Kerala based on the number of new cases k reported daily by the state's Department of Health. The value of R_t is related to that of a day before R_t-1, and every previous value of n days before, R_t-n.

Simulating the effective reproduction number R_eff

When we derive the expression for the basic reproduction number R₀ in the simple SIR model: we get an epidemic only if $\beta$/$\gamma$ > 1, i.e. if the average number of secondary cases caused by a single infected case in a totally susceptible population is greater than 1. As susceptibility declines over the course of the epidemic, the effective reproduction number R_eff determines the shape of the epidemic curve as it reflects the amount of immunity in the population at any given time.

In a simple homogenous SIR model, R_eff is directly related to the proportion of the population that is susceptible: \begin{align} R_{eff} = R_{0} \frac{S}{N} \end{align}

In this we model an epidemic and study the connection between the behaviour of R_eff and the epidemic curve. We are looking at a closed fully susceptible population, into which a single infected person is introduced.

How does R_eff vary over the course of the epidemic? What do you notice about the connection between the change in R_eff and the epidemic curve over time?

The effective reproduction number is highest when everyone is susceptible: at the beginning, R_eff = R₀. Over the course of the epidemic, R_eff declines in proportion to susceptibility.

The peak of the epidemic happens when R_eff goes down to 1. As R_eff decreases further below 1, the epidemic prevalence goes into decline.

Algorithms used in this project

Bettencourt & Ribeiro

We will use the same approach outlined by Kevin in his article, which uses the paper [1] as its underlying basis.

Modeling arrivals

The first step is to model the 'arrival' process of infections. A popular choice for the distribution of arrivals amongst statisticians is the Poisson Distribution. Accordingly, if we let λ represent the average rate of infections per day, then the probability that we are likely to see k new cases on a day, is given by

P(k|λ) = λ^k * e^−λ / k!

Given this setup, we can construct the probability distribution of new cases for a set of λs.

Poisson likelihood

Modeling the arrival process as a Poisson distribution allows us to predict the distribution of new cases in a day as a function of the arrival rate λ. However, in reality, we only observe the number of arrivals. So the key question now is how do we go from the observed number of arrivals to get a sense of the distribution of λ. Thankfully, the answer to this question is simple.

L(λ|k) = λ^k * e^−λ / k!

The distribution of λ over k is called the likelihood function, and it has the same mathematical expression as the probability mass function we used earlier. We can visualize the likelihood function by fixing the number of new cases observed (k), and computing the likelihood function over a range of values of λ.

Relating λ and Rt

According to this paper by Bettencourt & Ribeiro, the relationship between arrival rate λ and effective reproduction rate is defined as follows:

λ = k_t−1 * e^γ(Rt−1)

Note that γ here is the reciprocal of the serial interval (about 4 days for COVID19), and k_t−1 is the number of new cases observed in the time interval t−1.

We can use this expression for λ and reformulate the likelihood function in terms of R_t.

L(R_t|k) = λ^k * e ^−λ / k!

Dataset

For this project, we use the data from https://keralastats.coronasafe.live/histories.json page where daily counts of new COVID-19 cases are reported for every district in the state of Kerala. The data is then cleaned and wrangled in a proper dataframe containing the daily count of each district. We select Ernakulam to compute the district's effective reproduction rate of the COVID-19 pandemic, R_t. Every district's R_t can be computed at users' choice by modifying the vector districts in the Analytics markdown file.

Computation Steps

The process to compute Rt can be briefly described as follows:

Import the all districts' daily counts to a dataframe
Initialize a value of γ and a set of discrete values of R_t
Select one or more states of interest
Smooth out the daily counts to flatten the choppy data points
Compute the log-likelihood distribution P(k|R_t)
Compute the posterior P(R_t|k_t)
Compute the estimate of R_t, including the most-likely, the max, and the min values of R_t
Add Lockdown data to understand its impact
Compute the Growth rate of the covid-19 cases

References

[1] https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0002185

[2] https://www.datacamp.com/community/tutorials/replicating-in-r-covid19

[3] https://github.com/calldrj/COVID19.Effective.Reproduction.Rate

ohcnetwork / analytics

What is the effective reproduction rate of a pandemic, R_t ?

Simulating the effective reproduction number R_eff

How does R_eff vary over the course of the epidemic? What do you notice about the connection between the change in R_eff and the epidemic curve over time?

Algorithms used in this project

Bettencourt & Ribeiro

Modeling arrivals

Poisson likelihood

Relating λ and Rt

Dataset

Computation Steps

References

About

Languages

What is the effective reproduction rate of a pandemic, Rt ?

Simulating the effective reproduction number Reff

How does Reff vary over the course of the epidemic? What do you notice about the connection between the change in Reff and the epidemic curve over time?

Algorithms used in this project

Bettencourt & Ribeiro

Modeling arrivals

Poisson likelihood

Relating λ and Rt

Dataset

Computation Steps

References

About

Languages

What is the effective reproduction rate of a pandemic, R_t ?

Simulating the effective reproduction number R_eff

How does R_eff vary over the course of the epidemic? What do you notice about the connection between the change in R_eff and the epidemic curve over time?