beala / covid-19-forecast

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

COVID-19 Pandemic Forecast

by Alex Beal and Nick Vanderweit

We are non-experts with an interest in forecasting world events. Our approach is to first determine a prior probability given past events, and update that probability with current evidence. The calculations below use the odds form of Bayes’ theorem.

Definition of pandemic

We define a coronavirus pandemic as an event where > 10% of the world is infected by coronavirus before 2022. In this forecast, we focus on how prevalent we believe the coronavirus outbreak will become, although we use the estimated fatality rate to make some estimates about prevalence.

Prior probability

We estimate the prior probability of a pandemic given an outbreak of this size by looking at historical cases of outbreaks of this size. Below are outbreaks of respiratory viruses causing more than 1,000 confirmed cases.

Not Pandemic Pandemic
MERS (2018) H1N1 (2009)
H1N1 (India, 2015): Since H1N1 had already become seasonal, it’s unclear if this ever had the potential to become a pandemic. London Flu (1972): Thousands of deaths, outbreaks in several countries, but unclear what percentage of the world it infected.
SARS (2003) Hong Kong Flu (1968)
Russian Flu (1977) Asian Flu (1956)
Spanish Flu (1918)

It’s unclear if Russian Flu and the 2015 Indian H1N1 outbreak should be included, as they were both outbreaks of viruses that were either already in circulation, or had been in circulation in the past.

It’s also unclear if London Flu ever went on to infect a significant percentage of the world, or if the outbreaks across several countries were small.

It’s also true that MERS and SARS never got nearly to the size of the current outbreak. We include them because they’re the only large outbreaks to be contained in the 21st century.

Including all instances, our prior is 5:4 odds that an outbreak of this size will become a pandemic.

If we exclude questionable instances (Indian H1N1, Russian Flu, and London Flu), our prior is 2:1.

Some commenters have expressed concern about undersampling contained viruses. However these data would also undersample mild infections that spread widely e.g. known coronaviruses already in circulation that cause colds. So it is unclear to us how to update on this.

There has also been concern about what time range to consider, especially given that the data shows no non-pandemic cases for most of the 1900s. Perhaps advances in gene sequencing and other diagnostic technology give us reasons to give more weight to more recent data. Considering only cases since 2000, (SARS, MERS, and H1N1), we get a 1:2 prior.

Our priors range from 1:2 at the low end to 2:1 at the high end.

Evidence for pandemic

Undetected cases

Air travel data suggests that there are countries with a substantial number of undetected imported cases. Cambodia, for example, has a similar amount of air travel from affected areas as compared to Singapore, but only has 1 confirmed case compared to Singapore’s 22 confirmed importations. Thailand has about 4 times as much air travel from China as Singapore, but has only 23 imported cases, where we would expect around 92. If these undetected importations are causing undetected outbreaks, it may be impossible to contract trace and quarantine the ill and contain the virus.

February 22 Update

There have been 18 confirmed cases and 4 deaths in Iran. WHO estimates the Infection Fatality Ratio to be in the range 0.3%-1%. Applied to Iran, this yields between 400 and 1,300 total cases. If all cases were known, the official numbers would suggest an implausibly high 4/18 ~= 22% IFR. This suggests orders of magnitude more unconfirmed cases than confirmed.

The WHO has claimed that the outbreak in Iran has not been successfully traced back to China. It’s not entirely clear to us what this implies, but it would be consistent with:

  1. Cases spread from China to Iran directly, but many of the infected Iranians were not detected.
  2. Cases spread undetected from an intermediate country.

Given the relatively low number of expected importations from China to Iran, case (1) implies higher numbers of importations to many other countries including Thailand, Japan, South Korea, Taiwan and the United States. For instance, the import rate to the Philippines is roughly an order of magnitude higher than to Iran, so case (1) implies many undetected cases around the world.

Case (2) also implies an outbreak in whatever intermediate country the virus was imported from. For instance, Iran’s largest trading partner for imports is the United Arab Emirates, which has almost 5 times the importation risk from China that Iran does, and has recently detected cases tracing back to China.

The aforementioned WHO situation report shows a sharp increase in cases outside of China, with few new cases originating in China. Two new countries (Lebanon and Israel) are reporting COVID-19 cases.

Relevant studies:

We believe this is strong evidence in favor of an eventual pandemic. This updates our prior 4:1.

Evidence against pandemic

No large outbreaks outside of China

With the exception of the Diamond Princess cruise ship, there have been no large outbreaks outside of China. This is promising, but we believe this is consistent with the possibility that outbreaks are occuring undetected. It has only been 27 days since Singapore’s first detected importation. If undetected importations started occurring at about the same time in other countries, it’s possible they are still small enough to fly under the radar given the incubation period of 7 days and an R0 of 2.5.

That said, this provides some evidence that the virus is being contained. This updates our prior 3:4.

The outbreak in China may be peaking

The official numbers coming out of China suggest that the outbreak inside China may be peaking. If an outbreak inside China can be contained, that’s a signal that China’s tactic of large scale quarantine is effective even with case numbers in the tens of thousands. Other countries may be able to deploy a similar tactic against their outbreaks should they occur.

That said, we have ample reason to be skeptical.

  • Given the political pressures to drive down the number of cases, and the constant changing case definitions, it’s hard to be sure the trend is real. We can’t rule out disinformation given the political climate in China.
  • Even if cases are brought to a very low number, China must at some point end the quarantine which could lead to a second wave. China must eliminate the virus completely for this to be effective.

We don’t see this as a likely way for the outbreak to end, but it may buy the world time to develop a vaccine, treatment, or better surveillance, so we give it a 4:5 update.

Development of an effective vaccine or antiviral

Given the possibility of pandemic, governments and private companies are incentivized to create vaccines and treatments. There are some potentially promising avenues:

While we would expect to see a flurry of research into such avenues regardless of whether they were destined for success, it is not out of the question that advances in antiviral R&D for coronaviruses would produce mitigations for COVID-19. However in the world of pharmaceutical development, there is a huge difference between a promising initial trial and large-scale distribution of an effective treatment to hundreds of thousands or millions of people.

One report estimates that 9.6% of Phase 1 trials will eventually be deemed safe and effective. There are > 120 clinical trials happening for COVID-19, so it is possible that some of the treatments will be effective. However, we have reasons to suspect that the success rate of these trials will be lower than standard Phase 1 base rates. Due to the desperate situation in China, where the vast majority of these trials are happening, it is plausible that many clinical trials are being conducted that would not have advanced to the clinical trial stage in other circumstances. Hence we should expect a lower probability that a given trial will succeed here, assuming say 1%. Additionally, for this to reduce the chance of pandemic, the successful trial would have to be a treatment that slows or stops the spread. The treatment would have to be manufactured on a large scale and distributed to many countries with varying quality of healthcare infrastructure, and the trials, manufacture, and distribution would have to happen before the outbreak of a pandemic in order to prevent one from occurring.

Given that, in the H1N1 case, it was mere months between the first outbreaks outside of Mexico and declarations of pandemic, we think it’s very unlikely that a given treatment will arrive in time to prevent pandemic. Our Bayesian update for this is 99:100.

Posterior Probability

Prior odds 5:4 2:1 1:2
Undetected cases 4:1
No large outbreak outside of China 3:4
China outbreak peaking 4:3
Development of effective vaccine or antiviral 99:100
Combined update 297:125 ~= 2.376
Posterior odds 297:100 ~= 75% 594:125 ~= 83% 297:250 ~= 54%

What would change our minds

There continue to be no large outbreaks outside of China

The longer there are no outbreaks detected outside of China, the less likely it’s true that there are undetected outbreaks. An outbreak can only go undetected for so long. At some point, it becomes more likely that containment is in fact working. If our estimate of the number of undetected cases goes over 1,000 in several countries, it’s likely our estimate is wrong and our main evidence in favor of pandemic is likely false. This would substantially change our outlook.

For instance, assuming there have been around 20 undetected cases in Cambodia as import rates would suggest, with an incubation period of 7 days and R0 of 2.5, there would be at least 1,000 cases after about 30 days (not counting additional imports). If we don’t see outbreaks in Cambodia, Japan, or Thailand after mid-March 2020, it will seem much more likely that the virus is being contained.

Outbreaks outside of China are contained

One assumption of this forecast is that undetected cases can lead to large outbreaks that cannot be stopped once they’re detected. If instead these outbreaks are quickly stopped, that would be a signal countries are significantly better at responding to these outbreaks that we anticipated.

As of February 21, we are starting to see widespread outbreaks in Iran, and a worrying spike in cases in South Korea, possibly both caused by undetected lines of transmission. We’ll be watching to see how quickly these can be contained, if at all. If they are contained that leaves open the possibility of a middle ground scenario where we see new outbreaks regularly, but they are stopped quickly.

One cause for concern is that we expect there to be a high variance in how effective countries are at responding to an outbreak, and it may only require a few countries with a low ability to respond to spread the virus to the rest of the world.

About