quantum9Innovation / ons-coronasurveys

Identifying differences between ONS and Coronasurveys COVID-19 case count estimates

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ONS-Coronasurveys

Identifying differences between ONS and Coronasurveys COVID-19 case count estimates

Methodology

Data was gathered from the links above and stored to data/raw/. Coronasurveys regional data for England (ENG), Northern Ireland (NIR), Wales (WLS), and Scotland (SCT) track the total percentage of cases, while the ONS data tracks only the percentage of active infected cases. We also included the dataset for all regions from the Coronasurveys project, which tracks active infected cases. This data was then processed by either coronasurveys.R or ons.R to remove unnecessary columns and format percentages as a number out of 100. The processed data can be found in data/processed/. The filter.R script (or the filter-gb.R script in the case of the Coronasurveys data for the entire UK) was used to eliminate NA values from the data and leave only days where both the Coronasurveys and ONS datasets were complete. This data, used for generating plots, is available at data/filtered/.

Plots

To see all plots generated from the filtered datasets, run plot.py. If you're using Poetry, running poetry install will install all the required dependencies. Otherwise, you can run pip install -r requirements.txt to install the dependencies. A sample of some of the most interesting views can be found in plots/.

About

Identifying differences between ONS and Coronasurveys COVID-19 case count estimates

License:Apache License 2.0


Languages

Language:R 61.8%Language:Python 38.2%