DerekDLP / data2019nCoV

Data on the 2019 Novel Coronavirus Outbreak (R Package)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DOI

data2019nCoV R Package

This is a simple R package to store data about the 2019-2020 outbreak of the novel coronavirus, SARS-CoV-2, causing COVID-19.

There is no guarantee as to the accuracy of the transcribed data or graphs in the vignette. Please file an issue if any errors (discrepancies with the WHO reports) are noticed.

Currently, this package contains 3 datasets.

1. World Health Organization (WHO)

WHO_SR: The transcribed WHO situation reports up to 2020-03-20 (Situation Reports # 1-60).

2. Ontario Ministry of Health Cumulative Cases

ON_cumulative: The source of this data is the official Ontario Government's website. Data is included from page updates that were captured by the Internet Archive where possible. It has been updated up to 2020-03-20.

3. Government of Canada Cumulative Cases

CAN_cumulative: The source of this data is the official Canadian Government's website. Data is included from page updates that were captured by the Internet Archive where possible. It has been updated up to 2020-03-20.

Demo

Plots of the WHO data can be found in the WHO SR vignette.

Plots of the Ontario data can be found in the ON vignette.

Plots of the Canada data can be found in the CAN vignette.

Usage

The package can be installed via devtools::install_github("eebrown/data2019nCoV"). Be sure to reinstall to obtain the latest version as new data is released (including the errata periodically reported by WHO).

Once the package is installed and loaded, the data is available in a data.frame object, WHO_SR. This package uses plain lanugage variable names. The data is also available in the object sarscov2_who_2019, which uses more systematic variable names. The latter object also appears in the package outbreaks, though may be slightly less updated there.

Wiki

If you use the data for a project, please consider adding the project link to the wiki. If you verify the data (confirm it to be equivalent to the WHO Situation Reports), please also consider noting that on the wiki. The purpose is to facilitate data analysis in this rapidly evolving pandemic.

Data Structure

Raw/transcribed data are stored as .csv files in the /data-raw directory, e.g. /data-raw/WHO_SR.csv. The scripts that converted the raw data into an R object are in a corresponding .R file in that directory (e.g. /data-raw/WHO_SR.R).

Licence

This package uses freely available data, and itself is freely available with the CC0 licence. Information from WHO may be subject to WHO copyright and terms. The data source and package should be attributed. No warranty is made as to the accuracy of the data or transcription: use at your own risk. Please file an issue if you find any errors.

About

Data on the 2019 Novel Coronavirus Outbreak (R Package)


Languages

Language:R 100.0%