bigheiniu / COVID-19-Dataloaders

Automatically COVID-19 Data Loaders

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

COVID-19-Dataloaders

The repository affords researchers an easy tool and uniform tool to load the COVID-19 related dataset listed in here.

This dataloader is still under construction for automatically downloading from the Internet.

About

TODO:

  1. Automatically Upload file into S3.
  2. Download function should download the file from S3 instead of local file systems.

Goal

  • This tool is easy to use. All the datasets are in pandas.Dataframe format which is easy for further data analysis. Researchers can run one python command to load the dataset

      data = TweetIDCorpus.download(url_or_path)
      pandas_df = data.data
    
  • This dataloader can load datasets from different perspectives, i.e Academic, Social Media, News, Case Report, Geo-Spatial, etc. Researchers can find some connections among different kinds of datasets. And we believe this will boost the development in defending COVID-19.

Contributing

  • Although this project is still under construction, you can still take part in the data load construction. I have only implemented two topics: Social media and Academics, and auto-downloading is still on the way. If you like, you can work on other topics except for these two topics. It is better to check the base dataloader class.

  • Please feel free to send me pull requests or email (yichuan1@asu.edu) to add resources.

About

Automatically COVID-19 Data Loaders


Languages

Language:Python 100.0%