sonngai159 / covid_tracker

A crawler tool to crawl covid data of Viet Nam

Home Page:https://sonngai159.github.io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GitHub GitHub issues GitHub pull requests PRs Welcome GitHub contributors
GitHub watchers GitHub forks GitHub Repo stars

Viet Nam - Covid Tracker

A crawler tool to crawl data from Covid Viet Nam to google sheets automatically.

Requires:

  1. Python - 3.9 or newer
  2. Pipenv
  3. Google Drive API
  4. Google Sheets
  5. Azure Virtual Machine
  6. Tableau
  7. Crontab
  8. Basic Linux knowledges

Getting Started:

2 - Pipenv Setup:

Use $ pip install pipenv to install pipenv. Then create & move to the folder you want to be project folder. With pipfile in your project's file, install all dependencies $ pipenv install.

3 - Google Drive API Setup:

  • Go to Google Cloud Platform, create new project. create_project
  • Go to API & Services -> Enable Apis & services. Enable "Google Drive API" and "Google Sheets API". enable_api_services
  • Click on IAM & Admin, go to services account -> create a new services account.
  • Move to key tab -> ADD KEY -> Create new key -> Json type. NOTE: keep this json's key file to connect with google sheets. create_private_key

4 - Google Sheets Setup:

  • Create new sheet, name it "csv-to-gg-sheet" (change any as your wish, need to change in source code too).
  • Add 1 work sheet name "covid_cases" to store covid case every day, 1 work sheet name "covid_death" to store covid death case every day. *Note: when change sheets name we need to change it name in source code too.
  • Publish both sheets to web with csv type, save the share link to use in script.

5 - Azure Virtual Machine:

  • Go to Azure, create new Virtual Machine, save it's ssh_key.pem.
  • Connect with Virtual Machine throgh ssh protocol, install python.
  • Push source code, pipfile, ssh_key.pem file, json's key file to remote machine.
  • Install all dependencies by pipenv at step 2.

6 - Tableau Setup:

  • Download Tableau, use google sheet as data source, make your visualization.
  • Publish Tableau visualization to the tableau public server, it will auto update(1 per day) when your google sheet data change. tableau

7 - Crontab Setup:

About

A crawler tool to crawl covid data of Viet Nam

https://sonngai159.github.io/

License:MIT License


Languages

Language:Python 100.0%