g0v1ndN / Covid-SQL-Analytics

Covid SQL Analytics is a robust data project leveraging PostgreSQL for in-depth global and India-specific Covid-19 analysis. Offering curated queries and valuable insights.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Covid-SQL-Analytics

About:

This repository serves as a resource for SQL-based data analysis using PostgreSQL, focusing on Covid-19 data. It includes the code, curated data, and observations derived from detailed query analysis, making it a valuable reference for those interested in exploring this dataset.

Usage Instructions:

Prerequisites:

  • SQL and PostgreSQL Knowledge: A fundamental understanding of SQL concepts and experience with PostgreSQL will be beneficial as you navigate through the queries and analysis presented here.
  • Development Environment: Set up a development environment with PostgreSQL installed. You can download PostgreSQL from its official website and follow the installation instructions provided.
  • Basic Data Analysis Skills: Familiarity with data analysis concepts will help you better understand the insights derived from the queries and observations shared in this repository.

Downloading or Cloning the Repository:

  1. Click on the "Code" button on the top right corner of this repository page.
  2. Click on "Download ZIP" to download a compressed zip file of the repository.
  3. Extract the contents of the zip file to a folder on your computer.

OR

  1. Open your terminal or command prompt and navigate to the directory where you want to clone the repository.
  2. Run the command git clone https://github.com/g0v1ndN/Covid-SQL-Analytics to clone the repository to your local machine.

Note

You need to have Git installed on your computer. If you don't have it installed, please download and install Git from the official website.

Exploring the Code:

If you want to dive deep into the what this repository brings to the table, familiarize yourself with the main analysis code located in the analyze_covid.sql file. This code is split into three key parts:

  1. Initial Setup Queries: These queries establish the foundation by creating the necessary database, tables, and importing the Covid-19 data.
  2. Data Analysis Queries for India: This section contains SQL queries tailored to analyzing Covid-19 data specific to India.
  3. Global COVID-19 Data Analysis Queries: This segment comprises SQL queries aimed at dissecting and analyzing the global COVID-19 data landscape. It provides insight to cross-country comparisons and trends in the pandemic's data.

Observations & Findings:

Important

The data used in this project covers the period from January 1, 2020 to August 16, 2023. Please note that the source website's data might have been updated since the start of this project.

Understanding the COVID-19 Situation in India

  • COVID-19 was first reported in India on January 30, 2020.
  • As of the latest data, the infection percentage in India is 3.18%, indicating a relatively moderate spread of COVID-19 within the population.
  • The peak of new COVID-19 cases was recorded on May 2021, with a notable count of 9,284,558 new cases.
  • Currently, the mortality rate in India stands at 1.18%, indicating a comparatively low fatality rate among confirmed cases.
  • 72.50% of the population in India has received at least one dose of the Covid-19 vaccine, and 67.17% are fully vaccinated.
  • India has reported a significant number of COVID-19 deaths, totaling 531,922, which corresponds to 0.04% of its population.

Global COVID-19 Landscape

COVID-19 Outbreak and Global Data:

  • The COVID-19 outbreak, which originated in China, was officially reported on January 4, 2020.
  • The latest data reveals a COVID-19 infection rate of approximately 9.65%.
  • It also discloses a mortality rate of 0.90%, reflecting a relatively low fatality rate among confirmed cases.

Infection Statistics:

  • As of the current data, the top 5 hotspots with the highest infection percentages are Cyprus (73.76%), San Marino (72.37%), Brunei (69.09%), Austria (68.03%), and South Korea (65.42%).
  • In contrast, North Korea and Turkmenistan have zero cases, while Yemen, Niger, and Chad share the lowest infection rate of 0.04%.
  • The highest number of new COVID-19 cases globally occurred on January 2022, with a total of 91,806,015 new cases reported.

Vaccination Statistics:

  • Notably, vaccinations have been administered to more than 5.6 billion people, accounting for 70.48% of the population.
  • Gibraltar, Tokelau, United Arab Emirates, Qatar, Nauru, Brunei, and Pitcairn have successfully attained vaccination percentages exceeding 99%.
  • Vaccination rates vary across income categories, with 32.54% of low-income, 79.86% of high-income, 66.28% of lower-middle-income, and 83.42% of upper-middle-income individuals having been vaccinated at least once.

Regional Deaths and COVID-19 Toll:

  • Peru, Bulgaria, Bosnia and Herzegovina, Hungary, and North Macedonia exhibit higher death percentages, with rates ranging from 0.47% to 0.65%.
  • North America leads with the highest death count (1,127,152), followed by South America (704,659), Asia (531,922), Europe (399,938), Africa (102,595) and Oceania (22,618) on a continental basis.
  • The countries with the highest COVID-19 death toll are as follows: United States (1,127,152 deaths), Brazil (704,659 deaths), India (531,922 deaths), Russia (399,938 deaths), and Mexico (334,336 deaths).

External Resources:

Data Inclusion

  • Data Source: Our World in Data
  • Citation: Edouard Mathieu, Hannah Ritchie, Lucas Rodés-Guirao, Cameron Appel, Charlie Giattino, Joe Hasell, Bobbie Macdonald, Saloni Dattani, Diana Beltekian, Esteban Ortiz-Ospina, and Max Roser (2020) - "Coronavirus Pandemic (COVID-19)". Published online at OurWorldInData.org.
  • License: CC-BY-4.0 License

The Creative Commons Attribution (CC-BY) License is an open-source license that permits usage, modification, and distribution of the content, provided you attribute the original creator.

License:

This project is licensed under the terms & conditions (T&Cs) of the MIT license. If you're interested in reading more about it, please take a look at the LICENSE document.

About

Covid SQL Analytics is a robust data project leveraging PostgreSQL for in-depth global and India-specific Covid-19 analysis. Offering curated queries and valuable insights.

License:MIT License