ermiasgelaye / Google-Health-Search-Project

The goal of this project is to visualize the top searches for common health issues in the United States, from Cancer to Diabetes, and compare them with the actual location of occurrences for those same health conditions to understand how search data reflects life for millions of Americans.

Home Page:google-health-search-project.vercel.app

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Changes of Online Health Search Trends

🌱🌱🌱 Check the main project out here!

Project Goal

How has the online search interest for top health issues or diseases changed over time? And how does the online search interest compare with the real-life leading causes of death?

To investigate this question, our team used the online health search data in the US from 2004 to 2017 provided by Google Trends as well as the real-life leading death-causing disease data provided by the US CDC Department. We chose the United States to as the representative population. And Google Trends data in particular allows us to see what people are searching for at a very local level. With regard to the booming technological advancements and the growing reliance people have had on Internet, we will make visualizations and draw insights from our data to understand in the past two decades how the online search patterns reflect the real-life health conditions for millions of Americans.

Overall, the online health search has been increasing and presumably will keep increasing, due to the growth of Telehealth and technologies, which will sustain a foreseeable phenomenon in the future, in the United States and probably similarly else where. In the more technologically advanced and more metropolitan regions, this growing trend of online health search is more conspicuous.

Research Question

How have the most online searched diseases changed over the past two decades in the United States?

Data sources

Architectural Diagram

🔭 ETL Process

Extract

Data sourced from Google Trends, specifically Google Health Search from 2004 - 2017, and the US CDC data at the same period of time.

Transform

Data cleaned and transformed by using Python Jupyter Notebook. Health_Analysis.ipynb

Load

  • This project used Python Jupyter Notebook to load transformed data in to PostgreSQL database. loadData.ipynb

  • Python Flask–powered RESTful API were used to deploy the data into the web, and API end point links created. API links store our cleaned and transformed data in json format and are publicly accessible for visitors of our website.

Deployment

The app is deployed in Heroku in order to access the page click the following link Eagle Dashboard to explore our whole project

You can find our presentation slide here

Data Analysis and Visualization

Objectives

  • How has the online health search volume changed over the years in general? Can we confirm that people are increasingly reliant on Telehealth? (Visualized with a Single Line Chart)
  • How has the online health search volume changed in terms of the specific health conditions or the diseases over the years? (Visualized with a Multiple Line Chart, a Radar Chart, and a Boxplot)
  • How have the online health search volume changing patterns varied geologically? Which states/cities have been more reliant on searching for health issues online? (Visualized the main observation with a Choropleth map and a Bar Chart in the main dashboard, and in more details in the Comparison dashboard with a Stadium Track Chart, a Bar Chart, a Choropleth map, and Scatterplots)
  • How have the searches of the health conditions correlate with each other? (Visualized with a Correlation Matrix)
  • What is the situation about the real-life leading diseases? How have the real-life leading causes of death coincide with people’s online health search trends? (Visualized with a Multiple Line Chart)

The following visualizations are made:

Health Search Volume by State and Region (Choropleth map)

Interactive Charts With Dropdown Selection "City"

Health Search Volume by Year (Single Line Chart)

Health Search Volume by Year and Condition (Multiple Line Chart)

Health Search volume by States

Correlation Between Searches of Health Conditions

Boxplot of Health Google Search 2004-2017

Radar Plot on All Time Total Volume of Health Searches

Radar Plot on the Sum Total Volume of 10 Leading Causes of Death Per 100,000 Population from 2004-2017

Team members (Team Eagle)

  • Adedamola Atekoja (‘Damola)
  • Amanda Qianyue Ma
  • Amos Johnson
  • Ermias Gaga
  • Maria Lorena

About

The goal of this project is to visualize the top searches for common health issues in the United States, from Cancer to Diabetes, and compare them with the actual location of occurrences for those same health conditions to understand how search data reflects life for millions of Americans.

google-health-search-project.vercel.app


Languages

Language:HTML 66.8%Language:JavaScript 26.0%Language:Python 3.7%Language:CSS 2.7%Language:Jupyter Notebook 0.9%Language:Procfile 0.0%