OsamaHamdyOsman / Data_Wrangling_Project_DAND

Gather data from a variety of sources and in a variety of formats, assess its quality and tidiness, then clean it. Then, showcase wrangling efforts reflected on analyses and visualization.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data Wrangling Project

As a curriculum component of Udacity's Data Analyst Nanodegree, this project was completed as a requirment to for the data wrangling and anlysis course.

Introduction

The dataset used for this project was provided by Udacity and belongs to WeRateDogs twitter account. Following the recommended process flow for data wrangling (gathering, assessing, cleaning), followed by analysing the dataset to glean insights into the twitter accounts

The project entailed gathering data from a variety of file types and via numerous gathering techniques (manual download, programmatic download, api access), assessing the data for quality and tidiness, cleaning the data using a define, code, test methodology, and completing analysis and visualzations of the cleaned datasets.

Tools Used

  • Python
  • Libraries: pandas, numpy, matplotlib, seaborn, json, os, requests, tweepy
  • Jupyter notebook.

Outputs

Two reports in pdf formates:

  1. Wrangling report that lists the aformentioned data wrangling workflow in detail.
  2. Act report that summarizes the findings of the analysis.

About

Gather data from a variety of sources and in a variety of formats, assess its quality and tidiness, then clean it. Then, showcase wrangling efforts reflected on analyses and visualization.


Languages

Language:Jupyter Notebook 100.0%