nishchaychawla / Data-Wrangling-of-WeRateDogs

Data Wrangling using tweet id of WeRateDogs tweets

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data-Wrangling-of-WeRateDogs

Overview

Data Wrangling includes Gathering, Assessing and Cleaning of data. The data for this project was gathered from twitter API using the tweet id provided by Udacity and tweepy package in python. The JSON data gathered from tweepy was read as pandas dataframe then various visual and programtic assessments were performed to determine multiple quality and tidiness issues. A subset of issues(8 Quality and 3 Tidiness) were addressed using various techniques in pandas and finally few insights were disscussed.

File:

act_report.pdf : Briefly discuss the insights drawn from improved data set.

wrangle_act.ipynb : Code for whole Data Wrangling and analysis project.

wrangle_report.pdf: Briefly discuss the wrangling process and future work.

About

Data Wrangling using tweet id of WeRateDogs tweets


Languages

Language:Jupyter Notebook 100.0%