As a curriculum component of Udacity's Data Analyst Nanodegree, this project was completed as a requirment to for the data wrangling and anlysis course.
The dataset used for this project was provided by Udacity and belongs to WeRateDogs twitter account. Following the recommended process flow for data wrangling (gathering, assessing, cleaning), followed by analysing the dataset to glean insights into the twitter accounts
The project entailed gathering data from a variety of file types and via numerous gathering techniques (manual download, programmatic download, api access), assessing the data for quality and tidiness, cleaning the data using a define, code, test methodology, and completing analysis and visualzations of the cleaned datasets.
- Python
- Libraries: pandas, numpy, matplotlib, seaborn, json, os, requests, tweepy
- Jupyter notebook.
Two reports in pdf formates:
- Wrangling report that lists the aformentioned data wrangling workflow in detail.
- Act report that summarizes the findings of the analysis.