pks9862728888 / US_bikeshare_data_analysis

Analyzing the bike-share data of US from Motivate, for three popular cities Washington, New York, and Chicago and showing statistics for different users, stations, and times of travel. Also filters the data sets according to the user's choice and shows statistics on the filtered data.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Project 1: Analysis Of US Bikeshare Data From Motivate

by Pran Kumar Sarkar

US Bikeshare Data analysis project, part of the Udacity Python Foundation Nanodegree.

Overview:

This project analyzes the bike-share data of US from Motivate for popular cities Washington, New York, and Chicago; and then shows various statistics according to filter chosen by user. It also shows visual statistics and individual travel data if the user want to see it.

Moreover after seeing statistics of a particular station, the user can restart the program to see the statistics of another station or same station with different filters.

Available Filters

Day: This filters the data by day.
Month: This filters the data by month.
Both: This filters the data by both month and year.
None: This does not filter data at all.

If filter is day, month, or both, then statistics of popular day, month or both are shown respectively before filter is applied.

Statistics Shown

These statistics are shown on filtered data:~

Time statistics: This shows statistics of most popular day of travel, most popular month of travel, and most popular hour of travel. However if data is filtered by day, month or both, then statistics on most popular day, month or both is shown on whole data set instead of showing it on filtered data-set.

Station Statistics: This shows statistics of the most common Start Station, most common End Station, and most popular trip based on combination of Start and End Stations.

Trip Duration Statistics: This shows statistics of the total trip duration and the average trip duration.

User Statistics: This shows statistics on the types of bike users, gender of bike users, most recent, most common, and most earliest birth year of the bike users.

Moreover Time statistics, Station statistics, and User statistics are also shown in graphs.

Required Libraries and Dependencies:

Python 3.x is required to run this project. The Python executable should be in your default path, in which the Python installer should have set.

Additional required packages:

  • pandas : pip install pandas==1.1.4
  • numpy : pip install numpy==1.19.4
  • seaborn : pip install seaborn==0.11.0
  • matplotlib : pip install matplotlib==3.3.2

To install the above packages(if not installed) open terminal or command prompt and type in the commands provided above.

This project is done with pandas version 1.1.4, numpy version 1.19.4, seaborn version 0.11.0 and matplotlib version 3.3.2. Installing other versions may subject to errors.

Project contents:

This project consists for the following files:

  • bikeshare.py - main Python script to run.
  • chicago.csv - contains bikeshare data for Chicago city.
  • new_york_city.csv - contains bikeshare data for New York city.
  • washington.csv - contains bikeshare data for Washington city.
  • README.md - Readme file containing detailed instructions for program execution and project overview.
  • _config.yml - Configuration file for github theme.

Download:

This project can be downloaded by clicking here

How to Run Project:

Download the project zip file to your computer and unzip the file. Or clone this repository to your desktop by typing the following code in your terminal(for Linux) or command prompt(for windows):

git clone https://github.com/pks9862728888/Movie_trailer_website_generator.git

Navigate to the project directory and type in the following command:

python Bikeshare.py

Extra Credit Description:

The following features were implemented to gain an extra credit from Udacity:

  • Added functionality to show popular day and popular month of trip on whole data if data is to be filtered by day or month or both.
  • Shows the number and percentage of data which are missing.
  • Added functionality to visualize the individual statistics in graph.

References:

  1. Python Foundation Nanodegree Udacity
  2. Markdown formatting for README.md
  3. Writing Readme Free Course Udacity
  4. Sample Readme by tsega
  5. Clearing the screen: Stack Overflow Link
  6. Counting grouped occurences in dataframe: Stack Overflow Link
  7. Python for Data Analysis by Wes McKinney
  8. Mentor(available in Slack): Sir Vardaan Sharma

Thank you.

About

Analyzing the bike-share data of US from Motivate, for three popular cities Washington, New York, and Chicago and showing statistics for different users, stations, and times of travel. Also filters the data sets according to the user's choice and shows statistics on the filtered data.


Languages

Language:Python 100.0%