david-legend / programming-for-data-science-nanodegree

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

programming-for-data-science

David Cobbina

This repository contains projects for Udacity Programming For Data Science.

Project 1

Overview

This project consisted in the exploration of a provided database movie rentals in which the student had to run SQL queries and build visualizations to showcase the output of the student's queries.

For the presentation component, the student had to create four slides, and each should have (a) a question of interest, (b) a supporting SQL query needed to answer the question, (c) a supporting visualization created using the final data of the SQL queries that answer the questions of interest, and (d) a small summary on each slide.

Project Submission

Project Submission. You can find the queries here.

Question Sets

A set of questions have also been provided by Udacity so that we were free to consider and include them in our Project Submission. These were solely provided for our convenience, and we were able to choose between any of the questions in these sets or none at all in your project submission.Despite having not used any of the questions provided by Udacity, I still chose to query each question.

Project 2

Overview

In this project, the student had to make use of Python to explore data related to bike share systems for three major cities in the United States — Chicago, New York City, and Washington. The student had to write code to (a) import the data and answer interesting questions about it by computing descriptive statistics, and (b) write a script that takes in raw input to create an interactive experience in the terminal to present these statistics.

Project Submission

The developed CLI program allows the user to explore an US bikeshare system database and retrieve statistics information from the database. The user is able filter the information by city, month and weekday, in order to visualize statistics information related to a specific subset of data. The user is also able to chose to view raw data and to sort this data by columns, in ascending or descending order.

Project Submission.

Files Used

The required files for running this program are:

  • washington.csv
  • new_york_city.csv
  • chicago.csv

You can download the files here.

Requirements

This program was written in Python (version 3.7.1) and relies on the following libraries:

pandas==0.23.4 numpy==1.15.4 Click==7.0

Project 3

This project consisted in using Git and GitHub with a simulated workflow to refactor Project 2.

Project Submission

The Project Submission consisted in the filling of a template with all the Git commands used to perform the requested tasks.

Project Submission.

Imgur

About


Languages

Language:Python 100.0%