SophiaY404 / Datacamp-projects

Python projects from Datacamp, includes Jupyter notebooks and datasets

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DataCamp Python Projects

This repository contains the projects completed and downloaded from DataCamp. Each project includes a dataset folder, that has been checked to have valid data, and a Jupyter notebook. Some projects also include an HTML file exported from the Jupyter notebook, in order to capture the visualizations and graphs. Below is a summary table of the projects, sorted by core skill:

Data Manipulation & Visualization

Project Core Skill(s) Description
Bad Passwords and the NIST Guidelines Data manipulation Check what passwords fail to conform to the National Institute of Standards and Technology's password guidelines.
Exploring the History of Lego Data manipulation Explore a database of every LEGO set ever built.
Exploring the Evolution of Linux Data manipulation Find out about the development of the Linux operating system by exploring its Git repository history.
Generating Keywords for Google Ads Data manipulation Automatically generate keywords for a search engine marketing campaign using Python.
A Visual History of Nobel Prize Winners Data manipulation & visualization Explore a dataset from Kaggle containing a century's worth of Nobel Laureates. Who won? Who got snubbed?
Analyze your Runkeeper Fitness Data Data manipulation & visualization Import, clean, and analyze 7 years worth of training data tracked on the Runkeeper app.
Dr. Semmelweis and the Discovery of Handwashing Data manipulation & visualization Reanalyze the data behind one of the most important discoveries of modern medicine: handwashing.
Exploring the Bitcoin Cryptocurrency Market Data manipulation & visualization Explore the market capitalization of Bitcoin and other cryptocurrencies.
Name Game: Gender Prediction using Sound Data manipulation & visualization Analyze the gender distribution of children's book writers and use sound to match names to gender.
Real-time Insights from Social Media Data Data manipulation & visualization Analyze Twitter data and do a deep dive into a hot trend.
Recreating John Snow's Ghost Map Data manipulation & visualization Recreate John Snow's famous map of the 1854 cholera outbreak in London.
The Android App Market on Google Play Data manipulation & visualization Load, clean, and visualize scraped Google Play Store data to understand the Android app market.
The GitHub History of the Scala Language Data manipulation & visualization Find the true Scala experts by exploring its development history in Git and GitHub.
TV, Halftime Shows, and the Big Game Data manipulation & visualization Load, clean, and explore Super Bowl data in the age of soaring ad costs and flashy halftime shows.
Up and Down with the Kardashians Data manipulation & visualization Plot Google Trends data to find the most famous Kardashian/Jenner sister. Is it Kim? Kendall? Kylie?
Who is Drunk and When in Ames, Iowa? Data manipulation & visualization Flex your pandas muscles on breath alcohol test data from Ames, Iowa, USA.
Which Debts are Worth the Bank's Effort? Data manipulation & visualization Statistics Play bank data scientist and use regression discontinuity to see which debts are worth collecting.
Disney Movies and Box Office Success Data manipulation & visualization Statistics (Linear regression) Explore Disney movie data, then build a linear regression model to predict box office success.
Risk and Returns: The Sharpe Ratio Data manipulation & visualization Statistics Use pandas to calculate and compare profitability and risk of different investments using the Sharpe Ratio.

Machine & Deep Learning

Project Core Skill(s) Description
Predicting Credit Card Approvals Machine learning Build a machine learning model to predict if a credit card application will get approved.
Who's Tweeting? Trump or Trudeau? Machine learning Build a machine learning classifier that knows whether President Trump or Prime Minister Trudeau is tweeting!
Reducing Traffic Mortality in the USA Machine learning (Clustering) Data manipulation & visualization How can we find a good strategy for reducing traffic-related deaths?
Classify Song Genres from Audio Data Machine learning (Decision tree) Statistics Rock or rap? Apply machine learning methods in Python to classify songs into genres.
Naïve Bees: Predict Species from Images Machine learning (SVM) Build a model that can automatically detect honey bees and bumble bees in images and determine the accuracy of the model.
Give Life: Predict Blood Donations Machine learning (TPOT) Statistics Build a binary classifier to predict if a blood donor is likely to donate again.
Compare Cosmetics by Ingredients Machine learning (t-SNE) Data visualization (Bokeh) Process ingredient lists for cosmetics on Sephora then visualize similarity using t-SNE and Bokeh.
ASL Recognition with Deep Learning Deep learning (CNN) Build a convolutional neural network to classify images of letters from American Sign Language.
Naïve Bees: Deep Learning with Images Deep learning (Keras) Build a deep learning model that can automatically detect honey bees and bumble bees in images and determine the loss and accuracy metrics.

Other Skill Categories

Project Core Skill(s) Description
Mobile Games A/B Testing with Cookie Cats A/B testing Analyze an A/B test from the popular mobile puzzle game, Cookie Cats.
Naïve Bees: Image Loading and Processing Image processing Load, transform, and understand images of honey bees and bumble bees in Python.
A Network Analysis of Game of Thrones Network analysis Analyze the network of characters in Game of Thrones and how it changes over the course of the books.
Book Recommendations from Charles Darwin NLP Build a book recommendation system using NLP and the text of books like "On the Origin of Species."
Word Frequency in Moby Dick NLP Web scraping Use web scraping and NLP to find the most frequent words in Herman Melville's novel, Moby Dick.
Do Left-handed People Really Die Young? Statistics (Bayes) Use pandas and Bayesian statistics to see if left-handed people actually die earlier than righties.

About

Python projects from Datacamp, includes Jupyter notebooks and datasets


Languages

Language:Jupyter Notebook 55.4%Language:HTML 44.6%Language:Python 0.0%