Robert Smith's repositories
statistical-distributions-in-python
Python package for statistical distributions
sparkify-customer-retention
Customer Retention Modeling using PySpark and Shap
pymc-resources
PyMC educational resources
introtodeeplearning
Lab Materials for MIT 6.S191: Introduction to Deep Learning
disaster-response-pipeline
A NLP pipeline project for disaster response messages.
census-clustering
Leverage H2O and PySpark to cluster Census data.
Spark-The-Definitive-Guide
Spark: The Definitive Guide's Code Repository
tsanalysis
Module to perform basic time series exploration in Python.
deploying-machine-learning-models
Example Repo for the Udemy Course "Deployment of Machine Learning Models"
ud120-projects
Starter project code for students taking Udacity ud120
Breast-Cancer-Diagnostics
Machine Learning on UCI's Breast Cancer Diagnostic Data Set
Introduction-to-Statistical-Learning-Solutions
Solutions to select problems from an Introduction to Statistical Learning
Getting-CleaningData
Files related to the JHU Getting & Cleaning Data Class Project
RepData_PeerAssessment1
Peer Assessment 1 for Reproducible Research
MachineLearningFinalProject
Repository for Coursera's Machine Learning Final Project
ExData_Plotting1
Plotting Assignment 1 for Exploratory Data Analysis
ProgrammingAssignment2
Repository for Programming Assignment 2 for R Programming on Coursera
datasharing
The Leek group guide to data sharing
dsci-benchmark
R scripts for benchmarking next word prediction algorithms developed for the Coursera Data Science Capstone Project.