SHANE MCCALLUM's repositories
Data-Society-Netflix-Capstone
The capstone I completed for Booz Allen Hamilton's Tech Excellence in Data Science program. This covers a variety of methods for predicting the IMDB score of a movie based off of the features of the data.
4Lk0HtJ8UlUUGOtc
ACME is one of the fastest growing startups in the logistics and delivery domain. We work with several partners and make on-demand delivery to our customers. During the COVID-19 pandemic, we are facing several different challenges and everyday we are trying to address these challenges. At ACME we thrive for making our customers happy. As a growing startup, with a global expansion strategy we know that we need to make our customers happy and the only way to do that is to measure how happy each customer is. If we can predict what makes our customers happy or unhappy, we can then take necessary actions. Getting feedback from customers is not easy either, but we do our best to get constant feedback from our customers. This is a crucial function to improve our operations across all levels. We recently did a survey to a select customer cohort. You are presented with a subset of this data. We will be using the remaining data as a private test set.
Diversity-and-Violence--Is-there-a-connection-
In this capstone, I hope to see if there exist a strong relationship between the level of diversity in a US county and the rate of violent crime, as reported to the FBI. If there is, I will attempt to develope a model that can predict the level of violent crime within a county if given the level of diversity.
Ultimate-Challenge-take-home-assessment
An example of a take home test
PySpark-SQL-tutorials
PySpark tutorials
E-Commerce-RFM-Classification-Case-Study
A case study intended to focus on demonstrating how to develop a useful RFM Classification model and predict customer value from the model.
ARIMAX-Gold-and-S-P500-Time-Series
The purpose of this repository is to test the hypothesis that the S&P 500 index has an exogenous relationship to the price of Gold; specifically that as the S&P index falls, the value of Gold will increase.
PCA-Clustering-Case-Study
PCA and other various clustering methods
Case-Study-London-Housing
My case study Tier 3 for Springboard section 4.3
featuretools
Adapted exercise from here: https://github.com/Featuretools/predict-customer-churn/blob/master/churn/3.%20Feature%20Engineering.ipynb
featuretools-1
An open source python library for automated feature engineering
CosineSimilarityCaseStudy
Short Cosine Case Study
sklearn_pycon2014
Repository containing files for my PyCon 2014 scikit-learn tutorial.
XGB_Pima_Indians_Diabetes_Example
Example of XGBoost for ensemble methods
Logistic-Regression-Test
Springboard Log Test
Random-Forest-COVID-19
Springboard Random Forest work
BayesianOptimization
A Python implementation of global optimization with gaussian processes.
PythonDataScienceHandbook
Python Data Science Handbook: full text in Jupyter Notebooks
Decision-Tree-Test--RR-Coffee
From Springboard course
Statistical-inference-Python
Short notebook on computing MoE, CI, and other inferences.