Jesse Fredrickson's repositories
DEND_Capstone
ETL of large OpenDotA dataset (S3, Spark EMR, RedShift)
data-scientist-nanodegree
Solutions to exercises and projects for Udacity's Data Scientist Nanodegree
jf_twitch_streams
Live and historical twitch streaming and viewing metrics
A3C_Doom
VizDoom played by an A3C Network on Tensorflow 2.0
Analysis-Bikeshare-data
bikeshare inline analysis
Analysis-movie-ratings
imdb top 100 rating and budget analysis
augmented-volumetric-image-generator
Customised Keras' ImageDataGenerator for 3D volumetric medical image
DAND_t1_p4
A/B Testing
DAND_t2_p2
White Wine EDA
DAND_t2_p3
WeLoveDogs tweet analysis (do you like Samoyeds?)
DAND_t2_p4
Tableau dashboard of airport flight delays
DEND_Airflow
Airflow managed ETL from S3 to Redshift
DEND_AWS
Stand up redshift with boto3, connect to and use postgresql for staging
DEND_DataLake
Using AWS EMR to run a spark job which performs ETL on an S3 json dataset
DEND_t1_p1
Postgres Data modeling
DEND_t1_p2
Data Modeling with NoSQL
DSND_ML_Pipeline
NLP ML Pipeline for processing tweet data
DSND_t1_p1_charityml
Machine learning to estimate population income based on other features
DSND_t1_p2_image_classifier
Training a deep neural net to predict the species of a flower in an image
DSND_t1_p3_identify_customer_segments
Customer segments in a new market; PCA, Clustering
DSND_t2_p1_cardata
Data Analysis of 1990-2019 car MPG
DSND_t2_p3
IBM recommendation strategies
DSND_Term1
Contains files related to content and project of DSND
fashion_MNIST
using a few NNs to achieve high accuracy on the fashion MNIST data set
pipeline_template
Forkable starter code which contains a flexible sklearn-based pipeline implementation, improved feature importances, and logging
rateme
scraping of the r/rateme subreddit
stackoverflow
Findings from Stackoverflow 2017