L Greenford's repositories
etsy-dataviz
Etsy Data Visualization App
data-scientists-guide-apache-spark
Best practices of using Spark for practicing data scientists in the context of a data scientist’s standard workflow.
dataweek-workshop
Machine learning workshop using Python, pandas, and scikit-learn. The first half of the day covered supervised classification using Logistic Regression and how to use cross validation to evaluate your models . The second half of the day covered unsupervised clustering with Kmeans as well as an overview of the data science process.
deep-photo-styletransfer
Code and data for paper "Deep Photo Style Transfer": https://arxiv.org/abs/1703.07511
GalvanizeCapstone
This is a repository to use for my Galvanize Capstone Project
LearnDataScience
Open Content for self-directed learning in data science
Lessons-Learned-Data-Science-Interviews
Lessons learned the hard way through over 30+ data science interviews
MSongsDB
Code for the Million Song Dataset, the dataset contains metadata and audio analysis for a million tracks, a collaboration between The Echo Nest and LabROSA. See website for details.
Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
self-study-resources
DSI Self Study Resources
spark-install
Installation guide for Apache Spark + Hadoop on Mac/Linux
statlearning-notebooks
Python notebooks for exercises covered in Stanford statlearning class (where exercises were in R).
tensorflow
Computation using data flow graphs for scalable machine learning
ZA-Final-Project
Zipfian Academy Final Project - Twitter Community Detection