Dave Gerson's starred repositories
patient2vec
Embedding Complexity In the Data Representation Instead of In the Model (arXiv:1802.04233)
Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
open-source-cs-degree
The Open Source Computer Science Degree
data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
plascalding
Perceptron Learning Algorithm using Scalding Typed API
stats-testing-in-python
collection of ipython notebooks for "How to Analyse an Online Experiment in Python" tutorial
template-scala-parallel-complementarypurchase
PredictionIO Complementary Purchase Engine Template (Scala-based parallelized engine)
predictionio-template-ecom-recommender
PredictionIO E-Commerce Recommendation Engine Template (Scala-based parallelized engine)
SparkR-pkg
R frontend for Spark
java-libpst
A library to read PST files with java, without need for external libraries.
awesome-artificial-intelligence
A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.
HadoopInternals
Diagrams describing Apache Hadoop internals (2.3.0 or later).
data_hacks
Command line utilities for data analysis
free-data-science-books
Free resources for learning data science