Dave Gerson's starred repositories

Language:RLicense:GPL-3.0Stargazers:10Issues:0Issues:0

patient2vec

Embedding Complexity In the Data Representation Instead of In the Model (arXiv:1802.04233)

Language:Jupyter NotebookStargazers:24Issues:0Issues:0
Language:HTMLLicense:NOASSERTIONStargazers:122Issues:0Issues:0

forecast

Forecasting Functions for Time Series and Linear Models

Language:RStargazers:1105Issues:0Issues:0

Probabilistic-Programming-and-Bayesian-Methods-for-Hackers

aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

Language:Jupyter NotebookLicense:MITStargazers:26517Issues:0Issues:0

lifetimes

Lifetime value in Python

Language:PythonLicense:MITStargazers:1444Issues:0Issues:0

lifelines

Survival analysis in Python

Language:PythonLicense:MITStargazers:2321Issues:0Issues:0

go

The Open Source Data Science Masters

License:UnlicenseStargazers:24644Issues:0Issues:0

open-source-cs-degree

The Open Source Computer Science Degree

Stargazers:3524Issues:0Issues:0
Language:HTMLLicense:CC-BY-4.0Stargazers:407Issues:0Issues:0

data-science-ipython-notebooks

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

Language:PythonLicense:NOASSERTIONStargazers:26878Issues:0Issues:0

plascalding

Perceptron Learning Algorithm using Scalding Typed API

Language:ScalaStargazers:1Issues:0Issues:0

pyLDAvis

Python library for interactive topic model visualization. Port of the R LDAvis package.

Language:JavaScriptLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0

aas

Code to accompany Advanced Analytics with Spark from O'Reilly Media

Language:ScalaLicense:NOASSERTIONStargazers:1514Issues:0Issues:0

awesome-R

A curated list of awesome R frameworks, packages and software.

Language:RStargazers:1Issues:0Issues:0

stats-testing-in-python

collection of ipython notebooks for "How to Analyse an Online Experiment in Python" tutorial

Language:Jupyter NotebookStargazers:60Issues:0Issues:0

azrael

Physics Simulation For Engineers

Language:PythonStargazers:42Issues:0Issues:0

template-scala-parallel-complementarypurchase

PredictionIO Complementary Purchase Engine Template (Scala-based parallelized engine)

Language:ScalaStargazers:16Issues:0Issues:0

predictionio-template-ecom-recommender

PredictionIO E-Commerce Recommendation Engine Template (Scala-based parallelized engine)

Language:ScalaLicense:Apache-2.0Stargazers:74Issues:0Issues:0

joinery

Data frames for Java

Language:JavaLicense:GPL-3.0Stargazers:692Issues:0Issues:0

SparkR-pkg

R frontend for Spark

Language:RLicense:Apache-2.0Stargazers:642Issues:0Issues:0

java-libpst

A library to read PST files with java, without need for external libraries.

Language:JavaStargazers:247Issues:0Issues:0

awesome-artificial-intelligence

A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.

Stargazers:10247Issues:0Issues:0

HadoopInternals

Diagrams describing Apache Hadoop internals (2.3.0 or later).

Language:HTMLStargazers:429Issues:0Issues:0
Language:C++License:LGPL-3.0Stargazers:3217Issues:0Issues:0

jetpack

Get up and running w/ machine learning using Docker

Language:ShellLicense:AGPL-3.0Stargazers:1Issues:0Issues:0

RHadoop

RHadoop

Stargazers:763Issues:0Issues:0

framework

Machine learning, computer vision, statistics and general scientific computing for .NET

Language:C#License:LGPL-2.1Stargazers:4467Issues:0Issues:0

data_hacks

Command line utilities for data analysis

Language:PythonStargazers:1937Issues:0Issues:0

free-data-science-books

Free resources for learning data science

License:UnlicenseStargazers:2868Issues:0Issues:0