Cyrus Dioun's repositories
DLAB-Text-Working-Group
Berkeley DLAB Python & Text Working Group. Files are contributed by students using tutorials (Neal Caren/NLTK) and developing scripts that build upon, concatenate, and/or adapt open source code.
awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
causalTree
Working repository for Causal Tree and extensions
data-police-shootings
The Washington Post is compiling a database of every fatal shooting in the United States by a police officer in the line of duty in 2015 and 2016.
ggplot2
An implementation of the Grammar of Graphics in R
git-fundamentals
A starting point for discovering the wonderful world of Git, GitHub, and Git Annex (Assistant)
go
The Open Source Data Science Masters
h2o-training
training material
h2oEnsemble-benchmarks
Benchmarks of the H2O Ensemble R interface.
hadoop
Scripts for MapReduce processing of Twitter data.
pandas-cookbook
Recipes for using Python's pandas library
parallel_ml_tutorial
Tutorial on scikit-learn and IPython for parallel machine learning
r_useful_dlab
Materials for Useful Stuff in R Workshop, D-Lab, UC Berkeley
reddit-analysis
A Python script that parses post titles, self-texts, and comments on reddit and makes word clouds out of the word frequencies
scikit-learn
scikit-learn: machine learning in Python
scikit-learn-tutorial
Applied Machine Learning in Python with scikit-learn
skimage-tutorials
Scikit-image tutorials
training-scripts
Scripts to launch cluster used for Strata
Twitter-LDA
Latent Dirichlet Allocation (LDA) model for Microblogs (Twitter, weibo etc.)