cutter's repositories
vim
vim配置文件和插件
papers
论文放在这里比较保险
neuraltalk
NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.
parameter_server
A distributed machine learning framework.
display-advertising-challenge
Criteo/Kaggle Competition of CTR prediction
spark-training
Spark training material
practice_assignment
Practice assignment for the R programming class on Coursera
DataScienceSpecialization.github.io
http://DataScienceSpecialization.github.io
kaggle-ctr-prediction
Repo for code of kaggle competition on data by Criteolabs
bash-style-guide
A style guide for writing safe, predictable, and portable bash scripts (not sh!)
SparkInternals
Notes talking about the design and implementation of Apache Spark
h2o
h2o = fast statistical, machine learning & math runtime for bigdata
BuildingMachineLearningSystemsWithPython
Source Code for the book Building Machine Learning Systems with Python
NewStart20140807
The ReadMe File of All Programm
h2o-sparkling
H2O and Spark interoperability
word_cloud
A little word cloud generator in Python
java-deeplearning
Distributed Deep Learning Platform for Java, Clojure,Scala
hia-examples
Hadoop In Action Examples
Programming-Collective-Intelligence
Examples from Programming Collective Intelligence
oryx
Simple real-time large-scale machine learning infrastructure.
OpenRTB
Documentation and issue tracking for the OpenRTB Project
mahout
Mirror of Apache Mahout
hadoop-tutorials-examples
Source, data and turotials of the blog post video series of Hue, the Web UI for Hadoop.
Predicting_CTR
Predicting Click-Through Rate for New Users and Advertisers
scalacheck
Property-based testing for Scala
Mining-the-Social-Web
The official online compendium for Mining the Social Web (O'Reilly, 2011)