Don's repositories
python-patterns
A collection of design patterns implemented (by other people) in python
data-science-toolbox
A collection of command-line tools that facilitate the obtaining, scrubbing, and exploring of data.
Creating-maps-in-R
Introductory tutorial on graphical display of geographical information in R, to contribute to teaching material
data-tools
File format conversion tools
linux-2.6
Old Mirror of Linus Torvald's Kernel Tree, see "linux" repo for current version
csvkit
A suite of utilities for converting to and working with CSV, the king of tabular file formats.
slidify
Generate reproducible html5 slides from R markdown
seasonal
R interface to X-13ARIMA-SEATS
RecordStream
commandline tools for slicing and dicing JSON records.
ansj_seg
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
pizn.github.com
这是一个使用 Jekyll 搭建的个人博客,用来快速记录工作,学习,生活的一点一滴。更多分享,更多交流,更多进步。
joshualeung.github.com
joshua's personal blog
klib
A standalone and lightweight C library
knitr-examples
a collection of knitr examples
cracking-the-coding-interview
Solutions for the book: Cracking the coding interview V4. Written in C++.
json2csv
command line tool to convert json to csv
ML_for_Hackers
Code accompanying the book "Machine Learning for Hackers"
r-ninja
R语言忍者秘笈
akela
A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.
geoip-api-python
GeoIP Python API
Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming in data analysis with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
scala-k-means
k-means
douban-client
Python client library for Douban APIs (OAuth 2.0)
ggthemes
ggplot themes and scales
sina_reptile
获取新浪微博1000w用户的基本信息和每个爬取用户最近发表的50条微博,使用python编写,多进程爬取,将数据存储在了mongodb中
dataanalysis
The lecture slides for Coursera's Data Analysis class
RcppNaiveBayes
A Naive RcppNaiveBayes for simple text classification