zhengyihzw's repositories
extraction
A Python library for extracting titles, images, descriptions and canonical urls from HTML.
Ghost.py
Webkit based scriptable web browser for python.
Language:JavaScript000
opinion-mining
Mining people opinions around new products
Language:Python000
Snabler
Parallel Algorithms in Python for Hadoop/Mapreduce
Language:Python000
the-craft-of-selfteaching
One has no future if he couldn't teach himself.
TwitterCommunityDetection
Community Detection for Twitter follower network of 40 million users using mapreduce