Faridani's starred repositories
Mturk-Tracker
Software for gathering historical data from Amazon Mechanical Turk Service
CCA-Crawler
A web crawler that helps us collect data for CCA
ML_for_Hackers
Code accompanying the book "Machine Learning for Hackers"
Behavioral-Data-Mining
Behavioral Data Mining
reverse-stemmer
takes a corpus returns all the original words for each stem
PySentiment
Sentiment Analysis in Python
Rare-Words-Finder
Find rare words in a corpus
WikiLeaks_Analysis
Scripts and analysis in support of statistical analysis of WL Afghanistan data
R-Programs
A Variety of R Programs
yourworldoftext
Your World of Text is an infinite grid of text editable by any visitor.
flume
WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. The system is centrally managed and allows for intelligent dynamic management. It uses a simple extensible data model that allows for online analytic applications.
CleverAlgorithms
Clever Algorithms: Nature-Inspired Programming Recipes