Gianmarco De Francisci Morales's repositories
partial-key-grouping
An implementation and example of Partial Key Grouping for Apache Storm. Partial Key Grouping is a load balancing strategy for distributed stream processing systems.
similarity-self-join
Hadoop code for "Document Similarity Self-Join With Mapreduce" (ICDM'10)
twitter-crawler
Crawler for the social network of Twitter
gdfm.github.io
My GitHub user page
incubator-samoa
Mirror of Apache Samoa (Incubating)
kafka
Mirror of Apache Kafka
keystone
Simplifying robust end-to-end machine learning on Apache Spark.
okapi
Large-scale ML & graph analytics on Giraph
samoa
SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.
shobai-dogu
Tools of the trade
storm
Mirror of Apache Storm
vowpal_wabbit
John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm