Gianmarco De Francisci Morales's repositories
partial-key-grouping
An implementation and example of Partial Key Grouping for Apache Storm. Partial Key Grouping is a load balancing strategy for distributed stream processing systems.
similarity-self-join
Hadoop code for "Document Similarity Self-Join With Mapreduce" (ICDM'10)
twitter-crawler
Crawler for the social network of Twitter
gdfm.github.io
My GitHub user page
incubator-samoa
Mirror of Apache Samoa (Incubating)
Language:JavaApache-2.0000
kafka
Mirror of Apache Kafka
Language:ScalaApache-2.0000
keystone
Simplifying robust end-to-end machine learning on Apache Spark.
Language:ScalaApache-2.0000
samoa
SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.
Language:JavaApache-2.0000
shobai-dogu
Tools of the trade
Language:JavaApache-2.0000
storm
Mirror of Apache Storm
Language:JavaApache-2.0000
vowpal_wabbit
John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm