Marco Didonna (noiano)

noiano

Geek Repo

Company:Prometeia

Location:Milan

Home Page:twitter.com/noiano

Twitter:@noiano

Github PK Tool:Github PK Tool

Marco Didonna's starred repositories

storm

Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more

Language:JavaLicense:Apache-2.0Stargazers:8836Issues:0Issues:0

wikihadoop

Stream-based InputFormat for processing the compressed XML dumps of Wikipedia with Hadoop

Language:JavaStargazers:86Issues:0Issues:0

HadoopPerceptron

http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en//pubs/archive/36266.pdf

Language:JavaStargazers:14Issues:0Issues:0

twitter_nlp

Twitter NLP Tools

Language:HTMLLicense:GPL-3.0Stargazers:882Issues:0Issues:0

ark-tweet-nlp

CMU ARK Twitter Part-of-Speech Tagger

Language:JavaLicense:NOASSERTIONStargazers:575Issues:0Issues:0

curator

ZooKeeper client wrapper and rich ZooKeeper framework

Language:JavaLicense:NOASSERTIONStargazers:2156Issues:0Issues:0

MongoReduce

Hadoop Input and Ouput formats for MongoDB

Language:JavaStargazers:29Issues:0Issues:0

Yahoo_LDA

Yahoo!'s topic modelling framework using Latent Dirichlet Allocation

Language:C++License:Apache-2.0Stargazers:338Issues:0Issues:0

cascading.solr

Cascading scheme for Solr

Language:JavaStargazers:27Issues:0Issues:0

kundera

A JPA 2.1 compliant Polyglot Object-Datastore Mapping Library for NoSQL Datastores.Please subscribe to:

Language:JavaLicense:Apache-2.0Stargazers:903Issues:0Issues:0

python-snappy

Python bindings for the snappy google library

Language:PythonLicense:NOASSERTIONStargazers:476Issues:0Issues:0

commons

Twitter common libraries for python and the JVM (deprecated)

Language:JavaLicense:NOASSERTIONStargazers:2099Issues:0Issues:0

Pig-scripting-examples

Examples of use of pig scripting languages capabilities

Language:PythonStargazers:39Issues:0Issues:0

grouperfish

Text clustering service for the web

Language:JavaLicense:NOASSERTIONStargazers:25Issues:0Issues:0

gh4a

Github client for Android

Language:JavaLicense:Apache-2.0Stargazers:1691Issues:0Issues:0

elephantdb

Distributed database specialized in exporting key/value data from Hadoop

Language:JavaLicense:BSD-3-ClauseStargazers:558Issues:0Issues:0

goldenorb

GoldenOrb is an open-source implementation of Pregel, Google's graph processing framework

Language:JavaLicense:Apache-2.0Stargazers:293Issues:0Issues:0

dotfiles

~grb. Things in here are often interdependent. A lot of stuff relies on scripts in bin/.

Language:Vim ScriptStargazers:1900Issues:0Issues:0

wonderdog

Bulk loading for elastic search

Language:JavaLicense:Apache-2.0Stargazers:186Issues:0Issues:0

firesheep

A Firefox extension that demonstrates HTTP session hijacking attacks.

Language:C++License:GPL-3.0Stargazers:8Issues:0Issues:0

Ivory

A Hadoop toolkit for web-scale information retrieval research

Language:JavaStargazers:79Issues:0Issues:0

Cloud9

Cloud9 is a Hadoop toolkit for working with big data

Language:JavaStargazers:237Issues:0Issues:0

behemoth

Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.

Language:JavaLicense:NOASSERTIONStargazers:281Issues:0Issues:0

flockdb

A distributed, fault-tolerant graph database

Language:ScalaLicense:NOASSERTIONStargazers:3337Issues:0Issues:0

cascalog

Data processing on Hadoop without the hassle.

Language:ClojureLicense:NOASSERTIONStargazers:1376Issues:0Issues:0

elephant-bird

Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.

Language:JavaLicense:Apache-2.0Stargazers:1139Issues:0Issues:0

mitmproxy

An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.

Language:PythonLicense:MITStargazers:36115Issues:0Issues:0

hector

a high level client for cassandra

Language:JavaLicense:MITStargazers:644Issues:0Issues:0

hbasene

HBase as the backing store for the TF-IDF representations for Lucene

Language:JavaLicense:Apache-2.0Stargazers:108Issues:0Issues:0

FileSetInputFormat

A Hadoop input format for sending lists of files as keys to a mapper. Set the list of files, and an input split will be created per file. Each map task gets only one input key: the filename for its split.

Language:JavaStargazers:16Issues:0Issues:0