gnocchi's starred repositories

lodash

A modern JavaScript utility library delivering modularity, performance, & extras.

Language:JavaScriptLicense:NOASSERTIONStargazers:59328Issues:842Issues:4258

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaLicense:Apache-2.0Stargazers:38988Issues:2029Issues:0

gulp

A toolkit to automate & enhance your workflow

Language:JavaScriptLicense:MITStargazers:32950Issues:1026Issues:1926

bluebird

:bird: :zap: Bluebird is a full featured promise library with unmatched performance.

Language:JavaScriptLicense:MITStargazers:20448Issues:345Issues:1149

RxJS

The Reactive Extensions for JavaScript

Language:JavaScriptLicense:NOASSERTIONStargazers:19492Issues:548Issues:874

ipython

Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.

Language:PythonLicense:BSD-3-ClauseStargazers:16204Issues:745Issues:7328

word_cloud

A little word cloud generator in Python

Language:PythonLicense:MITStargazers:10024Issues:217Issues:518

CoreNLP

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

Language:JavaLicense:GPL-3.0Stargazers:9581Issues:487Issues:1115

csvkit

A suite of utilities for converting to and working with CSV, the king of tabular file formats.

Language:PythonLicense:MITStargazers:5913Issues:129Issues:912

objective-c-style-guide

The Objective-C Style Guide used by The New York Times

luvit

Lua + libUV + jIT = pure awesomesauce

Language:LuaLicense:Apache-2.0Stargazers:3784Issues:174Issues:428

stripe-node

Node.js library for the Stripe API.

Language:TypeScriptLicense:MITStargazers:3741Issues:83Issues:862

mrjob

Run MapReduce jobs on Hadoop or Amazon Web Services

Language:PythonLicense:NOASSERTIONStargazers:2613Issues:111Issues:1299

skyline

It'll detect your anomalies! Part of the Kale stack.

Language:PythonLicense:NOASSERTIONStargazers:2134Issues:174Issues:52

disco

a Map/Reduce framework for distributed computing

Language:ErlangLicense:BSD-3-ClauseStargazers:1631Issues:85Issues:418

stripe-python

Python library for the Stripe API.

Language:PythonLicense:MITStargazers:1612Issues:42Issues:369

spark

Lightning-fast cluster computing in Java, Scala and Python.

Language:ScalaStargazers:1427Issues:200Issues:0

dstk

A collection of the best open data sets and open-source tools for data science

dablooms

scaling, counting, bloom filter library

bloomfilter.js

JavaScript bloom filter using FNV for fast hashing

Language:JavaScriptLicense:BSD-3-ClauseStargazers:757Issues:22Issues:18

cubesviewer

Explore and visualize analytical datasets

Language:JavaScriptLicense:NOASSERTIONStargazers:440Issues:41Issues:78

brushfire

Distributed decision tree ensemble learning in Scala

Language:ScalaLicense:NOASSERTIONStargazers:391Issues:94Issues:30

Elements-of-Statistical-Learning

Contains LaTeX, SciPy and R code providing solutions to exercises in Elements of Statistical Learning (Hastie, Tibshirani & Friedman)

Language:RStargazers:291Issues:25Issues:0

botomatic

easily create twitter bots in python

Language:PythonLicense:BSD-2-ClauseStargazers:291Issues:24Issues:5

rosetta

Tools, wrappers, etc... for data science with a concentration on text processing

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:206Issues:22Issues:21

GitPad

Notepad.exe as Git commit editor

Language:C#License:MITStargazers:185Issues:284Issues:18

pydata2014nyc

Materials for my pandas tutorial at PyData 2014, NYC

Language:PerlStargazers:110Issues:10Issues:0

pydata-nyc-advanced-sklearn

Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.

License:CC0-1.0Stargazers:69Issues:14Issues:0

probably

Probabilistic Data Structures in Python (originally presented at PyData 2013)

Language:PythonLicense:MITStargazers:55Issues:32Issues:2

luvit-redis

fast luvit redis client