gnocchi's starred repositories

lodash

A modern JavaScript utility library delivering modularity, performance, & extras.

Language:JavaScriptLicense:NOASSERTIONStargazers:59237Issues:842Issues:4251

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaLicense:Apache-2.0Stargazers:38839Issues:2034Issues:0

gulp

A toolkit to automate & enhance your workflow

Language:JavaScriptLicense:MITStargazers:32953Issues:1028Issues:1926

bluebird

:bird: :zap: Bluebird is a full featured promise library with unmatched performance.

Language:JavaScriptLicense:MITStargazers:20440Issues:345Issues:1148

RxJS

The Reactive Extensions for JavaScript

Language:JavaScriptLicense:NOASSERTIONStargazers:19498Issues:549Issues:874

ipython

Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.

Language:PythonLicense:BSD-3-ClauseStargazers:16186Issues:745Issues:7324

word_cloud

A little word cloud generator in Python

Language:PythonLicense:MITStargazers:10012Issues:217Issues:518

CoreNLP

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

Language:JavaLicense:GPL-3.0Stargazers:9553Issues:487Issues:1112

csvkit

A suite of utilities for converting to and working with CSV, the king of tabular file formats.

Language:PythonLicense:MITStargazers:5898Issues:128Issues:910

objective-c-style-guide

The Objective-C Style Guide used by The New York Times

luvit

Lua + libUV + jIT = pure awesomesauce

Language:LuaLicense:Apache-2.0Stargazers:3783Issues:174Issues:427

stripe-node

Node.js library for the Stripe API.

Language:TypeScriptLicense:MITStargazers:3730Issues:83Issues:859

mrjob

Run MapReduce jobs on Hadoop or Amazon Web Services

Language:PythonLicense:NOASSERTIONStargazers:2614Issues:111Issues:1299

skyline

It'll detect your anomalies! Part of the Kale stack.

Language:PythonLicense:NOASSERTIONStargazers:2133Issues:174Issues:52

disco

a Map/Reduce framework for distributed computing

Language:ErlangLicense:BSD-3-ClauseStargazers:1631Issues:85Issues:418

stripe-python

Python library for the Stripe API.

Language:PythonLicense:MITStargazers:1610Issues:41Issues:367

spark

Lightning-fast cluster computing in Java, Scala and Python.

Language:ScalaStargazers:1427Issues:200Issues:0

dstk

A collection of the best open data sets and open-source tools for data science

dablooms

scaling, counting, bloom filter library

bloomfilter.js

JavaScript bloom filter using FNV for fast hashing

Language:JavaScriptLicense:BSD-3-ClauseStargazers:757Issues:22Issues:18

cubesviewer

Explore and visualize analytical datasets

Language:JavaScriptLicense:NOASSERTIONStargazers:440Issues:41Issues:78

brushfire

Distributed decision tree ensemble learning in Scala

Language:ScalaLicense:NOASSERTIONStargazers:393Issues:95Issues:30

Elements-of-Statistical-Learning

Contains LaTeX, SciPy and R code providing solutions to exercises in Elements of Statistical Learning (Hastie, Tibshirani & Friedman)

Language:RStargazers:291Issues:25Issues:0

botomatic

easily create twitter bots in python

Language:PythonLicense:BSD-2-ClauseStargazers:291Issues:24Issues:5

rosetta

Tools, wrappers, etc... for data science with a concentration on text processing

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:206Issues:22Issues:21

GitPad

Notepad.exe as Git commit editor

Language:C#License:MITStargazers:185Issues:284Issues:18

pydata2014nyc

Materials for my pandas tutorial at PyData 2014, NYC

Language:PerlStargazers:110Issues:10Issues:0

pydata-nyc-advanced-sklearn

Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.

License:CC0-1.0Stargazers:69Issues:14Issues:0

probably

Probabilistic Data Structures in Python (originally presented at PyData 2013)

Language:PythonLicense:MITStargazers:55Issues:32Issues:2

luvit-redis

fast luvit redis client