John Dennison's repositories
t-digest
A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
elasticsearch-py
Official Python low-level client for Elasticsearch.
confluent-kafka-python
Confluent's Apache Kafka Python client
librdkafka
The Apache Kafka C/C++ library
aws-es-kibana
AWS ElasticSearch Kibana Proxy
arrow
Apache Arrow is a columnar in-memory analytics layer designed to accelerate big data. It houses a set of canonical in-memory representations of flat and hierarchical data along with multiple language-bindings for structure manipulation. It also provides IPC and common algorithm implementations.
flask-caching
Continuation of the Flask-Cache Extension.
hive-json-schema
Tool to generate a Hive schema from a JSON example doc
presto-udfs
Plugin for Presto to allow addition of user functions easily
Hive-JSON-Serde
Read - Write JSON SerDe for Apache Hive.
pipelinedb
The Streaming SQL Database
presto
Distributed SQL query engine for big data
marathon-python
Python client library for Mesos Marathon's REST API
pykafka
Kafka client for Python
rocksdb
A library that provides an embeddable, persistent key-value store for fast storage.
pyrocksdb
Python bindings for RocksDB
pytest-docker
py.test helpers to test with docker containers
datadogpy
The Datadog Python library
presto-kinesis
Presto connector to Amazon Kinesis service.
dotfiles
mi dots!
redis-py
Redis Python Client
assertpy
Dead simple assertion framework for unit testing in python with a fluent API
streamparse
streamparse lets you run Python code against real-time streams of data. Integrates with Apache Storm.
wabbit_wappa
Wabbit Wappa is a full-featured Python wrapper for the Vowpal Wabbit machine learning utility.
mongo-python-driver
PyMongo - the Python driver for MongoDB
annoy
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
django-cumulus
An interface to python-swiftclient and rackspace cloudfiles API from Django.