John Dennison's repositories
pytest-docker
py.test helpers to test with docker containers
arrow
Apache Arrow is a columnar in-memory analytics layer designed to accelerate big data. It houses a set of canonical in-memory representations of flat and hierarchical data along with multiple language-bindings for structure manipulation. It also provides IPC and common algorithm implementations.
aws-es-kibana
AWS ElasticSearch Kibana Proxy
confluent-kafka-python
Confluent's Apache Kafka Python client
django-cumulus
An interface to python-swiftclient and rackspace cloudfiles API from Django.
elasticsearch-py
Official Python low-level client for Elasticsearch.
flask-caching
Continuation of the Flask-Cache Extension.
hive-json-schema
Tool to generate a Hive schema from a JSON example doc
Hive-JSON-Serde
Read - Write JSON SerDe for Apache Hive.
librdkafka
The Apache Kafka C/C++ library
marathon-python
Python client library for Mesos Marathon's REST API
mongo-python-driver
PyMongo - the Python driver for MongoDB
pipelinedb
The Streaming SQL Database
presto
Distributed SQL query engine for big data
presto-kinesis
Presto connector to Amazon Kinesis service.
presto-udfs
Plugin for Presto to allow addition of user functions easily
pykafka
Kafka client for Python
streamparse
streamparse lets you run Python code against real-time streams of data. Integrates with Apache Storm.
t-digest
A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
wabbit_wappa
Wabbit Wappa is a full-featured Python wrapper for the Vowpal Wabbit machine learning utility.