emaxerrno / arroyo

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Arroyo

Arroyo is a Python library for working with streaming data.

Most of the code here has been extracted from Snuba so that it can be reused in Sentry and other services.

Arroyo provides:

  • Consumer and producer interfaces. The primary use case is for working with Apache Kafka streams, however it also supports custom backends and includes local (memory or file based) consumer and producer implementations
  • Consumer strategy interface that helps build the processing strategy for how raw messages that are consumed should be filtered, transformed, batched and flushed as required
  • A synchronized consumer implementation that supports following another consumer by only consuming messages once already consumed and committed by another consumer group

About

License:Apache License 2.0


Languages

Language:Python 99.8%Language:Shell 0.1%Language:Makefile 0.1%