scrapinghub / streamparse

streamparse lets you run Python code against real-time streams of data. Integrates with Apache Storm.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

logo

Build Status

streamparse lets you run Python code against real-time streams of data. It also integrates Python smoothly with Apache Storm.

It can be viewed as a more robust alternative to Python worker-and-queue systems, as might be built atop frameworks like Celery and RQ. It offers a way to do "real-time map/reduce style computation" against live streams of data. It can also be a powerful way to scale long-running, highly parallel Python processes in production.

Demo

Documentation

http://streamparse.readthedocs.org/en/latest/

User Group

Follow the project's progress, get involved, submit ideas and ask for help via our Google Group, streamparse@googlegroups.com.

Contributors

Alphabetical, by last name:

Roadmap

See the Roadmap.

About

streamparse lets you run Python code against real-time streams of data. Integrates with Apache Storm.

License:Apache License 2.0


Languages

Language:Python 81.4%Language:Ruby 12.2%Language:HTML 3.8%Language:Clojure 2.0%Language:Shell 0.6%Language:Java 0.1%