elmiko / grafzahl

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Graf Zahl

Count words on an Apache Kafka topic and present them to web clients

Quick start

  1. Install app requirements

    pip install -r requirement.txt
  2. Get a copy of Apache Spark (requires 2.1.0+)

    mkdir spark
    curl https://www.apache.org/dist/spark/spark-2.1.0/spark-2.1.0-bin-hadoop2.7.tgz | tar zx -C spark --strip-components=1
  3. Setup Apache Kafka

  4. Run the app

    spark/bin/spark-submit --packages org.apache.spark:spark-sql-kafka-0-10_2.11:2.1.0 app.py
  5. Connect to the app

  6. Publish some words to topic word-fountain

About


Languages

Language:HTML 55.8%Language:Python 44.2%