Count words on an Apache Kafka topic and present them to web clients
-
Install app requirements
pip install -r requirement.txt
-
Get a copy of Apache Spark (requires 2.1.0+)
mkdir spark curl https://www.apache.org/dist/spark/spark-2.1.0/spark-2.1.0-bin-hadoop2.7.tgz | tar zx -C spark --strip-components=1
-
Run the app
spark/bin/spark-submit --packages org.apache.spark:spark-sql-kafka-0-10_2.11:2.1.0 app.py
-
Publish some words to topic
word-fountain