darshitmvora / spark-streaming

This repository contains the steps to install Apache Spark, and run an application that consumes the twitter's real-time stream, performs transformations on the data and displays them on a real-time dashboard in Jupyter Notebook.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Spark-Streaming

drawing drawing drawing

This repository contains the steps to install Apache Spark, and run an application that consumes the twitter's real-time stream, performs transformations on the data and displays them on a real-time dashboard in Jupyter Notebook.

If you have a windows system, you need to create a linux virtual machine. Instructions are given in SetupLinuxVM.md.

Refer SetupSparkEnvironment.md to install and setup Spark.

RunningTwitterApplication.md explains how to run the application.

About

This repository contains the steps to install Apache Spark, and run an application that consumes the twitter's real-time stream, performs transformations on the data and displays them on a real-time dashboard in Jupyter Notebook.


Languages

Language:Jupyter Notebook 91.6%Language:Python 8.4%