vambati / pipeline

Real-time, End-to-End, Advanced Analytics and Machine Learning Recommendation Pipeline

Home Page:http://advancedspark.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

##PANCAKE STACK

End-to-End Streaming Advanced Analytics and Machine Learning Recommendation Pipeline

Follow Wiki to Setup Docker-based Environment

PANCAKE STACK

Architecture Overview

Pipeline Architecture Overview

Screenshots

Apache Zeppelin Notebooks

Apache Zeppelin Notebooks

Stanford CoreNLP Sentiment Analysis

Stanford CoreNLP Sentiment

Jupyter/iPython Notebooks

Jupyter/iPython Notebooks

SparkR Notebooks

SparkR Notebooks

TensorFlow Notebooks

TensorFlow Notebooks

Apache NiFi Data Flows

Apache NiFi Data Flows

AirFlow Workflows

AirFlow Workflows

Presto Queries

Presto Queries

Tableau Integration

Tableau Integration

Beeline Command-line Hive Client

Beeline Command-line Hive Client

Log Visualization with Kibana & Logstash

Log Visualization with Kibana & Logstash

Spark, Spark Streaming, and Spark SQL Admin UIs

Spark Admin UI Spark Admin UI Spark Admin UI Spark Admin UI Spark Admin UI Spark Admin UI

Ganglia System and JVM Metrics Monitoring UIs

Ganglia Metrics UI Ganglia Metrics UI Ganglia Metrics UI

Tools Overview

Apache Spark Redis Apache Cassandra Apache Kafka NiFi ElasticSearch Logstash Kibana Apache Zeppelin Ganglia Hadoop HDFS iPython Notebook Docker Tachyon

About

Real-time, End-to-End, Advanced Analytics and Machine Learning Recommendation Pipeline

http://advancedspark.com

License:Other


Languages

Language:Jupyter Notebook 97.6%Language:Scala 0.6%Language:Python 0.6%Language:Shell 0.4%Language:C++ 0.4%Language:Java 0.1%Language:Vim Script 0.1%Language:JavaScript 0.0%Language:HTML 0.0%Language:ApacheConf 0.0%Language:C 0.0%Language:XSLT 0.0%Language:Batchfile 0.0%Language:Makefile 0.0%