toannhu96 / hadoop-hive-spark-docker

Hadoop-Hive-Spark cluster + Jupyter on Docker

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Hadoop-Hive-Spark cluster + Jupyter on Docker

Software

Quick Start

To deploy the cluster, run:

make
docker-compose up

Access interfaces with the following URL

Hadoop

ResourceManager: http://localhost:8088

NameNode: http://localhost:9870

HistoryServer: http://localhost:19888

Datanode1: http://localhost:9864 Datanode2: http://localhost:9865

NodeManager1: http://localhost:8042 NodeManager2: http://localhost:8043

Spark

master: http://localhost:8080

worker1: http://localhost:8081 worker2: http://localhost:8082

history: http://localhost:18080

Hive

URI: jdbc:hive2://localhost:10000

Jupyter Notebook

URL: http://localhost:8888

example: jupyter/notebook/pyspark.ipynb

About

Hadoop-Hive-Spark cluster + Jupyter on Docker


Languages

Language:Dockerfile 46.8%Language:Jupyter Notebook 41.0%Language:Shell 10.5%Language:Makefile 1.7%