ZakariaMahmoud / Docker-BigData-Tools

This is Hadoop Cluster that contains the necessary tools that can be used in BigData domain, It’s collection of docker containers that you can use directly to have the maximum of tools like : Hive, Hue, Zookeeper, Kafka, Hbase, Mongo, Metabase...

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Docker Big Data Tools

ℹ️ This docker-compose file is configured to run multiple nodes.

This is a Hadoop Cluster that contains the necessary tools that can be used in the BigData domain, It's a collection of docker containers that you can use directly to have the maximum of tools like :

  • Hive
  • Hue
  • MySql
  • Zookeeper
  • Kafka
  • Hbase
  • Mongo
  • Metabase
  • Streamsets
  • Sqoop
  • Storm

Docker Images Used


Instalations

git clone https://gitlab.com/ZakariaMahmoud/docker-bigdata-tools.git

cd docker-bigdata-tools

sudo docker-compose up -d

⚠️ It takes some time for launch and configure all the images

Screenshots

Namenode

👁️ You can see here 3 Live Nodes**

Datanode 1

Datanode 2

Datanode 3

Hue

Username : admin Password : admin

After click in Sign In

Now you can use Hive

  • Simple Query for test
CREATE TABLE IF NOT EXISTS users(id INT, name VARCHAR(45), website VARCHAR(45));

INSERT INTO users VALUES(1,"mahmoud zakaria","www.mahmoud.ma");

  • After insert data you can execute select query.
SELECT *FROM users;

  • Hue Dashboard

kafka Manager

Cluster Overview

Hbase

Storm UI

Jupyter

Mongo Express

StreamSets

Username : admin Password : admin

Created by

About

This is Hadoop Cluster that contains the necessary tools that can be used in BigData domain, It’s collection of docker containers that you can use directly to have the maximum of tools like : Hive, Hue, Zookeeper, Kafka, Hbase, Mongo, Metabase...


Languages

Language:VBA 89.0%Language:Jupyter Notebook 9.4%Language:Shell 1.4%Language:PLSQL 0.2%