Xelawk / hadoop-hive-spark-docker

Base Docker image with just essentials: Hadoop, Hive and Spark.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Big data playground: Hadoop + Hive + Spark

Docker Build Status Docker Pulls Docker Stars

Base Docker image with just essentials: Hadoop, Hive and Spark.

Software

Usage

Take a look at this repo to see how I use it as a part of a Docker Compose cluster.

Hive JDBC port is exposed to host:

  • URI: jdbc:hive2://localhost:10000
  • Driver: org.apache.hive.jdbc.HiveDriver (org.apache.hive:hive-jdbc:3.1.2)
  • User and password: unused.

Version compatibility notes

Maintaining

TODO

  • Upgrade spark to 3.0
  • When upgraded, enable Spark-Hive integration.

About

Base Docker image with just essentials: Hadoop, Hive and Spark.

License:MIT License


Languages

Language:Dockerfile 47.2%Language:C 33.8%Language:Shell 19.0%