datafibers / Apache_Hive_Essentials

The source code for Apache Hive Essentials 2nd Edition

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Apache Hive Essentials

Overview

This is source code for Apache Hive Essentials 2nd Edition

  • The code for the 1st edition is here
  • The code is tested in hive 1.2.1 in HDP 2.6.4
  • The new features are tested against Hive 2.3.2

Quick Start

  1. git clone https://github.com/datafibers/Apache_Hive_Essentials.git
  2. cd Apache_Hive_Essentials
  3. Make a directory in HDFS, hdfs dfs -mkdir -p /tmp/hivedemo
  4. Upload the data to the HDFS folder, hdfs dfs -put data /tmp/hivedemo/

Folder Structure

  • ./data: All sample data used for the demo code
  • ./scripts: Demo code for each chapters

Environment

An optimized sandbox is also provided here to run the code in script folder.

About

The source code for Apache Hive Essentials 2nd Edition


Languages

Language:Java 97.7%Language:Python 2.3%