tertiarycourses / HiveTraining

Exercise files for Big Data Analysis with Apache Hive

Home Page:https://www.tertiarycourses.com.sg/big-data-analysis-apache-hive.html

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Big Data Analysis with Apache Hive

These are the exercise files used for Big Data Analysis with Apache Hive course.

The course outline can be found in

https://www.tertiarycourses.com.sg/big-data-analysis-apache-hive.html

https://www.tertiarycourses.com.my/big-data-analysis-with-apache-hive-malaysia.html

Module 1: Get Started on Apache Hive

  • What is Hive?
  • How Hive Works with Hadoop
  • Install CDH on VirtualBox
  • Hue 4 UI Overview

Module 2: Basic Hive Operations

  • Create and Drop Database
  • Create and Drop Table
  • Create Table from CSV File
  • Alter Table
  • Fix CSV File with Serde
  • Load Data to Empty Table
  • Partition Tables

Module 3: HiveQL

  • Retrieve Data with SELECT
  • SELECT Options
  • Operators and Built In Functions
  • Filter data with WHERE

Module 4: Aggregating Data

  • Hive Aggregations
  • Having
  • Grouping Sets
  • Cube & Rollup

Module 5: Joining Tables

  • Combining Tables with JOIN
  • EXIST & SEMI JOIN
  • Joining Multiple Tables

Module 6: Data Analysis with Apache Hive

  • Math Functions
  • String Functions
  • Date Functions
  • Conditional Statements