monkeydata918 / TopReviewedWebsite

Hadoop project to read xml file using MapReduce, Pig, Hive, Sqoop, MySQL

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This project Top_Reviewed_Website read an XML file about www.reddit.com activities.

It uses Hadoop Apache components like HDFS, MapReduce, Pig, Hive, Sqoop

Data is processed and final results are exported via Sqoop to MySQL database.

The use case and the procedure is described in the document "Project overview.doc".

Each subdirectory has one or several text files with all the commands and their 
explanation.

About

Hadoop project to read xml file using MapReduce, Pig, Hive, Sqoop, MySQL


Languages

Language:Java 100.0%