This project Top_Reviewed_Website read an XML file about www.reddit.com activities. It uses Hadoop Apache components like HDFS, MapReduce, Pig, Hive, Sqoop Data is processed and final results are exported via Sqoop to MySQL database. The use case and the procedure is described in the document "Project overview.doc". Each subdirectory has one or several text files with all the commands and their explanation.