ndhuanhuan / Hadoop-Playground

Applying Big Data and Map Reduce concepts on a sample data set.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Hadoop-Playground

Applying Big Data and Map Reduce concepts on sample data sets.

Introduction

This repository contains source code for the assignments of Udacity's course, Introduction to Hadoop and MapReduce, which was unveiled on 15th November, 2013.
This is a short course by Cloudera guys in association with Udacity. Instructors for this course are Sarah Sproehnle and Ian Wrigley, both from Cloudera and Gundega Dekena, Course Developer is from Udacity.

Course does not mandate any programming language for writing Hadoop MapReduce jobs; but they have mainly used / taught Hadoop MapReduce jobs using Python [i.e. with Hadoop Streaming approach for running jobs] during the course.

About

Applying Big Data and Map Reduce concepts on a sample data set.


Languages

Language:Python 97.1%Language:Shell 2.9%