yahwang / Learn-Big-Data-Essentials-Yandex

Coursera의 Big Data Essentials: HDFS, MapReduce and Spark RDD Course 공부한 자료 정리

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Big-Data-Essentials-Yandex

https://www.coursera.org/learn/big-data-essentials

dockerhub : https://hub.docker.com/u/bigdatateam

HDFS architecture

MapReduce Basic / Hadoop Streaming(with Python) / MapReduce Optimization(Combiner, Partitioner, Comparator)

Week3 : Solving Problems with MapReduce (practice week)

Spark architecture

Week5 : Introduction to Apache Spark (practice week)

Map(Reduce)-Side Join / Job Chaining / Data Salting

About

Coursera의 Big Data Essentials: HDFS, MapReduce and Spark RDD Course 공부한 자료 정리


Languages

Language:Jupyter Notebook 98.0%Language:Python 2.0%