pigaov10 / big-data-mapreduce-course

Big Data, MapReduce, Spark, PySpark, Java @ Santa Clara University, SPRING 2017

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Course Information

Exam Dates

  • Midterm Exam: May (To-Be-Determined-Later), 2017 from 5:45pm to 7:00pm PST
  • Final Exam: Thursday, June 15, 2017 from 5:45pm-7:45pm PST

Course Description

The main focus of this class is to cover the following concepts:

  • Concepts of Big Data
  • Distributed File Systems
  • Distributed Computing
  • Distributed and Parallel Algorithms
  • MapReduce Paradigm
  • Scale-out Architectures (using Hadoop, Spark, PySpark)
  • Apache Spark: http://spark.apache.org/
  • Use Spark, Py-Spark, Hadoop, and Java to teach MapReduce and distributed computing

My latest book:

Data Algorithms: Recipes for Scaling up with Hadoop and Spark

Data Algorithms Book

About

Big Data, MapReduce, Spark, PySpark, Java @ Santa Clara University, SPRING 2017


Languages

Language:HTML 85.7%Language:Shell 5.9%Language:Java 5.8%Language:Batchfile 1.6%Language:Python 0.6%Language:XSLT 0.3%Language:TeX 0.2%