maheshsv / big-data-mapreduce-course

Big Data, MapReduce, Spark, PySpark, Java @ Santa Clara University, Fall 2016

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Course Information

Midterm Exam: TBDL, 2016 from 5:45pm to 7:00pm PST

Final Exam: Monday, December 5, 2016 from 5:45pm-7:45pm PST

Description

The main focus of this class is to cover the following concepts:

  • Concepts of Big Data
  • Distributed File Systems
  • Distributed Computing
  • Distributed and Parallel Algorithms
  • MapReduce Paradigm
  • Scale-out Architectures (using Hadoop, Spark, PySpark)
  • Apache Spark: http://spark.apache.org/
  • Use Spark, Py-Spark, Hadoop, and Java to teach MapReduce and distributed computing

My latest book:

Data Algorithms: Recipes for Scaling up with Hadoop and Spark

Data Algorithms Book

About

Big Data, MapReduce, Spark, PySpark, Java @ Santa Clara University, Fall 2016


Languages

Language:Shell 38.2%Language:Java 25.7%Language:HTML 18.9%Language:Batchfile 10.3%Language:Python 3.6%Language:XSLT 2.0%Language:TeX 1.3%