OndrejKucera / knowledge_spark

Notes about Apache Spark (early stage)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Knowledge - Spark

Note: The notes are not exhausted knowledge about Apache Spark.

  1. Introduction
  2. Architecture
  3. RDD
  4. Spark SQL - Structured API
  5. Join
  6. Key/Value data
  7. Testing
  8. Spark Streaming

... still in progress!

The main source for these notes is Spark: The Definitive Guide and High Performace Spark and Coursera course Big Data Analysis with Scala and Spark

About

Notes about Apache Spark (early stage)

License:MIT License