Distributed Machine Learning with Apache Spark
University of California, Berkeley (2016) (Edx)
* Ameet Talwalkar, University of California, Los Angeles
* Jon Bates, University of California, Berkeley
-
Week #1: Intro to Machine Learning and Spark RDDs
-
Week #2: Linear Regression and Distributed Machine Learning Principles
- Lab 2: Linear Regression Lab
- Dataset: Subset of Million Song
-
Week #3: Logistic Regression and Click-through Rate Prediction
- Lab 3: Click-Through Rate Prediction Lab
- Dataset: Display Advertising Challenge(Kaggle)
-
Week #4: Principal Component Analysis and Neuroimaging