scotthaleen / spark-saturday

Workshop for Spark and Databricks

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Spark Saturday July 27.

Jumpstart on Apache Spark 2.2 with Databricks

Agenda

Morning

  • Get to know Databricks
  • Overview of Spark Fundamentals & Architecture
  • What’s New in Spark 2.x
  • Break
  • Unified APIs: SparkSessions, SQL, DataFrames, Datasets…
  • Workshop Notebook 1: SparkSession
  • Lunch

Afternoon

Instructions to Register for Free Databricks Community Edition

  • Go to http://databricks.com/try-databricks

  • Start Today for Community Edition.

  • Make sure you use an email address from which you can access e-mails.

  • Got to gitbub: https://github.com/dmatrix/spark-saturday

  • Download DBC file: MeetupWorkshops.dbc

  • Go to your Databricks-->Workspace->Users->your_account@your-emal.com->Import

    • Click File option
    • Click on "Drop file here to upload or click to select."
    • Import MeetupWorkshops.dbc

You should have folder by that name with all the notesbooks

Notebooks URLS for the Labs:

Resources and APIs

About

Workshop for Spark and Databricks


Languages

Language:HTML 100.0%