Session 22

Introduction to Big Data

Class Objective:

The goal here is to provide an overview of how data processes can be scaled with Spark.

Readings (To be done before class):

Create a DataBricks Community Edition Account
Gentle Introduction To Spark - Download ebook Review the Hadoop Ecosystem

In Class Exercises

On the DataBricks Platform you should execute both the Introduction to Apache Spark on Databricks and the Databricks for Data Scientists.

Concepts from these will be included in the final.