Loading Events

« All Events

  • This event has passed.

Spark Saturday – Jumpstart with Apache Spark 2.x on Databricks

July 29, 2017 @ 10:00 am - 4:00 pm

Jumpstart with Apache Spark 2.x on Databricks

Apache Spark 2.0 and subsequent releases of Spark 2.1 and 2.2  have laid the foundation for many new features and functionality. Its main three themes—easier, faster, and smarter—are pervasive in its unified and simplified high-level APIs for Structured data.

In this introductory part lecture and part hands-on workshop, you’ll learn how to apply some of these new APIs using Databricks Community Edition. In particular, we will cover the following areas:

Agenda:

Overview of Spark Fundamentals & Architecture

What’s new in Spark 2.x

Unified APIs: SparkSessions, SQL, DataFrames, Datasets

Introduction to DataFrames, Datasets and Spark SQL

Introduction to Structured Streaming Concepts

Four Hands On Labs

 

You will use Databricks Community Edition, which will give you unlimited free access to a ~6 GB Spark 2.x local mode cluster. And in the process, you will learn how to create a cluster, navigate in Databricks, explore a couple of datasets, perform transformations and ETL, save your data as tables and parquet files, read from these sources, and analyze datasets using DataFrames/Datasets API and Spark SQL.

Level: Beginner to intermediate,  not for advanced Spark users.

Prerequisite: You will need a laptop with Chrome or Firefox browser installed with at least 8 GB. Introductory or basic knowledge Scala or Python is required, since the Notebooks will be in Scala; Python is optional.

Bio:

Jules S. Damji is an Apache Spark Community Evangelist with Databricks. He is a hands-on developer with over 15 years of experience and has worked at leading companies, such as Sun Microsystems, Netscape, LoudCloud/Opsware, VeriSign, Scalix, and ProQuest, building large-scale distributed systems. Before joining Databricks, he was a Developer Advocate at Hortonworks.

Spark Saturday – Jumpstart with Apache Spark 2.x on Databricks

Details

Date:
July 29, 2017
Time:
10:00 am - 4:00 pm
Event Categories:
,
Event Tags:
Website:
https://www.meetup.com/datariders/events/237836039/

Venue

Hacker Dojo
3350 Thomas Road
Santa Clara, CA United States
+ Google Map