SparkHub Apache® Spark™ Developer Resources

Below are Apache Spark Developer Resources including training, publications, packages, and other Apache Spark resources.

Massive Online Courses

Visit the Databricks’ training page for a list of available courses.

Introduction to Apache Spark

Introduction to Apache Spark

Learn the fundamentals and architecture of Apache Spark, the leading cluster-computing framework among professionals.

Starts on June 15, 2016

Enroll Now
Distributed Machine Learning with Apache Spark

Distributed Machine Learning with Apache Spark

Learn the underlying principles required to develop scalable machine learning pipelines and gain hands-on experience using Apache Spark.

Starts on July 6, 2016

Enroll Now

Apache Spark Publications

Cover of Learning Spark

Learning Spark

Pick up a copy of Learning Spark for a comprehensive introduction to the Apache Spark ecosystem directly from the project founders.

Cover of Intro to Apache Spark

Introduction to Apache Spark

An introduction to Apache Spark packaged as a video plus coding exercises: the essentials to get started running Spark apps.

Cover of Advanced Analytics with Spark

Advanced Analytics with Spark

In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark.

Cover of Fast Data Processing with Spark Second Edition

Fast Data Processing with Spark (Second Edition)

Perform real-time analytics using Spark in a fast, distributed, and scalable way.

Cover of Machine Learning with Spark

Machine Learning with Spark

Create machine learning systems that can scale to tackle even the largest data sets with ease and get real insights for your business with Apache Spark.

Cover of Apache Spark Reference Applications

Apache Spark Reference Applications

Reference Applications demonstrating Apache Spark - brought to you by Databricks.

Cover of Spark GraphX in Action

Spark GraphX in Action

Spark GraphX in Action starts out with an overview of Apache Spark and the GraphX graph processing API. This example-based tutorial then teaches you how to configure GraphX and use GraphX interactively.

Cover of Spark in Action

Spark in Action

Spark in Action teaches you to use Spark for stream and batch data processing. It starts with an introduction to the Spark architecture and ecosystem followed by a taste of Spark's command line interface.

Introducción a Apache Spark

Introducción a Apache Spark

The first Spanish book that presents a comprehensive introduction to the Apache Spark ecosystem written by researchers working with Spark in Barcelona.

Cover of Apache Spark Graph Processing

Apache Spark Graph Processing

Build, process and analyze large-scale graph data effectively with Spark.

New Apache Spark Packages

Third-party packages that integrate with Apache Spark

Approximate nearest neighbors search using Hierarchical Navigable Small World graphs
An implementation of bayes additive regression tree model atop Apache Spark
A robot powered training repository :robot:
Rumble: JSONiq for Apache Spark
/ /
Relational query engine that unites SparkSQL and GORpipe into a single declarative query framework.
/ /

See All Packages