Building a Production Data Lake in the Cloud using Apache Spark

Summary (Starts 6:30PM)  Apache Spark is a distributed processing engine that enterprises can use for large-scale data migration.  Key concepts will be discussed from a recent production-level ETL effort (extract, transform, load), which used Java Spark connectors to convert millions

Basics of Apache Spark with Scala

For September meetup we will cover the basics of Apache Spark with Scala. Senthil will discuss following • Introduction to scala • Scala syntax and functions • RDD transformations and actions • Dataframes and sql operations This meetup is going to be explicitly

Spark Saturday #2 in DC

Capital One Conference Center, McLean, VA *** REGISTRATION FOR THIS EVENT MUST BE COMPLETED HERE AND NOT VIA MEETUP: https://goo.gl/9r663d *** Join us for the second annual Spark Saturday Event hosted by Capital One, MetiStream, and Databricks in partnership with the Washington DC Apache Spark Interactive and Bay

Powering Qualtrics data platform with Apache Spark, Kafka, and AWS

Speaker: Peter Brown-Hayes Experience Management (XM) extends beyond just knowing your customers. It embodies the idea that you know your customers so well that you can create, deliver, and tailor personalized experiences that push the boundaries of customer loyalty. We

EXPLORING WIKIPEDIA WITH APACHE SPARK in Istanbul

Merhaba, Uzun zamandır ertelediğimiz bir etkinliği yapmanın vakti geldi. Scala ve Sql üzerine olacaktır. Kendi makinelerimiz yerine Databricks in community versiyonunu kullanacağız. Bu sebeple gelmek isteyen kişiler bir iki gün önceden kayıt olurlarsa çok iyi olur. Aşağıdaki linkten Databricks e

Bay Area Apache Spark Meetup @ HPE/Aruba Networks in Santa Clara

oin us for an evening of Bay Area Apache Spark Meetup featuring tech-talks about using Apache Spark at scale from Hewlett Packard Enterprise (HPE)/Aruba Networks and Databricks. Thanks to HPE/Aruba Networks for hosting and sponsoring this meetup. Agenda: 6:00 – 6:30 pm  Mingling &

Apache Spark Structured Streaming : Introduction and Internals

Structured Streaming is a new stream processing engine built on Spark SQL, which enables developers to express queries using powerful high-level APIs including DataFrames, Dataset and SQL. In this meetup, we’ll walk through the basics of Structured Streaming, its programming