Top Banner
Making big Data Simple with Spark Ion Stoica and Ali Ghodsi June 15, 2015
14

Spark Summit 2015 keynote: Making Big Data Simple with Spark

Aug 06, 2015

Download

Software

Databricks
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Spark Summit 2015 keynote: Making Big Data Simple with Spark

Making big Data Simple with Spark

Ion Stoica and Ali Ghodsi June 15, 2015

Page 2: Spark Summit 2015 keynote: Making Big Data Simple with Spark

More than 5,000 people trained over past year

2

Alleviating Data Scientist Scarcity Challenge

“Intro to Big Data with Apache Spark” •  Anthony Joseph, UC Berkeley •  Started June 1st

“Scalable Machine Learning”

•  Ameet Talwalkar, UCLA •  To start July 5th

Page 3: Spark Summit 2015 keynote: Making Big Data Simple with Spark

More than 5,000 people trained over past year

3

Alleviating Data Scientist Scarcity Challenge

“Intro to Big Data with Apache Spark” •  Anthony Joseph, UC Berkeley •  Started June 1st, over 64K registered students

“Scalable Machine Learning”

•  Ameet Talwalkar, UCLA •  To start July 5th, over 26K registered students

Page 4: Spark Summit 2015 keynote: Making Big Data Simple with Spark

4

…  

Spark Core Python, Java, Scala, R

Spark Streaming real-time

Spark SQL interactive

MLlib machine learning

GraphX graph

a  

Fast • Expressive • General

Spark Significantly Simplifies Big Data Processing

Page 5: Spark Summit 2015 keynote: Making Big Data Simple with Spark

5

Still need to set up and manage your own Spark cluster

Still more complex to operate than existing single node tools (R, Python)

But Big Data Processing Remains Complex...

Page 6: Spark Summit 2015 keynote: Making Big Data Simple with Spark

Databricks Truly Makes Big Data Simple A hosted end-to-end platform from ingest to production

6

Cluster Manager

Jobs Notebooks Third-Party Apps Dashboards

Page 7: Spark Summit 2015 keynote: Making Big Data Simple with Spark

June 2014: Unveiling •  Over 3,500 sign ups

November 2014: Limited Availability

Today •  Over 150 organizations using Databricks

Databricks: The Journey Thus Far

7

Page 8: Spark Summit 2015 keynote: Making Big Data Simple with Spark

Better products Update customers’ databases weekly instead of monthly

What can Databricks and Spark do for organizations?

8

Faster time to market Create new products in 3 weeks rather than 2 months

Democratize data access within enterprises Increase number of data analysts by 4x and number of data projects by 6x

Page 9: Spark Summit 2015 keynote: Making Big Data Simple with Spark

9

General Availability starting today!

www.databricks.com

Page 10: Spark Summit 2015 keynote: Making Big Data Simple with Spark

Ease of use Increase user productivity

10

Key Areas of Focus

1

2

Integration with existing (small and big) data tools Make non-Spark experts instantly productive

3

Security Enable mission-critical applications

Page 11: Spark Summit 2015 keynote: Making Big Data Simple with Spark

11

Cluster manager with multiple Spark versions

From notebooks to dashboards and jobs with just a few clicks

Lunch and monitor jobs, including streaming

Ease of Use

Notebooks

Dashboards

Jobs

Page 12: Spark Summit 2015 keynote: Making Big Data Simple with Spark

12

Best-of-breed apps Versioning R Notebooks

Integration

+

Page 13: Spark Summit 2015 keynote: Making Big Data Simple with Spark

13

Run in your own Amazon account

Access Control Lists

Security

Encryption at rest

Page 14: Spark Summit 2015 keynote: Making Big Data Simple with Spark

14

Demo