Top Banner
Fueling Your Business on Real-Time Analytics Eric Frenkiel, MemSQL CEO June 29, 2015 • San Francisco, CA From Spark to Ignition:
33
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Fueling Your Business on Real-Time Analytics

Eric Frenkiel, MemSQL CEO June 29, 2015 • San Francisco, CA

From Spark to Ignition:

Page 2: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

What’s in Store For This Presentation?

1.  MemSQL: A real-time database for transactions and analytics

2.  Spark Use Cases

3.  Example: Geospatial Enhancements

Page 3: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

The real-time database for transactions and analytics

MemSQL Story

Page 4: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

MemSQL at a Glance !  Experienced leadership from Facebook,

SQL Server, Oracle, Fusion-io

!  In-Memory, distributed, relational database

!  Solving the Enterprise Architecture Gap

!  Horizontal scale-out with modern database innovation

!  $50 million in funding

Page 5: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Four Ways Your DBMS is Holding You Back

! ETL (Extract, Transform, Load)

! Analytic Latency

! Synchronization

! Copies of data

Source: Gartner Hybrid/Transactional/Analytical Processing Will Foster Opportunities for Dramatic Business Innovation

Page 6: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

6

The Real-Time Database for Transactions and Analytics

In-Memory Distributed Relational

Data Center Software Cloud

Page 7: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

The Real-Time Database for Transactions and Analytics

Highest Value Hot Data

High Value Cold Data

Analytics

Tran

sact

ions

Data Loading and Queries

Aggregator Nodes

Availability Group 1

Availability Group 2

Cluster

Page 8: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Gartner Identifies Emerging Category: HTAP (Hybrid Transactional/Analytical Processing)

“HTAP will enable business leaders to perform…much more advanced and sophisticated real-time analysis of their business data than with traditional architectures.”

Download at: memsql.com/gartner

Page 9: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Simple

!  Standard SQL !  Transactions and analytics in one database !  Behind the firewall or on the cloud !  Flexible integrations (Hadoop, Spark, SQL)

Page 10: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Fast

!  Extremely low-latency queries !  Massive parallel transaction capacity !  Lock-free, shared-nothing architecture

Page 11: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Scalable

!  Scales out on cloud and commodity hardware !  Deploys to thousands of machines !  True linear scaling

Page 12: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

MemSQL Product Ecosystem In-Memory Applications Transactions and Analytics

Dashboards

Hadoop Amazon S3

ODBC, JDBC, .NET Connectors MemSQL Loader

Advanced Analytics

Wire-protocol compatibility

Databases and Data Warehouses

Streaming

Page 13: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Spark Use Cases

Page 14: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Spark Data Processing Framework

Intuitive, concise, and expressive operations needed for analytics

Spark SQL

Spark Streaming

Mllib (machine learning)

GraphX (graph)

Apache Spark

Page 15: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Cluster-wide Parallelization | Bi-Directional

Understanding MemSQL and Spark

Page 16: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Spark with MemSQL

MemSQL Spark Connector enables the real-time trinity

Message Queue Transformation Data Serving

Programming libraries Persistence Application platform

End-to-End Data Pipeline Under One Second

Page 17: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

MemSQL and Spark Use Cases

!  Operationalize models built in Spark

!  Stream and event processing

!  Live dashboards and automated reports

!  Extend MemSQL analytics

Page 18: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Operationalize Models Built in Spark

!  Process in Spark, persist to MemSQL !  Go to production and iterate faster

Enterprise Consumption Data into Spark

Model Creation Model Persistence

Page 19: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Stream and Event Processing

!  Structure event data on the fly !  Pass to MemSQL for persistent, queryable format

Enterprise Consumption

Real-time Streaming Data

Data Transformation Persistent, Queryable Format

Page 20: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Real-Time Analytics at Pinterest

!  Higher performance event logging !  Reliable log transport and storage !  Faster query execution on real-time data

Enterprise Consumption

Message Queue

Kafka App

Singer Secor

Spark, MemSQL, Application

Transform RT Analytics, Data Serve

50,000 pins/sec

Page 21: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition
Page 22: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Live Dashboards and Automated Reports

!  Serve live dashboards from MemSQL !  Run custom reports on live data with Spark

Live Dashboards

Custom Reporting

Access to Live Production Data

SQL Transactions and Analytics

Page 23: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Extend MemSQL Analytics

!  The freshest data for analysis in Spark !  Load from MemSQL to Spark and write results on return

Access to Live Production Data Real-time Replica

Applications, Data Streams

Interactive Analytics,

Machine Learning

Page 24: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

MemCity !  Capturing energy consumption data from 1.4 million households !  8 devices per household !  186,000 events per minute !  AWS hardware costs at $2.35 per hour

Page 25: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition
Page 26: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Geospatial Enhancements

Page 27: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Geospatial Challenge

! Commercial applications now geo-enabled ! Location is everywhere ! Lots of insight possible ! Traditionally geo is processed separately ! Real need for integrated geospatial at scale

Page 28: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

MemSQL Geospatial

! Points, Lines, and Polygons

! Topological filters

! Measurement functions

Page 29: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

MemSQL Geospatial

! BILLIONS of objects

! Sub-second latency

! Geo data is first-class citizen

! Geo + Simplicity + Speed + Scale

Page 30: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Real-Time Geospatial Location Intelligence

!  Sample from 170 million taxi trips !  Real-time ingest !  Concurrent queries in fractions of a second !  Unlimited number of geographic views !  Simple queries while simultaneously ingesting data

Page 31: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition
Page 32: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

A database so scalable that everyone can use it.

UNLIMITED scale and capacity Free FOREVER

MemSQL 4 Community Edition

Page 33: IMCSummit 2015 - Day 1 IT Business Track - From Spark to Ignition

Thank You!

Visit the MemSQL Booth #4

MemCity Showcase Giveaways Games

*