Top Banner
Bleeding Edge Databases @LynnLangit
38

0bbleedingedge long-140614012258-phpapp02 lynn-langit

Dec 04, 2014

Download

Technology

Big Data Camp LA 2014, Bleeding Edge Databases - Aerospike, Algebraix & Google Big Query by Lynn Langit - Keynote & Session
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

Bleeding Edge Databases

@LynnLang i t

Page 2: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

Unstructured Data

Page 3: 0bbleedingedge long-140614012258-phpapp02 lynn-langit
Page 4: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

Live Tweets on a Building

Page 5: 0bbleedingedge long-140614012258-phpapp02 lynn-langit
Page 6: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

What is Aerospike?

Real-time NoSQL• Flash Optimized• In-memory• Exponentially

Scalable

Super Fast• 1M TPS on one

server (reads)• 40K TPS on one

server (writes)

More• ACID

compliance• Tunable

Consistency

Page 7: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

Benchmark Results• 200,000 tps (read-write) & 300,000 tps (read-heavy)• 10X Faster for R/W loads on SSDs

Page 8: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

DEMO

Page 9: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

More Benchmark Results

Config• 10G network• Aerospike 3• Same hardware• 4-node CentOS

Data• 500GB• 50M recordsEach Record • 100 bytes• 23 byte key• 10 fields

Page 10: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

Aerospike Architecture

Page 11: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

Example Architecture

Page 12: 0bbleedingedge long-140614012258-phpapp02 lynn-langit
Page 13: 0bbleedingedge long-140614012258-phpapp02 lynn-langit
Page 14: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

How to try it out

• Bare metal or pick a Cloud, set up a VM• Get the free community edition• Go…

Page 15: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

Linked Open Data Cloud

Page 16: 0bbleedingedge long-140614012258-phpapp02 lynn-langit
Page 17: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

What is Algebraix Data?

IoT – Semantic Web

Super Powerful 1 Billion

Triples on 1 Node

Native Mathematical

Engine

Triple store RDF (Graph)

Page 18: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

SPARQL Server™

W3C & OGC compliant RDF / SPARQL Semantic Database

Natively built with proprietary Math• Algebraix technology (and patents)

Runs on commodity hardware• In the cloud (or on premise)• Scales Up and Down

Significantly better benchmark performance• over leading RDF databases

Page 19: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

Benchmark Results

• SP2Bench SPARQL Performance Benchmark

Page 20: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

SP^2 Benchmark Visualized

Page 21: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

DEMO

Page 22: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

It’s the Math…

Page 23: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

Patents

Page 24: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

Runs on common hardware

• Any Cloud or• On Rremises

High Performance & Capacity

• Needs no indexes

• Works particularly well w/sparse data

Self-tuning

• Retains results & intermediate sets

• Supports point-in-time queries

SPARQL Server™

Page 25: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

Algebraix Solution Stack

Data Algebra

DatabaseNoSQL Relational

RDF Semantic

ApplicationsMeaning

Organization

Optimization& Execution

Conceptual

Data Loaders Query Translators

• Modern abstract algebra• Zermelo-Fraenkel set theory

• Mathematics-based data management platform• Universal data language• Collection of I.P.

• SPARQL Server – RDF• A2DB - Relational

• Search• Analytics• Business Intelligence• Data Integration

Algebraix Platform

Page 26: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

How to try it out

• Sign up on their website• Try out when notified (this July)

Page 27: 0bbleedingedge long-140614012258-phpapp02 lynn-langit
Page 28: 0bbleedingedge long-140614012258-phpapp02 lynn-langit
Page 29: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

What is Google Big Query?

QaaS – interactive

RESTful web service

SQL-like language

Queries data stored in Google

cloud

Wide Column Tables

Uses OAuth for

access control

Very Fast 750M

Rows in <10 secs

Page 30: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

Easy & Fast

• Text or Json• Up to 100k inserts/sec (streaming)

Load it

• Supports core SQL query concepts• SELECT, FROM, JOIN, WHERE, ORDER BY, GROUP BY • Windowing functions (OVER / PARTITION)• Common Aggregates (SUM, COUNT, MAX)

• Includes ‘analytic’ SQL• STDDEV, VARIANCE, CORRELATION• REGEXP_MATCH

Query it

• Query is $ 5 per TB processed• Storage is around $30 TB per month

Pay (for) it

Page 31: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

Benchmark Results

• TCP-H Benchmark

Page 32: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

DEMO

Page 33: 0bbleedingedge long-140614012258-phpapp02 lynn-langit
Page 34: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

Partners and BigQuery

Google Sheets Tableau QlikView

Bime Excel

Page 35: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

How to try it out

• Set up a Google Cloud account• Upload or stream data• Query

Page 36: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

Google Cloud Starter Pack

Use code“gde-in”

Page 37: 0bbleedingedge long-140614012258-phpapp02 lynn-langit

Next steps

Try them out

@LynnLang i t

Page 38: 0bbleedingedge long-140614012258-phpapp02 lynn-langit