Netflix's Big Leap from Oracle to Cassandra

NETFLIX’S BIG LEAP FROM

ORACLE TO C*

ROOPA TANGIRALAEngineering Manager

Netflix

WHO AM I?

Engineering Manager @ NetflixTwitter - @roopatangiralaEmail roopa.tangirala@gmail.comLinkedin - https://www.linkedin.com/pub/roopa-tangirala/3/960/2b

OVERVIEW

• Brief History

• Set up

• Migration Strategy

• Migration Challenges

• Example of Real use cases

• Lessons learnt

DATACENTER

BACKEND

ORACLE DATAMODEL LIMITATIONS

NO HORIZONTAL SCALING

LICENSE COST

EVERY TWO WEEKS!!

MOVE TO CLOUD

REQUIREMENTS

• HIGHLY AVAILABLE

• MULTI DATACENTER SUPPORT

• PREDICTABLE PERFORMANCE AT SCALE

WHY C* ?

• Massively scalable architecture

• Multi-datacenter, multi-directional replication

• Linear scale performance

• Transparent fault detection and recovery

• Flexible, dynamic schema data

• Guaranteed data safety

• Tunable data consistency

BACKEND NOW

MICRO SERVICES

• Horizontal, Homogenous, Commoditized

DOWNTIME

ALMOST DAILY PUSHES

ACTIVE ACTIVE

GLOBAL PRESSENCE

MIGRATION STRATEGY

BABY STEPS

NEW FEATURES FIRST TO CLOUD

DATA MODEL REVIEW

keyspace

column family

Rowcolumn

•name

•value

•timestamp

DB/Schema

Rowcolumn

• name

• value

ORACLE

CASSANDRA

SCHEMALESS DESIGN

• Row-oriented

• Number of columns/Names can differ

namexyz Paul zip 95123

nameabc Adam zip 94538 sex Male

namecde XYZ

UNDERSTAND WRITE PATH

client

Commit log (Disk)

Memtable (memory)

sstable sstable sstable

UNDERSTAND READ PATH

clientmemtable

sstable

Row cache/key cache

LOGIC IN APPLICATION

• Stored procedures

• Functions

• Triggers

• Referential integrity constraints

DATACENTER AWS

ORACLE

DUAL WRITESREADS

CONSISTENCY CHECKER

WRITES

FORKLIFT

MAIN APPROACH

• DUAL WRITES

• FORKLIFT OLD DATASET

• CONSISTENCY CHECKER

• MIGRATE READS TO C* FIRST

• SWITCH OFF WRITES TO DC

Relationships – Better in App Layer

ORACLE SEQUENCE

• USE UUID for Unique keys

• Distributed Sequence Generator for Ordering

• C* counters – not so much

Heavy Transactional Use Case

CHALLENGES

SECURITY

DENORMALIZE

Roman Riding

Model Around Queries

Engineering Effort

Know limitations

SOURCE OF TRUTHF TRUTH

REAL EXAMPLES

• High concurrency

• Range scans

• ~1MB of data

• Caused heap issues at Cassandra level

• Very high read latency

Range scan replaced with inverted index

0000/odp/{ui}/pathEvaluator_2 active Scripts_1 Text datafalseallocation

0000/xbox/{ui}/pathEvaluator_

1active Scripts_1 Text datafalseallocation

0000/tvui/{ui}/pathEvaluator_1 active Scripts_1 Text datafalseallocation

active_Scripts_idx

Idx_1 /tvui/{ui}/pathEvaluator 2/odp/{ui}/pathEvaluator 2/odp/{ui}/pathEvaluator 2/odp/{ui}/pathEvaluator

scripts

client

Inverted Index considerations

• Column name can be used a row key placeholder

• Hotspots!!

• Sharding

VIEWING HISTORY

Growth of Viewing History

Problem

Growth Pattern: “Hockey-stick”

Retention Policy: Retain forever

Access Pattern: Retrieve all for a customer

Scaling and performance challenges as the data grows

Small Storage Footprint

Consistent Read/Write Performance

Infinite Scalability

Old Data Model

Old Data Model - Pros

Simple Wide Rows

No additional processing

Fast Write

Fast Read for Majority

Old Data Model - Cons

Read latency increases as number of viewing records increases

Cassandra Internal

• Number of SSTables

• Compaction

• Read Repair

Lesson learned : Wide Row Limitation

New Data Model

Split Viewing History into two Column Families1. Recent• Small records with frequent updates• Cassandra tuning : compaction, read repair, etc.

2. Compressed• Large historical records with rare updates• Rollup• Compression• Cassandra: rare compaction, no read repair

New Data Model cont’d

RESULTS

Think Data Archival

Data stores grow exponentially

Have a process in place to archive data

• Moving to a separate column family

• Moving to a separate cluster (non SSD)

• Setting right expectations w.r.t latencies with historical data

Cassandra TTL’s

Cinematch Rating Service

• First Model

• Second Model

Movie_id12345 BLOB

Movie_id4355 BLOB

545 1 545 4 545 534512345 4 545 2

454355 2 66 2 67 4

WHAT WORKED?

343 674

4542443 242 Name

4 BLOB52

BLOB vs COLUMN/VALUE

Try out column/value approach first and hopefully it satisfies avg/95/99th

Column value pro’s:• Write payload can be smaller

• Query by specific columns

• Read path does not require reading the entire row

Blob considerations• Read percentage is very high (90’s)

• Read latencies are very important

• All the data is read most of the time

LESSONS LEARNT

• Get the model right

• Baby Steps

• Huge Engineering effort

• Think Compression

• Performance test

• DB just for Data

WE ARE HIRING !! – CHECK OUT JOBS.NETFLIX.COM

Netflix's Big Leap from Oracle to Cassandra

Technology

Cassandra Training Session...

Netflix's Could Migration

Chicago Cassandra - Cassandra from Python

Apache Cassandra at Target - Cassandra Summit 2014

Leap Second · Leap Second

Winston - Netflix's event driven auto remediation and...

State of Cassandra, 2012 - NoSQL | Apache Cassandra ·...

Cassandra Summit 2014: Cassandra at Instagram 2014

(BDT403) Netflix's Next Generation Big Data Platform | AWS.....

Cassandra + Hadoop: Analisi Batch con Apache Cassandra

Netflix's Famous Culture Guide

AWS re:Invent 2016: Moving Mountains: Netflix's Migration...

Apache Cassandra in Action - O'Reilly...

Cassandra at eBay - Cassandra Summit 2012

Cassandra Summit 2014: Cassandra Compute Cloud: An elastic.....

Cassandra Core Concepts - Cassandra Day Toronto