Velox: Models in Action

VELOX:MODELS IN ACTION

Presented by Dan Crankshaw [email protected]

Henry Milner, Joseph Gonzalez, Peter Bailis, Haoyuan Li, Tomer Kaftan,Zhao Zhang, Ali Ghodsi, Michael Franklin, Michael Jordan, and Ion Stoica

https://amplab.cs.berkeley.edu/projects/velox/


Data

ModelPredictionsPredict

Train

Observe

Well Studied

MODELS AT REST

Data

ModelPredictionsServing

TrainingFeedb

ack

OpenChallenges

Data


TrainingFeedb

ack

OpenChallenges

Velox Model Management System

Catify: Music for Cats

Node.js App Server

Apache Web Server

MongoDB


MODELING TASK

Rating

Songs

MODELING TASK

Ratings

Songs

Prediction

Data


TrainingFeedb

ack


Tachyon + HDFS

Pipeline

CatID Song Score

1 16 2.1

1 14 3.7

3 273 4.2

4 14 1.9


Tachyon + HDFS

Pipeline

CatID Song Score

1 16 2.1

1 14 3.7

3 273 4.2

4 14 1.9


Tachyon + HDFS

Pipeline

CatID Song Score

1 16 2.1

1 14 3.7

3 273 4.2

4 14 1.9

Pipeline

Tachyon + HDFS

Node.js App Server

Apache Web Server

MongoDB


Data


TrainingFeedb

ack

Pipeline

Tachyon + HDFS

Node.js App Server

Apache Web Server

MongoDB


Tachyon + HDFS

Node.js App Server

NGINX

MongoDB

Materialize all predictions

Pipeline



SongsO(users + songs)

Users

Songs

Users

O(users * songs)


Pipeline

Tachyon + HDFS

Node.js App Server

NGINX

MongoDB


Pipeline

Tachyon + HDFS

Node.js App Server

NGINX

MongoDB

Training Data


Pipeline

Tachyon + HDFS

Node.js App Server

NGINX

MongoDB

Training Data

New Model


What’s wrong?

1. Built from scratch for each application

What’s wrong?


2. Different systems

What’s wrong?


2. Different systems3. Space inefficient

What’s wrong?


2. Different systems3. Space inefficient4. Stale predictions

What’s wrong?


2. Different systems3. Space inefficient4. Stale predictions5. The T-Swift effect Sample Bias

What’s wrong?

Pipeline

Tachyon + HDFS

Node.js App Server

NGINX

MongoDB

Training Data

New Model


Pipeline

Tachyon + HDFS

Web Application Velox

The Missing Piece

Data


TrainingFeedb

ack

Tachyon + HDFS

Velox

The Missing Piece

Prediction Service

Model Manager

Web Application

Pipeline

BENEFITS

BENEFITS1. Low-latency and scalable

predictions as a service


predictions as a service2. Integrated approach leads to

fresher, better predictions



fresher, better predictions3. Easy translation to production

predictions



fresher, better predictions3. Easy translation to production

predictions4. Eases operational pain

PERSONALIZED MODELING


wu · f(x; ✓)Rating =


Shared BasisFeature Models




PersonalizedUser Model





wu · f(x; ✓)

Change slowly

Rating =




wu · f(x; ✓)

Change slowlyHighly dynamic

Rating =


Data


TrainingFeedb

ack

VELOX

Pipeline

Tachyon + HDFS

VeloxPrediction Service

Model Manager

Web Application

Predictions as a service

VELOX

Pipeline

Tachyon + HDFS

VeloxPrediction Service

Model Manager

Web Application

Predictions as a service

PREDICTION API

GET /velox/catify/predict_top_k?userid=22&k=100

GET /velox/catify/predict?userid=22&song=27632

PREDICTION API



PREDICTION API



PREDICTIONS

def predict( u: UUID, x: Context )

wu · f(x; ✓)

Look up user weight

PREDICTIONS


wu · f(x; ✓)

Compute Features

Look up user weight

PREDICTIONS


wu · f(x; ✓)

LOW-LATENCY PREDICTIONS

Velox

Tachyon

Partition 0

Velox

Tachyon

Partition 1

Velox

Tachyon

Partition 2

Partition users

Compute Features

Look up user weight

PREDICTIONS


wu · f(x; ✓)


Velox

Tachyon

Feature Cache


Velox

Tachyon

Feature Cache

Features shared between users

Data


TrainingFeedb

ack

Data


TrainingFeedb

ack

Pipeline

Tachyon + HDFS

Node.js App Server

NGINX

MongoDB


Pipeline

Tachyon + HDFS

Node.js App Server

NGINX

MongoDB

Training Data


SIMPLE EXPLORATION

Rating

Songs

Prediction

SIMPLE EXPLORATION

Rating

Songs

Prediction

Epsilon-greedy

SIMPLE EXPLORATION

Rating

Songs

Prediction

Epsilon-greedy

ACTIVE LEARNING

Rating

Songs

Prediction

ACTIVE LEARNING: LinUCB

Rating

Songs

Prediction

Uncertainty

Li, L., Chu, W., Langford, J., & Schapire, R. E. (2010). A contextual-bandit approach to personalized news article recommendation. WWW '10: Proceedings of the 19th international conference on World wide web, New York, New York, USA: ACM. doi:10.1145/1772690.1772758


Rating

Songs

Prediction

Look at upper confidence bound

Uncertainty



Rating

Songs

Prediction

Look at upper confidence bound

Uncertainty


Data


TrainingFeedb

ack

Pipeline

Tachyon + HDFS

Node.js App Server

NGINX

MongoDB

Velox


Prediction Service

Model Manager

Data


TrainingFeedb

ackMgmt.

Data


TrainingFeedb

ackMgmt.

RealtimeLearning

Pipeline

Tachyon + HDFS

Node.js App Server

NGINX

MongoDB

Training Data

New Model




USER-FACING API



USER-FACING API

POST /velox/catify/observe?userid=22&song=27632?score=3.7

ONLINE UPDATES

def observe(u: UUID, x: Context, y: Score)

wu · f(x; ✓)

Update wu with new training point

ONLINE UPDATES


wu · f(x; ✓)

Basis functions stay fixed

Update wu with new training point

ONLINE UPDATES


wu · f(x; ✓)

Data


TrainingFeedb

ackMgmt.

RealtimeLearning

Data


TrainingFeedb

ackMgmt.

RealtimeLearning + Offline Retraining

Pipeline

Tachyon + HDFS

Node.js App Server

NGINX

MongoDB

Velox


Prediction Service

Model Manager

Data


Feedb

ack

Velox Model Management System

Spark

The future of research in scalable learning systems will be in the integration of the learning lifecycle:

Data


TrainingFeedb

ack

SUMMARY

•Model training and predictions rely on ad-hoc, manual processes spread across multiple systems

SUMMARY



•The Velox system automatically maintains multiple models while providing low latency, scalable, and personalized predictions

SUMMARY




•Velox is part of BDAS, is coming soon…

SUMMARY




•Velox is part of BDAS, is coming soon…•https://amplab.cs.berkeley.edu/projects/velox/

SUMMARY


BACKUP MATERIAL

RETRAIN OFFLINEdef retrainOffline(sc: SparkContext,

trainingData: RDD)

wu · f(x; ✓)

Retrain feature functions

RETRAIN OFFLINEdef retrainOffline(sc: SparkContext,

trainingData: RDD)

wu · f(x; ✓)

Use Spark for batch retrain

Velox: Models in Action

Software

app servernginxmongodbcatify

scalable predictions

different systemswhats

catswhats wrong

applicationwhats wrong

different systems3

space inefficientwhats

integrated approach