Top Banner
Introducing Apache PredictionIO (incubating) http://predictionio.incubator.apache.org Donald Szeto Tech Lead @ Salesforce First Git Commit @ PredictionIO
14

Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Apr 16, 2017

Download

Software

Databricks
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

IntroducingApache PredictionIO (incubating)http://predictionio.incubator.apache.org

Donald Szeto

Tech Lead @ SalesforceFirst Git Commit @ PredictionIO

Page 2: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Agenda

Future {

println(roadmap)

}

?

(Source: benbrandt22, https://redd.it/383edw)

Page 3: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Why PredictionIO?

ML problem 1 Data collection Modeling Serving/scoring

Evaluation

ML problem 2 Data collection Modeling Serving/scoring

Evaluation

Is there a common factor?(Not his actual words :))

Page 4: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

What is PredictionIO?

A machine learning server for developers and ML engineers

PredictionIO API

Engines EnginesEngines

Page 5: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)
Page 6: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Quick Demo

Page 7: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Digging Deeper

What is DASE?Data, Algorithm, Serving, Evaluation

What are engine instances?

What are engine variants?

Page 8: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Engine Instances

Engine(Scala/Java code)

Data

Engine Parameters

Algorithm Hyperparameters

Environment

+ Engine Instance=

Page 9: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Digging Deeper

What is DASE?Data, Algorithm, Serving, Evaluation

What are engine instances?

What are engine variants?

Page 10: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Engine Variants

Engine(Scala/Java code)

Data

Engine Parameters

Algorithm Hyperparameters

Environment

+ Engine Instance=

Data

Engine Parameters

Algorithm Hyperparameters

Environment

Engine Instance

Page 11: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Current Development ( <= 0.10.0 )

Migrating to ASF infrastructure

Merging forksSliding window event data source

Installation fixes

Engine templates and SDKs migration

Docker-based Integration Test Infrastructure

Page 12: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Travis CI Worker

Travis CI Worker

Travis CI Worker

Travis CI Worker

Docker-based Integration Test Infrastructure

Running many test engines in different environments, in parallel

Page 13: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Future Roadmap ( > 0.10.0 )

Cross-building with Spark 1.x and Spark 2.x

Better native support of Spark ML Pipeline and DataSet

Multi-engine serving

Admin API w/ CLI Refactoring

Testing infrastructure for community engine templates

Further separation-of-concernsAllow engines to use different parts of PredictionIO independently

Page 14: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

WE NEED YOUR HELP!!!Please subscribe to [email protected]

For usage questions please subscribe to [email protected]

donald (at) apache.org

Thank you!