Top Banner
1 Artificial Intelligence At The Speed Of Excellence FUT-1230 Roadmap & Vision Alessandro Festa Sr. Product Manager Artificial Intelligence [email protected] @bringyourownai
30

FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

Sep 04, 2021

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

1

Artificial Intelligence At The Speed Of Excellence

FUT-1230

Roadmap & Vision

Alessandro Festa

Sr. Product Manager Artificial Intelligence

[email protected]

@bringyourownai

Page 2: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

2

Agenda

1. Artificial Intelligence Landscape Challenges

2. Building a solution from SUSE perspective

3. Roadmap Vision

4. Roadmap Strategy

Page 3: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

3

What is the real

challenge of

Artificial Intelligence

for Enterprises?

The Commodity Dilemma

Page 4: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

4

ai

This is where you want to be.

You are here.

…Things to be done…

Page 5: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

5

Building a

solution from

SUSE’s

perspective

Page 6: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

6

SUSE AI Playground

I need a simple

way to..

Develop my AI models without

spending time to configure my

environment

Deploy an AI platform for my

Data Scientist without waste

time in learning how to do so..

Monitor what Data Scientist

deployed on the AI platform

without waste time in configure

all I need…

Data Scientist

Ai Operator

OR

Page 7: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

7

Risky

Outcomes

More Production

DelaysFalse promises

Unclear outcome,

uncertainty of

infrastructure stability

Production DelaysHigh risk of

errors“customer” walk away

and complain

Ai operator need

to deploy a new Ai

platform Release Ai

platforms

Now what? Am I

sure everything

works as

expected?

Setup/configure

monitoring

How do I ensure everything

works as expected? What I

need to monitor? What I

need to check?

Deploy Ai Platform

How do I install the Ai

Platform? What I need

(storage/containers)?

Define Software

“basics”

What we need as basics?

K8’s/HPC/Single

Server/Cloud?

Define HW resources

What we need?

GPU’s/CPU’s what if we

add/change/remove later?

Collect Data

Scientist

requirements

Data Scientist are not

infrastructure experts.

DevOps are not Ai

experts.

Challenges

Artificial Intelligence Value Stories

“Traditional processes are not good enough any

longer” - An example: Ai Operator journey

Ai playground solve this Ai Stack solve this

Page 8: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

8

Ai operator need

to deploy a new Ai

platform

Release Ai

platforms

Now what? Am I

sure everything

works as

expected?

Setup/configure

monitoring

How do I ensure everything

works as expected? What I

need to monitor? What I

need to check?

Deploy Ai Platform

How do I install the Ai

Platform? What I need

(storage/containers)?

Define Software

“basics”

What we need as basics?

K8’s/HPC/Single

Server/Cloud?

Define HW resources

What we need?

GPU’s/CPU’s what if we

add/change/remove later?

Collect Data

Scientist

requirements

Data Scientist are not

infrastructure experts.

DevOps are not Ai

experts.

Challenges

Use a “ready-to-go”

solution like Ai

Playground allow the

AiOps to select from

pre-configured set of

templates

The pre-built

automation workflow

remove complexity

and give freedom to

change the

infrastructure even

at later time

Templates are

organized by

“target”: on-prem,

on-cloud, dev-

sandboxes and by Ai

platform

We drive the AiOps

through the entire

deployment

experience. Just

launch the auto-pilot.

We offer a tested and

maintained solution.

We take care also of

the “sides” not just

of the “main course”

Platform is ready.

We give you the

basic instructions to

pass to your users.

Job is done.

Mitigation

Artificial Intelligence Value Stories

Real

Outcomes

No delays makes

people happyWe may offer a

solution

We do reduce frictions

and accelerate projects

We stick with deliver

times

Deploy what

you need and

nothing more..

“customer” is happy

and ask for more

Page 9: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

9

Roadmap Vision

Page 10: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

10

Building an AI

Playground

Avoiding Complexity for Data Scientist

Page 11: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

11

GPU/CPU

SLES/openSUSE

MacOS

Container Runtime

SUSE DSVM

AI Playground

Jupyter MS VsCode Nteract Atom

OS Selection

Alternative Component

AI Editor/Ai

Template of choice

HW Selection

Windows WSL2

PolyNote R-StudioML/DL

FrameworksAI Remote

(Pipelines)

SUSE Artificial Intelligence Playground- Reference Architecture

Page 12: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

12

Version 1.0 Version 2.0 Version 3.0 Version 4.0

NVIDIA - INTEL

SLE(15 or above)

CAASP or K8’s (including

Cloud Providers)

KUBEFLOW

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

NVIDIA-INTEL-AMD

SLE(15 or above)

CAASP or K8’s (including

Cloud Providers)

HPC (on prem-on cloud)

KUBEFLOW

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

NVIDIA-INTEL-AMD-HUAWEI

SLE(15 or above)

CAASP or K8’s (including

Cloud Providers)

HPC (on prem-on cloud)

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

SES or Ceph

NVIDIA-INTEL-AMD-

HUAWEI-…

SLE(15 or above)

CAASP or K8’s (including

Cloud Providers)

HPC (on prem-on cloud)

CAP

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

SES or Ceph

x86 X86 - ARM X86 - ARM X86 - ARM

JUPYTER

HUB

KUBEFLOW

AIRFLOW

JUPYTER

HUB

KUBEFLOW

AIRFLOW

JUPYTER

HUB

Ai Containers

Ai Platforms

Ai Application

Orchestrators

Ai Infrastructure

Orchestrators

Ai Storage

Ai OS

HW Accelerators

Platforms

3 months 3 months 3 months

SUSE Artificial Intelligence Stack- Reference Architecture

Page 13: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

13

SUSE Artificial Intelligence PlaygroundReference Architecture

GPU/CPU

SLES/openSUSE

MacOS

Container Runtime

SUSE DSVM

AI Playground (Electron)

Jupyter MS VsCode Nteract Atom

OS Selection

Alternative Component

AI Editor of choice

HW Selection

Windows WSL2

PolyNote R-StudioML/DL

FrameworksAI Remote

(Pipelines)

Page 14: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

14

SUSE Artificial Intelligence Playground

OS

Selection

Alternative

Component

AI Editor

of choice

HW

Selection GPU/CPU

SLES/openSUSE

Container Runtime

AI Playground (Stratos)

Version 1

Single Container CPU Single Container GPU build

at runtime

Single VM CPU (when DSVM will be ready)

Jupyter NotebookJupyter LabAtomMicrosoft VS CodeNteract

Ai Packages

Page 15: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

15

SUSE Artificial Intelligence Playground

OS

Selection

Alternative

Component

AI Editor

of choice

HW

Selection GPU/CPU

SLES/openSUSE

Container Runtime

AI Playground (Stratos)

Version 1

Single Container CPU Single Container GPU build

at runtime

Single VM CPU (when DSVM will be ready)

TerrafornSkubaKfctlKubectl

Helm

Infra Deployment

Ai Platform Deployment

Ai Infra Monitoring

Page 16: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

16

SUSE Artificial Intelligence Playground

OS

Selection

Alternative

Component

AI Editor

of choice

HW

Selection GPU/CPU

SLES/openSUSE

Container Runtime

AI Playground (Stratos)

GPU/CPU

SLES/openSUSE

Container Runtime

SUSE DSVM

AI Playground (Stratos)

Windows WSL2

Version 1 Version 2

Page 17: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

17

SUSE Artificial Intelligence Playground

OS

Selection

Alternative

Component

AI Editor

of choice

HW

Selection GPU/CPU

SLES/openSUSE

Container Runtime

AI Playground (Stratos)

GPU/CPU

SLES/openSUSE

Container Runtime

SUSE DSVM

AI Playground (Stratos)

Windows WSL2

GPU/CPU

SLES/openSUSE

MacOS

Container Runtime

SUSE DSVM

AI Playground (Stratos)

Windows WSL2

Version 1 Version 2 Version 3

Page 18: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

18

AI Software

Stacks

Infrastructure Ready To Go For Your AI Journey

Page 19: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

19

SUSE Artificial Intelligence Stack- Reference Architecture

GPU/CPU

HOST (for K8’s/HPC)

CAASP or K8’s (including Cloud Providers)

HPC (on prem-on cloud)

CAP

AI Platform

CONTAINER CONTAINER VM VM

Optional Component

Alternative Component

AI Platform of choice

GPU/CPU enabled containers

(including third-party registries)

OS optimized for HW

SES or CephStorage

SLE(15 or above)

Page 20: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

20

SUSE Artificial Intelligence StackVersion 1.0 Version 2.0 Version 3.0 Version 4.0

NVIDIA - INTEL

SLE(15 or above)

CAASP or K8’s (including

Cloud Providers)

KUBEFLOW

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

NVIDIA-INTEL-AMD

SLE(15 or above)

CAASP or K8’s (including

Cloud Providers)

HPC (on prem-on cloud)

KUBEFLOW

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

NVIDIA-INTEL-AMD-HUAWEI

SLE(15 or above)

CAASP or K8’s (including

Cloud Providers)

HPC (on prem-on cloud)

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

SES or Ceph

NVIDIA-INTEL-AMD-

HUAWEI-…

SLE(15 or above)

CAASP or K8’s (including

Cloud Providers)

HPC (on prem-on cloud)

CAP

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

SES or Ceph

x86 X86 - ARM X86 - ARM X86 - ARM

JUPYTER

HUB

KUBEFLOW

AIRFLOW

JUPYTER

HUB

KUBEFLOW

AIRFLOW

JUPYTER

HUB

Ai Containers

Ai Platforms

Ai Application

Orchestrators

Ai Infrastructure

Orchestrators

Ai Storage

Ai OS

HW Accelerators

Platforms

AWS “How-to” Documentation (NVIDIA)

SuperMicro Reference Architecture (NVIDIA/Intel)

Intel “Stacks” as “SUSE-Intel Stacks”

AMD Reference Architecture

ARM Reference Architecture (not CaaSP, include NVIDIA)

Huawei Reference Architecture (Huawei GPU’s)

Dell Reference Architecture (NVIDIA/Intel/AMD)

Reference Architectures and Joint “go-to-market”

Page 21: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

21

SUSE Artificial Intelligence StackVersion 1.0 Version 2.0 Version 3.0 Version 4.0

NVIDIA - INTEL

SLE(15 or above)

CAASP or K8’s (including

Cloud Providers)

KUBEFLOW

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

NVIDIA-INTEL-AMD

SLE(15 or above)

CAASP or K8’s (including

Cloud Providers)

HPC (on prem-on cloud)

KUBEFLOW

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

NVIDIA-INTEL-AMD-HUAWEI

SLE(15 or above)

CAASP or K8’s (including

Cloud Providers)

HPC (on prem-on cloud)

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

SES or Ceph

NVIDIA-INTEL-AMD-

HUAWEI-…

SLE(15 or above)

CAASP or K8’s (including

Cloud Providers)

HPC (on prem-on cloud)

CAP

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

SES or Ceph

x86 X86 - ARM X86 - ARM X86 - ARM

JUPYTER

HUB

KUBEFLOW

AIRFLOW

JUPYTER

HUB

KUBEFLOW

AIRFLOW

JUPYTER

HUB

Ai Containers

Ai Platforms

Ai Application

Orchestrators

Ai Infrastructure

Orchestrators

Ai Storage

Ai OS

HW Accelerators

Platforms

Manifests

Skuba

Terraform

Reference Architectures Roadmap Vision

AWS

BareMetal

Kustomize

SUSE Custom Manifest

AWS

Page 22: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

22

SUSE Artificial Intelligence Stack- Reference ArchitectureVersion 1.0 Version 2.0 Version 3.0 Version 4.0

NVIDIA - INTEL

SLE(15 or above)

CAASP or K8’s (including

Cloud Providers)

KUBEFLOW

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

NVIDIA-INTEL-AMD

SLE(15 or above)

CAASP or K8’s (including

Cloud Providers)

HPC (on prem-on cloud)

KUBEFLOW

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

NVIDIA-INTEL-AMD-HUAWEI

SLE(15 or above)

CAASP or K8’s (including

Cloud Providers)

HPC (on prem-on cloud)

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

SES or Ceph

NVIDIA-INTEL-AMD-

HUAWEI-…

SLE(15 or above)

CAASP or K8’s (including

Cloud Providers)

HPC (on prem-on cloud)

CAP

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

CON

TAIN

ER

SES or Ceph

x86 X86 - ARM X86 - ARM X86 - ARM

JUPYTER

HUB

KUBEFLOW

AIRFLOW

JUPYTER

HUB

KUBEFLOW

AIRFLOW

JUPYTER

HUB

Ai Containers

Ai Platforms

Ai Application

Orchestrators

Ai Infrastructure

Orchestrators

Ai Storage

Ai OS

HW Accelerators

Platforms

3 months 3 months 3 months

Page 23: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

23

Roadmap Strategy

Page 24: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

24

Vision

From zero to the top

The need of the «less (effort) is better»

Be different

Page 25: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

25

AI Packages

Page 26: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

26

AI Virtual Machines For Data Scientist

openSUSE Leap 15.2

https://download.opensuse.org/repositories/Virtualization:/Appliances:/Images:/openSUSE-Leap-15.2/images/

Containers may be created out of every

single package.

Page 27: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

27

What Are AI “Packs”- AI Playground

Any OS :

As application on Data Scientist environment:

• SUSE Linux

• Windows (WSL)

• Web (Jupyter)

Any Tool

As «known» interface to model locally:

• Jupyter Lab

• Jupyter Notebook

• Jupyter Nteract

• Polynote

• MS VSCode

• Atom

• …

Any Framework

As container to be deployed everywhere and to be rapidly accessible locally or remote:

• Tensorflow

• PyTorch

• OpenCV

• …

Any Deployment

Once model is ready is deployable as pipeline in a remote enviroment:

• SUSE CaaSP

• AKS (Azure Arc)

• EKS

• GKS

• …

Page 28: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

28

April 2020 October 2020 June 2021 2022

SUSE Artificial Intelligence1.0

1.1

2.0

Ai Playground• Editors support:

• Jupyter

• JupyterLab

• MS VsCode

• Nteract

• Templates:

• Kubeflow

• Monitoring:

• Kubeflow

AI “Stacks”• Kubeflow on CaaSP

• Intel-SUSE Stacks

• Supermicro Reference Architecture

• Dell Reference Architecture

• ARM Reference Architecture

AI packages• Tensorflow 1.15

• Tensorflow 2.1

• Pytorch

• Caffe

• OpenCV

• ONNX

• ArmNN

• R-Studio

1.0

Ai Playground• Editors support:

• Atom

• Polynote

• R-Studio

• Apache Zeppelin

• Templates:

• AirFlow

• Monitoring:

• Airflow

• Deployment

• Kubeflow

AI “Stacks”• AMD-SUSE Stacks

• NVIDIA Reference Architecture

AI packages• Apache Beam

• Apache Spark

• Tensorflow TFX

Ai Playground• Templates:

• Apache Spark

AI “Plugins”

• IdM/PAM plugins

• Authentication/Authorization

plugins

Ai Playground• Templates:

• AirFlow

• MLRun

• Apache Spark

• Monitoring:

• Apache Spark

• Deployment

• Airflow

• MLRun

• Apache Spark

1.1 2.0 2.1

* Information is forward looking and subject to change at any time.

Jan 2021

2.1

** Items are tech preview

Page 29: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL

29

General Disclaimer

This document is not to be construed as a promise by any participating company to

develop, deliver, or market a product. It is not a commitment to deliver any material,

code, or functionality, and should not be relied upon in making purchasing

decisions. SUSE makes no representations or warranties with respect to the contents of

this document, and specifically disclaims any express or implied warranties of

merchantability or fitness for any particular purpose. The development, release, and

timing of features or functionality described for SUSE products remains at the sole

discretion of SUSE. Further, SUSE reserves the right to revise this document and to

make changes to its content, at any time, without obligation to notify any person or entity

of such revisions or changes. All SUSE marks referenced in this presentation are

trademarks or registered trademarks of SUSE, LLC, Inc. in the United States and other

countries. All third-party trademarks are the property of their respective owners.

Page 30: FUT-1230 - Artificial intelligence at the speed of excellence · 2020. 6. 4. · 21 SUSE Artificial Intelligence Stack Version 1.0 Version 2.0 Version 3.0 Version 4.0 NVIDIA - INTEL