Top Banner
RTX: BRINGING AI & ADVANCED GRAPHICS TO VISUAL COMPUTING RAJ MIRPURI – VP PROFESSIONAL VISUALIZATION
31

RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

Jul 25, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

RTX: BRINGING AI & ADVANCED GRAPHICS TO VISUAL COMPUTINGRAJ MIRPURI – VP PROFESSIONAL VISUALIZATION

Page 2: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

2

NVIDIA ACCELERATES ALL WORKLOADS

SCIENTIFIC

RESEARCH

DEEP

LEARNING

MACHINE

LEARNING

RENDERING

& VIZ

CUDA

GPU

Page 3: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

3

Rendering

VISUAL COMPUTING CHALLENGES TODAY

Increased Complexity

& Quality

Increased Demand for

Data Analysis

Massive Data,

Real-time Insight

Need for Mixed Visualization & Compute

Page 4: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

4

NVIDIA POWERS THE NEXT ERA OF COMPUTING

Tensor Cores

RT Cores

Turing SM

NVLINK

Video

Encode/Decode

Display

Massive Memory

Turing RTX Architecture RTX Platform

Accelerated offline rendering

Data Science platform

AI at the Edge

Powerful virtual workstations

Page 5: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

5

RTX SERVER ENABLES AI FOR VISUAL COMPUTING

Virtualization and Mixed Workloads

Accelerate offline batch

rendering with the power of

multi-GPU acceleration

Enable multiple GPU configurations to

develop on larger data sizes to find

business impacting and changing results

Unlock benefits as bandwidth,

latency and availability to resources

are not ubiquitous and plentiful

Empower designers and

artists anywhere in the

world with the high end

RTX GPUs

Rendering Data Science AI at the Edge

Page 6: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

RTX AI FOR RENDERING

Page 7: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

7

AN EXPLOSION OF RICH CONTENT

The shot had over 8 million instanced lights, which initially took 1,000 hours to render each frame

By the end of production, it still took 50 hours to render each fully optimized frame

$0B

$5B

$10B

$15B

2017 2018 2019 2020 2021 2022

Annual Spend on Original Content

Amazon Netflix

1993: Jurassic Park had a total of 63 VFX shots taking a year to create

2018: Marvel’s Avengers: End Game had more than 3,000 shots

Page 8: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

8

CREATE MORE, WAIT LESS WITH QUADRO

Note: CPU = i9-7900X; GPU = NVIDIA RTX GPU; video playback at 2x speed

Page 9: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

9

EXPONENTIAL POWER AT 1/4 THE COST

4 RTX 8-GPU Servers

13 kW

$500,000

1/4 the Cost 1/10 the Space 1/11 the Power

240 Dual 12-core Skylake CPU Servers

144 kW

$2M Render Farm

Interactive BatchRTX-Accelerated Rendering

Interactive Content Creation Batch Rendering

Page 10: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

10

25X ACCELERATED RENDERING FOR NETFLIX

NETFLIX Lost In Space scene: renders in a fraction

of the time using RTX Server

• 6x faster for a single frame

• 25x faster for the entire shot

CPU Node

(Dual Skylake)

RTX Server

(4 x RTX 8000)Improvement

Render time

(1 frame)38 min 6 min 6x

Total render time

(120 frames)76 hours 3 hours 25x

# of nodes 25 1 25x

Power (kW) 13.2 1.9 7x

Acquisition cost $188k $28k 7x

Cost of power (5

yrs)$68k $10k 7x

Total cost $256k $38k 7x

Page 11: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

RTX FOR DESIGN AND RENDERINGAccelerated Workflows with RTX

DESIGN RENDERING

DESIGNWORKS

OpenGL DirectX Vulkan OptiX

MDL vMaterials GVDB

IndeX VXGI

NvPro

VRWorks

Video Codec SDK

PhysX

DESKTOP | MOBILE WORKSTATION DATACENTER CLOUD

RTX SERVER

Page 12: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

RTX AI FOR DATA SCIENCE

Page 13: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

13

3M DATA SCIENTISTS AT WORK TODAY

Ad Personalization

Click Through Rate Optimization

Churn Reduction

CONSUMER INTERNET

Claim Fraud

Customer Service Chatbots/Routing

Risk Evaluation

FINANCIAL SERVICES

Remaining Useful Life Estimation

Failure Prediction

Demand Forecasting

MANUFACTURING

Detect Network/Security Anomalies

Forecasting Network Performance

Network Resource Optimization (SON)

TELECOM

Supply Chain & Inventory Mgmt

Price Mgmt / Markdown Optimization

Promotion Prioritization And Ad Targeting

RETAIL

Intelligent Customer Interactions

Connected Vehicle Predictive Maintenance

Forecasting, Demand, & Capacity Planning

AUTOMOTIVE

Sensor Data Tag Mapping

Anomaly Detection

Robust Fault Prediction

OIL & GAS

Improve Clinical Care

Drive Operational Efficiency

Speed Up Drug Discovery

HEALTHCARE

Page 14: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

14

RTX STREAMLINES DATA SCIENCE WORKFLOWS

12

6

39

GPU-POWEREDWORKFLOW

Train Model

Validate

Test Model

Experiment with Optimizations and Repeat

Go Home on Time

DatasetDownloadsOvernight

Start GET A COFFEE

Stay Late

Restart Data Prep Workflow Again

Find Unexpected Null Values Stored as String…

Switch to Decaf

12

6

39

CPU-POWEREDWORKFLOW

Restart Data Prep Workflow

@*#! Forgot to Add a Feature

ANOTHER…

GET A COFFEE

Start Data PrepWorkflow

GET A COFFEE

Configure Data PrepWorkflow

DatasetDownloadsOvernight

Dataset Collection Analysis Data Prep Train Inference

Same Number of Iterationsin Much Less Time

Page 15: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

15

10X FASTER PERFORMANCE WITH RTX

End-to-end time = Data Prep + Conversion + Training + Validation CPU: dual Gold [email protected] 3.7GHz Turbo (Skylake), ETL with Dask + Pandas Dataset: Mortgage Data 2015-2016

0

50

100

150

200

250

300

CPU 1x RTX8000 2x RTX8000

Data Prep

0

20

40

60

80

100

120

140

160

180

200

CPU 1x RTX8000 2x RTX8000

Training w/XGBoost

0

100

200

300

400

500

600

CPU 1x RTX8000 2x RTX8000

End-to-End

Seconds (lower is better)

~ 30X Faster

than

CPU

~ 8X Faster

than

CPU

~ 10X Faster

than

CPU

Page 16: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

16

20X SPEEDUP FOR ARUP ANALYTICS

Opportunity: 200 Data Scientist

Use Case: Distributed infrastructure assets,

precipitation risk assessment

Data Preparation: 50,000 CSV files to extract

data and then aggregate in multiple formats

Model Training: Open source python for

analytics, pandas, GeoPandas, Shapely, NumPy

“The NVIDIA-powered Data Science Workstation[’s]… combination of well-designed software and highly

performant hardware provides a 20x and higher speed-ups in our analytics work and our team found its

ease of use liberating.”

Steve Walker, Associate Director, Arup, Advanced Digital Engineering

Page 17: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

17

RTX FOR DATA SCIENCE & AIFrom Data Science to NVIDIA Accelerated Data Science with CUDA-X AI

Page 18: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

RTX AI AT THE EDGE

Page 19: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

19

SMART AND SAFE CITIES NEED AI

0M

200M

400M

600M

800M

1,000M

2016 2020

1 billion installed security cameras WW (2020)

30 billion frames per day

Challenging real world conditions

Traditional video analytics not trustworthy

74%

97%

2010 2011 2012 2013 2014 2015 2016

Accura

cy

Image Classification

Human

Hand-coded CV

Deep Learning

AI achieves super human results

AI driven intelligent video analytics

Page 20: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

20

RTX ENABLES AI AT THE EDGE

EXTRACTING NEW VALUE SAFETY COMPLIANCE RETAIL STORE VIDEO SECURITY & PUBLIC SAFETY

Page 21: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

21

Page 22: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

22

REAL-TIME INSIGHTS TO FIND RESOLUTION

Reducing peak trafficcongestion by 15%

Finding lost peoplein 30 minutes

Capturing a criminalbefore he strikes again

Adding 100 virtualsecurity guards

Improving traffic safety

Page 23: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

RTX AI FOR MIXED WORKLOADS

Page 24: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

24

WORKFLOWS DEMAND MIXED WORKLOADS

VISUALIZATION COMPUTE

Windows 10 2D EDA 3D Apps Deep Learning HPC

Page 25: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

Media & Entertainment

Architecture Medical Imaging EnergyManufacturing

MIXED WORKLOADS ARE EVERYWHEREProfessional workflows are getting more demanding

Automotive Finance Defense

Page 26: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

26

CAD

Pre-Processing (CAE)

HPC Solver (CAE)

Post-Processing (CAE)

Addressing the Bottlenecks of FEA SimulationSource: Tech-Clarity 2017

COMPUTER AIDED ENGINEERING WORKFLOWBottlenecks Impact Design Iterations and Time to Market

Page 27: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

27

SIMULATION WITHIN DESIGN POWERED BY RTX

Page 28: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

28

NVIDIA Quadro Virtual Data Center

Workstation Software

• Now supports RTX 6000 and RTX 8000

• Brings world’s most powerful virtual

workstation to RTX Server platform

• Flexibly provision virtual workstations

or a combination of virtual workstations

and render nodes from a single RTX

Server

• Extends power of RTX platform to

designers, on any device, anywhere

QUADRO VIRTUAL WORKSTATION STREAMS FROM RTX SERVER

To Any Device, Anywhere

Page 29: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

29

QUADRO RTX VIRTUAL WORKSTATIONS

NEW TO VDI

Light Users

NVIDIA T4 or P6Quadro Virtual Data Center Workstation

16 GB

Medium Users

NVIDIA T4 or P6Quadro vDWS

16 GB

Type of User

RecommendedSolution

GPU Memory

Multiple Quadro P1000 Up to Quadro P4000Equivalent

Performance

K2, M60, P4, M6Replaces K2, M60, P4, M6

Heavy Users

NVIDIA Quadro RTX 8000, RTX 6000, P40 or V100 with Quadro vDWS

Up to Quadro RTX 8000

N/A

48 GB/32 GB/24 GB

Page 30: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU:

30

RTX POWERS AI FOR VISUAL COMPUTING

Accelerate offline batch rendering with the power of

multi-GPU acceleration

Enable multiple GPU configurations to develop on

larger data sizes to find business impacting and

changing results

Unlock benefits as bandwidth, latency and availability

to resources are not ubiquitous and plentiful

Empower designers and artists anywhere in the world

with the high end RTX GPUs

Page 31: RTX: BRINGING AI & ADVANCED GRAPHICS TO …...10X FASTER PERFORMANCE WITH RTX End-to-end time = Data Prep + Conversion + Training + Validation Dataset: Mortgage Data 2015-2016 CPU: