Top Banner
Salil Raje Executive Vice President & GM Xilinx Data Center Business FPGAs: The Key to Accelerating High-Speed Storage Systems
29

FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

Aug 07, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

Salil Raje

Executive Vice President & GM

Xilinx Data Center Business

FPGAs: The Key to Accelerating

High-Speed Storage Systems

Page 2: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

SSDs Have Been a Game Changer for Storage

1

10

100

1000

10000

1 2 3

MIC

RO

SE

CO

ND

S

LATENCY

>> 2

Page 3: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

VIDEOANALYTICS

MACHINE

LEARNING

FINANCIALLIFE

SCIENCES

DATABASE Apps

NetworkStorage

Compute

Explosion of UnstructuredData

Data Filtering

EncryptionDecompression

Compression

>> 3

Page 4: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

Continuously Evolving Standards

Data Filtering

EncryptionDecompression

CompressionHadoop

Spark

Aerospike

RocksDB

Cassandra

Foundation DB

GZip

zSTD

Huffman

LZ

Zipline

Brotli

LZ

Brotli

Zipline

DES

AES-XST

SHA1-256

Block chain

>> 4

Page 5: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

Controller

Bottlenecks Remain for Data Intensive Applications

PCIe

DRAMDRAM

Excessive data transfers

High latency

Limited BW

CPU not optimized for

these tasks

Processor-centric architecture

Flash

Compute

CPU

>> 5

Page 6: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

Controller

Emergence of Computational Storage as the Solution

PCIe

Computational storage architecture

CPU

Compute

acceleration

close to storage

Reduces required bandwidth

Reduces latency

More available

CPU cyclesComputeDRAMDRAM

Flash

>> 6

Page 7: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

Growing Industry Momentum for Computational Storage

>> 7

Page 8: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

How FPGAs Address the

Computational Storage Problem

Page 9: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

˃ Flash controllers

˃ Storage SystemsCache-offload

Storage System & Switching connectivity

Data Reduction

FPGAs in Storage Today

>> 9

Page 10: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

˃ Flexible, fully customizable architecture adapts to specific applications

Massive parallelism, I/O and customizable data path

˃ Performance, power and latency of dedicated HW + reconfigurability of SW

˃ More economical than ASIC/ASSP for many applications

FPGA Advantages for Computational Storage

FPGA FPGA FPGA

Encryption Accelerator Decryption Accelerator Analytics Accelerator

>> 10

Page 11: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

Architecture easily adapts to latest compression algorithms

FPGA Advantages for Changing Standards

FPGA FPGA FPGA

Gzip Accelerator Brotli Accelerator Zipline Accelerator

>> 11

Page 12: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

Example of Analytics Acceleration

Airline traffic in the USA from 1970 to Present

Flight Data — 1.2B EntriesAirport Data — 500M EntriesPlanes Data — 700M Entries

Q1: “Which cities originate the most flights with >10min delays? Q2: “Which airport in the Bay Area has the worst record?

Computational

Storage Drives

Scan, filter,

Hash-Agg

SSDCtrl

1x

4x

7x

13x

0

2

4

6

8

10

12

14

1

Re

lati

ve

P

erf

orm

an

ce

# FPGA Accelerators

QUERY PERFORMANCE

None 1 2 41

4FPGA

>> 12

Page 13: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

Example of Line Rate Hadoop Compression Acceleration

The challenge: Ingest real-timeretail sales data during peak shopping season

FPGA

1x

20x

0 5 10 15 20

Intel Skylake-SP 6152 @2.10GHz CPU (Ubuntu 16.04),

GB/s compression per CPU core = .0229. Alveo U50 =

10GB/s

CPU

FPGA

CPU

vs.

CPU can’t keep up with line-rate data ingestionmaking compression impractical

>> 13

Page 14: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

FPGA-based Data Compression Enables Server Consolidation

With 192TB (uncompressed)

2x Dual CPU Servers

Without Compression

Acceleration

50% Reduction in Nodes

40% Lower Cost

2x Accelerators, 96 TB (compressed)

Single Socket Server

Intel Skylake-SP 6152 @2.10GHz CPU (Ubuntu 16.04), GB/s compression per CPU core = .0229. Alveo U50 = 10GB/s, Assume 2:1

compression

With FPGA Compression

Acceleration

+

>> 14

Page 15: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

Computational Storage

Deployment Options

Page 16: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

˃ Integrated Accelerator and Flash

˃Benefits:

Easy to implement- plug & play

Adding capacity adds accelerators + performance

Ability to optimize BW between accelerator and flash

Ability to customize FTL for specific workloads

˃Vendors at FMS:

Samsung

Scaleflux

Computational Storage Drive (CSD)

PCIe

DRAMDRAM

CPU

Controller

Flash

FPGA

Controller

Flash

FPGA

Controller

Flash

FPGAFPGA FPGA FPGA

>> 16

Page 17: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

˃ Accelerator and Storage on same PCIe

subsystem

˃ Benefits:

SSD vendor independence

Plugs into standard slot

PCIe Peer-to-peer transfers for high bandwidth and low latency

˃ Vendors at FMS:

Bittware

Eideticom

Xilinx

Computational Storage Processor (CSP)

Peer-to-Peer Acceleration

PCIe

DRAMDRAM

CPU

FPGA

FPGA

>> 17

Page 18: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

˃ Accelerator in-line with storage

˃ Benefits:

SSD vendor independence

Independently scale accelerators and SSDs

Ability to optimize BW between accelerator and SSDs

˃ Vendors at FMS:

Bittware

Computational Storage Array (CSA)

PCIe

DRAMDRAM

CPU

FPGA

>> 18

Page 19: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

Future Directions

Page 20: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

Current Data Center Architecture: Fixed Resources, Sub-optimal Utilization

Ethernet

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

Accel

Accel

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

Accel

Accel

Accel

Accel

Accel

Accel

Accel

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

>> 20

Page 21: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

Ethernet

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Accel

Future Data Center : Disaggregated and Composable

Workload 1

Challenge: Reduced Bandwidth and Increased Latency

Workload 2

Workload 3

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

>> 21

Page 22: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

˃ Enables composability without

significant performance penalty

˃ Benefits

Performance and latency benefits of computational storage

Scale compute / storage independently

Higher density per rack

Lowest TCO

˃ Vendors at FMS:

Xilinx

Introducing Composable Storage Acceleration

NVMe over Fabrics

PCIe

>> 22

Page 23: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

SSD

SSD

Ethernet

Storage

Accel

Storage

Accel

Storage

Accel

Storage

Accel ˃ Moves some compute next to the data

˃ Network traffic reduced

˃ Latency improved

˃ Higher utilization with composable

infrastructure

Future DC: Composable + Adaptable Computational Storage

Reduced network traffic

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

Storage

Accel

FPGA

>> 23

Page 24: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

Ethernet

Smart

NIC

Smart

NIC

Smart

NIC

Smart

NIC

Smart

NIC

Smart

NIC

Smart

NIC

Smart

NIC

Smart

NIC

˃ Enables low latency high bandwidths acceleration of

network interface workloads.

˃ Enables significantly higher packets per second

˃ Offloads network functions from the CPU

Future DC: Composable + Adaptable Network Acceleration

FPGA

>> 24

Page 25: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

Ethernet

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

˃ Customizable acceleration up to

100x faster than CPUs for:

Video transcoding

ML inferencing

Financial modeling

Future DC: Composable + Adaptive Compute Acceleration

FPGA

>> 25

Page 26: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

CPU

SSD

SSD

Ethernet

Smart

NIC

Smart

NIC

Smart

NIC

Smart

NIC

Smart

NIC

Smart

NIC

Smart

NIC

Smart

NIC

Smart

NIC

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Compute

Accel

Storage

Accel

Storage

Accel

Storage

Accel

Storage

Accel

˃ Composable accelerated

storage, networking and

compute

˃ Optimized for each workload

˃ Optimal infrastructure

utilization

Future DC: Composable + Distributed Adaptive Acceleration

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

SSD

Storage

Accel

>> 26

Page 27: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

FPGAs are Key to Accelerating High-Speed Storage Systems

Computational storage addresses a broad range of application bottlenecks

Offers data center operators >5x performance boost and up to 2x reduction of TCO

Xilinx is leading the way in distributed adaptive acceleration

5x

2x

>> 27

Page 28: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

˃Visit Xilinx in booth 313

˃Visit our partners

Alpha Data, Bittware, Burlywood, Codelucida, GigaIO, Echo Streams, Eideticom, Everspin Technologies, IP-Maker, Mobiveil, Pliops, PLDA, Scaleflux, Smart IOPS, Samsung, SMART Modular, Toshiba Memory America, Western Digital

˃Visit our Computational Storage microsite

www.xilinx.com/computational-storage

˃Join SNIA working group for Computational Storage

Computational Storage in Action

>> 28

Page 29: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data

Adaptable.

Intelligent.