Lustre Beyond HPC - OpenSFSopensfs.org/wp-content/uploads/2014/04/D2_S25_LustreFor... · 2016-10-12 · • Lustre market expansion – From a relatively small player to the dominant

Lustre Beyond HPCToward Novel Use Cases for Lustre?

Presented at LUG 2014

Robert Triendl, DataDirect Networks, Inc.2014/03/31

ddn.com

this is a vendor presentation!

ddn.com

Lustre Today

• The undisputed file-system-of-choice for HPC – large-scale parallel I/O for very large HPC clusters

(several thousand nodes or larger)– applications that generate very large datasets but only a

relatively limited amount of metadata traffic

• Alternative solutions– are typically more expensive (since proprietary) – and not nearly as scalable as Lustre

ddn.com

Lustre Market OverviewData for the Japan Market

2008 2013 2018

Cloud & Content

Business Data Analysis

HPC "Work" Corporate

Research Data Analysis

HPC Archive

HPC "Work"

ddn.com

What Happened?Example: Storage Market Japan

• Lustre market expansion– From a relatively small player to the dominant HPC file

system

– From tens of OSSs to hundreds of OSSs (and, when including the “K” system, literally thousands of OSSs)

– But, also Lustre has become synonymous with large-scale HPC

– Experiments to use Lustre in other markets and for other applications have decreased, rather than increased

ddn.com

Overall Storage Market Evolution• “Software-defined storage” is replacing traditional

storage architectures in large cloud deployments

• Many Lustre “alternatives” are now available– CEPH, OpenStack/Swift, Swift Stack, Gluster, etc. – Various Hadoop distributions– Commercial Object Storage (DDN WOS, Scality,

Cleversafe, etc.)

• Lustre remains “exotic” and is rarely considered even an option

ddn.com

HPC "Work" Corp 15%

Research Data Analysis 35%

HPC Archive 23%

HPC Work, 24%

Project Quota

Small File I/O

SSD Acceleration

Fine-Grained Monitoring

NFS/CIFS Access

Management

Connectors

Object/Cloud Links

Data Management

Backup/Replication

Client Performance

Cluster Integration

Large I/O

I/O Caching

RAS Features

ddn.com

HPC "Work" Corp 15%

HPC Archive 23%

HPC Work, 24%

Project Quota

Small File I/O

SSD Acceleration

NFS/CIFS Access

Management

Connectors

Object/Cloud Links

Data Management

Backup/Replication

Client Performance

Cluster Integration

Large I/O

I/O Caching

ddn.com

Nagoya University Acceptance BM

• Large Cluster– FFP from each core in the cluster– Most efficient configuration with 3 TB/4 TB

drives– 350-400 threads per Lustre OST

ddn.com

Nagoya University Initial DataFPP with Large Number of Threads

0 50 100 150 200 250

GB/sec

Processes per OST

Write Performance Single OST (GB/sec)

ddn.com

Large I/O Patches

0 100 200 300 400 500 600 700

Number of Process

Write: Lustre Backend Performance Degradation (Maximum for each dataset=100%)

7.2KSAS(1MB RPC) 7.2KSAS(4MB RPC) SSD(1MB RPC)

0 100 200 300 400 500 600 700

Number of Process

Read : Lustre Backend Performance Degradation (Maximum for each dataset=100%)

7.2KSAS(1MB RPC) 7.2KSAS(4MB RPC) SSD(1MB RPC)

ddn.com

Raw Device Performance: Write

Total number of thread

sgpdd-survey(SeagateES3 7.2 NL-SAS, RAID6, write)crg=1 crg=2 crg=4 crg=8 crg=16

crg=32 crg=64 crg=128 crg=256

sgpdd-survey(Hitachi 7.2K NL-SAS, RAID6, write)

crg=1 crg=2 crg=4 crg=8 crg=16

ddn.com

Raw Device Performance: Read

sgpdd-survey(SeagateES3 7.2 NL-SAS, RAID6, read)crg=1 crg=2 crg=4 crg=8 crg=16

sgpdd-survey(Hitachi 7.2K NL-SAS , RAID6, read)

crg=1 crg=2 crg=4 crg=8 crg=16

ddn.com

HPC "Work" Corp 15%

HPC Archive 23%

HPC Work, 24%

Project Quota

Small File I/O

SSD Acceleration

NFS/CIFS Access

Management

Connectors

Object/Cloud Links

Data Management

Backup/Replication

Client Performance

Cluster Integration

Large I/O

I/O Caching

ddn.com

Output Data from a Simulation

• Requirements– Random reads against multiple large files– 2 million 4k random read IOPS

• Solution– Lustre file system with 16 OSS servers– Two SFA12K (or one SFA12KXi)– 40 SSDs as Object Storage Devices

ddn.com

Lustre 4K Random Read IOPSConfiguration: 4 OSSs, 10 SSDs, 16 clients

100,000

200,000

300,000

400,000

500,000

600,000

1n32p 2n64p 4n128p 8n256p 12n384p 16n512p

Lustre 4K Random Read (FPP and SSF)FPP (POSIX)

FPP (MPIIO)

SSF (POSIX)

SSF (MPIIO)

ddn.com

Genomics Workflos

• Mixed workflows– Ingest– Sequencing pipelines: large file I/O– Analytics workflows: mixed I/O

• Various I/O issues– Random reads for reference data– Small-file random reads

ddn.com

Random Reads with SSDs

ddn.com

Data Analysis “Workflows”

• Scientific data analysis– Genomics workflows– Seismic Data Analysis– Various types of accelerators– Large scientific instruments in astronomy– Remote sensing and environmental monitoring– Microscopy

ddn.com

Data Analysis “Workflows”

• Additional topics–Data ingest–Data management and data retention–Data distribution and data sharing

ddn.com

Hyperscale StorageHPC, Cloud, Data Analysis

High Performance ComputingMostly (very) large files (GBs)Mostly write I/O performance

Mostly streaming performance

10s of Petabytes of DataScratch data

100,000s coresMostly InfinibandSingle location

Very limited replication factorHigh efficiency

Cloud ComputingSmall and medium size files (MBs)

Mostly read I/O performanceMostly transactional performance

10s of Billions of filesWORM & WORN

10s of millions of coresAlmost exclusively Ethernet

Highly distributed dataHigh replication factor

Low efficiency

Data Analysis

Workflows

ddn.com

HPC "Work" Corp 15%

HPC Archive 23%

HPC Work, 24%

Project Quota

Small File I/O

SSD Acceleration

NFS/CIFS Access

Management

Connectors

Object/Cloud Links

Data Management

Backup/Replication

Client Performance

Cluster Integration

Large I/O

I/O Caching

ddn.com

A (DDN) Vision for Lustre

• Maximum sequential and transactional performance per storage sub-system CPU

• Caching at various layers within the data path

• Increased single node streaming and small file performance

• Millions of metadata operations in a single FS

• Millions of random (read) IOPS within a single FS

ddn.com

A (DDN) Vision for Lustre cont.

• Data management features, including (cloud) tiering, fast and efficient data back-up, and data lifecycle management

• Novel usability features such as cluster integration, QoS, directory-level quota, etc.

• Extremely high backend reliability for small and mid-sized systems

ddn.com

Futures for Lustre?

• Work Closely with Users– User problems are the best source for future

direction– Translate user problems into roadmap priorities

• Work Closely with the Lustre Community– Work very closely with OpenSFS and Intel

HPDD on Lustre roadmap priorities and various other topics

ddn.com

Futures for Lustre?

ddn.com

Lustre Beyond HPC - OpenSFSopensfs.org/wp-content/uploads/2014/04/D2_S25_LustreFor... · 2016-10-12 · • Lustre market expansion – From a relatively small player to the dominant

Documents

Eric Barton Lustre - ISW2008 · Lustre deployments today...

Dell EMC HPC Storage with Lustre -...

An I/O analysis of HPC workloads on CephFS and Lustre

LUSTRE HPC FILE SYSTEM DEPLOYOMENT METHOD - education…

Dell EMC Ready Solutions for HPC Lustre Storagelustrefs.cn.....

Creating a Lustre Test System from Source with Virtual...

Hadoop MapReduce over Lustre* -...

Scalable and Distributed DNN Training on Modern HPC Systems:...

Lustre Acquisition: Details of Xyratex Lustre® Assets...

Lustre Metrics: New Techniques For Monitoring Lustre

HPC Wales – An Integrated HPC service Powering – Success...

High-Performance GPU Clustering: GPUDirect RDMA...

Map/Reduce on Lustre Hadoop Performance in HPC Environments....

Software Libraries and Middleware for Exascale Systems ·.....

COLORCHART - Kreinik001 Silver 001HL SilverHi Lustre 002...

Lustre Beyond HPC -...