Top Banner
15

Lustre* & Intel® Omni-Path Fabric...Robin Hood Open ZFS, RAIDz Hadoop* Adapters Lnet (continued) • Networks are given unique names – o2ib0, tcp0, tcp1 • Lustre Network Identifier

May 20, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Lustre* & Intel® Omni-Path Fabric...Robin Hood Open ZFS, RAIDz Hadoop* Adapters Lnet (continued) • Networks are given unique names – o2ib0, tcp0, tcp1 • Lustre Network Identifier
Page 2: Lustre* & Intel® Omni-Path Fabric...Robin Hood Open ZFS, RAIDz Hadoop* Adapters Lnet (continued) • Networks are given unique names – o2ib0, tcp0, tcp1 • Lustre Network Identifier
Page 3: Lustre* & Intel® Omni-Path Fabric...Robin Hood Open ZFS, RAIDz Hadoop* Adapters Lnet (continued) • Networks are given unique names – o2ib0, tcp0, tcp1 • Lustre Network Identifier

Lustre* & Intel® Omni-Path Fabric

Keith Mannthey, Lustre Solutions Architect

Intel High Performance Data Division

Page 4: Lustre* & Intel® Omni-Path Fabric...Robin Hood Open ZFS, RAIDz Hadoop* Adapters Lnet (continued) • Networks are given unique names – o2ib0, tcp0, tcp1 • Lustre Network Identifier

Agenda

• Lnet Introduction

• Intel® Omni-Path Architecture (OPA)

• Lnet Intel® OPA Considerations

• Lnet Intel® OPA Integration into Existing Fabrics

4

Page 5: Lustre* & Intel® Omni-Path Fabric...Robin Hood Open ZFS, RAIDz Hadoop* Adapters Lnet (continued) • Networks are given unique names – o2ib0, tcp0, tcp1 • Lustre Network Identifier

Lnet Introduction

5

Page 6: Lustre* & Intel® Omni-Path Fabric...Robin Hood Open ZFS, RAIDz Hadoop* Adapters Lnet (continued) • Networks are given unique names – o2ib0, tcp0, tcp1 • Lustre Network Identifier

Intel® Manager for Lustre*

Lnet Introduction

6

ManagementNetwork

High Performance Data Network(InfiniBand*, 10GbE)

MetadataServers(1-10s)

Object Storage Servers(10s-1000s)

Lustre Clients (1 – 100,000+)

Object StorageTargets (OSTs)

Object StorageTargets (OSTs)MetadataTarget (MDT)

ManagementTarget (MGT)

HSM

Native Lustre* Client for Intel® Xeon Phi™ processorIntel® Digital Security Surveillance? (hinting)

Intel® Omni-Path SupportRobin Hood

Open ZFS, RAIDzHadoop* Adapters

Page 7: Lustre* & Intel® Omni-Path Fabric...Robin Hood Open ZFS, RAIDz Hadoop* Adapters Lnet (continued) • Networks are given unique names – o2ib0, tcp0, tcp1 • Lustre Network Identifier

Lnet (continued)

• Networks are given unique names – o2ib0, tcp0, tcp1

• Lustre Network Identifier (NID) defines interfaces– 10.1.145.16@o2ib0

• Includes native support for multiple networks– Accomplished via the Lustre Network Driver (LND)

• Infiniband via o2ib verbs interface, with RDMA support

• Ethernet via TCP/IP interface

• Lustre -> Network RPC API -> LNet -> LND -> Linux Driver

7

Page 8: Lustre* & Intel® Omni-Path Fabric...Robin Hood Open ZFS, RAIDz Hadoop* Adapters Lnet (continued) • Networks are given unique names – o2ib0, tcp0, tcp1 • Lustre Network Identifier

Lnet

• Designed to meet the needs of large-scale computing clusters– Optimized for very large node counts, high throughput

• Works with most networks, supports RDMA– Omni-Path, Ethernet, Infiniband, XC/XT/XE, ELAN, Myrinet*, etc.

• LNet is independent of the Lustre file system– Abstracts network details from Lustre

– Implemented as a set of kernel modules

8

Page 9: Lustre* & Intel® Omni-Path Fabric...Robin Hood Open ZFS, RAIDz Hadoop* Adapters Lnet (continued) • Networks are given unique names – o2ib0, tcp0, tcp1 • Lustre Network Identifier

Intel® Omni-Path Architecture Building on the industry’s best technologies Highly leverage existing Aries and Intel®

True Scale fabric

Adds innovative new features and capabilities to improve performance, reliability, and QoS

Re-use of existing OpenFabricsAlliance* software

94

Robust product offerings and ecosystem End-to-end Intel product line

Strong ecosystem with 70+ Fabric Builders members

Software

Open SourceHost Software and

Fabric Manager

HFI Adapters

Single portx8 and x16

x8 Adapter(58 Gb/s)

x16 Adapter

(100 Gb/s)

Edge Switches

1U Form Factor24 and 48 port

24-portEdge Switch

48-portEdge Switch

Director Switches

QSFP-based192 and 768 port

192-portDirector Switch

(7U chassis)

768-portDirector Switch

(20U chassis)

Cables

Third Party VendorsPassive Copper Active Optical

Silicon

OEM custom designsHFI and Switch ASICs

Switch siliconup to 48 ports

(1200 GB/stotal b/w

HFI siliconUp to 2 ports

(50 GB/s total b/w)

Page 10: Lustre* & Intel® Omni-Path Fabric...Robin Hood Open ZFS, RAIDz Hadoop* Adapters Lnet (continued) • Networks are given unique names – o2ib0, tcp0, tcp1 • Lustre Network Identifier

Lnet Intel® OPA Considerations

• Base OS Support – Red Hat Enterprise Linux 7

– SUSE Linux Enterprise Server 12

• Lustre 2.7+ Required for OS Support Server Side

• Intel Fabric Suite (IFS) delivers OPA UPDATES – IFS updates Base OS OFED components only as required

– Enables the use of other in Kernel drivers concurent with OPA

10

Page 11: Lustre* & Intel® Omni-Path Fabric...Robin Hood Open ZFS, RAIDz Hadoop* Adapters Lnet (continued) • Networks are given unique names – o2ib0, tcp0, tcp1 • Lustre Network Identifier

Lnet Intel® OPA Considerations (continued)

• Intel® OPA gen 1 Supports RDMA Verbs in OFED – Lnet uses the existing LND Infiniband Driver

• Only LND and Driver TUNING required for operation – Automated LND settings at Lnet install time

11

Page 12: Lustre* & Intel® Omni-Path Fabric...Robin Hood Open ZFS, RAIDz Hadoop* Adapters Lnet (continued) • Networks are given unique names – o2ib0, tcp0, tcp1 • Lustre Network Identifier

Integration into Existing Fabrics

Challenge : Intel® OPA is not directly compatible with Infinband. Intel® OPA cannot plug into IB Switch

Solution: Lnet Router

12

OPA Lustre Components

LNetRouters

IB Lustre ComponetsOPA Inifiband

Page 13: Lustre* & Intel® Omni-Path Fabric...Robin Hood Open ZFS, RAIDz Hadoop* Adapters Lnet (continued) • Networks are given unique names – o2ib0, tcp0, tcp1 • Lustre Network Identifier

Lnet Routers Overview

• Lnet Routers Lustre Software + Standard HW– Use off the shelf Hardware

– Software is apart of standard Lnet/Lustre

– Clustered Deployment Recommended

• Supported Configurations with Redhat* 7.2 and Lustre 2.7+– Intel® OPA -> Ethernet

– Intel® OPA -> FDR (use in kernel drivers for IB)

– Intel® OPA -> EDR (use in kernel drivers for IB)

– See Intel® Enterprise Edition for Lustre* software Configuration For LNet Routers

13

Page 14: Lustre* & Intel® Omni-Path Fabric...Robin Hood Open ZFS, RAIDz Hadoop* Adapters Lnet (continued) • Networks are given unique names – o2ib0, tcp0, tcp1 • Lustre Network Identifier

Wrap up

• Lustre Intel® OPA is in production Today

• Lustre Routers provided Flexible Deployment Options

• Learn More– www.intel.com/Lustre

14

Page 15: Lustre* & Intel® Omni-Path Fabric...Robin Hood Open ZFS, RAIDz Hadoop* Adapters Lnet (continued) • Networks are given unique names – o2ib0, tcp0, tcp1 • Lustre Network Identifier