Top Banner
Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA Is Exascale the End of the Line for Commodity Networks? Scott Pakin Applied Computer Science Group Los Alamos National Laboratory 27 September 2011
8
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Is Exascale the End of the Line for Commodity Networks?

Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

Is Exascale the End of the Line for Commodity Networks?

Scott Pakin Applied Computer Science Group Los Alamos National Laboratory

27 September 2011

Page 2: Is Exascale the End of the Line for Commodity Networks?

Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

Short Answer: No

…or, for the most part, me.

Disclaimer: The opinions expressed herein do not necessarily reflect the positions of Los Alamos National Laboratory, the National Nuclear Security Administration, or the United States Department of Energy

Presenter
Presentation Notes
Scott Pakin does not represent or endorse the accuracy or reliability of any advice, opinion, statement or other information provided by this presentation. Reliance upon any such advice, opinion, statement, or other information shall also be at the audience’s own risk. Neither Scott Pakin, nor his employer, nor any of their respective agents, employees, information providers, or content providers, shall be liable to any audience member or anyone else for any inaccuracy, error, omission, interruption, deletion, defect, alteration of or use of any content herein, or for its timeliness or completeness, nor shall they be liable for any failure of performance or communication line failure, regardless of cause, or for any damages resulting therefrom.
Page 3: Is Exascale the End of the Line for Commodity Networks?

Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

Argument #1: Commodity Networks Worked at Petascale

Roadrunner @ Los Alamos • First sustained petaflop/s • 3,060 nodes of InifiniBand • First Top 1 supercomputer ever

to use a commodity network

No multi-PB/s optical data vortex with cryogenic light sources • Sounded like a good idea for

petascale back in 1999

Exascale possibility #1 • Custom networks within a compute unit (e.g., a rack) • Commodity network interconnects the compute units • Not all that different from ASCI Blue Mountain, ca. 1999 (SGI NUMAlink intra-

node, commodity HiPPI inter-node) • For concreteness, consider, e.g., a Blue Gene-like system of 326 IB-connected

racks, 1,024 sockets per rack, and 3 Tflop/s GPUs instead of low-end CPUs

Presenter
Presentation Notes
Whatever happened to the HTMT design, anyway?
Page 4: Is Exascale the End of the Line for Commodity Networks?

Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

Argument #2: Cost

Underlies almost every argument for using almost any commodity

Get something “good” for significantly less money than “perfect” • I could buy a custom-tailored suit

that fits perfectly and looks exactly the way I want

• The clothes I’m wearing now fit fine, look okay, and cost significantly less

Leaves more money to spend on other parts of the system

Page 5: Is Exascale the End of the Line for Commodity Networks?

Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

Argument #3: Misguided Performance Metrics

Does “exascale” mean “1018 flop/s on LINPACK”? • Metric for sorting the Top500 list • People who pay for really big

supercomputers like to see them in the #1 slot

LINPACK transmits only O(N2) data for O(N3) computation

Moral • Buy a relatively cheap network • Put the money saved into more and

faster processors

(Oh, you actually wanted to run applications at exascale?)

Presenter
Presentation Notes
Bandwidth is also a more common metric than latency or feature set, and commodity networks are generally fairly competitive with bandwidth. Or just stick some extra NICs in each node.
Page 6: Is Exascale the End of the Line for Commodity Networks?

Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

Argument #4: Fewer Unknown Unknowns

Commodity networks have all sorts of problems from an HPC standpoint • Per-connection resource requirements • Pre-pinning of communication buffers • Bulky routing tables to handle arbitrary

topologies • Many cycles needed to trigger

communication

Point is that we know what the problems are • Academia figures out how to work

around most network shortcomings • Industry eventually produces great

implementations of awful standards • Why is my one-off network sometimes

slow? Who knows? (Limited experience and few tools)

“[T]here are known knowns; there are things we know we know. We also know there are known unknowns; that is to say we know there are some things we do not know. But there are also unknown unknowns—the ones we don't know we don't know.”

Presenter
Presentation Notes
Think of CISC winning out over RISC: better implementations won out over the better concept.
Page 7: Is Exascale the End of the Line for Commodity Networks?

Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

Argument #5: Stupid, Meddling Bureaucrats

U.S. regulations prohibit granting supercomputer access to a non-U.S. person without acquiring an export license

What’s a supercomputer?

If the system uses a proprietary network, then

If the system uses a commodity network, then

References • U.S. Dept. of Commerce. A

Practitioner’s Guide to Adjusted Peak Performance. Dec. 2006

• U.S. Export Administration Regulations, Part 774: Commerce Control List, Category 4 (Computers), Supplement No. 1

� 𝑊𝑊𝑖𝑖 �𝐹𝐹𝐹𝐹𝐹𝐹𝑖𝑖 𝑡𝑡𝑖𝑖

��������

Node performancein “weighted Tflop /s”

(an idiotic metric )

> 1.5 WT𝑛𝑛

𝑖𝑖=1

max1≤𝑖𝑖≤𝑛𝑛

𝑊𝑊𝑖𝑖 �𝐹𝐹𝐹𝐹𝐹𝐹𝑖𝑖𝑡𝑡𝑖𝑖

� > 1.5 WT

Page 8: Is Exascale the End of the Line for Commodity Networks?

Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

Conclusion

Let’s go build some exascale supercomputers with commodity networks!

It won’t be a horrible mess…really!