Top Banner
STORIES ABOUT SPARK, HPC & BARCELONA Jordi Torres Barcelona Supercomputing Center UPC Barcelona Tech www.JordiTorres.eu - @JordiTorresBCN
63

Stories About Spark, HPC and Barcelona by Jordi Torres

Jan 11, 2017

Download

Data & Analytics

Spark Summit
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Stories About Spark, HPC and Barcelona by Jordi Torres

STORIES ABOUT SPARK, HPC & BARCELONAJordi TorresBarcelona Supercomputing CenterUPC Barcelona Techwww.JordiTorres.eu - @JordiTorresBCN

Page 2: Stories About Spark, HPC and Barcelona by Jordi Torres

Why HPC?

Page 3: Stories About Spark, HPC and Barcelona by Jordi Torres

Scientists always needed the best instruments which technology of the time allowed to build

Microscope (Santiago Ramon y Cajal) Large Hadron Collider (CERN)

Page 4: Stories About Spark, HPC and Barcelona by Jordi Torres

And supercomputers today can be considered as the ultimate scientific

instrument that enables progress in science

Page 5: Stories About Spark, HPC and Barcelona by Jordi Torres

The Evolution of The Research Paradigm

High Performance Computing means Numerical Simulation and Big Data Analysis that allows

Reduce expense Avoid dangerous experiments Help to build knowledge where experiments are impossible or not affordable

Page 6: Stories About Spark, HPC and Barcelona by Jordi Torres

HPC is an enabler for all scientific fields

Life Sciences & Medicine

Earth Sciences

Astro, High Energy & Plasma Physics

Materials, Chemistry & Nanoscience

Engineering Neuroscience

Page 7: Stories About Spark, HPC and Barcelona by Jordi Torres

Emergent focus on big data requires a transition of computing facilities into a data-centric paradigm too

However, traditional HPC systems are designed according to the compute-centric paradigm

Page 8: Stories About Spark, HPC and Barcelona by Jordi Torres

We have experimented with this in our HPC facility in Barcelona.

And this is what I’m going to talk about today!

How can traditional HPC existing infrastructure evolve to meet the new demands?

Page 9: Stories About Spark, HPC and Barcelona by Jordi Torres

What is HPC in Barcelona like?

Page 10: Stories About Spark, HPC and Barcelona by Jordi Torres

In Barcelona HPC is without doubt …A team of 425 people(from 40 countries)

Page 11: Stories About Spark, HPC and Barcelona by Jordi Torres

BSC scientific departmentsEARTH SCIENCES

LIFE SCIENCES

ENGINEERING SCIENCE

COMPUTER SCIENCES

Page 12: Stories About Spark, HPC and Barcelona by Jordi Torres

Joint Research Centres with IT Companies

BSC-Microsoft Research Centre

BSC-IBM Technology Center for Supercomputing

Intel-BSC Exascale Lab

BSC-NVIDIA CUDA Center of Excellence

Page 13: Stories About Spark, HPC and Barcelona by Jordi Torres

Our Supercomputer in BarcelonaMarenostrum

Supercomputer

Page 14: Stories About Spark, HPC and Barcelona by Jordi Torres

Born inside a deconsecrated chapel

Page 15: Stories About Spark, HPC and Barcelona by Jordi Torres

The Marenostrum 3 SupercomputerOver 1015 Floating Points Operations per second (Petaflop)

– Nearly 50,000 cores

– 100.8 TB of memory

– 2000 TB disk storage

Page 16: Stories About Spark, HPC and Barcelona by Jordi Torres

The third of three brothers• 2004: MareNostrum 1

– Nearly 5x1013 Floating Points per second

– Nearly 5.000 cores– 236 TB disk storage

• 2006: MareNostrum 2– Nearly 1014 Floating

Points per second– Over 10.000 cores– 460 disk storage

• 2012: MareNostrum 3

Page 17: Stories About Spark, HPC and Barcelona by Jordi Torres

Marenostrum ancestors in the chapelA parallel system inside the same chapel:

Grandparent:Processing capacity: Over 1000 operations-beats per minuteParallel system with 8 parallel typewriter units.

Grandmother:Storage capacity: over 100MbParallel Storage System with 14 drawer devices.

Page 18: Stories About Spark, HPC and Barcelona by Jordi Torres

How could BSC meet new Big Data demands?

Page 19: Stories About Spark, HPC and Barcelona by Jordi Torres

Until now, the habitual MN3 workloads have been numerical applications

• MN3 Basic software Stack:– OpenMP– MPI– Threads– …

Page 20: Stories About Spark, HPC and Barcelona by Jordi Torres

How can MN3 evolve to meet new Big Data Analytics demand?

New module developed at BSC

MarenostrumSupercomputer+

Page 21: Stories About Spark, HPC and Barcelona by Jordi Torres

SPARK4MN module• framework to enable Spark workloads over

IBM LSF Platform workload manager on MN3

Page 22: Stories About Spark, HPC and Barcelona by Jordi Torres

Spark4MN in action

Lets go!

Page 23: Stories About Spark, HPC and Barcelona by Jordi Torres

Spark4MN in action• We performed a System level Performance

Evaluation & Tuning to MN3• Example of some results:

– Speed-up– Scale-up– Parallelism

Page 24: Stories About Spark, HPC and Barcelona by Jordi Torres

Example 1: Kmeans Speed-upMore dimensions smaller speed-up because of increased shuffling (same number of centroids to shuffle but bigger)

• Times for running k-means for 10 iterations.

• Problem size constant = 100GBs (10M1000D = 10M vectors of 1000 dimensions)

Page 25: Stories About Spark, HPC and Barcelona by Jordi Torres

Example 2: Kmeans Scale-up

• modify both the number of records and the number of machines.

• Ideally, all the plots should be horizontal

our system behaves closely to that.

Page 26: Stories About Spark, HPC and Barcelona by Jordi Torres

Example 3: Configuring task parallelismVarying the number of tasks over the same amount of cores

for k-means, the best-performing configuration is to have as many partitions as cores = 1 task per core is better!

• Median times for running k-means for 10 iterations with different number of partitions

• In our benchmarks the number of tasks is equal to the number of RDD partitions.

Page 27: Stories About Spark, HPC and Barcelona by Jordi Torres

Example 3: Configuring task parallelism• Using Sort-by-key: a more intensive shuffling-intensive scenario

– We sort 1 billion records using 64 nodes & different partition sizes

– Contrary to the previous case, we observe speed-ups when there are 2 partitions per core

Page 28: Stories About Spark, HPC and Barcelona by Jordi Torres

Exemple 4: sort-by-key• How many concurrent tasks an executor

can supervise?Having 2 8-core executors instead of 8 2-core ones, improves on the running time by a factor of 2.79 leaving all the other parameters the same.

Page 29: Stories About Spark, HPC and Barcelona by Jordi Torres

More results on Friday at the Santa Clara conference!

2015 IEEE International Conference on Big Data October 29-November 1, Santa Clara, CA, USA

Page 30: Stories About Spark, HPC and Barcelona by Jordi Torres

Spark and node level performance?

Page 31: Stories About Spark, HPC and Barcelona by Jordi Torres

New Architecture Support for Big Data Analytics

Exponential increase in core countNever promising technologies (Hybrid Memory Cubes, NVRAM, etc)

Page 32: Stories About Spark, HPC and Barcelona by Jordi Torres

Our Research Goal

Improve the node level performance of

state-of-the-art scale-out data

processing framework

+

Page 33: Stories About Spark, HPC and Barcelona by Jordi Torres

Speed-up vs Executor threads

(*) Processor Intel Xeon E5-2697 (24 cores) & Spark 1.3

Page 34: Stories About Spark, HPC and Barcelona by Jordi Torres

Data Processing Capacity scaling at large input dataset

The performance of Spark workloads degrades with large volumes of data due to substantial increase in garbage collection and file I/O time.

Spark workloads do not saturate the available bandwidth and hence their performance is bound on DRAM latency

Page 35: Stories About Spark, HPC and Barcelona by Jordi Torres

More results on

• A. J. Awan, M. Brorsson, V. Vlassov and E. Ayguade, "Performance Characterization of In-Memory Data Analytics on a Modern Cloud Server", in 5th IEEE International Conference on Big Data and Cloud Computing (BDCloud), Aug 2015, Dalina, China (Best Paper Award)

• A. J. Awan, M. Brorsson, V. Vlassov and E. Ayguade, "How Data Volume Affects Spark Based Data Analytics on a Scale-up Server", in 6th International Workshop on Big Data Benchmarks, Performance Optimization and Emerging Hardware (BpoE), held in conjunction with 41st International Conference on Very Large Data Bases, Sep 2015, Hawaii, USA.

Page 36: Stories About Spark, HPC and Barcelona by Jordi Torres

Next generation of HPC programming models and Spark?

Page 37: Stories About Spark, HPC and Barcelona by Jordi Torres

BSC programming model COMPSs

– Sequential programming model

– Abstracts the application from the underlying distributed infrastructure

– Exploit the inherent parallelism at runtime

Page 38: Stories About Spark, HPC and Barcelona by Jordi Torres

We are studying the comparison and interaction between these two programming

models in platforms like marenostrum 3

MarenostrumSupercomputer

MarenostrumSupercomputer

Page 39: Stories About Spark, HPC and Barcelona by Jordi Torres

Profiling Spark with BSC’s HPC tools

• Relying on over 20 years HPC experience & tools for profiling

• Preliminary work: Developed the Hadoop Instrumentation Toolkit

CPU

Memory

Page Faults

processes and communication

Page 40: Stories About Spark, HPC and Barcelona by Jordi Torres

Project ALOJA: Benchmarking Spark

• Open initiative to Explore and produce a systematic study of Hadoop/Spark efficiency on different SW and HW

• Online repository that allows compare, side by side all execution parameters ( 50,000+ runs over 100+ HW config.)

Page 41: Stories About Spark, HPC and Barcelona by Jordi Torres

Big Data Analytics workloads at BSC?

(with Spark)

Page 42: Stories About Spark, HPC and Barcelona by Jordi Torres

Preliminary work• Multimedia Big Data Computing: Work with three kinds of data at the same time

social network

relationships

audiovisualcontent metadata

Page 43: Stories About Spark, HPC and Barcelona by Jordi Torres

Preliminary case study Multimodal Data Analytics systems

E.g. Latent User Attribute Inference to Predicting Desigual Followers

Page 44: Stories About Spark, HPC and Barcelona by Jordi Torres

44

Example of tools created: VectorizationNecessary for visual similarity search, visual clustering, classification, etc.

Page 45: Stories About Spark, HPC and Barcelona by Jordi Torres

45

Available in our github: bsc.spark.image scala> import bsc.spark.image.ImageUtils…scala> images = ImageUtils.seqFile("hdfs://...", sc);

scala> dictionary = ImageUtils.BoWDictionary(images);

scala> vectors = dictionary.getBags(images); … scala> val splits = vectors.randomSplit(Array(0.6, 0.4), seed = 11L)

scala> training = splits(0)

scala> test = splits(1)

scala> model = NaiveBayes.train(training, lambda = 1.0)…

Page 46: Stories About Spark, HPC and Barcelona by Jordi Torres

Applications: Locality Sensitive Hashinge.g. near-replica detection (visual spam detection, copyright infringement)

PATCH 1

PATCH 2

PATCH 3

PATCH 4

KP1

KP2

KP3

KP4

feature detection

feature description

0000 0100 1100

0010 0110 1110

0011 0111 1111

features are sketched, embedded into a Hamming space

Similar features are hashed into similar buckets in a hash table

SIFT, SURF, ORB, etc.

0 1 1 0

Page 47: Stories About Spark, HPC and Barcelona by Jordi Torres

Current work: Computer Vision

• Makes very productive use of (convolutional) neural networks • SIFT features became unnecessary (used for decades)

Page 48: Stories About Spark, HPC and Barcelona by Jordi Torres

What next at BSC?

Page 49: Stories About Spark, HPC and Barcelona by Jordi Torres

BSC vision:Giving computers a greater

ability to understand information, and to learn, to

reason, and act upon it

Page 50: Stories About Spark, HPC and Barcelona by Jordi Torres

Old wine in a new bottle?

• the term itself dates from the 1950s. • periods of hype and high

expectations alternating with periods of setback and disappointment.

Artificial Intelligence

plays an important

role

Page 51: Stories About Spark, HPC and Barcelona by Jordi Torres

Why Now?1. Along the explosion of

data …

now algorithms can be “trained” by exposing them to

large data sets that were previously unavailable.

2. And the computing power necessary to

implement these algorithms are now available

Page 52: Stories About Spark, HPC and Barcelona by Jordi Torres

Evolution of computing powerFLOP/second

1988Cray Y-MP (8 processadors)

1998Cray T3E (1024 processadors)

2008Cray XT5 (15000 processadors)

~2019? (1x107 processadors

Page 53: Stories About Spark, HPC and Barcelona by Jordi Torres

This new type of computing requires

DATA

SupercomputersResearch

Big DataTechnologies

Advanced Analytic

Algorithms

1. the continuous development of supercomputing systems

2. enabling the convergence of advanced analytic algorithms

3. and big data technologies

Page 54: Stories About Spark, HPC and Barcelona by Jordi Torres

Today technologies & focus at BSC

COMPUTERVISION

Advanced Analytics

Algorithms

Page 55: Stories About Spark, HPC and Barcelona by Jordi Torres

Cognitive Computing requires a transition of computing facilities into a new paradigm too

Name? … We use Cognitive Computing

Yesterday Today Tomorrow

Page 56: Stories About Spark, HPC and Barcelona by Jordi Torres

And to finish… Welcome to Barcelona!

Page 57: Stories About Spark, HPC and Barcelona by Jordi Torres

Welcome to our wonderful city

57

Page 58: Stories About Spark, HPC and Barcelona by Jordi Torres

Welcome to our university

22 schools - 4K employees - 35K students

Page 59: Stories About Spark, HPC and Barcelona by Jordi Torres

Welcome to our research center

Page 60: Stories About Spark, HPC and Barcelona by Jordi Torres

Welcome to our everyday life

60

Page 61: Stories About Spark, HPC and Barcelona by Jordi Torres

Welcome to our academic activities• Teaching Spark @ Master courses• Using Spark @ Final Master Thesis• Using Spark @ Research activity

• NEW Spark Book in Spanish• Editorial UOC • Presentation November 3, 2015

61

Foreword by Matei Zaharia

Page 62: Stories About Spark, HPC and Barcelona by Jordi Torres

1000+ members

62

Welcome to our Spark Community

Page 63: Stories About Spark, HPC and Barcelona by Jordi Torres

1000+ members

63

Thank you for your attention!Jordi Torres @JordiTorresBCN www.JordiTorres.eu

Welcome to our Spark Community