Top Banner
27
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Distro compute
Page 2: Distro compute
Page 3: Distro compute

Introduce

Page 4: Distro compute

WorkloadsDistributed compute in a nutshell (where many nuts > few nuts)

Page 5: Distro compute

Workloads

Page 6: Distro compute

Distributed Compute Developer

Big Compute Big Penguin Big Data

Page 7: Distro compute

BIG ANYTHINGHello All Worlds

Page 8: Distro compute

Big Co$t?

That’d be like Microsoft, right?

Page 9: Distro compute

Microsoft Research Genomics

Page 10: Distro compute

InspectDistributed compute in a nutshell (with many little nuts)

Page 11: Distro compute

Inspect

Page 12: Distro compute

HPCHead Node

Broker Nodes

Compute Nodes

Allows on-premises

And hybrid option

Compare Architectures

Big DataName Node

Data Nodes

Allows cloud or on-premises no hybrid option

Page 13: Distro compute

Hadoop

HPC

Page 14: Distro compute

All distributed compute works on the basis of taking a large JOB and breaking it to many smaller TASKS which are then parallelised

Page 15: Distro compute

Develop

Page 16: Distro compute

Deploy

Page 17: Distro compute

ExamplesHow do you take your compute?

Page 18: Distro compute

OUCHLessons learned from getting too close to the coalface

Page 19: Distro compute
Page 20: Distro compute

A broken cluster is no place to be diagnosing

Page 21: Distro compute
Page 22: Distro compute

Scalability < Elasticity

Page 23: Distro compute
Page 24: Distro compute

Hybrid HPC is next to useless

Page 25: Distro compute
Page 26: Distro compute

95% through a petabyte is a bad place to find a bug

Page 27: Distro compute

Thank you!