Top Banner
51

Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Feb 19, 2017

Download

Technology

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016
Page 2: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

02/06/2016 OpenStack Israel 2016 2

Frontiers of Science and Clouds

Tim Bell – [email protected] - @noggin143

Page 3: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

About Tim

02/06/2016 OpenStack Israel 2016 3

Page 4: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

4

CERN: founded in 1954: 12 European States“Science for Peace”Today: 21 Member States

Member States: Austria, Belgium, Bulgaria, Czech Republic, Denmark, Finland, France, Germany, Greece, Hungary, Israel, Italy, Netherlands, Norway, Poland, Portugal, Slovak Republic, Spain, Sweden, Switzerland and United Kingdom Associate Member States: Pakistan, TurkeyStates in accession to Membership: Romania, SerbiaApplications for Membership or Associate Membership:Azerbaijan, Brazil, Croatia, Cyprus, India, Russia, Slovenia, Ukraine Observers to Council: India, Japan, Russia, United States of America; European Union, JINR and UNESCO

~ 2300 staff~ 1400 other paid personnel~ 12500 scientific usersBudget (2016) ~1000 MCHF

02/06/2016 OpenStack Israel 2016 4

Page 5: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

CMS

ALICE

ATLAS LHCb

Largest Machine on Earth

27 Km circumference, 100m underground02/06/2016 OpenStack Israel 2016 5

Page 6: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Most Powerful Magnets

9,600 superconducting magnets direct the beam02/06/2016 OpenStack Israel 2016 6

Page 7: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Highest Vacuum

Pressure is 10 times less than on the moon 02/06/2016 OpenStack Israel 2016 7

Page 8: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Coldest Temperature

120 tonnes of liquid helium for cooling to -271°C 02/06/2016 OpenStack Israel 2016 8

Page 9: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

30M USD/year Electricity Bill

Up to 200MW at peak utilisation02/06/2016 OpenStack Israel 2016 9

Page 10: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

ATLAS Experiment

02/06/2016 OpenStack Israel 2016 10

Page 11: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

ALICE Experiment

02/06/2016 OpenStack Israel 2016 11

Page 12: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

CMS Experiment

02/06/2016 OpenStack Israel 2016 12

Page 13: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

LHCb Experiment

02/06/2016 OpenStack Israel 2016 13

Page 14: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

COLLISIONS

02/06/2016 OpenStack Israel 2016 14

Collisions Produce 1PB/s

Page 15: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

02/06/2016 OpenStack Israel 2016 15

Innovations

Page 16: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

TIER-1: permanent storage,

re-processing, analysis

TIER-0 (CERN): data recording,

reconstruction and distribution

TIER-2: Simulation,

end-user analysis

> 2 million jobs/day

~350’000 cores

500 PB of storage

nearly 170 sites, 40 countries

10-100 Gb links

Worldwide LHC Computing Grid

02/06/2016 OpenStack Israel 2016 16

Page 17: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

02/06/2016

Compute Growth Outlook

0"

20"

40"

60"

80"

100"

120"

140"

160"

Run"1" Run"2" Run"3" Run"4"

GRID"

ATLAS"

CMS"

LHCb"

ALICE"

Compute: Growth > x50Moore’s law only x16

What we can afford

… and 400PB/year by 2023OpenStack Israel 2016 17

Page 18: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

02/06/2016 OpenStack Israel 2016 18

Page 19: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

02/06/2016 OpenStack Israel 2016 19

Expanded Facilities in Hungary

Page 20: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

O’Reilly Consideration

02/06/2016 OpenStack Israel 2016 20

Page 21: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Job Trends Consideration

02/06/2016 OpenStack Israel 2016 21

From Indeed.com

Page 22: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

CERN Tool Chain

02/06/2016 OpenStack Israel 2016 22

Page 23: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

CERN OpenStack Project

(*) Pilot

ESSEX

Nova (*)SwiftGlance (*)Horizon (*)Keystone (*)

FOLSOM

Nova (*)SwiftGlance (*)Horizon (*)Keystone (*)QuantumCinder

GRIZZLY

NovaSwiftGlanceHorizonKeystoneQuantumCinderCeilometer (*)

HAVANA

NovaSwiftGlanceHorizonKeystoneNeutronCinderCeilometer (*)Heat

ICEHOUSE

NovaSwiftGlanceHorizonKeystoneNeutronCinderCeilometerHeatIronicTrove

JUNO

NovaSwiftGlanceHorizonKeystoneNeutronCinderCeilometerHeat (*)Rally (*)

5 April 2012 27 September 2012 4 April 2013 17 October 2013 17 April 2014 16 October 2014

July 2013CERN OpenStack

Production Service

February 2014CERN OpenStackHavana Release

October 2014CERN OpenStackIcehouse Release

30 April 2015

March2015CERN OpenStack

Juno Release

LIBERTY

NovaSwiftGlanceHorizonKeystoneNeutron (*)CinderCeilometerHeatRallyMagnum (*)Barbican (*)Ironic

15 October 2015

September 2015CERN OpenStack

Kilo Release

KILO

NovaSwiftGlanceHorizonKeystoneNeutronCinderCeilometerHeatRallyManila

May 2016CERN OpenStackongoing Liberty

02/06/2016 OpenStack Israel 2016 23

Page 24: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Upstream OpenStack on its own does not give you a cloud service

e.g.

• 200 people arrive and leave CERN / month

• User skill levels vary widely

• Application range from server consolidation, dev/test to production compute

Cloud is a service!

Monitoring and alerting

Metering and

chargebackAutoscaling

Remediation

Resource Lifecycle

Capacity planning

Upgrades

SLA

Customer supportUser

experience

Incident resolution

Alerting

Cloud monitoring

Metrics

Log processing

High availability

Configmanagement

Infraonboarding

CI

BuildsNet/info sec

Network design

OpenStack APIs

02/06/2016 OpenStack Israel 2016 24

Subbu Allamaraju @ eBay

Not Just The Software

Page 25: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

In production:

• 4 clouds• >200K cores• >8,000 hypervisors

~60,000 additional cores being installed in Q2 2016

90% of CERN’s compute resources are now delivered on top of OpenStack

OpenStack@CERN Status

02/06/2016 OpenStack Israel 2016 25

Page 26: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

02/06/2016

Cultural Change Impact

OpenStack Israel 2016 26

• Move to Agile technologies brings great benefits

• Reduced deployment time• Continuous integration validation• Flexibility

• Don’t forget Hooke’s Law (adapted)• Under load, an organisation can

extend proportional to external force• Too much load leads to permanent

deformation

• Ensure the tail is moving fast as well as the head

• Application support• Cultural challenges• Process change

Page 27: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

• Open source collaboration sets model for in-house teams

• External recognition by the community is highly rewarding for contributors

• Reviews and being reviewed is a constant learning experience

• Operator sharing is much more than just OpenStack

02/06/2016

Communities Encourage Change

Keystone 2013

Paris 2014

OpenStack Israel 2016 27

Page 28: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Deployment Models

02/06/2016 OpenStack Israel 2016 28

• Skills• Culture• Need for Speed• Risk Appetite

Page 29: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Keeping Up with Releases

02/06/2016 OpenStack Israel 2016 29

Page 30: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Users and Technology

02/06/2016 OpenStack Israel 2016 30

Page 31: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

When it’s not on the menu?

02/06/2016 OpenStack Israel 2016 31

Page 32: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Top level cell• Runs API service• Top cell scheduler

Child cells run• Compute nodes• Nova network• Scheduler• Conductor

Version 2 coming• Default for all

Nova Cells

02/06/2016 OpenStack Israel 2016 32

Page 33: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Onwards Federated Clouds

Public Cloud such as Rackspace or IBM

CERN Private Cloud160K cores ATLAS Trigger

28K cores

ALICE Trigger9K cores

CMS Trigger13K cores

INFNItaly

Brookhaven National Labs

NecTARAustralia

Many Others on Their Way

02/06/2016 OpenStack Israel 2016 33

Available in standard OpenStack since Kilo

Page 34: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Containers on Clouds

For the user• Interactive• Dynamic• Choice

For IT• Timely!• Secure• Managed• Integrated

02/06/2016 OpenStack Israel 2016 34

CERN now runs Magnum on the production cloud

Page 35: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Outlook for next 12 months• Scale out

• Another ~100K cores end 2016• Further significant retirements and migrations

• Consolidate• EC2 API project• Nova network to Neutron

• Enhance• Kubernetes,Mesos,Swarm containers for all• Bare metal (Ironic) and Workflow (Mistral) to pilot• Accounting / Fleet Management

• Investigate further capacity options• Public clouds

02/06/2016 OpenStack Israel 2016 35

Page 36: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Summary• OpenStack at CERN has been in

production for 3 years• Major cultural and technology changes

have been successfully addressed • Contributing back upstream has led to

sustainable tools and effective technology transfer

This transformation would not have been possible without community contribution

02/06/2016 OpenStack Israel 2016 36

Page 37: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Technical details at http://openstack-in-production.blogspot.fr

02/06/2016 OpenStack Israel 2016 37

Thanks to all of you for contributing!

Thanks to the CERN IT team and Francois Briard from CERN communications

Page 38: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Some history of scale…

Date Collaboration sizes

Data volume, archive technology

Late 1950’s 2-3 Kilobits, notebooks

1960’s 10-15 kB, punchcards

1970’s ~35 MB, tape

1980’s ~100 GB, tape, disk

1990’s ~750 TB, tape, disk

2010’s ~3000 PB, tape, disk

For comparison:1990’s: Total LEP data set ~few TBWould fit on 1 tape today

Today: 1 year of LHC data ~27 PB

02/06/2016 OpenStack Israel 2016 38

Page 39: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

CERNArchive>100 PB

CERNNewdata

15 PB

23 PB

27 PB

02/06/2016 OpenStack Israel 2016 39

Page 40: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

4002/06/2016 OpenStack Israel 2016

Page 41: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

THE CERN MEYRIN DATA CENTRE

http://goo.gl/maps/K5SoG

02/06/2016 OpenStack Israel 2016 41

Page 42: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Public Procurement CycleStep Time (Days) Elapsed (Days)

User expresses requirement 0Market Survey prepared 15 15Market Survey for possible vendors 30 45Specifications prepared 15 60Vendor responses 30 90Test systems evaluated 30 120Offers adjudicated 10 130Finance committee 30 160Hardware delivered 90 250Burn in and acceptance 30 days typical with 380 worst case 280

Total 280+ Days

02/06/2016 OpenStack Israel 2016 42

Page 43: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Good News, Bad News

• Additional data centre in Budapest now online• Increasing use of facilities as data rates increase

But…• Staff numbers are fixed, no more people• Materials budget decreasing, no more money• Legacy tools are high maintenance and brittle• User expectations are for fast self-service

02/06/2016 OpenStack Israel 2016 43

Page 44: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Innovation Dilemma

• How can we avoid the sustainability trap ?• Define requirements• No solution available that meets those requirements• Develop our own new solution• Accumulate technical debt

• How can we learn from others and share ?• Find compatible open source communities• Contribute back where there is missing functionality• Stay mainstream

Are CERN computing needs really special ?

02/06/2016 OpenStack Israel 2016 44

Page 45: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

02/06/2016 45OpenStack Israel 2016

Page 46: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

The largest detectors

02/06/2016 OpenStack Israel 2016 46

Page 47: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

02/06/2016 OpenStack Israel 2016 47

Page 48: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Strategic Plan

� Establish multi-tenant, multi-provider cloud infrastructure

� Identify and adopt policies for trust, security and privacy

� Create governance structure

� Define funding schemes

To support the computing capacity

needs for the ATLAS experiment

Setting up a new service to simplify analysis of large genomes, for a deeper insight into

evolution and biodiversity

To create an Earth Observation platform,

focusing on earthquake and volcano research

To improve the speed and quality of research

for finding surrogate biomarkers based on

brain images

Adopters

Suppliers

Additional Users:

02/06/2016 OpenStack Israel 2016 48

Page 49: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

49* EC co-funded joint Pre-Commercial Procurement (PCP) project: https://indico.cern.ch/event/319753 ** Other work has been conducted outside CERN, such as the Amazon Pilot project at BNL for ATLAS

HN - Helix Nebula• Partnership between research

organization and European commercial cloud providers

Support CERN’s scientific computing programme

Investigations

Page 50: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Tier 0

LHCC; 25 May 2016 Ian Bird; CERN 50

Page 51: Frontiers of Science and Clouds - Tim Bell - Openstack Day Israel 2016

Run 2 has only just started• Hint of an excess with diphoton mass of 750 GeV

• Seen by ATLAS and CMS – coincidence or a new signal?

OpenStack Israel 2016 51

200 400 600 800 1000 1200 1400 1600

Even

ts /

20 G

eV

1−10

1

10

210

310

410ATLAS Preliminary

Spin-0 Selection-1 = 13 TeV, 3.2 fbs

Data

Background-only fit

[GeV]γγm200 400 600 800 1000 1200 1400 1600

Dat

a - f

itted

bac

kgro

und

10−

5−05

1015

Even

ts /

( 20

GeV

)

1

10

210

DataFit model

σ 1 ±

σ 2 ±

EBEB

(GeV)γ γm400 600 800 1000 1200 1400 1600

stat

σ(d

ata-

fit)/

-2

0

2

(13 TeV, 3.8T)-12.7 fbCMS Preliminary

02/06/2016