Top Banner
ISC 2014 Dr Paul Calleja Director Cambridge HPC Service SKA The worlds largest Radio Telescope streaming data processor
43

SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

Aug 04, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

Dr Paul Calleja

Director Cambridge HPC Service

SKA

The worlds largest Radio Telescope

streaming data processor

Page 2: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• Introduction to Cambridge HPCS

• Overview of the SKA project and the SKA SDP consortium

• Two SDP work pages

• SDP Open Architecture Lab

• SDP operations

Overview

Page 3: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

Cambridge University

• The University of Cambridge is a world leading teaching & research institution, consistently ranked within the top 3 Universities world wide

• Annual income of £1200M - 40% is research related - one of the largest R&D budgets within the UK HE sector

• 17000 students, 9,000 staff

• Cambridge is a major technology centre – 1535 technology companies in surrounding science parks – £12B annual revenue – 53000 staff

• The HPCS has a mandate to provide HPC services to both the University and wider technology company community

Page 4: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

Four domains of activity

Dell HPC

Solution Centre

Industrial HPC

Service

Cambridge

HPC Service

Commodity

HPC Centre of

Excellence

Promoting

uptake of HPC

by UK Industry

Driving

Discovery

Advancing

development and

application of HPC HPC

R& D

Page 5: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• Over 1000 registered users from 35 departments

• 856 Dell Servers - 450 TF sustained DP performance

• 128 node Westmere (1536 cores) (16 TF)

• 600 node (9600 core) full non blocking Mellanox FDR IB 2,6 GHz sandy bridge (200 TF) one of the fastest Intel clusters in he UK

• SKA GPU test bed -128 node 256 card NVIDIA K20 GPU

• Fastest GPU system in UK 250 TF

• Designed for maximum I/O throughput and message rate

• Full non blocking Dual rail Mellanox FDR Connect IB

• Full GPUDirect functionality

• Design for maximum energy efficiency

• 2 in Green500 (Nov 2013)

• Most efficient air cooled supercomputer in the world

• 4 PB storage – Lustre parallel file system 50GB/s

Cambridge HPC vital statistics

Page 6: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

CORE – Industrial HPC service & consultancy

Page 7: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

Dell | Cambridge HPC Solution Centre

• The Solution Centre is a Dell Cambridge joint funded HPC centre of excellence, provide leading edge commodity open source HPC solutions.

Page 8: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• HPCS in partnership with STFC

• STFC funded 50% of Cambridge system

• Run as national service for STFC funded astronomy and particle physics

communities

• UK QCD consortium major science user

• Testing standard model through

numerical simulation of Quantum

Chromodynamics

• Group has performed 1 years simulation

output in 1st month on new system

DiRAC national HPC service

Page 9: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• Cambridge were the first European NVIDIA CUDA COE

• Cambridge has had first large scale GPU cluster in UK for the last four years

• Key technology demonstrator for SKA

• Strong CUDA development skill within HPCS

• New large GPU system– largest GPU system in UK one of the most energy

efficient supercomputers in the world – built to push parallel scalability of

GPU clusters by deploying the best network possible and combining with

GPUDirect

NVIDIA CCOE

Page 10: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• The HPCS is providing all the data storage and computational recourse for a

major new genomics study

• The study involves the gene sequencing of 20,000 decease patients from

the UK

• This is a major data throughput workload with high data security issues

• It requires building designing an efficient data throughput pipeline in terms of

hardware and software.

20K genome project

Page 11: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• 5 year research project with JLR

• Drive capability in simulation & data mining

• HPC design, implementation and operation best practice

JLR R&D

Page 12: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

SA CHPC collaboration

• HPCS has a long term strategic

partnership with CHPC

• HPCS has been working closely

with CHPC for last 6 years

• Technology strategy, system design

procurement

• HPC system stack development

• SKA platform development

Page 13: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• Next generation radio telescope

• 100 x more sensitive

• 1000000 X faster

• 5 square km of dish over 3000 km

• The next big science project

• Currently the worlds most ambitious IT

Project

• First real exascale ready application

• Largest global big-data challenge

Square Kilometre Array - SKA

Page 14: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

SKA location

• Needs a radio-quiet site

• Very low population density

• Large amount of space

• Two sites:

• Western Australia

• Karoo Desert RSA

A Continental sized Radio

Telescope

Page 15: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

SKA – Key scientific drivers

Cradle of life

Cosmic Magnetism

Evolution of galaxies

Pulsar survey

gravity waves

Exploring the

dark ages

Page 16: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

But……

Most importantly the SKA will

investigate phenomena

we have not even imagined yet

Page 17: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

SKA timeline

2022 Operations SKA1 10% 2025: Operations SKA2 100% 2023-2027 Construction of Full SKA, SKA2 €2 B 2017-2022 10% SKA construction, SKA1 €650M 2012 Site selection 2012 - 2016 Pre-Construction: 1 yr Detailed design €90M PEP 3 yr Production Readiness 2008 - 2012 System design and refinement of specification 2000 - 2007 Initial concepts stage 1995 - 2000 Preliminary ideas and R&D

Page 18: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

SKA project structure

SKA Board

Director General

Work Package

Consortium 1

Work Package

Consortium n

Advisory Committees

(Science, Engineering,

Finance, Funding …)

Project Office

(OSKAO)

Locally funded

Page 19: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

Work package breakdown

UK (lead), AU (CSIRO…), NL (ASTRON…)

South Africa SKA, Industry (Intel, IBM…)

1. System

2. Science

3. Maintenance and support /Operations Plan

4. Site preparation

5. Dishes

6. Aperture arrays

7. Signal transport

8. Data networks

9. Signal processing

10. Science Data Processor

11. Monitor and Control

12. Power

SPO

Page 20: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

SDP = Streaming data processor challenge

• The SDP consortium led by Paul Alexander University of Cambridge

• 3 year design phase has now started (as of November 2013)

• To deliver SKA ICT infrastructure need a strong multi-disciplinary team

• Radio astronomy expertise

• HPC expertise (scalable software implementations; management)

• HPC hardware (heterogeneous processors; interconnects; storage)

• Delivery of data to users (cloud; UI …)

• Building a broad global consortium:

• 11 countries: UK, USA, AUS, NZ, Canada, NL, Germany, China, France,

Spain, South Korea

• Radio astronomy observatories; HPC centres; Multi-national ICT companies;

sub-contractors

Page 21: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

SDP consortium members Management Groupings Workshare (%)

University of Cambridge (Astrophysics & HPFCS) 9.15 Netherlands Institute for Radio Astronomy 9.25 International Centre for Radio Astronomy Research 8.35 SKA South Africa / CHPC 8.15 STFC Laboratories 4.05 Non-Imaging Processing Team 6.95 University of Manchester

Max-Planck-Institut für Radioastronomie

University of Oxford (Physics)

University of Oxford (OeRC) 4.85 Chinese Universities Collaboration 5.85 New Zealand Universities Collaboration 3.55 Canadian Collaboration 13.65 Forschungszentrum Jülich 2.95 Centre for High Performance Computing South Africa 3.95 iVEC Australia (Pawsey) 1.85 Centro Nacional de Supercomputación 2.25 Fundación Centro de Supercomputación de Castilla y León 1.85 Instituto de Telecomunicações 3.95 University of Southampton 2.35 University College London 2.35 University of Melbourne 1.85 French Universities Collaboration 1.85 Universidad de Chile 1.85

Page 22: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

SDP –strong industrial partnership

• Discussions under way with

• DelI, NVIDIA, Intel, HP IBM, SGI, l, ARM, Microsoft Research

• Xyratex, Mellanox, Cray, DDN

• NAG, Cambridge Consultants, Parallel Scientific

• Amazon, Bull, AMD, Altera, Solar flare, Geomerics, Samsung, CISCO

• Apologies to those I’ve forgotten to list

Page 23: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

SKA data rates

..

Sparse AA

Dense AA

..

Central Processing Facility - CPF

User interfacevia Internet

...

To 250 AA Stations

DSP

...

DSP

To 1200 Dishes

...15m Dishes

16 Tb/s

10 Gb/s

Data

Time

Control

70-450 MHz

Wide FoV

0.4-1.4 GHz

Wide FoV

1.2-10 GHz

WB-Single

Pixel feeds

Tile &Station

Processing

OpticalData links

... AA slice

... AA slice

... AA slice

...D

ish &

AA

+Dish

Co

rrelation

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

ProcessorBuffer

Data sw

itch ......Data

Archive

Science

Pro

cessors

Tb/s Gb/s Gb/s

...

...

TimeStandard

Imagin

g P

rocessors

Control Processors & User interface

Pb/s

Correlator UV Processors Image formation Archive

Aperture Array Station

16 Tb/s 4 Pb/s

24 Tb/s

20 Gb/s

20 Gb/s

1000Tb/s

Page 24: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

SKA conceptual data flow

Tiered Data Delivery

Astronomer

Regional Centre

Cloud

Sub-set of

Archive

Data

routing

Regional Centre

Sub-set of

Archive

Regional Centre

Sub-set of

Archive

Clo

ud a

ccess

SDP Core Facility

South Africa

SDP Core Facility

Australia

Page 25: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

Science data processor pipeline

10 Pflop

1 Eflop

100 Pflop

Software

complexity

10 Tb/s 200 Pflop

10 Eflop

Incoming

Data from

collectors

Switch

Bu

ffer store

Switch

Bu

ffer store

Bu

lk Store

Co

rrelator

Beam

form

er

UV

P

rocesso

r

Imaging:

Non-Imaging:

Corner Turning

Course Delays

Fine F-step/ Correlation

Visibility Steering

Observation Buffer

Gridding Visibilities Imaging

Image Storage

Corner Turning

Course Delays

Beamforming/ De-dispersion

Beam Steering

Observation Buffer

Time-series Searching

Search analysis

Object/timing Storage

HP

C scie

nce

p

roce

ssing

Image

Pro

cessor

1000Tb/s 1 Eflop 10 EB/y SKA 2

SKA 1 1 EB/y 50 PB

2.5 EB

Page 26: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

SDP processing rack – feasibility model

Host processor

Multi-core X86

M-C

ore - >1

0TFLO

P/s

M-C

ore

- >10

TFLOP

/s

To rack

switches

Disk 1 ≥1TB

56Gb/s

PCI Bus

Disk 2 ≥1TB

Disk 3 ≥1TB

Disk 4 ≥1TB

Processing blade 1

Processing blade 2

Processing blade 3

Processing blade 4

Processing blade 5

Processing blade 6

Processing blade 7

Processing blade 8

Processing blade 9

Processing blade 10

Processing blade 11

Processing blade 12

Processing blade 13

Processing blade 14

Processing blade 15

Processing blade 16

Processing blade 17

Processing blade 18

Processing blade 19

Processing blade 20

Leaf Switch-1 56Gb/s

Leaf Switch-2 56Gb/s

42U Rack

Processing Blade:

GGPU,

MIC,…?

• 20 TFlop

• 2x56 Gb/s comms

• 4 TB storage

• <1kW power

• Capable host (dual

Xeon)

• Programmable

• Significant RAM

Blade Specification

Page 27: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

SKA feasibility model

AA-low

Data 1

1

280

AA-low

Data 2

1

280

Dishes

Data 4

1 2 16

1 3 N

HPC

Bulk Store

2

Switch

Correlator/

UV processor

Further UV

processors

Imaging

Processor

Corner

Turner

switches

56Gb/s each

AA-low

Data 3

1

280

1

250

Page 28: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

SKA conceptual software stack

SKA subsystems and service components

SKA Common Software Application FrameworkUIF Toolkit

Access ControlMonitoring

Archiver

Live Data

Access

Logging

SystemAlarm Service

Configuration

Management

Scheduling

Block Service

Communication

MiddlewareDatabase Support

Third-party tools and

librariesDevelopment tools

Operating System

High-level APIs

and Tools

Core Services

Base Tools

Page 29: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• Create a critical mass of HPC & astronomy knowledge combined with HPC

equipment and lab staff to produce a shared resource to drive SKA system

development studies and SDP prototyping – Strong coordination of all

activities with COMP

• Provide a coordinated engagement mechanism with Industry to drive SKA

platform development studies Dedicated OAL project manager

• Benchmarking • Perform standardised benchmarking across range different vendor solutions

• Undertake a consistent analysis of benchmark systems and report into COMP

• Manage a number of industry contracts driving system studies • Low level software RAID / Lustre performance testing

• Large scale archives

• Software defined networking

• Openstack in HPC environment

• SLUM as a telescope scheduler

Open Architecture Lab function

Page 30: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• Build prototype systems under direction from COMP

• Undertaken system studies directed from COMP to investigate particular

system aspects Dedicated HPC engineer being hired to drive studies

• Act as managed Lab for COMP and wider SDP work packages., build

systems, perform technical studies, make systems accessible. Dedicated Lab engineer to service LAB

Open Architecture Lab function

Page 31: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• Emphasis on testing scalable components in terms of hardware and

software

• Key considerations in architectural studies will be:-

• Energy efficient architectures

• Scalable cost effective storage

• Interconnects

• Scalable system software

• Operations

Open Architecture Lab function

Page 32: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• Act as interface with industry providing coordinated engagement with SDP

• Benchmark study papers

• Roadmap papers

• Discussion digests

• Industrial system study contracts

• Design papers

• Benchmark papers

• Build target test platform

• Design papers

• benchmark papers

• Managed lab providing a service to SDP consortium

• Service function is output

Open Architecture Lab – Outputs

Page 33: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• Coordinated by Cambridge jointly run HPCS (Cambridge) and CHPC (SA)

– Dedicated PM

– Dedicated HPC engineer

– Dedicated lab engineer

• Collaborate and coordinate with distributed labs

• Cambridge

• CHPC

• Astron

• Julich

• Oxford

Open Architecture Lab – Organisation

Page 34: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• Large scale systems

• 600 node (9600 core) full non blocking Mellanox FDR IB 2,6 GHz sandy bridge (200 TF) one of the fastest Intel clusters in he UK

• SKA GPU test bed -128 node 256 card NVIDIA K20 GPU

• GPU test bed built extensive testing underway – good understanding of GPU –GPU RDMA functionality - GPU focused engineer in place

• Good understanding of current best practise in energy efficient computing and data build and design

• 4 PB storage – Lustre parallel file system 50GB/s

Cambridge test beds

Page 35: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• Small scale CPU test beds

• Phi – Installed last week – larger cluster being designed

• Arm – evaluating solutions

• Atom – evaluating solutions

• Agreed Intel Radio Astronomy IPPC 2 head count to be put in place looking at PHI

• Storage test beds

• Lustre on commodity hardware – H/W RAID - Installed

• Lustre on commodity hardware – SW RAID – under test

• Lustre on proprietary hardware – discussions with vendors

• Lustre on ZFS – under test

• Ceph – under test

• Archive test bed –discussions with vendors

• Distributed file system flash accelerated - in design

• Strong storage test programme underway – head count of 4 in place driving the programme

Cambridge test beds

Page 36: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• Networking

• Use current production system as large scale test bed

• Dedicated equipment to be ordered

• IB

• Ethernet

• Software defined networks

• RDMA data transfers co-p to co-p

• Networking SOW being constructed – 1FTE to be hired

• Slurm test bed at Cambridge & CHPC

• Headcount at Cambridge CHPC

• Openstack test bed under construction –

• Openstack development in CHPC/HPCS

• SDP operations – OAL to feed into operation WP

• Data centre issues

Cambridge test beds

Page 37: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• The SKA SDP compute facility will be at the time of deployment one of the

largest HPC systems in existence

• Operational management of large HPC systems is challenging at the best of

times - When HPC systems are housed in well established research centres

with good IT logistics and experienced Linux HPC staff

• The SKA SDP could be housed in a desert location with little surrounding IT

infrastructure, with poor IT logistics and little prior HPC history at the site

• Potential SKA SDP exascale systems are likely to consist of 100,000 nodes

occupy 800 cabinets and consume 20 MW. This is very large – around 5

times the physical size of Titan Cray at Oakridge national labs.

• The SKA SDP HPC operations will be very challenging but tractable

SKA Exascale computing in the desert

Page 38: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• We can describe the operational aspects by functional element

Machine room requirements **

SDP data connectivity requirements

SDP workflow requirements

System service level requirements

System management software requirements**

Commissioning & acceptance test procedures

System administration procedure

User access procedures

Security procedure

Maintenance & logistical procedures **

Refresh procedure

System staffing & training procedures **

SKA HPC operations – functional elements

Page 39: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• Machine room infrastructure for exascale HPC facilities is challenging

• 800 racks, 1600M squared

• 30MW IT load

• ~40 Kw of heat per rack

• Cooling efficiency and heat density management is vital

• Machine infrastructure at this scale is both costly and time consuming

• The power cost alone at todays cost is 10’s of millions (£) per year

• Desert location presents particular problems for data centre

• Hot ambient temperature - difficult for compressor less cooling

• Lack of water - difficult for compressor less cooling

• Very dry air - difficult for humidification

• Remote location - difficult for DC maintenance

Machine room requirements

Page 40: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• System management software is the vital element in HPC operations

• System management software today does not scale to exascale

• Worldwide coordinated effort to develop system management software for

exascale in HPC community

• We are very interested in leveraging Openstack technologies from non HPC

communities

System management software

Page 41: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• Current HPC technology MBTF for hardware and system software result in

failure rates of ~ 2 nodes per week on a cluster a ~600 nodes.

• It is expected that SKA exascale systems could contain ~100,000 nodes

• Thus expected failure rates of 300 nodes per week could be realistic

• During system commissioning this will be 3 or 4 X

• Fixing nodes quickly is vital otherwise the system will soon degrade into a

non functional state

• The manual engineering processes for fault detection and diagnosis on 600

will not scale to 100,000 nodes. This needs to be automated by the system

software layer

• Vendor hardware replacement logistics need to cope with high turn around

rates

Maintenance logistics

Page 42: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

• Providing functional staffing levels and experience at remote desert location

will be challenging

• Its hard enough finding good HPC staff to run small scale HPC systems in

Cambridge – finding orders of magnitude more staff to run much more

complicated systems in a remote desert location will be very Challenging

• Operational procedures using a combination of remote system

administration staff and DC smart hands will be needed.

• HPC training programmes need to be implemented to skill up way in

advance

Staffing levels and training

Page 43: SKA The worlds largest Radio Telescope streaming data ......•The HPCS is providing all the data storage and computational recourse for a major new genomics study •The study involves

ISC 2014

Early Cambridge SKA solution - EDSAC 1

Maurice Wilkes