Top Banner
A PPARC funded project Using AstroGrid CEA to access compute grids Guy Rixon Grid Workshop Strasbourg, June 2005
35

Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Aug 21, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

A PPARC funded project

Using AstroGrid CEA to access compute grids

Guy Rixon

Grid Workshop

Strasbourg, June 2005

Page 2: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 2

Outline

What the IVO movement has done that’s grid-like.

What CEA adds to IVO operations

Why “The Grid” is a hard sell to astronomers

CEA linked to compute grids

Page 3: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 3

Is the IVO a grid?

IVO = International Virtual Observatory

� The interoperable intersection of regional VOs

� Global; singular

� Internal standards managed by IVO Alliance (IVOA)

IVO is a network of (web) services

A service network might also work as a grid

A grid makes available a commodity:

� CPU cycles (compute grid)

� Storage (data grid)

� ??? (value-added grid)

What’s the commodity in IVO?

Page 4: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 4

IVO does data retrieval

Commodity: data in standard, self-describing formats

FITS, VOTable

Archive

Page 5: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 5

IVO does resource mirroring

Commodity: standard description of archives

Commodity: archives mirroring same data

FITS, VOTable

Archives

Registry

Archive content, coverage

Page 6: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 6

IVO does data selection

Commodity: data-selection applicationsavailable on all archives with standard interfaces.

NB: a query is a kind of application

VOTable

Archives

ADQL query/Cone search/SIAP/SSAP

Page 7: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 7

IVO does data storage

Commodity: storage

VOTable etc. Archives

ADQL query/Cone search/SIAP/SSAP

VOStore

Page 8: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 8

What the IVO movement has done that’s grid-like

What CEA adds to IVO operations

Why “The Grid” is a hard sell to astronomers

CEA linked to compute grids

Page 9: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 9

Some limitations of current IVO

Standard applications, but one hard-won IVOA standard per interface type

� ⇒ Few applications supported

No standard apps independent of data archives

� ⇒ no way to register applications with data inputs

Registry describes services, not applications

� ⇒ hard to find exact mirrors of resources.

Most standard apps require synchronous calls

� ⇒ doesn’t scale to long-running jobs

VOStore forthcoming but not yet part of standard apps

� ⇒ no data grid for most operations

Page 10: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 10

CEA: goals

To create a uniform interface and model for an application and its parameters.

To provide extensions with the VO Resource schema that can describe a general application.

To provide asynchronous operation of an application.

To allow for the data flow to not necessarily have to follow the call tree.

Abstracted from Harrison, IVOA Note “A Proposal for a Common Execution Architecture”

Page 11: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 11

CEA: filter-like processes

VOStore

Application

VOStore

reads from writes to

0..* 1..*

Common Execution Connector

Workflow

↓commands1..*

1..*

CommodityCommodity

ConsumerConsumer

Legacy codeLegacy code

SOAP wrapperSOAP wrapper

Page 12: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 12

CEA: new resource types for registry

CeaApplication

� Describes an application

� Defines application parameters

� Groups parameters into interfaces

� Basis for standardization/mirroring of apps

CeaService

� Lists registered applications that service can run

� States service endpoint

Page 13: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 13

CEA: JDL and WSDL contract

All CEA services are SOAP services

All have the same WSDL contract

The standard contract uses a Job-Description Language.

⇒ call to application is defined by an XML document passed to CEA service.

Page 14: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 14

CEA is a grid?

CEA makes the IVO more grid-like

� Can run “any” application

� Applications as proper commodities

� Data grid reconnected to applications

� Asynchronous execution of long jobs

� Offers processing power without exposing details

CEA is a grid…but it is not part of “The Grid”.

“Why aren’t AstroGrid using “Why aren’t AstroGrid using The GridThe Grid ?”?”

Page 15: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 15

What the IVO movement has done that’s grid-like

What CEA adds to IVO operations

Why “The Grid” is a hard sell to astronomers

CEA linked to compute grids

Page 16: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 16

“Why don’t astronomers use The Grid?”

Observational astronomers don’t make much use of existing, external grids: why?

EGEE: ~20,000 CPUs. Little interest.

UK National Grid Service: ~2000 CPUs. Little/no interest.

Grid acceptance at IoA Cambridge (observers):

CamGrid: ~650 CPUs. Little/no interest.

Departmental grid: ~50 CPUs. Saturated with jobs

Page 17: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 17

What’s the use case?

Rephrase the question: why should astronomers use an external compute grid?

� Run calculations that can’t otherwise run at all.

� Higher throughput of small calculations.

Refine the question: should the IVO be involved in astronomers’ use of grids?

� Or is the IVO a complementary alternative to “the Grid”?

IVO Grid

Page 18: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 18

Nature of obs calculations

Observational astronomy has few “big” calculations� Atomic calculations usually run on one CPU

� C.f. theoretical astronomy

Observational astronomy has many repeated calculations� E.g. image processing: embarrassingly parallel

� Few low-level operations per pixel; not CPU limited.

Observational astronomy uses RDBMS for many large-data calculations� Significant processing inside DB (e.g. SDSS)

Page 19: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 19

Win condition for use of grid

Consider processing of n similar data-setsWhat is duration t for whole experiment?� tc – time to process one data-set� td – time to transfer one data-set to processor� ti – time to understand/install/gridify application s/w

Local computation:� tl = ti + n(tc + td)� ti → 0 in typical case� td → 0 iff data local

Grid computation (n-way parallel)� tg = ti + tc + ntd

Grid advantage requires:� ti + ntd << (n-1)tc (if data are local)� ti << (n-1)tc (if data are remote)

Page 20: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 20

Astronomical win condition for use of grid

As n → ∞:

� Gridworthyness depends on tc/td for local data…

� …but grid wins automatically for remote data…

� …and all data are remote in IVO.

Typical image-processing computation:

� tc ~ 5 seconds

� td ~ 3 seconds

� 1 day < ti < 1 week for external grid?

� ⇒ ti dominates until n ~ 10,000

� Most astro experiments aren’t that big!

Page 21: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 21

What the IVO movement has done that’s grid-like

What CEA adds to IVO operations

Why “The Grid” is a hard sell to astronomers

CEA linked to compute grids

Page 22: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 22

Making ti go away

Astronomers won’t use The Grid until we make it easy.

Need a library of prepared applications� Provided as services

� Different services go to different execution environments (i.e. different grids) …

� …but external interface is the same for given app.

CEA already does this� Well-known, registered apps

� Run as sub-processes of SOAP service

� Local configuration supported but hidden from users.

Page 23: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 23

CEA does grid

A CEA service can drive “any” CL application

� ⇒ it can work client end of grid toolkit

� ⇒ it can delegate work to external grid

Page 24: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 24

CEA needs grid

Data volume looms

CEA concentrates demand on few sites

Service host can’t handle all processing� ⇒ need at least local

grid, e.g. on cluster

Volunteer sites donate a little h/w

� Demand can exceed donated resources

� Need to get more computers.

� Grid projects have all the h/w funding!

� ⇒ need external grids

Page 25: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 25

CEA + local condor pool

Pro:� No access restrictions� No contention� Use NFS as internal data-grid

Con:� Astronomers have to provide h/w

Under test at IoA Cambridge (5 CPUs; ad hoc cluster).Possible targets at JBO (~200 CPUs), Portsmouth (~15), Leicester (9)

CondorExecution

CondorExecution

CEC

Condorsubmission

Astronomy dept.

CondorExecution

NFS

VOSpace

Page 26: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 26

CondorExecution

CondorExecution

CEA + condor flock campus grid

Pro:� More CPUs

Con:� Typically no NFS� Access policy

Experiment planned with CamGrid (~650 CPUs)

CEC

Condorsubmission

Astronomy dept.

CondorExecution

University

Other depts.

VOSpace

Page 27: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 27

CEA + Condor-G + Globus grid

Pro:� Vastly more CPUs available

Con:� Latency� Credential complications� Still no data-grid below CEC.

Astronomer-friendly way in to NGS, EGEE?

CondorExecution

CondorExecution

CEC

Condorsubmission

Astronomy dept.

GlobusExecution

University

Other HEIs

VOSpace

Page 28: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 28

CEC ⇔ Condor: current experiment

CEC

Perlscript

Condormodule

Submit file

Condorpool

Log file

↓ Runs synchronously

↓ Writes

→ Calls → Calls

↑ Writes

↓ Reads

↑ Reads

App.→ Nominates ← Runs asynchronously

Existing Existing

CECCEC

Page 29: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 29

CEC ⇔ Condor: planned

CEC

Submit file

Condorpool

Log file

→ Invokes asynchronously

↓ Writes ↓ Reads

↑ Writes

→ Reads

App.→ Nominates ← Runs asynchronously

New kind New kind

of CECof CEC

Page 30: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 30

Deployments, current

Local tests at IoA Cambridge

� Dedicated Condor pool, ~5 CPUs

� Working

Tests at Leicester Physics & Astronomy

� Dedicated Condor pool, 9 CPUs; test only

� Under construction

Tests on CamGrid

� Shared Condor flock, Campus grid, ~600 CPUs

� Planned

Production grid, Portsmouth

� Dedicated Condor pool, ~15 CPUs?

� Possible?

Page 31: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 31

Which applications work on the grid?

Good:

� C programme, statically linked

� Few configuration files

� No dependency on OS

� Can be relinked for standard Condor universe

Tricky:

� Interpreted/uses VM (e.g. Python, Java)

� Dynamic linking

� Interdependent programmes (e.g. IRAF)

� Binary-only release

Page 32: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 32

Apps from AstroGrid prototypes (1)

AstroGrid has prepared these apps to run on non-grid CECs:� eSDO MSSL database (NG: local DB)

� Super COSMOS catalogues (NG: local DB)

� Solar FITS querier (NG: local DB)

� 6dF catalogues (NG: local DB)

� 2MASS catalogues (NG: local DB)

� USNOB catalogues (NG: local DB)

� INT-WFS images (NG: local DB)

� FIRST object catalogue (NG: local DB)

� INT-WFS observation catalogue (NG: local DB)

� INT-WFS object catalogue (NG: local DB)

Page 33: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 33

Apps from AstroGrid prototypes (2)

� BPZ (Difficult: Python)

� HyperZ (Difficult: library of template SEDs)

� Galaxev (tricky: library of templates?)

� Pegase (tricky: library of templates?)

� Generic movie maker (possible?)

� ANNz (possible?)

�� Sextractor (OK)Sextractor (OK)

Page 34: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 34

Challenges

More “easy” apps for Condor to run

� An IVO(A) library of prepared apps?

“Drive-by installation” for complex apps

Enable Java, Python apps

� Either Condor’s “Java universe” or drive-by installations

Enable/exploit checkpointing, migration

� Needs Condor’s “standard universe”

Submission to restricted parts of grid

� Needs working security framwork

Distinguish restricted services

� Needs registry support

Use Sun Grid Engine as well as Condor?

� Needs a use case to justify this.

Use Condor-G for wider grid

� Needs prayer?

Page 35: Using AstroGrid CEA to access compute gridscdsweb.u-strasbg.fr/meeting5/rixon-cea-and-grid.pdfService host can’t handle all processing ⇒need at least local grid, e.g. on cluster

Grid workshop, Strasbourg, June 2005 AstroGrid CEA and the Grid Slide 35

Summary

The IVO has a number of grid-like features

AstroGrid CEA enhances, strengthens these

CEA is now becoming a gateway to external compute grids

Lots more development needed to get full benefits…

…but some of it works now!

More information:� http://www.astrogrid.org/

� http://software.astrogrid.org/

� Guy Rixon: [email protected]