Robin Middleton RAL-PPD/EGEE/GridPP Grid Computing A high-level look at Grid Computing in the world of Particle Physics and at LHC in particular. I am indebted to the EGEE, LCG and GridPP rojects and to colleagues therein for much of the material presented here.
70
Embed
Robin Middleton RAL-PPD/EGEE/GridPP Grid Computing A high-level look at Grid Computing in the world of Particle Physics and at LHC in particular. I am.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Robin Middleton
RAL-PPD/EGEE/GridPP
Grid Computing
A high-level look at Grid Computing in theworld of Particle Physics and at LHC in particular.
I am indebted to the EGEE, LCG and GridPPprojects and to colleagues therein for much
of the material presented here.
2
Overview
• e-Science and The Grid• Grids in Particle Physics
– EGEE LCG GridPP
– Virtual Organisations
• Computing Model (very high level !)• Components of the EGEE/LCG/GridPP Grid
– security
– information service
– resource brokering
– data management
• Monitoring & User Support• Other Projects / Sciences• Sustainability & EGI• Further information
– Links
3
What is e-Science ?What is the Grid ?
e-Science
• …also : e-Infrastructure, cyberinfrastructure, e-Research, …• Includes
– grid computing (e.g. WLCG, EGEE, EGI, OSG, TeraGrid, NGS…)• computationally and/or data intensive; highly distributed over wide area
– digital curation
– digital libraries
– collaborative tools (e.g. Access Grid)
– …many other areas
• Most UK Research Councils active in e-Science– BBSRC
– NERC (e.g. climate studies, NERC DataGrid)
– ESRC (e.g. NCeSS
– AHRC (e.g. studies in collaborative performing arts)
– EPSRC (e.g. eMinerals, MyGrid, …)
– STFC (formerly PPARC and CCLRC) (e.g. GridPP, AstroGrid) 4
5
e-Science in 2000
• Dr John Taylor (former Director General of Research Councils,Office of Science and Technology)– ‘e-Science is about global collaboration in key areas of science, and the next
generation of infrastructure that will enable it.’
– ‘e-Science will change the dynamic of the way science is undertaken.’
• An independent panel of international experts has judged the UK'se-Science Programme as "world-leading", citing that "investments are already empowering significant contributions to wellbeing in the UK and the world beyond".
“The panel found the e-Science Programme to have had a positive economic impact, especially in the important areas of life sciences and medicine, materials, and energy and sustainability. Attractive to industry from its inception, the programme has drawn in around £30 million from industrial collaborations, both in cash and in-kind. Additionally it has already contributed to 138 stakeholder collaborations, 30 licenses or patents, 14 spin-off companies and 103 key results taken up by industry and early indications show there are still more to come.”
Grids• Collaborative environment• Distributed resources (political/sociological)• Commodity hardware (also supercomputers)• (HEP) data management• Complex interfaces (bug not feature)
Supercomputers• Expensive• Low latency interconnects• Applications peer reviewed• Parallel/coupled applications• Traditional interfaces (login)• Also SC grids (DEISA, Teragrid)
Clouds• Proprietary (implementation)• Economies of scale in management• Commodity hardware• Virtualisation for service provision and encapsulating application environment• Details of physical resources hidden• Simple interfaces (too simple?)
Volunteer computing• Simple mechanism to access millions CPUs• Difficult if (much) data involved• Control of environment check • Community building – people involved in Science• Potential for huge amounts of real work
Many different problems:Amenable to different solutions
No right answer
Many different problems:Amenable to different solutions
No right answer
Consider ALL as a combined e-Infrastructure ecosystemAim for interoperability and combine the resources into a consistent whole
Grids and Clouds• GridPP4 will take a
closer look at clouds• Issues
– relative costs
– I/O bandwidth : getting the data into the cloud in the first place !
– data security : entrusting our data to external body
• “Grid computing [is] distinguished from conventional distributed computing by its focus on large-scale resource sharing, innovative applications, and, in some cases, high-performance orientation...we review the "Grid problem", which we define as flexible, secure, coordinated resource sharing among dynamic collections of individuals, institutions, and resources - what we refer to as virtual organizations."– From "The Anatomy of the Grid: Enabling Scalable Virtual Organizations" by
Foster, Kesselman and Tuecke
• “The Web on Steroids” !
10
What is the Grid ?
• The Grid : Blueprint for a New Computing Infrastructure(Ian Foster & Carl Kesselman)
– “A computational grid is a hardware and softwareinfrastructure that provides dependable, consistent,pervasive, and inexpensive access to high-endcomputational capabilities.”
• Multiple standards• Interoperability• Distributed resource sharing• No common management• Concurrent access• Flexible
Analogies continue…• Far from perfect• Much room for improvement• A “trip” hazard !
Acknowledgement: J.Gordon
12
Grids in Particle Physics
(with the LHC as an example)
13
EGEELCGGridPP
LCG LHC Computing GridDistributed Production Environment for Physics Data Processing
In 2007 : 100,000 CPU, 15PB/Yr, 5000 physicist, 500 institutes
EGEE Enabling Grids for E-sciencEStarts from LCG infrastructure
Production Grid in 27 countries
HEP, BioMed, CompChem,
Earth Science, …
GridPPGrid computing for HEP in UK
Major contributor to LCG & EGEE
19 Institutes
14
EGEE-III• 140 partners in 33 countries• ~32M€ (EU); 2 years• ~250 sites in 45 countries; • >75,000 CPU-cores• >270 VOs• ~210k jobs/day (peak 230k)• ~76 million jobs run in year
to Aug08
EGEE Federations UK/Ireland
CPU time delivered (CPU months)
x 2
231K jobs/dayNumber of jobs per month
15
EGEE-III
• 140 partners in 33 countries• ~32M€ (EU); 2 years• ~250 sites in 45 countries; • >75,000 CPU-cores• >270 VOs• ~210k jobs/day (peak 230k)• ~76 million jobs run in year
to Aug08• “Other VOs” 30k jobs/day
EGEE “reach” in 2008
16
LCG – LHC Computing Grid
• Worldwide LHC ComputingGrid
• Framework to deliverdistributed computing forLHC experiments
Typical LHC particle physics Typical LHC particle physics experiment One year of acquisition experiment One year of acquisition and analysis of dataand analysis of data
24
Principle Components
25
The Middleware Stack
Computing cluster Network resources Data storage
Operating system Local schedulerFile system
User access SecurityData transferInformation schema
Job Management Data managementApp monitoring system
VO credential is used by the resource broker to pre-select available CEs.
28
Security – Running a Job
MyProxyserver
CECE
cert(long term)
host cert
proxy
authz
VO
WMS
1. cert download
LCAS/LCMAPS
authentication & authorization info
2. job start
LCAS: authorization based on (multiple) VO/group/role attributes
LCMAPS: mapping to user pool and to (multiple) groups
default VO = default UNIX group
other VO/group/role = other UNIX group(s)
voms-proxy-init
VO credential for authorization and mapping on the CE.
29
Information System
• At the core of making the Grid function• Hierarchy of distributed BDII/LDAP servers • Information organised using the GLUE Schema
30
Information System - LDAP
• Lightweight Directory Access Protocol: – structures data as a tree
– DIT = Directory Information Tree
• Following a path from the node back to
the root of the DIT, a unique name isbuilt (the DN):
“id=ano,ou=PPD,or=STFC,st=Chilton, \c=UK,o=grid”
o = grid (root of the DIT)
c= US c=UK c=Spain
st = Chilton
or = STFC
ou = PPD ou = ESC
objectClass:personcn: A.N.Other
phone: 5555666office: R1-3.10
31
WMS – Workload Management System
• WMS is composed of the following parts:1. User Interface (UI) : access point for the user to the WMS
2. Resource Broker (RB) : the broker of GRID resources, responsible to find the “best” resources where to submit jobs
3. Job Submission Service (JSS) : provides a reliable submission system
4. Information Index (BDII) : a server (based on LDAP) which collects information about Grid resources – used by the Resource Broker to rank and select resources
5. Logging and Bookkeeping services (LB) : store Job Info available for users to query
• (However, this is evolving with the moves to the gLite RB and the gLite CE !
– To stimulate and sustain the development of e-Sciencein the UK, to contribute significantly to its internationaldevelopment and to ensure that its techniques are rapidlypropagated to commerce and industry.
– To identify and support e-Science projects within andbetween institutions in Scotland, and to provide theappropriate technical infrastructure and support in orderto ensure rapid uptake of e-Science techniques byScottish scientists.
– To encourage the interaction and bi-directional flow ofideas between computing science research ande-Science applications
– To develop advances in scientific data curation andanalysis and to be a primary source of top qualitysystems and repositories that enable management,sharing and best use of research data.
58
Digital Curation
59
• Digital Curation Centre– Edinburgh, NeSC, HATII, UKOLN, STFC
– Objectives• Provide strategic leadership in digital
curation and preservation for the UKresearch community, with particularemphasis on science data
• Influence and inform national and international policy• Provide advocacy and expert advice and guidance to practitioners and funding
bodies• Create, manage and develop an outstanding suite of resources and tools• Raise the level of awareness and expertise amongst data creators and curators, and
other individuals with a curation role• Strengthen community curation networks and collaborative partnerships• Continue our strong association with our research programme
• Particle Physics– Study group / workshops (DESY & SLAC) in 2009 -> intermediate report
• Governed by NGIs• Initial co-funding from EU• For all disciplines
– sciences, humanities, …
62
e-IRG Recommendation, 12/2005:
“The e-IRG recognizes that the current project-based financing model of grids (e.g., EGEE, DEISA) presents continuity and interoperability problems,
and that new financing and governance models need to be explored – taking into account the role of national grid initiatives as recommended in the Luxembourg e-IRG meeting.”
e-IRG Recommendation, 12/2005:
“The e-IRG recognizes that the current project-based financing model of grids (e.g., EGEE, DEISA) presents continuity and interoperability problems,
and that new financing and governance models need to be explored – taking into account the role of national grid initiatives as recommended in the Luxembourg e-IRG meeting.”
• Specialised Support Centres- for VOs / disciplines (e.g. HEP)
- externally funded
EGI - Management Structure
63
EGI - Tasks
• Accounting, Security, User Support, Problem Tracking,Middleware testing, Deployment, VO Registration, Monitoring,Grid Information Systems, etc.
64
EGI - Transition
65
• EGI-DS Project– establish Blueprint for EGI
– establish EGI.org
• EGEE– begin transition ~Spring 2009
• EGI– operational Spring 2010
• Continuity of service is KEY– not only for LHC…
EGI - Status
• EU bids– Proposals submitted – Nov’09
– Significant (for HEP) SSCs not invited to hearing !
– EGI-Inspire & EMI hearing yesterday -> anticipate infrastructure & middleware development will be funded
• New legal entity, EGI.eu, created last week in Amsterdam– …and soon recruiting