Top Banner
EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III Director, EGI-InSPIRE Interim Director EGI.eu European Grid Infrastructure: Enabling the Global Research Community
27

EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

Mar 27, 2015

Download

Documents

Emma Lopez
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

EGEE-III INFSO-RI-222667

Enabling Grids for E-sciencE

www.eu-egee.org

EGEE and gLite are registered trademarks

Steven Newhouse

Technical Director, EGEE-III

Director, EGI-InSPIRE

Interim Director EGI.eu

European Grid Infrastructure: Enabling the Global Research Community

Page 2: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

Enabling Grids for E-sciencE

EGEE-III INFSO-RI-222667

What is e-Infrastructure?

• Resources linked by high speed networks– Compute, Storage, Instruments, ...

• Controlled access to shared resources– Authentication, Authorisation, Accounting, ...

• Dependable services for others to use– Driven by availability and reliability metrics

• Services that are there for the long-term– Supporting experiments lasting decades

European Grid Infrastructure (OGF 28) 2

Page 3: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

Enabling Grids for E-sciencE

EGEE-III INFSO-RI-222667

European Grid Infrastructure

• European Data Grid (EDG)– Explore concepts in a testbed

• Enabling Grid for E-sciencE (EGEE)– Moving from prototype to production

• European Grid Infrastructure (EGI)– Routine usage of a sustainable e-infrastructure

European Grid Infrastructure (OGF 28) 3

Page 4: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

Enabling Grids for E-sciencE

EGEE-III INFSO-RI-222667

EGEE has achieved a lot!

17,000 users

139,000 LCPUs (cores)

25Pb disk

39Pb tape

12 million jobs/month

+45% in a year

268 sites

+5% in a year

48 countries

+10% in a year

162 Virtual Organisations

+29% in a year

Over 20 active communities in 112 VOs

European Grid Infrastructure (OGF 28) 4

Page 5: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

Enabling Grids for E-sciencE

EGEE-III INFSO-RI-222667

Lessons Learned

• Supporting diverse communities is hard– One middleware distribution (gLite) means compromises– Focusing on a single operating model provides tensions

• Supporting a large operational infrastructure is costly– Communication and coordination across 260+ sites– Running hardware: compute, storage, networking, ...– Running software: site, domain specific, ...

• A production infrastructure does yield results– Recent reconstruction events from the first LHC run– In silico drug discovery searches– Fusion simulations– .....

European Grid Infrastructure (OGF 28) 5

Page 6: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

Enabling Grids for E-sciencE

EGEE-III INFSO-RI-222667

EGEE to EGI... what does it mean?

• An opportunity!– Draw a line under the experimentation in EDG & EGEE– Scope activities and structures so they are sustainable

• A challenge!– The technology landscape changes and we must change with it– Increasing diversity of application models and resources

Data Intensive Science is getting ever more intensive Expand beyond core EGEE high throughput grids

• Integrate desktop and high performance grids Expand technologies in response to end-user & operational needs

• How do virtualisation and cloud computing change things?

• A business model!– Add value where you can in providing a generic infrastructure– Provide an open extensible infrastructure for all

European Grid Infrastructure (OGF 28) 6

Page 7: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

What will EGI initially focus on?

• Provide a secure reliable generic infrastructure– Integrate resources based on gLite, UNICORE, ARC, Globus, ...– Leverage new technologies to provide more flexibility to users

• Support the user communities using the infrastructure– Assist and support the current EGEE communities– Engage with ESFRI projects to support their requirements

• Improve the efficiency of the infrastructure– The number of jobs, users & data continue to increase– Utilisation and effectiveness of the resources needs to match

Explore new technologies to make middleware selection and operation a domain

specific decision

Page 8: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

EGI means Innovation

• Deploy Technology Innovation– Distributed Computing continues to evolve

• Grids Desktops Virtualisation Clouds ?

• Enable Software Innovation– Provide reliable persistent technology platform

• Community tools built on the deployed technology

• Support Research Innovation– Infrastructure for data intensive science

• Support for international research (e.g. ESFRI)

Page 9: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

Technology Innovation

• Will come from outside EGI– Moving research technologies into production

• Partnership with technology projects– EMI (European Middleware Infrastructure)– IGE (Initiative for Globus in Europe)– EDGI (European Desktop Grid Initiative)– StratusLab– VenusC

Page 10: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

Software Innovation• Will also come from outside EGI

– EGI is a neutral platform for applications

• EGI cannot support all services in its core– Every community needs something different

• Foster innovation within different ‘sectors’– High Throughput Computing

• gLite, ARC, ....

– High Performance Computing• UNICORE, ...

– Digital Libraries• gCube from D4Science

Page 11: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

Research Innovation• An infrastructure to support European Researchers

– Within the EU27

– Geographical Europe

– Interoperability worldwide for collaboration

• Work with Virtual Research Communities– Groupings of aligned Virtual Organisations

– Enable their community specific support activity:• Support, training, consultancy, requirements etc.

Page 12: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

European Strategy Forum on Research Infrastructures

• Roadmap updated in 2008• Preparatory phase funding for most projects• Big push in FP8 (2013 and beyond)?• 44 projects covering:

– Social Sciences and Humanities– Environmental Sciences– Energy– Biological and Medical Sciences– Materials and Analytical Facilities– Physical Sciences and Engineering– e-Infrastructures

•Data Intensive Science•National commitments in European context•Global collaboration and shared access•Long lifetime (10-20+ years)

•Data Intensive Science•National commitments in European context•Global collaboration and shared access•Long lifetime (10-20+ years)

Page 13: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

EGI

Collaboration

N

G

I

N

G

I

N

G

I

N

G

I

Research

Community

Research

Community

Research

CommunityResearch

Community

E

I

R

O

E

I

R

OEGI.eu

Research

Community

NGI: National Grid Initiative EIRO: European International Research Organisation•13

Page 14: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

The EGI.eu Organisation

• Coordination for European DCI resources– Roadmap to integrate HTC, HPC, Data, Instruments, ...– Policy & services needed to run a production infrastructure

• EGI.eu governed and owned by its stakeholders– EGI Council votes proportional to national income– EGI Council fees proportional to votes– Builds on resources from within its stakeholders

• Located in the Amsterdam Science Park– Distributed staff (~45) with a core (~50%) in Amsterdam

• Human coordination in Amsterdam• Technical coordination with a few partners across Europe

– Costs: €4.75M/year (Revenue: Fees, NGI Effort & EC)

Page 15: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

EGI.eu’s Services

• Integrated Infrastructure– Coordinates (not owns) the compute & storage resources– Resources owned by individual organisations

• They manage access for their user communities

• Deploying Innovative Technology into Production– Software for secure authorised access to resources

• Liaison with external (to EGI) software providers• Integrated into the Unified Middleware Distribution (UMD)

– EGI defined and verified interfaces• Compatible software must be deployed

– Interoperation within your country and internationally

Page 16: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

EGI.eu’s Services

• User Community Support– From a single VO to a Virtual Research Community– Provide a federated Helpdesk linking:

• Discipline specific support (e.g. Bio Apps)• National infrastructure support (e.g. NGS)• Generic services within NGIs or VRCs (e.g. Training)

– Provide core services to support users• Manage VOs, Application DB, Training DB

– Support for Heavy User Communities• Dissemination

– With NGIs, VRCs, and other projects– Two Annual meetings: Users & Technology

• EGI Technical Forum 14-17th September 2010 in Amsterdam

Page 17: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

The EGI-InSPIRE Project Integrated Sustainable Pan-European Infrastructure for Researchers in Europe

• A 4 year project with €25M EC contribution– Project cost €69M– Total Effort ~€330M– Staff ~ 170FTE

•European Grid Infrastructure (OGF 28)

Project Partners (41)

• EGI.eu, 37 NGIs, 2 EIROs

• APGI (11 partners, 8 countries)

Funded

Un-Funded

Page 18: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

Be a Neutral Infrastructure

• Consider IP network providers– Open to any traffic from many different communities

• Restrictions to protect other users

– Customised solutions within a generic framework• Light paths on demand

– Standards drive integrated deployment• Hardware and fibre from many different providers

• And for sustainable E-Infrastructures?– Any application domain or middleware technology– A platform for domain specific innovation and use– Integration of any compliant compatible resources

Page 19: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

Can we learn from others?

• Grids have benefited from commoditisation– Hardware: HTC & HPC affordable to all

– Networking: GBs can be moved over WAN

– Software: Open source software comes of age

• How will commodity virtualisation impact us?– For transactional models

• Cloud Computing: A model based on compute not data

– For large distributed data-oriented models • The emergence of true ‘function shipping’?

Page 20: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

Data Intensive Science

20

Storage

Storage

Storage

Storage

VMM

VMM

VMM

Clusters

Clusters

Clusters

VMM

Clusters

Staff

Staff

Staff

Staff

Coordination by EGI.eu

Technology assessment, Integrated Operations & User Support

GEANT

gLite

gLite

ARC

VMM: Virtual Machine Managers

StaffStaff

How generic should a generic infrastructure be?

Page 21: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

UMD

Supporting Multiple Communities

21

Swiss Grid DayPhysical Resources

Core Site Services

HTCServices

HPC Services

DigitalLibrary

Services

HEP Apps LS Apps CCMST Apps F Apps

VolunteerDesktop Services

....

VRCVRCVRC VRC

VO

VO

VOVO VO VOVO VO

EMIIGEHow to structure end-user service provision (i.e. Cloud) ?

European Grid Infrastructure (OGF 28)

Page 22: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

Supporting Multiple Communities

22

Swiss Grid DayPhysical Resources

Core Site Infrastructure Services (AAAA & Data Fabric)

HTCServices

HPC Services

DigitalLibrary

Services

HEP Apps LS Apps CCMST Apps F Apps

VolunteerDesktop Services

....

VRCVRCVRC VRC

VO

VO

VOVO VO VOVO VO

Managed Virtual Machine Environment

Core Site Infrastructure Services:

European Science Cloud Infrastructure

Page 23: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

European Science Cloud Infrastructure

• Underpinned by an interoperable cloud infrastructure– Sites deploy the VM management technology they want– Securely integrated into a reliable infrastructure– Accessible to authenticated, authorized & accounted for users

• Provide a Data-Oriented Infrastructure as a Service– Leverage existing high performance data storage & transfers– Application domains (VOs) source and run their own services– VO Managers deploy & run these services on the infrastructure

• Bring new research innovations into production– Federated cloud environments (i.e. VMs @ each site)– Experimenting with virtualised worker nodes in EGEE:

• e.g. INFN, BiG Grid, CERN, NGS, ...

Page 24: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

What does this evolution mean?

• VOs now manage their own infrastructure– Decide what services are deployed where– Flexibility (& responsibility) to meet their own needs

• EGI coordinates the core infrastructure– Assessing & certifying technology for deployment– Operate & manage domain specific environments

• If required by that domain!

• Unbundle and open the infrastructure– Defined interfaces between components– ‘Burst out’ to other resource providers

Page 25: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

A long-term need for Standards

• Data Layer– Secure reliable data movement– Standardised access to data resources

• Virtualisation Layer– VMM across trust domains within agreed policies– Monitoring as important as lifecycle control

• Service Layer– The services that go into the virtual machine– Avoid domain specific silos & promote reuse

• Openness

• Consensus

• Balance

• Transparency

Page 26: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

Sustainability

• Reduce barriers for collaborative data intensive science– Integration with GEANT provides unique offering– Support to ESFRI projects and new communities

• Flexibility to run the services and software they need

• Open global collaboration of e-infrastructures providers– Domain driven collaboration with other infrastructures– Open standardised interfaces for integration & avoid lock in– Add value where we can and outsource where we can’t

‘Europe as a hub for sustainable e-science and continuous service innovation’

Page 27: EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE  EGEE and gLite are registered trademarks Steven Newhouse Technical Director, EGEE-III.

Summary• EGEE:

– Demonstrated a production e-infrastructure

• EGI:– Provide a sustainable production e-infrastructure

• EGI.eu is now a legal entity based in Amsterdam– Supported transition for 4 years through EGI-InSPIRE

• Contact: [email protected]

EGI Technical Forum

14-17th September 2010 in Amsterdam