1 Wolfgang Gentzsch, D-Grid October 17, 2006 Cracow Grid Workshop, October 2006 D-Grid in International Context Wolfgang Gentzsch with support from Tony Hey et al, Satoshi Matsuoka, Kazushige Saga, Hai Jin, Bob Jones, Charlie Catlett, Dane Skow, and the Renaissance Computing Institute at UNC Chapel Hill, North Carolina
52
Embed
Cracow Grid Workshop, October 2006 D-Grid in International Context Wolfgang Gentzsch
Cracow Grid Workshop, October 2006 D-Grid in International Context Wolfgang Gentzsch with support from Tony Hey et al, Satoshi Matsuoka, Kazushige Saga, Hai Jin, Bob Jones, Charlie Catlett, Dane Skow, and the Renaissance Computing Institute at UNC Chapel Hill, North Carolina. - PowerPoint PPT Presentation
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
1Wolfgang Gentzsch, D-GridOctober 17, 2006
Cracow Grid Workshop, October 2006
D-Grid in International Context
Wolfgang Gentzsch
with support from
Tony Hey et al, Satoshi Matsuoka, Kazushige Saga, Hai Jin, Bob Jones, Charlie Catlett, Dane Skow,
and the Renaissance Computing Institute atUNC Chapel Hill, North Carolina
2Wolfgang Gentzsch, D-GridOctober 17, 2006
Grid InitiativesInitiative Time Funding People *) Users
UK e-Science: To enable the next generation of multi-disciplinary collaborative science and engineering, to enable faster, better or different research. EGEE:To provide a seamless Grid infrastructure for e-Science that is available for scientists 24 hours-a-day.ChinaGrid: To provide a research and education platform by using grid technology for the faculties and students among the major universities in China.NAREGI:To do research, development and deployment of science grid middleware.TeraGrid:Create a unified Cyberinfrastructure supporting a broad array of US science activities using the suite of NSF HPC facilitiesD-Grid: Build and operate a sustainable grid service infrastructure for German research (D-Grid1) and research and industry (D-Grid2)
storage, software, data, . . .- Researchers share ditto and large experiments, instruments,
sensor networks, etc.
• Collaboration: - Enterprise departments with its suppliers and peers (e.g. design)- Research teams distributed around the world (HEP, Astro, Climate)
• Doing things which have not been possible before:- Grand Challenges needing huge amount of computing and data- Combining distributed datasets into on virtual data pool (Genome)- “Mass Grids” for the people (distributed digital libraries; digital school laboratories; etc)
FZJ/ZAM IBM Supercomputer with 8,5 TFlopsSTK data robot system with 2,8 PByte
32 CPUs300 TByte
FZK/IWR 8 nodes Opteron 2x2.2 GHz 8 processors of a system NEC SX-5 1 p630 with 4 processors1 SX-6i to do tests2 nodes Opteron 2x2.2 GHz to do tests
100%50%50%50%50%
LRZ SGI high performance system with 20 TFlop/sIntel IA32 and IA 64 Cluster, IBM p690, SunFire 80
5%5% 5%
MPI/RZG IBM supercomputer with 4,5 TFlops, PC cluster with 2 TFlopsData robot system with 8 PByte
32 CPUs400 TByte
PC² Cluster of 400 Xeon 64 Bit processors, high performance visualization and FPGAs
10%
RWTH/RZ 2 SunFire 6900 with 24 UltraSPARC IV each 100%
TU-Dresden/ZIH
SGI O2K(56 proc)/O3K(192 proc.) : T3E (64 proc):PC cluster with 30 processors,end off 2005: new system with 1000 proc.
10%20%20%2%
Uni-H/RRZN PC-Cluster mit 64 CPUs assoc.
Uni-KA PC-Pool assoc.
FHG/ITWM assoc.
10Wolfgang Gentzsch, D-GridSeptember, 2006
The German D-Grid Initiative *)
D-Grid-1Services for Scientists
*) funded by the German Ministry for Education and Research
11Wolfgang Gentzsch, D-GridSeptember, 2006
German e-Science Initiative, Key Objectives
Building a Grid Infrastructure in Germany Combine the existing German grid activities for infrastructure, middleware, and applications Integration of the middleware components developed in the Community Grids
Development of e-science services for the research community Science Service Grid
Important: Continuing sustainable production grid infrastructure after the end of the funding period Integration of new grid communities (2. generation) Business models for grid services
12Wolfgang Gentzsch, D-GridSeptember, 2006
D-Grid Projects
Generic Grid Middleware and Grid Services
Integration Project
As
tro
-Gri
d
C3
-Gri
d
HE
P-G
rid
IN-G
rid
Me
diG
rid
ON
TO
VE
RS
E
WIK
ING
ER
WIS
EN
T
Te
xtg
rid
VIOLA eSciDoc
D-Grid Knowledge Management
. . .
Im W
iss
en
sne
tz
13Wolfgang Gentzsch, D-GridSeptember, 2006
Community GridsCommunity Grids
Generic Grid Middleware and Grid Services
Information and Knowledge Management
Grid specificDevelopments
Application
CGMiddle-
ware
Grid specificDevelopment
Application
CGMiddle-
ware
D-Grid Structure
Courtesy Dr. Krahl PT/BMBF
Integration ProjectIntegration Project
Wolfgang Gentzsch, D-GridSeptember, 2006
DGI Infrastructure Project
WP 1: D-Grid basic software components, sharing resources, large
storage, data interfaces, virtual organizations, management
WP 2: Develop, operate and support robust core grid infrastructure, resource description, monitoring, accounting, and billing
WP 4: Business platform and sustainability, project management, communication and coordination
Scalable, extensible, generic grid platform for future
Longterm, sustainable grid operation, SLAs based
Wolfgang Gentzsch, D-GridSeptember, 2006
D-Grid Middleware
Nutzer
ApplicationDevelopment
and User Access
GAT API
Data/Software
Resourcesin D-Grid
High-levelGrid
Services
Basic Grid Services
DistributedData Archive
User
NetworkInfrastructur
LCG/gLite
Globus 4.0.1
AccountingBilling
User/VO-Mngt
SchedulingWorkflow Management
Data management
Security
Plug-In
UNICORE
DistributedCompute Resources
GridSphere
Monitoring
16Wolfgang Gentzsch, D-GridSeptember, 2006
DGI Services, Available Dec 2006
• Sustainable grid operation environment with a set of core D-Grid middleware services for all grid communities
• Central registration and information management for all resources
• Packaged middleware components for gLite, Globus and Unicore and for data management systems SRB, dCache and OGSA-DAI
• D-Grid support infrastructure for new communities with installation and integration of new grid resources into D-Grid Help-Desk, Monitoring System and central Information Portal
17Wolfgang Gentzsch, D-GridSeptember, 2006
DGI Services, Dec 2006, cont.
• Tools for managing VOs based on VOMS and Shibboleth
• Test implementation for Monitoring & Accounting for Grid resources, and first concept for a billing system
• Network and security support for Communities (firewalls in grids, alternative network protocols,...)
• DGI operates „Registration Authorities“, with internationally accepted Grid certificates of DFN & GridKa Karlsruhe
• Partners support new D-Grid members with building their own „Registration Authorities“
18Wolfgang Gentzsch, D-GridSeptember, 2006
• DGI will offer resources to other Communities, with access via gLite, Globus Toolkit 4, and UNICORE
• Portal-Framework Gridsphere can be used by future users as a graphical user interface
• For administration and management of large scientific datasets, DGI will offer dCache for testing
• New users can use the D-Grid resources of the core grid infrastructure upon request
DGI Services, Dec 2006, cont.
19Wolfgang Gentzsch, D-GridSeptember, 2006
AstroGrid
20Wolfgang Gentzsch, D-GridSeptember, 2006
Climate research moves towards new levels of complexity:
Stepping from Climate (=Atmosphere+Ocean) to Earth System Modelling
MediGrid: Mapping of Characteristics, Features, Raw Data, etc
24Wolfgang Gentzsch, D-GridSeptember, 2006
D-Grid-2 Call (review of proposals: Sept 19)
‘Horizontal’ Service Grids: professional Service Providers for heterogeneous user groups in research and industry
‘Vertical’ Community Service Grids using existing D-Grid infrastructure and services, supported by Service Providers
D-Grid extensions, based on a D-Grid 1 gap analysis - Tools for operating a professional grid service - Adding business layer on top of D-Grid infrastructure - Pilot service phase with service providers and ‘customers’
Grid Middleware Stack, major modulesUK e-Science: Phase 1: Globus 2.4.3, Condor, SRB. Phase 2: Globus 3.9.5 und 4.0.1, OGSA-DAI, Web services.
EGEE: gLite distribution: elements of Condor, Globus 2.4.3 (via VDT distribution).
ChinaGrid: ChinaGrid Supporting Platform (CGSP) 1.0 is based on Globus 3.9.1, and CGSP 2.0 is implemented based on Globus 4.0.
NAREGI: NAREGI middleware and Globus 4.0.1 GSI and WS-GRAM
TeraGrid: GT 2.4. and 4.0.1: Globus GRAM, MDS for information, GridFTP & TGCP file transfer, RLS for data replication support, MyProxy for credential mgmnt
D-Grid: Globus 2.4.3 (in gLite) and 4.0.2, Unicore 5, dCache, SRB, OGSA-DAI, GridSphere, GAT, VOMS and Shibboleth
27
The Architecture of Science Gateway ServicesThe Users Desktop
SecuritySecurity Data ManagementService
Data ManagementService
AccountingService
AccountingService
Notification ServiceNotification Service
PolicyPolicy Administration& Monitoring
Administration& Monitoring
Grid OrchestrationGrid OrchestrationResource
Allocation
ResourceAllocation
Reservations And Scheduling
Reservations And Scheduling
TeraGrid Gateway Services
Web Services Resource Framework – Web Services Notification
(( WSRF (GT4+Fujitsu WP1) + GT4 and other services)WSRF (GT4+Fujitsu WP1) + GT4 and other services)
SuperSINET
Grid-Enabled Nano-Applications (WP6)
Grid PSEGrid Programming (WP2)
-Grid RPC -Grid MPI
Grid Visualization
Grid VM (WP1)
Packag
ing
DistributedInformation Service
(CIM)
Grid Workflow (WFML (Unicore+ WF))
Super Scheduler
Grid Security and High-Performance Grid Networking (WP5)
Data (W
P4)
31
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
Workload ManagementData Management
SecurityInformation & Monitoring
Access
gLite Grid Middleware Services
API
ComputingElement
WorkloadManagement
MetadataCatalog
StorageElement
DataMovement
File & ReplicaCatalog
Authorization
Authentication
Information &Monitoring
Application
MonitoringAuditing
JobProvenance
PackageManager
CLI
Accounting
Site Proxy
Overview paper http://doc.cern.ch//archive/electronic/egee/tr/egee-tr-2006-001.pdf
32Wolfgang Gentzsch, D-GridSeptember, 2006
Nutzer
ApplicationDevelopment
and User Access
GAT API
Data/Software
Resourcesin D-Grid
High-levelGrid
Services
Basic Grid Services
DistributedData Archive
User
NetworkInfrastructur
LCG/gLite
Globus 4.0.1
AccountingBilling
User/VO-Mngt
SchedulingWorkflow Management
Data management
Security
Plug-In
UNICORE
DistributedCompute Resources
GridSphere
Monitoring
D-Grid Middleware
33Wolfgang Gentzsch, D-GridOctober 17, 2006
Major Challenges with Implementing Globus
UK e-S, EGEE: GT 2.4 not a robust product. In the early days it took months to install, and numerous workarounds by EDG, LCG and the Condor team.
UK e-S: The move from GT 2.4 to OGSA-based GT 3 to WS-based GT 4during many of the UK grid projects was a disruption.
TeraGrid:GT is a large suite of modules, most of which need to be specially built for HPC environments. The tooling on which it is based is largely unfamiliar to system administrators and requires a training/familiarization process.
D-Grid: The code is very complex and difficult to install on the many different systems in heterogeneous grid environment.
Efforts for SustainabilityUK e-Science: National Grid Service (NGS), Grid Operations Support Center (GOSC), National e-Science Center (NeSC), Regional e-Science Centers, Open Middleware Infrastructure Institute (OMII), Digital Curation Center (DCC)EGEE:Plans to establish a European Grid Initiative (EGI) to provide persistent grid service federating national grid programmes starting in 2008ChinaGrid: Increasing numbers of grid applications using CGSP grid middleware packagesNAREGI:Software will be managed and maintained by Cyber Science Infrastructure Center of National Institute of InformaticsTeraGrid:NSF Cyberinfrastructure Office: 5 year Coop. Agreement. Partnerships with peer grid efforts and commercial web services activities in order to integrate broadly D-Grid: DGI WP 4: sustainability, services strategies, and business models
37
The Open Middleware Infrastructure Institute (OMII)
OMII is based at the University of Southampton, School of Electronics & Computer Science.
Vision: to become the source for reliable, interoperable and open-source grid middleware, ensuring continued success of grid-enabled e-Science in the UK.
OMII intends to:• Create a one-stop portal and software repository for open-source grid
middleware, including comprehensive information about its function, reliability and usability;
• Provide quality-assured software engineering, testing, packaging and maintenance of software in the OMII repository, ensuring it is reliable and easy to both install and use;
• Lead the evolution of grid middleware at international level, through a managed program of research and wide-reaching collaboration with industry.
38
The Digital Curation Center (DCC)
The DCC is based at the University of Edinburgh.DCC supports UK institutions with the problems involved in
storing, managing and preserving vast amount of digital data to ensure its enhancement and continuing long-term use.
The purpose of DCC is to provide a national focus for research into curation issues and to promote expertise and good practice, both nationally and internationally, for the management of all research outputs in digital format.
TeraGrid Science Gateways Initiative:Community Interface to Grids
• Common Web Portal or application interfaces (database access, computation, workflow, etc), standards (primarily web services)• “Back-End” use of grid services such as computation, information management, visualization, etc.• Standard approaches so that science gateways may readily access resources in any cooperating Grid without technical modification
• Win support of users– Application and experimental deployment essential– R&D for production quality (free) software– Nano-science (and now Bio) involvement, large testbed
46
List of NAREGI “Standards”(beta 1 and beyond)
• GGF Standards and Pseudo-standard Activities set/employed by NAREGI
• Need to prepare for permanent Grid infrastructure– Maintain Europe’s leading position in global science Grids– Ensure a reliable and adaptive support for all sciences– Independent of short project funding cycles– Modelled on success of GÉANT
Infrastructure managed in collaboration with national grid initiatives
48
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
European National Grid Projects
• Austria – AustrianGrid• Belgium – BEGrid• Bulgaria – BgGrid• Croatia – CRO-GRID• Cyprus – CyGrid• Czech Republic- METACentre• Denmark ?• Estonia – Estonian Grid• Finland• France – planned (ICAR)• Germany – D-GRID• Greece - HellasGrid• Hungary• Ireland - Grid-Ireland• Israel – Israel Academic Grid• Italy - planned
• Latvia – Latvian Grid• Lithuania - LitGrid• Netherlands – DutchGrid• Norway – NorGrid• Poland - Pioneer• Portugal – launched April’06• Romania – RoGrid• Serbia – AEGIS• Slovakia• Slovenia - SiGNET• Spain – planned• Sweden – SweGrid• Switzerland - SwissGrid• Turkey – TR-Grid• Ukraine - UGrid• United Kingdom - eScience
49Wolfgang Gentzsch, D-GridSeptember, 2006
D-Grid: Towards a Sustainable Infrastructure for Science and Industry
Govt is changing policies for resource acquisition (HBFG ! ) to enable a service model
2nd Call: Focus on Service Provisioning for Sciences & Industry
Application and user-driven, not infrastructure-driven
Focus on implementation and production, not grid research, in a multi-technology environment (Globus, Unicore, gLite, etc)
D-Grid is the Core of the German e-Science Initiative
50Wolfgang Gentzsch, D-GridSeptember, 2006
• Sensitive data, sensitive applications (medical patient records)• Different organizations get different benefits• Accounting, who pays for what (sharing!)• Security policies: consistent and enforced across the grid !• Lack of standards prevent interoperability of components• Current IT culture is not predisposed to sharing resources• Not all applications are grid-ready or grid-enabled• Open source is not equal open source (read the small print)• SLAs based on open source (liability?)• “Static” licensing model don’t embrace grid• Protection of intellectual property • Legal issues (FDA, HIPAA, multi-country grids)
Summary:Challenges for Research and Industry
51Wolfgang Gentzsch, D-GridSeptember, 2006
Summary: Lessons Learned and Recommendations
– Continuity: Grid infrastructure should be modified and improved in large cycles only: applications depend on infrastructure !
– Sustainability: Funding should be available after end of project, to guarantee services, support and continuous improvement.
– Interoperability: Use open-source software and standards especially in the infrastructure and application middleware layer.
– Collaboration: between infrastructure developers and the applications, to best utilize grid services and to avoid application silos.
– User-Friendliness: for easy adoption for new communities. Infrastructure group should offer installation, operation and support services.
– Grid Services: Centers of Excellence should specialize on specific services, e.g. integration of new communities, grid operation, utility services, training, support, etc.
– Participation of Industry: has to be industry-driven. Push from outside, even with goverment funding, is not promising. Success comes only from real needs e.g. through existing collaborations between research and industry