Top Banner
RSV: OSG Grid Fabric RSV: OSG Grid Fabric Monitoring and Monitoring and Interoperation with WLCG Interoperation with WLCG Monitoring Systems Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and Nuclear Physics Location: Prague, Czech Republic Date: March 26, 2009 3/26/09
20

RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

Jan 06, 2018

Download

Documents

Kory Parks

Initial Goals of RSV Put the monitoring into the hands of the local resource administrator Make simple and flexible probe structure Provide independent schedule and collection infrastructure (decoupled from the probe) Provide data to WLCG for Availability and Reliability calculations 3/26/09
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

RSV: OSG Grid Fabric RSV: OSG Grid Fabric Monitoring and Monitoring and

Interoperation with WLCG Interoperation with WLCG Monitoring SystemsMonitoring Systems

Rob Quick, Arvind Gopu, and Soichi HayashiComputing in High Energy and Nuclear PhysicsLocation: Prague, Czech RepublicDate: March 26, 2009

3/26/09

Page 2: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

What we’ll be covering…What we’ll be covering…

Goals of the RSV Project

Local Structure and Initial Deployment

Central Collection and WLCG SAM Interoperability

Data Presentation

Next Steps

3/26/09

Page 3: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

Initial Goals of RSVInitial Goals of RSV

Put the monitoring into the hands of the local resource administrator

Make simple and flexible probe structureProvide independent schedule and

collection infrastructure (decoupled from the probe)

Provide data to WLCG for Availability and Reliability calculations

3/26/09

Page 4: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

Goals as RSV MaturedGoals as RSV Matured

Interact with local fabric monitoringRecruit ‘experts’ to create probesMake a flexible central display of collected

dataImprove WLCG transport reliability

3/26/09

Page 5: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

RSV Client RSV Client 3/26/09

Page 6: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

DeploymentDeployment

Quick adoption by ATLAS and CMS◦Due to WLCG Availability and Reliability

General OSG adoption outside of LCG related resources is still slow

Currently 106 of 131 Services Reporting RSV Status to Central Collector

Initial version had some reliability issues and was difficult to configure◦These have been addressed in RSV V2 or are

being addressed in RSV V3

3/26/09

Page 7: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

Central CollectionCentral Collection

Uses Gratia for transport and collection of probe results◦Mechanisms that holds records until they can

be transmitted protecting from outages on either side

◦Collection DatabaseOSG Information Management DB

◦Determines which records are from valid OSG resources

◦Determines which OSG sites should publish to WLCG (Changes left to the admin)

3/26/09

Page 8: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

WLCG SAM Interoperability WLCG SAM Interoperability

Probe output based on specification set forth by Grid Monitoring Working Group◦Joint project by EGEE and OSG

Uses Nagios Critical/Warning/Unknown/OK◦Allows use in existing fabric monitoring

Transmitted via ActiveMQ to WLCG

3/26/09

Pic: James Casey

Page 9: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

RSV Status in SAM RSV Status in SAM 3/26/09

Page 10: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

OSG Status to GridViewOSG Status to GridView3/26/09

Page 11: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

Data PresentationData Presentation

Everybody gets so much information all day long that they lose their common sense.

--Gertrude Stein (1874 – 1946)

Now that we have all this useful information, it would be nice to do something with it. (Actually, it can be emotionally fulfilling just to get the information. This is usually only true, however, if you have the social life of a kumquat.)

--Unix Programmer's Manual

3/26/09

Page 12: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

Goals of MyOSG PresentationGoals of MyOSG Presentation

Consolidate data sources in OSGReplace VORS monitoringProvide data is ways that are useful to the

usersDo not make another “dashboard”Allow users to integrate the information

into their normal daily workflow

3/26/09

Page 13: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

MyOSG Status HistoryMyOSG Status History3/26/09

Page 14: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

Drilldown on IssueDrilldown on Issue3/26/09

Page 15: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

MyOSG Availability GraphsMyOSG Availability Graphs3/26/09

Page 16: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

MyOSG UWA Used with iGoogleMyOSG UWA Used with iGoogle3/26/09

Page 17: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

MyOSG UWA used with NetvibesMyOSG UWA used with Netvibes3/26/09

Page 18: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

MyOSG - Universal Widget APIMyOSG - Universal Widget API

Allows creation of your own view of OSG Status data and integrate it with your other web/desktop/dashboard mechanisms

Netvibes, Google Personalized Homepage, Windows Vista, Apple Dashboard, Opera, iPhone (Other mobile devices)

If you don’t use one of the above a simple XML format is available also

3/26/09

Page 19: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

RSV Phase IIIRSV Phase III

More probes / re-write some probes◦Security Probes◦Infrastructure Probes (VOMS, GUMS, BDII)

Complete VORS replacement Improve stabilityConfiguration / restartingUnified Management ConsoleRobot certificates

Project Plan

3/26/09

Page 20: RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.

Questions?Questions?3/26/09