Top Banner
RN - IT Department CH-1211 Genève 23 Switzerland www.cern.ch/it Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires Viegas
18

CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

Dec 22, 2015

Download

Documents

Cecilia Lucas
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Oracle and Streams Diagnostics and

Monitoring Eva Dafonte Pérez

Florbela Tique Aires Viegas

Page 2: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Agenda

• Oracle Enterprise Manager• Streams monitoring tool• Local monitoring tools• Network diagnostic tools• Open questions

Page 3: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Oracle Enterprise Manager

• Set of centralized management tools – administration – configuration management– end-to-end monitoring – security capabilities

• Proactive monitoring and alerting• Monitoring service performance and usage• Automation, schedule jobs, patch

management

Overview

Page 4: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Oracle Enterprise Manager

Page 5: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Oracle Enterprise Manager

Page 6: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Oracle Enterprise Manager

Page 7: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Oracle Enterprise Manager

• Thresholds configuration• Metrics for the servers’ load• Run some advisors to try and pinpoint

performance or configuration issues• Can Tier1 use CERN OEM to monitor their

databases?

Open questions

Page 8: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Streams monitoring tool

• Objectives:– Replication topology– Status of streams connections– Error notifications– Monitor streams performance (latency, throughput, …)

– Monitor resources related to the streams performance (Streams Pool memory, Redo generation)

• Architecture:– ‘Strmmon’ daemon written in Phython

• collects streams and instances info + repository• errors and warnings

– End-user web application

http://oms3d.cern.ch:4889/streams/main.phpOverview

Page 9: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Streams monitoring tool

Monitor view

Connection view

Page 10: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Streams monitoring tool

Database list

Page 11: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Streams monitoring tool

Connection dashboard view

Detailed Streams view

Page 12: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Streams monitoring tool

Graph generator

Page 13: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Streams monitoring tool

• New features– Error tab (web application)

• list of errors that have been reported by streams processes

– Availability tab (web application)• Percentage availability of each instance provided with

availability plots.

– New metrics (monitor)• CPU consumption• Physical bytes

– Read

– Written

Page 14: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Streams monitoring tool

Errors List

CPU consumption

Page 15: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Streams monitoring tool

Availability

Page 16: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Streams monitoring tool

• Proposition of future features– Weekly reports(number of transactions applied, number of

LCRs streamed etc)?– More notifications via mail(high latency,high CPU

utilization etc.)?– Some automatization in streams administration?

• Detecting common failures (e.g. propagation hangs)

• Proceed procedure to solve the failures

• Streams errors report:– Any action necessary at Tier1?– Who is testing what?

• Email alerts– RAL still does not receive notifications

Open questions

Page 17: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Local and Network monitoring

• Is OEM sufficient?• Which other tools?• To which metrics we should pay attention?• “Homemade” tools for backup monitoring:

– RAL, …

• Local monitoring with Nagios– Is this reasonable?– Any experience?

Open questions

• Triumf (slides)• BNL (slides)

Page 18: CERN - IT Department CH-1211 Genève 23 Switzerland  t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.

CERN - IT DepartmentCH-1211 Genève 23

Switzerlandwww.cern.ch/it

Overall

• What, specifically, Tier-1s should monitor on their own databases?

• What CERN want to know about the sites?• What Tier-1 sites need to know about the

CERN databases?• What Tier-1 sites need to know about other

Tier-1 sites?