Top Banner
Building Technology for Storage Systems Monitoring Intermountain HealthCare Thomas Gwyn Dunbar [email protected]
12

Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Systems Monitoring

Dec 19, 2014

Download

Technology

Nagios

Thomas Dunbar's presentation on Building Technology for Storage Systems Monitoring.
The presentation was given during the Nagios World Conference North America held Sept 20-Oct 2nd, 2013 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/nwcna
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Systems Monitoring

Building Technology for Storage Systems Monitoring

Intermountain HealthCareThomas Gwyn Dunbar

[email protected]

Page 2: Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Systems Monitoring

References & Introduction

* http://content.healthaffairs.org/content/30/6/1185.full.html

* nagios.org, etc

* Nagios: Building Enterprise-Grade Monitoring Infrastructure for Systems and Monitoring, 2nd ed., David Jacobsen

* Unix Programming Environment, Kernighan & Pike

* After Virtue, 3rd ed, Alasdair MacIntyre

* Purgatorio, Dante - since Nagios ain’t gonna insist on sainthood

Page 3: Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Systems Monitoring

IHC and IT

Intermountain Healthcare is an internationally recognized, nonprofit system of 22 hospitals, a Medical Group with more than 185 physician clinics, and an affiliated health insurance company, SelectHealth. Our 33,000 employees serve patients and plan members in Utah and southeastern Idaho. IHC has an annual budget of around 5 billion dollars.

Datacenters in Plano, TX and Salt Lake City, UT and Ogden, UT providing high availability systems with over 5 petabytes of storage (over 12000 spindles) using IBM DS8000 for tier 1 and Netapp for other storage. In-house developed applications run on top of multiple Oracle databases over 15TB in size.

CA Service Desk/CA Spectrum/Xmatters; Nagios

Page 4: Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Systems Monitoring

Monitoring Trend at IHC

Page 5: Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Systems Monitoring

Storage’s Nagios Servers

while SA team moving away from Nagios, Storage is moving to it:

Using 3.5, with check_mk and pnp4nagios

DNX, if need be

Our own servers for business reasons

Integration with CA Spectrum/Service Desk, etc

Page 6: Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Systems Monitoring

Storage Hardware

Brocade switches, IBM DS SAN, SVC & Netapp

Page 7: Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Systems Monitoring

This Talk’s Perspective

Comprehensive monitoring is a major, site specific application.

Major applications become very difficult to replace (e.g. air traffic control, IHC systems)

Hence, let’s consider fundamentals

Page 8: Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Systems Monitoring

Worldviews

* What we look through, not what we look at

* Tempts us to think it is the only way to see

* Scientific: what can we know, and how

* Technological: what can we build, and how

* Context

Page 9: Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Systems Monitoring

Strategies

* Building and Growth

* Inputs and Feedback

* Planning

* Personality

Page 10: Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Systems Monitoring

Building Technology

Coherence

Clarity

Continuity

Page 11: Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Systems Monitoring

Spectrum of Traps

EventMessage: Thu 05 Sep, 2013 - 14:47:23 - Device ********** of type NetAppONTAPDev is no longer responding to primary management requests (e.g. SNMP)

CA Spectrum and Nagios

Page 12: Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Systems Monitoring

Graphing: Time Series

Down the road…correlation