Top Banner
When Downtime Is Not An Option Joe Felisky Global Director – Technology Solutions SUSE
23
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

When Downtime Is Not An Option

Joe Felisky Global Director – Technology Solutions SUSE

Page 2: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

2

Survey Results: More and More Systems Are Considered Critical

Source: Forrester Research, Inc.

Page 3: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

3

Workloads that can least tolerate downtime …

•  ERP applications, databases, transactional workloads, and more

• Workloads that may impact a large # of users • Workloads that directly impact revenue

Mission/Business Critical

Workloads

•  As Data Centers are consolidated onto fewer servers, highly virtualized hosts require additional protections

•  Downtime of the guests and hosts can critically impact your IT services

High Density Virtualization

Page 4: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

4

Downtime is More Than Just Lost Revenue

Page 5: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

5

Planned Downtime

•  What is/causes planned downtime? –  Scheduled software patching and updates that require system

reboot

–  Scheduled hardware maintenance

–  Data migration

•  What can IT do to mitigate it? –  Schedule service window to minimize business impact (getting

harder in globalization and mobile era)

–  Optimize the process

Scheduled Downtime

Page 6: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

6

Unplanned Downtime

•  What is/causes unplanned downtime: –  It’s a surprise – no/little warning

–  Hardware failure, software bug, malicious attack or operational mistake

–  An environmental failure such as natural disaster

•  What can IT do to mitigate it? –  Reliable systems, High Availability clustering, Geo clustering,

proactive patch management

–  Improve process by best practices and training

Page 7: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

7

Unplanned Downtime: Top Causes

Source: Forrester Research, Inc.

Page 8: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

Exploring Open Source Solutions That Minimize Downtime

Page 9: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

9

How To Choose The Right Technologies?

Page 10: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

10

1. Prevent

Hardware Downtime

3. Minimize Human Mistakes

2. Maximize Service

Availability

SUSE’s Three Steps Toward Zero Downtime:

Page 11: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

11

Step 1: Prevent Hardware Downtime

•  Choose the right platform to fit your IT needs.

•  By working with hardware partners, SUSE is bringing reliability to the next level. Hardware Architecuture

Hardware Benefits SUSE value-add

IBM System z

•  99.999% availability

•  Specially designed tools and technologies

•  HA/Geo included

INTEL64

•  Cost-effective •  Widely accepted

for mission-critical workloads

•  Cooperate with hardware to fully exploit RAS (i.e. MCE)

Page 12: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

12

Baldor

•  Baldor Electric is a global manufacturing company with headquarters in Arkansas, USA. It has 26 plants in the US, Canada, Mexico and China, selling industrial electric products to more than 70 countries.

•  Baldor couldn’t ensure 24x7 availability to provide quality service to customers and employees and eliminate the risk of business disruption. At the time, outages occurred five-to-eight times a year, costing hundreds of thousands of dollars.

•  By moving workloads to virtual machines on IBM System z running SUSE Linux Enterprise Server for System z, Baldor Electric Company reduced IT costs from two percent of sales to less than one percent—while improving response time, up-time and productivity.

•  “The platform goes beyond five 9s,” says Eric Breuer, Manager, Large Systems, Baldor. And the new environment results in considerable time savings for planned downtime.

●  30% lower hardware and software costs.

●  90% consolidation of servers (resulting in lower space, cooling and power costs).

●  90% improvement in up-time (reducing disaster recovery losses).

●  “We believe SUSE has the best Linux distribution with the best support. The platform goes beyond “five 9s,” and the new environment results in considerable time savings for planned downtime.” Eric Breuer, Manager Large Systems Baldor

Read Full Case Study

Page 13: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

13

Step 2: Maximize Service Availability

•  SUSE Linux Enterprise High Availability Extension –  Industry-leading open source HA clustering solution

–  Set up clusters among physical hosts or virtual guests

–  Easy-to-use set-up and management tools

–  Resource agents for third-party apps like SAP, PostgreSQL, Oracle, JBoss, Tomcat, Websphere and more.

•  Metro and Geo Clustering –  Combat regional disasters such as power outage or flood

–  Ensure business continuity by metro clustering (up to 25km) or geo clustering (any distance in the world) *Founder of the Linux Foundation HA Working Group

Page 14: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

14

Page 15: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

15

Maximize service in your cloud or with your storage … • SUSE Cloud

–  Leverage High Availability of the Control nodes in your Openstack (SUSE Cloud) deployment

–  Utilize Openstack Heat to scale applications on demand for increased service availability

• SUSE Storage (beta) –  Software Defined Storage based on Ceph

–  Scalable, Reliable and Highly Available through redundancy

–  Automated management and repair of underlying storage

Page 16: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

16

Step 2: Maximize Service Availability (Cont.) •  SUSE Linux Enterprise Live Patching New! (codenamed “kGraft”)

–  Live-kernel patching without reboot

–  Apply urgent security patches before next service window, reducing the need for planned downtime

•  Unique advantages –  It integrates smoothly into existing package and patch

management solutions, as it uses the Enterprise Linux RPM package standard

–  While patching, there is no need to hold the Linux kernel for a short time, as is necessary in other technologies

Page 17: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

17

Page 18: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

18

Step 3: Minimize Human Mistakes •  Full System rollback

–  Built on copy-on-write btrfs and efficient Snapper tool

–  Now capable of full-system rollback, including kernel files

–  Minimizes risks of operational mistakes

•  YaST and autoYaST

–  The most efficient single-system management framework, consistent UI

•  SUSE Manager

–  Open source one-to-many system management

–  Reduce errors by proactive and automated patching

–  Full Service Pack upgrade automation – with no re-install of the Server OS.

Page 19: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

19

Page 20: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

20

1. Prevent

Hardware Downtime

3. Minimize Human Mistakes

2. Maximize Service

Availability

Three Steps For Customers To Take Towards Zero Downtime

Page 21: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

21

Thank you.

For more information: www.suse.com/zerodowntime

Page 22: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

22

+49 911 740 53 0 (Worldwide) www.suse.com

Corporate Headquarters Maxfeldstrasse 5 90409 Nuremberg Germany

Join us on: www.opensuse.org

Page 23: Gartner Data Center Conference 2014 - When Downtime is Not an Option.

Unpublished Work of SUSE. All Rights Reserved. This work is an unpublished work and contains confidential, proprietary, and trade secret information of SUSE. Access to this work is restricted to SUSE employees who have a need to know to perform tasks within the scope of their assignments. No part of this work may be practiced, performed, copied, distributed, revised, modified, translated, abridged, condensed, expanded, collected, or adapted without the prior written consent of SUSE. Any use or exploitation of this work without authorization could subject the perpetrator to criminal and civil liability. General Disclaimer This document is not to be construed as a promise by any participating company to develop, deliver, or market a product. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. SUSE makes no representations or warranties with respect to the contents of this document, and specifically disclaims any express or implied warranties of merchantability or fitness for any particular purpose. The development, release, and timing of features or functionality described for SUSE products remains at the sole discretion of SUSE. Further, SUSE reserves the right to revise this document and to make changes to its content, at any time, without obligation to notify any person or entity of such revisions or changes. All SUSE marks referenced in this presentation are trademarks or registered trademarks of Novell, Inc. in the United States and other countries. All third-party trademarks are the property of their respective owners.

257-000020-001