Pets vs. Cattle: The Elastic Cloud Story

CCA - NoDerivs 3.0 Unported License - Usage OK, no modifications, full attribution*!* All unlicensed or borrowed works retain their original licenses

Pets vs. Cattle:!The Elastic Cloud Story!DevOps Chicago Meetup!February 26, 2014

@randybias

A Tale of Two Clouds

Enterprise Computing Approach

GUI Driven!Ticket-Based!Hand-Crafted!

Reserved !Scale-up!

Smart Hardware!Proprietary!

Traditional Dev!…

Cloud Computing Approach

API Driven!Self-Service!Automated!On-demand!Scale-out!

Smart Apps!Open Source!Agile DevOps!

Elastic Cloud Shifts Uptime Responsibility

Enterprise Model Cloud Model

99.9%!Applications!

(8h46m down)

99.999%!Infrastructure!

($$$$)

99.999% Applications!(5m down)

99% Infrastructure!

Elastic Cloud Origins

Elastic !Private Cloud

Enterprise Virtualization!Private Cloud

Elastic & Virtualization

2.0 Clouds are very different.!

!Different

workloads.!!

Different !architectures.!

!Different !

skills.!!

Different economics.

Virtual Infrastructure

Standardization, Automation,!

Chargeback, Self-Service!

Designed for Server Consolidation !IT Admins manage Infrastructure!Ticket-based manual provisioning!Improves virtualization value

Elastic Public Cloud

On-premise Deployment!

Designed for Agility!Cloud Admins manage Services!

Self-service automated provisioning!Delivers cloud value on-premise

What Companies Care About?

Cloud Computing!

Agile Development!

Business !Agility!

Operational Discipline!

ACCELERATING!TIME TO VALUE!Continuous

Integration

Continuous Testing & Delivery

Agile Methodologies

IaaS / PaaS !!

Public / Private / Hybrid !!

Big Data / Analytics

Public APIs

Continuous Deployment

DevOps Data Center & App Automation

Line of Business

Enablement

New App Initiatives

(Mobile, SaaS, etc.)

Data Center Modernization

Elastic Cloud is a Mindset Change

Attribution: Bill Baker, Distinguished Engineer, Microsoft

bowzer.company.com!(scale-up)

web001.company.com!(scale-out)

(Virtual) Servers *are* cattle

Pets vs. Cattle Takes Off

MicrosoftCloudscaling

ScalrRackspaceRed Hat

Scale-out, not UP in Cloud

(Some) Elastic Cloud Patterns!

What follows are *some* Elastic Cloud Patterns!There are many more, but these are mine!Input, ideas, & other thoughts welcome via twitter / email

Big Failure Domains !Make Big Craters

Anti-Pattern

Smaller Failure Domains

Would you rather have the whole cloud down !or just a small bit of it for a short time?

Loose Coupling

Synchronous, blocking calls mean cascading

failures.

Async, non-block calls mean failure in

isolation.

Open Source Software

Excessive software taxation is the past.

Black boxes create lock-in.

You can !always fork.

Uptime in Software Self-management

Hardware fails.!Software fails.!

People fail.

Only software can measure itself &

respond to failure in near real-time.

Applications designed for 99.999% uptime can

run anywhere

Scale Out vs Scale up

Vertical Scaling Make boxes bigger (usually an HA pair)

Horizontal ScalingMake more boxes

➔➔

B ...A B C N

Circuit Breaker Pattern

Fallback mechanisms (e.g. cached data)

ensure uninterrupted service while giving service time to

recover

When failing service detected, stop calling that

API and serve fallback responses

Buy from ODMs

ODMs operate their businesses on 3-10%

margins.

AMZN, GOOG, and Facebook buy direct without a middleman.

Only a few enterprise vendors are pivoting to

compete.

Less Enterprise “Value” in x86 Servers

Generic servers rule. Full stop. Nothing is better because nothing else is

*generic*.

“... a data center full of vanity free servers ... more efficient ... less expensive to build

and run ... “ - OCP

Fully Routed (L3) Networking

The largest cloud operators all run layer-3 routed,

networks with no VLANs.

Cloud-ready apps don’t need or want VLANs.

Enterprise apps can be supported on elastic clouds

using Software-defined Networking (SDN)

Software-defined Networking (SDN)

• x86 server is the new Linecard!• network switch is the new ASIC!• VXLAN (or NVGRE) is the new Chassis!• SDN Controller is the new SUP Engine

“Network Virtualization”

Flat Networking + SDNs

Flat + SDN co-exist & thrive together

Standard SecurityGroup

Availability Zone

Virtual L2 Network

Virtual Private Cloud

Networking

VPC SecurityGroup

Internet

VPC Gateway

Physical Node

RAIS instead of HA Pairs/ClustersRedundant arrays of inexpensive services (RAIS)!

Load balanced with no state sharing!Active … active … active … active … !On failure, connections are lost, but failures are rare!Rolling upgrades are easier, because each server is an island!Think: scale-out + fault isolation (sharding)!

Ridiculously simple & scalable!

Hardware failures are infrequent & impact subset of traffic!(N-F)/N, where N = total, F = failed!10 RAIS servers - 1 failure == 90% capacity!Most things retry anyway!

Cascade failures are unlikely and failure domains are small

Service Array (RAIS) Example

Backbone Routers

Cloud Access Switches

AZ (Spine) Switches

RAIS (NAT, LB, VPN)

OSPF Route Announcements

Return Traffic (default or source NAT)

Public IP Blocks

Cloud Control Plane

Lots of Inexpensive 1RU Switches

1RU: 6K-30K VMs / AZ

Simple spine-and-leaf flat routed network

Rack 1 Rack 2 Rack 3

Modular: 40K-200K VMs / AZ

Rack 1Rack 2

MultipleRacks

Rack 1Rack 2

MultipleRacks

Rack 1Rack 2

MultipleRacks

Direct-attached Storage (DAS)

Cloud-ready apps manage their own data replication.

DAS is the smallest failure domain possible with

reasonable storage I/O.

SAN == massive failure domain.

SSDs will be the great equalizer.

Elastic Block Device Services

EBS/EBD is a crutch

Bigger failure domains (AWS outage anyone?), complex,

sets high expectations

Sometimes you need a crutch. When you do, overbuild the network, and make sure

you have a smart scheduler.

AWS EBS Outage!http://aws.amazon.com/message/65648/

More Servers == More Storage I/O

>1M writes/second, triple-redundancy w/ Cassandra on AWS

Linear scale-out == linear costs for performance

Hypervisors are a Commodity

Cloud end-users want OS of choice, not HVs.

Level up! Managing iron is for mainframe operators.!… hypervisors are bare metal APIs

Hypervisor of the future is open source, easily modifiable, &

extensible.

The Hypervisor of the Future May Be NO Hypervisor

ironic

Bare Metal Cloud

Quiz Time

Pets CattleLACP?

Quiz Time

Pets CattleLACP ➔

Quiz Time

Pets CattleLACP

Managing a Server at a Time?

Quiz Time

Pets CattleLACP

Managing a Serverat a Time ➔

Quiz Time

Pets CattleLACP

Managing Server at a Time

Auto-scaling?

Quiz Time

Pets CattleLACP

Auto-scaling➔

Quiz Time

Pets CattleLACP

Auto-scaling

Design-for-Failure?

Quiz Time

Pets CattleLACP

Auto-scaling

Design-for-Failure➔

Quiz Time

Pets CattleLACP

Auto-scaling

Design-for-Failure

100% Uptime Goals?

Quiz Time

Pets CattleLACP

Auto-scaling

Design-for-Failure

100% Uptime Goals ➔

Quiz Time

Pets CattleLACP

Auto-scaling

Design-for-Failure

100% Uptime Goals

HA pairs for redundancy?

Quiz Time

Pets CattleLACP

Auto-scaling

Design-for-Failure

100% Uptime Goals

HA pairs for redundancy ➔

Quiz Time

Pets CattleLACP

Auto-scaling

Design-for-Failure

100% Uptime Goals

HA pairs for redundancy

Shared Nothing Architecture?

Quiz Time

Pets CattleLACP

Auto-scaling

Design-for-Failure

100% Uptime Goals

Shared Nothing Architecture➔

Quiz Time

Pets CattleLACP

Auto-scaling

Design-for-Failure

100% Uptime Goals

Shared Nothing Architecture

Persistent Block Storage?

Quiz Time

Pets CattleLACP

Auto-scaling

Design-for-Failure

100% Uptime Goals

Shared Nothing Architecture

Persistent Block Storage ➔

Randy Bias!Founder & CEO, Cloudscaling!Director, OpenStack Foundation!@randybias

Pets vs. Cattle: The Elastic Cloud Story

cloud admins

elastic cloud patterns

elastic cloud origins

elastic cloud story

cloud value onpremise

cloud computing approach

enterprise model cloud

largest cloud operators

Technology

Click pets

Animals & Pets

Pets, Pets, Pets. - RSPCA Victoria Pets Pets Vic... · When...

Containers Change Everything Anne Currie€¦ ·...

The History of Pets vs. Cattle ... And Using It Properly

Pets introduction

Pets pets pets - RSPCA Vic · Pets, Pets, Pets is a program...

Revista Pets

Turning Pets into Cattle: A Demonstration to Provoke...

Pampered Pets - Pets December 2011

Immutable Windows: from pets to cattle

Multi-Cloud Infrastructure Monitoring with Elastic Stack ·...

From Pets to Cattle to Bacteria OSCON BOF

enn with - I T.A.K.E. (Un)...

Historical Live Cattle/Feeder Cattle Report

Nicaraguan Cattle Industry 2016€¦ · Nicaraguan Cattle.....