Page 1
Managing Fires
leadership’s role in incident management
@CrayZeigh
Page 2
Aaron Aldrich@CrayZeigh
DevOps Consultant
Cage Data, Inc.
cagedata.com/devops/
Page 3
use your tools for little fires
@CrayZeigh
Page 4
context
@CrayZeigh
Page 5
runbooks
@CrayZeigh
Page 6
automation engines
@CrayZeigh
Page 7
escalation processes
@CrayZeigh
Page 8
crisis requires a team and leadership
@CrayZeigh
Page 9
bestow leadership before crisis
@CrayZeigh
Page 10
crisis leaders should be authorized to make important business decisions
@CrayZeigh
Page 11
delegate authority as far as possible
@CrayZeigh
Page 12
Troubleshooting 101
@CrayZeigh
Page 13
“Never let a serious crisis go to waste. What I mean by that it’s an opportunity to do things you think you could not do before.”
-Rahm Emanuel
@CrayZeigh
Page 14
take it incrementally
@CrayZeigh
Page 15
what will make it work?
@CrayZeigh
Page 16
what will make it whole?
@CrayZeigh
Page 17
“Brevity is the soul of wit. Specificity is the soul of narrative [getting work done]”
@CrayZeigh
Page 18
make work visible to all
@CrayZeigh
Page 20
leaders handle human problems
@CrayZeigh
Page 21
More on the plate, less on the mind
@CrayZeigh
Page 22
manage your bench
@CrayZeigh
Page 23
make sure team members are available to make substitutions
@CrayZeigh
Page 24
tell people to stop working and get rest
@CrayZeigh
Page 26
public communication
@CrayZeigh
Page 27
communicate quickly
@CrayZeigh
Page 28
update your customers often
@CrayZeigh
Page 29
be honest
@CrayZeigh
Page 30
empower your team, remove barriers
@CrayZeigh