1© 2001, Cisco Systems, Inc.
IOS Update for SwiNOG 4th
IOS Update for SwiNOG 4th
Chris Martin
Systems Engineer
Cisco Switzerland
Chris Martin
Systems Engineer
Cisco Switzerland
17th April 200217th April 2002
2© 2001, Cisco Systems, Inc.
AgendaAgenda
• Cust.Sat Survey / Quality Initiatives
• High Availability
• Cust.Sat Survey / Quality Initiatives
• High Availability
4© 2001, Cisco Systems, Inc.
Five Cisco IOS SW Quality GoalsEmbraced throughout Cisco
Five Cisco IOS SW Quality GoalsEmbraced throughout Cisco
Goal 1 - Reduce regression defects
Goal 2 - Reduce customer-found defects
Goal 3- Reduce total outstanding defects (backlog) in a timely manner
Goal 4 - Increase software release clarity and feature consistency
Goal 5 - Provide feature and maintenance releases with predictable schedules and quality
5© 2001, Cisco Systems, Inc.
Goals of IOS RepackagingGoals of IOS Repackaging
• Simplify software selection
process
• Eliminate massive feature
set confusion
• Reduce internal cost
• Simplify software selection
process
• Eliminate massive feature
set confusion
• Reduce internal cost
6© 2001, Cisco Systems, Inc.
The Legacy - circa 1996 The Legacy - circa 1996
B e g a n s i m p l y ...
IP
Enterprise (Includes
Desktop and IP)
Desktop (includes IP)
F u
n c
t i
o n
a l
i t
y
• 37 Feature sets and 2500 images
SNA
VOICE
A
T
M
CRYPTO
FW
PLUS
7© 2001, Cisco Systems, Inc.
IOS Revenue by Feature SetsIOS Revenue by Feature SetsIOS Revenue by Feature Sets(Based on # of systems shipped
0%
10%
20%
30%
40%
50%
60%
70%
80%
Less than 10%of Feature sets count for
90%of revenue (platforms: C800, C1600, C1700, C2500, C2600, C3600, C5x00, & C7x00)
IP
IP+
9© 2001, Cisco Systems, Inc.
IOS Technology Packaging
3 Programs
IOS Technology Packaging
3 Programs
• “Jenny Craig” - streamline IOS code by deprecating older legacy protocols no longer in use
• “IOS Reformation” – Realign IOS to today’s market needs & simplify image selection process
• “IOS Inquisition” - End of life older images which are business justified – about 60%
10© 2001, Cisco Systems, Inc.
AgendaAgenda
• Cust.Sat Survey / Quality Initiatives
• High Availability / Resilient IP
• Cust.Sat Survey / Quality Initiatives
• High Availability / Resilient IP
11© 2001, Cisco Systems, Inc.
The High Costs of Downtime
The High Costs of Downtime
• The average downtime costs incurred in the past 12 months: $21.6 Million
Ranges from $500,000 to $298M
Equates to an average of $2,169 per minute
• % having experienced downtime costs in the past 12 months: 98%
• The average downtime costs incurred in the past 12 months: $21.6 Million
Ranges from $500,000 to $298M
Equates to an average of $2,169 per minute
• % having experienced downtime costs in the past 12 months: 98%
Source: Sage Research, Aug. 2001
12© 2001, Cisco Systems, Inc.
High Availability means five 9’s or more
High Availability means five 9’s or more
AvailabilityAvailability Downtime Per Year Downtime Per Year
99.900%99.900%99.950%99.950%99.990%99.990%99.999%99.999%
99.9999%99.9999%
8 Hours8 Hours4 Hours4 Hours
53 Minutes53 Minutes5 Minutes5 Minutes
30 Seconds30 Seconds
46 Minutes46 Minutes23 Minutes23 Minutes
DPMDPM
10 10 100 100
10001000
1 1
500 500
High Availability means an average end user will experience less than five minutes down time per year
High Availability means an average end user will experience less than five minutes down time per year
What Is High Availability?
Carrier Class Means High Availability
13© 2001, Cisco Systems, Inc.
How is Availability Calculated?
How is Availability Calculated?
• Availability (%) is calculated by
tabulating end user outage time, typically
on a monthly basis.
• Some customers prefer to use
DPM (Defects Per Million) to represent
network availability.
• Availability (%) is calculated by
tabulating end user outage time, typically
on a monthly basis.
• Some customers prefer to use
DPM (Defects Per Million) to represent
network availability.
14© 2001, Cisco Systems, Inc.
Unscheduled DowntimeUnscheduled Downtime
2,6
5,5
7,6
8,3
8,5
11
11,3
12
12,1
26
0 5 10 15 20 25 30
Malicious damage
Acts of nature
Physical environment failures
Congestion/overload
Network software failures
Network operations failures
Unknown
Network hardware failures
Customer premises equipment failure
Physical link failures
hours
15© 2001, Cisco Systems, Inc.
Scheduled DowntimeScheduled Downtime
0 5 10 15 20 25 30
8.4
15
15.1
16.7
17.4
23.7
23.9
Other
Device maintenance
Device replacement
New device installation
Site relocation
Parts replacement
Software upgrade
hours
16© 2001, Cisco Systems, Inc.
The Edge is the Most Vulnerable
The Edge is the Most Vulnerable
• The Core is redundant enough
to disguise failures.
• The Edge is a Single Point of
Failure.
• The Edge is what the customer
sees.
17© 2001, Cisco Systems, Inc.
The Edge is the Most Vulnerable
The Edge is the Most Vulnerable
Failures here may affect thousands of customers
Failures here may affect thousands of customers
To Customers
18© 2001, Cisco Systems, Inc.
Phase 1 Target
Phase 2 Target
Components of DowntimeDetect failure
Switchover to redundant RP or Relaod RPReload image, parse config, identify LC in router
Final Initialization, take control of bus
Reload LC image
Converge route table and inform LC of new forwarding information
Restored
Restore connectivity (I.e. Frame Relay, PPP, etc)
Relative Time
COMPONENTS
19© 2001, Cisco Systems, Inc.
Delivering HA Features in Phases
Phase 1• c7500 SLCR• Reduce RP failover time (RPR/RPR+)
• Fast S/W Upgrade• Faster FR recovery
Phase 1• c7500 SLCR• Reduce RP failover time (RPR/RPR+)
• Fast S/W Upgrade• Faster FR recovery
Phase 2•Non StopForwarding
•(BGP, OSPF, ISIS) Stateful Switchover
•(cHDLC, PPP, ATM, FR)
Phase 2•Non StopForwarding
•(BGP, OSPF, ISIS) Stateful Switchover
•(cHDLC, PPP, ATM, FR)
Single Line Card Reload (SLCR)
Route Processor Redundancy (RPR)
Reduce MTTR Maintain Sessions Planned Outages
Phase 4•In Service Software Upgrades
Phase 4•In Service Software Upgrades
Phase 3
•Additional protocol support (EIGRP, MLPPP, MPLS, IPv6,TBD)
•Additional platform support (c6500/C7600)
Phase 3
•Additional protocol support (EIGRP, MLPPP, MPLS, IPv6,TBD)
•Additional platform support (c6500/C7600)
DeliveredEFT
20© 2001, Cisco Systems, Inc.
Initial Supported PlatformsInitial Supported Platforms
• Phases 1 & 2
Cisco 12000
Cisco 10000 ESR
Cisco 7500
• Phase 3
Cisco 6500/7600
• Future
C7300, AS5850, MGX8850, C10000ubr
• Phases 1 & 2
Cisco 12000
Cisco 10000 ESR
Cisco 7500
• Phase 3
Cisco 6500/7600
• Future
C7300, AS5850, MGX8850, C10000ubr
21© 2001, Cisco Systems, Inc.
RPR+ EvolutionRPR+ Evolution
• High System Availability (HSA): Two RPs – if Active RP fails the system reboots and theStandby becomes active
• RPR: Two RPs, Standby becomes active very quickly. However, line cards are reloaded.
• High System Availability (HSA): Two RPs – if Active RP fails the system reboots and theStandby becomes active
• RPR: Two RPs, Standby becomes active very quickly. However, line cards are reloaded.
22© 2001, Cisco Systems, Inc.
RPR+ EvolutionRPR+ Evolution
• RPR+: Two RPs, Standby
becomes active very quickly
and without reloading line
cards.
• RPR+ is a stepping stone for
SSO and NSF
• RPR+: Two RPs, Standby
becomes active very quickly
and without reloading line
cards.
• RPR+ is a stepping stone for
SSO and NSF
23© 2001, Cisco Systems, Inc.
Stateful Switchover (SSO)
• RPR+ Maintains link state
Session state (I.e. Frame Relay,
PPP, ATM, MPLS) is lost during
RP switchover. Resulting in
“dropped calls” and time to
re-establish connections.
24© 2001, Cisco Systems, Inc.
Stateful Switchover (SSO)
• Stateful Switchover passes
state information from the
Active RP to the Standby RP.
Resulting in maintaining
sessions during a RP
switchover.
25© 2001, Cisco Systems, Inc.
SSO Protocol SupportSSO Protocol Support
• InitialPPP, cHDLC, ATM, Frame Relay
• Now being developedMLPPPMPLS VPN and TE
• PlannedMulticastLooking for input
• InitialPPP, cHDLC, ATM, Frame Relay
• Now being developedMLPPPMPLS VPN and TE
• PlannedMulticastLooking for input
26© 2001, Cisco Systems, Inc.
NSF Protocols SupportNSF Protocols Support
• Initially OSPF and BGP
• Immediately after IS-IS
• Then EIGRP (for initial C6500
support but will support
appropriate router platforms
as well)
• Initially OSPF and BGP
• Immediately after IS-IS
• Then EIGRP (for initial C6500
support but will support
appropriate router platforms
as well)
27© 2001, Cisco Systems, Inc.
StandardsStandards
• All work has been submittedto the IETF
• ISIS - draft-shand-isis-restart-00.txt
• BGP - draft-ietf-idr-restart-01.txt
• All work has been submittedto the IETF
• ISIS - draft-shand-isis-restart-00.txt
• BGP - draft-ietf-idr-restart-01.txt
28© 2001, Cisco Systems, Inc.
Find it on the WebLearn More About HA
Find it on the WebLearn More About HA
High System Availability (HAS @ C7500):http://www.cisco.com/warp/public/cc/pd/rt/7500/prodlit/haibd_ov.htm
http://www.cisco.com/univercd/cc/td/doc/product/software/ios120/12cgcr/fun_c/fcprt3/fc_hsa.htm
http://www.cisco.com/warp/partner/synchronicd/cc/pd/iosw/iore/iore111/prodlit/hsa1_in.htm
Whitepaper on High Availability on Cat6k:http://www.cisco.com/warp/partner/synchronicd/cc/pd/si/casi/ca6000/tech/hafc6_wp.htm
High Availability @ the Edge (C10000):http://www.cisco.com/warp/partner/synchronicd/cc/pd/rt/10000/prodlit/c1hae_wp.htm
Route Processor Redundancy Plus (C12000):http://www.cisco.com/univercd/cc/td/doc/product/software/ios120/120newft/120limit/120st/120st17/rpr_plus.htm