Top Banner
Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie RIPE 51 – Amsterdam
25

Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

Sep 20, 2018

Download

Documents

phamnhan
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

Hot Potatoes Heat Up BGP Routing

Renata TeixeiraLaboratoire d’Informatique de Paris 6

Université Pierre et Marie Curie

RIPE 51 – Amsterdam

Page 2: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

2RIPE 51

Internet Routing Architecture

UCSDSprint

AT&T Verio

AOL

interdomain routing (BGP)

intradomain routing (IGP)Most common: OSPF,IS-IS

User

Web Server

Changes in one AS may impact traffic

and routing in other ASes

Page 3: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

3RIPE 51

Interaction between IGP and BGP

San Francisco

Dallas

New York

ISP network

9 10

destination prefixmultiple egress points

Hot-potato routing = select closest egress point when there is more than one route to destination

Page 4: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

4RIPE 51

Impact of Internal Routing Changes

San Francisco

Dallas

New York

ISP network

destination prefix

9 10- failure- planned maintenance- traffic engineering

11

Routes to thousands of prefixes switch

egress points!!!Consequences:

Transient forwarding instabilityTraffic shiftInterdomain routing changes

11

Page 5: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

5RIPE 51

Outline

Measurement methodologyCollection of OSPF and BGP data of AT&TIdentification of hot-potato routing changes

BGP impact Traffic impactMinimizing hot-potato disruptions

Page 6: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

6RIPE 51

Collecting Input Data

AT&T backbone

Replay routing decisions from vantage point A and B to identify hot-potato changes

OSPF Monitor

OSPF messages

Monitor de flooding oflink-state advertisements

BGP monitor

BGP updates

AB

Monitor updates from nine vantage points

Page 7: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

7RIPE 51

Algorithm for Correlating Routing Changes

Compute distance changesGroup OSPF messages close in timeCompute distance changes from each vantage point

Compute distance changesGroup OSPF messages close in timeCompute distance changes from each vantage point

Classify BGP changes by possible OSPF cause Group updates close in timeCompare old and new route according to decision process

Compute distance changesGroup OSPF messages close in timeCompute distance changes from each vantage point

Classify BGP changes by possible OSPF cause Group updates close in timeCompare old and new route according to decision process

Determine causal relationshipConsistent BGP and OSPF changes Close in time

SF

Dallas

NY

9 10

BGP update: SF → NY

?

Page 8: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

8RIPE 51

Outline

Measurement methodologyBGP impact

How often do hot-potato changes happen? Which fraction of prefixes do they affect?

Traffic impactMinimizing hot-potato disruptions

Page 9: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

9RIPE 51

BGP Impact of an OSPF Change

router Arouter B

Vast majority of OSPF changeshave no impact on these routers

… but few havea very big impact

Page 10: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

10RIPE 51

Variation across Routers

NY

109SF

A

dst

NY

10001SF

dst

B

Small changes will make router Aswitch egress points to dst

More robust to intradomainrouting changes

Significance of hot-potato routing depends on network design and router location.

Page 11: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

11RIPE 51

Outline

Measurement methodologyBGP impact Traffic impact

How long are convergence delays?What is the impact in the traffic matrix?

Minimizing hot-potato disruptions

Page 12: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

12RIPE 51

Delay for BGP Routing Change

Steps between OSPF change and BGP updateOSPF message flooded through the network (t0)OSPF updates distance informationBGP decision process rerun (timer driven)BGP update sent to another router (t)• First BGP update sent (t1) BGP monitor

OSPF monitor

time for BGP to revisit its decision

t0 t1 t time

Metricstime to updateother prefixes

Page 13: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

13RIPE 51

BGP Reaction Time

uniform 5 – 80 sec

Transfer delay

First BGP updateAll BGP updates

Worst case scenario:0 – 80 sec to revisit BGP decision50 – 110 sec to send multiple updates

Last prefix may take 3 minutes to converge!

Page 14: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

14RIPE 51

Transient Data Disruptions

R1 R2

dst

10

100 10111

E1 E2

Disastrous for interactive applications (VoIP, gaming, web)

2 – R2 starts using E1 to reach dst1 – BGP decision process runs in R2

R1 R2

dst

10

100 10111

E1 E2

3 – R1’s BGP decision can take up to 60 seconds to run

Packets to dst may be caught in a loop

for 60 seconds!

2 – R2 starts using E1 to reach dst1 – BGP decision process runs in R2

Page 15: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

15RIPE 51

Challenges for Active Measurements

R210

R1 R2

dst

10

100 10111

E1 E2

Problem: Single-homed probe machinesProbes do not experience the loopProbes do not illustrate the customer experience

P1 P2

customer traffic in loop

Operator probes

Page 16: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

16RIPE 51

Traffic ShiftsM

B p

er s

econ

d

minutes

i

e1e2

p

load variation (∆L) decreasein traffic

routing shift (∆R)

∆TM(i,e1,t) = TM(i,e1,t) - TM (i,e1,t-1) ∆TM(i,e1,t) = ∆L(i,e1,t) + ∆R(i,e1,t) ∆L(i,e1,t) : variation of traffic that still uses e1∆R(i,e1,t) : traffic that moved to e1 – moved out of e1

Page 17: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

17RIPE 51

Large Shifts Caused by Routing Changes

∆R

rela

tive

to n

orm

al v

aria

tions

∆TM relative to normal variations

∆TM caused by load

∆TM caused by routing

Vast majority (99.88%)of ∆TM ∈ [-4,4]

routing shift 70 times normal variations

Page 18: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

18RIPE 51

Hot-potato vs. External BGP Routing Changes

∆TM relative to normal variations

Hot-potatoeBGP

CD

F Hot-potato changes havebiggest impact on TM

Page 19: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

19RIPE 51

Summary of Measurement Analysis

Convergence can take minutesForwarding loops, leads to packet loss and delayFixes: event-driven implementations or tunnels

Frequency of hot-potato changes depends on locationOnce a week on average for more affected routers

Internal events can have big impactSome events affect over half of a BGP tableResponsible for largest traffic variations

ImplicationsEnd users: Transient disruptions and new

end-to-end path characteristicsNetwork administrators: Instability in the traffic matrix

Page 20: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

20RIPE 51

Outline

Measurement methodologyBGP impact Traffic impactMinimizing hot-potato disruptions

What can operators do today?

Page 21: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

21RIPE 51

What can operators do today?

Network designDesign networks that minimize hot-potato changesImplement a fixed ranking of egress points (e.g., MPLS tunnels injected in IGP)

MaintenancePlan maintenance activities considering the impact of changes on BGP routes

MonitoringDeploy measurement infrastructure that captures disruptions caused by hot-potato routing

Page 22: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

22RIPE 51

Comparison of Network Designs

NY

10SF

Dallas

NY

1000SF

dstdst

Dallas

Small changes will make Dallas switch egress points to dst More robust to intradomain

routing changes

NY

109

SF

Dallas

dst

910

19

Page 23: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

23RIPE 51

Careful Cost in/out Links

dstNY

SF

Dallas

10

1010

5

5100

4

Page 24: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

24RIPE 51

Conclusion

Hot-potato routing is too disruptiveSmall changes inside an AS can lead to big disruptions on

BGP and transit traffic

In addition, hot potato is…Too restrictive: Egress selection mechanism dictates a policyToo convoluted: IGP metrics determine BGP egress selection

Introduce more flexible egress selection mechanismTIE: Tunable Interdomain Egress selection

Page 25: Hot Potatoes Heat Up BGP Routing - meetings.ripe.net · Hot Potatoes Heat Up BGP Routing Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie

25RIPE 51

More Infohttp://rp.lip6.fr/~teixeira

BGP impactR. Teixeira, A. Shaikh, T. Griffin, and J. Rexford, “Dynamics of Hot-Potato Routing in IP networks”, in proceedings of ACM SIGMETRICS, June 2004.

Traffic impactR. Teixeira, N. Duffield, J. Rexford, and M. Roughan, “Traffic Matrix Reloaded: Impact of Routing Changes”, in proceedings of PAM, March 2005.

Model of network sensitivity to IGP changesR. Teixeira, T. Griffin, A. Shaikh, and G.M. Voelker, “Network Sensitivity to Hot-Potato Disruptions”, in proceedings of ACM SIGCOMM, August 2004.

New egress selection mechanismR. Teixeira, T. Griffin, M. Resende, and J. Rexford, “TIE Breaking: Tunable Interdomain Egress Selection”, in proceedings of CoNext, October 2005.