Top Banner
Semi-Oblivious Trac Engineering: The Road Not Taken Praveen Kumar (Cornell) Yang Yuan (Cornell) Chris Yu (CMU) Nate Foster (Cornell) Robert Kleinberg (Cornell) Petr Lapukhov (Facebook) Chiun Lin Lim (Facebook) Robert Soule (USI Lugano)
45

Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Jul 18, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Semi-Oblivious Traffic Engineering: The Road Not Taken

Praveen Kumar (Cornell) Yang Yuan (Cornell)

Chris Yu (CMU) Nate Foster (Cornell)

Robert Kleinberg (Cornell) Petr Lapukhov (Facebook) Chiun Lin Lim (Facebook) Robert Soule (USI Lugano)

Page 2: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

WAN Traffic Engineering

Page 3: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

WAN Traffic EngineeringObjectives Challenges

Gbps

Performance Robustness

Latency Operational simplicity

Page 4: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

WAN Traffic EngineeringObjectives Challenges

Gbps

Performance Robustness

Latency Operational simplicity

Unstructured topology

Unexpected failures

Misprediction & Traffic Bursts

Heterogeneous capacity

Update overheads

Device limitations

Page 5: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

TE ApproachesTraditional Distributed

SDN-Based Centralized

1

1

100

1

1

1

1

1

1

1

Page 6: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

TE ApproachesTraditional Distributed

SDN-Based Centralized

1

1

100

1

1

1

1

1

1

1100

Page 7: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

TE ApproachesTraditional Distributed

SDN-Based Centralized

Optimal TE?(MCF)

1

1

100

1

1

1

1

1

1

1100

Page 8: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Operational Cost of OptimalitySolver Time

Page 9: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Operational Cost of OptimalityPath Churn

Page 10: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Towards a Practical ModelTopology

(+ demands)

Path Selection

Rate Adaptation

Paths

Splitting Ratio

Demands

Page 11: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Towards a Practical ModelTopology

(+ demands)

Path Selection

Rate Adaptation

Paths

Splitting Ratio

Demands

Computing and updating

paths is typically expensive and

slow.

But updating splitting ratios is cheap and fast!

Page 12: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Towards a Practical ModelTopology

(+ demands)

Path Selection

Rate Adaptation

Paths

Splitting Ratio

Demands

Computing and updating

paths is typically expensive and

slow.

But updating splitting ratios is cheap and fast!

Static

Dynamic

Page 13: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Path Selection Challenges

• Selecting a good set of paths is tricky!

• Route the demands (ideally, with competitive latency)

• React to changes in demands (diurnal changes, traffic bursts, etc.)

• Be robust under mis-prediction of demands

• Have sufficient extra capacity to route demands in presence of failures

• …

Page 14: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

ApproachA static set of cleverly-constructed paths can

provide near-optimal performance and robustness!

Desired path properties:

• Low stretch for minimizing latency

• High diversity for ensuring robustness

• Good load balancing for performance • Capacity aware

• Globally optimized{

Page 15: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Path Properties: Capacity Aware

• Traditional approaches to routing based on shortest paths (e.g., ECMP, KSP) are generally not capacity aware

C

B

A

G E

F

D

100 Gbps10 Gbps

Page 16: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Path Properties: Capacity Aware

• Traditional approaches to routing based on shortest paths (e.g., ECMP, KSP) are generally not capacity aware

C

B

A

G E

F

DA

C

B

100 Gbps10 Gbps

Page 17: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Path Properties: Globally OptimalOther approaches based on greedy algorithms are

capacity aware, but are still not globally optimal

C

B

A

G E

F

D

Globally optimalCSPF

Page 18: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Path Properties: Globally OptimalOther approaches based on greedy algorithms are

capacity aware, but are still not globally optimal

C

B

A

G E

F

DA

Globally optimalCSPF

Page 19: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Path Properties: Globally OptimalOther approaches based on greedy algorithms are

capacity aware, but are still not globally optimal

C

B

A

G E

F

DA

B

Globally optimalCSPF

Page 20: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Path Properties: Globally OptimalOther approaches based on greedy algorithms are

capacity aware, but are still not globally optimal

C

B

A

G E

F

DA

C

B

Globally optimalCSPF

Page 21: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Path Properties: Globally OptimalOther approaches based on greedy algorithms are

capacity aware, but are still not globally optimal

C

B

A

G E

F

DA

C

B

C

B

A

G E

F

DA

C

B

Globally optimalCSPF

Page 22: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Path Selection

AlgorithmLoad balanced

Diverse Low-stretchCapacity aware

Globally Optimized

SPF / ECMP ❌ ❌ ❌ ✔

CSPF ✔ ❌ ❌ ✔

k-shortest paths ❌ ❌ ? ✔

Edge-disjoint KSP ❌ ❌ ✔ ✔

MCF ✔ ✔ ❌ ❌

VLB ❌ ❌ ✔ ❌

B4 ✔ ✔ ❌ ?

Page 23: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Path Selection

AlgorithmLoad balanced

Diverse Low-stretchCapacity aware

Globally Optimized

SPF / ECMP ❌ ❌ ❌ ✔

CSPF ✔ ❌ ❌ ✔

k-shortest paths ❌ ❌ ? ✔

Edge-disjoint KSP ❌ ❌ ✔ ✔

MCF ✔ ✔ ❌ ❌

VLB ❌ ❌ ✔ ❌

B4 ✔ ✔ ❌ ?

Page 24: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Path Selection

AlgorithmLoad balanced

Diverse Low-stretchCapacity aware

Globally Optimized

SPF / ECMP ❌ ❌ ❌ ✔

CSPF ✔ ❌ ❌ ✔

k-shortest paths ❌ ❌ ? ✔

Edge-disjoint KSP ❌ ❌ ✔ ✔

MCF ✔ ✔ ❌ ❌

VLB ❌ ❌ ✔ ❌

B4 ✔ ✔ ❌ ?

Page 25: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Path Selection

AlgorithmLoad balanced

Diverse Low-stretchCapacity aware

Globally Optimized

SPF / ECMP ❌ ❌ ❌ ✔

CSPF ✔ ❌ ❌ ✔

k-shortest paths ❌ ❌ ? ✔

Edge-disjoint KSP ❌ ❌ ✔ ✔

MCF ✔ ✔ ❌ ❌

VLB ❌ ❌ ✔ ❌

B4 ✔ ✔ ❌ ?

Page 26: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Oblivious Routing

Page 27: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

VLB

• Route through random intermediate node

• Works well for mesh topologies

• WANs are not mesh-like

• Good resilience

• Poor performance & latency

Mesh

3

21

N

4

Page 28: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

VLB

• Route through random intermediate node

• Works well for mesh topologies

• WANs are not mesh-like

• Good resilience

• Poor performance & latency

Mesh

3

21

N

4

Page 29: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Not Mesh

VLB

• Route through random intermediate node

• Works well for mesh topologies

• WANs are not mesh-like

• Good resilience

• Poor performance & latency

Page 30: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Not Mesh

VLB

• Route through random intermediate node

• Works well for mesh topologies

• WANs are not mesh-like

• Good resilience

• Poor performance & latency

Page 31: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Oblivious [Räcke ‘08]

• Generalizes VLB to non-mesh

• Distribution over routing trees

• Approximation algorithm for low-stretch trees [FRT ’04]

• Penalize links based on usage

• O(log n) competitive

Not Mesh

Low-stretch routing trees

Page 32: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Oblivious [Räcke ‘08]

• Generalizes VLB to non-mesh

• Distribution over routing trees

• Approximation algorithm for low-stretch trees [FRT ’04]

• Penalize links based on usage

• O(log n) competitive

Not Mesh

Low-stretch routing trees

Page 33: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Path Selection

AlgorithmLoad balanced

Diverse Low-stretchCapacity aware

Globally Optimized

SPF / ECMP ❌ ❌ ❌ ✔

CSPF ✔ ❌ ❌ ✔

k-shortest paths ❌ ❌ ? ✔

Edge-disjoint KSP ❌ ❌ ✔ ✔

MCF ✔ ✔ ❌ ❌

VLB ❌ ❌ ✔ ❌

B4 ✔ ✔ ❌ ?

SMORE / Oblivious ✔ ✔ ✔ ✔

Page 34: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

SMORE: Semi-Oblivious Routing

Oblivious Routing computes a set of paths which are low-stretch, robust and have good load balancing properties

LP Optimizer balances load by dynamically adjusting splitting ratios used to map incoming traffic flows to paths

Path Selection

Rate Adaptation

Page 35: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Semi-Oblivious Routing in Practice?

• ▼ Previous work [Hajiaghayi et al.] established a worst-case competitive ratio that is not much better than oblivious routing: Ω(log(n)/log (log(n)))

• But the real-world does not typically exhibit worst-case scenarios

• e.g., there is an correlation between demands and link capacities as network designs evolve

• Question: How well does semi-oblivious routing perform in practice?

Page 36: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Evaluation

Page 37: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Facebook’s WAN• Overview

• Common network design for content providers

• Several large data centers (DCs) and points-of-presence (PoPs)

• Mix of latency-sensitive customer traffic + background elastic traffic

• Method

• Collected accurate snapshot of network state - topology, TMs, etc.

• Simulations to study performance characteristics

Page 38: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

TE Systems - Comparison

• OSPF

• ECMP

• CSPF

• MCF

• Omniscient MCF (“Optimal”)

• …

• Oblivious [STOC ’08]

• VLB [INFOCOM ‘08]

• Robust MCF [SIGMETRICS ‘11]

• KSP + MCF [SIGCOMM ’13]

• FFC* [SIGCOMM ’15]

• …

Traditional Contemporary

Open-source implementations at http://github.com/cornell-netlab/yates

Page 39: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Performance

Page 40: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Robustness

Path budget = 4

Page 41: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Operational Constraints - Path Budget

4-8xOptimal

SMOREMCF KSP+MCFR-MCF

Page 42: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Large Scale Simulations

• Conducted larger set of simulations on Internet Topology Zoo

• 30 topologies from ISPs and content providers

• Multiple traffic matrices (gravity model), failure models and operational conditions

Page 43: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Do these results generalize?Yes*

Probability of achieving SLA

Throughput

Page 44: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Takeaways• Path selection plays an outsized role in the performance of TE systems

• Semi-oblivious TE meets the competing objectives of performance and robustness in modern networks

• Oblivious routing for path selection + Dynamic load-balancing

• Ongoing and future-work:

• Apply to other networks (e.g. non-Clos DC topologies)

• SR-based implementations and deployments

Page 45: Semi-Oblivious Traffic Engineering - Cornell Universitypraveenk/talks/2018-04-SMORE... · 2018-04-10 · Semi-Oblivious Traffic Engineering: The Road Not Taken Praveen Kumar (Cornell)

Thank You!

Bobby Kleinberg Cornell

Robert Soule Lugano

Nate Foster Cornell

Petr Lapukhov Facebook

Chiun Lin Lim Facebook

Chris Yu CMU

Yang Yuan Cornell

https://github.com/cornell-netlab/yates

SMORE: Oblivious routing + Dynamic rate adaptation