Top Banner
Department of Computer Science Empirical Evaluation of Latency-Sensitive Application Performance in the Cloud Sean Barker and Prashant Shenoy University of Massachusetts Amherst
26

Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

Jun 18, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

Department of Computer Science

Empirical Evaluation ofLatency-Sensitive Application

Performance in the Cloud

Sean Barker and Prashant ShenoyUniversity of Massachusetts Amherst

Page 2: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Cloud Computing

! Cloud platforms built with data centers: large-scale, concentrated servers clusters• Machines rented out to

companies or individuals• Hosting for arbitrary applications• May supplement local resources

! Cheap enough to rent machines by the hour

2

Type CPUs Memory Disk Cost/hr

Small 1 1.7 GB 160 GB $0.085

Large 4 7.5 GB 850 GB $0.34

XL 8 15 GB 1690 GB $0.68

Current prices on Amazon Elastic Compute Cloud (EC2)

Page 3: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Multimedia Cloud Computing Scenarios

! Clouds designed primarily for web & e-commerce apps, but may also be used for multimedia

! Rent game server for an evening• No firewall or bandwidth issues, only a few dollars

! Rent high-CPU machines for HD video transcoding• Home PC may take several hours to transcode one video,

cloud can transcode many in a fraction of this time

! Rent servers for webcast of live event• Large, inexpensive temporary bandwidth allocation

3

Page 4: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

! Data center servers are typically well-equipped• Providers share individual

machines machines among multiple users

! Example: one user runs game server, another runs high-performance database on same machine

! Multimedia has unique performance requirements• Low latency games, low jitter & high bandwidth streaming

! Are cloud platforms designed for conventional web applications suitable for multimedia?

University of Massachusetts Amherst - Department of Computer Science

Resource Sharing in the Cloud

4

8 GB RAM

Core 1

Core 2

Core 3

Core 4

1000 GB Disk

1000 GB Disk

4 GB RAM

Core 1

Core 2

Core 3

Core 4

1000 GB Disk

1000 GB Disk

4 GB RAM

Page 5: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Outline

! Motivation

! Virtualized clouds

! Amazon EC2 study

! Laboratory cloud study

! Real world multimedia case studies

! Related work & conclusions

5

Page 6: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Virtualized Clouds

! Cloud platforms are virtualized data centers! Virtualization facilitates machine distribution

among multiple users with virtual machines (VMs)

6

VM

Hardware

VM VM

Game Server

Web Server

Media Server

Customer A

Users

Customer C

Customer B

Page 7: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

! Each VM is assigned slice of physical resources! VM access to hardware managed by hypervisor• Enforces limits and isolates VMs from each other

! Are these resource sharing mechanisms suitable for the timeliness constraints of multimedia?

VM VM VM

AppA

App C

Users

App B

Hardware

Hypervisor

University of Massachusetts Amherst - Department of Computer Science

Virtual Machine Isolation

8

resourcestarvation

Hypervisor

VM VM VM

App A

Users

Hardware

App B App C

Page 8: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Outline

! Motivation

! Virtualized clouds

! Amazon EC2 study

! Laboratory cloud study

! Real world multimedia case studies

! Related work & conclusions

9

Page 9: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

EC2 Study – Overview

! Amazon Elastic Compute Cloud (EC2)• Popular virtualized cloud platform

! Unknown applications coexisting on machine• No control over VM placement

! Goal: evaluate performance with unknown background server load

! Methodology: measured CPU, disk, and network consistency over period of days

10

Page 10: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

EC2 CPU Performance

0

200

400

600

800

1000

1200

1400CPU time (ms)

Time (5 minute intervals)

EC2Local

11

• Volatility on EC2 vs stability on dedicated server

2.5x average outliers:

1.5-2x avg

no competing VMs: no outliers

Page 11: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

EC2 Disk Performance

0

10000

20000

30000

40000

50000

60000

70000

80000

90000Long write time (ms)

Time (5 minute intervals)

EC2Local

12

• Similarly: inconsistent EC2 disk performance

widely fluctuatingdisk performance

Page 12: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

EC2 Network Latency (LAN)

0

50

100

150

200

250First three hops latency (ms)

Time (5 minute intervals)

13

• Latency variations in EC2 LAN

Page 13: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

EC2 Study – Summary

! Performance variations observed on EC2• Not observed on local server running a single VM

! Can only speculate on causes without access to the hypervisor

! Need to experiment on a controlled platform similar to Amazon’s

14

Page 14: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Laboratory Cloud Study – Overview

! Local cloud running the Xen hypervisor• Same virtualization technology used by EC2• Advantage: local cloud gives us control of interference

! Built-in mechanisms for sharing hardware between VMs• CPU credit scheduler• Round-robin disk servicing• Linux-level tool tc for network sharing

! How well do these tools isolate background work?

! Methodology: evaluated performance impact of competing VM

15

Page 15: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

CPU Performance with Background Load

0

50

100

150

200CPU time (ms)

Time (5 second intervals)

16

• Default 1 to 1 sharing with variable background load

No background work: VM gets 100% CPU

Max background work: VM gets 50% CPU

Page 16: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Disk Performance with Background Load

0

20

40

60

80

100

1 2 3 4 8

Performance Impact (%)

Disk Thread Pairs on Collocated VM

Fair ShareSmall Read

Small WriteRead Throughput

Write Throughput

17

• Degraded by half over ‘fair’, but stable with increasing load

‘unfair’ impact

Page 17: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Laboratory Cloud Study – Summary

! Significant interference possible from background VMs

! Xen configuration can guarantee share of CPU• Default settings allow fluctuation in shared CPU

! Disk sharing less fair and harder to control• Consistent with observed EC2 behavior

! Network sharing effects evaluated in case studies on laboratory cloud (next)

18

Page 18: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Case Study 1 – Doom 3 Game Server

! Multiplayer Doom 3 game server

! Introduced controlled interference as before

! Measured map load times and server latency

! Network sharing configuration via tc:• Idle: No bandwidth usage by resource-hog VM• Off (default): No rate-limiting, network free-for-all• Shared: 50% (min) to 100% (max) of bandwidth per VM• Dedicated: 50% (max) of bandwidth per VM

19

Page 19: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Game Server Map Load

0

1000

2000

3000

4000

5000

Idle Disk CPU Disk + CPU

Average Server Load Time (ms)

Collocated VM Activity

20

• Interference produces up to 50% degradation

Page 20: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Game Server Latency

21

! Server crippled without bandwidth controls (tc off)

! Dedicated vs shared bandwidth:• Dedicated: lower latency, higher jitter• Sharing: higher latency, lower jitter

Configuration Avg. Latency (ms)

Std. Deviation (jitter) Timeouts

No interference 8.1 10.2 0%

tc off (free-for-all) N/A N/A 100%

tc, sharing b/w 33.9 16.9 2%

tc, dedicated b/w 23.6 29.6 7%

Page 21: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Case Study 2 – Darwin Streaming Server

! Streaming video to multiple clients

! Introduced controlled interference as before

! Measured sustained streaming bandwidth and stream jitter (latency variation)

! Varied tc settings and number of clients• Max video stream rate of 1 Mbps per client

22

Page 22: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Streaming Server Bandwidth

0

200

400

600

800

1000

idle (fair) off shared dedicated

average bitrate per stream (kbps)

tc sharing type

4 streams8 streams

23

• both tc configurations recovered bandwidth

decreased stream quality

Page 23: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Streaming Server Jitter

0

2

4

6

8

10

12

14

16

idle (fair) off shared dedicated

average stream jitter (ms)

tc sharing type

4 streams8 streams

24

• Jitter improved by shared, but worsened by dedicated

Page 24: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Real World Case Studies – Summary

! Real applications show substantial impacts from background interference

! Network is particularly vulnerable without administrative controls

! Proper configuration is important• CPU and network isolation tools fairly well-developed• Disk isolation needs better mechanisms

25

Page 25: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Related Work

! Fair-share schedulers and quality-of-service• Nieh and Lam (SOSP ‘97) for multimedia• Sundaram et al. (ACM MM ‘00) for QoS-aware OS

! Virtualization and hypervisors• Xen, VMware ESX Server

! Improving performance isolation• Gupta et al. (Middleware ‘06) for Xen mechanisms

! We focus on evaluation of existing mechanisms with specific attention to multimedia

26

Page 26: Empirical Evaluation of Latency-Sensitive Application ...lass.cs.umass.edu/slides/2010/mmsys10.pdf · Game Server Latency 21! Server crippled without bandwidth controls (tc off)!

University of Massachusetts Amherst - Department of Computer Science

Conclusions

! Clouds exhibit performance variations• Applications with timeliness requirements are

particularly sensitive

! Appropriate hypervisor configuration can help• In some cases, prevents resource starvation• Some resource sharing mechanisms need improvement

! Future work: evaluation of non-Xen platforms

! Questions?

27