Top Banner
Distribution – Part Distribution – Part II II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:
42

Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

Dec 30, 2015

Download

Documents

Rosalyn Fields
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

Distribution – Part IIDistribution – Part II

13/10 – 2003

INF5070 – Media Storage and Distribution Systems:

Page 2: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Type IV – Distribution Systems Combine

Types I, II or III Network of servers

Server hierarchy Autonomous servers Cooperative servers Coordinated servers

“Proxy caches” Not accurate … Cache servers

Keep copies on behalf of a remote server

Proxy servers Perform actions on behalf

of their clients

Page 3: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Type IV – Distribution Systems Combine

Types I, II or III Hierarchically organized

servers Server hierarchy

Autonomous servers Cooperative servers Coordinated servers

“Proxy caches” Not accurate … Cache servers

Keep copies on behalf of a remote server

Proxy servers Perform actions on behalf

of their clients

Page 4: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Type IV – Distribution Systems Combine

Types I, II or III Hierarchically organized

servers Server hierarchy

Autonomous servers Cooperative servers Coordinated servers

“Proxy caches” Not accurate … Cache servers

Keep copies on behalf of a remote server

Proxy servers Perform actions on behalf

of their clients

Page 5: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Type IV – Distribution Systems

Variations Gleaning

Autonomous, coordinated possible In komssys

Proxy prefix caching Coordinated, autonomous possible In Blue Coat (which was formerly Cacheflow, which was formerly Entera)

Period multicasting with pre-storage Coordinated The theoretical optimum

Page 6: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Gleaning Webster’s Dictionary: from Late Latin glennare, of Celtic origin

1. to gather grain or other produce left by reapers2. to gather information or material bit by bit

Combine patching with caching ideas Non-conflicting benefits of caching and patching

Caching reduce number of end-to-end transmissions distribute service access points no single point of failure true on-demand capabilities

Patching shorten average streaming time per client true on-demand capabilities

Page 7: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Gleaning Combines

Patching & Caching ideas Wide-area scalable Reduced server load Reduced network load Can support standard

clients

multicast

Unicast patch stream

Central server

1st client 2nd client

Join !

cyclicbuffer

Unicast Unicast

Proxy cacheProxy cache

Page 8: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Proxy prefix Caching Split movie

Prefix Suffix

Operation Store prefix in prefix cache

Coordination necessary! On demand

Delivery prefix immediately Prefetch suffic from central

server

Goal Reduce startup latency Hide bandwidth limitations,

delay and/or jitter in backbone

Reduce load in backboneClient

Unicast

Unicast

Central server

Prefix cache

Page 9: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

MCache One of several Prefix

Caching variations Combines Batching and

Prefix Caching Can be optimized per movie

server bandwidth network bandwidth cache space

Uses multicast Needs non-standard clients

Central server

1st client 2nd client

Unicast Unicast

Prefix cachePrefix cache

Batch(multicast)

Page 10: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Proxy prefix Caching Basic version

Practical No multicast Not optimized Aimed at large ISPs Wide-area scalable Reduced server load Reduced network load Can support standard

clients Can partially hide jitter

Optimized versions Theoretical Multicast Optimized Optimum is constantly

unstable jitter and loss is

experienced for each client !

Page 11: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Periodic Multicasting with Pre-Storage

Optimize storage and network Wide-area scalable Minimal server load

achievable Reduced network load Can support standard

clients

Specials Can optimize network load

per subtree

Negative Bad error behaviour

1st client 2nd client

Central server

Assumed startof the show

Page 12: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Periodic Multicasting with Pre-Storage

Optimize storage and network Wide-area scalable Minimal server load

achievable Reduced network load Can support standard

clients

Specials Can optimize network load

per subtree

Negative Bad error behaviour

1st client 2nd client

Central server

Page 13: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Type IV – Distribution Systems

Autonomous servers Requires decision making on each proxy Some content must be discarded Caching strategies

Coordinated servers Requires central decision making Global optimization of the system

Cooperative servers No quantitative research yet

Page 14: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

Autonomous servers

Page 15: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Simulation Binary tree model allows

Allows analytical comparison of

Caching Patching Gleaning

Considering optimal cache placement

per movie basic server cost per-stream costs of storage,

interface card, network link movie popularity according

to Zipf distribution

central server

optional

network link

cache server

0

0.08

0.16

0 20 40 80 10060rela

tiv

e p

rob

ab

ilit

y/x

/1

Page 16: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Simulation Example

500 different movies 220 active users basic server: $25000 interface cost: $100/stream network link cost: $350/stream storage cost: $1000/stream

Analytical comparison demonstrates potential of the approach very simplified

CachingCachingUnicast transmission 4664 Mio $

PatchingNo cachingMulticastClient side buffer

375 Mio $

GleaningCachingMulticast 276 Mio $

Page 17: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Simulation Modeling

User behaviour Movie popularity development Limited resources Hierarchical topology

Individual user’s Intention

depends on user’s time (model randomly) Selection

depends on movies’ popularity Popularity development

Po

pu

lari

ty

Movie title age Ob

serv

ed

Hits

Movie title age

Page 18: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Caching Strategies Strategies

FIFO First-in-first-out

Remove the oldest object in the cache in favor of new objects

LRU Least recently used strategy

Maintain a list of objects Move to head of the list whenever accessed Remove the tail of the list in favor of new objects

IRG-k Inter-reference gap

Log number of requests Maintain a list of objects Sort by number average of distance between k last requests Remove object with largest number of intermediate requests

in favor of new objects

Page 19: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Caching Strategies Considerations

conditional overwrite strategies can be highly efficient

limited uplink bandwidth quickly exhausted performance degrades immediately when working set is too

large for storage space

IRGForget object statistics when removedCache all requested objects

Log requests between hits

ECTRemember object statistics foreverCompare requested object andreplacement candidateLog times between hits

ECT Eternal, Conditional, Temporal

Page 20: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Effects of caching strategies on throughput

Movies 1.5 MBit/s, 5400 sec, size ~7.9 GB

Uplink usage profits greatly from small cache increases ... ... if there is a strategy

Conditional overwrite reduces uplink usage

0

50

100

150

0 5000 10000 20000 30000 40000 50000

Th

rou

gh

pu

t

Users

155 MBit/s uplink usage for single server, 64 GB cache

0

50

100

150

0 5000 10000 20000 30000 40000 50000

Th

rou

gh

pu

t

Users

155 MBit/s uplink usage for single server, 96 GB cache

ECTFIFO

LRU

ECTFIFO

LRU

better

Page 21: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Effects of caching strategies on user hit rates

Hit ratio Dumb strategies do not profit from cache size increases Intelligent strategies profit hugely from cache size increases Conditional overwrite outperforms other strategies massively

0.5

0.75

1

0 5000 10000 20000 30000 40000 50000

Hit

Ra

tio

Users

Cache Hit Ratio for single server, 64 GB cache

0.5

0.75

1

0 5000 10000 20000 30000 40000 50000

Hit

Ra

tio

Users

Cache Hit Ratio for single server, 96 GB cache

ECTFIFO

LRU

ECTFIFO

LRU

better

Page 22: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Effects of number of movies on uplink usage

In spite of 99% hit rates Increasing the number of user will congest the uplink Note

scheduling techniques provide no savings on low-popularity movies identical to unicast scenario with minimally larger caches

ECT Cache uplink usage with 64 GB, 155 MBit/s link

0

25

50

75

100

1000 2000 3000 4000 5000 6000 7000

Th

rou

gh

pu

t %

Movies in system

0

25

50

75

100

1000 2000 3000 4000 5000 6000 7000

Th

rou

gh

pu

t %

5000 users

10000 users

ECT Cache uplink usage with 64 GB, 622 MBit/s link

5000 users

10000 users

better

Page 23: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Effects of number of movies on hit ratio

Limited uplink bandwidth Prevents the exchange of titles with medium popularity Unproportional drop of efficiency for more users Strategy can not recognize medium popularity titles

ECT Cache hit ratio with 64 GB, 155 MBit/s link ECT Cache hit ratio with 64 GB, 622 MBit/s link

0.5

0.6

0.7

0.8

0.9

1

1000 2000 3000 4000 5000 6000 7000

Hit

Rat

io

Movies in system

0.5

0.6

0.7

0.8

0.9

1

1000 2000 3000 4000 5000 6000 7000

Hit

Rat

io

Movies in system

5000 users

10000 users

5000 users

10000 users

better

Page 24: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Effects of user numbers on refusal probabilities

Uplink-bound scenario Shows that low-popularity are accessed like unicast by all

techniques Patching techniques with infinite window can exploit multicast Collecting requests does not work

Cache size Is not very relevant for patching techniques Is very relevant for full-title techniques

0

0.005

0.01

0.015

0.02

0 10000 20000 30000 40000 50000

Re

fus

al

Pro

ba

bil

ity

Users

Refusal probability, 64 GB cache, 622 Mbit/s uplink

0

0.005

0.01

0.015

0.02

0 10000 20000 30000 40000 50000

Re

fus

al

Pro

ba

bil

ity

Users

Refusal probability, 96 GB cache, 155 Mbit/s uplink

batching

gleaning

unicast

batching gleaning

unicast

better

Caching strategy: ECT Caching strategy: ECT

Page 25: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Bandwidth effect of daytime variations

Change popularity according to time-of-day Two tests

Popularity peaks and valleys uniformly distributed Complete exchange of all titles Spread over the whole day

Popularity peaks and valleys either at 10:00 or at 20:00 Complete exchange of all titles Within a short time-frame around peak-time

Astonishing results For ECT with all mechanisms Hardly any influence on

hit rate uplink congestion

Traffic is hidden by delivery of low-popularity titles

Page 26: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Hint-based Caching

Idea Caches consider requests to neighbour caches in their removal decisions

Conclusion Instability due to uplink congestion can not be prevented Advantage exists and is logarithmic as expected

Larger hint numbers maintain the advantage to the point of instability Intensity of instability is due to ECT problem

ECT inherits IRG drawback of fixed-size histograms

0.6

0.7

0.8

0.9

10 100 1000

Hit

Ra

tio

Users

10 h ints

100 h ints

1000 h ints

10000 hints

0.6

0.7

0.8

0.9

10 100 1000

Hit

Ra

tio

Users

10 h ints

100 h ints

1000 h ints

10000 hints

better

Hit ratio development, increasing #hints, ECT history 8 Hit ratio development, increasing #hints, ECT history 64

Page 27: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Simulation High relevance of population sizes

complex strategies require large customer bases Efficiency of small caches

90:10 rule-of-thumb reasonable unlike web caching

Efficiency of distribution mechanisms considerable bandwidth savings for uncached titles

Effects of removal strategies relevance of conditional overwrite unlike web caching, paging, swapping, ...

Irrelevance of popularity changes on short timescales few cache updates

compared to many direct deliveries

Page 28: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

Coordinated servers

Page 29: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Distribution Architectures

Combined optimization Scheduling algorithm Proxy placement and dimensioning

client

1st level cache

2nd level cache

d-2nd level cache

d-1st level cache

origin server

Page 30: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Distribution Architectures

Combined optimization Scheduling algorithm Proxy placement and dimensioning

No problems with simple scheduling mechanisms

Examples Caching with unicast communication Caching with greedy patching

Patching window in greedy patching is the movie length

Page 31: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Distribution Architectures

0100200

Movie (0-300 of 500)

0.5

1Linkclient

1

2

3

4

5

origin

Cac

he

Le

vel

Cost

Caching

0100200

Movie (0-300 of 500)

0.5

1 Linkclient

1

2

3

4

5

origin

Ca

che

Lev

el

Cost

Caching and Greedy Patching

Movies moveAway from clients

top movieDecreasing popularity

Network for free

Increasing network costs

Page 32: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Distribution Architectures

Combined optimization Scheduling algorithm Proxy placement and dimensioning

Problems with complex scheduling mechanisms Examples

Caching with -patching Patching window is optimized for minimal server load

Caching with gleaning A 1st level proxy cache maintains the ”client buffer” for

several clients

Caching with MPatch The initial portion of the movie is cached in a 1st level proxy

cache

Page 33: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

-Patching

time

posi

tion

in m

ovie

(of

fset

)

Num

ber

of c

oncu

rren

t st

ream

s

UM F 2

multicast

Unicast patch stream

Central server

1st client 2nd client

cyclicbuffer

Page 34: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Distribution Architectures

Placement for -patching

Popular movies are further away from the client

Page 35: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Distribution Architectures

Failure of the optimization Implicitly assumes perfect delivery Has no notion of quality User satisfaction is ignored

Disadvantage Popular movies further away from clients

Longer distance Higher startup latency Higher loss rate More jitter

Popular movies are requested more frequently Average delivery quality is lower

Page 36: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Distribution Architectures

Placement for gleaning Combines

Caching of the full movie Optimized patching Mandatory proxy cache

2 degrees of freedom Caching level Patch length

Page 37: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Distribution Architectures

Placement for gleaning

Page 38: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Distribution Architectures

Placement for MPatch Combines

Caching of the full movie Partial caching in proxy servers Multicast in access networks Patching from the full copy

3 degrees of freedom Caching level Patch length Prefix length

Page 39: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Distribution Architectures

Placement for MPatch

Page 40: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Approaches Consider quality

Penalize distance in optimality calculation Sort

Penalty approach Low penalties

Doesn’t achieve order because actual cost is higher High penalties

Doesn’t achieve order because optimizer gets confused

Sorting Trivial Very low resource waste

Page 41: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

Distribution Architectures

Combined optimization Scheduling algorithm Proxy placement and dimensioning Impossible to achieve optimum with autonomous

caching

Solution for complex scheduling mechanisms

A simple solution exists: Enforce order according to priorities

(simple sorting)

Increase in resource use is marginal

Page 42: Distribution – Part II 13/10 – 2003 INF5070 – Media Storage and Distribution Systems:

2003 Carsten Griwodz & Pål Halvorsen

INF5070 – media storage and distribution systems

References1. S.-H. Gary Chan and Fourad A. Tobagi: "Distributed Server Architectures for Networked

Video Services", IEEE/ACM Transactions on Networking 9(2), Apr 2001, pp. 125-1362. Subhabrata Sen and Jennifer Rexford and Don Towsley: "Proxy Prefix Caxching for

Multimedia Streams", Joint Conference of the IEEE Computer and Communications Societies (INFOCOM), New York, NY, USA, Mar 1999, pp. 1310-1319

3. Sridhar Ramesh and Injong Rhee and Katherine Guo: "Multicast with cache (mcache): An adaptive zero-delay video-on-demand service", Joint Conference of the IEEE Computer and Communications Societies (INFOCOM), Anchorage, Alaska, USA, Apr 2001

4. Michael Bradshaw and Bing Wang and Subhabrata Sen and Lixin Gao and Jim Kurose and Prashant J. Shenoy and Don Towsley: "Periodic Broadcast and Patching Services - Implementation, Measurement, and Analysis in an Internet Streaming Video Testbed", ACM Multimedia Conference (ACM MM), Ottawa, Canada, Sep 2001, pp. 280-290

5. Bing Wang and Subhabrata Sen and Micah Adler and Don Towsley: "Proxy-based Distribution of Streaming Video over Unicast/Multicast Connections", Joint Conference of the IEEE Computer and Communications Societies (INFOCOM), New York, NY, USA, Jun 2002

6. Carsten Griwodz and Michael Zink and Michael Liepert and Giwon On and Ralf Steinmetz, "Multicast for Savings in Cache-based Video Distribution", Multimedia Computing and Networking (MMCN), San Jose, CA, USA, Jan 2000

7. Carsten Griwodz and Michael Bär and Lars C. Wolf: "Long-term Movie Popularity in Video-on-Demand Systems", ACM Multimedia Conference (ACM MM), Seattle, WA, USA, Nov 1997, pp. 340-357

8. Carsten Griwodz: "Wide-area True Video-on-Demand by a Decentralized Cache-based Distribution Infrastructure", PhD thesis, Darmstadt University of Technology, Darmstadt, Germany, Apr 2000