Top Banner
Multicache-Based Content Man agement for Web Caching Kai Cheng and Yahiko Kambayashi Graduate School of Informatics, Kyo to University Kyoto JAPAN
24

Multicache-Based Content Management for Web Caching

Jan 30, 2016

Download

Documents

gaenor

Multicache-Based Content Management for Web Caching. Kai Cheng and Yahiko Kambayashi Graduate School of Informatics, Kyoto University Kyoto JAPAN. Outline of the Presentation. Introduction Localizing Web Contents Why Content Management Contributions of Our Work - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Multicache-Based Content Management  for Web Caching

Multicache-Based Content Management

for Web Caching

Kai Cheng and Yahiko Kambayashi

Graduate School of Informatics, Kyoto University

Kyoto JAPAN

Page 2: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 2

Outline of the Presentation

• Introduction– Localizing Web Contents– Why Content Management– Contributions of Our Work

• Multicache-Based Content Management• Content Management Scheme for LRU-SP• Experimental Evaluation• Concluding Remarks

Page 3: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 3

Web Caching For Localizing Web Contents

19%

40%

75%

0%10%20%30%40%50%60%70%80%

1994 1996 1997

Internet Traffic for WWW• World Wide Content

Access/Delivery– Bandwidth Constraints– “Hot-Spot” Servers– Inherent Latency

(200300ms)

• Web Caching For Localizing Web Contents – Reduce Network Traffic– Distribute Server Load– Reduce Response Times– Can We Expect More ?

Page 4: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 4

Characteristics and Implications

Traditional Caching

Web Caching Implications

Process OrientedHuman-User

OrientedUser Preferences

System-Level Application-Level Semantic Information

Data Block Based Document-Based Varying Sizes, Types

Memory-Based Disk-BasedPersistent Storage,

Large Size,

Page 5: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 5

Limitations of Current Caching Schemes

• Document Managed As Physical Unit, Not Semantic Unit.

• Only Physical Properties Being Used

• Less Organized, Less Structured

• Only Support Simple Control Logic

Beyond Simple Priority Queues, Towards Sophisticated Content Management

Page 6: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 6

Content Management

• Basic Features– Larger Cache Space– Sophisticated Control Logic

• More Challenging – Sophisticated Replacement Policies With

• User-Oriented Performance Metrics

• Document Managed as Semantic Unit

Page 7: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 7

Contributions of This Work

• A Multicache Architecture for Implementing Sophisticated Content Management

• A Study of Content Management for LRU-SP

• Simulations to Compare LRU-SP Against Others

Page 8: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 8

Previous Work• Classifications (Cache Data )

– LRV, LNC-W3-U, etc.

• Segmentation (Cache Space)– Segmented FIFO, FBR, 2Q etc.

• Features– Differentiating Data With Different Properties

• Shortages: – No Sophisticated Category

– No Semantic-Based Classification

Page 9: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 9

Managing LFU Contents in Multiple Priority Queues

2

1

>2 B(8) C(6) D(3)

A(10) E(2) F(2)

F(1) G(1) H(1)

Hit

Hit

Outs

Outs

First In First Out Order

Ref

eren

ces

A(10) B(8) C(6) D(3) E(2) F(2) F(1) G(1) H(1)

Page 10: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 10

Basics of Cache

• Space– Limit Storage Space

• Contents– Objects Selected for Caching

• Policies– Replacement Policies

• Constraints– Special Conditions

Space

Contents Policies

Constraints

SpaceSpace

Page 11: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 11

Constraints for Cache

• Admission Constraints– Define Conditions for Objects Eligible For Caching

e.g. (size < 2MB) && !(Source = local)

• Freshness Constraints– Define Conditions for Objects Fresh Enough For Re-Use

e.g. (Type = news) && (Last-Modified < 1week)

• Miscellaneous Constraints e.g. (Time= end-of-day) (Total-Size< 95%*Cache-Size)

Page 12: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 12

Multicache Architecture

SUBCACHE SUBCACHE SUBCACHE SUBCACHE SUBCACHESUBCACHE

CENTRAL

ROUTER

CENTRAL

ROUTER

Client WWW

Web Cache With Multiple Subcaches

JUDGE

CONSTRAINTSCONSTRAINTS

CKBCKB

IN-CACHEIN-CACHE

Request/Response

Cache Knowledge

Base

Page 13: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 13

Components of the Architecture

• Central Router – Control and Mediate the Cache

• Cache Knowledge Base (CKB)– A Set of Rule Based To Allocate ObjectsR1. Allocate(X, 1):-url(X, U), match(U, *.jp),content(X, baseball)

• Subcaches– Keep Objects With Special Characteristics

• Cache Judge – Make Final Decisions From A Set of Eviction Candidat

es

Page 14: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 14

Central Router services each request. Suppose current request is for

document p; 1. Locating p by In-cache Index

2. If p is not in cache, download p; i. Validate Constraints, if false, loop;ii. Fire rules in CKB, let subcache ID = K;

iii. While no enough space in subcache K for p– Subcache K selects an eviction ;– If space sharing, other subcaches do same;– Judge assesses the eviction candidates;

– Purge the victim; iv. Cache p in subcache K

3. If p is in subcache , do i) - iv) re-cache p.

The Procedural Description

Page 15: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 15

Content Management for LRU-SP

• LRU (Least Recently Used)– Primarily Designed for Equal Sized Objects, an

d Only Recency of Reference In Use

• Extended LRUs– Size-Adjusted LRU (SzLRU)– Segmented LRU (SgLRU)

• LRU-SP(Size-Adjusted and Popularity-Aware LRU)– Make SzLRU Aware of Popularity Degree

Page 16: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 16

Probability of Re-ReferenceAs a Function of Current Reference Times

00.10.2

0.30.4

0.50.6

0.70.8

0.9

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

Next Reference Next K References After First

Page 17: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 17

Cost -To-Size Ratio Model

• An Object A In Cache Saves Cost nref * (1/atime)– nref is the frequency of reference

– atime is the time since last access, (1/atime) is the dynamic frequency of A

• When Put In Cache, It Takes Up Space size– Cost-to-size ratio = nref /(size*atime)

• The Object With Least Ratio Is Least Beneficial One

Page 18: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 18

Content Management of LRU-SP

• CKB Rule:– Allocate(X, log(size/nref)):-Size(X, size), Freq(X, nref)

• Subcaches– Least Recently Used (LRU)

• Judge– Find the One With Largest (size*atime)/nref

– The Larger and Older and Colder, the Fast An Object Will Be Purged

Page 19: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 19

Multicache Architecture for LRU-SP

LRU Subcache ①

LRU Subcache ②

LRU Subcache ③

CKB

Hits A, B

A

B

C

Judge

a

b

c

Ca

Computational Complexity O(1)

Page 20: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 20

Predicted Results

• A higher Hit Rate is expectable for LRU-SP, because it utilizes three indicators to document popularity.

• However, higher Hit Rates are usually at the cost of lower Byte Hit Rates, given a similar popularity, because smaller documents contribute less to bytes of hit data.

Page 21: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 21

Experiment Results Better Than Expected

0

0.05

0.1

0.15

0.2

0.25

0.15 0.3 0.5 0.8 1.5 2 3 4 5 6 7 8

LRU-SP SzLRU SgLRU LRV

0

0.05

0.1

0.15

0.2

0.25

0.3RU-SP SzLRU SgLRU LRV

* *

Page 22: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 22

Results & Explanations

• LRU-SP really obtained a much higher Hit Rate than SzLRU, SgLRU and LRV.

• LRU-SP also obtained a high Byte Hit Rate, especially when cache space exceeds 3% of total required space.

• Really Popular Objects Are Saved, So Both Hit Rate and Byte Hit Rate are Improved.

• LRU-SP only incurs O(1) time complexity in content management.

Page 23: Multicache-Based Content Management  for Web Caching

WISE'2000 (C)[email protected] 23

Concluding Remarks

• Multicahe-Based Architecture Has Proved Well-Performed In Balancing High Performance and Low Overhead

• Possible To Incorporate Semantic Information as Well as User Preference In Caching

• It Can Work With General Database Systems to Support Web Information Integration. (Future Work)

Page 24: Multicache-Based Content Management  for Web Caching

Thank You !And Welcome To

http://www.isse.kuis.kyoto-u.ac.jp