Top Banner
ACM/SPEC ICPE 2016 - March14 th , 2016 Weikun Wang (Imperial College London, UK) Giuliano Casale (Imperial College London, UK) Ajay Kattepur (TCS Innovation Labs, India) Manoj Nambiar (TCS Innovation Labs, India) Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data
25

Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

Jan 14, 2017

Download

Technology

Weikun Wang
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

ACM/SPEC ICPE 2016 - March14th, 2016

Weikun Wang (Imperial College London, UK)

Giuliano Casale (Imperial College London, UK)

Ajay Kattepur (TCS Innovation Labs, India)

Manoj Nambiar (TCS Innovation Labs, India)

Maximum Likelihood Estimation of Closed Queueing Network

Demands from Queue Length Data

Page 2: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

2

SPE & Model-Driven Engineering

Platform-Indep. Model

ArchitectureModel

Platform-Specific Model

Performance& Reliability Models

Page 3: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

3

CQN Characterization

CPU-1

Time

Service

Demand

X X X

CPU-1

Mix 1 Mix 2 Mix 3

Service demand of a request

CPU time, bandwidth consumed, …

Multi-threaded software

e.g., web servers

Page 4: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

4

Drawback of Existing methods

Utilization based approaches

Regression based on utilization and throughput

Issues: collinearities, load-dependence, outliers, utilization unreliable/unavailable, …

Page 5: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

5

State observations

Dataset ( points):

CQN State:

Queue length samples

Page 6: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

6

State observations

Dataset ( points):

CQN State:

Queue length samples

Page 7: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

7

Assume product-form state probabilities

Computationally challenging to evaluate

Maximum likelihood estimation?

Infer demands with the probability

Queue length samples

Service demand

Normalizing constant

Queue length

Page 8: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

8

Maximum likelihood estimation (MLE)

Problem with direct computation

Evaluation of for each observation

Slow due to the need for computing

Very small probabilities when L is large

Any other solution?

Maximum Likelihood Estimation

parameter spaceLikelihood

Page 9: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

9

A necessary condition for a point inside to be a MLE is that

How to find the MLE?

Change the value of , until the mean queue length predicted with MVA match

Fixed point iteration or an optimization program

Maximum Likelihood Estimation

observed mean queue length

theoretical mean queue length

Only meanqueue length is required!

Page 10: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

10

Confidence Intervals

Assume the MLE to be asymptotically normal

Confidence intervals for the MLE demands

is the Fisher Information matrix

is the Hessian matrix

works with mean queue length only!

Obtained by using standard MVA, no probabilities!

Page 11: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

11

QMLE Approximation

Exact MLE can be found by direct search

Fixed-point iteration tends to be effective

A simple approximation of the MLE:

Consider the demand vector where

Then it must be

observed mean queue length

Page 12: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

12

Validation- Existing methods

CI: Complete Information[J.F. Perez et al., IEEE Trans. Sw. Eng.’15]

Full knowledge of sample path

Baseline approach

ERPS: Extended Regression for Processor Sharing

[J.F. Perez et al., IEEE Trans. Sw. Eng.’15]

Based on mean response time and arrival queue

GQL: Gibbs Sampling for Queue Lengths

[W. Wang et al., Accepted to appear in ACM TOMACS]

Gibbs sampling based on queue length samples

Many iterations until convergence

Page 13: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

13

Validation

≈20000 random models

Randomized number of stations, classes, jobs

Focus on QMLE instead of exact analysis

Results

All the algorithms: below 10%

QMLE has less than 4% error

Confidence interval validated

Number of observations

Page 14: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

14

1E-04

1E-03

1E-02

1E-01

1E+00

1E+01

1E+02

1E+03

1E+04

CI ERPS GQL QMLE

Execution timeE

xe

cu

tio

n tim

e (

s)

Page 15: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

15

Mean demand varies under different load

Real world system behavior

e.g. multi-core servers

Load-dependent (LD) extension

Page 16: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

16

A scaling factor function

load-independent :

Product-form still holds

MLE

Load-dependent (LD) extension

new term

Page 17: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

17

Directly computation is infeasible

A necessary condition for a point inside

to be a MLE is that

and

Works with marginal probability only!

MLE characterization

Empirical marginal queue length probability

Theoretical marginal queue length probability

Page 18: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

18

How to find the MLE?

Solve by optimization program

Confidence intervals

Hessian matrix can still be derived

Computation requires marginal probabilities and mean queue length only

Drawback

Computationally expensive because of LD-MVA

MLE characterization

Page 19: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

19

Validation

Random models validation

2 stations, 2 classes, 8 jobs, different think time

MATLAB fmincon solver

Compare the estimated against exact ones

Considered scaling factors

: resembles multi-core feature

– number of CPUs in queueing station i.

Page 20: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

20

0

5

10

15

20

25

30

35

40

5000 10000 50000 100000

u

Estimation error on γ (scaling factors)E

rro

r o

n γ

(%)

Number of observed samples (L)

Page 21: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

21

0.0E+00

5.0E+03

1.0E+04

1.5E+04

2.0E+04

2.5E+04

3.0E+04

5000 10000 50000 10000

Execution timeE

xe

cu

tio

n tim

e (

s)

Number of observed samples (L)

Progress: We found a new approximation method for efficiently evaluating marginal probabilities, which reduces the execution time to < 20s on average!

Page 22: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

22

Case Study (MyBatis JPetStore)

3-tier commercial application

Transactions grouped in R=1 class

5 GB user data

User 1Worker Database

Web/Application server

Workload

GeneratorDispatcher

Database server

Page 23: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

23

Observed performance matching

Exact demand unknown

Estimated demands using QMLE

Validate observed throughput with estimated demands

0

5

10

15

20

25

0.1 0.5 1 5 All

ERPS

QMLE

Err

or

on

th

rou

gh

pu

t (%

)

Think time (s)

Page 24: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

24

Conclusion

Demand estimation from queue length

Efficient

Confidence interval characterization

Load-dependent extension

Ongoing work

Accelerate the load-dependent estimation

More experimental evaluations

Funded by FP7 MODAClouds, H2020 DICE, EPSRC OptiMAM

Page 25: Maximum Likelihood Estimation of Closed Queueing Network Demands from Queue Length Data

25

[email protected]

Thanks!