Top Banner
Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000 http://www.xiwt.org/documents/IPERF-paper2 .pdf
17

Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

Dec 25, 2015

Download

Documents

Jeremy Johnson
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

Reading Report 14Yin Chen

14 Apr 2004

Reference:Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000

http://www.xiwt.org/documents/IPERF-paper2.pdf

Page 2: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

Overview This work focuses on 3 primary internet performance issues:

Establishing baselines Will the application work? -- Whether the infrastructure can support new

applications and services. Detecting anomalies whether the existing infrastructure is currently meeting the performance and

reliability requirements. Identifying trends Will the application continue to work? – Predict the future performance of the

infrastructure.

Metrics of interest Roundtrip delay Packet loss Reachability Availability Refer to Reading Report 7

Page 3: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

Related Works Visual Networks(1999) and Keynote(2000)

Offer products and services for assessing the performance of applications and offer QoS in dial-up network.

NOT offer the detailed data analysis and statistics generation capabilities.

The PingER project Led by Stanford Linear Accelerator Center (SLAC) An effort to monitor and understand the parts of the Internet used in high energy

nuclear and particle physics research. Involved 71countries on six continents. The monitoring site sends pings to a remote site, gather the packet loss and

roundtrip time reported by ping from each of the 20 monitoring sites, and write to a database at Fermilab.

The Cooperative Association for Internet Data Analysis Developed a series of measurement and analysis tools that can be used to better

understand Internet traffic Most of the tools can be download from http://caida.org/tools/

Page 4: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

MethodologyExperiment Setting Up Use PingER software, to measure roundtrip delay, packet loss and availability between

pairs of hosts. About dozen measurement hosts are included. Every 30 min, each host pings every other host to detect anomalies. A set of 11 pings of 100 bytes each is send first, the first ping is uses to eliminate

possible effects, i.e., priming of caches A set of 10 pings of 1,000 bytes is followed. Also sent a traceroute command to each remote host . It provides information about the

nodes a packet encounters along the path from the source to the destination, and the times the packet reaches those nodes.

Once a day, and archive host retrieves ping and traceroute data from each of the measurement hosts and stores the data in a database.

Each ping packet received by a source host contains a value for the roundtrip delay between that host and the destination host.

Packet lost -- If one ping is not returned within timeout time Unreachable -- If none of the pings returned

By retaining data on all the pings, can calculate a variety of statistics, i.e., mean, median, minimum, maximum, quartile, and can perform this calculation over any aggregated set of data, i.e., aggregation over all hosts or over a particular period of time.

Page 5: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

MethodologyAdvantages & Disadvantages

Advantages Simple Availability of the ping tool on all machines

Disadvantages Ping uses the Internet Control Message Protocol (ICMP), does not necessarily

have the same performance as TCP, UDP, or other IP protocols. i.e., ICMP packet can be given lower priority on some routers, or they can be

clocked by firewalls.

Page 6: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

Examples A set of ping samples are collected every 30 min between each source and destination

pair. i.e., (default timeout for ping is 20 sec.){78ms, 85ms, 72ms, ∞, 64ms, 53ms, 81ms, 93ms, 101ms, 67ms}

Over the course of a day, 48 of these samples sets are collected :Time Sample Sets

1 12:00 am {78ms, 85ms, 72ms, ∞, 64ms, 53ms, 81ms, 93ms, 101ms, 67ms}

2 12.30 am {42ms, 77ms, 68ms, ∞, ∞, ∞, 95ms, 43ms, 41ms}

3 1.00 am {…}

4 1.30 am {…}

….

47 11.00 pm {…}

48 11.30 pm {…}

2 techniques for reducing large volume :statistics generation and data aggregation The median : 53 64 67 72 |78| 81 85 93 101 The Mean : (53+64+67+72+78+85+93+101) / 9 = 77.1ms The maximum : 101ms The minimum : 53ms

Page 7: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

Example (Cont.)Loss

Loss = unsuccessful pings / the total number of pings, i.e., for 12:00am set L = 1/10 = 10%

Aggregate the data collected between a single source and multiple destinations, (or reverse)

Destination No 1 2 3 4 5 6 7 8 9 10

Loss 0% 0% 10% 30% 0% 100% 40% 0% 10% 0%

The media 0% 0% 0% 0% |0%| 10% 10% 30% 40% The mean (0+0+0+0+0+10+10+30+40) / 9 = 10% The maximum 40% The minimum 0%

Page 8: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

Baselines

Page 9: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

Baselines (Cont.)

Page 10: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

Baselines (Cont.)

Page 11: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

Baselines (Cont.)Visualizing the Aggregation

FIGURE 25 Conceptualization of a Large Set of Loss Samples (All Source-Destination Pairs)

Page 12: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

Baselines (Cont.)Time of Day Baselines

Page 13: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

Baselines (Cont.)Daily Baselines

Page 14: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

Baselines (Cont.)Weekday Baselines

Page 15: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

Data visualizationSingle-dimensional

Page 16: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

Data VisualizationTwo-dimensional 2 ways

Direct plots of the two metrics Plots of some function of the two metrics

i.e., for availability, Availability is the fraction of time when the delay and loss rate of pings sent to a destination are

within selected thresholds: Good Delay < 100ms and loss < 5% Unavailable Delay > 400ms or loss > 20% Poor Otherwise

Plot ping data from a host pair during 10 weeks, Each point on the graph is a {loss, delay} pair i.e., for a set of delay measurements: {78ms, 85ms, 72ms, ∞, 64ms, 53ms, 81ms, 93ms, 101ms, 67ms}

Produce 10 {loss, delay} pairs :{10%, 78ms} {10%, 85ms}{10%, 72ms} {10%, ∞}{10%, 64ms} {10%, 53ms}{10%, 81ms} {10%, 93ms}{10%, 101ms} {10%, 67ms}

Page 17: Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.

Data VisualizationExamples of Baselines