Top Banner
Theius: A Streaming Visualization Suite for Hadoop Clusters Jon Tedesco IC2E 2013, San Francisco, CA, USA Jon Tedesco, Roman Dudko, Abhishek Sharma, Reza Farivar, Roy Campbell
26

Theius: A Streaming Visualization Suite for Hadoop Clusters

Jun 22, 2015

Download

Technology

jtedesco5

Slides for presentation of "Theius: A Streaming Visualization Suite for Hadoop Clusters", given at IC2E 2013 in San Francisco, California.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Theius: A Streaming Visualization Suite for Hadoop Clusters

Theius: A Streaming Visualization Suite for

Hadoop ClustersJon Tedesco

IC2E 2013, San Francisco, CA, USAJon Tedesco, Roman Dudko, Abhishek Sharma, Reza Farivar, Roy Campbell

Page 2: Theius: A Streaming Visualization Suite for Hadoop Clusters

Problem

◦ System administrators

Bottleneck for detecting & responding to failures

Communicate state of system quickly

Monitoring

◦ Streaming, real-time data

◦ Ganglia

Widely used, scalable, and flexible

Prediction

◦ Online prediction algorithms (real-time)

Visualization Problem

◦ Ganglia

Static, time-based graphs

Motivation

Mon

itoring

Visualization

Prediction

Page 3: Theius: A Streaming Visualization Suite for Hadoop Clusters

3

Related Work - Ganglia

Page 4: Theius: A Streaming Visualization Suite for Hadoop Clusters

4

Related Work - Ganglia

Page 5: Theius: A Streaming Visualization Suite for Hadoop Clusters

5

Interactive

◦ Responsive and controllable

Real-time

◦ Streaming, real-time, automatic

Informative

◦ Direct attention to potential problems and artifacts

Intuitive

◦ Demand skill, not experience

Scalable

◦ Visualize large clusters without sacrificing usability

Design Principles

Page 6: Theius: A Streaming Visualization Suite for Hadoop Clusters

6

Objectives

◦ Streaming data

◦ Configurable and interactive

◦ Informative

Use cases

◦ Heterogeneous cluster

◦ Rack failure

◦ Node failure

◦ Uneven load distribution

Design Principles

Page 7: Theius: A Streaming Visualization Suite for Hadoop Clusters

7

Architecture

◦ Simulator

Generates simulated cluster data

Streams data to clients

◦ Webpage

Asynchronous & interactive

Implementation

◦ JavaScript

d3.js

jQuery

◦ Python

◦ AJAX

Visualization - Architecture

Page 8: Theius: A Streaming Visualization Suite for Hadoop Clusters

8

Data

◦ Methodology

Data types from previous work

Heuristic values

◦ Examples

CPU, memory, context switch rate

Log events

MapReduce tasks and jobs

Failure or event prediction

Visualization - Architecture

Page 9: Theius: A Streaming Visualization Suite for Hadoop Clusters

9

Visualization - Overview

Page 10: Theius: A Streaming Visualization Suite for Hadoop Clusters

10

Visualization – Main Visualization

Main Visualization

Customizable using control panel

Aggregate view

◦ Summarize and drill down

Draws attention to anomalies

Page 11: Theius: A Streaming Visualization Suite for Hadoop Clusters

11

Visualization – Navigation

Switch between main visualizations

Seamless transitions

◦ Uninterrupted data stream

Page 12: Theius: A Streaming Visualization Suite for Hadoop Clusters

12

Visualization - Tree

Hierarchy of nodes, organized by rack

Color and size configurable

Scalable using summarization and drill-

down

Identify abnormal rack or nodes

Page 13: Theius: A Streaming Visualization Suite for Hadoop Clusters

13

Visualization – Large Tree

Hierarchy of nodes, organized by rack

Color and size configurable

Scalable using summarization and drill-down

Identify abnormal rack or nodes

Page 14: Theius: A Streaming Visualization Suite for Hadoop Clusters

14

Visualization - MapReduce

Grouped by job

Color and size configurable

◦ Example uses role for color, time remaining

for size

Identify abnormal jobs or tasks

Page 15: Theius: A Streaming Visualization Suite for Hadoop Clusters

15

Visualization - TreeMap

Grouped by rack

Color and size configurable

◦ Example uses CPU usage and rack color coding

Identify abnormal nodes or racks

Page 16: Theius: A Streaming Visualization Suite for Hadoop Clusters

16

Visualization - Scatterplot

Identify trends with nodes and racks

Color, size, and plots configurable

Identify correlations between metrics

Page 17: Theius: A Streaming Visualization Suite for Hadoop Clusters

17

Visualization – Individual Node

Detailed data for individual node

Traditional visualizations for single

node

Page 18: Theius: A Streaming Visualization Suite for Hadoop Clusters

18

Visualization - Controls

Controls

Configure metrics for visualizations

Pause and resume data stream

Legend for main visualization

Page 19: Theius: A Streaming Visualization Suite for Hadoop Clusters

19

Visualization – Aggregate Data

Aggregate Data

Aggregate data for the cluster

◦ Log events stream

◦ Global node data

◦ Summarization data

Page 20: Theius: A Streaming Visualization Suite for Hadoop Clusters

20

Visualization - Timeline

History Controls

Snapshots of historical data

◦ See main visualization and sidebar data at

certain time

Visualize metric across time

Page 21: Theius: A Streaming Visualization Suite for Hadoop Clusters

21

Scalable

◦ Drill-down and summarization

◦ Efficient web-based framework

Intuitive, informative

◦ Topological visualization

◦ Draw attention to abnormalities

Interactive, real-time

◦ Designed for streaming data

◦ Configurable visualization

◦ Pause, rewind, resume

Visualizations Summary

Page 22: Theius: A Streaming Visualization Suite for Hadoop Clusters

22

Experimental Setup

◦ Compare Theius to Ganglia

◦ 5 graduate students at UIUC

No prior experience with Ganglia or Theius

◦ 4 comparative tasks

Both Ganglia & Theius

◦ 6 scenarios for trends and correlations

Theius only

◦ Timings & subjective feedback

User Study

Page 23: Theius: A Streaming Visualization Suite for Hadoop Clusters

23

Tasks

◦ Scenario 1

CPU usage in single node

◦ Scenario 2

Node with highest CPU

◦ Scenario 3

High memory usage nodes

◦ Scenario 4

Aggregate cluster use

User Study – Comparative Tasks

Scen

ario

1

Scen

ario

2

Scen

ario

3

Scen

ario

40

10

20

30

40

50

60

Theius

Ganglia

Seco

nd

s

Page 24: Theius: A Streaming Visualization Suite for Hadoop Clusters

24

Task 1

◦ Identify abnormal rack in heterogeneous cluster

Task 2

◦ Identify rack with abnormal CPU usage

Task 3

◦ Identify machine that logged the last fatal error

Task 4

◦ Identify machine with high CPU, memory usage, or context switch

rate

Task 5

◦ Identify rack with high CPU, memory usage, or context switch rate

Task 6

◦ Identify correlation between context switch rate and CPU usage

User Study – Theius Tasks

2.2 s

6.2 s

10.0 s

67.4 s

1.2 s

7.8 s

Page 25: Theius: A Streaming Visualization Suite for Hadoop Clusters

25

Conclusion & Future Work

Source Code

◦ https://github.com/jtedesco/Theius

Future Work

◦ User study

System administrators

Larger group

Timing as appropriate metric

◦ MapReduce-specific visualizations

◦ Scalability experiments

Page 26: Theius: A Streaming Visualization Suite for Hadoop Clusters

Questions?Jon Tedesco

IC2E 2013, San Francisco, CA, USAJon Tedesco, Roman Dudko, Abhishek Sharma, Reza Farivar, Roy Campbell