Top Banner
Multidimensional Data Analysis IS 247 Information Visualization and Presentation 22 February 2002 James Reffell Moryma Aydelott Jean-Anne Fitzpatrick
43

Multidimensional Data Analysis

Jan 16, 2016

Download

Documents

neci

Multidimensional Data Analysis. IS 247 Information Visualization and Presentation 22 February 2002 James Reffell Moryma Aydelott Jean-Anne Fitzpatrick. Problem Statement. How to effectively present more than 3 dimensions of information in a visual display with 2 (to 3) dimensions? - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Multidimensional Data Analysis

Multidimensional Data Analysis

IS 247 Information Visualization and Presentation

22 February 2002

James ReffellMoryma Aydelott

Jean-Anne Fitzpatrick

Page 2: Multidimensional Data Analysis

Problem Statement

• How to effectively present more than 3 dimensions of information in a visual display with 2 (to 3) dimensions?

• How to effectively visualize “inherently abstract” data?

• How to effectively visualize very large, often complex data sets?

• How to effectively display results – when you don’t know what those results will be?

Page 3: Multidimensional Data Analysis

Key Goals

• More than 3 dimensions of data simultaneously

• Support “fuzzyness” (similarity queries, vector space, tolerance ranges)

• Support exploratory, opportunistic, “what-if” queries

• Allow identification of “interesting data properties” through pattern recognition

• Explore various dimensions without losing overview

Page 4: Multidimensional Data Analysis

Another Statement of Goals

Visualization of multidimensional data• Without loss of information• With:

– Minimal complexity– Any number of dimensions– Variables treated uniformly– Objects remain recognizable across

transformations– Easy / intuitive conveyance of information– Mathematically / algorithmically rigorous

(Adapted from Inselberg)

Page 5: Multidimensional Data Analysis

Purposes / Uses

• Find clusters of similar data• Find “hot spots” (exceptional items in

otherwise homogeneous regions)• Show relationships between multiple

variables• Similarity retrieval rather than boolean

matching, show near misses

“Searching for patterns in the big picture and fluidly investigating interesting details without losing framing context” (Rao & Card)

Page 6: Multidimensional Data Analysis

Characteristics• “Data-dense displays” (large number of

dimensions and/or values)– Often combine color with position / proximity

representing relevance “distance” – Often provide multiple views

• Build on concepts from previous weeks:– Retinal properties of marks– Gestalt concepts, e.g., grouping– Direct manipulation / interactive queries– Incremental construction of queries– Dynamic feedback

• Some require specialized input devices or unique gesture vocabulary

Page 7: Multidimensional Data Analysis

ExamplesWarning: These visualizations

are not easy to grasp at “first

glance”!

DON’T PANIC

Page 8: Multidimensional Data Analysis

Influence Explorer / Prosection Matrix

(Tweedie et. al.)

• We saw the video• Abstract one-way mathematical

models: multiple parameters, multiple variables

• Data through sampling• Colour coding, esp. near misses• Task: Make the red bit as big as

possible!

Page 9: Multidimensional Data Analysis

Influence Explorer / Prosection Matrix

(Tweedie et. al.)

• Selecting performance limits

Page 10: Multidimensional Data Analysis

Influence Explorer / Prosection Matrix

(Tweedie et. al.)

• The colours go in two directions!

Page 11: Multidimensional Data Analysis

Influence Explorer / Prosection Matrix

(Tweedie et. al.)

• Fitting tolerance region (yellow box) to acceptability (red region) gives high yield for minimum cost

Page 12: Multidimensional Data Analysis

The Table Lens (Rao and Card)

• - Tools: zoom, adjust, slide

• - Cell contents coded by color (nominal) or bar length (interval)

• - Special mouse gesture vocabulary

• Search / browse (spotlighting)

Page 13: Multidimensional Data Analysis

The Table Lens (Rao and Card)

Page 14: Multidimensional Data Analysis

The Table Lens (Rao and Card)

http://www.tablelens.com

Page 15: Multidimensional Data Analysis

Parallel Coordinates(Inselberg)

• Transformation of multiple graphs by using parallel axes in a 2D representation.

• Users attempt to recognize patterns between the axes - adding or removing parts of the data to see general patterns or more closely examine particular interactions.

• Article offers suggestions on how to most effectively use this system.

Page 17: Multidimensional Data Analysis

Parallel Coordinates(Inselberg)

Strengths – • Works for any N• Clearly displays data characteristics of the data (without

needing beaucoup explanations)• Easy to adjust or focus displays/ queries • Testing showed that it showed problems missed using other

forms of process control• Can be used in decision support when used as a visual

modeling tool (to see how adjusting one parameter effects others).

Weaknesses – • Formation of complex queries can be tricky (if you want to

get results that are useful and easy to interpret).

Page 18: Multidimensional Data Analysis

Polaris(Stolte and Hanharan)

• Extends pivot tables to generate graphic displays

• Multiple graphs on one screen• Designed to “combine

statistical analysis and visualization”

(a pivot table)

(polaris)

Page 19: Multidimensional Data Analysis

Polaris(Stolte and Hanharan)

• Table algebra automatically generated via drag and drop.

• Suitable graphic types are system selected based on query/result criteria. Include tables, bar charts, dot plot, gantt charts, matrices of scatterplots, maps.

• Users can select marks (marks differ by shape, size, orientation and color).

Page 20: Multidimensional Data Analysis

Polaris(Stolte and Hanharan)

Strengths – • Can be used with existing DB systems• Direct manipulation - drag and drop • Users can play with appearance of

display• Linking and Brushing supported

Weaknesses – • User only sees aggregated (not original)

data

• System performs a number of functions automatically (conversion of variables, aggregation) - user may not know or not be able to control how their data is changed.

Page 21: Multidimensional Data Analysis

Worlds Within Worlds(Fiener and Beshers)

• Basic approach: graph 3 dimensions, while holding “extra” dimensions constant

• Visually represent “extra” dimensions as space within which graph(s) are placed– Position of “inner world” graph axis zero point

equals set of constant values in “outer world”

• Tools:– Dipstick– Waterline– Magnifying box

The following images from: http://www-courses.cs.uiuc.edu/~cs419/multidim.ppt

Page 22: Multidimensional Data Analysis
Page 23: Multidimensional Data Analysis
Page 24: Multidimensional Data Analysis
Page 25: Multidimensional Data Analysis
Page 26: Multidimensional Data Analysis

Worlds Within Worlds

• Constraints:– Uses special input device (“Data Glove”)

and output device (liquid crystal stereo glasses); use without these special devices less than optimal

• Technical details:– Suspend calculation of “child” details during

movement– Algorithm for prioritizing overlapping objects– Need to “turn off” gesture recognition to

allow normal use of hand

Page 27: Multidimensional Data Analysis

Worlds Within Worlds I/O Devices

Page 28: Multidimensional Data Analysis

Techniques for plotting multivariate functions

(Mihalisin et al)

• Multiples showing component dimensions, color codes for dimensions applied across multiples

• Or, for categorical data, select mth category from nth dimension

• Or, plot nested boxes, step values of independent variables and color-coding dependent variable

Page 29: Multidimensional Data Analysis

Techniques for plotting multivariate functions

(Mihalisin et al)

• Tools:– General zoom: look at smaller range

of data in same amount of space– Subspace zoom: select view of

particular dimension’s input to function

– Decimate tool: sample fewer values within range

Page 30: Multidimensional Data Analysis

from http://www.cs.umd.edu/class/spring2001/cmsc838b/presentations/Zhijian_Pan/mdmv.ppt

Page 31: Multidimensional Data Analysis

from http://www.cs.umd.edu/class/spring2001/cmsc838b/presentations/Zhijian_Pan/mdmv.ppt

Page 32: Multidimensional Data Analysis

VisDB(Keim & Kriegel)

• Mapping entries from relational database to pixels on the screen

• Include “approximate” answers, with placement and color-coding based on relevance

• Data points laid out in:– Rectangular spiral– Or, with axes representing positive/negative

values for two selected dimensions– Or, group dimensions together (easier to

interpret than very large number of dimensions)

Page 33: Multidimensional Data Analysis

from http://infovis.cs.vt.edu/cs5984/students/VisDB.ppt

Page 34: Multidimensional Data Analysis

VisDB - Relevance

• Relevance calculation based on “distance” of each variable from query specification

• Distance calculation depends on data type– Numeric: mathematical – String: character/substring matching, lexical,

phonetic?, syntactic?– Nominal: predefined distance matrix– Possibly other “domain-specific” distance

metrics

Page 35: Multidimensional Data Analysis

VisDB – Screen Resolution

• Stated screen resolution seems reasonable by today’s standards:19 inch display, 1024x1280 pixels= 1.3 million data points

• However, controls take up a lot of space!

Page 36: Multidimensional Data Analysis

from http://www1.ics.uci.edu/~kobsa/courses/ICS280/notes/presentations/Keim-VisDB.ppt

Page 37: Multidimensional Data Analysis

VisDB – Implementation

• Requires features not available in commercial databases:– Partial query results– Incremental changes to queries– Speed? (1994 vs today)

Page 38: Multidimensional Data Analysis

Limitations and Issues

• (intro to following slides and/or Tweedie’s words of wisdom?)

Page 39: Multidimensional Data Analysis

Complexity

• Simplest approach to representing N dimensions is N controls, N one-dimensional outputs – but this fails to represent complex relationships

• Middle ground achieved by some?

Page 40: Multidimensional Data Analysis

Abstract data

• These visualizations are oriented toward abstract data

• For “naturally” two or three-dimensional data (things that vary over time or space, e.g., geographic data) visualizations which exploit those properties may exist and be more effective

Page 41: Multidimensional Data Analysis

User Testing?

• Many of these systems seem only appropriate for expert use

Page 42: Multidimensional Data Analysis

Future Work

• Save query parameters for reference / sharing results

• Automated query generation or filtering – Intelligent agents?

Page 43: Multidimensional Data Analysis

Words of wisdomfrom Tweedie et al

• Trade-off between amount of information, simplicity, and accuracy

• “It is often hard to judge what users will find intuitive and how [a visualization] will support a particular task”