Top Banner
Information Visualization Crash Course Chad Stolper Google (graduated from Georgia Tech CS PhD) 1 (AKA Information Visualization 101) http://poloclub.gatech.edu/cse6242 CSE6242: Data & Visual Analytics
145

Information Visualization Crash Course

Nov 01, 2021

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Information Visualization Crash Course

Information Visualization

Crash Course

Chad Stolper

Google(graduated from Georgia Tech CS PhD)

1

(AKA Information Visualization 101)

http://poloclub.gatech.edu/cse6242

CSE6242: Data & Visual Analytics

Page 2: Information Visualization Crash Course

What is Infovis?

Why is it Important?

Human Perception

Chart Basics(If Time, Some Color Theory)

The Shneiderman Mantra

Where to Learn More

2

Page 3: Information Visualization Crash Course

What is Information Visualization?

3

Page 4: Information Visualization Crash Course

Information Visualization

“The use of computer-supported, interactive,

visual representations of abstract data to

amplify cognition.”

Card, Mackinlay, and Shneiderman 1999

4

Page 5: Information Visualization Crash Course

Communication

Exploratory Data Analysis (EDA)

5

Page 6: Information Visualization Crash Course

Communication

6

(gone wrong)

Page 7: Information Visualization Crash Course

7

Page 8: Information Visualization Crash Course

8 X

Edward Tufte

An American statistician

and professor emeritus of

political science, statistics,

and computer science at

Yale University.

He is noted for his writings

on information design and

as a pioneer in the field of

data visualization.

-Wikipedia

Page 9: Information Visualization Crash Course

Space Shuttle ChallengerJanuary 28, 1986

9

Morning Temperature: 31°F

Page 10: Information Visualization Crash Course

10

Page 11: Information Visualization Crash Course

11

Tufte, E. R. (2012). Visual explanations: images and quantities,

evidence and narrative. Cheshire, CT: Graphics Press.

Page 12: Information Visualization Crash Course

13

Video originally from: http://www.FeynmanPhysicsLectures.com

Most Watched Science Experiment

Richard Feynman, Physics

Nobel laureate explained how

rubber became rigid in cold

temperate

YouTube video:

https://youtu.be/6Rwcbsn19c0

Page 13: Information Visualization Crash Course

How did this happen?

14

Page 14: Information Visualization Crash Course

15

Tufte, E. R. (2012). Visual explanations: images and quantities, evidence and narrative. Cheshire, CT: Graphics Press.

Engineers at Morton Thiokol, the rocket

maker, presented on the day before and

recommended not to launch.

Page 15: Information Visualization Crash Course

19

Page 16: Information Visualization Crash Course

24

Page 17: Information Visualization Crash Course

25

Page 18: Information Visualization Crash Course

26

Page 19: Information Visualization Crash Course

27

Page 20: Information Visualization Crash Course

28

Page 21: Information Visualization Crash Course

So, communication is

extremely important.

Visualization can help with that –

communicate ideas and insights.

29

Page 22: Information Visualization Crash Course

30http://www.ted.com/talks/hans_rosling_shows_the_best_stats_you_ve_ever_seen.html

Page 23: Information Visualization Crash Course

Visualization can also help with

Exploratory Data Analysis (EDA)

But why do you need to explore

data at all???

31

Page 24: Information Visualization Crash Course

“There are three kinds of lies:

lies, damned lies, and statistics.”

33

https://en.wikipedia.org/wiki/Lies,_damned_lies,_and_statistics

Page 25: Information Visualization Crash Course

Mystery Data Set

34

Page 26: Information Visualization Crash Course

Mystery Data Set

Property Value

mean( x ) 9

variance ( x ) 11

mean( y ) 7.5

variance ( y ) 4.122

correlation ( x,y ) 0.816

Linear Regression Line y = 3 + 0.5x

35

Page 27: Information Visualization Crash Course

36

Page 28: Information Visualization Crash Course

37

Page 29: Information Visualization Crash Course

38

Page 30: Information Visualization Crash Course

39

Page 31: Information Visualization Crash Course

Anscombe’s Quartet

40https://en.wikipedia.org/wiki/Anscombe%27s_quartet

Page 32: Information Visualization Crash Course

Anscombe’s Quartet

Sanity Checking Models

Outlier Detection

41

Page 33: Information Visualization Crash Course

Data visualization leverages

human perception

43

Page 34: Information Visualization Crash Course

Name the five senses.

44

Page 35: Information Visualization Crash Course

45

Sense Bandwidth (bits/sec)

Sight 10,000,000

Touch 1,000,000

Hearing 100,000

Smell 100,000

Taste 1,000

http://www.britannica.com/EBchecked/topic/287907/information-theory/214958/Physiology

Page 36: Information Visualization Crash Course

A (Simple) Model

of Human Visual Perception

46

Page 37: Information Visualization Crash Course

A (Simple) Model of Human Perception

47

Parallel detection of

basic features into

an iconic store

Serial processing of

object identification and

spatial layout

Stage 1 Stage 2

Page 38: Information Visualization Crash Course

Stage 1: Pre-Attentive Processing

Rapid

Parallel

Automatic(Fleeting = lasting for a short time)

48

Page 39: Information Visualization Crash Course

Stage 2: Serial Processing

Relatively Slow

(Incorporates Memory)

Manual

49

Page 40: Information Visualization Crash Course

Stage 1: Pre-Attentive Processing

The eye moves every 200ms

(so this processing occurs every

200ms-250ms)

50

Page 41: Information Visualization Crash Course

Example

1281768756138976546984506985604982826762

9809858458224509856458945098450980943585

9091030209905959595772564675050678904567

8845789809821677654876364908560912949686

51

Page 42: Information Visualization Crash Course

Example

1281768756138976546984506985604982826762

9809858458224509856458945098450980943585

9091030209905959595772564675050678904567

8845789809821677654876364908560912949686

52

Page 43: Information Visualization Crash Course

A few more examples from

Prof. Chris Healy at NC State

53

Page 44: Information Visualization Crash Course

54

Left Side Right Side

Page 45: Information Visualization Crash Course

Raise your hand if a RED DOT

is present…

(On the left or on the right?)

55

Page 46: Information Visualization Crash Course

56

Page 47: Information Visualization Crash Course

57

Page 48: Information Visualization Crash Course

Color (hue) is pre-attentively

processed.

58

Page 49: Information Visualization Crash Course

Raise your hand if a RED DOT

is present…

59

Page 50: Information Visualization Crash Course

60

Page 51: Information Visualization Crash Course

61

Page 52: Information Visualization Crash Course

Shape is pre-attentively

processed.

62

Page 53: Information Visualization Crash Course

Determine if a RED DOT is

present…

63

Page 54: Information Visualization Crash Course

64

Page 55: Information Visualization Crash Course

65

Page 56: Information Visualization Crash Course

Hue and shape together are NOT

pre-attentively processed.

66

Page 57: Information Visualization Crash Course

Pre-Attentive Processing

• length

• width

• size

• curvature

• number

• terminators

• intersection

• closure

• hue

• lightness

• flicker

• direction of motion

• binocular lustre

• stereoscopic depth

• 3-D depth cues

• lighting direction

67

Page 58: Information Visualization Crash Course

Stephen Few

“Now You See It”

pg. 3968

Page 59: Information Visualization Crash Course

Pre-Attentive → Cognitive

69

Page 60: Information Visualization Crash Course

Gestalt Psychology

Berlin, Early 1900s

70

Page 61: Information Visualization Crash Course

Gestalt Psychology

Goal was to understand

pattern perception

Gestalt (German) = “seeing the whole picture all at once”

instead of a collection of parts

Identified 8 “Laws of Grouping”

71

http://study.com/academy/lesson/gestalt-psychology-definition-principles-quiz.html

Page 62: Information Visualization Crash Course

Gestalt Psychology

1. Proximity

2. Similarity

3. Closure

4. Symmetry

5. Common Fate

6. Continuity

7. Good Gestalt

8. Past Experience

72

Page 63: Information Visualization Crash Course

How many groups are there?

73

Page 64: Information Visualization Crash Course

74

Page 65: Information Visualization Crash Course

Proximity

75

Page 66: Information Visualization Crash Course

How many groups are there?

76

Page 67: Information Visualization Crash Course

77

Page 68: Information Visualization Crash Course

Similarity

78

Page 69: Information Visualization Crash Course

How many shapes are there?

79

Page 70: Information Visualization Crash Course

80

Page 71: Information Visualization Crash Course

Closure

81

Page 72: Information Visualization Crash Course

How many items are there?

82

Page 73: Information Visualization Crash Course

( ) { } [ ]

83

Page 74: Information Visualization Crash Course

( ) { } [ ]

Symmetry

84

Page 75: Information Visualization Crash Course

How many sets are there?

85

Page 76: Information Visualization Crash Course

86

Page 77: Information Visualization Crash Course

87

Common Fate

Page 78: Information Visualization Crash Course

How many objects are there?

88

Page 79: Information Visualization Crash Course

89

Page 80: Information Visualization Crash Course

Continuity

90

Page 81: Information Visualization Crash Course

How many objects are there?

91

Page 82: Information Visualization Crash Course

92

Page 83: Information Visualization Crash Course

Good Gestalt

93

Page 84: Information Visualization Crash Course

What is this word?

94

Page 85: Information Visualization Crash Course

95

CLIP

Page 86: Information Visualization Crash Course

Past Experience

96

CLIP

Page 87: Information Visualization Crash Course

Pre-Attentive Processing

Gestalt Laws

101

Page 88: Information Visualization Crash Course

Detect Quickly

102

Page 89: Information Visualization Crash Course

Detect quickly does NOT mean

detect accurately

Ideally you want both.

103

Page 90: Information Visualization Crash Course

104Crowdsourcing Graphical Perception: Using Mechanical Turk to Assess Visualization Design.Heer

and Bostock. Proc ACM Conf. Human Factors in Computing Systems (CHI) 2010, p. 203–212.

Page 91: Information Visualization Crash Course

105Crowdsourcing Graphical Perception: Using Mechanical Turk to Assess Visualization Design.Heer

and Bostock. Proc ACM Conf. Human Factors in Computing Systems (CHI) 2010, p. 203–212.

1.0

1.5

2.0

2.5

3.0

10 20 30 40 50 60 70 80 90

T1T2T3T4T5T6T7T8T9

True Proportional Difference (%)

Lo

g E

rro

r

Figure 3: Midmeans of log absolute errors againsttrue percentages for each proportional judgment type;superimposed are curves computed with lowess.

the results for the position-angle experiment to those for theposition-length experiment. By designing judgment types 6and 7 to adhere to the same format as the others, the resultsshould be more apt for comparison. Indeed, the new resultsmatch expectations: psychophysical theory [7, 34] predictsareato perform worsethan angle, and both to besignificantlyworse than position. Theory also suggests that angle shouldperform worsethan length, but theresultsdo not support this.Cleveland & McGill also did not find angle to perform worsethan length, but as stated their position-angle results are notdirectly comparable to their position-length results.

EXPERIMENT 1B: RECTANGULAR AREA JUDGMENTS

After successfully replicating Cleveland & McGill’s results,we further extended the experiment to more judgment types.We sought to compare our circular area judgment (T7) re-sults with rectangular area judgments arising in visualiza-tions such as cartograms [9] and treemaps [26]. We hypoth-esized that, on average, subjects would perform similarly tothecircular case, but that performance would be impacted byvarying the aspect ratios of the compared shapes. Based onprior results [19, 34], we were confident that extreme varia-tions in aspect ratio would hamper area judgments. “Squar-ified” treemap algorithms [3, 35] address this issue by at-tempting to minimize deviance from a1:1 aspect ratio, but itisunclear that this approach isperceptually optimal. Wealsowanted to assess if other differences, such as the presence ofadditional distracting elements, might bias estimation.

Method

We again used Cleveland & McGill’s proportional judgmenttask: subjects were asked to identify which of two rectangles(marked A or B) was the smaller and then estimate the per-centage the smaller was of the larger by making a “quickvisual judgment.” We used a 2 (display) ⇥ 9 (aspect ra-tios) factorial design with 6 replications for a total of 108unique trials (HITs). In the first display condition (T8) we

Cleveland & McGill's Results

1.0 1.5 2.0 2.5 3.0 3.5

T1

T2

T3

T4

T5

Log Error

Crowdsourced Results

1.0 1.5 2.0 2.5 3.0 3.5

T1

T2

T3

T4

T5

T6

T7

T8

T9

Log Error

Figure 4: Proportional judgment results (Exp. 1A & B).Top: Cleveland & McGill’s [7] lab study. Bottom: MTurkstudies. Error bars indicate 95% confidence intervals.

1.0 1.5 2.0 2.5 3.0 3.5

2/3 : 2/3

1 : 1

3/2 : 3/2

2/3 : 1

2/3 : 3/2

1 : 3/2

Log Error

Aspe

ct

Ra

tio

s

Figure 5: Rectangular area judgments by aspect ratios(1B). Error bars indicate 95% confidence intervals.

showed two rectangles with horizontally aligned centers; inthe second display condition (T9) we used 600⇥400 pixeltreemaps depicting 24 values. Aspect ratios weredeterminedby the cross-product of the set { 2

3, 1, 3

2} with itself, roughly

matching the mean and spread of aspect ratios produced byasquarified treemap layout (we generated 1,000 treemaps of24 uniformly-distributed random values using Bruls et al.’slayout [3]: theaverageaspect ratio was1.04, thestandard de-viation was0.28). Wesystematically varied area and propor-tional difference across replications. Wemodified the squar-ified treemap layout to ensure that the size and aspect ratioof marked rectangles matched exactly across display condi-tions; other rectangle areas were determined randomly.

As a qualification task, we used multiple-choice versions oftwo trial stimuli, one for each display condition. For eachtrial (HIT), we requested N=24 assignments. We also re-duced the reward per HIT to $0.02. We chose this numberin an attempt to match the U.S. national minimum wage (as-suming a response time of 10 seconds per trial).

CHI 2010: Visualization April 10–15, 2010, Atlanta, GA, USA

206

Page 92: Information Visualization Crash Course

Mackinlay, 1986106

Page 93: Information Visualization Crash Course

Stephen Few

“Now You See It”

pg. 41 107

Page 94: Information Visualization Crash Course

What does this tell us?

108

Page 95: Information Visualization Crash Course

Barcharts, scatterplots, and line

charts are really effective

for quantitative data

0 20 40

0

20

40

0 20 40

0

20

40

0 20 40109

Page 96: Information Visualization Crash Course

(and for statistical distributions)

Tukey Box Plots

110

Page 97: Information Visualization Crash Course

111

Page 98: Information Visualization Crash Course

Median

Outliers

Largest < Q3 + 1.5 IQR

Smallest > Q1 - 1.5 IQR

Largest < Q3

Smallest > Q1

112

Page 99: Information Visualization Crash Course

Tufte’s Chart Principles

113

Edward Tufte

Page 100: Information Visualization Crash Course

Tufte’s Chart Principles

DO NOT LIE!Maximize Data-Ink Ratio

Minimize Chart Junk

116

Page 101: Information Visualization Crash Course

Tufte’s Chart Principles

DO NOT LIE!Maximize Data-Ink Ratio

Minimize Chart Junk

117

Page 102: Information Visualization Crash Course

118

Page 103: Information Visualization Crash Course

119

“Cumulative”

Page 104: Information Visualization Crash Course

http://www.perceptualedge.com/blog/?p=790120

Page 105: Information Visualization Crash Course

121

http://xkcd.com/1138/

Page 106: Information Visualization Crash Course

Tufte’s Chart Principles

DO NOT LIE!Maximize Data-Ink Ratio

Minimize Chart Junk

123

Page 107: Information Visualization Crash Course

http://skilfulminds.com/2011/04/05/exploring-the-usefulness-of-chartjunk-at-stl-ux-2011/

124

Page 108: Information Visualization Crash Course

125

Chartjunk. (2017, October 05). Retrieved December 01, 2017, from https://en.wikipedia.org/wiki/Chartjunk

Page 109: Information Visualization Crash Course

Please…

127

Page 110: Information Visualization Crash Course

No pie charts.

No 2.5D charts.

128

Page 111: Information Visualization Crash Course

129

Page 112: Information Visualization Crash Course

37

36

24

2 1

130

Page 113: Information Visualization Crash Course

0

5

10

15

20

25

30

35

40

131

Page 114: Information Visualization Crash Course

132

Page 115: Information Visualization Crash Course

PLEASE DON’T

EVER DO THIS!

133

Page 116: Information Visualization Crash Course

0 10 20 30 40

134

Page 117: Information Visualization Crash Course

But otherwise…

138

Page 118: Information Visualization Crash Course

Barcharts, scatterplots, and line

charts are really effectivefor quantitative data

0 20 40

0

20

40

0 20 40

0

20

40

0 20 40139

Page 119: Information Visualization Crash Course

Anyone else bored

by my color choices?

140

Page 120: Information Visualization Crash Course

In fact, grayscale can be risky…

141

Page 121: Information Visualization Crash Course

In fact, grayscale can be risky…

142

Page 122: Information Visualization Crash Course

Color is Powerful

143

Page 123: Information Visualization Crash Course

Call attention to information

Increase appeal

Increase memorability

Another dimension to work with

Color

144

Page 124: Information Visualization Crash Course

145

Have you heard of RGB?

RGB color model. (2017, November 20). Retrieved December 01, 2017, from https://en.wikipedia.org/wiki/RGB_color_model

Additive color model: colors create by mixing

red, green, blue light

Page 125: Information Visualization Crash Course

We see in RGB,

but we don’t interpret in RGB…

146

Page 126: Information Visualization Crash Course

147

Hue

Lightness

Saturation

Source: color picker in Affinity Designer

HSV Color Model

Page 127: Information Visualization Crash Course

Hue

Post & Greene, 1986148

Page 128: Information Visualization Crash Course

Hue and Colorblindness

10% of males and 1% of females

are Red-Green Colorblind

150

Page 129: Information Visualization Crash Course

151

Page 130: Information Visualization Crash Course

152http://viz.wtf/post/98981561686/ht-matthewbgilmore-noaas-new-weather-modelling

Page 131: Information Visualization Crash Course

Color and Quantitative Data

Can you order these (low→hi)?

154

Page 132: Information Visualization Crash Course

155http://www.personal.psu.edu/faculty/c/a/cab38/ColorSch/Schemes.html via Munzner

Page 133: Information Visualization Crash Course

156

Color Brewer for Picking Color Scales

COLORBREWER 2.0. (n.d.). Retrieved December 01, 2017, from http://colorbrewer2.org/

Page 134: Information Visualization Crash Course

Overview

Zoom+Filter

Details on Demand

Shneiderman Mantra

(Information-Seeking Mantra)

157

https://www.mat.ucsb.edu/g.legrady/academic/courses/11w259/schneiderman.pdf

Page 135: Information Visualization Crash Course

158

Page 136: Information Visualization Crash Course

http://visual.ly/every-single-death-game-thrones-series159

Page 137: Information Visualization Crash Course

160http://www.babynamewizard.com/voyager

Page 138: Information Visualization Crash Course

Where to learn more?

169

Page 139: Information Visualization Crash Course

CS 7450

Information Visualization

Every Fall

170

Page 140: Information Visualization Crash Course

Visualization @GeorgiaTech

vis.gatech.edu

171

Page 141: Information Visualization Crash Course

How to Make Good Charts

• Edward Tufte’s One-Day Workshop– http://www.edwardtufte.com/tufte/courses

• Edward Tufte, Visual Display of Quantitative Information– http://www.edwardtufte.com/tufte/books_vdqi

• Stephen Few, Show Me the Numbers: Designing Tables and Graphs to Enlighten– http://www.amazon.com/Show-Me-Numbers-

Designing-Enlighten/dp/0970601972/ref=la_B001H6IQ5M_1_2?s=books&ie=UTF8&qid=1385050724&sr=1-2

172

Page 142: Information Visualization Crash Course

Visualization Theory “Books”• Tamara Munzner VIS Tutorial and Book

– http://www.cs.ubc.ca/~tmm/talks.html

– http://www.cs.ubc.ca/~tmm/vadbook/

• Colin Ware, Information Visualization: Perception for Design– http://www.amazon.com/Information-Visualization-Perception-Interactive-

Technologies/dp/1558605118

• Stephen Few, Now You See It– http://www.amazon.com/Now-You-See-Visualization-

Quantitative/dp/0970601980/ref=pd_bxgy_b_img_z

• Edward Tufte, Envisioning Information– http://www.edwardtufte.com/tufte/books_ei

• Edward Tufte, Visual Explanations– http://www.edwardtufte.com/tufte/books_visex

• Edward Tufte, Beautiful Evidence– http://www.edwardtufte.com/tufte/books_be

• Tamara Munzner, Visualization Analysis & Design– http://www.amazon.com/Visualization-Analysis-Design-AK-

Peters/dp/1466508914

173

Page 143: Information Visualization Crash Course

Perception and Color Websites

• Chris Healy, NC State– http://www.csc.ncsu.edu/faculty/healey/PP/index.ht

ml

• Color Brewer– http://colorbrewer2.org/

• Maureen C. Stone (Color Links, Blog, Workshops)– http://www.stonesc.com/color/index.htm

• Subtleties of Color by Robert Simmon of NASA– http://blog.visual.ly/subtleties-of-color/

174

Page 144: Information Visualization Crash Course

Visualization Blogs

• Flowing Data by Nathan Yau– http://flowingdata.com/

• Information Aesthetics by Andrew Vande Moere– http://infosthetics.com/

• Information is Beautiful by David McCandless– http://www.informationisbeautiful.net/

• Visual.ly Blog– http://blog.visual.ly/

• Indexed Comic by Jessica Hagy– http://thisisindexed.com/

175

Page 145: Information Visualization Crash Course

Infographics

Visual.ly/view(wtfviz.net)

176