Top Banner
Voyagers and Voyeurs Supporting Social Data Analysis Jeffrey Heer Computer Science Department Stanford University CIDR 2009 Monterey, CA 5 January 2009
68

Voyagers and Voyeurs

Jan 04, 2017

Download

Documents

ledieu
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Voyagers and Voyeurs

Voyagers and VoyeursSupporting Social Data Analysis

Jeffrey HeerComputer Science DepartmentStanford University

CIDR 2009 – Monterey, CA5 January 2009

Page 2: Voyagers and Voyeurs

A Tale of Two Visualizations

Page 3: Voyagers and Voyeurs

vizster

Page 4: Voyagers and Voyeurs

Observations

Groups spent more time in front of the visualization than individuals.

Friends encouraged each other to unearth relationships, probe community boundaries, and challenge reported information.

Social play resulted in informal analysis, often driven by story-telling of group histories.

Page 5: Voyagers and Voyeurs

NameVoyagerThe Baby Name Voyager

Page 6: Voyagers and Voyeurs
Page 7: Voyagers and Voyeurs
Page 8: Voyagers and Voyeurs
Page 9: Voyagers and Voyeurs
Page 10: Voyagers and Voyeurs

Social Data Analysis

Visual sensemaking can be social as well as cognitive.

Analysis of data coupled with social interpretation and deliberation.

How can user interfaces catalyze and support collaborative visual analysis?

Page 11: Voyagers and Voyeurs

sense.usA Web Application for Collaborative Visualization of Demographic Data

Page 12: Voyagers and Voyeurs
Page 13: Voyagers and Voyeurs

Voyagers and Voyeurs

Complementary faces of analysis

Voyager – focus on visualized data

Active engagement with the data

Serendipitous comment discovery

Voyeur – focus on comment listings

Investigate others’ explorations

Find people and topics of interest

Catalyze new explorations

Page 14: Voyagers and Voyeurs

Out of the Lab,Into the Wild

Page 15: Voyagers and Voyeurs
Page 16: Voyagers and Voyeurs
Page 17: Voyagers and Voyeurs

Wikimapia.org

Page 18: Voyagers and Voyeurs

DecisionSite posters

Spotfire Decision Site Posters

Page 19: Voyagers and Voyeurs

Tableau Server

Page 20: Voyagers and Voyeurs
Page 21: Voyagers and Voyeurs

Many-Eyes

Page 22: Voyagers and Voyeurs

Social Data Analysis In Action

1. Discussion and Debate

2. Text is Data, Too

3. Data Integrity and Cleaning

4. Integrating Data in Context

5. Pointing and Naming

For each, some thoughts on future directions.

I asked my colleagues: if you could give database researchers a wish list, what would it be?

Page 23: Voyagers and Voyeurs

Discussion and Debate

Page 24: Voyagers and Voyeurs
Page 25: Voyagers and Voyeurs
Page 26: Voyagers and Voyeurs
Page 27: Voyagers and Voyeurs

Tableau X-Box / Quest Diag?

“Valley of Death”

Page 28: Voyagers and Voyeurs
Page 29: Voyagers and Voyeurs
Page 30: Voyagers and Voyeurs
Page 31: Voyagers and Voyeurs

Content Analysis of Comments

Feature prevalence from content analysis (min Cohen’s = .74)High co-occurrence of Observations, Questions, and Hypotheses

ServiceSense.us Many-Eyes

0 20 40 60 80

Percentage

0 20 40 60 80

Percentage

ObservationQuestion

HypothesisData Integrity

LinkingSocializing

System DesignTesting

TipsTo-Do

Affirmation

Page 32: Voyagers and Voyeurs

Reduce the cost of synthesizing contributions

WANTED: Structured Conversation

Wikipedia: Shared Revisions NASA ClickWorkers: Statistics

Page 33: Voyagers and Voyeurs

Reduce the cost of synthesizing contributions

Can we represent data, visualizations, and social activity in a unified data model?

WANTED: Structured Conversation

Page 34: Voyagers and Voyeurs

Text is Data, Too

Page 35: Voyagers and Voyeurs

Visualization Popularity

Over 1/3 of Many-Eyes visualizations use free text

ServiceMany-Eyes Swivel

0.0 0.1 0.2 0.3 0.4 0.5

Percentage

0.0 0.1 0.2 0.3 0.4 0.5

Percentage

Tag CloudBubble Graph

Word TreeBar Chart

MapsNetwork Diagram

TreemapMatrix Chart

Line GraphScatterplot

Stacked GraphPie Chart

Histogram

Page 36: Voyagers and Voyeurs
Page 37: Voyagers and Voyeurs

Alberto Gonzales

Page 38: Voyagers and Voyeurs

WANTED: Better Tools for Text

Statistical Analysis of text (with ties to source!)

Entity Extraction

Aggregation and Comparison of texts

Get a “global” view of documents

We can do better than Tag Clouds (!?)

Use text analysis tools to enable analysis of structured conversation by the community.

Page 39: Voyagers and Voyeurs

Data Integrity and Cleaning

Page 40: Voyagers and Voyeurs

No cooks in 1910? … There may have been cooks then. But maybe not.

Page 41: Voyagers and Voyeurs

The great postmaster scourge of 1910?

Or just a bugin the data?

Page 42: Voyagers and Voyeurs
Page 43: Voyagers and Voyeurs
Page 44: Voyagers and Voyeurs

Content Analysis of Comments

16% of sense.us comments and 10% of Many-Eyes comments reference data quality or integrity.

ServiceSense.us Many-Eyes

0 20 40 60 80

Percentage

0 20 40 60 80

Percentage

ObservationQuestion

HypothesisData Integrity

LinkingSocializing

System DesignTesting

TipsTo-Do

Affirmation

Page 45: Voyagers and Voyeurs

WANTED: Data Cleaning Tools

Reshape data, reformat rows & columns

Handle missing data: label, repair, interpolate

Entity resolution and de-duplication

Group related values into aggregates

Assist table lookups & data transforms

Provide tools in situ to leverage collective

Transparency requires provenance

Page 46: Voyagers and Voyeurs

Integrating Data in Context

Page 47: Voyagers and Voyeurs
Page 48: Voyagers and Voyeurs
Page 49: Voyagers and Voyeurs

College Drug Use

Page 50: Voyagers and Voyeurs

College Drug Use

Page 51: Voyagers and Voyeurs

Harry Potter is Freaking Popular

Page 52: Voyagers and Voyeurs
Page 53: Voyagers and Voyeurs

WANTED: In-Situ Data Integration

Search for and suggest related data or views

User input for types, schema matching, or data

Apply in context of the current task

But record mappings for future use

Record provenance: chain of data sources

Examples: Google Web Tables, Pay-As-You-Go, Stanford Vispedia, Utah VisTrails

Page 54: Voyagers and Voyeurs

Pointing and Naming

Page 55: Voyagers and Voyeurs

“Look at that spike.”

Page 56: Voyagers and Voyeurs

“Look at the spike for Turkey.”

Page 57: Voyagers and Voyeurs

“Look at the spike in the middle.”

Page 58: Voyagers and Voyeurs

Free-form Data-aware

Page 59: Voyagers and Voyeurs

Visual Queries

Model selections as declarative queries over interface elements or underlying data

(-118.371≤ lon AND lon≤ -118.164)AND(33.915≤ lat AND lat≤ 34.089)

Page 60: Voyagers and Voyeurs

Visual Queries

Model selections as declarative queries over interface elements or underlying data

Applicable to dynamic, time-varying data

Retarget selection across visual encodings

Support social navigation and data mining

Page 61: Voyagers and Voyeurs

WANTED: Data-Aware Annotation

Meta-queries linking annotations to views

Visually specifying notification triggers

Annotating data aggregates (use lineage?)

Unified model (again!) to facilitate reference

How to make it work at scale?

How else to use machine-readable annotations?

Can annotations be used to steer data mining?

Page 62: Voyagers and Voyeurs

Conclusion

Page 63: Voyagers and Voyeurs

Social Data Analysis

Collective analysis of data supported by social interaction.

1. Discussion and Debate

2. Text is Data, Too

3. Data Integrity and Cleaning

4. Integrating Data in Context

5. Pointing and Naming

Page 64: Voyagers and Voyeurs

Summary

As visualization becomes common on the web, opportunities for collaborative analysis abound.

Weave visualizations into the web: data access, visualization creation, view sharing and pointing.

Support discovery, discussion, and integrationof contributions to leverage the collective.

Improve both processes and technologies for communication and dissemination.

Page 65: Voyagers and Voyeurs

Parting Thoughts

Visualizations may have a catalytic effecton social interaction around data.

Encourage participation by minimizing or offsetting interaction costs.

Provide incentives by fostering the personal relevance of the data.

Page 66: Voyagers and Voyeurs

Acknowledgements

@ Berkeley: Maneesh Agrawala, Wes Willett, danah boyd, Marti Hearst, Joe Hellerstein

@ IBM: Martin Wattenberg, Fernanda Viégas

@ PARC: Stu Card

@ Tableau: Jock Mackinlay, Chris Stolte, Christian Chabot

Page 67: Voyagers and Voyeurs

Jeffrey Heer Stanford University

[email protected]://jheer.org

Voyagers and VoyeursSupporting Social Data Analysis

Page 68: Voyagers and Voyeurs

With a collaborative spirit, with a collaborative platformwhere people can upload data, explore data, compare solutions, discuss the results, build consensus, we can engage passionate people, local communities, media and this will raise - incredibly - the amount of people who can understand what is going on.

And this would have fantastic outcomes: the engagement of people, especially new generations; it would increase knowledge, unlock statistics, improve transparency and accountability of public policies, change culture, increase numeracy, and in the end, improve democracy and welfare.

Enrico Giovannini, Chief Statistician, OECD. June 2007.