Top Banner
The sum of all human knowledge in the age of machines A new research agenda for Wikimedia Dario Taraborelli • Wikimedia Foundation ICWSM 2015 Workshop, 26 May 2015
35

The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Aug 08, 2015

Download

Internet

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

The sum of all human knowledge in the age of machines

A new research agenda for Wikimedia

Dario Taraborelli • Wikimedia FoundationICWSM 2015 Workshop, 26 May 2015

Page 2: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

A conversation

Page 3: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Impact of Wikipedia research

contributor motivation

rise and decline of the editor population

gender gap

asymmetries in content and provenance of contributions

socio-technical systems governing quality control.

Page 4: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Human curated knowledge in the age of machines

Page 5: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

the long-form encyclopedia

Page 6: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Outline

1. sourcing information

2. consuming information

3. distributing content

A new research agenda

Wikimedia as a platform for researchers

Page 7: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

1. Sourcing information

Page 8: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Goats

Page 10: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)
Page 11: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)
Page 14: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

1. Sourcing information

● What role does information sourced by humans play when answers to most questions are readily available from search engines?

● Should Wikipedia start integrating algorithmically extracted sources in its contents?

● Should Wikipedia further invest in supporting human generated citations?

Page 15: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

2. Consuming information

Page 16: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

O. Keyes (2015) The Mobile Singularity is already here. Wikipedia and the Mobile Web, Part 1.

Page 17: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Bite size consumption

Page 18: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Structured contributions

Page 19: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Manipulating fragments

Page 20: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

media

structured data

referencesmedia

long-form text

fragments

references geocoordinatesstructured

data

decoupled article

Decoupling the article

long-form article

Page 21: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

2. Consuming information

● How to transform Wikimedia contents to make them suitable to bite size consumption?

● How to accelerate extraction and coverage of structured data in Wikidata?

● How to design effective lightweight contribution funnels around structured data and fragments?

● How to support programmatic manipulation of content fragments?

Page 22: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

3. Distributing content

Page 23: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

The paradox of reuse

Page 24: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Routing attention

Women in Science

Wikipedia needs your help

The English Wikipedia article Women in Science needs contributors from a more global perspective. Help expand it!

Page 25: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Routing attention

Page 26: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Routing attention

Page 27: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

3. Distributing content

● How can we design content distribution systems that do not intermediate Wikipedia?

● How do we leverage content syndication to route (expert) attention to the source?

Page 28: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

A new research agenda

Designing and evaluating systems to:

1. preserve and increase transparent sourcing of information

2. break down long-form articles into their constituents

3. optimize content fruition, as a function of access

4. enable lightweight contribution/manipulation of structured data / fragments

5. leverage content distributed / syndicated by 3rd parties

6. prioritize work and route contributors to the site, as a function of demand

Page 29: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Wikimedia Research as a platform

Page 30: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Wikimedia Research as a platform

Wikimedia Research & Data team

Edit quality classifiers

Automated link recommendations

Article translation recommendations

Fundraiser optimization

Page 31: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Scaling Wikimedia research

1:100,000,000Approximate ratio of full-time researchers at WMF by monthly unique visitors

Page 32: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Formal collaborations

Fellows and NDA’ed collaborators

Stanford University

GroupLens, University of Minnesota

Oxford Internet Institute (2015)

Los Alamos National Laboratory (2015)

Page 33: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Conclusions

Page 34: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Questions?

[email protected]

@readermeter@wikiresearch

Page 35: The sum of all human knowledge in the age of machines: A new research agenda for Wikimedia (ICWSM '15)

Image creditsElection Night Crowd, Wellington, 1931https://www.flickr.com/photos/nationallibrarynz_commons/3326203787CC0

King Billy of Dalkey Islandhttps://www.flickr.com/photos/paulodonnell/5937678226CC BY

Secretary at typewriter, 1912https://www.flickr.com/photos/muohio_digital_collections/3192197470CC0

"Getting em up" at U.S.Naval Training Camp, Seattle, Washington. ca. 1917 - ca. 1918https://www.flickr.com/photos/usnationalarchives/5505933145CC0