Top Banner
Data journalism - setting the stage Anders Pedersen @anpe @SchoolOfData
36

An introduction to Data Journalism

Aug 22, 2014

Download

Anders Pedersen

Presentation at School of Data training on May 14th for journalists at training with Open Data PH Taskforce in the Philippines.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: An introduction to Data Journalism

Data journalism - setting the

stageAnders Pedersen @anpe

@SchoolOfData

Page 2: An introduction to Data Journalism

Open Knowledge

Open Knowledge is a worldwide non-profit network of people passionate about openness, using advocacy, technology and training to unlock information and enable people to work with it to create and share knowledge.

Page 3: An introduction to Data Journalism

Evidence is power

School of Data works to empower civil society organizations, journalists and citizens with the skills they need to use data effectively – in their effort to create better societies.

Page 4: An introduction to Data Journalism

Target audience

We work mostly with change makers: NGOs and journalists.

We empower them to use data effectively to advance their cause and mission through a combination of training and long terms support.

Page 5: An introduction to Data Journalism

Why School of Data

School of Data is a critical component of the open data ecosystem:

● provides tools and training to empower people to use open data for good - especially to people new to open data;

● supports outreach and engagement by creating a supportive community of learners and mentors - working with Open Knowledge Foundation Local Groups;

● creates opportunities for people and communities to use open data to make an impact;

● works both with governments to open up data and data users such as journalists and NGOs.

Page 6: An introduction to Data Journalism

Slide name here

● Data expeditions - online and offline short gatherings where a group of people with different backgrounds tackle a data related problem

● Data clinics - hands on support working directly with people’s data● Mentoring - local mentors working with local communities● Online content - tutorial and walkthroughs ● Offline resources e.g. Data Journalism Handbook

Page 7: An introduction to Data Journalism

Slide name here

● We work globally, with a focus on the following regions: Latin America, Sub Saharan Africa and Middle East, Europe

● School of Data is translated in Spanish and Portuguese● Future: French, Greek and Italian● Over 10 fellows working in countries like: Egypt, Lebanon,

Uganda, Mexico, Costa Rica, Brazil, etc.

Page 8: An introduction to Data Journalism

Data Journalism:

Setting the stage

Page 9: An introduction to Data Journalism

Where do gun owners live?

Complex stories can now be told

Page 10: An introduction to Data Journalism

Budget information that readers can understand

But be aware of complexity!

Page 11: An introduction to Data Journalism

How quickly will the ambulance arrive?

Source: http://visualoop.com/media/2012/11/How-fast-is-LAFD-where-you-live-750x298.jpg

Enables you to focus locally

Page 12: An introduction to Data Journalism

And how about the fire truck?

Fire fighter response times in London

Page 13: An introduction to Data Journalism

Granularity is king

Tip: the story is almost always buried in granular data

Source: Mapumental

Page 14: An introduction to Data Journalism

Granularity is king

Who benefits from government subsidies?

Page 15: An introduction to Data Journalism

Who are benefiting from government contracts?

Source: http://usual-suppliers.pudo.org/

Page 16: An introduction to Data Journalism

Data journalism is also text mining

● U.K. MP expenses – 700,000 documents in PDF-format

● Wikileaks Iraq war data – 391,832 structured records, each including a text descriptions

● Wikileaks diplomatic cables – 251,287 cables, each a few pages long

● NSA files leaked by Snowden – 50,000 to 200,000 according to the NSA

A text document also contains data

Source: Jonathan Stray, Overview project

Page 17: An introduction to Data Journalism

Telling clear stories

Where do companies live?

Page 18: An introduction to Data Journalism

Company ownership networks

Page 19: An introduction to Data Journalism

Where do people live?

Source: Where nobody lives, http://mapsbynik.tumblr.com/post/82791188950/nobody-lives-here-the-nearly-5-million-census

Demographics: Where nobody lives

Page 20: An introduction to Data Journalism

Using statistics can help you find stories

Stories in statistics: regression analysis and outliers → test fraud cases

Page 21: An introduction to Data Journalism

Condition: Machine readable data

Nothing beats a good CSV file

Page 22: An introduction to Data Journalism

Good data is rarely available

Page 23: An introduction to Data Journalism

How we often get important data

Government official: “Please receive our annual audit reports in this stack of papers.”

Hard copies = hard work!

Page 24: An introduction to Data Journalism

Crowd cleaning of data

When data is messy: Readers can assist extracting and cleaning data

Page 25: An introduction to Data Journalism

Crowd cleaning of data

Readers can annotate documents

Page 26: An introduction to Data Journalism

Mapping people, power and money

Source: “Who is in charge” created by CIVIO (Spain), http://quienmanda.es/

Mapping relationships

Page 27: An introduction to Data Journalism

Who are friending who?

What is in a picture? Matching faces to names

Source: vg.no mapping the royal family network in Norway (left), Dirty Energy Money (right)

Page 28: An introduction to Data Journalism

Connected China

Source: “Who is in charge” created by CIVIO (Spain), http://quienmanda.es/

Data on relationships

Page 29: An introduction to Data Journalism

Crowd collection of data

Readers can assist collecting data

Page 30: An introduction to Data Journalism

A clear bar chart is often all you need

Page 31: An introduction to Data Journalism

Spending: make readers understand

Page 32: An introduction to Data Journalism

Where to find the data?

Page 33: An introduction to Data Journalism

The data journalism tool box● Extraction and scraping

○ Tabula○ Scraperwiki○ Online OCR

● Data cleaning○ Open Refine ○ Spreadsheets - yes, you cannot live

without● Visualisation

○ DataWrapper - http://datawrapper.de/○ D3.js - http://d3js.org/

The Data Journalism HandbookSchool of Data

The tools you need

Page 34: An introduction to Data Journalism

The data journalism tool box● Extraction and scraping

○ Tabula○ Scraperwiki○ Online OCR

● Data cleaning○ Open Refine ○ Spreadsheets - yes, you cannot live

without● Visualisation

○ DataWrapper - http://datawrapper.de/○ D3.js - http://d3js.org/

The Data Journalism HandbookSchool of Data

The tools you need

Page 35: An introduction to Data Journalism

Mailing lists

Page 36: An introduction to Data Journalism

Thank you!Stay in touch:

[email protected] | [email protected] @anpe | @SchooOfData