Top Banner
Content created by The Open Data Institute Using Open Data Dr David Tarrant | @davetaz | The Open Data Institute
38

Using Open Data - David Tarrant

Jan 22, 2018

Download

Technology

godanSec
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Using Open Data - David Tarrant

Content created by The Open Data Institute

Using Open DataDr David Tarrant | @davetaz | The Open Data Institute

Page 2: Using Open Data - David Tarrant

Content created by The Open Data Institute

Agenda

Discovering open data

Quality and provenance

Data analysis and visualisation

Open data in policy cycles

Referencing data

Page 3: Using Open Data - David Tarrant

Content created by The Open Data Institute

Agenda

Discovering open data

Quality and provenance

Data analysis and visualisation

Open data in policy cycles

Referencing data

Page 4: Using Open Data - David Tarrant

Content created by The Open Data Institute

Page 5: Using Open Data - David Tarrant

Content created by The Open Data Institute

data.gov.XX

Page 6: Using Open Data - David Tarrant

Content created by The Open Data Institute

Google advanced

site: Get results only from certain sites or domains

link: Find pages that link to a certain page

related: Find sites similar to one you already know

filetype: Find certain file types only

Page 7: Using Open Data - David Tarrant

Content created by The Open Data Institute

Aggregators and portalsCollect together data from across the web into one place.

FAO World Bank

Page 8: Using Open Data - David Tarrant

Content created by The Open Data Institute

ScrapingIf you can’t obtain usable data (csv, xls) then you may have to

resort to scraping.

pdftables.com magic.import.io

Page 9: Using Open Data - David Tarrant

Content created by The Open Data Institute

Page 10: Using Open Data - David Tarrant

Content created by The Open Data Institute

Page 11: Using Open Data - David Tarrant

Content created by The Open Data Institute

Page 12: Using Open Data - David Tarrant

Content created by The Open Data Institute

Page 13: Using Open Data - David Tarrant

Content created by The Open Data Institute

Agenda

Discovering open data

Quality and provenance

Data analysis and visualisation

Open data in policy cycles

Referencing data

Page 14: Using Open Data - David Tarrant

Content created by The Open Data Institute

Guidelines

5 - S t a r s★★★★★

Page 15: Using Open Data - David Tarrant

Content created by The Open Data Institute

Open Data Certificate

http://certificates.theodi.org

Page 16: Using Open Data - David Tarrant

Content created by The Open Data Institute

Establishing trust in dataWho

Collected it?Owns it?

Publishes it?Is the Audience?

What

Is it (title/description)?Type of data is it?Type of objects?

WhenCollected?Published?Updated?

Due next update?

WhereWas it collected?

Is it used?Is it described?

Is it located?

Page 17: Using Open Data - David Tarrant

Content created by The Open Data Institute

http://5stardata.info/

★★★★★

5-S ta r s

Page 18: Using Open Data - David Tarrant

Content created by The Open Data Institute

Open Refine

http://openrefine.org

A free power tool for cleaning messy data

Page 19: Using Open Data - David Tarrant

Content created by The Open Data Institute

Agenda

Discovering open data

Quality and provenance

Data analysis and visualisation

Open data in policy cycles

Referencing data

Page 20: Using Open Data - David Tarrant

Content created by The Open Data Institute

Data analysis

Quantitative Qualitative

Page 21: Using Open Data - David Tarrant

Content created by The Open Data Institute

Remember

• Not all data is structured

• Not all numeric data is structured

• Some text data is structured

Page 22: Using Open Data - David Tarrant

Content created by The Open Data Institute

Analysing quantitative data

Page 23: Using Open Data - David Tarrant

Content created by The Open Data Institute

Beware!• Targets

• Fluctuation

• Chance

• Correlation != Causation

https://xkcd.com/925/

Page 24: Using Open Data - David Tarrant

Content created by The Open Data Institute

Analysing qualitative dataEntity recognition can help with coding and thematic network analysis.

Try Open CalaisSearch: open calais

Page 25: Using Open Data - David Tarrant

Content created by The Open Data Institute

VisualisaionNot all data visualisations are good!

Page 26: Using Open Data - David Tarrant

Content created by The Open Data Institute

Picking the right visulisation

1) Audience• Who are your audience and what do they expect?

2) Purpose• What story are you trying to tell.

3) Data• What types of visulisation suit the data

Page 27: Using Open Data - David Tarrant

Content created by The Open Data Institute

Keep it simple!Which country achieved the greatest crop yield in 2014?

Page 28: Using Open Data - David Tarrant

Content created by The Open Data Institute

Nothing wrong with a bar chart

Observe how you don’t need unnecessary clutter like axis and labels you can’t read

Page 29: Using Open Data - David Tarrant

Content created by The Open Data Institute

Simple lines and interactivity

https://www.nytimes.com/interactive/2017/01/15/us/politics/you-draw-obama-legacy.html?_r=0

Page 30: Using Open Data - David Tarrant

Content created by The Open Data Institute

Agenda

Discovering open data

Quality and provenance

Data analysis and visualisation

Open data in policy cycles

Referencing data

Page 31: Using Open Data - David Tarrant

Content created by The Open Data Institute

The policy cycle

Open data helps at every stage of the policy cycle!

Page 32: Using Open Data - David Tarrant

Content created by The Open Data Institute

Example policy

Agenda: To publish more open data from Universities on Agriculture.

Why? To increase the benefit from this data to improve agriculture

worldwide.

But what is the benefit to those who already hold the data?

Page 33: Using Open Data - David Tarrant

Content created by The Open Data Institute

Understanding researchers

Universities are ranked on the quality of their research which is

linked to publication.

Therefor if data publication can hold the same value and benefit then we should see more data.

Page 34: Using Open Data - David Tarrant

Content created by The Open Data Institute

How research creates impact

1) The journal of publication

2) The number of citations the paper has

Page 35: Using Open Data - David Tarrant

Content created by The Open Data Institute

Doing the same for research data

1) Create reputable places to share data

2) Create a way to link/reference the data, including an index

3) Mandate the publication of research data

Page 36: Using Open Data - David Tarrant

Content created by The Open Data Institutehttps://blog.datacite.org/general-assembly-2016/

Page 37: Using Open Data - David Tarrant

Content created by The Open Data Institute

Recap

Discovering open data

Quality and provenance

Data analysis and visualisation

Open data in policy cycles

Referencing data

Page 38: Using Open Data - David Tarrant

Content created by The Open Data Institute

Thank-youDr David Tarrant | @davetaz | The Open Data Institute

https://xkcd.com/552/