Top Banner
The Web of Data and its Five Stars Richard Cyganiak, DERI, NUI Galway @cygri 6 June 2012 Realising and Exploiting the EU data cloud European Data Forum, Copenhagen, Denmark
39

EDF2012: The Web of Data and its Five Stars

Jan 14, 2015

Download

Technology

 
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: EDF2012: The Web of Data and its Five Stars

The Web of Data and its Five Stars

Richard Cyganiak, DERI, NUI Galway���@cygri

6 June 2012

Realising and Exploiting the EU data cloud

European Data Forum, Copenhagen, Denmark

Page 2: EDF2012: The Web of Data and its Five Stars

Generating insight from data

•  Today, data is abundant

•  New middlemen find new ways of getting data to the end user

•  Supply and demand for data higher than ever

•  Analyst's problem is no longer a lack of relevant data, but:

•  Understanding data

•  Assessing applicability

•  Getting it into the right form for use

•  Similar problems inside and outside of the firewall

Page 3: EDF2012: The Web of Data and its Five Stars

From the Web ���to the Web of Data

Page 4: EDF2012: The Web of Data and its Five Stars

Tim Berners-Lee’s 5-star plan for an open web of data

★ Make data available on the Web under an open license

★★ Make it available as structured data

★★★ Use a non-proprietary format

★★★★ Use URIs to identify things

★★★★★ Link your data to other people’s data ���to provide context

Page 5: EDF2012: The Web of Data and its Five Stars

The 0th star

• Data catalog with good metadata

• Make your data findable

Page 6: EDF2012: The Web of Data and its Five Stars
Page 7: EDF2012: The Web of Data and its Five Stars

Data on the Web, Open License

Page 8: EDF2012: The Web of Data and its Five Stars

Open Data

Page 9: EDF2012: The Web of Data and its Five Stars

Government data catalogs

Page 10: EDF2012: The Web of Data and its Five Stars

Open vs. Closed Data used to be closed by default.���

In the future, it will be open by default.

Page 11: EDF2012: The Web of Data and its Five Stars

Is open data just for governments?

Page 12: EDF2012: The Web of Data and its Five Stars
Page 13: EDF2012: The Web of Data and its Five Stars
Page 14: EDF2012: The Web of Data and its Five Stars
Page 15: EDF2012: The Web of Data and its Five Stars

Good reasons against opening data

•  Privacy

•  Competitive advantage

•  Producing data and charging for it as business model

•  Can't get license from upstream

Page 16: EDF2012: The Web of Data and its Five Stars

Business models

Scott Brinker, http://www.chiefmartec.com/2010/01/7-business-models-for-linked-data.html

Page 17: EDF2012: The Web of Data and its Five Stars

Data licenses

http://opendefinition.org/licenses/

Page 18: EDF2012: The Web of Data and its Five Stars

Structured Data

★★

Page 19: EDF2012: The Web of Data and its Five Stars

Enabling re-use

•  Delivering data to end users in different forms

•  Combining data with other data

•  3rd party analysis of data

Page 20: EDF2012: The Web of Data and its Five Stars

Formats in government data

•  Good for re-use: MS Excel, CSV, XML, JSON, Microdata

•  Not so good for re-use: Pure websites, MS Word

•  Bad for re-use: PDF

•  Really bad for re-use: Only charts/maps without numbers

Page 21: EDF2012: The Web of Data and its Five Stars

Symptom: Screenscraping

Page 22: EDF2012: The Web of Data and its Five Stars

Non-Proprietary Formats

★★★

Page 23: EDF2012: The Web of Data and its Five Stars

Specialist formats

•  Specialist tools often have specialist formats

•  Few people have the tools

•  Expensive

•  Difficult to re-use

•  (Geospatial tools, statistics packages, etc.)

Page 24: EDF2012: The Web of Data and its Five Stars
Page 25: EDF2012: The Web of Data and its Five Stars

Non-proprietary formats, open standards

•  CSV (dead simple)

•  XML

•  JSON

•  RDF (good for 4+5 stars)

•  OGC web services

•  OAI-ORE web services

Page 26: EDF2012: The Web of Data and its Five Stars

Use URIs as Identifiers

★★★★

Page 27: EDF2012: The Web of Data and its Five Stars

http://www.bbc.co.uk/music/artists/79239441-bfd5-4981-a70c-55c3f15c1287

Page 28: EDF2012: The Web of Data and its Five Stars

http://data.ordnancesurvey.co.uk/id/postcodeunit/HA99HD

Page 29: EDF2012: The Web of Data and its Five Stars

http://opencorporates.com/companies/us_vt/F013910

Page 30: EDF2012: The Web of Data and its Five Stars
Page 31: EDF2012: The Web of Data and its Five Stars

Turning local identifiers into URIs–Why?

• Make them globally unique

• Clarify authority

• Make them resolvable

• Make them linkable

http://data.ordnancesurvey.co.uk/id/7000000000017765

Page 32: EDF2012: The Web of Data and its Five Stars

The schema level

By using URIs, connections that existed only in people's minds can be put explicitly into the data model.

Page 33: EDF2012: The Web of Data and its Five Stars

Include Links to Other Data

★★★★★

Page 34: EDF2012: The Web of Data and its Five Stars

Hyperlinks are the soul of the Web.���

The Web of Data is no different.

Page 35: EDF2012: The Web of Data and its Five Stars

Central Contractor Registration (CCR)

Geonames

Data links

Page 36: EDF2012: The Web of Data and its Five Stars

Linked Data Principles

1.  Use URIs to name things (not only documents, but also people, locations, concepts, etc.)

2.  To enable agents (human users and machine agents alike) to look up those names, use HTTP URIs

3.  When someone looks up a URI, provide useful information (structured data in RDF, SPARQL).

4.  Include links to other URIs allowing agents to discover more things

http://www.w3.org/DesignIssues/LinkedData.html

Page 37: EDF2012: The Web of Data and its Five Stars
Page 38: EDF2012: The Web of Data and its Five Stars

Summary

•  In the future, data will be open by default, unless good reason not to

•  Emergence of a web of data

•  “Five-star plan” for getting there, dataset by dataset

•  2 stars: re-usable data!

•  3 stars: open standards!

•  4+5 stars: connect the silos!

Page 39: EDF2012: The Web of Data and its Five Stars

Thank You!

[email protected]

@cygri