Top Banner
Sharing Data on the Web 5-Mar-2013 Linked Data Overview for US EPA Office of Pollution Prevention & Toxics By Bernadette Hyland & Luke Ruth Tuesday, March 5, 13
59

Sharing Data on the Web

Sep 12, 2014

Download

Health & Medicine

A presentation on Linked Data to the US EPA Office of Pollution Prevention and Toxics
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Sharing Data on the Web

Sharing Data on the Web

5-Mar-2013Linked Data Overview for US EPA

Office of Pollution Prevention & ToxicsBy Bernadette Hyland & Luke Ruth

Tuesday, March 5, 13

Page 2: Sharing Data on the Web

Agenda• Intros ...• Trends in data management• Government data publication• Update on EPA Linked Data Service• EPA OPPT sharing data on the Web• Review Next steps ...

Tuesday, March 5, 13

Page 3: Sharing Data on the Web

3 Round Stones produces the leading platform for the publication of reusable data on the Web. Our commercially supported Open Source platform is used by the Fortune 2000 and US Government agencies to collect, publish and reuse data, both on the public Internet and behind institutional firewalls.

Tuesday, March 5, 13

Page 4: Sharing Data on the Web

US EPA Linked Data

• Cloud-based Linked Data provision of 3 core programs:

• 2.9M Facilities• 100K substances• 25 years of toxic pollution reports• FISMA compliant• 16 Callimachus templates• Official launch April 2013

Tuesday, March 5, 13

Page 5: Sharing Data on the Web

Tuesday, March 5, 13

Page 6: Sharing Data on the Web

Guidance for developers

Tuesday, March 5, 13

Page 7: Sharing Data on the Web

US GPO• Cloud-based Linked Data provision of persistent URLs for US Government documents:

• 100k+ documents• Used by 1,240 Federal Depository Libraries and public

• In 3rd year of operation• Deemed an “Essential service” supporting US Congress

Tuesday, March 5, 13

Page 9: Sharing Data on the Web

Tuesday, March 5, 13

Page 10: Sharing Data on the Web

Trends in government data management

Tuesday, March 5, 13

Page 11: Sharing Data on the Web

Tuesday, March 5, 13

Page 12: Sharing Data on the Web

Open Government Data

Tuesday, March 5, 13

Page 13: Sharing Data on the Web

“We’re moving from managing documents to managing discrete pieces of open data and content which can be tagged, shared, secured, mashed up and presented in the way that is most useful for the consumer of that information.”

-- Report on Digital Government: Building a 21st Century Platform to Better Serve the American People

Growing chorus ...

Tuesday, March 5, 13

Page 14: Sharing Data on the Web

Tuesday, March 5, 13

Page 15: Sharing Data on the Web

15Photo credit: http://www.flickr.com/photos/glennharper/4452247708/Tuesday, March 5, 13

Page 16: Sharing Data on the Web

Big DataSimple dataComplex dataLegacy data

Tuesday, March 5, 13

Page 17: Sharing Data on the Web

GovernmentsGoals: Governmental transparency and/or improved

internal efficiencies (data warehouses)

Tuesday, March 5, 13

Page 18: Sharing Data on the Web

Tuesday, March 5, 13

Page 19: Sharing Data on the Web

Tuesday, March 5, 13

Page 20: Sharing Data on the Web

Tuesday, March 5, 13

Page 21: Sharing Data on the Web

HELPING DEFINE THE PROCESS

PublishConvertDescribeNameModelIdentify

Maintain

Tuesday, March 5, 13

Page 22: Sharing Data on the Web

• Start easy

• Well curated datasets with relevant data

• Reach out to developers

• Get others involved early

• Ensure internal benefit

• Integrate related datasets

• Address data quality ...

• Multiple approaches including crowed sourcing

Path to Success

Tuesday, March 5, 13

Page 23: Sharing Data on the Web

Put it on the Web• Upload & share it

• Document what is available

• Document how to use it

• Solve a customer need

• Encourage feedback

• Continuous improvement

Tuesday, March 5, 13

Page 24: Sharing Data on the Web

Use a non-proprietary format

• Open Web data exchange formats that improve access and re-use

• RDF instead of CSV

• Benefits

• Accessibility & Interoperability

• Reduce risk of

• Confidential info

• Software viruses

Tuesday, March 5, 13

Page 25: Sharing Data on the Web

Open data + open standards + open platforms

Highly scalable computing & hosting via the

Cloud

International Data Exchange Standards

5 Star Data (Linked Data)

Leverage Open Source tools

Tuesday, March 5, 13

Page 26: Sharing Data on the Web

Its the Web of Data

• Universal unidirectional links using URLs

• “Cooperation without coordination

• It’s simple ... nodes and links

Tuesday, March 5, 13

Page 27: Sharing Data on the Web

Universal Identifiers• It’s the foundation of the

Web

• Others can reference things

• Two references with the same URI are the same thing

• Quick, easy and scaleable

• People keep coming back for more!!

Tuesday, March 5, 13

Page 28: Sharing Data on the Web

Social Responsibility

• Responsibility to maintain published data

• Publish frequency of data updates

• Have a persistence strategy

• Ensure data is accurate as possible

• Respond to reports of problematic data

Tuesday, March 5, 13

Page 29: Sharing Data on the Web

29

Clinical Trials + enterprise linked

data

US Legislation + enterprise data

DBpedia + enterprise datasets

Data driven Web apps using Callimachus

Tuesday, March 5, 13

Page 30: Sharing Data on the Web

Tuesday, March 5, 13

Page 31: Sharing Data on the Web

User

NOAA US EPA AirNow

DBpediaNational Library of Medicine

US EPA SunWise

Tuesday, March 5, 13

Page 32: Sharing Data on the Web

Tuesday, March 5, 13

Page 33: Sharing Data on the Web

Tuesday, March 5, 13

Page 34: Sharing Data on the Web

From WikipediaFrom EPA

Open Street Map

Tuesday, March 5, 13

Page 35: Sharing Data on the Web

Tuesday, March 5, 13

Page 36: Sharing Data on the Web

We’ve Seen This Before

Tuesday, March 5, 13

Page 37: Sharing Data on the Web

HOW IT IS DONE TODAY ...

Tuesday, March 5, 13

Page 38: Sharing Data on the Web

Audience for EPA Data

• Middle school student doing a science project

• Concerned citizen worried about local pollution

• Environmental Science PhD from EPA

• Doctor from NIH writing a research paper

Tuesday, March 5, 13

Page 39: Sharing Data on the Web

How much mercury did Hanson Permanente Cement

release in 2004?

Tuesday, March 5, 13

Page 40: Sharing Data on the Web

Tuesday, March 5, 13

Page 41: Sharing Data on the Web

Web Portals

Tuesday, March 5, 13

Page 42: Sharing Data on the Web

Tuesday, March 5, 13

Page 43: Sharing Data on the Web

Tuesday, March 5, 13

Page 44: Sharing Data on the Web

Tuesday, March 5, 13

Page 45: Sharing Data on the Web

Tuesday, March 5, 13

Page 46: Sharing Data on the Web

Finding Hanson Permanente

Tuesday, March 5, 13

Page 47: Sharing Data on the Web

Finding Mercury Released in 2004

Tuesday, March 5, 13

Page 48: Sharing Data on the Web

Compliance Report

Tuesday, March 5, 13

Page 49: Sharing Data on the Web

Potential Audience

• Middle school student doing a science project

• Concerned citizen worried about local pollution

• Environmental Science PhD from EPA

• Doctor from NIH writing a research paper

XX

X

Tuesday, March 5, 13

Page 50: Sharing Data on the Web

Linked Data Approach

Tuesday, March 5, 13

Page 51: Sharing Data on the Web

Finding Hanson Permanente

Tuesday, March 5, 13

Page 52: Sharing Data on the Web

Finding Mercury Released in 20041

2

Tuesday, March 5, 13

Page 53: Sharing Data on the Web

TRI Report

Tuesday, March 5, 13

Page 54: Sharing Data on the Web

Data Reuse

Tuesday, March 5, 13

Page 55: Sharing Data on the Web

Potential Audience

• Middle school student doing a science project

• Concerned citizen worried about local pollution

• Environmental Science PhD from EPA

• Doctor from NIH writing a research paper

Tuesday, March 5, 13

Page 56: Sharing Data on the Web

Tuesday, March 5, 13

Page 57: Sharing Data on the Web

Tuesday, March 5, 13

Page 58: Sharing Data on the Web

Credits

David NewmanGartner: “Innovation Insight: Linked Data Drives Innovation Through Information-Sharing Network Effects” Published: 15 December 2011

David Wood, ed. Linking Government Data, Springer (2011) http://3roundstones.com/linking-government-data/

US Executive Branch

Digital Government Strategy: Building a 21st Century Platform to Better Serve the American People, http://www.whitehouse.gov/sites/default/files/omb/egov/digital-government/digital-government.html

W3C Linked Data Cookbook http://www.w3.org/2011/gld/wiki/Linked_Data_Cookbook

All other photos and images © 2010-2012 3 Round Stones, Inc. and released under a CC-by-sa licenseAll other photos and images © 2010-2012 3 Round Stones, Inc. and released under a CC-by-sa license

Tuesday, March 5, 13

Page 59: Sharing Data on the Web

This work is Copyright © 2011-2012 3 Round Stones Inc.It is licensed under the Creative Commons Attribution 3.0 Unported LicenseFull details at: http://creativecommons.org/licenses/by/3.0/

You are free:

to Share — to copy, distribute and transmit the work

to Remix — to adapt the work

Under the following conditions:Attribution. You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work).

Share Alike. If you alter, transform, or build upon this work, you may distribute the resulting work only under the same or similar license to this one.

Tuesday, March 5, 13