Linked Open Data as an Enabler for Team Science Deborah L. McGuinness Tetherless World Senior Constellation Chair Professor of Computer and Cognitive Science Rensselaer Polytechnic Institute, Troy, NY & CEO McGuinness Associates, Latham, NY Science of Team Science; LOD and Team Science April 19, 2012
This talk introduces Linked Data and Semantic Web by using two examples - population sciences grid and semantAqua - a semantically enabled environmental monitoring. It shows a few tools and the semantic methodology and opens discussion for LOD and team science
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Linked Open Data as an Enabler for
Team Science
Deborah L. McGuinness Tetherless World Senior Constellation Chair
Professor of Computer and Cognitive Science
Rensselaer Polytechnic Institute, Troy, NY
& CEO McGuinness Associates, Latham, NY
Science of Team Science; LOD and Team Science April 19, 2012
Background
– Semantic Technologies – technological support for
encoding meaning in a form computers can
understand and manipulate – are maturing and
increasing in usage
– Computational encodings of meaning can be used
to help integrate, link, validate, filter,…. Essentially
to make smarter, more context-aware applications
– Semantic Technologies enable linking data … and
linked data provides a way of connecting and
traversing information, nodes, graphs, webs, …
Linked Data
• Linked Data is quite simple and follows principles set
out by Berners-Lee in
http://www.w3.org/DesignIssues/LinkedData.html
– Use URIs as names for things
– Use HTTP URIs so that people can look up those names.
– When someone looks up a URI, provide useful information,
using the standards (RDF*, SPARQL)
– Include links to other URIs. so that they can discover more
• Claim: all of this is being done now – but not at
scale 14
Updates and Motivations from a
Computer Science Perspective
Old:
• Raw conversions
• Per-dataset vocabularies
• Custom queries
• Custom data
management code
• Limited use because of
Google Visualization
licenses
• State-level data
New:
• Enhanced conversions
• Vocabulary reuse
• Generic queries
• Re-usable data
management code
• Unlimited use of new
open source visualization
toolkit
• State and county-level
data 15
RDF Data Cube
Vocabulary
• For publishing multi-dimensional data, such as statistics, on the web in such a way that it can be linked to related data sets and concepts using RDF.
• Compatible with the cube model that underlies SDMX (Statistical Data and Metadata eXchange).
• Also compatible with: – SKOS, SCOVO, VoiD,
FOAF, Dublin Core Terms
• Integrated with the LOGD
data conversion
infrastructure
• Integrated with other tooling
like Stats2RDF
16
County
average life
expectancy (Summary Measures of Health)
SemantEco/SemantAqua
• Enable/Empower citizens &
scientists to explore pollution
sites, facilities, regulations, and
health impacts along with
provenance.
• Demonstrates semantic
monitoring possibilities.
• Map presentation of analysis
• Explanations and Provenance
available
1
2 3
http://was.tw.rpi.edu/swqp/map.html and
http://aquarius.tw.rpi.edu/projects/semantaqua
4 5
1. Map view of analyzed results
2. Explanation of pollution
3. Possible health effect of contaminant (from EPA)
4. Filtering by facet to select type of data
5. Link for reporting problems
6. Now joint with USGS resource managers ; expanded to
endangered species; now more virtual observatory style
• Semantic Technologies and Linked Data are powering a wide array of application – many in Big Science, Team Science, at least interdisciplinary science
• Labeled graphs as powered by structured data can be a nice corpus for evaluation
• Tools and methodologies are ready for use
• We love to partner in these areas
• What do you need or want from linked data?
Questions? - dlm @ cs . rpi . edu
Extra
Directions
23
• Incorporation of TWC data Quality Facts label (Zednik et al)
• Use of DataFAQs automated data quality framework (Lebo et al)