Top Banner
Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics for Biodiversity Symposium TDWG 2013
28

Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

Jan 14, 2016

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

Using Darwin Core as a Model:An Ontologically Minimalist Approach to

Publishing Occurrence Data in RDF

Joel SachsFormal Models track of the Semantics for

Biodiversity Symposium TDWG 2013

Page 2: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

The first thing I want to communicate:

Semantics != Ontologies

Page 3: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

Semantics = Ontologies ?

• Semantics– Semiotics– Linguistics– Psychology

• Ontology– Philosophy– Computer Science

Page 4: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

Ontologies as a vehicle for semantics

• Ontologies were the first choice for putting the “semantic” in semantic web.

• But ontologies aren’t the only way to supply semantics.

• Furthermore, ontologies can be a barrier to shared semantics, in a number of ways.

Page 5: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

What’s green?

• Def 1:

Page 6: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

What’s green?

• Def 2: Green is the portion of the electromagnetic spectrum with a wavelength between 520 – 570 nm.

What’s electromagnetic?What’s a spectrum?What’s a wavelength?What’s a nanomemter?

Page 7: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

Occurrence_IDLocation_ID URIDateTime DateTimeIndividualOrganism_ID URI

Location_ID URILatitude floatLongitude floatDatum URI

Identification_IDIndividual_ID URITaxon URIIdentified_by URI

Occurrence_IDLatitudeLongitudeScientific NameVernacular Name

Taxon_IDScientific NameVernacular NameAuthorshipYearetc.

Occurrence

Location

Identification

Taxon

Occurrence

Page 8: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

There are many ways to think about biodiversity data.

Page 9: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

Thing #2 that I want to communicate

Darwin Core (as it is) can be used as a light weight “ontology”.

Page 10: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.
Page 11: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.
Page 12: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.
Page 13: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.
Page 14: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.
Page 15: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

Don’t try this at home

Page 16: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

Thing #3

How to minimize the amount of ontology in the Core.

Page 17: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

Example: Material Sample

dwctype:MaterialSample (roughly?) corresponds to OBI:Specimen.

Page 18: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

<owl:Class rdf:about=http://purl.obolibrary.org/obo/OBI_0100051><owl:equivalentClass>

<owl:Class> <owl:intersectionOf rdf:parseType="Collection"> <rdf:Description rdf:about="http://purl.obolibrary.org/obo/BFO_0000040"/> <owl:Restriction> <owl:onProperty rdf:resource="http://purl.obolibrary.org/obo/BFO_0000087"/> <owl:someValuesFrom rdf:resource="http://purl.obolibrary.org/obo/OBI_0000112"/> </owl:Restriction> </owl:intersectionOf> </owl:Class> </owl:equivalentClass> <rdfs:subClassOf rdf:resource="http://purl.obolibrary.org/obo/BFO_0000040"/>

Page 19: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.
Page 20: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.
Page 21: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

<owl:disjointWith rdf:resource="http://purl.obolibrary.org/obo/BFO_0000141"/> <n0pred:IAO_0000602>(forall (x) (if (MaterialEntity x) (IndependentContinuant x))) // axiom label in BFO2 CLIF: [019-002] </n0pred:IAO_0000602> <n0pred:BFO_0000179>material</n0pred:BFO_0000179> <n0pred:BFO_0000180>MaterialEntity</n0pred:BFO_0000180> <n0pred:IAO_0000602>(forall (x) (if (and (Entity x) (exists (y t) (and (MaterialEntity y) (continuantPartOfAt x y t)))) (MaterialEntity x))) // axiom label in BFO2 CLIF: [021-002] </n0pred:IAO_0000602> <n0pred:IAO_0000602>(forall (x) (if (and (Entity x) (exists (y t) (and (MaterialEntity y) (continuantPartOfAt y x t)))) (MaterialEntity x))) // axiom label in BFO2 CLIF: [020-002] </n0pred:IAO_0000602>

Page 22: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.
Page 23: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

curl -L -H "Accept: application/rdf+xml" http://rs.tdwg.org/dwc/dwctype/MaterialSample | grep OBI

<rdf:Description rdf:about="http://rs.tdwg.org/dwc/dwctype/MaterialSample"><rdfs:label xml:lang="en-US">MaterialSample</rdfs:label><rdfs:comment xml:lang="en-US">A resource describing the physical results of a sampling (or subsampling) event. In biological collections, the material sample is typically collected, and either preserved or destructively processed.</rdfs:comment><rdfs:isDefinedBy rdf:resource="http://rs.tdwg.org/dwc/dwctype/"/>

<dcterms:issued>2013-03-28</dcterms:issued><dcterms:modified>2013-09-26</dcterms:modified><rdf:type rdf:resource="http://www.w3.org/2000/01/rdf-schema#Class"/><dcterms:hasVersion rdf:resource="http://rs.tdwg.org/dwc/dwctype/history/ #MaterialSample-

2013-06-24"/><dcam:memberOf rdf:resource="http://rs.tdwg.org/dwc/terms/DwCType"/><rdfs:subClassOf rdf:resource="http://purl.obolibrary.org/obo/OBI_0100051"/>

<dwcattributes:status>recommended</dwcattributes:status><dwcattributes:decision rdf:resource="http://rs.tdwg.org/dwc/terms/history/decisions/

Decision_2013-10-09_12"/><dwcattributes:abcdEquivalence>DataSets/DataSet/Units/Unit</dwcattributes:abcdEquivalence>

</rdf:Description>

Page 24: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

curl -L -H "Accept: application/rdf+xml" http://rs.tdwg.org/dwc/dwctype/MaterialSample | grep OBI

<rdf:Description rdf:about="http://rs.tdwg.org/dwc/dwctype/MaterialSample">

<rdfs:subClassOf rdf:resource="http://purl.obolibrary.org/obo/OBI_0100051"/>

</rdf:Description>

Page 25: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

On the one hand

• Nobody forces consuming application to ingest the OBI and BFO ontologies when they ingest Darwin Core.

• So what’s the big deal?

Page 26: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

On the other hand

• Many semantic web clients automatically fetch and load referenced documents.– Especially if the documents are referenced with important

properties like rdfs:subClassOf

• It’s bad form (and slightly dangerous) to clutter a semantic web document with terms from unnecessary namespaces.

Page 27: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

My suggestion?

• Assertions that tie Core terms to upper ontologies should be asserted in a separate document.

E.g.<rdf:Description rdf:about="http://rs.tdwg.org/dwc/dwctype/MaterialSample"> <rdfs:subClassOf rdf:resource="http://purl.obolibrary.org/obo/OBI_0100051"/></rdf:Description>

should be asserted in obi.owl, or dwc_obi.owl

• That way, those doing integration that depends on OBI axioms can ingest the appropriate descriptions.

• Those that don’t need the OBI axioms don’t have to worry about incorrect inference.– Keep in mind: There is no preferred upper ontology for science on the

semantic web.• BFO, Dolce, SUMO, UMBEL, NULO, etc.

Page 28: Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

Thank you for paying attention!

Question, comments, and criticism to

@[email protected]