This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Conclusions
LSIDs suck (sadly)
“suck”is a technical term
DOIs suck ($ € £ )
Handles suck less
Metadata matters
RDF rocks
XML schema suck
What we need:
Unique identifiers
Resolvable
Have metadata
Taxonomic names aren’t enough
Names have too much information
Cherie Booth
Cherie Booth Cherie Blair
Names can change when circumstances change
Jonathon Roughgarden = Joan Roughgarden
Names carry meaning
QuickTime™ and aTIFF (Uncompressed) decompressor
are needed to see this picture.
QuickTime™ and aTIFF (Uncompressed) decompressor
are needed to see this picture.
Semantically opaque
identifier has no meaning
trouble with meaning
Zbtb7
POK erythroid myeloid ontogenic factor
POK erythroid myeloid ontogenic factor
Pokemon gene
Pokemon causes cancer
Funny!
Not funny
Zbtb7
Zbtb7
LSID parts
Opaque is a myth
Credit card
LSIDs are nice
Explict metadata and data access
QuickTime™ and aTIFF (LZW) decompressor
are needed to see this picture.
QuickTime™ and aTIFF (LZW) decompressor
are needed to see this picture.
LSIDs suck
Have to fuss with DNS
Reliant on Internet address
What about DOI’s ?
Resolve this:
doi:10.1080/10635150490264996
What do you get?
Have subscription?
What, no subscription?
Metadata?
Human-readable documents
Can’t predict what you get
Handles might be useful
hdl:2254/20971
Handle to HTML
HTML is XML
GUID resolving to metadata
Metadata matters
RDF
Resource Description Framework
Simple format (e.g., XML)
Everything is a resource…
…or a literal
supports inference
underpins Semantic Web
subject object
property
“triple”
http://www.w3.orgWorld Wide Web
consortium
dc:publisher
RDF is everywhere
RDF is everywhere
RDF is everywhere
Existing vocabularies
Basic metadata (Dublin Core)
Geography (WGS 84)
Publications (PRISM)
People (FOAF)
Rights (Creative Commons)
Requires you to have URIs for objects
URIs include:
URL
URN
DOI
LSID
RDF documents can be independent
Can be as small as one triple
Aggregate triples from different sources
Store in a triple store
QuickTime™ and aTIFF (LZW) decompressor
are needed to see this picture.
Make new inferences
There are known knowns, things we know that we
know
There are known unknowns, things we now
know we don’t know
But there are also unknown unknowns, things we do not