Top Banner
The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September 28, 2010
35

The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Jan 04, 2016

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

The Future of Informatics in Digital Literature – or Literature

and it’s (Digital) Future

Donat Agosti and Terrance CatapanoPlazi

TDWG, Woods Hole, September 28, 2010

Page 2: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Literature, the tool to formalize our knowledge, and make it

part of the global knowledgebase.

Page 3: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

< 15% of taxonomists opt for Open Access

Source: Zootaxa, publisher of ca 15% of all new taxonomic names

Page 4: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

“The current scholarly communication system is

nothing but a scanned copy of the paper based system.”

Van de Sompel & Lagoze, 2009, The Forth Paradigm.

Page 5: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

E.g. BHL‘s emphasis on scanning and images of text…

Page 6: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

E.g. BHL‘s emphasis on scanning and images of text…

… and little efforts (by third parties) to provide better

access

Page 7: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

„An articulated semantic structure facilitates simpler algorithms acting on World Wide Web text and data

and is more feasible in the near term than building a layer of complex

artificial intelligence to interpret free-form human ideas using some

probabilistic approach.“

Ginsparg, 2009, The Forth Paradigm.

Page 8: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Quantity vs precission

Page 9: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Howard Ratner, Nature: Nature on Mobile: http://river-valley.tv/conferences/stm-innovations-seminar-2009

Page 10: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

A semantically enhanced, linked XML document based on

clean OCR

Page 11: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

TaxonX

Page 12: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Text XML document

<tax:treatment> <tax:nomenclature> <tax:name> <tax:xid source="HNS" identifier="193329"/> <tax:xmldata> <dc:Genus>Mystrium</dc:Genus> <dc:Species>leonie</dc:Species> </tax:xmldata> Mystrium leonie </tax:name> <tax:status>n. sp.</tax:status> Fig 1 D - F </tax:nomenclature> <tax:div type="description"> <tax:p>HOLOTYPE WORKER: TL 3.95, HL 1.02, HW 0.95, CI 93, SL 1.30, SI 137, PW 0.73, ML 0.38. Mandible outer margin strongly curving to a sharp apical tooth, the apex parallel to the anterior clypeal margin. (Holotype with material in mandibles, so mandibles and anterior clypeus $ described below from paratypes.) Median clypeus....</treatment>

Page 13: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Treatment

Page 14: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Treatment≠©

Page 15: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

- Get LSID from Hymenoptera Name Server for names; ZooBank?-Add new names

- Get bibliographic Metadata from HNS (MODS)

- Get bibliographic Guids from bioguid (or EDIT?)

- Get geographic long/lat from geonames.org

Plazi workflow: GoldenGate mark up as an example

-Get Guids for - CBOL- NCBI- specimen- images- .....

Page 16: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Plazi Search and Retrieval Server: Access to data

TAPIR, SPM

You

You

You

human

machine

Page 17: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Materials examined from literature in GBIF

Page 18: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Facebook tool to mark

up legacy publications

Page 19: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Mark-up comes at an (exorbitant) cost…

Page 20: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Mark-up comes at an (exorbitant) cost, if done at the

wrong time

Page 21: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Shift from legacy to prospective publishing

Page 22: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Taxpub NLM DTD

Page 23: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Taxpub NLM DTD:a collaboration between

National Library of MedicineZookeys

Plazi

Page 24: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Taxpub NLM DTD: taxonomic domain specific

extension of the NLM Publishing and Archiving DTD

NLM DTD

Taxpub DTD

Page 25: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Taxpub/NLM DTD+ production worklow

Zookeys

Page 26: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Taxpub/NLM DTD+ production worklow

Zookeys

XMLPrint PDF HTML Other Sites

External resources

Page 27: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Treatment + external links

Page 28: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Treatment + external links:GUID / LSID

Page 29: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Now that we will have LSIDs in your content in PMC, I was looking for an LSID resolver so that we can build links to all of

this content.

But, the only place that I was able to resolve your LSIDs was on your zoobank.org/?lsid= service. I could not resolve them on

lsid.tdwg.org or bioguid.info/lsid.php. Perhaps I don’t understand how LSIDs are supposed to work, but I thought that

any LSID resolver should be able to resolve them. If only your local resolver resolves them, then are they really LSIDs or are

they just zoobank IDs dressed up like LSIDs?

Email from Jeff Beck, NCBI

Page 30: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Why do we do all that?

Page 31: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Technically, we are far beyond the doable

Page 32: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Technically, we are far beyond the doable, we need your

input:Why do you want to have a

(taxonomic) publication?

Page 33: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Why do you want to have a (taxonomic) publication?

External links?Materials Citations?

Descriptions?Credit?

Page 34: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

Where is your data so that it can be linked? How will it be

be standardized?

Page 35: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.

http://plazi.org

Thank you very much!

Donat Agosti and Terrance Catapano

[email protected]