Top Banner
DataCite – Bridging the gap and helping to find, access and reuse data Herbert Gruttemeier OpenAIREplus workshop February 8th, 2013 Braga
41

DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

Mar 03, 2021

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

DataCite – Bridging the gap and helping to find, access and reuse data

Herbert GruttemeierOpenAIREplus workshopFebruary 8th, 2013Braga

Page 2: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly
Page 3: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly
Page 4: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

Publishers’ data policies

Page 5: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

H. GRUTTEMEIER

Publishers’ data policies

extract from

Nature Publishing Group, Editorial Policies,Availability of data and materials

Page 6: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly
Page 7: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly
Page 8: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

http://www.doi.org

Page 9: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

http://www.handle.net

At the infrastructure level, DOI names are handles.

Page 10: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

From KE workshop presentation, The Hague, June 2011 (L. Lannom)

Page 11: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

From KE workshop presentation, The Hague, June 2011 (L. Lannom)

Page 12: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

From KE workshop presentation, The Hague, June 2011 (N. Paskin)

Page 13: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

plutôt: identifiant numérique d’objet

« The objects identified by DOI names may be of any form -digital, physical, or abstract - as all these forms may be necessary parts of a content management system. The DOI system is an abstract framework which does not specify a particular context of its application, but is designed with the aim of working over the Internet. »

Norman Paskin, « Digital Object Identifier (DOI®) System »

Page 14: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

DataCite

• Global consortium carried by local institutions• Focused on improving the scholarly infrastructure around

datasets and other non-textual information• Focused on working with data centres and organisations that

hold data• Providing standards, workflows and best-practice• Initially, but not exclusively based on the DOI system

• Memorandum of Understanding, Paris, February 2009• Officially founded December 1st 2009 in London

Page 15: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

DataCite Members

• Technische Informationsbibliothek (TIB), Germany• Canada Institute for Scientific and Technical Information (CISTI) • California Digital Library, USA• Purdue University, USA• Office of Scientific and Technical

Information (OSTI), USA• The British Library• Technical Information Center

of Denmark (DTU)• Library of TU Delft, The Netherlands• ZBMed, Germany• ZBW, Germany• GESIS, Germany• Library of ETH Zürich, Switzerland• Institut de l’Information Scientifique et

Technique (INIST-CNRS), France• Swedish National Data Service (SND)• Australian National Data Service (ANDS)• Conferenza dei Rettori delle

Università Italiane (CRUI)• National Research Council of Thailand

(NRCT)

Affiliated members:

• Digital Curation Center, UK• Microsoft Research• Interuniversity Consortium for Political and Social Research (ICPSR), USA • Korea Institute of Science and Technology Information (KISTI)• Bejiing Genomic Institute (BGI)

Page 16: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

DataCite

The DataCite registration agency– Maintains the resolution infrastructure– Maintains a searchable database of metadata– Manages the identifiers over the long term– Establishes and shares best practice

Publishing agents (data centres, research institutes, data publishers) are responsible for– Quality assurance – Content storage and access – Creating the identifiers– Creating and updating metadata

Page 17: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

IRD(grav/10 cm3)

Sand(%)

CaCO3(%)

TOC(%)

Radio(%/sand)

Smect(%/clay)

IRD(grav/10 cm3)

Sand(%)

CaCO3(%)

TOC(%)

Radio(%/sand)

Smect(%/clay)

IRD(grav/10 cm3)

Sand(%)

CaCO3(%)

TOC(%)

Radio(%/sand)

Smect(%/clay)

IRD(grav/10 cm3)

Sand(%)

CaCO3(%)

TOC(%)

Radio(%/sand)

Smect(%/clay)

IRD(grav/10 cm3)

Sand(%)

CaCO3(%)

TOC(%)

Radio(%/sand)

Smect(%/clay)

PS1389-3 PS1390-3 PS1431-1 PS1640-1 PS1648-1

Age (kyr) max. : 233.55 kyr PS1389-3ff

0.0

100.0

200.0

0 20 0 100 0 15 0 0.5 0 50 0 100 0 20 0 100 0 15 0 0.5 0 50 0 100 0 20 0 100 0 15 0 0.5 0 50 0 100 0 20 0 100 0 15 0 0.5 0 50 0 100 0 20 0 100 0 15 0 0.5 0 50 0 100

54° 0' 54° 0'

54°30' 54°30'

55° 0' 55° 0'

55°30' 55°30'

11°

11°

12°

12°

13°

13°

14°

14°

15°

15°

World vector shore lineGrain size class KOLP AGrain size class KOEHN2Grain size class KOEHNGeochemistryGrain size class KOLP BG i i l KOLP DIN

Scale: 1:2695194 at Latitude 0°

Source: Baltic Sea Research Institute, Warnemünde.

• Earth quake events => doi:10.1594/GFZ.GEOFON.gfz2009kciu

• Climate models => doi:10.1594/WDCC/dphase_mpeps• Sea bed photos => doi:10.1594/PANGAEA.757741• Distributes samples => doi:10.1594/PANGAEA.51749• Medical case studies => doi:10.1594/eaacinet2007/CR/5-

270407• Computational model => doi:10.4225/02/4E9F69C011BC8• Audio record => doi:10.1594/PANGAEA.339110• Grey Literature => doi:10.2314/GBV:489185967• Videos => doi:10.3207/2959859860

What type of data are we talking about?

Anything that is the foundation of further research

is research data

Data is evidence

Page 18: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

DataCite Structure

International DOI Foundation

DataCite

MemberInstitution

Data CentreData CentreData Centre

MemberInstitution

Data CentreData CentreData Centre

… Works with

Managing Agent(TIB)

Member

AssociateStakeholder

Page 19: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

Bridging the gap

Publishers Data centres

DOIs in Use: DataCite

CrossRef has registered more than 51 million DOIs on behalf of scholarly publishers. But CrossRef DOIs are not the only DOIs available in the scholarly community. DOIs for datasets associated with scholarly research are being registered by institutions in the DataCite network. DataCite and CrossRef have committed to the interoperability of their DOIs. Ideally, scholarly content like journals will cite related data by the appropriate DataCite DOI, and in return, the data record will cite the relevant article’s CrossRef DOI. (from CrossRef Quarterly, January 2012)

Page 20: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

Bridging the gap

Page 21: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

Connecting article and underlying data via DOI:

The dataset:Storz, D et al. (2009): Planktic foraminiferal flux and faunal composition of sediment trap

L1_K276 in the northeastern Atlantic. http://dx.doi.org/10.1594/PANGAEA.724325

Is supplement to the article:Storz, David; Schulz, Hartmut; Waniek, Joanna J; Schulz-Bull,

Detlef; Kucera, Michal (2009): Seasonal and interannual variability of the planktic foraminiferal flux in the vicinity of the Azores Current.

Deep-Sea Research Part I-Oceanographic Research Papers, 56(1),107-124,

http://dx.doi.org/10.1016/j.dsr.2008.08.009

Data citation

Page 22: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly
Page 23: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly
Page 24: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

Bridging the gap

• DataCite supports researchers by enabling them to locate, identify, and cite research datasets with confidence

• DataCite supports data centres by providing workflows and standards for data publication

• DataCite supports publishers by enabling linking from articles to the underlying data

http://www.datacite.orghttp://schema.datacite.orghttps://mds.datacite.orghttp://search.datacite.orghttp://oai.datacite.orghttp://data.datacite.orghttp://stats.datacite.org

Page 25: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

Working Groups

• Business Practices• Criteria for Data Centers• Identifier Syntax• Metadata• Services• Special Datasets• Technical Infrastructure

Page 26: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

MDS: Central portal allowing access to the metadata from all registered objects (OAI)

Page 27: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly
Page 28: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

DataCite Metadata 2.2 XML Schema

Page 29: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly
Page 30: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly
Page 31: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly
Page 32: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

• Service for displaying DataCite metadata

• Different formats (BibTeX, RIS, RDF, etc.)

• Content Negotation (through MIME-Typ)

– Access through DOI proxy (http://dx.doi.org)

– First implemented by CNRI and CrossRef:

• Documentation:

• http://www.crosscite.org/cn/

• Service for displaying DataCite metadata in different formats (BibTeX, RIS, RDF, etc.)

• A particular representation of the metadata can be requested via content negotiation or by using DOI proxy (the "http://dx.doi.org" formulation as a URL) and MIME-type

• Documentation: http://www.crosscite.org/cn/

Page 33: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

Resolution - Current Status

Persistent Identifier

(DOI, URN, …)

Resolver(DataCite, …)Mapping Table

PID - URL

Landing Pagewith catalogmetadata

(human-readable)

Data

Client (Web‐Browser) requesting PID

Details on Data(Rich

Metadata)(human-readable)Details on

Data(Rich

Structured Metadata)

(machine-ti bl )

ProblemNot machine‐actionable

Page 34: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

Content Negotiation - Based on the Solution of CrossRef/DataCite

Persistent Identifier

(DOI, URN, …)

Resolver(DataCite, …)Mapping Table

PID - URL

Web Page on Datawith catalogmetadata

(human-readable)

Data

Client requesting PID

Details on Data(Rich

Metadata)(human-readable)Details on

Data(Rich

Structured Metadata)

(machine-actionable)

Different Accept Headersin addition to URLrequesting different representations of PID

Page 35: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

List of repositoriesforresearch data

Page 36: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

Some recent related developments

• Thomson-Reuters Data Citation Index• ORCID official launch • ODIN European project• CODATA/ICSTI Working Group on Data

Citation• Creation of the Research Data Alliance

Page 37: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly
Page 38: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly
Page 39: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

ORCID and DataCite Interoperability Network

« ODIN will build on the ORCID and DataCite initiatives to uniquely identify scientists and data sets and connect this information across multiple services and infrastructures for scholarly communication.

It will address some of the critical open questions in the area: Referencing a data object; Tracking of use and re-use; Links between a data object, subsets, articles, rights statements and every person involved in its life-cycle. »

Page 40: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

http://www.codata.org/taskgroups/TGdatacitation/index.html

http://www.codata.org/taskgroups/TGdatacitation/index.html

Page 41: DataCite – Bridging the gap and helping to find, access and reuse … · 2016. 1. 14. · DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly

Thank you