Top Banner
#OAER12 22/10/2012 Afternoon session Opening up Science
42

Opendatasessions

Jun 27, 2015

Download

Documents

At OAER12, Jan Haspeslagh from VLIZ and Henk Harmsen from DANS talk about their experiences with opening up data
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Opendatasessions

#OAER1222/10/2012

Afternoon sessionOpening up Science

Page 2: Opendatasessions

Open, if it’s possibleand

closed, if it has to be

Archiving research data and making it available @DANS

Henk Harmsen

Open Access to Excellence in ResearchOctober 22, 2012 - 9.15 – 17.00 u

KVAB, Brussels

Page 3: Opendatasessions

Contents

• Data is hot!• About DANS• Storing & Sharing• Linked resources• Modes of access

Page 4: Opendatasessions
Page 5: Opendatasessions

NiederlandeRenommierter Psychologe gesteht Fälschungen

Page 6: Opendatasessions

Data is hot!

• Article on “trends for 2012”: “Keeping your research data secret until they are finally printed in a scientific journal is so 2011”

• Neelie Kroes (Vice-President of the European Commission responsible for the Digital Agenda): “Data is the new gold”

Page 7: Opendatasessions

What is DANS?

• Institute of Dutch Academy and Research Funding

Organisation (KNAW & NWO) since 2005

• First predecessor dates back to 1964 (Steinmetz Foundation), Historical Data Archive 1989

• Mission: promote and provide permanent access to digital research information (started with data archives in the humanities and social sciences)

Page 8: Opendatasessions

Our main activities and services

• Encourage researchers to self-archive and reuse data by means of our Electronic Archiving SYstem EASY

• Our largest digital collections are in archaeology, social sciences and history (moving into other domains as well)

• Provide access, through Narcis.nl, to thousands of scientific datasets, e-publications and other research information in the Netherlands

• Data projects in collaboration with research communities and partner organisations

• Advice, training and support (Data Seal of Approval, Persistent Identifier Infrastructure)

• R&D into archiving of and access to digital information

Page 9: Opendatasessions

Collaboration DANS – University Libraries

• Starting with Delft, Leiden, Wageningen…• UL: front offices - DANS: back office• Roles:

– DANS: long-term archiving of research data (like KB e-depot for publications), providing expertise, training, standards

– UL: data lab services (VRE, repository) for local researchers• Possibility to archive data from University repositories:

– Challenges explored in Podium Plus project (SURF Share)– Auto-ingest from Dataverses– Stumbling blocks not technical, but copyright– IPR issues can be solved if university, researchers and funders agree

Page 10: Opendatasessions

Modes of access

• Open (after registration)

• Restricted (depositor is the access authority)

• Other (DANS as security backup)

Archiving system EASY facilitates

- Access management easy and fast

- Embargo for limited time period

- Data reviews

- See who used “your” data

Data @ DANS is not “up for grabs”!

Page 11: Opendatasessions

Why is digital preservation of data important?

• Storage of data makes research more transparent

• Checks on claims made in publications• Replication research is possible• However, data re-use for comparative studies

is much more important

Page 12: Opendatasessions

How does it work?• NWO investments.

Before grant is awarded there is a agreements on access– At DANS– Or other repository with

Data Seal of Approval• Archeological research

deposit obligation

Page 13: Opendatasessions

Cultures of data sharing differ over disciplines, but also change over time

Page 14: Opendatasessions

Six reasons not to share your data

1. No one else can understand the complexity of my data.

2. If someone analyzes my data he/she might come to other conclusions.

3. Someone else might even discover new findings.4. I am not yet ready with the analysis of my data.5. I’ve worked hard to collect the data. They’re mine!6. I cannot trust data that has been produced

somewhere else.

Page 15: Opendatasessions

Connecting content & community

Page 16: Opendatasessions

NARCIS.nl: Access to Research Information, e-Publications, Data Sets and more

New!!

Page 17: Opendatasessions

Example of an

“enhanced publication”

Page 18: Opendatasessions
Page 19: Opendatasessions

Data reviews

• Pilot• 92% recommends re-used dataset

• Average rating is about 4 (scale 1-5)• 70% states that specific dataset helps to answer

questions

Page 20: Opendatasessions

5 Criteria16 Guidelines

The research data:• can be found on the

Internet• are accessible (clear

rights and licenses)• are in a usable format• are reliable• can be referred to

(persistent identifier)

13-04-2023

Data Seal of Approval

www.datasealofapproval.org

Page 21: Opendatasessions

Thank you for your attentionand visit us at:

www.dans.knaw.nlwww.narcis.nl

[email protected]

Page 22: Opendatasessions

Open Data: how we cope with them…

“OPEN ACCESS TO EXCELLENCE IN RESEARCH”

October 22, 2012 Brussel

Jan HaspeslaghLibrarian

Heike LustInformation manager

Page 23: Opendatasessions

Overview

The process

• Archiving• Documenting• QC & Integration• Publishing• Redistribition

The data policy

Page 24: Opendatasessions

Archiving

Documenting

QCPublishing

Integration

The process

Page 25: Opendatasessions

Archiving http://mda.vliz.be

Page 26: Opendatasessions

Archiving

Documenting

Metadata discovery:

• Responsibles• Access rights• Parameters• Coverage: time, geography, taxonomy, …• Relations to other datasets• Publications

Goal: Maximum searchabilityand retrieval

Page 27: Opendatasessions

Archiving

Documenting

Technical:

• Storage software• Checksum & size• ‘Material & methods’• Hierarchy • Units• Formula’s, calculations• …

Goal: Correct interpretation &future usability

Page 28: Opendatasessions

Archiving

Documenting

Page 29: Opendatasessions

Archiving

Documenting

QC

Integration

QC: all elements available for correct reading, use and analysis of data?

Integration: Combining data from different sources and providing users with a unified view of these data

Page 30: Opendatasessions

Publishing

Integrated Marine Information SystemIMIS

→ Module Datasets: ISO 19115 discovery metadata→ Module Literature: ISBD & ASFIS metadata standards

Page 31: Opendatasessions

Publishing

→ Module Datasets

Crossreferenced!→ Module Literature

Redistribution

Open Marine Archive&

Open Data

Page 32: Opendatasessions
Page 33: Opendatasessions
Page 34: Opendatasessions

Archived original dataset

Integrated datasets publication

Page 35: Opendatasessions

Integration of datasets into biodiversity database

Page 36: Opendatasessions
Page 37: Opendatasessions

(Elements of) published dataset linked to other end-user products

Page 38: Opendatasessions

www.icsu-wds.org

www.iode.org/

Data policy at VLIZ

Page 39: Opendatasessions

WDS Data Policy   There will be full and open exchange of data, metadata and products shared within WDS, … All shared data, metadata and products being free of charge or no more than cost of reproduction will be encouraged for research and education.

IOC Oceanographic Data Exchange Policy

Member States shall provide timely, free and unrestricted access to all data, associated metadata and products generated under the auspices of IOC programmes. Member States are encouraged to provide timely, free and unrestricted access to relevant data and associated metadata from non-IOC programmes …. for non-commercial use by the research and education communities, provided that any products or results of such use shall be published in the open literature without delay or restriction.

Page 40: Opendatasessions

Data policy at VLIZ

(under development)

VLIZ advocates free data exchange and supports the IOC Oceanographic Data Exchange Policy. Wherever possible and relevant, the data from the databases will be made available online through the Internet. Naturally, restrictions may apply, as a result of which we cannot offer unlimited access. This is for example the case for data of which VLIZ is not the primary source: in this case the data exchange policy of the originator of the data will apply.

Page 41: Opendatasessions

Data policy at VLIZ – practical

MDA & OMA • Permanent archive for data and publications• Fully documented• Easy online archival & information tool

Main challenges: Convincing scientists to openly share their data no mandates, all is voluntary! Effort involved in properly describing data so it

can be re-used by others → need for dedicated data centers!

Page 42: Opendatasessions

[email protected]@vliz.be