Open Research Data Pilot Open research data and data management for Horizon 2020 projects OpenAIRE Belgium Emilie Hermans Project Assistant OpenAIRE, UGent can be reused under the CC BY license
Open Research Data Pilot
Open research data and data management
for Horizon 2020 projects
OpenAIRE Belgium
Emilie HermansProject Assistant OpenAIRE, UGent can be reused under the
CC BY license
2
Why data management/open data?
1. e.g. Piwowar HA, Vision TJ. (2013) Data reuse and the open data citation advantage. PeerJ 1:e175 https://doi.org/10.7717/peerj.175, Piwowar HA, Day RS, Fridsma DB (2007) Sharing Detailed Research Data Is Associated with Increased Citation Rate. PLoS ONE 2(3): e308. doi:10.1371/journal.pone.0000308
2. Cartoon: "recycle" | Foster by Patrick Hochstenbach, 2015
1. Prevents data loss2. Data management to maximize usefulness: organize, make understandable and reusable
3. Fosters creativity, participation of citizens and increases transparency
4. Get credit: (much!) longer shelf life than interpretation
3
The Open Research Data Pilot Horizon 2020limited and flexible pilot
• Avoid duplication of research and loss of resources
• Foster Open Science: transparency, effectiveness and greater impact
Open Access to research data
Data Management Planning
4
Which areas are participating?
Open Research Data Pilot
Projects in other areas can participate on a voluntary basis
• Check Article 29.3 of the Model Grant Agreement
• Costs eligible (Article 6.2.D.3 of the Model Grant Agreement)
• Future and Emerging Technologies• Research infrastructures (including e-Infrastructures)• Leadership in enabling and industrial technologies – Information and Communication
Technologies• Nanotechnologies, Advanced Materials, Advanced Manufacturing and Processing, and
Biotechnology: ‘nanosafety’ and ‘modelling’ topics• Societal Challenge: Food security, sustainable agriculture and forestry, marine and maritime
and inland water research and the bioeconomy - selected topics in the calls H2020-SFS-2016/2017, H2020-BG-2016/2017, H2020-RUR-2016/2017 and H2020-BB-2016/2017, as specified in the work programme
• Societal Challenge: Climate Action, Environment, Resource Efficiency and Raw materials – except raw materials
• Societal Challenge: Europe in a changing world – inclusive, innovative and reflective Societies• Science with and for Society• Cross-cutting activities - focus areas – part Smart and Sustainable Cities.
5
Requirements of the Data Pilot1. Develop a Data Management Plan (DMP)
2. Deposit data in a research data repository
Open Research Data Pilot
3. Open data: freely used, modified, and shared by anyone for any purpose4. Provide information, tools and instruments needed to
validate results
REASONS FOR OPTING-OUT
6Open Research Data Pilot
• Exploitation of results• Confidentiality • Protection of personal data• Would jeopardize the main aim of the action• No data generated • Any other legitimate reason• Complete opt-out via project amendment • Complete or partially opt-out: describe issues in project DMP• As open as possible as closed as necessary
Projects can opt out at any stage.
7
Develop a DMP
Open Research Data Pilot
Updated minimum at: • Initial DMP: within first 6 Months of the
project• Mid-term review• Final project review
Data Management
Plan(DMP)
Living document: revise and update
Data management plan (DMP):
• Well managed in present and prepared for preservation in the future
• Handling of data during and after project
The DMP should address the points below on a dataset by dataset basis:
• Data set reference and name
• Data set description
• Standards and metadata
• Data sharing
• Archiving and preservation (including storage and backup)
Annex 2 (mid-term & final review)
Scientific research data should be easily:
• Discoverable• Accessible• Assessable and intelligible• Useable beyond the original purpose for which it was collected• Interoperable to specific quality standards
Annex 1 (by month 6)
Content of a DMPAnnex I and II of EC guidelines
http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-data-mgt_en.pdf
9Open Research Data Pilot 9
How to write a DMP
Online data management tool: dmponline.dcc.ac.uk/
10Open Research Data Pilot 10
How to write a DMP
11Open Research Data Pilot 11
Create and confirm plan
12Open Research Data Pilot 12
Plan Details
13
Open Research Data Pilot
13
Final DMP: DOI
Versions Share with partners
14Open Research Data Pilot 14
GuidanceGuidance based on guidelines EC
Guidance and links from DCC
15
Open Research Data Pilot
15
Export to various formats
1616
Content of a DMP
Handling of data
Collecting and
processing
Methodology and
standards
Open access
Curation and
preservation
Annex I and II of EC guidelines
http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-data-mgt_en.pdf
Handling of data
Open Research Data Pilot 17
• Storage and backup
• Additional measures?
• During and after the projectHandling of data
Collecting and analysing data
Open Research Data Pilot 18
Collecting and
processing
• Be clear what data you use
• Provide links to data sets you used
• E.g. lab notebook, end-to-end code/scripts for statistics
• Software can help: R, MatLab, Python…
• Be clear what methods you use
19
Data files: standard formats
Open Research Data Pilot
Use data formats that are: Methodology
and standards
• Open standard • In an easily re-usable
format• Commonly used by research community
Examples of preferred format choices:
Text .odt, .txt, .xml, .html, .rtf
Tabular Data .csv (comma separated values), .xml, .rdf, .SPSS portable
Images .tif, .jpeg2000, .png, .svg,
Structured data
.xml, .rdf
Any standard used in your field
20
Create searchable data
Open Research Data Pilot
• Data about data
• Machine readable
Using metadata
• Consists of set of attributes
• Helps prevent inappropriate use
Open Research Data Pilot 21
DON’T HOMETRY THIS AT
Example
Use standards of your domainDigital Curation Centre
General• Dublin Core (DC)• Datacite metadata schema • Metadata Object
Description Schema (MODS)
Humanities• Text Encoding Initiative
(TEI)• Visual Resources
Association Core (VRA)
Archives/Repositories• DatastaR minimD-space
metadata• um Metadata
Social Science• Data Documentation
Initiative (DDI)Life Sciences
• Darwin Core• Integrated Taxonomic
information System (ITIS)
Earth Science• Directory Interchange
Format (DIF)• Standard for the
Exchange of Earthquake Data (SEED)
Ecology• Ecological Metadata
Language (EML)
Geographic/Geospatial• Federal Geographic Data
Committee (FGDC)• ISO 19115• Geospatial
Interoperability Framework (GIF)
METADATASTANDARDS
Where to deposit data?
Open Research Data Pilot 23
• Disciplinary data repository
Research data repositoryCuration
and preservation
• Institutional data repository
• Zenodo
• Matches data needs
• Directory of data repositories: www.Re3data.org
Re3data.org
Re3data
Open Research Data Pilot 25
Trustworthy Digital repository
• Persistent identifier• Licenses • Access
26
What to deposit?
Open Research Data Pilot
• Tools: Documentation, scripts, software, info about statistical analyses….
Open Access to research data
Everything needed to validate results presented in scientific publications
• Understandable? add readme text file
• Data• Metadata• Other data described in Data Management Plan
27
What to deposit?
Open Research Data Pilot
Select
• Confidentiality/anonymization • Regenerating data cheaper
than archiving?
• Version control
• Potentially useful to others
28
Open data
Open Research Data Pilot
• Apply an open license:• Keep it simple
• What intellectual property rights exist in the data?• Apply a suitable ‘open’ license e.g. creative commons :
• Data repositories can provide licenses
Open access
• Re3data.org
29
Example
Open Research Data Pilot
Understandable for humans
Machine readable metadata
Tools
Open Data
Open license
Open Research Data Pilot 30
Support and information?
3030
OpenAIRE - An Open Knowledge & Research Information Infrastructure
• www.OpenAIRE.eu offers infrastructure, tools, information and helpdesk system
FACILITATING THE OPEN ACCESS POLICY OF THE
EUROPEAN COMMISSION
Open Research Data Pilot
Zenodo
For all content types
Create communities
describe
publish
31
For all content types
With GitHub Integration
Create communities
upload describe
publish
Open Research Data Pilot 32
OpenAIRE
3232
www.openaire.eu/search
Link your data to publications or
project
Open Research Data Pilot 33
OpenAIRE
3333
Training and support material
Information on: • Open research data pilot• Creating a data management plan• Selecting a data repository
Support material:Briefing papers, factsheets, webinars, workshops , FAQs, helpdesk
www.openaire.eu/opendatapilot
(Open) Data MetadataOther tools
dmponline.dcc.ac.uk
Open Research Data Pilot
Data Repositories • EC guidelines• OpenAIRE.eu• www.dcc.ac.uk
• Standard File Formats
• Standards metadata schema
• Open Licences
• 6 months• Mid-term
review• Final review
STEP 1WRITE A DMP
Deliverable at
FIND REPOSITORY DEPOSIT DATA Supporting infrastructure and
information
STEP 2 STEP 3 SUPPORT
• discipline/institutional
• www.re3data.org• Zenodo
Matches data needs
Designed by Freepik
35
Questions!
Open Research Data Pilot
www.openaire.eu@openaire_euFacebook.com/groups/openaire
https://www.linkedin.com/groups/OpenAIRE3893548
[email protected]@openaccess.be
can be reused under the CC BY license