Top Banner
Research Data Management: Why is it important? Stuart Macdonald Associate Data Librarian EDINA & Data Library [email protected]
29

Research Data Management: Why is it important?

Apr 08, 2017

Download

Education

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Research Data Management: Why is it  important?

Research Data Management: Why is it important?

Stuart MacdonaldAssociate Data LibrarianEDINA & Data [email protected]

Page 2: Research Data Management: Why is it  important?

Outline

• Research data• Research Data Management (RDM)• Funding bodies’ requirements• University of Edinburgh RDM Policy• RDM Services & support at the

University

Page 3: Research Data Management: Why is it  important?

Defining research data and Research Data Management (RDM)

Page 4: Research Data Management: Why is it  important?
Page 5: Research Data Management: Why is it  important?

Research Data Management (RDM) • Data management is a general term covering how you

organise, structure, store, and care for the data used or generated during the lifetime of a research project.

• It includes:– How you deal with data on a day-to-day basis over

the lifetime of a project,– What happens to data after the project concludes.

• RDM is considered an essential part of good research practice.

• Good research needs good data!

Page 6: Research Data Management: Why is it  important?

Activities involved in RDMType, format, volume of data, chosen software for long-term access, secondary data, file naming, versioning, quality assurance process.

Information needed for the data to be read and interpreted in future, metadata standards, methodology, definition of variables, format & file type of data.

Access restrictions, data security risks, appropriate methods to transfer / share data, encryption.

Secure & sufficient storage for active data, regular backups, disaster recovery

Make data publicly available (where possible) at the end of a project, license data, any restrictions on sharing, access controls?

Select data to keep, decide how long data will be kept, in which repository, costs involved in long-term storage? Data Management Planning

Day

-to-

day

man

agem

ent

of d

ata

Page 7: Research Data Management: Why is it  important?

Why manage your data?• So you can find and understand it when

needed.• To avoid unnecessary duplication.• To validate results if required.• So your research is visible and has impact.• To get credit when others cite your work.• To avoid data loss!

Page 8: Research Data Management: Why is it  important?

Drivers of RDM

“Publicly funded research data are a public good, produced in the public interest, which should be made openly

available with as few restrictions as possible in a timely and responsible manner that does not harm intellectual

property.”RCUK Common Principles on Data Policyhttp://www.rcuk.ac.uk/research/datapolicy/

Page 9: Research Data Management: Why is it  important?

Funder requirements • Funders are increasingly requiring researchers to

meet certain data management criteria.

• When applying for funding, you need to submit a technical or data management plan.

• You are expected to make your data publicly available (where appropriate) at the end of your project.

Page 10: Research Data Management: Why is it  important?

What do Funders want?

http://www.dcc.ac.uk/resources/policy-and-legal/overview-funders-data-policies

Page 11: Research Data Management: Why is it  important?

EPSRCExpects that:• published research papers should include a short

statement, describing how and on what terms any supporting research data may be accessed,

• metadata on the research data they hold will be published by institutions within 12 months of data generation,

• data will be securely preserved for a minimum of 10 years from the date of last 3rd party access.

• https://www.epsrc.ac.uk/about/standards/researchdata/expectations/ https://www.epsrc.ac.uk/files/aboutus/standards/clarificationsofexpectationsresearchdatamanagement/

Page 13: Research Data Management: Why is it  important?

ESRC Research Data Policy• The ESRC updated its Research Data Policy* in

March 2015. The updated policy is underpinned by nine new principles, which are aligned with the RCUK Common Principles on Data Policy.

• ESRC applicants who plan to generate data from their research must submit a data management and sharing plan as part of their application.

• The new ESRC Research Data Policy is not compulsory for postgraduate students**. However, ESRC-funded students are strongly encouraged to offer the UK Data Services copies of data created or repurposed during their PhD.

* http://www.esrc.ac.uk/about-esrc/information/data-policy.aspx** http://www.esrc.ac.uk/funding-and-guidance/postgraduates/esrc-students/index.aspx

Page 14: Research Data Management: Why is it  important?

RCUK ConcordatResearch Councils UK (RCUK) published a draft Concordat on Open Research Data (17 August 2015):

• Sets out expectations of good practice in publishing research data openly

• Lists 10 principles on working with research data.

• Applies to all fields of research.

• Emphasises responsibilities and accountabilities (institution, researcher, funder)

• Recognises the autonomy of researchers.

• Complements existing frameworks.

http://www.rcuk.ac.uk/RCUK-prod/assets/documents/documents/ConcordatOpenResearchData.pdf

Page 15: Research Data Management: Why is it  important?

University of Edinburgh’s requirement1. Research data will be managed to the

highest standards throughout the research data lifecycle as part of the University’s commitment to research excellence.

2. All new research proposals must include research data management plans…

7. Research data management plans must ensure that research data are available for access and re-use where appropriate…

http://www.ed.ac.uk/schools-departments/information-services/about/policies-and-regulations/research-data-policy

Page 16: Research Data Management: Why is it  important?

RDM Services & support at the University of Edinburgh

Page 17: Research Data Management: Why is it  important?

DMPonlineFree and open web-based tool to help researchers write plans: https://dmponline.dcc.ac.uk/

• Templates based on different requirements

• Tailored guidance (disciplinary, funder etc.)

• Customised exports to a variety of formats

• Ability to share DMPs with others Edinburgh has started the process of customising DMPonline for its researchers.DMPonline screencast:http://www.screenr.com/PJHN

Page 18: Research Data Management: Why is it  important?

Supporting researchers with DMPs

Various types of support we will provide:• Guidelines and templates on what to include in

plans.• A library of successful DMPs to reuse.• Training courses and guidance websites.• Tailored consultancy services.• Online tools (e.g. customised DMPonline).• Contact: [email protected]

Page 19: Research Data Management: Why is it  important?

DataStore The facility to store data that are actively

used in current research activities. 0.5 TB (500GB) per researcher, PGR upwards Up to 0.25TB of each allocation can be used

to create “shared” group storage. Cost of extra storage: £200 per TB per year=

1TB primary storage, 10 days online file history, 60 days backup, DR copy.

Page 20: Research Data Management: Why is it  important?

Accessing DataStore• Allocation will be provided as a mapped drive

(M: U: etc.) on staff desktops• Connect via “Run” or “Explorer” on Windows, or

• “connect to server” on Mac/Linux*• Off-site access – VPN first, or use “SFTP”• NFS available for fixed-location Linux desktops

Documentation links:http://www.ed.ac.uk/schools-departments/information-services/computing/desktop-personal/network-shares/accessing-datastore-net-shares-win http://www.ed.ac.uk/schools-departments/information-services/computing/desktop-personal/network-shares/accessing-datastore-net-shares-mac

Page 21: Research Data Management: Why is it  important?

• 'Dropbox-like’ file-hosting service for non-sensitive data: ww.ed.ac.uk/is/datasync

• Allows sharing and synchronisation of data.• Share using local clients or web URL with

colleagues anywhere.• 20GB free storage or map to personal / group

data on DataStore as required.• Using the ownCloud open source application.

Page 22: Research Data Management: Why is it  important?

Data VaultSafe, private, store of data that is only accessible by the data creator or their representative.

Secure storage: • File security • Storage security • Additional security • EncryptionBeing developed as a joint project with the University of Manchester and partly funded by JISC.

Full version will be in place in 2016.

http://datablog.is.ed.ac.uk/2013/12/20/thinking-about-a-data-vault

Page 23: Research Data Management: Why is it  important?

PURE: Describing your data

• You can describe your datasets (creating metadata) in PURE (datasets field): http://edin.ac/1OF8Auq

• Doing this will help your datasets to be discovered, accessed, and reused as appropriate.

• Ready to use.

Page 24: Research Data Management: Why is it  important?

Edinburgh DataShare• Edinburgh DataShare is the

University’s OA multi-disciplinary data repository hosted by the Data Library : http://datashare.is.ed.ac.uk

• Assists researchers who want to:

• share their data, • get credit for data publication• preserve their data for the long-

term (DOI, licence, citation)

• It can help researchers comply with funder requirements to preserve and share their data and complies with Edinburgh’s RDM Policy

http://datashare.is.ed.ac.uk

Page 26: Research Data Management: Why is it  important?

MANTRA• MANTRA is an internationally

recognized self-paced online training course developed by the Data Library Team for PGR’s and early career researchers in data management issues.

• Anyone doing a research project will benefit from at least some part of the training (and you can pick and choose).

• Data handling exercises with open datasets in 4 analytical packages: R, SPSS, NVivo, ArcGIS.

http://datalib.edina.ac.uk/mantra

Page 27: Research Data Management: Why is it  important?

http://datashare.is.ed.ac.uk/

www.ed.ac.uk/is/data-management http://datablog.is.ed.ac.uk/ http://datalib.edina.ac.uk/mantra/

DataStore

https://dmponline.dcc.ac.uk/

RESEARCH DATA MANAGEMENT AT THE

UNIVERSITY OF EDINBURGH

http://edin.ac/1OF8Auq

www.ed.ac.uk/is/datasync

Ready by mid-2016

http://www.ed.ac.uk/is/research-data-policy

Data catalogue in PURE

Page 28: Research Data Management: Why is it  important?

Useful links• RDM website

http://www.ed.ac.uk/is/data-management

• Research Code of Practice and related guidelineshttp://www.ed.ac.uk/schools-departments/institute-academic-development/research-roles/research-only-staff/advice/codes/research-code

• DCC (Digital Curation Centre). Data management planshttp://www.dcc.ac.uk/resources/data-management-plans

• UK Data Archive: Data management costing toolhttp://www.data-archive.ac.uk/media/247429/costingtool.pdf

• UK Data Archive: Ethical/Legalhttp://www.data-archive.ac.uk/create-manage/consent-ethics/legal

• UK Data Archive: Formatting your datahttp://www.data-archive.ac.uk/create-manage/format

Page 29: Research Data Management: Why is it  important?

Thank You

[email protected]