Top Banner
Hot Topics: DuraSpace Community Webinar Series Hot Topics: The DuraSpace Community Webinar Series Series Fifteen: DSpace for Data Curated by Claire Knowles, Library Digital Development Manager, The University of Edinburgh.
28

3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Mar 19, 2017

Download

Technology

DuraSpace
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Hot Topics: DuraSpace Community Webinar Series

Hot Topics: The DuraSpace Community Webinar Series

Series Fifteen: DSpace for Data

Curated by Claire Knowles, Library Digital Development Manager,

The University of Edinburgh.

Page 2: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Hot Topics: DuraSpace Community Webinar Series

Webinar 2:

DSpace for Data: issues, solutions and challenges

Presented by:

Claire Knowles, The University of Edinburgh

Ryan Scherle, Dryad Digital Repository

Pauline Ward, The University of Edinburgh

Page 3: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Today’s Speakers

Ryan Scherle Dryad Digital Repository datadryad.org

Pauline Ward Edinburgh DataShare, University of Edinburgh datashare.is.ed.ac.uk

Page 4: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Ryan Scherle Dryad Digital Repository

Page 5: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

What is Dryad?

A data repository, working closely with scientific journals. •data tightly connected to articles •broad disciplinary scope •broad interpretation of “data” •nonprofit, with Data Publication Charges

Page 6: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Sample content in Dryad

Page 7: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Why does Dryad use DSpace?

For the robust metadata model? For the extremely clean architecture? Just one reason… workflow

Page 8: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides
Page 9: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Issues to consider

File sizes File types Structured objects Versioning Timing of data release Additional metadata Sensitive data

Page 10: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

File sizes

Allow submission of large files Provide curators ways to inspect large files Be aware of time required for automated processes

Page 11: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

File types

DSpace doesn’t care, but the users do. Steer submitters to preferred types. Give curators tools to read varied types. Develop methods to look for common issues in a variety of types.

Page 12: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Structured objects

Changing the data model affects all parts of DSpace

•Submission •Identifiers •Curation •Item display •Search results •APIs

Page 13: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Articles are relatively static, but data is often reused, revised, and expanded!

Determine what constitutes a version, and how to cite it.

Versioning

http

s://f

lic.k

r/p/a

6Hpr

9

Page 14: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Timing of data release

Are data independent of the publication or synced with it?

Develop embargo policies for both metadata and bitstreams.

http

s://f

lic.k

r/p/e

bZd3

d

Page 15: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Additional metadata

Data in a repository may require additional metadata for:

•Discovery •Maintaining item structure •Support of workflow •Usage tracking

Page 16: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Sensitive data

Copyrights Endangered species Human subjects

http

s://f

lic.k

r/p/8

3Rki

t

http

s://f

lic.k

r/p/3

bpAk

c

Page 17: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Technical challenges in DSpace

The most important technical issues to address when adding data to DSpace are:

•Data model •Submission/curation workflow •Processes for large files •Embargo and access control

Page 18: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Pauline Ward The University of Edinburgh

https://wiki.duraspace.org/display/[email protected]/The+DSpace+Curator%27s+Handbook

https://wiki.duraspace.org/display/[email protected]/The+DSpace+Curator%27s+Handbook

Page 19: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

What is Edinburgh DataShare?

● Institutional research data repository

●DSpace 5.2, with the XMLUI Mirage interface

●First deposit was accessioned in 2008

●Now contains 1,912 data items

●Very broad disciplinary spread

Page 20: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Metadata

●We use Dublin Core

●We mint DataCite DOIs

Page 21: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Big Files

Our researchers wanted to deposit files over 1 GB, which was difficult to do via the web submission form. So our developer ported the HTML5 upload facility from JSPUI to XMLUI. Now, users can upload up to 20 GB via their browser. EDINA’s code is available: https://github.com/edina/DSpace/tree/xml-html5-upload

Page 22: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Request-a-copy

Issues:

●Spam

●When the depositor leaves the

institution

Page 23: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

File-level embargo

Issues:

●Policy clash

●Item embargo date ambiguous

Page 24: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Tombstoning

When withdrawn item

●ds.withdrawn.tombstone

Page 25: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

File Format Registry

●343 file formats

●Scope for improvement

Page 26: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

The Missing Curator’s Handbook

Looking for help: ●https://wiki.duraspace.org/display/[email protected]/The+DSpace+Curator%27s+Handbook

Page 27: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

How to contribute

Claim a ticket and/or join a meeting https://wiki.duraspace.org/display/DSPACE/DSpace+7+UI+Working+Group Join us on Slack / ask questions https://goo.gl/forms/s70dh26zY2cSqn2K3 DSpace 7 Outreach Group https://wiki.duraspace.org/display/DSPACE/DSpace+7+UI+Outreach+Group

Page 28: 3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

Hot Topics: DuraSpace Community Webinar Series

Hot Topics: The DuraSpace Community Webinar Series

Join us for our 3rd webinar:

How to contribute to DSpace –

be a part of the team!

March 15, 2017 at 11:00a.m. ET