Big Data Forum April 18, 2013 Beth Oehlerts Digital Management Librarian Nancy Hunter Coordinator of Acquisitions and Metadata Services Data Management at CSU: Starting a Campus Conversation
Dec 11, 2015
Big Data ForumApril 18, 2013
Beth Oehlerts
Digital Management Librarian
Nancy Hunter
Coordinator of Acquisitions and Metadata Services
Data Management at CSU:Starting a Campus
Conversation
Experience organizing, describing, preserving information
Growing expertise in the area of data management, which we can bring to this conversation
The CSU Institutional Repository (I.R.)
Libraries and Data Management
Compliance with grant-funder mandates
Copyright/Access rights information
Data organization (data lifecycle planning, metadata, data access structures, etc.)
Data storage (what, where, for how long?)
Determining Needs and Outcomes
Open resource for published research results and data sets
DiscoverableExposed to Google and other harvesters
AccessibleProfessionally managed access Part of Digital Collections of Colorado, shared with
8 librariesOrganized
Metadata - structural organization Persistent
Preservation
What the CSU I.R. Is
Access-controlled via login But – we are able to embargoAnd – we are able to limit to CSU IP range
Able to hold structured data, e.g. web pages pointing to databases
Able to accept self-deposit of publications and data sets
What the CSU I.R. Is Not (Yet)
Store your publications together, preferably in the CSU I.R.
Your publications should point to your referenced data sets
Depending on the size, data sets associated with your publications should be stored with the publication in the CSU I.R.
‘Small’ (up to 200 GB), and some ‘Medium’ (200 GB to 10 TB)
‘Large’ (>10 TB) should probably stay where generated or stored in a disciplinary data archive
Our Current Thinking
Struggling with ????
Do you need help describing data for use by others?
What skills would you like to develop?
How may the Libraries help?
How to Start the Conversation
Collaborate - Identify potential partnerships
Create campus-wide groups to work with data management needs
Any other ideas?
Next Steps
NCAR Libraries (June-November 2012)
Even experienced data managers are challenged byData Citation / AttributionGranularityFormatsData Life Cycle
Data Policies
You’re Not Alone!
CSU Librarieshttp://lib.colostate.edu/repository/http://lib.colostate.edu/repository/nsf
University of Minnesotahttps://www.lib.umn.edu/datamanagement
California Digital Libraryhttp://www.cdlib.org/services/uc3/datamanagement/
Databib (list of international data repositories)http://databib.org/
Resources