Carly Strasser @carlystrasser California Digital Library March 2013 UC Merced From Flickr by dipster1 DMPTool The Data Management Planning Tool
Carly Strasser @carlystrasser California Digital Library
March 2013 UC Merced
From
Flickr by dipster1
DMPTool The Data Management Planning Tool
Digital data From
Flickr by Flickm
or
From
Flickr by US Arm
y En
vironm
ental C
omman
d
From
Flickr by DW08
25
C. Strasser
Courtesey of W
HOI
www.woodrow.org
From
Flickr by deltaMike
Digital data +
Complex workflows
UGLY TRUTH
are not taught data management
don’t know what metadata are
can’t name data centers or repositories
don’t share data publicly or store it in an archive
aren’t convinced they should share data
joyfulmom
ma.com
Many researchers…
From Flickr by Gavinzac
?
From Flickr by Thomas Hawk
A document that describes what you will do with your data both
during your research and after you complete your project
What is a data management plan?
From Flickr by spanaut
DMPs for Funders
A short plan submitted alongside grant applications
An outline of – what will be created/collected – methods – Standards – Metadata – sharing/access – long-‐term storage
Includes how and why
But they all have different requirements and express them in
different ways
Federal Funding Accountability and Transparency Act 2006
Evolution
2010 2010 –present
DMP supplement may include: 1. the types of data, samples, physical collections, software, curriculum
materials, and other materials to be produced in the course of the project
2. the standards to be used for data and metadata format and content (where existing standards are absent or deemed inadequate, this should be documented along with any proposed solutions or remedies)
3. policies for access and sharing including provisions for appropriate protection of privacy, confidentiality, security, intellectual property, or other rights or requirements
4. policies and provisions for re-‐use, re-‐distribution, and the production of derivatives
5. plans for archiving data, samples, and other research products, and for preservation of access to them
NSF DMP Requirements
From Grant Proposal Guidelines:
• Types of data produced
• Relationship to existing data
• How/when/where will the data be captured or created?
• How will the data be processed?
• Quality assurance & quality control measures
• Security: version control, backing up
• Who will be responsible for data management during/after project?
1. Types of data & other information
biology.kenyon.edu
C. Strasser
From Flickr by Lazurite
Wired.com
• What metadata are needed to make the data meaningful? • How will you create or capture these metadata? • Why have you chosen particular standards and approaches
for metadata?
2. Data & metadata standards
• Are you under any obligation to share data?
• How, when, & where will you make the data available?
• What is the process for gaining access to the data?
• Who owns the copyright and/or intellectual property?
• Will you retain rights before opening data to wider use? How long? • Are permission restrictions necessary? • Embargo periods for political/commercial/patent reasons? • Ethical and privacy issues? • Who are the foreseeable data users? • How should your data be cited?
3. Policies for access & sharing 4. Policies for re-‐use & re-‐distribution
• What data will be preserved for the long term? For how long?
• Where will data be preserved?
• What data transformations need to occur before preservation?
5. Plans for archiving & preservation
From Flickr by theManWhoSurfedTooMuch
• What metadata will be submitted alongside the datasets?
• Who will be responsible for preparing data for preservation? Who will be the main contact person for the archived data?
DMPs and their evaluation will grow & change over time (similar to broader impacts)
Peer review will determine next steps
Community-‐driven guidelines – Discipline-‐specific – Flexibility at the directorate and division levels – Tailor implementation
Evaluation will vary with directorate, division, & program officer
*Unofficially
NSF’s Vision*
From Flickr by thewmatt
Step-by-step wizard for generating DMP
Create | edit | re-use | share | save | generate
Open to community
DMPonline: dmponline.dcc.ac.uk
dmptool.org
From
Flickr by Serfs U
p!
DMPTool Project
• Started working in January 2011 • Developed requirements, divided work among partners
• Self-‐funded / In-‐kind
DMPTool Participants CDL/UC3 Trisha Cruse Perry Willett Marisa Strong Tracy Seneca Scott Fisher Stephen Abrams Mark Reyes Margaret Low Carly Strasser DataONE Amber Budden
Smithsonian Günter Waibel UCLA Todd Grappone Gary Thompson Sharon Farbe Darrow Cole UCSD Brad Westbrook
University of Illinois Michael Grady Howard Ding Sarah Shreeves University of Virginia Andrew Sallans Sherry Lake Carla Lee Digital Curation Centre Martin Donnelly
• Free • Guides through creating a DMP • Helps meet funder requirements
• Supplies questions • Includes explanation/context provided by
the agency • Provides links to the agency website
dmptool.org
Step-by-step wizard for generating DMP
Create | edit | re-use | share | save | generate
Open to community
Wait!
Data management planning is complex & requires dialog
Range of support & understanding
Our focus: • simplify & scale the common parts • develop community • provide incremental improvement in
functionality
From
Flickr by Ch
risGoldN
Y
Access & Customization
• DMPTool can be added to campus single sign-‐on service
• Researchers use campus login for tool
American University Arizona State University Cal Poly State University Cal State Chico Cal State Fresno Cal State Los Angeles Cal State Office of the Chancellor Clemson University George Mason University Georgia Tech Humboldt State University (CSU) Indiana University Iowa State University James Madison University
Johns Hopkins University Michigan State University Moss Landing Marine Laboratories (CSU) North Carolina State University Northwestern University Ohio State Old Dominion University Penn State Purdue Rice University Smithsonian Institution Texas A&M Texas State University San Marcos Tulane University University of Arizona UC Los Angeles UC Berkeley UC Davis UC Irvine
UC Merced UC Office of the President UC San Diego UC San Francisco University of Chicago University of Illinois at Chicago University of Illinois at Urbana-‐Champaign University of Iowa University of Miami University of Michigan University of Nebraska-‐Lincoln University of North Carolina-‐Chapel Hill University of Notre Dame University of Texas at Austin University of Virginia University of Wisconsin-‐Madison Yale University
Organizations with Shibboleth log-‐in set up
Increasing Participation
Possible customization: • Help text • Links to resources and services • Suggested answers Can provide specific info at different levels • All DMPs • All DMPs for a particular funding agency • A question within a data management plan
Institution-‐specific resources
0
100
200
300
400
500
600
0
500
1000
1500
2000
2500
3000
3500
Oct-‐11 Dec-‐11 Feb-‐12 Apr-‐12 Jun-‐12 Aug-‐12
Num
ber o
f Ins-tu-
ons
Num
ber o
f Plans & Uniqu
e Users
Unique Users Plans InsEtuEons
DMPTool Uptake
Improvement via A.P. Sloan Grant
Data Management Planning Tool 2: Responding to the Community
1. Build community of researchers, institutions, funders, & libraries
2. Expand functionality of the current DMPTool for users & administrators
3. Release the DMPTool2 and provide training/documentation
4. Create an open-‐source community of DMPTool contributors
New Areas of Functionality in 2013
Granular modeling of plan templates
Granular modeling of institutions
Role-‐based user authorization
DMP life cycle management
Organizational planning activities
Enhanced search and browse
Institutional branding
Search and reporting for business
intelligence
Advanced administrative
interface
Collaborative plan creation Open API
IMLS Grant
Improving Data Stewardship with the DMPTool
Provide librarians with the tools and resources
to claim the data management education space
Materials to be Developed
Talking points Slide decks
Promotional materials Environmental scan kit
Webinar series Case studies
Customization help Online Commons
Libguide
blogs.library.ucla.edu/dmptool
My website Email me Tweet me My slides CDL Blog
carlystrasser.net [email protected] @carlystrasser slideshare.net/carlystrasser datapub.cdlib.org