Top Banner
Educating Scientists about the Data Life Cycle Bill Michener Professor and DataONE Project Director University of New Mexico 9 October 2012 2012 eScience Workshop
32

Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

Apr 11, 2018

Download

Documents

lyphuc
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

Educating Scientists about the Data Life Cycle

Bill Michener

Professor and DataONE Project Director

University of New Mexico

9 October 2012

2012 eScience Workshop

Page 2: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

2

Page 3: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

3

Three major components for a flexible, scalable, sustainable network

Member Nodes • diverse institutions • serve local community • provide resources for

managing their data • retain copies of data

Coordinating Nodes

• retain complete metadata catalog

• indexing for search

• network-wide services

• ensure content availability (preservation)

• replication services

Investigator Toolkit

DataONE

Page 4: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

4

The Data Life Cycle

Plan

Collect

Assure

Describe

Preserve

Discover

Integrate

Analyze

4

Page 5: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

5

Year 1 Year 2 Year 3 Year 4 Year 5

Scientists: BL

User Assessments

Scientists: FU

Librarians: BL Librarians: FU

Policy Makers: BL Policy Makers: FU

Educators: BL Educators: FU

Library Policies: BL Library Policies: FU

Page 6: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

6

• Best Practices

• Software Tools Catalog

• In-depth Training

Education

Page 7: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

7

Best Practices

Page 8: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

8

Best Practices

Page 9: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

9

Best Practices Primer

Page 10: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

10

Best Practices

Page 11: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

11

Best Practices

Page 12: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

12

Page 13: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

13

Page 14: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

14

Page 15: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

15

Software Tools Catalog

Page 16: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

16

Software Tools Catalog

Page 17: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

17

Page 18: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

18

Page 19: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

19

Page 20: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

20

In-depth Training

Page 21: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

21

In-depth Training

Page 22: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

22

Tutorials on Data Management

Lesson 10: Analysis and Workflows

CC

imag

e b

y w

lef7

0 o

n F

lickr

Credits: Heather Henkel, Viv Hutchison, Carly Strasser, Stacy Rebich Hespanha, Kristin Vanderbilt, and Linda Wayne

Page 23: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

23

1. Review of typical data analyses

2. Reproducibility & provenance

3. Workflows in general

4. Informal workflows

5. Formal workflows

Lesson Topics

CC

imag

e b

y jw

alsh

on

Flic

kr

Page 24: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

24

After completing this lesson, the participant will be able to:

oUnderstand a subset of typical analyses used

oDefine a workflow

oUnderstand the concepts informal and formal workflows

oDiscuss the benefits of workflows

Learning Objectives

CC

imag

e b

y cy

bra

rian

77

on

Flic

kr

Page 25: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

25

The Analysis Education Module

Page 26: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

26

1. Use concrete or ‘real-world’ examples and stories to illustrate important points

2. Include information about (and links to) tools and resources

3. Use text sparingly on slides 4. Define jargon 5. Take data management experience levels into

account 6. Include information about best practices 7. For a workshop format remove redundant

information

*May 23-24, 2012 – 2 day training and content evaluation workshop; Credits: Heather Henkel, Viv Hutchison, Carly Strasser, Stacy Rebich Hespanha, Kristin Vanderbilt, and Linda Wayne

7 Lessons from Evaluation of Modules*

Page 27: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

27

June 3-21, 2013

University of New Mexico

Page 28: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

28

Walter E. Dean Environmental Information Management Institute

• 6 graduate credits

• 3 weeks

• Intensive, hands-on training

• DMP Tool

• Excel, Powerpoint

• R

• MySQL

• ArcGIS

• Kepler

• Web design and Drupal

Page 29: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

29

Kepler

DMP-Tool

In-depth Training

Plan

Collect

Assure

Describe

Preserve

Discover

Integrate

Analyze

Page 30: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

30

DataONE.org

Page 31: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

31

Credits (Best Practices, Software Tools, Education Modules, EIM Summer Institute) Best Practices and Software Tools:

Bob Cook, William Michener, Rebecca Koskela, Amber Budden, Carly Strasser, Karl Benedict, Corinna Gries, Christine Laney, Ken Masarie, Mary McCloud, Inigo San Gil, Mark Servilla, Wade Sheldon, Will Shuart, Kristin Vanderbilt, Chris Jones, Cindy Parr, Damien Gessler, Emory Boose, Eric Lind, Faerthen Felix, Jeff Brown, Jeff Horsburgh, Jim Regetz, John Porter, Juliana Freire, Kevin Comerford, Margaret O’Brien, Rebecca Lubas, Robert Olendorf, Robert Stevenson, Ruth Duerr, Steve Tessler, Ted Haberman, Theresa Valentine, Thomas Burley, Trisha Cruse, Todd Grappone, Thorny Staples, Sherry Lake, Sharon Farb, Perry Willett, Michael Grady, Martin Donnelly, Gunter Waibel, Beth Sandore, Andrew Sallans, Marissa Strong, Viv Hutchison

(1) Education Modules and (2) EIM Summer Institute:

① Heather Henkel, Viv Hutchison, Carly Strasser, Stacy Rebich Hespanha, Kristin Vanderbilt, and Linda Wayne

② Laura Arguelles, Karl Benedict, Robert Cook, Rebecca Koskela, William Michener, Bob Olendorf, John Porter, Jim Regetz, Will Shuart, and Kristin Vanderbilt

Page 32: Educating Scientists about the Data Life Cycle file• Dave Vieglais •Paul Allen, Rick Bonney, Steve Kelling •Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew

32

DataONE Team and Sponsors

•Bertram Ludaescher

•Deborah McGuinness

• Jeff Horsburgh

•Robert Sandusky

• Peter Honeyman

• Carole Goble

• Cliff Duke

•Donald Hobern

• Ewa Deelman •Amber Budden, Roger Dahl, Rebecca Koskela, Bill Michener, Robert Nahf, Skye Roseboom, Mark Servilla

• Patricia Cruse, John Kunze

• Dave Vieglais

• Paul Allen, Rick Bonney, Steve Kelling

• Stephanie Hampton, Chris Jones, Matt Jones, Ben Leinfelder, Andrew Pippin

• Suzie Allard, Nick Dexter, Kimberly Douglass, Carol Tenopir, Robert Waltz, Bruce Wilson

• John Cobb, Bob Cook, Ranjeet Devarakonda, Giri Palanismy, Line Pouchard

• Sky Bristol, Mike Frame, Richard Huffine, Viv Hutchison, Jeff Morisette, Jake Weltzin, Lisa Zolly

•David DeRoure

•Ryan Scherle, Todd Vision

LEON LEVY

FOUNDATION

•Randy Butler