Top Banner
A centre of expertise in digital information management www.ukoln.ac.u k UKOLN is supported by: Acting as Advocate? Seven steps for libraries in the data decade Dr Liz Lyon, Director, UKOLN, University of Bath, UK Associate Director, UK Digital Curation Centre IATUL Conference, Purdue University, June 2010 . This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0
38

Acting as Advocate? Seven steps for libraries in the data decade

Jan 27, 2015

Download

Technology

LizLyon

Presentation given at the IATUL Conference, Purdue University in June 2010.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Acting as Advocate? Seven steps for libraries in the data decade

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk

UKOLN is supported by:

Acting as Advocate? Seven steps for libraries in the data decade

Dr Liz Lyon, Director, UKOLN, University of Bath, UKAssociate Director, UK Digital Curation Centre

IATUL Conference, Purdue University, June 2010

.This work is licensed under a Creative Commons LicenceAttribution-ShareAlike 2.0

Page 2: Acting as Advocate? Seven steps for libraries in the data decade

1. Scale, Complexity, Predictive Potential 2. Continuum of Openness3. Citizen Science4. Credentials, Incentives, Rewards5. Institutional Readiness & Response6. Data Informatics Capacity & Capability

http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/publications.html#november-2009

•Open Science at Web-Scale

•Consultation:

•Write-To-Reply

•Keynote Presentations:

•eResearch Australasia Nov 2009

•CNI, Baltimore April 2010•http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/presentations.html

Page 3: Acting as Advocate? Seven steps for libraries in the data decade

data scaleHuman Genome printed http://www.flickr.com/photos/johnjobby/2252981353/sizes/l/

Human Genome printed http://www.flickr.com/photos/johnjobby/2252981353/sizes/l/

Page 4: Acting as Advocate? Seven steps for libraries in the data decade

“Data sets are becoming the new instruments of science”

Page 5: Acting as Advocate? Seven steps for libraries in the data decade

$1000 genome in <15 minutes ....by 2013?

Page 6: Acting as Advocate? Seven steps for libraries in the data decade

...data logistic challenges....

• Large-scale data storage that is:– Cost-effective (rent on-demand)– Secure (privacy and IPR)– Robust and resilient– Low entry barrier / ease-of-use– Has data-handling / transfer / analysis capability

• Move sequencing out of genome centres

• “....analyse an entire human genome in a single day sitting with a laptop at your local Starbucks.”

...cloud services

Page 7: Acting as Advocate? Seven steps for libraries in the data decade

Clients in the cloud

Page 8: Acting as Advocate? Seven steps for libraries in the data decade

Library Actions1. Provide Briefings on Cloud Data Services

(in partnership with local IT Services?)

Page 9: Acting as Advocate? Seven steps for libraries in the data decade

Workflows, Models, Tools

Sage Bionetworks genomics Workflow

Page 10: Acting as Advocate? Seven steps for libraries in the data decade

Reference Linking

Research Outputs

User registration data; Instrument allocation data etc.

Comments, annotations, ratings etc.

Risk assessment data; other sample dataAnalyse

Derived Data

Research Concept and/or

Experiment Design

Acquire Sample

Peer-review Proposal

Conduct ExperimentGenerate, Create,

& Collect Raw Data

Process Raw Data into Derived Data

Interpret & Analyse

Results Data

Archive, Preservation & Curation

IPR, Embargo & Access Control

Validate, Reuse& Repurpose Data

Publish Research

Results Data Derived Data Processed Data Raw, Correction & Calibration Data

Papers, articles, presentations, reports

An Idealised Scientific Research Data Lifecycle Model

Documentation, Metadata & Storage (Reference, Provenance, Context, Calibration etc.)

Start Project

Write Proposal

(include DMP)

Scholarly Knowledge

Write Usage Reports

Publication Database

Research Activity Research Admin Activity

Archive Activity Information Flow KEY

Prepare Supplementary

Data

Prepare Manuscript

Peer Review Research Discover & Access

Appraisal & Quality Control

Programs (generate customised software)

Publication Activity

Page 11: Acting as Advocate? Seven steps for libraries in the data decade

State-of-the-Art Report : Models & Tools (Alex Ball, June 2010)

• Data Lifecycles• Data Policies (UK) incl DMP• Standards & tools• Data Asset Framework (DAF) • DANS Seal of Approval• Preservation metadata• Archive management tools• Cost / benefit tools

Page 12: Acting as Advocate? Seven steps for libraries in the data decade

Library Actions1. Provide Briefings on Cloud Data Services

(in partnership with local IT Services?)

2. Build usable Data Management Tools working in partnership with researchers

Page 13: Acting as Advocate? Seven steps for libraries in the data decade

Data Sustainability….

Page 14: Acting as Advocate? Seven steps for libraries in the data decade

Dimension 1

Direct Indirect (costs avoided)

Dimension 2

Near-term Long-term

Dimension 3

Private Public

Benefits Taxonomy: Summary

Keeping Research Data Safe2 Report: April 2010

Page 15: Acting as Advocate? Seven steps for libraries in the data decade

Library Actions1. Provide Briefings on Cloud Data Services

(in partnership with local IT Services?)

2. Build usable Data Management Tools working in partnership with researchers

3. Develop Data Sustainability Strategies and articulate the cost-benefits

Page 16: Acting as Advocate? Seven steps for libraries in the data decade

Ethics, Privacy, Culture

“You have zero privacy anyway. Get over it”

Scott McNealy, CEO Sun Microsystems, 1999

Page 17: Acting as Advocate? Seven steps for libraries in the data decade

Post-genome decade

Human genomes: >24 published &almost 200 unpublished

Page 18: Acting as Advocate? Seven steps for libraries in the data decade

“P4 medicine : Predictive, Personalised, Preventive, Participatory.”

Leroy Hood – Institute for Systems Biology

Image from Scientific American

...“medicine is going to become an information science”...

Page 19: Acting as Advocate? Seven steps for libraries in the data decade

P4 medicine• Each patient’s genome sequenced

• Your genome is basis of your medical record

• New method to anonymise medical records for genomics research at Vanderbilt Univ (April ‘10)

• New predictive models of health and disease

• Personalised treatments focus on preventative therapiesGenome scale network biologyGenomic data as a commodity

Page 20: Acting as Advocate? Seven steps for libraries in the data decade

They have shared their data….

Page 21: Acting as Advocate? Seven steps for libraries in the data decade

Share my

data?

Page 22: Acting as Advocate? Seven steps for libraries in the data decade

“While many researchers are positive about sharing data inprinciple, they are almost universally reluctant in practice. ..... using these data to publish results before anyone else is theprimary way of gaining prestige in nearly all disciplines.” INCREMENTAL Project

Page 23: Acting as Advocate? Seven steps for libraries in the data decade

• Sage Bionetworks : Integrative genomics• Open data in the Sage Commons repository• Human and mouse: clinical and genetics data• Develop predictive models of disease: liver /

breast / colon cancer, diabetes, obesity• Crowd-sourced effort : global scope

Stephen Friend

Page 24: Acting as Advocate? Seven steps for libraries in the data decade

Participatory medicine : share data &empower the patient...

Sage Congress San Francisco April 2010

Page 25: Acting as Advocate? Seven steps for libraries in the data decade

Library Actions1. Provide Briefings on Cloud Data Services

(in partnership with local IT Services?)

2. Build usable Data Management Tools working in partnership with researchers

3. Develop Data Sustainability Strategies and articulate the cost-benefits

4. Publish Case Studies on Open Science to show benefits of universal data sharing

Page 26: Acting as Advocate? Seven steps for libraries in the data decade

Library Actions1. Provide Briefings on Cloud Data Services

(in partnership with local IT Services?)

2. Build usable Data Management Tools working in partnership with researchers

3. Develop Data Sustainability Strategies and articulate the cost-benefits

4. Publish Case Studies on Open Science to show benefits of universal data sharing

5. Present at University Ethics Committee to highlight open data issues for faculty

Page 27: Acting as Advocate? Seven steps for libraries in the data decade

Professional Scientists Enthusiastic amateurs

Training Citizen scientist

Standards and ethics Local : natural history, environ.

Peer-review Global : astronomy

Organisational support Self-supporting

Page 28: Acting as Advocate? Seven steps for libraries in the data decade
Page 29: Acting as Advocate? Seven steps for libraries in the data decade

Citizen Science : validated in the professional press

Page 30: Acting as Advocate? Seven steps for libraries in the data decade

Working with science professionals

Page 31: Acting as Advocate? Seven steps for libraries in the data decade

Library Actions6. Raise awareness of Citizen Science

opportunities & guidelines for good practice

Page 32: Acting as Advocate? Seven steps for libraries in the data decade

Data Publication

and Attribution

http://www.flickr.com/photos/digitalfemme57/3271063366/

Page 33: Acting as Advocate? Seven steps for libraries in the data decade

Calls for action, new metrics

Page 34: Acting as Advocate? Seven steps for libraries in the data decade

• Journal

• Article

• Workflow

• Visualisation

• Model

• Data

• Annotation

• Concept

Macro

Micro / Nano

Attribution granularity

What are we citing?

Page 35: Acting as Advocate? Seven steps for libraries in the data decade

How to cite large-scale predictive network models?• Multiple data sources

• Linked data approach

• Visualise : Cytoscape

• Workflow : Taverna

• Provenance issues

Page 36: Acting as Advocate? Seven steps for libraries in the data decade

Library Actions6. Raise awareness of Citizen Science

opportunities & guidelines for good practice

7. Promote Data Citation and Attribution to embed in publication practice and influence funder policy

Page 37: Acting as Advocate? Seven steps for libraries in the data decade

Take homes...1. Briefings on Cloud Data Services

2. Build usable Data Management Tools

3. Develop Data Sustainability Strategies

4. Publish Case Studies on Open Science

5. Present at University Ethics Committee

6. Raise awareness of Citizen Science

7. Promote Data Citation and Attribution

...Acting as Advocate

Page 38: Acting as Advocate? Seven steps for libraries in the data decade

Chicago Mart Plaza, 6-8 December 2010

Thank you…