UKOLN is supported by: From research data to new knowledge: a lifecycle approach. Dr Liz Lyon, Director UKOLN, University of Bath, UK JISC/SURF/CNI Conference May 2005, Amsterdam. www.bath.ac.u k a centre of expertise in digital information management www.ukoln.ac.u k
36
Embed
UKOLN is supported by: From research data to new knowledge: a lifecycle approach. Dr Liz Lyon, Director UKOLN, University of Bath, UK JISC/SURF/CNI Conference.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
UKOLN is supported by:
From research data to new knowledge: a lifecycle approach.
Dr Liz Lyon, Director
UKOLN, University of Bath, UK
JISC/SURF/CNI Conference May 2005, Amsterdam.
www.bath.ac.uk
a centre of expertise in digital information management
www.ukoln.ac.uk
JISC/SURF/CNI Conference May 2005 2
Overview
1. Scholarly communications in flux
2. e-Research and the diversity of data
3. Repositories & meta-functionality• Realising the link to learning: eBank UK• Providing value-added services• Enabling knowledge extraction & post-
at the University of Bath• Population survey data: UK Biobank
• Highly sensitive, personal data: patient care records
JISC/SURF/CNI Conference May 2005 11
Taxonomy of data collections• Research collections:
jumping robots • Community collections:
Flybase at Indiana (with UC Berkeley )
• Reference collections: Protein Data Bank
Source: NSF Long-Lived Digital Data Collections
Draft report March 2005
JISC/SURF/CNI Conference May 2005 12
Taxonomy of data collections• Research collections:
jumping robots • Community collections:
Flybase at Indiana (with UC Berkeley )
• Reference collections: Protein Data Bank
Source: NSF Long-Lived Digital Data Collections
Draft report March 2005
Evolution……
JISC/SURF/CNI Conference May 2005 13
Repository evolution:
1971 Research collection
<12 files
2005 Reference collection
>2700 structures deposited in 6 months
JISC/SURF/CNI Conference May 2005 14
1. Issues: research data as content
• Sharing it!• Data diversity
– Homo- or heterogeneous– Raw and derived / processed – Sensitivity– Fast or slow growth in volume
• Repository evolution: – Likelihood to scale up (from bytes to petabytes)– Quality assurance (from the start)– Community-based standards development
(“folksonomies”)– Build robust services
3. Repositories & meta-functionality
JISC/SURF/CNI Conference May 2005 16
eBank UK: linking research data to learning
• JISC-funded September 2003, Phase 2 February 2005• UKOLN at the University of Bath (lead), University of
Southampton, University of Manchester• Exemplar: e-Science testbed ‘Combechem’
– Grid-enabled combinatorial chemistry– Crystallography, laser and surface chemistry examples– Development of an e-Lab using pervasive computing technology– National Crystallography Service
• Embedding in e-Learning processes• Evaluating the pedagogical benefits
– MChem course
– Chemical informatics course
JISC/SURF/CNI Conference May 2005 28
2. Issues: generic data models, metadata schema & terminology
• Validation against other schema– CCLRC Scientific Data Model Vs 2
• Complex digital objects and packaging options – METS– MPEG 21 DIDL
• Terminologies– Domain: crystallography– Inter-disciplinary e.g. biomaterials– Metadata enhancement: subject keyword additions to datasets
based on knowledge of keywords in related publications – Meaningful resource discovery?
JISC/SURF/CNI Conference May 2005 29
3. Issues: linking and identifiers
• Links to individual datasets within an experiment• Links to all datasets associated with an experiment or a data
collection• Links to derived eprints and published literature • Context sensitive linking: find me
– Datasets by this author / creator– Datasets related to this subject– Learning objects by this author / creator– Learning objects related to this subject
• Identifiers and persistence– “generic” – domain: International Chemical Identifier (InChI code)
• Resource discovery : Google Scholar?• Provenance: authenticity, authority, integrity?
JISC/SURF/CNI Conference May 2005 30
4. Issues: embedding and workflow
• Into the crystallographic publishing community International Union of Crystallography
• Into the chemistry research workflow– SMART TEA Digital Lab Book e-synthesis Lab– Other analytical techniques and instrumentation
• Into the curriculum and e-Learning workflows– MChem course – Undergraduate Chemical Informatics courses
JISC/SURF/CNI Conference May 2005 31
For later use? In use now (and the future)?
Repositories and digital curation
Data preservation Data curation
Static Dynamic
“maintaining and adding value to a trusted body of digital information for current and future use”
JISC/SURF/CNI Conference May 2005 32
Provide value-added services
Annotation
• e-Lab books (Smart Tea Project in chemistry)
• Gene and protein sequences
JISC/SURF/CNI Conference May 2005 33
Enable “post-processing” and knowledge extraction
The acquisition of newly-derived information and knowledge from repository content