UKOLN is supported by: Realising the scholarly knowledge cycle: The experience of eBank UK Dr Liz Lyon, UKOLN, University of Bath, UK CNI Task Force Meeting Spring 2004 Alexandria, Virginia, www.bath.ac.u k a centre of expertise in digital information management www.ukoln.ac.u k
43
Embed
UKOLN is supported by: Realising the scholarly knowledge cycle: The experience of eBank UK Dr Liz Lyon, UKOLN, University of Bath, UK CNI Task Force Meeting.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
UKOLN is supported by:
Realising the scholarly knowledge cycle:
The experience of eBank UK
Dr Liz Lyon, UKOLN, University of Bath, UK
CNI Task Force Meeting Spring 2004
Alexandria, Virginia,
www.bath.ac.uk
a centre of expertise in digital information management
www.ukoln.ac.uk
CNI Spring 2004 2
Overview
• Setting the scene– e-Research trends– Towards a common infrastructure
• The scholarly knowledge cycle– Data, information and workflows– Provenance
• eBank UK Project– The experience so far– Issues arising
• Challenges for the future
Setting the scene
“The next generation of research breakthroughs will rely upon new ways of handling the immense amounts of data that are being produced by modern research methods and equipment, such as telescopes, particle accelerators, genome sequencers and biological imagers….Similar developments are having an impact in the arts and humanities, and in the social sciences.”
A Vision for Research,
Research Councils UK, December 2003.
CNI Spring 2004 5
Report of the National Science Foundation
Blue-Ribbon Advisory Panel on Cyberinfrastructure
2003
http://www.cise.nsf.gov/sci/reports/toc.cfm
CNI Spring 2004 6
Report of the National Science Foundation
Blue-Ribbon Advisory Panel on Cyberinfrastructure
2003
http://www.cise.nsf.gov/sci/reports/toc.cfm
CNI Spring 2004 7
UK e-Science Programme
“e-Science is about global collaboration in key areas of science and the next generation of
infrastructure that will enable it.”
John Taylor, Director General, Research Councils, UK
CNI Spring 2004 8
CNI Spring 2004 9
Powering the Virtual Universehttp://www.astrogrid.org(Edinburgh, Belfast, Cambridge, Leicester, London, Manchester, RAL)
AstroGrid will provide advanced, Grid based, federation and data mining tools to facilitate better and faster scientific output.
• JISC-funded for 1 year from September 2003• UKOLN (lead), University of Southampton, University of
Manchester• “Building the links between research data, scholarly
communication and learning”• e-Science testbed Combechem
– Grid-enabled combinatorial chemistry– Crystallography, laser and surface chemistry– Development of an e-Lab using pervasive computing technology– National Crystallography Service
• UKOLN• Michael Day• Monica Duke• Rachel Heery• Liz Lyon• +• Andy Powell
• Southampton• Les Carr• Simon Coles• Jeremy Frey• Chris Gutteridge• Mike Hursthouse
• Manchester• John Blunden-Ellis
CNI Spring 2004 29
Key Deliverables
1. Requirements specification
2. Pilot service
3. Two supporting studies:– Provenance: review of current research– Feasibility report on dataset description and
schema
4. Consultative evaluation workshop and report
5. Recommendations for future work
CNI Spring 2004 30
Diagram by Andy Powell, UKOLN
Pilot service – technical architecture
Comb-e-Chem Project
X-Raye-Lab
Analysis
Properties
Propertiese-Lab
SimulationVideo
Diff
ract
omet
er
Grid Middleware
StructuresDatabase
CNI Spring 2004 32
Crystallography workflow
• Initialisation: mount new sample on diffractometer & set up data collection
• Collection: collect data• Processing: process and correct images• Solution: solve structures• Refinement: refine structure• CIF: produce CIF• Report: generate Crystal Structure Report
CNI Spring 2004 33
CNI Spring 2004 34
First steps: establishing common ground…
• Understand the data creation process • Terminology and definitions
– Data– Metadata– Datafile– Dataset– Data holding
• Different views– Digital library researchers, computer scientists, chemists– Generic vs specific– Modeller vs practitioner
• Aim for a common ontology• Modelling the domain• Creating a metadata schema
CNI Spring 2004 35
ebank_dc record (XML)
Crystal structure (data holding)
Crystal structure report (HTML)
Dataset
Dataset
Institutional repository
eBank UK aggregator service
ePrint UK aggregator service
Subject service
DepositHarvesting OAI-PMH
ebank_dc
Harvesting OAI-PMH oai_dc
Harvesting OAI-PMH oai_dc
Searching, linking and embedding
Searching, linking and embedding
Searching, linking and embedding
Dataset
dc:identifier
dcterms:references
Linking
dc:type=“CrystalStructure” and/or “Collection”
Model input Andy Powell, UKOLN.
PSIgate portal
Eprint oai_dc record (XML)
dcterms:isReferencedBy
dc:type=“Eprint” and/or ”Text”
CNI Spring 2004 36
Where are we now?
• Version 1.0 eBank metadata schema• Pilot eBank repository for harvesting• Exports records as ebank_dc and oai_dc• Validation of schema
– Against harvesting and searching– Against user requirements– Against other schema
• Concept of a collection and a Collection Level Description
• Implementing the pilot service
Challenges for the future
CNI Spring 2004 38
What next?The metadata schema…some issues• Reduce to its simplest form or reflect the complexity?• ebank_dc versus oai_dc• Compatibility with other schema
– CLRC Scientific Metadata Model vs 1.0 2001 (under revision) http://www-dienst.rl.ac.uk/library/2002/tr/dltr-2002001.pdf
• Expand to include SMART e-Lab metadata e.g. sample preparation
CNI Spring 2004 39
…and also….• Investigate identifiers e.g. International Chemical Identifier• Metadata enhancement - subject keyword additions to
datasets based on knowledge of keywords in related publications
• Develop search interface – embedding eBank UK• Testing with PSIgate physical sciences portal• Explore context sensitive linking: find me
– Datasets by this person– Journal articles by this person– Datasets related to this subject– Journal articles on this subject– Learning objects by this person– Learning objects on this subject