Top Banner
ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory [email protected]
13

ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory [email protected].

Mar 31, 2015

Download

Documents

Dale Hackett
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory brian.matthews@stfc.ac.uk.

ICAT + Information Model

Brian Matthews

Scientific Information GroupE-Science Centre

STFC Rutherford Appleton Laboratory

[email protected]

Page 2: ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory brian.matthews@stfc.ac.uk.

Facilities Process

Proposal

Approval

SchedulingExperiment

Data storage

Record Publication

Scientist submits application for

beamtime

Facility committee approves application

Facility registers, trains, and schedules

scientist’s visit

Scientists visits, facility run’s experiment

Subsequent publication registered

with facility

Raw data filtered, and stored

Data analysis

Tools for processing made available

Page 3: ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory brian.matthews@stfc.ac.uk.

Investigation

Publication KeywordTopic

Sample Sample ParameterDataset

Dataset ParameterDatafile

Datafile Parameter

Investigator

Related DatafileRelated Datafile

Parameter

Authorisation

Core Scientific Metadata Model (CSMD)

The Core Metadata model forms the information model for ICAT.

Designed to describe facilities based experiments in Structural Science.

Page 4: ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory brian.matthews@stfc.ac.uk.

RDBMS

Web Services API

ICAT API

Command Line Tools

Glassfish / JBOSS

JavaC++Fortran

Data Storage/ Delivery System

Single Sign On

User Database System

Proposal System

Proposal System

Publication System

Publication System

e-Science Services

e-Science Services

Software RepositorySoftware

Repository

ICAT Deployment

Page 5: ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory brian.matthews@stfc.ac.uk.
Page 6: ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory brian.matthews@stfc.ac.uk.

I2S2: MethodologyMapping across organisational infrastructures

Proposals

Once awarded beamtime at ISIS, an entry will be created in ICAT that describes your proposed experiment.

Experiment

Data collected from your experiment will be indexed by ICAT (with additional experimental conditions) and made available to your experimental team

Analysed Data

You will have the capability to upload any desired analysed data and associate it with your experiments.

Publication

Using ICAT you will also be able to associate publications to your experiment and even reference data from your publications.

B-lactoglobulin protein interfacial structureE

xam

ple

IS

IS P

rop

osa

lGEM – High intensity, high

resolution neutron diffractometer

H2-(zeolite) vibrational frequencies vs polarising

potential of cations

Home Institution Central Facility

Page 7: ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory brian.matthews@stfc.ac.uk.

Earth Sciences: typical workflow

Martin Dove & Erica Yang

Page 8: ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory brian.matthews@stfc.ac.uk.

CSMD: an established starting point

Investigation

Publication KeywordTopic

Sample Sample ParameterDataset

Dataset ParameterDatafile

Datafile Parameter

Investigator

Related DatafileRelated Datafile

Parameter

Authorisation

• CSMD: Core Scientific MetaData model

• Designed to describe facilities based experiments in Structural Science

• Forms the information model for ICAT, a production data management infrastructure employed by STFC

• Forms the basis for extensions:- To derived data- To laboratory based science- To secondary analysis data- To preservation information- To publication data

Page 9: ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory brian.matthews@stfc.ac.uk.

CSMD- Core• A Core CSMD

– Taking out a lot of the facility specific stuff– A simple model of datasets

Page 10: ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory brian.matthews@stfc.ac.uk.
Page 11: ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory brian.matthews@stfc.ac.uk.

I2S2-IM : Core Layer

These are entities which are in the CSMD• extended with the software execution to accept relationships between data sets• Working with ORE-CHEM

Page 12: ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory brian.matthews@stfc.ac.uk.

Software execution

Page 13: ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory brian.matthews@stfc.ac.uk.

Model with Data Derivation• Extension to the model to add an alternative Investigation

activity type– Very straightforward natural extension to the model

• ICAT can be used almost without modification to record data derivation– Just another data generation activity