Top Banner
EUDAT A cross-disciplinary data infrastructure in Horizon 2020 Damien Lecarpentier EUDAT Project Manager CSC – IT Center for Science Ltd
23

Eudat presentation nov2013

Dec 19, 2014

Download

Technology

EUDAT

EUDAT general presentation, November 2013
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Eudat presentation nov2013

EUDATA cross-disciplinary data infrastructure in Horizon

2020

Damien Lecarpentier

EUDAT Project Manager

CSC – IT Center for Science Ltd

Page 2: Eudat presentation nov2013

Data ”Deluge”

2

Increasing complexity and variety

Gigabytes

Terabytes

PetabytesExabytesZettabytes

Expo

nenti

al g

row

th

• Where to store it?• How to find it?• How to make the most of it?

Page 3: Eudat presentation nov2013

3

Synergies

3

If there are hundreds of Research Infrastructures, how many different data management systems can we sustain?

Page 4: Eudat presentation nov2013

Tru

st

Data

Cu

rati

on

Common Data Services

UsersData

Generators

Community Support Services

Riding the WaveCollaborative Data Infrastructure

-A framework for the future? -

Page 5: Eudat presentation nov2013

5

Page 6: Eudat presentation nov2013

Consortium

6

Page 7: Eudat presentation nov2013

• EPOS: European Plate Observatory System

• CLARIN: Common Language Resources and

Technology Infrastructure

• ENES: Service for Climate Modelling in Europe

• LifeWatch: Biodiversity Data and Observatories

• VPH: The Virtual Physiological Human

• INCF: International Neuroinformatics Coordinating

Facility

• DRIHM: Distributed Research Infrastructure for

Hydrometeorology

Seven Research Communities on Board

7

Page 8: Eudat presentation nov2013

User Forums + 25 communities

8

1st User Forum7-8 March 2012,

Barcelona

Page 10: Eudat presentation nov2013

Data Staging Safe Replication Simple Store

AAIMetadata Catalogue

Dynamic replication to HPC workspace for processing

Data curation and access optimization

Researcher data store (simple upload, share and access)

Aggregated EUDAT metadata domain.Data inventory

Network of trust among authentication and authorization actors

Selected Services

EUDAT Boxdropbox-like serviceeasy sharing local synching

Semantic Annochecking & referencing

Dynamic Dataimmediate handling

New servicesto come

PIDIdentityIntegrityAuthenticityLocations

Page 11: Eudat presentation nov2013

11

Safe Replication Service

• Robust, safe and highly available data replication service for small- and medium- sized repositories– To guard against data loss in long-term archiving and

preservation

EUDAT CDI Domain of registered data

PIDs • Policy rules

http://eudat.eu/safe-replication | [email protected]

– To optimize access for user from different regions

– To bring data closer to powerful computers for compute-intensive analysis

Page 12: Eudat presentation nov2013

12

Data Staging Service

• Support researchers in transferring large data collections from EUDAT storage to HPC facilities

• Reliable, efficient, and easy-to-use tools to manage data transfers

EUDAT CDI Domain of registered data

PRACEHPC

HPC

• Provide the means to re-ingest computational results back into the EUDAT infrastructure

http://eudat.eu/datastaging | [email protected]

Page 13: Eudat presentation nov2013

13

Simple Store Service

• Allow registered users to upload ”long tail” data into the EUDAT store

• Enable sharing objects and collections with other researchers

http://eudat.eu/simplestore | [email protected]

EUDAT CDI Domain of registered data

Simple uploadSimple metadata

PID registration

• Utilise other EUDAT services to provide reliability and data retention

Page 14: Eudat presentation nov2013

14

Page 15: Eudat presentation nov2013

15

Page 16: Eudat presentation nov2013

16

Metadata Service

• Easily find collections of scientific data – generated either by various communities or via EUDAT services

• Access those data collections through the given references in the metadata to the relevant data stores

• Europeana of scientific data

http://eudat.eu/metadata | [email protected]

EUDAT CDI Domain of registered data

Page 17: Eudat presentation nov2013

17

Page 18: Eudat presentation nov2013

18

Towards Horizon 2020

SynergySustainability

User driven services

Global collaboration

Trust

Joint e-infrastructure roadmaps

Page 19: Eudat presentation nov2013

A Network of Trusted Centers

• Strong and sustainable generic data centers with existing trusted relationships

• Each having specific relationship with research communities

• EUDAT is about providing solutions in a federated environment

Generic datacentres

Community data sites

Page 20: Eudat presentation nov2013

• Strong requirement from researchers and funders

Path to Sustainability

Bridging National and European solutions

Page 21: Eudat presentation nov2013
Page 22: Eudat presentation nov2013

22

EUDAT Priorities in H2020• Consolidation of Core Services

– Increased performance, new functionalities, AAI, etc. – Develop tools and policies to facilitate usage: data management plans,

licensing, training, etc.– Development of new services

• Financial Sustainability– Cost and funding models– Framework and mechanisms for sharing resources across sites and

across communities (juste retour, etc.)

• Interoperability– E-Infrastructures a joint roadmap?– National initiatives service portfolios– RDA EUDAT as a driver and implementer