Top Banner
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing www.chain-project.eu proj-office@chain- project.eu Grant Agreement n. 306819 A CHAIN-REDS Perspective about Data Access and Metadata Management Rafael Mayo-García, CIEMAT Tunis / 12-13 Dec 2013
35

A CHAIN-REDS Perspective about Data Access and Metadata Management

Feb 23, 2016

Download

Documents

bary

A CHAIN-REDS Perspective about Data Access and Metadata Management. Rafael Mayo-García, CIEMAT. Tunis / 12-13 Dec 2013. A CHAIN-REDS Perspective about Data Access and Metadata Management - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: A CHAIN-REDS Perspective about Data Access and Metadata Management

Co-ordination & Harmonisation of Advanced e-Infrastructuresfor Research and Education Data Sharing

[email protected] Agreement n. 306819

A CHAIN-REDS Perspective about Data Access and Metadata

ManagementRafael Mayo-García, CIEMAT

Tunis / 12-13 Dec 2013

Page 2: A CHAIN-REDS Perspective about Data Access and Metadata Management

A CHAIN-REDS Perspective about Data Access and Metadata Management

Roberto Barberaa,b, Carla Carrubbab, Giuseppina Inserrab, Christos Kanellopoulosc, Kostas Koumantarosc, Rafael Mayo-Garcíad, Ognjen

Prnjatc, Rita Riccerib, Manuel Rodriguez Pascuald, Antonio Rubio-Monterod, Federico Ruggierie

a University of Cataniab INFN-Catania

c GRNETd CIEMAT

e GARR & INFN-Roma Tre

Page 3: A CHAIN-REDS Perspective about Data Access and Metadata Management

Coordination &

Harmonisation of Advanced

eINfrastructuresCHAIN

CHAIN-REDS: A legacy from CHAIN

Page 4: A CHAIN-REDS Perspective about Data Access and Metadata Management

CHAIN-REDS is an EC (306819) funded project ~ 2.1 M€ 1 December 2012 – 30 months

Structured in WP 1 Project Management WP 2 Dissemination, Training and Outreach WP 3 Interoperation and coordination of e-

Infrastructures WP 4 Data Infrastructures WP 5 Support to small groups and emerging

communities

WP4 in CHAIN-REDS

Page 5: A CHAIN-REDS Perspective about Data Access and Metadata Management

CHAIN-REDS is an EC (306819) funded project ~ 2.1 M€ 1 December 2012 – 30 months

Structured in WP 1 Project Management WP 2 Dissemination, Training and Outreach WP 3 Interoperation and coordination of e-

Infrastructures WP 4 Data Infrastructures WP 5 Support to small groups and emerging

communities

WP4 in CHAIN-REDS

Page 6: A CHAIN-REDS Perspective about Data Access and Metadata Management

Partners INFN CIEMAT GRNET CESNET UBUNTUNET CLARA IHEP ASREN SIGMA ORIONIS C-DAC

WP4 ‘Data infrastructures’

Page 7: A CHAIN-REDS Perspective about Data Access and Metadata Management

Partners INFN CIEMAT GRNET CESNET UBUNTUNET CLARA IHEP ASREN SIGMA ORIONIS C-DAC

Europe

Europe

WP4 ‘Data infrastructures’

Page 8: A CHAIN-REDS Perspective about Data Access and Metadata Management

INFN CIEMAT GRNET CESNET UBUNTUNET CLARA IHEP ASREN SIGMA ORIONIS C-DAC

Europe

Africa

WP4 ‘Data infrastructures’

Europe

Page 9: A CHAIN-REDS Perspective about Data Access and Metadata Management

INFN CIEMAT GRNET CESNET UBUNTUNET CLARA IHEP ASREN SIGMA ORIONIS C-DAC

AfricaLatin America

WP4 ‘Data infrastructures’

Page 10: A CHAIN-REDS Perspective about Data Access and Metadata Management

INFN CIEMAT GRNET CESNET UBUNTUNET CLARA IHEP ASREN SIGMA ORIONIS C-DAC

Latin AmericaAsia

WP4 ‘Data infrastructures’

Asia

Page 11: A CHAIN-REDS Perspective about Data Access and Metadata Management

INFN CIEMAT GRNET CESNET UBUNTUNET CLARA IHEP ASREN SIGMA ORIONIS C-DAC

Asia

WP4 ‘Data infrastructures’

Middle East

Asia

Page 12: A CHAIN-REDS Perspective about Data Access and Metadata Management

INFN CIEMAT GRNET CESNET UBUNTUNET CLARA IHEP ASREN SIGMA ORIONIS C-DAC

WP4 ‘Data infrastructures’

Middle East

Page 13: A CHAIN-REDS Perspective about Data Access and Metadata Management

Public outreach and dissemination is focused on reporting on Trans-continental Data Infrastructures and Data repositories and on several Use Cases

D4.1 Trans-continental Data Infrastructures and Data repositories

D4.2 Analysis of Data Infrastructures and Data repositories (coming soon)

Available at http://www.chain-project.eu/deliverables

WP4 ‘Data infrastructures’

Page 14: A CHAIN-REDS Perspective about Data Access and Metadata Management

CHAIN-REDS has established official collaborations (MoUs) with other VRC-related communities

AgINFRA DCH-RP EarthServer EIFL ENGAGE

WP4 ‘Data infrastructures’

Page 15: A CHAIN-REDS Perspective about Data Access and Metadata Management

Conversations are being held with EUDAT, H3Africa, iMENTORS, IVOA, SAEON, SKA Africa, Univ. Cape Town

WP4 ‘Data infrastructures’

Page 16: A CHAIN-REDS Perspective about Data Access and Metadata Management

Extend the CHAIN-REDS Knowledge Base (BS) with Data capabilities http://www.chain-project.eu/knowledge-base

Knowledge Base: Infrastructure

RREN(s) NREN NGI CA(s) Ident.

Fed(s) ROC(s) Grid site(s) Application(

s)

Page 17: A CHAIN-REDS Perspective about Data Access and Metadata Management

An investigation on the available (Open Access) Data and Document Repositories has been performed

Information has been collected in Africa, Asia, Europe, Latin America and the Middle East

New ones have been incorporated into the Knowledge Base

These new repositories range from databases owned by a single group to huge continental collaborations

Knowledge Base:Document & Data repositories

Page 18: A CHAIN-REDS Perspective about Data Access and Metadata Management

Knowledge Base:Document & Data repositories

• 3,200 repos• >33 M docs

Page 19: A CHAIN-REDS Perspective about Data Access and Metadata Management

Knowledge Base:Document & Data repositories

Page 20: A CHAIN-REDS Perspective about Data Access and Metadata Management

About Open Access Data Repositories, standards are being promoted

OAI-PMH for metadata retrieval Dublin Core as metadata schema SPARQL for semantic web search VOTable (XML) as potential standard for the interchange

of data represented as a set of tables Persistent Identifiers (PID)

Standards

Page 21: A CHAIN-REDS Perspective about Data Access and Metadata Management

The adopted standards have been implemented in the CHAIN-REDS KB

Developments on (Open Access) Document and Data Repositories

A semantic web enrichment A semantic search engine

OADRs and DRs

Page 22: A CHAIN-REDS Perspective about Data Access and Metadata Management

25

Semantic enrichment

Page 23: A CHAIN-REDS Perspective about Data Access and Metadata Management

OAD

Rs

Dat

a Re

pos.OAI-PMH OAI-PMH

Harvester(running on grid/cloud)

Linked-data search engine

Semantic-web enrichment

End-points

Harvester(running on grid/cloud)

Semantic search engine architecture

Page 24: A CHAIN-REDS Perspective about Data Access and Metadata Management

The semantic search engine on CHAIN-REDS linked data is available

Allows searching among the semantically-enriched metadata coming from the OADRs and DRs included in the KB

OADRs and DRs

cell

Page 25: A CHAIN-REDS Perspective about Data Access and Metadata Management

OADRs and DRs

Page 26: A CHAIN-REDS Perspective about Data Access and Metadata Management

OADRs and DRs

New knowledge discovery!

Page 27: A CHAIN-REDS Perspective about Data Access and Metadata Management

Single and Parallel semantic search are available Single: the usual semantic search service described before Parallel: the new parallel semantic search service that allow

users to search in parallel across the millions of resources contained in the CHAIN-REDS Knowledge Base and in the ENGAGE Platform

Parallel semantic search engines have been made available also in others Science Gateways agINFRA (CHAIN-REDS Knowledge Base & OpenAgris

repository) DCH-RP (CHAIN-REDS Knowledge Base & Europeana, Cultura

Italia and Isidore repositories)

Semantic Search Engine

Page 28: A CHAIN-REDS Perspective about Data Access and Metadata Management

Performs sequential and parallel searches ENGAGE

agINFRA DCH-RP

Semantic Search Engine

Page 29: A CHAIN-REDS Perspective about Data Access and Metadata Management

Semantic Search Engine

Page 30: A CHAIN-REDS Perspective about Data Access and Metadata Management

A programmable use of the CHAIN-REDS Semantic Search Engine is also possible by means of a RESTful API

http://www.chain-project.eu/semantic-search-api CHAIN-REDS webpage Semantic Search Web

Example http://www.chain-project.eu/virtuoso/api/resources?

keyword=<KEYWORD>&limit=<NUMBER_OF_RESOURCES >

Semantic Search Engine

Page 31: A CHAIN-REDS Perspective about Data Access and Metadata Management

Future developments on A tool for extracting the data associated to OADRs The execution of distributed jobs in the Science

Gateway

Data Accessibility, Reproducibility and Trustworthiness (DART)

Based on the interoperability demo performed by CHAIN-REDS at EGI TF 2013

Aiming at seamlessly perform the cycle Access to a document Extraction of associated raw data

Execution of a code taking those data as input Generation of new results Upload of the new results and article

Coming actions

Page 32: A CHAIN-REDS Perspective about Data Access and Metadata Management

CHAIN-REDS has identified in a first phase several fields with interests in the different regions

Agriculture Cultural Heritage e-Government Earth Science Astronomy and Astrophysics

Potential collaborations with initiatives and projects working on these areas are being carried out

Conclusions

Page 33: A CHAIN-REDS Perspective about Data Access and Metadata Management

Other fields and groups are also of interest OADRs’ and DRs’ managers/owners are welcome to

contact the project to share their data within the CHAIN Knowledge Base (both in Africa and Latin America this is already happening)

CHAIN-REDS is also looking forward to receiving feedbacks from all interested organizations on the Knowledge Base and the semantic search service

Conclusions

Page 34: A CHAIN-REDS Perspective about Data Access and Metadata Management

Data developments have been carried out in the Regions of interest to CHAIN-REDS

A special action in the Middle East is now a priority for CHAIN-REDS

Semantic engine and web-enrichment are powerful tools to link data and retrieve information DART

Conclusions

Page 35: A CHAIN-REDS Perspective about Data Access and Metadata Management

Co-ordination & Harmonisation of Advanced e-Infrastructuresfor Research and Education Data Sharing

[email protected] Agreement n. 306819

Thank you !

[email protected][email protected]