Workshop on the Future of Big Data management, London, UK – 27-28 June 2013 EUDAT Towards a pan-European Collaborative Data Infrastructure Mark van de Sanden SURFsara Dutch National HPC center, The Netherlands Workshop on the Future of Big Data Management Imperial College, London, UK 27-28 June 2013
EUDAT. Towards a pan- European Collaborative Data Infrastructure. Mark van de Sanden SURFsara Dutch National HPC center, The Netherlands Workshop on the Future of Big Data Management Imperial College, London, UK 27-28 June 2013. Outline. Setting the Scene - PowerPoint PPT Presentation
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Workshop on the Future of Big Data management, London, UK – 27-28 June 2013
EUDATTowards a pan-European
Collaborative Data InfrastructureMark van de Sanden
SURFsaraDutch National HPC center, The Netherlands
Workshop on the Future of Big Data ManagementImperial College, London, UK
27-28 June 2013
Workshop on the Future of Big Data management, London, UK – 27-28 June 20132
Outline• Setting the Scene• Collaborative Data Infrastructure• EUDAT project• CDI Building Blocks
Workshop on the Future of Big Data management, London, UK – 27-28 June 2013
Rep
osito
ry V
olui
me
Rep
osito
ries
EB/yearPB/day
2016-2020
200PB~25PB/year
80TB~TB/year 20TB
~TB/year
~10PB/year
1
100
10k
PB
TB
EB
30 Repositories 5 Repositories
7 Repositories
Setting the Scene
Long tail of small dataLarge volume
~2-3PB/year
Use
rs
#M
1,3M Researchers15M Students500M Citizens
Varie
ty
Workshop on the Future of Big Data management, London, UK – 27-28 June 2013
Doing some MathEU
Institutes on higher education(research institutes not included)
4000
Average repositories per institute 10Average size repository 5TBResearchers 1,3MStudents 15M
Community Support ServicesData discovery & navigation, workflow generation, annotation, interpretability
Collaborative Data Infrastructure -A framework for the future? -
Workshop on the Future of Big Data management, London, UK – 27-28 June 2013
Workshop on the Future of Big Data management, London, UK – 27-28 June 2013
• EPOS: European Plate Observatory System• CLARIN: Common Language Resources and Technology Infrastructure• ENES: Service for Climate Modelling in Europe• LifeWatch: Biodiversity Data and Observatories• VPH: The Virtual Physiological Human
• INCF: International Neuroinformatics
• All share common challenges:– Reference models and architectures– Persistent data identifiers– Metadata management– Distributed data sources– Data interoperability
Six research communities on Board
Workshop on the Future of Big Data management, London, UK – 27-28 June 2013