M. Diepenbroek (MARUM), M. Lautenschlager (MPI-M), E. Paliouras (DLR), H. Grobe (AWI) CODATA General Assembly, Berlin 10.11.2004 rld Data Center Cluster „Earth System Resear - an approach for a common data infrastructure in geosciences WDC-MARE WDC-RSAT WDC-TERRA WDC-Climate (Candidate)
15
Embed
M. Diepenbroek (MARUM), M. Lautenschlager (MPI-M), E. Paliouras (DLR), H. Grobe (AWI) CODATA General Assembly, Berlin 10.11.2004 World Data Center Cluster.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
M. Diepenbroek (MARUM), M. Lautenschlager (MPI-M), E. Paliouras (DLR), H. Grobe (AWI)
CODATA General Assembly, Berlin 10.11.2004
World Data Center Cluster „Earth System Research“- an approach for a common data infrastructure in geosciences
• founded during the International Geophysical Year (IGY) 1957-58
• longterm funding and maintainance by their host countries on behalf of the international science community
• status of WDC is peer reviewed by international research institutes and programmes and funding organisations
• accept data from national and international scientific or monitoring programs as resources permit.
• all data held in WDCs are generally available to science
• scope of data collected: solar, geophysical, environmental, and human dimensions data, especially for monitoring changes in the geosphere and biosphere
• at present 52 Centers in 12 countries
The World Data Center (WDC) System of the International Council for Science (ICSU)
Longterm archiving facilities• Clear commission as data libraries• Data management infrastructure, expertise, and manpower• Longterm commitment and funding
Peer review for scientific data• Completeness of data set descriptions (metadata)• Validity of methods used• data values (precision, sequence, and ranges)• Data publication based on citable data entities having persistent identifiers (DOI)
Userfriendly and reliable systems for data retrieval and distribution• General nonrestricted online access• Offline products (e.g. data collections, DVD)
Fostering common standards and protocols
Clear commitment to the rules for good scientific practice!
WDC infrastructure• Metadataprofile (ISO 19115, subset compatible with Dublin Core and
ISO 690)• Metadata catalogues based on common protocols (ISO, W3C, OGC)• Common internet portal (search engine) • Cost models to support longterm archiving at universities and in
scientific projects
Data publication• Migration of metadata into library catalogues and direct access of
WDC archives• Common search of scientific data and literature• Peer review for scientific data• Acceptance as citable publication through ISI
• "Citation Index": Scientific efficiency is "measured" by publications.
• Extra work for data publication is currently not acknowledged.– Data processing, context documentation, quality assurance.
• Recommendation: Data publications should be included in the standard scientific "Citation Index".– Motivation of the individual scientist.– Connection between person and primary dataset.
• Citable Data publications– support the rules of good scientific practise.– encourage inter-disciplinary data utilisation.– Make data searchable in library catalogues together with articles– Closes the gap between scientifc literature and related data sources
15. description These data represent results from the ECHAM4/OPYC climate model running the SRES-B2 sceanrio. The data base tables contain monthly mean time series of ……
16. publicationPlace Hamburg
17. size 614190228 Bytes
18. format GRIB
19. edition 1
20. relatedDOIs (none)
Data Publication:Metadata for primary data 2
Data Publication:Criteria for Persistent Identifier Allocation
• Critical points are securing of data quality and stable connection between identifier and data entity
– Allocation is restricted to syntax control and completeness, i.e. expert data description and long-term archiving
– Scientific quality assurance is expected by the author and will be reviewed during the allocation process.
– Published primary data cannot be changed like published articles.
– Stable connection between identifier reference and data entity as well as long-term availability of the primary data are essential and must be ensured (e.g. ICSU WDC's)
GFZ Geophysics
International DOI Foundation
TIB HannoverRegistr.Agency
M&D/MPIM Climate Models
Marum/AWI Observations
Data StorageLong-termArchivingIn WDC
Data Storage Long-termArchivingIn WDC
Data StorageLong-termArchiving
Global Handle System
DDBURN-Knot
DFG Project "Publication and Citation of ScientificPrimary Data"
TIB-ORDERLibrary Catalogue
Data Publication:
Further information
• Project webpage:• http://www.std-doi.de• TIB Handle Server:• http://doi.tib-hannover.de:8000• DOI Foundation:• http://www.doi.org• URN registration of the DDB:• http://www.persistent-identifier.de