Publication of Research Publication of Research Data Data Marianne van der Heijden Marianne van der Heijden November 3, 2010 November 3, 2010
Publication of Research Data Publication of Research Data
Marianne van der HeijdenMarianne van der Heijden
November 3, 2010November 3, 2010
► dddfdddf““Scientists need to ensure Scientists need to ensure
that their results will be that their results will be managed for the long haul. managed for the long haul.
Maintaining data Maintaining data takes big takes big
organization”. organization”. Clifford Lynch,Clifford Lynch,
NatureNature Special on Big Data Special on Big Data (3 Sept. 2008)(3 Sept. 2008)
Stelling 1:Stelling 1: Voornaamste problemen rondom Voornaamste problemen rondom data publicatie zijn technische data publicatie zijn technische aspectenaspecten
Stelling 2:Stelling 2:Data management is ‘common Data management is ‘common practice’ bij practice’ bij onderzoeksinstitutenonderzoeksinstituten
Stelling 3:Stelling 3:Alle belangrijke onderzoeksdata Alle belangrijke onderzoeksdata wordt in artikelen gepubliceerd. wordt in artikelen gepubliceerd. Het apart publiceren van data is Het apart publiceren van data is dan ook overbodig.dan ook overbodig.
Who are involved?Who are involved?
► Funders Funders ► Researchers as data Researchers as data producers producers ►Data archivesData archives ► JournalsJournals► Researchers as data Researchers as data consumers consumers
Building a frameworkBuilding a framework
► Together with researchersTogether with researchers► Appoint contact persons per departmentAppoint contact persons per department
►Hosting(outsourcing) & support of the ICT Hosting(outsourcing) & support of the ICT environmentenvironment
►Discovery and archiving systemDiscovery and archiving system
Research dataResearch data
► long term access/availability (repository)long term access/availability (repository)► usable data formatusable data format► Reliable / Peer review / QualityReliable / Peer review / Quality► Metadata Metadata
TechniqueTechnique
Trusted RepositoryTrusted Repository
Data Seal of ApprovalData Seal of Approval
MetadataMetadata
► descriptive metadatadescriptive metadata to find, cite and understand data to find, cite and understand data
► structural metadatastructural metadata how to process the data and the relations how to process the data and the relations
between filesbetween files► administrative metadataadministrative metadata
intellectual property, conditions for use intellectual property, conditions for use and accessand access
Step-by-Step ApproachStep-by-Step Approach
► Inventarisation of dataInventarisation of data► Structuring of the data filesStructuring of the data files
Training and checklists to start as early as Training and checklists to start as early as possible with structured data possible with structured data
► Creation of datasetsCreation of datasets Uniformity of filesUniformity of files
► Archiving of the data setsArchiving of the data sets► Publishing of the metadata for discoveryPublishing of the metadata for discovery
Motivation and AwarenessMotivation and Awareness
►Benefits of data archiving►Recognition from data publishing►Demands from publishers and financiers
►NIOODATA wiki►Data day as a joint data archiving exercise
Benefits of Data PublicationBenefits of Data Publication► Enhance the visibility of your workEnhance the visibility of your work► Avoid data loss and corruptionAvoid data loss and corruption
Work more efficientlyWork more efficiently► Facilitate Data ExchangeFacilitate Data Exchange► Moral obligationMoral obligation
Costs of data collection (instruments, Costs of data collection (instruments, equipment,workforce)equipment,workforce)
Uniqeness of field observations (You cannot measure a Uniqeness of field observations (You cannot measure a value from 2003 in 2010 )value from 2003 in 2010 )
► Obtain scientific recognition for your work:Obtain scientific recognition for your work: Publishing a data paper (Example Ecological Archives)Publishing a data paper (Example Ecological Archives) Make your dataset ‘citable’ (datacentrum 3TU gives DOI)Make your dataset ‘citable’ (datacentrum 3TU gives DOI) Data citation from NIOO Data PortalData citation from NIOO Data Portal
Data Data RepositoryRepository
DANSDANS
3 TUD3 TUD
Other RepositoriesOther Repositories
►DistributedDistributed IPY DataIPY Data
► InstitutionalInstitutional NIOO / VLIZNIOO / VLIZ
Data Management Process:
Results pilot studyResults pilot study
► 21.393 files, 3.421 archived in 91 datasets21.393 files, 3.421 archived in 91 datasets► Some files integratedSome files integrated► Idea buildingIdea building
Procedures and agreementsProcedures and agreements Different formats request different systemsDifferent formats request different systems Data “rescue” actionData “rescue” action Integration necessity Integration necessity
VLIZ takes care for NIOOVLIZ takes care for NIOO
► Guidelines and procedures for Data Guidelines and procedures for Data Management and Data StoringManagement and Data Storing
► Two 3-day workshopsTwo 3-day workshops► Data “rescue” actionData “rescue” action► Help with archivingHelp with archiving► Elaborate formatsElaborate formats► Enhance GuidelinesEnhance Guidelines
in cooperation with researchersin cooperation with researchers
UserfriendlinessUserfriendliness
► Integration of archiving with workflow stepsIntegration of archiving with workflow steps► Selection criteriaSelection criteria► Interface Design with default suggestionsInterface Design with default suggestions
example Morphoexample Morpho
Morpho: datamanagement for ecologistsMorpho: datamanagement for ecologists
Data Management PlanData Management Plan
► Start working timely and efficiently with data managementStart working timely and efficiently with data management In US+ UK stellen sommige financiers dit al verplichtIn US+ UK stellen sommige financiers dit al verplicht
Data life cycleData life cycle
Data PublicationData Publication
► IOCD workshop on Data IOCD workshop on Data PublishingPublishing
► Ecological Archives Ecological Archives
Data citationData citation 3 TU – DOI3 TU – DOI
NIOO datacitationNIOO datacitation
Digital PreservationDigital Preservation
Take home thoughts
►Technology is facilitating, not prescribing►Motivation is essential►Resolutions in close contact with researcher►Start as early as possible in data life cycle
process►Start up rather expensive in investing in
infrastructure and in coaching people
Cooperate !Cooperate !
ThanksThanks
► Links to resources:Links to resources:
►http://www.delicious.com/mheyden/Datamanagement
► Links to presentation:Links to presentation:►http://www.slideshare.net/mheyden/Datapu
blication
Delicious
► Data Archiving and Networked Services | Over DANS ► Data en information management NIOO ALGBAC, voorbeeld data citatie ► Digital Preservation Tutorial: Table of Contents ► DMP ► DMP_checklist.pdf (application/pdf-object) ► Ecological Archives Preparing Data Papers Data publicatie ► Info | Data Seal of Approval ► International Polar Year Portal ► IODE Workshop on Data Publishing ► KNB Data :: Morpho Data management for ecologists ► Manage Your Data: Data Management: Subject Guides: MIT Libraries ► Open Data Commons » Blog Archive » Draft of an Open Data Commons Attribution
License ► Panton Principles ► VLIZ : VMDC Nieuws ► Wordle - Create