This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Plagued by selective reporting of data and methods
Plagued by selective reporting of data and methods
Why? For example:
• Researchers still lack of or insufficient motivations
o Focus on big discovery and impact; because they “have to”
• Hypothesis-confirming results get prioritized
o Difficulties with reviews of other results
• Agreements, disagreements and timing
o Unclear or lack of data sharing agreements and timing of disclosure
• Loose requirements and monitoring by journals and funders
o Publish and release just enough; keep the rest, move to next grant
Are open data and methods understandable, reusable?
Are open data and methods understandable, reusable?
Not always…
• Outputs are multi-dimensional, diverse, not always well cited / stored
• Software, codes, workflows etc.; hard(er) to get hold of
• Data often distributed and fragmented to fit (siloed) databases
o Not contain enough information for others to understand it
• Uneven level of details and annotation across different databases
o Specialized, generalist, public and institutional
• Data curation activities are perceived as time consuming
o Collection and harmonization of detailed methods and experimental
steps is done/rushed at publication stage
Worldwide movement for FAIR data
Role of data papers / data journals
• Incentive, credit for sharing!• Data-focused peer review!• Value of data vs. analysis, results!• Support of the FAIR concept!
market research (2011)
• What do researchers want from a data publications? o 96% - increased visibility and discovery o 95% - increased usability of their research data o 93% - credit mechanism for deposit of data o 80% - peer review of content/datasets
Respondent characteristics 387 respondents (329 active researchers Physics (24%) Earth and environmental science (21%) Biology (20%) Chemistry (19%) Others (16%)
Because of importance of formal publications in the academic !
incentive structure!
Publishers occupy a leverage point
Role of publishers as “agents of change”
"!!
Helping you publish, discover and reuse research data
Credit for sharing your data
Focused on reuse and reproducibility
Peer reviewed, curated
Promoting community data and code repositories
Open Access
• Currently covering life, natural and environmental sciences!
• Big and small data!o power of small data are in their aggregation and
integration with other datasets!
• New and previously published individual datasets, curated collections and citizen science!
o a fuller, more in-depth look at the data processing steps, additional data files, codes etc!
o tutorial-like information for scientists interested in reusing or integrating the data with their own!
Methods and technical analyses supporting the quality of the measurements:"What did I do to generate the data?"How was the data processed?"Where is the data?"Who did what when"How can the data be used or reused?"
Introducing a new content type: Data Descriptor
Designed to make data more FAIR Focused mainly on: • Methods • Technical Validation • Data Records • Usage Notes
Methods and technical analyses supporting the quality of the measurements:"What did I do to generate the data?"How was the data processed?"Where is the data?"Who did what when"How can the data be used or reused?"
Relation with traditional article - content
AFTER: expand on your research articles, adding further information for reuse of the data
AT THE SAME TIME: publish your Data Descriptor(s) alongside research article(s)
OR BEFORE
Relation with traditional article - time
Publish Data!
"""""""""
Code in GitHub
"""""""""Data in OpenfMRI
Share your data, get credited and cited
Evaluation is not be based on the perceived impact !or novelty of the findings or size of the data!
!
• Experimental rigour and technical data quality!o Methodologically sound!o Technical validation experiments and statistical analyses!o Depth, coverage, size, and/or completeness of data sufficient for the types
of applications!• Completeness of the description!
o Sufficient details to allow others to reproduce the results, reuse or integrate it with other data!
o Compliance with relevant minimum information or reporting standards!• Integrity of the data files and repository record!
o Data files match the descriptions in the Data Descriptor!o Deposited in the most appropriate available databases!
Focus on data reuse"Detailed descriptions of the methods and technical analyses supporting the quality of the measurements.!Does not contain tests of new scientific hypotheses!
Joint Declaration of Data Citation Principles by the Data Citation Synthesis Group
Data Descriptor: narrative
In-house editorial curator assists authors via !• Excel spreadsheet
templates"• internal authoring tool!
to create the structured component, also performing value-added semantic annotation
analysis !method! script!
Data file or !record in a database!
Data Descriptor: structure (CC0)
Because we do not want cryptic experimental info, e.g.: