Data Publishing & ManagementLearning Objectives: 1. Introduce the advantages of publishing your data, the steps involved and
how to publish to increase your research profile.2. To understand best practice in “data management” and how it underpins
efficient data publication.Sections:
• 1410-1415 A new era of data publishing, what it means for you and the role of data management• 1415-1425 Data publishing – your options and why does it matter• 1425-1435 Data publishing exercise• 1435-1450 Data management• 1450-1500 Data management exercise
Data publishing
Learning ObjectiveTo understand advantages of publishing your data, the steps involved and how to publish to increase your research profile.• Why publish?• What’s in it for you?• What’s the effort? - data publishing resources• TERN’s SHaRED – an example for ecologists
Survey on research data management 2012:• 63% aware of Australian Code of Conduct• 70% understand their data management responsibilities• 70% don’t do data management plans• 70% don’t keep a registry of research data collections
• 82% agree data should be available to other researchers• 81% would re-use another’s data• 29% supported public access to their data
Data Management & Publishing
An emerging international consensus
International Council for Science (ICSU)2 September 2014
Free of financial barriers• for any researcher to contribute to• for any user to access immediately
on publication
Made available without restriction on reuse for any purpose
• subject to proper attribution
Quality-assured and published in a timely manner
Archived and made available in perpetuity
Journals already mandate data publication in open data repositories
Ecology Letters says: “On submission of a paper, authors must confirm whether any of the data or content is already in the public domain (e.g., in a publicly accessible pre-print repository or report), and all relevant sources must be cited.”
Royal Society’s Open Biology says: “All manuscripts which report primary data (usually research articles) should include Data Accessibility section which states where the article's supporting data can be accessed…… Please read our data sharing policies carefully before submission.”
Granting bodies moving towards open access data publication
• NHMRC Open Access policy came into effect from 1 July 2012
http://www.nhmrc.gov.au/grants/policy/dissemination-research-findings
• ARC Open Access policy came into effect from 1 January 2013.
http://www.arc.gov.au/applicants/open_access.htm
“A11.5.2. Researchers and institutions have an obligation to care for and maintain research data in accordance with the Australian Code for the Responsible Conduct of Research (2007). The ARC considers data management planning an important part of the responsible conduct of research and strongly encourages the depositing of data arising from a Project in an appropriate publically accessible subject and/or institutional repository. “
Publishing data can raise your research profile
Individualcitations
reciprocityrecognition
crowdsourcingmore publishable units
altruisticethical science
error identificationmultiple perspectives
training new researchersmeeting grant conditions
increase research efficiencies
Publishing ecological datasets is more common
Log 10
Num
ber
*
Open Access Data Repositories
Organisations
Individuals
• Technical Data Storage & Access Systems• Data Professionals • BOM, CSIRO, ALA, TERN, State Atlases
• Dryad/KNB/ Nature’s Scientific Data, FigsShare, Genebank, Pangaea
• BioPlatforms Australia, TERN SHaRED
DatasetSubmission
DatasetsSelf-service Dataset
Submission
PortalRepository
Dataset file
Supplementarymaterials
Site Maps
CitationsDOIs
Images
Metadata
Dataset DownloadData Submission
Data In Data Published Data Out Data UsedWho
Researchers,Governments,Supersites,LTERN, ATN
Data Publishing Services
Datasets
Integrated Species/Site Data
Self-service Dataset
Submission
Assisted DataSubmission
PortalRepository
Services
RIF-CS
Dataset file
Supplementarymaterials
Site Maps
CitationsDOIs
Images
Metadata
Dataset Download
Integrated Data
Site Maps
Citations
Method
Diagram
Observation Diagram
PostgresMySQL
MS Access
Descriptions
Integrated Site Data Download
Data Submission
Data In Data Published Data Out Data UsedWho
Researchers,Governments,Supersites,LTERN, ATN
Universities,GovernmentDatabasesAusPlots. CSIRO, AusCover
Individual/research team data creators
Organisational data creators
TERN Eco-informatics SHaRED
* Australian Ecological Knowledge and Observation System
www.shared.org.au
Published in AEKOS
Prober S, TERN Australian SuperSite Network (2014). Vegetation Demographic Cover and Floristic Data, Great Western Woodlands SuperSite, 2012-2013, Version 13. 10.4227/05/53B49B4C5E115. Obtained from Australian Ecological Knowledge and Observation System Data Portal (ÆKOS, http://www.portal.aekos.org.au/), made available by James Cook University, Commonwealth Scientific and Industrial Research Organisation. Accessed 21 September 2014.
(www.aekos.org.au)
Warning - understand data licences
US Creative Commons 3.0 vs Creative Commons Australia 3.0Creative Commons International 4.0
When not to share/publish data
• Patent application• Confidential human patient details• Commercial sponsorship arrangements• Sensitive species declared by governments• Postgraduate - embargoed the data
Data Publishing - Exercise1. Guiding principals for publishing reusable datasets.A plenary discussion to understand the qualities of a ‘publication ready’ dataset(s) in terms of: - ‘garbage in, garbage out’ - reusability and effort- attributes required for re-use - which attributes are essential for reuse
OR
2. Demo of SHaREDA walk-through features of SHaRED data publishing - Metadata and keyword search terms by questions- A real example www.shared.org.au
SHaRED Questionnaire
Questions
Are you ready to publish?
Thank you
Email: [email protected] Website: www.ecoinformatics.org.au Portal: www.aekos.org.au Data submission (DOI): www.shared.org.au Twitter: @tern_aekos