Top Banner
Data Publication: Discover, Explore, Visualise Alejandra Gonzalez-Beltran, PhD Research Lecturer Oxford e-Research Centre University of Oxford Data Visualisation and the Future of Academic Publishing University of Oxford and Oxford University Press June 10 th 2016 @alegonbel
36

Data publication: Discover, Explore, Visualise

Apr 14, 2017

Download

Science

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Data publication: Discover, Explore, Visualise

Data Publication:Discover, Explore, Visualise

Alejandra Gonzalez-Beltran, PhDResearch Lecturer

Oxford e-Research CentreUniversity of Oxford

Data Visualisation and the Future of Academic PublishingUniversity of Oxford and Oxford University Press

June 10th 2016@alegonbel

Page 2: Data publication: Discover, Explore, Visualise

Philippe  Rocca-­Serra,  PhDSenior  Research  Lecturer

AlejandraGonzalez-­Beltran,  PhDResearch  Lecturer

Milo  Thurston,  DPhDResearch  Software  Engineer

MassimilianoIzzo,  PhDResearch  Software  Engineer

Peter  McQuilton,  PhDKnowledge  Engineer

Our  main  areas  of  research  and  activity:

• Enabling  reproducible  research  through…

• Data  collection,  curation,  representation  etc.• Data  publication• Data  provenance  • Development  of  software,  infrastructure• Open,  community  ontologies  and  standards• Semantic  web  /  linked  data• Training

Communities we work with/for:Allyson  Lister,  PhDKnowledge  Engineer

EamonnMaguire,  DPhilSoftware  Engineer  contractor

David  Johnson,  PhDResearch  Software  Engineer

Susanna-­Assunta  Sansone,  PhDPrincipal  Investigator,  Associate  Director  

Page 3: Data publication: Discover, Explore, Visualise

OutlineOutline

• Challenges  associated  to  scholarly  data

• Importance  of  all  research  outputs  /  metadata

• Reproducibility  crisis• Experiments  description• Data  availability

• Data  publication• Springer  Nature  Scientific  Data

• Discover,  Explore,  Visualise  Scholarly  Data• Scientific  Data  ISA-­explorer

• Challenges  associated  to  scholarly  data

• Importance  of  all  research  outputs  /  metadata

• Reproducibility  crisis• Experiments  description• Data  availability

• Data  publication• Springer  Nature  Scientific  Data

• Discover,  Explore,  Visualise  Scholarly  Data• Scientific  Data  ISA-­explorer

Page 4: Data publication: Discover, Explore, Visualise

Credit  to:  https://www.digital-­science.com/blog/news/five-­top-­reasons-­to-­protect-­your-­data-­and-­practise-­safe-­science/

Challenges  related  to  scholarly  dataChallenges  related  to  scholarly  data

Page 5: Data publication: Discover, Explore, Visualise

• Outputs are multi-dimensional, diverse, not always well cited / storedo Software, codes, workflows etc.; hard(er) to get hold of

• Data often distributed and fragmented to fit (siloed) databaseso Without enough information for others to understand it

• Uneven level of details and annotation across different databaseso Specialized, generalist, public and institutional

• Data curation activities are perceived as time consumingo Collection and harmonization of detailed methods and experimental

steps is done/rushed at publication stage

But…  shared  data  is  not  always  understandable,  reusable

But…  shared  data  is  not  always  understandable,  reusable

Page 6: Data publication: Discover, Explore, Visualise

Importance  of-­ avoid  selective  reporting-­ experimental  design-­ statistical  power-­ statistical  analysis-­ code/methods  availability-­ data  availability

Importance  of-­ avoid  selective  reporting-­ experimental  design-­ statistical  power-­ statistical  analysis-­ code/methods  availability-­ data  availability

Page 7: Data publication: Discover, Explore, Visualise
Page 8: Data publication: Discover, Explore, Visualise

• Incentive, credit for sharingo Big and small datao Unpublished datao Long tail of datao Curated aggregation

• Peer review of data• Value of data vs. analysis• Discoverability and reusability

o Complementing community databases

Growing  number  of  data  papers  and  data  journalsGrowing  number  of  data  papers  and  data  journals

Page 9: Data publication: Discover, Explore, Visualise

nature.com/scientificdataHonorary Academic Editor Susanna-Assunta Sansone, PhD

Managing EditorAndrew L Hufton, PhD

Editorial CuratorVarsha Khodiyar

PublisherIain Hrynaszkiewicz

A new open-access, online-only publication for descriptions of scientifically valuable datasets

Supported by

Page 10: Data publication: Discover, Explore, Visualise

nature.com/scientificdataHonorary Academic Editor Susanna-Assunta Sansone, PhD

Managing EditorAndrew L Hufton, PhD

Editorial CuratorVarsha Khodiyar

PublisherIain Hrynaszkiewicz

A new open-access, online-only publication for descriptions of scientifically valuable datasets

Supported by

Page 11: Data publication: Discover, Explore, Visualise

Research

papers

Data  

records

Data  

Descriptors

Value  added:  complement  between  traditional  articles  &  repositories

Value  added:  complement  between  traditional  articles  &  repositories

Page 12: Data publication: Discover, Explore, Visualise

Scientific hypotheses:SynthesisAnalysisConclusions

Methods and technical analyses supporting the quality of the measurements:What did I do to generate the data?How was the data processed?Where is the data?Who did what when

Relation  with  traditional  articles  – contentRelation  with  traditional  articles  – content

Page 13: Data publication: Discover, Explore, Visualise

Citation  of  and  links  to  data  files  and  databasesCitation  of  and  links  to  data  files  and  databases

Page 14: Data publication: Discover, Explore, Visualise

Citation  of  and  links  to  data  files  and  databasesCitation  of  and  links  to  data  files  and  databases

Credit  for  data  producersCredit  for  data  producers

Page 15: Data publication: Discover, Explore, Visualise

A  new  article  typeA  new  article  type

A new category of publication that provides detailed descriptors of scientifically valuable datasets

Mandates open data, without unnecessary restrictions, as a condition of submission

Page 16: Data publication: Discover, Explore, Visualise
Page 17: Data publication: Discover, Explore, Visualise
Page 18: Data publication: Discover, Explore, Visualise

Summary  Table

Page 19: Data publication: Discover, Explore, Visualise

Web  app  to  discover,  explore,  visualise data  descriptors

Page 20: Data publication: Discover, Explore, Visualise

http://scientificdata.isa-­explorer.org/

Page 21: Data publication: Discover, Explore, Visualise

Browse

Page 22: Data publication: Discover, Explore, Visualise

Keyword  search

Page 23: Data publication: Discover, Explore, Visualise

Filter

Page 24: Data publication: Discover, Explore, Visualise

Filtering  options

Summary  Table

Page 25: Data publication: Discover, Explore, Visualise

Filtering  options

See  annotations  and  number  ofassociated  data  descriptors

Page 26: Data publication: Discover, Explore, Visualise

Filtering  options

Page 27: Data publication: Discover, Explore, Visualise

Combination  of  filters

Page 28: Data publication: Discover, Explore, Visualise
Page 29: Data publication: Discover, Explore, Visualise
Page 30: Data publication: Discover, Explore, Visualise

Visualise the  samples’  characteristics

Page 31: Data publication: Discover, Explore, Visualise

Publication  date

Page 32: Data publication: Discover, Explore, Visualise

Open  associated  data  descriptor

Page 33: Data publication: Discover, Explore, Visualise

Download  Metadata

Licensing

Page 34: Data publication: Discover, Explore, Visualise

Links  to  data  repositories  to  access  the  data

Assays  details

Page 35: Data publication: Discover, Explore, Visualise

SummarySummary

• Challenges  associated  to  scholarly  data

• Importance  of  all  research  outputs  /  metadata

• Reproducibility  crisis• Experiments  description• Data  availability

• Data  publication• Springer  Nature  Scientific  Data

• Discover,  Explore,  Visualise  Scholarly  Data• Scientific  Data  ISA-­explorer

• Challenges  associated  to  scholarly  data

• Importance  of  all  research  outputs  /  metadata

• Reproducibility  crisis• Experiments  description• Data  availability

• Data  publication• Springer  Nature  Scientific  Data

• Discover,  Explore,  Visualise  Scholarly  Data• Scientific  Data  ISA-­explorer

Page 36: Data publication: Discover, Explore, Visualise

Philippe  Rocca-­Serra,  PhDSenior  Research  Lecturer

AlejandraGonzalez-­Beltran,  PhDResearch  Lecturer

Milo  Thurston,  DPhDResearch  Software  Engineer

MassimilianoIzzo,  PhDResearch  Software  Engineer

Peter  McQuilton,  PhDKnowledge  Engineer

Communities we work with/for:Allyson  Lister,  PhDKnowledge  Engineer

EamonnMaguire,  DPhilSoftware  Engineer  contractor

David  Johnson,  PhDResearch  Software  Engineer

Susanna-­Assunta  Sansone,  PhDPrincipal  Investigator,  Associate  Director  

Our  main  areas  of  research  and  activity:

• Enabling  reproducible  research  through…

• Data  collection,  curation,  representation  etc.• Data  publication• Data  provenance  • Development  of  software,  infrastructure• Open,  community  ontologies  and  standards• Semantic  web  /  linked  data• Training