Top Banner
@biosharing dx.doi.org/10.6084/m9.figshare.4055496.v1
18

BioSharing for the NIH BD2K community

Apr 15, 2017

Download

Data & Analytics

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: BioSharing for the NIH BD2K community

@biosharing

dx.doi.org/10.6084/m9.figshare.4055496.v1

Page 2: BioSharing for the NIH BD2K community

de jure de facto grass-roots

groups standard

organizations

NanotechnologyWorkingGroup

Formats Terminologies Guidelines

Community-driven efforts, just few examples

Content standards: domain-level descriptors essential for

interpretation, verification and reproducibility of datasets

Page 3: BioSharing for the NIH BD2K community

Formats Terminologies Guidelines

212

113

500+

source sourcesource

miame!MIRIAM!

MIQAS!MIX!MIGEN!

ARRIVE!MIAPE!

MIASE!

MIQE!

MISFISHIE….!

REMARK!

CONSORT!

SRAxml!

SOFT! FASTA!DICOM!

MzML!SBRML!

SEDML…!

GELML!

ISA!

CML!

MITAB!

AAO!CHEBI!OBI!

PATO! ENVO!MOD!

BTO!IDO…!

TEDDY!

PRO!XAO!

DO

VO!

Content standards – in numbers

Page 4: BioSharing for the NIH BD2K community

Datapoliciesbyfunders,journalsandotherorganizaLons

Database,toolsandservices

Contentstandards

Complex and evolving landscape

Formats Terminologies Guidelines

Page 5: BioSharing for the NIH BD2K community

Aweb-based,curatedandsearchableportalthatmonitorsthedevelopmentandevolu2onofstandards,theiruseindatabasesandtheadopLonofbothindatapolicies,toinformandeducatetheusercommunity

Page 6: BioSharing for the NIH BD2K community

Helping users make the right decision

Page 7: BioSharing for the NIH BD2K community
Page 8: BioSharing for the NIH BD2K community
Page 9: BioSharing for the NIH BD2K community
Page 10: BioSharing for the NIH BD2K community

Readyforuse,implementaLon,orrecommendaLon

Indevelopment

Statusuncertain

Deprecatedassubsumedorsuperseded

Manuallycuratedandverifiedbythecommunitybehindeachresource

Indicators to describe ‘status’

Page 11: BioSharing for the NIH BD2K community

Tracking evoluEon

Page 12: BioSharing for the NIH BD2K community
Page 13: BioSharing for the NIH BD2K community
Page 14: BioSharing for the NIH BD2K community
Page 15: BioSharing for the NIH BD2K community
Page 16: BioSharing for the NIH BD2K community

§  Wehavecurated586ofthe674standardsrecords(87%)•  155areclaimedbytherelevant,andsoconfirmedtobe

‘acLve’(23%);64standardsaredeprecated,8indevelopment,19uncertain

•  55standardsweregeneratedviaaworldwideconsorLum/organisaLon/governingbody;341involvedtheUSAeitheraloneorinpartnershipwithanothercountry;138theUK;53EU;8China

•  themostadoptedstandardisFASTA,implementedin247databases

•  309(46%)areimplementedin1ormoredatabaseso  151(22%)implementedin1database;64(9%)

implementedin2databases;157(23%)implementedin2ormoredatabases

Preliminary analysis – work in progress

Page 17: BioSharing for the NIH BD2K community

Standarddevelopinggroups:JournalPublishers:

Registry

Cross-links/Dataexchange:

Some user communiEes and collaborators SocieLesandorganisaLons:

InsLtuLonalRDMservices:

Projects/programmes:

Page 18: BioSharing for the NIH BD2K community

bsg-000174

biosharing:ReporLngGuideline

bsg-000161

MINSEQE

MIMARKS

sampleinformaLon

sampleidenLfier

taxonomyidenLfier

sequenceread

geolocaLon

High-levelinforma0onaboutthemetadatastandards

Representa0onsofthestandardselements

Templateelementsfor

el-000001

el-000002

el-000003

provenance:MINSEQE

provenance:MINSEQE

andMIMARKS

provenance:MIMARKS

•  Servemachine-readablecontentmetadatastandards,providingprovenancefortheirelements•  InformthecreaLonofmetadatatemplates,renderingstandardsinvisibletotheresearchers

From standards to CDEs-like, to annotaEon templates