Top Banner
BioSharing mapping the metadata standards in the life sciences BioSharing mapping the metadata standards in the life sciences Peter McQuilton, PhD BioSharing Content Lead
22

BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

Apr 13, 2017

Download

Science

Peter McQuilton
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

BioSharing – mapping the metadata standards in the life sciences

BioSharing – mapping the metadata standards in the life sciences

Peter McQuilton, PhD

BioSharing Content Lead

Page 2: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

Outline

1. The sea of standards in the life sciences

1. What is BioSharing?

2. Work in progress

3. BioSharing WG and potential collaborations

Page 3: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

miameMIAPA

MIRIAMMIQASMIX

MIGEN

ARRIVEMIAPE

MIASE

MIQE

MISFISHIE….

REMARK

CONSORT

MAGE-TabGCDML

SRAxmlSOFT FASTA

DICOM

MzMLSBRML

SEDML…

GELML

ISA-Tab

CML

MITAB

AAOCHEBI

OBIPATO ENVO

MOD

BTOIDO…

TEDDY

PROXAO

DO

VO

There are more than 600 standards in the life sciencesThere are more than 600 standards in the life sciences

Databases and tools implementing

Standards; also training material on and around

standards

Page 4: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

Mapping Standards and Databases in the Life Sciences

“Which standard should I use for this data, considering I’d like to publish in X journal?

“Are we using the most up-to-date version of this standard?”

“My data is in X format, which databases take that format?

Page 5: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

5

(2008)

(2009)

(2011)

(2015)

• registry development,• content collection,• community engagement,• funds raising• creation of a WG in RDA and

Force11; Advisory Board• links with ELIXIR and NIH

BD2K

Page 6: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

A$web$based,$curated and$searchable.registry.ensuring$that$biologicalstandards and$databases are$registered,$informative and$discoverable5$also$monitoring$the$development$and$evolution$of$standards,$their$use$in$

databases$and$the$adoption$of$both$in$data$policies.$

Page 7: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

Researchers,$developers and$curators lack$support$and$guidance$on$how$to$best$navigate$and$select$content$standards,$understand$their$maturity,$or$find$databases$that$implement$them5$

Funders,$journals and$librarians do$not$have$enough$ information$to$make$informed$decisions$on$which$content$standards$or$database$to$recommended$ in$policies,$or$fund$or$implement

Our mission: To help people make the right choice

Page 8: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

Three interlinked registries

Page 9: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

STANDARD DATABASE

Standards and databases cross-linked

Page 10: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

Terminologies linked to NCBO BioPortal

Page 11: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

The$ International$ Conference$ on$Systems$Biology$ (ICSB),$22J28$ August,$2008$ $$$$$$Susanna$Assunta.Sansone www.ebi.ac.uk/netJproject

Tracking evolution, e.g. deprecations and substitutions

Page 12: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

The$ International$ Conference$ on$Systems$Biology$ (ICSB),$22J28$ August,$2008$ $$$$$$Susanna$Assunta.Sansone www.ebi.ac.uk/netJproject

Tracking evolution, e.g. deprecations and substitutions

Page 13: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

Core.functionalities:• search$and$filtering,$e.g.$by$funder,$domain,$type$of$standard

• Refine$by$publication,$maintainer$etc.

• add$new$records,$edit$existing$records

• “claim”$records• person’s$profile$(as$maintainer$of$records)$associated$to$the$ORCID$profile$(for$credit)

• visualization$and$views$of$content$and$linking

Annotation.Sources:• 4$axis:$(material,$process,$quality,$information)

• NIF,OBI,CL,GO,IAO,EDAM

Search filter, refineSearch filter, refine

Page 14: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session
Page 15: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

Create your own CollectionCreate your own Collection

Page 16: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session
Page 17: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session
Page 18: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

1..Maximize$categorization$to$improve$searching,$continue$curation,$add$records,$increase$maintainers/vetting$of$records

2..Schema$views:$progressively$visualize$formats$and$reporting$guidelines$as$tested$for$terminologies$(via$BioPortal)$

3..Export$of$metadata$elements:$as$part$of$

WIP – surveys, advisory board & WGsWIP – surveys, advisory board & WGs

Page 19: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

The relationship amongpopular standard formatsfor pathway information.Demir, et al., The BioPAXcommunity standard forpathway data sharing,NatBiotech.2010.

4.$Create$relation$or$“usage$maps$and$guides”,$e.g.:$

5.$Metrics$of$maturity,$usability$and$popularity,$kickJstarted$by$Tenenbaum,$Sansone,$Haendel.$$A$sea$of$standards$for$omics$data:$sink$or$swim?$J Am7Med7Inform7Assoc (2014).6.$Embed$in$the$ecosystem$of$complementary$registries

WIP – surveys, advisory board & WGsWIP – surveys, advisory board & WGs

Page 20: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

Green.=..0.$ 6.monthsWG7is7just7starting

The products are

• principles for linking information about databases, contentstandards and journal / funder policies in the life sciences

• and a curated registry to access and cross-search the information, onwhich a variety of stakeholders can base their decisions

This is a joint WG

The work will benefit from the existing also part of

Page 21: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

Key points:

• We all provide information to those seek guidance to use, do or recommend something (databases, tools, standards, training material etc.)

• Designed with a slightly different focus and user needs; focus on curating/cataloguing certain information

• Have different entry points and UIs to fit the specific scope/user needs is good, so federation is key, e.g. cross-referencing

• Pool resources where appropriate

Collaboration and communication

Page 22: BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG Joint session

Proponents, core WG members and adoptersOperational Team – based at the

Philippe Rocca-Serra, PhD

AlejandraGonzalez-Beltran, PhD

Milo Thurston, PhD

Peter McQuilton, PhD

Allyson Lister, PhD

EamonnMaguire, DPhil / Contractor

David Johnson, PhD

SusannaSansone, PhD

WED - Session 1• RDA/WDS Publishing Data Workflows

THUR - Session 6• Joint meeting of IG Metadata, WG Metadata Standards

Catalog & WG BioSharingRegistry: Connecting data policies, standards & databases in life sciences"

FRI - Sessions 8 and 9• Joint meeting of IG ELIXIR Bridging Force & WG

BioSharingRegistry: Life Sciences and Sensitive Data”• Joint meeting of IG Domain Repositories, IG Agriculture

Data Interest Group (IGAD), WG BioSharingRegistry, IG RDA/CODATA Materials Data, Infrastructure & Interoperability & WG Wheat Data Interoperability"

Where we are and present