Top Banner
n connecting standards, databases and data policies Susanna-Assunta Sansone Associate Director Oxford e-Research Centre, University of Oxford
22

BioSharing - EUDAT semantic workshop

Apr 13, 2017

Download

Data & Analytics

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: BioSharing - EUDAT semantic workshop

n

connecting standards, databases and data policies

Susanna-Assunta Sansone

Associate Director

Oxford e-Research Centre, University of Oxford

Page 2: BioSharing - EUDAT semantic workshop

• Domain-level descriptors that are essential for interpretation, verification, reproducibility and reusability of datasets

• The depth and breadth of descriptors vary according to the domain broadly covering the what, who, when, how and why

Content standards

Page 3: BioSharing - EUDAT semantic workshop

Formats Terminologies Guidelines

Content standards: three categories

Page 4: BioSharing - EUDAT semantic workshop

Minimum information reporting requirements, checklists

o Report the same core, essential information

o e.g. MIAME guidelines

Controlled vocabularies, taxonomies, thesauri, ontologies etc.

o Unambiguous identification and definition of concepts o e.g. Gene Ontology

Conceptual model, schema, exchange formats etc

o Define the structure and interrelation of information, and the transmission format

o e.g. FASTA Formats Terminologies Guidelines

Content standards: three categories

Page 5: BioSharing - EUDAT semantic workshop

Formats Terminologies Guidelines

Community-driven initiatives

de jure de factograss-roots

groupsstandard

organizations

Nanotechnology Working Group

Page 6: BioSharing - EUDAT semantic workshop

883 -> ~1000

220+

115+

548

source sourcesource

Content standards in numbers

Formats Terminologies Guidelines

MIAMEMIRIAM

MIQASMIXMIGEN

ARRIVEMIAPE

MIASE

MIQE

MISFISHIE….

REMARK

CONSORT

SRAxml

SOFT FASTADICOM

MzMLSBRML

SEDML…

GELML

ISA

CML

MITAB

AAOCHEBIOBI

PATO ENVOMOD

BTOIDO…

TEDDY

PRO

XAO

DO

VO

MIAPPESample-Tab

Page 7: BioSharing - EUDAT semantic workshop

Content standards

Data policies by funders, journals and other organizations

Databases, tools and services

Formats Terminologies Guidelines

Mapping this evolving landscape

Page 8: BioSharing - EUDAT semantic workshop

Content standards

Data policies by funders, journals and other organizations

Databases, tools and services

Formats Terminologies Guidelines

a resource of the ELIXIR Interoperability Platform

• Aweb-based,curatedandsearchableportalthat monitorstheir

development andevolution toinform andeducate

Page 9: BioSharing - EUDAT semantic workshop
Page 10: BioSharing - EUDAT semantic workshop
Page 11: BioSharing - EUDAT semantic workshop
Page 12: BioSharing - EUDAT semantic workshop

Not just quantity but quality: rich, curated and community

vetted descriptions

Page 13: BioSharing - EUDAT semantic workshop

Indicators to describe the status of standards and databases

Readyforuse,implementation,orrecommendation

Indevelopment

Statusuncertain

Deprecatedassubsumedorsuperseded

Manuallycuratedandverifiedbythecommunitybehindeachresource

Page 14: BioSharing - EUDAT semantic workshop

Tracking evolution, e.g.:

Page 15: BioSharing - EUDAT semantic workshop

Visualizing relations, e.g.:

DataPolicyListoftheir

recommendeddatabasesandstandards

Page 16: BioSharing - EUDAT semantic workshop

…to inform and educate on existing and new resources

DataPolicy

Page 17: BioSharing - EUDAT semantic workshop

Working with/for the community and our ‘adopters’, e.g.:

Standard developing groups:Journal, publishers:

Cross-links, data exchange:

Societies and organisations: Institutional RDM services:

Projects, programmes: 533

responders

Page 18: BioSharing - EUDAT semantic workshop

Progressively cross-linking with other ELIXIR resources

Cross-links, data exchange:

Societies and organisations:

Standard developing groups:Journal, publishers:

Institutional RDM services:

Projects, programmes:

Page 19: BioSharing - EUDAT semantic workshop

• Increase discoverability (e.g. by search engines), aggregation (e.g. by indices)

and analysis of content in different websites and services• use of schema.org structured semantic markup (for web pages’ content) by Google, Bing,

Yahoo, Yandex• coordinate its extension, where needed, in the life science area

Gaining traction and support by:

Page 20: BioSharing - EUDAT semantic workshop
Page 21: BioSharing - EUDAT semantic workshop
Page 22: BioSharing - EUDAT semantic workshop

Acknowledgements