This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Promoting Coherent Minimum Reporting Guidelines for Biological & Biomedical Investigations: The MIBBI Project
MIBBI [www.mibbi.org] HUPO Proteomics Standards Initiative [psidev.sf.net] Research Information Network [www.rin.ac.uk]
On standards bodies
What defines a standards-generating body?— A beer and an airline (Zappa)— Formats, reporting guidelines, controlled vocabularies— Regular open attendance meetings, discussion lists,
Hugely dependent on their respective communities— Requirements gathering (What are we doing and why?)— Development (By the people, for the people)— Testing (No it isn’t finished, but yes I’d like you to use
— The user community (capture, store, search, analyse)
Technologically-delineated views of the world A: transcriptomics B: proteomics C: metabolomics …and…
Biologically-delineated views of the world A: plant biology B: epidemiology C: microbiology …and…
Generic features (‘common core’) — Description of source biomaterial — Experimental design components
Arrays
Scanning Arrays &Scanning
Columns
GelsMS MS
FTIR
NMR
Columns
Modelling the biosciences
Modelling the biosciences (slightly differently)
Assay: Omics and miscellaneous techniques
Investigation:
Medical syndrome, environmental effect, etc.Study: Toxicology, environmental science, etc.
Multiple all that by three (kinds of standard)
What biologists need
Diverse community-specific extensions
Generic Features (origin of biomaterial)
Generic Features (experimental design)
Diverse community-specific extensions
Generic Features (origin of biomaterial)
Generic Features (experimental design)
Well-oiled cogs meshing perfectly (would be nice)
How well are things working?—Cue the Tower of Babel analogy…—Situation is improving with respect to standards—But few tools, fewer carrots (though some
sticks)
Why do we care about that..?—Data exchange—Comprehensibility of work—Scope for reuse (parallel or orthogonal)
“Publicly-funded research data are a public good, produced in the public interest”
“Publicly-funded research data should be openly available to the maximum extent possible.”
Investigation / Study / Assay (ISA) Infrastructurehttp://isatab.sourceforge.net/
Ontology of Biomedical Investigations (OBI)http://obi.sourceforge.net/
Comparison of MIBBI-registered projects [21] ● Release
Granularity Coarse Medium Fine
Maturity ● Planned ● Drafting
The MIBBI Project (www.mibbi.org)
The MIBBI Project (www.mibbi.org)
Interaction graph for projects (line thickness & colour saturation show similarity)
The MIBBI Project (www.mibbi.org)
MICheckout: Supporting Users
Why should I dedicate resources to providing data to others?
—Pro bono arguments have no impact—‘Sticks’ from funders and publishers get the bare minimum
This is just a ‘make work’ scheme for bioinformaticians—Bioinformaticians get a buzz out of having big databases—Bioinformaticians benefitting from others’ work
I don’t trust anyone else’s data — I’d rather repeat work—Problems of quality, which are justified to an extent—But what of people lacking resource for this, or people who
want to refer to proteomics data but don’t do proteomics
How on earth am I supposed to do this anyway..?—Perception that there is no money to pay for this—No mature free tools — Excel sheets are no good for HT—Worries about vendor support, legacy systems (business
models)
The objections to fuller reporting
Data sharing is more or less a given now, and tools are emerging
—Lots of sticks, but they only get the bare minimum—How to get the best out of data generators?—Only meaningful credit will work
Need central registries of data sets that can record reuse—Well-presented, detailed papers get cited more frequently—The same principle should apply to data sets—So, OpenIDs for people, DOIs for data?
Side-benefits, challenges—Would also clear up problems around paper authorship—Would enable other kinds of credit (training, curation, etc.)—May have to be self-policing — researchers ‘own’ their
credit portfolio (though an enforcement body would also be useful)