Top Banner
BioUML – open source integrated platform for collaborative and reproducible research in systems biology www.biouml.or Fedor Kolpakov, Institute of Systems Biology Ltd. Alexander Kel, geneXplain GmbH
66

BioUML – open source integrated platform for collaborative and reproducible research in systems biology Fedor Kolpakov, Institute of Systems.

Dec 25, 2015

Download

Documents

Hubert Wheeler
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

BioUML – open source integrated platform for

collaborative and reproducible research in systems biology

www.biouml.org

Fedor Kolpakov, Institute of Systems Biology Ltd.Alexander Kel, geneXplain GmbH

Page 2: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

BioUML platform• BioUML is an open source integrated platform for systems biology

that spans the comprehensive range of capabilities including access to databases with experimental data, tools for formalized description, visual modeling and analyses of complex biological systems.

• Due to scripts (R, JavaScript) and workflow support it provides powerful possibilities for analyses of high-throughput data.

• Plug-in based architecture (Eclipse run time from IBM is used) allows to add new functionality using plug-ins.

BioUML platform consists from 3 parts: • BioUML server – provides access to biological databases;• BioUML workbench – standalone application. • BioUML web edition – web interface based on AJAX technology

Page 3: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.
Page 4: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

• supported standards: SBML, SBGN, BioPAX, SED-ML, SBO, MIRIAM, CellML– some examples, CellDesigner extension support– state concept– SED-ML as workflow

• Modular modelling: composite models, agent based models• systems biology – reproducible highthroughput data analyses:

• analyses: algorithms, scripts, workflows• integration with R/Bioconductor, Galaxy• data: microarrays, NGS, ChIP-SEQ• visualization: genome browser

• BioUML – as platform for collaborative research– Amazon EC2 servers– data repository - groups, projects, import/export, FTP upload– chat, history

• current works: Biostore, LIMS ( laboratory information management system)

Main topics

Page 5: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Systems Biology

Analysis

Computational modeling

Experimental data

Experimentalhypothesisvalidation

Page 6: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Analysis

Computational modeling

Experimental data

Experimentalhypothesisvalidation

COMBINE

Systems Biology

Page 7: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Standard supports:

SBML, SBGN, BioPAX, SED-ML, SBO, MIRIAM, CellML

Page 8: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

• import/export - level 1, 2, 3 (core)• passed all tests from SBML test suite version 2.0.0 beta 1

(2010, April 2)• extensions:

– CellDesigner– SBGN-PD (own XML format)

• Biomodels– full text search– SBGN -PD (parse RDF annotation do determine specie types)– layout algorithms– on-line simulation

• Panther DB– full text search– reads CellDesigner extensions– SBGN-PD

SBML support

Page 9: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.
Page 10: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.
Page 11: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.
Page 12: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.
Page 13: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

CellDesigner support – comparison for 165 images from Panther DB and generated by BioUML

Page 14: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

• diagram types: PD – beta; ER, AF – alpha• defined as XML graphic notation for BioUML• graphic notation can be edited using BioUML graphic notation editor• special extension for SBML• Reactome - SBGN-PD

– full text search– SQL version is used– read diagram layout from Reactome

• BioPAX – SBGN-PD– level 2 (beta), level 3 (alpha)– auto layout

• TRANSPATH – SBGN-PD– full text search– auto-layout

SBGN support

Page 15: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.
Page 16: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Reactome

Page 17: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

BioPAX – import dialog

Page 18: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

BioPAX example (BioCyc)

Page 19: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

TRANSPATH example

Page 20: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

• SED-ML import – only SBML models are supported now– stored in user’s project

• SED-ML changes presented as BioUML states• SED-ML presented as workflow• automated hierarchic layout• on-line simulation

SED-ML support

Page 21: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

SED-ML – import dialog

Page 22: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

SED-ML workflow

Page 23: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Simulation

Page 24: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Parameters fitting

Page 25: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Main features• Experimental data – time courses or steady states

expressed as exact or relative values of substance concentrations

• Different optimization methods for analysis• Multi-experimentsfitting• Constraint optimization• Local/global parameters• Parameters optimization using java script

Page 26: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Parameters fitting – user interface

Page 27: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Bentele M, 2004

Neumann L, 2010

CD95L module and results of fitting its dynamics to experimental data

Page 28: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Comparison with COPASI (10,000 simulations)

Method BioUML(4 cores)

BioUML(1 core)

COPASI(1 core)

Evolutionary Programming – –

1 min 58,2sec1 min 31,3 sec1 min 16,6 sec

Particle swarm 7,1 sec7,7 sec6,9 sec

22,4 sec15,3 sec22,5 sec

1 min 32 sec1 min 26,4 sec1 min 07,1 sec

Stochastic Ranking Evolution Strategy

7,5 sec7,47 sec 6,9 sec

23,4 sec23,5 sec22,2 sec

1 min 25,0 sec1 min 5,6 sec1 min 8,8 sec

Cellular genetic algorithm

7,7 sec7,5 sec7,2 sec

25,5 sec22,1 sec20,8 sec

Page 29: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Modular modelling

Composite modelof the Apoptosis Machinery

Agent based model of arterial blood pressure regulation

Page 30: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Yuri Lazebnik “Can a biologist fix a radio?—Or, what I learned while studying apoptosis“. Cancer Cell, 2002, 2(3): 179-182

Page 31: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Biologist view

Engineer’s view

Page 32: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

2002

However, I hope that it is only a question of time before a user-friendly and flexible formal language will be taught to biology students, as it is taught to engineers, as a basic requirement for their future studies. My advice to experimental biologists is to be prepared.

Y. Lazebnik, 2002

2010Standards in systems biology

SBML – Systems Biology Markup Language

SBGN – Systems Biology Graphics Notation

Page 33: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

TNF-α module (SBGN)

Page 34: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Biologist view

Engineer’s view

Page 35: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Biologist view

Engineer’s view

Last centaury: 50-70-th

Page 36: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Modular design

Page 37: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Modules: clear specification of interfaces

input/output

contacts

Page 38: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Modular model of apoptosis• 13 modules• 286 species• 684 reactions• 719 parameters

Page 39: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

EGF module(BMOND ID: Int_EGF_module)

Schoeberl B, et al: Nature Biotechnology 2002

Borisov N, et al: Molecular Systems Biology 2009Additions:Reactions of protein syntheses and degradations

Page 40: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Mitochondron module(BMOND ID: Int_Mitoch_module)

Bagci EZ, et al, Biophysical J 2006Albeck JG, et al, PLoS Biol 2008Additions:Activation of CREB and deactivation of BAD by Akt-PP and ERK-PPUpregulation of Bcl-2 by CREBBcl-2 suppression by p53

Page 41: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

top-down

Modular model allows us to combine both up-down and bottom-up approaches

bottom-up

Page 42: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Computational modeling

Experimental data

Experimentalhypothesisvalidation

COMBINE

Systems Biology

DataAnalysis

Page 43: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Systems biology:

reproducible highthroughput data analyses

– analyses: algorithms, scripts, workflows– integration with R/Bioconductor, Galaxy– data: microarrays, NGS, ChIP-SEQ– visualization: genome browser

Page 44: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.
Page 45: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

R world Java/BioUML world

JavaScript host objects allowsto merge R/Bioconductor and

Java/BioUML worlds

Page 46: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.
Page 47: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

ChIP-seq processing pipeline

Page 48: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Galaxy – analyses methods

Page 49: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Galaxy - workflow

Page 50: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Genome browser

Page 51: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

• uses AJAX and HTML5 <canvas> technologies• interactive - dragging, semantic zoom• tracks support• Ensembl• DAS-servers• user-loaded BED/GFF/Wiggle files

Genome browser: main features

Page 52: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.
Page 53: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.
Page 54: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Computational modeling

Experimental data

Experimentalhypothesisvalidation

COMBINE

Systems Biology

DataAnalysis

Data repositoryfor collaborative work

Page 55: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

BioUML:platform for collaborative research

– Amazon EC2 servers– data repository

– groups – projects – import/export, FTP upload

– chat– history (in process)

Page 56: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.
Page 57: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.
Page 58: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Main current works:

– LIMS (Laboratory Information Management System)– Biostore

Page 59: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Computational modeling

Experimental data

Experimentalhypothesisvalidation

COMBINE

Systems Biology

DataAnalysis

Data repositoryfor collaborative work

LIMS

Page 60: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

BioUML – LIMS (Laboratory Information Management System)

– is being developed by Data Integrated Solutions Inc. (Pittsburg, USA) start-up organized by Maxim Mikheev and Fedor Kolpakov– experiment is represented as workflow– automated management by laboratory equipment– automated generation of tasks for laboratory staff– automated upload of experimental data into repository– automated row data analysis

Page 61: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Android market

Android

AppStore

MacOS,iPOD, iPhone

Market

Platform

Page 62: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Android market

Android

AppStore

MacOS,iPOD, iPhone

Market

Platform

Biostore

BioUML

Page 63: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Biostore

BioUML platform

Developers- plug-ins: methods, visualization, etc.- databases

Users- subscriptions- collaborative & reproducible research

Experts-services for data analysis- on-line consultations

BioUML ecosystem

provide toolsand databases

use provide services

Page 64: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Main platforms for bioinformaticsand BioUML

Tavernastandalone applicationpowerful workflows

Galaxy

workflows, web interface, collaborative research,

genome browser

scripts, statistics, plots

R/Bioconductor

BioUML platform

standalone applicationpowerful workflows

web interface,collaborative research

genome browser

scripts, statistics, plots

BioClipse

Eclipse plug-in based architecture,chemoinformatics

Eclipse plug-in based architecture,chemoinformatics

Page 65: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Main platforms for bioinformaticsand BioUML

Tavernastandalone applicationpowerful workflows

Galaxy

workflows, web interface, collaborative research,

genome browser

scripts, statistics, plots

R/Bioconductor

BioUML platform

standalone applicationpowerful workflows

web interface,collaborative research

genome browser

scripts, statistics, plots

+ systems biology• visual modelling• simulation• parameters fitting• …

+ chat for on-line consultations

BioClipse

Eclipse plug-in based architecture,chemoinformatics

Eclipse plug-in based architecture,chemoinformatics

Page 66: BioUML – open source integrated platform for collaborative and reproducible research in systems biology  Fedor Kolpakov, Institute of Systems.

Acknowledgements

Part of this work was partially supported by the grant: European Committee grant №037590 “Net2Drug”

European Committee grant №202272 “LipidomicNet”Integration and interdisciplinary grants №16, 91 of SB RAS.

BioUML team Software developers BiologistsNikita Tolstyh Ilya Kiselev Ruslan Sharipov Tagir Valeev Elena Kutumova Ivan YevshinAnna Ryabova Alexey Shadrin