Top Banner
the European Nutrigenomics Organisation Nu Nu GO G O Nu Nu GO G O Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht
41

The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

Dec 19, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO

Using pathway information to understand omics data

Chris EveloNuGO WP7

BiGCaT Bioinformatics Maastricht

Page 2: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGOUn Oslo

Un Munich

Un Florence

Un Balearic Illes

Un Cork

Trinity

Un. Ulster

Rowett

Un Newcastle

Un Reading

IFR DiFE

Un Krakow

Inserm Marseille

TNO

Un Wageningen

Un Maastricht

EBI

NuNuGOGO

Un Lund

RikiltRivm

Page 3: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO Understanding Array data

Typical procedure

1. Annotate the reporters with something useful (UniProt!)

2. Sort based on fold change

3. Search for your favorite genes/proteins

4. Throw away 95% of the array

Page 4: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO

Page 5: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO Understanding Array data

Typical procedure

1. Annotate the reporters with something useful (UniProt!)

2. Sort based on fold change

3. Search for your favorite genes/proteins

4. Throw away 95% of the array

Page 6: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO Understanding Array data

“Advanced” procedures

o Gene clustering or principal component analysis

o Get groups of genes with parallel expression patterns

o Useful for diagnosis

o Not adding much to understanding (unless combined)

Page 7: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO Mapping

Annotation/coupling

Page 8: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGOBest known: GenMAPP

Free, academic initiative with editable mapps,

collaborates with NuGO

Page 9: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO Best known: GenMAPP

• Full content of GO database

• Textbook like local mapps

• Geneboxes with active backpages, coupled to online databases

• Visualize anything numerical(fold changes on arrays, p-values, present calls, proteomics results)

• Update mapps yourself

Page 10: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO GenMAPP: Full GO content

Page 11: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO

GenMAPP:Textbook like maps

Extensive backpages

present with links to online

databases

Page 12: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO2D gels of 3T3-L1 (pre)-adipocytes

Enlarged sections

gels derived from:

A: 3T3-L1 pre-adipocytes,

B: 3T3-L1 adipocytes,

C: 3T3-L1 adipocytes with caloric restriction

D: 3T3-L1 adipocytes with caloric restriction and

TNF-a.

Page 13: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO

GenMAPP: visualize anything numerical

Example

Proteomics results (2D gels with GC-MS identification).

Fasting/feeding study shows regulation of glycolysis (data from Johan Renes, UM).

Other useful things:

- p-values, present calls- presence in clusters- presence in QTLs

Page 14: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO Update mapps yourself

You can do anything.E.g. add genes, annotation, backpage information, graphics

Next page shows a combination of metabolic mapps.

“The Nutrigenomics Masterpiece”

created by Milka Sokolović (AMC Amsterdam)

Page 15: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.
Page 16: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO MAPPfinder

• Ranks mapps where relatively many changes occur

• Useful to find unexpected pathways

• Statistics hardly developed

Page 17: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO MAPPfinder z-score

Number of genes/proteins changed on this

mapp

Expected number of changes

Standard deviation of

observed number

many dependencies to overcome

Page 18: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO MAPPfinder

• Next example from heart failure study(Schroen et al. Circ Res; 2004 95: 506-514)

Page 19: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO GenMAPP: Full GO content

Page 20: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO Scientist know GenMapp

Advantages: • Free,

• Runs on (high end) MS Windows,

• Relatively easy to use,

• Reasonable visualization,

• Some pathway statistics,

• Interesting content (Including GO, KEGG),

• Content editable,

• Adopting standards (e.g. BioPax),

• Soon to become open source.

Page 21: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO Scientist know GenMapp

Disadvantages: • Small academic initiative, uncertain lifespan

• No info on reactions, metabolites, location

• No change (e.g. time course) visualization

• Hard to cope with ambiguous reporters(we are working on that)

• Content could be better!

Page 22: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGOMetacore example

GeneGo, IncGeneGo, Inc

• Systems ReconstructionTM Technology

www.genego.com

Page 23: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO

Agilent Affymetrix Proteomic SAGE

Concurrent visualization of different data types

Page 24: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO

GeneGo: primitive view of multiple conditions

Can you really see what happens?

Page 25: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGOBuild new networkusing MetacoreTM from GeneGO

• Around p53 protein• Making us of biological DB• Filtered to reduce complexity:

– for ‘rat ortholog’– for ‘transcriptional regulation’– for ‘liver’

Page 26: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO

Page 27: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO

Filtering needed to reduce complexity

Page 28: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO

Page 29: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO Datasources 1

GenMAPP local MAPPs:

Largely created by a single postdoc (Dr.Kam Dahlquist).

Page 30: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO Datasources 2

KEGG:

Older pathway database (Kyoto Japan), on enzyme code (EC) level.

Example… The Homo Sapiens Urea cycle Mapp

A converted KEGG Mapp

Note that not all EC’s were converted and that they don’t have backpages.

Page 31: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO Datasources 2

KEGG Conversion:

= How would you convert EC codes to Swissprot codes?

1) Go to Swissprot, look for EC code

2) Add all proteins with that EC code to GenMapp backpage

Example: Superoxide dismutase function reaction would have:Cu/Zn-SOD, Mn-SOD and Ex-SOD in backpage… (and that is not what we usually want.

Note that many other tools use KEGG converted pathways (e.g. Spotfire Decissionsite, GeneGo, Ingenuity)

Page 32: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO Datasources 2

KEGG:

Another example: Apoptosis KEGG Mapp

A contributed Mapp

Somebody manually converted this Mapp!

Great work… But, there are only four of these

Page 33: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO Datasources 3

Gene Ontology Database:

Simple tree structure database with a of lot biological content (biologist know and like it).

Automatic annotation possible even for EST’s

See structure in MappFinder (1) (or use Go browser)

Page 34: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO Datasources 4

Alternative programs like GeneGo:

Based on expert knowledge (20 Russian biochemists).

Page 35: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO

NuGO data pathway data collection workflow

Combine and forwardexisting maps

to limited group of experts

Text miningfrom key genes/metabolites

Forward improved mapsto limited group of experts

Collect back page info

Forward new draft to alarger group of experts

within NuGO

Develop storage format plus tools

Think of best way to storepathway information

Develop/adapt entry toolsplus converters

Test resulting maps

Make maps available

Page 36: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO

BioPAX Plus/GMML 2

Working with Reactome, GenMapp and BioPax

Expert data

Reactome

BioPAX

GMML

Current GenMapp

GenMapp 2

NUGO/EBI

EBIMDP4/GenMapp

With Philippe Rocca and Imre

Vastrik (EBI/Reactome) we will define a way to

get Reactome views and export

them to GenMapp2

BiGCaT students created GenMapp 2 – GMML converters

with help from Lynn Ferrante (GenMapp.org)

Rachel van Haaften (BiGCaT/NuGO) and

Marjan van Erk (TNO/NuGO) visited EBI early 2005 to learn doing this

This step has not been taken care off as of yet…

Rachel van Haaften (BiGCaT/NuGO) and

Marjan van Erk (TNO/NuGO) will

test this and give user feedback

GMML (GenMapp Markup Language) is a superset of BioPAX

1. BioPAX could contain graphical views. (GMML 2 =

BioPAX2).

But, how doe we make that happen?

Page 37: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO Can it help you?

Seeing the errors

and getting useful information

A NuGO example

Red Wine Polyphenols (Dr Cristina Luceri)

Page 38: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGOClusters in control grouprepresenting pathways

Hierarchical Clustering

7.12E3 0

1 1706 2 2

rat 5 rat 12 rat 14 rat 13 rat 4 rat 3 rat 2 rat 1 rat 15 rat 11

Scatter Plot

column for spotfire0 10 20 30 40 50 60 70 80

-80

-70

-60

-50

-40

-30

-20

-10

0

Caused by bad

technology and bad design

Page 39: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGOAfter adapted normalization:

Page 40: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO The bioinformatics

BiGCaT Bioinformatics• Chris Evelo• Rachel van Haaften• Arie van Erk• Stan Gaj• Magali Jaillard• Kitty ter Stege• Thomasz Kelder• Gijs Huisman

TNO Zeist• Rob Stierum• Marjan van Erk

EBI Hinxton• Susanna Sansone• Philippe Rocca• Imre Vastrik

University Firenze• Duccio Cavalieri

GenMAPP.org• Bruce Conklin• Lynn Ferrante

Page 41: The European Nutrigenomics Organisation Using pathway information to understand omics data Chris Evelo NuGO WP7 BiGCaT Bioinformatics Maastricht.

the European Nutrigenomics Organisation

NuNuGOGONuNuGOGO The Biology

Proteomics• Johan Renes (UM)

• Chris Evelo (BiGCaT)

The masterpiece• Milka Sokolović (AMC)

• Wout Lamers (AMC)

• Magali Jaillard (BiGCaT)

Heart Failure• Blanche Schroen (UM)

• Yigal Pinto (UM)

• Arie van Erk (BiGCaT)

Red Wine Polyphenols• Cristina Luceri (Firenze)

• At BiGCaT!

RhoA Stolen from• Rob Stierum (TNO)

Financial contributions: UM, TUe, Senter IOP, WCFS/ICN, Dutch Heart Foundation, NuGO