Top Banner
Exploring proteins, chemicals and their interactions with STRING and STITCH Michael Kuhn EMBL Heidelberg
81

Exploring proteins, chemicals and their interactions with STRING and STITCH

Jan 29, 2018

Download

Technology

biocs
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Exploring proteins, chemicals and their interactions with STRING and STITCH

Exploring proteins, chemicals and their interactions

with STRING and STITCH

Michael KuhnEMBL Heidelberg

Page 2: Exploring proteins, chemicals and their interactions with STRING and STITCH

my usual diet work:drugs, proteins, side effects

Page 3: Exploring proteins, chemicals and their interactions with STRING and STITCH

today

intro: interactionssmall examples

the databases: STRING and STITCHlarger example: NetworKIN

Page 4: Exploring proteins, chemicals and their interactions with STRING and STITCH

STRING: version 7interactions of proteins

STITCH: version 1interactions of

proteins and chemicals

Page 5: Exploring proteins, chemicals and their interactions with STRING and STITCH

interactions of proteins and chemicals

Page 6: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 7: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 8: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 9: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 10: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 11: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 12: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 13: Exploring proteins, chemicals and their interactions with STRING and STITCH

example

Tryptophan synthase beta chainE. Coli K12

Page 14: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 15: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 16: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 17: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 18: Exploring proteins, chemicals and their interactions with STRING and STITCH

example

aspirinHomo sapiens

Page 19: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 20: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 21: Exploring proteins, chemicals and their interactions with STRING and STITCH

content

(STRING 7)

Page 22: Exploring proteins, chemicals and their interactions with STRING and STITCH

373 genomes

(only completely sequenced genomes)

Page 23: Exploring proteins, chemicals and their interactions with STRING and STITCH

1.5 million genes

(not proteins)

Page 24: Exploring proteins, chemicals and their interactions with STRING and STITCH

68,000 chemicals

(including 2200 drugs)

Page 25: Exploring proteins, chemicals and their interactions with STRING and STITCH

many sources of interactions

Page 26: Exploring proteins, chemicals and their interactions with STRING and STITCH

genomic context

Page 27: Exploring proteins, chemicals and their interactions with STRING and STITCH

gene neighborhood

Page 28: Exploring proteins, chemicals and their interactions with STRING and STITCH

gene fusion

Page 29: Exploring proteins, chemicals and their interactions with STRING and STITCH

phylogenetic profiles

Page 30: Exploring proteins, chemicals and their interactions with STRING and STITCH

curated knowledge

Page 31: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 32: Exploring proteins, chemicals and their interactions with STRING and STITCH

Texperimental evidence

Page 33: Exploring proteins, chemicals and their interactions with STRING and STITCH

GEO: Gene Expression Omnibus

co-expression

Page 34: Exploring proteins, chemicals and their interactions with STRING and STITCH

experimental databases

Page 35: Exploring proteins, chemicals and their interactions with STRING and STITCH

literature

Page 36: Exploring proteins, chemicals and their interactions with STRING and STITCH

variable quality

different “raw scores”

Page 37: Exploring proteins, chemicals and their interactions with STRING and STITCH

benchmarking

calibrate against “gold standard”(KEGG)

Page 38: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 39: Exploring proteins, chemicals and their interactions with STRING and STITCH

probabilistic scores

e.g. “70% chance for an assocation”

Page 40: Exploring proteins, chemicals and their interactions with STRING and STITCH

combine all evidence

Page 41: Exploring proteins, chemicals and their interactions with STRING and STITCH

Bayesian scoring scheme

Page 42: Exploring proteins, chemicals and their interactions with STRING and STITCH

e.g.: two scores of 0.7combined probability: ?

Page 43: Exploring proteins, chemicals and their interactions with STRING and STITCH

e.g.: two scores of 0.7combined probability: 0.91

1 - (1-0.7)2 = 0.91

Page 44: Exploring proteins, chemicals and their interactions with STRING and STITCH

evidence spread over many species

Page 45: Exploring proteins, chemicals and their interactions with STRING and STITCH

evidence transfer

Page 46: Exploring proteins, chemicals and their interactions with STRING and STITCH

transfer by orthology

(or “fuzzy orthology”)

Page 47: Exploring proteins, chemicals and their interactions with STRING and STITCH

von Mering et al., Nucleic Acids Research, 2005

Page 48: Exploring proteins, chemicals and their interactions with STRING and STITCH

von Mering et al., Nucleic Acids Research, 2005

Page 49: Exploring proteins, chemicals and their interactions with STRING and STITCH

two modes

Page 50: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 51: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 52: Exploring proteins, chemicals and their interactions with STRING and STITCH

proteins mode

Page 53: Exploring proteins, chemicals and their interactions with STRING and STITCH

von Mering et al., Nucleic Acids Research, 2005

Page 54: Exploring proteins, chemicals and their interactions with STRING and STITCH

maximum specificitylower coverage

information will be relevant for selected species

Page 55: Exploring proteins, chemicals and their interactions with STRING and STITCH

COG mode

“clusters of orthologous groups”

Page 56: Exploring proteins, chemicals and their interactions with STRING and STITCH

von Mering et al., Nucleic Acids Research, 2005

Page 57: Exploring proteins, chemicals and their interactions with STRING and STITCH

higher coveragelower specificity

includes all available evidence

some orthologous groups are too large to be meaningful

Page 58: Exploring proteins, chemicals and their interactions with STRING and STITCH

a real application

Resource

Systematic Discovery ofIn Vivo Phosphorylation NetworksRune Linding,1,2,7,* Lars Juhl Jensen,3,7 Gerard J. Ostheimer,2,4,7 Marcel A.T.M. van Vugt,2,5 Claus Jørgensen,1

Ioana M. Miron,1 Francesca Diella,3 Karen Colwill,1 Lorne Taylor,1 Kelly Elder,1 Pavel Metalnikov,1

Vivian Nguyen,1 Adrian Pasculescu,1 Jing Jin,1 Jin Gyoon Park,1 Leona D. Samson,4 James R. Woodgett,1

Robert B. Russell,3 Peer Bork,3,6,* Michael B. Yaffe,2,* and Tony Pawson1,*1Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Canada2Center for Cancer Research, Massachusetts Institute of Technology, Cambridge, USA3European Molecular Biology Laboratory, Heidelberg, Germany4Center for Environmental Health Sciences, Massachusetts Institute of Technology, Cambridge, USA5Department of Cell Biology and Genetics, Erasmus University, Rotterdam, The Netherlands6Max-Delbruck-Centre for Molecular Medicine, Berlin, Germany7These authors contributed equally to this work.*Correspondence: [email protected] (R.L.), [email protected] (P.B.), [email protected] (M.B.Y.), [email protected] (T.P.)DOI 10.1016/j.cell.2007.05.052

SUMMARY

Protein kinases control cellular decision pro-cesses by phosphorylating specific substrates.Thousands of in vivo phosphorylation siteshave been identified, mostly by proteome-wide mapping. However, systematically match-ing these sites to specific kinases is presentlyinfeasible, due to limited specificity of consen-sus motifs, and the influence of contextualfactors, such as protein scaffolds, localization,and expression, on cellular substrate specific-ity. We have developed an approach (Networ-KIN) that augments motif-based predictionswith the network context of kinases andphosphoproteins. The latter provides 60%–80% of the computational capability to assignin vivo substrate specificity. NetworKIN pin-points kinases responsible for specific phos-phorylations and yields a 2.5-fold improvementin the accuracy with which phosphorylationnetworks can be constructed. Applying thisapproach to DNA damage signaling, we showthat 53BP1 and Rad50 are phosphorylated byCDK1 and ATM, respectively. We describea scalable strategy to evaluate predictions,which suggests that BCLAF1 is a GSK-3substrate.

INTRODUCTION

The dynamic behavior and decision processes of eukary-otic cells are controlled by posttranslational modificationssuch as protein phosphorylation. These, in turn, can mod-ify protein function by inducing conformational changes or

by creating binding sites for protein interaction domains(for example, SH2 or BRCT) that selectively recognizephosphorylated linear motifs (Seet et al., 2006).

Decades of targeted biochemical studies and recentexperiments employing mass spectrometry (MS) tech-niques have identified thousands of in vivo phosphoryla-tion sites (Aebersold and Mann, 2003). These arecollected in the Phospho.ELM database, which currentlycontains 7207 phosphorylation sites in 2540 humanproteins (Diella et al., 2004). However, which of theapproximately 518 human protein kinases (Manninget al., 2002) is responsible for each of these phosphoryla-tion events is only known for just over a third of sitesidentified thus far (35% [Diella et al., 2004]), and this frac-tion is decreasing in thewake of additional proteome-widestudies. As a consequence, there is an ever-widening gapin our understanding of in vivo phosphorylation networks,which is difficult to close in a systematic way by currentexperimental methods, despite advances in high-through-put in vitro assays (Ptacek et al., 2005) and selectivekinase inhibitors (Bain et al., 2003). Our understanding ofphosphorylation-dependent signaling networks is there-fore still fragmentary.

The desire to map phosphorylation networks hasmotivated the development of computational methodsto predict the substrate specificities of protein kinases,based on experimental identification of the consensussequence motifs recognized by the active site of kinasecatalytic domains (Hjerrild et al., 2004; Obenauer et al.,2003; Puntervoll et al., 2003). However, these motifs oftenlack sufficient information to uniquely identify the phys-iological substrates of specific kinases. For example,the sites phosphorylated by different kinases from theCDK or Src families cannot be distinguished by theirsequences, although consensus motifs of these kinaseshave been determined by in vitro experiments (Mankeet al., 2005). Thus, the recognition properties of the activesite alone are typically insufficient to reproduce the

Cell 129, 1415–1426, June 29, 2007 ª2007 Elsevier Inc. 1415

Page 59: Exploring proteins, chemicals and their interactions with STRING and STITCH

phosphoproteomics

Page 60: Exploring proteins, chemicals and their interactions with STRING and STITCH

in vivo phosphorylation sites

Page 61: Exploring proteins, chemicals and their interactions with STRING and STITCH

kinases are unknown

Page 62: Exploring proteins, chemicals and their interactions with STRING and STITCH

computational methods

Page 63: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 64: Exploring proteins, chemicals and their interactions with STRING and STITCH

overprediction

Page 65: Exploring proteins, chemicals and their interactions with STRING and STITCH

context

Page 66: Exploring proteins, chemicals and their interactions with STRING and STITCH

scaffolders

Alberts, Molecular Biology of the Cell

Page 67: Exploring proteins, chemicals and their interactions with STRING and STITCH

interaction networks

Page 68: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 69: Exploring proteins, chemicals and their interactions with STRING and STITCH

NetworKIN

Page 70: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 71: Exploring proteins, chemicals and their interactions with STRING and STITCH

benchmarking

Page 72: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 73: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 74: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 75: Exploring proteins, chemicals and their interactions with STRING and STITCH

DNA damage response

Page 76: Exploring proteins, chemicals and their interactions with STRING and STITCH
Page 77: Exploring proteins, chemicals and their interactions with STRING and STITCH

experimental validation

Page 78: Exploring proteins, chemicals and their interactions with STRING and STITCH

take home message

STRING and STITCH integrate information and predict interactions

you can always go to the sources

it’s useful!

Page 79: Exploring proteins, chemicals and their interactions with STRING and STITCH

Acknowledgements

The STRING/STITCH teamLars Juhl Jensen

Peer BorkChristian von Mering & group in Zurich

NetworKINLars Juhl Jensen

Rune Linding(and many other people)

Page 80: Exploring proteins, chemicals and their interactions with STRING and STITCH

Thank you for your attention

Page 81: Exploring proteins, chemicals and their interactions with STRING and STITCH

string.embl.devon Mering et al., NAR Database Issue 2007

stitch.embl.deKuhn et al., NAR Database Issue 2008