Top Banner
An Exploratory Method to Reconstruct Pathways Cory Tobin
22

An Exploratory Method to Reconstruct Pathways

Jan 21, 2016

Download

Documents

dyanne

Cory Tobin. An Exploratory Method to Reconstruct Pathways. Collaborators. Dr. Matteo Pellegrini Shawn Cokus @ UCLA. Outline. Purpose Methods Sample Data Possible Uses Final Remarks. Purpose. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: An Exploratory Method to Reconstruct Pathways

An Exploratory Method to Reconstruct Pathways

Cory Tobin

Page 2: An Exploratory Method to Reconstruct Pathways

Collaborators

Dr. Matteo Pellegrini

Shawn Cokus

@ UCLA

Page 3: An Exploratory Method to Reconstruct Pathways

Outline

• Purpose

• Methods

• Sample Data

• Possible Uses

• Final Remarks

Page 4: An Exploratory Method to Reconstruct Pathways

Purpose

Reconstruct signal transduction pathways & protein complexes using protein-protein interactions reported

on the web

Page 5: An Exploratory Method to Reconstruct Pathways

Materials

• Python

• Yahoo! Search API

• ProstgreSQL

• Django Web Framework

Page 6: An Exploratory Method to Reconstruct Pathways

Methods

• Construct high likelihood / low noise queries• Ex: “Jak2 phophorylates Stat5”

• Query Yahoo! for every permutation of 2 proteins in a given species

• Use high likelihood joining words…

Page 7: An Exploratory Method to Reconstruct Pathways

Joining Words

• Phosphorylates• Methylates• Acetylates• Activates• Deactivates• Binds to

• Inhibits• Dephosphorylates• Glycosylates• Ubiquitinates• Interacts with

Page 8: An Exploratory Method to Reconstruct Pathways

Full Query

“Jak2 acetylates OR phosphorylates OR

methylates OR binds to OR interacts

with Stat5”

Page 9: An Exploratory Method to Reconstruct Pathways

Hindrance

• Doing pair-wise queries for all N proteins in an organism requires N*N queries

• E. coli has >4000 genes

(16,000,000 queries)

• Yahoo! allows 5k / day / computer

Page 10: An Exploratory Method to Reconstruct Pathways

Possible Solutions

Recruit 4k computers and finish in a day

Find a better method

OR

Page 11: An Exploratory Method to Reconstruct Pathways

Better Method

• Only specify the first symbol

• Iterate through the results and only

take results whose word following the

joining symbol corresponds to a valid

symbol

Page 12: An Exploratory Method to Reconstruct Pathways

Full Query

“Jak2 acetylates OR phosphorylates OR

methylates OR binds to OR interacts

with”

Page 13: An Exploratory Method to Reconstruct Pathways

Another Hindrance

• The symbol “thE”(and others like it)

• Searches need to be case insensitive to account for “p53” and “P53”

• Recognizes the word “the” as the protein “thE”

Page 14: An Exploratory Method to Reconstruct Pathways

Solution

• Use a list of stop words

• Very common, non-interesting

words

• If the name appears in that list of stop

words, just forget about that protein

all together

http://www.dcs.gla.ac.uk/idom/ir_resources/linguistic_utils/stop_words

Page 15: An Exploratory Method to Reconstruct Pathways

Methods (cont.)

• After we have this data in a database...

• Create a web interface to the data so others can search for protein interactions (Shwe)

Page 16: An Exploratory Method to Reconstruct Pathways

Data

KEGG - Yeast MAPK

Our Datahttp://www.genome.jp/dbget-bin/show_pathway?sce04010+YGR040W

Page 17: An Exploratory Method to Reconstruct Pathways

Data (cont.)

KEGG - Yeast Cell Cycle

http://www.genome.jp/dbget-bin/get_pathway?org_name=sce&mapno=04110

Our Data

Page 18: An Exploratory Method to Reconstruct Pathways

Data (cont.)

KEGG - Yeast 26S Proteasome

Our Data

http://www.genome.jp/dbget-bin/show_pathway?sce03050+YER012W

Page 19: An Exploratory Method to Reconstruct Pathways

Possible Uses

• General reference for protein

interactions

• Curate other databases

Page 20: An Exploratory Method to Reconstruct Pathways

Final Remarks

• Only works well detecting signal pathways and protein complexes

• Not metabolic pathways

• It is possible to get high quality, interesting data without much noise or complex text analysis algorithms

Page 21: An Exploratory Method to Reconstruct Pathways

References

• Kyoto Encyclopedia of Genes and Genomes http://www.genome.jp/kegg/

• Cytoscape Network Visualization http://www.cytoscape.org/

• Yahoo! Developer Network http://developer.yahoo.com/

Page 22: An Exploratory Method to Reconstruct Pathways

Acknowledgements

• Dr. Matteo Pellegrini

• Everyone in the lab

• SoCalBSI

• NIH / NSF