Top Banner
The Wold Lab BioHub Cory Tobin
17

The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

Dec 20, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

The Wold Lab

BioHub

Cory Tobin

Page 2: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

Collaborators

Brandon King

Joe Roden

Diane Trout

Dr. Barbara

Page 3: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

Goal

• Standardize the relationship between biological data

• Integrate all of the data seamlessly

• Provide novel methods to search for and analyze data

Page 4: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

Adapted from http://woldlab.caltech.edu/biohub/

Page 5: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

My Contribution

Implement a database for homology data

Page 6: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

Background

Species A

Species B

Paralogs

Orthologs

The more general term is “homology”

Gene Gene

Gene

Page 7: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

Requirements

• Be more accurate and flexible than HomoloGene

• Work in real time

• Make sense of HomoloGene’s misleading data

Page 8: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

Rationale

Gene

Gene

Gene

Gene

Gene

They are similar

Gene

Gene Gene

GeneGene

HomoloGene BioHub

They are related like this

Page 9: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

Rationale Continued

Human Genome

Mouse Genome

Seq A Seq B

HomoloGene would BLAST seq A against mouse and determine that seq C is an ortholog of seq A.

Seq C

HomoloGene would also BLAST seq B against mouse and detrmine that seq C is an ortholog of seq B.

BioHub will BLAST seq A against mouse, find seq C, then BLAST C back against human to see if there are any better matches. It will find seq B to be better.

Page 10: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

Methods

• Design data relationships that make sense biologically

• Generate the low-level database interaction code

• Parse and load HomoloGene’s data into our database

• Write biologically useful functions

• Create a web-based interface for easy use

Page 11: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

Materials

• ArgoUML – Design Aid

• Pymerase – Design Implementation

• PostgreSQL – Database

• HomoloGene – Data Source

• Python – Programming Language

Page 12: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

Current State

• Design data relationships that make sense biologically

• Generate the low-level database interaction code

• Parse and load HomoloGene’s data into our database

• Write biologically useful functions

• Create a web-based interface for easy use

Page 13: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

Example Usage

Sequence of Interest

…GGATACAAAATTCCTC…

Are there any known genes in this sequence?

acetyl - coenzyme A

dehydrogenase ( Human )

(cont.)

Page 14: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

acetyl - coenzyme A

dehydrogenase ( Human )

Are there any homologs?

Mouse

Rat

Mosquito

Fruit fly

Nematode

(cont.)

Page 15: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

How are those genes related?

Page 16: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

Where do you

want to go?

Page 17: The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

More Info

BioHub woldlab.caltech.edu / biohub

HomoloGene www.ncbi.nlm.nih.gov

Python python.org

Pymerase pymerase.sf.net

PostgreSQL postgresql.org