Top Banner
Bioinformatics for Bioinformatics for your classroom your classroom Seth Bordenstein Seth Bordenstein Department of Biological Sciences Department of Biological Sciences Vanderbilt University Vanderbilt University NCBI BLAST
23

Bioinformatics for your classroom

Feb 09, 2016

Download

Documents

dory

Bioinformatics for your classroom. NCBI BLAST. Seth Bordenstein Department of Biological Sciences Vanderbilt University. Advantages. No programming skills needed Familiarity with personal computer and internet browser Customizable and free. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Bioinformatics for your classroom

Bioinformatics for your Bioinformatics for your classroomclassroom

Seth BordensteinSeth Bordenstein

Department of Biological SciencesDepartment of Biological Sciences

Vanderbilt UniversityVanderbilt University

NCBI

BLAST

Page 2: Bioinformatics for your classroom

1. No programming skills needed

2.Familiarity with personal computer and internet browser

3.Customizable and free

Advantages

Page 3: Bioinformatics for your classroom

Bioinformatics is like using ‘Google’ for DNA sequences

Page 4: Bioinformatics for your classroom

National Center for Biotechnology National Center for Biotechnology Information (NCBI)Information (NCBI)

http://www.ncbi.nlm.nih.govhttp://www.ncbi.nlm.nih.gov

Page 5: Bioinformatics for your classroom

Seq

uen

ce R

eco

rds

(mil

lio

ns)

To

tal Base P

airs(b

illion

s)

0

5

10

15

20

25

30

35

0

5

10

15

20

25

30

35

40Sequence recordsTotal base pairs

Release 148: 45.2 million records 49.4 billion nucleotides

Average doubling time ≈ 14 months

’83 ’84 ’85 ’86 ’87 ’88 ’89 ’90 ’91 ’92 ’93 ’94 ’95 ’96 ’97 ’98 ’99 ’00 ’01 ’02 ’03 ’04 ’05 ’06

40

45

45

50

5550

Growth of NCBI - GenBank

Page 6: Bioinformatics for your classroom

DNA RNA

cDNAESTs

phenotype

DNA sequencesgenomes

protein sequence databases

protein

Bioinformatics is NOT just information technology. It can teach the central dogmas of molecular biology

Page 7: Bioinformatics for your classroom

Target database: Adjustable using the pull-down menuTarget database: Adjustable using the pull-down menuTarget database: Adjustable using the pull-down menuTarget database: Adjustable using the pull-down menu

Page 8: Bioinformatics for your classroom
Page 9: Bioinformatics for your classroom
Page 10: Bioinformatics for your classroom
Page 11: Bioinformatics for your classroom

A TraditionalA TraditionalGenBank GenBank

RecordRecord

LOCUS AY182241 1931 bp mRNA linear PLN 04-MAY-2004DEFINITION Malus x domestica (E,E)-alpha-farnesene synthase (AFS1) mRNA, complete cds.ACCESSION AY182241VERSION AY182241.2 GI:32265057KEYWORDS .SOURCE Malus x domestica (cultivated apple) ORGANISM Malus x domestica Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids; eurosids I; Rosales; Rosaceae; Maloideae; Malus.REFERENCE 1 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Cloning and functional expression of an (E,E)-alpha-farnesene synthase cDNA from peel tissue of apple fruit JOURNAL Planta 219, 84-94 (2004)REFERENCE 2 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (18-NOV-2002) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USAREFERENCE 3 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (25-JUN-2003) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USA REMARK Sequence update by submitterCOMMENT On Jun 26, 2003 this sequence version replaced gi:27804758.FEATURES Location/Qualifiers source 1..1931 /organism="Malus x domestica" /mol_type="mRNA" /cultivar="'Law Rome'" /db_xref="taxon:3750" /tissue_type="peel" gene 1..1931 /gene="AFS1" CDS 54..1784 /gene="AFS1" /note="terpene synthase" /codon_start=1 /product="(E,E)-alpha-farnesene synthase" /protein_id="AAO22848.2" /db_xref="GI:32265058" /translation="MEFRVHLQADNEQKIFQNQMKPEPEASYLINQRRSANYKPNIWK NDFLDQSLISKYDGDEYRKLSEKLIEEVKIYISAETMDLVAKLELIDSVRKLGLANLF EKEIKEALDSIAAIESDNLGTRDDLYGTALHFKILRQHGYKVSQDIFGRFMDEKGTLE DFLHKNEDLLYNISLIVRLNNDLGTSAAEQERGDSPSSIVCYMREVNASEETARKNIK GMIDNAWKKVNGKCFTTNQVPFLSSFMNNATNMARVAHSLYKDGDGFGDQEKGPRTHI LSLLFQPLVN"ORIGIN 1 ttcttgtatc ccaaacatct cgagcttctt gtacaccaaa ttaggtattc actatggaat 61 tcagagttca cttgcaagct gataatgagc agaaaatttt tcaaaaccag atgaaacccg 121 aacctgaagc ctcttacttg attaatcaaa gacggtctgc aaattacaag ccaaatattt 181 ggaagaacga tttcctagat caatctctta tcagcaaata cgatggagat gagtatcgga 241 agctgtctga gaagttaata gaagaagtta agatttatat atctgctgaa acaatggatt//

Header

Feature Table

Sequence

The Flatfile Format

Page 12: Bioinformatics for your classroom

LOCUS AY182241 1931 bp mRNA linear PLN 04-MAY-2004DEFINITION Malus x domestica (E,E)-alpha-farnesene synthase (AFS1) mRNA, complete cds.ACCESSION AY182241VERSION AY182241.2 GI:32265057KEYWORDS .SOURCE Malus x domestica (cultivated apple) ORGANISM Malus x domestica Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids; eurosids I; Rosales; Rosaceae; Maloideae; Malus.REFERENCE 1 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Cloning and functional expression of an (E,E)-alpha-farnesene synthase cDNA from peel tissue of apple fruit JOURNAL Planta 219, 84-94 (2004)REFERENCE 2 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (18-NOV-2002) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USAREFERENCE 3 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (25-JUN-2003) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USA REMARK Sequence update by submitterCOMMENT On Jun 26, 2003 this sequence version replaced gi:27804758.

The HeaderThe Header

Page 13: Bioinformatics for your classroom

LOCUS AY182241 1931 bp mRNA linear PLN 04-MAY-2004DEFINITION Malus x domestica (E,E)-alpha-farnesene synthase (AFS1) mRNA, complete cds.ACCESSION AY182241VERSION AY182241.2 GI:32265057KEYWORDS .SOURCE Malus x domestica (cultivated apple) ORGANISM Malus x domestica Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids; eurosids I; Rosales; Rosaceae; Maloideae; Malus.REFERENCE 1 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Cloning and functional expression of an (E,E)-alpha-farnesene synthase cDNA from peel tissue of apple fruit JOURNAL Planta 219, 84-94 (2004)REFERENCE 2 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (18-NOV-2002) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USAREFERENCE 3 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (25-JUN-2003) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USA REMARK Sequence update by submitterCOMMENT On Jun 26, 2003 this sequence version replaced gi:27804758.

Header: Locus LineHeader: Locus LineLOCUS AY182241 1931 bp mRNA linear PLN 04-MAY-2004LOCUS AY182241 1931 bp mRNA linear PLN 04-MAY-2004

Molecule typeMolecule typeDivisionDivision

Modification DateModification Date

Locus nameLocus name

LengthLength

Page 14: Bioinformatics for your classroom

LOCUS AY182241 1931 bp mRNA linear PLN 04-MAY-2004DEFINITION Malus x domestica (E,E)-alpha-farnesene synthase (AFS1) mRNA, complete cds.ACCESSION AY182241VERSION AY182241.2 GI:32265057KEYWORDS .SOURCE Malus x domestica (cultivated apple) ORGANISM Malus x domestica Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids; eurosids I; Rosales; Rosaceae; Maloideae; Malus.REFERENCE 1 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Cloning and functional expression of an (E,E)-alpha-farnesene synthase cDNA from peel tissue of apple fruit JOURNAL Planta 219, 84-94 (2004)REFERENCE 2 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (18-NOV-2002) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USAREFERENCE 3 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (25-JUN-2003) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USA REMARK Sequence update by submitterCOMMENT On Jun 26, 2003 this sequence version replaced gi:27804758.

Header: Database IdentifiersHeader: Database Identifiers

ACCESSION AY182241

VERSION AY182241.2 GI:32265057

ACCESSION AY182241

VERSION AY182241.2 GI:32265057

Accession•Stable•Reportable•Universal

Accession•Stable•Reportable•Universal

Page 15: Bioinformatics for your classroom

LOCUS AY182241 1931 bp mRNA linear PLN 04-MAY-2004DEFINITION Malus x domestica (E,E)-alpha-farnesene synthase (AFS1) mRNA, complete cds.ACCESSION AY182241VERSION AY182241.2 GI:32265057KEYWORDS .SOURCE Malus x domestica (cultivated apple) ORGANISM Malus x domestica Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids; eurosids I; Rosales; Rosaceae; Maloideae; Malus.REFERENCE 1 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Cloning and functional expression of an (E,E)-alpha-farnesene synthase cDNA from peel tissue of apple fruit JOURNAL Planta 219, 84-94 (2004)REFERENCE 2 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (18-NOV-2002) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USAREFERENCE 3 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (25-JUN-2003) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USA REMARK Sequence update by submitterCOMMENT On Jun 26, 2003 this sequence version replaced gi:27804758.

Header: OrganismHeader: Organism

SOURCE Malus x domestica (cultivated apple) ORGANISM Malus x domestica Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids; eurosids I; Rosales; Rosaceae; Maloideae; Malus.

SOURCE Malus x domestica (cultivated apple) ORGANISM Malus x domestica Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids; eurosids I; Rosales; Rosaceae; Maloideae; Malus.

NCBI-controlled taxonomy

Page 16: Bioinformatics for your classroom

FEATURES Location/Qualifiers source 1..1931 /organism="Malus x domestica" /mol_type="mRNA" /cultivar="'Law Rome'" /db_xref="taxon:3750" /tissue_type="peel" gene 1..1931 /gene="AFS1" CDS 54..1784 /gene="AFS1" /note="terpene synthase" /codon_start=1 /product="(E,E)-alpha-farnesene synthase" /protein_id="AAO22848.2" /db_xref="GI:32265058" /translation="MEFRVHLQADNEQKIFQNQMKPEPEASYLINQRRSANYKPNIWK NDFLDQSLISKYDGDEYRKLSEKLIEEVKIYISAETMDLVAKLELIDSVRKLGLANLF EKEIKEALDSIAAIESDNLGTRDDLYGTALHFKILRQHGYKVSQDIFGRFMDEKGTLE NHHFAHLKGMLELFEASNLGFEGEDILDEAKASLTLALRDSGHICYPDSNLSRDVVHS LELPSHRRVQWFDVKWQINAYEKDICRVNATLLELAKLNFNVVQAQLQKNLREASRWW ANLGIADNLKFARDRLVECFACAVGVAFEPEHSSFRICLTKVINLVLIIDDVYDIYGS EEELKHFTNAVDRWDSRETEQLPECMKMCFQVLYNTTCEIAREIEEENGWNQVLPQLT KVWADFCKALLVEAEWYNKSHIPTLEEYLRNGCISSSVSVLLVHSFFSITHEGTKEMA DFLHKNEDLLYNISLIVRLNNDLGTSAAEQERGDSPSSIVCYMREVNASEETARKNIK GMIDNAWKKVNGKCFTTNQVPFLSSFMNNATNMARVAHSLYKDGDGFGDQEKGPRTHI LSLLFQPLVN"

The Feature TableThe Feature Table

Coding sequenceCoding sequence

start (atg)start (atg) stop (tag)stop (tag)

Page 17: Bioinformatics for your classroom

The Sequence: The Sequence: What do you do with it?What do you do with it?

ORIGIN 1 ttcttgtatc ccaaacatct cgagcttctt gtacaccaaa ttaggtattc actatggaat 61 tcagagttca cttgcaagct gataatgagc agaaaatttt tcaaaaccag atgaaacccg 121 aacctgaagc ctcttacttg attaatcaaa gacggtctgc aaattacaag ccaaatattt 181 ggaagaacga tttcctagat caatctctta tcagcaaata cgatggagat gagtatcgga

ORIGIN 1 ttcttgtatc ccaaacatct cgagcttctt gtacaccaaa ttaggtattc actatggaat 61 tcagagttca cttgcaagct gataatgagc agaaaatttt tcaaaaccag atgaaacccg 121 aacctgaagc ctcttacttg attaatcaaa gacggtctgc aaattacaag ccaaatattt 181 ggaagaacga tttcctagat caatctctta tcagcaaata cgatggagat gagtatcgga

1741 ggacccacat cctgtcttta ctattccaac ctcttgtaaa ctagtactca tatagtttga 1801 aataaatagc agcaaaagtt tgcggttcag ttcgtcatgg ataaattaat ctttacagtt 1861 tgtaacgttg ttgccaaaga ttatgaataa aaagttgtag tttgtcgttt aaaaaaaaaa 1921 aaaaaaaaaa a//

1741 ggacccacat cctgtcttta ctattccaac ctcttgtaaa ctagtactca tatagtttga 1801 aataaatagc agcaaaagtt tgcggttcag ttcgtcatgg ataaattaat ctttacagtt 1861 tgtaacgttg ttgccaaaga ttatgaataa aaagttgtag tttgtcgttt aaaaaaaaaa 1921 aaaaaaaaaa a//

Page 18: Bioinformatics for your classroom

BLAST:BLAST:

Compare new genes to old ones Compare genes from different species or

hosts Investigate the transcriptome (cDNAs) Identify possible functions based on

similarities to known sequences.

Query a database for sequences similar to an input sequence.

GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <——

——> > CCTTAAGGAAGGAAGGCC--GGTTAAGGTTCCAGAGAAGTGGTGTTCTTTGAGTTCCCTTTGAGTTCC

Page 19: Bioinformatics for your classroom

What are the broad goals of this lab?What are the broad goals of this lab?

To provide an introduction to bioinformatics To provide an introduction to bioinformatics with a focus on NCBIwith a focus on NCBI

To introduce you to searching for articles, To introduce you to searching for articles, sequences, scientists (perhaps yourself ;))sequences, scientists (perhaps yourself ;))

To use the most powerful and reliable To use the most powerful and reliable method to determine evolutionary method to determine evolutionary relationships between genesrelationships between genes

To combine your To combine your WolbachiaWolbachia research with research with computational biologycomputational biology

Page 20: Bioinformatics for your classroom

What are the specific goals of this lab?What are the specific goals of this lab?

To look for brand new W strainsTo look for brand new W strains

To make a phylogenetic tree of WTo make a phylogenetic tree of W

To ultimately compare the W tree to an To ultimately compare the W tree to an insect phylogeny to infer lateral vs. vertical insect phylogeny to infer lateral vs. vertical transmission of your W strainstransmission of your W strains

To contribute to a national sequence To contribute to a national sequence database on the genetic diversity of W 16S database on the genetic diversity of W 16S rRNA generRNA gene

Page 21: Bioinformatics for your classroom

Outcomes: A New Outcomes: A New WolbachiaWolbachia Species? Species?

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Page 22: Bioinformatics for your classroom

GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100% GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100%

GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100% GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100%

GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100% GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100%

GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100% GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100%

Insect Phylogeny Top 5 Wolbachia BLAST matches

GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100% GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100%

Page 23: Bioinformatics for your classroom

Let’s Begin Our Bioinformatic Exercise Let’s Begin Our Bioinformatic Exercise Lab 5Lab 5