Top Banner
DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress Department of Botany National Museum of Natural History Smithsonian Institution University of Sao Paulo December 2009
45

DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Oct 06, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

DNA Barcoding in Plants: Biodiversity Identification and

Discovery

W. John Kress

Department of Botany

National Museum of Natural History

Smithsonian Institution

University of Sao Paulo

December2009

Page 2: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

New Technologiesfor Taxonomy

DNA Barcodes

Page 3: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

UNITED STATES NATIONAL HERBARIUM4.7 Million Specimens

Page 4: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

NATIONAL MUSEUM OF NATURAL HISTORY124 Million Specimens

Page 5: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

DNA Barcodes

A short universal gene sequence taken from a standardized portion of the genome used to identify species

Page 6: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Uses of DNA Barcodes

1. Research tool for taxonomists: To aid identification of species To expand species diagnoses to all life history

stages, including fruits, seeds, dimorphic sexes, damaged specimens, gut contents, scats

To test consistency of species definitions with a DNA measure of variability

2. Applied tool for users of taxonomy:

To identify regulated species, including invasives

To test purity and identity of biological products

To assist ecologists in field studies of poorly known organisms

3. Discovery tool:

To flag potentially new species, especially undescribed and cryptic species

Page 7: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

The Barcoding Process - 2 parts

1. Populate the barcode “library” with known species

• Collect tissue from voucher specimen• Extract DNA• PCR/Amplify/cycle sequence gene(s)• Sequence• Database

2. “BLAST” an unidentified specimen against the barcode library

• Sequence comparison• New searching technologies• Ultimately - handheld device ?

3. Put barcode sequences to work to answer compelling scientific questions

• Ecological forensics• Community ecology and

phylogenetics

Page 8: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Smithsonian‟s National Museum of

Natural History

Caribbean Sponges

Page 9: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

DNA Barcode Pipeline

PCR

DNAExtraction

Data Editing

RoboticSequencing

Select plant material

Finished „Barcode‟

Library

Page 10: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

The Primary Choice for Barcoding in Animals: the Mitochondrial Genome

Cyt b

D-Loop

ND5

H-strand

ND4

ND4L

ND3

COIII

COICOIL-strand

ND6

COI

ND2

ND1

COII

Small ribosomal RNA

Large

ribosomal RNA

ATPase subunit 8

ATPase subunit 6

Page 11: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Why were plants behind?

• Finding the right gene regions

• Mobilizing a consensus in the botanical community

What about Plants?

Finally….

• Consensus on gene regions

• Moving ahead

Page 12: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To
Page 13: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Criteria for DNA Barcoding

• Contains sufficient variation to discriminate between species

• Conserved flanks for universalprimers

All land plants

• Short, 300-800 bp

Limited by current sequencing technology,

cost consideration (= 1 read length), and ability to use degraded samples

• Sequence Quality

Page 14: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Three Genomes of Plant Cells for Barcode Candidates

Chloroplast

Mitochondrial

Nuclear

*High copy number*Conserved structure*Diversity of substitution

rates across genes, introns, andintergenic spacers

*Locus of choice for animal barcoding is mitochondrial COI

*Limitations with plants

-Low divergence

-Rapid genome rearrange-ments

*Contain the most variable loci

*Problems with multi-gene families

*Single-copy genes often technically difficult

Page 15: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Schmitz-Linneweber et al. 2002

Atropa vs. Nicotiana Chloroplast Genomes

Complete

Page 16: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

1% divergence

Atropa vs. Nicotiana Chloroplast Genomes

Page 17: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

2% difference2% divergence

trnH-psbA

trnK-rps16

trnC-ycf6

ycf6-psbMpsbM-trnD

trnL-F

trnV-atpE

atpB-rbcL

rpl36-rps8

Atropa vs. Nicotiana Chloroplast Genomes

Page 18: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Top Plant Barcode Candidate:Intergenic Spacer trnH-psbA

CRITERIA FOR BARCODING

• Short, 300-800 bp trnH-psbA = 450 bp

• Conserved flanks for universal primers

trnH-psbA = 93-100% success

• Contains sufficient variation to discriminate between species

trnH-psbA = 1.17%

Page 19: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

A SINGLE-LOCUS PLANT BARCODE

Many Other Regions Proposed:

accD, matK, ndhJ, rbcL, rpoC1, rpoB2, trnL, YCF5, UPA, ITS, CO1

Option #1: Best Candidate Plastid Non-Coding

trnH-psbA

Page 20: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

SAMPLING AND PCR SUCCESS:

39 Orders of Land Plants

Page 21: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

A SINGLE-LOCUS PLANT BARCODE: Comparative Results

Page 22: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

A TWO-LOCUS PLANT BARCODE

rbcL = the “Anchor”(Plastid Coding

Gene)+

trnH-psbA= the

“Identifier”(Plastid Non-

coding Spacer)

Hierarchical and Complementary

Page 23: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

INTERGENIC SPACERS – Indels, Alignment, and Repeats:

Problems or Assets?

•Spacers for Identification (and local-scale phylogenetics)

•Indels as added characters for ID

•Partial sequences are useful

•New Informatics Tools for Searching the Reference Database

•New technologies for solving problems

Indel variation in segment of trnH-psbAspacer among 57 species

Do we need a coding gene??

Page 24: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

An Alternative Two-Locus Plant Barcode

CBOL Plant Working Group - 2009

156 Cryptogams81 Gymnosperms170 Angiosperms

Conclusion:

rbcL + matK

withtrnH-psbA &

other spacers as alternative

barcodes

Universality

Discrimination

Page 25: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

A THREE-LOCUS PLANT BARCODE

rbcL= the “Anchor”(Plastid Coding

Gene)+

trnH-psbA= the

“Identifier”(Plastid Non-

coding Spacer)+

matK(Plastid Coding

Gene)

Hierarchical and Complementary

matK

Page 26: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

What is a medicinal plant?

We used a consensus of four sources that list medicinal plants, primarily:

World Economic Plants - A

Standard Reference

Major Medicinal Plants of the World: An Applied Test of DNA Barcoding

Page 27: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

• How we assembled our set:– Selected ~1150 species– Requested

•USDA germplasm•USBG living collection• Local gardens•NMNH herbarium

– What we have:•768 species•>168 Genera•113 Plant Families•4 accessions per species

Major Medicinal Plants of the World: An Applied Test of DNA Barcoding

Page 28: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

rbcL Anchor

trnH-psbAIdentifier

Lamiales: Mentha

Two-locus approach: create

backbone of tree with rbcLas the Anchor; then separate individuals species in smaller groups with trnH-psbA as the Identifier

Major Medicinal Plants of the World: An Applied Test of DNA Barcoding

Results: >94%

success with rbcL/

trnH-psbA

Page 29: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Vital statistics of BCI

• Island in Panama Canal

– Premier Ecological Plot

• 296 tree species

– 1035 specimens (~3 accession/species)

• 180 Genera

• 49 Families

• ~50% of genera have one species = easy test of barcoding

50-ha Forest Dynamics Plot on Barro Colorado Island, Panama

Why DNA Barcoding on BCI?

Species identification*forensic/ecological

Phylogenetic applications*species/community

phylogenies*functional trait mapping

Smithsonian Institution Global Earth Observatories

(SIGEO)

A global program of long-term forest research: monitoring

the impact of climate change

Smithsonian Tropical Research Institute

Center for Tropical Forest Science

Page 30: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Collection Data Tab

50-ha Forest Dynamics Plots Field Information Management

System

Geographic Data Tab

Tissue Data Tab

Page 31: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

50-ha Forest Dynamics Plot on Barro Colorado Island, Panama

Smithsonian Institution Global Earth Observatories

(SIGEO)

A global program of long-term forest research: monitoring

the impact of climate change

Smithsonian Tropical Research Institute

Center for Tropical Forest Science

Barcode Success

*Note: ~8% of sequences are partial

trnH-psbA* matK rbcLa

pcr seq pcr seq pcr seq

98% 95% 85% 69% 94% 94%

ID Freq ID Freq ID Freq

95% 99% 75%

Page 32: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Species Identification = BLAST(Basic Local Alignment Search Tool)

• Designed to search for similarity among sequences• Can quantify rates of resolution• Use 281 barcode sequences as both library and query

50-ha Forest Dynamics Plot on Barro Colorado Island, Panama

RESULTS

• rbcLa + trnH-psbA + matK:

– 98% of all samples could be assigned to correct Species

– All ambiguity was in 4 genera: Psychotria, Ficus, Inga, Piper

– 100% of sequences were assigned to correct Genus

– Partial sequences were assigned correctly

Page 33: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Barcode

Barcodes and Forensic Ecology

Page 34: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Swenson 2009

The Components of Biodiversity

Barcodes and Community Ecology

Page 35: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Building a CommunityPhylogeny with Phylomatic

Phylogeneticallyclustered = High Plateau, Low Plateau and Young Habitats

Phylogenetically Over-dispersed = Swamp and Slope Habitats

Phylogenetically Random = Stream and Mixed Habitats

Page 36: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Building a Community Phylogeny with Barcodes: A Supermatrix of rbcL, matK, and

trnH-psbArbcLa

*aligns unambiguously

matK

*aligned with backtranslation (AA)

trnH-psbA

*aligned within ORDERS (Muscle), thenorders placed within rbcLa alignment with “missing data” coded for other Orders (MacClade)

Trees*constructed with Parsimony (PAUP) and ML (Garli: GTR+I+Ѓ)

Page 37: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

50-ha Forest Dynamics Plot

on BCI, Panama (281 species):

Community Phylogeny

using a Supermatrix

Approach with rbcL/trnH-psbA/matK

Page 38: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

A Comparison of Ordinal and Family Relationships on BCI

50-ha Forest Dynamics Plot

on BCI, Panama (282 species):

Community Phylogeny of 23 Orders using a Supermatrix

Approach with rbcL/trnH-psbA

50-ha Forest Dynamics Plot

on BCI, Panama (281 species):

Community Phylogeny

using a Supermatrix

Approach with rbcL/trnH-psbA/matK

Asterids

Page 39: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Barcodes vs. Phylomatic

50-ha Forest Dynamics Plot

on BCI, Panama (282 species):

Community Phylogeny of 23 Orders using a Supermatrix

Approach with rbcL/trnH-psbA

50-ha Forest Dynamics Plot

on BCI, Panama (281 species):

Community Phylogeny

using a Supermatrix

Approach with rbcL/trnH-psbA/matK

vs.

RubiaceaeOverall Tree:

< 50% resolution vs >97% resolution

Page 40: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

50-ha Forest Dynamics Plot

on BCI, Panama (282 species):

Community Phylogeny

using a Supermatrix

Approach with rbcL/trnH-psbA

50-ha Forest Dynamics Plot

on BCI, Panama (281 species):

Community Phylogeny

using a Supermatrix

Approach with rbcL/trnH-psbA/matK

Barcodes vs. Phylomatic

Net Relatedness Index (NRI)

Barcode Phylogeny:

Phylogeneticallyclustered = Low Plateau and Slope Habitats

Phylogenetically Over-dispersed = High Plateau, Mixed and Young Habitats =

PhylogeneticallyRandom = Stream and Swamp Habitats

Phylomatic Phylogeny:

Phylogeneticallyclustered = High Plateau, Low Plateau and Young Habitats

Phylogenetically Over-dispersed = Swamp and Slope Habitats

Phylogenetically Random = Stream and Mixed Habitats

Page 41: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

50-ha Forest Dynamics Plot

on BCI, Panama (281 species):

Community Phylogeny

using a Supermatrix

Approach with rbcL/trnH-psbA/matK

Functional Trait Analysis

Page 42: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Swenson 2009

Community Assembly,

Productivity, Stability,

Functional Trait Evolution

Phylogenies and Community Ecology

Page 43: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Expanding the network!

Smithsonian Institution Global Earth Observatories

(SIGEO)

A global program of long-term forest research: monitoring

the impact of climate change

Smithsonian Tropical Research Institute

Center for Tropical Forest Science

22 Established Sites (Black) 12 Candidate Sites (Blue)

Center for Tropical Forest ScienceSmithsonian Institution Global Earth

Observatories (SIGEO)

**

**

*

**

**

*

*

Purpose:

*Forest Dynamics*Climate Change*Conservation

*

*

**

Barcoding Initiated (Red)

Page 44: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

Smithsonian Institution Global Earth Observatories (SIGEO)

Center for Tropical Forest ScienceSmithsonian Institution Global Earth

Observatories (SIGEO)

Page 45: DNA Barcoding in Plants: Biodiversity Identification and Discovery … · 2009. 12. 3. · DNA Barcoding in Plants: Biodiversity Identification and Discovery W. John Kress ... To

DNA Barcoding in Plants: Biodiversity Identification and

Discovery

W. John Kress

Department of Botany

National Museum of Natural History

Smithsonian Institution

University of Sao Paulo

December2009

Dave EricksonKen WurdackLiz ZimmerDan JanzenLee WeigtLing ZhangNate SwensonAndy JonesOris SanjurJamie WhitakerIda LopezStuart DaviesJoe WrightBiff BerminghamScott Miller