Top Banner
Step 3. Construction of the phylogenetic tree Distance methods Character methods Maximum parsimony Maximum likelihood
34

Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Apr 03, 2018

Download

Documents

trinhdang
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Step 3. Construction of the phylogenetic tree

Distance methods

Character methodsMaximum parsimonyMaximum likelihood

Page 2: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Distance methods

Simplest distance measure:

Consider every pair of sequences in the multiple alignment and count the number of differences.

Degree of divergence = Hamming distance (D)

D = n/N where N = alignment length n = number of sites with differences

Example: AGGCTTTTCAAGCCTTCTCA

D = 2/10 = 0.2

Page 3: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Problem with distance measure:

As the distance between two sequences increases, the the probability increases that more than one mutation has occured at any one site.

time point 0 1 2 scenario 1 A --> A --> A scenario 2 A --> A --> G scenario 3 A --> G --> Cscenario 4 A --> G --> A

Therefore, methods have been developed tocompensate for this.

Page 4: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Corrected distances

1. Jukes and Cantor

2. Kimura two parameter model

rate of transitions is different fromrate of transversions

P = the fraction of sequence positions differing by a transition Q = the fraction of sequence positions differing

by a transversion.

Page 5: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

In general, transitions more common than transversions

Page 6: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Distance methods

UPGMA (unweighted pair group method with arithmetic mean)

Neighbor-joining

Page 7: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

UPGMA and the effect of unequal rates of evolution

Page 8: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Errors in tree topology may be remedied

* Transformed distance matrix.

This is used in neighbor joining.

Page 9: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Neighbor joining is also different from UPGMA in that it uses the star decomposition method

Page 10: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Procedure of neighbor joining

Page 11: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods
Page 12: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Neighbor joining creates an unrooted tree

Page 13: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Character-based methods

Maximum parsimonyMaximum likelihood

Page 14: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Maximum parsimony

parsimony - principle in science where the simplest answeris the preferred.

In phylogeny: The preferred phylogenetic tree is the one that requires the fewest evolutionary steps.

Page 15: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Maximum parsimony

1. Identify all informative sites in the multiple alignment

2. For each possible tree, calculate thenumber of changes at each informative site.

3. Sum the number of changes for each possible tree.

4. Tree with the smallest number of changesis selected as the most likely tree.

Page 16: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Site 1 2 3 4 5 6 7 8 9Sequence -------------------------

1 A A G A G T G C A2 A G C C G T G C G3 A G A T A T C C A4 A G A G A T C C G * * *

Maximum parsimony Identify informative sites

Page 17: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Site 3 - non informative

Page 18: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Site 5 - informative

Page 19: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Summing changes:

site 5 site 7 site 9 SumTree I 1 1 2 4

Tree II 2 2 1 5

Tree III 2 2 2 6

=> Tree I most likely.

Page 20: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Step 4. Evaluate the tree - Bootstrapping (from www.icp.ucl.ac.be/~opperd/private/bootstrap.html)

Bootstrapping is a way of testing the reliability of the dataset andthe tree, allows you to assess whether the distribution of charactershas been influenced by stochastic effects.

Bootstrapping in practice

Take a dataset consisting of in total n sequences with m sites each.A number of resampled datasets of the same size (n x m) as theoriginal dataset is produced. However, each site is sampled atrandom and no more sites are sampled than there were originalsites.

Page 21: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods
Page 22: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Consensus tree. The number of times each branch point or node occurred

(bootstrap proportion) is indicated at each node.

Page 23: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Bootstrapping typically involves 100-1000 datasets.Bootstrap values > 70% are generally considered to providesupport for the clade designation.

Page 24: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Software for phylogenetic analysis

PHYLIP (Phylogenetic Inference Package)Joe Felsensteinhttp://evolution.genetics.washington.edu/phylip.html

Command lineWebPhylip

Examples in practicalDNADIST = create a distance matrixNEIGHBOR = neighbor joining / UPGMADNAPARS = maximum parsimony

PAUP (Phylogenetic Analysis Using Parsimony)

Page 25: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Exercises in molecular phylogeny

* What animal is most closely related to the extinct quagga?

Clustalw alignment. Neighbor joining is used in the clustalw progressive alignment method.

Page 26: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Exercises in molecular phylogeny

Is the south american opossum

evolutionary related to the australian ‘marsupial wolf’ ?

Page 27: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

philander

phalanger

trichosuru

dasyurus

sarcophilu

thylacinus

echymipera

bos

Page 28: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

DNA analysis of Neanderthal individuals

Page 29: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

neighbor joining

maximum parsimony

Page 30: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

AIDS epidemic and the evolution of HIV

SIVcpz --> HIV-1 --> HIV-1 M group

SIVsm --> HIV-2

Page 31: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Phylogenetic relationships betweenHIV and SIV viruses. Sharp et al , Phil. Trans. R. Soc. Lond (2001)356, 867-876

Page 32: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

From : Worobey M, et al

Origin of AIDS: contaminated

polio vaccine theory refuted.

Nature. 2004 428:820.

Phylogenetic tree reconstructionusing maximum likelihood

Page 33: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

From: Stephens RS, et al Genome sequence of an obligate

intracellular pathogen of humans: Chlamydia trachomatis.Science. 1998 Oct 23;282(5389):754-9.

Phylogeny of chlamydial enoyl-acyl carrier protein reductase as an example of horizontal transfer.

Phylogenetic analysis may be used to identifyhorisontal gene transfer. Some Chlamydia (eubacterium) proteins cluster with plant homologs

Page 34: Distance methods Character methods Maximum parsimony ...bio.lundberg.gu.se/courses/ht05/bio1/molphylo2.pdf · Construction of the phylogenetic tree Distance methods Character methods

Phylogenetic analysis may be used to identifyhorisontal gene transfer.Aquifex aeolicus (of Eubacteria) has a large number of genes that seem to originate from Archeae

Aravind L, Tatusov RL, Wolf YI, Walker DR, Koonin EV.

Trends Genet. 1998 Nov;14(11):442-4.