Top Banner
Likelihood methods Trees - “What is the probability that a proposed model of sequence evolution and a particular tree would give rise to the observed data?” “What tree and model would maximize the probability of observing the observed data?
26

Likelihood methods

Jan 14, 2016

Download

Documents

Likelihood methods. Trees - “What is the probability that a proposed model of sequence evolution and a particular tree would give rise to the observed data?” “What tree and model would maximize the probability of observing the observed data?. P (data) :: tree, model. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Likelihood methods

Likelihood methods

Trees - “What is the probability that a

proposed model of sequence evolution and a particular tree would give rise to the observed data?” “What tree and model would maximize the probability of observing the observed data?

Page 2: Likelihood methods

In practice, the data are “given,” the tree is a hypothesis, and the model of the evol’n process is usually unknown, but w/ parameters either “given” based on external knowledge or estimated from the data set.

Therefore, we search for the hypothesis (tree) that gives the best probability of getting the observed data.

P (data) :: tree, model

Page 3: Likelihood methods

Potential Benefits of Likelihood

• Improved compensation for superimposed changes using explicit models

• Method is consistent• Usually minimizes variance of model

parameters• Often robust to violations of assumptions• Estimation and testing of evolutionary

models and hypotheses is a natural outcome

Page 4: Likelihood methods

Likelihood of a tree

Page 5: Likelihood methods

Likelihood of a tree IIFixed

Tree-dependent

4 bases x 4 bases = 16 possibles. Some much more probable.

Page 6: Likelihood methods

Likelihood of a tree IIIIf we can assume that nucleotide sites evolve independently, the Likelihood of full tree is product of likelihood at each site -- because these are vanishingly small., usu. Would log transform, so log likelihood of the tree is sum of log likelihoods of each site

Page 7: Likelihood methods

eg,

if L(tree1) = .0000002, ln L = -15.4

if L(tree2) = .0000004, ln L = -14.7

If L(tree3) = .0000008, ln L = -14.0

Page 8: Likelihood methods

Likelihood of a tree IV

0. Prior probability of an “A”

1. X P ( retaining A)

2. X P ( A to C)

3. X P ( A to C)

4. X P ( retaining A)

5. X P ( A to G)

Probabilities are a function of:Substitution model, base frequencies, branch lengths

Page 9: Likelihood methods

Calculation of probability of substitution or retention

Probabilities are a function of:Substitution model, base frequencies, branch lengths

* See example in Mount, p. 277* Formal analysis takes uses the model (JC, HKY, etc.) to generate explicit probabilities

Page 10: Likelihood methods
Page 11: Likelihood methods
Page 12: Likelihood methods

eg., Probability of a substitution:

Under Jukes-Cantor

PC = (1 + 3 e )/4

PnotC = 3/4 * (1 - e )

aC fdb

c

e

-4 t

-4 t

Page 13: Likelihood methods

Likelihood of state i at position j in A

Branch length

Ie., Conditional likelihood that A has state i is the product of the likelihoods that the i could have given rise to the outcomes in B and C

Prob of state i changing to state k

Likelihood that B has state k

*Likelihood that i could give rise to state in B

Similar for going to outcome in C

Page 14: Likelihood methods
Page 15: Likelihood methods

= max[L(null hypothesis data)] max[L(alternative hypothesis data)] Huelsenbeck et al (1997) Science. 276:227

Likelihood Ratio test

Page 16: Likelihood methods
Page 17: Likelihood methods
Page 18: Likelihood methods
Page 19: Likelihood methods
Page 20: Likelihood methods

Potential Benefits of Likelihood

• Improved compensation for superimposed changes using explicit models

• Method is consistent• Usually minimizes variance of model

parameters• Often robust to violations of assumptions• Estimation and testing of evolutionary

models and hypotheses is a natural outcome

**** effective Likelihood analysis requires a lg. Dataset, and full ML analysis is comput. intensive

Page 21: Likelihood methods

Likelihood of a tree - review

Page 22: Likelihood methods

= max[L(null hypothesis data)] max[L(alternative hypothesis data)] Huelsenbeck et al (1997) Science. 276:227

Likelihood Ratio test

Page 23: Likelihood methods
Page 24: Likelihood methods
Page 25: Likelihood methods
Page 26: Likelihood methods

Potential Benefits of Likelihood

• Improved compensation for superimposed changes using explicit models

• Method is consistent• Usually minimizes variance of model

parameters• Often robust to violations of assumptions• Estimation and testing of evolutionary

models and hypotheses is a natural outcome

**** effective Likelihood analysis requires a lg. Dataset, and full ML analysis is comput. intensive