Top Banner
Quantitative trait evolution with mutations of large effect Joshua G. Schraiber and Michael J. Landis May 1, 2014 Joshua G. Schraiber and Michael J. Landis Mutations of large effect
37

Quantitative trait evolution with mutations of large effect

May 29, 2022

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Quantitative trait evolution with mutations of large effect

Quantitative trait evolution with mutations oflarge effect

Joshua G. Schraiber and Michael J. Landis

May 1, 2014

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 2: Quantitative trait evolution with mutations of large effect

Quantitative traits

· Traits that vary continuously inpopulations

- Mass- Height- Bristle number (approx)

· Adaption

- Low oxygen tolerance

· Disease

- Obesity

· Agriculture

- Fruit yield

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 3: Quantitative trait evolution with mutations of large effect

Fisher’s quantitative trait model

· n biallelic loci impacting trait

· Allele j at locus i has effect aij· Phenotype is Xk =

∑i ,j aijzijk

- zijk is number of copies of allele jat locus i in individual k

· E(X̄ ) = 2nE(p)E(a)

· E(Var(X )) = 2nE(p(1− p))E(a2)

· Environmental variation inducesextra variability

1 locus

Trait

Frequency

-1.5 -1.0 -0.5 0.0

0200

600

5 loci

Trait

Frequency

-1 0 1 2 3

050

150

100 loci

Trait

Frequency

-35 -30 -25 -20 -15 -10 -50

50100

150

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 4: Quantitative trait evolution with mutations of large effect

Neutral genetic variance

· Equilibrium genetic variance

- E(VA(∞)) = 2NVm

· Vm is the additive variance of newmutations

- Vm ≈ 2µσ2

- µ: is trait-wide mutation rate- σ2: variance of mutant effect size

distribution

· Var(VA(∞)) ≈ 4Nµσ4 (Lynch andHill 1986)

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 5: Quantitative trait evolution with mutations of large effect

Lande’s Brownian motion model

· Approximation to full model

- Don’t keep track of individual loci- Assume genetic variance constant

in time- Mutations have small effect sizes

· Can incorporate selection viaOrnstein-Uhlenbeck model

· Extremely influential incomparative biology

- c.f. Felsenstein: independentcontrasts

0.0 0.2 0.4 0.6 0.8 1.0

-1.0

-0.8

-0.6

-0.4

-0.2

0.0

0.2

timeX

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 6: Quantitative trait evolution with mutations of large effect

Evidence for non-Brownian evolution

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 7: Quantitative trait evolution with mutations of large effect

A coalescent model

· Haploid population of size N

· Trait governed by n loci

· Each locus has an independent coalescent tree

· Each locus has mutation rate θ2 (coalescent time units)

· When a mutation happens, effect Y is drawn fromdistribution with density p(y).

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 8: Quantitative trait evolution with mutations of large effect

Example

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 9: Quantitative trait evolution with mutations of large effect

Mutational effects

· Common assumption: many loci of very small effect

- Infinitesimal model

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 10: Quantitative trait evolution with mutations of large effect

Large effect mutations

· In some cases, large effect mutations may occur

- Transcription factor binding sites- Null mutants upstream in pathways

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 11: Quantitative trait evolution with mutations of large effect

Characteristic functions

Characteristic function

For a random variable X drawn from the probability measure µ(·),the function

φX (k) = E(e ikX )

=

∫e ikxdµ(x)

is called the characteristic function

Sums of random variables

If X1, . . . ,Xn are independent random variables, and X =∑

i Xi

thenφX (k) =

∏i

φXi(k)

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 12: Quantitative trait evolution with mutations of large effect

Compound Poisson process

Definition

Let N(t) be a rate λ Poisson process, and (Yi , i ≥ 1) a sequenceof independent and identically distributed random variables. Then

X (t) =

N(t)∑i=1

Yi

is called a compound Poisson process.

0 1 2 3 4 5

-5-3

-11

t

X(t)

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 13: Quantitative trait evolution with mutations of large effect

Characteristic function of a compound Poisson process

CF of a CP process

If X (t) is a rate λ Compound Poisson process, and ψ(k) is thecharacteristic function of the jump distribution, then thecharacteristic function of X is

φt(k) = eλt(ψ(k)−1)

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 14: Quantitative trait evolution with mutations of large effect

One locus, sample of size 2

· Condition on T2, thecoalescence time

- X1 and X2 areindependent CP(θ/2)processes run for time T2

- Joint distribution dependson root value

· Consider Z = X2 − X1

- Doesn’t depend on root

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 15: Quantitative trait evolution with mutations of large effect

One locus, sample of size 2 (CF)

φZ (k) =

∫ ∞0

E(e ikZ )e−tdt

=

∫ ∞0

E(e ik(X2−X1))e−tdt

=

∫ ∞0

φt(k)φt(−k)e−tdt

=

∫ ∞0

eθ2t(ψ(k)+ψ(−k)−2)e−tdt

=1

1− θ2 (ψ(k) + ψ(−k)− 2)

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 16: Quantitative trait evolution with mutations of large effect

Phenotype at the root

· The root of the tree must be specified

· A new individual could coalesce more anciently than thecurrent root

- For each n, there is a fixed root- Ensuring consistency could be hard

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 17: Quantitative trait evolution with mutations of large effect

The root problem

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 18: Quantitative trait evolution with mutations of large effect

Getting rid of the root

Normalization

If X = (X0, . . . ,Xn−1) are the trait values in the sample, then therandom vector

Z = (Z1, . . . ,Zn−1)

= (X1 − X0, . . . ,Xn−1 − X0)

does not depend on the root value

· ”Normalizing” by an arbitrary individual makes n unimportant

- Intuition: now every sample has to follow the path through thetree to individual 0, so the root doesn’t matter

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 19: Quantitative trait evolution with mutations of large effect

One locus, sample of size 3

Characteristic function for n = 3, and symmetric mutational effects

φZ1,Z2(k1, k2) =

11+θ(1−ψ(k1)) + 1

1+θ(1−ψ(k2)) + 11+θ(1−ψ(k1+k2))

3− θ2 (ψ(k1) + ψ(k2) + ψ(k1 + k2)− 3)

· Sketch of derivation:

- Condition on topology (each of 3 topologies equally likely)- Compute characteristic function for each topology while

conditioning on coalescence times- Integrate over coalescence times- Take weighted sum of characteristic functions for each

topology

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 20: Quantitative trait evolution with mutations of large effect

Sending the number of loci off to infinity

· We would like some nice limit as the number of loci increases

- Trivial result: “uniform on R” or δ(x)

· Need to decrease the effect size or mutation rate of each locus

· Three nontrivial limits

- Mutation rate per locus decreases but effect sizes remainconstaint

- Mutation rate per locus remains constant but effect sizesdecrease and effect size distribution does not have fat tails

- Mutation rate per locus remains constant but effect sizedecreases and effect size distribution has fat tail

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 21: Quantitative trait evolution with mutations of large effect

Decreasing mutation rate

Correlated CPP limit

As n ↑ ∞ and θ ↓ 0 such that nθ → Θ,

φZ1,Z2(k1, k2)→ eΘ2

(ψ(k1)+ψ(k2)+ψ(k1+k2)−3)

which is the characteristic function of two correlated compoundPoisson processes.

· Perhaps not very biologically relevant

· Will ignore for rest of talk

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 22: Quantitative trait evolution with mutations of large effect

Small effect mutations

Bivariate Gaussian limit

As n ↑ ∞ and the second moment of the effect distrbution, τ2, ↓ 0such that nτ2 → σ2,

φZ1,Z2(k1, k2)→ e−θ2σ2(k2

1 +k1k2+k22 )

which is the characteristic function of a Bivariate Gaussiandistribution with mean vector (0, 0) and variance-covariance matrix

Σ = θσ2

[1 1/2

1/2 1

]

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 23: Quantitative trait evolution with mutations of large effect

Large effect mutations

Bivariate stable distribution

Assume that p(y) ∼ κ|y |−(α+1) and sett = κπ (sin(απ/2)Γ(α)α)−1. As n ↑ ∞ and t ↓ 0 such thatnt → c ,

φZ1,Z2(k1, k2)→ e−12θc(|k1|α+|k2|α+|k1+k2|α)

where c = c̃ πsin(απ

2 ), which is the characteristic function of a

bivariate α-stable distribution

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 24: Quantitative trait evolution with mutations of large effect

de Finetti’s theorem

Theorem

If (X1,X2, . . .) is an infinitely exchangeable random vector, thenthe probability density of (X1 = x1, . . .Xn = xn) is a mixture overi.i.d. probability densities. That is,

p(x1, . . . , xn) =

∫ ∏i

pθ(xi )ν(dθ)

· Suggests that we can find a de Finetti measure such that allour normalized samples are i.i.d.

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 25: Quantitative trait evolution with mutations of large effect

A guess in the normal case

· Given the mean, the trait is distributed N (0, θ2σ2)

· The mean is itself distributed N (0, θ2σ2)

· Law of total variance gives Var(X ) = 2Nµσ2

- Same as classical derivation

φ(k1, k2) =

∫ (∫ 2∏l=1

e iklxl1√πθσ2

e−(m−xl )

2

θσ2 dxl

)1√πθσ2

e−m2

θσ2 dm

= e−θ4σ2(k2

1 +k22 )∫

e im(k1+k2) 1√πθσ2

e−m2

θσ2 dm

= e−θ4σ2(k2

1 +k22 )e−

θ4σ2(k1+k2)2

= e−θ2σ2(k2

1 +k1k2+k22 )

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 26: Quantitative trait evolution with mutations of large effect

Tempting interpretation and conjecture

· Conditioning on the amount of evolution “exclusive to sample0”

- Integrate over whether sample 1 or sample 2 coalesces withsample 0 first

· Suggests that this can be extended to arbitrary sample sizes

Conjecture about larger samples (Gaussian limit)

When the mutation kernel has only small effect mutations, thelimit distribution is normal with variance θ

2σ2 and random mean.

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 27: Quantitative trait evolution with mutations of large effect

Testing the Gaussian conjecture

· Simulate quantitative traits according to the coalescent model· Use KS test to assess convergence in the limit

22

23

24

25

26

27

28

21 22 23 24 25 26 27 28# loci

# sa

mpl

es

same

diffvalue

Normal, KS D

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 28: Quantitative trait evolution with mutations of large effect

Conjecture for the stable limit

· Bivariate case

- Independently α-stable with random median- Analogous to Gaussian limit

Conjecture about larger samples (Stable limit)

When the mutation kernel has large mutational effects, the limit

distribution is α-stable with scale(θ2c)1/α

and random median.

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 29: Quantitative trait evolution with mutations of large effect

Testing the stable conjecture

· Simulate quantitative traits according to the coalescent model· Use KS test to assess convergence in the limit

22

23

24

25

26

27

28

21 22 23 24 25 26 27 28# loci

# sa

mpl

es

same

diffvalue

Stable, KS D

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 30: Quantitative trait evolution with mutations of large effect

More randomness than expected

-1.0 -0.5 0.0 0.5 1.0

02

46

8

Trait value

Density

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 31: Quantitative trait evolution with mutations of large effect

Stable limit for sample of size 4

· Tedious computation

- Need to average over 18 trees.

· Conjecture would require

φZ1,Z2,Z3(k1, k2, k3)→ e−θ2c(|k1|α+|k2|α+|k3|α+|k1+k2+k3|α)

Trivariate characteristic function

For a sample of size three, under the same conditions as thebivariate limit,

φZ1,Z2,Z3 (k1, k2, k3)→ exp{−θ3c(|k1|α + |k2|α + |k3|α

+1

2|k1 + k2|α +

1

2|k1 + k3|α +

1

2|k2 + k3|α

+ |k1 + k2 + k3|α)}

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 32: Quantitative trait evolution with mutations of large effect

Extremely weak conjecture for large effect mutations

Conjecture (Stable limit)

When the mutation kernel has large mutational effects, the limitdistribution is α-stable with random parameters

· Not even sure that this is true

- A mixture of stable distributions?

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 33: Quantitative trait evolution with mutations of large effect

Empirical data

· Neurospora crassa mRNA-seq

- Ellison et al (2011), PNAS- Restrict to genes with FPKM > 1

· For each gene fit normal distribution, α-stable distribution

· Bootstrap likelihood ratio test (H0 : α = 2 vs. H1 : α < 2)

· Preliminary!

- Only analyzed 346 genes

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 34: Quantitative trait evolution with mutations of large effect

Distribution of p-values

p−value

Fre

quen

cy

0.0 0.2 0.4 0.6 0.8 1.0

050

100

150

200

· 54 genes significant at FDR of 32%

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 35: Quantitative trait evolution with mutations of large effect

Distribution of α in significant genes

α̂

Fre

quen

cy

1.4 1.5 1.6 1.7 1.8 1.9

02

46

8

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 36: Quantitative trait evolution with mutations of large effect

Cherry-picked examples

NCU09477

log(FPKM)

Den

sity

−1.0 −0.5 0.0 0.5

0.0

0.5

1.0

1.5

NCU00484

log(FPKM)

Den

sity

−0.2 0.0 0.2 0.4

01

23

NCU02435

log(FPKM)

Den

sity

−0.6 −0.4 −0.2 0.0 0.2 0.4

01

23

4

NCU07659

log(FPKM)

Den

sity

−1.0 −0.5 0.0 0.5

0.0

1.0

2.0

Joshua G. Schraiber and Michael J. Landis Mutations of large effect

Page 37: Quantitative trait evolution with mutations of large effect

Conclusions

· Introduced a coalescent model of quantitative trait evolution

- Provided exact formulas for the characteristic functions forsamples of size 2, 3, 4

- Found limiting distributions as the number of loci becamelarge in each case

- Conjectured about extending these limits to larger samples

· Extensions

- Samples not contemporaneous- Population structure- Diploids

· Why a neutral model?

- Analytically tractable calculations- Intuition for what to expect with weak selection- Null model

Joshua G. Schraiber and Michael J. Landis Mutations of large effect