Top Banner
203.343 Advanced Genetics and Genomics Lecture 3 July 20th 2015 Olin Silander
65
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Lecture3_Jul20

203.343 Advanced Genetics

and Genomics

Lecture 3

July 20th 2015

Olin Silander

Page 2: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Page 3: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Describe the coalescent and the neutral model of evolution

Page 4: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Hardy Weinberg equilibrium

Page 5: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Hardy Weinberg equilibrium

Two alleles:

a 40%A 60%

AA Aa aa ? ? ?

Page 6: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Hardy Weinberg equilibrium

Two alleles:

a 40%A 60%

a

40%

Page 7: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Hardy Weinberg equilibrium

Two alleles:

a 40%A 60%

a

40%

a

40%

aa 16%

Page 8: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Hardy Weinberg equilibrium

Two alleles:

a 40%A 60%

a

40%

a

40%

aa 16%

A

40%

a

60%

Aa 48%

Page 9: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Hardy Weinberg equilibrium

Two alleles:

a 40%A 60%

a

40%

a

40%

aa16%

A

40%

a

60%

A

60%

A

60%

Aa 48%

AA 36%

Page 10: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Hardy Weinberg equilibrium

What are the assumptions of HWE?

(1) diploid (2) non-overlapping generations (3) sexual reproduction (4) random mating (5) infinite population size(6) no selection

Page 11: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Hardy Weinberg equilibrium

What are the assumptions of HWE?

infinite population size

aa 16%

Aa 48%

AA 36%

Page 12: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Hardy Weinberg equilibrium

What are the assumptions of HWE?

finite population size (drift)

aa 16%

Aa 48%

AA 36%

30% 40% 30%

infinite population size

Page 13: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Hardy Weinberg equilibrium

What are the assumptions of HWE?

finite population size (drift)

aa 16%

Aa 48%

AA 36%

30% 40% 30%

selection13%10%

77%

Page 14: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Hardy Weinberg equilibriumTwo alleles

Page 15: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Hardy Weinberg equilibriumTwo alleles

LinkageTwo alleles, two loci

Page 16: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Hardy Weinberg equilibriumTwo alleles

LinkageTwo alleles, two loci

A

B

A

b

a

b

a

B

Page 17: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

LinkageTwo alleles, two loci

A

B

A

b

a

b

a

B

A 60%a 40%B 80%b 20%

Page 18: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

A

B

A

b

a

b

a

B

48% 12% 8% 32%

LinkageTwo alleles, two lociA 60%

a 40%B 80%b 20%

Page 19: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

A

B

A

b

a

b

a

B

48% 12% 8% 32%

Linkage equilibrium

LinkageTwo alleles, two lociA 60%

a 40%B 80%b 20%

Page 20: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

A

B

A

b

a

b

a

B

60% 0% 20% 20%

Linkage disequilibrium

LinkageTwo alleles, two lociA 60%

a 40%B 80%b 20%

Page 21: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

A 60%a 40%B 80%b 20%

Linkage disequilibrium

LinkageTwo alleles, two loci

fAB = 0.60 fAb = 0 faB = 0.20 fab = 0.20

Page 22: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

A 60%a 40%B 80%b 20%

Linkage equilibrium

Linkage disequilibrium

LinkageTwo alleles, two loci

fAB = 0.48 faB = 0.32 fAb = 0.12 fab = 0.08

fAB = 0.60 fab = 0.20fAb = 0 faB = 0.20

Page 23: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

A 60%a 40%B 80%b 20%

Linkage equilibrium

Linkage disequilibrium

LinkageTwo alleles, two loci

fAB = 0.48 faB = 0.32 fAb = 0.12 fab = 0.08

fAB - fAfB = 0

fAB = 0.60 fab = 0.20fAb = 0 faB = 0.20

Page 24: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

A 60%a 40%B 80%b 20%

Linkage equilibrium

Linkage disequilibrium

LinkageTwo alleles, two loci

fAB = 0.48 faB = 0.32 fAb = 0.12 fab = 0.08

fAB - fAfB = 0

fAB - fAfB = ?

fAB = 0.60 fab = 0.20fAb = 0 faB = 0.20

Page 25: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

A 60%a 40%B 80%b 20%

Linkage disequilibrium

LinkageTwo alleles, two loci

fAB = 0.60 fab = 0.20

fAB - fAfB = DQuantify LD

fAb = 0 faB = 0.20

Page 26: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

A 60%a 40%B 80%b 20%

LinkageTwo alleles, two loci

fAB - fAfB = DQuantify LD

Page 27: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Linkage equilibrium

Quantifying LDA 60%a 40%B 80%b 20%

fAB - fAfB = D = 0

fAB - fAfB = DQuantify LD

Page 28: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LDA 60%a 40%B 80%b 20%

Linkage disequilibrium

fAB = 0.60 fab = 0.20

fAB - fAfB = DQuantify LD

fAb = 0 faB = 0.20

Page 29: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LDA 60%a 40%B 80%b 20%

fAB - fAfB = D = 0.12Linkage disequilibrium

fAB = 0.60 fab = 0.20

fAB - fAfB = DQuantify LD

fAb = 0 faB = 0.20

Page 30: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LDA 60%a 40%B 80%b 20%

What is the maximum value of D?

fAB - fAfB = D = 0.12Linkage disequilibrium

fAB = 0.60 fab = 0.20

fAB - fAfB = DQuantify LD

fAb = 0 faB = 0.20

Page 31: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LDA 60%a 40%B 80%b 20%

fAB - fAfB = DQuantify LD

D’: scaled to be between 0 and 1

Page 32: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LDA 60%a 40%B 80%b 20%

fAB - fAfB = DQuantify LD

if D > 0 D’ =

min(fAfb, fafB )D

if D < 0 D’ =

max(-fAfB, -fafb )D

D’: scaled to be between 0 and 1

Page 33: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LDA 60%a 40%B 80%b 20%

fAB - fAfB = DQuantify LD

if D > 0 D’ =

min(fAfb, fafB )D

if D < 0 D’ =

max(-fAfB, -fafb )D

D’: scaled to be between 0 and 1

fAB - fAfB = D = 0.12Linkage disequilibrium

fAB = 0.60 fab = 0.20fAb = 0 faB = 0.20

Page 34: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LDA 60%a 40%B 80%b 20%

fAB - fAfB = DQuantify LD

if D > 0 D’ =

min(fAfb, fafB )D

if D < 0 D’ =

max(-fAfB, -fafb )D

D’: scaled to be between 0 and 1

fAB - fAfB = D = ?Linkage disequilibrium

fAB = 0.55 fab = 0.15fAb = 0.05 faB = 0.25

Page 35: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LDA 60%a 40%B 80%b 20%

fAB - fAfB = DQuantify LD

if D > 0 D’ =

min(fAfb, fafB )D

if D < 0 D’ =

max(-fAfB, -fafb )D

D’: scaled to be between 0 and 1

fAB - fAfB = D = 0.07Linkage disequilibrium

fAB = 0.55 fab = 0.15fAb = 0.05 faB = 0.25

Page 36: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD Ind. locAA

locB1 A A2 A A3 A A4 A A5 A A6 A A7 A C8 A A9 A A10 A A11 T C12 T C13 T C14 T A15 T C16 T C17 T C18 T C19 T C20 T C

Ind. locAA

locB1 A C2 A A3 A C4 A C5 A C6 A A7 A A8 A A9 A C10 A A11 T C12 T C13 T C14 T A15 T A16 T A17 T A18 T A19 T C20 T C

population 2population 1

Page 37: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD Ind. locA locB1 A A2 A A3 A A4 A A5 A A6 A A7 A C8 A A9 A A10 A A11 T C12 T C13 T C14 T A15 T C16 T C17 T C18 T C19 T C20 T C

Ind. locAA

locB1 A C2 A A3 A C4 A C5 A C6 A A7 A A8 A A9 A C10 A A11 T C12 T C13 T C14 T A15 T A16 T A17 T A18 T A19 T C20 T C

fAB - fAfB = D

population 2population 1

Page 38: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD Ind. locA locB1 A A2 A A3 A A4 A A5 A A6 A A7 A C8 A A9 A A10 A A11 T C12 T C13 T C14 T A15 T C16 T C17 T C18 T C19 T C20 T C

Ind. locAA

locB1 A C2 A A3 A C4 A C5 A C6 A A7 A A8 A A9 A C10 A A11 T C12 T C13 T C14 T A15 T A16 T A17 T A18 T A19 T C20 T C

A aA T==locus A

fAB - fAfB = D

population 2population 1

Page 39: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD Ind. locA locB1 A A2 A A3 A A4 A A5 A A6 A A7 A C8 A A9 A A10 A A11 T C12 T C13 T C14 T A15 T C16 T C17 T C18 T C19 T C20 T C

Ind. locAA

locB1 A C2 A A3 A C4 A C5 A C6 A A7 A A8 A A9 A C10 A A11 T C12 T C13 T C14 T A15 T A16 T A17 T A18 T A19 T C20 T C

A aA T==locus A

B b==locus B

C A

fAB - fAfB = D

population 2population 1

Page 40: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD Ind. locA locB1 A A2 A A3 A A4 A A5 A A6 A A7 A C8 A A9 A A10 A A11 T C12 T C13 T C14 T A15 T C16 T C17 T C18 T C19 T C20 T C

Ind. locAA

locB1 A C2 A A3 A C4 A C5 A C6 A A7 A A8 A A9 A C10 A A11 T C12 T C13 T C14 T A15 T A16 T A17 T A18 T A19 T C20 T C

A aA T==locus A

B b==locus B

C A

0.25 - 0.5 x 0.5=0

fAB - fAfB = D

population 2population 1

Page 41: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD Ind. locA locB1 A A2 A A3 A A4 A A5 A A6 A A7 A C8 A A9 A A10 A A11 T C12 T C13 T C14 T A15 T C16 T C17 T C18 T C19 T C20 T C

Ind. locAA

locB1 A C2 A A3 A C4 A C5 A C6 A A7 A A8 A A9 A C10 A A11 T C12 T C13 T C14 T A15 T A16 T A17 T A18 T A19 T C20 T C

A aA T==locus A

B b==locus B

C A

0.25 - 0.5 x 0.5=00.05 - 0.5 x 0.5=-0.2

fAB - fAfB = D

population 2population 1

Page 42: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD Ind. locA locB1 A A2 A A3 A A4 A A5 A A6 A A7 A C8 A A9 A A10 A A11 T C12 T C13 T C14 T A15 T C16 T C17 T C18 T C19 T C20 T C

Ind. locAA

locB1 A C2 A A3 A C4 A C5 A C6 A A7 A A8 A A9 A C10 A A11 T C12 T C13 T C14 T A15 T A16 T A17 T A18 T A19 T C20 T C

A aA T==locus A

B b==locus B

C A

0.25 - 0.5 x 0.5=00.05 - 0.5 x 0.5=-0.2

fAb - fAfb = D0.45 - 0.5 x 0.5=0.2

fAB - fAfB = D

population 2population 1

Page 43: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD Ind. locA locB1 A A2 A A3 A A4 A A5 A A6 A A7 A C8 A A9 A A10 A A11 T C12 T C13 T C14 T A15 T C16 T C17 T C18 T C19 T C20 T C

Ind. locAA

locB1 A C2 A A3 A C4 A C5 A C6 A A7 A A8 A A9 A C10 A A11 T C12 T C13 T C14 T A15 T A16 T A17 T A18 T A19 T C20 T C

A aA T==locus A

B b==locus B

C A

0.25 - 0.5 x 0.5=00.05 - 0.5 x 0.5=-0.2

0.45 - 0.5 x 0.5=0.2

fab - fafb = D0.05 - 0.5 x 0.5=-0.2

faB - fafB = D0.45 - 0.5 x 0.5=0.2

fAB - fAfB = D

fAb - fAfb = D

population 2population 1

Page 44: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD Ind. locA locB1 A A2 A A3 A A4 A A5 A A6 A A7 A C8 A A9 A A10 A A11 T C12 T C13 T C14 T A15 T C16 T C17 T C18 T C19 T C20 T C

Ind. locAA

locB1 A C2 A A3 A C4 A C5 A C6 A A7 A A8 A A9 A C10 A A11 T C12 T C13 T C14 T A15 T A16 T A17 T A18 T A19 T C20 T C

A aA T==locus A

B b==locus B

C A

0.25 - 0.5 x 0.5=00.05 - 0.5 x 0.5=-0.2

The choice of alleles does not matter the absolute value of D is always the same

fAB - fAfB = D

population 2population 1

Page 45: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD Ind. locA locB1 A A2 A A3 A A4 A A5 A A6 A A7 A C8 A A9 A A10 A A11 T C12 T C13 T C14 T A15 T C16 T C17 T C18 T C19 T C20 T C

Ind. locAA

locB1 A C2 A A3 A C4 A C5 A C6 A A7 A A8 A A9 A C10 A A11 T C12 T C13 T C14 T A15 T A16 T A17 T A18 T A19 T C20 T C

A aA T==locus A

B b==locus B

C A

0.25 - 0.5 x 0.5=00.05 - 0.5 x 0.5=-0.2

fAB - fAfB = D

population 2population 1

Page 46: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD Ind. locA locB1 A A2 A A3 A A4 A A5 A A6 A A7 A C8 A A9 A A10 A A11 T C12 T C13 T C14 T A15 T C16 T C17 T C18 T C19 T C20 T C

Ind. locAA

locB1 A C2 A A3 A C4 A C5 A C6 A A7 A A8 A A9 A C10 A A11 T C12 T C13 T C14 T A15 T A16 T A17 T A18 T A19 T C20 T C

A aA T==locus A

B b==locus B

C A

0.25 - 0.5 x 0.5=00.05 - 0.5 x 0.5=-0.2

fAB - fAfB = D

if D < 0 D’ =

max(-fAfB, -fafb )D

D’ = 0.8

population 2population 1

Page 47: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD Ind. locAA

locB1 A A2 A A3 A A4 A A5 A A6 A A7 A C8 A A9 A A10 A A11 T C12 T C13 T C14 T A15 T C16 T C17 T C18 T C19 T C20 T C

Ind. locAA

locB1 A C2 A A3 A C4 A C5 A C6 A A7 A A8 A A9 A C10 A A11 T C12 T C13 T C14 T A15 T A16 T A17 T A18 T A19 T C20 T C

population 2: if we see an A

population 2population 1

Page 48: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD Ind. locAA

locB1 A A2 A A3 A A4 A A5 A A6 A A7 A C8 A A9 A A10 A A11 T C12 T C13 T C14 T A15 T C16 T C17 T C18 T C19 T C20 T C

population 2: if we see an A

90% chance there is an A at locus 2

Ind. locAA

locB1 A C2 A A3 A C4 A C5 A C6 A A7 A A8 A A9 A C10 A A11 T C12 T C13 T C14 T A15 T A16 T A17 T A18 T A19 T C20 T C

population 2population 1

Page 49: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD Ind. locAA

locB1 A A2 A A3 A A4 A A5 A A6 A A7 A C8 A A9 A A10 A A11 T C12 T C13 T C14 T A15 T C16 T C17 T C18 T C19 T C20 T C

population 2: if we see an A

90% chance there is an A at locus 2

r

Ind. locAA

locB1 A C2 A A3 A C4 A C5 A C6 A A7 A A8 A A9 A C10 A A11 T C12 T C13 T C14 T A15 T A16 T A17 T A18 T A19 T C20 T C

population 2population 1

Page 50: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD Ind. locAA

locB1 A A2 A A3 A A4 A A5 A A6 A A7 A C8 A A9 A A10 A A11 T C12 T C13 T C14 T A15 T C16 T C17 T C18 T C19 T C20 T C

population 2: if we see an A

90% chance there is an A at locus 2

r =

Ind. locAA

locB1 A C2 A A3 A C4 A C5 A C6 A A7 A A8 A A9 A C10 A A11 T C12 T C13 T C14 T A15 T A16 T A17 T A18 T A19 T C20 T C

population 2population 1

D√ fAfBfafb

Page 51: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD

r = D√ fAfBfafb

A 60%a 40%B 80%b 20%

if D > 0 D’ =

min(fAfb, fafB )D

if D < 0 D’ = max(-fAfB, -fafb )

D

D = fAB - fAfB =

Linkage disequilibrium

D’

fAB = 0.55 fab = 0.15fAb = 0.05 faB = 0.25

= =

Page 52: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD

r = D√ fAfBfafb

A 60%a 40%B 80%b 20%

if D > 0 D’ =

min(fAfb, fafB )D

if D < 0 D’ = max(-fAfB, -fafb )

D

D = fAB - fAfB = 0.07

Linkage disequilibrium

D’

fAB = 0.55 fab = 0.15fAb = 0.05 faB = 0.25

= 0.07/0.12 = 0.070.196

Page 53: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD

locus 1

1 2 3 4 5 6 7 8 9 10 1112 13 14 15 16 17 18 19 20

Page 54: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD

AA AA A AA AA AA ATTT TTTT T

locus 1A

T

1 2 3 4 5 6 7 8 9 10 1112 13 14 15 16 17 18 19 20

Page 55: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD

AA AA A AA AA AA ATTT TTTT T

locus 1A

T

1 2 3 4 5 6 7 8 9 10 1112 13 14 15 16 17 18 19 20C CC C CCC CCTTT TTTT T

T

C

1 2 3 4 5 6 7 8 9 10 1112 13 14 15 16 17 18 19 20

locus 2

C C C

Page 56: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD

AA AA A AA AA AA ATTT TTTT T

locus 1A

T

1 2 3 4 5 6 7 8 9 10 1112 13 14 15 16 17 18 19 20C CC C CCC CCTTT TTTT T

T

C

1 2 3 4 5 6 7 8 9 10 1112 13 14 15 16 17 18 19 20

locus 2

Both loci have the same demographic history

C C C

Complete linkage disequilibrium

Page 57: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD

AA AA A AA AA AA ATTT TTTT T

locus 1A

T

1 2 3 4 5 6 7 8 9 10 1112 13 14 15 16 17 18 19 20C CC C CCC CCTTT TTTT T

T

C

123 456 7 8910 1112 13 14 1516 17181920

locus 2

Both loci have the same demographic history

C C C

Complete linkage disequilibrium

Page 58: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD

AA AA A AA AA AA ATTT TTTT T

locus 1A

T

1 2 3 4 5 6 7 8 9 10 1112 13 14 15 16 17 18 19 20C CC C CCC CCTTT TTTT T

T

C

123 456 7 8910 1112 13 14 1516 17181920

locus 2

Both loci have different demographic histories

C C C

Partial linkage disequilibrium

Page 59: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Quantifying LD

D non-random associations

between AB alleles

D’ D normalized by allele frequencies

r correlation (D normalized

by allele frequencies)

Page 60: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Why do different loci have different demographic histories?

Page 61: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Why do different loci have different demographic histories?

Recombination

Page 62: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Why do different loci have different demographic histories?

Recombination

Page 63: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Why do different loci have the same demographic histories?

Page 64: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Why do different loci have the same demographic histories?

selectionpopulation size (bottleneck)

(lack of) recombination

population admixture (migration)

Page 65: Lecture3_Jul20

Gene Linkage and Linkage Disequilibrium

Why do different loci have the same demographic histories?

selectionpopulation size (bottleneck)

(lack of) recombination

population admixture (migration)