Top Banner
QTL Studies- Past, Present and Future David Evans
45

QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

May 02, 2018

Download

Documents

nguyenthien
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

QTL Studies- Past, Present and Future

David Evans

Page 2: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Genetic studies of complex diseases have not met anticipated

success

Glazier et al, Science (2002) 298:2345-2349

Page 3: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Korstanje & Pagan (2002) Nat Genet

Korstanje & Pagan (2002) Nat Genet

Page 4: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

BUT…Not much success in mapping complex diseases / traits !

Reasons for Failure?

Complex Phenotype

Commonenvironment

Marker Gene1

Individualenvironment

Polygenicbackground

Gene2

Gene3

Linkage

Linkagedisequilibrium

Mode ofinheritanceLinkage

Association

Weiss & Terwilliger (2000) Nat Genet

Page 5: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Type 1 diabetes and Insulin VNTR

Bennett & Todd, Ann Rev Genet, 1996

Alzheimers and ApoE4

Roses, Nature 2000

LD Patterns and Allelic Association

Pattern of LD unpredictable

Extent of common genetic variation unknown

Page 6: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Genome-wide Association?

Risch & Merikangas, Science 1996

Page 7: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Multiple Rare Variant Hypothesis?GWA assumes that common variants underlie common diseases

Page 8: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Enabling Genome-wide Association Studies

HAPlotype MAP

High throughput genotyping

Large cohorts“ALSPAC”

Page 9: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Wellcome Trust Case Control Consortium

Page 10: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

CASES1. Type 1 Diabetes2. Type 2 Diabetes3. Crohn’s Disease4. Coronary Heart Disease5. Hypertension6. Bipolar Disorder7. Rheumatoid Arthritis8. Malaria9. Tuberculosis

10.Ankylosing Spondylitis11.Grave’s Disease12.Breast Cancer13.Multiple Sclerosis

CONTROLS1. UK Controls A (1,500 - 1958 BC)2. UK Controls B (1,500 - NBS)3. Gambian controls (2000)

Wellcome Trust Case-Control ConsortiumGenome-Wide Association Across Major Human Diseases

DESIGNCollaboration amongst 26 UK diseaseinvestigators2000 cases each from 9 diseases1000 cases from 4 diseases

GENOTYPINGAffymetrix 500k SNPsIllumina Human NS_12 SNP chip

Page 11: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription
Page 12: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Ankylosing Spondylitis

Auto-immune arthritis resulting in fusion of vertebrae

Often associated with psoriasis, IBD and uveitis

Prevalence of 0.4% in Caucasians. More common in men.

Ed Sullivan, Mike Atherton

Page 13: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Ankylosing Spondylitis GWAS

WTCCC (2007) Nat Genet

IL23-R ARTS-1

Page 14: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Bre

ast c

a

Pros

tate

ca

Col

orec

tal c

a

Cro

hn’s

Dis

IBD

T1D

T2D

Obe

sity

Trig

lyce

rides

CA

D

Coe

liac

Dis

Rhe

umat

oid

A

Ank

ylos

ing

S

Alz

heim

er D

is

Gla

ucom

a

Ast

hma

FGFR2 SMAD7

TNRC9

MAPKI3

LSP1

2q35

8q24 8q24

NOD2 IL23R INS 9p21 FTO GCKR 9p21 IL2 PTPN22 ARTS GALP LOXL1 ORMDL

IL21 IL23R6q251

2q36

12q24

18p11

IL2RA

FTO

CDKAL1

IGF2BP2

KIAAA035D HHEX

PPARG

SLC30A8

TCF7L2

KCNJ11

5p13

ATG16L1

BSN

IRGM

PHOX2B

NCF4

NKX2-3

FAM92B

PTPN2

MSMB

KLK3

SLC22A3

NUDT11

3des.

11q13

LMTK2

HNF1B

CTBP2

JAZF1

8q24

WFS1

TCF2

PTPN22

CD25

ERBB3

CTLA4

IFIH1

10q21 Common genes = common etiology?

Some in gene deserts

Large relative risks does not = success

Drug targets

Successes…

Page 15: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

What About Quantitative Traits?

Quantitative genetics theory suggests that quantitative traits are the result of many variants of small effect

The corollary is that very large sample sizes will be needed to detect these variants in UNSELECTED samples

0

1

2

3

1 Gene 3 Genotypes 3 Phenotypes

0

1

2

3

2 Genes 9 Genotypes 5 Phenotypes

01234567

3 Genes27 Genotypes 7 Phenotypes

05

1015

20

4 Genes81 Genotypes 9 Phenotypes

Central Limit Theorem → Normal Distribution

Unselected samples

Page 16: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription
Page 17: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

FTSO

FTO

FTO produces a moderate signal in WTCCC T2D scan

But, no signal in an American T2D scan…?

WTCCC T2D Scan

-American cases and controls matched on BMI

Page 18: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

FTO

Replication is critical !!!

Page 19: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Twin, family and adoption studies suggest that, within a population, 90% of variation in height is due to genetic variation

Dizygotic Twins

Twins separated at birth

Monozygotic Twins

Borjeson, Acta Paed, 1976

Height- The Archetypal Polygenic Trait

Page 20: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

GWA of Height

Weedon et al. (in press) Nat Genet

Large numbers are needed to detect QTLs !!!

Collaboration is the name of the game !!!

A- 1914 Cases (WTCCC T2D)B- 4892 Cases (DGI)C- 6788 Cases (WTCCC HT)D- 8668 Cases (WTCCC CAD)E- 12228 Cases (EPIC)F- 13665 Cases (WTCCC UKBS)

Significant results

Other loci?

Page 21: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

A: 1,900

C: 7,200

E: 12,600 F: 14,000

D: 9,100

B: 5,000

Weedon et al. (in press) Nat Genet

Some real hits sit in the bottom of the distribution

Some hits initially look interesting but then go away

Page 22: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Hedgehog signaling, cell cycle, and extra-cellular matrix genes over-represented

Not known-YesDYM

Extra-cellular matrixYesYesACAN

Cell cycle--ANAPC13

Hedgehog signalingYesYesIHH

Extra-cellular matrix--ADAMTSL3

Polycomb proteinYes -SCMH1

Not known--DLEU7

Transcription factor--ZNF678

Hedgehog signaling--HHIP

Regulates cytokine signal transductionYes-SOCS2

Not known--SPAG17

Hedgehog signallingYesYesPTCH1

Not known--C6orf106

Extra-cellular matrix-YesEFEMP1

Not known--LOC387103

May act as transcription activator--LCORL

Involved in bone formationYesYesGDF5

Chromatin architectural factorsYesYesHMGA2

Involved in the control of the cell cycle.Yes-CDK6

Transcription factor.--ZBTB38

Details*Knockout mouseMonogenicCandidate gene

Weedon et al. (in press) Nat Genet

Page 23: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

The combined impact of the 20 SNPS with a P < 5 x 10-7

• The 20 SNPs explain only ~3% of the variation of height

• Lots more genes to find – but extremely large numbers needed

Weedon et al. (in press) Nat Genet

Page 24: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Perola et al, Plos Genetics, 2007; data available at http://www.genomeutwin.org; Weedon et al.; unpublished data

Height Linkage Regions

Page 25: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Perola et al, Plos Genetics, 2007; data available at http://www.genomeutwin.org; Weedon et al.; unpublished data

Page 26: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

What’s Going On?

BUT…what if linkage analysis and association analysis identify different types of loci?

Areas identified by linkage don’t have significant assocation hits over them

Linkage analysis lacks power?

Loci identified by GWAs don’t have linkage peaks over them

Type I error?

Power?

Page 27: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

What next?

InitialGenome

Wide Scans

Functional Studies

Other ethnicgroups

Fine mapping

More genes

CNVs

Transcriptomics

Animal models

Mendelian Randomisation

Genomic ProfilingEpigenetics

Genome-wide Sequencing

Page 28: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Distribution of MAFs in HapMap

Genome-wide panels and HapMap biased towards common variants

Common variants don’t tag rare variants well

Page 29: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

FTO

IL23R

HLA-B27

TCF7L2ARTS1

TCF2

PPARG

WFS1

???

???

??????

???

???

SEMA5A “Rotten Fruit”

“Low hanging fruit”

“High hanging fruit”

Complex Disease Tree

Page 30: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Methods of gene huntingE

ffec

t Siz

e

Frequency

rare, monogenic (linkage)

common, complex (association)?

Page 31: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Evans et al. (2008) EJHG

Genome-wide SequencingSequence individuals’ genomes

Will identify rare variants

But will we have enough power?

Page 32: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Genomic Profiling

BUT…if we consider several predisposing genetic and environmental factors, can we predict disease?

Predictive testing in the case of monogenic diseases has been used for years (1300+ tests available) (e.g. Phenylketonuria)

The idea of using genetic information to inform diagnosis

Not possible in complex diseases as effects of an individual variant is so small

Page 33: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

(from Yang et al. 2003 AJHG)

(from Janssens et al. 2004 AJHG)

Genomic Profiling

=> Give up and go home?

Page 34: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

PRIOR DISEASE PROBABILITY

POST

ERIO

R P

RO

BA

BIL

ITY

OF

DIS

EASE

D+/B27+D+/B27-D+/B27+,ARTS1+,IL23+D+/B27-/ARTS1-/IL23R-

Prevalence of B27+, ARTS1+,IL23R+ is 2.4%

Prevalence of B27-, ARTS1-, IL23R- is 19%

Ankylosing Spondylitis

(Brown & Evans, in prep)

Page 35: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Using Genetics to Inform Classical Epidemiology

Page 36: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Observational Studies

Obesity

25050Control100200CHDNoYes Odds of obesity in cases: 200/100 = 2

Odds of obesity in controls: 50/250 = 0.2

Odds Ratio: 2/0.2 = 10

Fanciful claims often made from observational studies

In a case-control study, a group of diseased individuals are recruited (Cases); A group of individuals without disease are gathered (Controls); Both groups are then measured retrospectively on an exposure of interest; A test of association is performed

Example: Obesity (Exposure) and Coronary Heart Disease (Outcome)

Page 37: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Classic limitations to “observational” science

• Confounding

• Reverse Causation

• Bias

Page 38: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Randomized Control TrialsIndividuals free fromexposure and disease

TreatmentGroup

ControlGroup

Randomization

MeasureOutcome

MeasureOutcome

Randomization controls for confounding

Reverse causation impossible

Gold standard for assessing causality

Page 39: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Mendelian Randomization

Fortunately nature has provided us with a natural randomized control trial !

RCTs not always ethical or possible

Mendel’s law of independent assortment states that inheritance of a trait is independent (randomized) with respect to other traits

Assessing the relationship between genotype, environmental risk factor and disease informs us on causality

Therefore individuals are randomly assigned to three groups based on their genotype (AA, Aa, aa) independent of outcome

Page 40: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

GeneticVariant

ModifiableEnvironmental

ExposureDisease

ConfoundingVariables

(FTO)

(Obesity)

(smoking, diet etc.)

(Coronary Heart Disease)

Mendelian Randomization

If obesity causes CHD then the relationship between FTO and CHD should be estimated by the product of βFTO-Obesity and βObesity-CHD

βFTO-Obesity

βObesity-CHD

Page 41: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

GeneticVariant

ModifiableEnvironmental

ExposureDisease

ConfoundingVariables

(FTO)

(Obesity)

(smoking, diet etc.)

(Coronary Heart Disease)

Mendelian Randomization

If CHD causes obesity then βFTO-CHD should be zero.

βFTO-Obesity

βObesity-CHD

Page 42: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

GeneticVariant

ModifiableEnvironmental

ExposureDisease

ConfoundingVariables

(FTO)

(Obesity)

(smoking, diet etc.)

(Coronary Heart Disease)

Mendelian Randomization

If the relationship between Obesity and CHD is purely correlational (i.e. due to confounding) then βFTO-CHD should be 0

βFTO-Obesity

Page 43: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

GeneticVariant

ModifiableEnvironmental

ExposureDisease

ConfoundingVariables

(FTO)

(Obesity)

(smoking, diet etc.)

(Coronary Heart Disease)

Mendelian Randomization

Genotype is NOT associated with confounders

Genotype is associated with the environmental exposure of interest

Genotype is only related to its outcome via its association withthe modifiable environmental exposure

Page 44: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

Mendelian RandomizationMendelian Randomization is a way of using a genetic variant(s) to make causal inferences about (modifiable) environmental risk factors for disease and health related outcomes

Environmental exposures (e.g. Obesity) can be modified ! Genetic factors cannot (at least for the moment…)

Still a relatively new approach that has problems (i.e. finding genetic proxies for environmental exposures- multiple instruments?)

…but a LOT of scope for development…

Page 45: QTL Studies- Past, Present and Future - Institute for …ibg ·  · 2008-03-18QTL Studies- Past, Present and Future David Evans. ... Multiple Rare Variant Hypothesis? ... - Transcription

GeneticVariant

ModifiableEnvironmental

ExposureDisease

ConfoundingVariables

(FTO)

(Obesity)

(smoking, diet etc.)

(Coronary Heart Disease)

Could SEM be used to enhance MR?

GeneticVariant (MC4R) Genetic

Variant (?)