Top Banner
Testing Multiple Hypotheses and False Discovery Rate Models, Inference, and Algorithms Primer Manuel A. Rivas Broad Institute
77

Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Nov 20, 2018

Download

Documents

doantuong
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Testing Multiple Hypotheses and

False Discovery RateModels, Inference, and Algorithms

Primer

Manuel A. Rivas Broad Institute

Page 2: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Let’s assume that we wish to examine the association between a response and

m different covariates

Page 3: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

When m tests are performed, the aim is to decide which of the nulls should be rejected.

Page 4: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Not flagged Flagged

H0

H1

Page 5: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Not flagged Flagged

H0

H1

K

This table shows the possibilities when m tests are performed and K are flagged as

requiring further attention.

Page 6: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Not flagged Flagged

H0 m0

H1

K

m0 is the number of true nulls

Page 7: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Not flagged Flagged

H0 B m0

H1

K

B is the number of type I errors

Page 8: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Not flagged Flagged

H0 B m0

H1 C

K

C is the number of type II errors

Page 9: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Not flagged Flagged

H0 A B m0

H1 C D m1

m - K K m

m1 is the number of true alternatives

Page 10: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Not flagged Flagged

H0 A B m0

H1 C D m1

m - K K m

Each of these quantities is unknown. The aim is to select a rule on the basis of some

criterion and this in turn will determine K.

Page 11: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

To illustrate the multiple testing problem we focus on GWAS as an example where we

typically test the null hypothesis

H0 : � = 0

i.e. the effect of the genetic variant is 0.

Page 12: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

In a single test situation the historical emphasis has been on the control of the

type I error rate (false positives).

Page 13: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

In a multiple testing situation there are a variety of criteria that may be considered.

Page 14: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

In a multiple testing situation there are a variety of criteria that may be considered:

1. Bonferroni method 2. Sidák correction 3. Benjamini and Hochberg (FDR) 4. Storey (FDR)

Frequentist analysis

Page 15: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

In a multiple testing situation there are a variety of criteria that may be considered:

1. Bonferroni method 2. Sidák correction 3. Benjamini and Hochberg (FDR) 4. Storey (FDR)

Frequentist analysis

Bayesian analysis1. Bayesian Bonferroni-type correction 2. Mixture models 3. Matthew Stephens’ FDR approach

Page 16: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

In a multiple testing situation there are a variety of criteria that may be considered:

1. Bonferroni method2. Sidák correction3. Benjamini and Hochberg (FDR) 4. Storey (FDR)

Frequentist analysis

Bayesian analysis 1. Bayesian Bonferroni-type correction 2. Mixture models 3. Matthew Stephens’ FDR approach

Page 17: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Family-wise error rate (FWER): the probability of making at least one type I error

Frequentist analysis

Page 18: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Family-wise error rate (FWER): the probability of making at least one type I error

Frequentist analysis

P (B � 1|H1 = 0, . . . , Hm = 0)

Page 19: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Bonferroni method Frequentist analysis

Let be the event that the ith null is incorrectly rejected,so that, B, the random variable representing the number of incorrectly rejected nulls, corresponds to the union of all incorrectly rejected nulls, i.e.

Bi

[mi=1Bi

Page 20: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Bonferroni method Frequentist analysis

With a common level for each test ↵⇤

the family-wise error rate (FWER) is

↵F = P (B � 1|H1 = 0, . . . , Hm = 0) = P ([mi=1Bi|H1 = 0, . . . , Hm = 0)

mX

i=1

P (Bi|H1 = 0, . . . , Hm = 0)

= m↵⇤

Page 21: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Bonferroni method Frequentist analysis

↵F = P (B � 1|H1 = 0, . . . , Hm = 0) = P ([mi=1Bi|H1 = 0, . . . , Hm = 0)

mX

i=1

P (Bi|H1 = 0, . . . , Hm = 0)

= m↵⇤

The Bonferroni method takes ↵⇤ = ↵F /m

to give FWER ↵F .

Page 22: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Bonferroni method Frequentist analysis

Preferred approach for GWAS where to control the FWER at a level of alpha = 0.05 with m = 1,000,000 tests, we would take

↵⇤ = .05/1, 000, 000 = 5⇥ 10�8.

Page 23: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Sidák correction Frequentist analysis

Overcomes conservatism introduced by inequality If test statistics are independent,

P (B � 1) = 1� P (B = 0)

= 1� P⇣\mi=1B

0

i

= 1�mY

i=1

P⇣B

0

i

= 1� P (1� ↵⇤)m

Page 24: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Sidák correction Frequentist analysis

Overcomes conservatism introduced by inequality If test statistics are independent,

↵⇤ = 1� (1� ↵F )1/m .

In GWAS, assuming 1,000,000 tests were independent this would change it slightly to 5.13e-8 as a p-value threshold.

Page 25: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Frequentist analysis

A simple way to overcome the conservative nature of the control of FWER is to increase ↵F .

Page 26: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Frequentist analysis

A simple way to overcome the conservative nature of the control of FWER is to increase ↵F .

One measure to calibrate a procedure is via the expected number of false discoveries:

EFD = m0 ⇥ ↵⇤

m⇥ ↵⇤

Page 27: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Frequentist analysis

A simple way to overcome the conservative nature of the control of FWER is to increase ↵F .

One measure to calibrate a procedure is via the expected number of false discoveries:

EFD = m0 ⇥ ↵⇤

m⇥ ↵⇤Recall m0 is the number of true nulls.

Page 28: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Frequentist analysis

For example, we could specify ↵⇤ such that EFD <= 1

We choose: ↵⇤ = 1/m.

↵⇤ = 1⇥ 10�6 for GWAS with 1,000,000 markers.

Page 29: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Frequentist analysis

We introduce the false discovery proportion (FDP) as the proportion of incorrect rejections:

FDP =B

K.

B is the number of type I errors

Page 30: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Frequentist analysis

We introduce the false discovery proportion (FDP) as the proportion of incorrect rejections:

FDP =B

K.

K is the number flagged for additional attention

Page 31: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Frequentist analysis

False Discovery Rate (FDR), the expected proportion of rejected nulls that are actually true:

FDR = E [FDP] = E [B/K|K > 0]P (K > 0)

Page 32: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Benjamini and Hochberg (1995) procedure

Frequentist analysis

For independent p-values, each of which is uniform under the null.

P(1) < · · · < P(m)1. Let denote the ordered p-values.

Page 33: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Benjamini and Hochberg (1995) procedure

Frequentist analysis

For independent p-values, each of which is uniform under the null.

P(1) < · · · < P(m)1. Let denote the ordered p-values.

2. Assume we would like FDR control at ↵ = 0.05

Page 34: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Benjamini and Hochberg (1995) procedure

Frequentist analysisP(1) < · · · < P(m)1. Let denote the ordered p-values.

2. Assume we would like FDR control at ↵ = 0.05

li = i↵/m R = max

�i : P(i) < li

Let and

3. We use p-value threshold at P(R).

Page 35: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Benjamini and Hochberg (1995) procedure

Frequentist analysis

GWAS example

1. Assume m = 1,000,000 independent tests.2. Assume P(10) = 4.5e-7 and P(11) = 5.7e-7

10*.05/1,000,000 = 5e-7P(10) < 5e-7

Page 36: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Benjamini and Hochberg (1995) procedure

Frequentist analysis

GWAS example

1. Assume m = 1,000,000 independent tests.2. Assume P(10) = 4.5e-7 and P(11) = 5.7e-7

10*.05/1,000,000 = 5e-7P(10) < 5e-7

11*.05/1,000,000 = 5.5e-7P(11) 5.5e-7⌅

Page 37: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Benjamini and Hochberg (1995) procedure

Frequentist analysis

GWAS example

1. Assume m = 1,000,000 independent tests.2. Assume P(10) = 4.5e-7 and P(11) = 5.7e-7

10*.05/1,000,000 = 5e-7P(10) < 5e-7

11*.05/1,000,000 = 5.5e-7P(11) 5.5e-7⌅

3. Use p-value threshold at P(10).

Page 38: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Benjamini and Hochberg (1995) procedure

Frequentist analysis

FDR is controlled at

If procedure is applied, then regardless of how many nulls are true (m0) and regardless of the distribution of the p-values when the null is false

FDR m0

m↵ < ↵.

↵.

Page 39: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Benjamini and Hochberg (1995) procedure

Frequentist analysis

FDR is controlled at ↵.

Bonferroni = 5%

Page 40: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Benjamini and Hochberg (1995) procedure

Frequentist analysis

FDR is controlled at ↵.

FDR = 5%

Page 41: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Benjamini and Hochberg (1995) procedure

Frequentist analysis

FDR is controlled at ↵.

EFD = 1

Page 42: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

False Discovery Rate (FDR) Storey (2002)

Frequentist analysis

Introduced the q-value

For each observed statistic we can obtain an associated q-value, which tells us the proportion of false positives incurred at a thresholded statistic.

q (t) = P (H = 0|T > t)

Page 43: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

In a multiple testing situation there are a variety of criteria that may be considered:

1. Bonferroni method 2. Sidák correction 3. Benjamini and Hochberg (FDR) 4. Storey (FDR)

Frequentist analysis

Bayesian analysis1. Bayesian Bonferroni-type correction2. Mixture models3. Matthew Stephens’ FDR approach

Page 44: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Bayes Factors

Bayesian analysis

Defined as ratios of marginal likelihoods of the data under two models.

Model 0 can be the null model.

Bayes Factori = P (Data|M0) /P (Data|M1)

Page 45: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Bayes Factors

Bayesian analysis

We apply the same procedure m times (can be genetic variants for instance).

Page 46: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Bayes Factors

Bayesian analysis

Combine with prior probabilities

Posterior Oddsi = Bayes Factori ⇥ Prior Oddsi

where

Prior Oddsi = ⇡0i/ (1� ⇡0i) .

Page 47: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Bayes Factors

Bayesian analysis

Combine with prior probabilities

Posterior Oddsi = Bayes Factori ⇥ Prior Oddsi

where

Prior Oddsi = ⇡0i/ (1� ⇡0i) .

Page 48: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Bayesian Bonferroni-type correction

Bayesian analysis

If the prior probabilities of each of the nulls are independent with ⇡0i = ⇡0 for i = 1, …, m.

Then prior probability that all nulls are true is

⇧0 = P (H1 = 0, . . . , Hm = 0) = ⇡m0

Page 49: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Bayesian Bonferroni-type correction

Bayesian analysis

If the prior probabilities of each of the nulls are independent with ⇡0i = ⇡0 for i = 1, …, m.

Suppose that we wish to fix the prior probability that all of the nulls are true at . We can fix ⇧0 ⇡0i = ⇧1/m

0

Page 50: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Mixture model

Bayesian analysis

Estimate common parameters like the proportion of null tests

Gibbs sampler.

Page 51: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Page 52: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Open source R package

http://github.com/stephens999/ashr

ashr

Page 53: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Three key ideas

1. Assumes distribution of effects is unimodal, with a mode at 0.

Page 54: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Three key ideas

1. Assumes distribution of effects is unimodal, with a mode at 0.

2. Takes as input two numbers: i) effect size estimate and ii) corresponding standard error.

Page 55: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Model outlineData corresponds to effect size estimates and corresponding (estimated) standard errors, i.e.

� =⇣�1, . . . , �m

s = (s1, . . . , sm) .

Page 56: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Model outline

Goal is to compute a posterior distribution

p⇣�|�, s

⌘/ p (�|s) p

⇣�|�, s

⌘.

LikelihoodPrior

Page 57: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Model outline

Goal is to compute a posterior distribution

p⇣�|�, s

⌘/ p (�|s) p

⇣�|�, s

⌘.

Key: For assumption is that the betas are independent from a unimodal distribution.

p (�|s)

“Unimodal assumption”

Page 58: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Model outlineAssume that it is a mixture of point mass at 0 and a mixture of zero-mean normal distributions:

“Unimodal assumption”

p (�|s,⇡) = ⇡0�0 (·) +KX

k=1

⇡kN�·; 0,�2

k

Estimate mixture proportions mixture component standard deviations is a grid

Page 59: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Model outline

For the likelihood p⇣�|�, s

p⇣�|�, s

⌘=

mY

j=1

N⇣�j ;�j , s

2j

Page 60: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Model outline

Goal is to compute a posterior distribution

p⇣�|�, s

⌘/ p (�|s) p

⇣�|�, s

⌘.

“Unimodal assumption”

Page 61: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Model outline

Goal is to compute a posterior distribution

p⇣�|�, s

⌘/ p (�|s) p

⇣�|�, s

⌘.

Measurement precision in likelihood

Page 62: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Three key ideas

1. Assumes distribution of effects is unimodal, with a mode at 0.

2. Takes as input two numbers: i) effect size estimate and ii) corresponding standard error.

3. local false sign rate - probability of getting sign of effect wrong.

Page 63: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Three key ideas

3. local false sign rate - probability of getting sign of effect wrong.

lfdrj := P⇣�j = 0|�, s, ⇡

⌘.

local false discovery rate (“local FDR”)

probability, given the observed data, that effect j would be a false discovery, if we were to declare it a discovery.

Page 64: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Three key ideas

lfdrj := P⇣�j = 0|�, s, ⇡

⌘.

some statisticians argue that it is inappropriate because that null hypothesis is often implausible.

Page 65: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Three key ideas

lfdrj := P⇣�j = 0|�, s, ⇡

⌘.

Can obtain an estimate of the average error rate over subsets of observations, for example if you declared all tests in as significant.

[FDR (�) := (1/ |�|)X

j2�

lfdrj

Page 66: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Three key ideas

Tukey stated:

All we know about the world teaches us that the effects of A and B are always different - in some

decimal place - for any A and B.

Page 67: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Three key ideas

3. local false sign rate - probability of getting sign of effect wrong.

Tukey suggested:

Is the evidence strong enough to support a belief that the observed difference has the correct sign?

Page 68: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Three key ideas3. local false sign rate - probability of getting sign of effect wrong.

lfsrj := minhp⇣�j � 0|�, s

⌘, p

⇣�j 0|⇡, �, s

⌘i.

Page 69: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Three key ideas3. local false sign rate - probability of getting sign of effect wrong.

lfsrj := minhp⇣�j � 0|�, s

⌘, p

⇣�j 0|⇡, �, s

⌘i.

Gelman proposed focusing on “type S errors”, errors in sign, rather than traditional type I errors.

Page 70: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Matthew Stephens’ FDR approach

Bayesian analysis

Other results/observations covered, but not in this primer

1. Computation/Implementation details 2. Comparisons to other approaches

Page 71: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Multiple tests of the same null hypothesis

Bayesian analysis

In genetics we may be interested in applying: 1) additive model, 2) dominant, and 3) recessive model.

How to correct?

Page 72: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Multiple tests of the same null hypothesis

Bayesian analysis

In genetics we may be interested in applying: 1) additive model, 2) dominant, and 3) recessive model.

Bonferroni?

Page 73: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Multiple tests of the same null hypothesis

Bayesian analysis

In genetics we may be interested in applying: 1) additive model, 2) dominant, and 3) recessive model.

Bonferroni? Too conservative.

Page 74: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Multiple tests of the same null hypothesis

Bayesian analysis

In genetics we may be interested in applying: 1) additive model, 2) dominant, and 3) recessive model.

Minimum p-value?

Page 75: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Multiple tests of the same null hypothesis

Bayesian analysis

In genetics we may be interested in applying: 1) additive model, 2) dominant, and 3) recessive model.

Minimum p-value? not valid p-value since it is not uniform under the null. (can modify null though)

Page 76: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Multiple tests of the same null hypothesis

Bayesian analysis

In genetics we may be interested in applying: 1) additive model, 2) dominant, and 3) recessive model.

Minimum p-value? not valid p-value since it is not uniform under the null.

Permutation.

Page 77: Testing Multiple Hypotheses and False Discovery Rate · Testing Multiple Hypotheses and False Discovery Rate Models, ... H0 H1 K This table shows the ... H0 : =0 i.e. the effect of

Multiple tests of the same null hypothesis

Bayesian analysis

In genetics we may be interested in applying: 1) additive model, 2) dominant, and 3) recessive model.

Bayesian model averaging.

Hoeting, Madigan, Raftery and Volinksy Statisticial Science 1999