Top Banner
Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics Imperial College London [email protected] useR! 2009, Rennes July 8-10, 2009
40

Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Jul 20, 2018

Download

Documents

hoangliem
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Sequential Implementation of Monte Carlo Testswith Uniformly Bounded Resampling Risk

Axel Gandy

Department of MathematicsImperial College [email protected]

useR! 2009, RennesJuly 8-10, 2009

Page 2: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Introduction

I Test statistic T , reject for large values.I Observation: t.I p-value:

p = P(T ≥ t)

Often not available in closed form.I Monte Carlo Test:

p̂naive =1

n

n∑i=1

I(Ti ≥ t),

where T ,T1, . . .Tn i.i.d.I Examples:

I Bootstrap,I Permutation tests.

I Goal: Estimate p using few Xi

Mainly interested in deciding if p ≤ α for some α.

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 2

Page 3: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Introduction

I Test statistic T , reject for large values.I Observation: t.I p-value:

p = P(T ≥ t)

Often not available in closed form.I Monte Carlo Test:

p̂naive =1

n

n∑i=1

I(Ti ≥ t)︸ ︷︷ ︸=:Xi∼B(1,p)

,

where T ,T1, . . .Tn i.i.d.I Examples:

I Bootstrap,I Permutation tests.

I Goal: Estimate p using few Xi

Mainly interested in deciding if p ≤ α for some α.

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 2

Page 4: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Introduction

I Test statistic T , reject for large values.I Observation: t.I p-value:

p = P(T ≥ t)

Often not available in closed form.I Monte Carlo Test:

p̂naive =1

n

n∑i=1

I(Ti ≥ t)︸ ︷︷ ︸=:Xi∼B(1,p)

,

where T ,T1, . . .Tn i.i.d.I Examples:

I Bootstrap,I Permutation tests.

I Goal: Estimate p using few Xi

Mainly interested in deciding if p ≤ α for some α.

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 2

Page 5: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Introduction

I Test statistic T , reject for large values.I Observation: t.I p-value:

p = P(T ≥ t)

Often not available in closed form.I Monte Carlo Test:

p̂naive =1

n

n∑i=1

I(Ti ≥ t)︸ ︷︷ ︸=:Xi∼B(1,p)

,

where T ,T1, . . .Tn i.i.d.I Examples:

I Bootstrap,I Permutation tests.

I Goal: Estimate p using few Xi

Mainly interested in deciding if p ≤ α for some α.

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 2

Page 6: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Sequential approaches based on Sn =∑n

i=1 Xi

0 1 2 3 4 5 6 7 8 9 10n

0

1

2

3

4

5

6

7

8

9

10

Sn

I Stop once Sn ≥ Un orSn ≤ Ln

I τ : hitting time

I Compute p̂ based on Sτand τ .

I Hit BU : decide p > α,

I Hit BL: decide p ≤ α,

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 3

Page 7: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Sequential approaches based on Sn =∑n

i=1 Xi

0 1 2 3 4 5 6 7 8 9 10n

0

1

2

3

4

5

6

7

8

9

10

Sn

I Stop once Sn ≥ Un orSn ≤ Ln

I τ : hitting time

I Compute p̂ based on Sτand τ .

I Hit BU : decide p > α,

I Hit BL: decide p ≤ α,

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 3

Page 8: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Sequential approaches based on Sn =∑n

i=1 Xi

0 1 2 3 4 5 6 7 8 9 10n

0

1

2

3

4

5

6

7

8

9

10

Sn

I Stop once Sn ≥ Un orSn ≤ Ln

I τ : hitting time

I Compute p̂ based on Sτand τ .

I Hit BU : decide p > α,

I Hit BL: decide p ≤ α,

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 3

Page 9: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Previous Approaches

I Besag & Clifford (1991):

0 mn

0

h

Sn

I (Truncated) Sequential Probability Ratio Test, Fay et al. (2007)

0 mn

0

h

Sn

I R-package MChtest.Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 4

Page 10: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

What do we really want?

Is p ≤ α?

Two individuals using the same statistical method on the same datashould arrive at the same conclusion.

First law of applied statistics, Gleser (1996)

Consider the resampling risk

RRp(p̂) ≡

{Pp(p̂ > α) if p ≤ α,Pp(p̂ ≤ α) if p > α.

Want:sup

p∈[0,1]RRp(p̂) ≤ ε

for some (small) ε > 0.For Besag & Clifford (1991), SPRT: supp RRP ≥ 0.5

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 5

Page 11: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

What do we really want?

Is p ≤ α?

Two individuals using the same statistical method on the same datashould arrive at the same conclusion.

First law of applied statistics, Gleser (1996)

Consider the resampling risk

RRp(p̂) ≡

{Pp(p̂ > α) if p ≤ α,Pp(p̂ ≤ α) if p > α.

Want:sup

p∈[0,1]RRp(p̂) ≤ ε

for some (small) ε > 0.For Besag & Clifford (1991), SPRT: supp RRP ≥ 0.5

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 5

Page 12: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

What do we really want?

Is p ≤ α?

Two individuals using the same statistical method on the same datashould arrive at the same conclusion.

First law of applied statistics, Gleser (1996)

Consider the resampling risk

RRp(p̂) ≡

{Pp(p̂ > α) if p ≤ α,Pp(p̂ ≤ α) if p > α.

Want:sup

p∈[0,1]RRp(p̂) ≤ ε

for some (small) ε > 0.For Besag & Clifford (1991), SPRT: supp RRP ≥ 0.5

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 5

Page 13: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

What do we really want?

Is p ≤ α?

Two individuals using the same statistical method on the same datashould arrive at the same conclusion.

First law of applied statistics, Gleser (1996)

Consider the resampling risk

RRp(p̂) ≡

{Pp(p̂ > α) if p ≤ α,Pp(p̂ ≤ α) if p > α.

Want:sup

p∈[0,1]RRp(p̂) ≤ ε

for some (small) ε > 0.For Besag & Clifford (1991), SPRT: supp RRP ≥ 0.5

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 5

Page 14: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

What do we really want?

Is p ≤ α?

Two individuals using the same statistical method on the same datashould arrive at the same conclusion.

First law of applied statistics, Gleser (1996)

Consider the resampling risk

RRp(p̂) ≡

{Pp(p̂ > α) if p ≤ α,Pp(p̂ ≤ α) if p > α.

Want:sup

p∈[0,1]RRp(p̂) ≤ ε

for some (small) ε > 0.For Besag & Clifford (1991), SPRT: supp RRP ≥ 0.5

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 5

Page 15: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

What do we really want?

Is p ≤ α?

Two individuals using the same statistical method on the same datashould arrive at the same conclusion.

First law of applied statistics, Gleser (1996)

Consider the resampling risk

RRp(p̂) ≡

{Pp(p̂ > α) if p ≤ α,Pp(p̂ ≤ α) if p > α.

Want:sup

p∈[0,1]RRp(p̂) ≤ ε

for some (small) ε > 0.For Besag & Clifford (1991), SPRT: supp RRP ≥ 0.5

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 5

Page 16: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Recursive Definition of the Boundaries

Want:supp

RRp(p̂) ≤ ε

Suffices to ensure

Pα(hit BU) ≤ εPα(hit BL) ≤ ε

Recursive definition:Given U1, . . . ,Un−1 and L1, . . . , Ln−1, define

I Un as the minimal value such that

Pα(hit BU until n) ≤ εnI and Ln as the maximal value such that

Pα(hit BL until n) ≤ εnwhere εn ≥ 0 with εn ↗ ε (spending sequence).

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 6

Page 17: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Recursive Definition of the Boundaries

Want:supp

RRp(p̂) ≤ ε

Suffices to ensure

Pα(hit BU) ≤ εPα(hit BL) ≤ ε

Recursive definition:Given U1, . . . ,Un−1 and L1, . . . , Ln−1, define

I Un as the minimal value such that

Pα(hit BU until n) ≤ εnI and Ln as the maximal value such that

Pα(hit BL until n) ≤ εnwhere εn ≥ 0 with εn ↗ ε (spending sequence).

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 6

Page 18: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Recursive Definition of the Boundaries

Want:supp

RRp(p̂) ≤ ε

Suffices to ensure

Pα(hit BU) ≤ εPα(hit BL) ≤ ε

Recursive definition:Given U1, . . . ,Un−1 and L1, . . . , Ln−1, define

I Un as the minimal value such that

Pα(hit BU until n) ≤ εnI and Ln as the maximal value such that

Pα(hit BL until n) ≤ εnwhere εn ≥ 0 with εn ↗ ε (spending sequence).

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 6

Page 19: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Recursive Definition of the Boundaries

Want:supp

RRp(p̂) ≤ ε

Suffices to ensure

Pα(hit BU) ≤ εPα(hit BL) ≤ ε

Recursive definition:Given U1, . . . ,Un−1 and L1, . . . , Ln−1, define

I Un as the minimal value such that

Pα(hit BU until n) ≤ εnI and Ln as the maximal value such that

Pα(hit BL until n) ≤ εnwhere εn ≥ 0 with εn ↗ ε (spending sequence).

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 6

Page 20: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Recursive Definition - Example

I α = 0.2, εn = 0.4 n5+n .

I Un=the minimal value such that

Pα(hit BU until n) ≤ εnI Ln = maximal value such that

Pα(hit BL until n) ≤ εnn =

Pα(Sn =k, τ≥n) 0

1 2 3 4 5 6 7 8

k= 3

.02 .03 .04 .05

k= 2

.04 .06 .08 .14 .20 .24 .26

k= 1

.2 .32 .38 .41 .41 .39 .37 .29

k= 0 1

.8 .64 .51 .41 .33 .26 .21

εn 0

.07 .11 .15 .18 .20 .22 .23 .25

Un 1

2 2 2 3 3 3 3 3

Ln -1

-1 -1 -1 -1 -1 -1 0 0

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7

Page 21: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Recursive Definition - Example

I α = 0.2, εn = 0.4 n5+n .

I Un=the minimal value such that

Pα(hit BU until n) ≤ εnI Ln = maximal value such that

Pα(hit BL until n) ≤ εnn =

Pα(Sn =k, τ≥n) 0 1

2 3 4 5 6 7 8

k= 3

.02 .03 .04 .05

k= 2

.04 .06 .08 .14 .20 .24 .26

k= 1 .2

.32 .38 .41 .41 .39 .37 .29

k= 0 1 .8

.64 .51 .41 .33 .26 .21

εn 0 .07

.11 .15 .18 .20 .22 .23 .25

Un 1 2

2 2 3 3 3 3 3

Ln -1 -1

-1 -1 -1 -1 -1 0 0

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7

Page 22: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Recursive Definition - Example

I α = 0.2, εn = 0.4 n5+n .

I Un=the minimal value such that

Pα(hit BU until n) ≤ εnI Ln = maximal value such that

Pα(hit BL until n) ≤ εnn =

Pα(Sn =k, τ≥n) 0 1 2

3 4 5 6 7 8

k= 3

.02 .03 .04 .05

k= 2 .04

.06 .08 .14 .20 .24 .26

k= 1 .2 .32

.38 .41 .41 .39 .37 .29

k= 0 1 .8 .64

.51 .41 .33 .26 .21

εn 0 .07 .11

.15 .18 .20 .22 .23 .25

Un 1 2 2

2 3 3 3 3 3

Ln -1 -1 -1

-1 -1 -1 -1 0 0

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7

Page 23: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Recursive Definition - Example

I α = 0.2, εn = 0.4 n5+n .

I Un=the minimal value such that

Pα(hit BU until n) ≤ εnI Ln = maximal value such that

Pα(hit BL until n) ≤ εnn =

Pα(Sn =k, τ≥n) 0 1 2 3

4 5 6 7 8

k= 3

.02 .03 .04 .05

k= 2 .04 .06

.08 .14 .20 .24 .26

k= 1 .2 .32 .38

.41 .41 .39 .37 .29

k= 0 1 .8 .64 .51

.41 .33 .26 .21

εn 0 .07 .11 .15

.18 .20 .22 .23 .25

Un 1 2 2 2

3 3 3 3 3

Ln -1 -1 -1 -1

-1 -1 -1 0 0

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7

Page 24: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Recursive Definition - Example

I α = 0.2, εn = 0.4 n5+n .

I Un=the minimal value such that

Pα(hit BU until n) ≤ εnI Ln = maximal value such that

Pα(hit BL until n) ≤ εnn =

Pα(Sn =k, τ≥n) 0 1 2 3 4

5 6 7 8

k= 3

.02 .03 .04 .05

k= 2 .04 .06 .08

.14 .20 .24 .26

k= 1 .2 .32 .38 .41

.41 .39 .37 .29

k= 0 1 .8 .64 .51 .41

.33 .26 .21

εn 0 .07 .11 .15 .18

.20 .22 .23 .25

Un 1 2 2 2 3

3 3 3 3

Ln -1 -1 -1 -1 -1

-1 -1 0 0

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7

Page 25: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Recursive Definition - Example

I α = 0.2, εn = 0.4 n5+n .

I Un=the minimal value such that

Pα(hit BU until n) ≤ εnI Ln = maximal value such that

Pα(hit BL until n) ≤ εnn =

Pα(Sn =k, τ≥n) 0 1 2 3 4 5

6 7 8

k= 3 .02

.03 .04 .05

k= 2 .04 .06 .08 .14

.20 .24 .26

k= 1 .2 .32 .38 .41 .41

.39 .37 .29

k= 0 1 .8 .64 .51 .41 .33

.26 .21

εn 0 .07 .11 .15 .18 .20

.22 .23 .25

Un 1 2 2 2 3 3

3 3 3

Ln -1 -1 -1 -1 -1 -1

-1 0 0

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7

Page 26: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Recursive Definition - Example

I α = 0.2, εn = 0.4 n5+n .

I Un=the minimal value such that

Pα(hit BU until n) ≤ εnI Ln = maximal value such that

Pα(hit BL until n) ≤ εnn =

Pα(Sn =k, τ≥n) 0 1 2 3 4 5 6

7 8

k= 3 .02 .03

.04 .05

k= 2 .04 .06 .08 .14 .20

.24 .26

k= 1 .2 .32 .38 .41 .41 .39

.37 .29

k= 0 1 .8 .64 .51 .41 .33 .26

.21

εn 0 .07 .11 .15 .18 .20 .22

.23 .25

Un 1 2 2 2 3 3 3

3 3

Ln -1 -1 -1 -1 -1 -1 -1

0 0

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7

Page 27: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Recursive Definition - Example

I α = 0.2, εn = 0.4 n5+n .

I Un=the minimal value such that

Pα(hit BU until n) ≤ εnI Ln = maximal value such that

Pα(hit BL until n) ≤ εnn =

Pα(Sn =k, τ≥n) 0 1 2 3 4 5 6 7

8

k= 3 .02 .03 .04

.05

k= 2 .04 .06 .08 .14 .20 .24

.26

k= 1 .2 .32 .38 .41 .41 .39 .37

.29

k= 0 1 .8 .64 .51 .41 .33 .26 .21

εn 0 .07 .11 .15 .18 .20 .22 .23

.25

Un 1 2 2 2 3 3 3 3

3

Ln -1 -1 -1 -1 -1 -1 -1 0

0

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7

Page 28: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Recursive Definition - Example

I α = 0.2, εn = 0.4 n5+n .

I Un=the minimal value such that

Pα(hit BU until n) ≤ εnI Ln = maximal value such that

Pα(hit BL until n) ≤ εnn =

Pα(Sn =k, τ≥n) 0 1 2 3 4 5 6 7 8

k= 3 .02 .03 .04 .05k= 2 .04 .06 .08 .14 .20 .24 .26k= 1 .2 .32 .38 .41 .41 .39 .37 .29k= 0 1 .8 .64 .51 .41 .33 .26 .21

εn 0 .07 .11 .15 .18 .20 .22 .23 .25

Un 1 2 2 2 3 3 3 3 3Ln -1 -1 -1 -1 -1 -1 -1 0 0

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7

Page 29: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Sequential Decision Procedure - Example

α = 0.2, εn = 0.4 n5+n .

0 10 20 30 40 50 60 70 80 90 100

n

0

10

20

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 8

Page 30: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Influence of ε on the stopping rule

ε = 0.1, 0.001, 10−5, 10−7; εn = ε n1000+n

0 1000 2000 3000 4000 5000

050

100

150

200

250

300

350

n

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 9

Page 31: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Sequential Estimation based on the MLE

p̂ =

Sττ, τ <∞

α, τ =∞,

I One can show:I hitting the upper boundary implies p̂ > α,I hitting the lower boundary implies p̂ < α.

Hence,supp

RRp(p̂) ≤ ε

I Furthermore, ∃ random interval In s.t.I In only depends on X1, . . . ,Xn,I p̂ ∈ In.

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 10

Page 32: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Example - Two-way sparse contingency table

1 2 2 1 1 0 12 0 0 2 3 0 00 1 1 1 2 7 31 1 2 0 0 0 10 1 1 1 1 0 0

I H0: variables are independent.

I Reject for large values of the likelihood ratio test statistic T

I Td→ χ2

(7−1)(5−1) under H0. Based on this: p = 0.031.

I Matrix sparse - approximation poor?

I Use parametric bootstrap based on row and column sums.

I Naive test statistic p̂naive with n = 1,000 replicates:p = 0.041 < 0.05.Probability of reporting p > 0.05: roughly 0.08.

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 11

Page 33: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Example - Two-way sparse contingency table

1 2 2 1 1 0 12 0 0 2 3 0 00 1 1 1 2 7 31 1 2 0 0 0 10 1 1 1 1 0 0

I H0: variables are independent.

I Reject for large values of the likelihood ratio test statistic T

I Td→ χ2

(7−1)(5−1) under H0. Based on this: p = 0.031.

I Matrix sparse - approximation poor?

I Use parametric bootstrap based on row and column sums.

I Naive test statistic p̂naive with n = 1,000 replicates:p = 0.041 < 0.05.Probability of reporting p > 0.05: roughly 0.08.

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 11

Page 34: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Example - Two-way sparse contingency table

1 2 2 1 1 0 12 0 0 2 3 0 00 1 1 1 2 7 31 1 2 0 0 0 10 1 1 1 1 0 0

I H0: variables are independent.

I Reject for large values of the likelihood ratio test statistic T

I Td→ χ2

(7−1)(5−1) under H0. Based on this: p = 0.031.

I Matrix sparse - approximation poor?

I Use parametric bootstrap based on row and column sums.

I Naive test statistic p̂naive with n = 1,000 replicates:p = 0.041 < 0.05.Probability of reporting p > 0.05: roughly 0.08.

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 11

Page 35: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Example - Bootstrap and Sequential Algorithm

> dat <- matrix(c(1,2,2,1,1,0,1, 2,0,0,2,3,0,0, 0,1,1,1,2,7,3, 1,1,2,0,0,0,1,+ 0,1,1,1,1,0,0), nrow=5,ncol=7,byrow=TRUE)> loglikrat <- function(data){+ cs <- colSums(data);rs <- rowSums(data); mu <- outer(rs,cs)/sum(rs)+ 2*sum(ifelse(data<=0.5, 0,data*log(data/mu)))+ }> resample <- function(data){+ cs <- colSums(data);rs <- rowSums(data); n <- sum(rs)+ mu <- outer(rs,cs)/n/n+ matrix(rmultinom(1,n,c(mu)),nrow=dim(data)[1],ncol=dim(data)[2])+ }> t <- loglikrat(dat);> library(simctest)> res <- simctest(function(){loglikrat(resample(dat))>=t},maxsteps=1000)> resNo decision reached.Final estimate will be in [ 0.02859135 , 0.07965451 ]Current estimate of the p.value: 0.041Number of samples: 1000> cont(res, steps=10000)p.value: 0.04035456Number of samples: 8574

Page 36: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Further Uses of the Algorithm

I Simulation study to evaluate whether a test isliberal/conservative.

I Determining the sample size to achieve a certain power.I Iterated Use:

I Determining the power of a bootstrap test.I Simulation study to evaluate whether a bootstrap test is

liberal/conservative.I Double bootstrap test.

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 13

Page 37: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Expected Hitting TimeResult: Ep(τ) <∞ ∀p 6= αExample with α = 0.05, εn = ε n

1000+n :0

400

800

p

Ep((ττ

))

εε == 0.001εε == 1e−05εε == 1e−07

0.0 0.2 0.4 0.6 0.8 1.0

1.0

1.2

1.4

p

Ep((ττ

))µµ p

µp = theoretical lower bound on Ep(τ).

I Note:∫ 10 µpdp =∞;

I for iterated use: Need to limit the number of steps.

Page 38: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Expected Hitting TimeResult: Ep(τ) <∞ ∀p 6= αExample with α = 0.05, εn = ε n

1000+n :0

400

800

p

Ep((ττ

))

εε == 0.001εε == 1e−05εε == 1e−07

0.0 0.2 0.4 0.6 0.8 1.0

1.0

1.2

1.4

p

Ep((ττ

))µµ p

µp = theoretical lower bound on Ep(τ).

I Note:∫ 10 µpdp =∞;

I for iterated use: Need to limit the number of steps.

Page 39: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

Summary

I Sequential implementation of Monte Carlo Tests andcomputation of p-values.

I Useful when implementing tests in packages.I After a finite number of steps:

I p̂ or

I interval [p̂Ln , p̂

Un ] in which p̂ will lie.

I Guarantee (up to a very small error probability):

p̂ is on the “correct side” of α.

I R-package simctest available on CRAN.(efficient implementation with C-code)

I For details see Gandy (2009).

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 15

Page 40: Sequential Implementation of Monte Carlo Tests with … · Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics

References

Besag, J. & Clifford, P. (1991). Sequential Monte Carlo p-values. Biometrika 78,301–304.

Davison, A. & Hinkley, D. (1997). Bootstrap methods and their application.Cambridge University Press.

Fay, M. P., Kim, H.-J. & Hachey, M. (2007). On using truncated sequentialprobability ratio test boundaries for Monte Carlo implementation of hypothesistests. Journal of Computational & Graphical Statistics 16, 946 – 967.

Gandy, A. (2009). Sequential implementation of Monte Carlo tests with uniformlybounded resampling risk. Accepted for publication in JASA.

Gleser, L. J. (1996). Comment on Bootstrap Confidence Intervals byT. J. DiCiccio and B. Efron. Statistical Science 11, 219–221.

Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 16