What size of trial do I need? Peter T. Donnan Professor of Epidemiology and Biostatistics Co-Director of TCTU Statistics for Health Research.

What size of trial do I need?

Peter T. Donnan

Professor of Epidemiology and Biostatistics

Co-Director of TCTU

Statistics for Health Statistics for Health ResearchResearch

What size of study do I need?

10 10,000

or

AnswerAnswer•As large as possible!As large as possible!

•Data is information, so Data is information, so the more data the sounder the more data the sounder the conclusionsthe conclusions

•In real world, data limited In real world, data limited by resources: access to by resources: access to patients, money, time, etc.. patients, money, time, etc..

What size of study do I What size of study do I need?need?

Expand the question:Expand the question:

What size of study do I need to What size of study do I need to answer the question posedanswer the question posed, , given the size of my practice / given the size of my practice / clinic, or no. of samples, given clinic, or no. of samples, given the amount of resources (time the amount of resources (time and money) I have to collect and money) I have to collect the information?the information?

What is the question?What is the question?

•Trial is Comparative: new Trial is Comparative: new drugs (CTIMP), drugs (CTIMP), management of patients, management of patients, etc… etc…

•EfficacyEfficacy

•EquivalenceEquivalence

•Non-inferiorityNon-inferiority

Why bother?Why bother?

1.1.You will not get your You will not get your study past ethics!study past ethics!

2.2.You will not get your You will not get your proposal past a statistical proposal past a statistical review by funders!review by funders!

3.3.It will be difficult to It will be difficult to publish your results! publish your results!

Why bother?Why bother?

•Is the study feasible?Is the study feasible?

•Is likely sample size Is likely sample size enough to show enough to show meaningfulmeaningful differences differences with statistical with statistical significance?significance?

•Does number planned give Does number planned give enough power or need enough power or need larger number? larger number?

OBJECTIVES OBJECTIVES

•Understand issues Understand issues involved in estimating involved in estimating sample sizesample size

•Sample size is dependent Sample size is dependent on on designdesign and and type of type of analysis analysis

•Parameters needed for Parameters needed for sample size estimation sample size estimation

OBJECTIVESOBJECTIVES

•Understand what is Understand what is necessary to carry out necessary to carry out some simple sample size some simple sample size calculationscalculations

•Carry out these Carry out these calculations with softwarecalculations with software

•Note SPSS does not yet Note SPSS does not yet have a sample size have a sample size calculator calculator

What is the measure of What is the measure of outcome?outcome?

•Difference in Change in :Difference in Change in :

•Scores, physiological Scores, physiological measures (BP, Chol), measures (BP, Chol), QOL, hospitalisations, QOL, hospitalisations,

mortality, etc…. mortality, etc….

•Choose PRIMARY OUTCOMEChoose PRIMARY OUTCOME

•A number of secondary but A number of secondary but not too many!not too many!

Intervention – Intervention – Randomised Controlled Randomised Controlled

TrialTrial1) Randomisation by 1) Randomisation by patient-patient-

RCTRCT

Crossover trialCrossover trial

2) Randomisation by 2) Randomisation by practice practice (Cluster (Cluster randomisation) randomisation)

RANDOMISED CONTROLLED RANDOMISED CONTROLLED TRIAL (RCT)TRIAL (RCT)

Gold standard Gold standard method to method to assess assess EfficacyEfficacy of treatmentof treatment

RANDOMISED CONTROLLED RANDOMISED CONTROLLED TRIAL (RCT)TRIAL (RCT)

Random allocation to Random allocation to intervention or control so intervention or control so likely balance of all likely balance of all factors affecting outcomefactors affecting outcome

Hence any difference in Hence any difference in outcome ‘caused’ by the outcome ‘caused’ by the interventionintervention

Randomised Controlled TrialRandomised Controlled Trial

RANDOMISED

Eligible subjects

Intervention

Control

INTERVENTION:INTERVENTION:

•new drug/therapy new drug/therapy

•patient educationpatient education

•Health professional Health professional educationeducation

•organisational changeorganisational change

To improve patient care To improve patient care and/or efficiency of care and/or efficiency of care deliverydelivery

ExampleExample

RCT of new statin vs. oldRCT of new statin vs. old

Evaluate cost-Evaluate cost-effectiveness of new effectiveness of new statinstatin

Randomise eligible Randomise eligible individuals to either individuals to either receive new statin or old receive new statin or old statinstatin

Eligible subjectsEligible subjects

Evaluate cost-Evaluate cost-effectiveness of new effectiveness of new statin on:statin on:

Men ? Aged over 50?Men ? Aged over 50?

Cardiovascular disease?Cardiovascular disease?

Previous MI?Previous MI?

Requires precise INCLUSION Requires precise INCLUSION and EXCLUSION criteria in and EXCLUSION criteria in protocolprotocol

WHAT IS THE OUTCOME?WHAT IS THE OUTCOME?

•Improvement in patients healthImprovement in patients health

•Reduction in CV hospitalisationsReduction in CV hospitalisations

•More explicitly a greater More explicitly a greater reduction in mean lipid levels reduction in mean lipid levels in those receiving the new statin in those receiving the new statin compared with the old statincompared with the old statin

•Reduction in costsReduction in costs

Effect size?Effect size?

Sounds a bit chicken and Sounds a bit chicken and egg!egg!

Likely size of effect: Likely size of effect:

What is the minimum What is the minimum effect size you will accept effect size you will accept as being as being clinicallyclinically or or scientificallyscientifically meaningful? meaningful?

Effect size?Effect size?

Change in Percentage with Change in Percentage with Total Cholesterol < 5 Total Cholesterol < 5 mmol/lmmol/l

• NewNew OldOldDifference Difference

• 40%40% 20%20% 20%20%

• 30%30% 20%20% 10%10%

• 25%25% 20%20% 5%5%

Variability of effect?Variability of effect?

Variability of size of effect: Variability of size of effect:

Obtained from previous Obtained from previous published studies published studies and/orand/or

Obtained from pilot work Obtained from pilot work prior to main studyprior to main study

Variability of effect?Variability of effect?

For a comparison of two For a comparison of two proportions the variability proportions the variability of size of effect is of size of effect is dependent on:dependent on:

1) the size of the study and 1) the size of the study and

2) the size of the 2) the size of the proportions or percentagesproportions or percentages

How many subjects?How many subjects?

•1) Likely size of effect 1) Likely size of effect

•2) Variability of effect 2) Variability of effect

•3) Statistical significance 3) Statistical significance level level

•4) Power4) Power

•5) 1 or 2-sided tests5) 1 or 2-sided tests

Statistical significance or Statistical significance or type I errortype I error

Type I error – rejecting null Type I error – rejecting null hypothesis when it is true: False hypothesis when it is true: False positive (Prob=positive (Prob= ) )

Generally use 5% level (Generally use 5% level ( = 0.05) = 0.05) i.e. accept evidence that null i.e. accept evidence that null hypothesis unlikely with p< 0.05hypothesis unlikely with p< 0.05

May decrease this for multiple May decrease this for multiple testing e.g. with 10 tests accept p testing e.g. with 10 tests accept p < 0.005< 0.005

1 or 2-sided?1 or 2-sided?

Generally use 2-sided Generally use 2-sided significance tests unless significance tests unless VERYVERY strong belief one strong belief one treatment could not be worse treatment could not be worse than the otherthan the other

e.g. Weakest NSAID e.g. Weakest NSAID compared with new Cox-2 compared with new Cox-2 NSAIDNSAID





•4) Power4) Power

•5) 1 or 2-sided tests 5) 1 or 2-sided tests

Power and type II errorPower and type II error

Type II error (False Type II error (False negative):negative):

Not rejecting the null Not rejecting the null hypothesis (non-hypothesis (non-significance) when it is significance) when it is falsefalse

Probability of type II error - Probability of type II error -

Power = 1 - Power = 1 - , typically , typically 80%80%

Type I and Type II errorsType I and Type II errors

ErrorError Prob.Prob. Screening Screening

Type I (Type I ()) False False 1-1-specificity specificity positivepositive

Type IIType II (()) FalseFalse 1-1-sensitivitysensitivity negativenegative

Analogy with sensitivity and Analogy with sensitivity and specificityspecificity

PowerPower

Acceptable power 70% - 99%Acceptable power 70% - 99%

If sample size is not a If sample size is not a problem go for 90% or 95% problem go for 90% or 95% power; power;

If sample size could be If sample size could be problematic go for lower problematic go for lower power but still sufficient power but still sufficient e.g. 80%e.g. 80%

PowerPower

In some studies finite limit In some studies finite limit on the possible size of the on the possible size of the study then question is study then question is rephrased:rephrased:

What likely effect size will I What likely effect size will I be able to detect given a be able to detect given a fixed sample size?fixed sample size?





•4) Power 4) Power


Sample size for difference Sample size for difference in two proportions in two proportions

Number needed for Number needed for comparison depends on comparison depends on statistical test used statistical test used For comparison of two For comparison of two proportions or proportions or percentages use Chi-percentages use Chi-Squared (Squared (22) test) test

Comparison of two Comparison of two proportionsproportions

Number in each arm = Number in each arm =

221

22112

2

pp

p100pp100pzzn

Where pWhere p11 and p and p22 are the are the percentages in group 1 and group percentages in group 1 and group 2 respectively2 respectively

Assume 90% power and 5% Assume 90% power and 5% statistical significance (2-statistical significance (2-

sided)sided)Number in each arm = Number in each arm =

221

2211

ppp100pp100p507.10

n

zz = 1.96 (5% significance level, 2- = 1.96 (5% significance level, 2-sided) zsided) z22ββ = 1.28 ( 90% power) are = 1.28 ( 90% power) are obtained from Normal distributionobtained from Normal distribution

Assume 40% reach lipid target Assume 40% reach lipid target on new statin and 20% on old on new statin and 20% on old

drugdrug

Number in each arm = Number in each arm = 105105

Total = 210Total = 210

22040

80206040507.10n

Comparison of two Comparison of two proportionsproportions

Repeat for different effects Repeat for different effects

•NewNew OldOld DifferenceDifference nn Total Total

•40%40% 20%20% 20%20% 105105 210 210

•30%30% 20%20%10%10%•25%25% 20%20%

5% 5% 19641964 3928 3928

472472 944 944

n.b. Halving effect size increases size by factor 4!n.b. Halving effect size increases size by factor 4!

Increase in sample size with decrease in difference

Group 2 proportion, ÒÂ

0.10 0.12 0.14 0.16 0.18 0.20 0.22 0.24 0.26 0.28 0.30 0.32 0.34

n pe

r gro

up

0

200

400

600

800

1000

1200

1400

1600

1800

Two group ×¿ test of equal proportions (odds ratio = 1) (equal n's)Æ = 0.050 ( 2) ÒÁ= 0.400 Pow= 90

Sample Size per Group

100 200 300 400 500

Power

50

60

70

80

90

100

Two group ×¿ test of equal proportions (odds ratio = 1) (equal n's)Æ = 0.050 ( 2) ÒÁ= 0.400 ÒÂ= 0.300

Increase in power with sample size

Comparison of two means Comparison of two means has a similar formulahas a similar formula

Number in each arm = Number in each arm =

( )( )221

22β2α

x-x

σz+z2=n

Where and are the means in Where and are the means in group 1 and group 2 respectively group 1 and group 2 respectively and and is the assumed standard is the assumed standard deviationdeviation

1x 2x

Allowing for loss to follow-Allowing for loss to follow-up / non-compliance up / non-compliance

The number estimated for The number estimated for statistical purposes may need statistical purposes may need to be inflated if likely that a to be inflated if likely that a proportion will be lost to proportion will be lost to follow-upfollow-upFor example if you know For example if you know approx. 20% will drop-out approx. 20% will drop-out inflate sample size by 1/(1-inflate sample size by 1/(1-0.2) = 1.25 0.2) = 1.25

Software and other sample Software and other sample size estimation size estimation

The formula depends on the The formula depends on the nature of the outcome and nature of the outcome and likely statistical testlikely statistical test

Numerous texts with sample Numerous texts with sample size tables and formulasize tables and formula

Software – nQuery Software – nQuery Advisor®Advisor®

SUMMARYSUMMARY

In planning consider:In planning consider:

design, type of design, type of intervention, outcomes, intervention, outcomes, sample size, power, and sample size, power, and ethics together at the ethics together at the design stagedesign stage

SUMMARYSUMMARY

Invaluable information is Invaluable information is gained from pilot work and gained from pilot work and also more likely to be also more likely to be funded (CSO)funded (CSO)

Sample size follows from Sample size follows from type of analysis which type of analysis which follows from designfollows from design

SUMMARYSUMMARY

Pilot also gives information Pilot also gives information on recruitment rateon recruitment rate

You may need to inflate You may need to inflate sample size due to:sample size due to:

Loss of follow-up/ drop-Loss of follow-up/ drop-outout

Low complianceLow compliance

Remember the checklistRemember the checklist




•4) Power 4) Power


SUMMARYSUMMARY

Remember in Scientific Remember in Scientific Research:Research:

Size Size MattersMatters

What size of trial do I need? Peter T. Donnan Professor of Epidemiology and Biostatistics Co-Director of TCTU Statistics for Health Research.

Documents

size of trial

size of study

sample size estimation

sample size calculator

likely sample size

intervention slide

type of analysissample

efficacy of treatment