Page 1
Measuring covariate data_Presentation (November 14, 2007) 1
Measuring covariate data in Measuring covariate data in subsets of study populations: subsets of study populations:
Design optionsDesign options
Jean-François Boivin, MD, ScD
McGill University
19 August 2007
Page 3
3
16th International Conference on Pharmacoepidemiology
Barcelona 2000
Page 4
4
What about missing covariate data?
Page 5
5
Do not research that topic
Option #1
Page 6
6
• Conduct study without covariates
• Scientifically reasonable for certain questions
• Example: Sharpe et al. 2000
Option #2
Page 7
7
British Journal of Cancer 2002The effects of tricyclic antidepressants
on breast cancer risk
• Genotoxicity in Drosophila
• Comparison of antidepressants:– 6 genotoxic vs 4 nongenotoxic
• Confounding unlikely
Page 8
8
Option #3
“Confounding by other determinants was studied in analyses with data obtained by interviewing samples of subjects…”
Page 9
9
List 4 - 6 different sampling strategies:
“Confounding by other determinants was studied in analyses with data obtained by interviewing samples of subjects…”
a) ?
b) ?
c) ?
d) ?
Page 12
12
Two-stage sampling
Page 13
13
Entire population (=truth)
OR=0.5
OR=0.5
OR=2.5
Obese
Not obese
All
E+ E-
D+
D+
D+
D-
D-
D-
12,000 140
10,200 10,400
22,200 10,540 32,740
2,000 4010,000 100
200 40010,000 10,000
2,200 44020,000 10,100
Page 14
14
Obese
Not obese
All
E+ E-
D+
D+
D-
D-
22,200 10,540
not available
computerized databases
2,200 44020,000 10,100
D+D-
Page 15
15
Two-stage sampling
Page 16
16
Obese
Not obese
All
E+ E-
250/ 250/250/ 250/ 2,200 440
20,000 10,100
32,740
227 23125 2
23 227125 248
D+D-
D+D-
D+D-
Two-stage sampling
OR1 biased
OR2 biased
250 x 250 250 x 250 = 1
Page 17
17
White. AJE 1982
Walker. Biometrics 1982
Cain, Breslow. AJE 1988
Weinberg, Wacholder. Biometrics 1990
Weinberg, Sandler. AJE 1991
Statistical analysis; further design issues
Page 19
19
Option 1:
Option 2:
Option 3:
Option 4:
No study
No covariate measurement
2-stage sampling
Case only measurement
Page 20
20
Ray et al.Archives of Internal Medicine 1991
Page 21
21
Cyclic antidepressants and the risk of hip fracture
Page 22
22
E+ E-
All
RR=0.5
RR=0.5
RR=
D+D-
D+D-
D+D-
All
Not obese
Obese
RR=0.5
N1=? N2=?
RR=0.5
N3=? N4=?
RR=
RR=0.5
N1=1,000
N2=1,000
RR=0.5
N3=1,000 N4=1,000
RR=0.5
RR=0.5
N1=1,000
N2=1,000 cross-product ratio =1
RR=0.5
N3=1,000 N4=1,000
RR=
RR=0.5
N1=1,000
N2=1,000
RR=0.5
N3=1,000 N4=1,000
RR=
Confounding: Quick review
Page 23
23
Obese
Not obese
All
D+
D+
D+
D-
D-
D-
OR=0.5
OR=0.5
OR=
E+ E-
OR=0.5500 1,500
OR=0.51,000 3,000
OR=
OR=0.5
OR=0.5
OR=0.5
OR=0.5
cross-product ratio =1
OR=0.5
OR=
Case-control study
Page 24
24
Cyclic antidepressants and the risk of hip fracture
Page 25
25
E+ E-
D+Obese
Not obese
All
D-
D+D-
D+D-
2,200 440 computerized database20,000 10,100
22,200 10,540
medical record review
2,200 440 computerized database20,000 10,100
22,200 10,540
2,000 400
? ?
200 40? ?
2,200 44020,000 10,10022,200 10,540
Covariate data on cases only
Page 26
26
E+ E-
D+Obese
Not obese
All
D-
D+D-
D+D-
2,000 400
? ?
200 40? ?
2,200 44020,000 10,10022,200 10,540
OR1
OR2
•assume OR1 = OR2
•then: cross-product ratio =1 implies no confounding
Covariate data on cases only
Page 27
27
What if confounding seems to be present?
Extensions
Page 29
29
Option 1: No study
Option 2: No covariate measurement
Option 3: 2-stage sampling
Option 4: Case only measurements
Suissa, Edwardes. 1997
Page 30
30
Confounder data on cases only
Obese
Not obese
E+ E-
D+D-
2,000 220? ?
200 220? ?
Cross-product ratio =10
Confounding plausible
D+D-
Page 31
31
Epidemiology 1997
• Extensions of Ray’s method to presence of confounding
• Requires additional data from external sources
Page 32
32
Smoker
Nonsmoker
All
E+ E-
D+
D+
D+
D-
D-
D-
Theophylline
17 13 30956 3,154 4,080
14 5 19
3 8 11
14 5 19
24% of 4,080
3 8 11
76% of 4,080
14 5 19
24% of 4,080
obtained from population survey
3 8 11
76% of 4,080
Confounding; no interaction
Page 33
33
• Extensions of Ray’s method to presence of interaction
• Requires further additional data from external sources
Suissa, Edwardes. 1997
Page 34
34
No interaction
OR=0.5
OR=0.5
Obese
Not obese
E+ E-
D+
D+
D-
D-
12,000 140
10,200 10,400
2,000 4010,000 100
200 40010,000 10,000
Page 35
35
Option 1: No study
Option 2: No covariate measurement
Option 3: 2-stage sampling
Option 4: Case only measurements
Suissa, Edwardes. 1997
Multi-stage sampling
Partial questionnaires
Propensity score adjustments
Others:
Page 38
38
Monotone missingness
Page 39
39
Wacholder S, et al.
Page 40
40
Cov 1 2 3 4 5 6 7 8
Subject 1
2
3
4
5
6
7
8
9
10
…
n
Cov 1 2 3 4 5 6 7 8
Subject 1
2
3 4
5
6
7
8
9
10
…
n
Cov 1 2 3 4 5 6 7 8
Subject 1
2
3
4 5
6
7
8
9
10
…
n
Cov 1 2 3 4 5 6 7 8
Subject 1 2 3
4
5
6
7
8
9
10
…
n
Cov 1 2 3 4 5 6 7 8
Subject 1
2
3
4
5 6
7
8
9
10
…
n
Cov 1 2 3 4 5 6 7 8
Subject 1
2
3
4
5
6 7
8
9
10
…
n
Cov 1 2 3 4 5 6 7 8
Subject 1
2
3
4
5
6
7 8
9
10
…
n
Cov 1 2 3 4 5 6 7 8
Subject 1
2
3
4
5
6
7
8 9
10
…
n
Cov 1 2 3 4 5 6 7 8
Subject 1
2
3
4
5
6
7
8
9 10 …
n
Page 41
41
Wacholder S, et al.
Restricted to a small number of discrete covariates
Page 42
42
Methodologic research
Stürmer et al. AJE 2005, 2007
Propensity score calibration
Page 43
43
• Summarizes information about several covariates into a single number
• Used for matching, stratification, regression
Propensity score
Page 44
44
• Main cohort: selected covariates-“error-prone” scores estimated - regression coefficients estimated
• Sample: additional covariates-gold standard scores-regression calibration
• Advantage: multivariable technique
Stürmer et al. 2005
Page 45
45
“Until the validity and limitation of… [propensity score calibration] have been assessed in different settings, the method should be seen as a sensitivity analysis.”
Stürmer et al. 2005
Page 48
48
Stage 1: 278 cases in 4561 pregnancies
Stage 2: 244 cases + 728 non cases
Page 50
50
“Relatively few examples of two-and three-phase sampling designs for case-control studies have appeared to date in the epidemiologic literature.This is unfortunate, because the stratified designs are easy to implement and can result in substantial savings.”
NE Breslow (2000)
Page 51
51
Consent for second-stage interviews:• Cases: 49%• Controls: 39%