Challenges & Opportunities in Clinical Prediction Modeling Where are We? Challenges Key Measures Diagnostic Risk Modeling Case Study Bibliography Challenges & Opportunities in Clinical Prediction Modeling Frank E Harrell Jr Department of Biostatistics Vanderbilt University School of Medicine Nashville, Tennessee Office of Biostatistics US FDA Center for Drug Evaluation and Research MEMTAB 2018 Utrecht NL 2018-07-02
24
Embed
Challenges & Opportunities in Clinical Prediction Modelinghbiostat.org/talks/memtab18.pdf · 2018-07-01 · Regression Modeling Strategies, with Applications to Linear Models, Logistic
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
Challenges & Opportunities in ClinicalPrediction Modeling
Frank E Harrell Jr
Department of BiostatisticsVanderbilt University School of Medicine
Nashville, Tennessee
Office of BiostatisticsUS FDA Center for Drug Evaluation and Research
MEMTAB 2018 Utrecht NL 2018-07-02
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
How Did We Get Here?
Statistical foundations: maximum likelihood (Fisher), andBayes
Long tradition of methodology development in statisticsand clinical epidemiology
Thousands of methodologists
Statistical computing platforms
Resampling methods for model validation
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
Where are We?
Flexible statistical models
Assume smoothness, not linearity (splines, etc.)Penalized maximum likelihood estimation (shrinkage)Bayesian model, penalizing through prior distributionsSemiparametric models for continuous ordinal Y
Overall modeling strategies
Handling complexityData reductionMissing data, e.g. multiple imputation
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
Where are We? continued
Validation methods
Bootstrap and other resampling methodsLess need for external validationValidation of predictive discrimination and absoluteaccuracy (calibration)
Machine leaning, if black box OK
Huge number of methods for assessing added value ofbiomarkers
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
Challenges
Role of machine learning, and dealing with hype
Interpreting complex models (. . . and machine learningalgorithms)
Frequentist statistical inference if using penalization
Move more to Bayesian models
No point estimate of risk but a per-subject risk distribution(pointed if N large) taking all uncertainties into accountNo overfitting, just disagreements about priors forregression coefficientsHandling of missing data much less ad hoc
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
Challenges: Interactions
Exploratory analysis of interaction largely fails
Interactions are frequently nonlinear and co-linear
Curse of dimensionality and difficulty in pre-specification
Need new approaches; focus on “interaction datareduction” and Bayes
Skeptical priors for interactions effectsStop making dichotomous decisionsInteractions can be “half in” the model
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
Challenges, continued
Methodologists keep inventing ad hoc approaches toquantifying and testing added predictive value
Many are statistically inefficient
Many use arbitrary categorization/binning
Many are unnecessary
Many indexes have problems
Suitable only for retrospective sampling (sensitivity,specificity, ROC curves)Arbitrary and statistically insensitiveImproper probability accuracy scoring rules are epidemic
Simpler, traditional methods handle greater complexity!
Interaction between a biomarker and a baseline clinicalvariable
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
Key Measures (Frequentist Versions)
Log-likelihood; gives rise to
Logarithmic proper accuracy scoreOverall LR model χ2 (denote by LR)Pseudo R2: 1− exp(−LR/n)
Explained variation
Linear model: SSR / SST or var(X β̂) / var(Y )Extended by Kent and O’Quigley 1988: SST or var(Y ) isdistribution–specificSchemper 2003: excellent paper advocating for measuresbased on absolute rather than squared differences
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
Relative Explained Variation
Base model A, added predictors B
LR is the gold standard frequentist method for establishingevidence for some added value
LR is an optimum, general information measure
LR = −n log(1− R2) (for linear models)
For small R2, this is approx. nR2
Adequacy index (Harrell 2015): LRA / LRAB
Proportion of explainable log likelihood that is explainedby AProportion of predictive information
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
Relative Explained Variation, continued
Relative R2:SSRA / SSRAB = R2
A / R2AB
SSRj = var(X j β̂j)SSRA / SSRAB : adequacy of A1 - this : proportion of explainable variation explained by B
Can use other measures than var(X β̂)
mean absolute deviation from mean X β̂g -index: Gini’s mean difference for X β̂probability scale, for any of the measures
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
Diagnostic Risk Modeling
Assuming (Atypical) Binary Disease Status
Y 1:diseased, 0:normalX vector of subject characteristics (e.g., demographics, risk factors, symptoms)
T vector of test (biomarker, . . . ) outputsα interceptβ vector of coefficients of Xγ vector of coefficients of T
Some Summary Measures for Pre– and Post–testProbabilities
quantile regression (Koenker and Bassett 1978) curves as afunction of pre
overall mean |post – pre|quantiles of post – pre
du50: distribution of post when pre = 0.5diagnostic utility at maximum pre-test uncertainty
Choose X so that pre = 0.5Examine distribution of post at this preSummarize with quantiles, Gini’s mean difference on prob.scaleSpecial case where test is binary (atypical): compute postfor T+ and for T−
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
Case Study
Patients undergoing cardiac catheterization at DukeUniversity, for chest pain; n = 2258
Diagnosis of significant coronary artery disease
See BBR Diagnosis Chapter: fharrell.com/links
Base model: age, sex; age and age × sex interactionsnonlinear using splines
Diagnostic Utility of Cholesterol vs. Age, LogitScale; No Cholesterol × Age Interaction
●●●● ● ●●●● ●●●● ● ●●● ●● ● ●● ● ● ●
● ● ●●● ●●● ●
● ● ●● ● ● ●●●●
●● ●
●● ●
●●
−0.5
0.0
0.5
1.0
1.5
2.0
2.5
3.0
1520253035404550556065707580
Age
Pos
t−P
re X
β̂Frequency
●
●
●
●
●
●
●
1
2
3
4
5
[6, 9)
[9,33]
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
Diagnostic Utility vs. Age, Logit ScaleCholesterol × Age Interaction Included
●● ●●● ● ●●
● ●●●●● ● ●●●●●
●● ●● ● ●● ● ●●● ● ●●●
● ●●●●● ● ● ●● ● ● ●
●● ● ● ●●
● ●●
● ●●
−1.0
−0.5
0.0
0.5
1.0
1.5
2.0
2.5
3.0
3.5
1520253035404550556065707580
Age
Pos
t−P
re X
β̂Frequency
●
●
●
●
●
●
●
1
2
3
4
[5, 7)
[7, 9)
[9,45]
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
Diagnostic Utility vs. Age, Probability ScaleInteraction Included
● ●●● ●●● ●
● ● ●●●●●●●●●●●● ●●●●●● ● ● ●●●● ● ●● ●
● ● ● ●●●
●● ●● ●
−0.2
−0.1
0.0
0.1
0.2
0.3
0.4
0.5
0.6
15 20 25 30 35 40 45 50 55 60 65 70 75 80
Age
Pos
t−P
re P̂
Frequency●
●
●
●
●
●
●
1
2
3
4
[ 5, 7)
[ 7,10)
[10,40]
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
Explained Variation vs. Age, Probability Scale
Absolute difference between P̂ and P in post-test model
●● ●● ●
●● ●
● ●●
●● ● ●
●●
●●●● ●
●● ●●●● ●●●●
● ● ●●● ●●●● ● ●●
● ● ● ● ● ●● ● ● ●●
●
●
●●
0.0
0.2
0.4
0.6
15 20 25 30 35 40 45 50 55 60 65 70 75 80
Age
post
−po
st
Frequency●
●
●
●
●
●
1
2
3
4
[5, 8)
[8,28]
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
Explained Variation vs. Age and Sex, ProbabilityScale
0.0
0.2
0.4
30 40 50 60 70
Age, Year
post
−po
st sex
male
female
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
Summary
There are many remaining challenges in clinical predictionmodel development
Need general approaches for reliable interaction modelingfor precision medicine/HTE
Bayesian modeling opens vast possibilities
Need to unlearn a lot of ad hoc methods for assessingadded value of biomarkers
Simple regression and likelihood approaches are
more powerfulmore preciseless arbitrary (no binning)more insightfulmore flexible
Need to spend effort translating likelihood and explainedvariation measures for clinicians
Challenges &Opportunities
in ClinicalPredictionModeling
Where areWe?
Challenges
Key Measures
DiagnosticRisk Modeling
Case Study
Bibliography
References
Harrell, F. E. (2015). Regression Modeling Strategies, withApplications to Linear Models, Logistic and Ordinal Regression,and Survival Analysis. Second edition. New York: Springer. isbn:978-3-319-19424-0 (cit. on p. 10).
Kent, J. T. and J. O’Quigley (1988). “Measures of dependence forcensored survival data”. In: Biometrika 75, pp. 525–534 (cit. onp. 9).
Koenker, R. and G. Bassett (1978). “Regression quantiles”. In:Econometrica 46, pp. 33–50 (cit. on p. 13).
Schemper, M. (2003). “Predictive accuracy and explained variation”.In: Stat Med 22, pp. 2299–2308 (cit. on p. 9).
value of R2 with binary response data;measures of average absolute predictionerrors with continuous response.