Top Banner
Modelling Relative Survival: Flexible Parametric Models and the Estimation of Net and Crude Mortality. Paul C Lambert 1,2 , Paul W Dickman 2 , Christopher P Nelson 1 , Patrick Royston 3 1 Department of Health Sciences, University of Leicester, UK 2 Medical Epidemiology & Biostatistics, Karolinska Institutet, Stockholm, Sweden 3 MRC Clinical Trials Unit, London, UK Seminar 18/3/2009 London School of Hygiene and Tropical Medicine Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 1 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating Net and Crude Mortality 5 Conclusion Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 2
30

Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Apr 27, 2021

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Modelling Relative Survival: Flexible Parametric Modelsand the Estimation of Net and Crude Mortality.

Paul C Lambert1,2,Paul W Dickman2, Christopher P Nelson1, Patrick Royston3

1Department of Health Sciences, University of Leicester, UK

2Medical Epidemiology & Biostatistics, Karolinska Institutet, Stockholm, Sweden

3MRC Clinical Trials Unit, London, UK

Seminar 18/3/2009London School of Hygiene and Tropical Medicine

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 1

1 Relative Survival

2 Flexible Parametric Survival Models

3 Modelling Relative Survival

4 Estimating Net and Crude Mortality

5 Conclusion

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 2

Page 2: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

1 Relative Survival

2 Flexible Parametric Survival Models

3 Modelling Relative Survival

4 Estimating Net and Crude Mortality

5 Conclusion

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 3

What is Relative Survival

Relative Survival =Observed Survival

Expected SurvivalR(t) = S(t)/S∗(t)

Expected survival obtained from national population life tablesstratified by age, sex, calendar year, other covariates.

Estimate of mortality associated with a disease without requiringinformation on cause of death.

Traditionally estimated in life tables.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 4

Page 3: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Interval Specific Survival - Colon Cancer - Finland - 75+

0.5

0.6

0.7

0.8

0.9

1.0

Inte

rval

Spe

cific

Sur

viva

l

0 2 4 6 8 10Years from Diagnosis

All Cause SurvivalExpected SurivalRelative Survival

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 5

Cumulative Survival - Colon Cancer - Finland - 75+

0.0

0.2

0.4

0.6

0.8

1.0

Sur

viva

l

0 2 4 6 8 10Years from Diagnosis

All Cause SurvivalExpected SurivalRelative Survival

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 6

Page 4: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Excess Mortality

Relative Survival =Observed Survival

Expected SurvivalR(t) = S(t)/S∗(t)

Transforming to the hazard scale gives

h(t) = h∗(t) + λ(t)

ObservedMortality Rate

=Expected

Mortality Rate+

ExcessMortality Rate

We are usually interested in modelling on the log excess hazard scale.

Assume that competing risks of other causes are acting independently.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 7

Why do we use Relative Survival

Estimate of mortality associated with a diagnosis of a particularcancer without the need for cause of death information.

If we had perfect cause-of-death information then treat those that diefrom another cause as censored at their time of death.

The quality of cause-of-death information varies over time, betweentypes of cancer and between regions/countries.

Many cancer registries do not record cause of death.

Cause of death is rarely a simple dichotomy.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 8

Page 5: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Net Survival

Relative survival and cause-specific survival are both estimates of netsurvival.

Net survival is the probability of survival in the hypothetical situation wherethe cancer of interest is the only possible cause of death, i.e in the absenceof other causes.

This is useful for national and international comparisons, changes in survivalover calendar time etc.

However, a patient and the treating clinician are also interested in theprobability of death in the presence of other causes.

I will return to this later.

Assume that the estimated expected survival is appropriate for the group inquestion and that non-cancer mortality is independent of cancer mortality.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 9

1 Relative Survival

2 Flexible Parametric Survival Models

3 Modelling Relative Survival

4 Estimating Net and Crude Mortality

5 Conclusion

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 10

Page 6: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Flexible Parametric Survival Models

Parametric estimate of the survival and hazard functions.

Useful for ‘standard’ and relative survival models.

First introduced by Royston and Parmer[10].

Smooth estimates of the hazard and survival functions.

The hazard function is of particular interest.

Also useful for modelling complex time-dependent functions.

In their simple form they assume proportional hazards.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 11

Flexible Parametric Models: Basic Idea

Consider a Weibull survival curve.

S(t) = exp (−λtγ)

If we transform to the log cumulative hazard scale.

ln [H(t)] = ln[− ln(S(t))]

ln [H(t)] = ln(λ) + γ ln(t)

This is a linear function of ln(t)

Introducing covariates gives

ln [H(t|xi )] = ln(λ) + γ ln(t) + xiβ

Rather than assuming linearity with ln(t) flexible parametric modelsuse restricted cubic splines for ln(t).

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 12

Page 7: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Why model on the log cumulative hazard scale? I

We are used to modeling on the log hazard scale, so why model onthe log cumulative hazard scale?

Under the proportional hazards assumption covariate effects can stillbe interpreted as hazard ratios.

hi (t|xi ) = h0(t) exp (xiβ) Hi (t|xi ) = H0(t) exp (xiβ)

It is easy to transform to the survival and hazard functions.

S(t) = exp [−H(t)] h(t) =d

dtH(t)

The log cumulative hazard as a function of log time is a generallystable function, e.g. in all Weibull models it is a straight line. It iseasier to capture the shape of simple functions.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 13

Cubic Splines

Flexible mathematical functions defined by piecewise polynomials.

The points at which the polynomials join are called knots.

Constraints ensure the function is smooth.

The most common splines used in practice are cubic splines.

Function is forced to have continuous 0th, 1st and 2nd derivatives.

Regression splines can be incorporated into any regression model witha linear predictor.

Restricted cubic splines are forced to be linear beyond the first andlast knots[4].

Restricted cubic splines are used in the models described here.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 14

Page 8: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Flexible Parametric Models: Incorporating Splines

We thus model on the log cumulative hazard scale.

ln[H(t|xi )] = ln [H0(t)] + xiβ

This is a proportional hazards model.Restricted cubic splines with knots, k0, are used to model the logbaseline cumulative hazard.

ln[H(t|xi )] = ηi = s (ln(t)|γ, k0) + xiβ

For example, with 4 knots we can write

ln [H(t|xi )] = ηi = γ0 + γ1z1i + γ2z2i + γ3z3i︸ ︷︷ ︸log baseline

cumulative hazard

+ xiβ︸︷︷︸log hazard

ratios

We are fitting a linear predictor on the log cumulative hazard scale.Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 15

Survival and Hazard Functions

We can transform to the survival scale

S(t|xi ) = exp(− exp(ηi ))

The hazard function is a bit more complex.

h(t|xi ) =ds (ln(t)|γ, k0)

dtexp(ηi )

This involves the derivatives of the restricted cubic splines functions.

However, these are easy to calculate.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 16

Page 9: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Likelihood

The general log-likelihood for a survival model can be written

ln Li = di ln [h(ti )] + ln [S(ti )]

Thus

ln Li = di

(ln[s ′(ln(t)|γ, k0)

]+ ηi

)− exp(ηi )

The likelihood can be maximized (using a few tricks) using Stata’soptimizer, ml.

This is implemented in stpm2. A description of this command willshortly appear in The Stata Journal [8].

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 17

Breast Cancer Example

Data were obtained from the public-use data set of all England andWales cancer registrations between 1 January 1971 and 31 December1990 with follow-up to 31 December 1995[1].

As an example I will investigate the effect of deprivation (basedCarstairs score[1]) on all-cause mortality in women who werediagnosed with breast cancer under the age of 50 years.

There are five deprivation groups ranging from the least deprived(affluent) to the most deprived quintile in the population.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 18

Page 10: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Fitting a Proportional Hazards Model I

Proportional hazards models

. stcox dep2-dep5,

. stpm2 dep2-dep5, df(5) scale(hazard) eform

The df(5) option implies using 4 internal knots and 2 boundaryknots at their default locations.

The scale(hazard) requests the model to be fitted on the logcumulative hazard scale.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 19

Fitting a Proportional Hazards Model II

Proportional hazards models.

. stcox dep2-dep5, noshow nologNo. of subjects = 24889 Number of obs = 24889

LR chi2(4) = 62.19Log likelihood = -73302.997 Prob > chi2 = 0.0000

_t Haz. Ratio Std. Err. z P>|z| [95% Conf. Interval]

dep2 1.048716 .0353999 1.41 0.159 .9815786 1.120445dep3 1.10618 .0383344 2.91 0.004 1.03354 1.183924dep4 1.212892 .0437501 5.35 0.000 1.130104 1.301744dep5 1.309478 .0513313 6.88 0.000 1.212638 1.414051

. stpm2 dep2-dep5, df(5) scale(hazard) eform nologNumber of obs = 24889Wald chi2(4) = 63.32

Log likelihood = -22502.633 Prob > chi2 = 0.0000

exp(b) Std. Err. z P>|z| [95% Conf. Interval]

xbdep2 1.048752 .0354011 1.41 0.158 .9816125 1.120483dep3 1.10615 .0383334 2.91 0.004 1.033513 1.183893dep4 1.212872 .0437493 5.35 0.000 1.130085 1.301722dep5 1.309479 .0513313 6.88 0.000 1.212639 1.414052

_rcs1 2.126897 .0203615 78.83 0.000 2.087361 2.167182_rcs2 .9812977 .0074041 -2.50 0.012 .9668927 .9959173_rcs3 1.057255 .0043746 13.46 0.000 1.048715 1.065863_rcs4 1.005372 .0020877 2.58 0.010 1.001288 1.009472_rcs5 1.002216 .0010203 2.17 0.030 1.000218 1.004218

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 20

Page 11: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Proportional hazards models

The estimated hazard ratios and their 95% confidence intervals arevery similar.

I have yet to find an example of a proportional hazards model, wherethere is a large difference in the estimated hazard ratios.

If you are just interested in hazard ratios in a proportional hazardsmodel, then you can get away with poor modelling of the baselinehazard.

One important exception is when the follow-up time differs betweengroups.

It is of course better to model the baseline hazard well!

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 21

Log Cumulative Hazard

−8

−6

−4

−2

0

Pre

dict

ed lo

g cu

mul

ativ

e ha

zard

0 1 2 3 4 5Time from Diagnosis (years)

Least Deprived234Most Deprived

Deprivation Group

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 22

Page 12: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Log Cumulative Hazard vs log(time)

−8

−6

−4

−2

0

Pre

dict

ed lo

g cu

mul

ativ

e ha

zard

1 2 3 4 5Time from Diagnosis (years)

Least Deprived234Most Deprived

Deprivation Group

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 23

Survival Function

.6

.7

.8

.9

1

Pro

port

ion

Aliv

e

0 1 2 3 4 5Time from Diagnosis (years)

Least Deprived234Most Deprived

Deprivation Group

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 24

Page 13: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Hazard Function ×1000

0

25

50

75

100

125

150

Pre

dict

ed M

orta

lity

Rat

e (p

er 1

000

py)

0 1 2 3 4 5Time from Diagnosis (years)

Least Deprived234Most Deprived

Deprivation Group

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 25

Location of the Knots

The df() option specifies that a certain number of knots are to beused using the default locations.

If df(1) is specified then the log cumulative hazard function isassumed to be a linear function of ln(t), i.e. a Weibull model.

Knots df Centiles1 2 502 3 33, 673 4 25, 50, 754 5 20, 40, 60, 805 6 17, 33, 50, 67, 836 7 14, 29, 43, 57, 71, 867 8 12.5, 25, 37.5, 50, 62.5, 75, 87.58 9 11.1, 22.2, 33.3, 44.4, 55.6, 66.7, 77.8, 88.99 10 10, 20, 30, 40, 50, 60, 70, 80, 90

Default positions of internal knots for modelling the baseline distribution function

and time-dependent effects in Royston-Parmar models. Knots are positions on the

distribution of uncensored log event-timesPaul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 26

Page 14: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Example of different knots for baseline hazard

0

25

50

75

100

Pre

dict

ed M

orta

lity

Rat

e (p

er 1

000

py)

0 1 2 3 4 5Time from Diagnosis (years)

1 df: AIC = 53746.92, BIC = 53788.35

2 df: AIC = 53723.60, BIC = 53771.93

3 df: AIC = 53521.06, BIC = 53576.29

4 df: AIC = 53510.33, BIC = 53572.47

5 df: AIC = 53507.78, BIC = 53576.83

6 df: AIC = 53511.59, BIC = 53587.54

7 df: AIC = 53510.06, BIC = 53592.91

8 df: AIC = 53510.78, BIC = 53600.54

9 df: AIC = 53509.62, BIC = 53606.28

10 df: AIC = 53512.35, BIC = 53615.92

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 27

Effect of number of knots on hazard ratios

13579

1 1.1 1.2 1.3 1.4

Deprivation Group 2

13579

1 1.1 1.2 1.3 1.4

Deprivation Group 3

13579

1 1.1 1.2 1.3 1.4

Deprivation Group 4

13579

1 1.1 1.2 1.3 1.4

Deprivation Group 5

df fo

r S

plin

es

Hazard Ratio

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 28

Page 15: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Effect of location of knots on baseline hazard - randomknots

0

25

50

75

100P

redi

cted

Mor

talit

y R

ate

(per

100

0 py

)

0 1 2 3 4 5Time from Diagnosis (years)

13.7 55.8 60.5 64.3

6.1 10.9 61.8 68.4

4.5 25.5 55.5 87.1

42.4 52.2 84.1 89.8

21.1 26.5 56.4 94.8

11.8 27.7 40.8 72.2

42.2 46.1 87.2 89.4

5.8 67.6 69.9 71.5

9.8 23.2 35.3 59.5

10.2 10.9 57.7 80.7

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 29

Effect of number of knots on baseline survival

.7

.8

.9

1

Pre

dict

ed S

urvi

val

0 1 2 3 4 5Time from Diagnosis (years)

13.7 55.8 60.5 64.3

6.1 10.9 61.8 68.4

4.5 25.5 55.5 87.1

42.4 52.2 84.1 89.8

21.1 26.5 56.4 94.8

11.8 27.7 40.8 72.2

42.2 46.1 87.2 89.4

5.8 67.6 69.9 71.5

9.8 23.2 35.3 59.5

10.2 10.9 57.7 80.7

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 30

Page 16: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Time-Dependent Effects I

A proportional cumulative hazards model can be written

ln [Hi (t|xi)] = ηi = s (ln(t)|γ, k0) + xiβ

There is a new set of spline variables for each time-dependent effect.

If there are D time-dependent effects then

ln [Hi (t|xi )] = s (ln(t)|γ, k0) +D∑

j=1

s (ln(t)|δk , kj)xij + xiβ

The number of spline variables for a particular time-dependent effectwill depend on the number of knots, kj

For any time-dependent effect there is an interaction between thecovariate and the spline variables.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 31

Example

I will look at the time-dependent effect of deprivation. For simplicity Iwill initially consider a model comparing the most deprived with theleast deprived group.

Time-dependent effects are fitted using the tvc() and dftvc()options.

The dftvc() option controls the number of knots in the same wayasfor the baseline hazard. Note that dftvc(1) means that thetime-dependent effect is modelled as a function of log time.

Non-proportional hazards models.

stpm2 dep5, scale(hazard) df(5) tvc(dep5) dftvc(3)

Differences quantified by (time-dependent) hazard ratios, hazarddifferences and survival differences.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 32

Page 17: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Estimated Hazard Functions

.04

.06

.08

.1

.12

haza

rd r

ate

0 1 2 3 4 5Time from Diagnosis (years)

Least deprivedMost deprived

Thinner lines are predictions from proportional hazards model

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 33

Estimated Hazard Ratio

1

1.5

2

2.5

3

3.5

haza

rd r

atio

0 1 2 3 4 5Time from Diagnosis (years)

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 34

Page 18: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Estimated Hazard Difference

0

50

100

150

200

Diff

eren

ce in

haz

ard

rate

(pe

r 10

00 p

erso

n ye

ars)

0 1 2 3 4 5Time from Diagnosis (years)

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 35

Estimated Survival curves

.4

.6

.8

1

Diff

eren

ce in

Sur

viva

l Cur

ves

0 1 2 3 4 5Time from Diagnosis (years)

Least deprivedMost deprived

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 36

Page 19: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Estimated Survival Difference

−0.10

−0.08

−0.06

−0.04

−0.02

0.00

0.02

Diff

eren

ce in

Sur

viva

l Cur

ves

0 1 2 3 4 5Time from Diagnosis (years)

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 37

Knots for Time-dependent effects

1

1.5

2

2.5

3

haza

rd r

atio

0 1 2 3 4 5Time from Diagnosis (years)

1 df: AIC = 20721.61, BIC = 20769.252 df: AIC = 20722.76, BIC = 20776.343 df: AIC = 20724.76, BIC = 20784.304 df: AIC = 20726.75, BIC = 20792.245 df: AIC = 20727.83, BIC = 20799.28

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 38

Page 20: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Flexible Parametric Models: Extensions

Modelling on other scales (probit, cumulative odds).

Age as the time-scale.

Multiple time-scales.

Multiple events.

Time varying covariates (with time-dependent effects).

Adjusted survival curves.

Relative Survival

Estimating crude and net mortality.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 39

1 Relative Survival

2 Flexible Parametric Survival Models

3 Modelling Relative Survival

4 Estimating Net and Crude Mortality

5 Conclusion

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 40

Page 21: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Relative Survival Models

Usually model on the log excess hazard (mortality) scale[3].

h(t) = h∗(t) + exp(xβ)

Parameters are log excess hazard ratios.

Models have proportional excess hazards as a special case, but oftennon-proportional excess hazards are observed.

Non-proportionality modelled piecewise[3], using fractionalpolynomials[6], or splines[5].

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 41

Modelling on the Log Cumulative Excess Hazard Scale

Nelson et al.[9] extended the flexible parametric modelling approachto model on the log cumulative excess hazard scale.

Hi (t) = H∗i (t) + Λi (t)

ln (− ln R(t|xi )) = ln (Λ(t)) = ln (Λ0(t)) + xβ

Relative Survival Models

ln Li = di ln(h∗(ti ) + λ(ti )) + ln(S∗(ti )) + ln(R(ti ))

S∗(ti ) does not depend on the model parameters and can be excludedfrom the likelihood.

Merge in expected mortality rate at time of death, h∗(ti ).

This is important as many of other models for relative survival involvefine splitting of the time-scale and/or numerical integration. Withlarge datasets this can be computationally intensive.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 42

Page 22: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

1 Relative Survival

2 Flexible Parametric Survival Models

3 Modelling Relative Survival

4 Estimating Net and Crude Mortality

5 Conclusion

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 43

Mortality in the absence/presence of other causes

Relative Survival is a measure of survival in the absence of othercauses, i.e. net probability.

1 - R(t) is an estimate of the net probability of death due to cancer.

Net Probabilityof Death

due to Cancer=

Probability of Death in ahypothetical world where the

cancer under study is the onlypossible cause of death

Crude Probabilityof Death

due to Cancer=

Probability of Death in thereal world where you may die

of other causes before thecancer kills you

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 44

Page 23: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Estimation of Crude Mortality

Cronin and Feuer[2] showed how this can be calculated from lifetables.

Calculated separately in age groups.

Mortality may increase dramatically between the lower and upperboundaries of these age groups.

Time-scale split into large (yearly) time intervals.

Flexible parametric approach allows individual level covariates to bemodelled

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 45

Brief Mathematical Details

h(t) - all-cause mortality rate S∗(t) - Expected Survivalh∗(t) - expected mortality rate R(t) - Relative Survivalλ(t) - excess mortality rate S(t) - Overall Survival

h(t) = h∗(t) + λ(t) S(t) = S∗(t)λ(t)

Net Prob of Death = 1− R(t) = 1− exp

(−∫ t

0λ(t)

)

Crude Prob of Death (cancer) =

∫ t

0S∗(t)R(t)λ(t)

Crude Prob of Death (other causes) =

∫ t

0S∗(t)R(t)h∗(t)

Integration performed numerically.

Delta-method to obtain variance.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 46

Page 24: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Example

28,943 men diagnosed with prostate cancer aged 40-90 in Englandand Wales between 1986-1988 inclusive and followed up to 1995[1].

A model is fitted on the log cumulative hazard scale using restrictedcubic splines are used to model the baseline excess hazard (6 knots).

Restricted cubic splines are also used to model the effect of age (4knots).

The effects of age is also allowed to vary over time by incorporatinginteractions between the restricted cubic spline terms for age atdiagnosis and a further set of restricted cubic splines for ln(t) (4knots).

Background mortality is incorporated and so this is a relative survival(excess mortality) model.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 47

Estimated Relative Survival

.2.4

.6.8

1R

elat

ive

Sur

viva

l

0 2 4 6 8 10Years from Diagnosis

45 years55 years65 years75 years85 years

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 48

Page 25: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Excess Hazard Ratios

0

1

2

3

4E

xces

s H

azar

d R

atio

0 2 4 6 8 10Years from Diagnosis

Age 45 vs Age 65

0

1

2

3

4

Exc

ess

Haz

ard

Rat

io

0 2 4 6 8 10Years from Diagnosis

Age 55 vs Age 65

0

1

2

3

4

Exc

ess

Haz

ard

Rat

io

0 2 4 6 8 10Years from Diagnosis

Age 75 vs Age 65

0

1

2

3

4

Exc

ess

Haz

ard

Rat

io

0 2 4 6 8 10Years from Diagnosis

Age 85 vs Age 65

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 49

Net and Crude Probability of Death due to Cancer

0.0

0.2

0.4

0.6

0.8

1.0

Net

Pro

babi

lity

of D

eath

(1

− R

elat

ive

Sur

viva

l)

0 2 4 6 8 10Years from Diagnosis

(a)

0.0

0.2

0.4

0.6

0.8

1.0

Cru

de P

roba

bilit

y of

Dea

th

0 2 4 6 8 10Years from Diagnosis

(b)

45 years 55 years 65 years75 years 85 years

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 50

Page 26: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Crude Probability of Death - Age 45

0.0

0.2

0.4

0.6

0.8

1.0P

roba

bilit

y of

Dea

th

0 2 4 6 8 10Years from Diagnosis

Cancer Other All Cause

Age 45 years

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 51

Crude Probability of Death - Age 75

0.0

0.2

0.4

0.6

0.8

1.0

Pro

babi

lity

of D

eath

0 2 4 6 8 10Years from Diagnosis

Cancer Other All Cause

Age 75 years

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 52

Page 27: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Crude Probability of Death - Age 85

0.0

0.2

0.4

0.6

0.8

1.0P

roba

bilit

y of

Dea

th

0 2 4 6 8 10Years from Diagnosis

Cancer Other All Cause

Age 85 years

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 53

Natural Frequencies

55 year old man

Out of 100 people like you - by 5 years

55 will die of cancer2 will die of other causes43 will be alive

85 year old man

Out of 100 people like you - by 5 years

46 will die of cancer32 will die of other causes22 will be alive

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 54

Page 28: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Sensitivity to the number of knots

A potential criticism of these models is the subjectivity in the numberand the location of the knots.

A small sensitivity analysis was carried out where the following modelswere fitted.

Model Baseline Time-dependent age No. of AIC BICdfb dft dfa Parameters

Model (a) 5 3 3 18 97250.11 97399.02Model (b) 8 5 5 39 97059.30 97381.95Model (c) 5 5 3 24 97235.68 97434.23Model (d) 3 3 3 16 97447.35 97579.72Model (e) 8 8 8 81 97105.8 97775.92

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 55

Knot sensitivity analysis

0

.2

.4

.6

.8

1

Cru

de P

roba

bilit

y

0 2 4 6 8 10Years from Diagnosis

Age 45

0

.2

.4

.6

.8

1

Cru

de P

roba

bilit

y

0 2 4 6 8 10Years from Diagnosis

Age 55

0

.2

.4

.6

.8

1

Cru

de P

roba

bilit

y

0 2 4 6 8 10Years from Diagnosis

Age 65

0

.2

.4

.6

.8

1

Cru

de P

roba

bilit

y

0 2 4 6 8 10Years from Diagnosis

Age 75

0

.2

.4

.6

.8

1

Cru

de P

roba

bilit

y

0 2 4 6 8 10Years from Diagnosis

Age 85

Model (a) Model (b) Model (c) Model (d) Model (e)

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 56

Page 29: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

Variance Estimates

0.0

0.2

0.4

0.6

Pro

babi

lity

of D

eath

0 2 4 6 8 10Years from Diagnosis

Age 45 years

0.0

0.2

0.4

0.6

Pro

babi

lity

of D

eath

0 2 4 6 8 10Years from Diagnosis

Age 55 years

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 57

Conclusions

Measuring the cumulative cause-specific mortality in the presence ofother causes is a useful measure.

It is a complement to relative survival.

At present this is estimated separately for subgroups of interest inlife-tables.

Modelling has the advantage of estimating in continuous time, smallerstandard errors, and making predictions for individual patients.

The flexible parametric approach is a useful framework to estimatethese quanities.

It is also possible to obtain these estimates from other relativesurvival models, for example, cure models[7].

Aim now is to move to more recent data to obtain individual levelpredictions using stage, risk factors, biomarkers etc.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 58

Page 30: Modelling Relative Survival: Flexible Parametric Models and the … · 2010. 7. 26. · 1 Relative Survival 2 Flexible Parametric Survival Models 3 Modelling Relative Survival 4 Estimating

References I

[1] M.P. Coleman, P. Babb, P. Damiecki, P. Grosclaude, S. Honjo, J. Jones,G. Knerer, A. Pitard, Quinn.M.J., A. Sloggett, and B. De Stavola. Cancer survivaltrends in England and Wales, 1971-1995: deprivation and NHS Region. Office forNational Statistics, London, 1999.

[2] K. A. Cronin and E. J. Feuer. Cumulative cause-specific mortality for cancerpatients in the presence of other causes: a crude analogue of relative survival.Statistics in Medicine, 19(13):1729–1740, Jul 2000.

[3] Paul W Dickman, Andy Sloggett, Michael Hills, and Timo Hakulinen. Regressionmodels for relative survival. Statistics in Medicine, 23(1):51–64, Jan 2004.

[4] S. Durrleman and R. Simon. Flexible regression models with cubic splines.Statistics in Medicine, 8(5):551–561, May 1989.

[5] Roch Giorgi, Michal Abrahamowicz, Catherine Quantin, Philippe Bolard, JacquesEsteve, Joanny Gouvernet, and Jean Faivre. A relative survival regression modelusing b-spline functions to model non-proportional hazards. Statistics in Medicine,22(17):2767–2784, Sep 2003.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 59

References II

[6] Paul C Lambert, Lucy K Smith, David R Jones, and Johannes L Botha. Additiveand multiplicative covariate regression models for relative survival incorporatingfractional polynomials for time-dependent effects. Statistics in Medicine,24(24):3871–3885, Dec 2005.

[7] Paul C Lambert, John R Thompson, Claire L Weston, and Paul W Dickman.Estimating and modeling the cure fraction in population-based cancer survivalanalysis. Biostatistics, 8(3):576–594, Jul 2007.

[8] P.C. Lambert and P. Royston. Further development of flexible parametric modelsfor survival analysis. The Stata Journa (in press), 2009.

[9] Christopher P Nelson, Paul C Lambert, Iain B Squire, and David R Jones. Flexibleparametric models for relative survival, with application in coronary heart disease.Statistics in Medicine, 26(30):5486–5498, Dec 2007.

[10] Patrick Royston and Mahesh K B Parmar. Flexible parametric proportional-hazardsand proportional-odds models for censored survival data, with application toprognostic modelling and estimation of treatment effects. Statistics in Medicine,21(15):2175–2197, Aug 2002.

Paul C Lambert Relative Survival & Crude Mortality LSHTM 18/3/2009 60