Top Banner
German Breast Cancer Survival Analysis David Schuler, Ankur Verma, Udyot Kumar, Uma Lalitha- Chockalingham
25

Survival Analysis Project

Feb 09, 2017

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Survival Analysis Project

German Breast Cancer Survival

AnalysisDavid Schuler, Ankur Verma, Udyot Kumar,

Uma Lalitha-Chockalingham

Page 2: Survival Analysis Project

Objectives

●Understand the length of time between breast cancer diagnoses and specific events

●Understand what factors play a role in determining these lengths

Page 3: Survival Analysis Project

About Survival Analysis

●Predicting the time until an event of interest occurs

●Applications in Medicine, Manufacturing, Sociology, Sports, and many more

●Right Censored Data – an observation where the event has not yet occurred

● Survival Function - probability that at a given time, t, an event of interest has not yet occurred

Page 4: Survival Analysis Project

Kaplan-Meier Estimator

●Non-parametric estimator of the survival function

●Time on the X-axis

●Percentage surviving on Y axis

●Tick marks represent right-censored observations

Page 5: Survival Analysis Project

Cox Proportional Hazard Regression●Used to look at the relationship between the survival of a patient

and various explanatory variables

●Each explanatory variable is given a coefficient

○ HR = 1 : No effect

○ HR < 1 : Reduction in hazard ( Death)

○ HR > 1 : Increase in Hazard ( Death)

Page 6: Survival Analysis Project

German Breast Cancer Data●Retrieved from UMass Amherst’s Statistics website

●Data collected from clinical trials performed by the German Breast Cancer Study Group

●Total of 686 observations conducted between July 1984 and December 1989● 16 variables, including censoring and time-length fields for death and cancer

recurrence

Page 7: Survival Analysis Project
Page 8: Survival Analysis Project

Exploratory Data Analysis

Page 9: Survival Analysis Project
Page 10: Survival Analysis Project

Correlation

● menopause and age

● Estrogen and Progesterone

Page 11: Survival Analysis Project

Preliminary Survival Analysis - KM Curves

Page 12: Survival Analysis Project

Preliminary Survival Analysis - KM Curves

Page 13: Survival Analysis Project
Page 14: Survival Analysis Project
Page 15: Survival Analysis Project

Probability density function f(t)

Survival function S(t) = P(T>=t)

Hazard function h(t) = f(t) / S(t)

A way to compare two hazard functions:Hazard ratio : HR(t) = h0 (t) / h1(t)

Proportional hazard assumption : The hazard ratio does not vary with time, i.e. HR(t) = HR

This is an important assumption for Cox PH model

Page 16: Survival Analysis Project

Preliminary Survival Analysis - Cox PH

○ HR = 1 : No effect

○ HR < 1 : Reduction in hazard ( Death)

○ HR > 1 : Increase in Hazard ( Death)

● Age1 = [21,30]● Age2 = (30,50]● Age3 =(50,80]

Page 17: Survival Analysis Project

COX Regression Modelling with phreg in SAS

Page 18: Survival Analysis Project

Survival Curve Estimate ( Test Data - 3 rows)

Page 19: Survival Analysis Project

Comparing R, Python & SAS Code

Page 20: Survival Analysis Project

Comparing R, Python, & SAS: Plots

Page 21: Survival Analysis Project

SAS Plot

Page 22: Survival Analysis Project

Comparing Regression Models: Tumor Grade

Page 23: Survival Analysis Project

Comparing Regression Models: Hormone

Page 24: Survival Analysis Project

Comparing Regression Models: Menopause

Page 25: Survival Analysis Project

References

●http://www.medicine.ox.ac.uk/bandolier/painres/download/whatis/cox_model.pdf

●https://media.readthedocs.org/pdf/lifelines/latest/lifelines.pdf

●https://www.cscu.cornell.edu/news/statnews/stnews78.pdf