ProSanos Corporation Confidential and Proprietary ProSanos Corporation Confidential and Proprietary Modeling and clustering Modeling and clustering disease progression for disease progression for correlation with correlation with genetic and demographic genetic and demographic factors factors Robert Kingan Robert Kingan
24
Embed
ProSanos Corporation Confidential and Proprietary Modeling and clustering disease progression for correlation with genetic and demographic factors Robert.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
ProSanos Corporation Confidential and ProprietaryProSanos Corporation Confidential and Proprietary
Modeling and clustering Modeling and clustering disease progression for disease progression for
correlation with genetic and correlation with genetic and demographic factorsdemographic factors
Robert KinganRobert Kingan
ProSanos Corporation Confidential and ProprietaryProSanos Corporation Confidential and Proprietary
What is SSIFT?
“To address […] common diseases, which include schizophrenia, depression, and breast cancer, it is essential to incorporate observations of the clinical progression of the disease to refine the definition of phenotype.” – Michael N. Liebman, U. Penn.
Yes, but what is SSIFT?– SSIFT = Stratification and Synchronization Inference
Technology
ProSanos Corporation Confidential and ProprietaryProSanos Corporation Confidential and Proprietary
What is SSIFT?
Stratification: Dividing a patient population into groups which are meaningful for diagnosis, prognosis, treatment selection, or genotype-phenotype correlation.
Synchronization: Recognizing a pattern of disease progression, regardless of disease stage for a particular patient.
ProSanos Corporation Confidential and ProprietaryProSanos Corporation Confidential and Proprietary
SSIFT overview
Assumptions—what is SSIFT-ableOther constraints on data selectionOutline of technique
– Identifying variables– Modeling disease progression– Parameterizing different models– Clustering patients by progression patterns– Interpreting the results
ProSanos Corporation Confidential and ProprietaryProSanos Corporation Confidential and Proprietary
Pattern of disease progression
Time
Dis
ease
mar
ker
initial value
final value
period of change
ProSanos Corporation Confidential and ProprietaryProSanos Corporation Confidential and Proprietary
SSIFT workflow
Survey the data
Select useful variables
Fit disease progression models
Construct feature vectors
Assign feature weights
Cluster weighted feature vectors
Evaluate the clustering results
Complete?No
Yes
ProSanos Corporation Confidential and ProprietaryProSanos Corporation Confidential and Proprietary
SSIFT workflow
Patient 1
0
0.5
1
1.5
2
2.5
3
3.5
4
1 2 3 4 5 6 7 8 9 10
Patient 2
0
1
2
3
4
5
6
1 2 3 4 5 6 7 8 9 10
Patient 3
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
1 2 3 4 5 6 7 8 9 10
Patient 4
0
1
2
3
4
5
6
1 2 3 4 5 6 7 8 9 10
Patient 6
0
0.5
1
1.5
2
2.5
3
3.5
1 2 3 4 5 6 7 8 9 10
Patient 7
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
1 2 3 4 5 6 7 8 9 10
Patient 8
0
0.5
1
1.5
2
2.5
3
1 2 3 4 5 6 7 8 9 10
Patient 9
0
1
2
3
4
5
6
1 2 3 4 5 6 7 8 9 10
SSIFTPatient 5
0
0.5
1
1.5
2
2.5
3
3.5
1 2 3 4 5 6 7 8 9 10
Patient 10
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
1 2 3 4 5 6 7 8 9 10
Group A: Patients 1,3,7,10
0
1
2
3
4
5
1 4 7
10 13 16 19 22 25 28 31 34 37 40 43 46 49
Time (years)
Mar
ker L
evel
Group B: Patients 2,4,5,6,8,9
0
1
2
3
4
5
6
1 4 7
10 13 16 19 22 25 28 31 34 37 40 43 46 49
Time (years)
Mar
ker L
evel
ProSanos Corporation Confidential and ProprietaryProSanos Corporation Confidential and Proprietary
SSIFT curve types
)(
)(
1)()(ˆ
t
t
e
eabaty
cty )(ˆ
mtyty 0)(ˆ
)1ln()(ˆ )(22
00
0
ttety
)1ln()(ˆ )(22
00
0
ttety
ProSanos Corporation Confidential and ProprietaryProSanos Corporation Confidential and Proprietary
Converting parameters
Logistic
Constant
Linear
Early stable
Late stable
)*)2/)(
,,,(),,,( 4)(
m
ybabamba ab
),,,(),,,( NULLNULLccmba
)*
,),(ˆ),(ˆ(),,,( 01 m
yymtytymba n
)*
,),(ˆ,(),,,( 0
y
ttymba n
)*
,,),(ˆ(),,,( 01
y
ttymba
y* = population mean, t1=first time point, tn=last time point
ProSanos Corporation Confidential and ProprietaryProSanos Corporation Confidential and Proprietary
Modified Mahalanobis distance
Tqpqp vvvvqpd )()(),( 1
Tqpqpqp vvvvqpd ))()((),( 11
21
Tqp
Tqpqp vvQggQvvqpd )())()(()(),( 2
1
ProSanos Corporation Confidential and ProprietaryProSanos Corporation Confidential and Proprietary
SSIFT workflow
Survey the data
Select useful variables
Fit disease progression models
Construct feature vectors
Assign feature weights
Cluster weighted feature vectors
Evaluate the clustering results
Complete?No
Yes
ProSanos Corporation Confidential and ProprietaryProSanos Corporation Confidential and Proprietary
SSIFT workflow
Survey the data
Select useful variables
Fit disease progression models
Construct feature vectors
Assign feature weights
Cluster weighted feature vectors
Evaluate the clustering results
Complete?No
Yes
Correlate results with:•demographic data•genetic data
ProSanos Corporation Confidential and ProprietaryProSanos Corporation Confidential and Proprietary
Application of SSIFT to NIDDK
About NIDDKSSIFT and transplant dataVariable selectionModelingResults
ProSanos Corporation Confidential and ProprietaryProSanos Corporation Confidential and Proprietary