Top Banner
SideFinder: Predicting Drug Side Effects Aqeel Ahmed Insight Data Science Fellow Prof. Heather A. Carlson Group College of Pharmacy University of Michigan
14

09 demo aqeel

Jan 15, 2017

Download

Healthcare

phulady
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: 09 demo aqeel

SideFinder: Predicting Drug Side Effects

Aqeel AhmedInsight Data Science

FellowProf. Heather A. Carlson GroupCollege of PharmacyUniversity of Michigan

Page 2: 09 demo aqeel

Motivations: Drug Side Effects Prediction

Developing a new drug is challengingRequires billions of dollarsOften more than 13 years of effortsSide effects are one of the main causes of

Drug failure Drug withdrawal

Predictions can save time/money Major public health concern

Estimated: 100,000 deaths per yearSelecting better drug candidates

Page 3: 09 demo aqeel

DEMO: www.SideFinder.info

Page 4: 09 demo aqeel

SIDER: http://sideeffects.embl.deDSigDB: http://tanlab.ucdenver.edu/DSigDB

Data Resources

DSigDB Drug Protein Interactions From: ChEMBL and PubChem

SIDER Drug side effects From: public documents and package inserts

Page 5: 09 demo aqeel

Data Representation Each drug is represented by ~700 dimensional interaction

vector Each element encodes for the interactions (1: interaction)

Each drug is also associated with output side effect profile Aim is to predict side effect profile for a new drug

1 2 3 . . . . . . 705

1 0 1 0 0 0 1 0 1 00 0 1 0 0 1 0 0 1 10 0 1 0 1 0 0 1 1 11 0 1 0 1 1 0 1 1 01 0 1 0 1 0 0 0 1 11 0 1 0 0 0 1 0 1 0

Protein interactions

Drug

s

1 2 3 . . . . . . . . . . 1000

0 0 1 0 0 0 1 0 0 0 1 0 1 01 0 0 1 1 0 0 1 1 0 0 1 1 00 0 0 1 1 0 1 0 0 0 1 0 0 01 0 1 0 0 0 0 1 1 0 0 0 1 00 0 0 1 1 0 1 0 0 0 1 0 1 11 0 0 1 1 0 1 0 0 0 1 0 1 0

Side effects

Page 6: 09 demo aqeel

Machine learning

Built 1000 models (one for each side effect) Random forest classifier Logistic regression classifier

Validation 5-fold cross validation Distributions of ROC AUC, Accuracy

Page 7: 09 demo aqeel

Area Under the Curve (AUC)

Page 8: 09 demo aqeel
Page 9: 09 demo aqeel

Drug associations

Page 10: 09 demo aqeel

Side effects

Protein interactionsFeature vectors for predictions

Page 11: 09 demo aqeel

Accuracy

Page 12: 09 demo aqeel

Roc (AUC)

Page 13: 09 demo aqeel

Literature support for the predicted side effectMefenamic Acid: Treats pain, including menstrual pain

Ref: Drug.com

Lovastatin: Lowering cholesterol

Ref: Indian J Endocrinol Metab. Safety of statins 2013; 17(4): 636–646

Page 14: 09 demo aqeel

Summary Predicting side effect in drug development process

Challenging Very important

Can save time and money Improve health care

Developed a machine learning approach Used drug-protein interactions Can predict many side effects

Future directions Incorporate other biological data

Drug structure features Gene expression profiles

Integrate models in a consensus based approach Group side effects into classes on similarity and severity levels