Top Banner
Data-driven visualization of drug interactions
12

Data-driven visualization of drug interactions. Adverse Drug Events Almost 1 million deaths/injuries each year in the US [1] Some fraction of ADEs are.

Jan 15, 2016

Download

Documents

Ira Evans
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Data-driven visualization of drug interactions. Adverse Drug Events Almost 1 million deaths/injuries each year in the US [1] Some fraction of ADEs are.

Data-driven visualization of drug interactions

Page 2: Data-driven visualization of drug interactions. Adverse Drug Events Almost 1 million deaths/injuries each year in the US [1] Some fraction of ADEs are.

Adverse Drug Events

• Almost 1 million deaths/injuries each year in the US[1]

• Some fraction of ADEs are caused by previously unknown drug-drug interactions• Clinical trials aren’t large enough to detect many potential

interactions

• FDA, WHO, pharmaceutical companies maintain databases of reported[2] ADEs• You can download a sample of the FDA data from the Adverse

Event Reporting System website[3]

• We can analyze the reported data to identify suspicious drug interactions

Copyright 2011 Cloudera Inc. All rights reserved

Page 3: Data-driven visualization of drug interactions. Adverse Drug Events Almost 1 million deaths/injuries each year in the US [1] Some fraction of ADEs are.

Challenges in Analyzing Adverse Drug Events

• Biased Sample• Adverse event reporting is voluntary• We don’t see events from patients who took the drugs and

nothing happened

• Correlation != Causation• No controlled trials, some correlations are coincidences

• Requires Advanced Statistical Modeling Skills• Multi-item Gamma Poisson Shrinkage Estimator is used to

score the significance of a drug interactions• The model is too complex to solve directly, we use Expectation

Maximization (EM) to estimate its parameters

Copyright 2011 Cloudera Inc. All rights reserved

Page 4: Data-driven visualization of drug interactions. Adverse Drug Events Almost 1 million deaths/injuries each year in the US [1] Some fraction of ADEs are.

The Hard Problem: Counting

• It is a “small” data problem…• 250,000+ events reported to the FDA annually

• …that explodes when we consider:• Multi-drug, multi-symptom interactions• Analyzed by strata (e.g., month of report, patient age, patient

gender, etc.)• ~1 million reports => ~360 million buckets

• Analysts typically filter the data to only consider a few adverse reactions at a time…

• …but that is not the way of the data scientist

Copyright 2011 Cloudera Inc. All rights reserved

Page 5: Data-driven visualization of drug interactions. Adverse Drug Events Almost 1 million deaths/injuries each year in the US [1] Some fraction of ADEs are.

Solving the Hard Problem

• MapReduce on Hadoop• 20 MapReduce jobs• Filter, aggregate, join, aggregate again• Model the resulting data in R• Use MapReduce to apply the model parameters to the data,

score each drug-drug interaction, and then filter the data to obtain the highest scoring interactions

• Visualizing the Results• Even applying a restrictive filter on the scores, we end up with

20,000+ statistically significant drug-drug-reaction triples

Copyright 2011 Cloudera Inc. All rights reserved

Page 6: Data-driven visualization of drug interactions. Adverse Drug Events Almost 1 million deaths/injuries each year in the US [1] Some fraction of ADEs are.

The Drug-Drug Interaction Graph

Copyright 2011 Cloudera Inc. All rights reserved

Page 7: Data-driven visualization of drug interactions. Adverse Drug Events Almost 1 million deaths/injuries each year in the US [1] Some fraction of ADEs are.

HIV Medications

Copyright 2011 Cloudera Inc. All rights reserved

Page 8: Data-driven visualization of drug interactions. Adverse Drug Events Almost 1 million deaths/injuries each year in the US [1] Some fraction of ADEs are.

Cancer Medications

Copyright 2011 Cloudera Inc. All rights reserved

Page 9: Data-driven visualization of drug interactions. Adverse Drug Events Almost 1 million deaths/injuries each year in the US [1] Some fraction of ADEs are.

Exploring the Graph

Copyright 2011 Cloudera Inc. All rights reserved

Page 10: Data-driven visualization of drug interactions. Adverse Drug Events Almost 1 million deaths/injuries each year in the US [1] Some fraction of ADEs are.

Bridges Between Dense Clusters

Copyright 2011 Cloudera Inc. All rights reserved

Page 11: Data-driven visualization of drug interactions. Adverse Drug Events Almost 1 million deaths/injuries each year in the US [1] Some fraction of ADEs are.

Copyright 2011 Cloudera Inc. All rights reserved

Page 12: Data-driven visualization of drug interactions. Adverse Drug Events Almost 1 million deaths/injuries each year in the US [1] Some fraction of ADEs are.

Acknowledgments and References

• Thanks to Josh Wills, Director of Data Science at Cloudera, for the data collection and analysis shown here.

• References:• [1] ADE instances/year:

http://www.ahrq.gov/qual/aderia/aderia.htm• [2] AERS reporting site:

http://www.ahrq.gov/qual/aderia/aderia.htm• [3] Download ADE instance data:

http://www.fda.gov/Drugs/GuidanceComplianceRegulatoryInformation/Surveillance/AdverseDrugEffects/ucm082193.htm

• Other resources:• http://www.cloudera.com/blog• http://wiki.cloudera.com/

Copyright 2011 Cloudera Inc. All rights reserved 12