Top Banner
Python에서 EMR데이터 (생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM
33

EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Jun 05, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Python에서EMR데이터 (생존)분석

따라하기

Soo-Heang Eo, Lead Data ScientistHuToM

Page 2: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Background

Page 3: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

EMR vs. EHR?

https://doctors.practo.com/emr-vs-ehr-whats-difference/ https://sooyongshin.wordpress.com/2017/05/14/healthcare-data-data-data-2-clinical-data-data-in-emr/

Page 4: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

EMR (Electric Medical Records)

https://en.wikipedia.org/wiki/Electronic_health_record

Systematized collection of patient and population electronically-stored health information in a digital format including• Demographics• medical history• medication and allergies• immunization status• laboratory test results• radiology images• vital signs• personal statistics like age and weight • billing information

Page 5: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Publicly Available EMR Dataset

https://github.com/beamandrew/medical-data

Page 6: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Publicly Available EMR Dataset

https://www.nature.com/articles/sdata201635.pdf

MIMIC Critical Care Data I2B2 Clinical Notes (NLP) Data

https://www.i2b2.org/NLP/DataSets/Main.php

Page 7: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Publicly Available EMR Dataset

https://www.nature.com/articles/sdata201635.pdf

MIMIC Critical Care Data I2B2 Clinical Notes Data

https://www.i2b2.org/NLP/DataSets/Main.php

Page 8: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

MIMIC

Data Analysis

Page 9: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

MIMIC III Dataset

Medical Information Mart for Intensive Care• Single Centre: Beth Israel Deaconess

Medical Centre• U.S. Based (Boston, MA)• Has MICU, SICU, CCU, CSRU,

TSICU, ...• Detailed in-ICU information derived from:

electronic medical records, critical care information systems, lab system,...

• Limited out-of-ICU information (social security death masterfile)

• >60,000 ICU stays, >40,000 patients (2002-2012)

HOSPITAL

BEDSIDE MONITORING

• Vital Signs

• Trends

• Alarms

CHART

• Fluids

• Medications

• Progress

Notes

ICUMICU SICU CCU CVICU NICU

TESTS

• Laboratory• Microbiology

ORDERS

• Provider Order Entry (POE)

BILLING

• ICD9

• DRG• Procedures (CPT)

DEMOGRAPHICS

• Admission / Discharge Dates

• Date of Birth / Date of Death

• Religion / Ethnicity / Marital Status

NOTES AND REPORTS

• Discharge Summaries

• Radiology (X-Ray, CT, MRI,

Ultrasound)

• Cardiology (ECHO, ECG)

EXTERNAL

• Social Security Death Index

DE-IDENTIFICATION

DATE SHIFTING

FORMAT CONVERSION

USER FEEDBACK

& CORRECTIONS

DATA ARCHIVE MIMIC-III DATABASE

https://github.com/MIT-LCP/mimic-workshop/tree/master/intro_to_mimic

Page 10: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

ACESS MIMIC Dataset

https://mimic.physionet.org/gettingstarted/access/

Page 11: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

ACESS MIMIC Dataset

https://mimic.physionet.org/gettingstarted/access/

Page 12: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

DOWNLOAD MIMIC dataset

https://github.com/SpiroGanas/MIMIC3py/tree/master/MIMIC3py

Page 13: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

ACESS MIMIC Dataset (EASY WAY)

https://physionet.org/works/MIMICIIIClinicalDatabaseDemo/

Page 14: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Tools for MIMIC

https://github.com/MIT-LCP/mimic-workshop

Page 15: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Navigating MIMIC

Page 16: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Navigating MIMIC

Page 17: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Navigating MIMIC

Page 18: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Navigating MIMIC

https://github.com/MIT-LCP/mimic-code/blob/master/notebooks/tableone-demo.ipynb

Page 19: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Navigating MIMIC

Page 20: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Navigating MIMIC (T-SNE)

Page 21: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Navigating MIMIC (T-SNE)

Page 22: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Navigating MIMIC (T-SNE)

Page 23: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Navigating MIMIC (T-SNE)

Page 24: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM
Page 25: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Clinical NLP using MIMIC

https://towardsdatascience.com/introduction-to-clinical-natural-language-processing-predicting-hospital-readmission-with-1736d52bc709

•ADMISSIONS— a table containing

admission and discharge dates (has a

unique identifier HADM_ID for each

admission)

•NOTEEVENTS— contains all notes for each

hospitalization (links with HADM_ID)

Page 26: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Clinical NLP using MIMIC

Page 27: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Clinical NLP using MIMIC

https://arxiv.org/pdf/1904.03323.pdf

Page 28: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Clinical NLP using MIMIC

https://github.com/EmilyAlsentzer/clinicalBERT

Page 29: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Discussion

Page 30: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

NLP

Page 31: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

DE-Identification

Page 32: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Advanced Analysis (Survival Analysis)

https://github.com/sebp/scikit-survival

https://github.com/CamDavidsonPilon/lifelines

Tanigawa and Pfhol (2017)

Page 33: EMR Analysis Python 190515 - GitHub Pages · Python에서 EMR데이터(생존)분석 따라하기 Soo-Heang Eo, Lead Data Scientist HuToM

Surgical Data Science

https://www.nature.com/articles/s41551-017-0132-7

The age of computer integrated surgery (CIS) with patient specific data

Vedula and Hager (2017, Innov Surg Sci)