Supplementary material Table of Contents Appendix 1 - ICD9 Mapping.......................................................................................... 2 Appendix 2 – Italian stop words used in the ICD9-CM TM analysis......................... 15 Appendix 3 – English stop words used in the Remapped Procedures TM analysis..16 Appendix 4 – TM on CCS remapped procedures........................................................ 17 Appendix 5 – Complete list of the extracted careflows and remapping into clusters ..................................................................................................................................... 19 Appendix 6– Grid search evaluation of CFM parameters......................................... 23 Appendix 7–CFM only based histories (Admission and SPU events)........................25
32
Embed
Appendix 1 - ICD9 Mapping - ars.els-cdn.com€¦ · Web viewSupplementary material. Table of Contents. Appendix 1 - ICD9 Mapping2. Appendix 2 – Italian stop words used in the
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Supplementary material
Table of ContentsAppendix 1 - ICD9 Mapping.....................................................................................................2
Appendix 2 – Italian stop words used in the ICD9-CM TM analysis.........................................15
Appendix 3 – English stop words used in the Remapped Procedures TM analysis...................16
Appendix 4 – TM on CCS remapped procedures.....................................................................17
Appendix 5 – Complete list of the extracted careflows and remapping into clusters..............19
Appendix 6– Grid search evaluation of CFM parameters........................................................23
Appendix 7–CFM only based histories (Admission and SPU events).......................................25
Appendix 1 - ICD9 MappingMapped Procedures Count of ICD9
CODESCount of
Observations
Lumpectomy 10 4102Plastic Reconstruction On The Breast 26 3413Injection Infusion Of Chemotherapeutic Substances 2 2436Operations on the hemic and lymphatic system 8 1995Cardiovascular System Procedures 7 1257Mastectomy 12 1134Microscopic examination 11 1104Operations on axillary lymph nodes 3 1043Injection Of Therapeutic Substance And Transfusion 22 839Other 20 672Biopsy 5 585Diagnostic Radiology 16 476Diagnostic Ultrasound - Metastases 1 364Rehabilitation 18 345General Surgery 54 289CAT MRI - Metastases 9 282Radioisotope Scan And Function Study - Metastases 5 224Plastic Surgery 13 165Diagnostic Ultrasound 5 156Operations On Respiratory System 11 123Therapeutic Radiology - Metastases 4 109Operations on the nervous system 14 103Operations On The Cardiovascular system 11 95Operation On Musculoskeletal System 10 59Endoscopic Procedure On Digestive System 11 55Diagnostic Radiology - Metastases 12 50Biopsy - Metastases 14 47Procedures Related To The Psyche 5 41Nervous System Procedures 6 34Hyperthermia for Cancer Treatment 2 19Operation On The Female Genital Organs 8 17Ophthalmologic And Otologic Procedures 4 15Operation On The Urinary System 6 9
Endoscopic Procedure On Digestive System - Metastases
2 5
Respiratory intubation and mechanical ventilation 1 2
Endoscopic Procedure On Digestive System - Metastases 33.2233.23
General Surgery 00.6200.6404.3705.4406.8908.6440.4142.9243.4143.4943.9944.3944.9945.7345.7545.7645.9346.2146.5147.0147.1948.6348.7449.0149.1149.2949.3949.4950.2250.2951.1051.2251.2351.9853.0353.0453.1453.1753.2153.4153.4953.5153.5953.6154.5954.9154.9355.0370.52
Hypertermia For Cancer Treatement 93.3499.85
Injection Infusion Of Chemotherapeutic Substances 99.2599.28
Injection Of Therapeutic Substance And Transfusion 09.6639.9541.0141.0450.9483.9899.0099.0199.0399.0499.0599.1499.1599.1799.1899.1999.2199.2399.2499.2699.2999.52
Nervous System Procedures 03.3189.1389.1489.1589.1789.19
Operation on Musculoskeletal System 77.6080.5981.9181.9283.3283.9188.3193.0893.1697.87
Operation On The Female Genital Organs 65.3165.4965.5365.6265.6368.2969.1989.26
Operation On The Urinary System 05.5405.9855.5156.3156.8196.49
Operations on axillary lymph nodes 40.2240.2340.51
Operations On Respiratory System 34.0934.9134.9389.3789.3889.6693.1893.9093.9193.9693.99
Operations On The Cardiovascularsystem 03.7038.5938.9138.9338.9538.9939.2739.5039.9086.0799.61
Operations on the hemic and lymphatic system 04.0304.0940.1940.2440.2940.5240.5340.59
Operations on the nervous system 00.3600.4203.5303.9003.9103.9203.9303.9403.9604.8004.8104.9905.3105.39
Ophthalmologic and Otologic Procedures 21.8640.2195.0295.03
Other 08.2308.6108.6308.8608.8708.9757.9489.0189.0289.0389.0589.0689.0789.0889.0989.6189.6593.0799.7999.84
Plastic Reconstruction On The Breast 85.8208.5683.4385.3185.3285.5085.5185.5385.5485.8585.8785.8985.9385.9485.9585.9686.0286.6386.6986.7086.7286.7586.8486.9388.9488.97
Plastic Surgery 86.6078.4986.0186.0486.0986.1986.2286.2386.2886.5986.8993.5796.59
Procedures Related To The Psyche 94.0994.1294.3794.3894.42
Radioisotope Scan And Function Study - Metastases 92.0192.0992.1492.1592.19
Appendix 4 – TM on CCS remapped proceduresUsing the same procedure described in the Methods section and applied to the ICD9-CM case, to choose the number of topics of the model we performed a grid search on K ranging from 2 to 13, and evaluated consistency, redundancy, importance, and perplexity. Consistency does not change with k, and it is always greater than 0.75. Redundancy is 0 up until K=6, never reaching values higher than 0.25. Importance slowly decreases between K=3 and K=7, with a steeper decrease from K=7. Perplexity shows a fast increase for K>7. In general, given the low number of words in the dictionary, models with higher number of topics over-describe the documents’ space.
Appendix 6 – Grid search evaluation of CFM parameters CFM algorithm parameters were selected following the grid-search approach presented in Section 2.3. In particular, we performed a grid search by varying min_support in the range 2-50 and max_length in the range 3-10. The Figure below shows a heatmap with the corresponding values for the number of careflows, the average number of patients per careflow, the average number of missed events, and the true match rate for each value of the pair of parameters min_support and max_length.Red values should be avoided, choosing minimum support and maximum history length accordingly. As can be seen from red boxes on heatmap, some indicators need to be maximized (true match rate and mean number of patients per careflow), and other to be minimized (number of mined careflows, mean number of missed events). The last heatmap reports a normalized score that combines all four parameters. All the single indicators have been normalized on a scale from 0 to 1, where 0 corresponds to the old red box values and 1 to the lighter values. The normalized values are combined together using the mean. As seen in the last heatmap, the best score is reached in the interval between 5 and 10 events for minimum_support and a maximum_history_length larger than 5 events. Choosing 10 as minimum support allows deriving less than one hundred histories.
Appendix 7–CFM only based histories (Admission and SPU events) History Regrouping