Department of Applied AI and Data Science, City of Hope National Medical Center, Duarte, CA.
Department of Clinical Informatics, City of Hope National Medical Center, Duarte, CA.
JCO Clin Cancer Inform. 2021 Feb;5:155-167. doi: 10.1200/CCI.20.00127.
Thirty-day unplanned readmission is one of the key components in measuring quality in patient care. Risk of readmission in oncology patients may be associated with a wide variety of specific factors including laboratory results and diagnoses, and it is hard to include all such features using traditional approaches such as one-hot encoding in predictive models.
We used clinical embeddings to represent complex medical concepts in lower dimensional spaces. For predictive modeling, we used gradient-boosted trees and adopted the shapley additive explanation framework to offer consistent individualized predictions. We used retrospective inpatient data between 2013 and 2018 with temporal split for training and testing.
Our best performing model predicting readmission at discharge using clinical embeddings showed a testing area under receiver operating characteristic curve of 0.78 (95% CI, 0.77 to 0.80). Use of clinical embeddings led to up to 23.1% gain in area under precision-recall curve and 6% in area under receiver operating characteristic curve. Hematology models had more performance gain over surgery and medical oncology. Our study was the first to develop (1) explainable predictive models for the hematology population and (2) dynamic models to keep track of readmission risk throughout the duration of patient visit.
To our knowledge, our study was the first to develop (1) explainable predictive models for the hematology population and (2) dynamic models to keep track of readmission risk throughout the duration of patient visit.
30 天非计划性再入院是衡量患者护理质量的关键指标之一。肿瘤患者的再入院风险可能与多种特定因素相关,包括实验室结果和诊断,使用传统方法(如独热编码)很难将所有这些特征纳入预测模型。
我们使用临床嵌入来表示低维空间中的复杂医疗概念。对于预测建模,我们使用梯度提升树,并采用 Shapley 可加性解释框架提供一致的个性化预测。我们使用 2013 年至 2018 年的回顾性住院数据进行时间分割训练和测试。
我们使用临床嵌入预测出院时再入院的最佳表现模型在测试中的受试者工作特征曲线下面积为 0.78(95%置信区间,0.77 至 0.80)。使用临床嵌入可使精确召回曲线下面积提高 23.1%,使受试者工作特征曲线下面积提高 6%。血液学模型比手术和肿瘤内科模型的性能增益更大。我们的研究是第一个开发(1)针对血液学人群的可解释预测模型,(2)能够跟踪患者就诊期间再入院风险的动态模型。
据我们所知,我们的研究是第一个开发(1)针对血液学人群的可解释预测模型,(2)能够跟踪患者就诊期间再入院风险的动态模型。