Yoo Junsang, Kim Si-Ho, Hur Sujeong, Ha Juhyung, Huh Kyungmin, Cha Won Chul
Department of Nursing, College of Nursing, Sahmyook University, Seoul, Republic of Korea.
Division of Infectious Disease, Samsung Changwon Hospital, Sungkyunkwan University School of Medicine, Changwon, Republic of Korea.
JMIR Med Inform. 2021 Jul 26;9(7):e24651. doi: 10.2196/24651.
Appropriate empirical treatment for candidemia is associated with reduced mortality; however, the timely diagnosis of candidemia in patients with sepsis remains poor.
We aimed to use machine learning algorithms to develop and validate a candidemia prediction model for patients with cancer.
We conducted a single-center retrospective study using the cancer registry of a tertiary academic hospital. Adult patients diagnosed with malignancies between January 2010 and December 2018 were included. Our study outcome was the prediction of candidemia events. A stratified undersampling method was used to extract control data for algorithm learning. Multiple models were developed-a combination of 4 variable groups and 5 algorithms (auto-machine learning, deep neural network, gradient boosting, logistic regression, and random forest). The model with the largest area under the receiver operating characteristic curve (AUROC) was selected as the Candida detection (CanDETEC) model after comparing its performance indexes with those of the Candida Score Model.
From a total of 273,380 blood cultures from 186,404 registered patients with cancer, we extracted 501 records of candidemia events and 2000 records as control data. Performance among the different models varied (AUROC 0.771- 0.889), with all models demonstrating superior performance to that of the Candida Score (AUROC 0.677). The random forest model performed the best (AUROC 0.889, 95% CI 0.888-0.889); therefore, it was selected as the CanDETEC model.
The CanDETEC model predicted candidemia in patients with cancer with high discriminative power. This algorithm could be used for the timely diagnosis and appropriate empirical treatment of candidemia.
念珠菌血症的恰当经验性治疗与死亡率降低相关;然而,脓毒症患者中念珠菌血症的及时诊断情况仍然较差。
我们旨在使用机器学习算法开发并验证一种针对癌症患者的念珠菌血症预测模型。
我们利用一家三级学术医院的癌症登记处进行了一项单中心回顾性研究。纳入2010年1月至2018年12月期间诊断为恶性肿瘤的成年患者。我们的研究结果是对念珠菌血症事件的预测。采用分层欠采样方法提取用于算法学习的对照数据。开发了多个模型——4个变量组和5种算法(自动机器学习、深度神经网络、梯度提升、逻辑回归和随机森林)的组合。在将其性能指标与念珠菌评分模型的指标进行比较后,选择受试者工作特征曲线下面积(AUROC)最大的模型作为念珠菌检测(CanDETEC)模型。
从186,404名登记的癌症患者的总共273,380份血培养样本中,我们提取了501条念珠菌血症事件记录和2000条记录作为对照数据。不同模型的性能有所不同(AUROC为0.771 - 0.889),所有模型的表现均优于念珠菌评分(AUROC为0.677)。随机森林模型表现最佳(AUROC为0.889,95%CI为0.888 - 0.889);因此,它被选为CanDETEC模型。
CanDETEC模型对癌症患者的念珠菌血症具有较高的判别能力。该算法可用于念珠菌血症的及时诊断和恰当的经验性治疗。