Duke-NUS Medical School, National University of Singapore, Singapore.
Duke-NUS Medical School, National University of Singapore, Singapore; Department of Emergency Medicine, Singapore General Hospital, Singapore; Health Services Research Centre, Singapore Health Services, Singapore.
J Biomed Inform. 2022 May;129:104072. doi: 10.1016/j.jbi.2022.104072. Epub 2022 Apr 11.
Medical decision-making impacts both individual and public health. Clinical scores are commonly used among various decision-making models to determine the degree of disease deterioration at the bedside. AutoScore was proposed as a useful clinical score generator based on machine learning and a generalized linear model. However, its current framework still leaves room for improvement when addressing unbalanced data of rare events.
Using machine intelligence approaches, we developed AutoScore-Imbalance, which comprises three components: training dataset optimization, sample weight optimization, and adjusted AutoScore. Baseline techniques for performance comparison included the original AutoScore, full logistic regression, stepwise logistic regression, least absolute shrinkage and selection operator (LASSO), full random forest, and random forest with a reduced number of variables. These models were evaluated based on their area under the curve (AUC) in the receiver operating characteristic analysis and balanced accuracy (i.e., mean value of sensitivity and specificity). By utilizing a publicly accessible dataset from Beth Israel Deaconess Medical Center, we assessed the proposed model and baseline approaches to predict inpatient mortality.
AutoScore-Imbalance outperformed baselines in terms of AUC and balanced accuracy. The nine-variable AutoScore-Imbalance sub-model achieved the highest AUC of 0.786 (0.732-0.839), while the eleven-variable original AutoScore obtained an AUC of 0.723 (0.663-0.783), and the logistic regression with 21 variables obtained an AUC of 0.743 (0.685-0.801). The AutoScore-Imbalance sub-model (using a down-sampling algorithm) yielded an AUC of 0.771 (0.718-0.823) with only five variables, demonstrating a good balance between performance and variable sparsity. Furthermore, AutoScore-Imbalance obtained the highest balanced accuracy of 0.757 (0.702-0.805), compared to 0.698 (0.643-0.753) by the original AutoScore and the maximum of 0.720 (0.664-0.769) by other baseline models.
We have developed an interpretable tool to handle clinical data imbalance, presented its structure, and demonstrated its superiority over baselines. The AutoScore-Imbalance tool can be applied to highly unbalanced datasets to gain further insight into rare medical events and facilitate real-world clinical decision-making.
医学决策会影响个人和公共健康。临床评分常用于各种决策模型,以确定床边疾病恶化的程度。AutoScore 是一种基于机器学习和广义线性模型的有用的临床评分生成器。然而,当涉及稀有事件的不平衡数据时,其当前框架仍有改进的空间。
我们使用机器智能方法开发了 AutoScore-Imbalance,它由三个组件组成:训练数据集优化、样本权重优化和调整后的 AutoScore。用于性能比较的基线技术包括原始 AutoScore、全逻辑回归、逐步逻辑回归、最小绝对值收缩和选择算子 (LASSO)、全随机森林和变量数量减少的随机森林。这些模型根据接收器操作特征分析中的曲线下面积 (AUC) 和平衡准确性(即灵敏度和特异性的平均值)进行评估。通过利用 Beth Israel Deaconess Medical Center 的公开数据集,我们评估了所提出的模型和基线方法来预测住院患者死亡率。
AutoScore-Imbalance 在 AUC 和平衡准确性方面优于基线。九变量 AutoScore-Imbalance 子模型获得了最高的 AUC 值 0.786(0.732-0.839),而十一变量原始 AutoScore 的 AUC 值为 0.723(0.663-0.783),具有 21 个变量的逻辑回归获得的 AUC 值为 0.743(0.685-0.801)。使用下采样算法的 AutoScore-Imbalance 子模型的 AUC 值为 0.771(0.718-0.823),仅使用五个变量,在性能和变量稀疏性之间取得了良好的平衡。此外,AutoScore-Imbalance 获得了最高的平衡准确性 0.757(0.702-0.805),而原始 AutoScore 为 0.698(0.643-0.753),其他基线模型的最大值为 0.720(0.664-0.769)。
我们开发了一种可解释的工具来处理临床数据不平衡问题,介绍了其结构,并证明了其优于基线的性能。AutoScore-Imbalance 工具可应用于高度不平衡的数据集,以深入了解罕见的医疗事件并促进现实世界中的临床决策。