Valderrama Camilo E, Niven Daniel J, Stelfox Henry T, Lee Joon
Data Intelligence for Health Lab, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.
Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.
JMIR Med Inform. 2022 Jun 3;10(6):e35250. doi: 10.2196/35250.
Redundancy in laboratory blood tests is common in intensive care units (ICUs), affecting patients' health and increasing health care expenses. Medical communities have made recommendations to order laboratory tests more judiciously. Wise selection can rely on modern data-driven approaches that have been shown to help identify low-yield laboratory blood tests in ICUs. However, although conditional entropy and conditional probability distribution have shown the potential to measure the uncertainty of yielding an abnormal test, no previous studies have adapted these techniques to include them in machine learning models for predicting abnormal laboratory test results.
This study aimed to address the limitations of previous reports by adapting conditional entropy and conditional probability to extract features for predicting abnormal laboratory blood test results.
We used an ICU data set collected across Alberta, Canada, which included 55,689 ICU admissions from 48,672 patients. We investigated the features of conditional entropy and conditional probability by comparing the performances of 2 machine learning approaches for predicting normal and abnormal results for 18 blood laboratory tests. Approach 1 used patients' vitals, age, sex, and admission diagnosis as features. Approach 2 used the same features plus the new conditional entropy-based and conditional probability-based features. Both approaches used 4 different machine learning models (fuzzy model, logistic regression, random forest, and gradient boosting trees) and 10 metrics (sensitivity, specificity, accuracy, precision, negative predictive value [NPV], F score, area under the curve [AUC], precision-recall AUC, mean G, and index balanced accuracy) to assess the performance of the approaches.
Approach 1 achieved an average AUC of 0.86 for all 18 laboratory tests across the 4 models (sensitivity 78%, specificity 84%, precision 82%, NPV 75%, F score 79%, and mean G 81%), whereas approach 2 achieved an average AUC of 0.89 (sensitivity 84%, specificity 84%, precision 83%, NPV 81%, F score 83%, and mean G 84%). We found that the inclusion of the new features resulted in significant differences for most of the metrics in favor of approach 2. Sensitivity significantly improved for 8 and 15 laboratory tests across the different classifiers (minimum P<.001 and maximum P=.04). Mean G and index balanced accuracy, which are balanced performance metrics, also improved significantly across the classifiers for 6 to 10 and 6 to 11 laboratory tests. The most relevant feature was the pretest probability feature, which is the probability that a test result was normal when a certain number of consecutive prior tests was already normal.
The findings suggest that conditional entropy-based features and pretest probability improve the capacity to discriminate between normal and abnormal laboratory test results. Detecting the next laboratory test result is an intermediate step toward developing guidelines for reducing overtesting in the ICU.
重症监护病房(ICU)中实验室血液检查的冗余现象很常见,这会影响患者健康并增加医疗费用。医学界已建议更明智地安排实验室检查。明智的选择可以依赖现代数据驱动方法,这些方法已被证明有助于识别ICU中低收益的实验室血液检查。然而,尽管条件熵和条件概率分布已显示出测量产生异常检查结果不确定性的潜力,但以前没有研究将这些技术应用于机器学习模型中以预测异常实验室检查结果。
本研究旨在通过采用条件熵和条件概率来提取特征以预测异常实验室血液检查结果,从而解决先前报告的局限性。
我们使用了在加拿大艾伯塔省收集的ICU数据集,其中包括来自48,672名患者的55,689例ICU入院病例。我们通过比较两种机器学习方法预测18项血液实验室检查正常和异常结果的性能,研究了条件熵和条件概率的特征。方法1使用患者的生命体征、年龄、性别和入院诊断作为特征。方法2使用相同的特征加上基于条件熵和条件概率的新特征。两种方法都使用4种不同的机器学习模型(模糊模型、逻辑回归、随机森林和梯度提升树)和10个指标(灵敏度、特异性、准确性、精确率、阴性预测值[NPV]、F分数、曲线下面积[AUC]、精确率-召回率AUC、平均G和指数平衡准确性)来评估方法的性能。
方法1在4个模型中对所有18项实验室检查的平均AUC为0.86(灵敏度78%,特异性84%,精确率82%,NPV 75%,F分数79%,平均G 81%),而方法2的平均AUC为0.89(灵敏度84%,特异性84%,精确率83%,NPV 81%,F分数83%,平均G 84%)。我们发现纳入新特征后,大多数指标都有显著差异,有利于方法2。在不同分类器中,8项和15项实验室检查的灵敏度显著提高(最小P<0.001,最大P = 0.04)。作为平衡性能指标的平均G和指数平衡准确性,在不同分类器中,6至10项和6至11项实验室检查也显著提高。最相关的特征是预测试概率特征,即当一定数量的连续先前检查已经正常时,测试结果正常的概率。
研究结果表明,基于条件熵的特征和预测试概率提高了区分正常和异常实验室检查结果的能力。检测下一个实验室检查结果是制定减少ICU过度检查指南的中间步骤。