Dept. of Medical Informatics, School of Medicine, Keimyung University, Daegu, Republic of Korea.
J Biomed Inform. 2012 Oct;45(5):999-1008. doi: 10.1016/j.jbi.2012.04.013. Epub 2012 May 4.
The accurate diagnosis of heart failure in emergency room patients is quite important, but can also be quite difficult due to our insufficient understanding of the characteristics of heart failure. The purpose of this study is to design a decision-making model that provides critical factors and knowledge associated with congestive heart failure (CHF) using an approach that makes use of rough sets (RSs) and decision trees. Among 72 laboratory findings, it was determined that two subsets (RBC, EOS, Protein, O2SAT, Pro BNP) in an RS-based model, and one subset (Gender, MCHC, Direct bilirubin, and Pro BNP) in a logistic regression (LR)-based model were indispensable factors for differentiating CHF patients from those with dyspnea, and the risk factor Pro BNP was particularly so. To demonstrate the usefulness of the proposed model, we compared the discriminatory power of decision-making models that utilize RS- and LR-based decision models by conducting 10-fold cross-validation. The experimental results showed that the RS-based decision-making model (accuracy: 97.5%, sensitivity: 97.2%, specificity: 97.7%, positive predictive value: 97.2%, negative predictive value: 97.7%, and area under ROC curve: 97.5%) consistently outperformed the LR-based decision-making model (accuracy: 88.7%, sensitivity: 90.1%, specificity: 87.5%, positive predictive value: 85.3%, negative predictive value: 91.7%, and area under ROC curve: 88.8%). In addition, a pairwise comparison of the ROC curves of the two models showed a statistically significant difference (p<0.01; 95% CI: 2.63-14.6).
在急诊科患者中准确诊断心力衰竭非常重要,但由于我们对心力衰竭特征的了解不足,这也可能非常困难。本研究旨在设计一个决策模型,该模型使用粗糙集(RS)和决策树方法提供与充血性心力衰竭(CHF)相关的关键因素和知识。在 72 项实验室发现中,确定了基于 RS 的模型中的两个子集(RBC、EOS、蛋白质、O2SAT、Pro BNP)和基于逻辑回归(LR)的模型中的一个子集(性别、MCHC、直接胆红素和 Pro BNP)是区分心力衰竭患者和呼吸困难患者的必不可少的因素,而风险因素 Pro BNP 则尤为重要。为了证明所提出模型的有用性,我们通过进行 10 倍交叉验证比较了利用基于 RS 和 LR 的决策模型的决策模型的辨别能力。实验结果表明,基于 RS 的决策模型(准确率:97.5%,灵敏度:97.2%,特异性:97.7%,阳性预测值:97.2%,阴性预测值:97.7%,ROC 曲线下面积:97.5%)始终优于基于 LR 的决策模型(准确率:88.7%,灵敏度:90.1%,特异性:87.5%,阳性预测值:85.3%,阴性预测值:91.7%,ROC 曲线下面积:88.8%)。此外,对两个模型的 ROC 曲线进行成对比较显示存在统计学差异(p<0.01;95%置信区间:2.63-14.6)。