Doig G S, Inman K J, Sibbald W J, Martin C M, Robertson J M
Department of Epidemiology and Biostatistics, University of Western Ontario, Canada.
Proc Annu Symp Comput Appl Med Care. 1993:361-5.
The objective of this study was to compare and contrast two techniques of modeling mortality in a 30 bed multi-disciplinary ICU; neural networks and logistic regression. Fifteen physiological variables were recorded on day 3 for 422 consecutive patients whose duration of stay was over 72 hours. Two separate models were built using each technique. First, logistic and neural network models were constructed on the complete 422 patient dataset and discrimination was compared. Second, the database was randomly divided into a 284 patient developmental dataset and a 138 patient validation dataset. The developmental dataset was used to construct logistic and neural net models and the predictive power of these models was verified on the validation dataset. On the complete dataset, the neural network clearly outperformed the logistic model (sensitivity and specificity of 1 and .997 vs. .525 and .966, area under ROC curve .9993 vs. .9259), while both performed equally well on the validation dataset (area under ROC of .82). The excellent performance of the neural net on the complete dataset reveals that the problem is classifiable. Since our dataset only contained 40 mortality events, it is highly likely that the validation dataset was not representative of the developmental dataset, which led to a decreased predictive performance by both the neural net and the logistic regression models. Theoretically, given an extensive dataset, the neural network should be able to perform mortality prediction with a sensitivity and a specificity approaching 95%. Clinically, this would be an extremely important achievement.(ABSTRACT TRUNCATED AT 250 WORDS)
本研究的目的是比较和对比在一个拥有30张床位的多学科重症监护病房(ICU)中模拟死亡率的两种技术:神经网络和逻辑回归。对422名连续入住且住院时间超过72小时的患者在第3天记录了15个生理变量。使用每种技术构建了两个独立的模型。首先,在完整的422例患者数据集上构建逻辑模型和神经网络模型,并比较二者的判别能力。其次,将数据库随机分为一个包含284例患者的开发数据集和一个包含138例患者的验证数据集。利用开发数据集构建逻辑模型和神经网络模型,并在验证数据集上验证这些模型的预测能力。在完整数据集上,神经网络明显优于逻辑模型(灵敏度和特异性分别为1和0.997,而逻辑模型为0.525和0.966;ROC曲线下面积分别为0.9993和0.9259),而在验证数据集上二者表现相当(ROC曲线下面积为0.82)。神经网络在完整数据集上的出色表现表明该问题是可分类的。由于我们的数据集仅包含40例死亡事件,很有可能验证数据集不能代表开发数据集,这导致神经网络和逻辑回归模型的预测性能均下降。从理论上讲,给定一个广泛的数据集,神经网络应该能够以接近95%的灵敏度和特异性进行死亡率预测。在临床上,这将是一项极其重要的成就。(摘要截断于250字)