Harrison Robert F, Kennedy R Lee
Department of Automatic Control and Systems Engineering, The University of Sheffield, Sheffield, United Kingdom.
Ann Emerg Med. 2005 Nov;46(5):431-9. doi: 10.1016/j.annemergmed.2004.09.012.
Clinical and ECG data from presentation are highly discriminatory for diagnosis of acute coronary syndromes, whereas definitive diagnosis from serial ECG and cardiac marker protein measurements is usually not available for several hours. Artificial neural networks are computer programs adept at pattern recognition tasks and have been used to analyze data from chest pain patients with a view to developing diagnostic algorithms that might improve triage practices in the emergency department. The aim of this study is to develop and optimize artificial neural network models for diagnosis of acute coronary syndrome, to test these models on data collected prospectively from different centers, and to establish whether the performance of these models was superior to that of models derived using a standard statistical technique, logistic regression.
The study used data from 3,147 patients presenting to 3 hospitals with acute chest pain. Data from hospital 1 were used to train the models, which were then tested on independent data from the other 2 hospitals. From 40 potential factors, variables were selected according to the logarithm of their likelihood ratios to produce models using 8, 13, 20, and 40 factors. Identical data were used for logistic regression and artificial neural network models. Calibration and performance were assessed, the latter using receiver operating characteristic (ROC) curve analysis.
Although the performance of artificial neural network models generally increased with increasing numbers of factors, this was insignificant. The 13-factor model was therefore used for the rest of the study owing to its marginally improved calibration over the smallest model. Area under the ROC curve (with standard error) was 0.97 (0.006). The overall sensitivity and specificity of this model for acute coronary syndrome diagnosis using the training data was 0.93. ROC curves for logistic regression and artificial neural network models applied to data from the 3 hospitals were identical. For the 13-factor artificial neural network model tested on data from hospitals 2 and 3, area under the ROC curves (standard error) were 0.93 (0.006) and 0.95 (0.009), respectively. Investigation of the performance of the artificial neural network models throughout the range of predicted probabilities showed that they were well calibrated.
This study confirms that artificial neural networks can offer a useful approach for developing diagnostic algorithms for chest pain patients; however, the exceptional performance and simplicity of the logistic model militates in favor of logistic regression for the present task. Our artificial neural network models were well calibrated and performed well on unseen data from different centers. These issues have not been addressed in previous studies. However, and unlike in previous studies, we did not find the performance of artificial neural network models to be significantly different from that of suitably optimized logistic regression models.
就诊时的临床和心电图数据对急性冠脉综合征的诊断具有高度鉴别性,而通过连续心电图和心脏标志物蛋白测量进行的明确诊断通常需要数小时才能获得。人工神经网络是擅长模式识别任务的计算机程序,已被用于分析胸痛患者的数据,旨在开发可能改善急诊科分诊流程的诊断算法。本研究的目的是开发并优化用于诊断急性冠脉综合征的人工神经网络模型,在从不同中心前瞻性收集的数据上测试这些模型,并确定这些模型的性能是否优于使用标准统计技术逻辑回归得出的模型。
该研究使用了3家医院3147例因急性胸痛就诊患者的数据。医院1的数据用于训练模型,然后在另外2家医院的独立数据上进行测试。从40个潜在因素中,根据似然比的对数选择变量,以生成使用8、13、20和40个因素的模型。相同的数据用于逻辑回归模型和人工神经网络模型。评估校准和性能,后者使用受试者工作特征(ROC)曲线分析。
尽管人工神经网络模型的性能通常随着因素数量的增加而提高,但这种提高并不显著。因此,在本研究的其余部分使用13因素模型,因为其校准比最小模型略有改善。ROC曲线下面积(标准误)为0.97(0.006)。使用训练数据时,该模型对急性冠脉综合征诊断的总体敏感性和特异性为0.93。应用于3家医院数据的逻辑回归模型和人工神经网络模型的ROC曲线相同。对于在医院2和医院3的数据上测试的13因素人工神经网络模型,ROC曲线下面积(标准误)分别为0.93(0.006)和0.95(0.009)。对人工神经网络模型在整个预测概率范围内的性能进行研究表明,它们的校准良好。
本研究证实,人工神经网络可为胸痛患者开发诊断算法提供一种有用的方法;然而,逻辑模型的卓越性能和简单性有利于在当前任务中使用逻辑回归。我们的人工神经网络模型校准良好,在来自不同中心的未见过的数据上表现良好。这些问题在以前的研究中尚未得到解决。然而,与以前的研究不同,我们没有发现人工神经网络模型的性能与经过适当优化的逻辑回归模型有显著差异。