Suppr超能文献

意大利急诊科队列中机器学习模型对晕厥短期结局预测的验证

Validation of syncope short-term outcomes prediction by machine learning models in an Italian emergency department cohort.

作者信息

Levra Alessandro Giaj, Gatti Mauro, Mene Roberto, Shiffer Dana, Costantino Giorgio, Solbiati Monica, Furlan Raffaello, Dipaola Franca

机构信息

Department of Cardiovascular Medicine, Humanitas Research Hospital, IRCCS, Rozzano, Milan, Italy.

Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy.

出版信息

Intern Emerg Med. 2025 Jul 16. doi: 10.1007/s11739-025-04034-x.

Abstract

Machine learning (ML) algorithms have the potential to enhance the prediction of adverse outcomes in patients with syncope. Recently, gradient boosting (GB) and logistic regression (LR) models have been applied to predict these outcomes following a syncope episode, using the Canadian Syncope Risk Score (CSRS) predictors. This study aims to externally validate these models and compare their performance with novel models. We included all consecutive non-low-risk patients evaluated in the emergency department for syncope between 2015 and 2017 at six Italian hospitals. The GB and LR models were trained and tested using previously validated CSRS predictors. Additionally, recently developed deep learning (TabPFN) and large language models (TabLLM) were validated on the same cohort. The area under the curve (AUC), Matthews correlation coefficient (MCC), and Brier score (BS) were compared for each model. A total of 257 patients were enrolled, with a median age of 71 years. Thirteen percent had adverse outcomes at 30 days. The GB model achieved the best performance, with an AUC of 0.78, an MCC of 0.36, and a BS of 0.42. Significant performance differences were observed compared with the TabPFN model (p < 0.01) and the TabLLM model (p = 0.01). The GB model performed only slightly better than the LR model. The predictive capability of the GB and LR models using CSRS variables was reduced when validated in an external syncope cohort characterized by a higher event rate.

摘要

机器学习(ML)算法有潜力提高对晕厥患者不良结局的预测。最近,梯度提升(GB)和逻辑回归(LR)模型已被应用于使用加拿大晕厥风险评分(CSRS)预测指标来预测晕厥发作后的这些结局。本研究旨在对这些模型进行外部验证,并将它们的性能与新模型进行比较。我们纳入了2015年至2017年期间在意大利六家医院急诊科接受评估的所有连续非低风险晕厥患者。使用先前验证的CSRS预测指标对GB和LR模型进行训练和测试。此外,最近开发的深度学习(TabPFN)和大语言模型(TabLLM)也在同一队列中进行了验证。比较了每个模型的曲线下面积(AUC)、马修斯相关系数(MCC)和布里尔评分(BS)。共纳入257例患者,中位年龄为71岁。13%的患者在30天时出现不良结局。GB模型表现最佳,AUC为0.78,MCC为0.36,BS为0.42。与TabPFN模型(p < 0.01)和TabLLM模型(p = 0.01)相比,观察到显著的性能差异。GB模型的表现仅略优于LR模型。当在以较高事件发生率为特征的外部晕厥队列中进行验证时,使用CSRS变量的GB和LR模型的预测能力有所降低。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验