Suppr超能文献

人工智能在上肢创伤X线片解读中的应用:一项系统综述和荟萃分析。

Artificial intelligence in the interpretation of upper extremity trauma radiographs: a systematic review and meta-analysis.

作者信息

Mellon Matthew, Dworsky-Fried Joshua, Rathod Preksha, Shah Darshil, Khan Moin, Yan James

机构信息

Michael G. DeGroote School of Medicine, McMaster University, Hamilton, ON, Canada.

Division of Orthopedic Surgery, Department of Surgery, McMaster University, Hamilton, ON, Canada.

出版信息

JSES Rev Rep Tech. 2025 Mar 8;5(3):435-443. doi: 10.1016/j.xrrt.2025.02.004. eCollection 2025 Aug.

Abstract

BACKGROUND

Upper extremity fractures represent a significant reason for emergency room visits; however, nonexpert readings commonly lead to diagnostic errors, particularly missed fractures. Artificial intelligence (AI) has emerged as a promising tool to aid in fracture detection, but it has been shown to be comparable to physicians at best, so it remains unclear whether there is value in its increasing implementation. This review aims to analyze the existing literature on AI in the identification and interpretation of upper extremity fractures on x-ray and to assess the diagnostic performance of such AI models.

METHODS

Three databases were searched (MEDLINE, Embase, and CENTRAL) for studies involving AI and imaging in upper extremity orthopedics. The review was conducted in adherence to the Preferred Reporting Items for Systematic reviews and Meta-Analyses guidelines. Inclusion criteria were papers that (1) investigated fractures of the upper extremity, (2) included the use of AI models to identify or augment the identification of fractures on imaging, identify characteristic of images, or classify images, and (3) assessed X-ray, computed tomography, or magnetic resonance imaging identification of fractures. Exclusion criteria were papers that (1) were not published in English, (2) were case reports, conference abstracts, editorials, or review articles, (3) related to hand and wrist orthopedics, and (4) reported upper extremity data integrated with nonupper extremity data. Data on fracture detection accuracy, area under the curve, sensitivity, and specificity were recorded. The Quality Assessment of Diagnostic Accuracy Studies score was used to conduct a quality assessment of all included studies. A meta-analysis was conducted on the sensitivity, specificity, and AI-reader differences in sensitivity and specificity on relevant studies.

RESULTS

A total of 16 studies were included in this review. The mean accuracy of AI models across 5 studies was 89.9%. The mean area under the curve across 8 studies was 0.932. Across 10 studies, the pooled sensitivity and specificity for fracture detection of the AI models were 97.65% (95% confidence interval [CI]: 97.16%-98.13%, = 98.15%) and 91.38% (95% CI: 90.87%-91.89%, = 96.91%), respectively. The pooled AI-reader difference in sensitivity and specificity across the same 10 studies were 8.43% (95% CI: 7.97%-8.89%, = 79.45%) and 1.98% (95% CI: 1.47%-2.49%, = 90.50%), respectively.

DISCUSSION

The AI models show promising diagnostic accuracy in the detection of upper extremity fractures, but there is significant variability in the results. Future studies should investigate whether factors such as model type or anatomic location influence accuracy in order to guide physicians on where such models will meet a minimum standard of accuracy.

摘要

背景

上肢骨折是急诊室就诊的一个重要原因;然而,非专业人员的读片通常会导致诊断错误,尤其是漏诊骨折。人工智能(AI)已成为一种有前景的辅助骨折检测工具,但研究表明其表现充其量与医生相当,因此目前尚不清楚其越来越广泛的应用是否具有价值。本综述旨在分析关于人工智能在X线片上识别和解读上肢骨折的现有文献,并评估此类人工智能模型的诊断性能。

方法

检索了三个数据库(MEDLINE、Embase和CENTRAL),以查找涉及上肢骨科人工智能与影像学的研究。本综述按照系统评价和Meta分析的首选报告项目指南进行。纳入标准为满足以下条件的论文:(1)研究上肢骨折;(2)包括使用人工智能模型识别或辅助识别影像学上的骨折、识别图像特征或对图像进行分类;(3)评估X线、计算机断层扫描或磁共振成像对骨折的识别。排除标准为满足以下条件的论文:(1)非英文发表;(2)为病例报告、会议摘要、社论或综述文章;(3)与手和腕部骨科相关;(4)报告的上肢数据与非上肢数据整合。记录骨折检测准确性、曲线下面积、敏感性和特异性的数据。使用诊断准确性研究的质量评估评分对所有纳入研究进行质量评估。对相关研究中人工智能模型的敏感性、特异性以及人工智能读片与人类读片在敏感性和特异性上的差异进行Meta分析。

结果

本综述共纳入16项研究。5项研究中人工智能模型的平均准确率为89.9%。8项研究中曲线下面积的平均值为0.932。10项研究中,人工智能模型检测骨折的合并敏感性和特异性分别为97.65%(95%置信区间[CI]:97.16%-98.13%,I² = 98.15%)和91.38%(95%CI:90.87%-91.89%,I² = 96.91%)。在相同的10项研究中,人工智能读片与人类读片在敏感性和特异性上的合并差异分别为8.43%(95%CI:7.97%-8.89%,I² = 79.45%)和1.98%(95%CI:1.47%-2.49%,I² = 90.50%)。

讨论

人工智能模型在上肢骨折检测中显示出有前景的诊断准确性,但结果存在显著差异。未来的研究应调查模型类型或解剖位置等因素是否会影响准确性,以便指导医生了解此类模型在何处能达到最低准确性标准。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc1a/12277729/3076aecbe38c/gr1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验