• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

急诊科患者近端股骨骨折检测深度学习系统的验证与算法审核:一项诊断准确性研究。

Validation and algorithmic audit of a deep learning system for the detection of proximal femoral fractures in patients in the emergency department: a diagnostic accuracy study.

作者信息

Oakden-Rayner Lauren, Gale William, Bonham Thomas A, Lungren Matthew P, Carneiro Gustavo, Bradley Andrew P, Palmer Lyle J

机构信息

School of Public Health, University of Adelaide, Adelaide, SA, Australia; Australian Institute for Machine Learning, University of Adelaide, Adelaide, SA, Australia.

Australian Institute for Machine Learning, University of Adelaide, Adelaide, SA, Australia; School of Computer Science, University of Adelaide, Adelaide, SA, Australia.

出版信息

Lancet Digit Health. 2022 May;4(5):e351-e358. doi: 10.1016/S2589-7500(22)00004-8. Epub 2022 Apr 5.

DOI:10.1016/S2589-7500(22)00004-8
PMID:35396184
Abstract

BACKGROUND

Proximal femoral fractures are an important clinical and public health issue associated with substantial morbidity and early mortality. Artificial intelligence might offer improved diagnostic accuracy for these fractures, but typical approaches to testing of artificial intelligence models can underestimate the risks of artificial intelligence-based diagnostic systems.

METHODS

We present a preclinical evaluation of a deep learning model intended to detect proximal femoral fractures in frontal x-ray films in emergency department patients, trained on films from the Royal Adelaide Hospital (Adelaide, SA, Australia). This evaluation included a reader study comparing the performance of the model against five radiologists (three musculoskeletal specialists and two general radiologists) on a dataset of 200 fracture cases and 200 non-fractures (also from the Royal Adelaide Hospital), an external validation study using a dataset obtained from Stanford University Medical Center, CA, USA, and an algorithmic audit to detect any unusual or unexpected model behaviour.

FINDINGS

In the reader study, the area under the receiver operating characteristic curve (AUC) for the performance of the deep learning model was 0·994 (95% CI 0·988-0·999) compared with an AUC of 0·969 (0·960-0·978) for the five radiologists. This strong model performance was maintained on external validation, with an AUC of 0·980 (0·931-1·000). However, the preclinical evaluation identified barriers to safe deployment, including a substantial shift in the model operating point on external validation and an increased error rate on cases with abnormal bones (eg, Paget's disease).

INTERPRETATION

The model outperformed the radiologists tested and maintained performance on external validation, but showed several unexpected limitations during further testing. Thorough preclinical evaluation of artificial intelligence models, including algorithmic auditing, can reveal unexpected and potentially harmful behaviour even in high-performance artificial intelligence systems, which can inform future clinical testing and deployment decisions.

FUNDING

None.

摘要

背景

股骨近端骨折是一个重要的临床和公共卫生问题,与严重的发病率和早期死亡率相关。人工智能可能会提高这些骨折的诊断准确性,但人工智能模型的典型测试方法可能会低估基于人工智能的诊断系统的风险。

方法

我们对一个深度学习模型进行了临床前评估,该模型旨在检测急诊科患者的正位X线片中的股骨近端骨折,使用澳大利亚南澳大利亚州阿德莱德皇家医院的X线片进行训练。该评估包括一项阅片者研究,在一个包含200例骨折病例和200例非骨折病例(同样来自阿德莱德皇家医院)的数据集上,将该模型的表现与五名放射科医生(三名肌肉骨骼专科医生和两名普通放射科医生)的表现进行比较;一项外部验证研究,使用从美国加利福尼亚州斯坦福大学医学中心获得的数据集;以及一项算法审核,以检测模型的任何异常或意外行为。

结果

在阅片者研究中,深度学习模型的受试者工作特征曲线下面积(AUC)为0.994(95%CI 0.988 - 0.999),而五名放射科医生的AUC为0.969(0.960 - 0.978)。在外部验证中,该模型出色的表现得以维持,AUC为0.980(0.931 - 1.000)。然而,临床前评估发现了安全部署的障碍,包括外部验证时模型工作点的大幅偏移,以及骨骼异常(如佩吉特病)病例的错误率增加。

解读

该模型在测试中表现优于放射科医生,并在外部验证中保持了性能,但在进一步测试中显示出一些意想不到的局限性。对人工智能模型进行全面的临床前评估,包括算法审核,即使在高性能的人工智能系统中也能揭示意想不到的潜在有害行为,这可为未来的临床试验和部署决策提供参考。

资金来源

无。

相似文献

1
Validation and algorithmic audit of a deep learning system for the detection of proximal femoral fractures in patients in the emergency department: a diagnostic accuracy study.急诊科患者近端股骨骨折检测深度学习系统的验证与算法审核:一项诊断准确性研究。
Lancet Digit Health. 2022 May;4(5):e351-e358. doi: 10.1016/S2589-7500(22)00004-8. Epub 2022 Apr 5.
2
Deep Learning Assistance Closes the Accuracy Gap in Fracture Detection Across Clinician Types.深度学习辅助缩小了不同临床医生类型在骨折检测中的准确性差距。
Clin Orthop Relat Res. 2023 Mar 1;481(3):580-588. doi: 10.1097/CORR.0000000000002385. Epub 2022 Sep 9.
3
Effect of a comprehensive deep-learning model on the accuracy of chest x-ray interpretation by radiologists: a retrospective, multireader multicase study.深度学习模型对放射科医师解读胸部 X 光片准确性的影响:一项回顾性、多读者多病例研究。
Lancet Digit Health. 2021 Aug;3(8):e496-e506. doi: 10.1016/S2589-7500(21)00106-0. Epub 2021 Jul 1.
4
External validation of an artificial intelligence multi-label deep learning model capable of ankle fracture classification.人工智能多标签深度学习模型对踝关节骨折分类的外部验证。
BMC Musculoskelet Disord. 2024 Oct 4;25(1):788. doi: 10.1186/s12891-024-07884-2.
5
Prognostication of patients with COVID-19 using artificial intelligence based on chest x-rays and clinical data: a retrospective study.基于胸部 X 光片和临床数据的人工智能预测 COVID-19 患者的预后:一项回顾性研究。
Lancet Digit Health. 2021 May;3(5):e286-e294. doi: 10.1016/S2589-7500(21)00039-X. Epub 2021 Mar 24.
6
Deep-learning-assisted diagnosis for knee magnetic resonance imaging: Development and retrospective validation of MRNet.深度学习辅助膝关节磁共振成像诊断:MRNet 的开发和回顾性验证。
PLoS Med. 2018 Nov 27;15(11):e1002699. doi: 10.1371/journal.pmed.1002699. eCollection 2018 Nov.
7
Deep learning-based artificial intelligence model for classification of vertebral compression fractures: A multicenter diagnostic study.基于深度学习的人工智能模型在椎体压缩性骨折分类中的应用:一项多中心诊断研究。
Front Endocrinol (Lausanne). 2023 Mar 22;14:1025749. doi: 10.3389/fendo.2023.1025749. eCollection 2023.
8
Assessing the Potential of a Deep Learning Tool to Improve Fracture Detection by Radiologists and Emergency Physicians on Extremity Radiographs.评估深度学习工具在提高放射科医生和急诊医生对手部和足部 X 光片骨折检测能力的潜力。
Acad Radiol. 2024 May;31(5):1989-1999. doi: 10.1016/j.acra.2023.10.042. Epub 2023 Nov 22.
9
Detecting pediatric wrist fractures using deep-learning-based object detection.基于深度学习的目标检测技术在小儿腕骨骨折中的应用
Pediatr Radiol. 2023 May;53(6):1125-1134. doi: 10.1007/s00247-023-05588-8. Epub 2023 Jan 18.
10
Effects of a comprehensive brain computed tomography deep learning model on radiologist detection accuracy.基于深度学习的脑 CT 综合模型对放射科医生检测准确率的影响。
Eur Radiol. 2024 Feb;34(2):810-822. doi: 10.1007/s00330-023-10074-8. Epub 2023 Aug 22.

引用本文的文献

1
Artificial intelligence in orthopedics: fundamentals, current applications, and future perspectives.骨科中的人工智能:基础、当前应用及未来展望。
Mil Med Res. 2025 Aug 4;12(1):42. doi: 10.1186/s40779-025-00633-z.
2
Swarm learning network for privacy-preserving and collaborative deep learning assisted diagnosis of fracture: a multi-center diagnostic study.用于骨折隐私保护与协作深度学习辅助诊断的群体学习网络:一项多中心诊断研究
Front Med (Lausanne). 2025 Jul 3;12:1534117. doi: 10.3389/fmed.2025.1534117. eCollection 2025.
3
Development and Validation of a Deep Learning System for the Detection of Nondisplaced Femoral Neck Fractures.
用于检测无移位股骨颈骨折的深度学习系统的开发与验证
Bioengineering (Basel). 2025 Apr 28;12(5):466. doi: 10.3390/bioengineering12050466.
4
Optimizing the power of AI for fracture detection: from blind spots to breakthroughs.优化人工智能在骨折检测中的效能:从盲点到突破
Skeletal Radiol. 2025 May 23. doi: 10.1007/s00256-025-04951-0.
5
A clinically applicable AI system for detection and diagnosis of bone metastases using CT scans.一种使用CT扫描检测和诊断骨转移的临床适用人工智能系统。
Nat Commun. 2025 May 13;16(1):4444. doi: 10.1038/s41467-025-59433-7.
6
Artificial intelligence: international perspectives on critical issues.人工智能:关键问题的国际视角
OTA Int. 2025 Apr 1;8(2 Suppl):e389. doi: 10.1097/OI9.0000000000000389. eCollection 2025 Apr.
7
Dual-Stream Attention-Based Classification Network for Tibial Plateau Fractures via Diffusion Model Augmentation and Segmentation Map Integration.基于双流注意力的胫骨平台骨折分类网络:通过扩散模型增强和分割图整合
Curr Med Sci. 2025 Feb;45(1):57-69. doi: 10.1007/s11596-025-00008-4. Epub 2025 Feb 25.
8
Integrating blockchain technology with artificial intelligence for the diagnosis of tibial plateau fractures.将区块链技术与人工智能相结合用于胫骨平台骨折的诊断。
Eur J Trauma Emerg Surg. 2025 Feb 21;51(1):119. doi: 10.1007/s00068-025-02793-y.
9
Impact of deep learning on pediatric elbow fracture detection: a systematic review and meta-analysis.深度学习对小儿肘部骨折检测的影响:一项系统评价和荟萃分析。
Eur J Trauma Emerg Surg. 2025 Feb 20;51(1):115. doi: 10.1007/s00068-025-02779-w.
10
Update report on the quality of gliomas radiomics: An integration of bibliometric and radiomics quality score.胶质瘤放射组学质量的最新报告:文献计量学与放射组学质量评分的整合
World J Radiol. 2024 Dec 28;16(12):794-805. doi: 10.4329/wjr.v16.i12.794.