• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于放射诊断的深度学习算法的外部验证:一项系统评价。

External Validation of Deep Learning Algorithms for Radiologic Diagnosis: A Systematic Review.

作者信息

Yu Alice C, Mohajer Bahram, Eng John

机构信息

Russell H. Morgan Department of Radiology and Radiological Science, Johns Hopkins University School of Medicine, 1800 Orleans St, Baltimore, MD 21287.

出版信息

Radiol Artif Intell. 2022 May 4;4(3):e210064. doi: 10.1148/ryai.210064. eCollection 2022 May.

DOI:10.1148/ryai.210064
PMID:35652114
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9152694/
Abstract

PURPOSE

To assess generalizability of published deep learning (DL) algorithms for radiologic diagnosis.

MATERIALS AND METHODS

In this systematic review, the PubMed database was searched for peer-reviewed studies of DL algorithms for image-based radiologic diagnosis that included external validation, published from January 1, 2015, through April 1, 2021. Studies using nonimaging features or incorporating non-DL methods for feature extraction or classification were excluded. Two reviewers independently evaluated studies for inclusion, and any discrepancies were resolved by consensus. Internal and external performance measures and pertinent study characteristics were extracted, and relationships among these data were examined using nonparametric statistics.

RESULTS

Eighty-three studies reporting 86 algorithms were included. The vast majority (70 of 86, 81%) reported at least some decrease in external performance compared with internal performance, with nearly half (42 of 86, 49%) reporting at least a modest decrease (≥0.05 on the unit scale) and nearly a quarter (21 of 86, 24%) reporting a substantial decrease (≥0.10 on the unit scale). No study characteristics were found to be associated with the difference between internal and external performance.

CONCLUSION

Among published external validation studies of DL algorithms for image-based radiologic diagnosis, the vast majority demonstrated diminished algorithm performance on the external dataset, with some reporting a substantial performance decrease. Meta-Analysis, Computer Applications-Detection/Diagnosis, Neural Networks, Computer Applications-General (Informatics), Epidemiology, Technology Assessment, Diagnosis, Informatics . © RSNA, 2022.

摘要

目的

评估已发表的用于放射诊断的深度学习(DL)算法的通用性。

材料与方法

在本系统评价中,检索了PubMed数据库中2015年1月1日至2021年4月1日发表的关于基于图像的放射诊断的DL算法的同行评审研究,这些研究包括外部验证。排除使用非成像特征或纳入非DL方法进行特征提取或分类的研究。两名评审员独立评估纳入研究,任何分歧通过协商解决。提取内部和外部性能指标以及相关研究特征,并使用非参数统计检验这些数据之间的关系。

结果

纳入了83项报告86种算法的研究。绝大多数(86项中的70项,81%)报告与内部性能相比,外部性能至少有一定程度的下降,近一半(86项中的42项,49%)报告至少有适度下降(单位尺度上≥0.05),近四分之一(86项中的21项,24%)报告有大幅下降(单位尺度上≥0.10)。未发现研究特征与内部和外部性能之间的差异相关。

结论

在已发表的基于图像的放射诊断DL算法的外部验证研究中,绝大多数显示算法在外部数据集上的性能有所下降,有些报告性能大幅下降。荟萃分析、计算机应用-检测/诊断、神经网络、计算机应用-一般(信息学)、流行病学、技术评估、诊断、信息学。©RSNA,2022。

相似文献

1
External Validation of Deep Learning Algorithms for Radiologic Diagnosis: A Systematic Review.用于放射诊断的深度学习算法的外部验证:一项系统评价。
Radiol Artif Intell. 2022 May 4;4(3):e210064. doi: 10.1148/ryai.210064. eCollection 2022 May.
2
Deep Learning in Neuroradiology: A Systematic Review of Current Algorithms and Approaches for the New Wave of Imaging Technology.神经放射学中的深度学习:对新一代成像技术当前算法和方法的系统综述。
Radiol Artif Intell. 2020 Mar 4;2(2):e190026. doi: 10.1148/ryai.2020190026. eCollection 2020 Mar.
3
Design Characteristics of Studies Reporting the Performance of Artificial Intelligence Algorithms for Diagnostic Analysis of Medical Images: Results from Recently Published Papers.研究报告用于医学图像诊断分析的人工智能算法性能的设计特点:近期发表论文的结果。
Korean J Radiol. 2019 Mar;20(3):405-410. doi: 10.3348/kjr.2019.0025.
4
Deep Learning Model for Automated Detection and Classification of Central Canal, Lateral Recess, and Neural Foraminal Stenosis at Lumbar Spine MRI.深度学习模型在腰椎 MRI 中用于自动检测和分类中央管、侧隐窝和神经孔狭窄
Radiology. 2021 Jul;300(1):130-138. doi: 10.1148/radiol.2021204289. Epub 2021 May 11.
5
Performance and Limitation of Machine Learning Algorithms for Diabetic Retinopathy Screening: Meta-analysis.机器学习算法在糖尿病视网膜病变筛查中的性能和局限性:荟萃分析。
J Med Internet Res. 2021 Jul 3;23(7):e23863. doi: 10.2196/23863.
6
MRI-based Identification and Classification of Major Intracranial Tumor Types by Using a 3D Convolutional Neural Network: A Retrospective Multi-institutional Analysis.基于磁共振成像利用三维卷积神经网络对主要颅内肿瘤类型进行识别与分类:一项回顾性多机构分析
Radiol Artif Intell. 2021 Aug 11;3(5):e200301. doi: 10.1148/ryai.2021200301. eCollection 2021 Sep.
7
Development and Validation of Deep Learning-based Automatic Detection Algorithm for Malignant Pulmonary Nodules on Chest Radiographs.基于深度学习的胸部 X 线片恶性肺结节自动检测算法的开发与验证。
Radiology. 2019 Jan;290(1):218-228. doi: 10.1148/radiol.2018180237. Epub 2018 Sep 25.
8
Deep learning algorithms for detection of diabetic retinopathy in retinal fundus photographs: A systematic review and meta-analysis.深度学习算法在眼底视网膜照片糖尿病性视网膜病变检测中的应用:系统评价和荟萃分析。
Comput Methods Programs Biomed. 2020 Jul;191:105320. doi: 10.1016/j.cmpb.2020.105320. Epub 2020 Jan 16.
9
A deep learning- and partial least square regression-based model observer for a low-contrast lesion detection task in CT.基于深度学习和偏最小二乘回归的 CT 低对比度病灶检测任务模型观察器。
Med Phys. 2019 May;46(5):2052-2063. doi: 10.1002/mp.13500. Epub 2019 Apr 1.
10
An increasing number of convolutional neural networks for fracture recognition and classification in orthopaedics : are these externally validated and ready for clinical application?骨科中用于骨折识别和分类的卷积神经网络越来越多:这些网络是否经过外部验证并准备好用于临床应用?
Bone Jt Open. 2021 Oct;2(10):879-885. doi: 10.1302/2633-1462.210.BJO-2021-0133.

引用本文的文献

1
Charting New Territory: AI Applications in Dental Caries Detection from Panoramic Imaging.开拓新领域:人工智能在全景成像龋齿检测中的应用
Dent J (Basel). 2025 Aug 12;13(8):366. doi: 10.3390/dj13080366.
2
The intelligent lift: Artificial Intelligence's growing role in plastic surgery - a comprehensive review.智能提升:人工智能在整形手术中日益重要的作用——全面综述
Front Surg. 2025 Aug 5;12:1640588. doi: 10.3389/fsurg.2025.1640588. eCollection 2025.
3
Deriving Health Utility Values Using Mapping Methods Among the Chinese Population: A Systematic Review.运用映射方法推导中国人群的健康效用值:一项系统评价
Appl Health Econ Health Policy. 2025 Aug 18. doi: 10.1007/s40258-025-00992-7.
4
Determination of Skeletal Age From Hand Radiographs Using Deep Learning.利用深度学习从手部X光片确定骨骼年龄
Am J Sports Med. 2025 Sep;53(11):2715-2725. doi: 10.1177/03635465251359618. Epub 2025 Aug 15.
5
Statistical variability in comparing accuracy of neuroimaging based classification models via cross validation.通过交叉验证比较基于神经影像学的分类模型准确性时的统计变异性。
Sci Rep. 2025 Aug 6;15(1):28745. doi: 10.1038/s41598-025-12026-2.
6
Machine learning approaches for EGFR mutation status prediction in NSCLC: an updated systematic review.用于非小细胞肺癌中表皮生长因子受体突变状态预测的机器学习方法:一项更新的系统评价
Front Oncol. 2025 Jul 10;15:1576461. doi: 10.3389/fonc.2025.1576461. eCollection 2025.
7
Re-identification of patients from imaging features extracted by foundation models.通过基础模型提取的影像特征对患者进行重新识别。
NPJ Digit Med. 2025 Jul 22;8(1):469. doi: 10.1038/s41746-025-01801-0.
8
Motion artifacts and image quality in stroke MRI: associated factors and impact on AI and human diagnostic accuracy.中风磁共振成像中的运动伪影与图像质量:相关因素及其对人工智能和人类诊断准确性的影响
Eur Radiol. 2025 Jul 15. doi: 10.1007/s00330-025-11807-7.
9
Artificial Intelligence in Detecting Periodontal Disease From Intraoral Photographs: A Systematic Review.基于口腔内照片检测牙周疾病的人工智能:一项系统综述
Int Dent J. 2025 Jul 9;75(5):100883. doi: 10.1016/j.identj.2025.100883.
10
Normalization and Selecting Non-Differentially Expressed Genes Improve Machine Learning Modelling of Cross-Platform Transcriptomic Data.归一化和选择非差异表达基因可改善跨平台转录组数据的机器学习建模
Trans Artif Intell. 2025;1(1). doi: 10.53941/tai.2025.100005. Epub 2025 May 25.

本文引用的文献

1
Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.停止为高风险决策解释黑箱机器学习模型,转而使用可解释模型。
Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.
2
Checklist for Artificial Intelligence in Medical Imaging (CLAIM): A Guide for Authors and Reviewers.医学影像人工智能清单(CLAIM):作者和审稿人指南
Radiol Artif Intell. 2020 Mar 25;2(2):e200029. doi: 10.1148/ryai.2020200029. eCollection 2020 Mar.
3
Deep Learning in Neuroradiology: A Systematic Review of Current Algorithms and Approaches for the New Wave of Imaging Technology.神经放射学中的深度学习:对新一代成像技术当前算法和方法的系统综述。
Radiol Artif Intell. 2020 Mar 4;2(2):e190026. doi: 10.1148/ryai.2020190026. eCollection 2020 Mar.
4
Artificial Intelligence Applied to Chest X-ray for Differential Diagnosis of COVID-19 Pneumonia.人工智能应用于胸部 X 光片对 COVID-19 肺炎进行鉴别诊断
Diagnostics (Basel). 2021 Mar 16;11(3):530. doi: 10.3390/diagnostics11030530.
5
Federated deep learning for detecting COVID-19 lung abnormalities in CT: a privacy-preserving multinational validation study.用于检测CT中COVID-19肺部异常的联邦深度学习:一项隐私保护的多国验证研究。
NPJ Digit Med. 2021 Mar 29;4(1):60. doi: 10.1038/s41746-021-00431-6.
6
Automated detection of critical findings in multi-parametric brain MRI using a system of 3D neural networks.使用三维神经网络系统自动检测多参数脑 MRI 的关键发现。
Sci Rep. 2021 Mar 25;11(1):6876. doi: 10.1038/s41598-021-86022-7.
7
Automated assessment of the substantia nigra on susceptibility map-weighted imaging using deep convolutional neural networks for diagnosis of Idiopathic Parkinson's disease.使用深度卷积神经网络在磁化率图加权成像上对黑质进行自动评估以诊断特发性帕金森病。
Parkinsonism Relat Disord. 2021 Apr;85:84-90. doi: 10.1016/j.parkreldis.2021.03.004. Epub 2021 Mar 17.
8
Development and Validation of a Deep Learning-Based Model to Distinguish Glioblastoma from Solitary Brain Metastasis Using Conventional MR Images.基于深度学习的模型在利用常规磁共振成像区分脑胶质母细胞瘤和单发脑转移瘤中的建立与验证。
AJNR Am J Neuroradiol. 2021 May;42(5):838-844. doi: 10.3174/ajnr.A7003. Epub 2021 Mar 18.
9
Predicting benign, preinvasive, and invasive lung nodules on computed tomography scans using machine learning.利用机器学习预测 CT 扫描中的良性、癌前和浸润性肺结节。
J Thorac Cardiovasc Surg. 2022 Apr;163(4):1496-1505.e10. doi: 10.1016/j.jtcvs.2021.02.010. Epub 2021 Feb 16.
10
Toward human intervention-free clinical diagnosis of intracranial aneurysm via deep neural network.通过深度神经网络实现颅内动脉瘤的无人工干预临床诊断。
Patterns (N Y). 2021 Jan 22;2(2):100197. doi: 10.1016/j.patter.2020.100197. eCollection 2021 Feb 12.