• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用计算压力测试评估深度学习骨龄算法对临床图像变化的稳健性。

Evaluating the Robustness of a Deep Learning Bone Age Algorithm to Clinical Image Variation Using Computational Stress Testing.

机构信息

From the Drexel University College of Medicine, Philadelphia, Pa (S.M.S.); University of Maryland Medical Intelligent Imaging (UM2ii) Center, Department of Diagnostic Radiology and Nuclear Medicine, University of Maryland School of Medicine, 670 W Baltimore St, 1st Fl, Room 1172, Baltimore, MD 21201 (S.M.S., K.P., E.B., V.S.P., P.H.Y.); and Malone Center for Engineering in Healthcare, Johns Hopkins University, Baltimore, Md (P.H.Y.).

出版信息

Radiol Artif Intell. 2024 May;6(3):e230240. doi: 10.1148/ryai.230240.

DOI:10.1148/ryai.230240
PMID:38477660
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11140516/
Abstract

Purpose To evaluate the robustness of an award-winning bone age deep learning (DL) model to extensive variations in image appearance. Materials and Methods In December 2021, the DL bone age model that won the 2017 RSNA Pediatric Bone Age Challenge was retrospectively evaluated using the RSNA validation set (1425 pediatric hand radiographs; internal test set in this study) and the Digital Hand Atlas (DHA) (1202 pediatric hand radiographs; external test set). Each test image underwent seven types of transformations (rotations, flips, brightness, contrast, inversion, laterality marker, and resolution) to represent a range of image appearances, many of which simulate real-world variations. Computational "stress tests" were performed by comparing the model's predictions on baseline and transformed images. Mean absolute differences (MADs) of predicted bone ages compared with radiologist-determined ground truth on baseline versus transformed images were compared using Wilcoxon signed rank tests. The proportion of clinically significant errors (CSEs) was compared using McNemar tests. Results There was no evidence of a difference in MAD of the model on the two baseline test sets (RSNA = 6.8 months, DHA = 6.9 months; = .05), indicating good model generalization to external data. Except for the RSNA dataset images with an appended radiologic laterality marker ( = .86), there were significant differences in MAD for both the DHA and RSNA datasets among other transformation groups (rotations, flips, brightness, contrast, inversion, and resolution). There were significant differences in proportion of CSEs for 57% of the image transformations (19 of 33) performed on the DHA dataset. Conclusion Although an award-winning pediatric bone age DL model generalized well to curated external images, it had inconsistent predictions on images that had undergone simple transformations reflective of several real-world variations in image appearance. Pediatrics, Hand, Convolutional Neural Network, Radiography © RSNA, 2024 See also commentary by Faghani and Erickson in this issue.

摘要

目的 评估一个屡获殊荣的骨龄深度学习(DL)模型对图像外观广泛变化的稳健性。

材料与方法 2021 年 12 月,回顾性评估了在 RSNA 验证集(1425 例儿科手部 X 线片;本研究中的内部测试集)和 Digital Hand Atlas(DHA)(1202 例儿科手部 X 线片;外部测试集)上获得 2017 年 RSNA 儿科骨龄挑战赛奖项的 DL 骨龄模型。每个测试图像都经历了七种类型的变换(旋转、翻转、亮度、对比度、反转、侧标和分辨率),以代表一系列图像外观,其中许多模拟了真实世界的变化。通过比较模型在基线和变换图像上的预测,进行计算“压力测试”。比较基线和变换图像上模型预测骨龄的平均绝对差异(MAD),使用 Wilcoxon 符号秩检验。使用 McNemar 检验比较临床上显著错误(CSE)的比例。

结果 两种基线测试集(RSNA = 6.8 个月,DHA = 6.9 个月; =.05)上模型 MAD 无差异,表明模型对外部数据具有良好的泛化能力。除了在 RSNA 数据集图像上附加放射侧标( =.86)外,DHA 和 RSNA 数据集的其他变换组(旋转、翻转、亮度、对比度、反转和分辨率)之间的 MAD 存在显著差异。在 DHA 数据集上进行的 57%(33 项中的 19 项)图像变换中,有 19 项存在 CSE 比例差异。

结论 尽管一个屡获殊荣的儿科骨龄 DL 模型对经过精心挑选的外部图像具有很好的泛化能力,但它对反映图像外观的几种真实世界变化的简单变换后的图像的预测结果不一致。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e6f/11140516/6eefdbe8b5fb/ryai.230240.VA.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e6f/11140516/6eefdbe8b5fb/ryai.230240.VA.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e6f/11140516/6eefdbe8b5fb/ryai.230240.VA.jpg

相似文献

1
Evaluating the Robustness of a Deep Learning Bone Age Algorithm to Clinical Image Variation Using Computational Stress Testing.使用计算压力测试评估深度学习骨龄算法对临床图像变化的稳健性。
Radiol Artif Intell. 2024 May;6(3):e230240. doi: 10.1148/ryai.230240.
2
Generalizability and Bias in a Deep Learning Pediatric Bone Age Prediction Model Using Hand Radiographs.深度学习利用手 X 光片预测儿童骨龄的泛化性和偏差。
Radiology. 2023 Feb;306(2):e220505. doi: 10.1148/radiol.220505. Epub 2022 Sep 27.
3
MABAL: a Novel Deep-Learning Architecture for Machine-Assisted Bone Age Labeling.MABAL:一种用于机器辅助骨龄标注的新型深度学习架构。
J Digit Imaging. 2018 Aug;31(4):513-519. doi: 10.1007/s10278-018-0053-3.
4
Bone age determination using only the index finger: a novel approach using a convolutional neural network compared with human radiologists.仅使用食指进行骨龄测定:一种使用卷积神经网络与人类放射科医生进行比较的新方法。
Pediatr Radiol. 2020 Apr;50(4):516-523. doi: 10.1007/s00247-019-04587-y. Epub 2019 Dec 20.
5
The RSNA Pediatric Bone Age Machine Learning Challenge.RSNA 儿科骨龄机器学习挑战赛。
Radiology. 2019 Feb;290(2):498-503. doi: 10.1148/radiol.2018180736. Epub 2018 Nov 27.
6
Performance of a Deep-Learning Neural Network Model in Assessing Skeletal Maturity on Pediatric Hand Radiographs.深度学习神经网络模型在评估小儿手部 X 光片骨骼成熟度中的性能。
Radiology. 2018 Apr;287(1):313-322. doi: 10.1148/radiol.2017170236. Epub 2017 Nov 2.
7
Deep learning-based automated bone age estimation for Saudi patients on hand radiograph images: a retrospective study.基于深度学习的沙特手部 X 光图像患者骨龄自动估算:一项回顾性研究。
BMC Med Imaging. 2024 Aug 1;24(1):199. doi: 10.1186/s12880-024-01378-2.
8
Can AI distinguish a bone radiograph from photos of flowers or cars? Evaluation of bone age deep learning model on inappropriate data inputs.人工智能能否区分骨 X 光片与花卉或汽车的照片?对不适当数据输入的骨龄深度学习模型的评估。
Skeletal Radiol. 2022 Feb;51(2):401-406. doi: 10.1007/s00256-021-03880-y. Epub 2021 Aug 5.
9
Enhancement of Non-Linear Deep Learning Model by Adjusting Confounding Variables for Bone Age Estimation in Pediatric Hand X-rays.调整混杂变量增强非线性深度学习模型在小儿手部 X 光片中骨龄估算的应用。
J Digit Imaging. 2023 Oct;36(5):2003-2014. doi: 10.1007/s10278-023-00849-2. Epub 2023 Jun 2.
10
Using a Dual-Input Convolutional Neural Network for Automated Detection of Pediatric Supracondylar Fracture on Conventional Radiography.利用双输入卷积神经网络自动检测常规 X 光片中的小儿髁上骨折。
Invest Radiol. 2020 Feb;55(2):101-110. doi: 10.1097/RLI.0000000000000615.

引用本文的文献

1
Determination of Skeletal Age From Hand Radiographs Using Deep Learning.利用深度学习从手部X光片确定骨骼年龄
Am J Sports Med. 2025 Sep;53(11):2715-2725. doi: 10.1177/03635465251359618. Epub 2025 Aug 15.
2
AI as teacher: effectiveness of an AI-based training module to improve trainee pediatric fracture detection.人工智能作为教师:基于人工智能的培训模块对提高实习儿科医生骨折检测能力的有效性
Skeletal Radiol. 2025 Apr 14. doi: 10.1007/s00256-025-04927-0.
3
Bone Age Assessment Using Various Medical Imaging Techniques Enhanced by Artificial Intelligence.

本文引用的文献

1
Visual Transformers and Convolutional Neural Networks for Disease Classification on Radiographs: A Comparison of Performance, Sample Efficiency, and Hidden Stratification.用于X光片疾病分类的视觉Transformer和卷积神经网络:性能、样本效率及隐藏分层的比较
Radiol Artif Intell. 2022 Sep 21;4(6):e220012. doi: 10.1148/ryai.220012. eCollection 2022 Nov.
2
Generalizability and Bias in a Deep Learning Pediatric Bone Age Prediction Model Using Hand Radiographs.深度学习利用手 X 光片预测儿童骨龄的泛化性和偏差。
Radiology. 2023 Feb;306(2):e220505. doi: 10.1148/radiol.220505. Epub 2022 Sep 27.
3
Toward Generalizability in the Deployment of Artificial Intelligence in Radiology: Role of Computation Stress Testing to Overcome Underspecification.
利用人工智能增强的各种医学成像技术进行骨龄评估。
Diagnostics (Basel). 2025 Jan 23;15(3):257. doi: 10.3390/diagnostics15030257.
4
Predictive analytics in bronchopulmonary dysplasia: past, present, and future.支气管肺发育不良的预测分析:过去、现在与未来。
Front Pediatr. 2024 Nov 20;12:1483940. doi: 10.3389/fped.2024.1483940. eCollection 2024.
5
Applications of Artificial Intelligence and Machine Learning in Spine MRI.人工智能与机器学习在脊柱磁共振成像中的应用
Bioengineering (Basel). 2024 Sep 5;11(9):894. doi: 10.3390/bioengineering11090894.
6
Bone Age Prediction under Stress.压力状态下的骨龄预测
Radiol Artif Intell. 2024 May;6(3):e240137. doi: 10.1148/ryai.240137.
迈向人工智能在放射学应用中的可推广性:计算压力测试在克服规格不足方面的作用
Radiol Artif Intell. 2021 Oct 27;3(6):e210097. doi: 10.1148/ryai.2021210097. eCollection 2021 Nov.
4
Deep Learning Algorithms for Interpretation of Upper Extremity Radiographs: Laterality and Technologist Initial Labels as Confounding Factors.深度学习算法在解读上肢 X 光片中的应用:侧别和技术员初始标签作为混杂因素。
AJR Am J Roentgenol. 2022 Apr;218(4):714-715. doi: 10.2214/AJR.21.26882. Epub 2021 Nov 10.
5
Your mileage may vary: impact of data input method for a deep learning bone age app's predictions.因人而异:深度学习骨龄应用程序的数据输入方式对预测结果的影响。
Skeletal Radiol. 2022 Feb;51(2):423-429. doi: 10.1007/s00256-021-03897-3. Epub 2021 Sep 3.
6
Can AI distinguish a bone radiograph from photos of flowers or cars? Evaluation of bone age deep learning model on inappropriate data inputs.人工智能能否区分骨 X 光片与花卉或汽车的照片?对不适当数据输入的骨龄深度学习模型的评估。
Skeletal Radiol. 2022 Feb;51(2):401-406. doi: 10.1007/s00256-021-03880-y. Epub 2021 Aug 5.
7
Stress testing reveals gaps in clinic readiness of image-based diagnostic artificial intelligence models.压力测试揭示了基于图像的诊断人工智能模型在临床准备方面的差距。
NPJ Digit Med. 2021 Jan 21;4(1):10. doi: 10.1038/s41746-020-00380-6.
8
Human-computer collaboration for skin cancer recognition.人机协作进行皮肤癌识别。
Nat Med. 2020 Aug;26(8):1229-1234. doi: 10.1038/s41591-020-0942-0. Epub 2020 Jun 22.
9
The RSNA Pediatric Bone Age Machine Learning Challenge.RSNA 儿科骨龄机器学习挑战赛。
Radiology. 2019 Feb;290(2):498-503. doi: 10.1148/radiol.2018180736. Epub 2018 Nov 27.
10
Deep learning for chest radiograph diagnosis: A retrospective comparison of the CheXNeXt algorithm to practicing radiologists.深度学习在胸片诊断中的应用:CheXNeXt 算法与临床放射科医生的回顾性比较。
PLoS Med. 2018 Nov 20;15(11):e1002686. doi: 10.1371/journal.pmed.1002686. eCollection 2018 Nov.