• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估用于医学影像中异常定位的显著性图的可信度。

Assessing the Trustworthiness of Saliency Maps for Localizing Abnormalities in Medical Imaging.

作者信息

Arun Nishanth, Gaw Nathan, Singh Praveer, Chang Ken, Aggarwal Mehak, Chen Bryan, Hoebel Katharina, Gupta Sharut, Patel Jay, Gidwani Mishka, Adebayo Julius, Li Matthew D, Kalpathy-Cramer Jayashree

机构信息

Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, Harvard Medical School, 149 13th St, Boston, MA 02129 (N.A., P.S., K.C., M.A., B.C., K.H., S.G., J.P., M.G., M.D.L., J.K.C.); Department of Computer Science, Shiv Nadar University, Greater Noida, India (N.A.); Department of Operational Sciences, Graduate School of Engineering and Management, Air Force Institute of Technology, Wright-Patterson AFB, Dayton, Ohio (N.G.); and Massachusetts Institute of Technology, Cambridge, Mass (K.C., B.C., K.H., J.P., J.A.).

出版信息

Radiol Artif Intell. 2021 Oct 6;3(6):e200267. doi: 10.1148/ryai.2021200267. eCollection 2021 Nov.

DOI:10.1148/ryai.2021200267
PMID:34870212
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8637231/
Abstract

PURPOSE

To evaluate the trustworthiness of saliency maps for abnormality localization in medical imaging.

MATERIALS AND METHODS

Using two large publicly available radiology datasets (Society for Imaging Informatics in Medicine-American College of Radiology Pneumothorax Segmentation dataset and Radiological Society of North America Pneumonia Detection Challenge dataset), the performance of eight commonly used saliency map techniques were quantified in regard to localization utility (segmentation and detection), sensitivity to model weight randomization, repeatability, and reproducibility. Their performances versus baseline methods and localization network architectures were compared, using area under the precision-recall curve (AUPRC) and structural similarity index measure (SSIM) as metrics.

RESULTS

All eight saliency map techniques failed at least one of the criteria and were inferior in performance compared with localization networks. For pneumothorax segmentation, the AUPRC ranged from 0.024 to 0.224, while a U-Net achieved a significantly superior AUPRC of 0.404 ( < .005). For pneumonia detection, the AUPRC ranged from 0.160 to 0.519, while a RetinaNet achieved a significantly superior AUPRC of 0.596 ( <.005). Five and two saliency methods (of eight) failed the model randomization test on the segmentation and detection datasets, respectively, suggesting that these methods are not sensitive to changes in model parameters. The repeatability and reproducibility of the majority of the saliency methods were worse than localization networks for both the segmentation and detection datasets.

CONCLUSION

The use of saliency maps in the high-risk domain of medical imaging warrants additional scrutiny and recommend that detection or segmentation models be used if localization is the desired output of the network. Technology Assessment, Technical Aspects, Feature Detection, Convolutional Neural Network (CNN) Supplemental material is available for this article. © RSNA, 2021.

摘要

目的

评估显著性图在医学影像中异常定位的可信度。

材料与方法

使用两个大型公开可用的放射学数据集(医学影像信息学会 - 美国放射学会气胸分割数据集和北美放射学会肺炎检测挑战赛数据集),从定位效用(分割和检测)、对模型权重随机化的敏感性、可重复性和再现性方面对八种常用的显著性图技术的性能进行量化。使用精确召回率曲线下面积(AUPRC)和结构相似性指数测量(SSIM)作为指标,将它们与基线方法和定位网络架构的性能进行比较。

结果

所有八种显著性图技术至少未达到其中一项标准,并且与定位网络相比性能较差。对于气胸分割,AUPRC范围为0.024至0.224,而一个U-Net实现了显著更高的AUPRC为0.404(P <.005)。对于肺炎检测,AUPRC范围为0.160至0.519,而一个RetinaNet实现了显著更高的AUPRC为0.596(P<.005)。八种显著性方法中的五种和两种分别在分割和检测数据集上未通过模型随机化测试,这表明这些方法对模型参数的变化不敏感。对于分割和检测数据集,大多数显著性方法的可重复性和再现性都比定位网络差。

结论

在医学影像的高风险领域使用显著性图需要进一步审查,并建议如果网络期望的输出是定位,则使用检测或分割模型。技术评估、技术方面、特征检测、卷积神经网络(CNN) 本文提供补充材料。©RSNA,2021。

相似文献

1
Assessing the Trustworthiness of Saliency Maps for Localizing Abnormalities in Medical Imaging.评估用于医学影像中异常定位的显著性图的可信度。
Radiol Artif Intell. 2021 Oct 6;3(6):e200267. doi: 10.1148/ryai.2021200267. eCollection 2021 Nov.
2
Attention-based Saliency Maps Improve Interpretability of Pneumothorax Classification.基于注意力的显著图提高气胸分类的可解释性。
Radiol Artif Intell. 2022 Mar 1;5(2):e220187. doi: 10.1148/ryai.220187. eCollection 2023 Mar.
3
Gradient-Based Saliency Maps Are Not Trustworthy Visual Explanations of Automated AI Musculoskeletal Diagnoses.基于梯度的显著图不是自动化 AI 肌肉骨骼诊断的可靠视觉解释。
J Imaging Inform Med. 2024 Oct;37(5):2490-2499. doi: 10.1007/s10278-024-01136-4. Epub 2024 May 6.
4
Revisiting the Trustworthiness of Saliency Methods in Radiology AI.重新审视放射科 AI 中显著性方法的可信度。
Radiol Artif Intell. 2024 Jan;6(1):e220221. doi: 10.1148/ryai.220221.
5
Deep Learning for the Diagnosis of Stage in Retinopathy of Prematurity: Accuracy and Generalizability across Populations and Cameras.深度学习在早产儿视网膜病变分期诊断中的应用:人群和摄像设备间的准确性和泛化能力。
Ophthalmol Retina. 2021 Oct;5(10):1027-1035. doi: 10.1016/j.oret.2020.12.013. Epub 2021 Feb 6.
6
MRI-based Identification and Classification of Major Intracranial Tumor Types by Using a 3D Convolutional Neural Network: A Retrospective Multi-institutional Analysis.基于磁共振成像利用三维卷积神经网络对主要颅内肿瘤类型进行识别与分类:一项回顾性多机构分析
Radiol Artif Intell. 2021 Aug 11;3(5):e200301. doi: 10.1148/ryai.2021200301. eCollection 2021 Sep.
7
Light Field Saliency Detection with Deep Convolutional Networks.基于深度卷积网络的光场显著性检测
IEEE Trans Image Process. 2020 Feb 5. doi: 10.1109/TIP.2020.2970529.
8
Development and Validation of a Convolutional Neural Network for Automated Detection of Scaphoid Fractures on Conventional Radiographs.用于在传统X线片上自动检测舟骨骨折的卷积神经网络的开发与验证
Radiol Artif Intell. 2021 Apr 28;3(4):e200260. doi: 10.1148/ryai.2021200260. eCollection 2021 Jul.
9
CheXLocNet: Automatic localization of pneumothorax in chest radiographs using deep convolutional neural networks.CheXLocNet:使用深度卷积神经网络自动定位胸部 X 光片中的气胸。
PLoS One. 2020 Nov 9;15(11):e0242013. doi: 10.1371/journal.pone.0242013. eCollection 2020.
10
Performance and Usability of Code-Free Deep Learning for Chest Radiograph Classification, Object Detection, and Segmentation.无代码深度学习在胸部X光片分类、目标检测和分割中的性能与可用性
Radiol Artif Intell. 2023 Feb 15;5(2):e220062. doi: 10.1148/ryai.220062. eCollection 2023 Mar.

引用本文的文献

1
Deep learning in chromatin organization: from super-resolution microscopy to clinical applications.染色质组织中的深度学习:从超分辨率显微镜到临床应用
Cell Mol Life Sci. 2025 Aug 29;82(1):323. doi: 10.1007/s00018-025-05837-z.
2
Deep learning-based prediction of rheumatoid arthritis-associated deformity on MRI.基于深度学习的MRI上类风湿性关节炎相关畸形预测
Brain Spine. 2025 Jul 12;5:104328. doi: 10.1016/j.bas.2025.104328. eCollection 2025.
3
A hybrid learning approach for MRI-based detection of alzheimer's disease stages using dual CNNs and ensemble classifier.一种使用双卷积神经网络和集成分类器基于磁共振成像检测阿尔茨海默病阶段的混合学习方法。
Sci Rep. 2025 Jul 14;15(1):25342. doi: 10.1038/s41598-025-11743-y.
4
Machine learning detection of epileptic seizure onset zone from iEEG.基于颅内脑电图的机器学习癫痫发作起始区检测
Biomed Eng Lett. 2025 May 27;15(4):677-692. doi: 10.1007/s13534-025-00480-w. eCollection 2025 Jul.
5
Artificial intelligence unravels interpretable malignancy grades of prostate cancer on histology images.人工智能在组织学图像上解析可解释的前列腺癌恶性等级。
Npj Imaging. 2024 Mar 6;2(1):6. doi: 10.1038/s44303-023-00005-z.
6
Multi-site validation of an interpretable model to analyze breast masses.用于分析乳腺肿块的可解释模型的多中心验证
PLoS One. 2025 Jun 26;20(6):e0320091. doi: 10.1371/journal.pone.0320091. eCollection 2025.
7
Development of an AI model for pneumothorax imaging: Dataset and model optimization strategies for real-world deployment.用于气胸成像的人工智能模型开发:面向实际应用的数据集与模型优化策略
Eur J Radiol Open. 2025 Jun 10;14:100664. doi: 10.1016/j.ejro.2025.100664. eCollection 2025 Jun.
8
Network Occlusion Sensitivity Analysis Identifies Regional Contributions to Brain Age Prediction.网络阻塞敏感性分析确定了对脑龄预测的区域贡献。
Hum Brain Mapp. 2025 Jun 1;46(8):e70239. doi: 10.1002/hbm.70239.
9
Enhancing the dataset of CycleGAN-M and YOLOv8s-KEF for identifying apple leaf diseases.增强CycleGAN-M和YOLOv8s-KEF数据集以识别苹果叶病害。
PLoS One. 2025 May 30;20(5):e0321770. doi: 10.1371/journal.pone.0321770. eCollection 2025.
10
Hearts, Data, and Artificial Intelligence Wizardry: From Imitation to Innovation in Cardiovascular Care.心脏、数据与人工智能魔法:心血管护理从模仿到创新
Biomedicines. 2025 Apr 23;13(5):1019. doi: 10.3390/biomedicines13051019.

本文引用的文献

1
Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.停止为高风险决策解释黑箱机器学习模型,转而使用可解释模型。
Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.
2
Evaluation of Explainable Deep Learning Methods for Ophthalmic Diagnosis.用于眼科诊断的可解释深度学习方法评估
Clin Ophthalmol. 2021 Jun 18;15:2573-2581. doi: 10.2147/OPTH.S312236. eCollection 2021.
3
Automatic detection of coronavirus disease (COVID-19) using X-ray images and deep convolutional neural networks.使用X射线图像和深度卷积神经网络自动检测冠状病毒病(COVID-19)。
Pattern Anal Appl. 2021;24(3):1207-1220. doi: 10.1007/s10044-021-00984-y. Epub 2021 May 9.
4
Augmenting the National Institutes of Health Chest Radiograph Dataset with Expert Annotations of Possible Pneumonia.利用可能患有肺炎的专家注释扩充美国国立卫生研究院胸部X光数据集。
Radiol Artif Intell. 2019 Jan 30;1(1):e180041. doi: 10.1148/ryai.2019180041. eCollection 2019 Jan.
5
Automated Assessment and Tracking of COVID-19 Pulmonary Disease Severity on Chest Radiographs using Convolutional Siamese Neural Networks.使用卷积暹罗神经网络对胸部X光片上的COVID-19肺部疾病严重程度进行自动评估和跟踪。
Radiol Artif Intell. 2020 Jul 22;2(4):e200079. doi: 10.1148/ryai.2020200079. eCollection 2020 Jul.
6
Hidden Stratification Causes Clinically Meaningful Failures in Machine Learning for Medical Imaging.隐藏分层导致医学成像机器学习中具有临床意义的失败。
Proc ACM Conf Health Inference Learn (2020). 2020 Apr;2020:151-159. doi: 10.1145/3368555.3384468.
7
Deep Learning for Diagnosis and Segmentation of Pneumothorax: The Results on the Kaggle Competition and Validation Against Radiologists.深度学习在气胸诊断和分割中的应用:Kaggle 竞赛结果及与放射科医生的验证
IEEE J Biomed Health Inform. 2021 May;25(5):1660-1672. doi: 10.1109/JBHI.2020.3023476. Epub 2021 May 11.
8
Multi-Institutional Assessment and Crowdsourcing Evaluation of Deep Learning for Automated Classification of Breast Density.多机构评估和众包评估深度学习在自动乳腺密度分类中的应用。
J Am Coll Radiol. 2020 Dec;17(12):1653-1662. doi: 10.1016/j.jacr.2020.05.015. Epub 2020 Jun 24.
9
Siamese neural networks for continuous disease severity evaluation and change detection in medical imaging.用于医学影像中连续疾病严重程度评估和变化检测的连体神经网络。
NPJ Digit Med. 2020 Mar 26;3:48. doi: 10.1038/s41746-020-0255-1. eCollection 2020.
10
Detection of anaemia from retinal fundus images via deep learning.利用深度学习从眼底图像中检测贫血
Nat Biomed Eng. 2020 Jan;4(1):18-27. doi: 10.1038/s41551-019-0487-z. Epub 2019 Dec 23.