• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类与机器在医学诊断中的感知差异。

Differences between human and machine perception in medical diagnosis.

机构信息

Center for Data Science, New York University, New York, NY, USA.

Department of Radiology, NYU Langone Health, New York, NY, USA.

出版信息

Sci Rep. 2022 Apr 27;12(1):6877. doi: 10.1038/s41598-022-10526-z.

DOI:10.1038/s41598-022-10526-z
PMID:35477730
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9046399/
Abstract

Deep neural networks (DNNs) show promise in image-based medical diagnosis, but cannot be fully trusted since they can fail for reasons unrelated to underlying pathology. Humans are less likely to make such superficial mistakes, since they use features that are grounded on medical science. It is therefore important to know whether DNNs use different features than humans. Towards this end, we propose a framework for comparing human and machine perception in medical diagnosis. We frame the comparison in terms of perturbation robustness, and mitigate Simpson's paradox by performing a subgroup analysis. The framework is demonstrated with a case study in breast cancer screening, where we separately analyze microcalcifications and soft tissue lesions. While it is inconclusive whether humans and DNNs use different features to detect microcalcifications, we find that for soft tissue lesions, DNNs rely on high frequency components ignored by radiologists. Moreover, these features are located outside of the region of the images found most suspicious by radiologists. This difference between humans and machines was only visible through subgroup analysis, which highlights the importance of incorporating medical domain knowledge into the comparison.

摘要

深度神经网络(DNN)在基于图像的医学诊断中显示出了潜力,但由于它们可能会因为与潜在病理学无关的原因而失败,所以不能完全信任。人类不太可能犯这样肤浅的错误,因为他们使用的特征是基于医学科学的。因此,了解 DNN 是否使用与人类不同的特征非常重要。为此,我们提出了一个用于比较医学诊断中人类和机器感知的框架。我们从扰动鲁棒性的角度来比较,并通过进行亚组分析来减轻辛普森悖论。该框架通过乳腺癌筛查的案例研究得到了验证,我们分别分析了微钙化和软组织病变。虽然还不能确定人类和 DNN 是否使用不同的特征来检测微钙化,但我们发现,对于软组织病变,DNN 依赖于放射科医生忽略的高频成分。此外,这些特征位于放射科医生认为最可疑的图像区域之外。只有通过亚组分析才能看到人类和机器之间的这种差异,这凸显了将医学领域知识纳入比较的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d38/9046399/11ab8c7bc820/41598_2022_10526_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d38/9046399/83d92189aa25/41598_2022_10526_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d38/9046399/b658063854c4/41598_2022_10526_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d38/9046399/3cb6829278c9/41598_2022_10526_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d38/9046399/8f98d556e371/41598_2022_10526_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d38/9046399/7066cd7042d6/41598_2022_10526_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d38/9046399/11ab8c7bc820/41598_2022_10526_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d38/9046399/83d92189aa25/41598_2022_10526_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d38/9046399/b658063854c4/41598_2022_10526_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d38/9046399/3cb6829278c9/41598_2022_10526_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d38/9046399/8f98d556e371/41598_2022_10526_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d38/9046399/7066cd7042d6/41598_2022_10526_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d38/9046399/11ab8c7bc820/41598_2022_10526_Fig6_HTML.jpg

相似文献

1
Differences between human and machine perception in medical diagnosis.人类与机器在医学诊断中的感知差异。
Sci Rep. 2022 Apr 27;12(1):6877. doi: 10.1038/s41598-022-10526-z.
2
Investigation of psychophysical similarity measures for selection of similar images in the diagnosis of clustered microcalcifications on mammograms.乳腺钼靶片上簇状微钙化诊断中用于选择相似图像的心理物理相似性度量研究。
Med Phys. 2008 Dec;35(12):5695-702. doi: 10.1118/1.3020760.
3
Perception without preconception: comparison between the human and machine learner in recognition of tissues from histological sections.无预设认知的感知:在识别组织切片方面,人与机器学习者的比较。
Sci Rep. 2022 Sep 30;12(1):16420. doi: 10.1038/s41598-022-20012-1.
4
Evaluating geodesic active contours in microcalcifications segmentation on mammograms.评估乳腺钼靶片中微钙化分割的测地主动轮廓。
Comput Methods Programs Biomed. 2015 Dec;122(3):304-15. doi: 10.1016/j.cmpb.2015.08.016. Epub 2015 Aug 29.
5
Malignant and benign clustered microcalcifications: automated feature analysis and classification.恶性和良性簇状微钙化:自动特征分析与分类
Radiology. 1996 Mar;198(3):671-8. doi: 10.1148/radiology.198.3.8628853.
6
Can a Machine Learn from Radiologists' Visual Search Behaviour and Their Interpretation of Mammograms-a Deep-Learning Study.机器能否从放射科医生的视觉搜索行为及其对乳房 X 光照片的解释中学习——一项深度学习研究。
J Digit Imaging. 2019 Oct;32(5):746-760. doi: 10.1007/s10278-018-00174-z.
7
Deep learning performance for detection and classification of microcalcifications on mammography.深度学习在乳腺 X 线摄影微钙化检测和分类中的性能。
Eur Radiol Exp. 2023 Nov 7;7(1):69. doi: 10.1186/s41747-023-00384-3.
8
Convolutional Neural Networks for the Segmentation of Microcalcification in Mammography Imaging.卷积神经网络在乳腺 X 线摄影成像中微钙化分割的应用。
J Healthc Eng. 2019 Apr 9;2019:9360941. doi: 10.1155/2019/9360941. eCollection 2019.
9
Improved automated early detection of breast cancer based on high resolution 3D micro-CT microcalcification images.基于高分辨率 3D 微 CT 微钙化图像的乳腺癌自动早期检测方法的改进。
BMC Cancer. 2022 Feb 11;22(1):162. doi: 10.1186/s12885-021-09133-4.
10
A deep learning model integrating mammography and clinical factors facilitates the malignancy prediction of BI-RADS 4 microcalcifications in breast cancer screening.深度学习模型整合乳腺 X 线摄影和临床因素有助于预测乳腺癌筛查中 BI-RADS 4 微钙化的恶性程度。
Eur Radiol. 2021 Aug;31(8):5902-5912. doi: 10.1007/s00330-020-07659-y. Epub 2021 Jan 26.

引用本文的文献

1
A comparative analysis of sagittal, coronal, and axial magnetic resonance imaging planes in diagnosing anterior cruciate ligament and meniscal tears via a deep learning model: emphasizing the unexpected importance of the axial plane.通过深度学习模型对矢状面、冠状面和轴位磁共振成像平面在诊断前交叉韧带和半月板撕裂中的比较分析:强调轴位平面的意外重要性
Quant Imaging Med Surg. 2025 Jun 6;15(6):5811-5824. doi: 10.21037/qims-24-1808. Epub 2025 Jun 3.
2
SEMTWIST Quantification of Biofilm Infection in Human Chronic Wound Using Scanning Electron Microscopy and Machine Learning.使用扫描电子显微镜和机器学习对人类慢性伤口生物膜感染进行SEMTWIST量化
Adv Wound Care (New Rochelle). 2025 Aug;14(8):393-408. doi: 10.1089/wound.2024.0291. Epub 2025 May 13.
3

本文引用的文献

1
Stan: A Probabilistic Programming Language.斯坦:一种概率编程语言。
J Stat Softw. 2017;76. doi: 10.18637/jss.v076.i01. Epub 2017 Jan 11.
2
Reducing False-Positive Biopsies using Deep Neural Networks that Utilize both Local and Global Image Context of Screening Mammograms.利用深度学习神经网络,利用筛查性乳房 X 光照片的局部和全局图像上下文减少假阳性活检。
J Digit Imaging. 2021 Dec;34(6):1414-1423. doi: 10.1007/s10278-021-00530-6. Epub 2021 Nov 3.
3
Improving the Ability of Deep Neural Networks to Use Information from Multiple Views in Breast Cancer Screening.
Unlocking the potential of digital pathology: Novel baselines for compression.释放数字病理学的潜力:压缩的新基线
J Pathol Inform. 2025 Jan 28;17:100421. doi: 10.1016/j.jpi.2025.100421. eCollection 2025 Apr.
4
Image biomarkers and explainable AI: handcrafted features versus deep learned features.影像生物标志物和可解释人工智能:手工特征与深度学习特征。
Eur Radiol Exp. 2024 Nov 19;8(1):130. doi: 10.1186/s41747-024-00529-y.
5
The State of Cancer Care in the United Arab Emirates in 2022.2022年阿拉伯联合酋长国的癌症护理状况
Clin Pract. 2022 Nov 23;12(6):955-985. doi: 10.3390/clinpract12060101.
提高深度神经网络在乳腺癌筛查中利用多视图信息的能力。
Proc Mach Learn Res. 2020 Jul;121:827-842.
4
Five points to check when comparing visual perception in humans and machines.比较人与机器的视觉感知时需要检查的五个要点。
J Vis. 2021 Mar 1;21(3):16. doi: 10.1167/jov.21.3.16.
5
An interpretable classifier for high-resolution breast cancer screening images utilizing weakly supervised localization.一种利用弱监督定位的高分辨率乳腺癌筛查图像可解释分类器。
Med Image Anal. 2021 Feb;68:101908. doi: 10.1016/j.media.2020.101908. Epub 2020 Dec 16.
6
Changes in cancer detection and false-positive recall in mammography using artificial intelligence: a retrospective, multireader study.人工智能在乳腺癌检测和假阳性召回中的变化:一项回顾性、多读者研究。
Lancet Digit Health. 2020 Mar;2(3):e138-e148. doi: 10.1016/S2589-7500(20)30003-0. Epub 2020 Feb 6.
7
A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: a systematic review and meta-analysis.深度学习在医学影像疾病检测方面的性能与医疗保健专业人员的比较:系统评价和荟萃分析。
Lancet Digit Health. 2019 Oct;1(6):e271-e297. doi: 10.1016/S2589-7500(19)30123-2. Epub 2019 Sep 25.
8
Hidden Stratification Causes Clinically Meaningful Failures in Machine Learning for Medical Imaging.隐藏分层导致医学成像机器学习中具有临床意义的失败。
Proc ACM Conf Health Inference Learn (2020). 2020 Apr;2020:151-159. doi: 10.1145/3368555.3384468.
9
Performance vs. competence in human-machine comparisons.人机比较中的表现与能力。
Proc Natl Acad Sci U S A. 2020 Oct 27;117(43):26562-26571. doi: 10.1073/pnas.1905334117. Epub 2020 Oct 13.
10
A deep learning system for differential diagnosis of skin diseases.深度学习系统用于皮肤病的鉴别诊断。
Nat Med. 2020 Jun;26(6):900-908. doi: 10.1038/s41591-020-0842-3. Epub 2020 May 18.