• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度学习注意力机制在临床研究中的局限性:基于韩国糖尿病疾病环境的实证案例研究。

Limitations of Deep Learning Attention Mechanisms in Clinical Research: Empirical Case Study Based on the Korean Diabetic Disease Setting.

机构信息

Graduate School of Cancer Science and Policy, National Cancer Center, Goyang-si, Gyeonggi-do, Republic of Korea.

Cancer Data Center, National Cancer Control Institute, National Cancer Center, Goyang-si, Gyeonggi-do, Republic of Korea.

出版信息

J Med Internet Res. 2020 Dec 16;22(12):e18418. doi: 10.2196/18418.

DOI:10.2196/18418
PMID:33325832
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7773508/
Abstract

BACKGROUND

Despite excellent prediction performance, noninterpretability has undermined the value of applying deep-learning algorithms in clinical practice. To overcome this limitation, attention mechanism has been introduced to clinical research as an explanatory modeling method. However, potential limitations of using this attractive method have not been clarified to clinical researchers. Furthermore, there has been a lack of introductory information explaining attention mechanisms to clinical researchers.

OBJECTIVE

The aim of this study was to introduce the basic concepts and design approaches of attention mechanisms. In addition, we aimed to empirically assess the potential limitations of current attention mechanisms in terms of prediction and interpretability performance.

METHODS

First, the basic concepts and several key considerations regarding attention mechanisms were identified. Second, four approaches to attention mechanisms were suggested according to a two-dimensional framework based on the degrees of freedom and uncertainty awareness. Third, the prediction performance, probability reliability, concentration of variable importance, consistency of attention results, and generalizability of attention results to conventional statistics were assessed in the diabetic classification modeling setting. Fourth, the potential limitations of attention mechanisms were considered.

RESULTS

Prediction performance was very high for all models. Probability reliability was high in models with uncertainty awareness. Variable importance was concentrated in several variables when uncertainty awareness was not considered. The consistency of attention results was high when uncertainty awareness was considered. The generalizability of attention results to conventional statistics was poor regardless of the modeling approach.

CONCLUSIONS

The attention mechanism is an attractive technique with potential to be very promising in the future. However, it may not yet be desirable to rely on this method to assess variable importance in clinical settings. Therefore, along with theoretical studies enhancing attention mechanisms, more empirical studies investigating potential limitations should be encouraged.

摘要

背景

尽管深度学习算法在预测性能方面表现出色,但由于其不可解释性,限制了其在临床实践中的应用价值。为了克服这一局限性,注意力机制作为一种解释性建模方法被引入临床研究。然而,临床研究人员尚未明确认识到使用这种有吸引力的方法的潜在局限性。此外,缺乏向临床研究人员介绍注意力机制的入门信息。

目的

本研究旨在介绍注意力机制的基本概念和设计方法。此外,我们旨在根据基于自由度和不确定性意识的二维框架,从预测和可解释性性能方面实证评估当前注意力机制的潜在局限性。

方法

首先,确定了注意力机制的基本概念和几个关键注意事项。其次,根据基于自由度和不确定性意识的二维框架,提出了四种注意力机制方法。第三,在糖尿病分类建模设置中评估了预测性能、概率可靠性、变量重要性的集中程度、注意力结果的一致性以及注意力结果对常规统计的可推广性。第四,考虑了注意力机制的潜在局限性。

结果

所有模型的预测性能都非常高。具有不确定性意识的模型的概率可靠性较高。当不考虑不确定性意识时,变量重要性集中在几个变量上。当考虑不确定性意识时,注意力结果的一致性较高。无论采用何种建模方法,注意力结果对常规统计的可推广性都较差。

结论

注意力机制是一种很有前途的技术,具有很大的潜力。然而,在临床环境中,依靠这种方法来评估变量的重要性可能还不太理想。因此,除了进行理论研究以增强注意力机制外,还应鼓励开展更多的实证研究来调查其潜在局限性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/0db21170b72e/jmir_v22i12e18418_fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/0889ef40f925/jmir_v22i12e18418_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/141a0da8460c/jmir_v22i12e18418_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/d5d160aa3419/jmir_v22i12e18418_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/2f37f6d38a96/jmir_v22i12e18418_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/07fa4186c6c5/jmir_v22i12e18418_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/787b14e863f2/jmir_v22i12e18418_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/5247755d9110/jmir_v22i12e18418_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/0db21170b72e/jmir_v22i12e18418_fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/0889ef40f925/jmir_v22i12e18418_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/141a0da8460c/jmir_v22i12e18418_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/d5d160aa3419/jmir_v22i12e18418_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/2f37f6d38a96/jmir_v22i12e18418_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/07fa4186c6c5/jmir_v22i12e18418_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/787b14e863f2/jmir_v22i12e18418_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/5247755d9110/jmir_v22i12e18418_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfaa/7773508/0db21170b72e/jmir_v22i12e18418_fig8.jpg

相似文献

1
Limitations of Deep Learning Attention Mechanisms in Clinical Research: Empirical Case Study Based on the Korean Diabetic Disease Setting.深度学习注意力机制在临床研究中的局限性:基于韩国糖尿病疾病环境的实证案例研究。
J Med Internet Res. 2020 Dec 16;22(12):e18418. doi: 10.2196/18418.
2
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测:机器学习在 1 型糖尿病中的应用。
Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.
3
Deep learning uncertainty and confidence calibration for the five-class polyp classification from colonoscopy.深度学习不确定性和置信度校准在结肠镜下五分类息肉分类中的应用。
Med Image Anal. 2020 May;62:101653. doi: 10.1016/j.media.2020.101653. Epub 2020 Feb 28.
4
Deep Learning for Diabetes: A Systematic Review.深度学习在糖尿病领域的应用:系统综述。
IEEE J Biomed Health Inform. 2021 Jul;25(7):2744-2757. doi: 10.1109/JBHI.2020.3040225. Epub 2021 Jul 27.
5
Research Techniques Made Simple: Deep Learning for the Classification of Dermatological Images.研究技巧简单化:用于皮肤科图像分类的深度学习。
J Invest Dermatol. 2020 Mar;140(3):507-514.e1. doi: 10.1016/j.jid.2019.12.029.
6
[Artificial intelligence in image analysis-fundamentals and new developments].[图像分析中的人工智能——基础与新进展]
Hautarzt. 2020 Sep;71(9):660-668. doi: 10.1007/s00105-020-04663-7.
7
An interpretable deep learning model for time-series electronic health records: Case study of delirium prediction in critical care.可解释的深度学习模型在电子健康记录时间序列中的应用:重症监护中谵妄预测的案例研究。
Artif Intell Med. 2023 Oct;144:102659. doi: 10.1016/j.artmed.2023.102659. Epub 2023 Sep 14.
8
De-Identification of Facial Features in Magnetic Resonance Images: Software Development Using Deep Learning Technology.磁共振图像中面部特征的去识别:基于深度学习技术的软件开发。
J Med Internet Res. 2020 Dec 10;22(12):e22739. doi: 10.2196/22739.
9
Deep Learning-Based Algorithms in Screening of Diabetic Retinopathy: A Systematic Review of Diagnostic Performance.基于深度学习的算法在糖尿病视网膜病变筛查中的应用:诊断性能的系统评价
Ophthalmol Retina. 2019 Apr;3(4):294-304. doi: 10.1016/j.oret.2018.10.014. Epub 2018 Nov 3.
10
Prediction of systemic biomarkers from retinal photographs: development and validation of deep-learning algorithms.从视网膜照片预测系统性生物标志物:深度学习算法的开发和验证。
Lancet Digit Health. 2020 Oct;2(10):e526-e536. doi: 10.1016/S2589-7500(20)30216-8.

引用本文的文献

1
Enhancing pediatric distal radius fracture detection: optimizing YOLOv8 with advanced AI and machine learning techniques.增强小儿桡骨远端骨折检测:采用先进人工智能和机器学习技术优化YOLOv8
BMC Med Imaging. 2025 Aug 5;25(1):316. doi: 10.1186/s12880-025-01669-2.
2
Deep learning for MRI-based acute and subacute ischaemic stroke lesion segmentation-a systematic review, meta-analysis, and pilot evaluation of key results.基于MRI的急性和亚急性缺血性中风病变分割的深度学习——系统综述、荟萃分析及关键结果的初步评估
Front Med Technol. 2025 Jun 10;7:1491197. doi: 10.3389/fmedt.2025.1491197. eCollection 2025.
3
Multi-modal image classification of COVID-19 cases using computed tomography and X-rays scans.

本文引用的文献

1
BERTMeSH: deep contextual representation learning for large-scale high-performance MeSH indexing with full text.BERTMeSH:基于深度上下文表示学习的大规模高性能 MeSH 索引与全文检索
Bioinformatics. 2021 May 5;37(5):684-692. doi: 10.1093/bioinformatics/btaa837.
2
BEHRT: Transformer for Electronic Health Records.BEHRT:电子健康记录的转换器。
Sci Rep. 2020 Apr 28;10(1):7155. doi: 10.1038/s41598-020-62922-y.
3
An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records.
利用计算机断层扫描和X射线扫描对新冠肺炎病例进行多模态图像分类。
Intell Syst Appl. 2023 Feb;17:200160. doi: 10.1016/j.iswa.2022.200160. Epub 2022 Nov 30.
4
AirSeg: Learnable Interconnected Attention Framework for Robust Airway Segmentation.AirSeg:用于稳健气道分割的可学习互联注意力框架
J Imaging Inform Med. 2025 May 22. doi: 10.1007/s10278-025-01545-z.
5
Deep learning-based differential gut flora for prediction of Parkinson's.基于深度学习的肠道菌群差异用于帕金森病预测
PLoS One. 2025 Jan 7;20(1):e0310005. doi: 10.1371/journal.pone.0310005. eCollection 2025.
6
Enhancing signal-to-noise ratio in real-time LED-based photoacoustic imaging: A comparative study of CNN-based deep learning architectures.实时基于LED的光声成像中提高信噪比:基于卷积神经网络的深度学习架构的比较研究
Photoacoustics. 2024 Nov 30;41:100674. doi: 10.1016/j.pacs.2024.100674. eCollection 2025 Feb.
7
Prototype Learning for Medical Time Series Classification via Human-Machine Collaboration.通过人机协作实现医学时间序列分类的原型学习
Sensors (Basel). 2024 Apr 22;24(8):2655. doi: 10.3390/s24082655.
8
A comparison of machine learning models and Cox proportional hazards models regarding their ability to predict the risk of gastrointestinal cancer based on metabolic syndrome and its components.基于代谢综合征及其组成部分,对机器学习模型和Cox比例风险模型预测胃肠道癌症风险的能力进行比较。
Front Oncol. 2023 Mar 2;13:1049787. doi: 10.3389/fonc.2023.1049787. eCollection 2023.
9
Artificial intelligence in cardiology: Hope for the future and power for the present.心脏病学中的人工智能:未来的希望与当下的力量。
Front Cardiovasc Med. 2022 Oct 13;9:945726. doi: 10.3389/fcvm.2022.945726. eCollection 2022.
基于注意力的深度学习模型在中文电子病历临床命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Dec 5;19(Suppl 5):235. doi: 10.1186/s12911-019-0933-6.
4
Attention-Based Deep Neural Networks for Detection of Cancerous and Precancerous Esophagus Tissue on Histopathological Slides.基于注意力的深度学习神经网络在组织病理学切片上用于癌症和癌前食管组织的检测。
JAMA Netw Open. 2019 Nov 1;2(11):e1914645. doi: 10.1001/jamanetworkopen.2019.14645.
5
Trends and Focus of Machine Learning Applications for Health Research.机器学习在健康研究中的应用趋势与重点。
JAMA Netw Open. 2019 Oct 2;2(10):e1914051. doi: 10.1001/jamanetworkopen.2019.14051.
6
Assessment of Deep Learning Using Nonimaging Information and Sequential Medical Records to Develop a Prediction Model for Nonmelanoma Skin Cancer.利用非成像信息和连续医疗记录评估深度学习以开发非黑色素瘤皮肤癌预测模型。
JAMA Dermatol. 2019 Nov 1;155(11):1277-1283. doi: 10.1001/jamadermatol.2019.2335.
7
Artificial Intelligence Algorithms to Assess Hormonal Status From Tissue Microarrays in Patients With Breast Cancer.人工智能算法评估乳腺癌患者组织微阵列中的激素状态。
JAMA Netw Open. 2019 Jul 3;2(7):e197700. doi: 10.1001/jamanetworkopen.2019.7700.
8
Development and Validation of a Deep Learning-Based Automated Detection Algorithm for Major Thoracic Diseases on Chest Radiographs.基于深度学习的胸部 X 线片主要胸部疾病自动检测算法的开发与验证。
JAMA Netw Open. 2019 Mar 1;2(3):e191095. doi: 10.1001/jamanetworkopen.2019.1095.
9
Assessment of a Deep Learning Model Based on Electronic Health Record Data to Forecast Clinical Outcomes in Patients With Rheumatoid Arthritis.基于电子健康记录数据的深度学习模型评估类风湿关节炎患者临床结局预测
JAMA Netw Open. 2019 Mar 1;2(3):e190606. doi: 10.1001/jamanetworkopen.2019.0606.
10
Attention gated networks: Learning to leverage salient regions in medical images.注意门控网络:学习利用医学图像中的显著区域。
Med Image Anal. 2019 Apr;53:197-207. doi: 10.1016/j.media.2019.01.012. Epub 2019 Feb 5.