• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用TAVAC量化视觉Transformer模型中的解释可重复性。

Quantifying interpretation reproducibility in Vision Transformer models with TAVAC.

作者信息

Zhao Yue, Agyemang Dylan, Liu Yang, Mahoney Matt, Li Sheng

机构信息

The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.

Department of Mathematics and Statistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.

出版信息

Sci Adv. 2024 Dec 20;10(51):eabg0264. doi: 10.1126/sciadv.abg0264.

DOI:10.1126/sciadv.abg0264
PMID:39705362
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11661421/
Abstract

Deep learning algorithms can extract meaningful diagnostic features from biomedical images, promising improved patient care in digital pathology. Vision Transformer (ViT) models capture long-range spatial relationships and offer robust prediction power and better interpretability for image classification tasks than convolutional neural network models. However, limited annotated biomedical imaging datasets can cause ViT models to overfit, leading to false predictions due to random noise. To address this, we introduce Training Attention and Validation Attention Consistency (TAVAC), a metric for evaluating ViT model overfitting and quantifying interpretation reproducibility. By comparing high-attention regions between training and testing, we tested TAVAC on four public image classification datasets and two independent breast cancer histological image datasets. Overfitted models showed significantly lower TAVAC scores. TAVAC also distinguishes off-target from on-target attentions and measures interpretation generalization at a fine-grained cellular level. Beyond diagnostics, TAVAC enhances interpretative reproducibility in basic research, revealing critical spatial patterns and cellular structures of biomedical and other general nonbiomedical images.

摘要

深度学习算法可以从生物医学图像中提取有意义的诊断特征,有望改善数字病理学中的患者护理。视觉Transformer(ViT)模型能够捕捉远距离空间关系,并且与卷积神经网络模型相比,在图像分类任务中具有更强的预测能力和更好的可解释性。然而,有限的带注释生物医学成像数据集可能会导致ViT模型过拟合,从而由于随机噪声产生错误预测。为了解决这个问题,我们引入了训练注意力和验证注意力一致性(TAVAC),这是一种用于评估ViT模型过拟合和量化解释可重复性的指标。通过比较训练和测试之间的高注意力区域,我们在四个公共图像分类数据集和两个独立的乳腺癌组织学图像数据集上测试了TAVAC。过拟合模型的TAVAC分数显著更低。TAVAC还能区分偏离目标的注意力和目标注意力,并在细粒度细胞水平上衡量解释的泛化能力。除了诊断之外,TAVAC还提高了基础研究中的解释可重复性,揭示了生物医学和其他一般非生物医学图像的关键空间模式和细胞结构。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/196c/11661421/ba580c7e343b/sciadv.abg0264-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/196c/11661421/b78a7877c4ef/sciadv.abg0264-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/196c/11661421/bdf7aee4f1ae/sciadv.abg0264-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/196c/11661421/c8d81fb25258/sciadv.abg0264-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/196c/11661421/e42ff45a0492/sciadv.abg0264-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/196c/11661421/8a33082df9d2/sciadv.abg0264-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/196c/11661421/ba580c7e343b/sciadv.abg0264-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/196c/11661421/b78a7877c4ef/sciadv.abg0264-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/196c/11661421/bdf7aee4f1ae/sciadv.abg0264-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/196c/11661421/c8d81fb25258/sciadv.abg0264-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/196c/11661421/e42ff45a0492/sciadv.abg0264-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/196c/11661421/8a33082df9d2/sciadv.abg0264-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/196c/11661421/ba580c7e343b/sciadv.abg0264-f6.jpg

相似文献

1
Quantifying interpretation reproducibility in Vision Transformer models with TAVAC.使用TAVAC量化视觉Transformer模型中的解释可重复性。
Sci Adv. 2024 Dec 20;10(51):eabg0264. doi: 10.1126/sciadv.abg0264.
2
Quantifying Interpretation Reproducibility in Vision Transformer Models with TAVAC.使用TAVAC量化视觉Transformer模型中的解释可重复性。
bioRxiv. 2024 Jan 22:2024.01.18.576252. doi: 10.1101/2024.01.18.576252.
3
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
4
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
5
A novel deep learning framework for retinal disease detection leveraging contextual and local features cues from retinal images.一种用于视网膜疾病检测的新型深度学习框架,利用来自视网膜图像的上下文和局部特征线索。
Med Biol Eng Comput. 2025 Feb 7. doi: 10.1007/s11517-025-03314-0.
6
A hybrid deep learning model EfficientNet with GRU for breast cancer detection from histopathology images.一种用于从组织病理学图像中检测乳腺癌的基于GRU的混合深度学习模型EfficientNet。
Sci Rep. 2025 Jul 9;15(1):24633. doi: 10.1038/s41598-025-00930-6.
7
Exploratory multi-cohort, multi-reader study on the clinical utility of a deep learning model for transforming cryosectioned to formalin-fixed, paraffin-embedded (FFPE) images in breast lesion diagnosis.关于深度学习模型在乳腺病变诊断中用于将冷冻切片图像转换为福尔马林固定石蜡包埋(FFPE)图像的临床效用的探索性多队列、多读者研究。
Breast Cancer Res. 2025 Jun 17;27(1):110. doi: 10.1186/s13058-025-02064-z.
8
Semi-Supervised Learning Allows for Improved Segmentation With Reduced Annotations of Brain Metastases Using Multicenter MRI Data.半监督学习可利用多中心MRI数据,通过减少脑转移瘤的标注来改进分割。
J Magn Reson Imaging. 2025 Jun;61(6):2469-2479. doi: 10.1002/jmri.29686. Epub 2025 Jan 10.
9
Artificial intelligence-based prediction of organ involvement in Sjogren's syndrome using labial gland biopsy whole-slide images.基于人工智能利用唇腺活检全切片图像预测干燥综合征的器官受累情况。
Clin Rheumatol. 2025 Jun 5. doi: 10.1007/s10067-025-07518-5.
10
Automatic melanoma detection using an optimized five-stream convolutional neural network.使用优化的五流卷积神经网络进行黑色素瘤自动检测。
Sci Rep. 2025 Jul 1;15(1):22404. doi: 10.1038/s41598-025-05675-w.

引用本文的文献

1
Longitudinal MRI-Driven Multi-Modality Approach for Predicting Pathological Complete Response and B Cell Infiltration in Breast Cancer.基于纵向磁共振成像的多模态方法预测乳腺癌病理完全缓解及B细胞浸润
Adv Sci (Weinh). 2025 Mar;12(12):e2413702. doi: 10.1002/advs.202413702. Epub 2025 Feb 7.

本文引用的文献

1
Computer-aided diagnosis of ischemic stroke using multi-dimensional image features in carotid color Doppler.基于颈动脉彩色多谱勒的多维图像特征的计算机辅助诊断缺血性脑卒中。
Comput Biol Med. 2022 Aug;147:105779. doi: 10.1016/j.compbiomed.2022.105779. Epub 2022 Jun 22.
2
Application of computer-aided detection (CAD) software to automatically detect nodules under SDCT and LDCT scans with different parameters.应用计算机辅助检测(CAD)软件自动检测不同参数下 SDCT 和 LDCT 扫描下的结节。
Comput Biol Med. 2022 Jul;146:105538. doi: 10.1016/j.compbiomed.2022.105538. Epub 2022 Apr 17.
3
GasHisSDB: A new gastric histopathology image dataset for computer aided diagnosis of gastric cancer.
GasHisSDB:一个用于胃癌计算机辅助诊断的新型胃组织病理学图像数据集。
Comput Biol Med. 2022 Mar;142:105207. doi: 10.1016/j.compbiomed.2021.105207. Epub 2022 Jan 6.
4
Spatial deconvolution of HER2-positive breast cancer delineates tumor-associated cell type interactions.空间去卷积 HER2 阳性乳腺癌描绘肿瘤相关细胞类型相互作用。
Nat Commun. 2021 Oct 14;12(1):6012. doi: 10.1038/s41467-021-26271-2.
5
Computer-aided diagnosis of low grade endometrial stromal sarcoma (LGESS).计算机辅助诊断低度子宫内膜间质肉瘤(LGESS)。
Comput Biol Med. 2021 Nov;138:104874. doi: 10.1016/j.compbiomed.2021.104874. Epub 2021 Sep 22.
6
Integrating spatial gene expression and breast tumour morphology via deep learning.通过深度学习整合空间基因表达和乳腺肿瘤形态。
Nat Biomed Eng. 2020 Aug;4(8):827-834. doi: 10.1038/s41551-020-0578-x. Epub 2020 Jun 22.
7
Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning.基于深度学习的非小细胞肺癌组织病理学图像分类和突变预测。
Nat Med. 2018 Oct;24(10):1559-1567. doi: 10.1038/s41591-018-0177-5. Epub 2018 Sep 17.
8
1399 H&E-stained sentinel lymph node sections of breast cancer patients: the CAMELYON dataset.1399 例乳腺癌患者 H&E 染色前哨淋巴结切片:CAMELYON 数据集。
Gigascience. 2018 Jun 1;7(6). doi: 10.1093/gigascience/giy065.
9
Deep Convolutional Neural Networks Enable Discrimination of Heterogeneous Digital Pathology Images.深度卷积神经网络能够区分异质数字病理学图像。
EBioMedicine. 2018 Jan;27:317-328. doi: 10.1016/j.ebiom.2017.12.026. Epub 2017 Dec 28.
10
The quantitative evaluation of functional neuroimaging experiments: the NPAIRS data analysis framework.功能神经影像学实验的定量评估:NPAIRS数据分析框架。
Neuroimage. 2002 Apr;15(4):747-71. doi: 10.1006/nimg.2001.1034.