从现代卷积神经网络到视觉Transformer：评估深度学习模型在组织病理学中的性能、鲁棒性和分类策略。

From modern CNNs to vision transformers: Assessing the performance, robustness, and classification strategies of deep learning models in histopathology.

作者信息

Springenberg Maximilian, Frommholz Annika, Wenzel Markus, Weicken Eva, Ma Jackie, Strodthoff Nils

机构信息

Fraunhofer Heinrich Hertz Institute, Einsteinufer 37, 10587 Berlin, Germany.

出版信息

Med Image Anal. 2023 Jul;87:102809. doi: 10.1016/j.media.2023.102809. Epub 2023 Apr 28.

DOI:10.1016/j.media.2023.102809

PMID:37201221

Abstract

While machine learning is currently transforming the field of histopathology, the domain lacks a comprehensive evaluation of state-of-the-art models based on essential but complementary quality requirements beyond a mere classification accuracy. In order to fill this gap, we developed a new methodology to extensively evaluate a wide range of classification models, including recent vision transformers, and convolutional neural networks such as: ConvNeXt, ResNet (BiT), Inception, ViT and Swin transformer, with and without supervised or self-supervised pretraining. We thoroughly tested the models on five widely used histopathology datasets containing whole slide images of breast, gastric, and colorectal cancer and developed a novel approach using an image-to-image translation model to assess the robustness of a cancer classification model against stain variations. Further, we extended existing interpretability methods to previously unstudied models and systematically reveal insights of the models' classification strategies that allow for plausibility checks and systematic comparisons. The study resulted in specific model recommendations for practitioners as well as putting forward a general methodology to quantify a model's quality according to complementary requirements that can be transferred to future model architectures.

摘要

虽然机器学习目前正在改变组织病理学领域，但该领域缺乏对基于基本但互补质量要求的最先进模型的全面评估，而不仅仅是分类准确率。为了填补这一空白，我们开发了一种新方法，以广泛评估各种分类模型，包括最近的视觉Transformer以及卷积神经网络，如：ConvNeXt、ResNet（BiT）、Inception、ViT和Swin Transformer，有无监督或自监督预训练均可。我们在五个广泛使用的组织病理学数据集上对模型进行了全面测试，这些数据集包含乳腺癌、胃癌和结直肠癌的全切片图像，并开发了一种使用图像到图像翻译模型的新方法，以评估癌症分类模型对染色变化的鲁棒性。此外，我们将现有的可解释性方法扩展到以前未研究过的模型，并系统地揭示模型分类策略的见解，以便进行合理性检查和系统比较。该研究为从业者提供了具体的模型建议，并提出了一种通用方法，根据可转移到未来模型架构的互补要求来量化模型质量。

相似文献

From modern CNNs to vision transformers: Assessing the performance, robustness, and classification strategies of deep learning models in histopathology.从现代卷积神经网络到视觉Transformer：评估深度学习模型在组织病理学中的性能、鲁棒性和分类策略。

Med Image Anal. 2023 Jul;87:102809. doi: 10.1016/j.media.2023.102809. Epub 2023 Apr 28.

Do it the transformer way: A comprehensive review of brain and vision transformers for autism spectrum disorder diagnosis and classification.采用变压器方法：自闭症谱系障碍诊断和分类的脑和视觉变压器的全面综述。

Comput Biol Med. 2023 Dec;167:107667. doi: 10.1016/j.compbiomed.2023.107667. Epub 2023 Nov 3.

ChampKit: A framework for rapid evaluation of deep neural networks for patch-based histopathology classification.ChampKit：一种基于补丁的组织病理学分类的深度神经网络快速评估框架。

Comput Methods Programs Biomed. 2023 Sep;239:107631. doi: 10.1016/j.cmpb.2023.107631. Epub 2023 May 30.

Semi-supervised training of deep convolutional neural networks with heterogeneous data and few local annotations: An experiment on prostate histopathology image classification.基于异构数据和少量局部标注的深度卷积神经网络的半监督学习：前列腺组织病理学图像分类实验。

Med Image Anal. 2021 Oct;73:102165. doi: 10.1016/j.media.2021.102165. Epub 2021 Jul 14.

Comparison between vision transformers and convolutional neural networks to predict non-small lung cancer recurrence.基于视觉Transformer 和卷积神经网络的非小细胞肺癌复发预测比较。

Sci Rep. 2023 Nov 23;13(1):20605. doi: 10.1038/s41598-023-48004-9.

Machine learning techniques for mitoses classification.机器学习技术在有丝分裂分类中的应用。

Comput Med Imaging Graph. 2021 Jan;87:101832. doi: 10.1016/j.compmedimag.2020.101832. Epub 2020 Nov 27.

Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review.医学图像分析中视觉转换器与卷积神经网络的比较：系统评价。

J Med Syst. 2024 Sep 12;48(1):84. doi: 10.1007/s10916-024-02105-8.

Transformer-based unsupervised contrastive learning for histopathological image classification.基于 Transformer 的无监督对比学习在组织病理学图像分类中的应用。

Med Image Anal. 2022 Oct;81:102559. doi: 10.1016/j.media.2022.102559. Epub 2022 Jul 30.

Enhancing Melanoma Diagnosis with Advanced Deep Learning Models Focusing on Vision Transformer, Swin Transformer, and ConvNeXt.利用聚焦于视觉Transformer、Swin Transformer和ConvNeXt的先进深度学习模型增强黑色素瘤诊断

Dermatopathology (Basel). 2024 Aug 15;11(3):239-252. doi: 10.3390/dermatopathology11030026.

Will Transformers change gastrointestinal endoscopic image analysis? A comparative analysis between CNNs and Transformers, in terms of performance, robustness and generalization.变形金刚会改变胃肠内镜图像分析吗？基于性能、鲁棒性和泛化能力的比较，分析卷积神经网络和变形金刚。

Med Image Anal. 2025 Jan;99:103348. doi: 10.1016/j.media.2024.103348. Epub 2024 Sep 16.

引用本文的文献

Evaluating Vision and Pathology Foundation Models for Computational Pathology: A Comprehensive Benchmark Study.评估用于计算病理学的视觉与病理学基础模型：一项全面的基准研究

Res Sq. 2025 Jul 4:rs.3.rs-6823810. doi: 10.21203/rs.3.rs-6823810/v1.

Application of deep learning convolutional neural networks to identify gastric squamous cell carcinoma in mice.应用深度学习卷积神经网络识别小鼠胃鳞状细胞癌。

Front Med (Lausanne). 2025 May 13;12:1587417. doi: 10.3389/fmed.2025.1587417. eCollection 2025.

A hybrid explainable federated-based vision transformer framework for breast cancer prediction via risk factors.一种基于混合可解释联邦的视觉Transformer框架，用于通过风险因素预测乳腺癌。

Sci Rep. 2025 May 27;15(1):18453. doi: 10.1038/s41598-025-96527-0.

Demystifying the black box: A survey on explainable artificial intelligence (XAI) in bioinformatics.揭开黑箱之谜：生物信息学中可解释人工智能（XAI）的调查。

Comput Struct Biotechnol J. 2025 Jan 10;27:346-359. doi: 10.1016/j.csbj.2024.12.027. eCollection 2025.

The Neural Frontier of Future Medical Imaging: A Review of Deep Learning for Brain Tumor Detection.未来医学成像的神经前沿：深度学习在脑肿瘤检测中的应用综述

J Imaging. 2024 Dec 24;11(1):2. doi: 10.3390/jimaging11010002.

Enhanced Immunohistochemistry Interpretation with a Machine Learning-Based Expert System.基于机器学习的专家系统增强免疫组织化学解读

Diagnostics (Basel). 2024 Aug 24;14(17):1853. doi: 10.3390/diagnostics14171853.

Gastric Cancer Image Classification: A Comparative Analysis and Feature Fusion Strategies.胃癌图像分类：比较分析与特征融合策略

J Imaging. 2024 Aug 10;10(8):195. doi: 10.3390/jimaging10080195.

Benchmarking Deep Learning-Based Image Retrieval of Oral Tumor Histology.基于深度学习的口腔肿瘤组织学图像检索基准测试

Cureus. 2024 Jun 12;16(6):e62264. doi: 10.7759/cureus.62264. eCollection 2024 Jun.

Current status and prospects of artificial intelligence in breast cancer pathology: convolutional neural networks to prospective Vision Transformers.人工智能在乳腺癌病理学中的现状与展望：从卷积神经网络到有前景的 Vision Transformers。

Int J Clin Oncol. 2024 Nov;29(11):1648-1668. doi: 10.1007/s10147-024-02513-3. Epub 2024 Apr 15.

Deep Learning Glioma Grading with the Tumor Microenvironment Analysis Protocol for Comprehensive Learning, Discovering, and Quantifying Microenvironmental Features.深度学习胶质瘤分级与肿瘤微环境分析协议，用于全面学习、发现和量化微环境特征。

J Imaging Inform Med. 2024 Aug;37(4):1711-1727. doi: 10.1007/s10278-024-01008-x. Epub 2024 Feb 27.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

从现代卷积神经网络到视觉Transformer：评估深度学习模型在组织病理学中的性能、鲁棒性和分类策略。

From modern CNNs to vision transformers: Assessing the performance, robustness, and classification strategies of deep learning models in histopathology.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献