用于提高计算病理学中深度卷积神经网络泛化能力的染色不变特征

Staining Invariant Features for Improving Generalization of Deep Convolutional Neural Networks in Computational Pathology.

作者信息

Otálora Sebastian, Atzori Manfredo, Andrearczyk Vincent, Khan Amjad, Müller Henning

机构信息

Institute of Information Systems, HES-SO University of Applied Sciences and Arts Western Switzerland, Sierre, Switzerland.

Computer Science Centre (CUI), University of Geneva, Geneva, Switzerland.

出版信息

Front Bioeng Biotechnol. 2019 Aug 23;7:198. doi: 10.3389/fbioe.2019.00198. eCollection 2019.

DOI:10.3389/fbioe.2019.00198

PMID:31508414

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6716536/

Abstract

One of the main obstacles for the implementation of deep convolutional neural networks (DCNNs) in the clinical pathology workflow is their low capability to overcome variability in slide preparation and scanner configuration, that leads to changes in tissue appearance. Some of these variations may not be not included in the training data, which means that the models have a risk to not generalize well. Addressing such variations and evaluating them in reproducible scenarios allows understanding of when the models generalize better, which is crucial for performance improvements and better DCNN models. Staining normalization techniques (often based on color deconvolution and deep learning) and color augmentation approaches have shown improvements in the generalization of the classification tasks for several tissue types. Domain-invariant training of DCNN's is also a promising technique to address the problem of training a single model for different domains, since it includes the source domain information to guide the training toward domain-invariant features, achieving state-of-the-art results in classification tasks. In this article, deep domain adaptation in convolutional networks (DANN) is applied to computational pathology and compared with widely used staining normalization and color augmentation methods in two challenging classification tasks. The classification tasks rely on two openly accessible datasets, targeting Gleason grading in prostate cancer, and mitosis classification in breast tissue. The benchmark of the different techniques and their combination in two DCNN architectures allows us to assess the generalization abilities and advantages of each method in the considered classification tasks. The code for reproducing our experiments and preprocessing the data is publicly available. Quantitative and qualitative results show that the use of DANN helps model generalization to external datasets. The combination of several techniques to manage color heterogeneity suggests that several methods together, such as color augmentation methods with DANN training, can generalize even further. The results do not show a single best technique among the considered methods, even when combining them. However, color augmentation and DANN training obtain most often the best results (alone or combined with color normalization and color augmentation). The statistical significance of the results and the embeddings visualizations provide useful insights to design DCNN that generalizes to unseen staining appearances. Furthermore, in this work, we release for the first time code for DANN evaluation in open access datasets for computational pathology. This work opens the possibility for further research on using DANN models together with techniques that can overcome the tissue preparation differences across datasets to tackle limited generalization.

摘要

在临床病理工作流程中，深度卷积神经网络（DCNN）应用的主要障碍之一是其克服载玻片制备和扫描仪配置差异的能力较低，这会导致组织外观发生变化。其中一些变化可能未包含在训练数据中，这意味着模型存在泛化能力不佳的风险。在可重复的场景中解决此类变化并对其进行评估，有助于理解模型何时能更好地泛化，这对于性能提升和更好的DCNN模型至关重要。染色归一化技术（通常基于颜色反卷积和深度学习）以及颜色增强方法已在多种组织类型的分类任务泛化方面取得了进展。DCNN的域不变训练也是一种有前景的技术，可解决针对不同域训练单个模型的问题，因为它包含源域信息以引导训练朝着域不变特征进行，在分类任务中取得了最优结果。在本文中，卷积网络中的深度域自适应（DANN）被应用于计算病理学，并在两项具有挑战性的分类任务中与广泛使用的染色归一化和颜色增强方法进行比较。分类任务依赖于两个可公开获取的数据集，分别针对前列腺癌的Gleason分级和乳腺组织的有丝分裂分类。在两种DCNN架构中对不同技术及其组合进行基准测试，使我们能够评估每种方法在考虑的分类任务中的泛化能力和优势。用于重现我们实验和预处理数据的代码是公开可用的。定量和定性结果表明，使用DANN有助于模型泛化到外部数据集。多种管理颜色异质性技术的组合表明，多种方法一起使用，例如颜色增强方法与DANN训练相结合，可以实现更进一步的泛化。即使在组合使用时，结果也未显示在所考虑的方法中有单一的最佳技术。然而，颜色增强和DANN训练最常获得最佳结果（单独使用或与颜色归一化和颜色增强相结合）。结果的统计显著性和嵌入可视化可为设计能泛化到未见染色外观的DCNN提供有用的见解。此外，在这项工作中，我们首次在开放获取的计算病理学数据集中发布了用于DANN评估的代码。这项工作为进一步研究将DANN模型与能够克服跨数据集组织制备差异的技术结合起来以解决有限泛化问题开辟了可能性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6f4/6716536/3d814e034a0d/fbioe-07-00198-g0001.jpg

相似文献

Staining Invariant Features for Improving Generalization of Deep Convolutional Neural Networks in Computational Pathology.

Front Bioeng Biotechnol. 2019 Aug 23;7:198. doi: 10.3389/fbioe.2019.00198. eCollection 2019.

Quantifying the effects of data augmentation and stain color normalization in convolutional neural networks for computational pathology.

Med Image Anal. 2019 Dec;58:101544. doi: 10.1016/j.media.2019.101544. Epub 2019 Aug 21.

The role of unpaired image-to-image translation for stain color normalization in colorectal cancer histology classification.

Comput Methods Programs Biomed. 2023 Jun;234:107511. doi: 10.1016/j.cmpb.2023.107511. Epub 2023 Mar 26.

Semi-supervised training of deep convolutional neural networks with heterogeneous data and few local annotations: An experiment on prostate histopathology image classification.

Med Image Anal. 2021 Oct;73:102165. doi: 10.1016/j.media.2021.102165. Epub 2021 Jul 14.

Learning Domain-Invariant Representations of Histological Images.

Front Med (Lausanne). 2019 Jul 16;6:162. doi: 10.3389/fmed.2019.00162. eCollection 2019.

Generalizing Deep Learning for Medical Image Segmentation to Unseen Domains via Deep Stacked Transformation.

IEEE Trans Med Imaging. 2020 Jul;39(7):2531-2540. doi: 10.1109/TMI.2020.2973595. Epub 2020 Feb 12.

Data-driven color augmentation for H&E stained images in computational pathology.

J Pathol Inform. 2023 Jan 3;14:100183. doi: 10.1016/j.jpi.2022.100183. eCollection 2023.

Learning generalizable AI models for multi-center histopathology image classification.

NPJ Precis Oncol. 2024 Jul 19;8(1):151. doi: 10.1038/s41698-024-00652-4.

Improving domain generalization performance for medical image segmentation via random feature augmentation.

Methods. 2023 Oct;218:149-157. doi: 10.1016/j.ymeth.2023.08.003. Epub 2023 Aug 10.

Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.

J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.

引用本文的文献

Learning generalizable AI models for multi-center histopathology image classification.

NPJ Precis Oncol. 2024 Jul 19;8(1):151. doi: 10.1038/s41698-024-00652-4.

Domain generalization for retinal vessel segmentation via Hessian-based vector field.

Med Image Anal. 2024 Jul;95:103164. doi: 10.1016/j.media.2024.103164. Epub 2024 Apr 6.

Computational pathology: A survey review and the way forward.

J Pathol Inform. 2024 Jan 14;15:100357. doi: 10.1016/j.jpi.2023.100357. eCollection 2024 Dec.

Deep Learning Methodologies Applied to Digital Pathology in Prostate Cancer: A Systematic Review.

Diagnostics (Basel). 2023 Aug 14;13(16):2676. doi: 10.3390/diagnostics13162676.

CNN stability training improves robustness to scanner and IHC-based image variability for epithelium segmentation in cervical histology.

Front Med (Lausanne). 2023 Jul 5;10:1173616. doi: 10.3389/fmed.2023.1173616. eCollection 2023.

Histopathology images predict multi-omics aberrations and prognoses in colorectal cancer patients.

Nat Commun. 2023 Apr 13;14(1):2102. doi: 10.1038/s41467-023-37179-4.

Data-driven color augmentation for H&E stained images in computational pathology.

J Pathol Inform. 2023 Jan 3;14:100183. doi: 10.1016/j.jpi.2022.100183. eCollection 2023.

Enhanced Pathology Image Quality with Restore-Generative Adversarial Network.

Am J Pathol. 2023 Apr;193(4):404-416. doi: 10.1016/j.ajpath.2022.12.011. Epub 2023 Jan 18.

Breast Cancer Dataset, Classification and Detection Using Deep Learning.

Healthcare (Basel). 2022 Nov 29;10(12):2395. doi: 10.3390/healthcare10122395.

Impact of scanner variability on lymph node segmentation in computational pathology.

J Pathol Inform. 2022 Jul 25;13:100127. doi: 10.1016/j.jpi.2022.100127. eCollection 2022.

本文引用的文献

Quantifying the effects of data augmentation and stain color normalization in convolutional neural networks for computational pathology.

Med Image Anal. 2019 Dec;58:101544. doi: 10.1016/j.media.2019.101544. Epub 2019 Aug 21.

Development and validation of a deep learning algorithm for improving Gleason scoring of prostate cancer.

NPJ Digit Med. 2019 Jun 7;2:48. doi: 10.1038/s41746-019-0112-2. eCollection 2019.

Predicting breast tumor proliferation from whole-slide images: The TUPAC16 challenge.

Med Image Anal. 2019 May;54:111-121. doi: 10.1016/j.media.2019.02.012. Epub 2019 Feb 27.

From Detection of Individual Metastases to Classification of Lymph Node Status at the Patient Level: The CAMELYON17 Challenge.

IEEE Trans Med Imaging. 2019 Feb;38(2):550-560. doi: 10.1109/TMI.2018.2867350.

Adversarial Domain Adaptation for Classification of Prostate Histopathology Whole-Slide Images.

Med Image Comput Comput Assist Interv. 2018 Sep;11071:201-209. doi: 10.1007/978-3-030-00934-2_23. Epub 2018 Sep 26.

Digital pathology image analysis: opportunities and challenges.

Imaging Med. 2009;1(1):7-10. doi: 10.2217/IIM.09.9.

Automated Gleason grading of prostate cancer tissue microarrays via deep learning.

Sci Rep. 2018 Aug 13;8(1):12054. doi: 10.1038/s41598-018-30535-1.

A study about color normalization methods for histopathology images.

Micron. 2018 Nov;114:42-61. doi: 10.1016/j.micron.2018.07.005. Epub 2018 Aug 1.

Segmentation of glandular epithelium in colorectal tumours to automatically compartmentalise IHC biomarker quantification: A deep learning approach.

Med Image Anal. 2018 Oct;49:35-45. doi: 10.1016/j.media.2018.07.004. Epub 2018 Jul 12.

Whole-Slide Mitosis Detection in H&E Breast Histology Using PHH3 as a Reference to Train Distilled Stain-Invariant Convolutional Networks.

IEEE Trans Med Imaging. 2018 Sep;37(9):2126-2136. doi: 10.1109/TMI.2018.2820199. Epub 2018 Sep 2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于提高计算病理学中深度卷积神经网络泛化能力的染色不变特征

Staining Invariant Features for Improving Generalization of Deep Convolutional Neural Networks in Computational Pathology.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献