图像拼接对深度学习语义分割的不良影响的系统评估

Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation.

作者信息

Reina G Anthony, Panchumarthy Ravi, Thakur Siddhesh Pravin, Bastidas Alexei, Bakas Spyridon

机构信息

Intel Corporation, Santa Clara, CA, United States.

Center for Biomedical Image Computing and Analytics, University of Pennsylvania, Philadelphia, PA, United States.

出版信息

Front Neurosci. 2020 Feb 7;14:65. doi: 10.3389/fnins.2020.00065. eCollection 2020.

DOI:10.3389/fnins.2020.00065

PMID:32116512

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7020775/

Abstract

Convolutional neural network (CNN) models obtain state of the art performance on image classification, localization, and segmentation tasks. Limitations in computer hardware, most notably memory size in deep learning accelerator cards, prevent relatively large images, such as those from medical and satellite imaging, from being processed as a whole in their original resolution. A fully convolutional topology, such as U-Net, is typically trained on down-sampled images and inferred on images of their original size and resolution, by simply dividing the larger image into smaller (typically overlapping) tiles, making predictions on these tiles, and stitching them back together as the prediction for the whole image. In this study, we show that this tiling technique combined with translationally-invariant nature of CNNs causes small, but relevant differences during inference that can be detrimental in the performance of the model. Here we quantify these variations in both medical (i.e., BraTS) and non-medical (i.e., satellite) images and show that training a 2D U-Net model on the whole image substantially improves the overall model performance. Finally, we compare 2D and 3D semantic segmentation models to show that providing CNN models with a wider context of the image in all three dimensions leads to more accurate and consistent predictions. Our results suggest that tiling the input to CNN models-while perhaps necessary to overcome the memory limitations in computer hardware-may lead to undesirable and unpredictable errors in the model's output that can only be adequately mitigated by increasing the input of the model to the largest possible tile size.

摘要

卷积神经网络（CNN）模型在图像分类、定位和分割任务中取得了领先的性能。计算机硬件的限制，最显著的是深度学习加速卡中的内存大小，使得相对较大的图像，如医学和卫星成像中的图像，无法以其原始分辨率作为一个整体进行处理。一种全卷积拓扑结构，如U-Net，通常在降采样图像上进行训练，并在原始大小和分辨率的图像上进行推理，方法是简单地将较大的图像划分为较小的（通常是重叠的）图像块，对这些图像块进行预测，然后将它们拼接在一起作为整个图像的预测。在本研究中，我们表明，这种图像块技术与CNN的平移不变性相结合，在推理过程中会导致微小但相关的差异，这可能对模型的性能产生不利影响。在这里，我们量化了医学（即BraTS）和非医学（即卫星）图像中的这些变化，并表明在整个图像上训练二维U-Net模型可显著提高整体模型性能。最后，我们比较了二维和三维语义分割模型，以表明在所有三个维度上为CNN模型提供更广泛的图像上下文会导致更准确和一致的预测。我们的结果表明，将输入划分为图像块——虽然这可能是克服计算机硬件内存限制所必需的——可能会导致模型输出中出现不良和不可预测的错误，只有通过将模型的输入增加到尽可能大的图像块大小才能充分减轻这些错误。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b02/7020775/7c14007138d0/fnins-14-00065-g0001.jpg

相似文献

Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation.图像拼接对深度学习语义分割的不良影响的系统评估

Front Neurosci. 2020 Feb 7;14:65. doi: 10.3389/fnins.2020.00065. eCollection 2020.

Automated glioma grading on conventional MRI images using deep convolutional neural networks.使用深度卷积神经网络对传统MRI图像进行自动脑胶质瘤分级

Med Phys. 2020 Jul;47(7):3044-3053. doi: 10.1002/mp.14168. Epub 2020 May 11.

DENSE-INception U-net for medical image segmentation.基于密集卷积 Inception 的 U-Net 网络在医学图像分割中的应用

Comput Methods Programs Biomed. 2020 Aug;192:105395. doi: 10.1016/j.cmpb.2020.105395. Epub 2020 Feb 15.

Exact Tile-Based Segmentation Inference for Images Larger than GPU Memory.针对大于GPU内存的图像的精确基于瓦片的分割推理

J Res Natl Inst Stand Technol. 2021 Jun 3;126:126009. doi: 10.6028/jres.126.009. eCollection 2021.

Convolutional neural network for automated mass segmentation in mammography.卷积神经网络在乳腺 X 线摄影中用于自动肿块分割。

BMC Bioinformatics. 2020 Dec 9;21(Suppl 1):192. doi: 10.1186/s12859-020-3521-y.

Evaluation of multislice inputs to convolutional neural networks for medical image segmentation.评估卷积神经网络的多切片输入在医学图像分割中的应用。

Med Phys. 2020 Dec;47(12):6216-6231. doi: 10.1002/mp.14391. Epub 2020 Nov 10.

Catheter segmentation in X-ray fluoroscopy using synthetic data and transfer learning with light U-nets.基于合成数据和轻量级 U 型网络的迁移学习在 X 射线透视下的导管分割

Comput Methods Programs Biomed. 2020 Aug;192:105420. doi: 10.1016/j.cmpb.2020.105420. Epub 2020 Feb 29.

A Hybrid Approach Based on Deep CNN and Machine Learning Classifiers for the Tumor Segmentation and Classification in Brain MRI.基于深度卷积神经网络和机器学习分类器的脑 MRI 肿瘤分割与分类的混合方法。

Comput Math Methods Med. 2022 Aug 5;2022:6446680. doi: 10.1155/2022/6446680. eCollection 2022.

3D convolutional neural networks for tumor segmentation using long-range 2D context.使用长程 2D 上下文的三维卷积神经网络进行肿瘤分割。

Comput Med Imaging Graph. 2019 Apr;73:60-72. doi: 10.1016/j.compmedimag.2019.02.001. Epub 2019 Feb 21.

Improving deep learning-based segmentation of diatoms in gigapixel-sized virtual slides by object-based tile positioning and object integrity constraint.通过基于对象的图块定位和对象完整性约束来改进千兆像素大小虚拟幻灯片中基于深度学习的硅藻分割。

PLoS One. 2023 Feb 24;18(2):e0272103. doi: 10.1371/journal.pone.0272103. eCollection 2023.

引用本文的文献

Deep Learning for MRI Segmentation and Molecular Subtyping in Glioblastoma: Critical Aspects from an Emerging Field.胶质母细胞瘤中用于MRI分割和分子亚型分析的深度学习：新兴领域的关键方面

Biomedicines. 2024 Aug 16;12(8):1878. doi: 10.3390/biomedicines12081878.

Analysis of Varroa Mite Colony Infestation Level Using New Open Software Based on Deep Learning Techniques.基于深度学习技术的新型开源软件对瓦螨种群侵袭水平的分析。

Sensors (Basel). 2024 Jun 13;24(12):3828. doi: 10.3390/s24123828.

Prognostic stratification of glioblastoma patients by unsupervised clustering of morphology patterns on whole slide images furthering our disease understanding.通过对全切片图像上的形态模式进行无监督聚类对胶质母细胞瘤患者进行预后分层，这进一步加深了我们对该疾病的理解。

Front Neurosci. 2024 May 20;18:1304191. doi: 10.3389/fnins.2024.1304191. eCollection 2024.

Analysis of cellularity in H&E-stained rat bone marrow tissue via deep learning.通过深度学习分析苏木精-伊红染色大鼠骨髓组织中的细胞成分。

J Pathol Inform. 2023 Aug 25;14:100333. doi: 10.1016/j.jpi.2023.100333. eCollection 2023.

PLoS One. 2023 Feb 24;18(2):e0272103. doi: 10.1371/journal.pone.0272103. eCollection 2023.

ACF: An Armed CCTV Footage Dataset for Enhancing Weapon Detection.ACF：用于增强武器检测的武装 CCTV 视频数据集。

Sensors (Basel). 2022 Sep 21;22(19):7158. doi: 10.3390/s22197158.

CEREBRUM-7T: Fast and Fully Volumetric Brain Segmentation of 7 Tesla MR Volumes.CEREBRUM-7T：7TMR 容积快速全面的全容积脑分割。

Hum Brain Mapp. 2021 Dec 1;42(17):5563-5580. doi: 10.1002/hbm.25636. Epub 2021 Oct 1.

本文引用的文献

Streaming Convolutional Neural Networks for End-to-End Learning With Multi-Megapixel Images.基于多百万像素图像的端到端学习的流式卷积神经网络。

IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1581-1590. doi: 10.1109/TPAMI.2020.3019563. Epub 2022 Feb 3.

Holistic decomposition convolution for effective semantic segmentation of medical volume images.用于医学体图像有效语义分割的整体分解卷积

Med Image Anal. 2019 Oct;57:149-164. doi: 10.1016/j.media.2019.07.003. Epub 2019 Jul 8.

An application of cascaded 3D fully convolutional networks for medical image segmentation.级联三维全卷积网络在医学图像分割中的应用。

Comput Med Imaging Graph. 2018 Jun;66:90-99. doi: 10.1016/j.compmedimag.2018.03.001. Epub 2018 Mar 16.

Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features.利用专家分割标签和放射组学特征推进癌症基因组图谱胶质细胞瘤 MRI 数据集。

Sci Data. 2017 Sep 5;4:170117. doi: 10.1038/sdata.2017.117.

Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation.高效多尺度 3D CNN 结合全连接条件随机场实现精准脑损伤分割。

Med Image Anal. 2017 Feb;36:61-78. doi: 10.1016/j.media.2016.10.004. Epub 2016 Oct 29.

The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS).多模态脑肿瘤图像分割基准（BRATS）。

IEEE Trans Med Imaging. 2015 Oct;34(10):1993-2024. doi: 10.1109/TMI.2014.2377694. Epub 2014 Dec 4.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

图像拼接对深度学习语义分割的不良影响的系统评估

Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献