Department of IT, Uppsala University, Uppsala, Sweden.
BioImage Informatics Facility of SciLifeLab, Uppsala, Sweden.
PLoS One. 2020 Mar 12;15(3):e0229839. doi: 10.1371/journal.pone.0229839. eCollection 2020.
There is a limitation in the size of an image that can be processed using computationally demanding methods such as e.g. Convolutional Neural Networks (CNNs). Furthermore, many networks are designed to work with a pre-determined fixed image size. Some imaging modalities-notably biological and medical-can result in images up to a few gigapixels in size, meaning that they have to be divided into smaller parts, or patches, for processing. However, when performing pixel classification, this may lead to undesirable artefacts, such as edge effects in the final re-combined image. We introduce windowing methods from signal processing to effectively reduce such edge effects. With the assumption that the central part of an image patch often holds richer contextual information than its sides and corners, we reconstruct the prediction by overlapping patches that are being weighted depending on 2-dimensional windows. We compare the results of simple averaging and four different windows: Hann, Bartlett-Hann, Triangular and a recently proposed window by Cui et al., and show that the cosine-based Hann window achieves the best improvement as measured by the Structural Similarity Index (SSIM). We also apply the Dice score to show that classification errors close to patch edges are reduced. The proposed windowing method can be used together with any CNN model for segmentation without any modification and significantly improves network predictions.
使用计算密集型方法(例如卷积神经网络 (CNN))处理图像时,存在图像大小的限制。此外,许多网络被设计为使用预定的固定图像大小。一些成像方式——特别是生物和医学成像——会导致大小达到几个千兆像素的图像,这意味着它们必须分割成较小的部分或补丁,以便进行处理。然而,在进行像素分类时,这可能会导致不理想的伪影,例如最终重新组合的图像中的边缘效应。我们从信号处理中引入了窗口化方法,以有效地减少这种边缘效应。假设图像补丁的中心部分通常比其边缘和角落包含更丰富的上下文信息,我们通过重叠补丁并根据二维窗口对其进行加权来重建预测。我们比较了简单平均值和四种不同窗口(汉宁、巴特莱特-汉宁、三角形和最近由 Cui 等人提出的窗口)的结果,并表明余弦汉宁窗口在结构相似性指数 (SSIM) 测量下实现了最佳的改进。我们还应用了 Dice 得分来表明靠近补丁边缘的分类错误减少了。所提出的窗口化方法可以与任何用于分割的 CNN 模型一起使用,而无需任何修改,并显著提高了网络预测的准确性。