He Xu, Yin Yong
School of Microelectronics and Communication Engineering, Chongqing University, Chongqing 400044, China.
Sensors (Basel). 2021 May 10;21(9):3281. doi: 10.3390/s21093281.
Recently, deep learning-based techniques have shown great power in image inpainting especially dealing with squared holes. However, they fail to generate plausible results inside the missing regions for irregular and large holes as there is a lack of understanding between missing regions and existing counterparts. To overcome this limitation, we combine two non-local mechanisms including a contextual attention module (CAM) and an implicit diversified Markov random fields (ID-MRF) loss with a multi-scale architecture which uses several dense fusion blocks (DFB) based on the dense combination of dilated convolution to guide the generative network to restore discontinuous and continuous large masked areas. To prevent color discrepancies and grid-like artifacts, we apply the ID-MRF loss to improve the visual appearance by comparing similarities of long-distance feature patches. To further capture the long-term relationship of different regions in large missing regions, we introduce the CAM. Although CAM has the ability to create plausible results via reconstructing refined features, it depends on initial predicted results. Hence, we employ the DFB to obtain larger and more effective receptive fields, which benefits to predict more precise and fine-grained information for CAM. Extensive experiments on two widely-used datasets demonstrate that our proposed framework significantly outperforms the state-of-the-art approaches both in quantity and quality.
最近,基于深度学习的技术在图像修复尤其是处理方形孔洞方面展现出了强大的能力。然而,对于不规则的大孔洞,它们在缺失区域内无法生成合理的结果,因为缺失区域和现有对应区域之间缺乏理解。为了克服这一限制,我们将两种非局部机制相结合,包括上下文注意力模块(CAM)和隐式多样化马尔可夫随机场(ID-MRF)损失,并采用了一种多尺度架构,该架构使用了基于扩张卷积的密集组合的几个密集融合块(DFB)来引导生成网络恢复不连续和连续的大掩码区域。为了防止颜色差异和网格状伪影,我们应用ID-MRF损失,通过比较远距离特征补丁的相似性来改善视觉外观。为了进一步捕捉大缺失区域中不同区域的长期关系,我们引入了CAM。尽管CAM有能力通过重建精细特征来创建合理的结果,但它依赖于初始预测结果。因此,我们使用DFB来获得更大、更有效的感受野,这有助于为CAM预测更精确和细粒度的信息。在两个广泛使用的数据集上进行的大量实验表明,我们提出的框架在数量和质量上均显著优于当前的先进方法。