Czerkawski Mikolaj, Upadhyay Priti, Davison Christopher, Atkinson Robert, Michie Craig, Andonovic Ivan, Macdonald Malcolm, Cardona Javier, Tachtatzis Christos
Department of Electronic and Electrical Engineering, University of Strathclyde, Glasgow G1 1XW, UK.
Department of Chemical Engineering, University of Strathclyde, Glasgow G1 1XJ, UK.
J Imaging. 2024 Mar 12;10(3):69. doi: 10.3390/jimaging10030069.
There are several image inverse tasks, such as inpainting or super-resolution, which can be solved using deep internal learning, a paradigm that involves employing deep neural networks to find a solution by learning from the sample itself rather than a dataset. For example, Deep Image Prior is a technique based on fitting a convolutional neural network to output the known parts of the image (such as non-inpainted regions or a low-resolution version of the image). However, this approach is not well adjusted for samples composed of multiple modalities. In some domains, such as satellite image processing, accommodating multi-modal representations could be beneficial or even essential. In this work, Multi-Modal Convolutional Parameterisation Network (MCPN) is proposed, where a convolutional neural network approximates shared information between multiple modes by combining a core shared network with modality-specific head networks. The results demonstrate that these approaches can significantly outperform the single-mode adoption of a convolutional parameterisation network on guided image inverse problems of inpainting and super-resolution.
存在几种图像逆任务,例如图像修复或超分辨率,这些任务可以使用深度内部学习来解决,这是一种范式,涉及利用深度神经网络通过从样本本身而非数据集学习来找到解决方案。例如,深度图像先验是一种基于拟合卷积神经网络以输出图像已知部分(如图像的未修复区域或低分辨率版本)的技术。然而,这种方法对于由多种模态组成的样本调整得并不理想。在某些领域,如卫星图像处理中,容纳多模态表示可能是有益的甚至是必不可少的。在这项工作中,提出了多模态卷积参数化网络(MCPN),其中卷积神经网络通过将核心共享网络与特定模态的头部网络相结合来近似多种模态之间的共享信息。结果表明,在图像修复和超分辨率的引导图像逆问题上,这些方法显著优于卷积参数化网络的单模态应用。