Lin Junyu, Huang Guoheng, Huang Jun, Yuan Xiaochen, Zeng Yiwen, Shi Cheng
School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, People's Republic of China.
Guangzhou Red Cross Hospital 510091, People's Republic of China.
Phys Med Biol. 2023 Mar 27;68(7). doi: 10.1088/1361-6560/acc002.
. In the field of endoscopic imaging, Super-Resolution (SR) plays an important role in Manufactured Diagnosis, physicians and machine Automatic Diagnosis. Although many recent studies have been performed, by using deep convolutional neural networks on endoscopic SR, most of the methods have large parameters, which limits their practical application. In addition, almost all of these methods treat each channel equally based on the real-valued domain, without considering the difference among the different channels. Our objective is to design a SR model named Quaternion Attention Multi-scale Widening Network (QAMWN) for endoscopy images to address the above problem.. QAMWN contains a stacked Quaternion Attention Multi-Scale Widening Block, that composed of Multi-scale Feature Widening Aggregation Module (MFWAM) and Quaternion Residual Channel Attention (QRCA). The MFWAM adopts multi-scale architecture with step-wise widening on feature channels for better feature extraction; and in QRCA, quaternion is introduced to construct Residual Channel Attention Mechanism, which obtains adaptively scales features by considering compact cross-channel interactions in the hyper-complex domain.. To verify the efficacy of our method, it is performed on two public endoscopic datasets, CVC ClinicDB and Kvasir dataset. The experimental results show that our proposed method can achieve a better trade-off in model size and performance. More importantly, the proposed QAMWN outperforms previous state-of-the-art methods in both metrics and visualization.. We propose a lightweight SR network for endoscopy and achieves better performance with fewer parameters, which helps in clinical diagnosis of endoscopy.
在内窥镜成像领域,超分辨率(SR)在人工诊断、医生诊断和机器自动诊断中发挥着重要作用。尽管最近进行了许多研究,通过在 endoscopic SR 上使用深度卷积神经网络,但大多数方法参数众多,这限制了它们的实际应用。此外,几乎所有这些方法都基于实值域平等对待每个通道,而没有考虑不同通道之间的差异。我们的目标是设计一种用于内窥镜图像的 SR 模型,名为四元数注意力多尺度加宽网络(QAMWN),以解决上述问题。QAMWN 包含一个堆叠的四元数注意力多尺度加宽块,它由多尺度特征加宽聚合模块(MFWAM)和四元数残差通道注意力(QRCA)组成。MFWAM 采用多尺度架构,在特征通道上逐步加宽以实现更好的特征提取;在 QRCA 中,引入四元数来构建残差通道注意力机制,通过考虑超复数域中的紧凑跨通道交互来自适应地缩放特征。为了验证我们方法的有效性,在两个公共内窥镜数据集 CVC ClinicDB 和 Kvasir 数据集上进行了实验。实验结果表明,我们提出的方法可以在模型大小和性能之间实现更好的权衡。更重要的是,所提出的 QAMWN 在指标和可视化方面均优于先前的最先进方法。我们提出了一种用于内窥镜的轻量级 SR 网络,以更少的参数实现了更好的性能,这有助于内窥镜的临床诊断。