HDConv：基于异构核的扩张卷积。

HDConv: Heterogeneous kernel-based dilated convolutions.

机构信息

College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou, 310014, PR China; Key Laboratory of Visual Media Intelligent Processing Technology of Zhejiang Province, Hangzhou 310023, PR China.

School of Mathematical and Computer Science, Zhejiang A & F University, Hangzhou 311300, PR China.

出版信息

Neural Netw. 2024 Nov;179:106568. doi: 10.1016/j.neunet.2024.106568. Epub 2024 Jul 23.

DOI:10.1016/j.neunet.2024.106568

PMID:39089152

Abstract

Dilated convolution has been widely used in various computer vision tasks due to its ability to expand the receptive field while maintaining the resolution of feature maps. However, the critical challenge is the gridding problem caused by the isomorphic structure of the dilated convolution, where the holes filled in the dilated convolution destroy the integrity of the extracted information and cut off the relevance of neighboring pixels. In this work, a novel heterogeneous dilated convolution, called HDConv, is proposed to address this issue by setting independent dilation rates on grouped channels while keeping the general convolution operation. The heterogeneous structure can effectively avoid the gridding problem while introducing multi-scale kernels in the filters. Based on the heterogeneous structure of the proposed HDConv, we also explore the benefit of large receptive fields to feature extraction by comparing different combinations of dilated rates. Finally, a series of experiments are conducted to verify the effectiveness of some computer vision tasks, such as image segmentation and object detection. The results show the proposed HDConv can achieve a competitive performance on ADE20K, Cityscapes, COCO-Stuff10k, COCO, and a medical image dataset UESTC-COVID-19. The proposed module can readily replace conventional convolutions in existing convolutional neural networks (i.e., plug-and-play), and it is promising to further extend dilated convolution to wider scenarios in the field of image segmentation.

摘要

扩张卷积由于能够在保持特征图分辨率的同时扩大感受野，因此在各种计算机视觉任务中得到了广泛应用。然而，其面临的关键挑战是扩张卷积的同构结构引起的网格问题，其中扩张卷积中填充的空洞会破坏提取信息的完整性，并切断相邻像素之间的相关性。在这项工作中，提出了一种新的非均匀扩张卷积（HDConv），通过在分组通道上设置独立的扩张率，同时保持一般卷积操作，可以解决这个问题。这种非均匀结构可以有效地避免网格问题，同时在滤波器中引入多尺度核。基于所提出的 HDConv 的非均匀结构，我们还通过比较不同扩张率的组合，探讨了大感受野对特征提取的益处。最后，进行了一系列实验来验证一些计算机视觉任务的有效性，如图像分割和目标检测。结果表明，所提出的 HDConv 在 ADE20K、Cityscapes、COCO-Stuff10k、COCO 和 UESTC-COVID-19 医疗图像数据集上具有竞争力的性能。所提出的模块可以很容易地替代现有卷积神经网络中的常规卷积（即即插即用），并且有望将扩张卷积进一步扩展到图像分割领域的更广泛场景。

相似文献

HDConv: Heterogeneous kernel-based dilated convolutions.HDConv：基于异构核的扩张卷积。

Neural Netw. 2024 Nov;179:106568. doi: 10.1016/j.neunet.2024.106568. Epub 2024 Jul 23.

Dilated Heterogeneous Convolution for Cell Detection and Segmentation Based on Mask R-CNN.基于掩码区域卷积神经网络的用于细胞检测与分割的扩张异构卷积

Sensors (Basel). 2024 Apr 10;24(8):2424. doi: 10.3390/s24082424.

Stacked dilated convolutions and asymmetric architecture for U-Net-based medical image segmentation.基于 U-Net 的医学图像分割的堆叠扩张卷积和非对称架构。

Comput Biol Med. 2022 Sep;148:105891. doi: 10.1016/j.compbiomed.2022.105891. Epub 2022 Jul 21.

Fully connected network with multi-scale dilation convolution module in evaluating atrial septal defect based on MRI segmentation.基于 MRI 分割的全连接网络与多尺度扩张卷积模块评估房间隔缺损

Comput Methods Programs Biomed. 2022 Mar;215:106608. doi: 10.1016/j.cmpb.2021.106608. Epub 2022 Jan 11.

HDA-Net: A novel dual gated attention network using comprehensive hybrid dilated convolutions for medical image segmentation.HDA-Net：一种使用综合混合扩张卷积的新型双门控注意力网络用于医学图像分割。

Comput Biol Med. 2023 Jan;152:106384. doi: 10.1016/j.compbiomed.2022.106384. Epub 2022 Nov 30.

Deeply supervised 3D fully convolutional networks with group dilated convolution for automatic MRI prostate segmentation.基于深度监督的三维全卷积网络与分组空洞卷积在自动 MRI 前列腺分割中的应用。

Med Phys. 2019 Apr;46(4):1707-1718. doi: 10.1002/mp.13416. Epub 2019 Feb 19.

Compressed sensing MRI via a multi-scale dilated residual convolution network.基于多尺度扩张残差卷积网络的压缩感知磁共振成像。

Magn Reson Imaging. 2019 Nov;63:93-104. doi: 10.1016/j.mri.2019.07.014. Epub 2019 Jul 27.

QGD-Net: A Lightweight Model Utilizing Pixels of Affinity in Feature Layer for Dermoscopic Lesion Segmentation.QGD-Net：一种利用特征层像素关联的轻量级模型进行皮肤镜病变分割。

IEEE J Biomed Health Inform. 2023 Dec;27(12):5982-5993. doi: 10.1109/JBHI.2023.3320953. Epub 2023 Dec 5.

OBELISK-Net: Fewer layers to solve 3D multi-organ segmentation with sparse deformable convolutions.OBELISK-Net：稀疏可变形卷积解决三维多器官分割问题，所需层数更少。

Med Image Anal. 2019 May;54:1-9. doi: 10.1016/j.media.2019.02.006. Epub 2019 Feb 13.

A Multi-Scale Context Aware Attention Model for Medical Image Segmentation.一种用于医学图像分割的多尺度上下文感知注意力模型。

IEEE J Biomed Health Inform. 2023 Aug;27(8):3731-3739. doi: 10.1109/JBHI.2022.3227540. Epub 2023 Aug 7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

HDConv：基于异构核的扩张卷积。

HDConv: Heterogeneous kernel-based dilated convolutions.

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献