深度可分离过参数化卷积层（DO-Conv）

DO-Conv: Depthwise Over-Parameterized Convolutional Layer.

作者信息

Cao Jinming, Li Yangyan, Sun Mingchao, Chen Ying, Lischinski Dani, Cohen-Or Daniel, Chen Baoquan, Tu Changhe

出版信息

IEEE Trans Image Process. 2022;31:3726-3736. doi: 10.1109/TIP.2022.3175432. Epub 2022 May 26.

DOI:10.1109/TIP.2022.3175432

Abstract

Convolutional layers are the core building blocks of Convolutional Neural Networks (CNNs). In this paper, we propose to augment a convolutional layer with an additional depthwise convolution, where each input channel is convolved with a different 2D kernel. The composition of the two convolutions constitutes an over-parameterization, since it adds learnable parameters, while the resulting linear operation can be expressed by a single convolution layer. We refer to this depthwise over-parameterized convolutional layer as DO-Conv, which is a novel way of over-parameterization. We show with extensive experiments that the mere replacement of conventional convolutional layers with DO-Conv layers boosts the performance of CNNs on many classical vision tasks, such as image classification, detection, and segmentation. Moreover, in the inference phase, the depthwise convolution is folded into the conventional convolution, reducing the computation to be exactly equivalent to that of a convolutional layer without over-parameterization. As DO-Conv introduces performance gains without incurring any computational complexity increase for inference, we advocate it as an alternative to the conventional convolutional layer. We open sourced an implementation of DO-Conv in Tensorflow, PyTorch and GluonCV at https://github.com/yangyanli/DO-Conv.

摘要

卷积层是卷积神经网络（CNN）的核心构建模块。在本文中，我们建议用额外的深度卷积来增强卷积层，其中每个输入通道都与一个不同的二维内核进行卷积。这两个卷积的组合构成了一种过参数化，因为它增加了可学习参数，而得到的线性运算可以由单个卷积层表示。我们将这种深度过参数化卷积层称为DO-Conv，这是一种新颖的过参数化方式。我们通过大量实验表明，仅用DO-Conv层替换传统卷积层就能提高CNN在许多经典视觉任务上的性能，如图像分类、检测和分割。此外，在推理阶段，深度卷积被折叠到传统卷积中，将计算量减少到与没有过参数化的卷积层完全相同。由于DO-Conv在不增加推理计算复杂度的情况下提高了性能，我们提倡将其作为传统卷积层的替代方案。我们在https://github.com/yangyanli/DO-Conv上开源了DO-Conv在Tensorflow、PyTorch和GluonCV中的实现。

相似文献

DO-Conv: Depthwise Over-Parameterized Convolutional Layer.深度可分离过参数化卷积层（DO-Conv）

IEEE Trans Image Process. 2022;31:3726-3736. doi: 10.1109/TIP.2022.3175432. Epub 2022 May 26.

Siamese network with a depthwise over-parameterized convolutional layer for visual tracking.用于视觉跟踪的带有深度过度参数化卷积层的暹罗网络。

PLoS One. 2022 Aug 31;17(8):e0273690. doi: 10.1371/journal.pone.0273690. eCollection 2022.

Depthwise Spatio-Temporal STFT Convolutional Neural Networks for Human Action Recognition.用于人体动作识别的深度时空 STFT 卷积神经网络。

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):4839-4851. doi: 10.1109/TPAMI.2021.3076522. Epub 2022 Aug 4.

LdsConv: Learned Depthwise Separable Convolutions by Group Pruning.LdsConv：通过分组剪枝学习深度可分离卷积。

Sensors (Basel). 2020 Aug 4;20(15):4349. doi: 10.3390/s20154349.

Hyper-convolutions via implicit kernels for medical image analysis.用于医学图像分析的基于隐式内核的超卷积

Med Image Anal. 2023 May;86:102796. doi: 10.1016/j.media.2023.102796. Epub 2023 Mar 16.

Sharp U-Net: Depthwise convolutional network for biomedical image segmentation.Sharp U-Net：用于生物医学图像分割的深度卷积网络。

Comput Biol Med. 2021 Sep;136:104699. doi: 10.1016/j.compbiomed.2021.104699. Epub 2021 Jul 29.

Facial Mask Detection Using Depthwise Separable Convolutional Neural Network Model During COVID-19 Pandemic.基于深度可分离卷积神经网络模型的 COVID-19 大流行期间面部口罩检测

Front Public Health. 2022 Mar 7;10:855254. doi: 10.3389/fpubh.2022.855254. eCollection 2022.

Content-aware convolutional neural networks.基于内容感知的卷积神经网络。

Neural Netw. 2021 Nov;143:657-668. doi: 10.1016/j.neunet.2021.06.030. Epub 2021 Jul 12.

Feature Map Retargeting to Classify Biomedical Journal Figures.用于对生物医学期刊图片进行分类的特征图重定向

Adv Vis Comput. 2020 Oct;12510:728-741. doi: 10.1007/978-3-030-64559-5_58. Epub 2020 Dec 7.

OMNI-CONV: Generalization of the Omnidirectional Distortion-Aware Convolutions.全向卷积：全向失真感知卷积的泛化

J Imaging. 2023 Jan 28;9(2):29. doi: 10.3390/jimaging9020029.

引用本文的文献

YOLOv8-BCD: a real-time deep learning framework for pulmonary nodule detection in computed tomography imaging.YOLOv8-BCD：一种用于计算机断层扫描成像中肺结节检测的实时深度学习框架。

Quant Imaging Med Surg. 2025 Sep 1;15(9):8189-8204. doi: 10.21037/qims-2025-824. Epub 2025 Aug 12.

A real-time end-to-end detector for detecting surface defects on oversized rings.一种用于检测超大尺寸环表面缺陷的实时端到端检测器。

PLoS One. 2025 Aug 12;20(8):e0330031. doi: 10.1371/journal.pone.0330031. eCollection 2025.

YOLO-RDM: A high accuracy and efficient algorithm for magnetic tile surface defect detection with practical applications.YOLO-RDM：一种用于磁瓦表面缺陷检测的高精度高效算法及其实际应用

PLoS One. 2025 Jul 18;20(7):e0328815. doi: 10.1371/journal.pone.0328815. eCollection 2025.

High-Precision Pose Measurement of Containers on the Transfer Platform of the Dual-Trolley Quayside Container Crane Based on Machine Vision.基于机器视觉的双小车岸边集装箱起重机转运平台上集装箱高精度姿态测量

Sensors (Basel). 2025 Apr 27;25(9):2760. doi: 10.3390/s25092760.

A multi-modal deep learning solution for precise pneumonia diagnosis: the PneumoFusion-Net model.一种用于精确肺炎诊断的多模态深度学习解决方案：PneumoFusion-Net模型。

Front Physiol. 2025 Mar 12;16:1512835. doi: 10.3389/fphys.2025.1512835. eCollection 2025.

A lightweight semantic segmentation method for concrete bridge surface diseases based on improved DeeplabV3.一种基于改进的深度卷积神经网络语义分割算法（DeeplabV3）的混凝土桥面病害轻量级语义分割方法。

Sci Rep. 2025 Mar 26;15(1):10348. doi: 10.1038/s41598-025-95518-5.

Auxiliary meta-learning strategy for cancer recognition: leveraging external data and optimized feature mapping.用于癌症识别的辅助元学习策略：利用外部数据和优化特征映射

BMC Cancer. 2025 Feb 27;25(1):367. doi: 10.1186/s12885-025-13740-w.

MLHI-Net: multi-level hybrid lightweight water body segmentation network for urban shoreline detection.MLHI-Net：用于城市海岸线检测的多级混合轻量级水体分割网络

Sci Rep. 2025 Feb 8;15(1):4746. doi: 10.1038/s41598-025-87209-y.

YOLOv7-CWFD for real time detection of bolt defects on transmission lines.用于输电线路螺栓缺陷实时检测的YOLOv7-CWFD

Sci Rep. 2025 Jan 10;15(1):1635. doi: 10.1038/s41598-024-81386-y.

Enhanced outdoor visual localization using Py-Net voting segmentation approach.使用Py-Net投票分割方法增强户外视觉定位

Front Robot AI. 2024 Oct 9;11:1469588. doi: 10.3389/frobt.2024.1469588. eCollection 2024.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

深度可分离过参数化卷积层（DO-Conv）

DO-Conv: Depthwise Over-Parameterized Convolutional Layer.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献