DDCNet：用于密集预测的深度扩张卷积神经网络。

DDCNet: Deep Dilated Convolutional Neural Network for Dense Prediction.

作者信息

Salehi Ali, Balasubramanian Madhusudhanan

机构信息

Department of Electrical and Computer Engineering, The University of Memphis, Memphis TN 38152.

出版信息

Neurocomputing (Amst). 2023 Feb 28;523:116-129. doi: 10.1016/j.neucom.2022.12.024. Epub 2022 Dec 15.

DOI:10.1016/j.neucom.2022.12.024

PMID:37332394

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10275502/

Abstract

Dense pixel matching problems such as optical flow and disparity estimation are among the most challenging tasks in computer vision. Recently, several deep learning methods designed for these problems have been successful. A sufficiently larger effective receptive field (ERF) and a higher resolution of spatial features within a network are essential for providing higher-resolution dense estimates. In this work, we present a systemic approach to design network architectures that can provide a larger receptive field while maintaining a higher spatial feature resolution. To achieve a larger ERF, we utilized dilated convolutional layers. By aggressively increasing dilation rates in the deeper layers, we were able to achieve a sufficiently larger ERF with a significantly fewer number of trainable parameters. We used optical flow estimation problem as the primary benchmark to illustrate our network design strategy. The benchmark results (Sintel, KITTI, and Middlebury) indicate that our compact networks can achieve comparable performance in the class of networks.

摘要

诸如光流和视差估计等密集像素匹配问题是计算机视觉中最具挑战性的任务之一。最近，为这些问题设计的几种深度学习方法已经取得了成功。网络中足够大的有效感受野（ERF）和更高分辨率的空间特征对于提供更高分辨率的密集估计至关重要。在这项工作中，我们提出了一种系统的方法来设计网络架构，该架构可以在保持较高空间特征分辨率的同时提供更大的感受野。为了实现更大的ERF，我们使用了空洞卷积层。通过在更深层中大幅提高扩张率，我们能够以显著更少的可训练参数实现足够大的ERF。我们以光流估计问题作为主要基准来说明我们的网络设计策略。基准测试结果（Sintel、KITTI和Middlebury）表明，我们的紧凑网络在同类网络中可以实现可比的性能。

相似文献

DDCNet: Deep Dilated Convolutional Neural Network for Dense Prediction.DDCNet：用于密集预测的深度扩张卷积神经网络。

Neurocomputing (Amst). 2023 Feb 28;523:116-129. doi: 10.1016/j.neucom.2022.12.024. Epub 2022 Dec 15.

HDConv: Heterogeneous kernel-based dilated convolutions.HDConv：基于异构核的扩张卷积。

Neural Netw. 2024 Nov;179:106568. doi: 10.1016/j.neunet.2024.106568. Epub 2024 Jul 23.

An effective modular approach for crowd counting in an image using convolutional neural networks.基于卷积神经网络的图像人群计数的有效模块化方法。

Sci Rep. 2022 Apr 6;12(1):5795. doi: 10.1038/s41598-022-09685-w.

A mixed-scale dense convolutional neural network for image analysis.一种用于图像分析的混合尺度密集卷积神经网络。

Proc Natl Acad Sci U S A. 2018 Jan 9;115(2):254-259. doi: 10.1073/pnas.1715832114. Epub 2017 Dec 26.

A Lightweight Optical Flow CNN -Revisiting Data Fidelity and Regularization.一种轻量级光流卷积神经网络——重新审视数据保真度和正则化

IEEE Trans Pattern Anal Mach Intell. 2021 Aug;43(8):2555-2569. doi: 10.1109/TPAMI.2020.2976928. Epub 2021 Jul 1.

Dense Residual Network: Enhancing global dense feature flow for character recognition.密集残差网络：增强字符识别的全局密集特征流。

Neural Netw. 2021 Jul;139:77-85. doi: 10.1016/j.neunet.2021.02.005. Epub 2021 Feb 25.

A two-stage segmentation of sublingual veins based on compact fully convolutional networks for Traditional Chinese Medicine images.基于紧凑全卷积网络的中医图像舌下静脉两阶段分割

Health Inf Sci Syst. 2023 Apr 6;11(1):19. doi: 10.1007/s13755-023-00214-1. eCollection 2023 Dec.

The super-resolution reconstruction algorithm of multi-scale dilated convolution residual network.多尺度扩张卷积残差网络的超分辨率重建算法

Front Neurorobot. 2024 Aug 16;18:1436052. doi: 10.3389/fnbot.2024.1436052. eCollection 2024.

Multipath Lightweight Deep Network Using Randomly Selected Dilated Convolution.多路径轻量化深度网络，采用随机选择的扩张卷积。

Sensors (Basel). 2021 Nov 26;21(23):7862. doi: 10.3390/s21237862.

OBELISK-Net: Fewer layers to solve 3D multi-organ segmentation with sparse deformable convolutions.OBELISK-Net：稀疏可变形卷积解决三维多器官分割问题，所需层数更少。

Med Image Anal. 2019 May;54:1-9. doi: 10.1016/j.media.2019.02.006. Epub 2019 Feb 13.

引用本文的文献

Dense optic nerve head deformation estimated using CNN as a structural biomarker of glaucoma progression.使用卷积神经网络估计致密视神经头变形作为青光眼进展的结构生物标志物。

Eye (Lond). 2023 Dec;37(18):3819-3826. doi: 10.1038/s41433-023-02623-8. Epub 2023 Jun 17.

本文引用的文献

A Lightweight Optical Flow CNN -Revisiting Data Fidelity and Regularization.一种轻量级光流卷积神经网络——重新审视数据保真度和正则化

IEEE Trans Pattern Anal Mach Intell. 2021 Aug;43(8):2555-2569. doi: 10.1109/TPAMI.2020.2976928. Epub 2021 Jul 1.

Models Matter, So Does Training: An Empirical Study of CNNs for Optical Flow Estimation.模型很重要，训练也很重要：用于光流估计的 CNN 的实证研究。

IEEE Trans Pattern Anal Mach Intell. 2020 Jun;42(6):1408-1423. doi: 10.1109/TPAMI.2019.2894353. Epub 2019 Jan 22.

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab：基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。

IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.

Fully Convolutional Networks for Semantic Segmentation.全卷积网络用于语义分割。

IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验