• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于小波卷积神经网络的自动驾驶三维目标检测 WCNN3D

WCNN3D: Wavelet Convolutional Neural Network-Based 3D Object Detection for Autonomous Driving.

机构信息

Department of Electrical and Computer Engineering, James Worth Bagley College of Engineering, Mississippi State University, Starkville, MS 39762, USA.

出版信息

Sensors (Basel). 2022 Sep 16;22(18):7010. doi: 10.3390/s22187010.

DOI:10.3390/s22187010
PMID:36146359
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9505771/
Abstract

Three-dimensional object detection is crucial for autonomous driving to understand the driving environment. Since the pooling operation causes information loss in the standard CNN, we designed a wavelet-multiresolution-analysis-based 3D object detection network without a pooling operation. Additionally, instead of using a single filter like the standard convolution, we used the lower-frequency and higher-frequency coefficients as a filter. These filters capture more relevant parts than a single filter, enlarging the receptive field. The model comprises a discrete wavelet transform (DWT) and an inverse wavelet transform (IWT) with skip connections to encourage feature reuse for contrasting and expanding layers. The IWT enriches the feature representation by fully recovering the lost details during the downsampling operation. Element-wise summation was used for the skip connections to decrease the computational burden. We trained the model for the Haar and Daubechies (Db4) wavelets. The two-level wavelet decomposition result shows that we can build a lightweight model without losing significant performance. The experimental results on KITTI's BEV and 3D evaluation benchmark show that our model outperforms the PointPillars-based model by up to 14% while reducing the number of trainable parameters.

摘要

三维目标检测对于自动驾驶理解驾驶环境至关重要。由于池化操作会导致标准 CNN 中的信息丢失,因此我们设计了一种基于小波多分辨率分析的无池化操作的 3D 目标检测网络。此外,我们没有像标准卷积那样使用单个滤波器,而是使用低频和高频系数作为滤波器。这些滤波器比单个滤波器捕捉到更多相关部分,扩大了感受野。该模型包含离散小波变换(DWT)和带有跳过连接的逆小波变换(IWT),以鼓励特征复用,用于对比和扩展层。IWT 通过完全恢复下采样操作中丢失的细节来丰富特征表示。跳过连接使用元素级求和来降低计算负担。我们针对 Haar 和 Daubechies(Db4)小波训练了该模型。两级小波分解结果表明,我们可以在不损失显著性能的情况下构建轻量级模型。在 KITTI 的 BEV 和 3D 评估基准上的实验结果表明,我们的模型在减少可训练参数的同时,比基于 PointPillars 的模型性能提高了 14%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/368a/9505771/2145677b1004/sensors-22-07010-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/368a/9505771/4517a730b512/sensors-22-07010-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/368a/9505771/62801dab3cee/sensors-22-07010-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/368a/9505771/1ea10705aa0e/sensors-22-07010-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/368a/9505771/2145677b1004/sensors-22-07010-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/368a/9505771/4517a730b512/sensors-22-07010-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/368a/9505771/62801dab3cee/sensors-22-07010-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/368a/9505771/1ea10705aa0e/sensors-22-07010-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/368a/9505771/2145677b1004/sensors-22-07010-g004.jpg

相似文献

1
WCNN3D: Wavelet Convolutional Neural Network-Based 3D Object Detection for Autonomous Driving.基于小波卷积神经网络的自动驾驶三维目标检测 WCNN3D
Sensors (Basel). 2022 Sep 16;22(18):7010. doi: 10.3390/s22187010.
2
WaveCNet: Wavelet Integrated CNNs to Suppress Aliasing Effect for Noise-Robust Image Classification.WaveCNet:用于抑制抗噪图像分类中的混叠效应的小波集成 CNNs。
IEEE Trans Image Process. 2021;30:7074-7089. doi: 10.1109/TIP.2021.3101395. Epub 2021 Aug 10.
3
Nested DWT-Based CNN Architecture for Monocular Depth Estimation.基于嵌套 DWT 的单目深度估计卷积神经网络架构。
Sensors (Basel). 2023 Mar 13;23(6):3066. doi: 10.3390/s23063066.
4
An Efficient Methodology for Brain MRI Classification Based on DWT and Convolutional Neural Network.基于 DWT 和卷积神经网络的脑 MRI 分类高效方法。
Sensors (Basel). 2021 Nov 10;21(22):7480. doi: 10.3390/s21227480.
5
Using discrete wavelet transform for optimizing COVID-19 new cases and deaths prediction worldwide with deep neural networks.利用离散小波变换和深度神经网络优化全球 COVID-19 新发病例和死亡预测。
PLoS One. 2023 Apr 6;18(4):e0282621. doi: 10.1371/journal.pone.0282621. eCollection 2023.
6
A novel wavelet decomposition and transformation convolutional neural network with data augmentation for breast cancer detection using digital mammogram.一种基于小波分解和变换的新型卷积神经网络,并结合数据增强技术,用于数字乳腺 X 线摄影的乳腺癌检测。
Sci Rep. 2022 Apr 8;12(1):5913. doi: 10.1038/s41598-022-09905-3.
7
Leg motion classification with artificial neural networks using wavelet-based features of gyroscope signals.基于陀螺仪信号的小波特征的人工神经网络腿部运动分类。
Sensors (Basel). 2011;11(2):1721-43. doi: 10.3390/s110201721. Epub 2011 Jan 28.
8
Electrocardiogram signals de-noising using lifting-based discrete wavelet transform.基于提升小波变换的心电图信号去噪
Comput Biol Med. 2004 Sep;34(6):479-93. doi: 10.1016/S0010-4825(03)00090-8.
9
Analysis of pattern electroretinogram signals of early primary open-angle glaucoma in discrete wavelet transform coefficients domain.离散小波变换系数域中早期原发性开角型青光眼图形视网膜电图信号分析
Int Ophthalmol. 2019 Oct;39(10):2373-2383. doi: 10.1007/s10792-019-01077-w. Epub 2019 Feb 6.
10
LiDAR-Based 3D Temporal Object Detection via Motion-Aware LiDAR Feature Fusion.基于激光雷达的三维目标检测:通过运动感知激光雷达特征融合实现的时间目标检测
Sensors (Basel). 2024 Jul 18;24(14):4667. doi: 10.3390/s24144667.

引用本文的文献

1
A Survey on Deep-Learning-Based LiDAR 3D Object Detection for Autonomous Driving.基于深度学习的自动驾驶激光雷达 3D 目标检测研究综述。
Sensors (Basel). 2022 Dec 7;22(24):9577. doi: 10.3390/s22249577.
2
Off-Road Detection Analysis for Autonomous Ground Vehicles: A Review.自主地面车辆的越野检测分析:综述
Sensors (Basel). 2022 Nov 3;22(21):8463. doi: 10.3390/s22218463.
3
Class-Aware Fish Species Recognition Using Deep Learning for an Imbalanced Dataset.使用深度学习对不均衡数据集进行类别感知的鱼类物种识别

本文引用的文献

1
A Survey on Deep-Learning-Based LiDAR 3D Object Detection for Autonomous Driving.基于深度学习的自动驾驶激光雷达 3D 目标检测研究综述。
Sensors (Basel). 2022 Dec 7;22(24):9577. doi: 10.3390/s22249577.
2
Voxel-FPN: Multi-Scale Voxel Feature Aggregation for 3D Object Detection from LIDAR Point Clouds.体素特征金字塔网络(Voxel-FPN):用于从激光雷达点云进行三维目标检测的多尺度体素特征聚合
Sensors (Basel). 2020 Jan 28;20(3):704. doi: 10.3390/s20030704.
3
SECOND: Sparsely Embedded Convolutional Detection.第二:稀疏嵌入卷积检测。
Sensors (Basel). 2022 Oct 28;22(21):8268. doi: 10.3390/s22218268.
Sensors (Basel). 2018 Oct 6;18(10):3337. doi: 10.3390/s18103337.
4
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.