• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于注意力和多尺度特征融合的三维目标检测。

3D Object Detection Based on Attention and Multi-Scale Feature Fusion.

机构信息

School of Information Science and Engineering, Xinjiang University, Urumqi 830046, China.

出版信息

Sensors (Basel). 2022 May 23;22(10):3935. doi: 10.3390/s22103935.

DOI:10.3390/s22103935
PMID:35632344
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9142975/
Abstract

Three-dimensional object detection in the point cloud can provide more accurate object data for autonomous driving. In this paper, we propose a method named MA-MFFC that uses an attention mechanism and a multi-scale feature fusion network with ConvNeXt module to improve the accuracy of object detection. The multi-attention (MA) module contains point-channel attention and voxel attention, which are used in voxelization and 3D backbone. By considering the point-wise and channel-wise, the attention mechanism enhances the information of key points in voxels, suppresses background point clouds in voxelization, and improves the robustness of the network. The voxel attention module is used in the 3D backbone to obtain more robust and discriminative voxel features. The MFFC module contains the multi-scale feature fusion network and the ConvNeXt module; the multi-scale feature fusion network can extract rich feature information and improve the detection accuracy, and the convolutional layer is replaced with the ConvNeXt module to enhance the feature extraction capability of the network. The experimental results show that the average accuracy is 64.60% for pedestrians and 80.92% for cyclists on the KITTI dataset, which is 1.33% and 2.1% higher, respectively, compared with the baseline network, enabling more accurate detection and localization of more difficult objects.

摘要

点云中的三维目标检测可为自动驾驶提供更准确的目标数据。本文提出了一种名为 MA-MFFC 的方法,它使用注意力机制和具有 ConvNeXt 模块的多尺度特征融合网络来提高目标检测的准确性。多注意力(MA)模块包含点通道注意力和体素注意力,用于体素化和 3D 骨干网络。通过考虑点和通道,可以增强体素中关键点的信息,抑制体素化中的背景点云,提高网络的鲁棒性。体素注意力模块用于 3D 骨干网络中,以获得更稳健和有区别的体素特征。MFFC 模块包含多尺度特征融合网络和 ConvNeXt 模块;多尺度特征融合网络可以提取丰富的特征信息,提高检测精度,并用 ConvNeXt 模块替换卷积层,增强网络的特征提取能力。实验结果表明,在 KITTI 数据集上,行人的平均准确率为 64.60%,自行车的平均准确率为 80.92%,分别比基线网络提高了 1.33%和 2.1%,能够更准确地检测和定位更困难的目标。

相似文献

1
3D Object Detection Based on Attention and Multi-Scale Feature Fusion.基于注意力和多尺度特征融合的三维目标检测。
Sensors (Basel). 2022 May 23;22(10):3935. doi: 10.3390/s22103935.
2
3D Point Cloud Object Detection Method Based on Multi-Scale Dynamic Sparse Voxelization.基于多尺度动态稀疏体素化的三维点云目标检测方法
Sensors (Basel). 2024 Mar 11;24(6):1804. doi: 10.3390/s24061804.
3
AMFF-Net: An Effective 3D Object Detector Based on Attention and Multi-Scale Feature Fusion.AMFF-Net:一种基于注意力和多尺度特征融合的有效3D目标检测器。
Sensors (Basel). 2023 Nov 22;23(23):9319. doi: 10.3390/s23239319.
4
3D Object Detection under Urban Road Traffic Scenarios Based on Dual-Layer Voxel Features Fusion Augmentation.基于双层体素特征融合增强的城市道路交通场景下的3D目标检测
Sensors (Basel). 2024 May 21;24(11):3267. doi: 10.3390/s24113267.
5
BAFusion: Bidirectional Attention Fusion for 3D Object Detection Based on LiDAR and Camera.BAFusion:基于激光雷达和摄像头的用于3D目标检测的双向注意力融合
Sensors (Basel). 2024 Jul 20;24(14):4718. doi: 10.3390/s24144718.
6
PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud.PSANet:用于点云中 3D 目标检测的金字塔分裂与聚合网络。
Sensors (Basel). 2020 Dec 28;21(1):136. doi: 10.3390/s21010136.
7
AEPF: Attention-Enabled Point Fusion for 3D Object Detection.AEPF:用于3D目标检测的注意力增强点融合
Sensors (Basel). 2024 Sep 9;24(17):5841. doi: 10.3390/s24175841.
8
CF-YOLOX: An Autonomous Driving Detection Model for Multi-Scale Object Detection.CF-YOLOX:一种用于多尺度目标检测的自动驾驶检测模型。
Sensors (Basel). 2023 Apr 7;23(8):3794. doi: 10.3390/s23083794.
9
IPD-Net: Infrared Pedestrian Detection Network via Adaptive Feature Extraction and Coordinate Information Fusion.IPD-Net:基于自适应特征提取和坐标信息融合的红外行人检测网络。
Sensors (Basel). 2022 Nov 19;22(22):8966. doi: 10.3390/s22228966.
10
Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection.基于注意力机制的多帧点云特征融合用于3D目标检测
Sensors (Basel). 2022 Oct 2;22(19):7473. doi: 10.3390/s22197473.

引用本文的文献

1
3D Object Detection under Urban Road Traffic Scenarios Based on Dual-Layer Voxel Features Fusion Augmentation.基于双层体素特征融合增强的城市道路交通场景下的3D目标检测
Sensors (Basel). 2024 May 21;24(11):3267. doi: 10.3390/s24113267.
2
Vehicle Detection Algorithms for Autonomous Driving: A Review.用于自动驾驶的车辆检测算法:综述
Sensors (Basel). 2024 May 13;24(10):3088. doi: 10.3390/s24103088.
3
PS5-Net: a medical image segmentation network with multiscale resolution.PS5-Net:一种具有多尺度分辨率的医学图像分割网络。

本文引用的文献

1
Deep Learning for 3D Point Clouds: A Survey.用于三维点云的深度学习:综述
IEEE Trans Pattern Anal Mach Intell. 2021 Dec;43(12):4338-4364. doi: 10.1109/TPAMI.2020.3005434. Epub 2021 Nov 3.
2
From Points to Parts: 3D Object Detection From Point Cloud With Part-Aware and Part-Aggregation Network.从点到部件:基于部件感知与部件聚合网络的点云三维目标检测
IEEE Trans Pattern Anal Mach Intell. 2021 Aug;43(8):2647-2664. doi: 10.1109/TPAMI.2020.2977026. Epub 2021 Jul 1.
3
SECOND: Sparsely Embedded Convolutional Detection.第二:稀疏嵌入卷积检测。
J Med Imaging (Bellingham). 2024 Jan;11(1):014008. doi: 10.1117/1.JMI.11.1.014008. Epub 2024 Feb 19.
4
Optimization of table tennis target detection algorithm guided by multi-scale feature fusion of deep learning.基于深度学习多尺度特征融合的乒乓球目标检测算法优化
Sci Rep. 2024 Jan 16;14(1):1401. doi: 10.1038/s41598-024-51865-3.
5
AMFF-Net: An Effective 3D Object Detector Based on Attention and Multi-Scale Feature Fusion.AMFF-Net:一种基于注意力和多尺度特征融合的有效3D目标检测器。
Sensors (Basel). 2023 Nov 22;23(23):9319. doi: 10.3390/s23239319.
6
PTA-Det: Point Transformer Associating Point Cloud and Image for 3D Object Detection.PTA-Det:用于 3D 目标检测的点变换关联点云和图像。
Sensors (Basel). 2023 Mar 17;23(6):3229. doi: 10.3390/s23063229.
7
Anti-Noise 3D Object Detection of Multimodal Feature Attention Fusion Based on PV-RCNN.基于PV-RCNN的多模态特征注意力融合抗噪声3D目标检测
Sensors (Basel). 2022 Dec 26;23(1):233. doi: 10.3390/s23010233.
8
A Survey on Deep-Learning-Based LiDAR 3D Object Detection for Autonomous Driving.基于深度学习的自动驾驶激光雷达 3D 目标检测研究综述。
Sensors (Basel). 2022 Dec 7;22(24):9577. doi: 10.3390/s22249577.
Sensors (Basel). 2018 Oct 6;18(10):3337. doi: 10.3390/s18103337.