Suppr超能文献

基于激光雷达的三维目标检测:通过运动感知激光雷达特征融合实现的时间目标检测

LiDAR-Based 3D Temporal Object Detection via Motion-Aware LiDAR Feature Fusion.

作者信息

Park Gyuhee, Koh Junho, Kim Jisong, Moon Jun, Choi Jun Won

机构信息

Department of Electrical Engineering, Hanyang University, Seoul 04763, Republic of Korea.

Department of Electrical and Computer Engineering, College of Liberal Studies, Seoul National University, Seoul 08826, Republic of Korea.

出版信息

Sensors (Basel). 2024 Jul 18;24(14):4667. doi: 10.3390/s24144667.

Abstract

Recently, the growing demand for autonomous driving in the industry has led to a lot of interest in 3D object detection, resulting in many excellent 3D object detection algorithms. However, most 3D object detectors focus only on a single set of LiDAR points, ignoring their potential ability to improve performance by leveraging the information provided by the consecutive set of LIDAR points. In this paper, we propose a novel 3D object detection method called temporal motion-aware 3D object detection (TM3DOD), which utilizes temporal LiDAR data. In the proposed TM3DOD method, we aggregate LiDAR voxels over time and the current BEV features by generating motion features using consecutive BEV feature maps. First, we present the temporal voxel encoder (TVE), which generates voxel representations by capturing the temporal relationships among the point sets within a voxel. Next, we design a motion-aware feature aggregation network (MFANet), which aims to enhance the current BEV feature representation by quantifying the temporal variation between two consecutive BEV feature maps. By analyzing the differences and changes in the BEV feature maps over time, MFANet captures motion information and integrates it into the current feature representation, enabling more robust and accurate detection of 3D objects. Experimental evaluations on the nuScenes benchmark dataset demonstrate that the proposed TM3DOD method achieved significant improvements in 3D detection performance compared with the baseline methods. Additionally, our method achieved comparable performance to state-of-the-art approaches.

摘要

近年来,行业内对自动驾驶的需求不断增长,引发了人们对三维目标检测的浓厚兴趣,催生了许多优秀的三维目标检测算法。然而,大多数三维目标检测器仅关注单组激光雷达点,忽略了利用连续激光雷达点集所提供信息来提升性能的潜在能力。在本文中,我们提出了一种名为时态运动感知三维目标检测(TM3DOD)的新型三维目标检测方法,该方法利用了时态激光雷达数据。在所提出的TM3DOD方法中,我们通过使用连续的鸟瞰图(BEV)特征图生成运动特征,对不同时刻的激光雷达体素和当前的BEV特征进行聚合。首先,我们提出了时态体素编码器(TVE),它通过捕捉体素内点集之间的时态关系来生成体素表示。接下来,我们设计了一个运动感知特征聚合网络(MFANet),其目的是通过量化两个连续BEV特征图之间的时态变化来增强当前的BEV特征表示。通过分析BEV特征图随时间的差异和变化,MFANet捕捉运动信息并将其整合到当前特征表示中,从而实现对三维目标更稳健、准确的检测。在nuScenes基准数据集上的实验评估表明,与基线方法相比,所提出的TM3DOD方法在三维检测性能上取得了显著提升。此外,我们的方法与当前最先进的方法性能相当。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/339a/11280692/985fc2e1d30d/sensors-24-04667-g001.jpg

相似文献

1
LiDAR-Based 3D Temporal Object Detection via Motion-Aware LiDAR Feature Fusion.
Sensors (Basel). 2024 Jul 18;24(14):4667. doi: 10.3390/s24144667.
2
LiDAR-Based Intensity-Aware Outdoor 3D Object Detection.
Sensors (Basel). 2024 May 6;24(9):2942. doi: 10.3390/s24092942.
3
PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud.
Sensors (Basel). 2020 Dec 28;21(1):136. doi: 10.3390/s21010136.
4
AMFF-Net: An Effective 3D Object Detector Based on Attention and Multi-Scale Feature Fusion.
Sensors (Basel). 2023 Nov 22;23(23):9319. doi: 10.3390/s23239319.
6
A New 3D Object Pose Detection Method Using LIDAR Shape Set.
Sensors (Basel). 2018 Mar 16;18(3):882. doi: 10.3390/s18030882.
7
Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking With Transformer.
IEEE Trans Pattern Anal Mach Intell. 2024 Sep;46(9):5921-5935. doi: 10.1109/TPAMI.2024.3373693. Epub 2024 Aug 6.
8
Fully Sparse Fusion for 3D Object Detection.
IEEE Trans Pattern Anal Mach Intell. 2024 Nov;46(11):7217-7231. doi: 10.1109/TPAMI.2024.3392303. Epub 2024 Oct 3.
9
A Survey on Deep-Learning-Based LiDAR 3D Object Detection for Autonomous Driving.
Sensors (Basel). 2022 Dec 7;22(24):9577. doi: 10.3390/s22249577.
10
MS3D: A 3D object detection method using multi-scale semantic feature points to construct 3D feature layer.
Neural Netw. 2024 Nov;179:106623. doi: 10.1016/j.neunet.2024.106623. Epub 2024 Aug 10.

引用本文的文献

1
2
LiDAR-Based Negative Obstacle Detection for Unmanned Ground Vehicles in Orchards.
Sensors (Basel). 2024 Dec 11;24(24):7929. doi: 10.3390/s24247929.

本文引用的文献

1
From Points to Parts: 3D Object Detection From Point Cloud With Part-Aware and Part-Aggregation Network.
IEEE Trans Pattern Anal Mach Intell. 2021 Aug;43(8):2647-2664. doi: 10.1109/TPAMI.2020.2977026. Epub 2021 Jul 1.
2
SECOND: Sparsely Embedded Convolutional Detection.
Sensors (Basel). 2018 Oct 6;18(10):3337. doi: 10.3390/s18103337.
3
Long short-term memory.
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验