• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MS3D:一种使用多尺度语义特征点构建 3D 特征层的 3D 目标检测方法。

MS3D: A 3D object detection method using multi-scale semantic feature points to construct 3D feature layer.

机构信息

The School of Mechanical and Electrical Engineering, China Jiliang University, Hanzhou, China.

The School of Mechanical and Electrical Engineering, China Jiliang University, Hanzhou, China.

出版信息

Neural Netw. 2024 Nov;179:106623. doi: 10.1016/j.neunet.2024.106623. Epub 2024 Aug 10.

DOI:10.1016/j.neunet.2024.106623
PMID:39154419
Abstract

LiDAR point clouds can effectively depict the motion and posture of objects in three-dimensional space. Many studies accomplish the 3D object detection by voxelizing point clouds. However, in autonomous driving scenarios, the sparsity and hollowness of point clouds create some difficulties for voxel-based methods. The sparsity of point clouds makes it challenging to describe the geometric features of objects. The hollowness of point clouds poses difficulties for the aggregation of 3D features. We propose a two-stage 3D object detection framework, called MS3D. (1) We propose a method using voxel feature points from multi-branch to construct the 3D feature layer. Using voxel feature points from different branches, we construct a relatively compact 3D feature layer with rich semantic features. Additionally, we propose a distance-weighted sampling method, reducing the loss of foreground points caused by downsampling and allowing the 3D feature layer to retain more foreground points. (2) In response to the hollowness of point clouds, we predict the offsets between deep-level feature points and the object's centroid, making them as close as possible to the object's centroid. This enables the aggregation of these feature points with abundant semantic features. For feature points from shallow-level, we retain them on the object's surface to describe the geometric features of the object. To validate our approach, we evaluated its effectiveness on both the KITTI and ONCE datasets.

摘要

激光雷达点云可以有效地描述三维空间中物体的运动和姿态。许多研究通过体素化点云来完成 3D 目标检测。然而,在自动驾驶场景中,点云的稀疏性和空洞性给基于体素的方法带来了一些困难。点云的稀疏性使得描述物体的几何特征变得困难。点云的空洞性使得 3D 特征的聚合变得困难。我们提出了一个两阶段的 3D 目标检测框架,称为 MS3D。(1)我们提出了一种使用多分支体素特征点构建 3D 特征层的方法。使用来自不同分支的体素特征点,我们构建了一个具有丰富语义特征的相对紧凑的 3D 特征层。此外,我们提出了一种距离加权采样方法,减少了下采样导致的前景点的损失,并使 3D 特征层保留更多的前景点。(2)针对点云的空洞性,我们预测深层特征点与物体质心之间的偏移量,使它们尽可能接近物体质心。这使得这些具有丰富语义特征的特征点能够进行聚合。对于浅层的特征点,我们保留在物体表面,以描述物体的几何特征。为了验证我们的方法,我们在 KITTI 和 ONCE 数据集上评估了它的有效性。

相似文献

1
MS3D: A 3D object detection method using multi-scale semantic feature points to construct 3D feature layer.MS3D:一种使用多尺度语义特征点构建 3D 特征层的 3D 目标检测方法。
Neural Netw. 2024 Nov;179:106623. doi: 10.1016/j.neunet.2024.106623. Epub 2024 Aug 10.
2
CBG-Net: Cross-modality and cross-scale balance network with global semantics for multi-modal 3D object detection.CBG-Net:用于多模态 3D 目标检测的具有全局语义的跨模态和跨尺度平衡网络。
Neural Netw. 2024 Nov;179:106535. doi: 10.1016/j.neunet.2024.106535. Epub 2024 Jul 14.
3
Voxel-FPN: Multi-Scale Voxel Feature Aggregation for 3D Object Detection from LIDAR Point Clouds.体素特征金字塔网络(Voxel-FPN):用于从激光雷达点云进行三维目标检测的多尺度体素特征聚合
Sensors (Basel). 2020 Jan 28;20(3):704. doi: 10.3390/s20030704.
4
SyS3DS: Systematic Sampling of Large-Scale LiDAR Point Clouds for Semantic Segmentation in Forestry Robotics.SyS3DS:用于林业机器人语义分割的大规模激光雷达点云系统采样
Sensors (Basel). 2024 Jan 26;24(3):823. doi: 10.3390/s24030823.
5
Refined Voting and Scene Feature Fusion for 3D Object Detection in Point Clouds.点云中的 3D 目标检测的精细化投票和场景特征融合。
Comput Intell Neurosci. 2022 Dec 29;2022:3023934. doi: 10.1155/2022/3023934. eCollection 2022.
6
PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud.PSANet:用于点云中 3D 目标检测的金字塔分裂与聚合网络。
Sensors (Basel). 2020 Dec 28;21(1):136. doi: 10.3390/s21010136.
7
Pre-Segmented Down-Sampling Accelerates Graph Neural Network-Based 3D Object Detection in Autonomous Driving.预分割下采样加速基于图神经网络的自动驾驶3D目标检测
Sensors (Basel). 2024 Feb 23;24(5):1458. doi: 10.3390/s24051458.
8
Enhanced Perception for Autonomous Driving Using Semantic and Geometric Data Fusion.利用语义和几何数据融合实现自动驾驶的增强感知。
Sensors (Basel). 2022 Jul 5;22(13):5061. doi: 10.3390/s22135061.
9
LIO-CSI: LiDAR inertial odometry with loop closure combined with semantic information.LIO-CSI:结合语义信息的具有闭环检测功能的激光雷达惯性里程计。
PLoS One. 2021 Dec 8;16(12):e0261053. doi: 10.1371/journal.pone.0261053. eCollection 2021.
10
Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud.基于迁移学习的点云三维目标检测语义分割。
Sensors (Basel). 2021 Jun 8;21(12):3964. doi: 10.3390/s21123964.