• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于基于激光雷达感知的圆柱形和非对称3D卷积网络

Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR-Based Perception.

作者信息

Zhu Xinge, Zhou Hui, Wang Tai, Hong Fangzhou, Li Wei, Ma Yuexin, Li Hongsheng, Yang Ruigang, Lin Dahua

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):6807-6822. doi: 10.1109/TPAMI.2021.3098789. Epub 2022 Sep 14.

DOI:10.1109/TPAMI.2021.3098789
PMID:34310286
Abstract

State-of-the-art methods for driving-scene LiDAR-based perception (including point cloud semantic segmentation, panoptic segmentation and 3D detection, etc.) often project the point clouds to 2D space and then process them via 2D convolution. Although this cooperation shows the competitiveness in the point cloud, it inevitably alters and abandons the 3D topology and geometric relations. A natural remedy is to utilize the 3D voxelization and 3D convolution network. However, we found that in the outdoor point cloud, the improvement obtained in this way is quite limited. An important reason is the property of the outdoor point cloud, namely sparsity and varying density. Motivated by this investigation, we propose a new framework for the outdoor LiDAR segmentation, where cylindrical partition and asymmetrical 3D convolution networks are designed to explore the 3D geometric pattern while maintaining these inherent properties. The proposed model acts as a backbone and the learned features from this model can be used for downstream tasks such as point cloud semantic and panoptic segmentation or 3D detection. In this paper, we benchmark our model on these three tasks. For semantic segmentation, we evaluate the proposed model on several large-scale datasets, i.e., SemanticKITTI, nuScenes and A2D2. Our method achieves the state-of-the-art on the leaderboard of SemanticKITTI (both single-scan and multi-scan challenge), and significantly outperforms existing methods on nuScenes and A2D2 dataset. Furthermore, the proposed 3D framework also shows strong performance and good generalization on LiDAR panoptic segmentation and LiDAR 3D detection.

摘要

用于基于激光雷达的驾驶场景感知的先进方法(包括点云语义分割、全景分割和3D检测等)通常将点云投影到2D空间,然后通过2D卷积对其进行处理。尽管这种协作在点云方面显示出竞争力,但它不可避免地改变并舍弃了3D拓扑结构和几何关系。一种自然的补救方法是利用3D体素化和3D卷积网络。然而,我们发现在户外点云中,通过这种方式获得的改进相当有限。一个重要原因是户外点云的特性,即稀疏性和密度变化。受此研究的启发,我们提出了一种用于户外激光雷达分割的新框架,其中设计了圆柱分区和非对称3D卷积网络,以在保持这些固有特性的同时探索3D几何模式。所提出的模型作为主干,从该模型中学习到的特征可用于下游任务,如点云语义和全景分割或3D检测。在本文中,我们在这三个任务上对我们的模型进行了基准测试。对于语义分割,我们在几个大规模数据集上评估了所提出的模型,即SemanticKITTI、nuScenes和A2D2。我们的方法在SemanticKITTI的排行榜上(单扫描和多扫描挑战)均达到了当前最优水平,并且在nuScenes和A2D2数据集上显著优于现有方法。此外,所提出的3D框架在激光雷达全景分割和激光雷达3D检测方面也表现出强大的性能和良好的通用性。

相似文献

1
Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR-Based Perception.用于基于激光雷达感知的圆柱形和非对称3D卷积网络
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):6807-6822. doi: 10.1109/TPAMI.2021.3098789. Epub 2022 Sep 14.
2
Unified 3D and 4D Panoptic Segmentation via Dynamic Shifting Networks.通过动态移位网络实现统一的3D和4D全景分割
IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):3480-3495. doi: 10.1109/TPAMI.2023.3349304. Epub 2024 Apr 3.
3
Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud.基于迁移学习的点云三维目标检测语义分割。
Sensors (Basel). 2021 Jun 8;21(12):3964. doi: 10.3390/s21123964.
4
SyS3DS: Systematic Sampling of Large-Scale LiDAR Point Clouds for Semantic Segmentation in Forestry Robotics.SyS3DS:用于林业机器人语义分割的大规模激光雷达点云系统采样
Sensors (Basel). 2024 Jan 26;24(3):823. doi: 10.3390/s24030823.
5
An Efficient Ensemble Deep Learning Approach for Semantic Point Cloud Segmentation Based on 3D Geometric Features and Range Images.一种基于3D几何特征和距离图像的高效集成深度学习语义点云分割方法。
Sensors (Basel). 2022 Aug 18;22(16):6210. doi: 10.3390/s22166210.
6
Spatial Aggregation Net: Point Cloud Semantic Segmentation Based on Multi-Directional Convolution.空间聚合网络:基于多方向卷积的点云语义分割。
Sensors (Basel). 2019 Oct 7;19(19):4329. doi: 10.3390/s19194329.
7
FWNet: Semantic Segmentation for Full-Waveform LiDAR Data Using Deep Learning.FWNet:使用深度学习对全波形激光雷达数据进行语义分割
Sensors (Basel). 2020 Jun 24;20(12):3568. doi: 10.3390/s20123568.
8
EPMF: Efficient Perception-Aware Multi-Sensor Fusion for 3D Semantic Segmentation.EPMF:用于3D语义分割的高效感知感知多传感器融合
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):8258-8273. doi: 10.1109/TPAMI.2024.3402232. Epub 2024 Nov 6.
9
Learning Polynomial-Based Separable Convolution for 3D Point Cloud Analysis.学习基于多项式的可分离卷积用于3D点云分析
Sensors (Basel). 2021 Jun 19;21(12):4211. doi: 10.3390/s21124211.
10
Exploring Adversarial Robustness of LiDAR Semantic Segmentation in Autonomous Driving.探索自动驾驶中激光雷达语义分割的对抗鲁棒性。
Sensors (Basel). 2023 Dec 2;23(23):9579. doi: 10.3390/s23239579.

引用本文的文献

1
Multi-scale sparse convolution and point convolution adaptive fusion point cloud semantic segmentation method.多尺度稀疏卷积与点卷积自适应融合的点云语义分割方法
Sci Rep. 2025 Feb 5;15(1):4372. doi: 10.1038/s41598-025-88905-5.
2
Bilinear Distance Feature Network for Semantic Segmentation in PowerLine Corridor Point Clouds.用于电力线走廊点云语义分割的双线性距离特征网络
Sensors (Basel). 2024 Aug 2;24(15):5021. doi: 10.3390/s24155021.
3
FGCN: Image-Fused Point Cloud Semantic Segmentation with Fusion Graph Convolutional Network.
FGCN:基于融合图卷积网络的图像融合点云语义分割
Sensors (Basel). 2023 Oct 9;23(19):8338. doi: 10.3390/s23198338.
4
Recent Advancements in Learning Algorithms for Point Clouds: An Updated Overview.点云学习算法的最新进展:最新综述。
Sensors (Basel). 2022 Feb 10;22(4):1357. doi: 10.3390/s22041357.