旋转激光传感器的点云实例分割

Point-Cloud Instance Segmentation for Spinning Laser Sensors.

作者信息

Casado-Coscolla Alvaro, Sanchez-Belenguer Carlos, Wolfart Erik, Sequeira Vitor

机构信息

European Commission, Joint Research Centre (JRC), Via Enrico Fermi 2749, 21027 Ispra, Italy.

Instituto Universitario de Automática e Informática Industrial, Universitat Politècnica de València, Camí de Vera, 46022 València, Spain.

出版信息

J Imaging. 2024 Dec 17;10(12):325. doi: 10.3390/jimaging10120325.

DOI:10.3390/jimaging10120325

PMID:39728222

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11728245/

Abstract

In this paper, we face the point-cloud segmentation problem for spinning laser sensors from a deep-learning (DL) perspective. Since the sensors natively provide their measurements in a 2D grid, we directly use state-of-the-art models designed for visual information for the segmentation task and then exploit the range information to ensure 3D accuracy. This allows us to effectively address the main challenges of applying DL techniques to point clouds, i.e., lack of structure and increased dimensionality. To the best of our knowledge, this is the first work that faces the 3D segmentation problem from a 2D perspective without explicitly re-projecting 3D point clouds. Moreover, our approach exploits multiple channels available in modern sensors, i.e., range, reflectivity, and ambient illumination. We also introduce a novel data-mining pipeline that enables the annotation of 3D scans without human intervention. Together with this paper, we present a new public dataset with all the data collected for training and evaluating our approach, where point clouds preserve their native sensor structure and where every single measurement contains range, reflectivity, and ambient information, together with its associated 3D point. As experimental results show, our approach achieves state-of-the-art results both in terms of performance and inference time. Additionally, we provide a novel ablation test that analyses the individual and combined contributions of the different channels provided by modern laser sensors.

摘要

在本文中，我们从深度学习（DL）的角度来处理旋转激光传感器的点云分割问题。由于这些传感器本身是以二维网格形式提供测量数据的，我们直接使用为视觉信息设计的先进模型来进行分割任务，然后利用距离信息来确保三维精度。这使我们能够有效应对将深度学习技术应用于点云时面临的主要挑战，即缺乏结构和维度增加的问题。据我们所知，这是第一项从二维角度处理三维分割问题且无需显式重新投影三维点云的工作。此外，我们的方法利用了现代传感器中可用的多个通道，即距离、反射率和环境光照。我们还引入了一种新颖的数据挖掘管道，能够在无需人工干预的情况下对三维扫描进行标注。与本文一同发布的，是一个新的公共数据集，其中包含为训练和评估我们的方法而收集的所有数据，点云保留了其原生传感器结构，且每个测量值都包含距离、反射率和环境信息以及其相关的三维点。实验结果表明，我们的方法在性能和推理时间方面均取得了领先成果。此外，我们提供了一种新颖的消融测试，分析现代激光传感器提供的不同通道的单独和综合贡献。