Li Shijie, Liu Yun, Gall Juergen
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4079-4090. doi: 10.1109/TNNLS.2021.3132836. Epub 2025 Feb 28.
Many point-based semantic segmentation methods have been designed for indoor scenarios, but they struggle if they are applied to point clouds that are captured by a light detection and ranging (LiDAR) sensor in an outdoor environment. In order to make these methods more efficient and robust such that they can handle LiDAR data, we introduce the general concept of reformulating 3-D point-based operations such that they can operate in the projection space. While we show by means of three point-based methods that the reformulated versions are between 300 and 400 times faster and achieve higher accuracy, we furthermore demonstrate that the concept of reformulating 3-D point-based operations allows to design new architectures that unify the benefits of point-based and image-based methods. As an example, we introduce a network that integrates reformulated 3-D point-based operations into a 2-D encoder-decoder architecture that fuses the information from different 2-D scales. We evaluate the approach on four challenging datasets for semantic LiDAR point cloud segmentation and show that leveraging reformulated 3-D point-based operations with 2-D image-based operations achieves very good results for all four datasets.
许多基于点的语义分割方法是针对室内场景设计的,但如果将它们应用于在室外环境中由激光雷达(LiDAR)传感器捕获的点云数据,这些方法就会遇到困难。为了使这些方法更高效、更稳健,以便能够处理LiDAR数据,我们引入了重新制定三维点基操作的一般概念,使其能够在投影空间中运行。虽然我们通过三种基于点的方法表明,重新制定后的版本速度提高了300到400倍,并且精度更高,但我们进一步证明,重新制定三维点基操作的概念允许设计新的架构,将基于点的方法和基于图像的方法的优点结合起来。例如,我们引入了一个网络,该网络将重新制定后的三维点基操作集成到一个二维编码器-解码器架构中,该架构融合了来自不同二维尺度的信息。我们在四个具有挑战性的语义LiDAR点云分割数据集上评估了该方法,并表明将重新制定后的三维点基操作与基于二维图像的操作相结合,对所有四个数据集都取得了非常好的结果。