Suppr超能文献

通过2D-3D神经校准实现激光雷达3D点云的自监督学习。

Self-Supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration.

作者信息

Zhang Yifan, Hou Junhui, Ren Siyu, Wu Jinjian, Yuan Yixuan, Shi Guangming

出版信息

IEEE Trans Pattern Anal Mach Intell. 2025 Oct;47(10):9201-9216. doi: 10.1109/TPAMI.2025.3584625.

Abstract

This paper introduces a novel self-supervised learning framework for enhancing 3D perception in autonomous driving scenes. Specifically, our approach, namely NCLR, focuses on 2D-3D neural calibration, a novel pretext task that estimates the rigid pose aligning camera and LiDAR coordinate systems. First, we propose the learnable transformation alignment to bridge the domain gap between image and point cloud data, converting features into a unified representation space for effective comparison and matching. Second, we identify the overlapping area between the image and point cloud with the fused features. Third, we establish dense 2D-3D correspondences to estimate the rigid pose. The framework not only learns fine-grained matching from points to pixels but also achieves alignment of the image and point cloud at a holistic level, understanding the LiDAR-to-camera extrinsic parameters. We demonstrate the efficacy of NCLR by applying the pre-trained backbone to downstream tasks, such as LiDAR-based 3D semantic segmentation, object detection, and panoptic segmentation. Comprehensive experiments on various datasets illustrate the superiority of NCLR over existing self-supervised methods. The results confirm that joint learning from different modalities significantly enhances the network's understanding abilities and effectiveness of learned representation.

摘要

本文介绍了一种用于增强自动驾驶场景中3D感知的新型自监督学习框架。具体而言,我们的方法即NCLR,专注于2D-3D神经校准,这是一种估计相机和激光雷达坐标系刚性姿态对齐的新型前置任务。首先,我们提出可学习的变换对齐来弥合图像和点云数据之间的领域差距,将特征转换到统一的表示空间以进行有效比较和匹配。其次,我们利用融合特征识别图像和点云之间的重叠区域。第三,我们建立密集的2D-3D对应关系来估计刚性姿态。该框架不仅从点到像素学习细粒度匹配,还在整体层面实现图像和点云的对齐,理解激光雷达到相机的外部参数。我们通过将预训练的主干应用于下游任务,如基于激光雷达的3D语义分割、目标检测和全景分割,来证明NCLR的有效性。在各种数据集上进行的综合实验表明NCLR优于现有的自监督方法。结果证实,从不同模态进行联合学习显著增强了网络的理解能力和学习表示的有效性。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验