Zhao Wenbo, Liu Xianming, Zhai Deming, Jiang Junjun, Ji Xiangyang
IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):12394-12407. doi: 10.1109/TPAMI.2023.3287628. Epub 2023 Sep 5.
Point clouds upsampling (PCU), which aims to generate dense and uniform point clouds from the captured sparse input of 3D sensor such as LiDAR, is a practical yet challenging task. It has potential applications in many real-world scenarios, such as autonomous driving, robotics, AR/VR, etc. Deep neural network based methods achieve remarkable success in PCU. However, most existing deep PCU methods either take the end-to-end supervised training, where large amounts of pairs of sparse input and dense ground-truth are required to serve as the supervision; or treat up-scaling of different factors as independent tasks, where multiple networks are required for different scaling factors, leading to significantly increased model complexity and training time. In this article, we propose a novel method that achieves self-supervised and magnification-flexible PCU simultaneously. No longer explicitly learning the mapping between sparse and dense point clouds, we formulate PCU as the task of seeking nearest projection points on the implicit surface for seed points. We then define two implicit neural functions to estimate projection direction and distance respectively, which can be trained by the pretext learning tasks. Moreover, the projection rectification strategy is tailored to remove outliers so as to keep the shape of object clear and sharp. Experimental results demonstrate that our self-supervised learning based scheme achieves competitive or even better performance than state-of-the-art supervised methods.
点云上采样(PCU)旨在从诸如激光雷达等3D传感器捕获的稀疏输入中生成密集且均匀的点云,是一项实际但具有挑战性的任务。它在许多现实世界场景中都有潜在应用,如自动驾驶、机器人技术、AR/VR等。基于深度神经网络的方法在PCU中取得了显著成功。然而,大多数现有的深度PCU方法要么采用端到端监督训练,其中需要大量稀疏输入和密集真值对作为监督;要么将不同因素的上采样视为独立任务,对于不同的缩放因子需要多个网络,这导致模型复杂度和训练时间显著增加。在本文中,我们提出了一种新颖的方法,该方法同时实现了自监督和放大灵活的PCU。不再明确学习稀疏点云和密集点云之间的映射,我们将PCU表述为为种子点在隐式曲面上寻找最近投影点的任务。然后我们定义两个隐式神经函数分别估计投影方向和距离,它们可以通过前置学习任务进行训练。此外,定制了投影校正策略以去除异常值,从而保持物体形状清晰锐利。实验结果表明,我们基于自监督学习的方案比现有最先进的监督方法取得了有竞争力甚至更好的性能。