Han Jiawei, Liu Kaiqi, Li Wei, Zhang Feng, Xia Xiang-Gen
IEEE Trans Pattern Anal Mach Intell. 2025 Jul;47(7):5778-5793. doi: 10.1109/TPAMI.2025.3553051.
Point cloud semantic segmentation can enhance the understanding of the production environment and is a crucial component of vision tasks. The efficacy and generalization prowess of deep learning-based segmentation models are inherently contingent upon the quality and nature of the data employed in their training. However, it is often challenging to obtain data with inter-class balance, and training an intelligent segmentation network with the imbalanced data may cause cognitive bias. In this paper, a network framework InvSpaceNet is proposed, which generates an inverse feature space to alleviate the cognitive bias caused by imbalanced data. Specifically, we design a dual-branch training architecture that combines the superior feature representations derived from instance-balanced sampling data with the cognitive corrections introduced by the proposed inverse sampling data. In the inverse feature space of the point cloud generated by the auxiliary branch, the central points aggregated by class are constrained by the contrastive loss. To refine the class cognition in the inverse feature space, features are used to generate point cloud class prototypes through momentum update. These class prototypes from the inverse space are utilized to generate feature maps and structure maps that are aligned with the positive feature space of the main branch segmentation network. The training of the main branch is dynamically guided through gradients back propagated from different losses. Extensive experiments conducted on four large benchmarks (i.e., S3DIS, ScanNet v2, Toronto-3D, and SemanticKITTI) demonstrate that the proposed method can effectively mitigate point cloud imbalance issues and improve segmentation performance.
点云语义分割可以增强对生产环境的理解,是视觉任务的关键组成部分。基于深度学习的分割模型的有效性和泛化能力本质上取决于其训练中所使用数据的质量和性质。然而,获得具有类间平衡的数据往往具有挑战性,并且使用不平衡数据训练智能分割网络可能会导致认知偏差。本文提出了一种网络框架InvSpaceNet,它生成一个逆特征空间来减轻不平衡数据导致的认知偏差。具体而言,我们设计了一种双分支训练架构,将从实例平衡采样数据中获得的优越特征表示与所提出的逆采样数据引入的认知校正相结合。在由辅助分支生成的点云的逆特征空间中,按类聚合的中心点受到对比损失的约束。为了在逆特征空间中细化类认知,通过动量更新使用特征来生成点云类原型。这些来自逆空间的类原型用于生成与主分支分割网络的正特征空间对齐的特征图和结构图。通过从不同损失反向传播的梯度动态指导主分支的训练。在四个大型基准数据集(即S3DIS、ScanNet v2、Toronto-3D和SemanticKITTI)上进行的大量实验表明,所提出的方法可以有效缓解点云不平衡问题并提高分割性能。