Wu Xian, Guo Xueyi, Peng Hang, Su Bin, Ahamod Sabbir, Han Fenglin
School of Mechanical and Electrical Engineering, Central South University, Changsha 410083, China.
Resource Recycling Research Institute, Central South University, Changsha 410083, China.
Sensors (Basel). 2024 Dec 4;24(23):7749. doi: 10.3390/s24237749.
Three-dimensional point cloud recognition is a very fundamental work in fields such as autonomous driving and face recognition. However, in real industrial scenarios, input point cloud data are often accompanied by factors such as occlusion, rotation, and noise. These factors make it challenging to apply existing point cloud classification algorithms in real industrial scenarios. Currently, most studies enhance model robustness from the perspective of neural network structure. However, researchers have found that simply adjusting the neural network structure has proven insufficient in addressing the decline in accuracy caused by data corruption. In this article, we use local feature descriptors as a preprocessing method to extract features from point cloud data and propose a new neural network architecture aligned with these local features, effectively enhancing performance even in extreme cases of data corruption. In addition, we conducted data augmentation to the 10 intentionally selected categories in ModelNet40. Finally, we conducted multiple experiments, including testing the robustness of the model to occlusion and coordinate transformation and then comparing the model with existing SOTA models. Furthermore, in actual scene experiments, we used depth cameras to capture objects and input the obtained data into the established model. The experimental results show that our model outperforms existing popular algorithms when dealing with corrupted point cloud data. Even when the input point cloud data are affected by occlusion or coordinate transformation, our proposed model can maintain high accuracy. This suggests that our method can alleviate the problem of decreased model accuracy caused by the aforementioned factors.
三维点云识别是自动驾驶和人脸识别等领域中一项非常基础的工作。然而,在实际工业场景中,输入的点云数据常常伴随着遮挡、旋转和噪声等因素。这些因素使得在实际工业场景中应用现有的点云分类算法具有挑战性。目前,大多数研究从神经网络结构的角度来增强模型的鲁棒性。然而,研究人员发现,仅仅调整神经网络结构已被证明不足以解决由数据损坏导致的精度下降问题。在本文中,我们使用局部特征描述符作为一种预处理方法,从点云数据中提取特征,并提出一种与这些局部特征对齐的新神经网络架构,即使在数据损坏的极端情况下也能有效提高性能。此外,我们对ModelNet40中故意选择的10个类别进行了数据增强。最后,我们进行了多项实验,包括测试模型对遮挡和坐标变换的鲁棒性,然后将该模型与现有的最优模型进行比较。此外,在实际场景实验中,我们使用深度相机捕获物体,并将获得的数据输入到已建立的模型中。实验结果表明,在处理损坏的点云数据时,我们的模型优于现有的流行算法。即使输入的点云数据受到遮挡或坐标变换的影响,我们提出的模型也能保持较高的精度。这表明我们的方法可以缓解由上述因素导致的模型精度下降问题。