Huang Chang-Qin, Jiang Fan, Huang Qiong-Hao, Wang Xi-Zhe, Han Zhong-Mei, Huang Wei-Yu
IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):4813-4825. doi: 10.1109/TNNLS.2022.3162301. Epub 2024 Apr 4.
Three-dimensional point cloud classification is fundamental but still challenging in 3-D vision. Existing graph-based deep learning methods fail to learn both low-level extrinsic and high-level intrinsic features together. These two levels of features are critical to improving classification accuracy. To this end, we propose a dual-graph attention convolution network (DGACN). The idea of DGACN is to use two types of graph attention convolution operations with a feedback graph feature fusion mechanism. Specifically, we exploit graph geometric attention convolution to capture low-level extrinsic features in 3-D space. Furthermore, we apply graph embedding attention convolution to learn multiscale low-level extrinsic and high-level intrinsic fused graph features together. Moreover, the points belonging to different parts in real-world 3-D point cloud objects are distinguished, which results in more robust performance for 3-D point cloud classification tasks than other competitive methods, in practice. Our extensive experimental results show that the proposed network achieves state-of-the-art performance on both the synthetic ModelNet40 and real-world ScanObjectNN datasets.
三维点云分类在三维视觉中是基础但仍具有挑战性的任务。现有的基于图的深度学习方法无法同时学习低级外在特征和高级内在特征。这两个层次的特征对于提高分类准确率至关重要。为此,我们提出了一种双图注意力卷积网络(DGACN)。DGACN的思想是使用两种类型的图注意力卷积操作以及反馈图特征融合机制。具体而言,我们利用图几何注意力卷积来捕获三维空间中的低级外在特征。此外,我们应用图嵌入注意力卷积来共同学习多尺度的低级外在和高级内在融合图特征。而且,在实际的三维点云对象中,属于不同部分的点被区分开来,这使得在三维点云分类任务中比其他竞争方法具有更稳健的性能。我们广泛的实验结果表明,所提出的网络在合成的ModelNet40和真实世界的ScanObjectNN数据集上均取得了领先的性能。