Li Yushi, Baciu George, Chen Rong, Li Chenhui, Wang Hao, Pan Yushan, Ding Weiping
IEEE Trans Neural Netw Learn Syst. 2025 May;36(5):9195-9209. doi: 10.1109/TNNLS.2024.3439706. Epub 2025 May 2.
Learning 3-D structures from incomplete point clouds with extreme sparsity and random distributions is a challenge since it is difficult to infer topological connectivity and structural details from fragmentary representations. Missing large portions of informative structures further aggravates this problem. To overcome this, a novel graph convolutional network (GCN) called dynamic and structure-aware NETwork (DSANet) is presented in this article. This framework is formulated based on a pyramidic auto-encoder (AE) architecture to address accurate structure reconstruction on the sparse and incomplete point clouds. A PointNet-like neural network is applied as the encoder to efficiently aggregate the global representations of coarse point clouds. On the decoder side, we design a dynamic graph learning module with a structure-aware attention (SAA) to take advantage of the topology relationships maintained in the dynamic latent graph. Relying on gradually unfolding the extracted representation into a sequence of graphs, DSANet is able to reconstruct complicated point clouds with rich and descriptive details. To associate analogous structure awareness with semantic estimation, we further propose a mechanism, called structure similarity assessment (SSA). This method allows our model to surmise semantic homogeneity in an unsupervised manner. Finally, we optimize the proposed model by minimizing a new distortion-aware objective end-to-end. Extensive qualitative and quantitative experiments demonstrate the impressive performance of our model in reconstructing unbroken 3-D shapes from deficient point clouds and preserving semantic relationships among different regional structures.
从具有极端稀疏性和随机分布的不完整点云中学习三维结构是一项挑战,因为很难从碎片化表示中推断拓扑连通性和结构细节。信息结构的大部分缺失进一步加剧了这个问题。为了克服这一点,本文提出了一种名为动态结构感知网络(DSANet)的新型图卷积网络(GCN)。该框架基于金字塔自动编码器(AE)架构构建,以解决稀疏和不完整点云上的精确结构重建问题。一个类似PointNet的神经网络被用作编码器,以有效地聚合粗点云的全局表示。在解码器端,我们设计了一个带有结构感知注意力(SAA)的动态图学习模块,以利用动态潜在图中保持的拓扑关系。依靠将提取的表示逐步展开为一系列图,DSANet能够重建具有丰富和描述性细节的复杂点云。为了将类似的结构感知与语义估计相关联,我们进一步提出了一种称为结构相似性评估(SSA)的机制。这种方法允许我们的模型以无监督的方式推测语义同质性。最后,我们通过端到端最小化一个新的失真感知目标来优化所提出的模型。大量的定性和定量实验证明了我们的模型在从有缺陷的点云重建完整三维形状以及保留不同区域结构之间的语义关系方面的出色性能。