IEEE Trans Image Process. 2017 Aug;26(8):3707-3720. doi: 10.1109/TIP.2017.2704426. Epub 2017 May 16.
Highly discriminative 3D shape representations can be formed by encoding the spatial relationship among virtual words into the Bag of Words (BoW) method. To achieve this challenging task, several unresolved issues in the encoding procedure must be overcome for 3D shapes, including: 1) arbitrary mesh resolution; 2) irregular vertex topology; 3) orientation ambiguity on the 3D surface; and 4) invariance to rigid and non-rigid shape transformations. In this paper, a novel spatially enhanced 3D shape representation called bag of spatial context correlations (BoSCCs) is proposed to address all these issues. Adopting a novel local perspective, BoSCC is able to describe a 3D shape by an occurrence frequency histogram of spatial context correlation patterns, which makes BoSCC become more compact and discriminative than previous global perspective-based methods. Specifically, the spatial context correlation is proposed to simultaneously encode the geometric and spatial information of a 3D local region by the correlation among spatial contexts of vertices in that region, which effectively resolves the aforementioned issues. The spatial context of each vertex is modeled by Markov chains in a multi-scale manner, which thoroughly captures the spatial relationship by the transition probabilities of intra-virtual words and the ones of inter-virtual words. The high discriminability and compactness of BoSCC are effective for classification and retrieval, especially in the scenarios of limited samples and partial shape retrieval. Experimental results show that BoSCC outperforms the state-of-the-art spatially enhanced BoW methods in three common applications: global shape retrieval, shape classification, and partial shape retrieval.
高度可区分的 3D 形状表示可以通过将虚拟词之间的空间关系编码到词袋 (BoW) 方法中形成。为了实现这一具有挑战性的任务,必须克服 3D 形状编码过程中的几个未解决的问题,包括:1)任意网格分辨率;2)不规则顶点拓扑;3)3D 表面上的方向模糊性;和 4)对刚体和非刚体形状变换的不变性。在本文中,提出了一种称为空间增强 3D 形状表示的新型方法,称为空间上下文相关的词袋 (BoSCCs),以解决所有这些问题。采用新颖的局部视角,BoSCC 能够通过空间上下文相关模式的出现频率直方图来描述 3D 形状,这使得 BoSCC 比以前基于全局视角的方法更加紧凑和具有区分度。具体来说,空间上下文相关性通过该区域中顶点的空间上下文之间的相关性来同时编码 3D 局部区域的几何和空间信息,从而有效地解决了上述问题。每个顶点的空间上下文以多尺度方式建模为马尔可夫链,通过虚拟词内和虚拟词间的转移概率彻底捕获空间关系。BoSCC 的高可区分性和紧凑性对于分类和检索非常有效,尤其是在样本有限和部分形状检索的情况下。实验结果表明,BoSCC 在三种常见应用中优于最先进的空间增强 BoW 方法:全局形状检索、形状分类和部分形状检索。