Xie Yuchen, Ho Jeffrey, Vemuri Baba
Qualcomm Technologies, Inc., San Diego, CA 92121 USA.
JMLR Workshop Conf Proc. 2013;28(3):1480-1488.
Existing dictionary learning algorithms are based on the assumption that the data are vectors in an Euclidean vector space ℝ , and the dictionary is learned from the training data using the vector space structure of ℝ and its Euclidean -metric. However, in many applications, features and data often originated from a Riemannian manifold that does not support a global linear (vector space) structure. Furthermore, the extrinsic viewpoint of existing dictionary learning algorithms becomes inappropriate for modeling and incorporating the intrinsic geometry of the manifold that is potentially important and critical to the application. This paper proposes a novel framework for sparse coding and dictionary learning for data on a Riemannian manifold, and it shows that the existing sparse coding and dictionary learning methods can be considered as special (Euclidean) cases of the more general framework proposed here. We show that both the dictionary and sparse coding can be effectively computed for several important classes of Riemannian manifolds, and we validate the proposed method using two well-known classification problems in computer vision and medical imaging analysis.
数据是欧几里得向量空间ℝ中的向量,并且字典是利用ℝ的向量空间结构及其欧几里得度量从训练数据中学习得到的。然而,在许多应用中,特征和数据通常源自不支持全局线性(向量空间)结构的黎曼流形。此外,现有字典学习算法的外在观点对于对该流形的内在几何进行建模和整合而言变得不合适,而这种内在几何对于应用可能是重要且关键的。本文提出了一种针对黎曼流形上的数据进行稀疏编码和字典学习的新颖框架,并且表明现有的稀疏编码和字典学习方法可被视为这里所提出的更通用框架的特殊(欧几里得)情形。我们表明,对于几类重要的黎曼流形,字典和稀疏编码都能被有效地计算出来,并且我们使用计算机视觉和医学成像分析中的两个著名分类问题对所提出的方法进行了验证。