Gao Wenxu, Ma Zhengming, Gan Weichao, Liu Shuyu
School of Electronics and Information Technology, Sun Yat-Sen University, Guangzhou 510006, China.
Public Experimental Teaching Center, Sun Yat-sen University, Guangzhou 510006, China.
Entropy (Basel). 2021 Aug 27;23(9):1117. doi: 10.3390/e23091117.
Symmetric positive definite (SPD) data have become a hot topic in machine learning. Instead of a linear Euclidean space, SPD data generally lie on a nonlinear Riemannian manifold. To get over the problems caused by the high data dimensionality, dimensionality reduction (DR) is a key subject for SPD data, where bilinear transformation plays a vital role. Because linear operations are not supported in nonlinear spaces such as Riemannian manifolds, directly performing Euclidean DR methods on SPD matrices is inadequate and difficult in complex models and optimization. An SPD data DR method based on Riemannian manifold tangent spaces and global isometry (RMTSISOM-SPDDR) is proposed in this research. The main contributions are listed: (1) Any Riemannian manifold tangent space is a Hilbert space isomorphic to a Euclidean space. Particularly for SPD manifolds, tangent spaces consist of symmetric matrices, which can greatly preserve the form and attributes of original SPD data. For this reason, RMTSISOM-SPDDR transfers the bilinear transformation from manifolds to tangent spaces. (2) By transformation, original SPD data are mapped to the tangent space at the identity matrix under the affine invariant Riemannian metric (AIRM). In this way, the geodesic distance between original data and the identity matrix is equal to the Euclidean distance between corresponding tangent vector and the origin. (3) The bilinear transformation is further determined by the isometric criterion guaranteeing the geodesic distance on high-dimensional SPD manifold as close as possible to the Euclidean distance in the tangent space of low-dimensional SPD manifold. Then, we use it for the DR of original SPD data. Experiments on five commonly used datasets show that RMTSISOM-SPDDR is superior to five advanced SPD data DR algorithms.
对称正定(SPD)数据已成为机器学习中的一个热门话题。与线性欧几里得空间不同,SPD数据通常位于非线性黎曼流形上。为了克服高数据维度带来的问题,降维(DR)是SPD数据的一个关键课题,其中双线性变换起着至关重要的作用。由于在诸如黎曼流形等非线性空间中不支持线性运算,在复杂模型和优化中直接对SPD矩阵执行欧几里得DR方法是不够的且困难的。本研究提出了一种基于黎曼流形切空间和全局等距的SPD数据DR方法(RMTSISOM - SPDDR)。主要贡献如下:(1)任何黎曼流形切空间都是与欧几里得空间同构的希尔伯特空间。特别是对于SPD流形,切空间由对称矩阵组成,这可以极大地保留原始SPD数据的形式和属性。因此,RMTSISOM - SPDDR将双线性变换从流形转移到切空间。(2)通过变换,原始SPD数据在仿射不变黎曼度量(AIRM)下被映射到单位矩阵处的切空间。这样,原始数据与单位矩阵之间的测地距离等于相应切向量与原点之间的欧几里得距离。(3)双线性变换进一步由等距准则确定,该准则保证高维SPD流形上的测地距离尽可能接近低维SPD流形切空间中的欧几里得距离。然后,我们将其用于原始SPD数据的降维。在五个常用数据集上的实验表明,RMTSISOM - SPDDR优于五种先进的SPD数据DR算法。