School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi 214122, China; Jiangsu Provincial Engineering Laboratory of Pattern Recognition and Computational Intelligence, Jiangnan University, Wuxi 214122, China.
School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi 214122, China; Jiangsu Provincial Engineering Laboratory of Pattern Recognition and Computational Intelligence, Jiangnan University, Wuxi 214122, China.
Neural Netw. 2022 Jul;151:94-110. doi: 10.1016/j.neunet.2022.03.012. Epub 2022 Mar 16.
Performing pattern analysis over the symmetric positive definite (SPD) manifold requires specific mathematical computations, characterizing the non-Euclidian property of the involved data points and learning tasks, such as the image set classification problem. Accompanied with the advanced neural networking techniques, several architectures for processing the SPD matrices have recently been studied to obtain fine-grained structured representations. However, existing approaches are challenged by the diversely changing appearance of the data points, begging the question of how to learn invariant representations for improved performance with supportive theories. Therefore, this paper designs two Riemannian operation modules for SPD manifold neural network. Specifically, a Riemannian batch regularization (RBR) layer is firstly proposed for the purpose of training a discriminative manifold-to-manifold transforming network with a novelly-designed metric learning regularization term. The second module realizes the Riemannian pooling operation with geometric computations on the Riemannian manifolds, notably the Riemannian barycenter, metric learning, and Riemannian optimization. Extensive experiments on five benchmarking datasets show the efficacy of the proposed approach.
对对称正定 (SPD) 流形进行模式分析需要特定的数学计算,这些计算描述了所涉及的数据点和学习任务的非欧几里得性质,例如图像集分类问题。随着先进的神经网络技术的发展,最近已经研究了几种用于处理 SPD 矩阵的架构,以获得精细的结构化表示。然而,现有的方法受到数据点外观的多样化变化的挑战,这就提出了如何学习不变表示以提高性能的问题,并需要有支持性的理论。因此,本文设计了两个用于 SPD 流形神经网络的黎曼运算模块。具体来说,首先提出了一种黎曼批量正则化 (RBR) 层,目的是训练具有新颖度量学习正则化项的判别流形到流形变换网络。第二个模块通过在黎曼流形上进行几何计算来实现黎曼池化操作,特别是黎曼质心、度量学习和黎曼优化。在五个基准数据集上的广泛实验表明了该方法的有效性。