• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于图像集分类的 SPD 流形深度度量学习

SPD Manifold Deep Metric Learning for Image Set Classification.

作者信息

Wang Rui, Wu Xiao-Jun, Chen Ziheng, Hu Cong, Kittler Josef

出版信息

IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):8924-8938. doi: 10.1109/TNNLS.2022.3216811. Epub 2024 Jul 8.

DOI:10.1109/TNNLS.2022.3216811
PMID:38470600
Abstract

By characterizing each image set as a nonsingular covariance matrix on the symmetric positive definite (SPD) manifold, the approaches of visual content classification with image sets have made impressive progress. However, the key challenge of unhelpfully large intraclass variability and interclass similarity of representations remains open to date. Although, several recent studies have mitigated the two problems by jointly learning the embedding mapping and the similarity metric on the original SPD manifold, their inherent shallow and linear feature transformation mechanism are not powerful enough to capture useful geometric features, especially in complex scenarios. To this end, this article explores a novel approach, termed SPD manifold deep metric learning (SMDML), for image set classification. Specifically, SMDML first selects a prevailing SPD manifold neural network (SPDNet) as the backbone (encoder) to derive an SPD matrix nonlinear representation. To counteract the degradation of structural information during multistage feature embedding, we construct a Riemannian decoder at the end of the encoder, trained by a reconstruction error term (RT), to induce the generated low-dimensional feature manifold of the hidden layer to capture the pivotal information about the visual data describing the imaged scene. We demonstrate through theory and experiments that it is feasible to replace the Riemannian metric with Euclidean distance in RT. Then, the ReCov layer is introduced into the established Riemannian network to regularize the local statistical information within each input feature matrix, which enhances the effectiveness of the learning process. The theoretical analysis of the activation function used in the ReCov layer in terms of continuity and conditions for generating positive definite matrices is beneficial for network design. Inspired by the fact that the single cross-entropy loss used for training is unable to effectively parse the geometric distribution of the deep representations, we finally endow the suggested model with a novel metric learning regularization term. By explicitly incorporating the encoding and processing of the data variations into the network learning process, this term can not only derive a powerful Riemannian representation but also train an effective classifier. The experimental results show the superiority of the proposed approach on three typical visual classification tasks.

摘要

通过将每个图像集表征为对称正定(SPD)流形上的非奇异协方差矩阵,基于图像集的视觉内容分类方法取得了令人瞩目的进展。然而,类内差异过大和类间相似性这一关键挑战至今仍未解决。尽管最近的一些研究通过在原始SPD流形上联合学习嵌入映射和相似性度量缓解了这两个问题,但它们固有的浅层和线性特征变换机制不足以强大到捕获有用的几何特征,尤其是在复杂场景中。为此,本文探索了一种用于图像集分类的新方法,称为SPD流形深度度量学习(SMDML)。具体而言,SMDML首先选择一个流行的SPD流形神经网络(SPDNet)作为主干(编码器)来导出SPD矩阵非线性表示。为了抵消多阶段特征嵌入过程中结构信息的退化,我们在编码器末尾构建了一个黎曼解码器,通过重构误差项(RT)进行训练,以促使隐藏层生成的低维特征流形捕获描述成像场景的视觉数据的关键信息。我们通过理论和实验证明,在RT中用欧几里得距离代替黎曼度量是可行的。然后,将ReCov层引入已建立的黎曼网络,以规范每个输入特征矩阵内的局部统计信息,这增强了学习过程的有效性。对ReCov层中使用的激活函数在连续性和生成正定矩阵条件方面的理论分析有助于网络设计。受用于训练的单一交叉熵损失无法有效解析深度表示的几何分布这一事实的启发,我们最终为所提出的模型赋予了一个新的度量学习正则化项。通过将数据变化的编码和处理明确纳入网络学习过程,该项不仅可以导出强大的黎曼表示,还可以训练有效的分类器。实验结果表明了该方法在三个典型视觉分类任务上的优越性。

相似文献

1
SPD Manifold Deep Metric Learning for Image Set Classification.用于图像集分类的 SPD 流形深度度量学习
IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):8924-8938. doi: 10.1109/TNNLS.2022.3216811. Epub 2024 Jul 8.
2
SymNet: A Simple Symmetric Positive Definite Manifold Deep Learning Method for Image Set Classification.SymNet:一种用于图像集分类的简单对称正定流形深度学习方法。
IEEE Trans Neural Netw Learn Syst. 2022 May;33(5):2208-2222. doi: 10.1109/TNNLS.2020.3044176. Epub 2022 May 2.
3
U-SPDNet: An SPD manifold learning-based neural network for visual classification.U-SPDNet:一种基于对称正定(SPD)流形学习的视觉分类神经网络。
Neural Netw. 2023 Apr;161:382-396. doi: 10.1016/j.neunet.2022.11.030. Epub 2022 Dec 14.
4
Learning a discriminative SPD manifold neural network for image set classification.学习用于图像集分类的判别 SPD 流形神经网络。
Neural Netw. 2022 Jul;151:94-110. doi: 10.1016/j.neunet.2022.03.012. Epub 2022 Mar 16.
5
Dimensionality Reduction of SPD Data Based on Riemannian Manifold Tangent Spaces and Isometry.基于黎曼流形切空间和等距映射的对称正定(SPD)数据降维
Entropy (Basel). 2021 Aug 27;23(9):1117. doi: 10.3390/e23091117.
6
Spectral-Based SPD Matrix Representation for Signal Detection Using a Deep Neutral Network.基于频谱的对称正定矩阵表示用于使用深度神经网络的信号检测
Entropy (Basel). 2020 May 22;22(5):585. doi: 10.3390/e22050585.
7
Discriminant Analysis on Riemannian Manifold of Gaussian Distributions for Face Recognition With Image Sets.基于图像集的高斯分布黎曼流形的人脸识别判别分析。
IEEE Trans Image Process. 2018;27(1):151-163. doi: 10.1109/TIP.2017.2746993.
8
Generalized Learning Riemannian Space Quantization: A Case Study on Riemannian Manifold of SPD Matrices.广义学习黎曼空间量化:关于对称正定矩阵黎曼流形的一个案例研究
IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):281-292. doi: 10.1109/TNNLS.2020.2978514. Epub 2021 Jan 4.
9
Generalized Learning Vector Quantization With Log-Euclidean Metric Learning on Symmetric Positive-Definite Manifold.对称正定流形上基于对数欧几里得度量学习的广义学习向量量化
IEEE Trans Cybern. 2023 Aug;53(8):5178-5190. doi: 10.1109/TCYB.2022.3178412. Epub 2023 Jul 18.
10
A Robust Distance Measure for Similarity-Based Classification on the SPD Manifold.一种用于对称正定(SPD)流形上基于相似性分类的稳健距离度量。
IEEE Trans Neural Netw Learn Syst. 2020 Sep;31(9):3230-3244. doi: 10.1109/TNNLS.2019.2939177. Epub 2019 Sep 27.