• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于黎曼流形上编码技术的全面审视。

A Comprehensive Look at Coding Techniques on Riemannian Manifolds.

作者信息

Faraki Masoud, Harandi Mehrtash T, Porikli Fatih

出版信息

IEEE Trans Neural Netw Learn Syst. 2018 Nov;29(11):5701-5712. doi: 10.1109/TNNLS.2018.2812799. Epub 2018 Mar 27.

DOI:10.1109/TNNLS.2018.2812799
PMID:29994290
Abstract

Core to many learning pipelines is visual recognition such as image and video classification. In such applications, having a compact yet rich and informative representation plays a pivotal role. An underlying assumption in traditional coding schemes [e.g., sparse coding (SC)] is that the data geometrically comply with the Euclidean space. In other words, the data are presented to the algorithm in vector form and Euclidean axioms are fulfilled. This is of course restrictive in machine learning, computer vision, and signal processing, as shown by a large number of recent studies. This paper takes a further step and provides a comprehensive mathematical framework to perform coding in curved and non-Euclidean spaces, i.e., Riemannian manifolds. To this end, we start by the simplest form of coding, namely, bag of words. Then, inspired by the success of vector of locally aggregated descriptors in addressing computer vision problems, we will introduce its Riemannian extensions. Finally, we study Riemannian form of SC, locality-constrained linear coding, and collaborative coding. Through rigorous tests, we demonstrate the superior performance of our Riemannian coding schemes against the state-of-the-art methods on several visual classification tasks, including head pose classification, video-based face recognition, and dynamic scene recognition.

摘要

许多学习流程的核心是视觉识别,如图像和视频分类。在这类应用中,拥有紧凑但丰富且信息量大的表示起着关键作用。传统编码方案(例如稀疏编码(SC))的一个潜在假设是数据在几何上符合欧几里得空间。换句话说,数据以向量形式呈现给算法,并且满足欧几里得公理。正如大量近期研究所表明的,这在机器学习、计算机视觉和信号处理中当然具有局限性。本文更进一步,提供了一个全面的数学框架,用于在弯曲和非欧几里得空间(即黎曼流形)中进行编码。为此,我们从最简单的编码形式——词袋模型开始。然后,受局部聚合描述符向量在解决计算机视觉问题方面取得成功的启发,我们将介绍其黎曼扩展。最后,我们研究稀疏编码、局部约束线性编码和协作编码的黎曼形式。通过严格测试,我们证明了我们的黎曼编码方案在包括头部姿态分类、基于视频的人脸识别和动态场景识别在内的多个视觉分类任务上相对于现有方法具有卓越性能。

相似文献

1
A Comprehensive Look at Coding Techniques on Riemannian Manifolds.关于黎曼流形上编码技术的全面审视。
IEEE Trans Neural Netw Learn Syst. 2018 Nov;29(11):5701-5712. doi: 10.1109/TNNLS.2018.2812799. Epub 2018 Mar 27.
2
Cross Euclidean-to-Riemannian Metric Learning with Application to Face Recognition from Video.基于欧式到黎曼度量学习的视频人脸识别方法
IEEE Trans Pattern Anal Mach Intell. 2018 Dec;40(12):2827-2840. doi: 10.1109/TPAMI.2017.2776154. Epub 2017 Nov 22.
3
Elastic Functional Coding of Riemannian Trajectories.黎曼轨迹的弹性功能编码。
IEEE Trans Pattern Anal Mach Intell. 2017 May;39(5):922-936. doi: 10.1109/TPAMI.2016.2564409. Epub 2016 May 6.
4
Riemannian Dictionary Learning and Sparse Coding for Positive Definite Matrices.黎曼字典学习和正定矩阵的稀疏编码。
IEEE Trans Neural Netw Learn Syst. 2017 Dec;28(12):2859-2871. doi: 10.1109/TNNLS.2016.2601307. Epub 2016 Sep 13.
5
Kernel Methods on Riemannian Manifolds with Gaussian RBF Kernels.基于高斯 RBF 核的黎曼流形上的核方法。
IEEE Trans Pattern Anal Mach Intell. 2015 Dec;37(12):2464-77. doi: 10.1109/TPAMI.2015.2414422.
6
On A Nonlinear Generalization of Sparse Coding and Dictionary Learning.关于稀疏编码与字典学习的非线性推广
JMLR Workshop Conf Proc. 2013;28(3):1480-1488.
7
Generalized Learning Riemannian Space Quantization: A Case Study on Riemannian Manifold of SPD Matrices.广义学习黎曼空间量化:关于对称正定矩阵黎曼流形的一个案例研究
IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):281-292. doi: 10.1109/TNNLS.2020.2978514. Epub 2021 Jan 4.
8
Probabilistic learning vector quantization on manifold of symmetric positive definite matrices.流形上的概率学习向量量化的对称正定矩阵。
Neural Netw. 2021 Oct;142:105-118. doi: 10.1016/j.neunet.2021.04.024. Epub 2021 Apr 28.
9
Discriminant Analysis on Riemannian Manifold of Gaussian Distributions for Face Recognition With Image Sets.基于图像集的高斯分布黎曼流形的人脸识别判别分析。
IEEE Trans Image Process. 2018;27(1):151-163. doi: 10.1109/TIP.2017.2746993.
10
Learning to Optimize on Riemannian Manifolds.学习在黎曼流形上进行优化。
IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):5935-5952. doi: 10.1109/TPAMI.2022.3215702. Epub 2023 Apr 3.