• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

3D 视频理解的拓扑字典。

Topology dictionary for 3D video understanding.

机构信息

Department of Intelligence Science and Technology, Matsuyama Laboratory, Graduate School of Informatics, Kyoto University, Kyoto, Japan.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2012 Aug;34(8):1645-57. doi: 10.1109/TPAMI.2011.258.

DOI:10.1109/TPAMI.2011.258
PMID:22745004
Abstract

This paper presents a novel approach that achieves 3D video understanding. 3D video consists of a stream of 3D models of subjects in motion. The acquisition of long sequences requires large storage space (2 GB for 1 min). Moreover, it is tedious to browse data sets and extract meaningful information. We propose the topology dictionary to encode and describe 3D video content. The model consists of a topology-based shape descriptor dictionary which can be generated from either extracted patterns or training sequences. The model relies on 1) topology description and classification using Reeb graphs, and 2) a Markov motion graph to represent topology change states. We show that the use of Reeb graphs as the high-level topology descriptor is relevant. It allows the dictionary to automatically model complex sequences, whereas other strategies would require prior knowledge on the shape and topology of the captured subjects. Our approach serves to encode 3D video sequences, and can be applied for content-based description and summarization of 3D video sequences. Furthermore, topology class labeling during a learning process enables the system to perform content-based event recognition. Experiments were carried out on various 3D videos. We showcase an application for 3D video progressive summarization using the topology dictionary.

摘要

本文提出了一种新颖的方法来实现 3D 视频理解。3D 视频由运动主体的 3D 模型流组成。长序列的获取需要大量的存储空间(1 分钟 2GB)。此外,浏览数据集和提取有意义的信息也很繁琐。我们提出了拓扑字典来对 3D 视频内容进行编码和描述。该模型由基于拓扑的形状描述符字典组成,可以从提取的模式或训练序列中生成。该模型依赖于 1)使用 Reeb 图进行拓扑描述和分类,以及 2)使用马尔可夫运动图来表示拓扑变化状态。我们表明,使用 Reeb 图作为高级拓扑描述符是相关的。它允许字典自动对复杂的序列进行建模,而其他策略则需要对捕获主体的形状和拓扑有先验知识。我们的方法用于对 3D 视频序列进行编码,并可应用于 3D 视频序列的基于内容的描述和摘要。此外,在学习过程中进行拓扑分类标记可以使系统能够执行基于内容的事件识别。在各种 3D 视频上进行了实验。我们展示了使用拓扑字典进行 3D 视频渐进式摘要的应用。

相似文献

1
Topology dictionary for 3D video understanding.3D 视频理解的拓扑字典。
IEEE Trans Pattern Anal Mach Intell. 2012 Aug;34(8):1645-57. doi: 10.1109/TPAMI.2011.258.
2
Learning sparse representations for human action recognition.学习人类动作识别的稀疏表示。
IEEE Trans Pattern Anal Mach Intell. 2012 Aug;34(8):1576-88. doi: 10.1109/TPAMI.2011.253.
3
A Unified Framework for Event Summarization and Rare Event Detection from Multiple Views.一种用于多视图事件总结和稀有事件检测的统一框架。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1737-50. doi: 10.1109/TPAMI.2014.2385695.
4
Protein topology recognition from secondary structure sequences: application of the hidden Markov models to the alpha class proteins.从二级结构序列识别蛋白质拓扑结构:隐马尔可夫模型在α类蛋白质中的应用。
J Mol Biol. 1997 Mar 28;267(2):446-63. doi: 10.1006/jmbi.1996.0874.
5
Transferring of speech movements from video to 3D face space.将语音动作从视频转移到3D面部空间。
IEEE Trans Vis Comput Graph. 2007 Jan-Feb;13(1):58-69. doi: 10.1109/TVCG.2007.22.
6
Videography-Based Unconstrained Video Analysis.基于视频的非约束性视频分析。
IEEE Trans Image Process. 2017 May;26(5):2261-2273. doi: 10.1109/TIP.2017.2678800. Epub 2017 Mar 6.
7
Analysis and synthesis of textured motion: particles and waves.纹理运动的分析与合成:粒子与波。
IEEE Trans Pattern Anal Mach Intell. 2004 Oct;26(10):1348-63. doi: 10.1109/TPAMI.2004.76.
8
Alignment of continuous video onto 3D point clouds.将连续视频与3D点云对齐。
IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1305-18. doi: 10.1109/TPAMI.2005.152.
9
Probabilistic space-time video modeling via piecewise GMM.基于分段高斯混合模型的概率时空视频建模
IEEE Trans Pattern Anal Mach Intell. 2004 Mar;26(3):384-96. doi: 10.1109/TPAMI.2004.1262334.
10
Graph Convolutional Dictionary Selection With L₂ₚ Norm for Video Summarization.用于视频摘要的具有L₂ₚ范数的图卷积字典选择
IEEE Trans Image Process. 2022;31:1789-1804. doi: 10.1109/TIP.2022.3146012. Epub 2022 Feb 10.