• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于时间层次字典引导解码的在线手势分割与识别。

Temporal Hierarchical Dictionary Guided Decoding for Online Gesture Segmentation and Recognition.

出版信息

IEEE Trans Image Process. 2020;29:9689-9702. doi: 10.1109/TIP.2020.3028962. Epub 2020 Oct 28.

DOI:10.1109/TIP.2020.3028962
PMID:33052853
Abstract

Online segmentation and recognition of skeleton- based gestures are challenging. Compared with offline cases, the inference of online settings can only rely on the current few frames and always completes before whole temporal movements are performed. However, incompletely performed gestures are ambiguous and their early recognition is easy to fall into local optimum. In this work, we address the problem with a temporal hierarchical dictionary to guide the hidden Markov model (HMM) decoding procedure. The intuition is that, gestures are ambiguous with high uncertainty at early performing phases, and only become discriminate after certain phases. This uncertainty naturally can be measured by entropy. Thus, we propose a measurement called "relative entropy map" (REM) to encode this temporal context to guide HMM decoding. Furthermore, we introduce a progressive learning strategy with which neural networks could learn a robust recognition of HMM states in an iterative manner. The performance of our method is intensively evaluated on three challenging databases and achieves state-of-the-art results. Our method shows the abilities of both extracting the discriminate connotations and reducing large redundancy in the HMM transition process. It is verified that our framework can achieve online recognition of continuous gesture streams even when they are halfway performed.

摘要

基于骨架的手势的在线分割和识别具有挑战性。与离线情况相比,在线设置的推断只能依赖当前的少数几帧,并且必须在整个时间运动完成之前完成。然而,未完全执行的手势是模糊的,它们的早期识别容易陷入局部最优。在这项工作中,我们使用时间层次字典来解决这个问题,以指导隐马尔可夫模型(HMM)解码过程。直觉是,手势在早期执行阶段具有很高的不确定性,并且只有在某些阶段之后才变得有区别。这种不确定性自然可以用熵来衡量。因此,我们提出了一种称为“相对熵图”(REM)的度量方法,以将这种时间上下文编码为指导 HMM 解码。此外,我们引入了一种渐进式学习策略,神经网络可以通过迭代的方式学习到 HMM 状态的鲁棒识别。我们的方法在三个具有挑战性的数据库上进行了密集评估,并取得了最先进的结果。我们的方法展示了在 HMM 转换过程中提取判别内涵和减少大量冗余的能力。验证了即使在手势流执行到一半时,我们的框架也能够实现连续手势流的在线识别。

相似文献

1
Temporal Hierarchical Dictionary Guided Decoding for Online Gesture Segmentation and Recognition.基于时间层次字典引导解码的在线手势分割与识别。
IEEE Trans Image Process. 2020;29:9689-9702. doi: 10.1109/TIP.2020.3028962. Epub 2020 Oct 28.
2
Deep Dynamic Neural Networks for Multimodal Gesture Segmentation and Recognition.深度动态神经网络用于多模态手势分割与识别。
IEEE Trans Pattern Anal Mach Intell. 2016 Aug;38(8):1583-97. doi: 10.1109/TPAMI.2016.2537340. Epub 2016 Mar 2.
3
3D Skeletal Gesture Recognition via Hidden States Exploration.通过隐藏状态探索实现的3D骨骼手势识别
IEEE Trans Image Process. 2020 Feb 21. doi: 10.1109/TIP.2020.2974061.
4
A unified framework for gesture recognition and spatiotemporal gesture segmentation.用于手势识别和时空手势分割的统一框架。
IEEE Trans Pattern Anal Mach Intell. 2009 Sep;31(9):1685-99. doi: 10.1109/TPAMI.2008.203.
5
Finger Gesture Spotting from Long Sequences Based on Multi-Stream Recurrent Neural Networks.基于多流循环神经网络的长序列手指手势识别。
Sensors (Basel). 2020 Jan 18;20(2):528. doi: 10.3390/s20020528.
6
FLGR: Fixed Length Gists Representation Learning for RNN-HMM Hybrid-Based Neuromorphic Continuous Gesture Recognition.FLGR:用于基于RNN-HMM混合模型的神经形态连续手势识别的定长要点表示学习
Front Neurosci. 2019 Feb 12;13:73. doi: 10.3389/fnins.2019.00073. eCollection 2019.
7
A Dataset and Benchmarks for Segmentation and Recognition of Gestures in Robotic Surgery.机器人手术中手势分割与识别的数据集及基准
IEEE Trans Biomed Eng. 2017 Sep;64(9):2025-2041. doi: 10.1109/TBME.2016.2647680. Epub 2017 Jan 4.
8
A Novel Phonology- and Radical-Coded Chinese Sign Language Recognition Framework Using Accelerometer and Surface Electromyography Sensors.一种使用加速度计和表面肌电图传感器的新颖的基于音韵和部首编码的中国手语识别框架。
Sensors (Basel). 2015 Sep 15;15(9):23303-24. doi: 10.3390/s150923303.
9
HAGR-D: A Novel Approach for Gesture Recognition with Depth Maps.HAGR-D:一种利用深度图进行手势识别的新方法。
Sensors (Basel). 2015 Nov 12;15(11):28646-64. doi: 10.3390/s151128646.
10
Real-time gesture interface based on event-driven processing from stereo silicon retinas.基于立体硅视网膜事件驱动处理的实时手势界面。
IEEE Trans Neural Netw Learn Syst. 2014 Dec;25(12):2250-63. doi: 10.1109/TNNLS.2014.2308551.