使用嵌套动态规划处理连续手语识别中的运动插入和手部分割歧义。

Handling movement epenthesis and hand segmentation ambiguities in continuous sign language recognition using nested dynamic programming.

机构信息

University of South Florida, Tampa, FL, USA.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2010 Mar;32(3):462-77. doi: 10.1109/TPAMI.2009.26.

DOI:10.1109/TPAMI.2009.26

PMID:20075472

Abstract

We consider two crucial problems in continuous sign language recognition from unaided video sequences. At the sentence level, we consider the movement epenthesis (me) problem and at the feature level, we consider the problem of hand segmentation and grouping. We construct a framework that can handle both of these problems based on an enhanced, nested version of the dynamic programming approach. To address movement epenthesis, a dynamic programming (DP) process employs a virtual me option that does not need explicit models. We call this the enhanced level building (eLB) algorithm. This formulation also allows the incorporation of grammar models. Nested within this eLB is another DP that handles the problem of selecting among multiple hand candidates. We demonstrate our ideas on four American Sign Language data sets with simple background, with the signer wearing short sleeves, with complex background, and across signers. We compared the performance with Conditional Random Fields (CRF) and Latent Dynamic-CRF-based approaches. The experiments show more than 40 percent improvement over CRF or LDCRF approaches in terms of the frame labeling rate. We show the flexibility of our approach when handling a changing context. We also find a 70 percent improvement in sign recognition rate over the unenhanced DP matching algorithm that does not accommodate the me effect.

摘要

我们考虑了连续手语识别中两个关键问题。在句子层面，我们考虑运动插入（ME）问题，在特征层面，我们考虑手分割和分组问题。我们构建了一个基于增强嵌套动态规划方法的框架来处理这两个问题。为了解决运动插入问题，动态规划（DP）过程采用了一种不需要显式模型的虚拟 ME 选项。我们称之为增强层构建（eLB）算法。这种公式还允许合并语法模型。嵌套在这个 eLB 中的是另一个 DP，用于在多个手候选者中进行选择。我们在四个美国手语数据集上展示了我们的想法，这些数据集的背景简单，签名者穿着短袖，背景复杂，以及跨签名者。我们将性能与条件随机场（CRF）和基于潜在动态 CRF 的方法进行了比较。实验表明，在帧标记率方面，与 CRF 或 LDCRF 方法相比，我们的方法提高了 40%以上。我们展示了我们的方法在处理变化的上下文时的灵活性。我们还发现，与不适应 ME 效应的未增强 DP 匹配算法相比，签名识别率提高了 70%。

相似文献

Handling movement epenthesis and hand segmentation ambiguities in continuous sign language recognition using nested dynamic programming.

IEEE Trans Pattern Anal Mach Intell. 2010 Mar;32(3):462-77. doi: 10.1109/TPAMI.2009.26.

A unified framework for gesture recognition and spatiotemporal gesture segmentation.

IEEE Trans Pattern Anal Mach Intell. 2009 Sep;31(9):1685-99. doi: 10.1109/TPAMI.2008.203.

Automatic sign language analysis: a survey and the future beyond lexical meaning.

IEEE Trans Pattern Anal Mach Intell. 2005 Jun;27(6):873-91. doi: 10.1109/TPAMI.2005.112.

Sign language spotting with a threshold model based on conditional random fields.

IEEE Trans Pattern Anal Mach Intell. 2009 Jul;31(7):1264-77. doi: 10.1109/TPAMI.2008.172.

Sign language recognition using intrinsic-mode sample entropy on sEMG and accelerometer data.

IEEE Trans Biomed Eng. 2009 Dec;56(12):2879-90. doi: 10.1109/TBME.2009.2013200. Epub 2009 Jan 23.

Model-based hand tracking using a hierarchical Bayesian filter.

IEEE Trans Pattern Anal Mach Intell. 2006 Sep;28(9):1372-84. doi: 10.1109/TPAMI.2006.189.

Action recognition using mined hierarchical compound features.

IEEE Trans Pattern Anal Mach Intell. 2011 May;33(5):883-97. doi: 10.1109/TPAMI.2010.144.

PADS: a probabilistic activity detection framework for video data.

IEEE Trans Pattern Anal Mach Intell. 2010 Dec;32(12):2246-61. doi: 10.1109/TPAMI.2010.33.

Video registration using dynamic textures.

IEEE Trans Pattern Anal Mach Intell. 2011 Jan;33(1):158-71. doi: 10.1109/TPAMI.2010.61.

Sign language recognition with the Kinect sensor based on conditional random fields.

Sensors (Basel). 2014 Dec 24;15(1):135-47. doi: 10.3390/s150100135.

引用本文的文献

Novel Wearable System to Recognize Sign Language in Real Time.

Sensors (Basel). 2024 Jul 16;24(14):4613. doi: 10.3390/s24144613.

Methods, Databases and Recent Advancement of Vision-Based Hand Gesture Recognition for HCI Systems: A Review.

SN Comput Sci. 2021;2(6):436. doi: 10.1007/s42979-021-00827-x. Epub 2021 Aug 29.

A New Spiking Convolutional Recurrent Neural Network (SCRNN) With Applications to Event-Based Hand Gesture Recognition.

Front Neurosci. 2020 Nov 17;14:590164. doi: 10.3389/fnins.2020.590164. eCollection 2020.

A Novel Phonology- and Radical-Coded Chinese Sign Language Recognition Framework Using Accelerometer and Surface Electromyography Sensors.

Sensors (Basel). 2015 Sep 15;15(9):23303-24. doi: 10.3390/s150923303.

Real-time hand gesture recognition using finger segmentation.

ScientificWorldJournal. 2014;2014:267872. doi: 10.1155/2014/267872. Epub 2014 Jun 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用嵌套动态规划处理连续手语识别中的运动插入和手部分割歧义。

Handling movement epenthesis and hand segmentation ambiguities in continuous sign language recognition using nested dynamic programming.

机构信息

University of South Florida, Tampa, FL, USA.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2010 Mar;32(3):462-77. doi: 10.1109/TPAMI.2009.26.

DOI:10.1109/TPAMI.2009.26

PMID:20075472

Abstract

摘要

使用嵌套动态规划处理连续手语识别中的运动插入和手部分割歧义。

Handling movement epenthesis and hand segmentation ambiguities in continuous sign language recognition using nested dynamic programming.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

使用嵌套动态规划处理连续手语识别中的运动插入和手部分割歧义。

Handling movement epenthesis and hand segmentation ambiguities in continuous sign language recognition using nested dynamic programming.

机构信息

出版信息

相似文献

引用本文的文献