用于手势识别和时空手势分割的统一框架。

A unified framework for gesture recognition and spatiotemporal gesture segmentation.

作者信息

Alon Jonathan, Athitsos Vassilis, Yuan Quan, Sclaroff Stan

机构信息

Computer Science Department, Boston University, Boston, MA 02215, USA.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2009 Sep;31(9):1685-99. doi: 10.1109/TPAMI.2008.203.

DOI:10.1109/TPAMI.2008.203

PMID:19574627

Abstract

Within the context of hand gesture recognition, spatiotemporal gesture segmentation is the task of determining, in a video sequence, where the gesturing hand is located and when the gesture starts and ends. Existing gesture recognition methods typically assume either known spatial segmentation or known temporal segmentation, or both. This paper introduces a unified framework for simultaneously performing spatial segmentation, temporal segmentation, and recognition. In the proposed framework, information flows both bottom-up and top-down. A gesture can be recognized even when the hand location is highly ambiguous and when information about when the gesture begins and ends is unavailable. Thus, the method can be applied to continuous image streams where gestures are performed in front of moving, cluttered backgrounds. The proposed method consists of three novel contributions: a spatiotemporal matching algorithm that can accommodate multiple candidate hand detections in every frame, a classifier-based pruning framework that enables accurate and early rejection of poor matches to gesture models, and a subgesture reasoning algorithm that learns which gesture models can falsely match parts of other longer gestures. The performance of the approach is evaluated on two challenging applications: recognition of hand-signed digits gestured by users wearing short-sleeved shirts, in front of a cluttered background, and retrieval of occurrences of signs of interest in a video database containing continuous, unsegmented signing in American Sign Language (ASL).

摘要

在手势识别的背景下，时空手势分割是在视频序列中确定做出手势的手的位置以及手势何时开始和结束的任务。现有的手势识别方法通常假设已知空间分割或已知时间分割，或者两者都已知。本文介绍了一个用于同时执行空间分割、时间分割和识别的统一框架。在所提出的框架中，信息自下而上和自上而下流动。即使手的位置高度模糊且手势开始和结束的信息不可用时，也可以识别出手势。因此，该方法可以应用于在移动的、杂乱的背景前执行手势的连续图像流。所提出的方法包括三个新颖的贡献：一种时空匹配算法，该算法可以在每一帧中容纳多个候选手部检测；一个基于分类器的剪枝框架，该框架能够准确且早期拒绝与手势模型的不良匹配；以及一个子手势推理算法，该算法学习哪些手势模型可能错误地匹配其他更长手势的部分。该方法的性能在两个具有挑战性的应用中进行了评估：识别穿着短袖衬衫的用户在杂乱背景前做出的手语数字，以及在包含美国手语（ASL）连续、未分割手语的视频数据库中检索感兴趣的手语出现情况。

相似文献

A unified framework for gesture recognition and spatiotemporal gesture segmentation.

IEEE Trans Pattern Anal Mach Intell. 2009 Sep;31(9):1685-99. doi: 10.1109/TPAMI.2008.203.

Model-based hand tracking using a hierarchical Bayesian filter.

IEEE Trans Pattern Anal Mach Intell. 2006 Sep;28(9):1372-84. doi: 10.1109/TPAMI.2006.189.

Automatic sign language analysis: a survey and the future beyond lexical meaning.

IEEE Trans Pattern Anal Mach Intell. 2005 Jun;27(6):873-91. doi: 10.1109/TPAMI.2005.112.

Weakly supervised training of a sign language recognition system using multiple instance learning density matrices.

IEEE Trans Syst Man Cybern B Cybern. 2011 Apr;41(2):526-41. doi: 10.1109/TSMCB.2010.2065802. Epub 2010 Sep 23.

Distribution-based dimensionality reduction applied to articulated motion recognition.

IEEE Trans Pattern Anal Mach Intell. 2009 May;31(5):795-810. doi: 10.1109/TPAMI.2008.80.

Analysis of head gesture and prosody patterns for prosody-driven head-gesture animation.

IEEE Trans Pattern Anal Mach Intell. 2008 Aug;30(8):1330-45. doi: 10.1109/TPAMI.2007.70797.

Analyzing and capturing articulated hand motion in image sequences.

IEEE Trans Pattern Anal Mach Intell. 2005 Dec;27(12):1910-22. doi: 10.1109/TPAMI.2005.233.

Sign language spotting with a threshold model based on conditional random fields.

IEEE Trans Pattern Anal Mach Intell. 2009 Jul;31(7):1264-77. doi: 10.1109/TPAMI.2008.172.

Inferring segmented dense motion layers using 5D tensor voting.

IEEE Trans Pattern Anal Mach Intell. 2008 Sep;30(9):1589-602. doi: 10.1109/TPAMI.2007.70802.

Discriminative feature co-occurrence selection for object detection.

IEEE Trans Pattern Anal Mach Intell. 2008 Jul;30(7):1257-69. doi: 10.1109/TPAMI.2007.70767.

引用本文的文献

Two-stream fusion model using 3D-CNN and 2D-CNN via video-frames and optical flow motion templates for hand gesture recognition.

Innov Syst Softw Eng. 2022 Aug 29:1-14. doi: 10.1007/s11334-022-00477-z.

Context-Aware Automatic Sign Language Video Transcription in Psychiatric Interviews.

Sensors (Basel). 2022 Mar 30;22(7):2656. doi: 10.3390/s22072656.

Human Activity and Motion Pattern Recognition within Indoor Environment Using Convolutional Neural Networks Clustering and Naive Bayes Classification Algorithms.

Sensors (Basel). 2022 Jan 28;22(3):1016. doi: 10.3390/s22031016.

Methods, Databases and Recent Advancement of Vision-Based Hand Gesture Recognition for HCI Systems: A Review.

SN Comput Sci. 2021;2(6):436. doi: 10.1007/s42979-021-00827-x. Epub 2021 Aug 29.

Searching and Mining Trillions of Time Series Subsequences under Dynamic Time Warping.

KDD. 2012 Aug;2012:262-270. doi: 10.1145/2339530.2339576.

Addressing Big Data Time Series: Mining Trillions of Time Series Subsequences Under Dynamic Time Warping.

ACM Trans Knowl Discov Data. 2013 Sep;7(3).

Hand Gesture Recognition in Automotive Human⁻Machine Interaction Using Depth Cameras.

Sensors (Basel). 2018 Dec 24;19(1):59. doi: 10.3390/s19010059.

An Human-Computer Interactive Augmented Reality System for Coronary Artery Diagnosis Planning and Training.

J Med Syst. 2017 Sep 2;41(10):159. doi: 10.1007/s10916-017-0805-5.

Forecasting Occurrences of Activities.

Pervasive Mob Comput. 2017 Jul;38(Pt 1):77-91. doi: 10.1016/j.pmcj.2016.09.010. Epub 2016 Sep 27.

Activity Learning as a Foundation for Security Monitoring in Smart Homes.

Sensors (Basel). 2017 Mar 31;17(4):737. doi: 10.3390/s17040737.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于手势识别和时空手势分割的统一框架。

A unified framework for gesture recognition and spatiotemporal gesture segmentation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献