Yang Hee-Deok, Sclaroff Stan, Lee Seong-Whan
Department of Computer Science and Engineering, Korea University, Seoul 136-713, Korea.
IEEE Trans Pattern Anal Mach Intell. 2009 Jul;31(7):1264-77. doi: 10.1109/TPAMI.2008.172.
Sign language spotting is the task of detecting and recognizing signs in a signed utterance, in a set vocabulary. The difficulty of sign language spotting is that instances of signs vary in both motion and appearance. Moreover, signs appear within a continuous gesture stream, interspersed with transitional movements between signs in a vocabulary and nonsign patterns (which include out-of-vocabulary signs, epentheses, and other movements that do not correspond to signs). In this paper, a novel method for designing threshold models in a conditional random field (CRF) model is proposed which performs an adaptive threshold for distinguishing between signs in a vocabulary and nonsign patterns. A short-sign detector, a hand appearance-based sign verification method, and a subsign reasoning method are included to further improve sign language spotting accuracy. Experiments demonstrate that our system can spot signs from continuous data with an 87.0 percent spotting rate and can recognize signs from isolated data with a 93.5 percent recognition rate versus 73.5 percent and 85.4 percent, respectively, for CRFs without a threshold model, short-sign detection, subsign reasoning, and hand appearance-based sign verification. Our system can also achieve a 15.0 percent sign error rate (SER) from continuous data and a 6.4 percent SER from isolated data versus 76.2 percent and 14.5 percent, respectively, for conventional CRFs.
手语识别是在特定词汇表中检测和识别手语话语中手势的任务。手语识别的难点在于,手势实例在动作和外观上都存在差异。此外,手势出现在连续的手势流中,夹杂着词汇表中手势之间的过渡动作以及非手势模式(包括词汇表外的手势、插入音以及其他与手势不对应的动作)。本文提出了一种在条件随机场(CRF)模型中设计阈值模型的新方法,该方法执行自适应阈值以区分词汇表中的手势和非手势模式。还包括一个短手势检测器、一种基于手部外观的手势验证方法以及一种子手势推理方法,以进一步提高手语识别的准确性。实验表明,我们的系统能够从连续数据中识别出手势,识别率为87.0%,从孤立数据中识别手势的准确率为93.5%,而没有阈值模型、短手势检测、子手势推理和基于手部外观的手势验证的CRF分别为73.5%和85.4%。我们的系统从连续数据中还能实现15.0%的手势错误率(SER),从孤立数据中实现6.4%的SER,而传统CRF分别为76.2%和14.5%。