Suppr超能文献

支持手势识别的单次标注。

Supporting One-Time Point Annotations for Gesture Recognition.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2017 Nov;39(11):2270-2283. doi: 10.1109/TPAMI.2016.2637350. Epub 2016 Dec 8.

Abstract

This paper investigates a new annotation technique that reduces significantly the amount of time to annotate training data for gesture recognition. Conventionally, the annotations comprise the start and end times, and the corresponding labels of gestures in sensor recordings. In this work, we propose a one-time point annotation in which labelers do not have to select the start and end time carefully, but just mark a one-time point within the time a gesture is happening. The technique gives more freedom and reduces significantly the burden for labelers. To make the one-time point annotations applicable, we propose a novel BoundarySearch algorithm to find automatically the correct temporal boundaries of gestures by discovering data patterns around their given one-time point annotations. The corrected annotations are then used to train gesture models. We evaluate the method on three applications from wearable gesture recognition with various gesture classes (10-17 classes) recorded with different sensor modalities. The results show that training on the corrected annotations can achieve performances close to a fully supervised training on clean annotations (lower by just up to 5 percent F1-score on average). Furthermore, the BoundarySearch algorithm is also evaluated on the ChaLearn 2014 multi-modal gesture recognition challenge recorded with Kinect sensors from computer vision and achieves similar results.

摘要

本文研究了一种新的注释技术,可大大减少手势识别训练数据注释的时间。传统上,注释包括传感器记录中手势的开始和结束时间以及相应的标签。在这项工作中,我们提出了一种一次性注释,注释者不必仔细选择开始和结束时间,只需在手势发生的时间内标记一个时间点。该技术提供了更大的自由度,并大大减轻了注释者的负担。为了使一次性注释适用,我们提出了一种新的边界搜索算法,通过发现给定一次性注释周围的数据模式来自动找到手势的正确时间边界。然后使用校正后的注释来训练手势模型。我们在三个应用程序中评估了该方法,这些应用程序来自可穿戴手势识别,记录了具有不同传感器模态的各种手势类别(10-17 个类别)。结果表明,在经过校正的注释上进行训练可以达到与在干净注释上进行完全监督训练相当的性能(平均仅低 5 个百分点 F1 得分)。此外,边界搜索算法还在 ChaLearn 2014 多模态手势识别挑战赛中进行了评估,该挑战赛使用来自计算机视觉的 Kinect 传感器记录,结果相似。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验