IEEE Trans Neural Syst Rehabil Eng. 2021;29:1004-1015. doi: 10.1109/TNSRE.2021.3077413. Epub 2021 Jun 8.
This work is motivated by the recent advances in Deep Neural Networks (DNNs) and their widespread applications in human-machine interfaces. DNNs have been recently used for detecting the intended hand gesture through the processing of surface electromyogram (sEMG) signals. Objective: Although DNNs have shown superior accuracy compared to conventional methods when large amounts of data are available for training, their performance substantially decreases when data are limited. Collecting large datasets for training may be feasible in research laboratories, but it is not a practical approach for real-life applications. The main objective of this work is to design a modern DNN-based gesture detection model that relies on minimal training data while providing high accuracy. Methods: We propose the novel Few-Shot learning- Hand Gesture Recognition (FS-HGR) architecture. Few-shot learning is a variant of domain adaptation with the goal of inferring the required output based on just one or a few training observations. The proposed FS-HGR generalizes after seeing very few observations from each class by combining temporal convolutions with attention mechanisms. This allows the meta-learner to aggregate contextual information from experience and to pinpoint specific pieces of information within its available set of inputs. Data Source & Summary of Results: The performance of FS-HGR was tested on the second and fifth Ninapro databases, referred to as the DB2 and DB5, respectively. The DB2 consists of 50 gestures (rest included) from 40 healthy subjects. The Ninapro DB5 contains data from 10 healthy participants performing a total of 53 different gestures (rest included). The proposed approach for the Ninapro DB2 led to 85.94% classification accuracy on new repetitions with few-shot observation (5-way 5-shot), 81.29% accuracy on new subjects with few-shot observation (5-way 5-shot), and 73.36% accuracy on new gestures with few-shot observation (5-way 5-shot). Moreover, the proposed approach for the Ninapro DB5 led to 64.65% classification accuracy on new subjects with few-shot observation (5-way 5-shot).
这项工作的动机是最近在深度神经网络(DNN)方面的进展以及它们在人机接口中的广泛应用。最近,通过处理表面肌电图(sEMG)信号,DNN 已被用于检测预期的手势。
尽管 DNN 在有大量数据可用于训练时表现出优于传统方法的准确性,但当数据有限时,其性能会大大降低。在研究实验室中,收集大量数据集进行训练可能是可行的,但对于实际应用来说并不是一种实用的方法。这项工作的主要目的是设计一种基于现代 DNN 的手势检测模型,该模型依赖于最小的训练数据,同时提供高精度。
我们提出了新颖的 Few-Shot learning-Hand Gesture Recognition (FS-HGR) 架构。Few-Shot learning 是一种域自适应变体,其目标是仅根据一个或几个训练观察值推断所需的输出。通过将时间卷积与注意力机制相结合,所提出的 FS-HGR 在看到每个类别的极少数观察值后进行泛化。这允许元学习者从经验中聚合上下文信息,并在其可用输入集中定位特定的信息。
FS-HGR 的性能在第二和第五个 Ninapro 数据库(分别称为 DB2 和 DB5)上进行了测试。DB2 由 40 位健康受试者的 50 个手势(包括休息)组成。Ninapro DB5 包含来自 10 位健康参与者执行的总共 53 种不同手势(包括休息)的数据。针对 Ninapro DB2 的建议方法在新的重复观察中(5-way 5-shot)获得了 85.94%的分类准确率,在新的少数观察中(5-way 5-shot)获得了 81.29%的新受试者准确率,在新的少数观察中(5-way 5-shot)获得了 73.36%的新手势准确率。此外,针对 Ninapro DB5 的建议方法在新的少数观察受试者(5-way 5-shot)中获得了 64.65%的分类准确率。