Suppr超能文献

基于语音识别的辅助机器人控制的高效自注意力模型。

Efficient Self-Attention Model for Speech Recognition-Based Assistive Robots Control.

机构信息

Université Laval, Quebec City, QC G1V 0A6, Canada.

Centre for Interdisciplinary Research in Rehabilitation and Social Integration, CIUSSS de la Capitale-Nationale, Quebec City, QC G1M 2S8, Canada.

出版信息

Sensors (Basel). 2023 Jun 30;23(13):6056. doi: 10.3390/s23136056.

Abstract

Assistive robots are tools that people living with upper body disabilities can leverage to autonomously perform Activities of Daily Living (ADL). Unfortunately, conventional control methods still rely on low-dimensional, easy-to-implement interfaces such as joysticks that tend to be unintuitive and cumbersome to use. In contrast, vocal commands may represent a viable and intuitive alternative. This work represents an important step toward providing a viable vocal interface for people living with upper limb disabilities by proposing a novel lightweight vocal command recognition system. The proposed model leverages the MobileNet2 architecture, augmenting it with a novel approach to the self-attention mechanism, achieving a new state-of-the-art performance for Keyword Spotting (KWS) on the Google Speech Commands Dataset (GSCD). Moreover, this work presents a new dataset, referred to as the French Speech Commands Dataset (FSCD), comprising 4963 vocal command utterances. Using the GSCD as the source, we used Transfer Learning (TL) to adapt the model to this cross-language task. TL has been shown to significantly improve the model performance on the FSCD. The viability of the proposed approach is further demonstrated through real-life control of a robotic arm by four healthy participants using both the proposed vocal interface and a joystick.

摘要

辅助机器人是上肢残疾人士可以自主执行日常生活活动 (ADL) 的工具。遗憾的是,传统的控制方法仍然依赖于低维、易于实现的接口,如操纵杆,这些接口往往不直观且使用起来很繁琐。相比之下,语音命令可能代表一种可行且直观的替代方法。这项工作通过提出一种新颖的轻量级语音命令识别系统,代表了为上肢残疾人士提供可行语音接口的重要一步。所提出的模型利用了 MobileNet2 架构,并通过一种新颖的方法对自注意力机制进行了增强,从而在 Google Speech Commands Dataset (GSCD) 上的关键字识别 (KWS) 方面实现了新的最先进性能。此外,这项工作还提出了一个新的数据集,称为 French Speech Commands Dataset (FSCD),包含 4963 个语音命令发音。我们使用 GSCD 作为源,使用迁移学习 (TL) 将模型适用于这项跨语言任务。TL 已被证明可以显著提高模型在 FSCD 上的性能。通过四名健康参与者使用所提出的语音接口和操纵杆对机器人手臂进行实时控制,进一步证明了所提出方法的可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b81e/10347238/92ab20abe8f1/sensors-23-06056-g001.jpg

相似文献

7
Wireless sEMG-Based Body-Machine Interface for Assistive Technology Devices.用于辅助技术设备的基于无线表面肌电图的人体-机器接口
IEEE J Biomed Health Inform. 2017 Jul;21(4):967-977. doi: 10.1109/JBHI.2016.2642837. Epub 2016 Dec 21.

本文引用的文献

2
Assistive robotic arm: Evaluation of the performance of intelligent algorithms.辅助机器人手臂:智能算法性能评估
Assist Technol. 2021 Mar 4;33(2):95-104. doi: 10.1080/10400435.2019.1601649. Epub 2019 May 9.
3
Long-term use of the JACO robotic arm: a case series.JACO机器人手臂的长期使用:病例系列
Disabil Rehabil Assist Technol. 2019 Apr;14(3):267-275. doi: 10.1080/17483107.2018.1428692. Epub 2018 Jan 31.
6
A review of assistive devices for arm balancing.手臂平衡辅助设备综述。
IEEE Int Conf Rehabil Robot. 2013 Jun;2013:6650485. doi: 10.1109/ICORR.2013.6650485.
10
Dynamic programming.动态规划。
Science. 1966 Jul 1;153(3731):34-7. doi: 10.1126/science.153.3731.34.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验