基于时空神经网络的高密度表面肌电图解码无声语音

Decoding Silent Speech Based on High-Density Surface Electromyogram Using Spatiotemporal Neural Network.

作者信息

Chen Xi, Zhang Xu, Chen Xiang, Chen Xun

出版信息

IEEE Trans Neural Syst Rehabil Eng. 2023;31:2069-2078. doi: 10.1109/TNSRE.2023.3266299. Epub 2023 Apr 26.

DOI:10.1109/TNSRE.2023.3266299

Abstract

Finer-grained decoding at a phoneme or syllable level is a key technology for continuous recognition of silent speech based on surface electromyogram (sEMG). This paper aims at developing a novel syllable-level decoding method for continuous silent speech recognition (SSR) using spatio-temporal end-to-end neural network. In the proposed method, the high-density sEMG (HD-sEMG) was first converted into a series of feature images, and then a spatio-temporal end-to-end neural network was applied to extract discriminative feature representations and to achieve syllable-level decoding. The effectiveness of the proposed method was verified with HD-sEMG data recorded by four pieces of 64-channel electrode arrays placed over facial and laryngeal muscles of fifteen subjects subvocalizing 33 Chinese phrases consisting of 82 syllables. The proposed method outperformed the benchmark methods by achieving the highest phrase classification accuracy (97.17 ± 1.53%, ), and lower character error rate (3.11 ± 1.46%, ). This study provides a promising way of decoding sEMG towards SSR, which has great potential applications in instant communication and remote control.

摘要

在音素或音节层面进行更细粒度的解码是基于表面肌电图（sEMG）的无声语音连续识别的一项关键技术。本文旨在开发一种使用时空端到端神经网络的新型音节级解码方法，用于连续无声语音识别（SSR）。在所提出的方法中，首先将高密度sEMG（HD-sEMG）转换为一系列特征图像，然后应用时空端到端神经网络来提取判别性特征表示并实现音节级解码。通过由放置在15名受试者面部和喉部肌肉上的四套64通道电极阵列记录的HD-sEMG数据，验证了所提出方法的有效性，这些受试者默读了由82个音节组成的33个中文短语。所提出的方法通过实现最高的短语分类准确率（97.17 ± 1.53%）和较低的字符错误率（3.11 ± 1.46%），优于基准方法。本研究为朝着SSR方向解码sEMG提供了一种有前景的方法，其在即时通信和远程控制方面具有巨大的潜在应用。

相似文献

Decoding Silent Speech Based on High-Density Surface Electromyogram Using Spatiotemporal Neural Network.基于时空神经网络的高密度表面肌电图解码无声语音

IEEE Trans Neural Syst Rehabil Eng. 2023;31:2069-2078. doi: 10.1109/TNSRE.2023.3266299. Epub 2023 Apr 26.

Towards optimizing electrode configurations for silent speech recognition based on high-density surface electromyography.针对基于高密度表面肌电图的无声语音识别的电极配置进行优化。

J Neural Eng. 2021 Jan 25;18(1). doi: 10.1088/1741-2552/abca14.

Decoding subtle forearm flexions using fractal features of surface electromyogram from single and multiple sensors.使用来自单个和多个传感器的表面肌电图的分形特征来解码微妙的前臂弯曲。

J Neuroeng Rehabil. 2010 Oct 21;7:53. doi: 10.1186/1743-0003-7-53.

sEMG-based technology for silent voice recognition.基于表面肌电的无声语音识别技术。

Comput Biol Med. 2023 Jan;152:106336. doi: 10.1016/j.compbiomed.2022.106336. Epub 2022 Nov 18.

High-density surface electromyography: A visualization method of laryngeal muscle activity.高密度表面肌电图：一种喉部肌肉活动的可视化方法。

Laryngoscope. 2019 Oct;129(10):2347-2353. doi: 10.1002/lary.27784. Epub 2019 Jan 21.

Improved High-Density Myoelectric Pattern Recognition Control Against Electrode Shift Using Data Augmentation and Dilated Convolutional Neural Network.使用数据增强和扩张卷积神经网络改进高密度肌电模式识别控制以对抗电极移位。

IEEE Trans Neural Syst Rehabil Eng. 2020 Dec;28(12):2637-2646. doi: 10.1109/TNSRE.2020.3030931. Epub 2021 Jan 28.

Development of sEMG sensors and algorithms for silent speech recognition.用于无声语音识别的表面肌电传感器和算法的开发。

J Neural Eng. 2018 Aug;15(4):046031. doi: 10.1088/1741-2552/aac965. Epub 2018 Jun 1.

A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient.一种基于并行初始卷积神经网络和梅尔频率谱系数的新型无声语音识别方法。

Front Neurorobot. 2022 Sep 2;16:971446. doi: 10.3389/fnbot.2022.971446. eCollection 2022.

MSFF-Net: Multi-Stream Feature Fusion Network for surface electromyography gesture recognition.MSFF-Net：用于表面肌电信号手势识别的多流特征融合网络。

PLoS One. 2022 Nov 7;17(11):e0276436. doi: 10.1371/journal.pone.0276436. eCollection 2022.

High-Density Surface EMG-Based Gesture Recognition Using a 3D Convolutional Neural Network.基于高密度表面肌电的三维卷积神经网络手势识别

Sensors (Basel). 2020 Feb 21;20(4):1201. doi: 10.3390/s20041201.

引用本文的文献

A comprehensive multimodal dataset for contactless lip reading and acoustic analysis.用于非接触式唇读和声学分析的综合多模态数据集。

Sci Data. 2023 Dec 13;10(1):895. doi: 10.1038/s41597-023-02793-w.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于时空神经网络的高密度表面肌电图解码无声语音

Decoding Silent Speech Based on High-Density Surface Electromyogram Using Spatiotemporal Neural Network.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献