基于肌电图的语音识别，使用带有全局控制变量的隐马尔可夫模型。

EMG-based speech recognition using hidden markov models with global control variables.

作者信息

Lee Ki-Seung

机构信息

Department of Electronic Engineering, Konkuk University, 1 Hwayang-dong, Gwangjin-gu, Seoul 143-701, Korea.

出版信息

IEEE Trans Biomed Eng. 2008 Mar;55(3):930-40. doi: 10.1109/TBME.2008.915658.

DOI:10.1109/TBME.2008.915658

PMID:18334384

Abstract

It is well known that a strong relationship exists between human voices and the movement of articulatory facial muscles. In this paper, we utilize this knowledge to implement an automatic speech recognition scheme which uses solely surface electromyogram (EMG) signals. The sequence of EMG signals for each word is modelled by a hidden Markov model (HMM) framework. The main objective of the work involves building a model for state observation density when multichannel observation sequences are given. The proposed model reflects the dependencies between each of the EMG signals, which are described by introducing a global control variable. We also develop an efficient model training method, based on a maximum likelihood criterion. In a preliminary study, 60 isolated words were used as recognition variables. EMG signals were acquired from three articulatory facial muscles. The findings indicate that such a system may have the capacity to recognize speech signals with an accuracy of up to 87.07%, which is superior to the independent probabilistic model.

摘要

众所周知，人类语音与发音面部肌肉的运动之间存在着密切的关系。在本文中，我们利用这一知识来实现一种仅使用表面肌电图（EMG）信号的自动语音识别方案。每个单词的EMG信号序列由隐马尔可夫模型（HMM）框架进行建模。这项工作的主要目标是在给定多通道观测序列时建立状态观测密度模型。所提出的模型反映了每个EMG信号之间的依赖性，这是通过引入一个全局控制变量来描述的。我们还基于最大似然准则开发了一种有效的模型训练方法。在一项初步研究中，使用60个孤立单词作为识别变量。从三块发音面部肌肉采集EMG信号。研究结果表明，这样的系统可能有能力以高达87.07%的准确率识别语音信号，这优于独立概率模型。

相似文献

EMG-based speech recognition using hidden markov models with global control variables.

IEEE Trans Biomed Eng. 2008 Mar;55(3):930-40. doi: 10.1109/TBME.2008.915658.

Myoelectric signal classification for phoneme-based speech recognition.

IEEE Trans Biomed Eng. 2007 Apr;54(4):694-9. doi: 10.1109/TBME.2006.889175.

Improved phoneme-based myoelectric speech recognition.

IEEE Trans Biomed Eng. 2009 Aug;56(8):2016-23. doi: 10.1109/TBME.2009.2024079. Epub 2009 Jun 16.

Investigation of an HMM/ANN hybrid structure in pattern recognition application using cepstral analysis of dysarthric (distorted) speech signals.

Med Eng Phys. 2006 Oct;28(8):741-8. doi: 10.1016/j.medengphy.2005.11.002. Epub 2005 Dec 15.

Multiexpert automatic speech recognition using acoustic and myoelectric signals.

IEEE Trans Biomed Eng. 2006 Apr;53(4):676-85. doi: 10.1109/TBME.2006.870224.

SNR-adaptive stream weighting for audio-MES ASR.

IEEE Trans Biomed Eng. 2008 Aug;55(8):2001-10. doi: 10.1109/TBME.2008.921094.

Hybrid simulated annealing and its application to optimization of hidden Markov models for visual speech recognition.

IEEE Trans Syst Man Cybern B Cybern. 2010 Aug;40(4):1188-96. doi: 10.1109/TSMCB.2009.2036753. Epub 2010 Jan 8.

Maximum confidence hidden markov modeling for face recognition.

IEEE Trans Pattern Anal Mach Intell. 2008 Apr;30(4):606-16. doi: 10.1109/TPAMI.2007.70715.

Multistream articulatory feature-based models for visual speech recognition.

IEEE Trans Pattern Anal Mach Intell. 2009 Sep;31(9):1700-7. doi: 10.1109/TPAMI.2008.303.

A constraint-based evolutionary learning approach to the expectation maximization for optimal estimation of the hidden Markov model for speech signal modeling.

IEEE Trans Syst Man Cybern B Cybern. 2009 Feb;39(1):182-97. doi: 10.1109/TSMCB.2008.2004051. Epub 2008 Dec 9.

引用本文的文献

Prediction of Voice Fundamental Frequency and Intensity from Surface Electromyographic Signals of the Face and Neck.

Vibration. 2022 Dec;5(4):692-710. doi: 10.3390/vibration5040041. Epub 2022 Oct 13.

Myoelectric Pattern Recognition Performance Enhancement Using Nonlinear Features.

Comput Intell Neurosci. 2022 Apr 29;2022:6414664. doi: 10.1155/2022/6414664. eCollection 2022.

Silent speech command word recognition using stepped frequency continuous wave radar.

Sci Rep. 2022 Mar 9;12(1):4192. doi: 10.1038/s41598-022-07842-9.

Surface Electromyography-Based Recognition, Synthesis, and Perception of Prosodic Subvocal Speech.

J Speech Lang Hear Res. 2021 Jun 18;64(6S):2134-2153. doi: 10.1044/2021_JSLHR-20-00257. Epub 2021 May 12.

Review on electromyography signal acquisition and processing.

Biophys Rev. 2020 Nov 10;12(6):1361-7. doi: 10.1007/s12551-020-00770-w.

Novel Activity Detection Algorithm to Characterize Spontaneous Stepping During Multimodal Spinal Neuromodulation After Mid-Thoracic Spinal Cord Injury in Rats.

Front Syst Neurosci. 2020 Jan 15;13:82. doi: 10.3389/fnsys.2019.00082. eCollection 2019.

Development of sEMG sensors and algorithms for silent speech recognition.

J Neural Eng. 2018 Aug;15(4):046031. doi: 10.1088/1741-2552/aac965. Epub 2018 Jun 1.

Silent Speech Recognition as an Alternative Communication Device for Persons with Laryngectomy.

IEEE/ACM Trans Audio Speech Lang Process. 2017 Dec;25(12):2386-2398. doi: 10.1109/TASLP.2017.2740000. Epub 2017 Nov 28.

A Wearable High-Resolution Facial Electromyography for Long Term Recordings in Freely Behaving Humans.

Sci Rep. 2018 Feb 1;8(1):2058. doi: 10.1038/s41598-018-20567-y.

Comparison of feature evaluation criteria for speech recognition based on electromyography.

Med Biol Eng Comput. 2018 Jun;56(6):1041-1051. doi: 10.1007/s11517-017-1723-x. Epub 2017 Nov 14.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于肌电图的语音识别，使用带有全局控制变量的隐马尔可夫模型。

EMG-based speech recognition using hidden markov models with global control variables.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献