• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于无声语音识别的表面肌电传感器和算法的开发。

Development of sEMG sensors and algorithms for silent speech recognition.

机构信息

VocaliD, Inc. 50 Leonard St, Belmont, MA 02478, United States of America.

出版信息

J Neural Eng. 2018 Aug;15(4):046031. doi: 10.1088/1741-2552/aac965. Epub 2018 Jun 1.

DOI:10.1088/1741-2552/aac965
PMID:29855428
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6168082/
Abstract

OBJECTIVE

Speech is among the most natural forms of human communication, thereby offering an attractive modality for human-machine interaction through automatic speech recognition (ASR). However, the limitations of ASR-including degradation in the presence of ambient noise, limited privacy and poor accessibility for those with significant speech disorders-have motivated the need for alternative non-acoustic modalities of subvocal or silent speech recognition (SSR).

APPROACH

We have developed a new system of face- and neck-worn sensors and signal processing algorithms that are capable of recognizing silently mouthed words and phrases entirely from the surface electromyographic (sEMG) signals recorded from muscles of the face and neck that are involved in the production of speech. The algorithms were strategically developed by evolving speech recognition models: first for recognizing isolated words by extracting speech-related features from sEMG signals, then for recognizing sequences of words from patterns of sEMG signals using grammar models, and finally for recognizing a vocabulary of previously untrained words using phoneme-based models. The final recognition algorithms were integrated with specially designed multi-point, miniaturized sensors that can be arranged in flexible geometries to record high-fidelity sEMG signal measurements from small articulator muscles of the face and neck.

MAIN RESULTS

We tested the system of sensors and algorithms during a series of subvocal speech experiments involving more than 1200 phrases generated from a 2200-word vocabulary and achieved an 8.9%-word error rate (91.1% recognition rate), far surpassing previous attempts in the field.

SIGNIFICANCE

These results demonstrate the viability of our system as an alternative modality of communication for a multitude of applications including: persons with speech impairments following a laryngectomy; military personnel requiring hands-free covert communication; or the consumer in need of privacy while speaking on a mobile phone in public.

摘要

目的

言语是人类最自然的交流形式之一,因此通过自动语音识别(ASR)为人机交互提供了一种有吸引力的方式。然而,ASR 的局限性,包括在环境噪声存在下的降级、对那些有严重言语障碍的人的隐私和可及性有限,促使人们需要替代非声学的亚声或无声语音识别(SSR)模态。

方法

我们开发了一种新的面部和颈部佩戴传感器和信号处理算法系统,能够完全从参与言语产生的面部和颈部肌肉记录的表面肌电图(sEMG)信号中识别无声的口语音和短语。这些算法是通过进化语音识别模型来策略性地开发的:首先通过从 sEMG 信号中提取与语音相关的特征来识别孤立的单词,然后使用语法模型从 sEMG 信号模式中识别单词序列,最后使用基于音素的模型识别以前未训练过的单词词汇。最终的识别算法与专门设计的多点、微型传感器集成在一起,可以以灵活的几何形状排列,从面部和颈部的小发音肌记录高保真 sEMG 信号测量。

主要结果

我们在一系列亚声语音实验中测试了传感器和算法系统,这些实验涉及到来自 2200 个单词词汇的 1200 多个短语,达到了 8.9%-单词错误率(91.1%的识别率),远远超过了该领域以前的尝试。

意义

这些结果表明,我们的系统作为一种替代的通信方式具有多种应用的可行性,包括:喉切除术后有言语障碍的人;需要免提隐蔽通信的军事人员;或在公共场合使用手机时需要隐私的消费者。

相似文献

1
Development of sEMG sensors and algorithms for silent speech recognition.用于无声语音识别的表面肌电传感器和算法的开发。
J Neural Eng. 2018 Aug;15(4):046031. doi: 10.1088/1741-2552/aac965. Epub 2018 Jun 1.
2
Silent Speech Recognition as an Alternative Communication Device for Persons with Laryngectomy.无声语音识别作为喉切除患者的替代交流设备
IEEE/ACM Trans Audio Speech Lang Process. 2017 Dec;25(12):2386-2398. doi: 10.1109/TASLP.2017.2740000. Epub 2017 Nov 28.
3
Surface Electromyography-Based Recognition, Synthesis, and Perception of Prosodic Subvocal Speech.基于表面肌电图的韵律性默读语音识别、合成与感知
J Speech Lang Hear Res. 2021 Jun 18;64(6S):2134-2153. doi: 10.1044/2021_JSLHR-20-00257. Epub 2021 May 12.
4
Towards optimizing electrode configurations for silent speech recognition based on high-density surface electromyography.针对基于高密度表面肌电图的无声语音识别的电极配置进行优化。
J Neural Eng. 2021 Jan 25;18(1). doi: 10.1088/1741-2552/abca14.
5
Multiexpert automatic speech recognition using acoustic and myoelectric signals.使用声学和肌电信号的多专家自动语音识别
IEEE Trans Biomed Eng. 2006 Apr;53(4):676-85. doi: 10.1109/TBME.2006.870224.
6
sEMG-based technology for silent voice recognition.基于表面肌电的无声语音识别技术。
Comput Biol Med. 2023 Jan;152:106336. doi: 10.1016/j.compbiomed.2022.106336. Epub 2022 Nov 18.
7
Pilot study for a novel and personalized voice restoration device for patients with laryngectomy.一项针对喉切除患者的新型个性化语音恢复装置的试点研究。
Head Neck. 2020 May;42(5):839-845. doi: 10.1002/hed.26057. Epub 2019 Dec 26.
8
Myoelectric signal classification for phoneme-based speech recognition.用于基于音素的语音识别的肌电信号分类
IEEE Trans Biomed Eng. 2007 Apr;54(4):694-9. doi: 10.1109/TBME.2006.889175.
9
Tackling speaking mode varieties in EMG-based speech recognition.应对基于肌电图的语音识别中的语音模式变体。
IEEE Trans Biomed Eng. 2014 Oct;61(10):2515-26. doi: 10.1109/TBME.2014.2319000. Epub 2014 Apr 18.
10
Neck and face surface electromyography for prosthetic voice control after total laryngectomy.全喉切除术后用于假体语音控制的颈部和面部表面肌电图
IEEE Trans Neural Syst Rehabil Eng. 2009 Apr;17(2):146-55. doi: 10.1109/TNSRE.2009.2017805. Epub 2009 Mar 16.

引用本文的文献

1
Automated speech artefact removal from MEG data utilizing facial gestures and mutual information.利用面部手势和互信息从脑磁图数据中自动去除语音伪迹。
Imaging Neurosci (Camb). 2025 Apr 22;3. doi: 10.1162/imag_a_00545. eCollection 2025.
2
A Wearable Silent Text Input System Using EMG and Piezoelectric Sensors.一种使用肌电图和压电传感器的可穿戴静音文本输入系统。
Sensors (Basel). 2025 Apr 21;25(8):2624. doi: 10.3390/s25082624.
3
Electrode Setup for Electromyography-Based Silent Speech Interfaces: A Pilot Study.基于肌电图的无声语音接口的电极设置:一项初步研究。

本文引用的文献

1
Silent Speech Recognition as an Alternative Communication Device for Persons with Laryngectomy.无声语音识别作为喉切除患者的替代交流设备
IEEE/ACM Trans Audio Speech Lang Process. 2017 Dec;25(12):2386-2398. doi: 10.1109/TASLP.2017.2740000. Epub 2017 Nov 28.
2
Improved phoneme-based myoelectric speech recognition.基于音素的改进型肌电语音识别。
IEEE Trans Biomed Eng. 2009 Aug;56(8):2016-23. doi: 10.1109/TBME.2009.2024079. Epub 2009 Jun 16.
3
EMG-based speech recognition using hidden markov models with global control variables.
Sensors (Basel). 2025 Jan 28;25(3):781. doi: 10.3390/s25030781.
4
Prosodic Preferences of Surface Electromyography-based Subvocal Speech for People With Laryngectomy.基于表面肌电图的喉切除患者默读语音的韵律偏好
J Voice. 2024 Dec 5. doi: 10.1016/j.jvoice.2024.10.024.
5
A data-efficient and easy-to-use lip language interface based on wearable motion capture and speech movement reconstruction.基于可穿戴运动捕捉和语音运动重建的高效、易用的唇语接口。
Sci Adv. 2024 Jun 28;10(26):eado9576. doi: 10.1126/sciadv.ado9576. Epub 2024 Jun 26.
6
Identification of the Biomechanical Response of the Muscles That Contract the Most during Disfluencies in Stuttered Speech.口吃言语不流畅时收缩最强烈的肌肉的生物力学反应的识别。
Sensors (Basel). 2024 Apr 20;24(8):2629. doi: 10.3390/s24082629.
7
MagTrack: A Wearable Tongue Motion Tracking System for Silent Speech Interfaces.磁追踪:一种用于无声语音接口的可穿戴舌动跟踪系统。
J Speech Lang Hear Res. 2023 Aug 17;66(8S):3206-3221. doi: 10.1044/2023_JSLHR-22-00319. Epub 2023 May 5.
8
Prediction of Voice Fundamental Frequency and Intensity from Surface Electromyographic Signals of the Face and Neck.基于面部和颈部表面肌电信号预测语音基频和强度
Vibration. 2022 Dec;5(4):692-710. doi: 10.3390/vibration5040041. Epub 2022 Oct 13.
9
Ultrathin crystalline-silicon-based strain gauges with deep learning algorithms for silent speech interfaces.基于超薄结晶硅的应变计与深度学习算法用于无声语音接口。
Nat Commun. 2022 Oct 3;13(1):5815. doi: 10.1038/s41467-022-33457-9.
10
A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient.一种基于并行初始卷积神经网络和梅尔频率谱系数的新型无声语音识别方法。
Front Neurorobot. 2022 Sep 2;16:971446. doi: 10.3389/fnbot.2022.971446. eCollection 2022.
基于肌电图的语音识别,使用带有全局控制变量的隐马尔可夫模型。
IEEE Trans Biomed Eng. 2008 Mar;55(3):930-40. doi: 10.1109/TBME.2008.915658.
4
Electro-mechanical stability of surface EMG sensors.表面肌电图传感器的机电稳定性。
Med Biol Eng Comput. 2007 May;45(5):447-57. doi: 10.1007/s11517-007-0168-z. Epub 2007 Feb 16.
5
Multi-stream HMM for EMG-based speech recognition.用于基于肌电图的语音识别的多流隐马尔可夫模型
Conf Proc IEEE Eng Med Biol Soc. 2004;2004:4389-92. doi: 10.1109/IEMBS.2004.1404221.
6
Myo-electric signals to augment speech recognition.用于增强语音识别的肌电信号。
Med Biol Eng Comput. 2001 Jul;39(4):500-4. doi: 10.1007/BF02345373.
7
Research summary of a scheme to ascertain the availability of speech information in the myoelectric signals of neck and head muscles using surface electrodes.一项关于使用表面电极确定颈部和头部肌肉肌电信号中语音信息可用性的方案的研究总结。
Comput Biol Med. 1986;16(6):399-410. doi: 10.1016/0010-4825(86)90064-8.