• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于肌电信号的跨被试无声语音识别的简化对抗架构。

A simplified adversarial architecture for cross-subject silent speech recognition using electromyography.

机构信息

Defense Innovation Institute, Academy of Military Sciences (AMS), Beijing 100071, People's Republic of China.

Intelligent Game and Decision Laboratory, Beijing 100071, People's Republic of China.

出版信息

J Neural Eng. 2024 Sep 3;21(5). doi: 10.1088/1741-2552/ad7321.

DOI:10.1088/1741-2552/ad7321
PMID:39178906
Abstract

. The decline in the performance of electromyography (EMG)-based silent speech recognition is widely attributed to disparities in speech patterns, articulation habits, and individual physiology among speakers. Feature alignment by learning a discriminative network that resolves domain offsets across speakers is an effective method to address this problem. The prevailing adversarial network with a branching discriminator specializing in domain discrimination renders insufficiently direct contribution to categorical predictions of the classifier.. To this end, we propose a simplified discrepancy-based adversarial network with a streamlined end-to-end structure for EMG-based cross-subject silent speech recognition. Highly aligned features across subjects are obtained by introducing a Nuclear-norm Wasserstein discrepancy metric on the back end of the classification network, which could be utilized for both classification and domain discrimination. Given the low-level and implicitly noisy nature of myoelectric signals, we devise a cascaded adaptive rectification network as the front-end feature extraction network, adaptively reshaping the intermediate feature map with automatically learnable channel-wise thresholds. The resulting features effectively filter out domain-specific information between subjects while retaining domain-invariant features critical for cross-subject recognition.. A series of sentence-level classification experiments with 100 Chinese sentences demonstrate the efficacy of our method, achieving an average accuracy of 89.46% tested on 40 new subjects by training with data from 60 subjects. Especially, our method achieves a remarkable 10.07% improvement compared to the state-of-the-art model when tested on 10 new subjects with 20 subjects employed for training, surpassing its result even with three times training subjects.. Our study demonstrates an improved classification performance of the proposed adversarial architecture using cross-subject myoelectric signals, providing a promising prospect for EMG-based speech interactive application.

摘要

基于肌电图(EMG)的无声语音识别性能的下降,广泛归因于说话者之间的语音模式、发音习惯和个体生理差异。通过学习一个能够解决说话者之间域偏移的判别网络来对齐特征是解决该问题的有效方法。现有的带有专门用于域判别分支判别器的对抗网络,对分类器的类别预测贡献不足。为此,我们提出了一种简化的基于差异的对抗网络,具有简化的端到端结构,用于基于 EMG 的跨主体无声语音识别。通过在分类网络的后端引入核范数 Wasserstein 差异度量,可以在分类和域判别两个方面获得高度对齐的特征。鉴于肌电信号的低水平和隐含噪声性质,我们设计了一个级联自适应整流网络作为前端特征提取网络,自适应地使用自动学习的通道级阈值重新塑造中间特征图。所得到的特征有效地滤除了主体之间的特定于域的信息,同时保留了对跨主体识别至关重要的域不变特征。一系列使用 100 个中文句子的句子级分类实验表明,我们的方法具有有效性,在 60 个主体的训练数据上测试 40 个新主体时,平均准确率为 89.46%。特别是,当在 10 个新主体上测试,而训练主体为 20 个时,我们的方法比最先进的模型提高了 10.07%,甚至超过了其在 3 倍训练主体上的结果。我们的研究表明,所提出的对抗架构使用跨主体肌电信号可以提高分类性能,为基于 EMG 的语音交互应用提供了有前途的前景。

相似文献

1
A simplified adversarial architecture for cross-subject silent speech recognition using electromyography.基于肌电信号的跨被试无声语音识别的简化对抗架构。
J Neural Eng. 2024 Sep 3;21(5). doi: 10.1088/1741-2552/ad7321.
2
Comparison of feature evaluation criteria for speech recognition based on electromyography.基于肌电图的语音识别特征评估标准比较。
Med Biol Eng Comput. 2018 Jun;56(6):1041-1051. doi: 10.1007/s11517-017-1723-x. Epub 2017 Nov 14.
3
Syllable-based speech recognition using EMG.基于肌电图的音节语音识别。
Annu Int Conf IEEE Eng Med Biol Soc. 2010;2010:4699-702. doi: 10.1109/IEMBS.2010.5626426.
4
Dual Stream Long Short-Term Memory Feature Fusion Classifier for Surface Electromyography Gesture Recognition.双通道长短时记忆特征融合分类器用于表面肌电手势识别。
Sensors (Basel). 2024 Jun 4;24(11):3631. doi: 10.3390/s24113631.
5
Pattern learning with deep neural networks in EMG-based speech recognition.基于肌电图的语音识别中深度神经网络的模式学习
Annu Int Conf IEEE Eng Med Biol Soc. 2014;2014:4200-3. doi: 10.1109/EMBC.2014.6944550.
6
Improved phoneme-based myoelectric speech recognition.基于音素的改进型肌电语音识别。
IEEE Trans Biomed Eng. 2009 Aug;56(8):2016-23. doi: 10.1109/TBME.2009.2024079. Epub 2009 Jun 16.
7
Feature dimensionality reduction for myoelectric pattern recognition: a comparison study of feature selection and feature projection methods.用于肌电模式识别的特征降维:特征选择与特征投影方法的比较研究
Med Eng Phys. 2014 Dec;36(12):1716-20. doi: 10.1016/j.medengphy.2014.09.011. Epub 2014 Oct 5.
8
Toward attenuating the impact of arm positions on electromyography pattern-recognition based motion classification in transradial amputees.为了减轻手臂位置对基于肌电图模式识别的桡骨截肢者运动分类的影响。
J Neuroeng Rehabil. 2012 Oct 5;9:74. doi: 10.1186/1743-0003-9-74.
9
Across Sessions and Subjects Domain Adaptation for Building Robust Myoelectric Interface.跨会话和主题领域自适应构建稳健的肌电接口。
IEEE Trans Neural Syst Rehabil Eng. 2024;32:2005-2015. doi: 10.1109/TNSRE.2023.3347540. Epub 2024 May 27.
10
Voiceless Arabic vowels recognition using facial EMG.基于面部肌电的无声阿拉伯语元音识别
Med Biol Eng Comput. 2011 Jul;49(7):811-8. doi: 10.1007/s11517-011-0751-1. Epub 2011 Mar 16.