• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用肌电信号预测声音特征参数。

Prediction of acoustic feature parameters using myoelectric signals.

机构信息

Department of Electronic Engineering, Konkuk University, Seoul 143-701, Korea.

出版信息

IEEE Trans Biomed Eng. 2010 Jul;57(7):1587-95. doi: 10.1109/TBME.2010.2041455. Epub 2010 Feb 17.

DOI:10.1109/TBME.2010.2041455
PMID:20172775
Abstract

It is well-known that a clear relationship exists between human voices and myoelectric signals (MESs) from the area of the speaker's mouth. In this study, we utilized this information to implement a speech synthesis scheme in which MES alone was used to predict the parameters characterizing the vocal-tract transfer function of specific speech signals. Several feature parameters derived from MES were investigated to find the optimal feature for maximization of the mutual information between the acoustic and the MES features. After the optimal feature was determined, an estimation rule for the acoustic parameters was proposed, based on a minimum mean square error (MMSE) criterion. In a preliminary study, 60 isolated words were used for both objective and subjective evaluations. The results showed that the average Euclidean distance between the original and predicted acoustic parameters was reduced by about 30% compared with the average Euclidean distance of the original parameters. The intelligibility of the synthesized speech signals using the predicted features was also evaluated. A word-level identification ratio of 65.5% and a syllable-level identification ratio of 73% were obtained through a listening test.

摘要

众所周知,人类的声音和口腔区域的肌电信号(MESs)之间存在明显的关系。在这项研究中,我们利用这一信息,实现了一种语音合成方案,其中仅使用 MES 来预测特定语音信号的声道传递函数的参数。研究了从 MES 中提取的几个特征参数,以找到最优特征,从而最大化声学分与 MES 特征之间的互信息。确定最优特征后,根据最小均方误差(MMSE)准则,提出了一种用于估计声参量的估计规则。在初步研究中,使用 60 个孤立的单词进行了客观和主观的评估。结果表明,与原始参数的平均欧几里得距离相比,原始和预测的声学参数之间的平均欧几里得距离减少了约 30%。使用预测特征合成的语音信号的可懂度也进行了评估。通过听力测试获得了 65.5%的单词级识别率和 73%的音节级识别率。

相似文献

1
Prediction of acoustic feature parameters using myoelectric signals.利用肌电信号预测声音特征参数。
IEEE Trans Biomed Eng. 2010 Jul;57(7):1587-95. doi: 10.1109/TBME.2010.2041455. Epub 2010 Feb 17.
2
Myoelectric signal classification for phoneme-based speech recognition.用于基于音素的语音识别的肌电信号分类
IEEE Trans Biomed Eng. 2007 Apr;54(4):694-9. doi: 10.1109/TBME.2006.889175.
3
EMG-based speech recognition using hidden markov models with global control variables.基于肌电图的语音识别,使用带有全局控制变量的隐马尔可夫模型。
IEEE Trans Biomed Eng. 2008 Mar;55(3):930-40. doi: 10.1109/TBME.2008.915658.
4
Multiexpert automatic speech recognition using acoustic and myoelectric signals.使用声学和肌电信号的多专家自动语音识别
IEEE Trans Biomed Eng. 2006 Apr;53(4):676-85. doi: 10.1109/TBME.2006.870224.
5
SNR-adaptive stream weighting for audio-MES ASR.用于音频MES自动语音识别的信噪比自适应流加权
IEEE Trans Biomed Eng. 2008 Aug;55(8):2001-10. doi: 10.1109/TBME.2008.921094.
6
Improved phoneme-based myoelectric speech recognition.基于音素的改进型肌电语音识别。
IEEE Trans Biomed Eng. 2009 Aug;56(8):2016-23. doi: 10.1109/TBME.2009.2024079. Epub 2009 Jun 16.
7
Multiresolutional modification of speech signals for listeners with hearing impairment.针对听力受损者的语音信号多分辨率修改
J Rehabil Res Dev. 1999 Jul;36(3):230-6.
8
A linear model of acoustic-to-facial mapping: model parameters, data set size, and generalization across speakers.声学到面部映射的线性模型:模型参数、数据集大小及跨说话者的泛化能力
J Acoust Soc Am. 2008 Nov;124(5):3183-90. doi: 10.1121/1.2982369.
9
Hands-free human computer interaction via an electromyogram-based classification algorithm.通过基于肌电图的分类算法实现免提人机交互。
Biomed Sci Instrum. 2005;41:31-6.
10
Optimal design of minimum mean-square error noise reduction algorithms using the simulated annealing technique.采用模拟退火技术的最小均方误差降噪算法的优化设计
J Acoust Soc Am. 2009 Feb;125(2):934-43. doi: 10.1121/1.3050292.

引用本文的文献

1
A Neuromotor to Acoustical Jaw-Tongue Projection Model With Application in Parkinson's Disease Hypokinetic Dysarthria.一种应用于帕金森病运动减少型构音障碍的神经运动到声学颌-舌投射模型。
Front Hum Neurosci. 2021 Mar 15;15:622825. doi: 10.3389/fnhum.2021.622825. eCollection 2021.