• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于使用隐马尔可夫模型识别构音障碍语音。

On the use of hidden Markov modelling for recognition of dysarthric speech.

作者信息

Deller J R, Hsu D, Ferrier L J

机构信息

Michigan State University, Department of Electrical Engineering, East Lansing 48824.

出版信息

Comput Methods Programs Biomed. 1991 Jun;35(2):125-39. doi: 10.1016/0169-2607(91)90071-z.

DOI:10.1016/0169-2607(91)90071-z
PMID:1914451
Abstract

Recognition of the speech of severely dysarthric individuals requires a technique which is robust to extraordinary conditions of high variability and very little training data. A hidden Markov model approach to isolated word recognition is used in an attempt to automatically model the enormous variability of the speech, while signal preprocessing measures and model modifications are employed to make better use of the existing data. Two findings are contrary to general experience with normal speech recognition. The first is that an ergodic model is found to outperform a standard left-to-right (Bakis) model structure. The second is that automated clipping of transitional acoustics in the speech is found to significantly enhance recognition. Experimental results using utterances of cerebral palsied persons with an array of articulatory abilities are presented.

摘要

识别严重构音障碍者的语音需要一种对高度可变且训练数据极少的特殊情况具有鲁棒性的技术。本文采用隐马尔可夫模型方法进行孤立词识别,试图自动对语音的巨大变异性进行建模,同时采用信号预处理措施和模型修改以更好地利用现有数据。有两个发现与正常语音识别的一般经验相反。第一个发现是,遍历模型的性能优于标准的从左到右(巴克斯)模型结构。第二个发现是,语音中过渡声学特征的自动裁剪能显著提高识别率。本文展示了使用具有一系列发音能力的脑瘫患者话语的实验结果。

相似文献

1
On the use of hidden Markov modelling for recognition of dysarthric speech.关于使用隐马尔可夫模型识别构音障碍语音。
Comput Methods Programs Biomed. 1991 Jun;35(2):125-39. doi: 10.1016/0169-2607(91)90071-z.
2
Effect of high-frequency spectral components in computer recognition of dysarthric speech based on a Mel-cepstral stochastic model.基于梅尔倒谱随机模型的高频谱成分在构音障碍语音计算机识别中的作用
J Rehabil Res Dev. 2005 May-Jun;42(3):363-71. doi: 10.1682/jrrd.2004.06.0067.
3
Experiments with fast Fourier transform, linear predictive and cepstral coefficients in dysarthric speech recognition algorithms using hidden Markov Model.使用隐马尔可夫模型的构音障碍语音识别算法中快速傅里叶变换、线性预测和倒谱系数的实验。
IEEE Trans Neural Syst Rehabil Eng. 2005 Dec;13(4):558-61. doi: 10.1109/TNSRE.2005.856074.
4
Vocal tract representation in the recognition of cerebral palsied speech.声道特征在脑瘫语音识别中的应用。
J Speech Lang Hear Res. 2012 Aug;55(4):1190-207. doi: 10.1044/1092-4388(2011/11-0223). Epub 2012 Jan 23.
5
Investigation of an HMM/ANN hybrid structure in pattern recognition application using cepstral analysis of dysarthric (distorted) speech signals.使用构音障碍(失真)语音信号的倒谱分析对隐马尔可夫模型/人工神经网络混合结构在模式识别应用中的研究。
Med Eng Phys. 2006 Oct;28(8):741-8. doi: 10.1016/j.medengphy.2005.11.002. Epub 2005 Dec 15.
6
Automatic speech recognition and training for severely dysarthric users of assistive technology: the STARDUST project.针对严重构音障碍的辅助技术用户的自动语音识别与训练:星尘项目。
Clin Linguist Phon. 2006 Apr-May;20(2-3):149-56. doi: 10.1080/02699200400026884.
7
Assessment of Dysarthria Using One-Word Speech Recognition with Hidden Markov Models.基于隐马尔可夫模型的单字言语识别在构音障碍评估中的应用。
J Korean Med Sci. 2019 Apr 8;34(13):e108. doi: 10.3346/jkms.2019.34.e108.
8
Representation Learning Based Speech Assistive System for Persons With Dysarthria.基于表示学习的构音障碍患者语音辅助系统。
IEEE Trans Neural Syst Rehabil Eng. 2017 Sep;25(9):1510-1517. doi: 10.1109/TNSRE.2016.2638830. Epub 2016 Dec 13.
9
Estimation of phoneme-specific HMM topologies for the automatic recognition of dysarthric speech.用于语音识别的特定音位 HMM 拓扑结构的估计。
Comput Math Methods Med. 2013;2013:297860. doi: 10.1155/2013/297860. Epub 2013 Oct 8.
10
A statistical causal model for the assessment of dysarthric speech and the utility of computer-based speech recognition.一种用于评估构音障碍性言语及基于计算机的语音识别效用的统计因果模型。
IEEE Trans Biomed Eng. 1993 Dec;40(12):1282-98. doi: 10.1109/10.250584.

引用本文的文献

1
Community-Supported Shared Infrastructure in Support of Speech Accessibility.支持语音可访问性的社区支持共享基础架构。
J Speech Lang Hear Res. 2024 Nov 7;67(11):4162-4175. doi: 10.1044/2024_JSLHR-24-00122. Epub 2024 Sep 26.
2
Silent Speech Recognition as an Alternative Communication Device for Persons with Laryngectomy.无声语音识别作为喉切除患者的替代交流设备
IEEE/ACM Trans Audio Speech Lang Process. 2017 Dec;25(12):2386-2398. doi: 10.1109/TASLP.2017.2740000. Epub 2017 Nov 28.
3
Estimation of phoneme-specific HMM topologies for the automatic recognition of dysarthric speech.
用于语音识别的特定音位 HMM 拓扑结构的估计。
Comput Math Methods Med. 2013;2013:297860. doi: 10.1155/2013/297860. Epub 2013 Oct 8.