用于识别低帧率语音的隐马尔可夫模型的自适应。

Adaptation of hidden Markov models for recognizing speech of reduced frame rate.

出版信息

IEEE Trans Cybern. 2013 Dec;43(6):2114-21. doi: 10.1109/TCYB.2013.2240450.

DOI:10.1109/TCYB.2013.2240450

Abstract

The frame rate of the observation sequence in distributed speech recognition applications may be reduced to suit a resource-limited front-end device. In order to use models trained using full-frame-rate data in the recognition of reduced frame-rate (RFR) data, we propose a method for adapting the transition probabilities of hidden Markov models (HMMs) to match the frame rate of the observation. Experiments on the recognition of clean and noisy connected digits are conducted to evaluate the proposed method. Experimental results show that the proposed method can effectively compensate for the frame-rate mismatch between the training and the test data. Using our adapted model to recognize the RFR speech data, one can significantly reduce the computation time and achieve the same level of accuracy as that of a method, which restores the frame rate using data interpolation.

摘要

在分布式语音识别应用中，观察序列的帧率可能会降低，以适应资源有限的前端设备。为了在降低帧率（RFR）数据的识别中使用全帧率数据训练的模型，我们提出了一种适应隐马尔可夫模型（HMM）的转移概率以匹配观察帧率的方法。在干净和嘈杂的连接数字识别的实验中，评估了所提出的方法。实验结果表明，所提出的方法可以有效地补偿训练和测试数据之间的帧率不匹配。使用我们的自适应模型来识别 RFR 语音数据，可以显著减少计算时间，并达到与使用数据插值来恢复帧率的方法相同的准确性。

相似文献

Adaptation of hidden Markov models for recognizing speech of reduced frame rate.

IEEE Trans Cybern. 2013 Dec;43(6):2114-21. doi: 10.1109/TCYB.2013.2240450.

Model adaptation method for recognition of speech with missing frames.

J Acoust Soc Am. 2014 Mar;135(3):EL166-71. doi: 10.1121/1.4865264.

Hybrid simulated annealing and its application to optimization of hidden Markov models for visual speech recognition.

IEEE Trans Syst Man Cybern B Cybern. 2010 Aug;40(4):1188-96. doi: 10.1109/TSMCB.2009.2036753. Epub 2010 Jan 8.

Improved model adaptation approach for recognition of reduced-frame-rate continuous speech.

PLoS One. 2018 Nov 7;13(11):e0206916. doi: 10.1371/journal.pone.0206916. eCollection 2018.

EMG-based speech recognition using hidden markov models with global control variables.

IEEE Trans Biomed Eng. 2008 Mar;55(3):930-40. doi: 10.1109/TBME.2008.915658.

Approximated mutual information training for speech recognition using myoelectric signals.

Conf Proc IEEE Eng Med Biol Soc. 2006;2006:767-70. doi: 10.1109/IEMBS.2006.259992.

Hierarchical singleton-type recurrent neural fuzzy networks for noisy speech recognition.

IEEE Trans Neural Netw. 2007 May;18(3):833-43. doi: 10.1109/TNN.2007.891194.

Investigation of an HMM/ANN hybrid structure in pattern recognition application using cepstral analysis of dysarthric (distorted) speech signals.

Med Eng Phys. 2006 Oct;28(8):741-8. doi: 10.1016/j.medengphy.2005.11.002. Epub 2005 Dec 15.

GFM-based methods for speaker identification.

IEEE Trans Cybern. 2013 Jun;43(3):1047-58. doi: 10.1109/TSMCB.2012.2223461. Epub 2012 Oct 26.

Experiments with fast Fourier transform, linear predictive and cepstral coefficients in dysarthric speech recognition algorithms using hidden Markov Model.

IEEE Trans Neural Syst Rehabil Eng. 2005 Dec;13(4):558-61. doi: 10.1109/TNSRE.2005.856074.

引用本文的文献

Improved model adaptation approach for recognition of reduced-frame-rate continuous speech.

PLoS One. 2018 Nov 7;13(11):e0206916. doi: 10.1371/journal.pone.0206916. eCollection 2018.

Stochastic modeling of central apnea events in preterm infants.

Physiol Meas. 2016 Apr;37(4):463-84. doi: 10.1088/0967-3334/37/4/463. Epub 2016 Mar 10.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于识别低帧率语音的隐马尔可夫模型的自适应。

Adaptation of hidden Markov models for recognizing speech of reduced frame rate.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于识别低帧率语音的隐马尔可夫模型的自适应。

Adaptation of hidden Markov models for recognizing speech of reduced frame rate.

出版信息

相似文献

引用本文的文献