Chen Guo, Parsa Vijay
J Acoust Soc Am. 2007 Feb;121(2):EL77-83. doi: 10.1121/1.2430765.
This work presents a speech quality evaluation method which is based on Moore and Glasberg's loudness model and Bayesian modeling. In the proposed method, the differences between the loudness patterns of the original and processed speech signals are employed as the observed features for representing speech quality, a Bayesian learning model is exploited as the cognitive model which maps the features into quality scores, and Markov chain Monte Carlo methods are used for the Bayesian computation. The performance of the proposed method was demonstrated through comparisons with the state-of-the-art speech quality evaluation standard, ITU-T P.862, using seven ITU subjective quality databases.
这项工作提出了一种基于摩尔和格拉斯伯格响度模型以及贝叶斯建模的语音质量评估方法。在所提出的方法中,原始语音信号和处理后的语音信号的响度模式差异被用作表示语音质量的观测特征,贝叶斯学习模型被用作将这些特征映射为质量分数的认知模型,并且马尔可夫链蒙特卡罗方法用于贝叶斯计算。通过使用七个国际电联主观质量数据库与最先进的语音质量评估标准ITU-T P.862进行比较,证明了所提出方法的性能。