Huda Shamsul, Yearwood John, Togneri Roberto
Center for Informatics and Applied Optimization, University of Ballarat, Ballarat, Vic. 3350, Australia.
IEEE Trans Syst Man Cybern B Cybern. 2009 Feb;39(1):182-97. doi: 10.1109/TSMCB.2008.2004051. Epub 2008 Dec 9.
This paper attempts to overcome the tendency of the expectation-maximization (EM) algorithm to locate a local rather than global maximum when applied to estimate the hidden Markov model (HMM) parameters in speech signal modeling. We propose a hybrid algorithm for estimation of the HMM in automatic speech recognition (ASR) using a constraint-based evolutionary algorithm (EA) and EM, the CEL-EM. The novelty of our hybrid algorithm (CEL-EM) is that it is applicable for estimation of the constraint-based models with many constraints and large numbers of parameters (which use EM) like HMM. Two constraint-based versions of the CEL-EM with different fusion strategies have been proposed using a constraint-based EA and the EM for better estimation of HMM in ASR. The first one uses a traditional constraint-handling mechanism of EA. The other version transforms a constrained optimization problem into an unconstrained problem using Lagrange multipliers. Fusion strategies for the CEL-EM use a staged-fusion approach where EM has been plugged with the EA periodically after the execution of EA for a specific period of time to maintain the global sampling capabilities of EA in the hybrid algorithm. A variable initialization approach (VIA) has been proposed using a variable segmentation to provide a better initialization for EA in the CEL-EM. Experimental results on the TIMIT speech corpus show that CEL-EM obtains higher recognition accuracies than the traditional EM algorithm as well as a top-standard EM (VIA-EM, constructed by applying the VIA to EM).
本文试图克服期望最大化(EM)算法在应用于语音信号建模中估计隐马尔可夫模型(HMM)参数时倾向于找到局部而非全局最大值的问题。我们提出了一种用于自动语音识别(ASR)中HMM估计的混合算法,该算法使用基于约束的进化算法(EA)和EM,即CEL-EM。我们的混合算法(CEL-EM)的新颖之处在于,它适用于估计具有许多约束和大量参数的基于约束的模型(如使用EM的HMM)。我们已经提出了CEL-EM的两个基于约束的版本,它们采用不同的融合策略,使用基于约束的EA和EM来更好地估计ASR中的HMM。第一个版本使用EA的传统约束处理机制。另一个版本使用拉格朗日乘数将约束优化问题转化为无约束问题。CEL-EM的融合策略采用分阶段融合方法,即在EA执行特定时间段后,将EM定期插入EA中,以保持混合算法中EA的全局采样能力。我们提出了一种变量初始化方法(VIA),通过变量分割为CEL-EM中的EA提供更好的初始化。在TIMIT语音语料库上的实验结果表明,CEL-EM比传统EM算法以及顶级标准EM(通过将VIA应用于EM构建的VIA-EM)获得更高的识别准确率。