Juang Chia-Feng, Chiou Chyi-Tian, Lai Chun-Lung
Department of Electrical Engineering, National Chung-Hsing University, Taichung 402, Taiwan.
IEEE Trans Neural Netw. 2007 May;18(3):833-43. doi: 10.1109/TNN.2007.891194.
This paper proposes noisy speech recognition using hierarchical singleton-type recurrent neural fuzzy networks (HSRNFNs). The proposed HSRNFN is a hierarchical connection of two singleton-type recurrent neural fuzzy networks (SRNFNs), where one is used for noise filtering and the other for recognition. The SRNFN is constructed by recurrent fuzzy if-then rules with fuzzy singletons in the consequences, and their recurrent properties make them suitable for processing speech patterns with temporal characteristics. In n words recognition, n SRNFNs are created for modeling n words, where each SRNFN receives the current frame feature and predicts the next one of its modeling word. The prediction error of each SRNFN is used as recognition criterion. In filtering, one SRNFN is created, and each SRNFN recognizer is connected to the same SRNFN filter, which filters noisy speech patterns in the feature domain before feeding them to the SRNFN recognizer. Experiments with Mandarin word recognition under different types of noise are performed. Other recognizers, including multilayer perceptron (MLP), time-delay neural networks (TDNNs), and hidden Markov models (HMMs), are also tested and compared. These experiments and comparisons demonstrate good results with HSRNFN for noisy speech recognition tasks.
本文提出了一种使用分层单例型递归神经模糊网络(HSRNFN)的噪声语音识别方法。所提出的HSRNFN是两个单例型递归神经模糊网络(SRNFN)的分层连接,其中一个用于噪声滤波,另一个用于识别。SRNFN由后件中带有模糊单例的递归模糊if-then规则构建而成,其递归特性使其适合处理具有时间特征的语音模式。在n个单词识别中,创建n个SRNFN来对n个单词进行建模,每个SRNFN接收当前帧特征并预测其建模单词的下一个特征。每个SRNFN的预测误差用作识别标准。在滤波过程中,创建一个SRNFN,每个SRNFN识别器都连接到同一个SRNFN滤波器,该滤波器在将噪声语音模式馈送到SRNFN识别器之前,在特征域中对其进行滤波。进行了在不同类型噪声下的汉语单词识别实验。还测试并比较了其他识别器,包括多层感知器(MLP)、时延神经网络(TDNN)和隐马尔可夫模型(HMM)。这些实验和比较表明,HSRNFN在噪声语音识别任务中取得了良好的结果。