Suppr超能文献

一种使用多网络人工神经网络的多视图多学习者方法用于构音障碍语音识别。

A multi-views multi-learners approach towards dysarthric speech recognition using multi-nets artificial neural networks.

作者信息

Shahamiri Seyed Reza, Salim Siti Salwah Binti

出版信息

IEEE Trans Neural Syst Rehabil Eng. 2014 Sep;22(5):1053-63. doi: 10.1109/TNSRE.2014.2309336. Epub 2014 Mar 11.

Abstract

Automatic speech recognition (ASR) can be very helpful for speakers who suffer from dysarthria, a neurological disability that damages the control of motor speech articulators. Although a few attempts have been made to apply ASR technologies to sufferers of dysarthria, previous studies show that such ASR systems have not attained an adequate level of performance. In this study, a dysarthric multi-networks speech recognizer (DM-NSR) model is provided using a realization of multi-views multi-learners approach called multi-nets artificial neural networks, which tolerates variability of dysarthric speech. In particular, the DM-NSR model employs several ANNs (as learners) to approximate the likelihood of ASR vocabulary words and to deal with the complexity of dysarthric speech. The proposed DM-NSR approach was presented as both speaker-dependent and speaker-independent paradigms. In order to highlight the performance of the proposed model over legacy models, multi-views single-learner models of the DM-NSRs were also provided and their efficiencies were compared in detail. Moreover, a comparison among the prominent dysarthric ASR methods and the proposed one is provided. The results show that the DM-NSR recorded improved recognition rate by up to 24.67% and the error rate was reduced by up to 8.63% over the reference model.

摘要

自动语音识别(ASR)对于患有构音障碍的说话者非常有帮助,构音障碍是一种神经障碍,会损害运动性言语发音器官的控制。尽管已经有人尝试将ASR技术应用于构音障碍患者,但先前的研究表明,此类ASR系统尚未达到足够的性能水平。在本研究中,使用一种称为多网络人工神经网络的多视图多学习者方法实现,提供了一种构音障碍多网络语音识别器(DM-NSR)模型,该模型能够容忍构音障碍语音的变异性。具体而言,DM-NSR模型采用多个人工神经网络(作为学习者)来近似ASR词汇的可能性,并处理构音障碍语音的复杂性。所提出的DM-NSR方法以依赖说话者和独立于说话者的范式呈现。为了突出所提出模型相对于传统模型的性能,还提供了DM-NSR的多视图单学习者模型,并详细比较了它们的效率。此外,还对突出的构音障碍ASR方法与所提出的方法进行了比较。结果表明,与参考模型相比,DM-NSR的识别率提高了24.67%,错误率降低了8.63%。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验