Rose R C, Schroeter J, Sondhi M M
AT&T Bell Laboratories, Murray Hill, New Jersey 07974-0636, USA.
J Acoust Soc Am. 1996 Mar;99(3):1699-709. doi: 10.1121/1.414679.
This paper investigates the issues that are associated with applying speech production models to automatic speech recognition (ASR). Here the applicability of articulatory representations to ASR is considered independently of the role of articulatory representations in speech perception. While the question of whether it is necessary or even possible for human listeners to recover the state of the articulators during the process of perceiving speech is an important one, it is not considered here. Hence, the authors refrain from posing completely new paradigms for ASR which more closely parallel the relationship between speech production and human speech understanding. Instead, work aimed at integrating speech production models into existing ASR formalisms is described.
本文研究了将言语产生模型应用于自动语音识别(ASR)所涉及的问题。这里,发音表征对ASR的适用性是独立于发音表征在言语感知中的作用来考虑的。虽然人类听众在言语感知过程中是否有必要甚至有可能恢复发音器官的状态这一问题很重要,但本文不考虑这一点。因此,作者并未提出与言语产生和人类言语理解之间的关系更为紧密并行的全新ASR范式。相反,本文描述了旨在将言语产生模型集成到现有ASR形式体系中的工作。