Atal B S
AT&T Bell Laboratories, Murray Hill, NJ 07974, USA.
Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10046-51. doi: 10.1073/pnas.92.22.10046.
Research in speech recognition and synthesis over the past several decades has brought speech technology to a point where it is being used in "real-world" applications. However, despite the progress, the perception remains that the current technology is not flexible enough to allow easy voice communication with machines. The focus of speech research is now on producing systems that are accurate and robust but that do not impose unnecessary constraints on the user. This chapter takes a critical look at the shortcomings of the current speech recognition and synthesis algorithms, discusses the technical challenges facing research, and examines the new directions that research in speech recognition and synthesis must take in order to form the basis of new solutions suitable for supporting a wide range of applications.
在过去几十年中,语音识别与合成领域的研究已将语音技术发展到了可应用于“现实世界”的阶段。然而,尽管取得了进展,但人们仍然认为,当前技术的灵活性不足,无法实现与机器的便捷语音通信。语音研究目前的重点是开发准确且稳健的系统,同时又不会给用户带来不必要的限制。本章将批判性地审视当前语音识别与合成算法的缺点,讨论研究所面临的技术挑战,并探讨语音识别与合成研究为形成适用于广泛应用的新解决方案基础而必须采取的新方向。