Suppr超能文献

语音处理技术——及其在电信领域的应用。

Voice-processing technologies--their application in telecommunications.

作者信息

Wilpon J G

机构信息

AT&T Bell Laboratories, Murray Hill, NJ 07974, USA.

出版信息

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9991-8. doi: 10.1073/pnas.92.22.9991.

Abstract

As the telecommunications industry evolves over the next decade to provide the products and services that people will desire, several key technologies will become commonplace. Two of these, automatic speech recognition and text-to-speech synthesis, will provide users with more freedom on when, where, and how they access information. While these technologies are currently in their infancy, their capabilities are rapidly increasing and their deployment in today's telephone network is expanding. The economic impact of just one application, the automation of operator services, is well over $100 million per year. Yet there still are many technical challenges that must be resolved before these technologies can be deployed ubiquitously in products and services throughout the worldwide telephone network. These challenges include: (i) High level of accuracy. The technology must be perceived by the user as highly accurate, robust, and reliable. (ii) Easy to use. Speech is only one of several possible input/output modalities for conveying information between a human and a machine, much like a computer terminal or Touch-Tone pad on a telephone. It is not the final product. Therefore, speech technologies must be hidden from the user. That is, the burden of using the technology must be on the technology itself. (iii) Quick prototyping and development of new products and services. The technology must support the creation of new products and services based on speech in an efficient and timely fashion. In this paper I present a vision of the voice-processing industry with a focus on the areas with the broadest base of user penetration: speech recognition, text-to-speech synthesis, natural language processing, and speaker recognition technologies. The current and future applications of these technologies in the telecommunications industry will be examined in terms of their strengths, limitations, and the degree to which user needs have been or have yet to be met. Although noteworthy gains have been made in areas with potentially small user bases and in the more mature speech-coding technologies, these subjects are outside the scope of this paper.

摘要

在未来十年,随着电信行业不断发展以提供人们所需的产品和服务,若干关键技术将变得司空见惯。其中两项技术,即自动语音识别和文本转语音合成,将为用户提供更多关于获取信息的时间、地点和方式的自由。虽然这些技术目前尚处于起步阶段,但其功能正在迅速增强,并且在当今电话网络中的应用也在不断扩大。仅一项应用,即话务员服务自动化,每年产生的经济影响就超过1亿美元。然而,在这些技术能够在全球电话网络的产品和服务中广泛部署之前,仍有许多技术挑战必须加以解决。这些挑战包括:(i)高精度。该技术必须让用户认为高度准确、稳健且可靠。(ii)易于使用。语音只是人与机器之间传递信息的几种可能的输入/输出方式之一,就像计算机终端或电话上的按键式拨号盘一样。它不是最终产品。因此,语音技术必须对用户隐藏起来。也就是说,使用该技术的负担必须由技术本身承担。(iii)快速进行新产品和服务的原型设计与开发。该技术必须支持高效、及时地创建基于语音的新产品和服务。在本文中,我将呈现语音处理行业的愿景,重点关注用户渗透率最高广泛的领域:语音识别、文本转语音合成、自然语言处理和说话人识别技术。将从这些技术的优势、局限性以及满足用户需求的程度方面,考察它们在电信行业的当前和未来应用。尽管在潜在用户群体较小的领域以及更成熟的语音编码技术方面已经取得了显著进展,但这些主题不在本文讨论范围内。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ef7/40724/2a9612a14c7a/pnas01500-0092-a.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验