语音处理技术——及其在电信领域的应用。

Voice-processing technologies--their application in telecommunications.

作者信息

Wilpon J G

机构信息

AT&T Bell Laboratories, Murray Hill, NJ 07974, USA.

出版信息

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9991-8. doi: 10.1073/pnas.92.22.9991.

DOI:10.1073/pnas.92.22.9991

PMID:7479815

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC40724/

Abstract

As the telecommunications industry evolves over the next decade to provide the products and services that people will desire, several key technologies will become commonplace. Two of these, automatic speech recognition and text-to-speech synthesis, will provide users with more freedom on when, where, and how they access information. While these technologies are currently in their infancy, their capabilities are rapidly increasing and their deployment in today's telephone network is expanding. The economic impact of just one application, the automation of operator services, is well over $100 million per year. Yet there still are many technical challenges that must be resolved before these technologies can be deployed ubiquitously in products and services throughout the worldwide telephone network. These challenges include: (i) High level of accuracy. The technology must be perceived by the user as highly accurate, robust, and reliable. (ii) Easy to use. Speech is only one of several possible input/output modalities for conveying information between a human and a machine, much like a computer terminal or Touch-Tone pad on a telephone. It is not the final product. Therefore, speech technologies must be hidden from the user. That is, the burden of using the technology must be on the technology itself. (iii) Quick prototyping and development of new products and services. The technology must support the creation of new products and services based on speech in an efficient and timely fashion. In this paper I present a vision of the voice-processing industry with a focus on the areas with the broadest base of user penetration: speech recognition, text-to-speech synthesis, natural language processing, and speaker recognition technologies. The current and future applications of these technologies in the telecommunications industry will be examined in terms of their strengths, limitations, and the degree to which user needs have been or have yet to be met. Although noteworthy gains have been made in areas with potentially small user bases and in the more mature speech-coding technologies, these subjects are outside the scope of this paper.

摘要

在未来十年，随着电信行业不断发展以提供人们所需的产品和服务，若干关键技术将变得司空见惯。其中两项技术，即自动语音识别和文本转语音合成，将为用户提供更多关于获取信息的时间、地点和方式的自由。虽然这些技术目前尚处于起步阶段，但其功能正在迅速增强，并且在当今电话网络中的应用也在不断扩大。仅一项应用，即话务员服务自动化，每年产生的经济影响就超过1亿美元。然而，在这些技术能够在全球电话网络的产品和服务中广泛部署之前，仍有许多技术挑战必须加以解决。这些挑战包括：（i）高精度。该技术必须让用户认为高度准确、稳健且可靠。（ii）易于使用。语音只是人与机器之间传递信息的几种可能的输入/输出方式之一，就像计算机终端或电话上的按键式拨号盘一样。它不是最终产品。因此，语音技术必须对用户隐藏起来。也就是说，使用该技术的负担必须由技术本身承担。（iii）快速进行新产品和服务的原型设计与开发。该技术必须支持高效、及时地创建基于语音的新产品和服务。在本文中，我将呈现语音处理行业的愿景，重点关注用户渗透率最高广泛的领域：语音识别、文本转语音合成、自然语言处理和说话人识别技术。将从这些技术的优势、局限性以及满足用户需求的程度方面，考察它们在电信行业的当前和未来应用。尽管在潜在用户群体较小的领域以及更成熟的语音编码技术方面已经取得了显著进展，但这些主题不在本文讨论范围内。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ef7/40724/2a9612a14c7a/pnas01500-0092-a.jpg

相似文献

Voice-processing technologies--their application in telecommunications.语音处理技术——及其在电信领域的应用。

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9991-8. doi: 10.1073/pnas.92.22.9991.

Commercial applications of speech interface technology: an industry at the threshold.语音接口技术的商业应用：一个处于起步阶段的行业。

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10007-10. doi: 10.1073/pnas.92.22.10007.

A perspective on early commercial applications of voice-processing technology for telecommunications and aids for the handicapped.语音处理技术在电信及残疾人辅助设备方面的早期商业应用展望。

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9989-90. doi: 10.1073/pnas.92.22.9989.

Toward the ultimate synthesis/recognition system.迈向终极合成/识别系统。

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10040-5. doi: 10.1073/pnas.92.22.10040.

Deployment of human-machine dialogue systems.人机对话系统的部署

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10017-22. doi: 10.1073/pnas.92.22.10017.

Research in speech communication.言语交际研究。

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9938-45. doi: 10.1073/pnas.92.22.9938.

State of the art in continuous speech recognition.连续语音识别的技术现状。

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9956-63. doi: 10.1073/pnas.92.22.9956.

Scientific bases of human-machine communication by voice.人机语音通信的科学基础。

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9914-20. doi: 10.1073/pnas.92.22.9914.

Speech technology in 2001: new research directions.2001年的语音技术：新的研究方向。

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10046-51. doi: 10.1073/pnas.92.22.10046.

The role of voice input for human-machine communication.语音输入在人机通信中的作用。

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9921-7. doi: 10.1073/pnas.92.22.9921.

引用本文的文献

Models of natural language understanding.自然语言理解模型。

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9977-82. doi: 10.1073/pnas.92.22.9977.

Speech recognition technology: a critique.语音识别技术：一篇评论

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9953-5. doi: 10.1073/pnas.92.22.9953.

The role of voice input for human-machine communication.语音输入在人机通信中的作用。

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9921-7. doi: 10.1073/pnas.92.22.9921.

Speech technology in 2001: new research directions.2001年的语音技术：新的研究方向。

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10046-51. doi: 10.1073/pnas.92.22.10046.

Toward the ultimate synthesis/recognition system.迈向终极合成/识别系统。

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10040-5. doi: 10.1073/pnas.92.22.10040.

Deployment of human-machine dialogue systems.人机对话系统的部署

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10017-22. doi: 10.1073/pnas.92.22.10017.

本文引用的文献

User interfaces for voice applications.语音应用的用户界面。

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10031-7. doi: 10.1073/pnas.92.22.10031.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

语音处理技术——及其在电信领域的应用。

Voice-processing technologies--their application in telecommunications.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献