Bates M
BBN Systems and Technologies, Cambridge, MA 02138, USA.
Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9977-82. doi: 10.1073/pnas.92.22.9977.
This paper surveys some of the fundamental problems in natural language (NL) understanding (syntax, semantics, pragmatics, and discourse) and the current approaches to solving them. Some recent developments in NL processing include increased emphasis on corpus-based rather than example- or intuition-based work, attempts to measure the coverage and effectiveness of NL systems, dealing with discourse and dialogue phenomena, and attempts to use both analytic and stochastic knowledge. Critical areas for the future include grammars that are appropriate to processing large amounts of real language; automatic (or at least semi-automatic) methods for deriving models of syntax, semantics, and pragmatics; self-adapting systems; and integration with speech processing. Of particular importance are techniques that can be tuned to such requirements as full versus partial understanding and spoken language versus text. Portability (the ease with which one can configure an NL system for a particular application) is one of the largest barriers to application of this technology.
本文概述了自然语言(NL)理解中的一些基本问题(句法、语义、语用和篇章)以及当前解决这些问题的方法。NL处理方面的一些最新进展包括:更加强调基于语料库而非基于示例或直觉的工作;尝试衡量NL系统的覆盖范围和有效性;处理篇章和对话现象;以及尝试同时使用分析性知识和随机性知识。未来的关键领域包括适合处理大量真实语言的语法;推导句法、语义和语用模型的自动(或至少半自动)方法;自适应系统;以及与语音处理的集成。特别重要的是能够根据诸如全面理解与部分理解、口语与文本等要求进行调整的技术。可移植性(为特定应用配置NL系统的难易程度)是这项技术应用的最大障碍之一。