Suppr超能文献

自然语言处理的新趋势:统计自然语言处理。

New trends in natural language processing: statistical natural language processing.

作者信息

Marcus M

机构信息

Department of Computer and Information Science, University of Pennsylvania, Philadelphia 19104-6389, USA.

出版信息

Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10052-9. doi: 10.1073/pnas.92.22.10052.

Abstract

The field of natural language processing (NLP) has seen a dramatic shift in both research direction and methodology in the past several years. In the past, most work in computational linguistics tended to focus on purely symbolic methods. Recently, more and more work is shifting toward hybrid methods that combine new empirical corpus-based methods, including the use of probabilistic and information-theoretic techniques, with traditional symbolic methods. This work is made possible by the recent availability of linguistic databases that add rich linguistic annotation to corpora of natural language text. Already, these methods have led to a dramatic improvement in the performance of a variety of NLP systems with similar improvement likely in the coming years. This paper focuses on these trends, surveying in particular three areas of recent progress: part-of-speech tagging, stochastic parsing, and lexical semantics.

摘要

在过去几年中,自然语言处理(NLP)领域在研究方向和方法上都发生了巨大的转变。过去,计算语言学中的大多数工作往往侧重于纯粹的符号方法。最近,越来越多的工作正在转向混合方法,这种方法将基于新语料库的新实证方法(包括概率和信息论技术的使用)与传统符号方法相结合。近期可用的语言数据库为自然语言文本语料库添加了丰富的语言注释,使得这项工作成为可能。这些方法已经使各种NLP系统的性能得到了显著提升,未来几年可能还会有类似的改进。本文重点关注这些趋势,特别概述了近期取得进展的三个领域:词性标注、随机句法分析和词汇语义学。

相似文献

2
Models of natural language understanding.自然语言理解模型。
Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9977-82. doi: 10.1073/pnas.92.22.9977.
3
Deployment of human-machine dialogue systems.人机对话系统的部署
Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10017-22. doi: 10.1073/pnas.92.22.10017.
4
State of the art in continuous speech recognition.连续语音识别的技术现状。
Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9956-63. doi: 10.1073/pnas.92.22.9956.
5
Wide-coverage probabilistic sentence processing.广覆盖概率句子处理
J Psycholinguist Res. 2000 Nov;29(6):647-69. doi: 10.1023/a:1026560822390.
6
Toward the ultimate synthesis/recognition system.迈向终极合成/识别系统。
Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10040-5. doi: 10.1073/pnas.92.22.10040.
9
Developing a corpus of clinical notes manually annotated for part-of-speech.开发一个词性人工标注的临床笔记语料库。
Int J Med Inform. 2006 Jun;75(6):418-29. doi: 10.1016/j.ijmedinf.2005.08.006. Epub 2005 Sep 19.

引用本文的文献

2
Models of natural language understanding.自然语言理解模型。
Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9977-82. doi: 10.1073/pnas.92.22.9977.
3
Speech recognition technology: a critique.语音识别技术:一篇评论
Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9953-5. doi: 10.1073/pnas.92.22.9953.
4
Toward the ultimate synthesis/recognition system.迈向终极合成/识别系统。
Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10040-5. doi: 10.1073/pnas.92.22.10040.
5
Speech technology in the year 2001.2001年的语音技术。
Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10038-9. doi: 10.1073/pnas.92.22.10038.
6
Deployment of human-machine dialogue systems.人机对话系统的部署
Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10017-22. doi: 10.1073/pnas.92.22.10017.

本文引用的文献

1
Training and search methods for speech recognition.语音识别的训练与搜索方法。
Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9964-9. doi: 10.1073/pnas.92.22.9964.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验