自然语言处理：入门。

Natural language processing: an introduction.

机构信息

Yale University School of Medicine, New Haven, Connecticut, USA.

出版信息

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):544-51. doi: 10.1136/amiajnl-2011-000464.

DOI:10.1136/amiajnl-2011-000464

PMID:21846786

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3168328/

Abstract

OBJECTIVES

To provide an overview and tutorial of natural language processing (NLP) and modern NLP-system design.

TARGET AUDIENCE

This tutorial targets the medical informatics generalist who has limited acquaintance with the principles behind NLP and/or limited knowledge of the current state of the art.

SCOPE

We describe the historical evolution of NLP, and summarize common NLP sub-problems in this extensive field. We then provide a synopsis of selected highlights of medical NLP efforts. After providing a brief description of common machine-learning approaches that are being used for diverse NLP sub-problems, we discuss how modern NLP architectures are designed, with a summary of the Apache Foundation's Unstructured Information Management Architecture. We finally consider possible future directions for NLP, and reflect on the possible impact of IBM Watson on the medical field.

摘要

目的

提供自然语言处理（NLP）和现代 NLP 系统设计概述和教程。

受众

本教程面向对 NLP 原理知之甚少或对当前技术水平了解有限的医学信息学通才。

范围

我们描述了 NLP 的历史演变，并总结了这个广泛领域中的常见 NLP 子问题。然后，我们简要介绍了医学 NLP 工作的一些亮点。在简要描述正在用于各种 NLP 子问题的常见机器学习方法之后，我们讨论了如何设计现代 NLP 架构，并总结了 Apache 基金会的非结构化信息管理架构。最后，我们考虑了 NLP 的可能未来方向，并思考了 IBM Watson 对医学领域的可能影响。

相似文献

Natural language processing: an introduction.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):544-51. doi: 10.1136/amiajnl-2011-000464.

Using clinical Natural Language Processing for health outcomes research: Overview and actionable suggestions for future advances.

J Biomed Inform. 2018 Dec;88:11-19. doi: 10.1016/j.jbi.2018.10.005. Epub 2018 Oct 24.

Large language models for biomedicine: foundations, opportunities, challenges, and best practices.

J Am Med Inform Assoc. 2024 Sep 1;31(9):2114-2124. doi: 10.1093/jamia/ocae074.

A comparison of word embeddings for the biomedical natural language processing.

J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.

Closing the gap between NLP research and clinical practice.

Methods Inf Med. 2010;49(4):317-9.

Exploring the Latest Highlights in Medical Natural Language Processing across Multiple Languages: A Survey.

Yearb Med Inform. 2023 Aug;32(1):230-243. doi: 10.1055/s-0043-1768726. Epub 2023 Dec 26.

Medical Information Extraction in the Age of Deep Learning.

Yearb Med Inform. 2020 Aug;29(1):208-220. doi: 10.1055/s-0040-1702001. Epub 2020 Aug 21.

Methodology for creating UMLS content views appropriate for biomedical natural language processing.

AMIA Annu Symp Proc. 2008 Nov 6;2008:21-5.

A scoping review of publicly available language tasks in clinical natural language processing.

J Am Med Inform Assoc. 2022 Sep 12;29(10):1797-1806. doi: 10.1093/jamia/ocac127.

Data for registry and quality review can be retrospectively collected using natural language processing from unstructured charts of arthroplasty patients.

Bone Joint J. 2020 Jul;102-B(7_Supple_B):99-104. doi: 10.1302/0301-620X.102B7.BJJ-2019-1574.R1.

引用本文的文献

Artificial intelligence in interventional cardiology: a review of its role in diagnosis, decision-making, and procedural precision.

Ann Med Surg (Lond). 2025 Jul 18;87(9):5720-5734. doi: 10.1097/MS9.0000000000003602. eCollection 2025 Sep.

Natural Language Processing and Coding for Detecting Bleeding Events in Discharge Summaries: Comparative Cross-Sectional Study.

JMIR Med Inform. 2025 Aug 29;13:e67837. doi: 10.2196/67837.

Large language models for clinical decision support in gastroenterology and hepatology.

Nat Rev Gastroenterol Hepatol. 2025 Aug 22. doi: 10.1038/s41575-025-01108-1.

Identifying Hidden Barriers to PrEP Adherence Among Young Men Who Have Sex with Men: Application of Natural Language Processing.

AIDS Behav. 2025 Aug 19. doi: 10.1007/s10461-025-04863-z.

Artificial Intelligence in Hypertrophic Cardiomyopathy: Advances, Challenges, and Future Directions for Personalized Risk Prediction and Management.

Cureus. 2025 Jul 14;17(7):e87907. doi: 10.7759/cureus.87907. eCollection 2025 Jul.

Assessing the Role of Large Language Models Between ChatGPT and DeepSeek in Asthma Education for Bilingual Individuals: Comparative Study.

JMIR Med Inform. 2025 Aug 13;13:e65365. doi: 10.2196/65365.

Forecasting school violence risk with incomplete interview data: an automated assessment approach.

JAMIA Open. 2025 Jul 31;8(4):ooaf084. doi: 10.1093/jamiaopen/ooaf084. eCollection 2025 Aug.

Quantifying uncert-AI-nty: Testing the accuracy of LLMs' confidence judgments.

Mem Cognit. 2025 Jul 22. doi: 10.3758/s13421-025-01755-4.

The potential utility of CHATGPT4.0 as an AI assistant in the education and management of patients with Barrett's esophagus.

Dis Esophagus. 2025 Jul 3;38(4). doi: 10.1093/dote/doaf050.

AI-Based Drug Discovery and Design for Different Genetic Designs.

Methods Mol Biol. 2025;2952:125-148. doi: 10.1007/978-1-0716-4690-8_8.

本文引用的文献

2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6. doi: 10.1136/amiajnl-2011-000203. Epub 2011 Jun 16.

Anaphoric relations in the clinical narrative: corpus creation.

J Am Med Inform Assoc. 2011 Jul-Aug;18(4):459-65. doi: 10.1136/amiajnl-2011-000108. Epub 2011 Apr 1.

Extracting medication information from clinical text.

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):514-8. doi: 10.1136/jamia.2010.003947.

Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13. doi: 10.1136/jamia.2009.001560.

Characterizing environmental and phenotypic associations using information theory and electronic health records.

BMC Bioinformatics. 2009 Sep 17;10 Suppl 9(Suppl 9):S13. doi: 10.1186/1471-2105-10-S9-S13.

Evaluation of a method to identify and categorize section headers in clinical documents.

J Am Med Inform Assoc. 2009 Nov-Dec;16(6):806-15. doi: 10.1197/jamia.M3037. Epub 2009 Aug 28.

Recognizing obesity and comorbidities in sparse data.

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):561-70. doi: 10.1197/jamia.M3115. Epub 2009 Apr 23.

Active computerized pharmacovigilance using natural language processing, statistics, and electronic health records: a feasibility study.

J Am Med Inform Assoc. 2009 May-Jun;16(3):328-37. doi: 10.1197/jamia.M3028. Epub 2009 Mar 4.

Using empiric semantic correlation to interpret temporal assertions in clinical texts.

J Am Med Inform Assoc. 2009 Mar-Apr;16(2):220-7. doi: 10.1197/jamia.M3007. Epub 2008 Dec 11.

Methods for building sense inventories of abbreviations in clinical notes.

AMIA Annu Symp Proc. 2008 Nov 6;2008:819.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

自然语言处理：入门。

Natural language processing: an introduction.

机构信息

出版信息

OBJECTIVES

TARGET AUDIENCE

SCOPE

目的

受众

范围

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

自然语言处理：入门。

Natural language processing: an introduction.

机构信息

出版信息

OBJECTIVES

TARGET AUDIENCE

SCOPE

目的

受众

范围