临床注释部分检测使用统一医学语言系统语义类型的隐马尔可夫模型。

Clinical Note Section Detection Using a Hidden Markov Model of Unified Medical Language System Semantic Types.

机构信息

Center for Biomedical Informatics, Brown University, Providence RI.

The Warren Alpert Medical School, Brown University, Providence, RI.

出版信息

AMIA Annu Symp Proc. 2022 Feb 21;2021:418-427. eCollection 2021.

PMID:35308919

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8861726/

Abstract

Clinical notes are a rich source of biomedical data for natural language processing (NLP). The identification of note sections represents a first step in creating portable NLP tools. Here, a system that used a heterogeneous hidden Markov model (HMM) was designed to identify seven note sections: (1) Medical History, (2) Medications, (3) Family and Social History, (4) Physical Exam, (5) Labs and Imaging, (6) Assessment and Plan, and (7) Review of Systems. Unified Medical Language System (UMLS) concepts were identified using MetaMap, and UMLS semantic type distributions for each section type were empirically determined. The UMLS semantic type distributions were used to train the HMM for identifying clinical note sections. The system was evaluated relative to a template boundary model using manually annotated notes from the Medical Information Mart for Intensive Care III. The results show promise for an approach to segment clinical notes into sections for subsequent NLP tasks.

摘要

临床笔记是自然语言处理 (NLP) 的生物医学数据的丰富来源。注释部分的识别是创建可移植的 NLP 工具的第一步。在这里，设计了一个使用异构隐马尔可夫模型 (HMM) 的系统来识别七个注释部分：(1) 病史，(2) 药物，(3) 家族和社会史，(4) 体检，(5) 实验室和影像学，(6) 评估和计划，以及 (7) 系统回顾。使用 MetaMap 识别统一医学语言系统 (UMLS) 概念，并通过经验确定每个部分类型的 UMLS 语义类型分布。使用 UMLS 语义类型分布来训练 HMM 以识别临床笔记部分。该系统相对于使用从重症监护医疗信息集市 III 手动注释的模板边界模型进行了评估。结果表明，这种方法有望将临床笔记分割成后续 NLP 任务的部分。

相似文献

Clinical Note Section Detection Using a Hidden Markov Model of Unified Medical Language System Semantic Types.临床注释部分检测使用统一医学语言系统语义类型的隐马尔可夫模型。

AMIA Annu Symp Proc. 2022 Feb 21;2021:418-427. eCollection 2021.

Use of "off-the-shelf" information extraction algorithms in clinical informatics: A feasibility study of MetaMap annotation of Italian medical notes.临床信息学中“现成可用”信息提取算法的应用：意大利医学记录的MetaMap注释可行性研究。

J Biomed Inform. 2016 Oct;63:22-32. doi: 10.1016/j.jbi.2016.07.017. Epub 2016 Jul 18.

Bottom-Up Natural Language Processing Based Evaluation of the Fitness of UMLS as a Semantic Source for a Computer Interpretable Guidelines Ontology.基于自底向上自然语言处理的 UMLS 作为计算机可解释指南本体语义源适用性评估。

Stud Health Technol Inform. 2022 Jun 6;290:1060-1061. doi: 10.3233/SHTI220267.

A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。

J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.

Automatic acquisition of sublanguage semantic schema: towards the word sense disambiguation of clinical narratives.子语言语义模式的自动获取：迈向临床叙述的词义消歧

AMIA Annu Symp Proc. 2010 Nov 13;2010:612-6.

Can Unified Medical Language System-based semantic representation improve automated identification of patient safety incident reports by type and severity?基于统一医学语言系统的语义表示能否提高自动识别患者安全事件报告的类型和严重程度的能力？

J Am Med Inform Assoc. 2020 Oct 1;27(10):1502-1509. doi: 10.1093/jamia/ocaa082.

Mapping terms to UMLS concepts of the same semantic type.将术语映射到相同语义类型的统一医学语言系统（UMLS）概念。

AMIA Annu Symp Proc. 2007 Oct 11:1136.

Towards a semantic lexicon for clinical natural language processing.迈向用于临床自然语言处理的语义词典。

AMIA Annu Symp Proc. 2012;2012:568-76. Epub 2012 Nov 3.

UMLS-Interface and UMLS-Similarity : open source software for measuring paths and semantic similarity.统一医学语言系统接口与统一医学语言系统相似度：用于测量路径和语义相似度的开源软件。

AMIA Annu Symp Proc. 2009 Nov 14;2009:431-5.

Towards comprehensive syntactic and semantic annotations of the clinical narrative.朝着临床叙述的全面句法和语义标注努力。

J Am Med Inform Assoc. 2013 Sep-Oct;20(5):922-30. doi: 10.1136/amiajnl-2012-001317. Epub 2013 Jan 25.

引用本文的文献

DR.BENCH: Diagnostic Reasoning Benchmark for Clinical Natural Language Processing.DR.BENCH：临床自然语言处理的诊断推理基准。

J Biomed Inform. 2023 Feb;138:104286. doi: 10.1016/j.jbi.2023.104286. Epub 2023 Jan 25.

本文引用的文献

Extracting Angina Symptoms from Clinical Notes Using Pre-Trained Transformer Architectures.使用预训练的Transformer架构从临床记录中提取心绞痛症状。

AMIA Annu Symp Proc. 2021 Jan 25;2020:412-421. eCollection 2020.

Reconsidering hospital EHR adoption at the dawn of HITECH: implications of the reported 9% adoption of a "basic" EHR.在 HITECH 时代重新考虑医院 EHR 采用：报告的“基本”EHR 采用率为 9%的影响。

J Am Med Inform Assoc. 2020 Aug 1;27(8):1198-1205. doi: 10.1093/jamia/ocaa090.

MIMIC-III, a freely accessible critical care database.MIMIC-III，一个免费获取的重症监护数据库。

Sci Data. 2016 May 24;3:160035. doi: 10.1038/sdata.2016.35.

Recognition and Evaluation of Clinical Section Headings in Clinical Documents Using Token-Based Formulation with Conditional Random Fields.使用基于词元的公式和条件随机字段识别与评估临床文档中的临床章节标题

Biomed Res Int. 2015;2015:873012. doi: 10.1155/2015/873012. Epub 2015 Aug 26.

Comparison of UMLS terminologies to identify risk of heart disease using clinical notes.使用临床记录比较统一医学语言系统术语以识别心脏病风险

J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S103-S110. doi: 10.1016/j.jbi.2015.08.025. Epub 2015 Sep 12.

Social determinants of family health history collection.家庭健康史收集的社会决定因素。

J Community Genet. 2016 Jan;7(1):57-64. doi: 10.1007/s12687-015-0251-3. Epub 2015 Aug 18.

Electronic health records improve clinical note quality.电子健康记录可提高临床记录质量。

J Am Med Inform Assoc. 2015 Jan;22(1):199-205. doi: 10.1136/amiajnl-2014-002726. Epub 2014 Oct 23.

Interview with Lawrence Weed, MD- The Father of the Problem-Oriented Medical Record Looks Ahead.对问题导向医疗记录之父劳伦斯·威德医生的访谈——展望未来。

Perm J. 2009 Summer;13(3):84-9. doi: 10.7812/TPP/09-068.

Automatic segmentation of clinical texts.临床文本的自动分割

Annu Int Conf IEEE Eng Med Biol Soc. 2009;2009:5905-8. doi: 10.1109/IEMBS.2009.5334831.

Evaluation of a method to identify and categorize section headers in clinical documents.评估一种识别和分类临床文档中标题的方法。

J Am Med Inform Assoc. 2009 Nov-Dec;16(6):806-15. doi: 10.1197/jamia.M3037. Epub 2009 Aug 28.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验