一种用于推进基于多站点电子健康记录的临床概念提取中系统误差分析的分类法。

A taxonomy for advancing systematic error analysis in multi-site electronic health record-based clinical concept extraction.

机构信息

Department of AI and Informatics, Mayo Clinic, Rochester, MN 55902, United States.

Center for Translational AI Excellence and Applications in Medicine, University of Texas Health Science Center at Houston, Houston, TX 77030, United States.

出版信息

J Am Med Inform Assoc. 2024 Jun 20;31(7):1493-1502. doi: 10.1093/jamia/ocae101.

DOI:10.1093/jamia/ocae101

PMID:38742455

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11187420/

Abstract

BACKGROUND

Error analysis plays a crucial role in clinical concept extraction, a fundamental subtask within clinical natural language processing (NLP). The process typically involves a manual review of error types, such as contextual and linguistic factors contributing to their occurrence, and the identification of underlying causes to refine the NLP model and improve its performance. Conducting error analysis can be complex, requiring a combination of NLP expertise and domain-specific knowledge. Due to the high heterogeneity of electronic health record (EHR) settings across different institutions, challenges may arise when attempting to standardize and reproduce the error analysis process.

OBJECTIVES

This study aims to facilitate a collaborative effort to establish common definitions and taxonomies for capturing diverse error types, fostering community consensus on error analysis for clinical concept extraction tasks.

MATERIALS AND METHODS

We iteratively developed and evaluated an error taxonomy based on existing literature, standards, real-world data, multisite case evaluations, and community feedback. The finalized taxonomy was released in both .dtd and .owl formats at the Open Health Natural Language Processing Consortium. The taxonomy is compatible with several different open-source annotation tools, including MAE, Brat, and MedTator.

RESULTS

The resulting error taxonomy comprises 43 distinct error classes, organized into 6 error dimensions and 4 properties, including model type (symbolic and statistical machine learning), evaluation subject (model and human), evaluation level (patient, document, sentence, and concept), and annotation examples. Internal and external evaluations revealed strong variations in error types across methodological approaches, tasks, and EHR settings. Key points emerged from community feedback, including the need to enhancing clarity, generalizability, and usability of the taxonomy, along with dissemination strategies.

CONCLUSION

The proposed taxonomy can facilitate the acceleration and standardization of the error analysis process in multi-site settings, thus improving the provenance, interpretability, and portability of NLP models. Future researchers could explore the potential direction of developing automated or semi-automated methods to assist in the classification and standardization of error analysis.

摘要

背景

错误分析在临床概念提取中起着至关重要的作用，这是临床自然语言处理（NLP）的基本子任务。该过程通常涉及手动审查错误类型，例如导致错误发生的上下文和语言因素，并确定根本原因，以改进 NLP 模型并提高其性能。进行错误分析可能很复杂，需要结合 NLP 专业知识和领域特定知识。由于不同机构的电子健康记录（EHR）设置具有高度的异质性，因此在尝试标准化和复制错误分析过程时可能会遇到挑战。

目的

本研究旨在促进建立共同定义和分类法的协作努力，以捕获各种错误类型，并就临床概念提取任务的错误分析达成社区共识。

材料和方法

我们基于现有文献、标准、真实数据、多站点案例评估和社区反馈，迭代开发和评估了一个错误分类法。最终的分类法以.dtd 和.owl 格式发布在开放健康自然语言处理联盟中。该分类法与多个不同的开源注释工具兼容，包括 MAE、Brat 和 MedTator。

结果

所得到的错误分类法由 43 个不同的错误类别组成，分为 6 个错误维度和 4 个属性，包括模型类型（符号和统计机器学习）、评估主体（模型和人类）、评估级别（患者、文档、句子和概念）和注释示例。内部和外部评估显示，不同方法、任务和 EHR 设置之间的错误类型存在很大差异。社区反馈的要点包括需要提高分类法的清晰度、通用性和可用性，以及传播策略。

结论

所提出的分类法可以促进多站点设置中的错误分析过程的加速和标准化，从而提高 NLP 模型的出处、可解释性和可移植性。未来的研究人员可以探索开发自动化或半自动化方法来协助错误分析的分类和标准化的潜在方向。

相似文献

A taxonomy for advancing systematic error analysis in multi-site electronic health record-based clinical concept extraction.

J Am Med Inform Assoc. 2024 Jun 20;31(7):1493-1502. doi: 10.1093/jamia/ocae101.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

An open natural language processing (NLP) framework for EHR-based clinical research: a case demonstration using the National COVID Cohort Collaborative (N3C).

J Am Med Inform Assoc. 2023 Nov 17;30(12):2036-2040. doi: 10.1093/jamia/ocad134.

Enhancing suicidal behavior detection in EHRs: A multi-label NLP framework with transformer models and semantic retrieval-based annotation.

J Biomed Inform. 2025 Jan;161:104755. doi: 10.1016/j.jbi.2024.104755. Epub 2024 Dec 2.

Ensembles of natural language processing systems for portable phenotyping solutions.

J Biomed Inform. 2019 Dec;100:103318. doi: 10.1016/j.jbi.2019.103318. Epub 2019 Oct 23.

A method for cohort selection of cardiovascular disease records from an electronic health record system.

Int J Med Inform. 2017 Jun;102:138-149. doi: 10.1016/j.ijmedinf.2017.03.015. Epub 2017 Mar 30.

The Growing Impact of Natural Language Processing in Healthcare and Public Health.

Inquiry. 2024 Jan-Dec;61:469580241290095. doi: 10.1177/00469580241290095.

Natural language processing techniques applied to the electronic health record in clinical research and practice - an introduction to methodologies.

Comput Biol Med. 2025 Apr;188:109808. doi: 10.1016/j.compbiomed.2025.109808. Epub 2025 Feb 12.

NLP for Analyzing Electronic Health Records and Clinical Notes in Cancer Research: A Review.

J Pain Symptom Manage. 2025 May;69(5):e374-e394. doi: 10.1016/j.jpainsymman.2025.01.019. Epub 2025 Jan 31.

Avoiding and identifying errors in health technology assessment models: qualitative study and methodological review.

Health Technol Assess. 2010 May;14(25):iii-iv, ix-xii, 1-107. doi: 10.3310/hta14250.

引用本文的文献

Identifying Transportation Needs in Ophthalmology Clinic Notes Using Natural Language Processing: Retrospective, Cross-Sectional Study.

JMIR Med Inform. 2025 Sep 5;13:e69216. doi: 10.2196/69216.

Development and Validation of Natural Language Processing Algorithms in the ENACT National Electronic Health Record Research Network.

medRxiv. 2025 Jan 27:2025.01.24.25321096. doi: 10.1101/2025.01.24.25321096.

Addressing methodological and logistical challenges of using electronic health record (EHR) data for research.

J Am Med Inform Assoc. 2024 Jun 20;31(7):1449-1450. doi: 10.1093/jamia/ocae126.

本文引用的文献

A cross-institutional evaluation on breast cancer phenotyping NLP algorithms on electronic health records.

Comput Struct Biotechnol J. 2023 Aug 22;22:32-40. doi: 10.1016/j.csbj.2023.08.018. eCollection 2023.

An open natural language processing (NLP) framework for EHR-based clinical research: a case demonstration using the National COVID Cohort Collaborative (N3C).

J Am Med Inform Assoc. 2023 Nov 17;30(12):2036-2040. doi: 10.1093/jamia/ocad134.

Portability of natural language processing methods to detect suicidality from clinical text in US and UK electronic health records.

J Affect Disord Rep. 2022 Dec;10. doi: 10.1016/j.jadr.2022.100430. Epub 2022 Oct 25.

Clin Transl Sci. 2023 Mar;16(3):398-411. doi: 10.1111/cts.13463. Epub 2022 Dec 26.

Quality assessment of functional status documentation in EHRs across different healthcare institutions.

Front Digit Health. 2022 Sep 27;4:958539. doi: 10.3389/fdgth.2022.958539. eCollection 2022.

Multicenter Validation of Natural Language Processing Algorithms for the Detection of Common Data Elements in Operative Notes for Total Hip Arthroplasty: Algorithm Development and Validation.

JMIR Med Inform. 2022 Aug 31;10(8):e38155. doi: 10.2196/38155.

CancerBERT: a cancer domain-specific language model for extracting breast cancer phenotypes from electronic health records.

J Am Med Inform Assoc. 2022 Jun 14;29(7):1208-1216. doi: 10.1093/jamia/ocac040.

A hybrid model to identify fall occurrence from electronic health records.

Int J Med Inform. 2022 Mar 7;162:104736. doi: 10.1016/j.ijmedinf.2022.104736.

Ascertainment of Delirium Status Using Natural Language Processing From Electronic Health Records.

J Gerontol A Biol Sci Med Sci. 2022 Mar 3;77(3):524-530. doi: 10.1093/gerona/glaa275.

MedTator: a serverless annotation tool for corpus development.

Bioinformatics. 2022 Mar 4;38(6):1776-1778. doi: 10.1093/bioinformatics/btab880.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于推进基于多站点电子健康记录的临床概念提取中系统误差分析的分类法。

A taxonomy for advancing systematic error analysis in multi-site electronic health record-based clinical concept extraction.

机构信息

Department of AI and Informatics, Mayo Clinic, Rochester, MN 55902, United States.

Center for Translational AI Excellence and Applications in Medicine, University of Texas Health Science Center at Houston, Houston, TX 77030, United States.

出版信息

J Am Med Inform Assoc. 2024 Jun 20;31(7):1493-1502. doi: 10.1093/jamia/ocae101.

DOI:10.1093/jamia/ocae101

PMID:38742455

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11187420/

Abstract

BACKGROUND

OBJECTIVES

MATERIALS AND METHODS

RESULTS

CONCLUSION

摘要

背景

目的

本研究旨在促进建立共同定义和分类法的协作努力，以捕获各种错误类型，并就临床概念提取任务的错误分析达成社区共识。

一种用于推进基于多站点电子健康记录的临床概念提取中系统误差分析的分类法。

A taxonomy for advancing systematic error analysis in multi-site electronic health record-based clinical concept extraction.

机构信息

出版信息

BACKGROUND

OBJECTIVES

MATERIALS AND METHODS

RESULTS

CONCLUSION

背景

目的

材料和方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

一种用于推进基于多站点电子健康记录的临床概念提取中系统误差分析的分类法。

A taxonomy for advancing systematic error analysis in multi-site electronic health record-based clinical concept extraction.

机构信息

出版信息

BACKGROUND

OBJECTIVES

MATERIALS AND METHODS

RESULTS

CONCLUSION

背景

目的

材料和方法

结果

结论