用于从电子健康记录中提取日常生活活动信息的自然语言处理系统。一项系统综述。

Natural language processing systems for extracting information from electronic health records about activities of daily living. A systematic review.

作者信息

Wieland-Jorna Yvonne, van Kooten Daan, Verheij Robert A, de Man Yvonne, Francke Anneke L, Oosterveld-Vlug Mariska G

机构信息

Netherlands Institute for Health Services Research (Nivel), Utrecht, Postbus 1568, 3500 BN, The Netherlands.

Tranzo, School of Social Sciences and Behavioural Research, Tilburg University, Tilburg, Postbus 90153, 5000 LE, The Netherlands.

出版信息

JAMIA Open. 2024 May 24;7(2):ooae044. doi: 10.1093/jamiaopen/ooae044. eCollection 2024 Jul.

DOI:10.1093/jamiaopen/ooae044

PMID:38798774

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11126158/

Abstract

OBJECTIVE

Natural language processing (NLP) can enhance research on activities of daily living (ADL) by extracting structured information from unstructured electronic health records (EHRs) notes. This review aims to give insight into the state-of-the-art, usability, and performance of NLP systems to extract information on ADL from EHRs.

MATERIALS AND METHODS

A systematic review was conducted based on searches in Pubmed, Embase, Cinahl, Web of Science, and Scopus. Studies published between 2017 and 2022 were selected based on predefined eligibility criteria.

RESULTS

The review identified 22 studies. Most studies (65%) used NLP for classifying unstructured EHR data on 1 or 2 ADL. Deep learning, combined with a ruled-based method or machine learning, was the approach most commonly used. NLP systems varied widely in terms of the pre-processing and algorithms. Common performance evaluation methods were cross-validation and train/test datasets, with F1, precision, and sensitivity as the most frequently reported evaluation metrics. Most studies reported relativity high overall scores on the evaluation metrics.

DISCUSSION

NLP systems are valuable for the extraction of unstructured EHR data on ADL. However, comparing the performance of NLP systems is difficult due to the diversity of the studies and challenges related to the dataset, including restricted access to EHR data, inadequate documentation, lack of granularity, and small datasets.

CONCLUSION

This systematic review indicates that NLP is promising for deriving information on ADL from unstructured EHR notes. However, what the best-performing NLP system is, depends on characteristics of the dataset, research question, and type of ADL.

摘要

目的

自然语言处理（NLP）可通过从非结构化电子健康记录（EHR）笔记中提取结构化信息，加强对日常生活活动（ADL）的研究。本综述旨在深入了解NLP系统从EHR中提取ADL信息的最新技术水平、可用性和性能。

材料与方法

基于对PubMed、Embase、Cinahl、科学引文索引和Scopus的检索进行系统综述。根据预先确定的纳入标准，选取2017年至2022年间发表的研究。

结果

该综述共纳入22项研究。大多数研究（65%）使用NLP对1项或2项ADL的非结构化EHR数据进行分类。深度学习与基于规则的方法或机器学习相结合是最常用的方法。NLP系统在预处理和算法方面差异很大。常见的性能评估方法是交叉验证和训练/测试数据集，F1值、精确度和灵敏度是最常报告的评估指标。大多数研究报告的评估指标总体得分相对较高。

讨论

NLP系统对于提取关于ADL的非结构化EHR数据很有价值。然而，由于研究的多样性以及与数据集相关的挑战，包括EHR数据访问受限、记录不充分、缺乏粒度和数据集较小等，比较NLP系统的性能很困难。

结论

本系统综述表明，NLP有望从非结构化EHR笔记中获取ADL信息。然而，最佳性能的NLP系统取决于数据集的特征、研究问题和ADL的类型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6f1/11126158/c928e1ab44ad/ooae044f1.jpg

相似文献

Natural language processing systems for extracting information from electronic health records about activities of daily living. A systematic review.用于从电子健康记录中提取日常生活活动信息的自然语言处理系统。一项系统综述。

JAMIA Open. 2024 May 24;7(2):ooae044. doi: 10.1093/jamiaopen/ooae044. eCollection 2024 Jul.

Natural language processing with machine learning methods to analyze unstructured patient-reported outcomes derived from electronic health records: A systematic review.使用机器学习方法进行自然语言处理，以分析来自电子健康记录的非结构化患者报告结局：系统评价。

Artif Intell Med. 2023 Dec;146:102701. doi: 10.1016/j.artmed.2023.102701. Epub 2023 Nov 1.

Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review.电子健康记录中自由文本叙述的症状的自然语言处理：系统评价。

J Am Med Inform Assoc. 2019 Apr 1;26(4):364-379. doi: 10.1093/jamia/ocy173.

Natural Language Processing of Clinical Notes on Chronic Diseases: Systematic Review.慢性病临床记录的自然语言处理：系统综述

JMIR Med Inform. 2019 Apr 27;7(2):e12239. doi: 10.2196/12239.

Augmented intelligence with natural language processing applied to electronic health records for identifying patients with non-alcoholic fatty liver disease at risk for disease progression.应用自然语言处理的增强型人工智能用于电子健康记录，以识别非酒精性脂肪性肝病患者中疾病进展风险较高的患者。

Int J Med Inform. 2019 Sep;129:334-341. doi: 10.1016/j.ijmedinf.2019.06.028. Epub 2019 Jul 6.

Getting More Out of Large Databases and EHRs with Natural Language Processing and Artificial Intelligence: The Future Is Here.借助自然语言处理和人工智能从大型数据库和电子健康记录中获取更多价值：未来已来。

J Bone Joint Surg Am. 2022 Oct 19;104(Suppl 3):51-55. doi: 10.2106/JBJS.22.00567.

A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。

J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.

Ensembles of natural language processing systems for portable phenotyping solutions.用于便携表型解决方案的自然语言处理系统集合。

J Biomed Inform. 2019 Dec;100:103318. doi: 10.1016/j.jbi.2019.103318. Epub 2019 Oct 23.

The Growing Impact of Natural Language Processing in Healthcare and Public Health.自然语言处理在医疗保健和公共卫生领域的影响日益扩大。

Inquiry. 2024 Jan-Dec;61:469580241290095. doi: 10.1177/00469580241290095.

The Food and Drug Administration Biologics Effectiveness and Safety Initiative Facilitates Detection of Vaccine Administrations From Unstructured Data in Medical Records Through Natural Language Processing.美国食品药品监督管理局生物制品有效性和安全性倡议通过自然语言处理促进从医疗记录中的非结构化数据检测疫苗接种情况。

Front Digit Health. 2021 Dec 22;3:777905. doi: 10.3389/fdgth.2021.777905. eCollection 2021.

引用本文的文献

Applying NLP methods to code functional performance in electronic health records using the international classification of functioning, disability, and health.运用自然语言处理方法，依据国际功能、残疾与健康分类对电子健康记录中的功能表现进行编码。

Disabil Health J. 2025 May 24:101888. doi: 10.1016/j.dhjo.2025.101888.

From manual clinical criteria to machine learning algorithms: Comparing outcome endpoints derived from diverse electronic health record data modalities.从手动临床标准到机器学习算法：比较源自不同电子健康记录数据模式的结局终点。

PLOS Digit Health. 2025 May 14;4(5):e0000755. doi: 10.1371/journal.pdig.0000755. eCollection 2025 May.

Global Research Trends, Hotspots, Impacts, and Emergence of Artificial Intelligence and Machine Learning in Health and Medicine: A 25-Year Bibliometric Analysis.全球人工智能和机器学习在健康与医学领域的研究趋势、热点、影响及兴起：一项25年的文献计量分析

Healthcare (Basel). 2025 Apr 13;13(8):892. doi: 10.3390/healthcare13080892.

Qualitative changes in clinical records after implementation of pharmacist-led antimicrobial stewardship program: a text mining analysis.实施由药剂师主导的抗菌药物管理计划后临床记录的定性变化：一项文本挖掘分析

J Pharm Health Care Sci. 2025 Apr 23;11(1):34. doi: 10.1186/s40780-025-00439-0.

A Tutorial and Use Case Example of the eXtreme Gradient Boosting (XGBoost) Artificial Intelligence Algorithm for Drug Development Applications.用于药物开发应用的极限梯度提升（XGBoost）人工智能算法教程及用例示例。

Clin Transl Sci. 2025 Mar;18(3):e70172. doi: 10.1111/cts.70172.

Bibliometric analysis of artificial intelligence in healthcare research: Trends and future directions.医疗保健研究中人工智能的文献计量分析：趋势与未来方向。

Future Healthc J. 2024 Sep 3;11(3):100182. doi: 10.1016/j.fhj.2024.100182. eCollection 2024 Sep.

本文引用的文献

Assessing the efficacy of machine learning algorithms for syncope classification: A systematic review.评估机器学习算法用于晕厥分类的疗效：一项系统综述。

MethodsX. 2023 Dec 6;12:102508. doi: 10.1016/j.mex.2023.102508. eCollection 2024 Jun.

Machine learning models to detect and predict patient safety events using electronic health records: A systematic review.使用电子健康记录的机器学习模型来检测和预测患者安全事件：系统评价。

Int J Med Inform. 2023 Dec;180:105246. doi: 10.1016/j.ijmedinf.2023.105246. Epub 2023 Oct 9.

The added value of text from Dutch general practitioner notes in predictive modeling.荷兰全科医生记录中文本在预测建模中的附加价值。

J Am Med Inform Assoc. 2023 Nov 17;30(12):1973-1984. doi: 10.1093/jamia/ocad160.

Detecting acute respiratory diseases in the pediatric population using cough sound features and machine learning: A systematic review.利用咳嗽声特征和机器学习技术检测儿科急性呼吸道疾病：系统综述。

Int J Med Inform. 2023 Aug;176:105093. doi: 10.1016/j.ijmedinf.2023.105093. Epub 2023 May 18.

Natural Language Processing for Breast Imaging: A Systematic Review.用于乳腺成像的自然语言处理：一项系统综述。

Diagnostics (Basel). 2023 Apr 14;13(8):1420. doi: 10.3390/diagnostics13081420.

Neurologic outcomes of carotid and other emergent interventions for ischemic stroke over 6 years with dataset enhanced by machine learning.6 年以上的颈动脉和其他紧急干预缺血性中风的神经学结果，通过机器学习增强数据集。

J Vasc Surg. 2022 Nov;76(5):1280-1288.e2. doi: 10.1016/j.jvs.2022.06.020. Epub 2022 Jun 25.

Validation of a machine learning approach to estimate expanded disability status scale scores for multiple sclerosis.一种用于估计多发性硬化症扩展残疾状态量表评分的机器学习方法的验证

Mult Scler J Exp Transl Clin. 2022 Jun 22;8(2):20552173221108635. doi: 10.1177/20552173221108635. eCollection 2022 Apr-Jun.

Linking Free Text Documentation of Functioning and Disability to the ICF With Natural Language Processing.通过自然语言处理将功能与残疾的自由文本记录与《国际功能、残疾和健康分类》相联系。

Front Rehabil Sci. 2021 Nov;2. doi: 10.3389/fresc.2021.742702. Epub 2021 Nov 5.

Gross motor function prediction using natural language processing in cerebral palsy.使用自然语言处理预测脑瘫的粗大运动功能。

Dev Med Child Neurol. 2023 Jan;65(1):100-106. doi: 10.1111/dmcn.15301. Epub 2022 Jun 5.

Availability of information on functional limitations in structured electronic health records data.结构化电子健康记录数据中关于功能受限信息的可获取性。

J Am Geriatr Soc. 2022 Jul;70(7):2161-2163. doi: 10.1111/jgs.17776. Epub 2022 Apr 5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于从电子健康记录中提取日常生活活动信息的自然语言处理系统。一项系统综述。

Natural language processing systems for extracting information from electronic health records about activities of daily living. A systematic review.

作者信息

机构信息

出版信息

OBJECTIVE

MATERIALS AND METHODS

RESULTS

DISCUSSION

CONCLUSION

目的

材料与方法

结果

讨论

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献