利用临床记录识别急性护理高风险肿瘤患者的自然语言处理方法

Natural Language Processing Methods to Identify Oncology Patients at High Risk for Acute Care with Clinical Notes.

作者信息

Fanconi Claudio, van Buchem Marieke, Hernandez-Boussard Tina

机构信息

Stanford University, Stanford, California, United States.

ETH Zürich, Zürich, Switzerland.

出版信息

AMIA Jt Summits Transl Sci Proc. 2023 Jun 16;2023:138-147. eCollection 2023.

PMID:37350895

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10283145/

Abstract

Clinical notes are an essential component of a health record. This paper evaluates how natural language processing (NLP) can be used to identify the risk of acute care use (ACU) in oncology patients, once chemotherapy starts. Risk prediction using structured health data (SHD) is now standard, but predictions using free-text formats are complex. This paper explores the use of free-text notes for the prediction of ACU in leu of SHD. Deep Learning models were compared to manually engineered language features. Results show that SHD models minimally outperform NLP models; an ℓ-penalised logistic regression with SHD achieved a C-statistic of 0.748 (95%-CI: 0.735, 0.762), while the same model with language features achieved 0.730 (95%-CI: 0.717, 0.745) and a transformer-based model achieved 0.702 (95%-CI: 0.688, 0.717). This paper shows how language models can be used in clinical applications and underlines how risk bias is different for diverse patient groups, even using only free-text data.

摘要

临床记录是健康档案的重要组成部分。本文评估了自然语言处理（NLP）如何用于识别肿瘤患者化疗开始后急性护理使用（ACU）的风险。使用结构化健康数据（SHD）进行风险预测现已成为标准做法，但使用自由文本格式进行预测则较为复杂。本文探讨了在没有SHD的情况下使用自由文本记录来预测ACU。将深度学习模型与人工设计的语言特征进行了比较。结果表明，SHD模型略优于NLP模型；使用SHD的ℓ-惩罚逻辑回归模型的C统计量为0.748（95%置信区间：0.735，0.762），而使用语言特征的相同模型的C统计量为0.730（95%置信区间：0.717，0.745），基于Transformer的模型的C统计量为0.702（95%置信区间：0.688，0.717）。本文展示了语言模型如何用于临床应用，并强调了即使仅使用自由文本数据，不同患者群体的风险偏差也有所不同。

相似文献

Natural Language Processing Methods to Identify Oncology Patients at High Risk for Acute Care with Clinical Notes.

AMIA Jt Summits Transl Sci Proc. 2023 Jun 16;2023:138-147. eCollection 2023.

Machine learning and natural language processing (NLP) approach to predict early progression to first-line treatment in real-world hormone receptor-positive (HR+)/HER2-negative advanced breast cancer patients.

Eur J Cancer. 2021 Feb;144:224-231. doi: 10.1016/j.ejca.2020.11.030. Epub 2020 Dec 26.

Natural language processing with deep learning for medical adverse event detection from free-text medical narratives: A case study of detecting total hip replacement dislocation.

Comput Biol Med. 2021 Feb;129:104140. doi: 10.1016/j.compbiomed.2020.104140. Epub 2020 Nov 24.

Deep Learning Approaches for Predicting Glaucoma Progression Using Electronic Health Records and Natural Language Processing.

Ophthalmol Sci. 2022 Feb 12;2(2):100127. doi: 10.1016/j.xops.2022.100127. eCollection 2022 Jun.

Natural Language Processing of Clinical Notes to Identify Mental Illness and Substance Use Among People Living with HIV: Retrospective Cohort Study.

JMIR Med Inform. 2021 Mar 10;9(3):e23456. doi: 10.2196/23456.

Extracting Clinical Features From Dictated Ambulatory Consult Notes Using a Commercially Available Natural Language Processing Tool: Pilot, Retrospective, Cross-Sectional Validation Study.

JMIR Med Inform. 2019 Nov 1;7(4):e12575. doi: 10.2196/12575.

Identification of Preanesthetic History Elements by a Natural Language Processing Engine.

Anesth Analg. 2022 Dec 1;135(6):1162-1171. doi: 10.1213/ANE.0000000000006152. Epub 2022 Jul 15.

Development of machine learning and natural language processing algorithms for preoperative prediction and automated identification of intraoperative vascular injury in anterior lumbar spine surgery.

Spine J. 2021 Oct;21(10):1635-1642. doi: 10.1016/j.spinee.2020.04.001. Epub 2020 Apr 12.

Identifying Goals of Care Conversations in the Electronic Health Record Using Natural Language Processing and Machine Learning.

J Pain Symptom Manage. 2021 Jan;61(1):136-142.e2. doi: 10.1016/j.jpainsymman.2020.08.024. Epub 2020 Aug 25.

Classification of the Disposition of Patients Hospitalized with COVID-19: Reading Discharge Summaries Using Natural Language Processing.

JMIR Med Inform. 2021 Feb 10;9(2):e25457. doi: 10.2196/25457.

引用本文的文献

Ensemble learning to enhance accurate identification of patients with glaucoma using electronic health records.

JAMIA Open. 2025 Aug 10;8(4):ooaf080. doi: 10.1093/jamiaopen/ooaf080. eCollection 2025 Aug.

Data-Driven Defragmentation: Achieving Value-Based Sarcoma and Rare Cancer Care Through Integrated Care Pathway Mapping.

J Pers Med. 2025 May 19;15(5):203. doi: 10.3390/jpm15050203.

Opportunities for Artificial Intelligence in Oncology: From the Lens of Clinicians and Patients.

JCO Oncol Pract. 2025 Mar 13:OP2400797. doi: 10.1200/OP-24-00797.

Large language models in cancer: potentials, risks, and safeguards.

BJR Artif Intell. 2024 Dec 20;2(1):ubae019. doi: 10.1093/bjrai/ubae019. eCollection 2025 Jan.

Applying natural language processing to patient messages to identify depression concerns in cancer patients.

J Am Med Inform Assoc. 2024 Oct 1;31(10):2255-2262. doi: 10.1093/jamia/ocae188.

The Role of Large Language Models in Transforming Emergency Medicine: Scoping Review.

JMIR Med Inform. 2024 May 10;12:e53787. doi: 10.2196/53787.

Automated stratification of trauma injury severity across multiple body regions using multi-modal, multi-class machine learning models.

J Am Med Inform Assoc. 2024 May 20;31(6):1291-1302. doi: 10.1093/jamia/ocae071.

Predicting Depression Risk in Patients With Cancer Using Multimodal Data: Algorithm Development Study.

JMIR Med Inform. 2024 Jan 18;12:e51925. doi: 10.2196/51925.

本文引用的文献

Using deep learning-based natural language processing to identify reasons for statin nonuse in patients with atherosclerotic cardiovascular disease.

Commun Med (Lond). 2022 Jul 15;2:88. doi: 10.1038/s43856-022-00157-w. eCollection 2022.

Launching into clinical space with medspaCy: a new clinical text processing toolkit in Python.

AMIA Annu Symp Proc. 2022 Feb 21;2021:438-447. eCollection 2021.

Machine Learning Applied to Electronic Health Records: Identification of Chemotherapy Patients at High Risk for Preventable Emergency Department Visits and Hospital Admissions.

JCO Clin Cancer Inform. 2021 Oct;5:1106-1126. doi: 10.1200/CCI.21.00116.

Prediction of Stroke Outcome Using Natural Language Processing-Based Machine Learning of Radiology Report of Brain MRI.

J Pers Med. 2020 Dec 16;10(4):286. doi: 10.3390/jpm10040286.

Deep learning in clinical natural language processing: a methodical review.

J Am Med Inform Assoc. 2020 Mar 1;27(3):457-470. doi: 10.1093/jamia/ocz200.

Development and Validation of a Score to Predict Acute Care Use After Initiation of Systemic Therapy for Cancer.

JAMA Netw Open. 2019 Oct 2;2(10):e1912823. doi: 10.1001/jamanetworkopen.2019.12823.

A simple, step-by-step guide to interpreting decision curve analysis.

Diagn Progn Res. 2019 Oct 4;3:18. doi: 10.1186/s41512-019-0064-7. eCollection 2019.

Validation of Prediction Models for Critical Care Outcomes Using Natural Language Processing of Electronic Health Record Data.

JAMA Netw Open. 2018 Dec 7;1(8):e185097. doi: 10.1001/jamanetworkopen.2018.5097.

A calibration hierarchy for risk models was defined: from utopia to empirical data.

J Clin Epidemiol. 2016 Jun;74:167-76. doi: 10.1016/j.jclinepi.2015.12.005. Epub 2016 Jan 6.

A Clinical Prediction Model to Assess Risk for Chemotherapy-Related Hospitalization in Patients Initiating Palliative Chemotherapy.

JAMA Oncol. 2015 Jul;1(4):441-7. doi: 10.1001/jamaoncol.2015.0828.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用临床记录识别急性护理高风险肿瘤患者的自然语言处理方法

Natural Language Processing Methods to Identify Oncology Patients at High Risk for Acute Care with Clinical Notes.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献