一种用于自然语言处理的模块化管道——从电子健康记录中筛选出实用试验结果的人工摘要。

A modular pipeline for natural language processing-screened human abstraction of a pragmatic trial outcome from electronic health records.

作者信息

Lee Robert Y, Li Kevin S, Sibley James, Cohen Trevor, Lober William B, O'Brien Janaki, LeDuc Nicole, Andrews Kasey Mallon, Ungar Anna, Walsh Jessica, Nielsen Elizabeth L, Dotolo Danae G, Kross Erin K

机构信息

Division of Pulmonary, Critical Care, and Sleep Medicine, University of Washington, Seattle, USA.

Cambia Palliative Care Center of Excellence at UW Medicine, University of Washington, Seattle, USA.

出版信息

medRxiv. 2025 Jun 24:2025.06.23.25330134. doi: 10.1101/2025.06.23.25330134.

DOI:10.1101/2025.06.23.25330134

PMID:40666322

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12262768/

Abstract

BACKGROUND

Natural language processing (NLP) allows efficient extraction of clinical variables and outcomes from electronic health records (EHR). However, measuring pragmatic clinical trial outcomes may demand accuracy that exceeds NLP performance. Combining NLP with human adjudication can address this gap, yet few software solutions support such workflows. We developed a modular, scalable system for NLP-screened human abstraction to measure the primary outcomes of two clinical trials.

METHODS

In two clinical trials of hospitalized patients with serious illness, a deep-learning NLP model screened EHR passages for documented goals-of-care discussions. Screen-positive passages were referred for human adjudication using a REDCap-based system to measure the trial outcomes. Dynamic pooling of passages using structured query language (SQL) within the REDCap database reduced unnecessary abstraction while ensuring data completeness.

RESULTS

In the first trial (N=2,512), NLP identified 22,187 screen-positive passages (0.8%) from 2.6 million EHR passages. Human reviewers adjudicated 7,494 passages over 34.3 abstractor-hours to measure the cumulative incidence and time to first documented goals-of-care discussion for all patients with 92.6% patient-level sensitivity. In the second trial (N=617), NLP identified 8,952 screen-positive passages (1.6%) from 559,596 passages at a threshold with near-100% sensitivity. Human reviewers adjudicated 3,509 passages over 27.9 abstractor-hours to measure the same outcome for all patients.

CONCLUSION

We present the design and source code for a scalable and efficient pipeline for measuring complex EHR-derived outcomes using NLP-screened human abstraction. This implementation is adaptable to diverse research needs, and its modular pipeline represents a practical middle ground between custom software and commercial platforms.

摘要

背景

自然语言处理（NLP）可从电子健康记录（EHR）中高效提取临床变量和结果。然而，衡量务实的临床试验结果可能需要超出NLP性能的准确性。将NLP与人工判定相结合可以弥补这一差距，但很少有软件解决方案支持此类工作流程。我们开发了一个模块化、可扩展的系统，用于NLP筛选后的人工提取，以衡量两项临床试验的主要结果。

方法

在两项针对重症住院患者的临床试验中，一个深度学习NLP模型对EHR段落进行筛选，以查找记录在案的照护目标讨论。筛选呈阳性的段落会使用基于REDCap的系统进行人工判定，以衡量试验结果。在REDCap数据库中使用结构化查询语言（SQL）对段落进行动态汇总，减少了不必要的提取工作，同时确保了数据完整性。

结果

在第一项试验（N = 2512）中，NLP从260万条EHR段落中识别出22187条筛选呈阳性的段落（0.8%）。人工评审员在34.3个提取工时内对7494条段落进行了判定，以衡量所有患者首次记录照护目标讨论的累积发生率和时间，患者层面的敏感性为92.6%。在第二项试验（N = 617）中，NLP在接近100%敏感性的阈值下，从559596条段落中识别出8952条筛选呈阳性的段落（1.6%）。人工评审员在27.9个提取工时内对3509条段落进行了判定，以衡量所有患者的相同结果。

结论

我们展示了一个可扩展且高效的流程的设计和源代码，该流程使用NLP筛选后的人工提取来衡量复杂的EHR衍生结果。此实施方案适用于各种研究需求，其模块化流程代表了定制软件和商业平台之间的实用中间立场。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d7ea/12262768/fa99ce7e5e19/nihpp-2025.06.23.25330134v1-f0001.jpg

相似文献

A modular pipeline for natural language processing-screened human abstraction of a pragmatic trial outcome from electronic health records.一种用于自然语言处理的模块化管道——从电子健康记录中筛选出实用试验结果的人工摘要。

medRxiv. 2025 Jun 24:2025.06.23.25330134. doi: 10.1101/2025.06.23.25330134.

Extraction of sleep information from clinical notes of Alzheimer's disease patients using natural language processing.使用自然语言处理从阿尔茨海默病患者的临床记录中提取睡眠信息。

J Am Med Inform Assoc. 2024 Oct 1;31(10):2217-2227. doi: 10.1093/jamia/ocae177.

Natural Language Processing for Adjudication of Heart Failure in a Multicenter Clinical Trial: A Secondary Analysis of a Randomized Clinical Trial.自然语言处理在多中心临床试验中心衰裁决中的应用：一项随机临床试验的二次分析。

JAMA Cardiol. 2024 Feb 1;9(2):174-181. doi: 10.1001/jamacardio.2023.4859.

Interventions for interpersonal communication about end of life care between health practitioners and affected people.干预健康从业者与受影响者之间关于临终关怀的人际沟通。

Cochrane Database Syst Rev. 2022 Jul 8;7(7):CD013116. doi: 10.1002/14651858.CD013116.pub2.

Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施：系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。

Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.

Eliciting adverse effects data from participants in clinical trials.从临床试验参与者中获取不良反应数据。

Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤

Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.

Identifying Asthma-Related Symptoms From Electronic Health Records Using a Hybrid Natural Language Processing Approach Within a Large Integrated Health Care System: Retrospective Study.在大型综合医疗保健系统中使用混合自然语言处理方法从电子健康记录中识别哮喘相关症状：回顾性研究

JMIR AI. 2025 May 2;4:e69132. doi: 10.2196/69132.

A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。

Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.

本文引用的文献

Performance of Predefined Search Patterns for Identifying Documented Goals-of-Care Discussions in Inpatient Electronic Health Records.用于识别住院电子健康记录中记录的照护目标讨论的预定义搜索模式的性能

J Palliat Med. 2025 Apr 4. doi: 10.1089/jpm.2024.0496.

Intervention to Promote Communication About Goals of Care for Hospitalized Patients With Serious Illness: A Randomized Clinical Trial.促进与住院重症患者进行治疗目标沟通的干预措施：一项随机临床试验。

JAMA. 2023 Jun 20;329(23):2028-2037. doi: 10.1001/jama.2023.8812.

Assessment of Natural Language Processing of Electronic Health Records to Measure Goals-of-Care Discussions as a Clinical Trial Outcome.评估电子健康记录中的自然语言处理以衡量作为临床试验结局的照护目标讨论。

JAMA Netw Open. 2023 Mar 1;6(3):e231204. doi: 10.1001/jamanetworkopen.2023.1204.

Improving communication about goals of care for hospitalized patients with serious illness: Study protocol for two complementary randomized trials.改善对患有严重疾病的住院患者的关怀目标沟通：两项互补随机试验的研究方案。

Contemp Clin Trials. 2022 Sep;120:106879. doi: 10.1016/j.cct.2022.106879. Epub 2022 Aug 10.

Efficacy of a Communication-Priming Intervention on Documented Goals-of-Care Discussions in Hospitalized Patients With Serious Illness: A Randomized Clinical Trial.一项针对住院重症患者记录的目标关怀讨论的沟通启动干预的效果：一项随机临床试验。

JAMA Netw Open. 2022 Apr 1;5(4):e225088. doi: 10.1001/jamanetworkopen.2022.5088.

Mixed-methods evaluation of three natural language processing modeling approaches for measuring documented goals-of-care discussions in the electronic health record.混合方法评估三种自然语言处理建模方法在电子健康记录中测量记录的目标关怀讨论的效果。

J Pain Symptom Manage. 2022 Jun;63(6):e713-e723. doi: 10.1016/j.jpainsymman.2022.02.006. Epub 2022 Feb 16.

Shifting to Serious Illness Communication.转向重症疾病沟通。

JAMA. 2022 Jan 25;327(4):321-322. doi: 10.1001/jama.2021.23695.

Differing Conceptualizations of the Goals of Care Discussion: A Critical Discourse Analysis.护理讨论目标的不同概念化：一项批判性话语分析

J Pain Symptom Manage. 2022 Apr;63(4):495-502. doi: 10.1016/j.jpainsymman.2021.12.020. Epub 2021 Dec 23.

Natural Language Processing to Identify Advance Care Planning Documentation in a Multisite Pragmatic Clinical Trial.自然语言处理在多中心实用临床试验中识别预先医疗照护计划文件。

J Pain Symptom Manage. 2022 Jan;63(1):e29-e36. doi: 10.1016/j.jpainsymman.2021.06.025. Epub 2021 Jul 14.

Identifying Goals of Care Conversations in the Electronic Health Record Using Natural Language Processing and Machine Learning.使用自然语言处理和机器学习在电子健康记录中识别护理谈话目标

J Pain Symptom Manage. 2021 Jan;61(1):136-142.e2. doi: 10.1016/j.jpainsymman.2020.08.024. Epub 2020 Aug 25.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于自然语言处理的模块化管道——从电子健康记录中筛选出实用试验结果的人工摘要。

A modular pipeline for natural language processing-screened human abstraction of a pragmatic trial outcome from electronic health records.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献