• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用自然语言处理技术对患者发起的电子健康记录消息进行分析,以识别 COVID-19 感染患者。

Use of Natural Language Processing of Patient-Initiated Electronic Health Record Messages to Identify Patients With COVID-19 Infection.

机构信息

Currently a medical student at Emory University School of Medicine, Atlanta, Georgia.

School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta.

出版信息

JAMA Netw Open. 2023 Jul 3;6(7):e2322299. doi: 10.1001/jamanetworkopen.2023.22299.

DOI:10.1001/jamanetworkopen.2023.22299
PMID:37418261
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10329205/
Abstract

IMPORTANCE

Natural language processing (NLP) has the potential to enable faster treatment access by reducing clinician response time and improving electronic health record (EHR) efficiency.

OBJECTIVE

To develop an NLP model that can accurately classify patient-initiated EHR messages and triage COVID-19 cases to reduce clinician response time and improve access to antiviral treatment.

DESIGN, SETTING, AND PARTICIPANTS: This retrospective cohort study assessed development of a novel NLP framework to classify patient-initiated EHR messages and subsequently evaluate the model's accuracy. Included patients sent messages via the EHR patient portal from 5 Atlanta, Georgia, hospitals between March 30 and September 1, 2022. Assessment of the model's accuracy consisted of manual review of message contents to confirm the classification label by a team of physicians, nurses, and medical students, followed by retrospective propensity score-matched clinical outcomes analysis.

EXPOSURE

Prescription of antiviral treatment for COVID-19.

MAIN OUTCOMES AND MEASURES

The 2 primary outcomes were (1) physician-validated evaluation of the NLP model's message classification accuracy and (2) analysis of the model's potential clinical effect via increased patient access to treatment. The model classified messages into COVID-19-other (pertaining to COVID-19 but not reporting a positive test), COVID-19-positive (reporting a positive at-home COVID-19 test result), and non-COVID-19 (not pertaining to COVID-19).

RESULTS

Among 10 172 patients whose messages were included in analyses, the mean (SD) age was 58 (17) years; 6509 patients (64.0%) were women and 3663 (36.0%) were men. In terms of race and ethnicity, 2544 patients (25.0%) were African American or Black, 20 (0.2%) were American Indian or Alaska Native, 1508 (14.8%) were Asian, 28 (0.3%) were Native Hawaiian or other Pacific Islander, 5980 (58.8%) were White, 91 (0.9%) were more than 1 race or ethnicity, and 1 (0.01%) chose not to answer. The NLP model had high accuracy and sensitivity, with a macro F1 score of 94% and sensitivity of 85% for COVID-19-other, 96% for COVID-19-positive, and 100% for non-COVID-19 messages. Among the 3048 patient-generated messages reporting positive SARS-CoV-2 test results, 2982 (97.8%) were not documented in structured EHR data. Mean (SD) message response time for COVID-19-positive patients who received treatment (364.10 [784.47] minutes) was faster than for those who did not (490.38 [1132.14] minutes; P = .03). Likelihood of antiviral prescription was inversely correlated with message response time (odds ratio, 0.99 [95% CI, 0.98-1.00]; P = .003).

CONCLUSIONS AND RELEVANCE

In this cohort study of 2982 COVID-19-positive patients, a novel NLP model classified patient-initiated EHR messages reporting positive COVID-19 test results with high sensitivity. Furthermore, when responses to patient messages occurred faster, patients were more likely to receive antiviral medical prescription within the 5-day treatment window. Although additional analysis on the effect on clinical outcomes is needed, these findings represent a possible use case for integration of NLP algorithms into clinical care.

摘要

重要性

自然语言处理(NLP)有可能通过减少临床医生的响应时间和提高电子健康记录(EHR)的效率来加速治疗的获得。

目的

开发一种能够准确分类患者发起的 EHR 消息的 NLP 模型,以筛选 COVID-19 病例,从而减少临床医生的响应时间并改善获得抗病毒治疗的机会。

设计、地点和参与者:这是一项回顾性队列研究,评估了一种新的 NLP 框架的开发,以分类患者发起的 EHR 消息,并随后评估该模型的准确性。纳入了 2022 年 3 月 30 日至 9 月 1 日期间,来自佐治亚州亚特兰大的 5 家医院的患者通过 EHR 患者门户发送的消息。模型准确性的评估包括通过一组医生、护士和医学生对消息内容进行手动审查,以确认分类标签,随后进行回顾性倾向评分匹配的临床结果分析。

暴露

COVID-19 的抗病毒治疗处方。

主要结果和措施

主要结果有两个:(1)医生验证的 NLP 模型消息分类准确性评估;(2)通过增加患者获得治疗的机会,分析模型的潜在临床效果。该模型将消息分类为 COVID-19-其他(与 COVID-19 相关但未报告阳性检测结果)、COVID-19-阳性(报告家庭 COVID-19 检测结果阳性)和非 COVID-19(与 COVID-19 无关)。

结果

在 10772 名其消息被纳入分析的患者中,平均(标准差)年龄为 58(17)岁;6509 名患者(64.0%)为女性,3663 名(36.0%)为男性。在种族和民族方面,2544 名患者(25.0%)为非裔美国人或黑人,20 名(0.2%)为美洲印第安人或阿拉斯加原住民,1508 名(14.8%)为亚洲人,28 名(0.3%)为夏威夷原住民或其他太平洋岛民,5980 名(58.8%)为白人,91 名(0.9%)为多种族或民族,1 名(0.01%)选择不回答。该 NLP 模型具有较高的准确性和敏感性,COVID-19-其他的宏观 F1 得分为 94%,敏感性为 85%,COVID-19-阳性的得分为 96%,非 COVID-19 的得分为 100%。在 3048 名报告 SARS-CoV-2 检测结果阳性的患者生成的消息中,有 2982 条(97.8%)未在结构化 EHR 数据中记录。接受治疗的 COVID-19-阳性患者的平均(标准差)消息响应时间为 364.10(784.47)分钟,比未接受治疗的患者(490.38(1132.14)分钟)更快(P=0.03)。抗病毒药物处方的可能性与消息响应时间呈负相关(比值比,0.99[95%置信区间,0.98-1.00];P=0.003)。

结论和相关性

在这项针对 2982 名 COVID-19 阳性患者的队列研究中,一种新的 NLP 模型以高灵敏度对报告 COVID-19 阳性检测结果的患者发起的 EHR 消息进行分类。此外,当对患者消息的响应更快时,患者在 5 天治疗窗口内获得抗病毒药物处方的可能性更大。尽管需要进一步分析对临床结果的影响,但这些发现代表了将 NLP 算法集成到临床护理中的一种可能应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f902/10329205/cca100d2cbbd/jamanetwopen-e2322299-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f902/10329205/cca100d2cbbd/jamanetwopen-e2322299-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f902/10329205/cca100d2cbbd/jamanetwopen-e2322299-g001.jpg

相似文献

1
Use of Natural Language Processing of Patient-Initiated Electronic Health Record Messages to Identify Patients With COVID-19 Infection.利用自然语言处理技术对患者发起的电子健康记录消息进行分析,以识别 COVID-19 感染患者。
JAMA Netw Open. 2023 Jul 3;6(7):e2322299. doi: 10.1001/jamanetworkopen.2023.22299.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Association of Electronic Health Record Inbasket Message Characteristics With Physician Burnout.电子健康记录收件匣信息特征与医生倦怠的关联。
JAMA Netw Open. 2022 Nov 1;5(11):e2244363. doi: 10.1001/jamanetworkopen.2022.44363.
4
Underserved populations with missing race ethnicity data differ significantly from those with structured race/ethnicity documentation.服务不足的人群中缺失种族民族数据与那些有结构化种族/民族文档记录的人群有显著差异。
J Am Med Inform Assoc. 2019 Aug 1;26(8-9):722-729. doi: 10.1093/jamia/ocz040.
5
Assessment of Natural Language Processing of Electronic Health Records to Measure Goals-of-Care Discussions as a Clinical Trial Outcome.评估电子健康记录中的自然语言处理以衡量作为临床试验结局的照护目标讨论。
JAMA Netw Open. 2023 Mar 1;6(3):e231204. doi: 10.1001/jamanetworkopen.2023.1204.
6
Differences in Care Team Response to Patient Portal Messages by Patient Race and Ethnicity.患者种族和民族差异对患者门户信息的医护团队响应的影响。
JAMA Netw Open. 2024 Mar 4;7(3):e242618. doi: 10.1001/jamanetworkopen.2024.2618.
7
Moving Biosurveillance Beyond Coded Data Using AI for Symptom Detection From Physician Notes: Retrospective Cohort Study.利用人工智能从医生笔记中检测症状,推动生物监测超越编码数据:回顾性队列研究。
J Med Internet Res. 2024 Apr 4;26:e53367. doi: 10.2196/53367.
8
A Natural Language Processing Model for COVID-19 Detection Based on Dutch General Practice Electronic Health Records by Using Bidirectional Encoder Representations From Transformers: Development and Validation Study.基于荷兰全科电子健康记录的 COVID-19 检测自然语言处理模型:使用转换器的双向编码器表示进行开发和验证研究。
J Med Internet Res. 2023 Oct 4;25:e49944. doi: 10.2196/49944.
9
Evaluation of Attention Switching and Duration of Electronic Inbox Work Among Primary Care Physicians.评价初级保健医生的注意力转换和电子收件箱工作持续时间。
JAMA Netw Open. 2021 Jan 4;4(1):e2031856. doi: 10.1001/jamanetworkopen.2020.31856.
10
Rates of ICD-10 Code U09.9 Documentation and Clinical Characteristics of VA Patients With Post-COVID-19 Condition.ICD-10 编码 U09.9 的记录率和退伍军人事务部患有新冠后状况患者的临床特征。
JAMA Netw Open. 2023 Dec 1;6(12):e2346783. doi: 10.1001/jamanetworkopen.2023.46783.

引用本文的文献

1
Natural Language Processing Improves Reliable Identification of COVID-19 Compared to Diagnostic Codes Alone.与仅使用诊断代码相比,自然语言处理可提高对新冠肺炎的可靠识别率。
Am J Epidemiol. 2025 Jul 30. doi: 10.1093/aje/kwaf162.
2
Extracting circumstances of Covid-19 transmission from free text with large language models.使用大语言模型从自由文本中提取新冠病毒-19传播情况
Nat Commun. 2025 Jul 1;16(1):5836. doi: 10.1038/s41467-025-60762-w.
3
Explainability Enhanced Machine Learning Model for Classifying Intellectual Disability and Attention-Deficit/Hyperactivity Disorder With Psychological Test Reports.

本文引用的文献

1
Association Between Electronic Health Record Time and Quality of Care Metrics in Primary Care.电子健康记录时间与初级保健护理质量指标的关联。
JAMA Netw Open. 2022 Oct 3;5(10):e2237086. doi: 10.1001/jamanetworkopen.2022.37086.
2
Use of At-Home COVID-19 Tests - United States, August 23, 2021-March 12, 2022.家庭版 COVID-19 检测的使用情况-美国,2021 年 8 月 23 日-2022 年 3 月 12 日。
MMWR Morb Mortal Wkly Rep. 2022 Apr 1;71(13):489-494. doi: 10.15585/mmwr.mm7113e1.
3
Oral Nirmatrelvir for High-Risk, Nonhospitalized Adults with Covid-19.
基于心理测试报告的可解释性增强型机器学习模型用于智力障碍和注意力缺陷多动障碍的分类
J Korean Med Sci. 2025 Mar 24;40(11):e26. doi: 10.3346/jkms.2025.40.e26.
4
Association of delayed asthma diagnosis with asthma exacerbations in children.儿童哮喘延迟诊断与哮喘急性加重的关联。
J Allergy Clin Immunol Glob. 2025 Jan 16;4(2):100409. doi: 10.1016/j.jacig.2025.100409. eCollection 2025 May.
5
A pediatric emergency prediction model using natural language process in the pediatric emergency department.一种在儿科急诊科使用自然语言处理的儿科急诊预测模型。
Sci Rep. 2025 Jan 28;15(1):3574. doi: 10.1038/s41598-025-87161-x.
6
Natural language processing to evaluate texting conversations between patients and healthcare providers during COVID-19 Home-Based Care in Rwanda at scale.在卢旺达大规模开展的新冠疫情居家护理期间,运用自然语言处理技术评估患者与医护人员之间的短信对话。
PLOS Digit Health. 2025 Jan 15;4(1):e0000625. doi: 10.1371/journal.pdig.0000625. eCollection 2025 Jan.
7
Graph theoretic visualization of patient and health worker messaging in the EHR.电子健康记录中患者与医护人员信息传递的图论可视化。
Front Artif Intell. 2024 Dec 3;7:1422208. doi: 10.3389/frai.2024.1422208. eCollection 2024.
8
Using Large Language Models to Extract Core Injury Information From Emergency Department Notes.使用大语言模型从急诊科病历中提取核心损伤信息。
J Korean Med Sci. 2024 Dec 2;39(46):e291. doi: 10.3346/jkms.2024.39.e291.
9
Artificial Intelligence in Medical Affairs: A New Paradigm with Novel Opportunities.人工智能在医疗事务中的应用:一种具有新机遇的新模式。
Pharmaceut Med. 2024 Sep;38(5):331-342. doi: 10.1007/s40290-024-00536-9. Epub 2024 Sep 11.
10
A pseudonymized corpus of occupational health narratives for clinical entity recognition in Spanish.西班牙语临床实体识别用假名化职业健康叙事语料库。
BMC Med Inform Decis Mak. 2024 Jul 24;24(1):204. doi: 10.1186/s12911-024-02609-w.
奈玛特韦片/利托那韦片组合包装口服药用于伴有进展为重症高风险因素的 COVID-19 门诊患者。
N Engl J Med. 2022 Apr 14;386(15):1397-1408. doi: 10.1056/NEJMoa2118542. Epub 2022 Feb 16.
4
Patient Portal Messaging for Asynchronous Virtual Care During the COVID-19 Pandemic: Retrospective Analysis.COVID-19大流行期间用于异步虚拟护理的患者门户消息传递:回顾性分析
JMIR Hum Factors. 2022 May 5;9(2):e35187. doi: 10.2196/35187.
5
Trends in Electronic Health Record Inbox Messaging During the COVID-19 Pandemic in an Ambulatory Practice Network in New England.在新英格兰地区的一个门诊实践网络中,COVID-19 大流行期间电子健康记录收件箱消息的趋势。
JAMA Netw Open. 2021 Oct 1;4(10):e2131490. doi: 10.1001/jamanetworkopen.2021.31490.
6
Effect of Monoclonal Antibody Treatment on Clinical Outcomes in Ambulatory Patients With Coronavirus Disease 2019.单克隆抗体治疗对2019冠状病毒病门诊患者临床结局的影响。
Open Forum Infect Dis. 2021 Jun 12;8(7):ofab315. doi: 10.1093/ofid/ofab315. eCollection 2021 Jul.
7
Using Natural Language Processing to Measure and Improve Quality of Diabetes Care: A Systematic Review.使用自然语言处理技术衡量和改善糖尿病护理质量:系统评价。
J Diabetes Sci Technol. 2021 May;15(3):553-560. doi: 10.1177/19322968211000831. Epub 2021 Mar 19.
8
Real-time prediction of COVID-19 related mortality using electronic health records.利用电子健康记录实时预测 COVID-19 相关死亡率。
Nat Commun. 2021 Feb 16;12(1):1058. doi: 10.1038/s41467-020-20816-7.
9
High Volume Portal Usage Impacts Practice Resources.高流量门户使用影响实践资源。
J Am Board Fam Med. 2020 May-Jun;33(3):452-455. doi: 10.3122/jabfm.2020.03.190401.
10
Detecting Hypoglycemia Incidents Reported in Patients' Secure Messages: Using Cost-Sensitive Learning and Oversampling to Reduce Data Imbalance.检测患者安全信息中报告的低血糖事件:使用成本敏感学习和过采样来减少数据不平衡
J Med Internet Res. 2019 Mar 11;21(3):e11990. doi: 10.2196/11990.