基于大语言模型对美国各地卫生系统中患者自我报告的症状和需求进行分类。

LLM enabled classification of patient self-reported symptoms and needs in health systems across the USA.

作者信息

Naved Bilal A, Ravishankar Shravan, Colbert Georges E, Johnston Andrew, Slott Quintan M, Luo Yuan

机构信息

Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA.

Clearstep Health, Chicago, IL, USA.

出版信息

NPJ Digit Med. 2025 Jul 1;8(1):390. doi: 10.1038/s41746-025-01779-9.

DOI:10.1038/s41746-025-01779-9

PMID:40595018

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12215686/

Abstract

US health systems receive up to 200 M monthly website visitors. Connecting patient searches to the appropriate workflow requires accurate classification. A dataset of searches on ~15 US health system websites was annotated, characterized, and used to train and evaluate a multi-label, multi-class, deep neural network. This classifier was deployed to health systems touching patients in all 50 states and compared to an LLM. The training dataset contained 504 unique classes with performance of the model in classifying searches among those classes ranging from ~0.90 to ~0.70 across metrics depending on the number of classes included. GPT-4 performed similarly if given a master list and demonstrated value in providing added coverage to augment the supervised classifier's performance. The collected data revealed characteristics of patient searches in the largest, multi-center, national study of US health systems to date.

摘要

美国医疗系统每月网站访问量高达2亿人次。将患者搜索与适当的工作流程相连接需要准确分类。对约15个美国医疗系统网站上的搜索数据集进行了注释、特征分析，并用于训练和评估一个多标签、多类别深度神经网络。该分类器被部署到覆盖美国所有50个州的医疗系统中，并与一个大型语言模型进行比较。训练数据集包含504个独特类别，模型在这些类别中对搜索进行分类的性能，根据所包含类别的数量，在各项指标上从约0.90到约0.70不等。如果给GPT-4一份主列表，它的表现类似，并在提供额外覆盖范围以增强监督分类器性能方面展现出价值。在这项迄今为止规模最大的关于美国医疗系统的多中心全国性研究中，所收集的数据揭示了患者搜索的特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/78d4/12215686/183394e97347/41746_2025_1779_Fig1_HTML.jpg

相似文献

LLM enabled classification of patient self-reported symptoms and needs in health systems across the USA.基于大语言模型对美国各地卫生系统中患者自我报告的症状和需求进行分类。

NPJ Digit Med. 2025 Jul 1;8(1):390. doi: 10.1038/s41746-025-01779-9.

Sertindole for schizophrenia.用于治疗精神分裂症的舍吲哚。

Cochrane Database Syst Rev. 2005 Jul 20;2005(3):CD001715. doi: 10.1002/14651858.CD001715.pub2.

Surveillance for Violent Deaths - National Violent Death Reporting System, 50 States, the District of Columbia, and Puerto Rico, 2022.暴力死亡监测——2022年全国暴力死亡报告系统，50个州、哥伦比亚特区和波多黎各

MMWR Surveill Summ. 2025 Jun 12;74(5):1-42. doi: 10.15585/mmwr.ss7405a1.

Antidepressants for depression in adults with HIV infection.用于感染HIV的成年抑郁症患者的抗抑郁药。

Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD008525. doi: 10.1002/14651858.CD008525.pub3.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

Efficacy of nicergoline in dementia and other age associated forms of cognitive impairment.尼麦角林治疗痴呆及其他与年龄相关的认知障碍形式的疗效。

Cochrane Database Syst Rev. 2001;2001(4):CD003159. doi: 10.1002/14651858.CD003159.

Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗：一项系统综述

Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.

Videolaryngoscopy versus direct laryngoscopy for adult patients requiring tracheal intubation.针对需要气管插管的成年患者，视频喉镜检查与直接喉镜检查的比较。

Cochrane Database Syst Rev. 2016 Nov 15;11(11):CD011136. doi: 10.1002/14651858.CD011136.pub2.

Interventions for patients and caregivers to improve knowledge of sickle cell disease and recognition of its related complications.针对患者及护理人员的干预措施，以提高对镰状细胞病的认识及其相关并发症的识别能力。

Cochrane Database Syst Rev. 2016 Oct 6;10(10):CD011175. doi: 10.1002/14651858.CD011175.pub2.

本文引用的文献

Identifying COVID-19 cases and extracting patient reported symptoms from Reddit using natural language processing.利用自然语言处理技术从 Reddit 上识别 COVID-19 病例并提取患者自述症状。

Sci Rep. 2023 Aug 22;13(1):13721. doi: 10.1038/s41598-023-39986-7.

Using natural language processing to automatically classify written self-reported narratives by patients with migraine or cluster headache.利用自然语言处理技术自动对偏头痛或丛集性头痛患者的书面自述进行分类。

J Headache Pain. 2022 Sep 30;23(1):129. doi: 10.1186/s10194-022-01490-0.

Accuracy of online symptom checkers and the potential impact on service utilisation.在线症状检查器的准确性及其对服务利用的潜在影响。

PLoS One. 2021 Jul 15;16(7):e0254088. doi: 10.1371/journal.pone.0254088. eCollection 2021.

Use Characteristics and Triage Acuity of a Digital Symptom Checker in a Large Integrated Health System: Population-Based Descriptive Study.大型综合医疗系统中数字症状检查器的使用特征与分诊 acuity：基于人群的描述性研究。（注：这里“acuity”可能是特定语境下的专业术语，直接保留英文以便准确传达原文信息，具体含义需结合专业领域进一步理解。）

J Med Internet Res. 2020 Nov 30;22(11):e20549. doi: 10.2196/20549.

Assessment of the Frequency of Online Searches for Symptoms Before Diagnosis: Analysis of Archival Data.诊断前在线搜索症状的频率评估：档案数据分析

J Med Internet Res. 2020 Mar 6;22(3):e15065. doi: 10.2196/15065.

Waste in the US Health Care System: Estimated Costs and Potential for Savings.美国医疗体系中的浪费：估计成本和节约潜力。

JAMA. 2019 Oct 15;322(15):1501-1509. doi: 10.1001/jama.2019.13978.

A systematic review of natural language processing and text mining of symptoms from electronic patient-authored text data.基于电子患者自报告文本数据的症状自然语言处理和文本挖掘的系统评价。

Int J Med Inform. 2019 May;125:37-46. doi: 10.1016/j.ijmedinf.2019.02.008. Epub 2019 Feb 20.

Understanding Health Information Technology Induced Medication Safety Events by Two Conceptual Frameworks.理解两个概念框架下的健康信息技术导致的药物安全事件。

Appl Clin Inform. 2019 Jan;10(1):158-167. doi: 10.1055/s-0039-1678693. Epub 2019 Mar 6.

Safety of patient-facing digital symptom checkers.面向患者的数字症状检查器的安全性。

Lancet. 2018 Nov 24;392(10161):2263-2264. doi: 10.1016/S0140-6736(18)32819-8. Epub 2018 Nov 6.

Health Care Spending in the United States and Other High-Income Countries.美国和其他高收入国家的医疗保健支出。

JAMA. 2018 Mar 13;319(10):1024-1039. doi: 10.1001/jama.2018.1150.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于大语言模型对美国各地卫生系统中患者自我报告的症状和需求进行分类。

LLM enabled classification of patient self-reported symptoms and needs in health systems across the USA.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献