整合宿主转录组生物标志物与大语言模型用于诊断下呼吸道感染。

Integrating a host transcriptomic biomarker with a large language model for diagnosis of lower respiratory tract infection.

作者信息

Van Phan Hoang, Spottiswoode Natasha, Lydon Emily C, Chu Victoria T, Cuesta Adolfo, Kazberouk Alexander D, Richmond Natalie L, Deosthale Padmini, Calfee Carolyn S, Langelier Charles R

机构信息

Department of Medicine, Division of Infectious Diseases, University of California San Francisco.

Department of Pediatrics, Division of Infectious Diseases and Global Health, University of California San Francisco.

出版信息

medRxiv. 2025 Apr 3:2024.08.28.24312732. doi: 10.1101/2024.08.28.24312732.

DOI:10.1101/2024.08.28.24312732

PMID:40236397

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11998817/

Abstract

BACKGROUND

Lower respiratory tract infections (LRTIs) are a leading cause of mortality worldwide and can be difficult to diagnose in critically ill patients, as non-infectious causes of respiratory failure can present with similar clinical features.

METHODS

We developed a LRTI diagnostic method combining the pulmonary transcriptomic biomarker with electronic medical record (EMR) text assessment using the large language model Generative Pre-trained Transformer 4 (GPT-4). We evaluated this approach in a prospective cohort of critically ill adults with acute respiratory failure from whom tracheal aspirate expression was measured by RNA sequencing. Patients with LRTI or non-infectious conditions were identified using retrospective, multi-physician clinical adjudication. We then confirmed our findings by applying this method to an independent validation cohort of 115 adults with acute respiratory failure.

RESULTS

In the derivation cohort, a combined classifier incorporating expression and GPT-4-assisted EMR analysis achieved an AUC of 0.93 (±0.08) and an accuracy of 84%, outperforming expression alone (AUC 0.84 ± 0.11) and GPT-4-based analysis alone (AUC 0.83 ± 0.07). By comparison, the primary medical team's admission diagnosis had an accuracy of 72%. In the validation cohort, the combined classifier yielded an AUC of 0.98 (±0.04) and an accuracy of 96%.

CONCLUSIONS

Integrating a host transcriptional biomarker with EMR text analysis using a large language model may offer a promising new approach to improving the diagnosis of LRTIs in critically ill adults.

摘要

背景

下呼吸道感染（LRTIs）是全球范围内主要的死亡原因之一，在重症患者中可能难以诊断，因为呼吸衰竭的非感染性病因可能表现出相似的临床特征。

方法

我们开发了一种下呼吸道感染诊断方法，该方法将肺部转录组生物标志物与使用大语言模型生成式预训练变换器4（GPT-4）的电子病历（EMR）文本评估相结合。我们在一组患有急性呼吸衰竭的重症成年患者的前瞻性队列中评估了这种方法，通过RNA测序测量了这些患者的气管吸出物表达。使用回顾性、多医生临床判定来识别患有下呼吸道感染或非感染性疾病的患者。然后，我们将此方法应用于115名患有急性呼吸衰竭的成年患者的独立验证队列，以证实我们的发现。

结果

在推导队列中，结合表达和GPT-4辅助EMR分析的联合分类器的曲线下面积（AUC）为0.93（±0.08），准确率为84%，优于单独的表达（AUC 0.84±0.11）和单独的基于GPT-4的分析（AUC 0.83±0.07）。相比之下，初级医疗团队的入院诊断准确率为72%。在验证队列中，联合分类器的AUC为0.98（±0.04），准确率为96%。

结论

将宿主转录生物标志物与使用大语言模型的EMR文本分析相结合，可能为改善重症成年患者下呼吸道感染的诊断提供一种有前景的新方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/219b/11998817/75da2ab9f861/nihpp-2024.08.28.24312732v2-f0001.jpg

相似文献

Integrating a host transcriptomic biomarker with a large language model for diagnosis of lower respiratory tract infection.整合宿主转录组生物标志物与大语言模型用于诊断下呼吸道感染。

medRxiv. 2025 Apr 3:2024.08.28.24312732. doi: 10.1101/2024.08.28.24312732.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施：系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。

Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.

Carbon dioxide detection for diagnosis of inadvertent respiratory tract placement of enterogastric tubes in children.用于诊断儿童肠胃管意外置入呼吸道的二氧化碳检测

Cochrane Database Syst Rev. 2025 Feb 19;2(2):CD011196. doi: 10.1002/14651858.CD011196.pub2.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

The potential of Generative Pre-trained Transformer 4 (GPT-4) to analyse medical notes in three different languages: a retrospective model-evaluation study.生成式预训练变换器4（GPT-4）分析三种不同语言医学笔记的潜力：一项回顾性模型评估研究。

Lancet Digit Health. 2025 Jan;7(1):e35-e43. doi: 10.1016/S2589-7500(24)00246-2.

Systemic Inflammatory Response Syndrome全身炎症反应综合征

Proteomic profiling of the local and systemic immune response to pediatric respiratory viral infections.儿童呼吸道病毒感染局部和全身免疫反应的蛋白质组学分析

mSystems. 2025 Jan 21;10(1):e0133524. doi: 10.1128/msystems.01335-24. Epub 2024 Nov 29.

Biomarkers as point-of-care tests to guide prescription of antibiotics in people with acute respiratory infections in primary care.生物标志物作为即时检测手段，指导初级保健中急性呼吸道感染患者使用抗生素的处方。

Cochrane Database Syst Rev. 2022 Oct 17;10(10):CD010130. doi: 10.1002/14651858.CD010130.pub3.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

本文引用的文献

Large Language Model Influence on Diagnostic Reasoning: A Randomized Clinical Trial.大语言模型对诊断推理的影响：一项随机临床试验。

JAMA Netw Open. 2024 Oct 1;7(10):e2440969. doi: 10.1001/jamanetworkopen.2024.40969.

Pulmonary Is an Inverse Biomarker of Pneumonia in Critically Ill Children and Adults.在危重症儿童和成人中，肺部是肺炎的反向生物标志物。

Am J Respir Crit Care Med. 2024 Dec 15;210(12):1480-1483. doi: 10.1164/rccm.202403-0516RL.

Evaluating GPT-V4 (GPT-4 with Vision) on Detection of Radiologic Findings on Chest Radiographs.评估 GPT-V4（具有视觉功能的 GPT-4）在检测胸部 X 光片中放射学发现的能力。

Radiology. 2024 May;311(2):e233270. doi: 10.1148/radiol.233270.

Antibiotics Not Associated with Shorter Duration or Reduced Severity of Acute Lower Respiratory Tract Infection.抗生素与急性下呼吸道感染的持续时间缩短或严重程度降低无关。

J Gen Intern Med. 2024 Aug;39(10):1887-1894. doi: 10.1007/s11606-024-08758-y. Epub 2024 Apr 15.

Can Chatbot Artificial Intelligence Replace Infectious Diseases Physicians in the Management of Bloodstream Infections? A Prospective Cohort Study.人工智能聊天机器人能否在血流感染管理中取代传染病医生？一项前瞻性队列研究。

Clin Infect Dis. 2024 Apr 10;78(4):825-832. doi: 10.1093/cid/ciad632.

Benefits, Limits, and Risks of GPT-4 as an AI Chatbot for Medicine. Reply.GPT-4作为医学人工智能聊天机器人的益处、局限性和风险。回复。

N Engl J Med. 2023 Jun 22;388(25):2400. doi: 10.1056/NEJMc2305286.

Integrated host/microbe metagenomics enables accurate lower respiratory tract infection diagnosis in critically ill children.整合宿主/微生物宏基因组学可实现重症患儿下呼吸道感染的准确诊断。

J Clin Invest. 2023 Apr 3;133(7):e165904. doi: 10.1172/JCI165904.

Antibiotic resistance associated with the COVID-19 pandemic: a systematic review and meta-analysis.与 COVID-19 大流行相关的抗生素耐药性：系统评价和荟萃分析。

Clin Microbiol Infect. 2023 Mar;29(3):302-309. doi: 10.1016/j.cmi.2022.12.006. Epub 2022 Dec 9.

Machine learning for patient risk stratification: standing on, or looking over, the shoulders of clinicians?用于患者风险分层的机器学习：是站在临床医生的肩膀上，还是俯瞰他们？

NPJ Digit Med. 2021 Mar 30;4(1):62. doi: 10.1038/s41746-021-00426-3.

Integrating host response and unbiased microbe detection for lower respiratory tract infection diagnosis in critically ill adults.整合宿主反应和非偏倚微生物检测以诊断危重症成人下呼吸道感染。

Proc Natl Acad Sci U S A. 2018 Dec 26;115(52):E12353-E12362. doi: 10.1073/pnas.1809700115. Epub 2018 Nov 27.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

整合宿主转录组生物标志物与大语言模型用于诊断下呼吸道感染。

Integrating a host transcriptomic biomarker with a large language model for diagnosis of lower respiratory tract infection.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献