• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

堆叠集成学习模型利用来自宏转录组学的宿主转录组数据诊断肺部感染。

Stacking ensemble learning models diagnose pulmonary infections using host transcriptome data from metatranscriptomics.

作者信息

Zhang Tian, Deng Ying, Wang Wentao, Zhao Zhe, Wu Yiling, Wang Haoqian, Xia Shutao, Liao Weifang, Liao Weijie

机构信息

Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, People's Republic of China.

Peng Cheng Laboratory, Shenzhen, China.

出版信息

Sci Rep. 2025 Aug 20;15(1):30516. doi: 10.1038/s41598-025-15914-9.

DOI:10.1038/s41598-025-15914-9
PMID:40835893
Abstract

The prompt diagnosis of pulmonary infections with unknown etiology in patients in severe condition remains a challenge due to the lack of rapid and effective diagnostic methods. While metatranscriptomic sequencing offers a powerful approach, its clinical utility is often limited by issues of timeliness. In this study, we conducted metatranscriptomic sequencing on bronchoalveolar lavage fluid (BALF) collected from critically ill, severely ill, and ICU patients. Based on microbial detection results, patients were classified into four types: negative, bacterial infection, viral infection, and fungal infection. To identify host gene expression signatures associated with infection, we screened characteristic genes from human metatranscriptomic data by comparing 70% of patients with confirmed infections vs. non-infections. Leveraging these characteristic genes, we constructed classification sub-models employing 13 types of machine learning algorithms, and we further integrated these sub-models into stacking-based ensemble models with Lasso regression, resulting in diagnostic models that required only a small set of gene expression inputs. The average performance of five-fold cross-validation demonstrated high diagnostic accuracy: distinguishing infection from non-infection (AUC = 0.984), bacterial infection from non-bacterial infection (AUC = 0.98), and viral infection from non- viral infection (AUC = 0.98). Test cohorts' results demonstrated the method's high diagnostic accuracy consistency with metatranscriptomic sequencing in discerning patient infection status (AUC = 0.865) and the type of infection (viral: AUC = 0.934, bacterial: AUC = 0.871). Our study presented a rapid and inexpensive adjunctive diagnostic strategy that achieves diagnostic accuracy comparable to metatranscriptomic sequencing, enabling timely identification of both infection status and type in pulmonary infections.

摘要

由于缺乏快速有效的诊断方法,对重症患者的不明病因肺部感染进行及时诊断仍然是一项挑战。虽然宏转录组测序提供了一种强大的方法,但其临床应用往往受到及时性问题的限制。在本研究中,我们对从危重症、重症和ICU患者收集的支气管肺泡灌洗液(BALF)进行了宏转录组测序。根据微生物检测结果,将患者分为四类:阴性、细菌感染、病毒感染和真菌感染。为了识别与感染相关的宿主基因表达特征,我们通过比较70%确诊感染患者与未感染患者的人类宏转录组数据来筛选特征基因。利用这些特征基因,我们采用13种机器学习算法构建了分类子模型,并进一步将这些子模型整合到基于Lasso回归的堆叠集成模型中,从而得到仅需要少量基因表达输入的诊断模型。五折交叉验证的平均性能显示出高诊断准确性:区分感染与未感染(AUC = 0.984)、细菌感染与非细菌感染(AUC = 0.98)以及病毒感染与非病毒感染(AUC = 0.98)。测试队列的结果表明,该方法在辨别患者感染状态(AUC = 0.865)和感染类型(病毒:AUC = 0.934,细菌:AUC = 0.871)方面与宏转录组测序具有高度一致的诊断准确性。我们的研究提出了一种快速且廉价的辅助诊断策略,其诊断准确性与宏转录组测序相当,能够及时识别肺部感染的感染状态和类型。

相似文献

1
Stacking ensemble learning models diagnose pulmonary infections using host transcriptome data from metatranscriptomics.堆叠集成学习模型利用来自宏转录组学的宿主转录组数据诊断肺部感染。
Sci Rep. 2025 Aug 20;15(1):30516. doi: 10.1038/s41598-025-15914-9.
2
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
Performance of broad-spectrum targeted next-generation sequencing in lower respiratory tract infections in ICU patients: a prospective observational study.广谱靶向二代测序在重症监护病房患者下呼吸道感染中的应用:一项前瞻性观察研究。
Crit Care. 2025 Jun 4;29(1):226. doi: 10.1186/s13054-025-05470-z.
5
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
6
Proteomic profiling of the local and systemic immune response to pediatric respiratory viral infections.儿童呼吸道病毒感染局部和全身免疫反应的蛋白质组学分析
mSystems. 2025 Jan 21;10(1):e0133524. doi: 10.1128/msystems.01335-24. Epub 2024 Nov 29.
7
Development of Machine Learning-based Algorithms to Predict the 2- and 5-year Risk of TKA After Tibial Plateau Fracture Treatment.基于机器学习的算法用于预测胫骨平台骨折治疗后2年和5年全膝关节置换风险的研究进展
Clin Orthop Relat Res. 2025 Mar 12. doi: 10.1097/CORR.0000000000003442.
8
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
9
A Responsible Framework for Assessing, Selecting, and Explaining Machine Learning Models in Cardiovascular Disease Outcomes Among People With Type 2 Diabetes: Methodology and Validation Study.用于评估、选择和解释2型糖尿病患者心血管疾病结局机器学习模型的责任框架:方法与验证研究
JMIR Med Inform. 2025 Jun 27;13:e66200. doi: 10.2196/66200.
10
The clinical utility of bronchoalveolar lavage fluid metagenomic next-generation sequencing in immunocompromised critically ill patients with invasive pulmonary aspergillosis: a multicenter retrospective study.支气管肺泡灌洗液体宏基因组下一代测序在免疫功能低下的侵袭性肺曲霉病重症患者中的临床应用:一项多中心回顾性研究
Microbiol Spectr. 2025 Jun 12:e0058425. doi: 10.1128/spectrum.00584-25.

本文引用的文献

1
KEGG: biological systems database as a model of the real world.京都基因与基因组百科全书(KEGG):作为现实世界模型的生物系统数据库。
Nucleic Acids Res. 2025 Jan 6;53(D1):D672-D677. doi: 10.1093/nar/gkae909.
2
Integrating host transcriptomic signatures for distinguishing autoimmune encephalitis in cerebrospinal fluid by metagenomic sequencing.通过宏基因组测序整合宿主转录组特征以区分脑脊液中的自身免疫性脑炎。
Cell Biosci. 2023 Jun 19;13(1):111. doi: 10.1186/s13578-023-01047-x.
3
Clinical Application and Evaluation of Metagenomic Next-Generation Sequencing in Pulmonary Infection with Pleural Effusion.
宏基因组下一代测序在伴有胸腔积液的肺部感染中的临床应用与评估
Infect Drug Resist. 2022 Jun 1;15:2813-2824. doi: 10.2147/IDR.S365757. eCollection 2022.
4
Metagenomics next-generation sequencing tests take the stage in the diagnosis of lower respiratory tract infections.宏基因组下一代测序检测在诊断下呼吸道感染中崭露头角。
J Adv Res. 2021 Sep 29;38:201-212. doi: 10.1016/j.jare.2021.09.012. eCollection 2022 May.
5
Integrating central nervous system metagenomics and host response for diagnosis of tuberculosis meningitis and its mimics.整合中枢神经系统宏基因组学与宿主反应用于结核性脑膜炎及其类似疾病的诊断。
Nat Commun. 2022 Mar 30;13(1):1675. doi: 10.1038/s41467-022-29353-x.
6
Metagenomic next-generation sequencing to identify pathogens and cancer in lung biopsy tissue.宏基因组下一代测序鉴定肺活检组织中的病原体和癌症。
EBioMedicine. 2021 Nov;73:103639. doi: 10.1016/j.ebiom.2021.103639. Epub 2021 Oct 23.
7
The lung microbiome: progress and promise.肺部微生物组:进展与前景。
J Clin Invest. 2021 Aug 2;131(15). doi: 10.1172/JCI150473.
8
IOBR: Multi-Omics Immuno-Oncology Biological Research to Decode Tumor Microenvironment and Signatures.IOBR:多组学免疫肿瘤生物学研究解码肿瘤微环境和特征。
Front Immunol. 2021 Jul 2;12:687975. doi: 10.3389/fimmu.2021.687975. eCollection 2021.
9
Metagenomics for surveillance of respiratory pathogens.宏基因组学在呼吸道病原体监测中的应用。
Nat Rev Microbiol. 2021 May;19(5):285. doi: 10.1038/s41579-021-00541-8.
10
Metagenomic Next-Generation Sequencing versus Traditional Pathogen Detection in the Diagnosis of Peripheral Pulmonary Infectious Lesions.宏基因组下一代测序与传统病原体检测在周围型肺部感染性病变诊断中的比较
Infect Drug Resist. 2020 Feb 19;13:567-576. doi: 10.2147/IDR.S235182. eCollection 2020.