Li Lanlan, Deng XuZai, Wang Shuge, Huang Tao
Department of Pediatrics, Tianyou Hospital Affiliated to Wuhan University of Science and Technology, Wuhan, China.
Department of Pediatrics, Maternal and Child Health Hospital of Hubei Province, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.
Front Pharmacol. 2025 Apr 14;16:1545392. doi: 10.3389/fphar.2025.1545392. eCollection 2025.
Pediatric inflammatory bowel disease (IBD), especially Crohn's disease, significantly affects gut health and quality of life. Although gut microbiome research has advanced, identifying reliable biomarkers remains difficult due to microbial complexity.
We used RNA-seq-based microbial profiling and machine learning (ML) to find robust biomarkers in pediatric IBD. Microbial taxa were profiled at phylum, genus, and species levels using kraken2 on Crohn's disease and non-IBD ileal biopsies. We performed abundance-based analyses and applied four ML models (Logistic Regression, Random Forest, Support Vector Machine, XGBoost) to detect discriminative taxa. An independent cohort of 36 pediatric stool samples assessed by 16S rRNA sequencing validated top ML results.
Traditional abundance-based methods showed compositional shifts but identified few consistently significant taxa. ML models had better discriminatory performance, with XGBoost outperforming others and pinpointing Orthotospovirus and Vescimonas as key genera. These findings were confirmed in the validation cohort, where only one traditionally noted genus, , maintained significance.
Integrating conventional omics with AI-driven analytics boosts reproducibility and clinical relevance of microbial biomarker discovery, opening new possibilities for targeted therapies and precision medicine in pediatric IBD.
儿童炎症性肠病(IBD),尤其是克罗恩病,会显著影响肠道健康和生活质量。尽管肠道微生物组研究取得了进展,但由于微生物的复杂性,确定可靠的生物标志物仍然很困难。
我们使用基于RNA测序的微生物分析和机器学习(ML)来寻找儿童IBD中的强大生物标志物。使用kraken2在克罗恩病和非IBD回肠活检样本中,在门、属和种水平上对微生物分类群进行分析。我们进行了基于丰度的分析,并应用四种ML模型(逻辑回归、随机森林、支持向量机、XGBoost)来检测具有鉴别性的分类群。通过16S rRNA测序评估的36份儿童粪便样本的独立队列验证了顶级ML结果。
传统的基于丰度的方法显示了组成变化,但确定的一致显著的分类群很少。ML模型具有更好的鉴别性能,XGBoost的表现优于其他模型,并确定正呼肠孤病毒属和维西莫纳斯属为关键属。这些发现在验证队列中得到了证实,在该队列中,只有一个传统上提到的属保持了显著性。
将传统组学与人工智能驱动的分析相结合,提高了微生物生物标志物发现的可重复性和临床相关性,为儿童IBD的靶向治疗和精准医学开辟了新的可能性。