Yu Shicheng, Zhang Mengxian, Ye Zhaofeng, Wang Yalong, Wang Xu, Chen Ye-Guang
Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, 190 Kaiyuan Avenue, Guangzhou Science Park, Luogang District, Guangzhou, 510530, China.
Guangzhou Laboratory, Guangzhou, 510700, China.
Cell Regen. 2023 Jan 5;12(1):8. doi: 10.1186/s13619-022-00143-6.
Inflammatory bowel disease (IBD) is a chronic inflammatory condition caused by multiple genetic and environmental factors. Numerous genes are implicated in the etiology of IBD, but the diagnosis of IBD is challenging. Here, XGBoost, a machine learning prediction model, has been used to distinguish IBD from healthy cases following elaborative feature selection. Using combined unsupervised clustering analysis and the XGBoost feature selection method, we successfully identified a 32-gene signature that can predict IBD occurrence in new cohorts with 0.8651 accuracy. The signature shows enrichment in neutrophil extracellular trap formation and cytokine signaling in the immune system. The probability threshold of the XGBoost-based classification model can be adjusted to fit personalized lifestyle and health status. Therefore, this study reveals potential IBD-related biomarkers that facilitate an effective personalized diagnosis of IBD.
炎症性肠病(IBD)是一种由多种遗传和环境因素引起的慢性炎症性疾病。许多基因与IBD的病因有关,但IBD的诊断具有挑战性。在此,一种机器学习预测模型XGBoost已被用于在精心进行特征选择后将IBD与健康病例区分开来。通过结合无监督聚类分析和XGBoost特征选择方法,我们成功识别出一个32基因特征,该特征能够以0.8651的准确率预测新队列中IBD的发生。该特征在中性粒细胞胞外陷阱形成和免疫系统中的细胞因子信号传导方面表现出富集。基于XGBoost的分类模型的概率阈值可以进行调整,以适应个性化的生活方式和健康状况。因此,本研究揭示了潜在的IBD相关生物标志物,有助于对IBD进行有效的个性化诊断。