*Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel; †Department of Gastroenterology and Liver Diseases, IBD Center, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel; and ‡Genetic Institute, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel.
Inflamm Bowel Dis. 2017 Sep;23(9):1516-1523. doi: 10.1097/MIB.0000000000001222.
BACKGROUND: The inflammatory bowel diseases (IBDs) are chronic inflammatory disorders, associated with genetic, immunologic, and environmental factors. Although hundreds of genes are implicated in IBD etiology, it is likely that additional genes play a role in the disease process. We developed a machine learning-based gene prioritization method to identify novel IBD-risk genes. METHODS: Known IBD genes were collected from genome-wide association studies and annotated with expression and pathway information. Using these genes, a model was trained to identify IBD-risk genes. A comprehensive list of 16,390 genes was then scored and classified. RESULTS: Immune and inflammatory responses, as well as pathways such as cell adhesion, cytokine-cytokine receptor interaction, and sulfur metabolism were identified to be related to IBD. Scores predicted for IBD genes were significantly higher than those for non-IBD genes (P < 10). There was a significant association between the score and having an IBD publication (P < 10). Overall, 347 genes had a high prediction score (>0.8). A literature review of the genes, excluding those used to train the model, identified 67 genes without any publication concerning IBD. These genes represent novel candidate IBD-risk genes, which can be targeted in future studies. CONCLUSIONS: Our method successfully differentiated IBD-risk genes from non-IBD genes by using information from expression data and a multitude of gene annotations. Crucial features were defined, and we were able to detect novel candidate risk genes for IBD. These findings may help detect new IBD-risk genes and improve the understanding of IBD pathogenesis.
背景:炎症性肠病(IBD)是一种慢性炎症性疾病,与遗传、免疫和环境因素有关。尽管有数百个基因与 IBD 的病因有关,但很可能还有其他基因在疾病过程中发挥作用。我们开发了一种基于机器学习的基因优先级排序方法,以识别新的 IBD 风险基因。
方法:从全基因组关联研究中收集已知的 IBD 基因,并注释其表达和途径信息。使用这些基因,我们训练了一个模型来识别 IBD 风险基因。然后对包含 16390 个基因的综合列表进行评分和分类。
结果:免疫和炎症反应,以及细胞黏附、细胞因子-细胞因子受体相互作用和硫代谢等途径被认为与 IBD 有关。IBD 基因的预测分数明显高于非 IBD 基因(P<10)。评分与具有 IBD 文献发表之间存在显著关联(P<10)。总体而言,347 个基因的预测评分较高(>0.8)。对这些基因进行文献综述,排除用于训练模型的基因,发现有 67 个基因没有任何与 IBD 相关的文献报道。这些基因代表新的候选 IBD 风险基因,可作为未来研究的靶点。
结论:我们的方法通过使用表达数据和大量基因注释信息,成功地区分了 IBD 风险基因和非 IBD 基因。确定了关键特征,并能够检测到新的候选 IBD 风险基因。这些发现可能有助于发现新的 IBD 风险基因,并提高对 IBD 发病机制的理解。
Mayo Clin Proc. 2019-1
Curr Opin Gastroenterol. 2015-11
Saudi J Gastroenterol. 2025-7-1
Therap Adv Gastroenterol. 2024-5-10
United European Gastroenterol J. 2024-12
Brief Bioinform. 2024-7-25