Lu Shiheng, Wang Hui, Zhang Jian
Department of Ophthalmology, Shanghai Eye Disease Prevention and Treatment Center, Shanghai Eye Hospital, Shanghai, China.
Department of Ophthalmology, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
Front Mol Neurosci. 2022 Sep 8;15:1007352. doi: 10.3389/fnmol.2022.1007352. eCollection 2022.
Uveitis is a typical type of eye inflammation affecting the middle layer of eye (i.e., uvea layer) and can lead to blindness in middle-aged and young people. Therefore, a comprehensive study determining the disease susceptibility and the underlying mechanisms for uveitis initiation and progression is urgently needed for the development of effective treatments. In the present study, 108 uveitis-related genes are collected on the basis of literature mining, and 17,560 other human genes are collected from the Ensembl database, which are treated as non-uveitis genes. Uveitis- and non-uveitis-related genes are then encoded by gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment scores based on the genes and their neighbors in STRING, resulting in 20,681 GO term features and 297 KEGG pathway features. Subsequently, we identify functions and biological processes that can distinguish uveitis-related genes from other human genes by using an integrated feature selection method, which incorporate feature filtering method (Boruta) and four feature importance assessment methods (i.e., LASSO, LightGBM, MCFS, and mRMR). Some essential GO terms and KEGG pathways related to uveitis, such as GO:0001841 (neural tube formation), has04612 (antigen processing and presentation in human beings), and GO:0043379 (memory T cell differentiation), are identified. The plausibility of the association of mined functional features with uveitis is verified on the basis of the literature. Overall, several advanced machine learning methods are used in the current study to uncover specific functions of uveitis and provide a theoretical foundation for the clinical treatment of uveitis.
葡萄膜炎是一种典型的眼部炎症,影响眼睛的中层(即葡萄膜层),可导致中青年失明。因此,迫切需要进行一项全面研究,以确定葡萄膜炎的疾病易感性以及发病和进展的潜在机制,从而开发有效的治疗方法。在本研究中,基于文献挖掘收集了108个与葡萄膜炎相关的基因,并从Ensembl数据库中收集了17560个其他人类基因,将其作为非葡萄膜炎基因。然后,基于基因及其在STRING中的邻域,通过基因本体论(GO)和京都基因与基因组百科全书(KEGG)富集分数对葡萄膜炎相关基因和非葡萄膜炎相关基因进行编码,得到20681个GO术语特征和297个KEGG通路特征。随后,我们使用一种集成特征选择方法来识别能够区分葡萄膜炎相关基因与其他人类基因的功能和生物学过程,该方法结合了特征过滤方法(Boruta)和四种特征重要性评估方法(即LASSO、LightGBM、MCFS和mRMR)。确定了一些与葡萄膜炎相关的重要GO术语和KEGG通路,如GO:0001841(神经管形成)、has04612(人类抗原加工和呈递)和GO:0043379(记忆T细胞分化)。基于文献验证了挖掘出的功能特征与葡萄膜炎关联的合理性。总体而言,本研究使用了几种先进的机器学习方法来揭示葡萄膜炎的特定功能,并为葡萄膜炎的临床治疗提供理论基础。