Zheng Hexiang, Gu Changgui, Yang Huijie
Department of Systems Science, Business School, University of Shanghai for Science and Technology, Shanghai, 200093, China.
Heliyon. 2024 Mar 1;10(5):e27070. doi: 10.1016/j.heliyon.2024.e27070. eCollection 2024 Mar 15.
Finding biomarker genes for complex diseases attracts persistent attention due to its application in clinics. In this paper, we propose a network-based method to obtain a set of biomarker genes. The key idea is to construct a gene co-expression network among sensitive genes and cluster the genes into different modules. For each module, we can identify its representative, i.e., the gene with the largest connectivity and the smallest average shortest path length to other genes within the module. We believe these representative genes could serve as a new set of potential biomarkers for diseases. As a typical example, we investigated Alzheimer's disease, obtaining a total of 16 potential representative genes, three of which belong to the non-transcriptome. A total of 11 out of these genes are found in literature from different perspectives and methods. The incipient groups were classified into two different subtypes using machine learning algorithms. We subjected the two subtypes to Gene Ontology analysis and Kyoto Encyclopedia of Genes and Genomes analysis with healthy groups and moderate groups, respectively. The two sub-type groups were involved in two different biological processes, demonstrating the validity of this approach. This method is disease-specific and independent; hence, it can be extended to classify other kinds of complex diseases.
由于生物标志物基因在临床中的应用,寻找复杂疾病的生物标志物基因一直备受关注。在本文中,我们提出了一种基于网络的方法来获取一组生物标志物基因。其核心思想是在敏感基因之间构建一个基因共表达网络,并将基因聚类到不同的模块中。对于每个模块,我们可以确定其代表基因,即与模块内其他基因具有最大连通性和最小平均最短路径长度的基因。我们认为这些代表基因可以作为一组新的潜在疾病生物标志物。作为一个典型例子,我们研究了阿尔茨海默病,共获得16个潜在的代表基因,其中3个属于非转录组。这些基因中共有11个在不同视角和方法的文献中被发现。使用机器学习算法将初始组分为两种不同的亚型。我们分别将这两种亚型与健康组和中度组进行基因本体分析和京都基因与基因组百科全书分析。这两个亚型组参与了两个不同的生物学过程,证明了该方法的有效性。该方法具有疾病特异性和独立性;因此,它可以扩展到对其他类型的复杂疾病进行分类。