Jia Peilin, Zhao Zhongming
Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN 37232, USA.
IEEE Int Workshop Genomic Signal Process Stat. 2011:131-134. doi: 10.1109/GENSiPS.2011.6169462.
The recent success of genome-wide association (GWA) studies has greatly expanded our understanding of many complex diseases by delivering previously unknown loci and genes. A large number of GWAS datasets have already been made available, with more being generated. To explore the underlying moderate and weak signals, we recently developed a network-based dense module search (DMS) method for identification of disease candidate genes from GWAS datasets, leveraging on the joint effect of multiple genes. DMS is designed to dynamically search for the best nodes in a step-wise fashion and, thus, could overcome the limitation of pre-defined gene sets. Here, we propose an improved version of DMS, the topologically-adjusted DMS, to facilitate the analysis of complex diseases. Building on the previous version of DMS, we improved the randomization process by taking into account the topological character, aiming to adjust the bias potentially caused by high-degree nodes in the whole network. We demonstrated the topologically-adjusted DMS algorithm in a GWAS dataset for schizophrenia. We found the improved DMS strategy could effectively identify candidate genes while reducing the burden of high-degree nodes. In our evaluation, we found more candidate genes identified by the topologically-adjusted DMS algorithm have been reported in the previous association studies, suggesting this new algorithm has better performance than the unweighted DMS algorithm. Finally, our functional analysis of the top module genes revealed that they are enriched in immune-related pathways.
全基因组关联(GWA)研究最近取得的成功,通过发现此前未知的基因座和基因,极大地拓展了我们对许多复杂疾病的认识。大量的GWA研究数据集已经可以获取,并且仍在不断产生更多数据。为了探索潜在的中等强度和弱信号,我们最近开发了一种基于网络的密集模块搜索(DMS)方法,用于从GWA研究数据集中识别疾病候选基因,该方法利用了多个基因的联合效应。DMS旨在以逐步的方式动态搜索最佳节点,从而克服预定义基因集的局限性。在此,我们提出了DMS的改进版本——拓扑调整后的DMS,以促进对复杂疾病的分析。在先前版本的DMS基础上,我们通过考虑拓扑特征改进了随机化过程,旨在调整整个网络中高度数节点可能导致的偏差。我们在一个精神分裂症的GWA研究数据集中展示了拓扑调整后的DMS算法。我们发现改进后的DMS策略能够有效识别候选基因,同时减轻高度数节点的负担。在我们的评估中,我们发现拓扑调整后的DMS算法识别出的更多候选基因在先前的关联研究中已有报道,这表明这种新算法比未加权的DMS算法具有更好的性能。最后,我们对顶级模块基因的功能分析表明,它们在免疫相关途径中富集。