应用于基因相互作用网络的网络聚类算法的比较与评估

Comparison and evaluation of network clustering algorithms applied to genetic interaction networks.

作者信息

Hou Lin, Wang Lin, Berg Arthur, Qian Minping, Zhu Yunping, Li Fangting, Deng Minghua

机构信息

LMAM, School of Mathematical Sciences, Peking University, Beijing 100871, China.

出版信息

Front Biosci (Elite Ed). 2012 Jan 1;4(6):2150-61. doi: 10.2741/e532.

DOI:10.2741/e532

PMID:22202027

Abstract

The goal of network clustering algorithms detect dense clusters in a network, and provide a first step towards the understanding of large scale biological networks. With numerous recent advances in biotechnologies, large-scale genetic interactions are widely available, but there is a limited understanding of which clustering algorithms may be most effective. In order to address this problem, we conducted a systematic study to compare and evaluate six clustering algorithms in analyzing genetic interaction networks, and investigated influencing factors in choosing algorithms. The algorithms considered in this comparison include hierarchical clustering, topological overlap matrix, bi-clustering, Markov clustering, Bayesian discriminant analysis based community detection, and variational Bayes approach to modularity. Both experimentally identified and synthetically constructed networks were used in this comparison. The accuracy of the algorithms is measured by the Jaccard index in comparing predicted gene modules with benchmark gene sets. The results suggest that the choice differs according to the network topology and evaluation criteria. Hierarchical clustering showed to be best at predicting protein complexes; Bayesian discriminant analysis based community detection proved best under epistatic miniarray profile (EMAP) datasets; the variational Bayes approach to modularity was noticeably better than the other algorithms in the genome-scale networks.

摘要

网络聚类算法的目标是在网络中检测密集簇，并为理解大规模生物网络迈出第一步。随着生物技术最近取得众多进展，大规模遗传相互作用广泛可得，但对于哪种聚类算法可能最有效，人们的了解有限。为了解决这个问题，我们进行了一项系统研究，以比较和评估六种聚类算法在分析遗传相互作用网络方面的表现，并研究选择算法时的影响因素。此次比较中考虑的算法包括层次聚类、拓扑重叠矩阵、双聚类、马尔可夫聚类、基于贝叶斯判别分析的社区检测以及变分贝叶斯模块化方法。此次比较使用了实验鉴定的网络和人工构建的网络。在将预测的基因模块与基准基因集进行比较时，算法的准确性通过杰卡德指数来衡量。结果表明，根据网络拓扑结构和评估标准的不同，选择也会有所不同。层次聚类在预测蛋白质复合物方面表现最佳；基于贝叶斯判别分析的社区检测在上位性微阵列谱（EMAP）数据集下被证明是最佳的；变分贝叶斯模块化方法在基因组规模网络中明显优于其他算法。

相似文献

Comparison and evaluation of network clustering algorithms applied to genetic interaction networks.应用于基因相互作用网络的网络聚类算法的比较与评估

Front Biosci (Elite Ed). 2012 Jan 1;4(6):2150-61. doi: 10.2741/e532.

Modular analysis of the probabilistic genetic interaction network.概率遗传交互网络的模块化分析。

Bioinformatics. 2011 Mar 15;27(6):853-9. doi: 10.1093/bioinformatics/btr031. Epub 2011 Jan 28.

Resolving the structure of interactomes with hierarchical agglomerative clustering.利用层次凝聚聚类解析互作组学结构。

BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S44. doi: 10.1186/1471-2105-12-S1-S44.

MultiSimNeNc: A network representation learning-based module identification method by network embedding and clustering.MultiSimNeNc：一种基于网络嵌入和聚类的网络表示学习模块识别方法。

Comput Biol Med. 2023 Apr;156:106703. doi: 10.1016/j.compbiomed.2023.106703. Epub 2023 Feb 24.

Imputing missing values for genetic interaction data.估算基因相互作用数据的缺失值。

Methods. 2014 Jun 1;67(3):269-77. doi: 10.1016/j.ymeth.2014.03.032. Epub 2014 Apr 6.

Network inference with ensembles of bi-clustering trees.基于二部聚类树集成的网络推断。

BMC Bioinformatics. 2019 Oct 28;20(1):525. doi: 10.1186/s12859-019-3104-y.

K-Module Algorithm: An Additional Step to Improve the Clustering Results of WGCNA Co-Expression Networks.K-模块算法：改进 WGCNA 共表达网络聚类结果的附加步骤。

Genes (Basel). 2021 Jan 12;12(1):87. doi: 10.3390/genes12010087.

Clustering approaches for visual knowledge exploration in molecular interaction networks.分子相互作用网络中视觉知识探索的聚类方法。

BMC Bioinformatics. 2018 Aug 29;19(1):308. doi: 10.1186/s12859-018-2314-z.

CytoCluster: A Cytoscape Plugin for Cluster Analysis and Visualization of Biological Networks.CytoCluster：一款用于生物网络聚类分析和可视化的Cytoscape插件。

Int J Mol Sci. 2017 Aug 31;18(9):1880. doi: 10.3390/ijms18091880.

A structural approach for finding functional modules from large biological networks.一种从大型生物网络中寻找功能模块的结构化方法。

BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S19. doi: 10.1186/1471-2105-9-S9-S19.

引用本文的文献

Machine Learning Prediction of Adenovirus D8 Conjunctivitis Complications from Viral Whole-Genome Sequence.基于病毒全基因组序列的腺病毒D8型结膜炎并发症的机器学习预测

Ophthalmol Sci. 2022 May 10;2(4):100166. doi: 10.1016/j.xops.2022.100166. eCollection 2022 Dec.

Network pharmacology: a new approach for chinese herbal medicine research.网络药理学：中药研究的新途径。

Evid Based Complement Alternat Med. 2013;2013:621423. doi: 10.1155/2013/621423. Epub 2013 May 20.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

应用于基因相互作用网络的网络聚类算法的比较与评估

Comparison and evaluation of network clustering algorithms applied to genetic interaction networks.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献