Suppr超能文献

基于基因的聚类算法:DBSCAN密度聚类算法、模糊C均值聚类算法和BIRCH算法的比较

Gene-Based Clustering Algorithms: Comparison Between Denclue, Fuzzy-C, and BIRCH.

作者信息

Nwadiugwu Martin C

机构信息

Department of Biomedical Informatics, University of Nebraska Omaha, Omaha, NE, USA.

出版信息

Bioinform Biol Insights. 2020 Apr 1;14:1177932220909851. doi: 10.1177/1177932220909851. eCollection 2020.

Abstract

The current study seeks to compare 3 clustering algorithms that can be used in gene-based bioinformatics research to understand disease networks, protein-protein interaction networks, and gene expression data. Denclue, Fuzzy-C, and Balanced Iterative and Clustering using Hierarchies (BIRCH) were the 3 gene-based clustering algorithms selected. These algorithms were explored in relation to the subfield of bioinformatics that analyzes omics data, which include but are not limited to genomics, proteomics, metagenomics, transcriptomics, and metabolomics data. The objective was to compare the efficacy of the 3 algorithms and determine their strength and drawbacks. Result of the review showed that unlike Denclue and Fuzzy-C which are more efficient in handling noisy data, BIRCH can handle data set with outliers and have a better time complexity.

摘要

当前的研究旨在比较三种可用于基于基因的生物信息学研究的聚类算法,以了解疾病网络、蛋白质-蛋白质相互作用网络和基因表达数据。所选择的三种基于基因的聚类算法分别是Denclue算法、模糊C均值算法(Fuzzy-C)和平衡迭代分层聚类算法(BIRCH)。这些算法是针对生物信息学中分析组学数据的子领域进行探索的,组学数据包括但不限于基因组学、蛋白质组学、宏基因组学、转录组学和代谢组学数据。目的是比较这三种算法的有效性,并确定它们的优点和缺点。综述结果表明,与在处理噪声数据方面更有效的Denclue算法和模糊C均值算法不同,BIRCH算法可以处理含有异常值的数据集,并且具有更好的时间复杂度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7300/7133071/8935355d99eb/10.1177_1177932220909851-fig1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验