基于基因的聚类算法：DBSCAN密度聚类算法、模糊C均值聚类算法和BIRCH算法的比较

Gene-Based Clustering Algorithms: Comparison Between Denclue, Fuzzy-C, and BIRCH.

作者信息

Nwadiugwu Martin C

机构信息

Department of Biomedical Informatics, University of Nebraska Omaha, Omaha, NE, USA.

出版信息

Bioinform Biol Insights. 2020 Apr 1;14:1177932220909851. doi: 10.1177/1177932220909851. eCollection 2020.

DOI:10.1177/1177932220909851

PMID:32284672

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7133071/

Abstract

The current study seeks to compare 3 clustering algorithms that can be used in gene-based bioinformatics research to understand disease networks, protein-protein interaction networks, and gene expression data. Denclue, Fuzzy-C, and Balanced Iterative and Clustering using Hierarchies (BIRCH) were the 3 gene-based clustering algorithms selected. These algorithms were explored in relation to the subfield of bioinformatics that analyzes omics data, which include but are not limited to genomics, proteomics, metagenomics, transcriptomics, and metabolomics data. The objective was to compare the efficacy of the 3 algorithms and determine their strength and drawbacks. Result of the review showed that unlike Denclue and Fuzzy-C which are more efficient in handling noisy data, BIRCH can handle data set with outliers and have a better time complexity.

摘要

当前的研究旨在比较三种可用于基于基因的生物信息学研究的聚类算法，以了解疾病网络、蛋白质-蛋白质相互作用网络和基因表达数据。所选择的三种基于基因的聚类算法分别是Denclue算法、模糊C均值算法（Fuzzy-C）和平衡迭代分层聚类算法（BIRCH）。这些算法是针对生物信息学中分析组学数据的子领域进行探索的，组学数据包括但不限于基因组学、蛋白质组学、宏基因组学、转录组学和代谢组学数据。目的是比较这三种算法的有效性，并确定它们的优点和缺点。综述结果表明，与在处理噪声数据方面更有效的Denclue算法和模糊C均值算法不同，BIRCH算法可以处理含有异常值的数据集，并且具有更好的时间复杂度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7300/7133071/8935355d99eb/10.1177_1177932220909851-fig1.jpg

相似文献

Gene-Based Clustering Algorithms: Comparison Between Denclue, Fuzzy-C, and BIRCH.基于基因的聚类算法：DBSCAN密度聚类算法、模糊C均值聚类算法和BIRCH算法的比较

Bioinform Biol Insights. 2020 Apr 1;14:1177932220909851. doi: 10.1177/1177932220909851. eCollection 2020.

VSClust: feature-based variance-sensitive clustering of omics data.VSClust：基于特征的组学数据方差敏感聚类。

Bioinformatics. 2018 Sep 1;34(17):2965-2972. doi: 10.1093/bioinformatics/bty224.

FLAME, a novel fuzzy clustering method for the analysis of DNA microarray data.FLAME，一种用于分析DNA微阵列数据的新型模糊聚类方法。

BMC Bioinformatics. 2007 Jan 4;8:3. doi: 10.1186/1471-2105-8-3.

Apache Spark based kernelized fuzzy clustering framework for single nucleotide polymorphism sequence analysis.基于 Apache Spark 的核模糊聚类框架用于单核苷酸多态性序列分析。

Comput Biol Chem. 2021 Jun;92:107454. doi: 10.1016/j.compbiolchem.2021.107454. Epub 2021 Feb 10.

DCT-Yager FNN: a novel Yager-based fuzzy neural network with the discrete clustering technique.DCT-耶格模糊神经网络：一种基于离散聚类技术的新型耶格模糊神经网络。

IEEE Trans Neural Netw. 2008 Apr;19(4):625-44. doi: 10.1109/TNN.2007.911709.

The multisynapse neural network and its application to fuzzy clustering.多突触神经网络及其在模糊聚类中的应用。

IEEE Trans Neural Netw. 2002;13(3):600-18. doi: 10.1109/TNN.2002.1000127.

Rough-fuzzy clustering for grouping functionally similar genes from microarray data.基于粗糙模糊聚类的基因功能相似性分组方法研究

IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):286-99. doi: 10.1109/TCBB.2012.103.

Hybrid fuzzy cluster ensemble framework for tumor clustering from biomolecular data.用于从生物分子数据中进行肿瘤聚类的混合模糊聚类集成框架。

IEEE/ACM Trans Comput Biol Bioinform. 2013 May-Jun;10(3):657-70. doi: 10.1109/TCBB.2013.59.

Alpha-cut implemented fuzzy clustering algorithms and switching regressions.实现了阿尔法切割的模糊聚类算法和切换回归。

IEEE Trans Syst Man Cybern B Cybern. 2008 Jun;38(3):588-603. doi: 10.1109/TSMCB.2008.915537.

Robust clustering by pruning outliers.通过修剪异常值进行稳健聚类。

IEEE Trans Syst Man Cybern B Cybern. 2003;33(6):983-98. doi: 10.1109/TSMCB.2003.816993.

引用本文的文献

Identifying inflammatory bowel disease subtypes: a comprehensive exploration of transcriptomic data and machine learning-based approaches.识别炎症性肠病亚型：对转录组数据和基于机器学习方法的全面探索

Therap Adv Gastroenterol. 2025 Aug 12;18:17562848251362391. doi: 10.1177/17562848251362391. eCollection 2025.

Image Features of Resting-State Functional Magnetic Resonance Imaging in Evaluating Poor Emotion and Sleep Quality in Patients with Chronic Pain under Artificial Intelligence Algorithm.基于人工智能算法评估慢性疼痛患者不良情绪和睡眠质量的静息态功能磁共振成像的图像特征。

Contrast Media Mol Imaging. 2022 Jan 4;2022:5002754. doi: 10.1155/2022/5002754. eCollection 2022.

本文引用的文献

Fuzzy Clustering Algorithm with Non-Neighborhood Spatial Information for Surface Roughness Measurement Based on the Reflected Aliasing Images.基于反射混叠图像的具有非邻域空间信息的模糊聚类算法用于表面粗糙度测量

Sensors (Basel). 2019 Jul 26;19(15):3285. doi: 10.3390/s19153285.

Prioritization of potential vaccine targets using comparative proteomics and designing of the chimeric multi-epitope vaccine against Pseudomonas aeruginosa.利用比较蛋白质组学对潜在疫苗靶标进行优先级排序，并设计针对铜绿假单胞菌的嵌合多表位疫苗。

Sci Rep. 2019 Mar 27;9(1):5240. doi: 10.1038/s41598-019-41496-4.

Clustering algorithms: A comparative approach.聚类算法：一种比较方法。

PLoS One. 2019 Jan 15;14(1):e0210236. doi: 10.1371/journal.pone.0210236. eCollection 2019.

Clustering multilayer omics data using MuNCut.使用 MuNCut 对多层组学数据进行聚类。

BMC Genomics. 2018 Mar 14;19(1):198. doi: 10.1186/s12864-018-4580-6.

Integrative clustering of multi-level 'omic data based on non-negative matrix factorization algorithm.基于非负矩阵分解算法的多组学数据的整合聚类

PLoS One. 2017 May 1;12(5):e0176278. doi: 10.1371/journal.pone.0176278. eCollection 2017.

Clustering Algorithms: Their Application to Gene Expression Data.聚类算法：它们在基因表达数据中的应用。

Bioinform Biol Insights. 2016 Nov 30;10:237-253. doi: 10.4137/BBI.S38316. eCollection 2016.

What to Do When K-Means Clustering Fails: A Simple yet Principled Alternative Algorithm.当K均值聚类失败时该怎么办：一种简单而有原则的替代算法。

PLoS One. 2016 Sep 26;11(9):e0162259. doi: 10.1371/journal.pone.0162259. eCollection 2016.

A new Growing Neural Gas for clustering data streams.一种用于数据流聚类的新型生长神经网络。

Neural Netw. 2016 Jun;78:36-50. doi: 10.1016/j.neunet.2016.02.003. Epub 2016 Feb 26.

Clustering Acoustic Segments Using Multi-Stage Agglomerative Hierarchical Clustering.使用多阶段凝聚层次聚类对声学片段进行聚类

PLoS One. 2015 Oct 30;10(10):e0141756. doi: 10.1371/journal.pone.0141756. eCollection 2015.

Multiple fuzzy c-means clustering algorithm in medical diagnosis.医学诊断中的多重模糊C均值聚类算法

Technol Health Care. 2015;23 Suppl 2:S519-27. doi: 10.3233/THC-150989.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于基因的聚类算法：DBSCAN密度聚类算法、模糊C均值聚类算法和BIRCH算法的比较

Gene-Based Clustering Algorithms: Comparison Between Denclue, Fuzzy-C, and BIRCH.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献