基于最小生成树邻域图的特征分析对相似基因进行功能分组。

Functional grouping of similar genes using eigenanalysis on minimum spanning tree based neighborhood graph.

作者信息

Jothi R, Mohanty Sraban Kumar, Ojha Aparajita

机构信息

Indian Institute of Information Technology, Design and Manufacturing Jabalpur, Madhya Pradesh, India.

出版信息

Comput Biol Med. 2016 Apr 1;71:135-48. doi: 10.1016/j.compbiomed.2016.02.007. Epub 2016 Feb 21.

DOI:10.1016/j.compbiomed.2016.02.007

PMID:26945461

Abstract

Gene expression data clustering is an important biological process in DNA microarray analysis. Although there have been many clustering algorithms for gene expression analysis, finding a suitable and effective clustering algorithm is always a challenging problem due to the heterogeneous nature of gene profiles. Minimum Spanning Tree (MST) based clustering algorithms have been successfully employed to detect clusters of varying shapes and sizes. This paper proposes a novel clustering algorithm using Eigenanalysis on Minimum Spanning Tree based neighborhood graph (E-MST). As MST of a set of points reflects the similarity of the points with their neighborhood, the proposed algorithm employs a similarity graph obtained from k(') rounds of MST (k(')-MST neighborhood graph). By studying the spectral properties of the similarity matrix obtained from k(')-MST graph, the proposed algorithm achieves improved clustering results. We demonstrate the efficacy of the proposed algorithm on 12 gene expression datasets. Experimental results show that the proposed algorithm performs better than the standard clustering algorithms.

摘要

基因表达数据聚类是DNA微阵列分析中的一个重要生物学过程。尽管已经有许多用于基因表达分析的聚类算法，但由于基因图谱的异质性，找到一种合适且有效的聚类算法始终是一个具有挑战性的问题。基于最小生成树（MST）的聚类算法已成功用于检测各种形状和大小的簇。本文提出了一种基于最小生成树邻域图的特征分析的新型聚类算法（E-MST）。由于一组点的MST反映了这些点与其邻域的相似性，因此该算法采用从k(')轮MST获得的相似性图（k(')-MST邻域图）。通过研究从k(')-MST图获得的相似性矩阵的谱特性，该算法取得了改进的聚类结果。我们在12个基因表达数据集上证明了该算法的有效性。实验结果表明，该算法的性能优于标准聚类算法。

相似文献

Functional grouping of similar genes using eigenanalysis on minimum spanning tree based neighborhood graph.基于最小生成树邻域图的特征分析对相似基因进行功能分组。

Comput Biol Med. 2016 Apr 1;71:135-48. doi: 10.1016/j.compbiomed.2016.02.007. Epub 2016 Feb 21.

Clustering gene expression data using a graph-theoretic approach: an application of minimum spanning trees.使用图论方法对基因表达数据进行聚类：最小生成树的应用

Bioinformatics. 2002 Apr;18(4):536-45. doi: 10.1093/bioinformatics/18.4.536.

Minimum spanning trees for gene expression data clustering.用于基因表达数据聚类的最小生成树

Genome Inform. 2001;12:24-33.

Clustering high throughput biological data with B-MST, a minimum spanning tree based heuristic.使用基于最小生成树的启发式算法B-MST对高通量生物数据进行聚类。

Comput Biol Med. 2015 Jul;62:94-102. doi: 10.1016/j.compbiomed.2015.03.031. Epub 2015 Apr 14.

Parallel clustering algorithm for large data sets with applications in bioinformatics.用于大数据集的并行聚类算法及其在生物信息学中的应用

IEEE/ACM Trans Comput Biol Bioinform. 2009 Apr-Jun;6(2):344-52. doi: 10.1109/TCBB.2007.70272.

A dynamically growing self-organizing tree (DGSOT) for hierarchical clustering gene expression profiles.一种用于分层聚类基因表达谱的动态生长自组织树（DGSOT）。

Bioinformatics. 2004 Nov 1;20(16):2605-17. doi: 10.1093/bioinformatics/bth292. Epub 2004 May 6.

Clustering of gene expression data: performance and similarity analysis.基因表达数据的聚类：性能与相似性分析

BMC Bioinformatics. 2006 Dec 12;7 Suppl 4(Suppl 4):S19. doi: 10.1186/1471-2105-7-S4-S19.

Clustering binary fingerprint vectors with missing values for DNA array data analysis.用于DNA阵列数据分析的带有缺失值的二元指纹向量聚类

Proc IEEE Comput Soc Bioinform Conf. 2003;2:38-47.

Comput Biol Med. 2015 Sep;64:236-45. doi: 10.1016/j.compbiomed.2015.07.008. Epub 2015 Jul 21.

Evaluation of clustering algorithms for gene expression data.基因表达数据聚类算法的评估

BMC Bioinformatics. 2006 Dec 12;7 Suppl 4(Suppl 4):S17. doi: 10.1186/1471-2105-7-S4-S17.

引用本文的文献

Influence of multi-species data on gene-disease associations in substance use disorder using random walk with restart models.使用带重启的随机游走模型的多物种数据对物质使用障碍中基因-疾病关联的影响

PLoS One. 2025 Jun 16;20(6):e0325201. doi: 10.1371/journal.pone.0325201. eCollection 2025.

A hybrid multi-objective whale optimization algorithm for analyzing microarray data based on Apache Spark.一种基于Apache Spark的用于分析微阵列数据的混合多目标鲸鱼优化算法。

PeerJ Comput Sci. 2021 Mar 25;7:e416. doi: 10.7717/peerj-cs.416. eCollection 2021.

A multi-objective gene clustering algorithm guided by apriori biological knowledge with intensification and diversification strategies.一种由先验生物学知识引导的多目标基因聚类算法，具备强化和多样化策略。

BioData Min. 2018 Aug 7;11:16. doi: 10.1186/s13040-018-0178-4. eCollection 2018.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于最小生成树邻域图的特征分析对相似基因进行功能分组。

Functional grouping of similar genes using eigenanalysis on minimum spanning tree based neighborhood graph.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献