• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于最小生成树邻域图的特征分析对相似基因进行功能分组。

Functional grouping of similar genes using eigenanalysis on minimum spanning tree based neighborhood graph.

作者信息

Jothi R, Mohanty Sraban Kumar, Ojha Aparajita

机构信息

Indian Institute of Information Technology, Design and Manufacturing Jabalpur, Madhya Pradesh, India.

出版信息

Comput Biol Med. 2016 Apr 1;71:135-48. doi: 10.1016/j.compbiomed.2016.02.007. Epub 2016 Feb 21.

DOI:10.1016/j.compbiomed.2016.02.007
PMID:26945461
Abstract

Gene expression data clustering is an important biological process in DNA microarray analysis. Although there have been many clustering algorithms for gene expression analysis, finding a suitable and effective clustering algorithm is always a challenging problem due to the heterogeneous nature of gene profiles. Minimum Spanning Tree (MST) based clustering algorithms have been successfully employed to detect clusters of varying shapes and sizes. This paper proposes a novel clustering algorithm using Eigenanalysis on Minimum Spanning Tree based neighborhood graph (E-MST). As MST of a set of points reflects the similarity of the points with their neighborhood, the proposed algorithm employs a similarity graph obtained from k(') rounds of MST (k(')-MST neighborhood graph). By studying the spectral properties of the similarity matrix obtained from k(')-MST graph, the proposed algorithm achieves improved clustering results. We demonstrate the efficacy of the proposed algorithm on 12 gene expression datasets. Experimental results show that the proposed algorithm performs better than the standard clustering algorithms.

摘要

基因表达数据聚类是DNA微阵列分析中的一个重要生物学过程。尽管已经有许多用于基因表达分析的聚类算法,但由于基因图谱的异质性,找到一种合适且有效的聚类算法始终是一个具有挑战性的问题。基于最小生成树(MST)的聚类算法已成功用于检测各种形状和大小的簇。本文提出了一种基于最小生成树邻域图的特征分析的新型聚类算法(E-MST)。由于一组点的MST反映了这些点与其邻域的相似性,因此该算法采用从k(')轮MST获得的相似性图(k(')-MST邻域图)。通过研究从k(')-MST图获得的相似性矩阵的谱特性,该算法取得了改进的聚类结果。我们在12个基因表达数据集上证明了该算法的有效性。实验结果表明,该算法的性能优于标准聚类算法。

相似文献

1
Functional grouping of similar genes using eigenanalysis on minimum spanning tree based neighborhood graph.基于最小生成树邻域图的特征分析对相似基因进行功能分组。
Comput Biol Med. 2016 Apr 1;71:135-48. doi: 10.1016/j.compbiomed.2016.02.007. Epub 2016 Feb 21.
2
Clustering gene expression data using a graph-theoretic approach: an application of minimum spanning trees.使用图论方法对基因表达数据进行聚类:最小生成树的应用
Bioinformatics. 2002 Apr;18(4):536-45. doi: 10.1093/bioinformatics/18.4.536.
3
Minimum spanning trees for gene expression data clustering.用于基因表达数据聚类的最小生成树
Genome Inform. 2001;12:24-33.
4
Clustering high throughput biological data with B-MST, a minimum spanning tree based heuristic.使用基于最小生成树的启发式算法B-MST对高通量生物数据进行聚类。
Comput Biol Med. 2015 Jul;62:94-102. doi: 10.1016/j.compbiomed.2015.03.031. Epub 2015 Apr 14.
5
Parallel clustering algorithm for large data sets with applications in bioinformatics.用于大数据集的并行聚类算法及其在生物信息学中的应用
IEEE/ACM Trans Comput Biol Bioinform. 2009 Apr-Jun;6(2):344-52. doi: 10.1109/TCBB.2007.70272.
6
A dynamically growing self-organizing tree (DGSOT) for hierarchical clustering gene expression profiles.一种用于分层聚类基因表达谱的动态生长自组织树(DGSOT)。
Bioinformatics. 2004 Nov 1;20(16):2605-17. doi: 10.1093/bioinformatics/bth292. Epub 2004 May 6.
7
Clustering of gene expression data: performance and similarity analysis.基因表达数据的聚类:性能与相似性分析
BMC Bioinformatics. 2006 Dec 12;7 Suppl 4(Suppl 4):S19. doi: 10.1186/1471-2105-7-S4-S19.
8
Clustering binary fingerprint vectors with missing values for DNA array data analysis.用于DNA阵列数据分析的带有缺失值的二元指纹向量聚类
Proc IEEE Comput Soc Bioinform Conf. 2003;2:38-47.
9
Similarity-balanced discriminant neighbor embedding and its application to cancer classification based on gene expression data.基于基因表达数据的相似性平衡判别近邻嵌入及其在癌症分类中的应用。
Comput Biol Med. 2015 Sep;64:236-45. doi: 10.1016/j.compbiomed.2015.07.008. Epub 2015 Jul 21.
10
Evaluation of clustering algorithms for gene expression data.基因表达数据聚类算法的评估
BMC Bioinformatics. 2006 Dec 12;7 Suppl 4(Suppl 4):S17. doi: 10.1186/1471-2105-7-S4-S17.

引用本文的文献

1
Influence of multi-species data on gene-disease associations in substance use disorder using random walk with restart models.使用带重启的随机游走模型的多物种数据对物质使用障碍中基因-疾病关联的影响
PLoS One. 2025 Jun 16;20(6):e0325201. doi: 10.1371/journal.pone.0325201. eCollection 2025.
2
A hybrid multi-objective whale optimization algorithm for analyzing microarray data based on Apache Spark.一种基于Apache Spark的用于分析微阵列数据的混合多目标鲸鱼优化算法。
PeerJ Comput Sci. 2021 Mar 25;7:e416. doi: 10.7717/peerj-cs.416. eCollection 2021.
3
A multi-objective gene clustering algorithm guided by apriori biological knowledge with intensification and diversification strategies.
一种由先验生物学知识引导的多目标基因聚类算法,具备强化和多样化策略。
BioData Min. 2018 Aug 7;11:16. doi: 10.1186/s13040-018-0178-4. eCollection 2018.