• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用自适应双自组织映射对基因表达数据进行聚类。

Clustering gene expression data using adaptive double self-organizing map.

作者信息

Ressom Habtom, Wang Dali, Natarajan Padma

机构信息

Intelligent Systems Laboratory, Department of Electrical and Computer Engineering, University of Maine, Orono, Maine 04469, USA.

出版信息

Physiol Genomics. 2003 Jun 24;14(1):35-46. doi: 10.1152/physiolgenomics.00138.2002.

DOI:10.1152/physiolgenomics.00138.2002
PMID:12672901
Abstract

This paper presents a novel clustering technique known as adaptive double self-organizing map (ADSOM). ADSOM has a flexible topology and performs clustering and cluster visualization simultaneously, thereby requiring no a priori knowledge about the number of clusters. ADSOM is developed based on a recently introduced technique known as double self-organizing map (DSOM). DSOM combines features of the popular self-organizing map (SOM) with two-dimensional position vectors, which serve as a visualization tool to decide how many clusters are needed. Although DSOM addresses the problem of identifying unknown number of clusters, its free parameters are difficult to control to guarantee correct results and convergence. ADSOM updates its free parameters during training, and it allows convergence of its position vectors to a fairly consistent number of clusters provided that its initial number of nodes is greater than the expected number of clusters. The number of clusters can be identified by visually counting the clusters formed by the position vectors after training. A novel index is introduced based on hierarchical clustering of the final locations of position vectors. The index allows automated detection of the number of clusters, thereby reducing human error that could be incurred from counting clusters visually. The reliance of ADSOM in identifying the number of clusters is proven by applying it to publicly available gene expression data from multiple biological systems such as yeast, human, and mouse. ADSOM's performance in detecting number of clusters is compared with a model-based clustering method.

摘要

本文提出了一种名为自适应双自组织映射(ADSOM)的新型聚类技术。ADSOM具有灵活的拓扑结构,可同时执行聚类和聚类可视化,因此无需关于聚类数量的先验知识。ADSOM是基于最近引入的一种称为双自组织映射(DSOM)的技术开发的。DSOM将流行的自组织映射(SOM)的特征与二维位置向量相结合,二维位置向量用作一种可视化工具来确定需要多少个聚类。尽管DSOM解决了识别未知聚类数量的问题,但其自由参数难以控制以保证正确的结果和收敛。ADSOM在训练过程中更新其自由参数,并且只要其初始节点数量大于预期的聚类数量,它就能使位置向量收敛到相当一致数量的聚类。聚类数量可以通过在训练后直观地数出由位置向量形成的聚类来确定。基于位置向量最终位置的层次聚类引入了一个新的指标。该指标允许自动检测聚类数量,从而减少因直观计数聚类可能产生的人为误差。通过将ADSOM应用于来自酵母、人类和小鼠等多个生物系统的公开可用基因表达数据,证明了ADSOM在识别聚类数量方面的可靠性。将ADSOM在检测聚类数量方面的性能与基于模型的聚类方法进行了比较。

相似文献

1
Clustering gene expression data using adaptive double self-organizing map.使用自适应双自组织映射对基因表达数据进行聚类。
Physiol Genomics. 2003 Jun 24;14(1):35-46. doi: 10.1152/physiolgenomics.00138.2002.
2
Adaptive double self-organizing maps for clustering gene expression profiles.用于聚类基因表达谱的自适应双自组织映射
Neural Netw. 2003 Jun-Jul;16(5-6):633-40. doi: 10.1016/S0893-6080(03)00102-3.
3
Selection of informative clusters from hierarchical cluster tree with gene classes.从带有基因类别的层次聚类树中选择信息性聚类
BMC Bioinformatics. 2004 Mar 25;5:32. doi: 10.1186/1471-2105-5-32.
4
Detecting clusters of different geometrical shapes in microarray gene expression data.在微阵列基因表达数据中检测不同几何形状的聚类。
Bioinformatics. 2005 May 1;21(9):1927-34. doi: 10.1093/bioinformatics/bti251. Epub 2005 Jan 12.
5
Time-dependent changes in ARE-driven gene expression by use of a noise-filtering process for microarray data.通过对微阵列数据使用噪声过滤过程,分析随时间变化的ARE驱动基因表达。
Physiol Genomics. 2002;9(3):137-44. doi: 10.1152/physiolgenomics.00003.2002. Epub 2002 Apr 9.
6
Time-synchronized clustering of gene expression trajectories.基因表达轨迹的时间同步聚类
Biostatistics. 2009 Jan;10(1):32-45. doi: 10.1093/biostatistics/kxn011. Epub 2008 May 22.
7
DNA microarray data and contextual analysis of correlation graphs.DNA微阵列数据与相关图的背景分析
BMC Bioinformatics. 2003 Apr 29;4:15. doi: 10.1186/1471-2105-4-15.
8
Cluster stability scores for microarray data in cancer studies.癌症研究中微阵列数据的聚类稳定性评分。
BMC Bioinformatics. 2003 Sep 6;4:36. doi: 10.1186/1471-2105-4-36.
9
Extracting the hidden features in saline osmotic tolerance in Saccharomyces cerevisiae from DNA microarray data using the self-organizing map: biosynthesis of amino acids.利用自组织映射从DNA微阵列数据中提取酿酒酵母盐渗透耐受性的隐藏特征:氨基酸的生物合成
Appl Microbiol Biotechnol. 2007 May;75(2):415-26. doi: 10.1007/s00253-007-0837-8. Epub 2007 Jan 30.
10
Iterative class discovery and feature selection using Minimal Spanning Trees.使用最小生成树的迭代类发现和特征选择
BMC Bioinformatics. 2004 Sep 8;5:126. doi: 10.1186/1471-2105-5-126.

引用本文的文献

1
Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections.感染期间时空蛋白质组变化的生物信息学分析
Front Genet. 2021 Jul 2;12:667936. doi: 10.3389/fgene.2021.667936. eCollection 2021.
2
Feasibility of whole RNA sequencing from single-cell mRNA amplification.单细胞mRNA扩增进行全RNA测序的可行性。
Genet Res Int. 2013;2013:724124. doi: 10.1155/2013/724124. Epub 2013 Dec 23.
3
Genome signatures, self-organizing maps and higher order phylogenies: a parametric analysis.基因组特征、自组织映射和高阶系统发育:参数分析。
Evol Bioinform Online. 2007 Sep 17;3:211-36.