• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用类信息的动态扩展自组织映射进行基因表达数据分析。

Gene expression data analysis with a dynamically extended self-organized map that exploits class information.

作者信息

Mavroudi Seferina, Papadimitriou Stergios, Bezerianos Anastasios

机构信息

Department of Medical Physics, School of Medicine, University of Patras, 26500 Patras, Greece.

出版信息

Bioinformatics. 2002 Nov;18(11):1446-53. doi: 10.1093/bioinformatics/18.11.1446.

DOI:10.1093/bioinformatics/18.11.1446
PMID:12424115
Abstract

MOTIVATION

Currently the most popular approach to analyze genome-wide expression data is clustering. One of the major drawbacks of most of the existing clustering methods is that the number of clusters has to be specified a priori. Furthermore, by using pure unsupervised algorithms prior biological knowledge is totally ignored Moreover, most current tools lack an effective framework for tight integration of unsupervised and supervised learning for the analysis of high-dimensional expression data and only very few multi-class supervised approaches are designed with the provision for effectively utilizing multiple functional class labeling.

RESULTS

The paper adapts a novel Self-Organizing map called supervised Network Self-Organized Map (sNet-SOM) to the peculiarities of multi-labeled gene expression data. The sNet-SOM determines adaptively the number of clusters with a dynamic extension process. This process is driven by an inhomogeneous measure that tries to balance unsupervised, supervised and model complexity criteria. Nodes within a rectangular grid are grown at the boundary nodes, weights rippled from the internal nodes towards the outer nodes of the grid, and whole columns inserted within the map The appropriate level of expansion is determined automatically. Multiple sNet-SOM models are constructed dynamically each for a different unsupervised/supervised balance and model selection criteria are used to select the one optimum one. The results indicate that sNet-SOM yields competitive performance to other recently proposed approaches for supervised classification at a significantly reduced computational cost and it provides extensive exploratory analysis potentiality within the analysis framework. Furthermore, it explores simple design decisions that are easier to comprehend and computationally efficient.

摘要

动机

目前,分析全基因组表达数据最流行的方法是聚类。大多数现有聚类方法的主要缺点之一是聚类数量必须事先指定。此外,通过使用纯无监督算法,先前的生物学知识被完全忽略。而且,目前大多数工具缺乏一个有效的框架来紧密集成无监督和有监督学习以分析高维表达数据,并且只有极少数多类有监督方法在设计时考虑了有效利用多个功能类标签。

结果

本文将一种名为监督网络自组织映射(sNet - SOM)的新型自组织映射方法应用于多标签基因表达数据的特性分析。sNet - SOM通过动态扩展过程自适应地确定聚类数量。这个过程由一种不均匀度量驱动,该度量试图平衡无监督、有监督和模型复杂度标准。矩形网格内的节点在边界节点处生长,权重从内部节点向网格的外部节点波动,并且在映射图中插入整列。自动确定适当的扩展级别。针对不同的无监督/有监督平衡动态构建多个sNet - SOM模型,并使用模型选择标准来选择最优的一个。结果表明,sNet - SOM在显著降低计算成本的情况下,与其他最近提出的有监督分类方法相比具有竞争力的性能,并且它在分析框架内提供了广泛的探索性分析潜力。此外,它探索了更易于理解和计算高效的简单设计决策。

相似文献

1
Gene expression data analysis with a dynamically extended self-organized map that exploits class information.利用类信息的动态扩展自组织映射进行基因表达数据分析。
Bioinformatics. 2002 Nov;18(11):1446-53. doi: 10.1093/bioinformatics/18.11.1446.
2
Kernel-based self-organized maps trained with supervised bias for gene expression data analysis.
J Bioinform Comput Biol. 2004 Jan;1(4):647-80. doi: 10.1142/s021972000400034x.
3
Genetic algorithms applied to multi-class prediction for the analysis of gene expression data.应用于基因表达数据分析的多类预测的遗传算法。
Bioinformatics. 2003 Jan;19(1):37-44. doi: 10.1093/bioinformatics/19.1.37.
4
CLICK and EXPANDER: a system for clustering and visualizing gene expression data.CLICK和EXPANDER:一种用于基因表达数据聚类和可视化的系统。
Bioinformatics. 2003 Sep 22;19(14):1787-99. doi: 10.1093/bioinformatics/btg232.
5
Ischemia detection with a self-organizing map supplemented by supervised learning.使用自组织映射并辅以监督学习进行缺血检测。
IEEE Trans Neural Netw. 2001;12(3):503-15. doi: 10.1109/72.925554.
6
Reliable classification of two-class cancer data using evolutionary algorithms.使用进化算法对两类癌症数据进行可靠分类。
Biosystems. 2003 Nov;72(1-2):111-29. doi: 10.1016/s0303-2647(03)00138-2.
7
Clustering of gene expression data: performance and similarity analysis.基因表达数据的聚类:性能与相似性分析
BMC Bioinformatics. 2006 Dec 12;7 Suppl 4(Suppl 4):S19. doi: 10.1186/1471-2105-7-S4-S19.
8
An unsupervised hierarchical dynamic self-organizing approach to cancer class discovery and marker gene identification in microarray data.一种用于微阵列数据中癌症类别发现和标记基因识别的无监督分层动态自组织方法。
Bioinformatics. 2003 Nov 1;19(16):2131-40. doi: 10.1093/bioinformatics/btg296.
9
Prediction of biologically significant components from microarray data: Independently Consistent Expression Discriminator (ICED).从微阵列数据预测具有生物学意义的成分:独立一致表达鉴别器(ICED)。
Bioinformatics. 2003 Jan;19(1):62-70. doi: 10.1093/bioinformatics/19.1.62.
10
Background rareness-based iterative multiple sequence alignment algorithm for regulatory element detection.用于调控元件检测的基于稀有性的迭代多序列比对算法
Bioinformatics. 2003 Oct 12;19(15):1952-63. doi: 10.1093/bioinformatics/btg266.

引用本文的文献

1
Som-based class discovery exploring the ICA-reduced features of microarray expression profiles.基于自组织映射的类发现,探索微阵列表达谱经独立成分分析降维后的特征。
Comp Funct Genomics. 2004;5(8):596-616. doi: 10.1002/cfg.444.