文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

Graph-based consensus clustering for class discovery from gene expression data.

作者信息

Yu Zhiwen, Wong Hau-San, Wang Hongqiang

机构信息

Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong.

出版信息

Bioinformatics. 2007 Nov 1;23(21):2888-96. doi: 10.1093/bioinformatics/btm463. Epub 2007 Sep 14.


DOI:10.1093/bioinformatics/btm463
PMID:17872912
Abstract

MOTIVATION: Consensus clustering, also known as cluster ensemble, is one of the important techniques for microarray data analysis, and is particularly useful for class discovery from microarray data. Compared with traditional clustering algorithms, consensus clustering approaches have the ability to integrate multiple partitions from different cluster solutions to improve the robustness, stability, scalability and parallelization of the clustering algorithms. By consensus clustering, one can discover the underlying classes of the samples in gene expression data. RESULTS: In addition to exploring a graph-based consensus clustering (GCC) algorithm to estimate the underlying classes of the samples in microarray data, we also design a new validation index to determine the number of classes in microarray data. To our knowledge, this is the first time in which GCC is applied to class discovery for microarray data. Given a pre specified maximum number of classes (denoted as K(max) in this article), our algorithm can discover the true number of classes for the samples in microarray data according to a new cluster validation index called the Modified Rand Index. Experiments on gene expression data indicate that our new algorithm can (i) outperform most of the existing algorithms, (ii) identify the number of classes correctly in real cancer datasets, and (iii) discover the classes of samples with biological meaning. AVAILABILITY: Matlab source code for the GCC algorithm is available upon request from Zhiwen Yu.

摘要

相似文献

[1]
Graph-based consensus clustering for class discovery from gene expression data.

Bioinformatics. 2007-11-1

[2]
A mixture model with random-effects components for clustering correlated gene-expression profiles.

Bioinformatics. 2006-7-15

[3]
Class discovery from gene expression data based on perturbation and cluster ensemble.

IEEE Trans Nanobioscience. 2009-6

[4]
Clustering of change patterns using Fourier coefficients.

Bioinformatics. 2008-1-15

[5]
Weighted rank aggregation of cluster validation measures: a Monte Carlo cross-entropy approach.

Bioinformatics. 2007-7-1

[6]
Robust multi-scale clustering of large DNA microarray datasets with the consensus algorithm.

Bioinformatics. 2006-1-1

[7]
A multi-stage approach to clustering and imputation of gene expression profiles.

Bioinformatics. 2007-4-15

[8]
An iterative data mining approach for mining overlapping coexpression patterns in noisy gene expression data.

IEEE Trans Nanobioscience. 2009-7-14

[9]
Divisive Correlation Clustering Algorithm (DCCA) for grouping of genes: detecting varying patterns in expression profiles.

Bioinformatics. 2008-6-1

[10]
An improved algorithm for clustering gene expression data.

Bioinformatics. 2007-11-1

引用本文的文献

[1]
Cross-talk of mA methylation modification and the tumor microenvironment composition in esophageal cancer.

Front Immunol. 2025-7-7

[2]
VIASCKDE Index: A Novel Internal Cluster Validity Index for Arbitrary-Shaped Clusters Based on the Kernel Density Estimation.

Comput Intell Neurosci. 2022-6-8

[3]
ClusterMine: A knowledge-integrated clustering approach based on expression profiles of gene sets.

J Bioinform Comput Biol. 2020-6

[4]
Overlapping clustering of gene expression data using penalized weighted normalized cut.

Genet Epidemiol. 2018-12

[5]
Assisted gene expression-based clustering with AWNCut.

Stat Med. 2018-8-9

[6]
Cluster ensemble based on Random Forests for genetic data.

BioData Min. 2017-12-15

[7]
Spectral clustering using Nyström approximation for the accurate identification of cancer molecular subtypes.

Sci Rep. 2017-7-7

[8]
Tradict enables accurate prediction of eukaryotic transcriptional states from 100 marker genes.

Nat Commun. 2017-5-5

[9]
Clustering cancer gene expression data by projective clustering ensemble.

PLoS One. 2017-2-24

[10]
Interpolation based consensus clustering for gene expression time series.

BMC Bioinformatics. 2015-4-16

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索