• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

验证基因表达数据的聚类分析

Validating clustering for gene expression data.

作者信息

Yeung K Y, Haynor D R, Ruzzo W L

机构信息

Computer Science and Engineering, Box 352350, University of Washington, Seattle, WA 98195, USA.

出版信息

Bioinformatics. 2001 Apr;17(4):309-18. doi: 10.1093/bioinformatics/17.4.309.

DOI:10.1093/bioinformatics/17.4.309
PMID:11301299
Abstract

MOTIVATION

Many clustering algorithms have been proposed for the analysis of gene expression data, but little guidance is available to help choose among them. We provide a systematic framework for assessing the results of clustering algorithms. Clustering algorithms attempt to partition the genes into groups exhibiting similar patterns of variation in expression level. Our methodology is to apply a clustering algorithm to the data from all but one experimental condition. The remaining condition is used to assess the predictive power of the resulting clusters-meaningful clusters should exhibit less variation in the remaining condition than clusters formed by chance.

RESULTS

We successfully applied our methodology to compare six clustering algorithms on four gene expression data sets. We found our quantitative measures of cluster quality to be positively correlated with external standards of cluster quality.

摘要

动机

已经提出了许多聚类算法用于基因表达数据分析,但在选择这些算法方面几乎没有可用的指导。我们提供了一个用于评估聚类算法结果的系统框架。聚类算法试图将基因划分为在表达水平上呈现相似变化模式的组。我们的方法是将一种聚类算法应用于除一个实验条件之外的所有数据。其余条件用于评估所得聚类的预测能力——有意义的聚类在其余条件下应比随机形成的聚类表现出更小的变化。

结果

我们成功应用我们的方法在四个基因表达数据集上比较了六种聚类算法。我们发现我们对聚类质量的定量度量与聚类质量的外部标准呈正相关。

相似文献

1
Validating clustering for gene expression data.验证基因表达数据的聚类分析
Bioinformatics. 2001 Apr;17(4):309-18. doi: 10.1093/bioinformatics/17.4.309.
2
Visualization and evaluation of clusters for exploratory analysis of gene expression data.用于基因表达数据探索性分析的聚类可视化与评估
J Biomed Inform. 2002 Feb;35(1):25-36. doi: 10.1016/s1532-0464(02)00001-1.
3
Knowledge-assisted recognition of cluster boundaries in gene expression data.基因表达数据中聚类边界的知识辅助识别。
Artif Intell Med. 2005 Sep-Oct;35(1-2):171-83. doi: 10.1016/j.artmed.2005.02.007.
4
Model-based clustering and data transformations for gene expression data.基于模型的基因表达数据聚类与数据转换
Bioinformatics. 2001 Oct;17(10):977-87. doi: 10.1093/bioinformatics/17.10.977.
5
Clustering of gene expression data: performance and similarity analysis.基因表达数据的聚类:性能与相似性分析
BMC Bioinformatics. 2006 Dec 12;7 Suppl 4(Suppl 4):S19. doi: 10.1186/1471-2105-7-S4-S19.
6
Principal component analysis for clustering gene expression data.用于聚类基因表达数据的主成分分析。
Bioinformatics. 2001 Sep;17(9):763-74. doi: 10.1093/bioinformatics/17.9.763.
7
Detecting clusters of different geometrical shapes in microarray gene expression data.在微阵列基因表达数据中检测不同几何形状的聚类。
Bioinformatics. 2005 May 1;21(9):1927-34. doi: 10.1093/bioinformatics/bti251. Epub 2005 Jan 12.
8
Entropy-based cluster validation and estimation of the number of clusters in gene expression data.基于熵的基因表达数据聚类验证及聚类数量估计
J Bioinform Comput Biol. 2012 Oct;10(5):1250011. doi: 10.1142/S0219720012500114. Epub 2012 Jun 26.
9
Analysis of a Gibbs sampler method for model-based clustering of gene expression data.一种基于模型的基因表达数据聚类的吉布斯采样器方法分析。
Bioinformatics. 2008 Jan 15;24(2):176-83. doi: 10.1093/bioinformatics/btm562. Epub 2007 Nov 22.
10
Unsupervised ranking of clustering algorithms by INFOMAX.基于 INFOMAX 的无监督聚类算法排序。
PLoS One. 2020 Oct 26;15(10):e0239331. doi: 10.1371/journal.pone.0239331. eCollection 2020.

引用本文的文献

1
An improved elastic net clustering algorithm with dynamic parameter strategy.一种具有动态参数策略的改进弹性网聚类算法。
Sci Rep. 2025 Aug 21;15(1):30747. doi: 10.1038/s41598-025-16319-4.
2
Proteomic Profile Regulated by the Immunomodulatory Jusvinza Drug in Neutrophils Isolated from Rheumatoid Arthritis Patients.类风湿关节炎患者分离出的中性粒细胞中,免疫调节药物Jusvinza调控的蛋白质组学特征
Biomedicines. 2024 Nov 29;12(12):2740. doi: 10.3390/biomedicines12122740.
3
Multi-way overlapping clustering by Bayesian tensor decomposition.基于贝叶斯张量分解的多路重叠聚类
Stat Interface. 2024;17(2):219-230. doi: 10.4310/23-sii790. Epub 2024 Feb 1.
4
miR394 and LCR cooperate with TPL to regulate AM initiation.miR394 和 LCR 与 TPL 合作调节 AM 起始。
Nat Commun. 2024 Nov 23;15(1):10156. doi: 10.1038/s41467-024-54494-6.
5
Bayesian profile regression for clustering analysis involving a longitudinal response and explanatory variables.用于涉及纵向响应和解释变量的聚类分析的贝叶斯轮廓回归
Methodology (Gott). 2024 Mar 11;73(2):314-339. doi: 10.1093/jrsssc/qlad097. Epub 2023 Nov 8.
6
Transcriptome-wide marker gene expression analysis of stress-responsive sulfate-reducing bacteria.基于转录组的应激响应硫酸盐还原菌标记基因表达分析。
Sci Rep. 2023 Sep 27;13(1):16181. doi: 10.1038/s41598-023-43089-8.
7
Integrated omic analysis provides insights into the molecular regulation of stress tolerance by partial root-zone drying in rice.综合组学分析揭示了水稻部分根区干燥胁迫耐受性的分子调控机制。
Front Plant Sci. 2023 Jun 9;14:1156514. doi: 10.3389/fpls.2023.1156514. eCollection 2023.
8
Stability estimation for unsupervised clustering: A review.无监督聚类的稳定性估计:综述
Wiley Interdiscip Rev Comput Stat. 2022 Nov-Dec;14(6):e1575. doi: 10.1002/wics.1575. Epub 2022 Jan 9.
9
Exploring key developmental phases and phase-specific genes across the entirety of anther development in maize.探究玉米花粉发育全过程中的关键发育阶段和阶段特异性基因。
J Integr Plant Biol. 2022 Jul;64(7):1394-1410. doi: 10.1111/jipb.13276.
10
Mapping of the Acoustic Environment at an Urban Park in the City Area of Milan, Italy, Using Very Low-Cost Sensors.使用极低成本传感器对意大利米兰市区城市公园的声环境进行测绘。
Sensors (Basel). 2022 May 6;22(9):3528. doi: 10.3390/s22093528.