• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对具有重复测量值的基因表达数据进行聚类分析。

Clustering gene-expression data with repeated measurements.

作者信息

Yeung Ka Yee, Medvedovic Mario, Bumgarner Roger E

机构信息

Department of Microbiology, University of Washington, Seattle, WA 98195, USA.

出版信息

Genome Biol. 2003;4(5):R34. doi: 10.1186/gb-2003-4-5-r34. Epub 2003 Apr 25.

DOI:10.1186/gb-2003-4-5-r34
PMID:12734014
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC156590/
Abstract

Clustering is a common methodology for the analysis of array data, and many research laboratories are generating array data with repeated measurements. We evaluated several clustering algorithms that incorporate repeated measurements, and show that algorithms that take advantage of repeated measurements yield more accurate and more stable clusters. In particular, we show that the infinite mixture model-based approach with a built-in error model produces superior results.

摘要

聚类是分析阵列数据的常用方法,许多研究实验室正在生成带有重复测量的阵列数据。我们评估了几种纳入重复测量的聚类算法,并表明利用重复测量的算法能产生更准确、更稳定的聚类。特别是,我们表明基于无限混合模型且带有内置误差模型的方法能产生更优结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a482/156590/3af3463b1abb/gb-2003-4-5-r34-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a482/156590/f6712bd03a8f/gb-2003-4-5-r34-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a482/156590/c3d88e98149d/gb-2003-4-5-r34-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a482/156590/ac5818190f1e/gb-2003-4-5-r34-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a482/156590/3af3463b1abb/gb-2003-4-5-r34-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a482/156590/f6712bd03a8f/gb-2003-4-5-r34-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a482/156590/c3d88e98149d/gb-2003-4-5-r34-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a482/156590/ac5818190f1e/gb-2003-4-5-r34-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a482/156590/3af3463b1abb/gb-2003-4-5-r34-4.jpg

相似文献

1
Clustering gene-expression data with repeated measurements.对具有重复测量值的基因表达数据进行聚类分析。
Genome Biol. 2003;4(5):R34. doi: 10.1186/gb-2003-4-5-r34. Epub 2003 Apr 25.
2
An approach for clustering gene expression data with error information.一种用于对带有误差信息的基因表达数据进行聚类的方法。
BMC Bioinformatics. 2006 Jan 12;7:17. doi: 10.1186/1471-2105-7-17.
3
From co-expression to co-regulation: how many microarray experiments do we need?从共表达到共调控:我们需要多少微阵列实验?
Genome Biol. 2004;5(7):R48. doi: 10.1186/gb-2004-5-7-r48. Epub 2004 Jun 28.
4
Bayesian infinite mixture model based clustering of gene expression profiles.基于贝叶斯无限混合模型的基因表达谱聚类
Bioinformatics. 2002 Sep;18(9):1194-206. doi: 10.1093/bioinformatics/18.9.1194.
5
Interpolation based consensus clustering for gene expression time series.基于插值的基因表达时间序列一致性聚类
BMC Bioinformatics. 2015 Apr 16;16:117. doi: 10.1186/s12859-015-0541-0.
6
Influence of microarrays experiments missing values on the stability of gene groups by hierarchical clustering.微阵列实验缺失值对通过层次聚类的基因组稳定性的影响。
BMC Bioinformatics. 2004 Aug 23;5:114. doi: 10.1186/1471-2105-5-114.
7
Finding gene clusters for a replicated time course study.为一项重复时间进程研究寻找基因簇。
BMC Res Notes. 2014 Jan 24;7:60. doi: 10.1186/1756-0500-7-60.
8
A latent variable model for chemogenomic profiling.一种用于化学基因组学分析的潜在变量模型。
Bioinformatics. 2005 Aug 1;21(15):3286-93. doi: 10.1093/bioinformatics/bti515. Epub 2005 May 26.
9
Decomposing gene expression into cellular processes.将基因表达分解为细胞过程。
Pac Symp Biocomput. 2003:89-100. doi: 10.1142/9789812776303_0009.
10
Visualization and evaluation of clusters for exploratory analysis of gene expression data.用于基因表达数据探索性分析的聚类可视化与评估
J Biomed Inform. 2002 Feb;35(1):25-36. doi: 10.1016/s1532-0464(02)00001-1.

引用本文的文献

1
Spectral divergence prioritizes key classes, genes, and pathways shared between substance use disorders and cardiovascular disease.光谱散度对物质使用障碍和心血管疾病之间共有的关键类别、基因和通路进行了优先排序。
Front Neurosci. 2025 Jul 22;19:1572243. doi: 10.3389/fnins.2025.1572243. eCollection 2025.
2
Cytokine, chemokine, and acute-phase protein profiles in plasma as correlative biomarkers of clinical outcomes for patients with COVID-19.血浆中的细胞因子、趋化因子和急性期蛋白谱作为COVID-19患者临床结局的相关生物标志物
Sci Rep. 2025 May 2;15(1):15397. doi: 10.1038/s41598-025-99248-6.
3
VICatMix: variational Bayesian clustering and variable selection for discrete biomedical data.

本文引用的文献

1
A Study of the Comparability of External Criteria for Hierarchical Cluster Analysis.层次聚类分析外部准则的可比性研究
Multivariate Behav Res. 1986 Oct 1;21(4):441-58. doi: 10.1207/s15327906mbr2104_5.
2
Bayesian infinite mixture model based clustering of gene expression profiles.基于贝叶斯无限混合模型的基因表达谱聚类
Bioinformatics. 2002 Sep;18(9):1194-206. doi: 10.1093/bioinformatics/18.9.1194.
3
Cluster analysis and its applications to gene expression data.
Ernst Schering Res Found Workshop. 2002(38):83-108. doi: 10.1007/978-3-662-04747-7_5.
VICatMix:用于离散生物医学数据的变分贝叶斯聚类和变量选择
Bioinform Adv. 2025 Mar 17;5(1):vbaf055. doi: 10.1093/bioadv/vbaf055. eCollection 2025.
4
On the evaluation of outlier detection and one-class classification: a comparative study of algorithms, model selection, and ensembles.关于异常值检测和单类分类的评估:算法、模型选择和集成的比较研究。
Data Min Knowl Discov. 2023;37(4):1473-1517. doi: 10.1007/s10618-023-00931-x. Epub 2023 May 16.
5
Digital phenotyping of sleep patterns among heterogenous samples of Latinx adults using unsupervised learning.使用无监督学习对拉丁裔成年人的异质样本进行睡眠模式的数字表型分析。
Sleep Med. 2021 Sep;85:211-220. doi: 10.1016/j.sleep.2021.07.023. Epub 2021 Jul 19.
6
A unified view of density-based methods for semi-supervised clustering and classification.基于密度的半监督聚类和分类方法的统一观点。
Data Min Knowl Discov. 2019;33(6):1894-1952. doi: 10.1007/s10618-019-00651-1. Epub 2020 Jul 27.
7
Autophagy and Hsp70 activation alleviate oral epithelial cell death induced by food-derived hypertonicity.自噬和热休克蛋白 70 的激活可减轻食物来源的高渗性诱导的口腔上皮细胞死亡。
Cell Stress Chaperones. 2020 Mar;25(2):253-264. doi: 10.1007/s12192-020-01068-2. Epub 2020 Jan 23.
8
Markov chain Monte Carlo simulation of a Bayesian mixture model for gene network inference.贝叶斯混合模型的马尔可夫链蒙特卡罗模拟在基因网络推断中的应用。
Genes Genomics. 2019 May;41(5):547-555. doi: 10.1007/s13258-019-00789-8. Epub 2019 Feb 11.
9
Cluster analysis of replicated alternative polyadenylation data using canonical correlation analysis.基于典型相关分析的重复可变剪接数据分析的聚类。
BMC Genomics. 2019 Jan 22;20(1):75. doi: 10.1186/s12864-019-5433-7.
10
Accurate and Rapid Differentiation of Acinetobacter baumannii Strains by Raman Spectroscopy: a Comparative Study.通过拉曼光谱对鲍曼不动杆菌菌株进行准确快速的鉴别:一项比较研究
J Clin Microbiol. 2017 Aug;55(8):2480-2490. doi: 10.1128/JCM.01744-16. Epub 2017 Jun 7.
4
A mixture model-based approach to the clustering of microarray expression data.一种基于混合模型的微阵列表达数据聚类方法。
Bioinformatics. 2002 Mar;18(3):413-22. doi: 10.1093/bioinformatics/18.3.413.
5
Gene expression profiling predicts clinical outcome of breast cancer.基因表达谱分析可预测乳腺癌的临床预后。
Nature. 2002 Jan 31;415(6871):530-6. doi: 10.1038/415530a.
6
Model-based clustering and data transformations for gene expression data.基于模型的基因表达数据聚类与数据转换
Bioinformatics. 2001 Oct;17(10):977-87. doi: 10.1093/bioinformatics/17.10.977.
7
Model-based analysis of oligonucleotide arrays: model validation, design issues and standard error application.基于模型的寡核苷酸阵列分析:模型验证、设计问题及标准误差应用
Genome Biol. 2001;2(8):RESEARCH0032. doi: 10.1186/gb-2001-2-8-research0032. Epub 2001 Aug 3.
8
Bootstrapping cluster analysis: assessing the reliability of conclusions from microarray experiments.自举聚类分析:评估微阵列实验结论的可靠性。
Proc Natl Acad Sci U S A. 2001 Jul 31;98(16):8961-5. doi: 10.1073/pnas.161273698. Epub 2001 Jul 24.
9
Missing value estimation methods for DNA microarrays.DNA微阵列的缺失值估计方法。
Bioinformatics. 2001 Jun;17(6):520-5. doi: 10.1093/bioinformatics/17.6.520.
10
Testing for differentially-expressed genes by maximum-likelihood analysis of microarray data.通过对微阵列数据进行最大似然分析来检测差异表达基因。
J Comput Biol. 2000;7(6):805-17. doi: 10.1089/10665270050514945.