• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

带罚因子分析器混合模型及其在高维微阵列数据分析中的聚类应用。

Penalized mixtures of factor analyzers with application to clustering high-dimensional microarray data.

机构信息

Division of Biostatistics, School of Public Health and School of Statistics, University of Minnesota, Minneapolis, MN, USA.

出版信息

Bioinformatics. 2010 Feb 15;26(4):501-8. doi: 10.1093/bioinformatics/btp707. Epub 2009 Dec 23.

DOI:10.1093/bioinformatics/btp707
PMID:20031967
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2852217/
Abstract

MOTIVATION

Model-based clustering has been widely used, e.g. in microarray data analysis. Since for high-dimensional data variable selection is necessary, several penalized model-based clustering methods have been proposed tørealize simultaneous variable selection and clustering. However, the existing methods all assume that the variables are independent with the use of diagonal covariance matrices.

RESULTS

To model non-independence of variables (e.g. correlated gene expressions) while alleviating the problem with the large number of unknown parameters associated with a general non-diagonal covariance matrix, we generalize the mixture of factor analyzers to that with penalization, which, among others, can effectively realize variable selection. We use simulated data and real microarray data to illustrate the utility and advantages of the proposed method over several existing ones.

摘要

动机

基于模型的聚类已被广泛应用,例如在微阵列数据分析中。由于对于高维数据,变量选择是必要的,因此已经提出了几种惩罚性基于模型的聚类方法来实现同时进行变量选择和聚类。然而,现有的方法都假设变量是独立的,使用对角协方差矩阵。

结果

为了在缓解与一般非对角协方差矩阵相关的大量未知参数问题的同时对变量的非独立性(例如相关基因表达)进行建模,我们将因子分析的混合推广到具有惩罚的混合,它可以有效地实现变量选择。我们使用模拟数据和真实的微阵列数据来说明所提出的方法相对于其他几种现有方法的实用性和优势。

相似文献

1
Penalized mixtures of factor analyzers with application to clustering high-dimensional microarray data.带罚因子分析器混合模型及其在高维微阵列数据分析中的聚类应用。
Bioinformatics. 2010 Feb 15;26(4):501-8. doi: 10.1093/bioinformatics/btp707. Epub 2009 Dec 23.
2
Mixtures of common t-factor analyzers for clustering high-dimensional microarray data.常见 t 因子分析器的混合物用于聚类高维微阵列数据。
Bioinformatics. 2011 May 1;27(9):1269-76. doi: 10.1093/bioinformatics/btr112. Epub 2011 Mar 3.
3
Variable selection for model-based high-dimensional clustering and its application to microarray data.基于模型的高维聚类的变量选择及其在微阵列数据中的应用。
Biometrics. 2008 Jun;64(2):440-8. doi: 10.1111/j.1541-0420.2007.00922.x. Epub 2007 Oct 26.
4
Penalized model-based clustering with cluster-specific diagonal covariance matrices and grouped variables.具有特定聚类对角协方差矩阵和分组变量的基于惩罚模型的聚类
Electron J Stat. 2008;2:168-212. doi: 10.1214/08-EJS194.
5
A novel approach to the clustering of microarray data via nonparametric density estimation.一种基于非参数密度估计的微阵列数据聚类新方法。
BMC Bioinformatics. 2011 Feb 8;12:49. doi: 10.1186/1471-2105-12-49.
6
Bayesian model-based clustering of temporal gene expression using autoregressive panel data approach.基于自回归面板数据方法的时间基因表达的贝叶斯模型聚类。
Bioinformatics. 2012 Aug 1;28(15):2004-7. doi: 10.1093/bioinformatics/bts322. Epub 2012 Jun 4.
7
Penalized model-based clustering with unconstrained covariance matrices.具有无约束协方差矩阵的基于惩罚模型的聚类
Electron J Stat. 2009 Jan 1;3:1473-1496. doi: 10.1214/09-EJS487.
8
Gene expression data clustering using a multiobjective symmetry based clustering technique.基于多目标对称的基因表达数据聚类技术。
Comput Biol Med. 2013 Nov;43(11):1965-77. doi: 10.1016/j.compbiomed.2013.07.021. Epub 2013 Sep 7.
9
Robust multi-scale clustering of large DNA microarray datasets with the consensus algorithm.使用一致性算法对大型DNA微阵列数据集进行稳健的多尺度聚类
Bioinformatics. 2006 Jan 1;22(1):58-67. doi: 10.1093/bioinformatics/bti746. Epub 2005 Oct 27.
10
Beyond clustering of array expressions.超越阵列表达式的聚类。
Int J Bioinform Res Appl. 2009;5(3):329-48. doi: 10.1504/IJBRA.2009.026423.

引用本文的文献

1
Mixture of regressions with multivariate responses for discovering subtypes in Alzheimer's biomarkers with detection limits.用于发现具有检测限的阿尔茨海默病生物标志物亚型的多元响应回归混合模型
Data Sci Sci. 2024;3(1). doi: 10.1080/26941899.2024.2309403. Epub 2024 Mar 6.
2
Clustering High-Dimensional Landmark-based Two-dimensional Shape Data.基于高维地标点的二维形状数据聚类
J Am Stat Assoc. 2015 Nov 7;110(115):946-961. doi: 10.1080/01621459.2015.1034802. Epub 2015 Apr 16.
3
Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm.通过一种新的稳健网络聚类算法发现癌症亚型和鉴定生物标志物。
PLoS One. 2013 Jun 17;8(6):e66256. doi: 10.1371/journal.pone.0066256. Print 2013.
4
Penalized model-based clustering with unconstrained covariance matrices.具有无约束协方差矩阵的基于惩罚模型的聚类
Electron J Stat. 2009 Jan 1;3:1473-1496. doi: 10.1214/09-EJS487.

本文引用的文献

1
Mixtures of factor analyzers with common factor loadings: applications to the clustering and visualization of high-dimensional data.具有共同因子负荷的因子分析器混合物:在高维数据的聚类和可视化中的应用。
IEEE Trans Pattern Anal Mach Intell. 2010 Jul;32(7):1298-309. doi: 10.1109/TPAMI.2009.149.
2
Penalized model-based clustering with unconstrained covariance matrices.具有无约束协方差矩阵的基于惩罚模型的聚类
Electron J Stat. 2009 Jan 1;3:1473-1496. doi: 10.1214/09-EJS487.
3
Penalized model-based clustering with cluster-specific diagonal covariance matrices and grouped variables.具有特定聚类对角协方差矩阵和分组变量的基于惩罚模型的聚类
Electron J Stat. 2008;2:168-212. doi: 10.1214/08-EJS194.
4
Discussion of "Sure Independence Screening for Ultra-High Dimensional Feature Space.《超高维特征空间中的确定独立性筛选》讨论
J R Stat Soc Series B Stat Methodol. 2008 Nov;70(5):903. doi: 10.1111/j.1467-9868.2008.00674.x.
5
Modeling the manifolds of images of handwritten digits.对手写数字图像的流形进行建模。
IEEE Trans Neural Netw. 1997;8(1):65-74. doi: 10.1109/72.554192.
6
Variable selection in penalized model-based clustering via regularization on grouped parameters.基于分组参数正则化的惩罚模型聚类中的变量选择
Biometrics. 2008 Sep;64(3):921-930. doi: 10.1111/j.1541-0420.2007.00955.x. Epub 2007 Dec 20.
7
Variable selection for model-based high-dimensional clustering and its application to microarray data.基于模型的高维聚类的变量选择及其在微阵列数据中的应用。
Biometrics. 2008 Jun;64(2):440-8. doi: 10.1111/j.1541-0420.2007.00922.x. Epub 2007 Oct 26.
8
Evaluation and comparison of gene clustering methods in microarray analysis.微阵列分析中基因聚类方法的评估与比较
Bioinformatics. 2006 Oct 1;22(19):2405-12. doi: 10.1093/bioinformatics/btl406. Epub 2006 Jul 31.
9
Gene-expression profiles predict survival of patients with lung adenocarcinoma.基因表达谱可预测肺腺癌患者的生存情况。
Nat Med. 2002 Aug;8(8):816-24. doi: 10.1038/nm733. Epub 2002 Jul 15.
10
A mixture model-based approach to the clustering of microarray expression data.一种基于混合模型的微阵列表达数据聚类方法。
Bioinformatics. 2002 Mar;18(3):413-22. doi: 10.1093/bioinformatics/18.3.413.