• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过非负矩阵分解减少微阵列数据以进行可视化和聚类分析。

Reducing microarray data via nonnegative matrix factorization for visualization and clustering analysis.

作者信息

Liu Weixiang, Yuan Kehong, Ye Datian

机构信息

Research Center of Biomedical Engineering, Life Science Division, Graduate School at Shenzhen, Tsinghua University, Shenzhen 518055, China.

出版信息

J Biomed Inform. 2008 Aug;41(4):602-6. doi: 10.1016/j.jbi.2007.12.003. Epub 2007 Dec 23.

DOI:10.1016/j.jbi.2007.12.003
PMID:18234564
Abstract

In microarray data analysis, each gene expression sample has thousands of genes and reducing such high dimensionality is useful for both visualization and further clustering of samples. Traditional principal component analysis (PCA) is a commonly used method which has problems. Nonnegative Matrix Factorization (NMF) is a new dimension reduction method. In this paper we compare NMF and PCA for dimension reduction. The reduced data is used for visualization, and clustering analysis via k-means on 11 real gene expression datasets. Before the clustering analysis, we apply NMF and PCA for reduction in visualization. The results on one leukemia dataset show that NMF can discover natural clusters and clearly detect one mislabeled sample while PCA cannot. For clustering analysis via k-means, NMF most typically outperforms PCA. Our results demonstrate the superiority of NMF over PCA in reducing microarray data.

摘要

在微阵列数据分析中,每个基因表达样本都包含数千个基因,降低这种高维度对于样本的可视化和进一步聚类都很有用。传统的主成分分析(PCA)是一种常用方法,但存在问题。非负矩阵分解(NMF)是一种新的降维方法。在本文中,我们比较了NMF和PCA用于降维的情况。降维后的数据用于可视化,并通过k均值算法对11个真实基因表达数据集进行聚类分析。在聚类分析之前,我们应用NMF和PCA进行可视化降维。一个白血病数据集的结果表明,NMF可以发现自然聚类并清晰地检测出一个错误标记的样本,而PCA则不能。对于通过k均值算法进行的聚类分析,NMF通常比PCA表现更优。我们的结果证明了NMF在降低微阵列数据维度方面优于PCA。

相似文献

1
Reducing microarray data via nonnegative matrix factorization for visualization and clustering analysis.通过非负矩阵分解减少微阵列数据以进行可视化和聚类分析。
J Biomed Inform. 2008 Aug;41(4):602-6. doi: 10.1016/j.jbi.2007.12.003. Epub 2007 Dec 23.
2
Tumor clustering using nonnegative matrix factorization with gene selection.使用带基因选择的非负矩阵分解进行肿瘤聚类。
IEEE Trans Inf Technol Biomed. 2009 Jul;13(4):599-607. doi: 10.1109/TITB.2009.2018115. Epub 2009 Apr 14.
3
Non-negative matrix factorization of gene expression profiles: a plug-in for BRB-ArrayTools.基因表达谱的非负矩阵分解:BRB-ArrayTools的一个插件。
Bioinformatics. 2009 Feb 15;25(4):545-7. doi: 10.1093/bioinformatics/btp009. Epub 2009 Jan 8.
4
Nonlinear dimensionality reduction of gene expression data for visualization and clustering analysis of cancer tissue samples.基因表达数据的非线性维数降低,用于癌症组织样本的可视化和聚类分析。
Comput Biol Med. 2010 Aug;40(8):723-32. doi: 10.1016/j.compbiomed.2010.06.007. Epub 2010 Jul 16.
5
On alpha-divergence based nonnegative matrix factorization for clustering cancer gene expression data.基于α-散度的非负矩阵分解用于聚类癌症基因表达数据
Artif Intell Med. 2008 Sep;44(1):1-5. doi: 10.1016/j.artmed.2008.05.001. Epub 2008 Jul 3.
6
Co-clustering and visualization of gene expression data and gene ontology terms for Saccharomyces cerevisiae using self-organizing maps.使用自组织映射对酿酒酵母的基因表达数据和基因本体术语进行共聚类和可视化。
J Biomed Inform. 2007 Apr;40(2):160-73. doi: 10.1016/j.jbi.2006.05.001. Epub 2006 May 20.
7
A new algorithm for comparing and visualizing relationships between hierarchical and flat gene expression data clusterings.一种用于比较和可视化层次化与平面化基因表达数据聚类之间关系的新算法。
Bioinformatics. 2005 Nov 1;21(21):3993-9. doi: 10.1093/bioinformatics/bti644. Epub 2005 Sep 1.
8
A simple and robust algorithm for microarray data clustering based on gene population-variance ratio metric.一种基于基因群体方差比度量的简单且稳健的微阵列数据聚类算法。
Biotechnol J. 2009 Sep;4(9):1357-61. doi: 10.1002/biot.200800219.
9
Unsupervised clustering in mRNA expression profiles.mRNA表达谱中的无监督聚类
Comput Biol Med. 2006 Oct;36(10):1126-42. doi: 10.1016/j.compbiomed.2005.09.003. Epub 2005 Oct 24.
10
Improving gene expression cancer molecular pattern discovery using nonnegative principal component analysis.使用非负主成分分析改进基因表达癌症分子模式发现
Genome Inform. 2008;21:200-11.

引用本文的文献

1
Multi-omics assessment of dilated cardiomyopathy using non-negative matrix factorization.使用非负矩阵分解对扩张型心肌病进行多组学评估。
PLoS One. 2022 Aug 18;17(8):e0272093. doi: 10.1371/journal.pone.0272093. eCollection 2022.
2
An effective short-text topic modelling with neighbourhood assistance-driven NMF in Twitter.一种在推特中基于邻域辅助驱动的非负矩阵分解的有效短文本主题建模方法。
Soc Netw Anal Min. 2022;12(1):89. doi: 10.1007/s13278-022-00898-5. Epub 2022 Jul 24.
3
The importance of genomic predictors for clinical outcome of hematological malignancies.
基因组预测指标对血液系统恶性肿瘤临床结局的重要性。
Blood Sci. 2021 Jul 7;3(3):93-95. doi: 10.1097/BS9.0000000000000075. eCollection 2021 Jul.
4
Unsupervised Algorithms for Microarray Sample Stratification.非监督算法在微阵列样本分层中的应用。
Methods Mol Biol. 2022;2401:121-146. doi: 10.1007/978-1-0716-1839-4_9.
5
Correlation between Alzheimer's disease and type 2 diabetes using non-negative matrix factorization.使用非负矩阵分解研究阿尔茨海默病与 2 型糖尿病的相关性。
Sci Rep. 2021 Jul 27;11(1):15265. doi: 10.1038/s41598-021-94048-0.
6
Multi-omics analysis of genomics, epigenomics and transcriptomics for molecular subtypes and core genes for lung adenocarcinoma.多组学分析基因组学、表观基因组学和转录组学,用于肺腺癌的分子亚型和核心基因。
BMC Cancer. 2021 Mar 9;21(1):257. doi: 10.1186/s12885-021-07888-4.
7
Orthogonal joint sparse NMF for microarray data analysis.用于微阵列数据分析的正交联合稀疏非负矩阵分解
J Math Biol. 2019 Jul;79(1):223-247. doi: 10.1007/s00285-019-01355-2. Epub 2019 Apr 19.
8
Improving knowledge on the activation of bone marrow fibroblasts in MGUS and MM disease through the automatic extraction of genes via a nonnegative matrix factorization approach on gene expression profiles.通过基于基因表达谱的非负矩阵分解方法自动提取基因,提高对 MGUS 和 MM 疾病中骨髓成纤维细胞激活的认识。
J Transl Med. 2018 Aug 3;16(1):217. doi: 10.1186/s12967-018-1589-1.
9
Analysis of biological functional networks during sciatic nerve repair and regeneration.分析坐骨神经修复和再生过程中的生物功能网络。
Mol Cell Biochem. 2018 Feb;439(1-2):141-150. doi: 10.1007/s11010-017-3143-5. Epub 2017 Aug 5.
10
A Bayesian semiparametric factor analysis model for subtype identification.用于亚型识别的贝叶斯半参数因子分析模型
Stat Appl Genet Mol Biol. 2017 Apr 25;16(2):145-158. doi: 10.1515/sagmb-2016-0051.