• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 DNA 微阵列基因表达数据的矩阵分解方法分析的综合评估。

Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data.

机构信息

Seoul National University Biomedical Informatics, Systems Biomedical Informatics Research Center, and Interdisciplinary Program of Medical Informatics Div. of Biomedical Informatics, Seoul National University College of Medicine, Seoul 110799, Korea.

出版信息

BMC Bioinformatics. 2011;12 Suppl 13(Suppl 13):S8. doi: 10.1186/1471-2105-12-S13-S8. Epub 2011 Nov 30.

DOI:10.1186/1471-2105-12-S13-S8
PMID:22373334
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3278848/
Abstract

BACKGROUND

Clustering-based methods on gene-expression analysis have been shown to be useful in biomedical applications such as cancer subtype discovery. Among them, Matrix factorization (MF) is advantageous for clustering gene expression patterns from DNA microarray experiments, as it efficiently reduces the dimension of gene expression data. Although several MF methods have been proposed for clustering gene expression patterns, a systematic evaluation has not been reported yet.

RESULTS

Here we evaluated the clustering performance of orthogonal and non-orthogonal MFs by a total of nine measurements for performance in four gene expression datasets and one well-known dataset for clustering. Specifically, we employed a non-orthogonal MF algorithm, BSNMF (Bi-directional Sparse Non-negative Matrix Factorization), that applies bi-directional sparseness constraints superimposed on non-negative constraints, comprising a few dominantly co-expressed genes and samples together. Non-orthogonal MFs tended to show better clustering-quality and prediction-accuracy indices than orthogonal MFs as well as a traditional method, K-means. Moreover, BSNMF showed improved performance in these measurements. Non-orthogonal MFs including BSNMF showed also good performance in the functional enrichment test using Gene Ontology terms and biological pathways.

CONCLUSIONS

In conclusion, the clustering performance of orthogonal and non-orthogonal MFs was appropriately evaluated for clustering microarray data by comprehensive measurements. This study showed that non-orthogonal MFs have better performance than orthogonal MFs and K-means for clustering microarray data.

摘要

背景

基于聚类的基因表达分析方法在癌症亚型发现等生物医学应用中已经显示出了一定的作用。其中,矩阵分解(MF)在聚类 DNA 微阵列实验中的基因表达模式方面具有优势,因为它可以有效地降低基因表达数据的维度。尽管已经提出了几种用于聚类基因表达模式的 MF 方法,但尚未进行系统评估。

结果

我们通过对四个基因表达数据集和一个用于聚类的知名数据集进行的总共九项测量,评估了正交和非正交 MF 的聚类性能。具体来说,我们采用了一种非正交 MF 算法,BSNMF(双向稀疏非负矩阵分解),它应用了双向稀疏约束和非负约束,包含了少数共同表达的基因和样本。非正交 MF 比正交 MF 以及传统方法 K-means 更倾向于表现出更好的聚类质量和预测准确性指标。此外,BSNMF 在这些测量中表现出了更好的性能。非正交 MF,包括 BSNMF,在使用基因本体论术语和生物学途径进行的功能富集测试中也表现出了良好的性能。

结论

总之,通过综合测量,我们适当地评估了正交和非正交 MF 在聚类微阵列数据方面的聚类性能。这项研究表明,非正交 MF 在聚类微阵列数据方面比正交 MF 和 K-means 具有更好的性能。

相似文献

1
Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data.基于 DNA 微阵列基因表达数据的矩阵分解方法分析的综合评估。
BMC Bioinformatics. 2011;12 Suppl 13(Suppl 13):S8. doi: 10.1186/1471-2105-12-S13-S8. Epub 2011 Nov 30.
2
Reducing microarray data via nonnegative matrix factorization for visualization and clustering analysis.通过非负矩阵分解减少微阵列数据以进行可视化和聚类分析。
J Biomed Inform. 2008 Aug;41(4):602-6. doi: 10.1016/j.jbi.2007.12.003. Epub 2007 Dec 23.
3
Exploring matrix factorization techniques for significant genes identification of Alzheimer's disease microarray gene expression data.探索矩阵分解技术在阿尔茨海默病基因表达数据中显著基因识别中的应用。
BMC Bioinformatics. 2011;12 Suppl 5(Suppl 5):S7. doi: 10.1186/1471-2105-12-S5-S7. Epub 2011 Jul 27.
4
Improving molecular cancer class discovery through sparse non-negative matrix factorization.通过稀疏非负矩阵分解改进分子癌症类别发现。
Bioinformatics. 2005 Nov 1;21(21):3970-5. doi: 10.1093/bioinformatics/bti653.
5
Combining Pareto-optimal clusters using supervised learning for identifying co-expressed genes.使用监督学习组合帕累托最优聚类以识别共表达基因。
BMC Bioinformatics. 2009 Jan 20;10:27. doi: 10.1186/1471-2105-10-27.
6
Evaluation and comparison of gene clustering methods in microarray analysis.微阵列分析中基因聚类方法的评估与比较
Bioinformatics. 2006 Oct 1;22(19):2405-12. doi: 10.1093/bioinformatics/btl406. Epub 2006 Jul 31.
7
Clustering and re-clustering for pattern discovery in gene expression data.用于基因表达数据中模式发现的聚类和再聚类。
J Bioinform Comput Biol. 2005 Apr;3(2):281-301. doi: 10.1142/s0219720005001053.
8
A Robust Manifold Graph Regularized Nonnegative Matrix Factorization Algorithm for Cancer Gene Clustering.一种稳健的流形图正则化非负矩阵分解算法在癌症基因聚类中的应用。
Molecules. 2017 Dec 2;22(12):2131. doi: 10.3390/molecules22122131.
9
Detecting clusters of different geometrical shapes in microarray gene expression data.在微阵列基因表达数据中检测不同几何形状的聚类。
Bioinformatics. 2005 May 1;21(9):1927-34. doi: 10.1093/bioinformatics/bti251. Epub 2005 Jan 12.
10
Information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments.在短时间进程微阵列实验中基于信息准则的具有顺序受限候选概况的聚类
BMC Bioinformatics. 2009 May 15;10:146. doi: 10.1186/1471-2105-10-146.

引用本文的文献

1
The importance of genomic predictors for clinical outcome of hematological malignancies.基因组预测指标对血液系统恶性肿瘤临床结局的重要性。
Blood Sci. 2021 Jul 7;3(3):93-95. doi: 10.1097/BS9.0000000000000075. eCollection 2021 Jul.
2
Integrative analysis reveals disrupted pathways regulated by microRNAs in cancer.综合分析揭示了 microRNAs 在癌症中调控的失调途径。
Nucleic Acids Res. 2018 Feb 16;46(3):1089-1101. doi: 10.1093/nar/gkx1250.
3
Impact of the Choice of Normalization Method on Molecular Cancer Class Discovery Using Nonnegative Matrix Factorization.

本文引用的文献

1
A cluster separation measure.一种聚类分离度量。
IEEE Trans Pattern Anal Mach Intell. 1979 Feb;1(2):224-7.
2
Principal component analysis based methods in bioinformatics studies.基于主成分分析的生物信息学研究方法。
Brief Bioinform. 2011 Nov;12(6):714-22. doi: 10.1093/bib/bbq090. Epub 2011 Jan 17.
3
Non-negative matrix factorization of gene expression profiles: a plug-in for BRB-ArrayTools.基因表达谱的非负矩阵分解:BRB-ArrayTools的一个插件。
归一化方法的选择对使用非负矩阵分解进行分子癌症类别发现的影响
PLoS One. 2016 Oct 14;11(10):e0164880. doi: 10.1371/journal.pone.0164880. eCollection 2016.
4
Classification of breast cancer patients using somatic mutation profiles and machine learning approaches.利用体细胞突变谱和机器学习方法对乳腺癌患者进行分类。
BMC Syst Biol. 2016 Aug 26;10 Suppl 3(Suppl 3):62. doi: 10.1186/s12918-016-0306-z.
5
Gain-of-function miRNA signature by mutant p53 associates with poor cancer outcome.突变型p53导致的功能获得性miRNA特征与癌症预后不良相关。
Oncotarget. 2016 Mar 8;7(10):11056-66. doi: 10.18632/oncotarget.7090.
6
Archetypal analysis of diverse Pseudomonas aeruginosa transcriptomes reveals adaptation in cystic fibrosis airways.对不同的铜绿假单胞菌转录组进行原型分析揭示了在囊性纤维化气道中的适应性。
BMC Bioinformatics. 2013 Sep 23;14:279. doi: 10.1186/1471-2105-14-279.
7
Non-negative matrix factorization by maximizing correntropy for cancer clustering.基于互信息最大化的非负矩阵分解在癌症聚类中的应用。
BMC Bioinformatics. 2013 Mar 24;14:107. doi: 10.1186/1471-2105-14-107.
8
Towards big data science in the decade ahead from ten years of InCoB and the 1st ISCB-Asia Joint Conference.展望未来十年的大数据科学:来自 InCoB 十年和第一届 ISCB-Asia 联合会议。
BMC Bioinformatics. 2011;12 Suppl 13(Suppl 13):S1. doi: 10.1186/1471-2105-12-S13-S1. Epub 2011 Nov 30.
Bioinformatics. 2009 Feb 15;25(4):545-7. doi: 10.1093/bioinformatics/btp009. Epub 2009 Jan 8.
4
Knowledge-based gene expression classification via matrix factorization.基于知识的矩阵分解基因表达分类
Bioinformatics. 2008 Aug 1;24(15):1688-97. doi: 10.1093/bioinformatics/btn245. Epub 2008 Jun 5.
5
Some new indexes of cluster validity.一些新的聚类有效性指标。
IEEE Trans Syst Man Cybern B Cybern. 1998;28(3):301-15. doi: 10.1109/3477.678624.
6
Improving molecular cancer class discovery through sparse non-negative matrix factorization.通过稀疏非负矩阵分解改进分子癌症类别发现。
Bioinformatics. 2005 Nov 1;21(21):3970-5. doi: 10.1093/bioinformatics/bti653.
7
ArrayXPath II: mapping and visualizing microarray gene-expression data with biomedical ontologies and integrated biological pathway resources using Scalable Vector Graphics.ArrayXPath II:使用可缩放矢量图形,通过生物医学本体和整合的生物途径资源对微阵列基因表达数据进行映射和可视化。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W621-6. doi: 10.1093/nar/gki450.
8
Multi-way clustering of microarray data using probabilistic sparse matrix factorization.使用概率稀疏矩阵分解的微阵列数据多向聚类
Bioinformatics. 2005 Jun;21 Suppl 1:i144-51. doi: 10.1093/bioinformatics/bti1041.
9
Multi-class tumor classification by discriminant partial least squares using microarray gene expression data and assessment of classification models.使用微阵列基因表达数据通过判别偏最小二乘法进行多类别肿瘤分类及分类模型评估
Comput Biol Chem. 2004 Jul;28(3):235-44. doi: 10.1016/j.compbiolchem.2004.05.002.
10
ArrayXPath: mapping and visualizing microarray gene-expression data with integrated biological pathway resources using Scalable Vector Graphics.ArrayXPath:使用可缩放矢量图形,通过整合生物通路资源来映射和可视化微阵列基因表达数据。
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W460-4. doi: 10.1093/nar/gkh476.