• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于非负矩阵分解算法的多组学数据的整合聚类

Integrative clustering of multi-level 'omic data based on non-negative matrix factorization algorithm.

作者信息

Chalise Prabhakar, Fridley Brooke L

机构信息

Department of Biostatistics, University of Kansas Medical Center, Kansas City, Kansas, United States of America.

Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, Florida, United States of America.

出版信息

PLoS One. 2017 May 1;12(5):e0176278. doi: 10.1371/journal.pone.0176278. eCollection 2017.

DOI:10.1371/journal.pone.0176278
PMID:28459819
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5411077/
Abstract

Integrative analyses of high-throughput 'omic data, such as DNA methylation, DNA copy number alteration, mRNA and protein expression levels, have created unprecedented opportunities to understand the molecular basis of human disease. In particular, integrative analyses have been the cornerstone in the study of cancer to determine molecular subtypes within a given cancer. As malignant tumors with similar morphological characteristics have been shown to exhibit entirely different molecular profiles, there has been significant interest in using multiple 'omic data for the identification of novel molecular subtypes of disease, which could impact treatment decisions. Therefore, we have developed intNMF, an integrative approach for disease subtype classification based on non-negative matrix factorization. The proposed approach carries out integrative clustering of multiple high dimensional molecular data in a single comprehensive analysis utilizing the information across multiple biological levels assessed on the same individual. As intNMF does not assume any distributional form for the data, it has obvious advantages over other model based clustering methods which require specific distributional assumptions. Application of intNMF is illustrated using both simulated and real data from The Cancer Genome Atlas (TCGA).

摘要

对高通量“组学”数据(如DNA甲基化、DNA拷贝数改变、mRNA和蛋白质表达水平)进行综合分析,为理解人类疾病的分子基础创造了前所未有的机会。特别是,综合分析一直是癌症研究中确定特定癌症分子亚型的基石。由于具有相似形态特征的恶性肿瘤已被证明表现出完全不同的分子谱,因此人们对使用多种“组学”数据来识别疾病的新型分子亚型产生了浓厚兴趣,这可能会影响治疗决策。因此,我们开发了intNMF,一种基于非负矩阵分解的疾病亚型分类综合方法。该方法在单一综合分析中对多个高维分子数据进行综合聚类,利用在同一个体上评估的多个生物学水平的信息。由于intNMF不假设数据的任何分布形式,与其他需要特定分布假设的基于模型的聚类方法相比,它具有明显优势。使用来自癌症基因组图谱(TCGA)的模拟数据和真实数据说明了intNMF的应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875a/5411077/b1e20b7742ca/pone.0176278.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875a/5411077/22a350a8faad/pone.0176278.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875a/5411077/b327ee092943/pone.0176278.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875a/5411077/f07d53200204/pone.0176278.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875a/5411077/b1e20b7742ca/pone.0176278.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875a/5411077/22a350a8faad/pone.0176278.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875a/5411077/b327ee092943/pone.0176278.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875a/5411077/f07d53200204/pone.0176278.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875a/5411077/b1e20b7742ca/pone.0176278.g004.jpg

相似文献

1
Integrative clustering of multi-level 'omic data based on non-negative matrix factorization algorithm.基于非负矩阵分解算法的多组学数据的整合聚类
PLoS One. 2017 May 1;12(5):e0176278. doi: 10.1371/journal.pone.0176278. eCollection 2017.
2
InterSIM: Simulation tool for multiple integrative 'omic datasets'.InterSIM:用于多个综合“组学”数据集的模拟工具。
Comput Methods Programs Biomed. 2016 May;128:69-74. doi: 10.1016/j.cmpb.2016.02.011. Epub 2016 Feb 27.
3
Randomized singular value decomposition for integrative subtype analysis of 'omics data' using non-negative matrix factorization.随机奇异值分解在非负矩阵分解中用于 'omics 数据' 的综合亚型分析。
Stat Appl Genet Mol Biol. 2023 Nov 9;22(1). doi: 10.1515/sagmb-2022-0047. eCollection 2023 Jan 1.
4
COPS: A novel platform for multi-omic disease subtype discovery via robust multi-objective evaluation of clustering algorithms.COPS:一种通过稳健的聚类算法多目标评估发现多组学疾病亚型的新平台。
PLoS Comput Biol. 2024 Aug 5;20(8):e1012275. doi: 10.1371/journal.pcbi.1012275. eCollection 2024 Aug.
5
Integrative clustering methods for high-dimensional molecular data.用于高维分子数据的整合聚类方法
Transl Cancer Res. 2014 Jun 1;3(3):202-216. doi: 10.3978/j.issn.2218-676X.2014.06.03.
6
Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis.基于联合潜在变量模型的多种基因组数据类型综合聚类及其在乳腺癌和肺癌亚型分析中的应用。
Bioinformatics. 2009 Nov 15;25(22):2906-12. doi: 10.1093/bioinformatics/btp543. Epub 2009 Sep 16.
7
Clusternomics: Integrative context-dependent clustering for heterogeneous datasets.聚类组学:针对异构数据集的整合上下文相关聚类
PLoS Comput Biol. 2017 Oct 16;13(10):e1005781. doi: 10.1371/journal.pcbi.1005781. eCollection 2017 Oct.
8
Network-based integrative clustering of multiple types of genomic data using non-negative matrix factorization.基于网络的多种类型基因组数据的非负矩阵分解综合聚类分析。
Comput Biol Med. 2020 Mar;118:103625. doi: 10.1016/j.compbiomed.2020.103625. Epub 2020 Jan 23.
9
Predicting censored survival data based on the interactions between meta-dimensional omics data in breast cancer.基于乳腺癌元维度组学数据间的相互作用预测删失生存数据。
J Biomed Inform. 2015 Aug;56:220-8. doi: 10.1016/j.jbi.2015.05.019. Epub 2015 Jun 3.
10
Subtype identification from heterogeneous TCGA datasets on a genomic scale by multi-view clustering with enhanced consensus.通过具有增强一致性的多视图聚类,从基因组规模的异质TCGA数据集中进行亚型识别。
BMC Med Genomics. 2017 Dec 21;10(Suppl 4):75. doi: 10.1186/s12920-017-0306-x.

引用本文的文献

1
Molecular subtypes of human skeletal muscle in cancer cachexia.癌症恶病质中人类骨骼肌的分子亚型
Nature. 2025 Sep 10. doi: 10.1038/s41586-025-09502-0.
2
stImage: a versatile framework for optimizing spatial transcriptomic analysis through customizable deep histology and location informed integration.stImage:一个通过可定制的深度组织学和位置信息整合来优化空间转录组分析的通用框架。
Brief Bioinform. 2025 Aug 31;26(5). doi: 10.1093/bib/bbaf429.
3
A review on multi-omics integration for aiding study design of large scale TCGA cancer datasets.

本文引用的文献

1
InterSIM: Simulation tool for multiple integrative 'omic datasets'.InterSIM:用于多个综合“组学”数据集的模拟工具。
Comput Methods Programs Biomed. 2016 May;128:69-74. doi: 10.1016/j.cmpb.2016.02.011. Epub 2016 Feb 27.
2
Tensor GSVD of patient- and platform-matched tumor and normal DNA copy-number profiles uncovers chromosome arm-wide patterns of tumor-exclusive platform-consistent alterations encoding for cell transformation and predicting ovarian cancer survival.患者与平台匹配的肿瘤和正常DNA拷贝数图谱的张量广义奇异值分解揭示了全染色体臂范围内肿瘤特异性平台一致改变的模式,这些改变编码细胞转化并预测卵巢癌生存率。
PLoS One. 2015 Apr 15;10(4):e0121396. doi: 10.1371/journal.pone.0121396. eCollection 2015.
3
关于多组学整合以辅助大规模TCGA癌症数据集研究设计的综述。
BMC Genomics. 2025 Aug 22;26(1):769. doi: 10.1186/s12864-025-11925-y.
4
A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches.多组学数据整合方法的技术综述:从经典统计方法到深度生成方法
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf355.
5
Retrotransposon methylation profiles and survival in Black women with high-grade serous ovarian carcinoma.高级别浆液性卵巢癌黑人女性的逆转录转座子甲基化谱与生存情况
Clin Epigenetics. 2025 Jul 30;17(1):134. doi: 10.1186/s13148-025-01942-9.
6
MOTL: enhancing multi-omics matrix factorization with transfer learning.MOTL:通过迁移学习增强多组学矩阵分解
Genome Biol. 2025 Jul 25;26(1):224. doi: 10.1186/s13059-025-03675-7.
7
GAUDI: interpretable multi-omics integration with UMAP embeddings and density-based clustering.GAUDI:通过UMAP嵌入和基于密度的聚类实现可解释的多组学整合。
Nat Commun. 2025 Jul 1;16(1):5771. doi: 10.1038/s41467-025-60822-1.
8
3Mont: A multi-omics integrative tool for breast cancer subtype stratification.3Mont:一种用于乳腺癌亚型分层的多组学整合工具。
PLoS One. 2025 Jun 27;20(6):e0326154. doi: 10.1371/journal.pone.0326154. eCollection 2025.
9
EMitool: Explainable Multi-Omics Integration for Disease Subtyping.EMitool:用于疾病亚型分型的可解释多组学整合
Int J Mol Sci. 2025 Apr 30;26(9):4268. doi: 10.3390/ijms26094268.
10
Do we need a standardized 16S rRNA gene amplicon sequencing analysis protocol for poultry microbiota research?家禽微生物群研究是否需要标准化的16S rRNA基因扩增子测序分析方案?
Poult Sci. 2025 Jul;104(7):105242. doi: 10.1016/j.psj.2025.105242. Epub 2025 May 1.
Integrative clustering methods for high-dimensional molecular data.
用于高维分子数据的整合聚类方法
Transl Cancer Res. 2014 Jun 1;3(3):202-216. doi: 10.3978/j.issn.2218-676X.2014.06.03.
4
Principles and methods of integrative genomic analyses in cancer.癌症综合基因组分析的原则和方法。
Nat Rev Cancer. 2014 May;14(5):299-313. doi: 10.1038/nrc3721.
5
Multi-tissue analysis of co-expression networks by higher-order generalized singular value decomposition identifies functionally coherent transcriptional modules.通过高阶广义奇异值分解对共表达网络进行多组织分析,确定功能一致的转录模块。
PLoS Genet. 2014 Jan;10(1):e1004006. doi: 10.1371/journal.pgen.1004006. Epub 2014 Jan 2.
6
Metasignatures identify two major subtypes of breast cancer.代谢特征可鉴定两种主要的乳腺癌亚型。
CPT Pharmacometrics Syst Pharmacol. 2013 Mar 27;2(3):e35. doi: 10.1038/psp.2013.11.
7
Pattern discovery and cancer gene identification in integrated cancer genomic data.整合癌症基因组数据中的模式发现和癌症基因鉴定。
Proc Natl Acad Sci U S A. 2013 Mar 12;110(11):4245-50. doi: 10.1073/pnas.1208949110. Epub 2013 Feb 21.
8
Bayesian correlated clustering to integrate multiple datasets.贝叶斯相关聚类分析整合多数据集。
Bioinformatics. 2012 Dec 15;28(24):3290-7. doi: 10.1093/bioinformatics/bts595. Epub 2012 Oct 9.
9
Comprehensive molecular portraits of human breast tumours.人类乳腺肿瘤的全面分子特征图谱。
Nature. 2012 Oct 4;490(7418):61-70. doi: 10.1038/nature11412. Epub 2012 Sep 23.
10
Discovery of multi-dimensional modules by integrative analysis of cancer genomic data.通过癌症基因组数据的综合分析发现多维模块。
Nucleic Acids Res. 2012 Oct;40(19):9379-91. doi: 10.1093/nar/gks725. Epub 2012 Aug 8.