• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

scMNMF:一种基于矩阵分解的单细胞多组学聚类新方法。

scMNMF: a novel method for single-cell multi-omics clustering based on matrix factorization.

机构信息

School of Mathematical Sciences, Shenzhen University, 518000, Guangdong, China.

College of Life and Health Sciences, Northeastern University, Shenyang, 110169, China.

出版信息

Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae228.

DOI:10.1093/bib/bbae228
PMID:38754408
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11097994/
Abstract

MOTIVATION

The technology for analyzing single-cell multi-omics data has advanced rapidly and has provided comprehensive and accurate cellular information by exploring cell heterogeneity in genomics, transcriptomics, epigenomics, metabolomics and proteomics data. However, because of the high-dimensional and sparse characteristics of single-cell multi-omics data, as well as the limitations of various analysis algorithms, the clustering performance is generally poor. Matrix factorization is an unsupervised, dimensionality reduction-based method that can cluster individuals and discover related omics variables from different blocks. Here, we present a novel algorithm that performs joint dimensionality reduction learning and cell clustering analysis on single-cell multi-omics data using non-negative matrix factorization that we named scMNMF. We formulate the objective function of joint learning as a constrained optimization problem and derive the corresponding iterative formulas through alternating iterative algorithms. The major advantage of the scMNMF algorithm remains its capability to explore hidden related features among omics data. Additionally, the feature selection for dimensionality reduction and cell clustering mutually influence each other iteratively, leading to a more effective discovery of cell types. We validated the performance of the scMNMF algorithm using two simulated and five real datasets. The results show that scMNMF outperformed seven other state-of-the-art algorithms in various measurements.

AVAILABILITY AND IMPLEMENTATION

scMNMF code can be found at https://github.com/yushanqiu/scMNMF.

摘要

动机

分析单细胞多组学数据的技术发展迅速,通过探索基因组学、转录组学、表观基因组学、代谢组学和蛋白质组学数据中的细胞异质性,提供了全面而准确的细胞信息。然而,由于单细胞多组学数据具有高维性和稀疏性的特点,以及各种分析算法的局限性,聚类性能通常较差。矩阵分解是一种无监督的、基于降维的方法,可以从不同的块中对个体进行聚类,并发现相关的组学变量。在这里,我们提出了一种新的算法,该算法使用非负矩阵分解对单细胞多组学数据进行联合降维和细胞聚类分析,我们将其命名为 scMNMF。我们将联合学习的目标函数表述为一个约束优化问题,并通过交替迭代算法推导出相应的迭代公式。scMNMF 算法的主要优点是能够在组学数据中探索隐藏的相关特征。此外,降维和细胞聚类的特征选择相互迭代影响,从而更有效地发现细胞类型。我们使用两个模拟数据集和五个真实数据集验证了 scMNMF 算法的性能。结果表明,在各种测量中,scMNMF 算法均优于其他七种最先进的算法。

可用性和实现

scMNMF 代码可在 https://github.com/yushanqiu/scMNMF 上找到。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/11097994/baa6ad2e3e6b/bbae228f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/11097994/3eedec38565c/bbae228f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/11097994/1d3e60bec495/bbae228f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/11097994/c8d89cd7a6a8/bbae228f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/11097994/5428babc17ae/bbae228f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/11097994/848523b44334/bbae228f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/11097994/4d095936d184/bbae228f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/11097994/baa6ad2e3e6b/bbae228f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/11097994/3eedec38565c/bbae228f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/11097994/1d3e60bec495/bbae228f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/11097994/c8d89cd7a6a8/bbae228f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/11097994/5428babc17ae/bbae228f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/11097994/848523b44334/bbae228f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/11097994/4d095936d184/bbae228f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/11097994/baa6ad2e3e6b/bbae228f7.jpg

相似文献

1
scMNMF: a novel method for single-cell multi-omics clustering based on matrix factorization.scMNMF:一种基于矩阵分解的单细胞多组学聚类新方法。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae228.
2
Effective Integration of Single-Cell Multi-Omics Data Using Improved Network-Based Integrative Clustering with Multigraph Regularization.使用改进的基于网络的多图正则化集成聚类实现单细胞多组学数据的有效整合。
J Comput Biol. 2025 Jun;32(6):601-614. doi: 10.1089/cmb.2023.0460. Epub 2025 May 22.
3
Graph neural networks for single-cell omics data: a review of approaches and applications.用于单细胞组学数据的图神经网络:方法与应用综述
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf109.
4
Novel multi-omics deconfounding variational autoencoders can obtain meaningful disease subtyping.新型多组学去混淆变分自动编码器可获得有意义的疾病亚型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae512.
5
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
6
PartIES: a disease subtyping framework with Partition-level Integration using diffusion-Enhanced Similarities from multi-omics Data.PARTIES:一种基于分区水平集成的疾病亚型框架,利用多组学数据的扩散增强相似性。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae609.
7
Algorithm-based pain management for people with dementia in nursing homes.基于算法的养老院痴呆患者疼痛管理。
Cochrane Database Syst Rev. 2022 Apr 1;4(4):CD013339. doi: 10.1002/14651858.CD013339.pub2.
8
scGGC: a two-stage strategy for single-cell clustering through cellular gene pathway construction.scGGC:一种通过细胞基因通路构建进行单细胞聚类的两阶段策略。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf368.
9
IGCN: integrative graph convolution networks for patient level insights and biomarker discovery in multi-omics integration.IGCN:用于多组学整合中患者层面洞察和生物标志物发现的整合图卷积网络。
Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf313.
10
scMGCL: accurate and efficient integration representation of single-cell multi-omics data.scMGCL:单细胞多组学数据的准确高效整合表示
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf392.

引用本文的文献

1
PLNMFG: Pseudo-label guided non-negative matrix factorization model with graph constraint for single-cell multi-omics data clustering.PLNMFG:用于单细胞多组学数据聚类的具有图约束的伪标签引导非负矩阵分解模型。
PLoS Comput Biol. 2025 Aug 18;21(8):e1013375. doi: 10.1371/journal.pcbi.1013375. eCollection 2025 Aug.
2
RNA sequence analysis landscape: A comprehensive review of task types, databases, datasets, word embedding methods, and language models.RNA序列分析全景:任务类型、数据库、数据集、词嵌入方法及语言模型的全面综述
Heliyon. 2025 Jan 6;11(2):e41488. doi: 10.1016/j.heliyon.2024.e41488. eCollection 2025 Jan 30.
3

本文引用的文献

1
SSNMDI: a novel joint learning model of semi-supervised non-negative matrix factorization and data imputation for clustering of single-cell RNA-seq data.SSNMDI:一种用于单细胞 RNA-seq 数据聚类的半监督非负矩阵分解和数据插补的新型联合学习模型。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad149.
2
Clustering of single-cell multi-omics data with a multimodal deep learning method.基于多模态深度学习方法的单细胞多组学数据聚类。
Nat Commun. 2022 Dec 13;13(1):7705. doi: 10.1038/s41467-022-35031-9.
3
Clustering single-cell multi-omics data with MoClust.
GSTRPCA: irregular tensor singular value decomposition for single-cell multi-omics data clustering.
GSTRPCA:用于单细胞多组学数据聚类的不规则张量奇异值分解
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae649.
4
scDRMAE: integrating masked autoencoder with residual attention networks to leverage omics feature dependencies for accurate cell clustering.scDRMAE:集成掩蔽自动编码器和残差注意力网络,利用组学特征依赖性进行准确的细胞聚类。
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae599.
使用 MoClust 对单细胞多组学数据进行聚类。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac736.
4
Linear-time cluster ensembles of large-scale single-cell RNA-seq and multimodal data.大规模单细胞 RNA-seq 和多模态数据的线性时间聚类集成。
Genome Res. 2021 Apr;31(4):677-688. doi: 10.1101/gr.267906.120. Epub 2021 Feb 24.
5
EC-PGMGR: Ensemble Clustering Based on Probability Graphical Model With Graph Regularization for Single-Cell RNA-seq Data.EC-PGMGR:基于概率图模型和图正则化的单细胞RNA测序数据集成聚类
Front Genet. 2020 Nov 4;11:572242. doi: 10.3389/fgene.2020.572242. eCollection 2020.
6
Multiple kernel learning for integrative consensus clustering of omic datasets.基于多核学习的组学数据集综合共识聚类分析。
Bioinformatics. 2020 Sep 15;36(18):4789-4796. doi: 10.1093/bioinformatics/btaa593.
7
MOFA+: a statistical framework for comprehensive integration of multi-modal single-cell data.MOFA+:一种全面整合多模态单细胞数据的统计框架。
Genome Biol. 2020 May 11;21(1):111. doi: 10.1186/s13059-020-02015-1.
8
BREM-SC: a bayesian random effects mixture model for joint clustering single cell multi-omics data.BREM-SC:一种用于联合聚类单细胞多组学数据的贝叶斯随机效应混合模型。
Nucleic Acids Res. 2020 Jun 19;48(11):5814-5824. doi: 10.1093/nar/gkaa314.
9
CiteFuse enables multi-modal analysis of CITE-seq data.CiteFuse 支持 CITE-seq 数据的多模式分析。
Bioinformatics. 2020 Aug 15;36(14):4137-4143. doi: 10.1093/bioinformatics/btaa282.
10
scAI: an unsupervised approach for the integrative analysis of parallel single-cell transcriptomic and epigenomic profiles.scAI:一种用于平行单细胞转录组学和表观基因组学综合分析的无监督方法。
Genome Biol. 2020 Feb 3;21(1):25. doi: 10.1186/s13059-020-1932-8.