• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PLNMFG:用于单细胞多组学数据聚类的具有图约束的伪标签引导非负矩阵分解模型。

PLNMFG: Pseudo-label guided non-negative matrix factorization model with graph constraint for single-cell multi-omics data clustering.

作者信息

Yuan Hui, Liu Mingzhu, Qiu Yushan, Ching Wai-Ki, Zou Quan

机构信息

School of Mathematical Sciences, Shenzhen University, Shenzhen, China.

Institute for Advanced Study, Shenzhen University, Shenzhen, China.

出版信息

PLoS Comput Biol. 2025 Aug 18;21(8):e1013375. doi: 10.1371/journal.pcbi.1013375. eCollection 2025 Aug.

DOI:10.1371/journal.pcbi.1013375
PMID:40825083
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12416850/
Abstract

The development of single-cell multi-omics sequencing technologies has enabled the simultaneous analysis of multi-omics data within the same cell. Accurate clustering of these cells is crucial for downstream analyses of complex biological functions. Despite significant advances in multi-omics integration approaches, current methodologies exhibit two major limitations. First, they inadequately incorporate prior biological knowledge from various omic layers. Second, these methods often conduct independent dimensionality reduction on individual omic datasets, thereby failing to capture the intrinsic complementary information and potentially overlooking crucial cross-platform interactions. Motivated by these, this study investigates a non-negative matrix factorization model called PLNMFG, which integrates the unified latent representation learning that retains the features between and within omics and the cluster structure learning that retains the intrinsic structure of the data into one joint framework. Specially, PLNMFG performs adaptive imputation to handle dropout events and uses prior pseudo-labels as constraints during the process of collective non-negative matrix factorization, as a result, a more robust latent representation that preserves the double similarity information is obtained. Graph Laplacian constraint is applied during clustering which further preserves structure characteristic of multi-omics data. In addition, the weight of each omic is adaptively learned based on the omic contribution. A series of experiments on 8 benchmark datasets show that our model performs well in terms of clustering accuracy and computational efficiency.

摘要

单细胞多组学测序技术的发展使得在同一细胞内同时分析多组学数据成为可能。对这些细胞进行准确聚类对于复杂生物学功能的下游分析至关重要。尽管多组学整合方法取得了显著进展,但当前方法存在两个主要局限性。首先,它们没有充分整合来自各个组学层面的先验生物学知识。其次,这些方法通常对单个组学数据集进行独立的降维,从而无法捕捉内在的互补信息,并可能忽略关键的跨平台相互作用。受此启发,本研究探讨了一种名为PLNMFG的非负矩阵分解模型,该模型将保留组学之间和组学内部特征的统一潜在表示学习与保留数据内在结构的聚类结构学习整合到一个联合框架中。具体而言,PLNMFG执行自适应插补以处理缺失事件,并在集体非负矩阵分解过程中使用先验伪标签作为约束,结果获得了一个更稳健的潜在表示,该表示保留了双重相似性信息。在聚类过程中应用图拉普拉斯约束,进一步保留了多组学数据的结构特征。此外,基于组学贡献自适应学习每个组学的权重。在8个基准数据集上进行的一系列实验表明,我们的模型在聚类准确性和计算效率方面表现良好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6818/12416850/b5c6f02b1b22/pcbi.1013375.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6818/12416850/08af36b113d2/pcbi.1013375.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6818/12416850/6756c00b9c20/pcbi.1013375.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6818/12416850/b48a2566bc93/pcbi.1013375.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6818/12416850/11b11264b20b/pcbi.1013375.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6818/12416850/2298e7ae5f3f/pcbi.1013375.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6818/12416850/eede0b8c0765/pcbi.1013375.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6818/12416850/b5c6f02b1b22/pcbi.1013375.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6818/12416850/08af36b113d2/pcbi.1013375.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6818/12416850/6756c00b9c20/pcbi.1013375.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6818/12416850/b48a2566bc93/pcbi.1013375.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6818/12416850/11b11264b20b/pcbi.1013375.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6818/12416850/2298e7ae5f3f/pcbi.1013375.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6818/12416850/eede0b8c0765/pcbi.1013375.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6818/12416850/b5c6f02b1b22/pcbi.1013375.g007.jpg

相似文献

1
PLNMFG: Pseudo-label guided non-negative matrix factorization model with graph constraint for single-cell multi-omics data clustering.PLNMFG:用于单细胞多组学数据聚类的具有图约束的伪标签引导非负矩阵分解模型。
PLoS Comput Biol. 2025 Aug 18;21(8):e1013375. doi: 10.1371/journal.pcbi.1013375. eCollection 2025 Aug.
2
scMNMF: a novel method for single-cell multi-omics clustering based on matrix factorization.scMNMF:一种基于矩阵分解的单细胞多组学聚类新方法。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae228.
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
MO-GCAN: multi-omics integration based on graph convolutional and attention networks.MO-GCAN:基于图卷积和注意力网络的多组学整合
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf405.
5
MGDMCL: A multi-omics integration framework based on masked graph dynamic learning and multi-granularity feature contrastive learning for biomedical classification.MGDMCL:一种基于掩码图动态学习和多粒度特征对比学习的用于生物医学分类的多组学整合框架。
Comput Methods Programs Biomed. 2025 Nov;271:109024. doi: 10.1016/j.cmpb.2025.109024. Epub 2025 Aug 13.
6
Effective Integration of Single-Cell Multi-Omics Data Using Improved Network-Based Integrative Clustering with Multigraph Regularization.使用改进的基于网络的多图正则化集成聚类实现单细胞多组学数据的有效整合。
J Comput Biol. 2025 Jun;32(6):601-614. doi: 10.1089/cmb.2023.0460. Epub 2025 May 22.
7
Multi-layer matrix factorization for cancer subtyping using full and partial multi-omics dataset.使用完整和部分多组学数据集进行癌症亚型分类的多层矩阵分解
Brief Bioinform. 2025 Aug 31;26(5). doi: 10.1093/bib/bbaf448.
8
sCIN: a contrastive learning framework for single-cell multi-omics data integration.sCIN:用于单细胞多组学数据整合的对比学习框架。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf411.
9
Short-Term Memory Impairment短期记忆障碍
10
Multi-omics single-cell data alignment and integration with enhanced contrastive learning and differential attention mechanism.基于增强对比学习和差分注意力机制的多组学单细胞数据对齐与整合
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf443.

本文引用的文献

1
scMNMF: a novel method for single-cell multi-omics clustering based on matrix factorization.scMNMF:一种基于矩阵分解的单细胞多组学聚类新方法。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae228.
2
SSNMDI: a novel joint learning model of semi-supervised non-negative matrix factorization and data imputation for clustering of single-cell RNA-seq data.SSNMDI:一种用于单细胞 RNA-seq 数据聚类的半监督非负矩阵分解和数据插补的新型联合学习模型。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad149.
3
Clustering single-cell multi-omics data with MoClust.
使用 MoClust 对单细胞多组学数据进行聚类。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac736.
4
A deep generative model for multi-view profiling of single-cell RNA-seq and ATAC-seq data.一种用于单细胞 RNA-seq 和 ATAC-seq 数据多视图分析的深度生成模型。
Genome Biol. 2022 Jan 12;23(1):20. doi: 10.1186/s13059-021-02595-6.
5
Pseudo-Label Guided Collective Matrix Factorization for Multiview Clustering.伪标签引导的多视图聚类协同矩阵分解。
IEEE Trans Cybern. 2022 Sep;52(9):8681-8691. doi: 10.1109/TCYB.2021.3051182. Epub 2022 Aug 18.
6
Dimensionality reduction for single cell RNA sequencing data using constrained robust non-negative matrix factorization.使用约束鲁棒非负矩阵分解对单细胞RNA测序数据进行降维
NAR Genom Bioinform. 2020 Aug 28;2(3):lqaa064. doi: 10.1093/nargab/lqaa064. eCollection 2020 Sep.
7
MOFA+: a statistical framework for comprehensive integration of multi-modal single-cell data.MOFA+:一种全面整合多模态单细胞数据的统计框架。
Genome Biol. 2020 May 11;21(1):111. doi: 10.1186/s13059-020-02015-1.
8
BREM-SC: a bayesian random effects mixture model for joint clustering single cell multi-omics data.BREM-SC:一种用于联合聚类单细胞多组学数据的贝叶斯随机效应混合模型。
Nucleic Acids Res. 2020 Jun 19;48(11):5814-5824. doi: 10.1093/nar/gkaa314.
9
Challenges in unsupervised clustering of single-cell RNA-seq data.无监督单细胞 RNA-seq 数据聚类的挑战。
Nat Rev Genet. 2019 May;20(5):273-282. doi: 10.1038/s41576-018-0088-9.
10
Multiview Clustering Based on Non-Negative Matrix Factorization and Pairwise Measurements.基于非负矩阵分解和成对测量的多视图聚类
IEEE Trans Cybern. 2019 Sep;49(9):3333-3346. doi: 10.1109/TCYB.2018.2842052. Epub 2018 Jun 21.