• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

NMFLRR:基于非负矩阵分解和低秩表示整合的 scRNA-Seq 数据聚类

NMFLRR: Clustering scRNA-Seq Data by Integrating Nonnegative Matrix Factorization With Low Rank Representation.

出版信息

IEEE J Biomed Health Inform. 2022 Mar;26(3):1394-1405. doi: 10.1109/JBHI.2021.3099127. Epub 2022 Mar 7.

DOI:10.1109/JBHI.2021.3099127
PMID:34310328
Abstract

Fast-developing single-cell technologies create unprecedented opportunities to reveal cell heterogeneity and diversity. Accurate classification of single cells is a critical prerequisite for recovering the mechanisms of heterogeneity. However, the scRNA-seq profiles we obtained at present have high dimensionality, sparsity, and noise, which pose challenges for existing clustering methods in grouping cells that belong to the same subpopulation based on transcriptomic profiles. Although many computational methods have been proposed developing novel and effective computational methods to accurately identify cell types remains a considerable challenge. We present a new computational framework to identify cell types by integrating low-rank representation (LRR) and nonnegative matrix factorization (NMF); this framework is named NMFLRR. The LRR captures the global properties of original data by using nuclear norms, and a locality constrained graph regularization term is introduced to characterize the data's local geometric information. The similarity matrix and low-dimensional features of data can be simultaneously obtained by applying the alternating direction method of multipliers (ADMM) algorithm to handle each variable alternatively in an iterative way. We finally obtained the predicted cell types by using a spectral algorithm based on the optimized similarity matrix. Nine real scRNA-seq datasets were used to test the performance of NMFLRR and fifteen other competitive methods, and the accuracy and robustness of the simulation results suggest the NMFLRR is a promising algorithm for the classification of single cells. The simulation code is freely available at: https://github.com/wzhangwhu/NMFLRR_code.

摘要

单细胞技术的快速发展为揭示细胞异质性和多样性创造了前所未有的机会。准确的单细胞分类是恢复异质性机制的关键前提。然而,我们目前获得的 scRNA-seq 图谱具有高维性、稀疏性和噪声,这对基于转录组图谱对属于同一亚群的细胞进行分组的现有聚类方法提出了挑战。尽管已经提出了许多计算方法来开发新颖有效的计算方法,但准确识别细胞类型仍然是一个相当大的挑战。我们提出了一种新的计算框架,通过整合低秩表示(LRR)和非负矩阵分解(NMF)来识别细胞类型;该框架命名为 NMFLRR。LRR 通过使用核范数来捕捉原始数据的全局特性,并引入局部约束图正则化项来描述数据的局部几何信息。通过交替方向乘子法(ADMM)算法交替处理每个变量,可以同时获得相似性矩阵和数据的低维特征。我们最后通过使用基于优化相似性矩阵的谱算法获得预测的细胞类型。九个真实的 scRNA-seq 数据集被用于测试 NMFLRR 和其他十五种竞争方法的性能,模拟结果的准确性和稳健性表明 NMFLRR 是一种有前途的单细胞分类算法。模拟代码可在以下网址免费获取:https://github.com/wzhangwhu/NMFLRR_code。

相似文献

1
NMFLRR: Clustering scRNA-Seq Data by Integrating Nonnegative Matrix Factorization With Low Rank Representation.NMFLRR:基于非负矩阵分解和低秩表示整合的 scRNA-Seq 数据聚类
IEEE J Biomed Health Inform. 2022 Mar;26(3):1394-1405. doi: 10.1109/JBHI.2021.3099127. Epub 2022 Mar 7.
2
SCCLRR: A Robust Computational Method for Accurate Clustering Single Cell RNA-Seq Data.SCCLRR:一种用于准确聚类单细胞 RNA-Seq 数据的稳健计算方法。
IEEE J Biomed Health Inform. 2021 Jan;25(1):247-256. doi: 10.1109/JBHI.2020.2991172. Epub 2021 Jan 5.
3
JLONMFSC: Clustering scRNA-seq data based on joint learning of non-negative matrix factorization and subspace clustering.JLONMFSC:基于非负矩阵分解和子空间聚类联合学习的 scRNA-seq 数据聚类。
Methods. 2024 Feb;222:1-9. doi: 10.1016/j.ymeth.2023.11.019. Epub 2023 Dec 19.
4
A Personalized Low-Rank Subspace Clustering Method Based on Locality and Similarity Constraints for scRNA-seq Data Analysis.基于局部性和相似性约束的个性化低秩子空间聚类方法在 scRNA-seq 数据分析中的应用。
IEEE J Biomed Health Inform. 2023 May;27(5):2575-2584. doi: 10.1109/JBHI.2023.3247723. Epub 2023 May 4.
5
ARGLRR: A Sparse Low-Rank Representation Single-Cell RNA-Sequencing Data Clustering Method Combined with a New Graph Regularization.ARGLRR:一种基于稀疏低秩表示和新图正则化的单细胞 RNA 测序数据聚类方法。
J Comput Biol. 2023 Aug;30(8):848-860. doi: 10.1089/cmb.2023.0077. Epub 2023 Jul 20.
6
Combining Global-Constrained Concept Factorization and a Regularized Gaussian Graphical Model for Clustering Single-Cell RNA-seq Data.基于全局约束概念分解和正则化高斯图模型的单细胞 RNA-seq 数据聚类分析。
Interdiscip Sci. 2024 Mar;16(1):1-15. doi: 10.1007/s12539-023-00587-7. Epub 2023 Oct 10.
7
Network-Based Structural Learning Nonnegative Matrix Factorization Algorithm for Clustering of scRNA-Seq Data.用于scRNA-Seq数据聚类的基于网络的结构学习非负矩阵分解算法
IEEE/ACM Trans Comput Biol Bioinform. 2023 Jan-Feb;20(1):566-575. doi: 10.1109/TCBB.2022.3161131. Epub 2023 Feb 3.
8
NLRRC: A Novel Clustering Method of Jointing Non-Negative LRR and Random Walk Graph Regularized NMF for Single-Cell Type Identification.NLRRC:一种联合非负 LRR 和随机游走图正则化 NMF 的新型聚类方法,用于单细胞类型鉴定。
IEEE J Biomed Health Inform. 2023 Oct;27(10):5199-5209. doi: 10.1109/JBHI.2023.3299748. Epub 2023 Oct 5.
9
Robust Graph Regularized NMF with Dissimilarity and Similarity Constraints for ScRNA-seq Data Clustering.具有差异和相似性约束的鲁棒图正则化非负矩阵分解用于单细胞RNA测序数据聚类
J Chem Inf Model. 2022 Dec 12;62(23):6271-6286. doi: 10.1021/acs.jcim.2c01305. Epub 2022 Dec 2.
10
Unsupervised Cluster Analysis and Gene Marker Extraction of scRNA-seq Data Based On Non-Negative Matrix Factorization.基于非负矩阵分解的 scRNA-seq 数据无监督聚类分析和基因标志物提取。
IEEE J Biomed Health Inform. 2022 Jan;26(1):458-467. doi: 10.1109/JBHI.2021.3091506. Epub 2022 Jan 17.

引用本文的文献

1
scRDEN: single-cell dynamic gene rank differential expression network and robust trajectory inference.scRDEN:单细胞动态基因排名差异表达网络及稳健轨迹推断
Sci Rep. 2025 May 15;15(1):16963. doi: 10.1038/s41598-025-01969-1.
2
Accurate identification of single-cell types via correntropy-based Sparse PCA combining hypergraph and fusion similarity.基于核熵的稀疏主成分分析结合超图和融合相似性对单细胞类型进行准确识别。
J Appl Stat. 2024 Jul 21;52(2):356-380. doi: 10.1080/02664763.2024.2369955. eCollection 2025.
3
Identifying cell types by lasso-constraint regularized Gaussian graphical model based on weighted distance penalty.
基于加权距离惩罚的lasso 约束正则化高斯图形模型识别细胞类型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae572.
4
Cauchy hyper-graph Laplacian nonnegative matrix factorization for single-cell RNA-sequencing data analysis.基于柯西超图拉普拉斯的非负矩阵分解方法在单细胞 RNA-seq 数据分析中的应用。
BMC Bioinformatics. 2024 Apr 29;25(1):169. doi: 10.1186/s12859-024-05797-4.
5
Combining Global-Constrained Concept Factorization and a Regularized Gaussian Graphical Model for Clustering Single-Cell RNA-seq Data.基于全局约束概念分解和正则化高斯图模型的单细胞 RNA-seq 数据聚类分析。
Interdiscip Sci. 2024 Mar;16(1):1-15. doi: 10.1007/s12539-023-00587-7. Epub 2023 Oct 10.
6
Joint Learning of Correlation-Constrained Fuzzy Clustering and Discriminative Non-Negative Representation for Hyperspectral Band Selection.基于相关约束模糊聚类和判别非负表示的联合学习的高光谱波段选择。
Sensors (Basel). 2023 May 17;23(10):4838. doi: 10.3390/s23104838.
7
Data analysis guidelines for single-cell RNA-seq in biomedical studies and clinical applications.生物医学研究和临床应用中单细胞 RNA-seq 的数据分析指南。
Mil Med Res. 2022 Dec 2;9(1):68. doi: 10.1186/s40779-022-00434-8.