• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SC-JNMF:基于联合非负矩阵分解整合多种量化方法的单细胞聚类

SC-JNMF: single-cell clustering integrating multiple quantification methods based on joint non-negative matrix factorization.

作者信息

Shiga Mikio, Seno Shigeto, Onizuka Makoto, Matsuda Hideo

机构信息

Graduate School of Information Science and Technology, Osaka University, Osaka, Japan.

出版信息

PeerJ. 2021 Aug 27;9:e12087. doi: 10.7717/peerj.12087. eCollection 2021.

DOI:10.7717/peerj.12087
PMID:34532161
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8404576/
Abstract

Single-cell RNA-sequencing is a rapidly evolving technology that enables us to understand biological processes at unprecedented resolution. Single-cell expression analysis requires a complex data processing pipeline, and the pipeline is divided into two main parts: The quantification part, which converts the sequence information into gene-cell matrix data; the analysis part, which analyzes the matrix data using statistics and/or machine learning techniques. In the analysis part, unsupervised cell clustering plays an important role in identifying cell types and discovering cell diversity and subpopulations. Identified cell clusters are also used for subsequent analysis, such as finding differentially expressed genes and inferring cell trajectories. However, single-cell clustering using gene expression profiles shows different results depending on the quantification methods. Clustering results are greatly affected by the quantification method used in the upstream process. In other words, even if the original RNA-sequence data is the same, gene expression profiles processed by different quantification methods will produce different clusters. In this article, we propose a robust and highly accurate clustering method based on joint non-negative matrix factorization (joint-NMF) by utilizing the information from multiple gene expression profiles quantified using different methods from the same RNA-sequence data. Our joint-NMF can extract common factors among multiple gene expression profiles by applying each NMF under the constraint that one of the factorized matrices is shared among multiple NMFs. The joint-NMF determines more robust and accurate cell clustering results by leveraging multiple quantification methods compared to conventional clustering methods, which use only a single gene expression profile. Additionally, we showed the usefulness of discovering marker genes with the extracted features using our method.

摘要

单细胞RNA测序是一项快速发展的技术,它使我们能够以前所未有的分辨率理解生物过程。单细胞表达分析需要一个复杂的数据处理流程,该流程主要分为两个部分:量化部分,将序列信息转化为基因-细胞矩阵数据;分析部分,使用统计和/或机器学习技术分析矩阵数据。在分析部分,无监督细胞聚类在识别细胞类型、发现细胞多样性和亚群方面发挥着重要作用。识别出的细胞簇也用于后续分析,如寻找差异表达基因和推断细胞轨迹。然而,使用基因表达谱进行单细胞聚类时,根据量化方法的不同会显示出不同的结果。聚类结果受上游过程中使用的量化方法的影响很大。换句话说,即使原始RNA序列数据相同,不同量化方法处理的基因表达谱也会产生不同的聚类。在本文中,我们通过利用从同一RNA序列数据中使用不同方法量化得到的多个基因表达谱中的信息,提出了一种基于联合非负矩阵分解(joint-NMF)的稳健且高度准确的聚类方法。我们的联合非负矩阵分解可以通过在多个非负矩阵分解中共享其中一个分解矩阵的约束下应用每个非负矩阵分解,来提取多个基因表达谱中的共同因子。与仅使用单个基因表达谱的传统聚类方法相比,联合非负矩阵分解通过利用多种量化方法确定了更稳健、准确的细胞聚类结果。此外,我们展示了使用我们的方法通过提取的特征发现标记基因的实用性。

相似文献

1
SC-JNMF: single-cell clustering integrating multiple quantification methods based on joint non-negative matrix factorization.SC-JNMF:基于联合非负矩阵分解整合多种量化方法的单细胞聚类
PeerJ. 2021 Aug 27;9:e12087. doi: 10.7717/peerj.12087. eCollection 2021.
2
Detecting heterogeneity in single-cell RNA-Seq data by non-negative matrix factorization.通过非负矩阵分解检测单细胞RNA测序数据中的异质性。
PeerJ. 2017 Jan 19;5:e2888. doi: 10.7717/peerj.2888. eCollection 2017.
3
A robust semi-supervised NMF model for single cell RNA-seq data.一种用于单细胞RNA测序数据的强大半监督非负矩阵分解模型。
PeerJ. 2020 Oct 16;8:e10091. doi: 10.7717/peerj.10091. eCollection 2020.
4
A Robust Manifold Graph Regularized Nonnegative Matrix Factorization Algorithm for Cancer Gene Clustering.一种稳健的流形图正则化非负矩阵分解算法在癌症基因聚类中的应用。
Molecules. 2017 Dec 2;22(12):2131. doi: 10.3390/molecules22122131.
5
NMF Clustering: Accessible NMF-based Clustering Utilizing GPU Acceleration.非负矩阵分解聚类:利用GPU加速的基于非负矩阵分解的可访问聚类。
J Bioinform Syst Biol. 2023;6(4):379-383. doi: 10.26502/jbsb.5107072. Epub 2023 Dec 21.
6
Robust classification of single-cell transcriptome data by nonnegative matrix factorization.通过非负矩阵分解对单细胞转录组数据进行稳健分类。
Bioinformatics. 2017 Jan 15;33(2):235-242. doi: 10.1093/bioinformatics/btw607. Epub 2016 Sep 23.
7
Joint Dictionary Learning-Based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery.基于联合字典学习的非负矩阵分解用于口腔手术后语音转换以提高语音清晰度
IEEE Trans Biomed Eng. 2017 Nov;64(11):2584-2594. doi: 10.1109/TBME.2016.2644258.
8
Hessian regularization based non-negative matrix factorization for gene expression data clustering.基于Hessian正则化的非负矩阵分解用于基因表达数据聚类
Annu Int Conf IEEE Eng Med Biol Soc. 2015;2015:4130-3. doi: 10.1109/EMBC.2015.7319303.
9
Joint learning dimension reduction and clustering of single-cell RNA-sequencing data.单细胞 RNA 测序数据的联合降维和聚类学习。
Bioinformatics. 2020 Jun 1;36(12):3825-3832. doi: 10.1093/bioinformatics/btaa231.
10
Regularized Non-Negative Matrix Factorization for Identifying Differentially Expressed Genes and Clustering Samples: A Survey.正则化非负矩阵分解在差异表达基因识别和样本聚类中的应用综述。
IEEE/ACM Trans Comput Biol Bioinform. 2018 May-Jun;15(3):974-987. doi: 10.1109/TCBB.2017.2665557. Epub 2017 Feb 7.

引用本文的文献

1
Integrating Multi-omics Data for Alzheimer's Disease to Explore Its Biomarkers Via the Hypergraph-Regularized Joint Deep Semi-Non-Negative Matrix Factorization Algorithm.通过超图正则化联合深度半非负矩阵分解算法整合阿尔茨海默病的多组学数据以探索其生物标志物
J Mol Neurosci. 2024 Apr 15;74(2):43. doi: 10.1007/s12031-024-02211-9.
2
GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership.GoM DE:利用允许成员等级的差异表达分析来解释序列计数数据中的结构。
Genome Biol. 2023 Oct 19;24(1):236. doi: 10.1186/s13059-023-03067-9.
3
Clustering single-cell multimodal omics data with jrSiCKLSNMF.

本文引用的文献

1
A robust semi-supervised NMF model for single cell RNA-seq data.一种用于单细胞RNA测序数据的强大半监督非负矩阵分解模型。
PeerJ. 2020 Oct 16;8:e10091. doi: 10.7717/peerj.10091. eCollection 2020.
2
SciPy 1.0: fundamental algorithms for scientific computing in Python.SciPy 1.0:Python 中的科学计算基础算法。
Nat Methods. 2020 Mar;17(3):261-272. doi: 10.1038/s41592-019-0686-2. Epub 2020 Feb 3.
3
scAI: an unsupervised approach for the integrative analysis of parallel single-cell transcriptomic and epigenomic profiles.scAI:一种用于平行单细胞转录组学和表观基因组学综合分析的无监督方法。
使用jrSiCKLSNMF对单细胞多组学数据进行聚类。
Front Genet. 2023 Jun 9;14:1179439. doi: 10.3389/fgene.2023.1179439. eCollection 2023.
4
scINSIGHT for interpreting single-cell gene expression from biologically heterogeneous data.scINSIGHT 用于解释生物异质数据中的单细胞基因表达。
Genome Biol. 2022 Mar 21;23(1):82. doi: 10.1186/s13059-022-02649-3.
Genome Biol. 2020 Feb 3;21(1):25. doi: 10.1186/s13059-020-1932-8.
4
Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis.单细胞 RNA-seq 分析中降维方法的准确性、鲁棒性和可扩展性。
Genome Biol. 2019 Dec 10;20(1):269. doi: 10.1186/s13059-019-1898-6.
5
A systematic evaluation of single cell RNA-seq analysis pipelines.单细胞 RNA 测序分析流程的系统评价。
Nat Commun. 2019 Oct 11;10(1):4667. doi: 10.1038/s41467-019-12266-7.
6
Benchmarking single cell RNA-sequencing analysis pipelines using mixture control experiments.使用混合对照实验对标单细胞 RNA 测序分析流程。
Nat Methods. 2019 Jun;16(6):479-487. doi: 10.1038/s41592-019-0425-8. Epub 2019 May 27.
7
Alevin efficiently estimates accurate gene abundances from dscRNA-seq data.Alevin 能够有效地从 dscRNA-seq 数据中估计准确的基因丰度。
Genome Biol. 2019 Mar 27;20(1):65. doi: 10.1186/s13059-019-1670-y.
8
RNA-Seq Signatures Normalized by mRNA Abundance Allow Absolute Deconvolution of Human Immune Cell Types.RNA-Seq 特征通过 mRNA 丰度标准化可实现人类免疫细胞类型的绝对分解。
Cell Rep. 2019 Feb 5;26(6):1627-1640.e7. doi: 10.1016/j.celrep.2019.01.041.
9
Challenges in unsupervised clustering of single-cell RNA-seq data.无监督单细胞 RNA-seq 数据聚类的挑战。
Nat Rev Genet. 2019 May;20(5):273-282. doi: 10.1038/s41576-018-0088-9.
10
Comparison of clustering tools in R for medium-sized 10x Genomics single-cell RNA-sequencing data.用于中等规模10x基因组学单细胞RNA测序数据的R语言聚类工具比较
F1000Res. 2018 Aug 15;7:1297. doi: 10.12688/f1000research.15809.2. eCollection 2018.