• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于正则化高斯图模型的单细胞 RNA-Seq 数据聚类。

Clustering Single-Cell RNA-Seq Data with Regularized Gaussian Graphical Model.

机构信息

Department of Public Health Sciences, Pennsylvania State University College of Medicine, 500 University Drive, Hershey, PA 17033, USA.

出版信息

Genes (Basel). 2021 Feb 22;12(2):311. doi: 10.3390/genes12020311.

DOI:10.3390/genes12020311
PMID:33671799
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7927011/
Abstract

Single-cell RNA-seq (scRNA-seq) is a powerful tool to measure the expression patterns of individual cells and discover heterogeneity and functional diversity among cell populations. Due to variability, it is challenging to analyze such data efficiently. Many clustering methods have been developed using at least one free parameter. Different choices for free parameters may lead to substantially different visualizations and clusters. Tuning free parameters is also time consuming. Thus there is need for a simple, robust, and efficient clustering method. In this paper, we propose a new regularized Gaussian graphical clustering (RGGC) method for scRNA-seq data. RGGC is based on high-order (partial) correlations and subspace learning, and is robust over a wide-range of a regularized parameter λ. Therefore, we can simply set λ=2 or λ=log(p) for AIC (Akaike information criterion) or BIC (Bayesian information criterion) without cross-validation. Cell subpopulations are discovered by the Louvain community detection algorithm that determines the number of clusters automatically. There is no free parameter to be tuned with RGGC. When evaluated with simulated and benchmark scRNA-seq data sets against widely used methods, RGGC is computationally efficient and one of the top performers. It can detect inter-sample cell heterogeneity, when applied to glioblastoma scRNA-seq data.

摘要

单细胞 RNA 测序 (scRNA-seq) 是一种强大的工具,可以测量单个细胞的表达模式,并发现细胞群体中的异质性和功能多样性。由于存在可变性,因此高效分析此类数据具有挑战性。已经开发了许多使用至少一个自由参数的聚类方法。自由参数的不同选择可能导致截然不同的可视化和聚类。调整自由参数也很耗时。因此,需要一种简单、稳健且高效的聚类方法。在本文中,我们提出了一种用于 scRNA-seq 数据的新正则化高斯图形聚类 (RGGC) 方法。RGGC 基于高阶(偏)相关和子空间学习,在广泛的正则化参数 λ 范围内具有稳健性。因此,我们可以简单地将 λ=2 或 λ=log(p) 设置为 AIC(赤池信息量准则)或 BIC(贝叶斯信息量准则),而无需进行交叉验证。细胞亚群通过 Louvain 社区检测算法发现,该算法自动确定聚类的数量。RGGC 没有要调整的自由参数。当使用模拟和基准 scRNA-seq 数据集与广泛使用的方法进行评估时,RGGC 计算效率高,是表现最好的方法之一。当应用于胶质母细胞瘤 scRNA-seq 数据时,它可以检测样本间的细胞异质性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b52/7927011/3a205f49e99a/genes-12-00311-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b52/7927011/3e62c27f3cf7/genes-12-00311-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b52/7927011/3a205f49e99a/genes-12-00311-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b52/7927011/3e62c27f3cf7/genes-12-00311-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b52/7927011/3a205f49e99a/genes-12-00311-g002.jpg

相似文献

1
Clustering Single-Cell RNA-Seq Data with Regularized Gaussian Graphical Model.基于正则化高斯图模型的单细胞 RNA-Seq 数据聚类。
Genes (Basel). 2021 Feb 22;12(2):311. doi: 10.3390/genes12020311.
2
Combining Global-Constrained Concept Factorization and a Regularized Gaussian Graphical Model for Clustering Single-Cell RNA-seq Data.基于全局约束概念分解和正则化高斯图模型的单细胞 RNA-seq 数据聚类分析。
Interdiscip Sci. 2024 Mar;16(1):1-15. doi: 10.1007/s12539-023-00587-7. Epub 2023 Oct 10.
3
JLONMFSC: Clustering scRNA-seq data based on joint learning of non-negative matrix factorization and subspace clustering.JLONMFSC:基于非负矩阵分解和子空间聚类联合学习的 scRNA-seq 数据聚类。
Methods. 2024 Feb;222:1-9. doi: 10.1016/j.ymeth.2023.11.019. Epub 2023 Dec 19.
4
Adaptive Total-Variation Regularized Low-Rank Representation for Analyzing Single-Cell RNA-seq Data.自适应全变差正则化低秩表示用于分析单细胞 RNA-seq 数据。
Interdiscip Sci. 2021 Sep;13(3):476-489. doi: 10.1007/s12539-021-00444-5. Epub 2021 Jun 2.
5
SDImpute: A statistical block imputation method based on cell-level and gene-level information for dropouts in single-cell RNA-seq data.SDImpute:一种基于单细胞 RNA-seq 数据中细胞水平和基因水平信息的统计分块插补方法。
PLoS Comput Biol. 2021 Jun 17;17(6):e1009118. doi: 10.1371/journal.pcbi.1009118. eCollection 2021 Jun.
6
A Personalized Low-Rank Subspace Clustering Method Based on Locality and Similarity Constraints for scRNA-seq Data Analysis.基于局部性和相似性约束的个性化低秩子空间聚类方法在 scRNA-seq 数据分析中的应用。
IEEE J Biomed Health Inform. 2023 May;27(5):2575-2584. doi: 10.1109/JBHI.2023.3247723. Epub 2023 May 4.
7
Integrating Deep Supervised, Self-Supervised and Unsupervised Learning for Single-Cell RNA-seq Clustering and Annotation.将深度监督学习、自监督学习和无监督学习相结合进行单细胞 RNA-seq 聚类和注释。
Genes (Basel). 2020 Jul 14;11(7):792. doi: 10.3390/genes11070792.
8
Robust Graph Regularized NMF with Dissimilarity and Similarity Constraints for ScRNA-seq Data Clustering.具有差异和相似性约束的鲁棒图正则化非负矩阵分解用于单细胞RNA测序数据聚类
J Chem Inf Model. 2022 Dec 12;62(23):6271-6286. doi: 10.1021/acs.jcim.2c01305. Epub 2022 Dec 2.
9
A parameter-free deep embedded clustering method for single-cell RNA-seq data.一种无参数深度嵌入聚类方法,用于单细胞 RNA-seq 数据。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac172.
10
A hybrid deep clustering approach for robust cell type profiling using single-cell RNA-seq data.基于单细胞 RNA-seq 数据的混合深度聚类方法进行稳健的细胞类型分析。
RNA. 2020 Oct;26(10):1303-1319. doi: 10.1261/rna.074427.119. Epub 2020 Jun 12.

引用本文的文献

1
Identifying cell types by lasso-constraint regularized Gaussian graphical model based on weighted distance penalty.基于加权距离惩罚的lasso 约束正则化高斯图形模型识别细胞类型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae572.
2
scEM: A New Ensemble Framework for Predicting Cell Type Composition Based on scRNA-Seq Data.scEM:一种基于 scRNA-Seq 数据预测细胞类型组成的新集成框架。
Interdiscip Sci. 2024 Jun;16(2):304-317. doi: 10.1007/s12539-023-00601-y. Epub 2024 Feb 18.
3
Quantum annealing-based clustering of single cell RNA-seq data.

本文引用的文献

1
Review of single-cell RNA-seq data clustering for cell-type identification and characterization.单细胞 RNA-seq 数据聚类用于细胞类型鉴定和特征分析的综述。
RNA. 2023 May;29(5):517-530. doi: 10.1261/rna.078965.121. Epub 2023 Feb 3.
2
Visualizing Single-Cell RNA-seq Data with Semisupervised Principal Component Analysis.基于半监督主成分分析的单细胞 RNA-seq 数据可视化
Int J Mol Sci. 2020 Aug 12;21(16):5797. doi: 10.3390/ijms21165797.
3
SinNLRR: a robust subspace clustering method for cell type detection by non-negative and low-rank representation.
基于量子退火的单细胞 RNA-seq 数据聚类。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad377.
4
Type 2 cytokines sensitize human sensory neurons to itch-associated stimuli.2型细胞因子使人类感觉神经元对瘙痒相关刺激敏感。
Front Mol Neurosci. 2023 Oct 5;16:1258823. doi: 10.3389/fnmol.2023.1258823. eCollection 2023.
5
Combining Global-Constrained Concept Factorization and a Regularized Gaussian Graphical Model for Clustering Single-Cell RNA-seq Data.基于全局约束概念分解和正则化高斯图模型的单细胞 RNA-seq 数据聚类分析。
Interdiscip Sci. 2024 Mar;16(1):1-15. doi: 10.1007/s12539-023-00587-7. Epub 2023 Oct 10.
6
A pulmonologist's guide to perform and analyse cross-species single lung cell transcriptomics.肺科医生进行跨物种单细胞转录组学分析的操作指南。
Eur Respir Rev. 2022 Jul 27;31(165). doi: 10.1183/16000617.0056-2022. Print 2022 Sep 30.
7
Network Approaches for Charting the Transcriptomic and Epigenetic Landscape of the Developmental Origins of Health and Disease.网络方法绘制健康与疾病的发育起源的转录组学和表观遗传学图谱。
Genes (Basel). 2022 Apr 26;13(5):764. doi: 10.3390/genes13050764.
8
Accurate Single-Cell Clustering through Ensemble Similarity Learning.通过集成相似性学习实现准确的单细胞聚类。
Genes (Basel). 2021 Oct 22;12(11):1670. doi: 10.3390/genes12111670.
SinNLRR:一种基于非负低秩表示的稳健子空间聚类方法,用于细胞类型检测。
Bioinformatics. 2019 Oct 1;35(19):3642-3650. doi: 10.1093/bioinformatics/btz139.
4
Challenges in unsupervised clustering of single-cell RNA-seq data.无监督单细胞 RNA-seq 数据聚类的挑战。
Nat Rev Genet. 2019 May;20(5):273-282. doi: 10.1038/s41576-018-0088-9.
5
VASC: Dimension Reduction and Visualization of Single-cell RNA-seq Data by Deep Variational Autoencoder.VASC:基于深度变分自动编码器的单细胞 RNA-seq 数据降维和可视化。
Genomics Proteomics Bioinformatics. 2018 Oct;16(5):320-331. doi: 10.1016/j.gpb.2018.08.003. Epub 2018 Dec 18.
6
Recovering Gene Interactions from Single-Cell Data Using Data Diffusion.利用数据扩散从单细胞数据中恢复基因相互作用。
Cell. 2018 Jul 26;174(3):716-729.e27. doi: 10.1016/j.cell.2018.05.061. Epub 2018 Jun 28.
7
Interpretable dimensionality reduction of single cell transcriptome data with deep generative models.基于深度生成模型的单细胞转录组数据的可解释维度约简。
Nat Commun. 2018 May 21;9(1):2002. doi: 10.1038/s41467-018-04368-5.
8
SCANPY: large-scale single-cell gene expression data analysis.SCANPY:大规模单细胞基因表达数据分析。
Genome Biol. 2018 Feb 6;19(1):15. doi: 10.1186/s13059-017-1382-0.
9
SAIC: an iterative clustering approach for analysis of single cell RNA-seq data.SAIC:一种用于分析单细胞 RNA-seq 数据的迭代聚类方法。
BMC Genomics. 2017 Oct 3;18(Suppl 6):689. doi: 10.1186/s12864-017-4019-5.
10
Splatter: simulation of single-cell RNA sequencing data.Splatter:单细胞 RNA 测序数据模拟。
Genome Biol. 2017 Sep 12;18(1):174. doi: 10.1186/s13059-017-1305-0.