• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

scZAG:基于 ZINB 的自动编码器与自适应数据增强图对比学习在 scRNA-seq 聚类中的整合。

scZAG: Integrating ZINB-Based Autoencoder with Adaptive Data Augmentation Graph Contrastive Learning for scRNA-seq Clustering.

机构信息

College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China.

出版信息

Int J Mol Sci. 2024 May 29;25(11):5976. doi: 10.3390/ijms25115976.

DOI:10.3390/ijms25115976
PMID:38892162
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11172799/
Abstract

Single-cell RNA sequencing (scRNA-seq) is widely used to interpret cellular states, detect cell subpopulations, and study disease mechanisms. In scRNA-seq data analysis, cell clustering is a key step that can identify cell types. However, scRNA-seq data are characterized by high dimensionality and significant sparsity, presenting considerable challenges for clustering. In the high-dimensional gene expression space, cells may form complex topological structures. Many conventional scRNA-seq data analysis methods focus on identifying cell subgroups rather than exploring these potential high-dimensional structures in detail. Although some methods have begun to consider the topological structures within the data, many still overlook the continuity and complex topology present in single-cell data. We propose a deep learning framework that begins by employing a zero-inflated negative binomial (ZINB) model to denoise the highly sparse and over-dispersed scRNA-seq data. Next, scZAG uses an adaptive graph contrastive representation learning approach that combines approximate personalized propagation of neural predictions graph convolution (APPNPGCN) with graph contrastive learning methods. By using APPNPGCN as the encoder for graph contrastive learning, we ensure that each cell's representation reflects not only its own features but also its position in the graph and its relationships with other cells. Graph contrastive learning exploits the relationships between nodes to capture the similarity among cells, better representing the data's underlying continuity and complex topology. Finally, the learned low-dimensional latent representations are clustered using Kullback-Leibler divergence. We validated the superior clustering performance of scZAG on 10 common scRNA-seq datasets in comparison to existing state-of-the-art clustering methods.

摘要

单细胞 RNA 测序 (scRNA-seq) 被广泛用于解释细胞状态、检测细胞亚群和研究疾病机制。在 scRNA-seq 数据分析中,细胞聚类是识别细胞类型的关键步骤。然而,scRNA-seq 数据具有高维性和显著的稀疏性,这给聚类带来了相当大的挑战。在高维基因表达空间中,细胞可能形成复杂的拓扑结构。许多传统的 scRNA-seq 数据分析方法侧重于识别细胞亚群,而不是详细探索这些潜在的高维结构。尽管一些方法已经开始考虑数据中的拓扑结构,但许多方法仍然忽略了单细胞数据中的连续性和复杂拓扑结构。我们提出了一个深度学习框架,该框架首先使用零膨胀负二项式 (ZINB) 模型对高度稀疏和过度分散的 scRNA-seq 数据进行去噪。接下来,scZAG 使用自适应图对比表示学习方法,该方法结合了近似个性化传播神经预测图卷积 (APPNPGCN) 和图对比学习方法。通过使用 APPNPGCN 作为图对比学习的编码器,我们确保每个细胞的表示不仅反映了其自身的特征,还反映了其在图中的位置及其与其他细胞的关系。图对比学习利用节点之间的关系来捕获细胞之间的相似性,更好地表示数据的潜在连续性和复杂拓扑结构。最后,使用 Kullback-Leibler 散度对学习到的低维潜在表示进行聚类。我们在 10 个常见的 scRNA-seq 数据集上验证了 scZAG 优于现有最先进的聚类方法的优越聚类性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb7/11172799/f1ab4091d706/ijms-25-05976-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb7/11172799/cdd0488620a4/ijms-25-05976-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb7/11172799/f8e4f042da97/ijms-25-05976-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb7/11172799/b70b89f90450/ijms-25-05976-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb7/11172799/09b40a61ea4f/ijms-25-05976-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb7/11172799/f1ab4091d706/ijms-25-05976-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb7/11172799/cdd0488620a4/ijms-25-05976-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb7/11172799/f8e4f042da97/ijms-25-05976-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb7/11172799/b70b89f90450/ijms-25-05976-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb7/11172799/09b40a61ea4f/ijms-25-05976-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb7/11172799/f1ab4091d706/ijms-25-05976-g005.jpg

相似文献

1
scZAG: Integrating ZINB-Based Autoencoder with Adaptive Data Augmentation Graph Contrastive Learning for scRNA-seq Clustering.scZAG:基于 ZINB 的自动编码器与自适应数据增强图对比学习在 scRNA-seq 聚类中的整合。
Int J Mol Sci. 2024 May 29;25(11):5976. doi: 10.3390/ijms25115976.
2
scGCL: an imputation method for scRNA-seq data based on graph contrastive learning.scGCL:一种基于图对比学习的 scRNA-seq 数据插补方法。
Bioinformatics. 2023 Mar 1;39(3). doi: 10.1093/bioinformatics/btad098.
3
scDCCA: deep contrastive clustering for single-cell RNA-seq data based on auto-encoder network.scDCCA:基于自动编码器网络的单细胞RNA测序数据深度对比聚类
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac625.
4
Deep structural clustering for single-cell RNA-seq data jointly through autoencoder and graph neural network.基于自动编码器和图神经网络的单细胞 RNA-seq 数据深度结构聚类。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac018.
5
nsDCC: dual-level contrastive clustering with nonuniform sampling for scRNA-seq data analysis.nsDCC:基于非均匀采样的双层对比聚类算法,用于 scRNA-seq 数据分析。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae477.
6
scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering.scBGEDA:基于双分图集成分聚类的对偶去噪自动编码器的单细胞聚类分析。
Bioinformatics. 2023 Feb 14;39(2). doi: 10.1093/bioinformatics/btad075.
7
Single-cell RNA sequencing data analysis utilizing multi-type graph neural networks.利用多种类型图神经网络进行单细胞 RNA 测序数据分析。
Comput Biol Med. 2024 Sep;179:108921. doi: 10.1016/j.compbiomed.2024.108921. Epub 2024 Jul 25.
8
Deep enhanced constraint clustering based on contrastive learning for scRNA-seq data.基于对比学习的深度增强约束聚类算法在单细胞 RNA-seq 数据分析中的应用。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad222.
9
scGAAC: A graph attention autoencoder for clustering single-cell RNA-sequencing data.scGAAC:一种用于单细胞RNA测序数据聚类的图注意力自动编码器。
Methods. 2024 Sep;229:115-124. doi: 10.1016/j.ymeth.2024.06.010. Epub 2024 Jun 29.
10
Attention-based deep clustering method for scRNA-seq cell type identification.基于注意力机制的深度聚类方法在 scRNA-seq 细胞类型鉴定中的应用。
PLoS Comput Biol. 2023 Nov 10;19(11):e1011641. doi: 10.1371/journal.pcbi.1011641. eCollection 2023 Nov.

引用本文的文献

1
spaMGCN: a graph convolutional network with autoencoder for spatial domain identification using multi-scale adaptation.spaMGCN:一种带有自动编码器的图卷积网络,用于通过多尺度自适应进行空间域识别。
Genome Biol. 2025 Jun 10;26(1):159. doi: 10.1186/s13059-025-03637-z.
2
cfDiffusion: diffusion-based efficient generation of high quality scRNA-seq data with classifier-free guidance.cfDiffusion:基于扩散的高质量单细胞RNA测序数据高效生成,无分类器引导。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf071.
3
scDRMAE: integrating masked autoencoder with residual attention networks to leverage omics feature dependencies for accurate cell clustering.

本文引用的文献

1
scGCL: an imputation method for scRNA-seq data based on graph contrastive learning.scGCL:一种基于图对比学习的 scRNA-seq 数据插补方法。
Bioinformatics. 2023 Mar 1;39(3). doi: 10.1093/bioinformatics/btad098.
2
A universal deep neural network for in-depth cleaning of single-cell RNA-Seq data.一种用于单细胞 RNA-Seq 数据深度清洗的通用深度神经网络。
Nat Commun. 2022 Apr 7;13(1):1901. doi: 10.1038/s41467-022-29576-y.
3
scNAME: neighborhood contrastive clustering with ancillary mask estimation for scRNA-seq data.scNAME:基于辅助掩模估计的 scRNA-seq 数据邻域对比聚类。
scDRMAE:集成掩蔽自动编码器和残差注意力网络,利用组学特征依赖性进行准确的细胞聚类。
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae599.
Bioinformatics. 2022 Mar 4;38(6):1575-1583. doi: 10.1093/bioinformatics/btac011.
4
A topology-preserving dimensionality reduction method for single-cell RNA-seq data using graph autoencoder.基于图自动编码器的单细胞 RNA-seq 数据拓扑保持降维方法。
Sci Rep. 2021 Oct 8;11(1):20028. doi: 10.1038/s41598-021-99003-7.
5
UICPC: Centrality-based clustering for scRNA-seq data analysis without user input.UICPC:无需用户输入的基于中心度的 scRNA-seq 数据分析聚类方法。
Comput Biol Med. 2021 Oct;137:104820. doi: 10.1016/j.compbiomed.2021.104820. Epub 2021 Sep 3.
6
Contrastive self-supervised clustering of scRNA-seq data.单细胞 RNA 测序数据的对比自监督聚类。
BMC Bioinformatics. 2021 May 27;22(1):280. doi: 10.1186/s12859-021-04210-8.
7
scGNN is a novel graph neural network framework for single-cell RNA-Seq analyses.scGNN 是一种用于单细胞 RNA-Seq 分析的新型图神经网络框架。
Nat Commun. 2021 Mar 25;12(1):1882. doi: 10.1038/s41467-021-22197-x.
8
Deep soft -means clustering with self-training for single-cell RNA sequence data.用于单细胞RNA序列数据的基于自训练的深度软均值聚类
NAR Genom Bioinform. 2020 May 25;2(2):lqaa039. doi: 10.1093/nargab/lqaa039. eCollection 2020 Jun.
9
Challenges in unsupervised clustering of single-cell RNA-seq data.无监督单细胞 RNA-seq 数据聚类的挑战。
Nat Rev Genet. 2019 May;20(5):273-282. doi: 10.1038/s41576-018-0088-9.
10
The adult human testis transcriptional cell atlas.成人睾丸转录组细胞图谱。
Cell Res. 2018 Dec;28(12):1141-1157. doi: 10.1038/s41422-018-0099-2. Epub 2018 Oct 12.