• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

scSAMAC:用于单细胞聚类的显著性调整掩膜诱导注意力对比学习

scSAMAC: saliency-adjusted masking induced attention contrastive learning for single-cell clustering.

作者信息

Li Bo, Zhao Yongkang, Hu Jing, Zhang Shihua, Zhang Xiaolong

机构信息

School of Computer Science and Technology, Wuhan University of Science and Technology, Huangjiahu west road 2#, Wuhan 430065, China.

Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System, Wuhan University of Science and Technology, Huangjiahu west road 2#, Wuhan 430065, China.

出版信息

Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf128.

DOI:10.1093/bib/bbaf128
PMID:40131310
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11934584/
Abstract

Single-cell sequencing technology has enabled researchers to study cellular heterogeneity at the cell level. To facilitate the downstream analysis, clustering single-cell data into subgroups is essential. However, the high dimensionality, sparsity, and dropout events of the data make the clustering challenging. Currently, many deep learning methods have been proposed. Nevertheless, they either fail to fully utilize pairwise distances information between similar cells, or do not adequately capture their feature correlations. They cannot also effectively handle high-dimensional sparse data. Therefore, they are not suitable for high-fidelity clustering, leading to difficulties in analyzing the clear cell types required for downstream analysis. The proposed scSAMAC method integrates contrastive learning and negative binomial losses into a variational autoencoder, extracting features via contrastive unit similarity while preserving the intrinsic characteristics. This enhances the robustness and generalization during the clustering. In the contrastive learning, it constructs a mask module by adopting a negative sample generation method with gene feature saliency adjustment, which selects features more influential in the clustering phase and simulates data missing events. Additionally, it develops a novel loss, which consists of a soft k-means loss, a Wasserstein distance, and a contrastive loss. This fully utilizes data information and improves clustering performance. Furthermore, a multi-head attention mechanism module is applied to the latent variables at each layer of autoencoder to enhance feature correlation, integration, and information repair. Experimental results demonstrate that scSAMAC outperforms several state-of-the-art clustering methods.

摘要

单细胞测序技术使研究人员能够在细胞水平上研究细胞异质性。为便于下游分析,将单细胞数据聚类为亚组至关重要。然而,数据的高维度、稀疏性和缺失事件使得聚类具有挑战性。目前,已经提出了许多深度学习方法。然而,它们要么未能充分利用相似细胞之间的成对距离信息,要么没有充分捕捉它们的特征相关性。它们也无法有效处理高维稀疏数据。因此,它们不适用于高保真聚类,导致在分析下游分析所需的清晰细胞类型时遇到困难。所提出的scSAMAC方法将对比学习和负二项式损失集成到变分自编码器中,通过对比单元相似性提取特征,同时保留内在特征。这增强了聚类过程中的鲁棒性和泛化能力。在对比学习中,它采用具有基因特征显著性调整的负样本生成方法构建一个掩码模块,该模块在聚类阶段选择更具影响力的特征并模拟数据缺失事件。此外,它还开发了一种新颖的损失,由软k均值损失、瓦瑟斯坦距离和对比损失组成。这充分利用了数据信息并提高了聚类性能。此外,多头注意力机制模块应用于自编码器各层的潜在变量,以增强特征相关性、整合和信息修复。实验结果表明,scSAMAC优于几种现有的聚类方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/00d2e037a51d/bbaf128f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/d210f8a53e48/bbaf128f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/84919bef8193/bbaf128f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/859c0bd2facc/bbaf128f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/94ec1b092d65/bbaf128f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/587e481f4e1c/bbaf128f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/3c4b0a40496c/bbaf128f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/a6a95d01229b/bbaf128f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/00d2e037a51d/bbaf128f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/d210f8a53e48/bbaf128f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/84919bef8193/bbaf128f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/859c0bd2facc/bbaf128f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/94ec1b092d65/bbaf128f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/587e481f4e1c/bbaf128f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/3c4b0a40496c/bbaf128f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/a6a95d01229b/bbaf128f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c029/11934584/00d2e037a51d/bbaf128f8.jpg

相似文献

1
scSAMAC: saliency-adjusted masking induced attention contrastive learning for single-cell clustering.scSAMAC:用于单细胞聚类的显著性调整掩膜诱导注意力对比学习
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf128.
2
scDCCA: deep contrastive clustering for single-cell RNA-seq data based on auto-encoder network.scDCCA:基于自动编码器网络的单细胞RNA测序数据深度对比聚类
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac625.
3
scZAG: Integrating ZINB-Based Autoencoder with Adaptive Data Augmentation Graph Contrastive Learning for scRNA-seq Clustering.scZAG:基于 ZINB 的自动编码器与自适应数据增强图对比学习在 scRNA-seq 聚类中的整合。
Int J Mol Sci. 2024 May 29;25(11):5976. doi: 10.3390/ijms25115976.
4
nsDCC: dual-level contrastive clustering with nonuniform sampling for scRNA-seq data analysis.nsDCC:基于非均匀采样的双层对比聚类算法,用于 scRNA-seq 数据分析。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae477.
5
Deep enhanced constraint clustering based on contrastive learning for scRNA-seq data.基于对比学习的深度增强约束聚类算法在单细胞 RNA-seq 数据分析中的应用。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad222.
6
scAMZI: attention-based deep autoencoder with zero-inflated layer for clustering scRNA-seq data.scAMZI:用于scRNA序列数据聚类的带零膨胀层的基于注意力的深度自动编码器。
BMC Genomics. 2025 Apr 7;26(1):350. doi: 10.1186/s12864-025-11511-2.
7
scMDCL: A Deep Collaborative Contrastive Learning Framework for Matched Single-Cell Multiomics Data Clustering.scMDCL:用于匹配单细胞多组学数据聚类的深度协作对比学习框架
J Chem Inf Model. 2025 Mar 24;65(6):3048-3063. doi: 10.1021/acs.jcim.4c02114. Epub 2025 Mar 11.
8
Multi-level multi-view network based on structural contrastive learning for scRNA-seq data clustering.基于结构对比学习的多层次多视图网络用于 scRNA-seq 数据聚类。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae562.
9
Composite attention mechanism network for deep contrastive multi-view clustering.用于深度对比多视图聚类的组合注意力机制网络。
Neural Netw. 2024 Aug;176:106361. doi: 10.1016/j.neunet.2024.106361. Epub 2024 May 3.
10
scNAME: neighborhood contrastive clustering with ancillary mask estimation for scRNA-seq data.scNAME:基于辅助掩模估计的 scRNA-seq 数据邻域对比聚类。
Bioinformatics. 2022 Mar 4;38(6):1575-1583. doi: 10.1093/bioinformatics/btac011.

本文引用的文献

1
nsDCC: dual-level contrastive clustering with nonuniform sampling for scRNA-seq data analysis.nsDCC:基于非均匀采样的双层对比聚类算法,用于 scRNA-seq 数据分析。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae477.
2
scTPC: a novel semisupervised deep clustering model for scRNA-seq data.scTPC:一种用于 scRNA-seq 数据的新型半监督深度聚类模型。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae293.
3
Dual-GCN-based deep clustering with triplet contrast for ScRNA-seq data analysis.基于双图卷积网络的深度聚类与三重对比在单细胞 RNA-seq 数据分析中的应用。
Comput Biol Chem. 2023 Oct;106:107924. doi: 10.1016/j.compbiolchem.2023.107924. Epub 2023 Jul 17.
4
scDFC: A deep fusion clustering method for single-cell RNA-seq data.scDFC:一种用于单细胞 RNA-seq 数据的深度融合聚类方法。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad216.
5
Single-cell RNA-seq data clustering by deep information fusion.基于深度信息融合的单细胞 RNA-seq 数据聚类。
Brief Funct Genomics. 2024 Mar 20;23(2):128-137. doi: 10.1093/bfgp/elad017.
6
Clustering ensemble in scRNA-seq data analysis: Methods, applications and challenges.单细胞 RNA 测序数据分析中的聚类集成:方法、应用和挑战。
Comput Biol Med. 2023 Jun;159:106939. doi: 10.1016/j.compbiomed.2023.106939. Epub 2023 Apr 15.
7
scDCCA: deep contrastive clustering for single-cell RNA-seq data based on auto-encoder network.scDCCA:基于自动编码器网络的单细胞RNA测序数据深度对比聚类
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac625.
8
scGMAAE: Gaussian mixture adversarial autoencoders for diversification analysis of scRNA-seq data.scGMAAE:用于单细胞RNA测序数据多样化分析的高斯混合对抗自编码器
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac585.
9
scDSSC: Deep Sparse Subspace Clustering for scRNA-seq Data.scDSSC:用于 scRNA-seq 数据的深度稀疏子空间聚类。
PLoS Comput Biol. 2022 Dec 19;18(12):e1010772. doi: 10.1371/journal.pcbi.1010772. eCollection 2022 Dec.
10
scHFC: a hybrid fuzzy clustering method for single-cell RNA-seq data optimized by natural computation.scHFC:一种基于自然计算优化的单细胞 RNA-seq 数据的混合模糊聚类方法。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab588.