• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

scAMZI:用于scRNA序列数据聚类的带零膨胀层的基于注意力的深度自动编码器。

scAMZI: attention-based deep autoencoder with zero-inflated layer for clustering scRNA-seq data.

作者信息

Yuan Lin, Xu Zhijie, Meng Boyuan, Ye Lan

机构信息

Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center, Qilu University of Technology (Shandong Academy of Sciences), 3501 Daxue Road, Jinan, 250353, China.

Shandong Engineering Research Center of Big Data Applied Technology, Faculty of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), 3501 Daxue Road, Jinan, 250353, China.

出版信息

BMC Genomics. 2025 Apr 7;26(1):350. doi: 10.1186/s12864-025-11511-2.

DOI:10.1186/s12864-025-11511-2
PMID:40197174
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11974017/
Abstract

BACKGROUND

Clustering scRNA-seq data plays a vital role in scRNA-seq data analysis and downstream analyses. Many computational methods have been proposed and achieved remarkable results. However, there are several limitations of these methods. First, they do not fully exploit cellular features. Second, they are developed based on gene expression information and lack of flexibility in integrating intercellular relationships. Finally, the performance of these methods is affected by dropout event.

RESULTS

We propose a novel deep learning (DL) model based on attention autoencoder and zero-inflated (ZI) layer, namely scAMZI, to cluster scRNA-seq data. scAMZI is mainly composed of SimAM (a Simple, parameter-free Attention Module), autoencoder, ZINB (Zero-Inflated Negative Binomial) model and ZI layer. Based on ZINB model, we introduce autoencoder and SimAM to reduce dimensionality of data and learn feature representations of cells and relationships between cells. Meanwhile, ZI layer is used to handle zero values in the data. We compare the performance of scAMZI with nine methods (three shallow learning algorithms and six state-of-the-art DL-based methods) on fourteen benchmark scRNA-seq datasets of various sizes (from hundreds to tens of thousands of cells) with known cell types. Experimental results demonstrate that scAMZI outperforms competing methods.

CONCLUSIONS

scAMZI outperforms competing methods and can facilitate downstream analyses such as cell annotation, marker gene discovery, and cell trajectory inference. The package of scAMZI is made freely available at https://doi.org/10.5281/zenodo.13131559 .

摘要

背景

对单细胞RNA测序(scRNA-seq)数据进行聚类在scRNA-seq数据分析及下游分析中起着至关重要的作用。已经提出了许多计算方法并取得了显著成果。然而,这些方法存在一些局限性。首先,它们没有充分利用细胞特征。其次,它们是基于基因表达信息开发的,在整合细胞间关系方面缺乏灵活性。最后,这些方法的性能会受到数据丢失事件的影响。

结果

我们提出了一种基于注意力自动编码器和零膨胀(ZI)层的新型深度学习(DL)模型,即scAMZI,用于对scRNA-seq数据进行聚类。scAMZI主要由SimAM(一种简单的、无参数的注意力模块)、自动编码器、零膨胀负二项分布(ZINB)模型和ZI层组成。基于ZINB模型,我们引入自动编码器和SimAM来降低数据维度,并学习细胞的特征表示以及细胞间的关系。同时,ZI层用于处理数据中的零值。我们在14个具有已知细胞类型的不同大小(从数百个到数万个细胞)的基准scRNA-seq数据集上,将scAMZI的性能与9种方法(3种浅层学习算法和6种基于深度学习的先进方法)进行了比较。实验结果表明,scAMZI优于其他竞争方法。

结论

scAMZI优于其他竞争方法,可促进细胞注释、标记基因发现和细胞轨迹推断等下游分析。scAMZI软件包可在https://doi.org/10.5281/zenodo.13131559上免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/2a53b2c51898/12864_2025_11511_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/a567cc42ae6e/12864_2025_11511_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/01a4168c34d5/12864_2025_11511_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/a6292d492ac9/12864_2025_11511_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/3053b878ac57/12864_2025_11511_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/5622b58d216c/12864_2025_11511_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/723876a06c7e/12864_2025_11511_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/befe2664655b/12864_2025_11511_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/2a53b2c51898/12864_2025_11511_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/a567cc42ae6e/12864_2025_11511_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/01a4168c34d5/12864_2025_11511_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/a6292d492ac9/12864_2025_11511_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/3053b878ac57/12864_2025_11511_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/5622b58d216c/12864_2025_11511_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/723876a06c7e/12864_2025_11511_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/befe2664655b/12864_2025_11511_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65f4/11974017/2a53b2c51898/12864_2025_11511_Fig8_HTML.jpg

相似文献

1
scAMZI: attention-based deep autoencoder with zero-inflated layer for clustering scRNA-seq data.scAMZI:用于scRNA序列数据聚类的带零膨胀层的基于注意力的深度自动编码器。
BMC Genomics. 2025 Apr 7;26(1):350. doi: 10.1186/s12864-025-11511-2.
2
Deep structural clustering for single-cell RNA-seq data jointly through autoencoder and graph neural network.基于自动编码器和图神经网络的单细胞 RNA-seq 数据深度结构聚类。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac018.
3
scZAG: Integrating ZINB-Based Autoencoder with Adaptive Data Augmentation Graph Contrastive Learning for scRNA-seq Clustering.scZAG:基于 ZINB 的自动编码器与自适应数据增强图对比学习在 scRNA-seq 聚类中的整合。
Int J Mol Sci. 2024 May 29;25(11):5976. doi: 10.3390/ijms25115976.
4
scLEGA: an attention-based deep clustering method with a tendency for low expression of genes on single-cell RNA-seq data.scLEGA:一种基于注意力的深度聚类方法,在单细胞 RNA-seq 数据中倾向于低表达基因。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae371.
5
scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering.scBGEDA:基于双分图集成分聚类的对偶去噪自动编码器的单细胞聚类分析。
Bioinformatics. 2023 Feb 14;39(2). doi: 10.1093/bioinformatics/btad075.
6
scTPC: a novel semisupervised deep clustering model for scRNA-seq data.scTPC:一种用于 scRNA-seq 数据的新型半监督深度聚类模型。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae293.
7
Attention-based deep clustering method for scRNA-seq cell type identification.基于注意力机制的深度聚类方法在 scRNA-seq 细胞类型鉴定中的应用。
PLoS Comput Biol. 2023 Nov 10;19(11):e1011641. doi: 10.1371/journal.pcbi.1011641. eCollection 2023 Nov.
8
scGAAC: A graph attention autoencoder for clustering single-cell RNA-sequencing data.scGAAC:一种用于单细胞RNA测序数据聚类的图注意力自动编码器。
Methods. 2024 Sep;229:115-124. doi: 10.1016/j.ymeth.2024.06.010. Epub 2024 Jun 29.
9
scRGCL: a cell type annotation method for single-cell RNA-seq data using residual graph convolutional neural network with contrastive learning.scRGCL:一种使用带有对比学习的残差图卷积神经网络对单细胞RNA测序数据进行细胞类型注释的方法。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae662.
10
scSFCL:Deep clustering of scRNA-seq data with subspace feature confidence learning.scSFCL:基于子空间特征置信度学习的单细胞RNA测序数据深度聚类
Comput Biol Chem. 2025 Feb;114:108292. doi: 10.1016/j.compbiolchem.2024.108292. Epub 2024 Nov 22.

引用本文的文献

1
Random forest-driven mortality prediction in critical IBD care: a dual-database model integrating comorbidity patterns and real-time physiometrics.随机森林驱动的重症炎症性肠病护理中的死亡率预测:一种整合共病模式和实时生理指标的双数据库模型
Front Med (Lausanne). 2025 Aug 8;12:1624899. doi: 10.3389/fmed.2025.1624899. eCollection 2025.
2
Interpretable graph Kolmogorov-Arnold networks for multi-cancer classification and biomarker identification using multi-omics data.用于多癌分类和使用多组学数据进行生物标志物识别的可解释图柯尔莫哥洛夫-阿诺德网络
Sci Rep. 2025 Jul 29;15(1):27607. doi: 10.1038/s41598-025-13337-0.

本文引用的文献

1
scMGATGRN: a multiview graph attention network-based method for inferring gene regulatory networks from single-cell transcriptomic data.scMGATGRN:一种基于多视图图注意力网络的方法,用于从单细胞转录组数据推断基因调控网络。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae526.
2
Transcriptomic diversity of cell types across the adult human brain.成人脑中细胞类型的转录组多样性。
Science. 2023 Oct 13;382(6667):eadd7046. doi: 10.1126/science.add7046.
3
An optimized graph-based structure for single-cell RNA-seq cell-type classification based on non-linear dimension reduction.
基于非线性降维的单细胞 RNA-seq 细胞类型分类的优化图结构。
BMC Genomics. 2023 May 2;24(1):227. doi: 10.1186/s12864-023-09344-y.
4
Dimensionality reduction and visualization of single-cell RNA-seq data with an improved deep variational autoencoder.基于改进深度变分自动编码器的单细胞 RNA-seq 数据降维和可视化。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad152.
5
High-Resolution Transcriptomic Landscape of the Human Submandibular Gland.人类下颌下腺的高分辨率转录组图谱。
J Dent Res. 2023 May;102(5):525-535. doi: 10.1177/00220345221147908. Epub 2023 Feb 1.
6
scDSSC: Deep Sparse Subspace Clustering for scRNA-seq Data.scDSSC:用于 scRNA-seq 数据的深度稀疏子空间聚类。
PLoS Comput Biol. 2022 Dec 19;18(12):e1010772. doi: 10.1371/journal.pcbi.1010772. eCollection 2022 Dec.
7
Single-nucleus profiling of human dilated and hypertrophic cardiomyopathy.人类扩张型和肥厚型心肌病的单细胞分析。
Nature. 2022 Aug;608(7921):174-180. doi: 10.1038/s41586-022-04817-8. Epub 2022 Jun 22.
8
High-resolution Slide-seqV2 spatial transcriptomics enables discovery of disease-specific cell neighborhoods and pathways.高分辨率Slide-seqV2空间转录组学有助于发现疾病特异性细胞邻域和通路。
iScience. 2022 Mar 16;25(4):104097. doi: 10.1016/j.isci.2022.104097. eCollection 2022 Apr 15.
9
Benchmarking clustering algorithms on estimating the number of cell types from single-cell RNA-sequencing data.基于单细胞 RNA 测序数据评估细胞类型数量的聚类算法基准测试。
Genome Biol. 2022 Feb 8;23(1):49. doi: 10.1186/s13059-022-02622-0.
10
A topology-preserving dimensionality reduction method for single-cell RNA-seq data using graph autoencoder.基于图自动编码器的单细胞 RNA-seq 数据拓扑保持降维方法。
Sci Rep. 2021 Oct 8;11(1):20028. doi: 10.1038/s41598-021-99003-7.