• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

scAIDE:大规模单细胞RNA测序数据的聚类揭示了假定的和罕见的细胞类型。

scAIDE: clustering of large-scale single-cell RNA-seq data reveals putative and rare cell types.

作者信息

Xie Kaikun, Huang Yu, Zeng Feng, Liu Zehua, Chen Ting

机构信息

Institute for Artificial Intelligence, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China.

Department of Automation, Xiamen University, Xiamen 361005, China.

出版信息

NAR Genom Bioinform. 2020 Oct 9;2(4):lqaa082. doi: 10.1093/nargab/lqaa082. eCollection 2020 Dec.

DOI:10.1093/nargab/lqaa082
PMID:33575628
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7671411/
Abstract

Recent advancements in both single-cell RNA-sequencing technology and computational resources facilitate the study of cell types on global populations. Up to millions of cells can now be sequenced in one experiment; thus, accurate and efficient computational methods are needed to provide clustering and post-analysis of assigning putative and rare cell types. Here, we present a novel unsupervised deep learning clustering framework that is robust and highly scalable. To overcome the high level of noise, scAIDE first incorporates an autoencoder-imputation network with a distance-preserved embedding network (AIDE) to learn a good representation of data, and then applies a random projection hashing based -means algorithm to accommodate the detection of rare cell types. We analyzed a 1.3 million neural cell dataset within 30 min, obtaining 64 clusters which were mapped to 19 putative cell types. In particular, we further identified three different neural stem cell developmental trajectories in these clusters. We also classified two subpopulations of malignant cells in a small glioblastoma dataset using scAIDE. We anticipate that scAIDE would provide a more in-depth understanding of cell development and diseases.

摘要

单细胞RNA测序技术和计算资源的最新进展促进了对全球细胞群体中细胞类型的研究。现在,在一个实验中可以对数百万个细胞进行测序;因此,需要准确而高效的计算方法来对假定的和罕见的细胞类型进行聚类和后期分析。在此,我们提出了一种新颖的无监督深度学习聚类框架,该框架稳健且具有高度可扩展性。为了克服高水平的噪声,scAIDE首先将一个自动编码器插补网络与一个距离保持嵌入网络(AIDE)相结合,以学习数据的良好表示,然后应用基于随机投影哈希的均值算法来适应对罕见细胞类型的检测。我们在30分钟内分析了一个包含130万个神经细胞的数据集,获得了64个聚类,这些聚类被映射到19种假定的细胞类型。特别地,我们在这些聚类中进一步确定了三种不同的神经干细胞发育轨迹。我们还使用scAIDE对一个小胶质母细胞瘤数据集中的两个恶性细胞亚群进行了分类。我们预计scAIDE将为细胞发育和疾病提供更深入的理解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b75a/7671411/4b8a0cc5dd4f/lqaa082fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b75a/7671411/81f6757f1184/lqaa082fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b75a/7671411/9585081966ac/lqaa082fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b75a/7671411/8c576d72663b/lqaa082fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b75a/7671411/6c9ffea1aa68/lqaa082fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b75a/7671411/4b8a0cc5dd4f/lqaa082fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b75a/7671411/81f6757f1184/lqaa082fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b75a/7671411/9585081966ac/lqaa082fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b75a/7671411/8c576d72663b/lqaa082fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b75a/7671411/6c9ffea1aa68/lqaa082fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b75a/7671411/4b8a0cc5dd4f/lqaa082fig5.jpg

相似文献

1
scAIDE: clustering of large-scale single-cell RNA-seq data reveals putative and rare cell types.scAIDE:大规模单细胞RNA测序数据的聚类揭示了假定的和罕见的细胞类型。
NAR Genom Bioinform. 2020 Oct 9;2(4):lqaa082. doi: 10.1093/nargab/lqaa082. eCollection 2020 Dec.
2
Autoencoder-based cluster ensembles for single-cell RNA-seq data analysis.基于自动编码器的单细胞 RNA-seq 数据分析聚类集成。
BMC Bioinformatics. 2019 Dec 24;20(Suppl 19):660. doi: 10.1186/s12859-019-3179-5.
3
Deep structural clustering for single-cell RNA-seq data jointly through autoencoder and graph neural network.基于自动编码器和图神经网络的单细胞 RNA-seq 数据深度结构聚类。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac018.
4
ScCAEs: deep clustering of single-cell RNA-seq via convolutional autoencoder embedding and soft K-means.ScCAEs:基于卷积自动编码器嵌入和软 K-means 的单细胞 RNA-seq 深度聚类。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab321.
5
scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering.scBGEDA:基于双分图集成分聚类的对偶去噪自动编码器的单细胞聚类分析。
Bioinformatics. 2023 Feb 14;39(2). doi: 10.1093/bioinformatics/btad075.
6
Contrastive self-supervised clustering of scRNA-seq data.单细胞 RNA 测序数据的对比自监督聚类。
BMC Bioinformatics. 2021 May 27;22(1):280. doi: 10.1186/s12859-021-04210-8.
7
NISC: Neural Network-Imputation for Single-Cell RNA Sequencing and Cell Type Clustering.NISC:用于单细胞RNA测序和细胞类型聚类的神经网络插补法
Front Genet. 2022 May 3;13:847112. doi: 10.3389/fgene.2022.847112. eCollection 2022.
8
Self-supervised deep clustering of single-cell RNA-seq data to hierarchically detect rare cell populations.基于单细胞 RNA-seq 数据的自监督深度聚类来分层检测稀有细胞群体。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad335.
9
ScGSLC: An unsupervised graph similarity learning framework for single-cell RNA-seq data clustering.ScGSLC:一种用于单细胞 RNA-seq 数据聚类的无监督图相似性学习框架。
Comput Biol Chem. 2021 Feb;90:107415. doi: 10.1016/j.compbiolchem.2020.107415. Epub 2020 Nov 18.
10
A hybrid deep clustering approach for robust cell type profiling using single-cell RNA-seq data.基于单细胞 RNA-seq 数据的混合深度聚类方法进行稳健的细胞类型分析。
RNA. 2020 Oct;26(10):1303-1319. doi: 10.1261/rna.074427.119. Epub 2020 Jun 12.

引用本文的文献

1
Comparative benchmarking of single-cell clustering algorithms for transcriptomic and proteomic data.用于转录组学和蛋白质组学数据的单细胞聚类算法的比较基准测试
Genome Biol. 2025 Sep 3;26(1):265. doi: 10.1186/s13059-025-03719-y.
2
Integration of multi-modal measurements identifies critical mechanisms of tuberculosis drug action.多模态测量的整合确定了结核病药物作用的关键机制。
Cell Syst. 2025 Aug 20;16(8):101348. doi: 10.1016/j.cels.2025.101348. Epub 2025 Jul 29.
3
Robust self-supervised machine learning for single cell embeddings and annotations.

本文引用的文献

1
redPATH: Reconstructing the Pseudo Development Time of Cell Lineages in Single-cell RNA-seq Data and Applications in Cancer.redPATH:单细胞 RNA-seq 数据中细胞谱系伪发育时间的重构及其在癌症中的应用。
Genomics Proteomics Bioinformatics. 2021 Apr;19(2):292-305. doi: 10.1016/j.gpb.2020.06.014. Epub 2021 Feb 17.
2
GiniClust3: a fast and memory-efficient tool for rare cell type identification.GiniClust3:一种快速且节省内存的稀有细胞类型识别工具。
BMC Bioinformatics. 2020 Apr 25;21(1):158. doi: 10.1186/s12859-020-3482-1.
3
Benchmark and Parameter Sensitivity Analysis of Single-Cell RNA Sequencing Clustering Methods.
用于单细胞嵌入和注释的强大自监督机器学习
bioRxiv. 2025 Jun 8:2025.06.05.658097. doi: 10.1101/2025.06.05.658097.
4
Decoupled GNNs based on multi-view contrastive learning for scRNA-seq data clustering.基于多视图对比学习的解耦图神经网络用于单细胞RNA测序数据聚类
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf198.
5
Multi-level multi-view network based on structural contrastive learning for scRNA-seq data clustering.基于结构对比学习的多层次多视图网络用于 scRNA-seq 数据聚类。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae562.
6
ESCHR: a hyperparameter-randomized ensemble approach for robust clustering across diverse datasets.ESCHR:一种针对不同数据集的稳健聚类的超参数随机集成方法。
Genome Biol. 2024 Sep 16;25(1):242. doi: 10.1186/s13059-024-03386-5.
7
scCAD: Cluster decomposition-based anomaly detection for rare cell identification in single-cell expression data.scCAD:基于聚类分解的单细胞表达数据中稀有细胞异常检测方法。
Nat Commun. 2024 Aug 31;15(1):7561. doi: 10.1038/s41467-024-51891-9.
8
scCross: efficient search for rare subpopulations across multiple single-cell samples.scCross:在多个单细胞样本中高效搜索稀有亚群。
Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae371.
9
Exploring high-quality microbial genomes by assembling short-reads with long-range connectivity.通过组装具有长程连接性的短读长来探索高质量的微生物基因组。
Nat Commun. 2024 May 31;15(1):4631. doi: 10.1038/s41467-024-49060-z.
10
Advancing single-cell RNA-seq data analysis through the fusion of multi-layer perceptron and graph neural network.通过多层感知机和图神经网络的融合来推进单细胞 RNA-seq 数据分析。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad481.
单细胞RNA测序聚类方法的基准测试与参数敏感性分析
Front Genet. 2019 Dec 11;10:1253. doi: 10.3389/fgene.2019.01253. eCollection 2019.
4
The art of using t-SNE for single-cell transcriptomics.使用 t-SNE 进行单细胞转录组学分析的艺术。
Nat Commun. 2019 Nov 28;10(1):5416. doi: 10.1038/s41467-019-13056-x.
5
Meningeal γδ T cell-derived IL-17 controls synaptic plasticity and short-term memory.脑膜 γδ T 细胞衍生的白细胞介素-17 控制着突触可塑性和短期记忆。
Sci Immunol. 2019 Oct 11;4(40). doi: 10.1126/sciimmunol.aay5199.
6
Supervised classification enables rapid annotation of cell atlases.监督分类可实现细胞图谱的快速标注。
Nat Methods. 2019 Oct;16(10):983-986. doi: 10.1038/s41592-019-0535-3. Epub 2019 Sep 9.
7
A comparison of automatic cell identification methods for single-cell RNA sequencing data.单细胞 RNA 测序数据的自动细胞识别方法比较。
Genome Biol. 2019 Sep 9;20(1):194. doi: 10.1186/s13059-019-1795-z.
8
PanoView: An iterative clustering method for single-cell RNA sequencing data.PanoView:一种用于单细胞 RNA 测序数据的迭代聚类方法。
PLoS Comput Biol. 2019 Aug 30;15(8):e1007040. doi: 10.1371/journal.pcbi.1007040. eCollection 2019 Aug.
9
CellSIUS provides sensitive and specific detection of rare cell populations from complex single-cell RNA-seq data.CellSIUS 能够从复杂的单细胞 RNA-seq 数据中灵敏且特异地检测稀有细胞群体。
Genome Biol. 2019 Jul 17;20(1):142. doi: 10.1186/s13059-019-1739-7.
10
Comprehensive Integration of Single-Cell Data.单细胞数据的综合整合。
Cell. 2019 Jun 13;177(7):1888-1902.e21. doi: 10.1016/j.cell.2019.05.031. Epub 2019 Jun 6.