• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于揭示细胞异质性的单细胞染色质可及性测序数据的离散潜在嵌入

Discrete latent embedding of single-cell chromatin accessibility sequencing data for uncovering cell heterogeneity.

作者信息

Cui Xuejian, Chen Xiaoyang, Li Zhen, Gao Zijing, Chen Shengquan, Jiang Rui

机构信息

Ministry of Education Key Laboratory of Bioinformatics, Bioinformatics Division at the Beijing National Research Center for Information Science and Technology, Center for Synthetic and Systems Biology, Department of Automation, Tsinghua University, Beijing, China.

School of Mathematical Sciences and LPMC, Nankai University, Tianjin, China.

出版信息

Nat Comput Sci. 2024 May;4(5):346-359. doi: 10.1038/s43588-024-00625-4. Epub 2024 May 10.

DOI:10.1038/s43588-024-00625-4
PMID:38730185
Abstract

Single-cell epigenomic data has been growing continuously at an unprecedented pace, but their characteristics such as high dimensionality and sparsity pose substantial challenges to downstream analysis. Although deep learning models-especially variational autoencoders-have been widely used to capture low-dimensional feature embeddings, the prevalent Gaussian assumption somewhat disagrees with real data, and these models tend to struggle to incorporate reference information from abundant cell atlases. Here we propose CASTLE, a deep generative model based on the vector-quantized variational autoencoder framework to extract discrete latent embeddings that interpretably characterize single-cell chromatin accessibility sequencing data. We validate the performance and robustness of CASTLE for accurate cell-type identification and reasonable visualization compared with state-of-the-art methods. We demonstrate the advantages of CASTLE for effective incorporation of existing massive reference datasets in a weakly supervised or supervised manner. We further demonstrate CASTLE's capacity for intuitively distilling cell-type-specific feature spectra that unveil cell heterogeneity and biological implications quantitatively.

摘要

单细胞表观基因组数据一直在以前所未有的速度持续增长,但其诸如高维度和稀疏性等特征给下游分析带来了巨大挑战。尽管深度学习模型——尤其是变分自编码器——已被广泛用于捕获低维特征嵌入,但普遍的高斯假设与真实数据存在一定差异,并且这些模型往往难以整合来自丰富细胞图谱的参考信息。在此,我们提出了CASTLE,这是一种基于向量量化变分自编码器框架的深度生成模型,用于提取离散的潜在嵌入,从而以可解释的方式表征单细胞染色质可及性测序数据。与现有最先进的方法相比,我们验证了CASTLE在准确的细胞类型识别和合理的可视化方面的性能和稳健性。我们展示了CASTLE以弱监督或监督方式有效整合现有大量参考数据集的优势。我们进一步展示了CASTLE直观提炼细胞类型特异性特征谱的能力,这些特征谱定量地揭示了细胞异质性和生物学意义。

相似文献

1
Discrete latent embedding of single-cell chromatin accessibility sequencing data for uncovering cell heterogeneity.用于揭示细胞异质性的单细胞染色质可及性测序数据的离散潜在嵌入
Nat Comput Sci. 2024 May;4(5):346-359. doi: 10.1038/s43588-024-00625-4. Epub 2024 May 10.
2
scCASE: accurate and interpretable enhancement for single-cell chromatin accessibility sequencing data.scCASE:单细胞染色质可及性测序数据的准确可解释增强
Nat Commun. 2024 Feb 22;15(1):1629. doi: 10.1038/s41467-024-46045-w.
3
Deep-joint-learning analysis model of single cell transcriptome and open chromatin accessibility data.单细胞转录组和开放染色质可及性数据的深度联合学习分析模型。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa287.
4
A deep adversarial variational autoencoder model for dimensionality reduction in single-cell RNA sequencing analysis.一种用于单细胞 RNA 测序分析中降维的深度对抗变分自动编码器模型。
BMC Bioinformatics. 2020 Feb 21;21(1):64. doi: 10.1186/s12859-020-3401-5.
5
Deep Clustering Analysis via Dual Variational Autoencoder With Spherical Latent Embeddings.基于具有球形潜在嵌入的对偶变分自编码器的深度聚类分析
IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):6303-6312. doi: 10.1109/TNNLS.2021.3135460. Epub 2023 Sep 1.
6
Ensemble deep learning of embeddings for clustering multimodal single-cell omics data.基于嵌入的集成深度学习用于聚类多模态单细胞组学数据。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad382.
7
Interactive analysis of single-cell epigenomic landscapes with ChromSCape.使用 ChromSCape 进行单细胞表观基因组景观的交互式分析。
Nat Commun. 2020 Nov 11;11(1):5702. doi: 10.1038/s41467-020-19542-x.
8
Joint variational autoencoders for multimodal imputation and embedding.用于多模态插补和嵌入的联合变分自编码器
Nat Mach Intell. 2023 Jun;5(6):631-642. doi: 10.1038/s42256-023-00663-z. Epub 2023 May 29.
9
Attention-based deep clustering method for scRNA-seq cell type identification.基于注意力机制的深度聚类方法在 scRNA-seq 细胞类型鉴定中的应用。
PLoS Comput Biol. 2023 Nov 10;19(11):e1011641. doi: 10.1371/journal.pcbi.1011641. eCollection 2023 Nov.
10
scVAEBGM: Clustering Analysis of Single-Cell ATAC-seq Data Using a Deep Generative Model.scVAEBGM:使用深度生成模型对单细胞ATAC序列数据进行聚类分析。
Interdiscip Sci. 2022 Dec;14(4):917-928. doi: 10.1007/s12539-022-00536-w. Epub 2022 Aug 8.

引用本文的文献

1
MINGLE: a mutual information-based interpretable framework for automatic cell type annotation in single-cell chromatin accessibility data.MINGLE:一种基于互信息的可解释框架,用于单细胞染色质可及性数据中的自动细胞类型注释。
Genome Biol. 2025 Jun 11;26(1):162. doi: 10.1186/s13059-025-03603-9.
2
CREATE: cell-type-specific cis-regulatory element identification via discrete embedding.CREATE:通过离散嵌入进行细胞类型特异性顺式调控元件识别
Nat Commun. 2025 May 17;16(1):4607. doi: 10.1038/s41467-025-59780-5.
3
Graph neural networks for single-cell omics data: a review of approaches and applications.
用于单细胞组学数据的图神经网络:方法与应用综述
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf109.
4
INSTINCT: Multi-sample integration of spatial chromatin accessibility sequencing data via stochastic domain translation.INSTINCT:通过随机结构域翻译对空间染色质可及性测序数据进行多样本整合。
Nat Commun. 2025 Feb 1;16(1):1247. doi: 10.1038/s41467-025-56535-0.