• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度学习解缠的多批次单细胞对比图谱构建

Multi-batch single-cell comparative atlas construction by deep learning disentanglement.

机构信息

Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.

Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, USA.

出版信息

Nat Commun. 2023 Jul 12;14(1):4126. doi: 10.1038/s41467-023-39494-2.

DOI:10.1038/s41467-023-39494-2
PMID:37433791
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10336139/
Abstract

Cell state atlases constructed through single-cell RNA-seq and ATAC-seq analysis are powerful tools for analyzing the effects of genetic and drug treatment-induced perturbations on complex cell systems. Comparative analysis of such atlases can yield new insights into cell state and trajectory alterations. Perturbation experiments often require that single-cell assays be carried out in multiple batches, which can introduce technical distortions that confound the comparison of biological quantities between different batches. Here we propose CODAL, a variational autoencoder-based statistical model which uses a mutual information regularization technique to explicitly disentangle factors related to technical and biological effects. We demonstrate CODAL's capacity for batch-confounded cell type discovery when applied to simulated datasets and embryonic development atlases with gene knockouts. CODAL improves the representation of RNA-seq and ATAC-seq modalities, yields interpretable modules of biological variation, and enables the generalization of other count-based generative models to multi-batched data.

摘要

通过单细胞 RNA-seq 和 ATAC-seq 分析构建的细胞状态图谱是分析遗传和药物处理诱导的扰动对复杂细胞系统影响的有力工具。对这些图谱进行比较分析可以深入了解细胞状态和轨迹的改变。扰动实验通常需要在多个批次中进行单细胞分析,这可能会引入技术扭曲,从而混淆不同批次之间生物数量的比较。在这里,我们提出了 CODAL,这是一种基于变分自动编码器的统计模型,它使用互信息正则化技术来显式分离与技术和生物效应相关的因素。当应用于模拟数据集和具有基因敲除的胚胎发育图谱时,我们证明了 CODAL 具有在批次混淆的情况下发现细胞类型的能力。CODAL 改善了 RNA-seq 和 ATAC-seq 模态的表示,产生了可解释的生物学变异模块,并使其他基于计数的生成模型能够推广到多批次数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a933/10336139/54409c2aa094/41467_2023_39494_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a933/10336139/bee3aa0ba9a0/41467_2023_39494_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a933/10336139/d1eeab916e6e/41467_2023_39494_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a933/10336139/e71b0ca7027a/41467_2023_39494_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a933/10336139/cb68db3c848e/41467_2023_39494_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a933/10336139/8b3215f3ff39/41467_2023_39494_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a933/10336139/54409c2aa094/41467_2023_39494_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a933/10336139/bee3aa0ba9a0/41467_2023_39494_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a933/10336139/d1eeab916e6e/41467_2023_39494_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a933/10336139/e71b0ca7027a/41467_2023_39494_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a933/10336139/cb68db3c848e/41467_2023_39494_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a933/10336139/8b3215f3ff39/41467_2023_39494_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a933/10336139/54409c2aa094/41467_2023_39494_Fig6_HTML.jpg

相似文献

1
Multi-batch single-cell comparative atlas construction by deep learning disentanglement.基于深度学习解缠的多批次单细胞对比图谱构建
Nat Commun. 2023 Jul 12;14(1):4126. doi: 10.1038/s41467-023-39494-2.
2
A Unified Deep Learning Framework for Single-Cell ATAC-Seq Analysis Based on ProdDep Transformer Encoder.基于 ProdDep 转换器编码器的单细胞 ATAC-Seq 分析的统一深度学习框架。
Int J Mol Sci. 2023 Mar 1;24(5):4784. doi: 10.3390/ijms24054784.
3
scJoint integrates atlas-scale single-cell RNA-seq and ATAC-seq data with transfer learning.scJoint 集成了图谱尺度单细胞 RNA-seq 和 ATAC-seq 数据,并结合了迁移学习。
Nat Biotechnol. 2022 May;40(5):703-710. doi: 10.1038/s41587-021-01161-6. Epub 2022 Jan 20.
4
scDREAMER for atlas-level integration of single-cell datasets using deep generative model paired with adversarial classifier.scDREAMER:基于深度生成模型与对抗分类器的单细胞数据集图谱级整合方法。
Nat Commun. 2023 Nov 27;14(1):7781. doi: 10.1038/s41467-023-43590-8.
5
scVAEBGM: Clustering Analysis of Single-Cell ATAC-seq Data Using a Deep Generative Model.scVAEBGM:使用深度生成模型对单细胞ATAC序列数据进行聚类分析。
Interdiscip Sci. 2022 Dec;14(4):917-928. doi: 10.1007/s12539-022-00536-w. Epub 2022 Aug 8.
6
Translator: A fer earning Approach to Facilitate Single-Cell AC-Seq Data Analysis frm eference Dataset.翻译:一种基于收益的方法,可从参考数据集促进单细胞 AC-Seq 数据分析。
J Comput Biol. 2022 Jul;29(7):619-633. doi: 10.1089/cmb.2021.0596. Epub 2022 May 17.
7
Integrating single-cell RNA-seq datasets with substantial batch effects.整合具有显著批次效应的单细胞RNA测序数据集。
bioRxiv. 2024 Feb 10:2023.11.03.565463. doi: 10.1101/2023.11.03.565463.
8
SAILER: scalable and accurate invariant representation learning for single-cell ATAC-seq processing and integration.SAILER:用于单细胞 ATAC-seq 处理和整合的可扩展且准确的不变表示学习。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i317-i326. doi: 10.1093/bioinformatics/btab303.
9
deepMNN: Deep Learning-Based Single-Cell RNA Sequencing Data Batch Correction Using Mutual Nearest Neighbors.deepMNN:基于深度学习的使用相互最近邻算法的单细胞RNA测序数据批次校正
Front Genet. 2021 Aug 10;12:708981. doi: 10.3389/fgene.2021.708981. eCollection 2021.
10
ScInfoVAE: interpretable dimensional reduction of single cell transcription data with variational autoencoders and extended mutual information regularization.ScInfoVAE:使用变分自编码器和扩展互信息正则化对单细胞转录数据进行可解释的降维。
BioData Min. 2023 Jun 10;16(1):17. doi: 10.1186/s13040-023-00333-1.

引用本文的文献

1
Leveraging multiple labeled datasets for the automated annotation of single-cell RNA and ATAC data.利用多个标记数据集对单细胞RNA和ATAC数据进行自动注释。
Comput Struct Biotechnol J. 2025 Jul 1;27:2863-2870. doi: 10.1016/j.csbj.2025.06.043. eCollection 2025.
2
BIOTIC: a Bayesian framework to integrate single-cell multi-omics for transcription factor activity inference and improve identity characterization of cells.BIOTIC:一种用于整合单细胞多组学数据以进行转录因子活性推断并改善细胞身份特征描述的贝叶斯框架。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf013.
3
Biologically relevant integration of transcriptomics profiles from cancer cell lines, patient-derived xenografts, and clinical tumors using deep learning.

本文引用的文献

1
MIRA: joint regulatory modeling of multimodal expression and chromatin accessibility in single cells.MIRA:单细胞中多模态表达和染色质可及性的联合调控建模。
Nat Methods. 2022 Sep;19(9):1097-1108. doi: 10.1038/s41592-022-01595-z. Epub 2022 Sep 6.
2
PeakVI: A deep generative model for single-cell chromatin accessibility analysis.PeakVI:一种用于单细胞染色质可及性分析的深度生成模型。
Cell Rep Methods. 2022 Mar 15;2(3):100182. doi: 10.1016/j.crmeth.2022.100182. eCollection 2022 Mar 28.
3
Benchmarking atlas-level data integration in single-cell genomics.
利用深度学习对癌细胞系、患者来源的异种移植瘤和临床肿瘤的转录组学图谱进行生物学相关整合。
Sci Adv. 2025 Jan 17;11(3):eadn5596. doi: 10.1126/sciadv.adn5596.
4
Machine learning of brain-specific biomarkers from EEG.从脑电图中机器学习脑特异性生物标志物。
EBioMedicine. 2024 Aug;106:105259. doi: 10.1016/j.ebiom.2024.105259. Epub 2024 Aug 5.
5
Beaconet: A Reference-Free Method for Integrating Multiple Batches of Single-Cell Transcriptomic Data in Original Molecular Space.Beaconet:一种在原始分子空间中整合多个批次单细胞转录组数据的无参方法。
Adv Sci (Weinh). 2024 Jul;11(26):e2306770. doi: 10.1002/advs.202306770. Epub 2024 May 6.
单细胞基因组学中图谱级数据整合的基准测试。
Nat Methods. 2022 Jan;19(1):41-50. doi: 10.1038/s41592-021-01336-8. Epub 2021 Dec 23.
4
JASPAR 2022: the 9th release of the open-access database of transcription factor binding profiles.JASPAR 2022:转录因子结合谱开放获取数据库的第 9 个版本。
Nucleic Acids Res. 2022 Jan 7;50(D1):D165-D173. doi: 10.1093/nar/gkab1113.
5
Chromatin and gene-regulatory dynamics of the developing human cerebral cortex at single-cell resolution.单细胞分辨率解析人类大脑皮层发育过程中的染色质和基因调控动态。
Cell. 2021 Sep 16;184(19):5053-5069.e23. doi: 10.1016/j.cell.2021.07.039. Epub 2021 Aug 13.
6
Molecular logic of cellular diversification in the mouse cerebral cortex.小鼠大脑皮层细胞多样化的分子逻辑。
Nature. 2021 Jul;595(7868):554-559. doi: 10.1038/s41586-021-03670-5. Epub 2021 Jun 23.
7
BUTTERFLY: addressing the pooled amplification paradox with unique molecular identifiers in single-cell RNA-seq.蝴蝶:单细胞 RNA-seq 中利用独特分子标识符解决汇集扩增悖论。
Genome Biol. 2021 Jun 8;22(1):174. doi: 10.1186/s13059-021-02386-z.
8
Single-cell brain atlas of Parkinson's disease mouse model.帕金森病小鼠模型的单细胞大脑图谱。
J Genet Genomics. 2021 Apr 20;48(4):277-288. doi: 10.1016/j.jgg.2021.01.003. Epub 2021 Feb 18.
9
Deconvolution of cell type-specific drug responses in human tumor tissue with single-cell RNA-seq.单细胞 RNA 测序解析人类肿瘤组织中细胞类型特异性药物反应
Genome Med. 2021 May 11;13(1):82. doi: 10.1186/s13073-021-00894-y.
10
A single-embryo, single-cell time-resolved model for mouse gastrulation.用于小鼠原肠胚形成的单细胞时分辨胚胎模型。
Cell. 2021 May 27;184(11):2825-2842.e22. doi: 10.1016/j.cell.2021.04.004. Epub 2021 Apr 30.