scDREAMER：基于深度生成模型与对抗分类器的单细胞数据集图谱级整合方法。

scDREAMER for atlas-level integration of single-cell datasets using deep generative model paired with adversarial classifier.

机构信息

Department of Computer Science and Engineering, Indian Institute of Technology Kanpur, Kanpur, India.

Department of Biological Sciences and Bioengineering, Indian Institute of Technology Kanpur, Kanpur, India.

出版信息

Nat Commun. 2023 Nov 27;14(1):7781. doi: 10.1038/s41467-023-43590-8.

DOI:10.1038/s41467-023-43590-8

PMID:38012145

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10682386/

Abstract

Integration of heterogeneous single-cell sequencing datasets generated across multiple tissue locations, time, and conditions is essential for a comprehensive understanding of the cellular states and expression programs underlying complex biological systems. Here, we present scDREAMER ( https://github.com/Zafar-Lab/scDREAMER ), a data-integration framework that employs deep generative models and adversarial training for both unsupervised and supervised (scDREAMER-Sup) integration of multiple batches. Using six real benchmarking datasets, we demonstrate that scDREAMER can overcome critical challenges including skewed cell type distribution among batches, nested batch-effects, large number of batches and conservation of development trajectory across batches. Our experiments also show that scDREAMER and scDREAMER-Sup outperform state-of-the-art unsupervised and supervised integration methods respectively in batch-correction and conservation of biological variation. Using a 1 million cells dataset, we demonstrate that scDREAMER is scalable and can perform atlas-level cross-species (e.g., human and mouse) integration while being faster than other deep-learning-based methods.

摘要

整合来自多个组织位置、时间和条件的异质单细胞测序数据集对于全面了解复杂生物系统的细胞状态和表达程序至关重要。在这里，我们提出了 scDREAMER（https://github.com/Zafar-Lab/scDREAMER），这是一个数据集成框架，它使用深度生成模型和对抗训练来进行多个批次的无监督和有监督（scDREAMER-Sup）集成。使用六个真实的基准数据集，我们证明了 scDREAMER 可以克服关键挑战，包括批次之间细胞类型分布的偏斜、嵌套批次效应、大量批次以及批次之间发育轨迹的保留。我们的实验还表明，scDREAMER 和 scDREAMER-Sup 在批次校正和保留生物变异性方面分别优于最先进的无监督和有监督集成方法。使用一个 100 万个细胞的数据集，我们证明了 scDREAMER 是可扩展的，可以进行图谱级别的跨物种（例如，人类和小鼠）集成，并且比其他基于深度学习的方法更快。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ed7/10682386/3a39a92caba8/41467_2023_43590_Fig1_HTML.jpg

相似文献

scDREAMER for atlas-level integration of single-cell datasets using deep generative model paired with adversarial classifier.scDREAMER：基于深度生成模型与对抗分类器的单细胞数据集图谱级整合方法。

Nat Commun. 2023 Nov 27;14(1):7781. doi: 10.1038/s41467-023-43590-8.

Batch correction of single-cell sequencing data via an autoencoder architecture.通过自动编码器架构对单细胞测序数据进行批量校正。

Bioinform Adv. 2023 Dec 28;4(1):vbad186. doi: 10.1093/bioadv/vbad186. eCollection 2024.

ResPAN: a powerful batch correction model for scRNA-seq data through residual adversarial networks.ResPAN：通过残差对抗网络对 scRNA-seq 数据进行强大的批量校正模型。

Bioinformatics. 2022 Aug 10;38(16):3942-3949. doi: 10.1093/bioinformatics/btac427.

Benchmarking atlas-level data integration in single-cell genomics.单细胞基因组学中图谱级数据整合的基准测试。

Nat Methods. 2022 Jan;19(1):41-50. doi: 10.1038/s41592-021-01336-8. Epub 2021 Dec 23.

HDMC: a novel deep learning-based framework for removing batch effects in single-cell RNA-seq data.HDMC：一种用于去除单细胞 RNA-seq 数据中批次效应的新型深度学习框架。

Bioinformatics. 2022 Feb 7;38(5):1295-1303. doi: 10.1093/bioinformatics/btab821.

iMAP: integration of multiple single-cell datasets by adversarial paired transfer networks.iMAP：基于对抗配对迁移网络的多个单细胞数据集整合。

Genome Biol. 2021 Feb 18;22(1):63. doi: 10.1186/s13059-021-02280-8.

BERMAD: batch effect removal for single-cell RNA-seq data using a multi-layer adaptation autoencoder with dual-channel framework.BERMAD：基于双通道框架的多层自适应自动编码器去除单细胞 RNA-seq 数据中的批次效应

Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae127.

scMultiGAN: cell-specific imputation for single-cell transcriptomes with multiple deep generative adversarial networks.scMultiGAN：使用多个深度生成对抗网络进行单细胞转录组的细胞特异性插补。

Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad384.

Beaconet: A Reference-Free Method for Integrating Multiple Batches of Single-Cell Transcriptomic Data in Original Molecular Space.Beaconet：一种在原始分子空间中整合多个批次单细胞转录组数据的无参方法。

Adv Sci (Weinh). 2024 Jul;11(26):e2306770. doi: 10.1002/advs.202306770. Epub 2024 May 6.

Multi-batch single-cell comparative atlas construction by deep learning disentanglement.基于深度学习解缠的多批次单细胞对比图谱构建

Nat Commun. 2023 Jul 12;14(1):4126. doi: 10.1038/s41467-023-39494-2.

引用本文的文献

A Benchmark of Semi-Supervised scRNA-seq Integration Methods in Real-World Scenarios.真实场景下半监督单细胞RNA测序整合方法的基准测试

bioRxiv. 2025 Aug 27:2025.08.23.671952. doi: 10.1101/2025.08.23.671952.

Partially characterized topology guides reliable anchor-free scRNA-integration.部分特征化的拓扑结构指导可靠的无锚单细胞RNA整合。

Commun Biol. 2025 Apr 4;8(1):561. doi: 10.1038/s42003-025-07988-y.

scMEDAL for the interpretable analysis of single-cell transcriptomics data with batch effect visualization using a deep mixed effects autoencoder.scMEDAL：使用深度混合效应自动编码器进行单细胞转录组学数据的可解释分析及批次效应可视化

Res Sq. 2025 Mar 19:rs.3.rs-6081478. doi: 10.21203/rs.3.rs-6081478/v1.

scCobra allows contrastive cell embedding learning with domain adaptation for single cell data integration and harmonization.scCobra支持通过域适应进行对比细胞嵌入学习，以实现单细胞数据整合与归一化。

Commun Biol. 2025 Feb 13;8(1):233. doi: 10.1038/s42003-025-07692-x.

scMEDAL for the interpretable analysis of single-cell transcriptomics data with batch effect visualization using a deep mixed effects autoencoder.scMEDAL：用于通过深度混合效应自动编码器进行批量效应可视化的单细胞转录组学数据的可解释分析。

ArXiv. 2025 Mar 13:arXiv:2411.06635v3.

Batch correction of single-cell sequencing data via an autoencoder architecture.通过自动编码器架构对单细胞测序数据进行批量校正。

Bioinform Adv. 2023 Dec 28;4(1):vbad186. doi: 10.1093/bioadv/vbad186. eCollection 2024.

Integrating single-cell RNA-seq datasets with substantial batch effects.整合具有显著批次效应的单细胞RNA测序数据集。

bioRxiv. 2024 Feb 10:2023.11.03.565463. doi: 10.1101/2023.11.03.565463.

本文引用的文献

Spatially resolved multiomics of human cardiac niches.人类心脏龛位的空间分辨多组学研究。

Nature. 2023 Jul;619(7971):801-810. doi: 10.1038/s41586-023-06311-1. Epub 2023 Jul 12.

Batch alignment of single-cell transcriptomics data using deep metric learning.基于深度度量学习的单细胞转录组学数据批量对齐。

Nat Commun. 2023 Feb 21;14(1):960. doi: 10.1038/s41467-023-36635-5.

Inference of cell state transitions and cell fate plasticity from single-cell with MARGARET.基于 MARGARET 从单细胞推断细胞状态转变和细胞命运可塑性。

Nucleic Acids Res. 2022 Aug 26;50(15):e86. doi: 10.1093/nar/gkac412.

The Tabula Sapiens: A multiple-organ, single-cell transcriptomic atlas of humans.智慧人图谱：人类多器官单细胞转录组图谱。

Science. 2022 May 13;376(6594):eabl4896. doi: 10.1126/science.abl4896.

Mapping the developing human immune system across organs.绘制器官间发育中人类免疫系统图谱。

Science. 2022 Jun 3;376(6597):eabo0510. doi: 10.1126/science.abo0510.

Benchmarking atlas-level data integration in single-cell genomics.单细胞基因组学中图谱级数据整合的基准测试。

Nat Methods. 2022 Jan;19(1):41-50. doi: 10.1038/s41592-021-01336-8. Epub 2021 Dec 23.

Deep generative model embedding of single-cell RNA-Seq profiles on hyperspheres and hyperbolic spaces.基于超球和双曲空间的单细胞 RNA-Seq 图谱的深度生成模型嵌入。

Nat Commun. 2021 May 5;12(1):2554. doi: 10.1038/s41467-021-22851-4.

Computational principles and challenges in single-cell data integration.单细胞数据整合的计算原理与挑战。

Nat Biotechnol. 2021 Oct;39(10):1202-1215. doi: 10.1038/s41587-021-00895-7. Epub 2021 May 3.

iMAP: integration of multiple single-cell datasets by adversarial paired transfer networks.iMAP：基于对抗配对迁移网络的多个单细胞数据集整合。

Genome Biol. 2021 Feb 18;22(1):63. doi: 10.1186/s13059-021-02280-8.

Probabilistic harmonization and annotation of single-cell transcriptomics data with deep generative models.使用深度生成模型对单细胞转录组学数据进行概率协调和注释。

Mol Syst Biol. 2021 Jan;17(1):e9620. doi: 10.15252/msb.20209620.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

scDREAMER：基于深度生成模型与对抗分类器的单细胞数据集图谱级整合方法。

scDREAMER for atlas-level integration of single-cell datasets using deep generative model paired with adversarial classifier.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献