• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

scDisInFact:用于多批次多条件单细胞 RNA-seq 数据的集成和预测的解缠学习。

scDisInFact: disentangled learning for integration and prediction of multi-batch multi-condition single-cell RNA-sequencing data.

机构信息

School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA, USA.

School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, USA.

出版信息

Nat Commun. 2024 Jan 30;15(1):912. doi: 10.1038/s41467-024-45227-w.

DOI:10.1038/s41467-024-45227-w
PMID:38291052
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10827746/
Abstract

Single-cell RNA-sequencing (scRNA-seq) has been widely used for disease studies, where sample batches are collected from donors under different conditions including demographic groups, disease stages, and drug treatments. It is worth noting that the differences among sample batches in such a study are a mixture of technical confounders caused by batch effect and biological variations caused by condition effect. However, current batch effect removal methods often eliminate both technical batch effect and meaningful condition effect, while perturbation prediction methods solely focus on condition effect, resulting in inaccurate gene expression predictions due to unaccounted batch effect. Here we introduce scDisInFact, a deep learning framework that models both batch effect and condition effect in scRNA-seq data. scDisInFact learns latent factors that disentangle condition effect from batch effect, enabling it to simultaneously perform three tasks: batch effect removal, condition-associated key gene detection, and perturbation prediction. We evaluate scDisInFact on both simulated and real datasets, and compare its performance with baseline methods for each task. Our results demonstrate that scDisInFact outperforms existing methods that focus on individual tasks, providing a more comprehensive and accurate approach for integrating and predicting multi-batch multi-condition single-cell RNA-sequencing data.

摘要

单细胞 RNA 测序 (scRNA-seq) 已广泛应用于疾病研究,其中样本批次是从不同条件下的供体中收集的,包括人口统计学群体、疾病阶段和药物治疗。值得注意的是,在这种研究中,样本批次之间的差异是由批次效应引起的技术混杂因素和由条件效应引起的生物学变异的混合物。然而,目前的批次效应去除方法通常会同时消除技术批次效应和有意义的条件效应,而扰动预测方法则仅关注条件效应,由于未考虑批次效应,导致基因表达预测不准确。在这里,我们介绍了 scDisInFact,这是一个用于 scRNA-seq 数据中批次效应和条件效应建模的深度学习框架。scDisInFact 学习从批次效应中分离条件效应的潜在因素,使其能够同时执行三个任务:批次效应去除、与条件相关的关键基因检测和扰动预测。我们在模拟和真实数据集上评估了 scDisInFact,并将其性能与每个任务的基线方法进行了比较。我们的结果表明,scDisInFact 优于专注于单个任务的现有方法,为整合和预测多批次多条件单细胞 RNA-seq 数据提供了更全面、更准确的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c277/10827746/d99ad442b8b3/41467_2024_45227_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c277/10827746/65e0825757b7/41467_2024_45227_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c277/10827746/61528342d8f9/41467_2024_45227_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c277/10827746/8a235741e2ef/41467_2024_45227_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c277/10827746/fefa647734ac/41467_2024_45227_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c277/10827746/1cdf3a17a54e/41467_2024_45227_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c277/10827746/d99ad442b8b3/41467_2024_45227_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c277/10827746/65e0825757b7/41467_2024_45227_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c277/10827746/61528342d8f9/41467_2024_45227_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c277/10827746/8a235741e2ef/41467_2024_45227_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c277/10827746/fefa647734ac/41467_2024_45227_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c277/10827746/1cdf3a17a54e/41467_2024_45227_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c277/10827746/d99ad442b8b3/41467_2024_45227_Fig6_HTML.jpg

相似文献

1
scDisInFact: disentangled learning for integration and prediction of multi-batch multi-condition single-cell RNA-sequencing data.scDisInFact:用于多批次多条件单细胞 RNA-seq 数据的集成和预测的解缠学习。
Nat Commun. 2024 Jan 30;15(1):912. doi: 10.1038/s41467-024-45227-w.
2
scDisInFact: disentangled learning for integration and prediction of multi-batch multi-condition single-cell RNA-sequencing data.scDisInFact:用于多批次多条件单细胞RNA测序数据整合与预测的解纠缠学习
bioRxiv. 2023 May 2:2023.05.01.538975. doi: 10.1101/2023.05.01.538975.
3
BERMUDA: a novel deep transfer learning method for single-cell RNA sequencing batch correction reveals hidden high-resolution cellular subtypes.百慕大:一种新型的单细胞 RNA 测序批次校正深度迁移学习方法揭示了隐藏的高分辨率细胞亚型。
Genome Biol. 2019 Aug 12;20(1):165. doi: 10.1186/s13059-019-1764-6.
4
IMGG: Integrating Multiple Single-Cell Datasets through Connected Graphs and Generative Adversarial Networks.IMGG:通过连接图和生成对抗网络整合多个单细胞数据集。
Int J Mol Sci. 2022 Feb 14;23(4):2082. doi: 10.3390/ijms23042082.
5
Integration of scRNA-seq data by disentangled representation learning with condition domain adaptation.基于条件域自适应的解缠表示学习整合 scRNA-seq 数据。
BMC Bioinformatics. 2024 Mar 16;25(1):116. doi: 10.1186/s12859-024-05706-9.
6
BERMAD: batch effect removal for single-cell RNA-seq data using a multi-layer adaptation autoencoder with dual-channel framework.BERMAD:基于双通道框架的多层自适应自动编码器去除单细胞 RNA-seq 数据中的批次效应
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae127.
7
CellMixS: quantifying and visualizing batch effects in single-cell RNA-seq data.CellMixS:量化和可视化单细胞 RNA-seq 数据中的批次效应。
Life Sci Alliance. 2021 Mar 23;4(6). doi: 10.26508/lsa.202001004. Print 2021 Jun.
8
Latent cellular analysis robustly reveals subtle diversity in large-scale single-cell RNA-seq data.潜伏细胞分析能稳健地揭示大规模单细胞 RNA-seq 数据中的细微多样性。
Nucleic Acids Res. 2019 Dec 16;47(22):e143. doi: 10.1093/nar/gkz826.
9
HDMC: a novel deep learning-based framework for removing batch effects in single-cell RNA-seq data.HDMC:一种用于去除单细胞 RNA-seq 数据中批次效应的新型深度学习框架。
Bioinformatics. 2022 Feb 7;38(5):1295-1303. doi: 10.1093/bioinformatics/btab821.
10
ResPAN: a powerful batch correction model for scRNA-seq data through residual adversarial networks.ResPAN:通过残差对抗网络对 scRNA-seq 数据进行强大的批量校正模型。
Bioinformatics. 2022 Aug 10;38(16):3942-3949. doi: 10.1093/bioinformatics/btac427.

引用本文的文献

1
Harmonizing heterogeneous single-cell gene expression data with individual-level covariate information.将异质单细胞基因表达数据与个体水平协变量信息进行整合。
Bioinform Adv. 2025 Aug 9;5(1):vbaf189. doi: 10.1093/bioadv/vbaf189. eCollection 2025.
2
A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches.多组学数据整合方法的技术综述:从经典统计方法到深度生成方法
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf355.
3
RGCN-BA: relational graph convolutional network with batch awareness for single-cell RNA sequencing clustering.

本文引用的文献

1
Predicting cellular responses to complex perturbations in high-throughput screens.高通量筛选中预测细胞对复杂扰动的反应。
Mol Syst Biol. 2023 Jun 12;19(6):e11517. doi: 10.15252/msb.202211517. Epub 2023 May 8.
2
Self-supervised contrastive learning for integrative single cell RNA-seq data analysis.基于自监督对比学习的整合单细胞 RNA-seq 数据分析。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac377.
3
scDART: integrating unmatched scRNA-seq and scATAC-seq data and learning cross-modality relationship simultaneously.
RGCN-BA:用于单细胞RNA测序聚类的具有批次感知的关系图卷积网络
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf378.
4
Causal disentanglement for single-cell representations and controllable counterfactual generation.用于单细胞表示和可控反事实生成的因果解缠
Nat Commun. 2025 Jul 23;16(1):6775. doi: 10.1038/s41467-025-62008-1.
5
Small data methods in omics: the power of one.组学中的小数据方法:以一当十。
Nat Methods. 2024 Sep;21(9):1597-1602. doi: 10.1038/s41592-024-02390-8. Epub 2024 Aug 22.
6
Integration of scRNA-seq data by disentangled representation learning with condition domain adaptation.基于条件域自适应的解缠表示学习整合 scRNA-seq 数据。
BMC Bioinformatics. 2024 Mar 16;25(1):116. doi: 10.1186/s12859-024-05706-9.
scDART:同时整合未配对的 scRNA-seq 和 scATAC-seq 数据并学习跨模态关系。
Genome Biol. 2022 Jun 27;23(1):139. doi: 10.1186/s13059-022-02706-x.
4
scPreGAN, a deep generative model for predicting the response of single-cell expression to perturbation.scPreGAN,一种用于预测单细胞表达对扰动响应的深度生成模型。
Bioinformatics. 2022 Jun 27;38(13):3377-3384. doi: 10.1093/bioinformatics/btac357.
5
scINSIGHT for interpreting single-cell gene expression from biologically heterogeneous data.scINSIGHT 用于解释生物异质数据中的单细胞基因表达。
Genome Biol. 2022 Mar 21;23(1):82. doi: 10.1186/s13059-022-02649-3.
6
Benchmarking atlas-level data integration in single-cell genomics.单细胞基因组学中图谱级数据整合的基准测试。
Nat Methods. 2022 Jan;19(1):41-50. doi: 10.1038/s41592-021-01336-8. Epub 2021 Dec 23.
7
Deconvolution of cell type-specific drug responses in human tumor tissue with single-cell RNA-seq.单细胞 RNA 测序解析人类肿瘤组织中细胞类型特异性药物反应
Genome Med. 2021 May 11;13(1):82. doi: 10.1186/s13073-021-00894-y.
8
COVID-19 immune features revealed by a large-scale single-cell transcriptome atlas.大规模单细胞转录组图谱揭示的 COVID-19 免疫特征。
Cell. 2021 Apr 1;184(7):1895-1913.e19. doi: 10.1016/j.cell.2021.01.053. Epub 2021 Feb 3.
9
scMC learns biological variation through the alignment of multiple single-cell genomics datasets.scMC 通过多个单细胞基因组学数据集的比对来学习生物学变异。
Genome Biol. 2021 Jan 4;22(1):10. doi: 10.1186/s13059-020-02238-2.
10
Severe COVID-19 Is Marked by a Dysregulated Myeloid Cell Compartment.严重的 COVID-19 以髓系细胞失调为特征。
Cell. 2020 Sep 17;182(6):1419-1440.e23. doi: 10.1016/j.cell.2020.08.001. Epub 2020 Aug 5.