• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度生成模型的经验 Bayes 方法在单细胞差异表达分析中的应用。

An empirical Bayes method for differential expression analysis of single cells with deep generative models.

机构信息

Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 74720.

Department of Statistics, University of Michigan, Ann Arbor, MI 48109.

出版信息

Proc Natl Acad Sci U S A. 2023 May 23;120(21):e2209124120. doi: 10.1073/pnas.2209124120. Epub 2023 May 16.

DOI:10.1073/pnas.2209124120
PMID:37192164
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10214125/
Abstract

Detecting differentially expressed genes is important for characterizing subpopulations of cells. In scRNA-seq data, however, nuisance variation due to technical factors like sequencing depth and RNA capture efficiency obscures the underlying biological signal. Deep generative models have been extensively applied to scRNA-seq data, with a special focus on embedding cells into a low-dimensional latent space and correcting for batch effects. However, little attention has been paid to the problem of utilizing the uncertainty from the deep generative model for differential expression (DE). Furthermore, the existing approaches do not allow for controlling for effect size or the false discovery rate (FDR). Here, we present lvm-DE, a generic Bayesian approach for performing DE predictions from a fitted deep generative model, while controlling the FDR. We apply the lvm-DE framework to scVI and scSphere, two deep generative models. The resulting approaches outperform state-of-the-art methods at estimating the log fold change in gene expression levels as well as detecting differentially expressed genes between subpopulations of cells.

摘要

检测差异表达基因对于描述细胞亚群特征非常重要。然而,在 scRNA-seq 数据中,由于测序深度和 RNA 捕获效率等技术因素引起的混杂变异掩盖了潜在的生物学信号。深度生成模型已经被广泛应用于 scRNA-seq 数据,特别关注将细胞嵌入低维潜在空间,并校正批次效应。然而,对于利用深度生成模型的不确定性进行差异表达(DE)分析的问题,关注较少。此外,现有的方法不允许控制效应大小或假发现率(FDR)。在这里,我们提出了 lvm-DE,这是一种从拟合的深度生成模型中进行 DE 预测的通用贝叶斯方法,同时控制 FDR。我们将 lvm-DE 框架应用于 scVI 和 scSphere 这两种深度生成模型。结果表明,与最先进的方法相比,这些方法在估计基因表达水平的对数倍数变化以及检测细胞亚群之间的差异表达基因方面表现更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d6e/10214125/b4925edd3c88/pnas.2209124120fig05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d6e/10214125/92b66d798bdd/pnas.2209124120fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d6e/10214125/baa32b849329/pnas.2209124120fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d6e/10214125/5ee371eaebab/pnas.2209124120fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d6e/10214125/77dc5c97725a/pnas.2209124120fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d6e/10214125/b4925edd3c88/pnas.2209124120fig05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d6e/10214125/92b66d798bdd/pnas.2209124120fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d6e/10214125/baa32b849329/pnas.2209124120fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d6e/10214125/5ee371eaebab/pnas.2209124120fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d6e/10214125/77dc5c97725a/pnas.2209124120fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d6e/10214125/b4925edd3c88/pnas.2209124120fig05.jpg

相似文献

1
An empirical Bayes method for differential expression analysis of single cells with deep generative models.基于深度生成模型的经验 Bayes 方法在单细胞差异表达分析中的应用。
Proc Natl Acad Sci U S A. 2023 May 23;120(21):e2209124120. doi: 10.1073/pnas.2209124120. Epub 2023 May 16.
2
Deep generative model embedding of single-cell RNA-Seq profiles on hyperspheres and hyperbolic spaces.基于超球和双曲空间的单细胞 RNA-Seq 图谱的深度生成模型嵌入。
Nat Commun. 2021 May 5;12(1):2554. doi: 10.1038/s41467-021-22851-4.
3
scGMAAE: Gaussian mixture adversarial autoencoders for diversification analysis of scRNA-seq data.scGMAAE:用于单细胞RNA测序数据多样化分析的高斯混合对抗自编码器
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac585.
4
Learning deep features and topological structure of cells for clustering of scRNA-sequencing data.学习 scRNA-seq 数据聚类的细胞深度特征和拓扑结构。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac068.
5
A joint deep learning model enables simultaneous batch effect correction, denoising, and clustering in single-cell transcriptomics.联合深度学习模型可实现单细胞转录组学中批量效应校正、去噪和聚类的同时进行。
Genome Res. 2021 Oct;31(10):1753-1766. doi: 10.1101/gr.271874.120. Epub 2021 May 25.
6
Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data.单细胞RNA测序数据差异表达分析中的技术噪声处理
Nucleic Acids Res. 2017 Nov 2;45(19):10978-10988. doi: 10.1093/nar/gkx754.
7
Bayesian-frequentist hybrid inference framework for single cell RNA-seq analyses.贝叶斯-频率派混合推理框架用于单细胞 RNA-seq 分析。
Hum Genomics. 2024 Jun 20;18(1):69. doi: 10.1186/s40246-024-00638-0.
8
scGGAN: single-cell RNA-seq imputation by graph-based generative adversarial network.scGGAN:基于图的生成对抗网络的单细胞RNA测序数据插补
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad040.
9
Detection of differentially expressed genes in discrete single-cell RNA sequencing data using a hurdle model with correlated random effects.使用具有相关随机效应的障碍模型检测离散单细胞RNA测序数据中的差异表达基因。
Biometrics. 2019 Dec;75(4):1051-1062. doi: 10.1111/biom.13074. Epub 2019 Jun 17.
10
Probabilistic harmonization and annotation of single-cell transcriptomics data with deep generative models.使用深度生成模型对单细胞转录组学数据进行概率协调和注释。
Mol Syst Biol. 2021 Jan;17(1):e9620. doi: 10.15252/msb.20209620.

引用本文的文献

1
Scvi-hub: an actionable repository for model-driven single-cell analysis.Scvi-hub:一个用于模型驱动单细胞分析的可操作资源库。
Nat Methods. 2025 Sep 8. doi: 10.1038/s41592-025-02799-9.
2
Cross-species imputation and comparison of single-cell transcriptomic profiles.跨物种单细胞转录组图谱的推算与比较
Genome Biol. 2025 Feb 26;26(1):40. doi: 10.1186/s13059-025-03493-x.
3
Reduced circulating sphingolipids and activity are linked to T2D risk and impaired insulin secretion.循环中鞘脂水平降低及活性降低与2型糖尿病风险和胰岛素分泌受损有关。

本文引用的文献

1
A COMPOSITIONAL MODEL TO ASSESS EXPRESSION CHANGES FROM SINGLE-CELL RNA-SEQ DATA.一种用于评估单细胞RNA测序数据中表达变化的组成模型。
Ann Appl Stat. 2021 Jun;15(2):880-901. doi: 10.1214/20-aoas1423. Epub 2021 Jul 12.
2
A Python library for probabilistic analysis of single-cell omics data.一个用于单细胞组学数据概率分析的Python库。
Nat Biotechnol. 2022 Feb;40(2):163-166. doi: 10.1038/s41587-021-01206-w.
3
Benchmarking atlas-level data integration in single-cell genomics.单细胞基因组学中图谱级数据整合的基准测试。
Sci Adv. 2025 Jan 10;11(2):eadr1725. doi: 10.1126/sciadv.adr1725.
4
Deep profiling deconstructs features associated with memory CD8 T cell tissue residence.深度分析解构了与记忆性CD8 T细胞组织驻留相关的特征。
Immunity. 2025 Jan 14;58(1):162-181.e10. doi: 10.1016/j.immuni.2024.11.007. Epub 2024 Dec 20.
5
Considerations for building and using integrated single-cell atlases.构建和使用整合单细胞图谱的注意事项。
Nat Methods. 2025 Jan;22(1):41-57. doi: 10.1038/s41592-024-02532-y. Epub 2024 Dec 13.
6
VI-VS: calibrated identification of feature dependencies in single-cell multiomics.VI-VS:单细胞多组学中特征依赖关系的校准识别。
Genome Biol. 2024 Nov 15;25(1):294. doi: 10.1186/s13059-024-03419-z.
7
Transcriptional Patterns in Stages of Alzheimer's Disease Are Cell-Type-Specific and Partially Converge with the Effects of Alcohol Use Disorder in Humans.阿尔茨海默病各阶段的转录模式具有细胞类型特异性,且部分与人类酒精使用障碍的影响趋同。
eNeuro. 2024 Oct 16;11(10). doi: 10.1523/ENEURO.0118-24.2024. Print 2024 Oct.
8
Leveraging neighborhood representations of single-cell data to achieve sensitive DE testing with miloDE.利用单细胞数据的邻域表示来实现 miloDE 的灵敏差异表达检测。
Genome Biol. 2024 Jul 18;25(1):189. doi: 10.1186/s13059-024-03334-3.
9
Bayesian estimation of gene constraint from an evolutionary model with gene features.基于具有基因特征的进化模型的基因约束贝叶斯估计。
Nat Genet. 2024 Aug;56(8):1632-1643. doi: 10.1038/s41588-024-01820-9. Epub 2024 Jul 8.
10
Dependency-aware deep generative models for multitasking analysis of spatial omics data.依赖感知的深度生成模型在空间组学数据的多任务分析中的应用。
Nat Methods. 2024 Aug;21(8):1501-1513. doi: 10.1038/s41592-024-02257-y. Epub 2024 May 23.
Nat Methods. 2022 Jan;19(1):41-50. doi: 10.1038/s41592-021-01336-8. Epub 2021 Dec 23.
4
Confronting false discoveries in single-cell differential expression.单细胞差异表达中虚假发现的应对策略。
Nat Commun. 2021 Sep 28;12(1):5692. doi: 10.1038/s41467-021-25960-2.
5
Mapping single-cell data to reference atlases by transfer learning.通过迁移学习将单细胞数据映射到参考图谱上。
Nat Biotechnol. 2022 Jan;40(1):121-130. doi: 10.1038/s41587-021-01001-7. Epub 2021 Aug 30.
6
Deep generative model embedding of single-cell RNA-Seq profiles on hyperspheres and hyperbolic spaces.基于超球和双曲空间的单细胞 RNA-Seq 图谱的深度生成模型嵌入。
Nat Commun. 2021 May 5;12(1):2554. doi: 10.1038/s41467-021-22851-4.
7
scMET: Bayesian modeling of DNA methylation heterogeneity at single-cell resolution.scMET:单细胞分辨率下 DNA 甲基化异质性的贝叶斯建模。
Genome Biol. 2021 Apr 20;22(1):114. doi: 10.1186/s13059-021-02329-8.
8
Probabilistic harmonization and annotation of single-cell transcriptomics data with deep generative models.使用深度生成模型对单细胞转录组学数据进行概率协调和注释。
Mol Syst Biol. 2021 Jan;17(1):e9620. doi: 10.15252/msb.20209620.
9
muscat detects subpopulation-specific state transitions from multi-sample multi-condition single-cell transcriptomics data.Muscat 可从多样本多条件单细胞转录组学数据中检测到亚群特异性状态转变。
Nat Commun. 2020 Nov 30;11(1):6077. doi: 10.1038/s41467-020-19894-4.
10
A curated database reveals trends in single-cell transcriptomics.一个经过精心策划的数据库揭示了单细胞转录组学的发展趋势。
Database (Oxford). 2020 Nov 28;2020. doi: 10.1093/database/baaa073.