• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用深度生成模型对单细胞转录组学数据进行概率协调和注释。

Probabilistic harmonization and annotation of single-cell transcriptomics data with deep generative models.

机构信息

Center for Computational Biology, University of California, Berkeley, CA, USA.

Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA, USA.

出版信息

Mol Syst Biol. 2021 Jan;17(1):e9620. doi: 10.15252/msb.20209620.

DOI:10.15252/msb.20209620
PMID:33491336
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7829634/
Abstract

As the number of single-cell transcriptomics datasets grows, the natural next step is to integrate the accumulating data to achieve a common ontology of cell types and states. However, it is not straightforward to compare gene expression levels across datasets and to automatically assign cell type labels in a new dataset based on existing annotations. In this manuscript, we demonstrate that our previously developed method, scVI, provides an effective and fully probabilistic approach for joint representation and analysis of scRNA-seq data, while accounting for uncertainty caused by biological and measurement noise. We also introduce single-cell ANnotation using Variational Inference (scANVI), a semi-supervised variant of scVI designed to leverage existing cell state annotations. We demonstrate that scVI and scANVI compare favorably to state-of-the-art methods for data integration and cell state annotation in terms of accuracy, scalability, and adaptability to challenging settings. In contrast to existing methods, scVI and scANVI integrate multiple datasets with a single generative model that can be directly used for downstream tasks, such as differential expression. Both methods are easily accessible through scvi-tools.

摘要

随着单细胞转录组学数据集数量的增加,下一步自然是整合积累的数据,以实现细胞类型和状态的通用本体。然而,直接比较数据集之间的基因表达水平,并根据现有注释在新数据集中自动分配细胞类型标签并不容易。在本文中,我们证明了我们之前开发的方法 scVI 为联合表示和分析 scRNA-seq 数据提供了一种有效且完全概率的方法,同时考虑了由生物和测量噪声引起的不确定性。我们还介绍了使用变分推理进行单细胞注释 (scANVI),这是 scVI 的半监督变体,旨在利用现有的细胞状态注释。我们证明,scVI 和 scANVI 在数据集成和细胞状态注释方面的准确性、可扩展性以及对挑战性设置的适应性方面均优于最先进的方法。与现有方法不同,scVI 和 scANVI 可以使用单个生成模型整合多个数据集,该模型可以直接用于下游任务,如差异表达。这两种方法都可以通过 scvi-tools 轻松访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aa4/7829634/e6e12fc61839/MSB-17-e9620-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aa4/7829634/319945590987/MSB-17-e9620-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aa4/7829634/64e3cd7d23c5/MSB-17-e9620-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aa4/7829634/2aef060bea62/MSB-17-e9620-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aa4/7829634/6bed2c10216d/MSB-17-e9620-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aa4/7829634/072755adc123/MSB-17-e9620-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aa4/7829634/6213dad2305b/MSB-17-e9620-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aa4/7829634/e6e12fc61839/MSB-17-e9620-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aa4/7829634/319945590987/MSB-17-e9620-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aa4/7829634/64e3cd7d23c5/MSB-17-e9620-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aa4/7829634/2aef060bea62/MSB-17-e9620-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aa4/7829634/6bed2c10216d/MSB-17-e9620-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aa4/7829634/072755adc123/MSB-17-e9620-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aa4/7829634/6213dad2305b/MSB-17-e9620-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aa4/7829634/e6e12fc61839/MSB-17-e9620-g008.jpg

相似文献

1
Probabilistic harmonization and annotation of single-cell transcriptomics data with deep generative models.使用深度生成模型对单细胞转录组学数据进行概率协调和注释。
Mol Syst Biol. 2021 Jan;17(1):e9620. doi: 10.15252/msb.20209620.
2
Deep generative modeling for single-cell transcriptomics.单细胞转录组学的深度生成模型。
Nat Methods. 2018 Dec;15(12):1053-1058. doi: 10.1038/s41592-018-0229-2. Epub 2018 Nov 30.
3
scPLAN: a hierarchical computational framework for single transcriptomics data annotation, integration and cell-type label refinement.scPLAN:一种用于单细胞转录组学数据注释、整合和细胞类型标签细化的分层计算框架。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae305.
4
scMRA: a robust deep learning method to annotate scRNA-seq data with multiple reference datasets.scMRA:一种用于用多个参考数据集注释单细胞RNA测序数据的强大深度学习方法。
Bioinformatics. 2022 Jan 12;38(3):738-745. doi: 10.1093/bioinformatics/btab700.
5
Integrating Deep Supervised, Self-Supervised and Unsupervised Learning for Single-Cell RNA-seq Clustering and Annotation.将深度监督学习、自监督学习和无监督学习相结合进行单细胞 RNA-seq 聚类和注释。
Genes (Basel). 2020 Jul 14;11(7):792. doi: 10.3390/genes11070792.
6
Semi-supervised integration of single-cell transcriptomics data.单细胞转录组学数据的半监督整合。
Nat Commun. 2024 Jan 29;15(1):872. doi: 10.1038/s41467-024-45240-z.
7
PredGCN: a Pruning-enabled Gene-Cell Net for automatic cell annotation of single cell transcriptome data.PredGCN:一种支持剪枝的基因-细胞网络,用于单细胞转录组数据的自动细胞注释。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae421.
8
scSemiGAN: a single-cell semi-supervised annotation and dimensionality reduction framework based on generative adversarial network.scSemiGAN:基于生成对抗网络的单细胞半监督注释和降维框架。
Bioinformatics. 2022 Nov 15;38(22):5042-5048. doi: 10.1093/bioinformatics/btac652.
9
Critical downstream analysis steps for single-cell RNA sequencing data.单细胞 RNA 测序数据分析的关键下游步骤。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab105.
10
A comparison of scRNA-seq annotation methods based on experimentally labeled immune cell subtype dataset.基于实验标记免疫细胞亚型数据集的 scRNA-seq 注释方法比较。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae392.

引用本文的文献

1
Integrated single-cell atlas of human atherosclerotic plaques.人类动脉粥样硬化斑块的综合单细胞图谱。
Nat Commun. 2025 Sep 10;16(1):8255. doi: 10.1038/s41467-025-63202-x.
2
A Benchmark of Semi-Supervised scRNA-seq Integration Methods in Real-World Scenarios.真实场景下半监督单细胞RNA测序整合方法的基准测试
bioRxiv. 2025 Aug 27:2025.08.23.671952. doi: 10.1101/2025.08.23.671952.
3
Microbiome data integration via shared dictionary learning.通过共享字典学习进行微生物组数据整合。

本文引用的文献

1
A guide to the BRAIN Initiative Cell Census Network data ecosystem.《脑计划细胞普查网络数据生态系统指南》
PLoS Biol. 2023 Jun 30;21(6):e3002133. doi: 10.1371/journal.pbio.3002133. eCollection 2023 Jun.
2
Benchmarking atlas-level data integration in single-cell genomics.单细胞基因组学中图谱级数据整合的基准测试。
Nat Methods. 2022 Jan;19(1):41-50. doi: 10.1038/s41592-021-01336-8. Epub 2021 Dec 23.
3
scVAE: variational auto-encoders for single-cell gene expression data.scVAE:用于单细胞基因表达数据的变分自动编码器。
Nat Commun. 2025 Sep 1;16(1):8147. doi: 10.1038/s41467-025-63425-y.
4
Single-cell RNA sequencing reveals different cellular states in malignant cells and the tumor microenvironment in primary and metastatic ER-positive breast cancer.单细胞RNA测序揭示了原发性和转移性雌激素受体阳性乳腺癌中恶性细胞及肿瘤微环境的不同细胞状态。
NPJ Breast Cancer. 2025 Aug 26;11(1):95. doi: 10.1038/s41523-025-00808-w.
5
GNODEVAE: a graph-based ODE-VAE enhances clustering for single-cell data.GNODEVAE:一种基于图的常微分方程变分自编码器增强了单细胞数据的聚类效果。
BMC Genomics. 2025 Aug 21;26(1):767. doi: 10.1186/s12864-025-11946-7.
6
HiCat: a semi-supervised approach for cell type annotation.HiCat:一种用于细胞类型注释的半监督方法。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf428.
7
Fluctuation structure predicts genome-wide perturbation outcomes.波动结构可预测全基因组扰动结果。
Res Sq. 2025 Aug 12:rs.3.rs-7304871. doi: 10.21203/rs.3.rs-7304871/v1.
8
Lysine-specific demethylase 1 regulates hematopoietic stem cell expansion and myeloid cell differentiation.赖氨酸特异性去甲基化酶1调节造血干细胞扩增和髓细胞分化。
Cell Death Dis. 2025 Aug 15;16(1):619. doi: 10.1038/s41419-025-07951-z.
9
PCLDA: An interpretable cell annotation tool for single-cell RNA-sequencing data based on simple statistical methods.PCLDA:一种基于简单统计方法的用于单细胞RNA测序数据的可解释细胞注释工具。
Comput Struct Biotechnol J. 2025 Jul 23;27:3264-3274. doi: 10.1016/j.csbj.2025.07.019. eCollection 2025.
10
CellFuse Enables Multi-modal Integration of Single-cell and Spatial Proteomics data.CellFuse实现了单细胞和空间蛋白质组学数据的多模态整合。
bioRxiv. 2025 Jul 26:2025.07.23.665976. doi: 10.1101/2025.07.23.665976.
Bioinformatics. 2020 Aug 15;36(16):4415-4422. doi: 10.1093/bioinformatics/btaa293.
4
Lineage tracing on transcriptional landscapes links state to fate during differentiation.在分化过程中,基于转录景观的谱系追踪将状态与命运联系起来。
Science. 2020 Feb 14;367(6479). doi: 10.1126/science.aaw3381. Epub 2020 Jan 23.
5
Integrated single cell analysis of blood and cerebrospinal fluid leukocytes in multiple sclerosis.多发性硬化症患者血液和脑脊液白细胞的综合单细胞分析。
Nat Commun. 2020 Jan 14;11(1):247. doi: 10.1038/s41467-019-14118-w.
6
Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression.使用正则化负二项式回归进行单细胞 RNA-seq 数据的归一化和方差稳定化。
Genome Biol. 2019 Dec 23;20(1):296. doi: 10.1186/s13059-019-1874-1.
7
Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model.基于多项模型的单细胞 RNA-Seq 特征选择和降维。
Genome Biol. 2019 Dec 23;20(1):295. doi: 10.1186/s13059-019-1861-6.
8
Fast, sensitive and accurate integration of single-cell data with Harmony.利用 Harmony 实现单细胞数据的快速、灵敏和精确整合。
Nat Methods. 2019 Dec;16(12):1289-1296. doi: 10.1038/s41592-019-0619-0. Epub 2019 Nov 18.
9
Exploring single-cell data with deep multitasking neural networks.用深度多任务神经网络探索单细胞数据。
Nat Methods. 2019 Nov;16(11):1139-1145. doi: 10.1038/s41592-019-0576-7. Epub 2019 Oct 7.
10
Functional interpretation of single cell similarity maps.单细胞相似性图谱的功能解释。
Nat Commun. 2019 Sep 26;10(1):4376. doi: 10.1038/s41467-019-12235-0.