• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于联合 VAE 模型构建可学习的单细胞图谱通用坐标系。

Building a learnable universal coordinate system for single-cell atlas with a joint-VAE model.

机构信息

MOE Key Laboratory of Bioinformatics and Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, Beijing, China.

School of Life Sciences and School of Medicine, Center for Synthetic and Systems Biology, Tsinghua University, Beijing, China.

出版信息

Commun Biol. 2024 Aug 12;7(1):977. doi: 10.1038/s42003-024-06564-0.

DOI:10.1038/s42003-024-06564-0
PMID:39134617
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11319358/
Abstract

A universal coordinate system that can ensemble the huge number of cells and capture their heterogeneities is of vital importance for constructing large-scale cell atlases as references for molecular and cellular studies. Studies have shown that cells exhibit multifaceted heterogeneities in their transcriptomic features at multiple resolutions. This nature of complexity makes it hard to design a fixed coordinate system through a combination of known features. It is desirable to build a learnable universal coordinate model that can capture major heterogeneities and serve as a controlled generative model for data augmentation. We developed UniCoord, a specially-tuned joint-VAE model to represent single-cell transcriptomic data in a lower-dimensional latent space with high interpretability. Each latent dimension can represent either discrete or continuous feature, and either supervised by prior knowledge or unsupervised. The latent dimensions can be easily reconfigured to generate pseudo transcriptomic profiles with desired properties. UniCoord can also be used as a pre-trained model to analyze new data with unseen cell types and thus can serve as a feasible framework for cell annotation and comparison. UniCoord provides a prototype for a learnable universal coordinate framework to enable better analysis and generation of cells with highly orchestrated functions and heterogeneities.

摘要

建立一个能够整合大量细胞并捕捉其异质性的通用坐标系,对于构建大规模细胞图谱作为分子和细胞研究的参考至关重要。研究表明,细胞在转录组特征的多个分辨率上表现出多方面的异质性。这种复杂性使得通过已知特征的组合来设计固定的坐标系统变得困难。因此,建立一个可学习的通用坐标模型来捕捉主要异质性,并作为数据增强的受控生成模型是很有必要的。我们开发了 UniCoord,这是一个专门调整的联合变分自编码器模型,可以在一个具有高可解释性的低维潜在空间中表示单细胞转录组数据。每个潜在维度都可以表示离散或连续的特征,并且可以由先验知识或无监督学习来监督。潜在维度可以轻松重新配置,以生成具有所需特性的伪转录组谱。UniCoord 还可以用作预训练模型来分析具有未见细胞类型的新数据,因此可以作为细胞注释和比较的可行框架。UniCoord 为可学习的通用坐标框架提供了一个原型,以实现对具有高度协调功能和异质性的细胞的更好分析和生成。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4dad/11319358/41c09a73ca84/42003_2024_6564_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4dad/11319358/3ecd94086559/42003_2024_6564_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4dad/11319358/68adb2ac5dca/42003_2024_6564_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4dad/11319358/288662ea329c/42003_2024_6564_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4dad/11319358/41c09a73ca84/42003_2024_6564_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4dad/11319358/3ecd94086559/42003_2024_6564_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4dad/11319358/68adb2ac5dca/42003_2024_6564_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4dad/11319358/288662ea329c/42003_2024_6564_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4dad/11319358/41c09a73ca84/42003_2024_6564_Fig4_HTML.jpg

相似文献

1
Building a learnable universal coordinate system for single-cell atlas with a joint-VAE model.基于联合 VAE 模型构建可学习的单细胞图谱通用坐标系。
Commun Biol. 2024 Aug 12;7(1):977. doi: 10.1038/s42003-024-06564-0.
2
scBOL: a universal cell type identification framework for single-cell and spatial transcriptomics data.scBOL:单细胞和空间转录组学数据的通用细胞类型识别框架。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae188.
3
Extracting a biologically relevant latent space from cancer transcriptomes with variational autoencoders.使用变分自编码器从癌症转录组中提取生物学相关的潜在空间。
Pac Symp Biocomput. 2018;23:80-91.
4
Continually adapting pre-trained language model to universal annotation of single-cell RNA-seq data.持续调整预先训练的语言模型,以实现单细胞 RNA-seq 数据的通用注释。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae047.
5
Beaconet: A Reference-Free Method for Integrating Multiple Batches of Single-Cell Transcriptomic Data in Original Molecular Space.Beaconet:一种在原始分子空间中整合多个批次单细胞转录组数据的无参方法。
Adv Sci (Weinh). 2024 Jul;11(26):e2306770. doi: 10.1002/advs.202306770. Epub 2024 May 6.
6
A Universal Method for Crossing Molecular and Atlas Modalities using Simplex-Based Image Varifolds and Quadratic Programming.一种使用基于单纯形的图像变分曲面和二次规划来跨越分子和图谱模态的通用方法。
bioRxiv. 2023 Mar 29:2023.03.28.534622. doi: 10.1101/2023.03.28.534622.
7
Elucidating transcriptomic profiles from single-cell RNA sequencing data using nature-inspired compressed sensing.利用受自然启发的压缩感知技术从单细胞 RNA 测序数据中阐明转录组特征。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab125.
8
A variational autoencoder trained with priors from canonical pathways increases the interpretability of transcriptome data.基于经典通路先验信息训练的变分自编码器提高了转录组数据的可解释性。
PLoS Comput Biol. 2024 Jul 3;20(7):e1011198. doi: 10.1371/journal.pcbi.1011198. eCollection 2024 Jul.
9
A hybrid deep clustering approach for robust cell type profiling using single-cell RNA-seq data.基于单细胞 RNA-seq 数据的混合深度聚类方法进行稳健的细胞类型分析。
RNA. 2020 Oct;26(10):1303-1319. doi: 10.1261/rna.074427.119. Epub 2020 Jun 12.
10
scPLAN: a hierarchical computational framework for single transcriptomics data annotation, integration and cell-type label refinement.scPLAN:一种用于单细胞转录组学数据注释、整合和细胞类型标签细化的分层计算框架。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae305.

引用本文的文献

1
Variational inference of single cell time series.单细胞时间序列的变分推断
bioRxiv. 2024 Aug 30:2024.08.29.610389. doi: 10.1101/2024.08.29.610389.

本文引用的文献

1
Large-scale foundation model on single-cell transcriptomics.单细胞转录组学的大规模基础模型。
Nat Methods. 2024 Aug;21(8):1481-1491. doi: 10.1038/s41592-024-02305-7. Epub 2024 Jun 6.
2
stDiff: a diffusion model for imputing spatial transcriptomics through single-cell transcriptomics.stDiff:一种通过单细胞转录组学推断空间转录组学的扩散模型。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae171.
3
scGPT: toward building a foundation model for single-cell multi-omics using generative AI.scGPT:迈向使用生成式人工智能构建单细胞多组学基础模型
Nat Methods. 2024 Aug;21(8):1470-1480. doi: 10.1038/s41592-024-02201-0. Epub 2024 Feb 26.
4
STEM enables mapping of single-cell and spatial transcriptomics data with transfer learning.STEM 可通过迁移学习实现单细胞和空间转录组学数据的映射。
Commun Biol. 2024 Jan 6;7(1):56. doi: 10.1038/s42003-023-05640-1.
5
Transfer learning enables predictions in network biology.迁移学习可实现网络生物学预测。
Nature. 2023 Jun;618(7965):616-624. doi: 10.1038/s41586-023-06139-9. Epub 2023 May 31.
6
Protocol for profiling cell-centric assembled single-cell human transcriptome data in hECA.用于分析 hECA 中基于细胞的组装单细胞人类转录组数据的方案。
STAR Protoc. 2022 Jul 31;3(3):101589. doi: 10.1016/j.xpro.2022.101589. eCollection 2022 Sep 16.
7
What is a cell type and how to define it?什么是细胞类型,如何定义它?
Cell. 2022 Jul 21;185(15):2739-2755. doi: 10.1016/j.cell.2022.06.031.
8
Single-cell analyses define a continuum of cell state and composition changes in the malignant transformation of polyps to colorectal cancer.单细胞分析定义了息肉恶变为结直肠癌过程中细胞状态和组成变化的连续体。
Nat Genet. 2022 Jul;54(7):985-995. doi: 10.1038/s41588-022-01088-x. Epub 2022 Jun 20.
9
hECA: The cell-centric assembly of a cell atlas.人类细胞图谱计划(hECA):以细胞为中心构建细胞图谱。
iScience. 2022 Apr 28;25(5):104318. doi: 10.1016/j.isci.2022.104318. eCollection 2022 May 20.
10
Cross-tissue immune cell analysis reveals tissue-specific features in humans.跨组织免疫细胞分析揭示人类组织特异性特征。
Science. 2022 May 13;376(6594):eabl5197. doi: 10.1126/science.abl5197.