• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单细胞数据整合的最优传输统一计算框架。

A unified computational framework for single-cell data integration with optimal transport.

机构信息

LSC, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China.

School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, China.

出版信息

Nat Commun. 2022 Dec 1;13(1):7419. doi: 10.1038/s41467-022-35094-8.

DOI:10.1038/s41467-022-35094-8
PMID:36456571
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9715710/
Abstract

Single-cell data integration can provide a comprehensive molecular view of cells. However, how to integrate heterogeneous single-cell multi-omics as well as spatially resolved transcriptomic data remains a major challenge. Here we introduce uniPort, a unified single-cell data integration framework that combines a coupled variational autoencoder (coupled-VAE) and minibatch unbalanced optimal transport (Minibatch-UOT). It leverages both highly variable common and dataset-specific genes for integration to handle the heterogeneity across datasets, and it is scalable to large-scale datasets. uniPort jointly embeds heterogeneous single-cell multi-omics datasets into a shared latent space. It can further construct a reference atlas for gene imputation across datasets. Meanwhile, uniPort provides a flexible label transfer framework to deconvolute heterogeneous spatial transcriptomic data using an optimal transport plan, instead of embedding latent space. We demonstrate the capability of uniPort by applying it to integrate a variety of datasets, including single-cell transcriptomics, chromatin accessibility, and spatially resolved transcriptomic data.

摘要

单细胞数据整合可以提供细胞的全面分子视图。然而,如何整合异质的单细胞多组学以及空间分辨转录组数据仍然是一个主要挑战。在这里,我们介绍了 uniPort,这是一个统一的单细胞数据整合框架,它结合了耦合变分自动编码器(coupled-VAE)和小批量不平衡最优传输(Minibatch-UOT)。它利用高度可变的常见基因和数据集特定基因进行整合,以处理数据集之间的异质性,并且可扩展到大规模数据集。uniPort 将异质的单细胞多组学数据集联合嵌入到一个共享的潜在空间中。它可以进一步构建跨数据集的基因插补参考图谱。同时,uniPort 提供了一个灵活的标签转移框架,使用最优传输方案对异质的空间转录组数据进行去卷积,而不是嵌入潜在空间。我们通过将其应用于整合各种数据集来展示 uniPort 的能力,包括单细胞转录组学、染色质可及性和空间分辨转录组学数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dad/9715710/8adaeba46b68/41467_2022_35094_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dad/9715710/226622a993ca/41467_2022_35094_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dad/9715710/7a8669fd8962/41467_2022_35094_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dad/9715710/36048845bd6a/41467_2022_35094_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dad/9715710/2e534ddf7648/41467_2022_35094_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dad/9715710/97bad04d915f/41467_2022_35094_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dad/9715710/3d8aabff6155/41467_2022_35094_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dad/9715710/8adaeba46b68/41467_2022_35094_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dad/9715710/226622a993ca/41467_2022_35094_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dad/9715710/7a8669fd8962/41467_2022_35094_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dad/9715710/36048845bd6a/41467_2022_35094_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dad/9715710/2e534ddf7648/41467_2022_35094_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dad/9715710/97bad04d915f/41467_2022_35094_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dad/9715710/3d8aabff6155/41467_2022_35094_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dad/9715710/8adaeba46b68/41467_2022_35094_Fig7_HTML.jpg

相似文献

1
A unified computational framework for single-cell data integration with optimal transport.单细胞数据整合的最优传输统一计算框架。
Nat Commun. 2022 Dec 1;13(1):7419. doi: 10.1038/s41467-022-35094-8.
2
Deep-joint-learning analysis model of single cell transcriptome and open chromatin accessibility data.单细胞转录组和开放染色质可及性数据的深度联合学习分析模型。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa287.
3
Probabilistic embedding, clustering, and alignment for integrating spatial transcriptomics data with PRECAST.概率嵌入、聚类和对齐,用于将空间转录组学数据与 PRECAST 整合。
Nat Commun. 2023 Jan 18;14(1):296. doi: 10.1038/s41467-023-35947-w.
4
Robust probabilistic modeling for single-cell multimodal mosaic integration and imputation via scVAEIT.通过 scVAEIT 对单细胞多模态镶嵌整合和插补进行稳健的概率建模。
Proc Natl Acad Sci U S A. 2022 Dec 6;119(49):e2214414119. doi: 10.1073/pnas.2214414119. Epub 2022 Dec 2.
5
InClust+: the deep generative framework with mask modules for multimodal data integration, imputation, and cross-modal generation.InClust+:带有掩码模块的深度生成框架,用于多模态数据集成、插补和跨模态生成。
BMC Bioinformatics. 2024 Jan 24;25(1):41. doi: 10.1186/s12859-024-05656-2.
6
Graspot: a graph attention network for spatial transcriptomics data integration with optimal transport.Graspot:一种基于图注意力网络的最优传输算法的空间转录组学数据整合方法。
Bioinformatics. 2024 Sep 1;40(Suppl 2):ii137-ii145. doi: 10.1093/bioinformatics/btae394.
7
Multi-View Variational Autoencoder for Missing Value Imputation in Untargeted Metabolomics.用于非靶向代谢组学中缺失值插补的多视图变分自编码器
ArXiv. 2024 Mar 12:arXiv:2310.07990v2.
8
A versatile and scalable single-cell data integration algorithm based on domain-adversarial and variational approximation.基于对抗域和变分逼近的通用可扩展单细胞数据集成算法。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab400.
9
Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona.使用 Pamona 对异质单细胞多组学数据进行多样本整合
Bioinformatics. 2021 Dec 22;38(1):211-219. doi: 10.1093/bioinformatics/btab594.
10
SIMBA: single-cell embedding along with features.SIMBA:单细胞特征嵌入。
Nat Methods. 2024 Jun;21(6):1003-1013. doi: 10.1038/s41592-023-01899-8. Epub 2023 May 29.

引用本文的文献

1
Integrative snRNA-seq, molecular docking and dynamics simulations identifies Lasmiditan as drug candidate for Alzheimer's disease.整合性单细胞核RNA测序、分子对接和动力学模拟确定拉米地坦为阿尔茨海默病的候选药物。
Clin Transl Med. 2025 Aug;15(8):e70443. doi: 10.1002/ctm2.70443.
2
Semi-supervised contrastive learning variational autoencoder Integrating single-cell multimodal mosaic datasets.整合单细胞多模态镶嵌数据集的半监督对比学习变分自编码器
BMC Bioinformatics. 2025 Aug 4;26(1):206. doi: 10.1186/s12859-025-06239-5.
3
Applications of Spatial Transcriptomics in Veterinary Medicine: A Scoping Review of Research, Diagnostics, and Treatment Strategies.

本文引用的文献

1
Online single-cell data integration through projecting heterogeneous datasets into a common cell-embedding space.通过将异质数据集映射到共同的细胞嵌入空间来实现单细胞数据的在线整合。
Nat Commun. 2022 Oct 17;13(1):6118. doi: 10.1038/s41467-022-33758-z.
2
Benchmarking spatial and single-cell transcriptomics integration methods for transcript distribution prediction and cell type deconvolution.用于转录本分布预测和细胞类型反卷积的空间和单细胞转录组学整合方法的基准测试
Nat Methods. 2022 Jun;19(6):662-670. doi: 10.1038/s41592-022-01480-9. Epub 2022 May 16.
3
Multi-omics single-cell data integration and regulatory inference with graph-linked embedding.
空间转录组学在兽医学中的应用:研究、诊断和治疗策略的范围综述
Int J Mol Sci. 2025 Jun 26;26(13):6163. doi: 10.3390/ijms26136163.
4
scMGCL: accurate and efficient integration representation of single-cell multi-omics data.scMGCL:单细胞多组学数据的准确高效整合表示
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf392.
5
SCOT+: A Comprehensive Software Suite for Single-Cell alignment Using Optimal Transport.SCOT+:一个使用最优传输进行单细胞比对的综合软件套件。
bioRxiv. 2025 May 27:2025.05.21.655322. doi: 10.1101/2025.05.21.655322.
6
sciLaMA: A Single-Cell Representation Learning Framework to Leverage Prior Knowledge from Large Language Models.sciLaMA:一种利用大语言模型先验知识的单细胞表示学习框架。
bioRxiv. 2025 May 29:2025.01.28.635153. doi: 10.1101/2025.01.28.635153.
7
Integrating Dynamical Systems Modeling with Spatiotemporal scRNA-Seq Data Analysis.将动态系统建模与时空单细胞RNA测序数据分析相结合。
Entropy (Basel). 2025 Apr 22;27(5):453. doi: 10.3390/e27050453.
8
Partially characterized topology guides reliable anchor-free scRNA-integration.部分特征化的拓扑结构指导可靠的无锚单细胞RNA整合。
Commun Biol. 2025 Apr 4;8(1):561. doi: 10.1038/s42003-025-07988-y.
9
Deep learning in single-cell and spatial transcriptomics data analysis: advances and challenges from a data science perspective.从数据科学视角看深度学习在单细胞和空间转录组学数据分析中的进展与挑战
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf136.
10
COME: contrastive mapping learning for spatial reconstruction of single-cell RNA sequencing data.COME:用于单细胞RNA测序数据空间重建的对比映射学习
Bioinformatics. 2025 Mar 4;41(3). doi: 10.1093/bioinformatics/btaf083.
基于图链接嵌入的多组学单细胞数据整合与调控推断。
Nat Biotechnol. 2022 Oct;40(10):1458-1466. doi: 10.1038/s41587-022-01284-4. Epub 2022 May 2.
4
scJoint integrates atlas-scale single-cell RNA-seq and ATAC-seq data with transfer learning.scJoint 集成了图谱尺度单细胞 RNA-seq 和 ATAC-seq 数据,并结合了迁移学习。
Nat Biotechnol. 2022 May;40(5):703-710. doi: 10.1038/s41587-021-01161-6. Epub 2022 Jan 20.
5
Tertiary lymphoid structures in cancer.癌症中的三级淋巴结构。
Science. 2022 Jan 7;375(6576):eabf9419. doi: 10.1126/science.abf9419.
6
Single-Cell Multiomics Integration by SCOT.单细胞多组学整合分析技术(SCOT)
J Comput Biol. 2022 Jan;29(1):19-22. doi: 10.1089/cmb.2021.0477. Epub 2022 Jan 5.
7
Cobolt: integrative analysis of multimodal single-cell sequencing data.科博尔特:多模态单细胞测序数据的综合分析。
Genome Biol. 2021 Dec 28;22(1):351. doi: 10.1186/s13059-021-02556-z.
8
MultiMAP: dimensionality reduction and integration of multimodal data.MultiMAP:多模态数据的降维和整合。
Genome Biol. 2021 Dec 20;22(1):346. doi: 10.1186/s13059-021-02565-y.
9
Pancreatic Cancer and Platelets Crosstalk: A Potential Biomarker and Target.胰腺癌与血小板的相互作用:一种潜在的生物标志物和靶点。
Front Cell Dev Biol. 2021 Nov 10;9:749689. doi: 10.3389/fcell.2021.749689. eCollection 2021.
10
Single-cell chromatin state analysis with Signac.使用 Signac 进行单细胞染色质状态分析。
Nat Methods. 2021 Nov;18(11):1333-1341. doi: 10.1038/s41592-021-01282-5. Epub 2021 Nov 1.