• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用 Pamona 对异质单细胞多组学数据进行多样本整合

Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona.

机构信息

LSC, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China.

School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China.

出版信息

Bioinformatics. 2021 Dec 22;38(1):211-219. doi: 10.1093/bioinformatics/btab594.

DOI:10.1093/bioinformatics/btab594
PMID:34398192
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8696097/
Abstract

MOTIVATION

Single-cell multi-omics sequencing data can provide a comprehensive molecular view of cells. However, effective approaches for the integrative analysis of such data are challenging. Existing manifold alignment methods demonstrated the state-of-the-art performance on single-cell multi-omics data integration, but they are often limited by requiring that single-cell datasets be derived from the same underlying cellular structure.

RESULTS

In this study, we present Pamona, a partial Gromov-Wasserstein distance-based manifold alignment framework that integrates heterogeneous single-cell multi-omics datasets with the aim of delineating and representing the shared and dataset-specific cellular structures across modalities. We formulate this task as a partial manifold alignment problem and develop a partial Gromov-Wasserstein optimal transport framework to solve it. Pamona identifies both shared and dataset-specific cells based on the computed probabilistic couplings of cells across datasets, and it aligns cellular modalities in a common low-dimensional space, while simultaneously preserving both shared and dataset-specific structures. Our framework can easily incorporate prior information, such as cell type annotations or cell-cell correspondence, to further improve alignment quality. We evaluated Pamona on a comprehensive set of publicly available benchmark datasets. We demonstrated that Pamona can accurately identify shared and dataset-specific cells, as well as faithfully recover and align cellular structures of heterogeneous single-cell modalities in a common space, outperforming the comparable existing methods.

AVAILABILITYAND IMPLEMENTATION

Pamona software is available at https://github.com/caokai1073/Pamona.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

单细胞多组学测序数据可以提供细胞的全面分子视图。然而,有效的综合分析方法仍然具有挑战性。现有的流形对齐方法在单细胞多组学数据集成方面表现出了最先进的性能,但它们通常受到限制,要求单细胞数据集来自相同的基础细胞结构。

结果

在这项研究中,我们提出了 Pamona,这是一种基于部分 Gromov-Wasserstein 距离的流形对齐框架,旨在描绘和表示跨模态的共享和数据集特定的细胞结构。我们将此任务表述为部分流形对齐问题,并开发了部分 Gromov-Wasserstein 最优传输框架来解决它。Pamona 根据跨数据集的细胞计算出的概率耦合来识别共享和数据集特定的细胞,并在公共低维空间中对齐细胞模态,同时保留共享和数据集特定的结构。我们的框架可以轻松地合并先验信息,例如细胞类型注释或细胞-细胞对应关系,以进一步提高对齐质量。我们在一组全面的公开基准数据集上评估了 Pamona。我们证明了 Pamona 可以准确地识别共享和数据集特定的细胞,以及忠实地恢复和对齐公共空间中异构单细胞模态的细胞结构,优于可比的现有方法。

可用性和实现

Pamona 软件可在 https://github.com/caokai1073/Pamona 上获得。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/492c/8696097/dad0bdb7179c/btab594f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/492c/8696097/af21bf4bc264/btab594f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/492c/8696097/7c9eccdbcf83/btab594f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/492c/8696097/855166720420/btab594f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/492c/8696097/9537784e1316/btab594f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/492c/8696097/dad0bdb7179c/btab594f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/492c/8696097/af21bf4bc264/btab594f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/492c/8696097/7c9eccdbcf83/btab594f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/492c/8696097/855166720420/btab594f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/492c/8696097/9537784e1316/btab594f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/492c/8696097/dad0bdb7179c/btab594f5.jpg

相似文献

1
Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona.使用 Pamona 对异质单细胞多组学数据进行多样本整合
Bioinformatics. 2021 Dec 22;38(1):211-219. doi: 10.1093/bioinformatics/btab594.
2
Unsupervised topological alignment for single-cell multi-omics integration.无监督拓扑对齐单细胞多组学整合。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i48-i56. doi: 10.1093/bioinformatics/btaa443.
3
A universal framework for single-cell multi-omics data integration with graph convolutional networks.基于图卷积网络的单细胞多组学数据集成通用框架
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad081.
4
SCOT: Single-Cell Multi-Omics Alignment with Optimal Transport.SCOT:基于最优传输的单细胞多组学整合分析。
J Comput Biol. 2022 Jan;29(1):3-18. doi: 10.1089/cmb.2021.0446.
5
scTopoGAN: unsupervised manifold alignment of single-cell data.scTopoGAN:单细胞数据的无监督流形对齐
Bioinform Adv. 2023 Nov 24;3(1):vbad171. doi: 10.1093/bioadv/vbad171. eCollection 2023.
6
MRGCN: cancer subtyping with multi-reconstruction graph convolutional network using full and partial multi-omics dataset.MRGCN:基于全和部分多组学数据集的多重建图卷积网络进行癌症亚型分类。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad353.
7
Clustering single-cell multi-omics data with MoClust.使用 MoClust 对单细胞多组学数据进行聚类。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac736.
8
Con-AAE: contrastive cycle adversarial autoencoders for single-cell multi-omics alignment and integration.Con-AAE:用于单细胞多组学比对和整合的对比循环对抗自动编码器。
Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad162.
9
Deep cross-omics cycle attention model for joint analysis of single-cell multi-omics data.用于单细胞多组学数据联合分析的深度跨组学循环注意力模型。
Bioinformatics. 2021 Nov 18;37(22):4091-4099. doi: 10.1093/bioinformatics/btab403.
10
Integrative analysis of multi-omics and imaging data with incorporation of biological information via structural Bayesian factor analysis.基于结构贝叶斯因子分析,结合生物学信息,对多组学和影像数据进行综合分析。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad073.

引用本文的文献

1
Supervised Gromov-Wasserstein Optimal Transport with Metric-Preserving Constraints.具有度量保持约束的监督Gromov-Wasserstein最优传输
SIAM J Math Data Sci. 2025;7(1):301-328. doi: 10.1137/24m1630499. Epub 2025 Feb 20.
2
Securing diagonal integration of multimodal single-cell data against ambiguous mapping.确保多模态单细胞数据的对角整合以防止模糊映射。
Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf345.
3
SCOT+: A Comprehensive Software Suite for Single-Cell alignment Using Optimal Transport.SCOT+:一个使用最优传输进行单细胞比对的综合软件套件。

本文引用的文献

1
Machine learning for deciphering cell heterogeneity and gene regulation.用于解读细胞异质性和基因调控的机器学习
Nat Comput Sci. 2021 Mar;1(3):183-191. doi: 10.1038/s43588-021-00038-7. Epub 2021 Mar 15.
2
Jointly Embedding Multiple Single-Cell Omics Measurements.联合嵌入多个单细胞组学测量值
Algorithms Bioinform. 2019 Sep 3;143. doi: 10.4230/LIPIcs.WABI.2019.10.
3
Unsupervised manifold alignment for single-cell multi-omics data.单细胞多组学数据的无监督流形对齐
bioRxiv. 2025 May 27:2025.05.21.655322. doi: 10.1101/2025.05.21.655322.
4
HABiC: an algorithm based on the exact computation of the Kantorovich-Rubinstein optimizer for binary classification in transcriptomics.HABiC:一种基于康德罗维奇-鲁宾斯坦优化器精确计算的算法,用于转录组学中的二元分类。
Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf310.
5
Single-cell genomics of the mouse olfactory cortex reveals contrasts with neocortex and ancestral signatures of cell type evolution.小鼠嗅觉皮层的单细胞基因组学揭示了与新皮层的差异以及细胞类型进化的祖先特征。
Nat Neurosci. 2025 May;28(5):937-948. doi: 10.1038/s41593-025-01924-3. Epub 2025 Apr 8.
6
Optimal Transport Based Graph Kernels for Drug Property Prediction.用于药物性质预测的基于最优传输的图核
IEEE Open J Eng Med Biol. 2024 Oct 14;6:152-157. doi: 10.1109/OJEMB.2024.3480708. eCollection 2025.
7
Integration of single-cell transcriptome and chromatin accessibility and its application on tumor investigation.单细胞转录组与染色质可及性的整合及其在肿瘤研究中的应用。
Life Med. 2024 Apr 26;3(2):lnae015. doi: 10.1093/lifemedi/lnae015. eCollection 2024 Apr.
8
Knowledge-based inductive bias and domain adaptation for cell type annotation.基于知识的归纳偏差和领域自适应在细胞类型注释中的应用。
Commun Biol. 2024 Nov 5;7(1):1440. doi: 10.1038/s42003-024-07171-9.
9
Modal-nexus auto-encoder for multi-modality cellular data integration and imputation.模态关联自动编码器用于多模态细胞数据的整合和插补。
Nat Commun. 2024 Oct 18;15(1):9021. doi: 10.1038/s41467-024-53355-6.
10
Graspot: a graph attention network for spatial transcriptomics data integration with optimal transport.Graspot:一种基于图注意力网络的最优传输算法的空间转录组学数据整合方法。
Bioinformatics. 2024 Sep 1;40(Suppl 2):ii137-ii145. doi: 10.1093/bioinformatics/btae394.
ACM BCB. 2020 Sep;2020:1-10. doi: 10.1145/3388440.3412410.
4
Deep-joint-learning analysis model of single cell transcriptome and open chromatin accessibility data.单细胞转录组和开放染色质可及性数据的深度联合学习分析模型。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa287.
5
Integrative analyses of single-cell transcriptome and regulome using MAESTRO.使用 MAESTRO 进行单细胞转录组和调控组的综合分析。
Genome Biol. 2020 Aug 7;21(1):198. doi: 10.1186/s13059-020-02116-x.
6
Unsupervised topological alignment for single-cell multi-omics integration.无监督拓扑对齐单细胞多组学整合。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i48-i56. doi: 10.1093/bioinformatics/btaa443.
7
Poincaré maps for analyzing complex hierarchies in single-cell data.用于分析单细胞数据中复杂层次结构的庞加莱映射。
Nat Commun. 2020 Jun 11;11(1):2966. doi: 10.1038/s41467-020-16822-4.
8
scAI: an unsupervised approach for the integrative analysis of parallel single-cell transcriptomic and epigenomic profiles.scAI:一种用于平行单细胞转录组学和表观基因组学综合分析的无监督方法。
Genome Biol. 2020 Feb 3;21(1):25. doi: 10.1186/s13059-020-1932-8.
9
Computational methods for single-cell omics across modalities.跨模态单细胞组学的计算方法
Nat Methods. 2020 Jan;17(1):14-17. doi: 10.1038/s41592-019-0692-4.
10
Multi-omics profiling of mouse gastrulation at single-cell resolution.单细胞分辨率下的小鼠原肠胚形成的多组学分析。
Nature. 2019 Dec;576(7787):487-491. doi: 10.1038/s41586-019-1825-8. Epub 2019 Dec 11.