• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单细胞马赛克整合和细胞状态转移与自动缩放自注意机制。

Single-cell mosaic integration and cell state transfer with auto-scaling self-attention mechanism.

机构信息

Department of Biostatistics, School of Public Health, Peking University, 38 Xueyuan Rd., Haidian District, Beijing 100191, China.

Peking University Cancer Hospital, 52 Fucheng Rd., Haidian District, Beijing 100142, China.

出版信息

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae540.

DOI:10.1093/bib/bbae540
PMID:39438079
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11495875/
Abstract

The integration of data from multiple modalities generated by single-cell omics technologies is crucial for accurately identifying cell states. One challenge in comprehending multi-omics data resides in mosaic integration, in which different data modalities are profiled in different subsets of cells, as it requires simultaneous batch effect removal and modality alignment. Here, we develop Multi-omics Mosaic Auto-scaling Attention Variational Inference (mmAAVI), a scalable deep generative model for single-cell mosaic integration. Leveraging auto-scaling self-attention mechanisms, mmAAVI can map arbitrary combinations of omics to the common embedding space. If existing well-annotated cell states, the model can perform semisupervised learning to utilize existing these annotations. We validated the performance of mmAAVI and five other commonly used methods on four benchmark datasets, which vary in cell numbers, omics types, and missing patterns. mmAAVI consistently demonstrated its superiority. We also validated mmAAVI's ability for cell state knowledge transfer, achieving balanced accuracies of 0.82 and 0.97 with less 1% labeled cells between batches with completely different omics. The full package is available at https://github.com/luyiyun/mmAAVI.

摘要

单细胞多组学技术产生的多模态数据的整合对于准确识别细胞状态至关重要。理解多组学数据的一个挑战在于嵌合体整合,其中不同的数据模态在不同的细胞亚群中进行分析,因为它需要同时去除批次效应和模态对齐。在这里,我们开发了 Multi-omics Mosaic Auto-scaling Attention Variational Inference (mmAAVI),这是一种用于单细胞嵌合体整合的可扩展深度生成模型。利用自缩放自注意机制,mmAAVI 可以将任意组合的组学映射到共同的嵌入空间中。如果存在经过良好注释的细胞状态,该模型可以进行半监督学习以利用这些现有注释。我们在四个基准数据集上验证了 mmAAVI 和其他五种常用方法的性能,这些数据集在细胞数量、组学类型和缺失模式方面有所不同。mmAAVI 始终表现出优越性。我们还验证了 mmAAVI 在细胞状态知识转移方面的能力,在批次之间具有完全不同的组学的情况下,使用少于 1%的标记细胞实现了平衡准确率为 0.82 和 0.97。完整的软件包可在 https://github.com/luyiyun/mmAAVI 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70cc/11495875/408d9b3bf59a/bbae540f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70cc/11495875/658a0a961df8/bbae540f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70cc/11495875/56426390d02a/bbae540f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70cc/11495875/e920d7460413/bbae540f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70cc/11495875/408d9b3bf59a/bbae540f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70cc/11495875/658a0a961df8/bbae540f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70cc/11495875/56426390d02a/bbae540f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70cc/11495875/e920d7460413/bbae540f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70cc/11495875/408d9b3bf59a/bbae540f4.jpg

相似文献

1
Single-cell mosaic integration and cell state transfer with auto-scaling self-attention mechanism.单细胞马赛克整合和细胞状态转移与自动缩放自注意机制。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae540.
2
Mosaic integration and knowledge transfer of single-cell multimodal data with MIDAS.使用 MIDAS 进行单细胞多模态数据的嵌合体整合和知识转移。
Nat Biotechnol. 2024 Oct;42(10):1594-1605. doi: 10.1038/s41587-023-02040-y. Epub 2024 Jan 23.
3
Multi-omics integration for both single-cell and spatially resolved data based on dual-path graph attention auto-encoder.基于双通道图注意自动编码器的单细胞和空间分辨数据的多组学整合。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae450.
4
scCross: a deep generative model for unifying single-cell multi-omics with seamless integration, cross-modal generation, and in silico exploration.scCross:一个深度生成模型,用于将单细胞多组学数据进行统一,实现无缝集成、跨模态生成和计算探索。
Genome Biol. 2024 Jul 29;25(1):198. doi: 10.1186/s13059-024-03338-z.
5
mosaicMPI: a framework for modular data integration across cohorts and -omics modalities.马赛克 MPI:一个跨队列和组学模式进行模块化数据集成的框架。
Nucleic Acids Res. 2024 Jul 8;52(12):e53. doi: 10.1093/nar/gkae442.
6
Multi-omics single-cell data integration and regulatory inference with graph-linked embedding.基于图链接嵌入的多组学单细胞数据整合与调控推断。
Nat Biotechnol. 2022 Oct;40(10):1458-1466. doi: 10.1038/s41587-022-01284-4. Epub 2022 May 2.
7
Robust probabilistic modeling for single-cell multimodal mosaic integration and imputation via scVAEIT.通过 scVAEIT 对单细胞多模态镶嵌整合和插补进行稳健的概率建模。
Proc Natl Acad Sci U S A. 2022 Dec 6;119(49):e2214414119. doi: 10.1073/pnas.2214414119. Epub 2022 Dec 2.
8
Unsupervised topological alignment for single-cell multi-omics integration.无监督拓扑对齐单细胞多组学整合。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i48-i56. doi: 10.1093/bioinformatics/btaa443.
9
scGCN is a graph convolutional networks algorithm for knowledge transfer in single cell omics.scGCN 是一种用于单细胞组学中知识迁移的图卷积网络算法。
Nat Commun. 2021 Jun 22;12(1):3826. doi: 10.1038/s41467-021-24172-y.
10
SIMBA: single-cell embedding along with features.SIMBA:单细胞特征嵌入。
Nat Methods. 2024 Jun;21(6):1003-1013. doi: 10.1038/s41592-023-01899-8. Epub 2023 May 29.

本文引用的文献

1
Mosaic integration and knowledge transfer of single-cell multimodal data with MIDAS.使用 MIDAS 进行单细胞多模态数据的嵌合体整合和知识转移。
Nat Biotechnol. 2024 Oct;42(10):1594-1605. doi: 10.1038/s41587-023-02040-y. Epub 2024 Jan 23.
2
Omics-based deep learning approaches for lung cancer decision-making and therapeutics development.基于组学的深度学习方法在肺癌决策和治疗开发中的应用。
Brief Funct Genomics. 2024 May 15;23(3):181-192. doi: 10.1093/bfgp/elad031.
3
The technological landscape and applications of single-cell multi-omics.
单细胞多组学的技术领域和应用。
Nat Rev Mol Cell Biol. 2023 Oct;24(10):695-713. doi: 10.1038/s41580-023-00615-w. Epub 2023 Jun 6.
4
Stabilized mosaic single-cell data integration using unshared features.使用非共享特征稳定镶嵌单细胞数据集成。
Nat Biotechnol. 2024 Feb;42(2):284-292. doi: 10.1038/s41587-023-01766-z. Epub 2023 May 25.
5
scMoMaT jointly performs single cell mosaic integration and multi-modal bio-marker detection.scMoMaT 联合执行单细胞马赛克整合和多模式生物标志物检测。
Nat Commun. 2023 Jan 24;14(1):384. doi: 10.1038/s41467-023-36066-2.
6
Multi-omics single-cell data integration and regulatory inference with graph-linked embedding.基于图链接嵌入的多组学单细胞数据整合与调控推断。
Nat Biotechnol. 2022 Oct;40(10):1458-1466. doi: 10.1038/s41587-022-01284-4. Epub 2022 May 2.
7
UINMF performs mosaic integration of single-cell multi-omic datasets using nonnegative matrix factorization.UINMF 通过非负矩阵分解对单细胞多组学数据集进行镶嵌式整合。
Nat Commun. 2022 Feb 9;13(1):780. doi: 10.1038/s41467-022-28431-4.
8
Cobolt: integrative analysis of multimodal single-cell sequencing data.科博尔特:多模态单细胞测序数据的综合分析。
Genome Biol. 2021 Dec 28;22(1):351. doi: 10.1186/s13059-021-02556-z.
9
Benchmarking atlas-level data integration in single-cell genomics.单细胞基因组学中图谱级数据整合的基准测试。
Nat Methods. 2022 Jan;19(1):41-50. doi: 10.1038/s41592-021-01336-8. Epub 2021 Dec 23.
10
MultiMAP: dimensionality reduction and integration of multimodal data.MultiMAP:多模态数据的降维和整合。
Genome Biol. 2021 Dec 20;22(1):346. doi: 10.1186/s13059-021-02565-y.