• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

UINMF 通过非负矩阵分解对单细胞多组学数据集进行镶嵌式整合。

UINMF performs mosaic integration of single-cell multi-omic datasets using nonnegative matrix factorization.

机构信息

Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.

Department of Computer Science and Engineering, University of Michigan, Ann Arbor, MI, USA.

出版信息

Nat Commun. 2022 Feb 9;13(1):780. doi: 10.1038/s41467-022-28431-4.

DOI:10.1038/s41467-022-28431-4
PMID:35140223
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8828882/
Abstract

Single-cell genomic technologies provide an unprecedented opportunity to define molecular cell types in a data-driven fashion, but present unique data integration challenges. Many analyses require "mosaic integration", including both features shared across datasets and features exclusive to a single experiment. Previous computational integration approaches require that the input matrices share the same number of either genes or cells, and thus can use only shared features. To address this limitation, we derive a nonnegative matrix factorization algorithm for integrating single-cell datasets containing both shared and unshared features. The key advance is incorporating an additional metagene matrix that allows unshared features to inform the factorization. We demonstrate that incorporating unshared features significantly improves integration of single-cell RNA-seq, spatial transcriptomic, SNARE-seq, and cross-species datasets. We have incorporated the UINMF algorithm into the open-source LIGER R package ( https://github.com/welch-lab/liger ).

摘要

单细胞基因组技术提供了一个前所未有的机会,可以以数据驱动的方式定义分子细胞类型,但也带来了独特的数据集成挑战。许多分析需要“镶嵌式集成”,包括跨数据集共享的特征和单个实验特有的特征。以前的计算集成方法要求输入矩阵共享相同数量的基因或细胞,因此只能使用共享特征。为了解决这个限制,我们推导出了一种非负矩阵分解算法,用于集成包含共享和非共享特征的单细胞数据集。关键的进展是引入了一个额外的元基因矩阵,允许非共享特征为分解提供信息。我们证明了纳入非共享特征可以显著改善单细胞 RNA-seq、空间转录组学、SNARE-seq 和跨物种数据集的集成。我们已经将 UINMF 算法纳入了开源 LIGER R 包(https://github.com/welch-lab/liger)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/104b1c16d83e/41467_2022_28431_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/d05a5a52e540/41467_2022_28431_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/3eed6097bef9/41467_2022_28431_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/e82bda06cda5/41467_2022_28431_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/8868dd4d874c/41467_2022_28431_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/82f9eb098dfc/41467_2022_28431_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/d76ea68f24bf/41467_2022_28431_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/0d41c2357dca/41467_2022_28431_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/104b1c16d83e/41467_2022_28431_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/d05a5a52e540/41467_2022_28431_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/3eed6097bef9/41467_2022_28431_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/e82bda06cda5/41467_2022_28431_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/8868dd4d874c/41467_2022_28431_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/82f9eb098dfc/41467_2022_28431_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/d76ea68f24bf/41467_2022_28431_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/0d41c2357dca/41467_2022_28431_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9613/8828882/104b1c16d83e/41467_2022_28431_Fig8_HTML.jpg

相似文献

1
UINMF performs mosaic integration of single-cell multi-omic datasets using nonnegative matrix factorization.UINMF 通过非负矩阵分解对单细胞多组学数据集进行镶嵌式整合。
Nat Commun. 2022 Feb 9;13(1):780. doi: 10.1038/s41467-022-28431-4.
2
Jointly defining cell types from multiple single-cell datasets using LIGER.使用 LIGER 从多个单细胞数据集联合定义细胞类型。
Nat Protoc. 2020 Nov;15(11):3632-3662. doi: 10.1038/s41596-020-0391-8. Epub 2020 Oct 12.
3
Integration and transfer learning of single-cell transcriptomes via cFIT.通过 cFIT 实现单细胞转录组的整合和迁移学习。
Proc Natl Acad Sci U S A. 2021 Mar 9;118(10). doi: 10.1073/pnas.2024383118.
4
BEM: Mining Coregulation Patterns in Transcriptomics via Boolean Matrix Factorization.BEM:通过布尔矩阵分解挖掘转录组学中的核心调控模式。
Bioinformatics. 2020 Jul 1;36(13):4030-4037. doi: 10.1093/bioinformatics/btz977.
5
STACAS: Sub-Type Anchor Correction for Alignment in Seurat to integrate single-cell RNA-seq data.STACAS:在 Seurat 中进行亚型锚定校正以整合单细胞 RNA-seq 数据。
Bioinformatics. 2021 May 5;37(6):882-884. doi: 10.1093/bioinformatics/btaa755.
6
PyLiger: scalable single-cell multi-omic data integration in Python.PyLiger:用于 Python 的可扩展单细胞多组学数据集成。
Bioinformatics. 2022 May 13;38(10):2946-2948. doi: 10.1093/bioinformatics/btac190.
7
Iterative single-cell multi-omic integration using online learning.基于在线学习的迭代单细胞多组学整合。
Nat Biotechnol. 2021 Aug;39(8):1000-1007. doi: 10.1038/s41587-021-00867-x. Epub 2021 Apr 19.
8
NMFLRR: Clustering scRNA-Seq Data by Integrating Nonnegative Matrix Factorization With Low Rank Representation.NMFLRR:基于非负矩阵分解和低秩表示整合的 scRNA-Seq 数据聚类
IEEE J Biomed Health Inform. 2022 Mar;26(3):1394-1405. doi: 10.1109/JBHI.2021.3099127. Epub 2022 Mar 7.
9
SMILE: mutual information learning for integration of single-cell omics data.SMILE:单细胞组学数据集成的互信息学习。
Bioinformatics. 2022 Jan 3;38(2):476-486. doi: 10.1093/bioinformatics/btab706.
10
Bayesian semi-nonnegative matrix tri-factorization to identify pathways associated with cancer phenotypes.贝叶斯半非负矩阵三因子分解鉴定与癌症表型相关的途径。
Pac Symp Biocomput. 2020;25:427-438.

引用本文的文献

1
A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches.多组学数据整合方法的技术综述:从经典统计方法到深度生成方法
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf355.
2
scMultiSim: simulation of single-cell multi-omics and spatial data guided by gene regulatory networks and cell-cell interactions.scMultiSim:由基因调控网络和细胞间相互作用引导的单细胞多组学和空间数据模拟。
Nat Methods. 2025 May;22(5):982-993. doi: 10.1038/s41592-025-02651-0. Epub 2025 Apr 17.
3
Empowering integrative and collaborative exploration of single-cell and spatial multimodal data with SGS genome browser.

本文引用的文献

1
MultiMAP: dimensionality reduction and integration of multimodal data.MultiMAP:多模态数据的降维和整合。
Genome Biol. 2021 Dec 20;22(1):346. doi: 10.1186/s13059-021-02565-y.
2
Jointly Embedding Multiple Single-Cell Omics Measurements.联合嵌入多个单细胞组学测量值
Algorithms Bioinform. 2019 Sep 3;143. doi: 10.4230/LIPIcs.WABI.2019.10.
3
Computational principles and challenges in single-cell data integration.单细胞数据整合的计算原理与挑战。
借助SGS基因组浏览器助力单细胞和空间多模态数据的整合与协作探索。
Cell Genom. 2025 May 14;5(5):100848. doi: 10.1016/j.xgen.2025.100848. Epub 2025 Apr 14.
4
Single-Cell Multi-Omics: Insights into Therapeutic Innovations to Advance Treatment in Cancer.单细胞多组学:对推动癌症治疗进展的治疗创新的见解
Int J Mol Sci. 2025 Mar 9;26(6):2447. doi: 10.3390/ijms26062447.
5
Examining the dynamics of three-dimensional genome organization with multitask matrix factorization.利用多任务矩阵分解研究三维基因组组织的动力学
Genome Res. 2025 May 2;35(5):1179-1193. doi: 10.1101/gr.279930.124.
6
AI-driven multi-omics integration for multi-scale predictive modeling of genotype-environment-phenotype relationships.用于基因型-环境-表型关系多尺度预测建模的人工智能驱动的多组学整合
Comput Struct Biotechnol J. 2025 Jan 2;27:265-277. doi: 10.1016/j.csbj.2024.12.030. eCollection 2025.
7
Single-cell mosaic integration and cell state transfer with auto-scaling self-attention mechanism.单细胞马赛克整合和细胞状态转移与自动缩放自注意机制。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae540.
8
Benchmarking algorithms for single-cell multi-omics prediction and integration.单细胞多组学预测和整合算法的基准测试。
Nat Methods. 2024 Nov;21(11):2182-2194. doi: 10.1038/s41592-024-02429-w. Epub 2024 Sep 25.
9
scRNMF: An imputation method for single-cell RNA-seq data by robust and non-negative matrix factorization.scRNMF:一种基于鲁棒非负矩阵分解的单细胞 RNA-seq 数据插补方法。
PLoS Comput Biol. 2024 Aug 8;20(8):e1012339. doi: 10.1371/journal.pcbi.1012339. eCollection 2024 Aug.
10
A scalable approach to topic modelling in single-cell data by approximate pseudobulk projection.基于近似伪总体投影的单细胞数据主题建模可扩展方法。
Life Sci Alliance. 2024 Aug 6;7(10). doi: 10.26508/lsa.202402713. Print 2024 Oct.
Nat Biotechnol. 2021 Oct;39(10):1202-1215. doi: 10.1038/s41587-021-00895-7. Epub 2021 May 3.
4
Joint probabilistic modeling of single-cell multi-omic data with totalVI.单细胞多组学数据的总变分联合概率建模。
Nat Methods. 2021 Mar;18(3):272-282. doi: 10.1038/s41592-020-01050-x. Epub 2021 Feb 15.
5
Method of the Year 2020: spatially resolved transcriptomics.2020年度方法:空间分辨转录组学。
Nat Methods. 2021 Jan;18(1):1. doi: 10.1038/s41592-020-01042-x.
6
Chromatin Potential Identified by Shared Single-Cell Profiling of RNA and Chromatin.基于 RNA 和染色质的共享单细胞分析鉴定染色质潜能
Cell. 2020 Nov 12;183(4):1103-1116.e20. doi: 10.1016/j.cell.2020.09.056. Epub 2020 Oct 23.
7
Hybridization-based in situ sequencing (HybISS) for spatially resolved transcriptomics in human and mouse brain tissue.基于杂交的原位测序(HybISS)在人类和小鼠脑组织中进行空间分辨转录组学分析。
Nucleic Acids Res. 2020 Nov 4;48(19):e112. doi: 10.1093/nar/gkaa792.
8
MOFA+: a statistical framework for comprehensive integration of multi-modal single-cell data.MOFA+:一种全面整合多模态单细胞数据的统计框架。
Genome Biol. 2020 May 11;21(1):111. doi: 10.1186/s13059-020-02015-1.
9
A benchmark of batch-effect correction methods for single-cell RNA sequencing data.单细胞 RNA 测序数据批次效应校正方法的基准测试。
Genome Biol. 2020 Jan 16;21(1):12. doi: 10.1186/s13059-019-1850-9.
10
Method of the Year 2019: Single-cell multimodal omics.2019年度方法:单细胞多组学
Nat Methods. 2020 Jan;17(1):1. doi: 10.1038/s41592-019-0703-5.