• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

概率张量分解从单细胞多组学数据中提取更好的潜在嵌入。

Probabilistic tensor decomposition extracts better latent embeddings from single-cell multiomic data.

机构信息

Department of Computer Science, City University of Hong Kong, Hong Kong.

出版信息

Nucleic Acids Res. 2023 Aug 25;51(15):e81. doi: 10.1093/nar/gkad570.

DOI:10.1093/nar/gkad570
PMID:37403780
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10450184/
Abstract

Single-cell sequencing technology enables the simultaneous capture of multiomic data from multiple cells. The captured data can be represented by tensors, i.e. the higher-rank matrices. However, the existing analysis tools often take the data as a collection of two-order matrices, renouncing the correspondences among the features. Consequently, we propose a probabilistic tensor decomposition framework, SCOIT, to extract embeddings from single-cell multiomic data. SCOIT incorporates various distributions, including Gaussian, Poisson, and negative binomial distributions, to deal with sparse, noisy, and heterogeneous single-cell data. Our framework can decompose a multiomic tensor into a cell embedding matrix, a gene embedding matrix, and an omic embedding matrix, allowing for various downstream analyses. We applied SCOIT to eight single-cell multiomic datasets from different sequencing protocols. With cell embeddings, SCOIT achieves superior performance for cell clustering compared to nine state-of-the-art tools under various metrics, demonstrating its ability to dissect cellular heterogeneity. With the gene embeddings, SCOIT enables cross-omics gene expression analysis and integrative gene regulatory network study. Furthermore, the embeddings allow cross-omics imputation simultaneously, outperforming current imputation methods with the Pearson correlation coefficient increased by 3.38-39.26%; moreover, SCOIT accommodates the scenario that subsets of the cells are with merely one omic profile available.

摘要

单细胞测序技术能够同时从多个细胞中捕获多组学数据。捕获的数据可以用张量表示,即更高阶矩阵。然而,现有的分析工具通常将数据视为二维矩阵的集合,放弃了特征之间的对应关系。因此,我们提出了一个概率张量分解框架 SCOIT,用于从单细胞多组学数据中提取嵌入。SCOIT 结合了各种分布,包括高斯分布、泊松分布和负二项式分布,以处理稀疏、嘈杂和异质的单细胞数据。我们的框架可以将一个多组学张量分解为一个细胞嵌入矩阵、一个基因嵌入矩阵和一个组学嵌入矩阵,允许进行各种下游分析。我们将 SCOIT 应用于来自不同测序方案的八个单细胞多组学数据集。使用细胞嵌入,SCOIT 在各种指标下与九种最先进的工具相比,在细胞聚类方面表现出优异的性能,证明了其解析细胞异质性的能力。使用基因嵌入,SCOIT 可以进行跨组学基因表达分析和综合基因调控网络研究。此外,嵌入还允许同时进行跨组学插补,与当前的插补方法相比,Pearson 相关系数提高了 3.38-39.26%;此外,SCOIT 还适应了只有一个组学谱可用的细胞子集的情况。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/8fd078d782df/gkad570fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/ea54cc31eefe/gkad570figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/caa1a4fd031e/gkad570fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/e0ad01082db0/gkad570fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/0d27811d77bc/gkad570fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/cfa501399502/gkad570fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/78a1ce7cf396/gkad570fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/ccc5d519c7df/gkad570fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/41736bc4e34e/gkad570fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/8fd078d782df/gkad570fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/ea54cc31eefe/gkad570figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/caa1a4fd031e/gkad570fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/e0ad01082db0/gkad570fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/0d27811d77bc/gkad570fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/cfa501399502/gkad570fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/78a1ce7cf396/gkad570fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/ccc5d519c7df/gkad570fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/41736bc4e34e/gkad570fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f52/10450184/8fd078d782df/gkad570fig8.jpg

相似文献

1
Probabilistic tensor decomposition extracts better latent embeddings from single-cell multiomic data.概率张量分解从单细胞多组学数据中提取更好的潜在嵌入。
Nucleic Acids Res. 2023 Aug 25;51(15):e81. doi: 10.1093/nar/gkad570.
2
GSTRPCA: irregular tensor singular value decomposition for single-cell multi-omics data clustering.GSTRPCA:用于单细胞多组学数据聚类的不规则张量奇异值分解
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae649.
3
MultiSC: a deep learning pipeline for analyzing multiomics single-cell data.MultiSC:用于分析多组学单细胞数据的深度学习管道。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae492.
4
ScLRTC: imputation for single-cell RNA-seq data via low-rank tensor completion.ScLRTC:基于低秩张量补全的单细胞 RNA-seq 数据插补。
BMC Genomics. 2021 Nov 29;22(1):860. doi: 10.1186/s12864-021-08101-3.
5
Robust joint clustering of multi-omics single-cell data via multi-modal high-order neighborhood Laplacian matrix optimization.基于多模态高阶邻域拉普拉斯矩阵优化的多组学单细胞数据稳健联合聚类。
Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad414.
6
scMIC: A Deep Multi-Level Information Fusion Framework for Clustering Single-Cell Multi-Omics Data.scMIC:一种用于聚类单细胞多组学数据的深度多层信息融合框架。
IEEE J Biomed Health Inform. 2023 Dec;27(12):6121-6132. doi: 10.1109/JBHI.2023.3317272. Epub 2023 Dec 5.
7
JSNMFuP: a unsupervised method for the integrative analysis of single-cell multi-omics data based on non-negative matrix factorization.JSNMFuP:一种基于非负矩阵分解的单细胞多组学数据综合分析的无监督方法。
BMC Genomics. 2025 Mar 20;26(1):274. doi: 10.1186/s12864-025-11462-8.
8
TriTan: an efficient triple nonnegative matrix factorization method for integrative analysis of single-cell multiomics data.TriTan:一种用于单细胞多组学数据综合分析的高效三重非负矩阵分解方法。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae615.
9
SMURF: embedding single-cell RNA-seq data with matrix factorization preserving self-consistency.SMURF:通过保持自一致性的矩阵分解来嵌入单细胞RNA测序数据
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad026.
10
FactVAE: a factorized variational autoencoder for single-cell multi-omics data integration analysis.FactVAE:用于单细胞多组学数据整合分析的因子分解变分自编码器。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf157.

引用本文的文献

1
scMMAE: masked cross-attention network for single-cell multimodal omics fusion to enhance unimodal omics.scMMAE:用于单细胞多组学融合以增强单一组学的掩码交叉注意力网络。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf010.
2
Benchmarking algorithms for single-cell multi-omics prediction and integration.单细胞多组学预测和整合算法的基准测试。
Nat Methods. 2024 Nov;21(11):2182-2194. doi: 10.1038/s41592-024-02429-w. Epub 2024 Sep 25.
3
Identify gestational diabetes mellitus by deep learning model from cell-free DNA at the early gestation stage.

本文引用的文献

1
MultiVI: deep generative model for the integration of multimodal data.MultiVI:用于多模态数据集成的深度生成模型。
Nat Methods. 2023 Aug;20(8):1222-1231. doi: 10.1038/s41592-023-01909-9. Epub 2023 Jun 29.
2
Single nucleus multi-omics identifies human cortical cell regulatory genome diversity.单核多组学技术揭示人类皮质细胞调控基因组多样性。
Cell Genom. 2022 Mar 9;2(3). doi: 10.1016/j.xgen.2022.100107.
3
Cobolt: integrative analysis of multimodal single-cell sequencing data.科博尔特:多模态单细胞测序数据的综合分析。
利用早期妊娠阶段的游离细胞 DNA 通过深度学习模型来识别妊娠糖尿病。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad492.
Genome Biol. 2021 Dec 28;22(1):351. doi: 10.1186/s13059-021-02556-z.
4
Single-cell chromatin state analysis with Signac.使用 Signac 进行单细胞染色质状态分析。
Nat Methods. 2021 Nov;18(11):1333-1341. doi: 10.1038/s41592-021-01282-5. Epub 2021 Nov 1.
5
Multi-omics integration in the age of million single-cell data.多组学整合在百万单细胞数据时代。
Nat Rev Nephrol. 2021 Nov;17(11):710-724. doi: 10.1038/s41581-021-00463-x. Epub 2021 Aug 20.
6
Integrated analysis of multimodal single-cell data.多模态单细胞数据的综合分析。
Cell. 2021 Jun 24;184(13):3573-3587.e29. doi: 10.1016/j.cell.2021.04.048. Epub 2021 May 31.
7
Decoding dynamic epigenetic landscapes in human oocytes using single-cell multi-omics sequencing.使用单细胞多组学测序技术解析人类卵母细胞中的动态表观遗传景观。
Cell Stem Cell. 2021 Sep 2;28(9):1641-1656.e7. doi: 10.1016/j.stem.2021.04.012. Epub 2021 May 5.
8
Computational principles and challenges in single-cell data integration.单细胞数据整合的计算原理与挑战。
Nat Biotechnol. 2021 Oct;39(10):1202-1215. doi: 10.1038/s41587-021-00895-7. Epub 2021 May 3.
9
Joint probabilistic modeling of single-cell multi-omic data with totalVI.单细胞多组学数据的总变分联合概率建模。
Nat Methods. 2021 Mar;18(3):272-282. doi: 10.1038/s41592-020-01050-x. Epub 2021 Feb 15.
10
Single-cell proteomic and transcriptomic analysis of macrophage heterogeneity using SCoPE2.单细胞蛋白质组学和转录组学分析使用 SCoPE2 分析巨噬细胞异质性。
Genome Biol. 2021 Jan 27;22(1):50. doi: 10.1186/s13059-021-02267-5.