• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SMURF:通过保持自一致性的矩阵分解来嵌入单细胞RNA测序数据

SMURF: embedding single-cell RNA-seq data with matrix factorization preserving self-consistency.

作者信息

Pu Juhua, Wang Bingchen, Liu Xingwu, Chen Lingxi, Li Shuai Cheng

机构信息

State Key Laboratory of Software Development Environment, Beihang University, Beijing, China.

Beihang Hangzhou Innovation Institute Yuhang, Xixi Octagon City, Yuhang District, Hangzhou 310023, China.

出版信息

Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad026.

DOI:10.1093/bib/bbad026
PMID:36715274
Abstract

The advance in single-cell RNA-sequencing (scRNA-seq) sheds light on cell-specific transcriptomic studies of cell developments, complex diseases and cancers. Nevertheless, scRNA-seq techniques suffer from 'dropout' events, and imputation tools are proposed to address the sparsity. Here, rather than imputation, we propose a tool, SMURF, to extract the low-dimensional embeddings from cells and genes utilizing matrix factorization with a mixture of Poisson-Gamma divergent as objective while preserving self-consistency. SMURF exhibits feasible cell subpopulation discovery efficacy with obtained cell embeddings on replicated in silico and eight web lab scRNA datasets with ground truth cell types. Furthermore, SMURF can reduce the cell embedding to a 1D-oval space to recover the time course of cell cycle. SMURF can also serve as an imputation tool; the in silico data assessment shows that SMURF parades the most robust gene expression recovery power with low root mean square error and high Pearson correlation. Moreover, SMURF recovers the gene distribution for the WM989 Drop-seq data. SMURF is available at https://github.com/deepomicslab/SMURF.

摘要

单细胞RNA测序(scRNA-seq)技术的进步为细胞发育、复杂疾病和癌症的细胞特异性转录组学研究提供了线索。然而,scRNA-seq技术存在“缺失”事件,因此人们提出了插补工具来解决数据稀疏性问题。在此,我们提出了一种名为SMURF的工具,它不是进行插补,而是利用矩阵分解从细胞和基因中提取低维嵌入,以泊松-伽马散度混合作为目标,同时保持自一致性。在具有真实细胞类型的模拟复制和八个网络实验室scRNA数据集中,SMURF通过获得的细胞嵌入展现出可行的细胞亚群发现效果。此外,SMURF可以将细胞嵌入简化到一维椭圆空间,以恢复细胞周期的时间进程。SMURF还可以用作插补工具;模拟数据评估表明,SMURF具有最强的基因表达恢复能力,均方根误差低,皮尔逊相关性高。此外,SMURF恢复了WM989 Drop-seq数据的基因分布。可通过https://github.com/deepomicslab/SMURF获取SMURF。

相似文献

1
SMURF: embedding single-cell RNA-seq data with matrix factorization preserving self-consistency.SMURF:通过保持自一致性的矩阵分解来嵌入单细胞RNA测序数据
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad026.
2
GE-Impute: graph embedding-based imputation for single-cell RNA-seq data.GE-Impute:基于图嵌入的单细胞 RNA-seq 数据插补。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac313.
3
I-Impute: a self-consistent method to impute single cell RNA sequencing data.I-Impute:一种用于单细胞 RNA 测序数据插补的自洽方法。
BMC Genomics. 2020 Nov 18;21(Suppl 10):618. doi: 10.1186/s12864-020-07007-w.
4
SSNMDI: a novel joint learning model of semi-supervised non-negative matrix factorization and data imputation for clustering of single-cell RNA-seq data.SSNMDI:一种用于单细胞 RNA-seq 数据聚类的半监督非负矩阵分解和数据插补的新型联合学习模型。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad149.
5
CL-Impute: A contrastive learning-based imputation for dropout single-cell RNA-seq data.CL-Impute:基于对比学习的 dropout 单细胞 RNA-seq 数据插补方法。
Comput Biol Med. 2023 Sep;164:107263. doi: 10.1016/j.compbiomed.2023.107263. Epub 2023 Jul 23.
6
scGCL: an imputation method for scRNA-seq data based on graph contrastive learning.scGCL:一种基于图对比学习的 scRNA-seq 数据插补方法。
Bioinformatics. 2023 Mar 1;39(3). doi: 10.1093/bioinformatics/btad098.
7
GNN-based embedding for clustering scRNA-seq data.基于图神经网络的 scRNA-seq 数据聚类嵌入方法。
Bioinformatics. 2022 Jan 27;38(4):1037-1044. doi: 10.1093/bioinformatics/btab787.
8
Bubble: a fast single-cell RNA-seq imputation using an autoencoder constrained by bulk RNA-seq data.Bubble:一种利用受批量RNA测序数据约束的自动编码器进行的快速单细胞RNA测序插补方法。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac580.
9
Accurate and interpretable gene expression imputation on scRNA-seq data using IGSimpute.使用 IGSimpute 实现 scRNA-seq 数据的准确和可解释的基因表达推断。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad124.
10
CDSImpute: An ensemble similarity imputation method for single-cell RNA sequence dropouts.CDSImpute:一种用于单细胞 RNA 序列缺失的集成相似性插补方法。
Comput Biol Med. 2022 Jul;146:105658. doi: 10.1016/j.compbiomed.2022.105658. Epub 2022 May 21.

引用本文的文献

1
DCRELM: dual correlation reduction network-based extreme learning machine for single-cell RNA-seq data clustering.基于双相关降维网络的极限学习机用于单细胞 RNA-seq 数据聚类。
Sci Rep. 2024 Jun 12;14(1):13541. doi: 10.1038/s41598-024-64217-y.
2
Incorporating cell hierarchy to decipher the functional diversity of single cells.将细胞层次结构纳入其中,以破译单细胞的功能多样性。
Nucleic Acids Res. 2023 Jan 25;51(2):e9. doi: 10.1093/nar/gkac1044.
3
Detecting TAD-like domains from RNA-associated interactions.从 RNA 相关相互作用中检测 TAD 样结构域。
Nucleic Acids Res. 2022 Aug 26;50(15):e88. doi: 10.1093/nar/gkac422.