• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CMF-Impute:一种用于单细胞 RNA-seq 数据的精确插补工具。

CMF-Impute: an accurate imputation tool for single-cell RNA-seq data.

机构信息

College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan 410082, P.R. China.

School of Mathematics and Statistics, Hainan Normal University, Haikou 570100, P.R. China.

出版信息

Bioinformatics. 2020 May 1;36(10):3139-3147. doi: 10.1093/bioinformatics/btaa109.

DOI:10.1093/bioinformatics/btaa109
PMID:32073612
Abstract

MOTIVATION

Single-cell RNA-sequencing (scRNA-seq) technology provides a powerful tool for investigating cell heterogeneity and cell subpopulations by allowing the quantification of gene expression at single-cell level. However, scRNA-seq data analysis remains challenging because of various technical noises such as dropout events (i.e. excessive zero counts in the expression matrix).

RESULTS

By taking consideration of the association among cells and genes, we propose a novel collaborative matrix factorization-based method called CMF-Impute to impute the dropout entries of a given scRNA-seq expression matrix. We test CMF-Impute and compare it with the other five state-of-the-art methods on six popular real scRNA-seq datasets of various sizes and three simulated datasets. For simulated datasets, CMF-Impute outperforms other methods in imputing the closest dropouts to the original expression values as evaluated by both the sum of squared error and Pearson correlation coefficient. For real datasets, CMF-Impute achieves the most accurate cell classification results in spite of the choice of different clustering methods like SC3 or T-SNE followed by K-means as evaluated by both adjusted rand index and normalized mutual information. Finally, we demonstrate that CMF-Impute is powerful in reconstructing cell-to-cell and gene-to-gene correlation, and in inferring cell lineage trajectories.

AVAILABILITY AND IMPLEMENTATION

CMF-Impute is written as a Matlab package which is available at https://github.com/xujunlin123/CMFImpute.git.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

单细胞 RNA 测序 (scRNA-seq) 技术通过允许在单细胞水平上定量基因表达,为研究细胞异质性和细胞亚群提供了强大的工具。然而,由于各种技术噪声,如缺失事件(即表达矩阵中过多的零计数),scRNA-seq 数据分析仍然具有挑战性。

结果

通过考虑细胞和基因之间的关联,我们提出了一种新的基于协同矩阵分解的方法,称为 CMF-Impute,用于对给定的 scRNA-seq 表达矩阵中的缺失项进行插补。我们在六个大小不同的流行真实 scRNA-seq 数据集和三个模拟数据集上测试了 CMF-Impute 并将其与其他五种最先进的方法进行了比较。对于模拟数据集,CMF-Impute 在根据均方误差和 Pearson 相关系数评估的最接近原始表达值的缺失值插补上优于其他方法。对于真实数据集,CMF-Impute 尽管选择了不同的聚类方法,如 SC3 或 T-SNE 随后是 K-means,但仍能获得最准确的细胞分类结果,这是根据调整后的兰德指数和归一化互信息评估的。最后,我们证明了 CMF-Impute 在重建细胞间和基因间的相关性以及推断细胞谱系轨迹方面非常有效。

可用性和实现

CMF-Impute 用 Matlab 编写,可在 https://github.com/xujunlin123/CMFImpute.git 获得。

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
CMF-Impute: an accurate imputation tool for single-cell RNA-seq data.CMF-Impute:一种用于单细胞 RNA-seq 数据的精确插补工具。
Bioinformatics. 2020 May 1;36(10):3139-3147. doi: 10.1093/bioinformatics/btaa109.
2
ScLRTC: imputation for single-cell RNA-seq data via low-rank tensor completion.ScLRTC:基于低秩张量补全的单细胞 RNA-seq 数据插补。
BMC Genomics. 2021 Nov 29;22(1):860. doi: 10.1186/s12864-021-08101-3.
3
GE-Impute: graph embedding-based imputation for single-cell RNA-seq data.GE-Impute:基于图嵌入的单细胞 RNA-seq 数据插补。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac313.
4
CL-Impute: A contrastive learning-based imputation for dropout single-cell RNA-seq data.CL-Impute:基于对比学习的 dropout 单细胞 RNA-seq 数据插补方法。
Comput Biol Med. 2023 Sep;164:107263. doi: 10.1016/j.compbiomed.2023.107263. Epub 2023 Jul 23.
5
I-Impute: a self-consistent method to impute single cell RNA sequencing data.I-Impute:一种用于单细胞 RNA 测序数据插补的自洽方法。
BMC Genomics. 2020 Nov 18;21(Suppl 10):618. doi: 10.1186/s12864-020-07007-w.
6
TsImpute: an accurate two-step imputation method for single-cell RNA-seq data.TsImpute:一种用于单细胞 RNA-seq 数据的准确两步插补方法。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad731.
7
CDSImpute: An ensemble similarity imputation method for single-cell RNA sequence dropouts.CDSImpute:一种用于单细胞 RNA 序列缺失的集成相似性插补方法。
Comput Biol Med. 2022 Jul;146:105658. doi: 10.1016/j.compbiomed.2022.105658. Epub 2022 May 21.
8
An efficient scRNA-seq dropout imputation method using graph attention network.基于图注意力网络的高效 scRNA-seq 缺失值插补方法。
BMC Bioinformatics. 2021 Dec 7;22(1):582. doi: 10.1186/s12859-021-04493-x.
9
scRMD: imputation for single cell RNA-seq data via robust matrix decomposition.scRMD:基于稳健矩阵分解的单细胞 RNA-seq 数据插补。
Bioinformatics. 2020 May 1;36(10):3156-3161. doi: 10.1093/bioinformatics/btaa139.
10
2DImpute: imputation in single-cell RNA-seq data from correlations in two dimensions.2DImpute:基于二维相关性的单细胞 RNA-seq 数据插补。
Bioinformatics. 2020 Jun 1;36(11):3588-3589. doi: 10.1093/bioinformatics/btaa148.

引用本文的文献

1
scTsI: an effective two-stage imputation method for single-cell RNA-seq data.scTsI:一种用于单细胞RNA测序数据的有效两阶段插补方法。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf298.
2
Navigating single-cell RNA-sequencing: protocols, tools, databases, and applications.探索单细胞RNA测序:方案、工具、数据库及应用
Genomics Inform. 2025 May 17;23(1):13. doi: 10.1186/s44342-025-00044-5.
3
Protein structure prediction via deep learning: an in-depth review.基于深度学习的蛋白质结构预测:深入综述
Front Pharmacol. 2025 Apr 3;16:1498662. doi: 10.3389/fphar.2025.1498662. eCollection 2025.
4
Bidirectional f-Divergence-Based Deep Generative Method for Imputing Missing Values in Time-Series Data.基于双向f散度的深度生成方法用于插补时间序列数据中的缺失值
Stats (Basel). 2025 Mar;8(1). doi: 10.3390/stats8010007. Epub 2025 Jan 14.
5
Addressing scalability and managing sparsity and dropout events in single-cell representation identification with ZIGACL.使用ZIGACL解决单细胞表示识别中的可扩展性问题并管理稀疏性和缺失事件。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae703.
6
scDTL: enhancing single-cell RNA-seq imputation through deep transfer learning with bulk cell information.scDTL:通过利用批量细胞信息进行深度迁移学习增强单细胞 RNA-seq 推断。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae555.
7
scRNMF: An imputation method for single-cell RNA-seq data by robust and non-negative matrix factorization.scRNMF:一种基于鲁棒非负矩阵分解的单细胞 RNA-seq 数据插补方法。
PLoS Comput Biol. 2024 Aug 8;20(8):e1012339. doi: 10.1371/journal.pcbi.1012339. eCollection 2024 Aug.
8
Single-cell omics: experimental workflow, data analyses and applications.单细胞组学:实验工作流程、数据分析及应用
Sci China Life Sci. 2025 Jan;68(1):5-102. doi: 10.1007/s11427-023-2561-0. Epub 2024 Jul 23.
9
scINRB: single-cell gene expression imputation with network regularization and bulk RNA-seq data.scINRB:基于网络正则化和 bulk RNA-seq 数据的单细胞基因表达推断。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae148.
10
Effects of different KRAS mutants and Ki67 expression on diagnosis and prognosis in lung adenocarcinoma.不同KRAS突变体和Ki67表达对肺腺癌诊断及预后的影响
Sci Rep. 2024 Feb 19;14(1):4085. doi: 10.1038/s41598-023-48307-x.