• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用 IGSimpute 实现 scRNA-seq 数据的准确和可解释的基因表达推断。

Accurate and interpretable gene expression imputation on scRNA-seq data using IGSimpute.

机构信息

Department of Computer Science, Hong Kong Baptist University, Waterloo Road, Kowloon Tong, Hong Kong.

School of Chinese Medicine, Hong Kong Baptist University, Waterloo Road, Kowloon Tong, Hong Kong.

出版信息

Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad124.

DOI:10.1093/bib/bbad124
PMID:37039664
Abstract

Single-cell ribonucleic acid sequencing (scRNA-seq) enables the quantification of gene expression at the transcriptomic level with single-cell resolution, enhancing our understanding of cellular heterogeneity. However, the excessive missing values present in scRNA-seq data hinder downstream analysis. While numerous imputation methods have been proposed to recover scRNA-seq data, high imputation performance often comes with low or no interpretability. Here, we present IGSimpute, an accurate and interpretable imputation method for recovering missing values in scRNA-seq data with an interpretable instance-wise gene selection layer (GSL). IGSimpute outperforms 12 other state-of-the-art imputation methods on 13 out of 17 datasets from different scRNA-seq technologies with the lowest mean squared error as the chosen benchmark metric. We demonstrate that IGSimpute can give unbiased estimates of the missing values compared to other methods, regardless of whether the average gene expression values are small or large. Clustering results of imputed profiles show that IGSimpute offers statistically significant improvement over other imputation methods. By taking the heart-and-aorta and the limb muscle tissues as examples, we show that IGSimpute can also denoise gene expression profiles by removing outlier entries with unexpectedly high expression values via the instance-wise GSL. We also show that genes selected by the instance-wise GSL could indicate the age of B cells from bladder fat tissue of the Tabula Muris Senis atlas. IGSimpute can impute one million cells using 64 min, and thus applicable to large datasets.

摘要

单细胞核糖核酸测序(scRNA-seq)能够以单细胞分辨率定量转录组水平的基因表达,从而增强我们对细胞异质性的理解。然而,scRNA-seq 数据中存在大量缺失值,这阻碍了下游分析。虽然已经提出了许多用于恢复 scRNA-seq 数据的插补方法,但高插补性能通常伴随着低或无可解释性。在这里,我们提出了 IGSimpute,这是一种用于恢复 scRNA-seq 数据中缺失值的准确且可解释的插补方法,具有可解释的实例级基因选择层(GSL)。IGSimpute 在来自不同 scRNA-seq 技术的 17 个数据集的 13 个数据集上的 12 种其他最先进的插补方法中表现最佳,选择均方误差作为基准指标。我们证明,与其他方法相比,IGSimpute 可以对缺失值进行无偏估计,而与平均基因表达值的大小无关。插补轮廓的聚类结果表明,IGSimpute 与其他插补方法相比提供了统计学上的显著改进。通过以心脏和主动脉以及肢体肌肉组织为例,我们表明 IGSimpute 还可以通过实例级 GSL 去除具有异常高表达值的异常值条目来对基因表达谱进行去噪。我们还表明,实例级 GSL 选择的基因可以指示 Tabula Muris Senis 图谱中膀胱脂肪组织中 B 细胞的年龄。IGSimpute 可以在 64 分钟内对一百万细胞进行插补,因此适用于大型数据集。

相似文献

1
Accurate and interpretable gene expression imputation on scRNA-seq data using IGSimpute.使用 IGSimpute 实现 scRNA-seq 数据的准确和可解释的基因表达推断。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad124.
2
CL-Impute: A contrastive learning-based imputation for dropout single-cell RNA-seq data.CL-Impute:基于对比学习的 dropout 单细胞 RNA-seq 数据插补方法。
Comput Biol Med. 2023 Sep;164:107263. doi: 10.1016/j.compbiomed.2023.107263. Epub 2023 Jul 23.
3
GE-Impute: graph embedding-based imputation for single-cell RNA-seq data.GE-Impute:基于图嵌入的单细胞 RNA-seq 数据插补。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac313.
4
scGGAN: single-cell RNA-seq imputation by graph-based generative adversarial network.scGGAN:基于图的生成对抗网络的单细胞RNA测序数据插补
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad040.
5
scGCL: an imputation method for scRNA-seq data based on graph contrastive learning.scGCL:一种基于图对比学习的 scRNA-seq 数据插补方法。
Bioinformatics. 2023 Mar 1;39(3). doi: 10.1093/bioinformatics/btad098.
6
Bubble: a fast single-cell RNA-seq imputation using an autoencoder constrained by bulk RNA-seq data.Bubble:一种利用受批量RNA测序数据约束的自动编码器进行的快速单细胞RNA测序插补方法。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac580.
7
ScLRTC: imputation for single-cell RNA-seq data via low-rank tensor completion.ScLRTC:基于低秩张量补全的单细胞 RNA-seq 数据插补。
BMC Genomics. 2021 Nov 29;22(1):860. doi: 10.1186/s12864-021-08101-3.
8
CDSImpute: An ensemble similarity imputation method for single-cell RNA sequence dropouts.CDSImpute:一种用于单细胞 RNA 序列缺失的集成相似性插补方法。
Comput Biol Med. 2022 Jul;146:105658. doi: 10.1016/j.compbiomed.2022.105658. Epub 2022 May 21.
9
scDSSC: Deep Sparse Subspace Clustering for scRNA-seq Data.scDSSC:用于 scRNA-seq 数据的深度稀疏子空间聚类。
PLoS Comput Biol. 2022 Dec 19;18(12):e1010772. doi: 10.1371/journal.pcbi.1010772. eCollection 2022 Dec.
10
SinCWIm: An imputation method for single-cell RNA sequence dropouts using weighted alternating least squares.SinCWIm:一种基于加权交替最小二乘法的单细胞 RNA 序列缺失数据插补方法。
Comput Biol Med. 2024 Mar;171:108225. doi: 10.1016/j.compbiomed.2024.108225. Epub 2024 Feb 27.

引用本文的文献

1
Navigating single-cell RNA-sequencing: protocols, tools, databases, and applications.探索单细胞RNA测序:方案、工具、数据库及应用
Genomics Inform. 2025 May 17;23(1):13. doi: 10.1186/s44342-025-00044-5.
2
Cofea: correlation-based feature selection for single-cell chromatin accessibility data.Cofea:基于相关性的单细胞染色质可及性数据特征选择。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad458.