SDImpute：一种基于单细胞 RNA-seq 数据中细胞水平和基因水平信息的统计分块插补方法。

SDImpute: A statistical block imputation method based on cell-level and gene-level information for dropouts in single-cell RNA-seq data.

机构信息

School of Mathematics, Harbin Institute of Technology, Harbin, P.R, China.

出版信息

PLoS Comput Biol. 2021 Jun 17;17(6):e1009118. doi: 10.1371/journal.pcbi.1009118. eCollection 2021 Jun.

DOI:10.1371/journal.pcbi.1009118

PMID:34138847

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8266063/

Abstract

The single-cell RNA sequencing (scRNA-seq) technologies obtain gene expression at single-cell resolution and provide a tool for exploring cell heterogeneity and cell types. As the low amount of extracted mRNA copies per cell, scRNA-seq data exhibit a large number of dropouts, which hinders the downstream analysis of the scRNA-seq data. We propose a statistical method, SDImpute (Single-cell RNA-seq Dropout Imputation), to implement block imputation for dropout events in scRNA-seq data. SDImpute automatically identifies the dropout events based on the gene expression levels and the variations of gene expression across similar cells and similar genes, and it implements block imputation for dropouts by utilizing gene expression unaffected by dropouts from similar cells. In the experiments, the results of the simulated datasets and real datasets suggest that SDImpute is an effective tool to recover the data and preserve the heterogeneity of gene expression across cells. Compared with the state-of-the-art imputation methods, SDImpute improves the accuracy of the downstream analysis including clustering, visualization, and differential expression analysis.

摘要

单细胞 RNA 测序 (scRNA-seq) 技术可在单细胞分辨率下获取基因表达信息，为探索细胞异质性和细胞类型提供了一种工具。由于每个细胞中提取的 mRNA 拷贝数量较少，scRNA-seq 数据中存在大量的缺失值，这阻碍了 scRNA-seq 数据的下游分析。我们提出了一种统计方法 SDImpute（单细胞 RNA-seq 缺失值插补），用于对 scRNA-seq 数据中的缺失事件进行块插补。SDImpute 基于基因表达水平以及相似细胞和相似基因之间的基因表达变化，自动识别缺失事件，并利用不受缺失影响的相似细胞中的基因表达来对缺失值进行块插补。在实验中，模拟数据集和真实数据集的结果表明，SDImpute 是一种有效的数据恢复工具，可以保留细胞间基因表达的异质性。与最先进的插补方法相比，SDImpute 提高了下游分析的准确性，包括聚类、可视化和差异表达分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8df5/8266063/55e9935af479/pcbi.1009118.g001.jpg

相似文献

SDImpute: A statistical block imputation method based on cell-level and gene-level information for dropouts in single-cell RNA-seq data.

PLoS Comput Biol. 2021 Jun 17;17(6):e1009118. doi: 10.1371/journal.pcbi.1009118. eCollection 2021 Jun.

SinCWIm: An imputation method for single-cell RNA sequence dropouts using weighted alternating least squares.

Comput Biol Med. 2024 Mar;171:108225. doi: 10.1016/j.compbiomed.2024.108225. Epub 2024 Feb 27.

Evaluation of tools for highly variable gene discovery from single-cell RNA-seq data.

Brief Bioinform. 2019 Jul 19;20(4):1583-1589. doi: 10.1093/bib/bby011.

CDSImpute: An ensemble similarity imputation method for single-cell RNA sequence dropouts.

Comput Biol Med. 2022 Jul;146:105658. doi: 10.1016/j.compbiomed.2022.105658. Epub 2022 May 21.

Imputing single-cell RNA-seq data by considering cell heterogeneity and prior expression of dropouts.

J Mol Cell Biol. 2021 Apr 10;13(1):29-40. doi: 10.1093/jmcb/mjaa052.

scIGANs: single-cell RNA-seq imputation using generative adversarial networks.

Nucleic Acids Res. 2020 Sep 4;48(15):e85. doi: 10.1093/nar/gkaa506.

scRNMF: An imputation method for single-cell RNA-seq data by robust and non-negative matrix factorization.

PLoS Comput Biol. 2024 Aug 8;20(8):e1012339. doi: 10.1371/journal.pcbi.1012339. eCollection 2024 Aug.

ScLRTC: imputation for single-cell RNA-seq data via low-rank tensor completion.

BMC Genomics. 2021 Nov 29;22(1):860. doi: 10.1186/s12864-021-08101-3.

Propensity score matching enables batch-effect-corrected imputation in single-cell RNA-seq analysis.

Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac275.

An accurate and robust imputation method scImpute for single-cell RNA-seq data.

Nat Commun. 2018 Mar 8;9(1):997. doi: 10.1038/s41467-018-03405-7.

引用本文的文献

SAE-Impute: imputation for single-cell data via subspace regression and auto-encoders.

BMC Bioinformatics. 2024 Oct 1;25(1):317. doi: 10.1186/s12859-024-05944-x.

cnnImpute: missing value recovery for single cell RNA sequencing data.

Sci Rep. 2024 Feb 16;14(1):3946. doi: 10.1038/s41598-024-53998-x.

scMTD: a statistical multidimensional imputation method for single-cell RNA-seq data leveraging transcriptome dynamic information.

Cell Biosci. 2022 Sep 2;12(1):142. doi: 10.1186/s13578-022-00886-4.

scIMC: a platform for benchmarking comparison and visualization analysis of scRNA-seq data imputation methods.

Nucleic Acids Res. 2022 May 20;50(9):4877-4899. doi: 10.1093/nar/gkac317.

Correction: SDImpute: A statistical block imputation method based on cell-level and gene-level information for dropouts in single-cell RNA-seq data.

PLoS Comput Biol. 2022 Jan 5;18(1):e1009770. doi: 10.1371/journal.pcbi.1009770. eCollection 2022 Jan.

AdImpute: An Imputation Method for Single-Cell RNA-Seq Data Based on Semi-Supervised Autoencoders.

Front Genet. 2021 Sep 8;12:739677. doi: 10.3389/fgene.2021.739677. eCollection 2021.

本文引用的文献

SIMPLEs: a single-cell RNA sequencing imputation strategy preserving gene modules and cell clusters variation.

NAR Genom Bioinform. 2020 Dec;2(4):lqaa077. doi: 10.1093/nargab/lqaa077. Epub 2020 Sep 28.

Imputing single-cell RNA-seq data by considering cell heterogeneity and prior expression of dropouts.

J Mol Cell Biol. 2021 Apr 10;13(1):29-40. doi: 10.1093/jmcb/mjaa052.

Single-cell RNA-seq denoising using a deep count autoencoder.

Nat Commun. 2019 Jan 23;10(1):390. doi: 10.1038/s41467-018-07931-2.

Challenges in unsupervised clustering of single-cell RNA-seq data.

Nat Rev Genet. 2019 May;20(5):273-282. doi: 10.1038/s41576-018-0088-9.

M3Drop: dropout-based feature selection for scRNASeq.

Bioinformatics. 2019 Aug 15;35(16):2865-2867. doi: 10.1093/bioinformatics/bty1044.

VIPER: variability-preserving imputation for accurate gene expression recovery in single-cell RNA sequencing studies.

Genome Biol. 2018 Nov 12;19(1):196. doi: 10.1186/s13059-018-1575-1.

Comparison of Computational Methods for Imputing Single-Cell RNA-Sequencing Data.

IEEE/ACM Trans Comput Biol Bioinform. 2020 Mar-Apr;17(2):376-389. doi: 10.1109/TCBB.2018.2848633. Epub 2018 Jun 19.

Recovering Gene Interactions from Single-Cell Data Using Data Diffusion.

Cell. 2018 Jul 26;174(3):716-729.e27. doi: 10.1016/j.cell.2018.05.061. Epub 2018 Jun 28.

SAVER: gene expression recovery for single-cell RNA sequencing.

Nat Methods. 2018 Jul;15(7):539-542. doi: 10.1038/s41592-018-0033-z. Epub 2018 Jun 25.

DrImpute: imputing dropout events in single cell RNA sequencing data.

BMC Bioinformatics. 2018 Jun 8;19(1):220. doi: 10.1186/s12859-018-2226-y.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

SDImpute：一种基于单细胞 RNA-seq 数据中细胞水平和基因水平信息的统计分块插补方法。

SDImpute: A statistical block imputation method based on cell-level and gene-level information for dropouts in single-cell RNA-seq data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献