单细胞 RNA 测序数据分析工具的差异基因表达分析比较。

Comparative analysis of differential gene expression analysis tools for single-cell RNA sequencing data.

机构信息

Computer Science and Engineering Department, University of Connecticut, Storrs, CT, USA.

Department of Molecular & Cell Biology, University of Connecticut, Storrs, CT, USA.

出版信息

BMC Bioinformatics. 2019 Jan 18;20(1):40. doi: 10.1186/s12859-019-2599-6.

DOI:10.1186/s12859-019-2599-6

PMID:30658573

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6339299/

Abstract

BACKGROUND

The analysis of single-cell RNA sequencing (scRNAseq) data plays an important role in understanding the intrinsic and extrinsic cellular processes in biological and biomedical research. One significant effort in this area is the detection of differentially expressed (DE) genes. scRNAseq data, however, are highly heterogeneous and have a large number of zero counts, which introduces challenges in detecting DE genes. Addressing these challenges requires employing new approaches beyond the conventional ones, which are based on a nonzero difference in average expression. Several methods have been developed for differential gene expression analysis of scRNAseq data. To provide guidance on choosing an appropriate tool or developing a new one, it is necessary to evaluate and compare the performance of differential gene expression analysis methods for scRNAseq data.

RESULTS

In this study, we conducted a comprehensive evaluation of the performance of eleven differential gene expression analysis software tools, which are designed for scRNAseq data or can be applied to them. We used simulated and real data to evaluate the accuracy and precision of detection. Using simulated data, we investigated the effect of sample size on the detection accuracy of the tools. Using real data, we examined the agreement among the tools in identifying DE genes, the run time of the tools, and the biological relevance of the detected DE genes.

CONCLUSIONS

In general, agreement among the tools in calling DE genes is not high. There is a trade-off between true-positive rates and the precision of calling DE genes. Methods with higher true positive rates tend to show low precision due to their introducing false positives, whereas methods with high precision show low true positive rates due to identifying few DE genes. We observed that current methods designed for scRNAseq data do not tend to show better performance compared to methods designed for bulk RNAseq data. Data multimodality and abundance of zero read counts are the main characteristics of scRNAseq data, which play important roles in the performance of differential gene expression analysis methods and need to be considered in terms of the development of new methods.

摘要

背景

单细胞 RNA 测序（scRNAseq）数据分析在理解生物和生物医学研究中内在和外在的细胞过程中起着重要作用。在这个领域的一个重要工作是检测差异表达（DE）基因。然而，scRNAseq 数据高度异质且具有大量零计数，这给检测 DE 基因带来了挑战。要解决这些挑战，需要采用超越传统方法的新方法，传统方法基于平均表达的非零差异。已经开发了几种用于 scRNAseq 数据差异基因表达分析的方法。为了提供选择合适工具或开发新工具的指导，有必要评估和比较 scRNAseq 数据差异基因表达分析方法的性能。

结果

在这项研究中，我们对十一种用于 scRNAseq 数据的差异基因表达分析软件工具的性能进行了全面评估，这些工具是专为 scRNAseq 数据设计的，或者可以应用于 scRNAseq 数据。我们使用模拟和真实数据来评估检测的准确性和精度。使用模拟数据，我们研究了样本量对工具检测准确性的影响。使用真实数据，我们检查了工具在识别 DE 基因方面的一致性、工具的运行时间以及检测到的 DE 基因的生物学相关性。

结论

一般来说，工具在调用 DE 基因方面的一致性不高。在调用 DE 基因的真阳性率和精度之间存在权衡。具有较高真阳性率的方法由于引入了假阳性而往往精度较低，而具有较高精度的方法由于识别的 DE 基因较少而真阳性率较低。我们观察到，为 scRNAseq 数据设计的当前方法并不倾向于比为批量 RNAseq 数据设计的方法表现出更好的性能。数据多模态性和大量零读计数是 scRNAseq 数据的主要特征，它们在差异基因表达分析方法的性能中起着重要作用，需要在新方法的开发中加以考虑。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c235/6339299/b610e44bdb68/12859_2019_2599_Fig1_HTML.jpg

相似文献

Comparative analysis of differential gene expression analysis tools for single-cell RNA sequencing data.单细胞 RNA 测序数据分析工具的差异基因表达分析比较。

BMC Bioinformatics. 2019 Jan 18;20(1):40. doi: 10.1186/s12859-019-2599-6.

SigEMD: A powerful method for differential gene expression analysis in single-cell RNA sequencing data.SigEMD：一种用于单细胞 RNA 测序数据分析中差异基因表达分析的强大方法。

Methods. 2018 Aug 1;145:25-32. doi: 10.1016/j.ymeth.2018.04.017. Epub 2018 Apr 24.

Differential Expression Analysis in Single-Cell Transcriptomics.单细胞转录组学中的差异表达分析

Methods Mol Biol. 2019;1979:425-432. doi: 10.1007/978-1-4939-9240-9_25.

Identifying cell populations with scRNASeq.单细胞 RNA 测序鉴定细胞群体。

Mol Aspects Med. 2018 Feb;59:114-122. doi: 10.1016/j.mam.2017.07.002. Epub 2017 Jul 25.

aFold - using polynomial uncertainty modelling for differential gene expression estimation from RNA sequencing data.aFold - 使用多项式不确定性建模进行 RNA 测序数据的差异基因表达估计。

BMC Genomics. 2019 May 10;20(1):364. doi: 10.1186/s12864-019-5686-1.

Benchmarking differential expression analysis tools for RNA-Seq: normalization-based vs. log-ratio transformation-based methods.RNA-Seq 差异表达分析工具的基准测试：基于标准化与基于对数比变换的方法。

BMC Bioinformatics. 2018 Jul 18;19(1):274. doi: 10.1186/s12859-018-2261-8.

Detection of differentially expressed genes in discrete single-cell RNA sequencing data using a hurdle model with correlated random effects.使用具有相关随机效应的障碍模型检测离散单细胞RNA测序数据中的差异表达基因。

Biometrics. 2019 Dec;75(4):1051-1062. doi: 10.1111/biom.13074. Epub 2019 Jun 17.

A fuzzy method for RNA-Seq differential expression analysis in presence of multireads.一种用于存在多重读取情况下RNA测序差异表达分析的模糊方法。

BMC Bioinformatics. 2016 Nov 8;17(Suppl 12):345. doi: 10.1186/s12859-016-1195-2.

MitoTrace: A Computational Framework for Analyzing Mitochondrial Variation in Single-Cell RNA Sequencing Data.MitoTrace：一种用于分析单细胞 RNA 测序数据中线粒体变异的计算框架。

Genes (Basel). 2023 Jun 4;14(6):1222. doi: 10.3390/genes14061222.

Differential expression analysis of RNA sequencing data by incorporating non-exonic mapped reads.通过纳入非外显子映射读数对RNA测序数据进行差异表达分析。

BMC Genomics. 2015;16 Suppl 7(Suppl 7):S14. doi: 10.1186/1471-2164-16-S7-S14. Epub 2015 Jun 11.

引用本文的文献

Benchmarking sketching methods on spatial transcriptomics data.基于空间转录组学数据的草图绘制方法基准测试

bioRxiv. 2025 Sep 2:2025.08.26.672376. doi: 10.1101/2025.08.26.672376.

cytoKernel: robust kernel embeddings for assessing differential expression of single-cell data.细胞内核：用于评估单细胞数据差异表达的稳健内核嵌入

Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf399.

scMetaIntegrator: a meta-analysis approach to paired single-cell differential expression analysis.scMetaIntegrator：一种用于配对单细胞差异表达分析的荟萃分析方法。

bioRxiv. 2025 Jun 8:2025.06.04.657898. doi: 10.1101/2025.06.04.657898.

Human endothelial cells promote a human neural stem cell type B phenotype via Notch signaling.人内皮细胞通过Notch信号通路促进B型人神经干细胞表型。

Nat Commun. 2025 May 30;16(1):5031. doi: 10.1038/s41467-025-60194-6.

New Insights and Implications of Cell-Cell Interactions in Developmental Biology.发育生物学中细胞间相互作用的新见解与影响

Int J Mol Sci. 2025 Apr 23;26(9):3997. doi: 10.3390/ijms26093997.

SciGeneX: enhancing transcriptional analysis through gene module detection in single-cell and spatial transcriptomics data.SciGeneX：通过在单细胞和空间转录组学数据中进行基因模块检测来增强转录分析。

NAR Genom Bioinform. 2025 Apr 17;7(2):lqaf043. doi: 10.1093/nargab/lqaf043. eCollection 2025 Jun.

Synaptic deregulation of cholinergic projection neurons causes olfactory dysfunction across five fly Parkinsonism models.胆碱能投射神经元的突触失调在五种果蝇帕金森病模型中导致嗅觉功能障碍。

Elife. 2025 Apr 3;13:RP98348. doi: 10.7554/eLife.98348.

scMUSCL: multi-source transfer learning for clustering scRNA-seq data.scMUSCL：用于单细胞RNA测序数据聚类的多源迁移学习

Bioinformatics. 2025 May 6;41(5). doi: 10.1093/bioinformatics/btaf137.

Exploring cell-to-cell variability and functional insights through differentially variable gene analysis.通过差异可变基因分析探索细胞间变异性和功能见解。

NPJ Syst Biol Appl. 2025 Mar 20;11(1):29. doi: 10.1038/s41540-025-00507-z.

Postmortem Interval Leads to Loss of Disease-Specific Signatures in Brain Tissue.死后间隔时间导致脑组织中疾病特异性特征丧失。

eNeuro. 2025 Mar 14;12(3). doi: 10.1523/ENEURO.0505-24.2025. Print 2025 Mar.

本文引用的文献

Methods. 2018 Aug 1;145:25-32. doi: 10.1016/j.ymeth.2018.04.017. Epub 2018 Apr 24.

DEsingle for detecting three types of differential expression in single-cell RNA-seq data.DEsingle 用于检测单细胞 RNA-seq 数据中的三种差异表达。

Bioinformatics. 2018 Sep 15;34(18):3223-3224. doi: 10.1093/bioinformatics/bty332.

Single-Cell RNA-Sequencing: Assessment of Differential Expression Analysis Methods.单细胞RNA测序：差异表达分析方法评估

Front Genet. 2017 May 23;8:62. doi: 10.3389/fgene.2017.00062. eCollection 2017.

Single-cell mRNA quantification and differential analysis with Census.使用Census进行单细胞mRNA定量和差异分析。

Nat Methods. 2017 Mar;14(3):309-315. doi: 10.1038/nmeth.4150. Epub 2017 Jan 23.

A statistical approach for identifying differential distributions in single-cell RNA-seq experiments.一种用于识别单细胞RNA测序实验中差异分布的统计方法。

Genome Biol. 2016 Oct 25;17(1):222. doi: 10.1186/s13059-016-1077-y.

Comparison of methods to detect differentially expressed genes between single-cell populations.单细胞群体间差异表达基因检测方法的比较。

Brief Bioinform. 2017 Sep 1;18(5):735-743. doi: 10.1093/bib/bbw057.

Analysis of aggregated cell-cell statistical distances within pathways unveils therapeutic-resistance mechanisms in circulating tumor cells.对通路内聚集的细胞间统计距离进行分析，揭示了循环肿瘤细胞中的治疗抗性机制。

Bioinformatics. 2016 Jun 15;32(12):i80-i89. doi: 10.1093/bioinformatics/btw248.

Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq.通过单细胞RNA测序剖析转移性黑色素瘤的多细胞生态系统

Science. 2016 Apr 8;352(6282):189-96. doi: 10.1126/science.aad0501.

Design and computational analysis of single-cell RNA-sequencing experiments.单细胞RNA测序实验的设计与计算分析

Genome Biol. 2016 Apr 7;17:63. doi: 10.1186/s13059-016-0927-y.

Discrete distributional differential expression (D3E)--a tool for gene expression analysis of single-cell RNA-seq data.离散分布差异表达（D3E）——一种用于单细胞RNA测序数据基因表达分析的工具。

BMC Bioinformatics. 2016 Feb 29;17:110. doi: 10.1186/s12859-016-0944-6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

单细胞 RNA 测序数据分析工具的差异基因表达分析比较。

Comparative analysis of differential gene expression analysis tools for single-cell RNA sequencing data.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献