• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单细胞RNA测序数据中统计差异分析方法的评估

An evaluation of statistical differential analysis methods in single-cell RNA-seq data.

作者信息

Li Dongmei, Zand Martin, Dye Timothy, Goniewicz Maciej, Rahman Irfan, Xie Zidian

机构信息

Clinical and Translational Science Institute, School of Medicine and Dentistry, University of Rochester, 265 Crittenden Boulevard CU 420708, 14642 Rochester, NY, US.

Department of Medicine, University of Rochester Medical Center, 601 Elmwood Ave, Box 675, 14642 Rochester, NY, US.

出版信息

Res Sq. 2023 Mar 23:rs.3.rs-2670717. doi: 10.21203/rs.3.rs-2670717/v1.

DOI:10.21203/rs.3.rs-2670717/v1
PMID:36993457
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10055642/
Abstract

BACKGROUND

Single-cell RNA Sequencing is gaining popularity in recent years. Compared to bulk RNA-Seq, single-cell RNA Sequencing allows the gene expression being measured within individual cells instead of mean gene expression levels across all cells in the sample. Thus, cell-to-cell variation of gene expressions could be examined. Gene differential expression analysis remains the major purpose in most single-cell RNA sequencing experiments and many methods have been developed in recent years to conduct gene differential expression analysis for single-cell RNA sequencing data.

RESULTS

Through simulation studies and real data examples, we evaluated the performance of five open-source popular methods used for gene differential expression analysis in single-cell RNA sequencing data. The five methods included DEsingle (Zero-inflated negative binomial model), Linnorm (Empirical Bayes method on transformed count data using the limma package), monocle (An approximate Chi-Square likelihood ratio test), MAST (A generalized linear hurdle model), and DESeq2 (A generalized linear model with empirical Bayes approach and also commonly used for bulk RNA sequencing differential express analyses). We assessed the false discovery rate (FDR) control, sensitivity, specificity, accuracy, and area under the receiver operating characteristics (AUROC) curve for all five methods under different sample sizes, distribution assumptions, and proportions of zeros in the data.

CONCLUSIONS

We found the MAST method performed the best among the five methods compared with the largest AUROC values across all tested sample sizes and different proportion of truly differential expressed genes, when the data followed negative binomial distributions. When the sample size increased to 100 in each group, the MAST method showed the best performance with the highest AUROC regardless of the data distributions. If the excess zeros were first filtered out before the gene differential analyses, the DESingle, Linnorm, and DESeq2 performed relatively better than the MAST and the monocle methods with higher AUROC values.

摘要

背景

近年来,单细胞RNA测序越来越受欢迎。与批量RNA测序相比,单细胞RNA测序能够测量单个细胞内的基因表达,而不是样本中所有细胞的平均基因表达水平。因此,可以检测基因表达的细胞间差异。基因差异表达分析仍然是大多数单细胞RNA测序实验的主要目的,近年来已经开发了许多方法来进行单细胞RNA测序数据的基因差异表达分析。

结果

通过模拟研究和实际数据示例,我们评估了用于单细胞RNA测序数据基因差异表达分析的五种开源常用方法的性能。这五种方法包括DEsingle(零膨胀负二项式模型)、Linnorm(使用limma软件包对转换后的计数数据进行经验贝叶斯方法)、monocle(近似卡方似然比检验)、MAST(广义线性障碍模型)和DESeq2(具有经验贝叶斯方法的广义线性模型,也常用于批量RNA测序差异表达分析)。我们评估了这五种方法在不同样本量、分布假设和数据中零值比例下的错误发现率(FDR)控制、敏感性、特异性、准确性和受试者工作特征曲线下面积(AUROC)。

结论

我们发现,当数据遵循负二项分布时,在所有测试样本量和不同比例的真正差异表达基因中,MAST方法的AUROC值最大,在这五种方法中表现最佳。当每组样本量增加到100时,无论数据分布如何,MAST方法的AUROC最高,表现最佳。如果在基因差异分析之前先过滤掉过多的零值,DEsingle、Linnorm和DESeq2的表现相对优于MAST和monocle方法,AUROC值更高。

相似文献

1
An evaluation of statistical differential analysis methods in single-cell RNA-seq data.单细胞RNA测序数据中统计差异分析方法的评估
Res Sq. 2023 Mar 23:rs.3.rs-2670717. doi: 10.21203/rs.3.rs-2670717/v1.
2
An evaluation of RNA-seq differential analysis methods.RNA-seq 差异分析方法评估。
PLoS One. 2022 Sep 16;17(9):e0264246. doi: 10.1371/journal.pone.0264246. eCollection 2022.
3
Reproducibility of Methods to Detect Differentially Expressed Genes from Single-Cell RNA Sequencing.从单细胞RNA测序中检测差异表达基因方法的可重复性
Front Genet. 2020 Jan 17;10:1331. doi: 10.3389/fgene.2019.01331. eCollection 2019.
4
DEsingle for detecting three types of differential expression in single-cell RNA-seq data.DEsingle 用于检测单细胞 RNA-seq 数据中的三种差异表达。
Bioinformatics. 2018 Sep 15;34(18):3223-3224. doi: 10.1093/bioinformatics/bty332.
5
lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA.lncDIFF:一种用于分析非编码 RNA 差异表达的新型拟似然方法。
BMC Genomics. 2019 Jul 2;20(1):539. doi: 10.1186/s12864-019-5926-4.
6
Inverse weighting method with jackknife variance estimator for differential expression analysis of single-cell RNA sequencing data.基于刀切方差估计的逆加权法用于单细胞 RNA 测序数据分析中的差异表达分析。
Comput Biol Chem. 2022 Oct;100:107733. doi: 10.1016/j.compbiolchem.2022.107733. Epub 2022 Jul 18.
7
Choice of library size normalization and statistical methods for differential gene expression analysis in balanced two-group comparisons for RNA-seq studies.RNA-seq 研究中平衡两组比较差异基因表达分析的库大小标准化和统计方法选择。
BMC Genomics. 2020 Jan 28;21(1):75. doi: 10.1186/s12864-020-6502-7.
8
Linnorm: improved statistical analysis for single cell RNA-seq expression data.Linnorm:单细胞RNA测序表达数据的改进统计分析
Nucleic Acids Res. 2017 Dec 15;45(22):e179. doi: 10.1093/nar/gkx828.
9
A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data.用于RNA测序数据差异表达分析的每个样本全局缩放和每个基因归一化方法的比较。
PLoS One. 2017 May 1;12(5):e0176185. doi: 10.1371/journal.pone.0176185. eCollection 2017.
10
Three Differential Expression Analysis Methods for RNA Sequencing: limma, EdgeR, DESeq2.三种 RNA 测序差异表达分析方法:limma、EdgeR、DESeq2。
J Vis Exp. 2021 Sep 18(175). doi: 10.3791/62528.

引用本文的文献

1
Postmortem Interval Leads to Loss of Disease-Specific Signatures in Brain Tissue.死后间隔时间导致脑组织中疾病特异性特征丧失。
eNeuro. 2025 Mar 14;12(3). doi: 10.1523/ENEURO.0505-24.2025. Print 2025 Mar.