• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于读段计数的 RNAseq 分析方法评估。

Evaluation of read count based RNAseq analysis methods.

出版信息

BMC Genomics. 2013;14 Suppl 8(Suppl 8):S2. doi: 10.1186/1471-2164-14-S8-S2. Epub 2013 Dec 9.

DOI:10.1186/1471-2164-14-S8-S2
PMID:24564449
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4092879/
Abstract

BACKGROUND

RNAseq technology is replacing microarray technology as the tool of choice for gene expression profiling. While providing much richer data than microarray, analysis of RNAseq data has been much more challenging. To date, there has not been a consensus on the best approach for conducting robust RNAseq analysis.

RESULTS

In this study, we designed a thorough experiment to evaluate six read count-based RNAseq analysis methods (DESeq, DEGseq, edgeR, NBPSeq, TSPM and baySeq) using both real and simulated data. We found the six methods produce similar fold changes and reasonable overlapping of differentially expressed genes based on p-values. However, all six methods suffer from over-sensitivity.

CONCLUSIONS

Based on the evaluation of runtime using real data and area under the receiver operating characteristic curve (AUC-ROC) using simulated data, we found that edgeR achieves a better balance between speed and accuracy than the other methods.

摘要

背景

RNAseq 技术正在取代微阵列技术,成为基因表达谱分析的首选工具。虽然 RNAseq 技术比微阵列技术提供了更丰富的数据,但对其数据的分析也更具挑战性。迄今为止,对于进行稳健的 RNAseq 分析,还没有达成共识的最佳方法。

结果

在这项研究中,我们设计了一个全面的实验,使用真实数据和模拟数据来评估六种基于读取计数的 RNAseq 分析方法(DESeq、DEGseq、edgeR、NBPSeq、TSPM 和 baySeq)。我们发现,基于 p 值,这六种方法产生的倍数变化和差异表达基因的合理重叠相似。然而,所有六种方法都存在过度敏感的问题。

结论

基于使用真实数据评估运行时间和使用模拟数据评估接收者操作特征曲线下的面积(AUC-ROC),我们发现 edgeR 在速度和准确性之间取得了比其他方法更好的平衡。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a69/4092879/4119bd9df393/1471-2164-14-S8-S2-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a69/4092879/6ede551be846/1471-2164-14-S8-S2-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a69/4092879/1f95e13b1942/1471-2164-14-S8-S2-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a69/4092879/4902dac3bd78/1471-2164-14-S8-S2-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a69/4092879/03072a517200/1471-2164-14-S8-S2-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a69/4092879/4119bd9df393/1471-2164-14-S8-S2-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a69/4092879/6ede551be846/1471-2164-14-S8-S2-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a69/4092879/1f95e13b1942/1471-2164-14-S8-S2-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a69/4092879/4902dac3bd78/1471-2164-14-S8-S2-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a69/4092879/03072a517200/1471-2164-14-S8-S2-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a69/4092879/4119bd9df393/1471-2164-14-S8-S2-5.jpg

相似文献

1
Evaluation of read count based RNAseq analysis methods.基于读段计数的 RNAseq 分析方法评估。
BMC Genomics. 2013;14 Suppl 8(Suppl 8):S2. doi: 10.1186/1471-2164-14-S8-S2. Epub 2013 Dec 9.
2
A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data.用于RNA测序数据差异表达分析的每个样本全局缩放和每个基因归一化方法的比较。
PLoS One. 2017 May 1;12(5):e0176185. doi: 10.1371/journal.pone.0176185. eCollection 2017.
3
A comparison of statistical methods for detecting differentially expressed genes from RNA-seq data.基于 RNA-seq 数据的差异表达基因检测的统计学方法比较。
Am J Bot. 2012 Feb;99(2):248-56. doi: 10.3732/ajb.1100340. Epub 2012 Jan 20.
4
aFold - using polynomial uncertainty modelling for differential gene expression estimation from RNA sequencing data.aFold - 使用多项式不确定性建模进行 RNA 测序数据的差异基因表达估计。
BMC Genomics. 2019 May 10;20(1):364. doi: 10.1186/s12864-019-5686-1.
5
TCC: an R package for comparing tag count data with robust normalization strategies.TCC:一个用于比较标签计数数据的 R 包,带有稳健的归一化策略。
BMC Bioinformatics. 2013 Jul 9;14:219. doi: 10.1186/1471-2105-14-219.
6
Differential Expression Analysis in Single-Cell Transcriptomics.单细胞转录组学中的差异表达分析
Methods Mol Biol. 2019;1979:425-432. doi: 10.1007/978-1-4939-9240-9_25.
7
Evaluation of methods for differential expression analysis on multi-group RNA-seq count data.多组RNA测序计数数据差异表达分析方法的评估
BMC Bioinformatics. 2015 Nov 4;16:361. doi: 10.1186/s12859-015-0794-7.
8
Large scale comparison of gene expression levels by microarrays and RNAseq using TCGA data.基于 TCGA 数据的基因表达水平的大规模比较:微阵列和 RNAseq 方法的比较。
PLoS One. 2013 Aug 20;8(8):e71462. doi: 10.1371/journal.pone.0071462. eCollection 2013.
9
Normalization of RNA-Seq data using adaptive trimmed mean with multi-reference.使用自适应修剪均值和多参考对 RNA-Seq 数据进行标准化。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae241.
10
Parallel comparison of Illumina RNA-Seq and Affymetrix microarray platforms on transcriptomic profiles generated from 5-aza-deoxy-cytidine treated HT-29 colon cancer cells and simulated datasets.Illumina RNA-Seq 和 Affymetrix 微阵列平台在 5-aza-去氧胞苷处理的 HT-29 结肠癌细胞和模拟数据集产生的转录组图谱上的平行比较。
BMC Bioinformatics. 2013;14 Suppl 9(Suppl 9):S1. doi: 10.1186/1471-2105-14-S9-S1. Epub 2013 Jun 28.

引用本文的文献

1
Nanopore microscope identifies RNA isoforms with structural colours.纳米孔显微镜可识别具有结构颜色的 RNA 异构体。
Nat Chem. 2022 Nov;14(11):1258-1264. doi: 10.1038/s41557-022-01037-5. Epub 2022 Sep 19.
2
Integrated COVID-19 Predictor: Differential expression analysis to reveal potential biomarkers and prediction of coronavirus using RNA-Seq profile data.整合 COVID-19 预测器:差异表达分析揭示潜在生物标志物和使用 RNA-Seq 谱数据预测冠状病毒
Comput Biol Med. 2022 Aug;147:105684. doi: 10.1016/j.compbiomed.2022.105684. Epub 2022 Jun 3.
3
Transcriptome-Wide Gene Expression Plasticity in in Response to Grazing Intensity Differences.

本文引用的文献

1
Large scale comparison of gene expression levels by microarrays and RNAseq using TCGA data.基于 TCGA 数据的基因表达水平的大规模比较:微阵列和 RNAseq 方法的比较。
PLoS One. 2013 Aug 20;8(8):e71462. doi: 10.1371/journal.pone.0071462. eCollection 2013.
2
Comprehensive molecular characterization of human colon and rectal cancer.全面的人类结肠和直肠癌分子特征分析。
Nature. 2012 Jul 18;487(7407):330-7. doi: 10.1038/nature11252.
3
Integrated genomic analyses of ovarian carcinoma.卵巢癌的综合基因组分析。
转录组水平上基因表达对放牧强度差异的响应。
Int J Mol Sci. 2021 Nov 2;22(21):11882. doi: 10.3390/ijms222111882.
4
In vivo commensal control of Clostridioides difficile virulence.体内共生控制艰难梭菌毒力。
Cell Host Microbe. 2021 Nov 10;29(11):1693-1708.e7. doi: 10.1016/j.chom.2021.09.007. Epub 2021 Oct 11.
5
Differential gene expression in Drosophila melanogaster and D. nigrosparsa infected with the same Wolbachia strain.果蝇和黑腹果蝇感染相同沃尔巴克氏体菌株时的差异基因表达。
Sci Rep. 2021 May 31;11(1):11336. doi: 10.1038/s41598-021-90857-5.
6
Mass molecular testing for COVID19 using NGS-based technology and a highly scalable workflow.基于 NGS 技术和高度可扩展工作流程的大规模分子 COVID19 检测。
Sci Rep. 2021 Mar 29;11(1):7122. doi: 10.1038/s41598-021-86498-3.
7
Systematic comparison and assessment of RNA-seq procedures for gene expression quantitative analysis.系统比较和评估 RNA-seq 程序进行基因表达定量分析。
Sci Rep. 2020 Nov 12;10(1):19737. doi: 10.1038/s41598-020-76881-x.
8
Choice of library size normalization and statistical methods for differential gene expression analysis in balanced two-group comparisons for RNA-seq studies.RNA-seq 研究中平衡两组比较差异基因表达分析的库大小标准化和统计方法选择。
BMC Genomics. 2020 Jan 28;21(1):75. doi: 10.1186/s12864-020-6502-7.
9
Gene expression changes associated with the evolutionary loss of a metabolic trait: lack of lipogenesis in parasitoids.与代谢特征进化丧失相关的基因表达变化:寄生蜂中缺乏脂肪生成。
BMC Genomics. 2019 Apr 23;20(1):309. doi: 10.1186/s12864-019-5673-6.
10
PairedFB: a full hierarchical Bayesian model for paired RNA-seq data with heterogeneous treatment effects.配对 FB:一种具有异质处理效应的配对 RNA-seq 数据的全层次贝叶斯模型。
Bioinformatics. 2019 Mar 1;35(5):787-797. doi: 10.1093/bioinformatics/bty731.
Nature. 2011 Jun 29;474(7353):609-15. doi: 10.1038/nature10166.
4
Differential expression analysis for sequence count data.差异表达分析序列计数数据。
Genome Biol. 2010;11(10):R106. doi: 10.1186/gb-2010-11-10-r106. Epub 2010 Oct 27.
5
baySeq: empirical Bayesian methods for identifying differential expression in sequence count data.baySeq:用于识别序列计数数据中差异表达的经验贝叶斯方法。
BMC Bioinformatics. 2010 Aug 10;11:422. doi: 10.1186/1471-2105-11-422.
6
Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.通过 RNA-Seq 进行转录本组装和定量分析揭示了细胞分化过程中未注释的转录本和异构体转换。
Nat Biotechnol. 2010 May;28(5):511-5. doi: 10.1038/nbt.1621. Epub 2010 May 2.
7
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.edgeR:一个用于数字基因表达数据差异表达分析的 Bioconductor 包。
Bioinformatics. 2010 Jan 1;26(1):139-40. doi: 10.1093/bioinformatics/btp616. Epub 2009 Nov 11.
8
DEGseq: an R package for identifying differentially expressed genes from RNA-seq data.DEGseq:一个用于从 RNA-seq 数据中识别差异表达基因的 R 包。
Bioinformatics. 2010 Jan 1;26(1):136-8. doi: 10.1093/bioinformatics/btp612. Epub 2009 Oct 24.
9
RNA-Seq: a revolutionary tool for transcriptomics.RNA测序:转录组学的革命性工具。
Nat Rev Genet. 2009 Jan;10(1):57-63. doi: 10.1038/nrg2484.
10
The beginning of the end for microarrays?微阵列技术走向终结的开端?
Nat Methods. 2008 Jul;5(7):585-7. doi: 10.1038/nmeth0708-585.