利用观测权重稳健检测RNA测序数据中的差异表达。

Robustly detecting differential expression in RNA sequencing data using observation weights.

作者信息

Zhou Xiaobei, Lindsay Helen, Robinson Mark D

机构信息

Institute of Molecular Life Sciences, University of Zurich, CH-8057 Zurich, Switzerland SIB Swiss Institute of Bioinformatics, University of Zurich, CH-8057 Zurich, Switzerland.

Institute of Molecular Life Sciences, University of Zurich, CH-8057 Zurich, Switzerland SIB Swiss Institute of Bioinformatics, University of Zurich, CH-8057 Zurich, Switzerland

出版信息

Nucleic Acids Res. 2014 Jun;42(11):e91. doi: 10.1093/nar/gku310. Epub 2014 Apr 20.

DOI:10.1093/nar/gku310

PMID:24753412

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4066750/

Abstract

A popular approach for comparing gene expression levels between (replicated) conditions of RNA sequencing data relies on counting reads that map to features of interest. Within such count-based methods, many flexible and advanced statistical approaches now exist and offer the ability to adjust for covariates (e.g. batch effects). Often, these methods include some sort of 'sharing of information' across features to improve inferences in small samples. It is important to achieve an appropriate tradeoff between statistical power and protection against outliers. Here, we study the robustness of existing approaches for count-based differential expression analysis and propose a new strategy based on observation weights that can be used within existing frameworks. The results suggest that outliers can have a global effect on differential analyses. We demonstrate the effectiveness of our new approach with real data and simulated data that reflects properties of real datasets (e.g. dispersion-mean trend) and develop an extensible framework for comprehensive testing of current and future methods. In addition, we explore the origin of such outliers, in some cases highlighting additional biological or technical factors within the experiment. Further details can be downloaded from the project website: http://imlspenticton.uzh.ch/robinson_lab/edgeR_robust/.

摘要

一种用于比较RNA测序数据（重复）条件下基因表达水平的常用方法依赖于对映射到感兴趣特征的 reads 进行计数。在这些基于计数的方法中，现在存在许多灵活且先进的统计方法，并且能够针对协变量（例如批次效应）进行调整。通常，这些方法包括某种跨特征的“信息共享”，以改善小样本中的推断。在统计功效和抵御异常值之间实现适当的权衡非常重要。在这里，我们研究了基于计数的差异表达分析现有方法的稳健性，并提出了一种基于观察权重的新策略，该策略可在现有框架内使用。结果表明，异常值可能会对差异分析产生全局影响。我们用反映真实数据集属性（例如离散度 - 均值趋势）的真实数据和模拟数据证明了我们新方法的有效性，并开发了一个可扩展框架，用于对当前和未来的方法进行全面测试。此外，我们探索了此类异常值的来源，在某些情况下突出了实验中其他的生物学或技术因素。更多详细信息可从项目网站下载：http://imlspenticton.uzh.ch/robinson_lab/edgeR_robust/ 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7339/4066750/87d64c6e538f/gku310fig1.jpg

相似文献

Robustly detecting differential expression in RNA sequencing data using observation weights.

Nucleic Acids Res. 2014 Jun;42(11):e91. doi: 10.1093/nar/gku310. Epub 2014 Apr 20.

A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data.

PLoS One. 2017 May 1;12(5):e0176185. doi: 10.1371/journal.pone.0176185. eCollection 2017.

Robust identification of differentially expressed genes from RNA-seq data.

Genomics. 2020 Mar;112(2):2000-2010. doi: 10.1016/j.ygeno.2019.11.012. Epub 2019 Nov 20.

Benchmarking RNA-seq differential expression analysis methods using spike-in and simulation data.

PLoS One. 2020 Apr 30;15(4):e0232271. doi: 10.1371/journal.pone.0232271. eCollection 2020.

LPEseq: Local-Pooled-Error Test for RNA Sequencing Experiments with a Small Number of Replicates.

PLoS One. 2016 Aug 17;11(8):e0159182. doi: 10.1371/journal.pone.0159182. eCollection 2016.

Differential expression analysis of RNA sequencing data by incorporating non-exonic mapped reads.

BMC Genomics. 2015;16 Suppl 7(Suppl 7):S14. doi: 10.1186/1471-2164-16-S7-S14. Epub 2015 Jun 11.

Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data.

Nucleic Acids Res. 2017 Nov 2;45(19):10978-10988. doi: 10.1093/nar/gkx754.

Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing.

BMC Genomics. 2012 Sep 17;13:484. doi: 10.1186/1471-2164-13-484.

Count-based differential expression analysis of RNA sequencing data using R and Bioconductor.

Nat Protoc. 2013 Sep;8(9):1765-86. doi: 10.1038/nprot.2013.099. Epub 2013 Aug 22.

A flexible count data model to fit the wide diversity of expression profiles arising from extensively replicated RNA-seq experiments.

BMC Bioinformatics. 2013 Aug 21;14:254. doi: 10.1186/1471-2105-14-254.

引用本文的文献

The Relevance of G-Quadruplexes in Gene Promoters and the First Introns Associated with Transcriptional Regulation in Breast Cancer.

Int J Mol Sci. 2025 Jul 17;26(14):6874. doi: 10.3390/ijms26146874.

The Human Myometrial Transcriptome and the DNA Methylome of Testosterone-treated Patients Resemble the Myometria from Fibroid Patients.

Reprod Sci. 2025 Jun 5. doi: 10.1007/s43032-025-01893-9.

Circulating immune cells exhibit distinct traits linked to metastatic burden in breast cancer.

Breast Cancer Res. 2025 May 8;27(1):73. doi: 10.1186/s13058-025-01982-2.

In vivo functional screens reveal loss as a driver of chemoresistance in small cell lung cancer.

Sci Adv. 2025 Apr 25;11(17):eadq7084. doi: 10.1126/sciadv.adq7084. Epub 2025 Apr 23.

Spatial transcriptomic analysis identifies epithelium-macrophage crosstalk in endometriotic lesions.

iScience. 2025 Jan 10;28(2):111790. doi: 10.1016/j.isci.2025.111790. eCollection 2025 Feb 21.

edgeR v4: powerful differential analysis of sequencing data with expanded functionality and improved support for small counts and larger datasets.

Nucleic Acids Res. 2025 Jan 11;53(2). doi: 10.1093/nar/gkaf018.

Robust double machine learning model with application to omics data.

BMC Bioinformatics. 2024 Nov 14;25(1):355. doi: 10.1186/s12859-024-05975-4.

YAP1 and WWTR1 are required for murine pregnancy initiation.

Reproduction. 2025 Jan 2;169(1). doi: 10.1530/REP-24-0355. Print 2025 Jan 1.

Histone deacetylase 9 promotes osteogenic trans-differentiation of vascular smooth muscle cells via ferroptosis in chronic kidney disease vascular calcification.

Ren Fail. 2024 Dec;46(2):2422435. doi: 10.1080/0886022X.2024.2422435. Epub 2024 Nov 5.

Sensor-Based and Visual Behavioral Profiling of Dry Holstein Cows Presenting Distinct Median Core Body Temperatures.

Animals (Basel). 2024 Oct 1;14(19):2832. doi: 10.3390/ani14192832.

本文引用的文献

voom: Precision weights unlock linear model analysis tools for RNA-seq read counts.

Genome Biol. 2014 Feb 3;15(2):R29. doi: 10.1186/gb-2014-15-2-r29.

Evaluating statistical analysis models for RNA sequencing experiments.

Front Genet. 2013 Sep 17;4:178. doi: 10.3389/fgene.2013.00178. eCollection 2013.

Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data.

Genome Biol. 2013;14(9):R95. doi: 10.1186/gb-2013-14-9-r95.

Count-based differential expression analysis of RNA sequencing data using R and Bioconductor.

Nat Protoc. 2013 Sep;8(9):1765-86. doi: 10.1038/nprot.2013.099. Epub 2013 Aug 22.

Higher order asymptotics for negative binomial regression inferences from RNA-sequencing data.

Stat Appl Genet Mol Biol. 2013 Mar 26;12(1):49-70. doi: 10.1515/sagmb-2012-0071.

A comparison of methods for differential expression analysis of RNA-seq data.

BMC Bioinformatics. 2013 Mar 9;14:91. doi: 10.1186/1471-2105-14-91.

EBSeq: an empirical Bayes hierarchical model for inference in RNA-seq experiments.

Bioinformatics. 2013 Apr 15;29(8):1035-43. doi: 10.1093/bioinformatics/btt087. Epub 2013 Feb 21.

A new shrinkage estimator for dispersion improves differential expression detection in RNA-seq data.

Biostatistics. 2013 Apr;14(2):232-43. doi: 10.1093/biostatistics/kxs033. Epub 2012 Sep 22.

Bayesian analysis of RNA sequencing data by estimating multiple shrinkage priors.

Biostatistics. 2013 Jan;14(1):113-28. doi: 10.1093/biostatistics/kxs031. Epub 2012 Sep 17.

Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation.

Nucleic Acids Res. 2012 May;40(10):4288-97. doi: 10.1093/nar/gks042. Epub 2012 Jan 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用观测权重稳健检测RNA测序数据中的差异表达。

Robustly detecting differential expression in RNA sequencing data using observation weights.

作者信息

Zhou Xiaobei, Lindsay Helen, Robinson Mark D

机构信息

Institute of Molecular Life Sciences, University of Zurich, CH-8057 Zurich, Switzerland SIB Swiss Institute of Bioinformatics, University of Zurich, CH-8057 Zurich, Switzerland.

Institute of Molecular Life Sciences, University of Zurich, CH-8057 Zurich, Switzerland SIB Swiss Institute of Bioinformatics, University of Zurich, CH-8057 Zurich, Switzerland

出版信息

Nucleic Acids Res. 2014 Jun;42(11):e91. doi: 10.1093/nar/gku310. Epub 2014 Apr 20.

DOI:10.1093/nar/gku310

PMID:24753412

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4066750/

Abstract

摘要

利用观测权重稳健检测RNA测序数据中的差异表达。

Robustly detecting differential expression in RNA sequencing data using observation weights.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

利用观测权重稳健检测RNA测序数据中的差异表达。

Robustly detecting differential expression in RNA sequencing data using observation weights.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献