Erhard Florian, Zimmer Ralf
Institut für Informatik, Ludwig-Maximilians-Universität München, Amalienstraße 17, 80333 München, Germany
Institut für Informatik, Ludwig-Maximilians-Universität München, Amalienstraße 17, 80333 München, Germany.
Nucleic Acids Res. 2015 Nov 16;43(20):e136. doi: 10.1093/nar/gkv696. Epub 2015 Jul 8.
Various biases affect high-throughput sequencing read counts. Contrary to the general assumption, we show that bias does not always cancel out when fold changes are computed and that bias affects more than 20% of genes that are called differentially regulated in RNA-seq experiments with drastic effects on subsequent biological interpretation. Here, we propose a novel approach to estimate fold changes. Our method is based on a probabilistic model that directly incorporates count ratios instead of read counts. It provides a theoretical foundation for pseudo-counts and can be used to estimate fold change credible intervals as well as normalization factors that outperform currently used normalization methods. We show that fold change estimates are significantly improved by our method by comparing RNA-seq derived fold changes to qPCR data from the MAQC/SEQC project as a reference and analyzing random barcoded sequencing data. Our software implementation is freely available from the project website http://www.bio.ifi.lmu.de/software/lfc.
Nucleic Acids Res. 2015-11-16
Bioinformatics. 2015-7-1
Bioinformatics. 2013-6-21
Bioinformatics. 2012-5-3
Bioinformatics. 2015-7-1
NAR Genom Bioinform. 2025-4-24
Bioinformatics. 2018-7-1
Nat Methods. 2018-3-12
Nat Rev Genet. 2016-10-14
Genome Res. 2014-3-25
Genome Biol. 2013-7-29
G3 (Bethesda). 2013-2-1
Nat Biotechnol. 2012-12-9
Bioinformatics. 2012-10-25
Nucleic Acids Res. 2012-1-12
Proc Natl Acad Sci U S A. 2012-1-9