Department of Chemical Engineering, Universitat Rovira i Virgili, 43007 Tarragona, Spain.
Department of Bioscience, Aarhus University, 8600 Silkeborg, Denmark.
Sci Rep. 2017 Mar 9;7:42460. doi: 10.1038/srep42460.
RNA-Seq and gene expression microarrays provide comprehensive profiles of gene activity, but lack of reproducibility has hindered their application. A key challenge in the data analysis is the normalization of gene expression levels, which is currently performed following the implicit assumption that most genes are not differentially expressed. Here, we present a mathematical approach to normalization that makes no assumption of this sort. We have found that variation in gene expression is much larger than currently believed, and that it can be measured with available assays. Our results also explain, at least partially, the reproducibility problems encountered in transcriptomics studies. We expect that this improvement in detection will help efforts to realize the full potential of gene expression profiling, especially in analyses of cellular processes involving complex modulations of gene expression.
RNA-Seq 和基因表达微阵列提供了全面的基因活性谱,但缺乏可重复性阻碍了它们的应用。数据分析中的一个关键挑战是基因表达水平的标准化,目前是按照大多数基因没有差异表达的隐含假设进行的。在这里,我们提出了一种不需要这种假设的数学方法来进行标准化。我们发现基因表达的变化比目前认为的要大得多,而且可以用现有的检测方法来测量。我们的结果也至少部分地解释了转录组学研究中遇到的可重复性问题。我们预计,这种检测能力的提高将有助于充分发挥基因表达谱分析的潜力,特别是在涉及基因表达复杂调节的细胞过程分析中。