Center for Quantitative Sciences, Vanderbilt University, Nashville, Tennessee, United States of America.
PLoS One. 2013 Aug 20;8(8):e71462. doi: 10.1371/journal.pone.0071462. eCollection 2013.
RNAseq and microarray methods are frequently used to measure gene expression level. While similar in purpose, there are fundamental differences between the two technologies. Here, we present the largest comparative study between microarray and RNAseq methods to date using The Cancer Genome Atlas (TCGA) data. We found high correlations between expression data obtained from the Affymetrix one-channel microarray and RNAseq (Spearman correlations coefficients of ∼0.8). We also observed that the low abundance genes had poorer correlations between microarray and RNAseq data than high abundance genes. As expected, due to measurement and normalization differences, Agilent two-channel microarray and RNAseq data were poorly correlated (Spearman correlations coefficients of only ∼0.2). By examining the differentially expressed genes between tumor and normal samples we observed reasonable concordance in directionality between Agilent two-channel microarray and RNAseq data, although a small group of genes were found to have expression changes reported in opposite directions using these two technologies. Overall, RNAseq produces comparable results to microarray technologies in term of expression profiling. The RNAseq normalization methods RPKM and RSEM produce similar results on the gene level and reasonably concordant results on the exon level. Longer exons tended to have better concordance between the two normalization methods than shorter exons.
RNAseq 和微阵列方法常用于测量基因表达水平。虽然它们的目的相似,但这两种技术在根本上存在差异。在这里,我们使用癌症基因组图谱 (TCGA) 数据展示了迄今为止最大的微阵列和 RNAseq 方法之间的比较研究。我们发现,从 Affymetrix 单通道微阵列获得的表达数据与 RNAseq 之间具有高度相关性(Spearman 相关系数约为 0.8)。我们还观察到,低丰度基因与微阵列和 RNAseq 数据之间的相关性比高丰度基因差。正如预期的那样,由于测量和归一化差异,Agilent 双通道微阵列和 RNAseq 数据之间的相关性很差(Spearman 相关系数仅约为 0.2)。通过检查肿瘤和正常样本之间的差异表达基因,我们观察到 Agilent 双通道微阵列和 RNAseq 数据在方向上具有合理的一致性,尽管发现一小部分基因使用这两种技术报告了相反方向的表达变化。总体而言,RNAseq 在表达谱方面与微阵列技术产生可比的结果。RNAseq 的 RPKM 和 RSEM 归一化方法在基因水平上产生相似的结果,在外显子水平上具有相当一致的结果。较长的外显子往往比较短的外显子在两种归一化方法之间具有更好的一致性。