Chen Li, Sun Fenghao, Yang Xiaodong, Jin Yulin, Shi Mengkun, Wang Lin, Shi Yu, Zhan Cheng, Wang Qun
Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai, 20032, China; Department of Orthopedics, West China Hospital, Sichuan University, Chengdu, Sichuan, 610041, China.
Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai, 20032, China.
Gene. 2017 Sep 10;628:200-204. doi: 10.1016/j.gene.2017.07.056. Epub 2017 Jul 20.
RNA sequencing (RNA-Seq) and microarray are two of the most commonly used high-throughput technologies for transcriptome profiling; however, they both have their own inherent strengths and limitations. This research aims to analyze the correlation between microarrays and RNA-Seq detection of transcripts in the same tissue sample to explore the reproducibility between the techniques. Using data of RNA-Seq v2 and three different microarrays provided by The Cancer Genome Atlas, 11,120 genes of 111 lung squamous cell carcinoma samples were simultaneously detected by the four methods. Then we analyzed the Pearson correlation between microarrays and RNA-Seq. Finally, in the six comparison results, 9984 (89.8%) genes, irrespective of which two methods were used, simultaneously showed the existence of correlation, whereas only 83 (0.1%) genes proved to have no significant correlation in either comparison. In addition, the comparisons between 3266 (29.3%) genes showed high correlation (R≥0.8) in all six comparisons, only for 1643 (14.8%) genes correlation were not as high in either comparison. Meanwhile, transcripts with extreme high or low expression levels were more highly discrepant across the methods. In conclusion, we found that, for most transcripts, the results obtained by RNA-Seq and microarrays were highly reproducible.
RNA测序(RNA-Seq)和微阵列是转录组分析中最常用的两种高通量技术;然而,它们都有各自固有的优势和局限性。本研究旨在分析同一组织样本中微阵列与RNA-Seq转录本检测之间的相关性,以探索这两种技术之间的可重复性。利用癌症基因组图谱提供的RNA-Seq v2数据和三种不同的微阵列,通过这四种方法同时检测了111个肺鳞状细胞癌样本中的11120个基因。然后我们分析了微阵列与RNA-Seq之间的Pearson相关性。最后,在六个比较结果中,无论使用哪两种方法,9984个(89.8%)基因同时显示出相关性,而只有83个(0.1%)基因在任何比较中都没有显著相关性。此外,3266个(29.3%)基因在所有六个比较中显示出高度相关性(R≥0.8),只有1643个(14.8%)基因在任何比较中的相关性都没有那么高。同时,表达水平极高或极低的转录本在不同方法之间的差异更大。总之,我们发现,对于大多数转录本,RNA-Seq和微阵列获得的结果具有高度可重复性。