Zhang Runxuan, Calixto Cristiane P G, Tzioutziou Nikoleta A, James Allan B, Simpson Craig G, Guo Wenbin, Marquez Yamile, Kalyna Maria, Patro Rob, Eyras Eduardo, Barta Andrea, Nimmo Hugh G, Brown John W S
Informatics and Computational Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK.
Plant Sciences Division, College of Life Sciences, University of Dundee, Invergowrie, Dundee, DD2 5DA, UK.
New Phytol. 2015 Oct;208(1):96-101. doi: 10.1111/nph.13545. Epub 2015 Jun 25.
RNA-sequencing (RNA-seq) allows global gene expression analysis at the individual transcript level. Accurate quantification of transcript variants generated by alternative splicing (AS) remains a challenge. We have developed a comprehensive, nonredundant Arabidopsis reference transcript dataset (AtRTD) containing over 74 000 transcripts for use with algorithms to quantify AS transcript isoforms in RNA-seq. The AtRTD was formed by merging transcripts from TAIR10 and novel transcripts identified in an AS discovery project. We have estimated transcript abundance in RNA-seq data using the transcriptome-based alignment-free programmes Sailfish and Salmon and have validated quantification of splicing ratios from RNA-seq by high resolution reverse transcription polymerase chain reaction (HR RT-PCR). Good correlations between splicing ratios from RNA-seq and HR RT-PCR were obtained demonstrating the accuracy of abundances calculated for individual transcripts in RNA-seq. The AtRTD is a resource that will have immediate utility in analysing Arabidopsis RNA-seq data to quantify differential transcript abundance and expression.
RNA测序(RNA-seq)可在单个转录本水平进行全基因组基因表达分析。对由可变剪接(AS)产生的转录本变体进行准确量化仍然是一项挑战。我们开发了一个全面、无冗余的拟南芥参考转录本数据集(AtRTD),其中包含超过74000个转录本,用于与算法结合以量化RNA-seq中的AS转录本异构体。AtRTD是通过合并TAIR10中的转录本和在一个AS发现项目中鉴定出的新转录本而形成的。我们使用基于转录组的无比对程序Sailfish和Salmon估计了RNA-seq数据中的转录本丰度,并通过高分辨率逆转录聚合酶链反应(HR RT-PCR)验证了RNA-seq中剪接比率的量化。RNA-seq和HR RT-PCR的剪接比率之间获得了良好的相关性,证明了RNA-seq中单个转录本丰度计算的准确性。AtRTD是一种资源,将在分析拟南芥RNA-seq数据以量化差异转录本丰度和表达方面立即发挥作用。