Zuo Chunman, Blow Matthew, Sreedasyam Avinash, Kuo Rita C, Ramamoorthy Govindarajan Kunde, Torres-Jerez Ivone, Li Guifen, Wang Mei, Dilworth David, Barry Kerrie, Udvardi Michael, Schmutz Jeremy, Tang Yuhong, Xu Ying
1College of Computer Science and Technology, Jilin University, Changchun, China.
2Department of Biochemistry and Molecular Biology and Institute of Bioinformatics, University of Georgia, Athens, GA USA.
Biotechnol Biofuels. 2018 Jun 20;11:170. doi: 10.1186/s13068-018-1167-z. eCollection 2018.
Switchgrass ( L.) is an important bioenergy crop widely used for lignocellulosic research. While extensive transcriptomic analyses have been conducted on this species using short read-based sequencing techniques, very little has been reliably derived regarding alternatively spliced (AS) transcripts.
We present an analysis of transcriptomes of six switchgrass tissue types pooled together, sequenced using Pacific Biosciences (PacBio) single-molecular long-read technology. Our analysis identified 105,419 unique transcripts covering 43,570 known genes and 8795 previously unknown genes. 45,168 are novel transcripts of known genes. A total of 60,096 AS transcripts are identified, 45,628 being novel. We have also predicted 1549 transcripts of genes involved in cell wall construction and remodeling, 639 being novel transcripts of known cell wall genes. Most of the predicted transcripts are validated against Illumina-based short reads. Specifically, 96% of the splice junction sites in all the unique transcripts are validated by at least five Illumina reads. Comparisons between genes derived from our identified transcripts and the current genome annotation revealed that among the gene set predicted by both analyses, 16,640 have different exon-intron structures.
Overall, substantial amount of new information is derived from the PacBio RNA data regarding both the transcriptome and the genome of switchgrass.
柳枝稷(Panicum virgatum L.)是一种重要的生物能源作物,广泛用于木质纤维素研究。虽然已经使用基于短读长的测序技术对该物种进行了广泛的转录组分析,但关于可变剪接(AS)转录本的可靠信息却很少。
我们对六种柳枝稷组织类型混合后的转录组进行了分析,使用太平洋生物科学公司(PacBio)的单分子长读长技术进行测序。我们的分析鉴定出105,419个独特转录本,覆盖43,570个已知基因和8795个以前未知的基因。其中45,168个是已知基因的新转录本。总共鉴定出60,096个AS转录本,其中45,628个是新的。我们还预测了1549个参与细胞壁构建和重塑的基因转录本,其中639个是已知细胞壁基因的新转录本。大多数预测的转录本通过基于Illumina的短读长进行了验证。具体而言,所有独特转录本中96%的剪接连接位点至少被五个Illumina读段验证。将我们鉴定的转录本衍生的基因与当前的基因组注释进行比较发现,在两种分析预测的基因集中,有16,640个具有不同的外显子-内含子结构。
总体而言,PacBio RNA数据提供了大量关于柳枝稷转录组和基因组的新信息。