School of Applied Chemistry and Biotechnology, Shenzhen Polytechnic, Shenzhen, 518055, China.
College of Grassland Science, Beijing Forestry University, Beijing, 100083, China.
Mol Genet Genomics. 2020 Mar;295(2):475-489. doi: 10.1007/s00438-019-01635-y. Epub 2020 Jan 1.
Perennial ryegrass (Lolium perenne), one of the most widely used forage and cool-season turfgrass worldwide, has a breeding history of more than 100 years. However, the current draft genome annotation and transcriptome characterization are incomplete mainly because of the enormous difficulty in obtaining full-length transcripts. To explore the complete structure of the mRNA and improve the current draft genome, we performed PacBio single-molecule long-read sequencing for full-length transcriptome sequencing in perennial ryegrass. We generated 29,175 high-confidence non-redundant transcripts from 15,893 genetic loci, among which more than 66.88% of transcripts and 24.99% of genetic loci were not previously annotated in the current reference genome. The re-annotated 18,327 transcripts enriched the reference transcriptome. Particularly, 6709 alternative splicing events and 23,789 alternative polyadenylation sites were detected, providing a comprehensive landscape of the post-transcriptional regulation network. Furthermore, we identified 218 long non-coding RNAs and 478 fusion genes. Finally, the transcriptional regulation mechanism of perennial ryegrass in response to drought stress based on the newly updated reference transcriptome sequences was explored, providing new information on the underlying transcriptional regulation network. Taken together, we analyzed the full-length transcriptome of perennial ryegrass by PacBio single-molecule long-read sequencing. These results improve our understanding of the perennial ryegrass transcriptomes and refined the annotation of the reference genome.
多年生黑麦草(Lolium perenne)是世界上应用最广泛的饲草和冷季草坪草之一,其育种历史已有 100 多年。然而,目前的基因组草案注释和转录组特征描述并不完整,主要是因为获得全长转录本的难度极大。为了探索 mRNA 的完整结构并改进当前的基因组草案,我们对多年生黑麦草进行了 PacBio 单分子长读测序,以进行全长转录组测序。我们从 15893 个遗传基因座中生成了 29175 个高可信度的非冗余转录本,其中超过 66.88%的转录本和 24.99%的遗传基因座在当前参考基因组中没有被注释。重新注释的 18327 个转录本丰富了参考转录组。特别是,检测到 6709 个可变剪接事件和 23789 个可变多聚腺苷酸化位点,为转录后调控网络提供了全面的景观。此外,我们鉴定了 218 个长非编码 RNA 和 478 个融合基因。最后,基于新更新的参考转录组序列,探索了多年生黑麦草对干旱胁迫的转录调控机制,为潜在的转录调控网络提供了新信息。总之,我们通过 PacBio 单分子长读测序分析了多年生黑麦草的全长转录组。这些结果加深了我们对多年生黑麦草转录组的理解,并完善了参考基因组的注释。