Department of Biology, University of Florida, Gainesville, FL, 32611-8525, USA.
Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611-7800, USA.
Mol Ecol Resour. 2017 Nov;17(6):1243-1256. doi: 10.1111/1755-0998.12670. Epub 2017 Jun 5.
Alternative splicing (AS) is a major source of transcript and proteome diversity, but examining AS in species without well-annotated reference genomes remains difficult. Research on both human and mouse has demonstrated the advantages of using Iso-Seq™ data for isoform-level transcriptome analysis, including the study of AS and gene fusion. We applied Iso-Seq™ to investigate AS in Amborella trichopoda, a phylogenetically pivotal species that is sister to all other living angiosperms. Our data show that, compared with RNA-Seq data, the Iso-Seq™ platform provides better recovery on large transcripts, new gene locus identification and gene model correction. Reference-based AS detection with Iso-Seq™ data identifies AS within a higher fraction of multi-exonic genes than observed for published RNA-Seq analysis (45.8% vs. 37.5%). These data demonstrate that the Iso-Seq™ approach is useful for detecting AS events. Using the Iso-Seq-defined transcript collection in Amborella as a reference, we further describe a pipeline for detection of AS isoforms from PacBio Iso-Seq™ without using a reference sequence (de novo). Results using this pipeline show a 66%-76% overall success rate in identifying AS events. This de novoAS detection pipeline provides a method to accurately characterize and identify bona fide alternatively spliced transcripts in any nonmodel system that lacks a reference genome sequence. Hence, our pipeline has huge potential applications and benefits to the broader biology community.
可变剪接 (AS) 是转录组和蛋白质组多样性的主要来源,但对于缺乏良好注释参考基因组的物种,研究 AS 仍然具有挑战性。人类和小鼠的研究都表明,使用 Iso-Seq™ 数据进行异构体水平转录组分析具有优势,包括对 AS 和基因融合的研究。我们应用 Iso-Seq™ 来研究 Amborella trichopoda 中的 AS,该物种在系统发育上是所有其他活被子植物的姐妹群,具有重要意义。我们的数据表明,与 RNA-Seq 数据相比,Iso-Seq™ 平台在大转录本、新基因座鉴定和基因模型校正方面提供了更好的恢复。基于参考的 Iso-Seq™ 数据检测到的 AS 比已发表的 RNA-Seq 分析观察到的多外显子基因中的 AS 更高(45.8%比 37.5%)。这些数据表明,Iso-Seq™ 方法可用于检测 AS 事件。我们进一步描述了一种使用 PacBio Iso-Seq™ 而不使用参考序列(从头开始)从 Amborella 的 Iso-Seq 定义的转录本集合中检测 AS 异构体的管道。使用此管道的结果表明,在鉴定 AS 事件方面总体成功率为 66%-76%。这种从头开始的 AS 检测管道为任何缺乏参考基因组序列的非模式系统提供了一种准确描述和鉴定真实的可变剪接转录本的方法。因此,我们的管道具有广泛的应用潜力和对更广泛的生物学界的益处。