Department of Biotechnology and Systems Biology, National Institute of Biology, Ljubljana, Slovenia.
National Center for Genome Analysis and Support (NCGAS), Indiana University, Bloomington, USA.
Sci Data. 2020 Jul 24;7(1):249. doi: 10.1038/s41597-020-00581-4.
Although the reference genome of Solanum tuberosum Group Phureja double-monoploid (DM) clone is available, knowledge on the genetic diversity of the highly heterozygous tetraploid Group Tuberosum, representing most cultivated varieties, remains largely unexplored. This lack of knowledge hinders further progress in potato research. In conducted investigation, we first merged and manually curated the two existing partially-overlapping DM genome-based gene models, creating a union of genes in Phureja scaffold. Next, we compiled available and newly generated RNA-Seq datasets (cca. 1.5 billion reads) for three tetraploid potato genotypes (cultivar Désirée, cultivar Rywal, and breeding clone PW363) with diverse breeding pedigrees. Short-read transcriptomes were assembled using several de novo assemblers under different settings to test for optimal outcome. For cultivar Rywal, PacBio Iso-Seq full-length transcriptome sequencing was also performed. EvidentialGene redundancy-reducing pipeline complemented with in-house developed scripts was employed to produce accurate and complete cultivar-specific transcriptomes, as well as to attain the pan-transcriptome. The generated transcriptomes and pan-transcriptome represent a valuable resource for potato gene variability exploration, high-throughput omics analyses, and breeding programmes.
尽管已经有了 Solanum tuberosum Group Phureja 双单倍体 (DM) 克隆的参考基因组,但对高度杂合的四倍体 Group Tuberosum 的遗传多样性的了解仍在很大程度上尚未探索。这种缺乏知识的情况阻碍了马铃薯研究的进一步进展。在进行的研究中,我们首先合并并手动整理了现有的两个基于 DM 基因组的部分重叠基因模型,创建了 Phureja 支架中的基因联合体。接下来,我们编译了可用的和新生成的 RNA-Seq 数据集(约 15 亿个读数),用于具有不同育种背景的三个四倍体马铃薯基因型(品种 Désirée、品种 Rywal 和育种克隆 PW363)。使用几种不同设置的从头组装程序对短读转录组进行组装,以测试最佳结果。对于品种 Rywal,还进行了 PacBio Iso-Seq 全长转录组测序。采用证据基因冗余减少管道,并辅以内部开发的脚本,生成准确和完整的品种特异性转录组,并获得泛转录组。生成的转录组和泛转录组为马铃薯基因变异探索、高通量组学分析和育种计划提供了有价值的资源。