Carvalho Daniel S, Nishimwe Aime V, Schnable James C
Department of Agronomy and Horticulture Center for Plant Science Innovation University of Nebraska-Lincoln Lincoln NE USA.
Plant Direct. 2020 Feb 28;4(2):e00203. doi: 10.1002/pld3.203. eCollection 2020 Feb.
The number of plant species with genomic and transcriptomic data has been increasing rapidly. The grasses-Poaceae-have been well represented among species with published reference genomes. However, as a result the genomes of wild grasses are less frequently targeted by sequencing efforts. Sequence data from wild relatives of crop species in the grasses can aid the study of domestication, gene discovery for breeding and crop improvement, and improve our understanding of the evolution of C photosynthesis. Here, we used long-read sequencing technology to characterize the transcriptomes of three C panicoid grass species: , , and . Based on alignments to the sorghum genome, we estimate that assembled consensus transcripts from each species capture between 54.2% and 65.7% of the conserved syntenic gene space in grasses. Genes co-opted into C were also well represented in this dataset, despite concerns that because these genes might play roles unrelated to photosynthesis in the target species, they would be expressed at low levels and missed by transcript-based sequencing. A combined analysis using syntenic orthologous genes from grasses with published reference genomes and consensus long-read sequences from these wild species was consistent with previously published phylogenies. It is hoped that these data, targeting underrepresented classes of species within the PACMAD grasses-wild species and species utilizing C photosynthesis-will aid in future studies of domestication and C evolution by decreasing the evolutionary distance between C and C species within this clade, enabling more accurate comparisons associated with evolution of the C pathway.
拥有基因组和转录组数据的植物物种数量一直在迅速增加。禾本科植物在已发表参考基因组的物种中占比很大。然而,结果是野草的基因组较少成为测序工作的目标。禾本科作物野生近缘种的序列数据有助于驯化研究、育种和作物改良的基因发现,并增进我们对C4光合作用进化的理解。在这里,我们使用长读长测序技术对三种C4类黍族禾本科植物的转录组进行了表征:、和。基于与高粱基因组的比对,我们估计每个物种组装的共有转录本覆盖了禾本科中54.2%至65.7%的保守同线基因空间。尽管有人担心,因为这些基因可能在目标物种中发挥与光合作用无关的作用,它们会以低水平表达并被基于转录本的测序遗漏,但被纳入C4光合作用的基因在该数据集中也有很好的代表性。使用来自已发表参考基因组的禾本科同线直系同源基因和这些野生物种的共有长读长序列进行的联合分析与先前发表的系统发育一致。希望这些针对PACMAD禾本科中代表性不足的物种类别(野生物种和利用C4光合作用的物种)的数据,将有助于未来的驯化和C4进化研究,通过缩短该分支内C4和C3物种之间的进化距离,实现与C4途径进化相关的更准确比较。