Throckmorton Plant Sciences Center, Kansas State University, Manhattan, KS 66506, USA.
Genome Biol. 2011 Sep 14;12(9):R88. doi: 10.1186/gb-2011-12-9-r88.
The ability of grass species to adapt to various habitats is attributed to the dynamic nature of their genomes, which have been shaped by multiple rounds of ancient and recent polyploidization. To gain a better understanding of the nature and extent of variation in functionally relevant regions of a polyploid genome, we developed a sequence capture assay to compare exonic sequences of allotetraploid wheat accessions.
A sequence capture assay was designed for the targeted re-sequencing of 3.5 Mb exon regions that surveyed a total of 3,497 genes from allotetraploid wheat. These data were used to describe SNPs, copy number variation and homoeologous sequence divergence in coding regions. A procedure for variant discovery in the polyploid genome was developed and experimentally validated. About 1% and 24% of discovered SNPs were loss-of-function and non-synonymous mutations, respectively. Under-representation of replacement mutations was identified in several groups of genes involved in translation and metabolism. Gene duplications were predominant in a cultivated wheat accession, while more gene deletions than duplications were identified in wild wheat.
We demonstrate that, even though the level of sequence similarity between targeted polyploid genomes and capture baits can bias enrichment efficiency, exon capture is a powerful approach for variant discovery in polyploids. Our results suggest that allopolyploid wheat can accumulate new variation in coding regions at a high rate. This process has the potential to broaden functional diversity and generate new phenotypic variation that eventually can play a critical role in the origin of new adaptations and important agronomic traits.
草物种适应各种生境的能力归因于其基因组的动态性质,这些基因组是由多轮古老和近期的多倍化塑造的。为了更好地了解多倍体基因组中功能相关区域的变异性质和程度,我们开发了一种序列捕获测定法来比较异源四倍体小麦品系的外显子序列。
设计了一种序列捕获测定法,用于靶向重测序 3.5 Mb 的外显子区域,总共从异源四倍体小麦中调查了 3497 个基因。这些数据用于描述编码区中的 SNP、拷贝数变异和同系序列分歧。开发了一种在多倍体基因组中发现变体的程序,并进行了实验验证。发现的 SNP 中约有 1%和 24%分别为无功能和非同义突变。在涉及翻译和代谢的几个基因组中,发现了替换突变的代表性不足。在栽培小麦品系中,基因重复占主导地位,而在野生小麦中,基因缺失多于基因重复。
我们证明,即使目标多倍体基因组和捕获诱饵之间的序列相似性水平可能会影响富集效率,但外显子捕获是多倍体变体发现的强大方法。我们的结果表明,异源四倍体小麦可以在编码区以高速度积累新的变异。这个过程有可能扩大功能多样性,并产生新的表型变异,最终在新适应和重要农艺性状的起源中发挥关键作用。