Isidore Edwige, Scherrer Beatrice, Chalhoub Boulos, Feuillet Catherine, Keller Beat
Institute of Plant Biology, University of Zürich, 8008 Zürich, Switzerland.
Genome Res. 2005 Apr;15(4):526-36. doi: 10.1101/gr.3131005.
Plant genomes, in particular grass genomes, evolve very rapidly. The closely related A genomes of diploid, tetraploid, and hexaploid wheat are derived from a common ancestor that lived <3 million years ago and represent a good model to study molecular mechanisms involved in such rapid evolution. We have sequenced and compared physical contigs at the Lr10 locus on chromosome 1AS from diploid (211 kb), tetraploid (187 kb), and hexaploid wheat (154 kb). A maximum of 33% of the sequences were conserved between two species. The sequences from diploid and tetraploid wheat shared all of the genes, including Lr10 and RGA2 and define a first haplotype (H1). The 130-kb intergenic region between Lr10 and RGA2 was conserved in size despite its activity as a hot spot for transposon insertion, which resulted in >70% of sequence divergence. The hexaploid wheat sequence lacks both Lr10 and RGA2 genes and defines a second haplotype, H2, which originated from ancient and extensive rearrangements. These rearrangements included insertions of retroelements and transposons deletions, as well as unequal recombination within elements. Gene disruption in haplotype H2 was caused by a deletion and subsequent large inversion. Gene conservation between H1 haplotypes, as well as conservation of rearrangements at the origin of the H2 haplotype at three different ploidy levels indicate that the two haplotypes are ancient and had a stable gene content during evolution, whereas the intergenic regions evolved rapidly. Polyploidization during wheat evolution had no detectable consequences on the structure and evolution of the two haplotypes.
植物基因组,尤其是禾本科植物基因组,进化非常迅速。二倍体、四倍体和六倍体小麦的亲缘关系密切的A基因组源自一个生活在不到300万年前的共同祖先,是研究这种快速进化所涉及分子机制的良好模型。我们对来自二倍体(211 kb)、四倍体(187 kb)和六倍体小麦(154 kb)1AS染色体上Lr10位点的物理重叠群进行了测序和比较。两个物种之间最多33%的序列是保守的。二倍体和四倍体小麦的序列共享所有基因,包括Lr10和RGA2,并定义了第一个单倍型(H1)。尽管Lr10和RGA2之间的130 kb基因间区域作为转座子插入热点具有活性,但其大小是保守的,这导致了>70%的序列差异。六倍体小麦序列缺乏Lr10和RGA2基因,并定义了第二个单倍型H2,它起源于古老且广泛的重排。这些重排包括反转录元件的插入、转座子缺失以及元件内的不等交换。单倍型H2中的基因破坏是由一次缺失和随后的大倒位引起的。H1单倍型之间的基因保守性,以及在三个不同倍性水平上H2单倍型起源处重排的保守性表明,这两个单倍型很古老,在进化过程中基因含量稳定,而基因间区域进化迅速。小麦进化过程中的多倍体化对这两个单倍型的结构和进化没有可检测到的影响。