Hanfstingl U, Berry A, Kellogg E A, Costa J T, Rüdiger W, Ausubel F M
Department of Genetics, Harvard Medical School, Boston, Massachusetts.
Genetics. 1994 Nov;138(3):811-28. doi: 10.1093/genetics/138.3.811.
We designate a region of the alcohol dehydrogenase locus (Adh) of the weedy crucifer, Arabidopsis thaliana, as "hypervariable" on the basis of a comparison of sequences from ecotypes Columbia and Landsberg. We found eight synonymous and two replacement mutations in the first 262 nucleotides of exon 4, and an additional two mutations in the contiguous region of intron 3. The rest of the sequence (2611 bp) has just three mutations, all of them confined to noncoding regions. Our survey of the hypervariable region among 37 ecotypes of A. thaliana revealed two predominant haplotypes, corresponding to the Columbia and Landsberg sequences. We identified five additional haplotypes and 4 additional segregating sites. The lack of haplotype diversity is presumably in part a function of low rates of recombination between haplotypes conferred by A. thaliana's tendency to self-fertilize. However, an analysis in 32 ecotypes of 12 genome-wide polymorphic markers distinguishing Columbia and Landsberg ecotypes indicated levels of outcrossing sufficient at least to erode linkage disequilibrium between dispersed markers. We discuss possible evolutionary explanations for the coupled observation of marked divergence within the hypervariable region and a lack of haplotype diversity among ecotypes. The sequence of the region for closely related species argues against the possibility that one allele is the product of introgression. We note (1) that several loss of function mutations (both naturally and chemically induced) map to the hypervariable region, and (2) the presence of two amino acid replacement polymorphisms, one of which causes the mobility difference between the two major classes of A. thaliana Adh electrophoretic alleles. We argue that protein polymorphism in such a functionally significant part of the molecule may be subject to balancing selection. The observed pattern of extensive divergence between the alleles is consistent with this explanation because balancing selection on a particular site maintains linked neutral polymorphisms at intermediate frequencies.
基于对拟南芥生态型哥伦比亚和兰茨贝格序列的比较,我们将杂草十字花科植物拟南芥乙醇脱氢酶基因座(Adh)的一个区域指定为“高变区”。我们在第4外显子的前262个核苷酸中发现了8个同义突变和2个替换突变,在第3内含子的相邻区域还发现了另外2个突变。序列的其余部分(2611 bp)只有3个突变,均局限于非编码区。我们对37个拟南芥生态型的高变区进行的调查揭示了两种主要单倍型,分别对应于哥伦比亚和兰茨贝格序列。我们还鉴定出另外5种单倍型和4个额外的分离位点。单倍型多样性的缺乏可能部分是由于拟南芥自花授粉倾向导致单倍型之间重组率较低的结果。然而,对区分哥伦比亚和兰茨贝格生态型的12个全基因组多态性标记在32个生态型中的分析表明,异交水平足以至少消除分散标记之间的连锁不平衡。我们讨论了对高变区内明显分化与生态型间单倍型多样性缺乏这一耦合观察结果的可能进化解释。密切相关物种该区域的序列排除了一个等位基因是基因渗入产物的可能性。我们注意到:(1)几个功能丧失突变(自然发生的和化学诱导的)定位于高变区;(2)存在两种氨基酸替换多态性,其中一种导致了拟南芥Adh电泳等位基因两大主要类型之间的迁移率差异。我们认为,分子中如此功能重要部分的蛋白质多态性可能受到平衡选择。观察到的等位基因间广泛分化模式与这一解释一致,因为对特定位点的平衡选择会使连锁中性多态性维持在中等频率。