Nelson K, Selander R K
Institute of Molecular Evolutionary Genetics, Mueller Laboratory, Pennsylvania State University, University Park 16802.
J Bacteriol. 1992 Nov;174(21):6886-95. doi: 10.1128/jb.174.21.6886-6895.1992.
Virtually complete sequences (1,467 bp) of the proline permease gene (putP) and complete sequences (416 to 422 bp) of the control region of the proline utilization operon were determined for 16 strains of Salmonella, representing all eight subspecies, and 13 strains of Escherichia coli recovered from natural populations. Strains of Salmonella and E. coli differed, on average, at 16.3% of putP nucleotide sites and 17.5% of control region sites; the average difference between strains was much larger for Salmonella strains (4.6% of putP sites and 3.4% of control region sites) than for E. coli (2.4 and 0.9%, respectively). There was no difference in the distribution of polymorphic amino acid positions between the membrane-spanning and loop regions of the permease molecule, and rates of synonymous nucleotide substitution were virtually the same for the two domains. Statistical analysis yielded evidence of three probable cases of intragenic recombination, including the acquisition of a large segment of putP by strains of Salmonella subspecies VII from an unidentified source, the exchange of a 21-bp segment between two strains of E. coli, and the acquisition by one strain of E. coli of a cluster of 14 unique polymorphic control region sites from an unknown donor. An evolutionary tree for the putP and control region sequences was generally concordant with a tree for the gapA gene and a tree based on multilocus enzyme electrophoresis, thus providing evidence that for neither gene nor for enzyme genes in general has recombination occurred at rates sufficiently high or over regions sufficiently large to completely obscure phylogenetic relationships dependent on mutational divergence. It is suggested that the recombination rate varies among genes in relation to functional type, being highest for genes encoding cell surface and other proteins for which there is an adaptive advantage in structural diversity.
测定了16株沙门氏菌(代表所有8个亚种)和13株从自然群体中分离出的大肠杆菌脯氨酸通透酶基因(putP)的几乎完整序列(1467 bp)以及脯氨酸利用操纵子控制区的完整序列(416至422 bp)。沙门氏菌和大肠杆菌菌株在putP核苷酸位点上平均有16.3%的差异,在控制区位点上有17.5%的差异;沙门氏菌菌株之间的平均差异(putP位点的4.6%和控制区位点的3.4%)比大肠杆菌(分别为2.4%和0.9%)大得多。通透酶分子跨膜区和环区之间多态性氨基酸位置的分布没有差异,两个结构域的同义核苷酸替换率几乎相同。统计分析提供了三个可能的基因内重组案例的证据,包括VII亚种沙门氏菌菌株从一个未确定来源获得了一大段putP、两株大肠杆菌之间交换了一个21 bp的片段以及一株大肠杆菌从一个未知供体获得了一组14个独特的多态性控制区位点。putP和控制区序列的进化树总体上与gapA基因的树以及基于多位点酶电泳的树一致,因此提供了证据表明,对于该基因以及一般的酶基因,重组发生的速率不够高,或者发生的区域不够大,以至于不会完全模糊依赖于突变差异的系统发育关系。有人提出,重组率因基因的功能类型而异,对于编码细胞表面蛋白和其他在结构多样性方面具有适应性优势的蛋白的基因,重组率最高。