Crozier R H, Crozier Y C
Department of Genetics and Human Variation, La Trobe University, Bundoora, Victoria, Australia.
Genetics. 1993 Jan;133(1):97-117. doi: 10.1093/genetics/133.1.97.
The complete sequence of honeybee (Apis mellifera) mitochondrial DNA is reported being 16,343 bp long in the strain sequenced. Relative to their positions in the Drosophila map, 11 of the tRNA genes are in altered positions, but the other genes and regions are in the same relative positions. Comparisons of the predicted protein sequences indicate that the honeybee mitochondrial genetic code is the same as that for Drosophila; but the anticodons of two tRNAs differ between these two insects. The base composition shows extreme bias, being 84.9% AT (cf. 78.6% in Drosophila yakuba). In protein-encoding genes, the AT bias is strongest at the third codon positions (which in some cases lack guanines altogether), and least in second codon positions. Multiple stepwise regression analysis of the predicted products of the protein-encoding genes shows a significant association between the numbers of occurrences of amino acids and %T in codon family, but not with the number of codons per codon family or other parameters associated with codon family base composition. Differences in amino acid abundances are apparent between the predicted Apis and Drosophila proteins, with a relative abundance in the Apis proteins of lysine and a relative deficiency of alanine. Drosophila alanine residues are as often replaced by serine as conserved in Apis. The differences in abundances between Drosophila and Apis are associated with %AT in the codon families, and the degree of divergence in amino acid composition between proteins correlates with the divergence in %AT at the second codon positions. Overall, transversions are about twice as abundant as transitions when comparing Drosophila and Apis protein-encoding genes, but this ratio varies between codon positions. Marked excesses of transitions over chance expectation are seen for the third positions of protein-coding genes and for the gene for the small subunit of ribosomal RNA. For the third codon positions the excess of transitions is adequately explained as due to the restriction of observable substitutions to transitions for conserved amino acids with two-codon families; the excess of transitions over expectation for the small ribosomal subunit suggests that the conservation of nucleotide size is favored by selection.
据报道,所测序品系的蜜蜂(意大利蜜蜂)线粒体DNA完整序列长度为16343 bp。相对于它们在果蝇图谱中的位置,11个tRNA基因的位置发生了改变,但其他基因和区域的相对位置相同。对预测的蛋白质序列进行比较表明,蜜蜂线粒体遗传密码与果蝇的相同;但这两种昆虫的两个tRNA的反密码子不同。碱基组成表现出极端的偏向性,AT含量为84.9%(相比之下,雅库布果蝇为78.6%)。在蛋白质编码基因中,AT偏向在第三密码子位置最强(在某些情况下完全没有鸟嘌呤),在第二密码子位置最弱。对蛋白质编码基因预测产物的多元逐步回归分析表明,氨基酸出现次数与密码子家族中T%之间存在显著关联,但与每个密码子家族的密码子数量或与密码子家族碱基组成相关的其他参数无关。预测的蜜蜂和果蝇蛋白质之间氨基酸丰度存在明显差异,蜜蜂蛋白质中赖氨酸相对丰度较高,丙氨酸相对缺乏。果蝇中的丙氨酸残基在蜜蜂中被丝氨酸取代的频率与保守的频率相同。果蝇和蜜蜂之间丰度的差异与密码子家族中的AT%相关,蛋白质之间氨基酸组成的差异程度与第二密码子位置的AT%差异相关。总体而言,比较果蝇和蜜蜂的蛋白质编码基因时,颠换的数量约为转换的两倍,但该比例在不同密码子位置有所变化。在蛋白质编码基因的第三位置以及核糖体RNA小亚基基因中,观察到转换明显超过随机预期。对于第三密码子位置,转换的过量可以充分解释为由于对于具有两个密码子家族的保守氨基酸,可观察到的替换仅限于转换;核糖体小亚基转换超过预期表明选择有利于核苷酸大小的保守。