Sheppard H W, Gutman G A
Proc Natl Acad Sci U S A. 1981 Nov;78(11):7064-8. doi: 10.1073/pnas.78.11.7064.
The genes that code for two allotypic forms of the rat kappa light chain constant region (C(kappa)) have been cloned and the nucleotide sequence of 1172 base pairs of coding and flanking sequence has been determined for both alleles. These sequences have been compared to each other and to the corresponding sequences found in the mouse and human. Comparison of the LEW allele with mouse C(kappa) reveals two surprising features: (i) There is an unusually large number of amino acid substitutions (21) relative to the total number of nucleotide changes (37) in the coding region. Comparision among several other mammalian genes reveals a larger proportion of "silent" changes. (ii) The rate of accumulation of base substitutions is the same within the coding region as it is in some 870 base pairs of noncoding sequence (including 3' untranslated, 3' flanking, and 5' intervening sequences). Comparison of the two allelic forms of rat C(kappa) shows the same unusual features in more extreme form. (i) Twelve base substitutions in the coding region determine 11 amino acid differences-only one "silent" change exists. (ii) There are 12 base substitutions in the 318 base pairs of coding sequence (3.7% difference) and only 9 in the remaining 854 base pairs of noncoding DNA (1.1%), a highly significant difference. This degree of conservation of noncoding sequences and of "silent" sites within the coding region is unique among the mammalian genes studied thus far. These patterns suggest that there has been strong selection for conservation of nucleotide sequences, both inside and outside the coding region, independent of the selection required to maintain the function and characteristic structure of the immunoglobulin domain itself. The functions of the nucleotide sequences that account for this selective pressure are unclear at the present time.
编码大鼠κ轻链恒定区(C(κ))两种同种异型形式的基因已被克隆,并且已确定了两个等位基因的1172个碱基对编码序列和侧翼序列的核苷酸序列。这些序列已相互比较,并与在小鼠和人类中发现的相应序列进行了比较。将LEW等位基因与小鼠C(κ)进行比较揭示了两个惊人的特征:(i)相对于编码区核苷酸变化总数(37个),氨基酸取代的数量异常多(21个)。与其他几个哺乳动物基因的比较显示“沉默”变化的比例更大。(ii)编码区内碱基取代的积累速率与约870个碱基对的非编码序列(包括3'非翻译区、3'侧翼和5'间隔序列)中的积累速率相同。大鼠C(κ)的两种等位基因形式的比较以更极端的形式显示了相同的异常特征。(i)编码区的12个碱基取代决定了11个氨基酸差异——仅存在一个“沉默”变化。(ii)在318个碱基对的编码序列中有12个碱基取代(差异为3.7%),而在其余854个碱基对的非编码DNA中只有9个(差异为1.1%),这是一个高度显著的差异。在迄今为止研究的哺乳动物基因中,非编码序列和编码区内“沉默”位点的这种保守程度是独特的。这些模式表明,在编码区内外,对于核苷酸序列的保守存在着强烈的选择,这与维持免疫球蛋白结构域本身的功能和特征结构所需的选择无关。目前尚不清楚导致这种选择压力的核苷酸序列的功能。