Dickinson D P, Mirels L, Tabak L A, Gross K W
Department of Biological Chemistry, University of Texas Health Science Center, Houston 77025.
Mol Biol Evol. 1989 Jan;6(1):80-102. doi: 10.1093/oxfordjournals.molbev.a040534.
A survey of polypeptides encoded by RNA isolated from the submandibular glands of members of the Muridae (species of Mus and Rattus), in conjunction with cDNA cloning, has identified a class of salivary proteins that we term "spot proteins." Although clearly homologous, these proteins show dramatic differences between species in their polypeptide length. On the basis of the sequence of the corresponding clones, it is inferred that the rat spot 1 protein has a size of 6,370 daltons (Da), whereas that of the inbred mouse spot 1 is 11,603 Da. A second component is expressed in some stocks and strains of Mus, and this spot 2 protein has a size of up to 19,212 Da. The sizes of the corresponding mRNAs show parallel differences, and the variation in the sizes of mRNAs in different species of Mus correlates with the pattern of speciation, the size increasing with increased relatedness to inbred mice. The spot protein sequence comprises three domains: an N-terminal domain rich in hydroxy and acidic amino acids, a central domain consisting of repeats of a 9-amino-acid sequence, and a C-terminal domain that in the mouse is very basic. Variation in the number of repeats largely accounts for the differences in size between the mouse and rat mRNAs and their encoded polypeptides, and the coding sequence appears to have been expanding during speciation in the Muridae. There is extensive divergence in sequence between the mouse and rat mRNAs and their encoded proteins. The pattern of amino acid replacements and nucleotide substitutions is consistent with little, if any, selection constraint on the precise sequence of the spot proteins, suggesting that it is the overall architecture of the molecule, rather than the precise structure, that is important for function. There is strong evidence for a gene conversion event having occurred between the two mouse sequences. Frequent recombination by unequal crossing-over between spot protein coding sequences, if it occurs between active and silent genes, could account not only for the expansion in their size but also for their rapid divergence.
对从鼠科(小家鼠和大鼠物种)成员下颌下腺分离的RNA编码的多肽进行的一项调查,结合cDNA克隆,确定了一类唾液蛋白,我们将其称为“斑点蛋白”。尽管这些蛋白明显同源,但它们在不同物种间的多肽长度上表现出显著差异。根据相应克隆的序列推断,大鼠斑点1蛋白的大小为6370道尔顿(Da),而近交系小鼠斑点1蛋白的大小为11603 Da。在小家鼠的一些品系和菌株中表达了第二种成分,这种斑点2蛋白的大小可达19212 Da。相应mRNA的大小也呈现出平行差异,不同小家鼠物种中mRNA大小的变化与物种形成模式相关,随着与近交系小鼠亲缘关系的增加,大小也会增加。斑点蛋白序列包含三个结构域:富含羟基和酸性氨基酸的N端结构域、由一个9氨基酸序列重复组成的中央结构域,以及在小鼠中非常碱性的C端结构域。重复次数的变化在很大程度上解释了小鼠和大鼠mRNA及其编码多肽之间大小的差异,并且编码序列在鼠科物种形成过程中似乎一直在扩展。小鼠和大鼠的mRNA及其编码蛋白在序列上存在广泛的差异。氨基酸替换和核苷酸取代的模式表明,斑点蛋白的精确序列几乎没有受到选择限制(如果有也是极少的),这表明对于功能来说重要的是分子的整体结构,而不是精确的结构。有强有力的证据表明在两个小鼠序列之间发生了基因转换事件。如果斑点蛋白编码序列之间通过不等交换频繁重组,且发生在活性基因和沉默基因之间,那么这不仅可以解释它们大小的扩展,还可以解释它们的快速分化。