Department of Biochemistry, University of Georgia, 30602, Athens, GA, USA.
Plant Mol Biol. 1987 Nov;9(6):533-46. doi: 10.1007/BF00020531.
The α globulin storage protein genes of cotton are found to exist as gene tandems that contain a gene from each of the 2 α globulin subfamilies separated by a spacer region of about 2700 or 3400 base pairs. Three different tandems have been identified by restriction endonuclease mapping of genomic DNA. A cDNA that is different from the genes of the tandems in map sites and/or in nucleotide sequence indicates that a fourth tandem probably exists in the cotton genome. Since the species of cotton used here (Gossypium hirsutum) is an amphidiploid, it is likely that two of the tandems are contributed from each genome.Considerable divergence in nucleotide sequence (18%) and in derived amino acid sequence (28%) is found when the 2 genes of a sequenced tandem are compared. The sequence of the cDNA closely resembles one of the genes in the tandem showing only a 4% divergence in nucleotides and a 4.2% divergence in amino acids. Thus the 2 genes of each tandem represent a relatively ancient gene duplication that has given rise to the two α globulin subfamilies of cotton. Only one subfamily has a glycosylation site and the glycosylation of its derived proteins gives rise to the 2 molecular weight sets of α globulins seen on gel electrophoresis.Other basic features of these genes and their derived proteins are presented.
棉花的α球蛋白贮藏蛋白基因被发现是以串联形式存在的,每个串联包含来自 2 个α球蛋白亚家族的一个基因,它们之间由约 2700 或 3400 个碱基对的间隔区隔开。通过基因组 DNA 的限制性内切酶图谱分析,已经鉴定出了三种不同的串联。一个 cDNA 在图谱位置和/或核苷酸序列上与串联的基因不同,这表明第四个串联可能存在于棉花基因组中。由于这里使用的棉花物种(Gossypium hirsutum)是双二倍体,因此很可能每个基因组都有两个串联。当比较一个已测序串联的两个基因时,发现核苷酸序列(18%)和推导的氨基酸序列(28%)有很大的差异。cDNA 的序列与串联中的一个基因非常相似,核苷酸仅相差 4%,氨基酸相差 4.2%。因此,每个串联的两个基因代表了一个相对古老的基因复制,它产生了棉花的两个α球蛋白亚家族。只有一个亚家族有一个糖基化位点,其衍生蛋白的糖基化导致在凝胶电泳上看到的α球蛋白的两个分子量集。还介绍了这些基因及其衍生蛋白的其他基本特征。