Cushman J C, Christopher D A, Little M C, Hallick R B, Price C A
Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ 08854.
Curr Genet. 1988 Feb;13(2):173-80. doi: 10.1007/BF00365652.
The genes for cytochrome b559, designated psbE and psbF, and two highly conserved open reading frames of 38 and 42 codons have been located and characterized on the chloroplast genome of Euglena gracilis. The organization of the genes is psbE - 8 bp spacer - psbF - 110 bp spacer - orf38 - 87 bp spacer - orf42. All genes are of the same polarity. The psbE gene contains two introns of 350 and 326 bp. The psbF gene contains a single large intron of 1,042 bp. The orf38 and orf42 loci lack introns. The introns are extremely AT rich with a pronounced base composition bias of T greater than A greater than G greater than C in the mRNA-like strand and group II-like boundary sequences at their 3' and 5' ends having the consensus 5'-GTGTG .. INTRON .. TTAATTTNAT-3'. The psbE gene consists of 82 codons and encodes a polypeptide with a predicted molecular weight of 9,212. The psbF gene consists of 42 codons, which specify a polypeptide with a predicted molecular weight of 4,785. The highly conserved open reading frames of 38 and 42 codons code for polypeptides with predicted molecular weights of 4,405 and 4,426, respectively. The gene products of psbE, psbF, orf38 and orf42 are, respectively, 69.5%, 70% and 61.5% identical to those found in higher plants. The predicted secondary structure of the proteins from hydropathy plots is consistent with each containing a single membrane-spanning domain of at least 20 amino acids. Each of the genes is preceded by sequences which may serve as ribosome binding sites. All four genes are transcribed.
细胞色素b559的基因,命名为psbE和psbF,以及两个由38和42个密码子组成的高度保守的开放阅读框,已在纤细裸藻的叶绿体基因组中定位并进行了特征分析。这些基因的组织形式为:psbE - 8个碱基对的间隔区 - psbF - 110个碱基对的间隔区 - orf38 - 87个碱基对的间隔区 - orf42。所有基因具有相同的极性。psbE基因包含两个分别为350和326个碱基对的内含子。psbF基因包含一个1042个碱基对的大内含子。orf38和orf42位点没有内含子。这些内含子富含AT,在类似mRNA的链中碱基组成明显偏向于T大于A大于G大于C,并且在其3'和5'端具有类似II类的边界序列,共有序列为5'-GTGTG..内含子..TTAATTTNAT-3'。psbE基因由82个密码子组成,编码一个预测分子量为9212的多肽。psbF基因由42个密码子组成,指定一个预测分子量为4785的多肽。由38和42个密码子组成的高度保守的开放阅读框分别编码预测分子量为4405和4426的多肽。psbE、psbF、orf38和orf42的基因产物分别与高等植物中发现的基因产物有69.5%、70%和61.5%的同一性。根据亲水性图谱预测的蛋白质二级结构与每个蛋白质都包含至少20个氨基酸的单个跨膜结构域一致。每个基因之前都有可能作为核糖体结合位点的序列。所有四个基因都被转录。