Gingrich J C, Hallick R B
J Biol Chem. 1985 Dec 25;260(30):16156-61.
The nucleotide sequence of 6225 base pairs (bp) of Euglena gracilis chloroplast DNA including the complete DNA sequence of the chloroplast-encoded ribulose-1,5-bisphosphate carboxylase large subunit gene along with the flanking DNA sequences is presented. The gene is greater than 5.5 kilobase pairs in length and is organized as 10 exons coding for 475 amino acids, separated by 9 introns. The exons range in size from 45 to 438 bp, while the introns range in size from 382 to 568 bp. The introns have highly conserved boundary sequences with the consensus, 5'-N GTGTGGATTT...(intron)...TTAATTTTAT N-3'. The introns are 82-85 mol% AT, with a pronounced T greater than A greater than G greater than C base bias in the RNA-like strand. They do not appear to encode any polypeptides. In addition, the introns have a conserved sequence 30-50 bp from their 3'-ends with the consensus, 5'-TACAGTTTGAAAATGA-3'. The 5'-TACA sequence bears some homology to the 5'-end of the TACTAACA sequence found in a similar location in yeast nuclear mRNA introns. The conserved sequences of the Euglena rbcL introns may be indicative of a splicing mechanism similar to that of eucaryotic nuclear mRNA introns and group II mitochondrial introns.
本文给出了纤细裸藻叶绿体DNA 6225个碱基对(bp)的核苷酸序列,包括叶绿体编码的核酮糖-1,5-二磷酸羧化酶大亚基基因的完整DNA序列及其侧翼DNA序列。该基因长度大于5.5千碱基对,由10个外显子组成,编码475个氨基酸,中间被9个内含子隔开。外显子大小在45至438 bp之间,内含子大小在382至568 bp之间。内含子具有高度保守的边界序列,共有序列为5'-N GTGTGGATTT...(内含子)...TTAATTTTAT N-3'。内含子的AT含量为82 - 85 mol%,在RNA样链中碱基偏好明显为T>A>G>C。它们似乎不编码任何多肽。此外,内含子在其3'端30 - 50 bp处有一个保守序列,共有序列为5'-TACAGTTTGAAAATGA-3'。5'-TACA序列与酵母核mRNA内含子中类似位置发现的TACTAACA序列的5'端有一定同源性。纤细裸藻rbcL内含子的保守序列可能表明其剪接机制类似于真核细胞核mRNA内含子和II类线粒体内含子。