Hallick R B, Hong L, Drager R G, Favreau M R, Monfort A, Orsat B, Spielmann A, Stutz E
Department of Biochemistry, University of Arizona, Tucson 85721.
Nucleic Acids Res. 1993 Jul 25;21(15):3537-44. doi: 10.1093/nar/21.15.3537.
We report the complete DNA sequence of the Euglena gracilis, Pringsheim strain Z chloroplast genome. This circular DNA is 143,170 bp, counting only one copy of a 54 bp tandem repeat sequence that is present in variable copy number within a single culture. The overall organization of the genome involves a tandem array of three complete and one partial ribosomal RNA operons, and a large single copy region. There are genes for the 16S, 5S, and 23S rRNAs of the 70S chloroplast ribosomes, 27 different tRNA species, 21 ribosomal proteins plus the gene for elongation factor EF-Tu, three RNA polymerase subunits, and 27 known photosynthesis-related polypeptides. Several putative genes of unknown function have also been identified, including five within large introns, and five with amino acid sequence similarity to genes in other organisms. This genome contains at least 149 introns. There are 72 individual group II introns, 46 individual group III introns, 10 group II introns and 18 group III introns that are components of twintrons (introns-within-introns), and three additional introns suspected to be twintrons composed of multiple group II and/or group III introns, but not yet characterized. At least 54,804 bp, or 38.3% of the total DNA content is represented by introns.
我们报道了纤细裸藻普林斯海姆株Z叶绿体基因组的完整DNA序列。这个环状DNA为143,170碱基对,仅计算一个54碱基对串联重复序列的一个拷贝,该序列在单一培养物中以可变拷贝数存在。基因组的总体组织包括三个完整和一个部分核糖体RNA操纵子的串联阵列,以及一个大的单拷贝区域。有70S叶绿体核糖体的16S、5S和23S rRNA的基因、27种不同的tRNA、21种核糖体蛋白加上延伸因子EF-Tu的基因、三个RNA聚合酶亚基,以及27种已知的光合作用相关多肽。还鉴定了几个功能未知的推定基因,包括大内含子中的五个,以及五个与其他生物体中的基因具有氨基酸序列相似性的基因。这个基因组至少包含149个内含子。有72个单独的II类内含子、46个单独的III类内含子、10个II类内含子和18个III类内含子是双内含子(内含子内的内含子)的组成部分,还有另外三个疑似由多个II类和/或III类内含子组成但尚未表征的双内含子。至少54,804碱基对,即总DNA含量的38.3%由内含子代表。