Sugita Chieko, Ogata Koretsugu, Shikata Masamitsu, Jikuya Hiroyuki, Takano Jun, Furumichi Miho, Kanehisa Minoru, Omata Tatsuo, Sugiura Masahiro, Sugita Mamoru
Center for Gene Research, Nagoya University, Chikusa, Nagoya 464-8602, Japan.
Photosynth Res. 2007 Jul-Sep;93(1-3):55-67. doi: 10.1007/s11120-006-9122-4. Epub 2007 Jan 9.
The entire genome of the unicellular cyanobacterium Synechococcus elongatus PCC 6301 (formerly Anacystis nidulans Berkeley strain 6301) was sequenced. The genome consisted of a circular chromosome 2,696,255 bp long. A total of 2,525 potential protein-coding genes, two sets of rRNA genes, 45 tRNA genes representing 42 tRNA species, and several genes for small stable RNAs were assigned to the chromosome by similarity searches and computer predictions. The translated products of 56% of the potential protein-coding genes showed sequence similarities to experimentally identified and predicted proteins of known function, and the products of 35% of the genes showed sequence similarities to the translated products of hypothetical genes. The remaining 9% of genes lacked significant similarities to genes for predicted proteins in the public DNA databases. Some 139 genes coding for photosynthesis-related components were identified. Thirty-seven genes for two-component signal transduction systems were also identified. This is the smallest number of such genes identified in cyanobacteria, except for marine cyanobacteria, suggesting that only simple signal transduction systems are found in this strain. The gene arrangement and nucleotide sequence of Synechococcus elongatus PCC 6301 were nearly identical to those of a closely related strain Synechococcus elongatus PCC 7942, except for the presence of a 188.6 kb inversion. The sequences as well as the gene information shown in this paper are available in the Web database, CYORF (http://www.cyano.genome.jp/).
对单细胞蓝藻聚球藻属细长聚球藻PCC 6301(以前称为巢状集胞藻伯克利菌株6301)的全基因组进行了测序。该基因组由一条长度为2,696,255 bp的环状染色体组成。通过相似性搜索和计算机预测,总共2,525个潜在的蛋白质编码基因、两组rRNA基因、代表42种tRNA种类的45个tRNA基因以及几个小的稳定RNA基因被定位到该染色体上。56%的潜在蛋白质编码基因的翻译产物与实验鉴定和预测的已知功能蛋白质具有序列相似性,35%的基因产物与假设基因的翻译产物具有序列相似性。其余9%的基因与公共DNA数据库中预测蛋白质的基因缺乏显著相似性。鉴定出约139个编码光合作用相关成分的基因。还鉴定出37个编码双组分信号转导系统的基因。这是在蓝藻中鉴定出的此类基因的最小数量,海洋蓝藻除外,这表明在该菌株中仅发现了简单的信号转导系统。除了存在一个188.6 kb的倒位外,细长聚球藻PCC 6301的基因排列和核苷酸序列与密切相关的菌株细长聚球藻PCC 7942几乎相同。本文中显示的序列以及基因信息可在网络数据库CYORF(http://www.cyano.genome.jp/)中获得。