Sun Jibin, Gunzer Florian, Westendorf Astrid M, Buer Jan, Scharfe Maren, Jarek Michael, Gössling Frank, Blöcker Helmut, Zeng An-Ping
GBF - German Research Centre for Biotechnology, Experimental Bioinformatics, Mascheroder Weg 1, D-38124 Braunschweig, Germany.
J Biotechnol. 2005 May 4;117(2):147-61. doi: 10.1016/j.jbiotec.2005.01.008.
Probiotic Escherichia coli strain Nissle 1917 (O6:K5:H1) is a commensal E. coli isolate that has a long tradition in medicine for the treatment of various intestinal disorders in humans. To elucidate the molecular basis of its probiotic nature, we started sequencing the genome of this organism with a whole-genome shotgun approach. A 7.8-fold coverage of the genomic sequence has been generated and is now in the finishing stage. To exploit the genome data as early as possible and to generate hypotheses for functional studies, the unfinished sequencing data were analyzed in this work using a new method [Sun, J., Zeng, A.P., 2004. IdentiCS--identification of coding sequence and in silico reconstruction of the metabolic network directly from unannotated low-coverage bacterial genome sequence. BMC Bioinformatics 5, 112] which is particularly suitable for the prediction of coding sequences (CDSs) from unannotated genome sequence. The CDSs predicted for E. coli Nissle 1917 were compared with those of all five other sequenced E. coli strains (E. coli K-12 MG1655, E. coli K-12 W3110, E. coli CFT073, EHEC O157:H7 EDL933 and EHEC O157:H7 Sakai) published to date. Five thousand one hundred and ninety-two CDSs were predicted for E. coli Nissle 1917, of which 1065 were assigned with enzyme EC numbers. The comparison of all predicted CDSs of E. coli Nissle 1917 to the other E. coli strains revealed 108 CDSs specific for this isolate. They are organized as four big genome islands and many other smaller gene clusters. Based on CDSs with EC numbers for enzymes, the potential metabolic network of Nissle 1917 was reconstructed and compared to those of the other five E. coli strains. Overall, the comparative genomic analysis sheds light on the genomic peculiarity of the probiotic E. coli strain Nissle 1917 and is helpful for designing further functional studies long before the sequencing project is completely finished.
益生菌大肠杆菌菌株Nissle 1917(O6:K5:H1)是一种共生大肠杆菌分离株,在医学上长期用于治疗人类各种肠道疾病。为了阐明其益生菌特性的分子基础,我们开始采用全基因组鸟枪法对该菌株的基因组进行测序。目前已获得该基因组序列7.8倍的覆盖度,测序工作正处于收尾阶段。为了尽早利用基因组数据并为功能研究提出假设,我们在这项工作中使用了一种新方法[Sun, J., Zeng, A.P., 2004. IdentiCS--identification of coding sequence and in silico reconstruction of the metabolic network directly from unannotated low-coverage bacterial genome sequence. BMC Bioinformatics 5, 112]对未完成的测序数据进行分析,该方法特别适用于从未注释的基因组序列中预测编码序列(CDS)。将预测的大肠杆菌Nissle 1917的CDS与迄今已公布的其他五种已测序大肠杆菌菌株(大肠杆菌K-12 MG1655、大肠杆菌K-12 W3110、大肠杆菌CFT073、肠出血性大肠杆菌O157:H7 EDL933和肠出血性大肠杆菌O157:H7 Sakai)的CDS进行比较。预测大肠杆菌Nissle 1917有5192个CDS,其中1065个被赋予了酶的EC编号。将大肠杆菌Nissle 1917所有预测的CDS与其他大肠杆菌菌株进行比较,发现该分离株有108个特异的CDS。它们组成了四个大的基因组岛和许多其他较小的基因簇。基于具有酶EC编号的CDS,重建了Nissle 1917的潜在代谢网络,并与其他五种大肠杆菌菌株的代谢网络进行比较。总体而言,比较基因组分析揭示了益生菌大肠杆菌菌株Nissle 1917的基因组特性,并且在测序项目完全完成之前很久就有助于设计进一步的功能研究。