Pfeiffer Friedhelm, Bagyan Irina, Alfaro-Espinoza Gabriela, Zamora-Lagos Maria-A, Habermann Bianca, Marin-Sanguino Alberto, Oesterhelt Dieter, Kunte Hans J
Computational Biology Group, Max-Planck-Institute of Biochemistry, Martinsried, Germany.
Research and Development Division, bitop AG, Witten, Germany.
Microbiologyopen. 2017 Aug;6(4). doi: 10.1002/mbo3.465. Epub 2017 Mar 27.
The genome of the Halomonas elongata type strain DSM 2581, an industrial producer, was reevaluated using the Illumina HiSeq2500 technology. To resolve duplication-associated ambiguities, PCR products were generated and sequenced. Outside of duplications, 72 sequence corrections were required, of which 24 were point mutations and 48 were indels of one or few bases. Most of these were associated with polynucleotide stretches (poly-T stretch overestimated in 19 cases, poly-C underestimated in 15 cases). These problems may be attributed to using 454 technology for original genome sequencing. On average, the original genome sequence had only one error in 56 kb. There were 23 frameshift error corrections in the 29 protein-coding genes affected by sequence revision. The genome has been subjected to major reannotation in order to substantially increase the annotation quality.
利用Illumina HiSeq2500技术对工业生产菌株嗜盐栖热菌模式菌株DSM 2581的基因组进行了重新评估。为了解决与重复相关的歧义,生成了PCR产物并进行测序。在重复区域之外,需要进行72处序列校正,其中24处是点突变,48处是一个或几个碱基的插入或缺失。其中大多数与多核苷酸序列相关(19例中多聚T序列被高估,15例中多聚C序列被低估)。这些问题可能归因于最初使用454技术进行基因组测序。平均而言,原始基因组序列在56 kb中只有一个错误。在受序列修订影响的29个蛋白质编码基因中有23处移码错误校正。为了大幅提高注释质量,对该基因组进行了重大的重新注释。