Baltz R H, Norris F H, Matsushima P, DeHoff B S, Rockey P, Porter G, Burgett S, Peery R, Hoskins J, Braverman L, Jenkins I, Solenberg P, Young M, McHenney M A, Skatrud P L, Rosteck P R
Lilly Research Laboratories, Division of Eli Lilly and Company, Indianapolis, IN 46285, USA.
Microb Drug Resist. 1998 Spring;4(1):1-9. doi: 10.1089/mdr.1998.4.1.
We initiated a survey of the Streptococcus pneumoniae genome by DNA sequence sampling. More than 9,500 random DNA sequences of approximately 500 bases average length were determined. Partial sequences sufficient to identify approximately 95% of the aminoacyl tRNA synthetase genes and ribosomal protein (rps) genes were found by comparing the database of partial sequences to known sequences from other organisms. Many genes involved in DNA replication, repair, and mutagenesis are present in S. pneumoniae. Genes for the major subunits of RNA polymerase are also present, as are genes for two alternative sigma factors, rpoD and rpoN. Many genes necessary for amino acid or cofactor biosynthesis and aerobic energy metabolism in other bacteria appear to be absent from the S. pneumoniae genome. A number of genes involved in cell wall biosynthesis and septation were identified, including six homologs to different penicillin binding proteins. Interestingly, four genes involved in the addition of D-alanine to lipoteicoic acid in other gram positive bacteria were found, even though the lipoteicoic acid in S. pneumoniae has not been shown to contain D-alanine. The S. pneumoniae genome contains a number of chaperonin genes similar to those found in other bacteria, but apparently does not contain genes involved in the type III secretion commonly observed in gram negative pathogens. The G+C content of S. pneumoniae genomic DNA is approximately 43 mole percent and the size of the genome is approximately 2.0 Mb as determined by pulsed-field gel electrophoresis. Many of the genes identified by sequence sampling have been physically mapped to the 19 different SmaI fragments derived from the S. pneumoniae genome. The database of random genome sequence tags (GSTs) provides the starting material for determining the complete genome sequence, gene disruption analysis, and comparative genomics to identify novel targets for antibiotic development.
我们通过DNA序列抽样对肺炎链球菌基因组进行了一项调查。测定了9500多个平均长度约为500个碱基的随机DNA序列。通过将部分序列数据库与其他生物的已知序列进行比较,发现了足以鉴定约95%的氨酰基tRNA合成酶基因和核糖体蛋白(rps)基因的部分序列。肺炎链球菌中存在许多参与DNA复制、修复和诱变的基因。RNA聚合酶主要亚基的基因也存在,还有两个替代sigma因子rpoD和rpoN的基因。肺炎链球菌基因组中似乎没有其他细菌中许多氨基酸或辅因子生物合成以及有氧能量代谢所必需的基因。鉴定出了一些参与细胞壁生物合成和隔膜形成的基因,包括六个与不同青霉素结合蛋白同源的基因。有趣的是,发现了四个在其他革兰氏阳性细菌中参与将D-丙氨酸添加到脂磷壁酸中的基因,尽管肺炎链球菌中的脂磷壁酸尚未显示含有D-丙氨酸。肺炎链球菌基因组包含许多与其他细菌中发现的类似的伴侣蛋白基因,但显然不包含革兰氏阴性病原体中常见的III型分泌相关基因。通过脉冲场凝胶电泳测定,肺炎链球菌基因组DNA的G+C含量约为43摩尔百分比,基因组大小约为2.0 Mb。通过序列抽样鉴定出的许多基因已被物理定位到源自肺炎链球菌基因组的19个不同的SmaI片段上。随机基因组序列标签(GST)数据库为确定完整基因组序列、基因破坏分析和比较基因组学提供了起始材料,以识别抗生素开发的新靶点。