Lanie Joel A, Ng Wai-Leung, Kazmierczak Krystyna M, Andrzejewski Tiffany M, Davidsen Tanja M, Wayne Kyle J, Tettelin Hervé, Glass John I, Winkler Malcolm E
Department of Biology, Indiana University Bloomington, Jordan Hall 142, Bloomington, IN 47405, USA.
J Bacteriol. 2007 Jan;189(1):38-51. doi: 10.1128/JB.01148-06. Epub 2006 Oct 13.
Streptococcus pneumoniae (pneumococcus) is a leading human respiratory pathogen that causes a variety of serious mucosal and invasive diseases. D39 is an historically important serotype 2 strain that was used in experiments by Avery and coworkers to demonstrate that DNA is the genetic material. Although isolated nearly a century ago, D39 remains extremely virulent in murine infection models and is perhaps the strain used most frequently in current studies of pneumococcal pathogenesis. To date, the complete genome sequences have been reported for only two S. pneumoniae strains: TIGR4, a recent serotype 4 clinical isolate, and laboratory strain R6, an avirulent, unencapsulated derivative of strain D39. We report here the genome sequences and new annotation of two different isolates of strain D39 and the corrected sequence of strain R6. Comparisons of these three related sequences allowed deduction of the likely sequence of the D39 progenitor and mutations that arose in each isolate. Despite its numerous repeated sequences and IS elements, the serotype 2 genome has remained remarkably stable during cultivation, and one of the D39 isolates contains only five relatively minor mutations compared to the deduced D39 progenitor. In contrast, laboratory strain R6 contains 71 single-base-pair changes, six deletions, and four insertions and has lost the cryptic pDP1 plasmid compared to the D39 progenitor strain. Many of these mutations are in or affect the expression of genes that play important roles in regulation, metabolism, and virulence. The nature of the mutations that arose spontaneously in these three strains, the relative global transcription patterns determined by microarray analyses, and the implications of the D39 genome sequences to studies of pneumococcal physiology and pathogenesis are presented and discussed.
肺炎链球菌是一种主要的人类呼吸道病原体,可引发多种严重的黏膜和侵袭性疾病。D39是一株具有历史意义的2型菌株,艾弗里及其同事曾用它进行实验,以证明DNA是遗传物质。尽管D39是近一个世纪前分离得到的,但在小鼠感染模型中它仍具有极强的毒性,可能是目前肺炎链球菌致病机制研究中使用最频繁的菌株。迄今为止,仅报道了两株肺炎链球菌的全基因组序列:TIGR4,一株近期分离的4型临床菌株;以及实验室菌株R6,它是D39菌株的无毒、无荚膜衍生物。我们在此报告了D39菌株两个不同分离株的基因组序列和新注释,以及R6菌株的校正序列。对这三个相关序列的比较有助于推断D39祖先菌株的可能序列以及每个分离株中出现的突变。尽管2型基因组有大量重复序列和插入序列元件,但在培养过程中其基因组仍保持显著稳定,其中一个D39分离株与推断的D39祖先菌株相比仅含有五个相对较小的突变。相比之下,实验室菌株R6含有71个单碱基对变化、六个缺失和四个插入,并且与D39祖先菌株相比丢失了隐蔽的pDP1质粒。这些突变中的许多位于或影响在调控、代谢和毒力中起重要作用的基因的表达。本文展示并讨论了这三株菌株中自发产生的突变的性质、通过微阵列分析确定的相对全局转录模式,以及D39基因组序列对肺炎链球菌生理学和致病机制研究的意义。