Shen Kai, Gladitz John, Antalis Patricia, Dice Bethany, Janto Benjamin, Keefe Randy, Hayes Jay, Ahmed Azad, Dopico Richard, Ehrlich Nathan, Jocz Jennifer, Kropp Laura, Yu Shujun, Nistico Laura, Greenberg David P, Barbadora Karen, Preston Robert A, Post J Christopher, Ehrlich Garth D, Hu Fen Z
Center for Genomic Sciences, Allegheny-Singer Research Institute, Allegheny General Hospital, 320 East North Avenue, 11th Floor South Tower, Pittsburgh, PA 15212, USA.
Infect Immun. 2006 Jan;74(1):321-30. doi: 10.1128/IAI.74.1.321-330.2006.
Eight low-passage-number Streptococcus pneumoniae clinical isolates, each of a different serotype and a different multilocus sequence type, were obtained from pediatric participants in a pneumococcal vaccine trial. Comparative genomic analyses were performed with these strains and two S. pneumoniae reference strains. Individual genomic libraries were constructed for each of the eight clinical isolates, with an average insert size of approximately 1 kb. A total of 73,728 clones were picked for arraying, providing more than four times genomic coverage per strain. A subset of 4,793 clones were sequenced, for which homology searches revealed that 750 (15.6%) of the sequences were unique with respect to the TIGR4 reference genome and 263 (5.5%) clones were unrelated to any available streptococcal sequence. Hypothetical translations of the open reading frames identified within these novel sequences showed homologies to a variety of proteins, including bacterial virulence factors not previously identified in S. pneumoniae. The distribution and expression patterns of 58 of these novel sequences among the eight clinical isolates were analyzed by PCR- and reverse transcriptase PCR-based analyses, respectively. These unique sequences were nonuniformly distributed among the eight isolates, and transcription of these genes in planktonic cultures was detected in 81% (172/212) of their genic occurrences. All 58 novel sequences were transcribed in one or more of the clinical strains, suggesting that they all correspond to functional genes. Sixty-five percent (38/58) of these sequences were found in 50% or less of the clinical strains, indicating a significant degree of genomic plasticity among natural isolates.
从一项肺炎球菌疫苗试验的儿科参与者中获得了8株低传代次数的肺炎链球菌临床分离株,每株具有不同的血清型和多位点序列类型。对这些菌株和两株肺炎链球菌参考菌株进行了比较基因组分析。为这8株临床分离株分别构建了个体基因组文库,平均插入片段大小约为1 kb。总共挑选了73,728个克隆用于阵列,每个菌株的基因组覆盖率超过四倍。对4,793个克隆的一个子集进行了测序,同源性搜索显示,其中750个(15.6%)序列相对于TIGR4参考基因组是独特的,263个(5.5%)克隆与任何可用的链球菌序列无关。对这些新序列中鉴定出的开放阅读框进行的假设翻译显示,它们与多种蛋白质具有同源性,包括以前在肺炎链球菌中未鉴定出的细菌毒力因子。分别通过基于PCR和逆转录酶PCR的分析,分析了这58个新序列在8株临床分离株中的分布和表达模式。这些独特序列在8株分离株中分布不均匀,在浮游培养物中这些基因的转录在其基因出现的81%(172/212)中被检测到。所有58个新序列在一种或多种临床菌株中都有转录,表明它们都对应于功能基因。这些序列中有65%(38/58)在50%或更少的临床菌株中被发现,这表明自然分离株之间存在显著程度的基因组可塑性。