Power Peter M, Sweetman W A, Gallacher N J, Woodhall M R, Kumar G A, Moxon E R, Hood D W
Molecular Infectious Diseases Group, Department of Paediatrics, Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX39DS, UK.
Infect Genet Evol. 2009 Mar;9(2):216-28. doi: 10.1016/j.meegid.2008.11.006. Epub 2008 Nov 28.
Simple sequence repeat (SSRs) of DNA are subject to high rates of mutation and are important mediators of adaptation in Haemophilus influenzae. Previous studies of the Rd KW20 genome identified the primacy of tetranucleotide SSRs in mediating phase variation (the rapid reversible switching of gene expression) of surface exposed structures such as lipopolysaccharide. The recent sequencing of the genomes of multiple strains of H. influenzae allowed the comparison of the SSRs (repeat units of one to nine nucleotides in length) in detail across four complete H. influenzae genomes and then comparison with a further 12 genomes when they became available. The SSR loci were broadly classified into three groups: (1) those that did not vary; (2) those for which some variation between strains was observed but this could not be linked to variation of gene expression; and (3) those that both varied and were located in regions consistent with mediating phase variable gene expression. Comparative analysis of 988 SSR associated loci confirmed that tetranucleotide repeats were the major mediators of phase variation and extended the repertoire of known tetranucleotide SSR loci by identifying ten previously uncharacterised tetranucleotide SSR loci with the potential to mediate phase variation which were unequally distributed across the H. influenzae pan-genome. Further, analysis of non-tetranucleotide SSR in the 16 strains revealed a number of mononucleotide, dinucleotide, pentanucleotide, heptanucleotide, and octanucleotide SSRs which were consistent with these tracts mediating phase variation. This study substantiates previous findings as to the important role that tetranucleotide SSRs play in H. influenzae biology. Two Brazilian isolates showed the most variation in their complement of SSRs suggesting the possibility of geographic and phenotypic influences on SSR distribution.
DNA的简单序列重复(SSRs)具有很高的突变率,是流感嗜血杆菌适应性的重要调节因子。先前对Rd KW20基因组的研究确定了四核苷酸SSRs在介导表面暴露结构(如脂多糖)的相变(基因表达的快速可逆切换)中的首要地位。最近对多株流感嗜血杆菌基因组的测序使得能够详细比较四个完整流感嗜血杆菌基因组中的SSRs(长度为1至9个核苷酸的重复单元),随后在另外12个基因组可用时进行比较。SSRs位点大致分为三类:(1)不变的位点;(2)观察到菌株间存在一些变异但这与基因表达变异无关的位点;(3)既发生变异又位于与介导相变基因表达一致区域的位点。对988个与SSRs相关的位点进行比较分析证实,四核苷酸重复是相变的主要调节因子,并通过鉴定十个先前未表征的具有介导相变潜力的四核苷酸SSRs位点扩展了已知四核苷酸SSRs位点的库,这些位点在流感嗜血杆菌泛基因组中分布不均。此外,对16个菌株中的非四核苷酸SSRs分析揭示了一些单核苷酸、二核苷酸、五核苷酸、七核苷酸和八核苷酸SSRs,这些与这些区域介导相变一致。本研究证实了先前关于四核苷酸SSRs在流感嗜血杆菌生物学中所起重要作用的发现。两株巴西分离株在其SSRs互补序列中表现出最大的变异,这表明地理和表型对SSRs分布可能存在影响。