Miller L, Gray L, Beachey E, Kehoe M
Department of Microbiology, Medical School, University of Newcastle upon Tyne, United Kingdom.
J Biol Chem. 1988 Apr 25;263(12):5668-73.
The 1479-base pair (bp) nucleotide sequence of the serotype 5 M protein gene (smp5) from Streptococcus pyogenes contains three distinct types of tandemly repeated sequences, designated A, B, and C. Repeat A (21 bp x 6, in the 5'-half of smp5), shares no homology with the types 6 or 24 M protein genes (Hollingshead, S. K., Fischetti, V. A., and Scott, J. R. (1986) J. Biol. Chem. 261, 1677-1686; Mouw, A. R., Beachey, E. H., and Burdett, V. (1988) J. Bacteriol., in press). Repeat B (75 bp x 3.6, in the center of smp5) is also present in the M6, but not in the M24 gene. Repeat C (105 bp x 2.7, just distal to the B repeats) shares homology with repeats in both the M6 and M24 genes. All three genes share extensive homology in their 3'-halves and in 5' sequences encoding the N-terminal signal peptides, but between these two regions there are highly variable sequences that are responsible for antigenic diversity. These relationships suggest that both intergenic and intragenic recombination has occurred during the evolution of distinct M protein serotypes. All three M proteins contain conserved hydrophobic and proline-rich sequences at their C-terminal ends, suggestive of a membrane anchor and a peptidoglycan spanning region.
化脓性链球菌5型M蛋白基因(smp5)的1479个碱基对(bp)核苷酸序列包含三种不同类型的串联重复序列,分别命名为A、B和C。重复序列A(21 bp×6,位于smp5的5'端一半)与6型或24型M蛋白基因没有同源性(霍林斯黑德,S.K.,菲谢蒂,V.A.,和斯科特,J.R.(1986年)《生物化学杂志》261,1677 - 1686;穆,A.R.,比奇,E.H.,和伯德特,V.(1988年)《细菌学杂志》,即将发表)。重复序列B(75 bp×3.6,位于smp5的中心)也存在于M6基因中,但不存在于M24基因中。重复序列C(105 bp×2.7,紧挨着B重复序列的远端)与M6和M24基因中的重复序列具有同源性。所有三个基因在其3'端一半以及编码N端信号肽的5'序列中具有广泛的同源性,但在这两个区域之间存在高度可变的序列,这些序列导致了抗原多样性。这些关系表明,在不同M蛋白血清型的进化过程中发生了基因间和基因内的重组。所有三种M蛋白在其C端都含有保守的疏水和富含脯氨酸的序列,提示存在膜锚定和肽聚糖跨膜区域。