Molecular Bacteriology Laboratory, Université libre de Bruxelles, Brussels, Belgium.
Tropical Diseases Research Group, Murdoch Children's Research Institute, Melbourne, Australia.
Microbiology (Reading). 2023 Jan;169(1). doi: 10.1099/mic.0.001280.
Group A (GAS) M and M-like proteins are essential virulence factors and represent the primary epidemiological marker of this pathogen. Protein sequences encoding 1054 M, Mrp and Enn proteins, from 1668 GAS genomes, were analysed by SplitsTree4, partitioning around medoids and co-occurrence. The splits network and groups-based analysis of all M and M-like proteins revealed four large protein groupings, with multiple evolutionary histories as represented by multiple edges for most splits, leading to 'M-family-groups' (FG) of protein sequences: FG I, Mrp; FG II, M protein and Protein H; FG III, Enn; and FG IV, M protein. M and Enn proteins formed two groups with nine sub-groups and Mrp proteins formed four groups with ten sub-groups. Discrete co-occurrence of M and M-like proteins were identified suggesting that while dynamic, evolution may be constrained by a combination of functional and virulence attributes. At a granular level, four distinct family-groups of M, Enn and Mrp proteins are observable, with Mrp representing the most genetically distinct of the family-group of proteins. While M and Enn protein families generally group into three distinct family-groups, horizontal and vertical gene flow between distinct GAS strains is ongoing.
A 组(GAS)M 和类似 M 的蛋白是重要的毒力因子,也是该病原体的主要流行病学标志物。通过 SplitsTree4 对来自 1668 个 GAS 基因组的 1054 个 M、Mrp 和 Enn 蛋白的编码序列进行了分析,采用中位数分区和共同出现进行分区。所有 M 和类似 M 的蛋白的分支网络和基于分组的分析揭示了四个大型蛋白分组,大多数分支具有多个进化历史,代表了多个边缘,从而导致了 M 家族分组(FG)的蛋白序列:FG I,Mrp;FG II,M 蛋白和蛋白 H;FG III,Enn;和 FG IV,M 蛋白。M 和 Enn 蛋白形成了两个具有九个亚组的群组,而 Mrp 蛋白形成了四个具有十个亚组的群组。离散的 M 和类似 M 蛋白的共同出现表明,尽管是动态的,但进化可能受到功能和毒力属性的组合限制。在精细水平上,可以观察到四个不同的 M、Enn 和 Mrp 蛋白家族群组,其中 Mrp 代表了蛋白家族群组中最具遗传差异的蛋白。虽然 M 和 Enn 蛋白家族通常分为三个不同的家族群组,但不同 GAS 菌株之间的水平和垂直基因流仍在继续。