McMillan David J, Beiko R G, Geffers R, Buer Jan, Schouls L M, Vlaminckx B J M, Wannet W J B, Sriprakash K S, Chhatwal G S
German Research Centre for Biotechnology, Braunschweig, Germany.
Clin Infect Dis. 2006 Oct 1;43(7):884-91. doi: 10.1086/507537. Epub 2006 Aug 23.
The factors behind the reemergence of severe, invasive group A streptococcal (GAS) diseases are unclear, but it could be caused by altered genetic endowment in these organisms. However, data from previous studies assessing the association between single genetic factors and invasive disease are often conflicting, suggesting that other, as-yet unidentified factors are necessary for the development of this class of disease.
In this study, we used a targeted GAS virulence microarray containing 226 GAS genes to determine the virulence gene repertoires of 68 GAS isolates (42 associated with invasive disease and 28 associated with noninvasive disease) collected in a defined geographic location during a contiguous time period. We then employed 3 advanced machine learning methods (genetic algorithm neural network, support vector machines, and classification trees) to identify genes with an increased association with invasive disease.
Virulence gene profiles of individual GAS isolates varied extensively among these geographically and temporally related strains. Using genetic algorithm neural network analysis, we identified 3 genes with a marginal overrepresentation in invasive disease isolates. Significantly, 2 of these genes, ssa and mf4, encoded superantigens but were only present in a restricted set of GAS M-types. The third gene, spa, was found in variable distributions in all M-types in the study.
Our comprehensive analysis of GAS virulence profiles provides strong evidence for the incongruent relationships among any of the 226 genes represented on the array and the overall propensity of GAS to cause invasive disease, underscoring the pathogenic complexity of these diseases, as well as the importance of multiple bacteria and/or host factors.
A 组链球菌(GAS)严重侵袭性疾病再度出现的背后因素尚不清楚,但可能是这些病原体的基因禀赋发生了改变所致。然而,以往评估单一遗传因素与侵袭性疾病之间关联的研究数据往往相互矛盾,这表明这类疾病的发生还需要其他尚未明确的因素。
在本研究中,我们使用了一种包含226个GAS基因的靶向GAS毒力微阵列,来确定在一个特定地理位置连续时间段内收集的68株GAS分离株(42株与侵袭性疾病相关,28株与非侵袭性疾病相关)的毒力基因库。然后,我们采用3种先进的机器学习方法(遗传算法神经网络、支持向量机和分类树)来识别与侵袭性疾病关联增加的基因。
在这些地理和时间相关的菌株中,单个GAS分离株的毒力基因谱差异很大。使用遗传算法神经网络分析,我们鉴定出3个在侵袭性疾病分离株中略有过量表达的基因。值得注意的是,其中2个基因ssa和mf4编码超抗原,但仅存在于一组有限的GAS M型中。第三个基因spa在研究的所有M型中分布各不相同。
我们对GAS毒力谱的综合分析为阵列上所代表的226个基因中的任何一个与GAS导致侵袭性疾病的总体倾向之间不一致的关系提供了有力证据,强调了这些疾病的致病复杂性以及多种细菌和/或宿主因素的重要性。