Department of Cell and Molecular Biology, University of Mississippi Medical Center, Jackson, Mississippi, USA.
Department of Pharmacology and Toxicology, University of Mississippi Medical Center, Jackson, Mississippi, USA.
J Clin Microbiol. 2023 Mar 23;61(3):e0141222. doi: 10.1128/jcm.01412-22. Epub 2023 Feb 22.
Staphylococcus epidermidis infections can be challenging to diagnose due to the species frequent contamination of clinical specimens and indolent course of infection. Nevertheless, S. epidermidis is the major cause of late-onset sepsis among premature infants and of intravascular infection in all age groups. Prior work has shown that bacterial virulence factors, antimicrobial resistances, and strains have up to 80% in-sample accuracy to distinguish hospital from community sources, but are unable to distinguish true bacteremia from blood culture contamination. Here, a phylogeny-informed genome-wide association study of 88 isolates was used to estimate effect sizes of particular genomic variants for isolation sources. A "polygenic score" was calculated for each isolate as the summed effect sizes of its repertoire of genomic variants. Predictive models of isolation sources based on polygenic scores were tested with in-samples and out-samples from prior studies of different patient populations. Polygenic scores from accessory genes (AGs) distinguished hospital from community sources with the highest accuracy to date, up to 98% for in-samples and 65% to 91% for various out-samples, whereas scores from single nucleotide polymorphisms (SNPs) had lower accuracy. Scores from AGs and SNPs achieved the highest in-sample accuracy to date, up to 76%, in distinguishing infection from contaminant sources within a hospital. Model training and testing data sets with more similar population structures resulted in more accurate predictions. This study reports the first use of a polygenic score for predicting a complex bacterial phenotype and shows the potential of this approach for enhancing S. epidermidis diagnosis.
表皮葡萄球菌感染的诊断具有挑战性,因为该物种经常污染临床标本,且感染过程较为隐匿。然而,表皮葡萄球菌是早产儿迟发性败血症和所有年龄段人群血管内感染的主要原因。先前的研究表明,细菌毒力因子、抗生素耐药性和菌株具有高达 80%的样本内准确性,可区分医院和社区来源,但无法区分真正的菌血症与血培养污染。在这里,对 88 个分离株进行了基于系统发育的全基因组关联研究,以估计特定基因组变异体对分离源的效应大小。为每个分离株计算了“多基因评分”,即其基因组变异体谱的总和效应大小。基于多基因评分的分离源预测模型在来自不同患者群体的先前研究的内部和外部样本中进行了测试。辅助基因 (AG) 的多基因评分迄今为止能够以最高的准确性区分医院和社区来源,内部样本的准确率高达 98%,各种外部样本的准确率为 65%至 91%,而单核苷酸多态性 (SNP) 的准确率较低。AG 和 SNP 的评分在内部样本中达到了迄今为止最高的准确率,高达 76%,可区分医院内感染和污染来源。具有更相似人群结构的训练和测试数据集可实现更准确的预测。本研究首次报告了使用多基因评分预测复杂细菌表型的情况,并展示了该方法在增强表皮葡萄球菌诊断方面的潜力。