Laboratory of Molecular Epidemiology, School of Public Health, Guangdong Pharmaceutical University, Guangzhou, China.
Department of Laboratory Science, Maoming Hospital of Guangzhou University of Chinese Medicine, Guangzhou, China.
Microbiol Spectr. 2024 Oct 3;12(10):e0049324. doi: 10.1128/spectrum.00493-24. Epub 2024 Sep 9.
() is a clinically significant opportunistic pathogen, which can colonize multiple body sites in healthy individuals and cause various life-threatening diseases in both children and adults worldwide. The genetic backgrounds of that cause infection versus asymptomatic carriage vary widely, but the potential genetic elements (k-mers) associated with infection remain unknown, which leads to difficulties in differentiating infection isolates from harmless colonizers. Here, we address the disease-associated k-mers by using a comprehensive genome-wide association study (GWAS) to compare the genetic variation of isolates from clinical infection sites (272 isolates) with nasal carriage (240 isolates). This study uncovers consensus evidence that certain k-mers are overrepresented in infection isolates compared with carriage isolates, indicating the presence of specific genetic elements associated with infection. Moreover, the random forest (RF) model achieved a classification accuracy of 77% for predicting disease status (infection vs carriage), with 68% accuracy for a single highest-ranked k-mer, providing a simple target for identifying high-risk genotypes. Our findings suggest that the disease-causing is a pathogenic subpopulation harboring unique genomic variation that promotes invasion and infection, providing novel targets for clinical interventions.
Defining the disease-causing isolates is the first step toward disease control. However, the disease-associated genetic elements of remain unknown, which leads to difficulties in differentiating infection isolates from harmless carriage isolates. Our comprehensive genome-wide association study (GWAS) found consensus evidence that certain genetic elements are overrepresented among infection isolates than carriage isolates, suggesting that the enrichment of disease-associated elements may promote infection. Notably, a single k-mer predictor achieved a high classification accuracy, which forms the basis for early diagnostics and interventions.
()是一种临床上重要的机会性病原体,它可以定植于健康个体的多个身体部位,并在全球范围内导致儿童和成人的各种危及生命的疾病。导致感染与无症状定植的遗传背景差异很大,但与感染相关的潜在遗传元素(k-mer)尚不清楚,这导致难以区分感染分离株与无害定植者。在这里,我们通过使用全基因组关联研究(GWAS)来解决与疾病相关的 k-mer 问题,比较来自临床感染部位(272 株)和鼻腔定植(240 株)的分离株的遗传变异。这项研究揭示了一致的证据,即与定植分离株相比,感染分离株中某些 k-mer 过度表达,表明存在与感染相关的特定遗传元素。此外,随机森林(RF)模型对预测疾病状态(感染与定植)的准确率达到 77%,单个最高排名的 k-mer 的准确率为 68%,为识别高风险基因型提供了一个简单的目标。我们的研究结果表明,导致疾病的 是一个携带独特基因组变异的致病性亚群,这种变异促进了入侵和感染,为临床干预提供了新的靶点。
确定致病分离株是控制疾病的第一步。然而,与 相关的遗传元素仍不清楚,这导致难以区分感染分离株与无害定植分离株。我们的全基因组关联研究(GWAS)发现了一致的证据,即某些遗传元素在感染分离株中比在定植分离株中过度表达,这表明与疾病相关的元素的富集可能促进了感染。值得注意的是,单个 k-mer 预测器实现了高分类准确率,这为早期诊断和干预提供了基础。