Department of Human Genetics, McGill University, Montreal, Canada.
J Clin Microbiol. 2011 Jun;49(6):2230-8. doi: 10.1128/JCM.00391-11. Epub 2011 Apr 20.
Clostridium difficile is a common cause of infectious diarrhea in hospitalized patients. A severe and increased incidence of C. difficile infection (CDI) is associated predominantly with the NAP1 strain; however, the existence of other severe-disease-associated (SDA) strains and the extensive genetic diversity across C. difficile complicate reliable detection and diagnosis. Comparative genome analysis of 14 sequenced genomes, including those of a subset of NAP1 isolates, allowed the assessment of genetic diversity within and between strain types to identify DNA markers that are associated with severe disease. Comparative genome analysis of 14 isolates, including five publicly available strains, revealed that C. difficile has a core genome of 3.4 Mb, comprising ∼ 3,000 genes. Analysis of the core genome identified candidate DNA markers that were subsequently evaluated using a multistrain panel of 177 isolates, representing more than 50 pulsovars and 8 toxinotypes. A subset of 117 isolates from the panel had associated patient data that allowed assessment of an association between the DNA markers and severe CDI. We identified 20 candidate DNA markers for species-wide detection and 10,683 single nucleotide polymorphisms (SNPs) associated with the predominant SDA strain (NAP1). A species-wide detection candidate marker, the sspA gene, was found to be the same across 177 sequenced isolates and lacked significant similarity to those of other species. Candidate SNPs in genes CD1269 and CD1265 were found to associate more closely with disease severity than currently used diagnostic markers, as they were also present in the toxin A-negative and B-positive (A-B+) strain types. The genetic markers identified illustrate the potential of comparative genomics for the discovery of diagnostic DNA-based targets that are species specific or associated with multiple SDA strains.
艰难梭菌是住院患者感染性腹泻的常见病因。严重且发病率增加的艰难梭菌感染(CDI)主要与NAP1 株相关;然而,其他严重疾病相关(SDA)株的存在以及艰难梭菌广泛的遗传多样性使得可靠的检测和诊断变得复杂。对 14 个测序基因组的比较基因组分析,包括 NAP1 分离株的一部分,允许评估菌株类型内和菌株类型之间的遗传多样性,以确定与严重疾病相关的 DNA 标记。对包括 5 个公开可用菌株在内的 14 个分离株的比较基因组分析表明,艰难梭菌具有 3.4 Mb 的核心基因组,包含约 3000 个基因。核心基因组分析确定了候选 DNA 标记,随后使用代表超过 50 个脉冲变体和 8 个毒素型的 177 个分离株的多菌株进行了评估。从该组中选择了 117 个分离株子集,其相关的患者数据允许评估 DNA 标记与严重 CDI 之间的关联。我们为全物种检测确定了 20 个候选 DNA 标记和 10683 个与主要 SDA 株(NAP1)相关的单核苷酸多态性(SNP)。在 177 个测序分离株中发现了一个全物种检测候选标记,即 sspA 基因,并且与其他物种没有显著相似性。在基因 CD1269 和 CD1265 中发现的候选 SNP 与疾病严重程度的关联比目前使用的诊断标记更密切,因为它们也存在于毒素 A 阴性和 B 阳性(A-B+)株型中。所鉴定的遗传标记说明了比较基因组学在发现特定于物种或与多个 SDA 株相关的诊断 DNA 靶标方面的潜力。