Institute for Genome Sciences, Department of Microbiology and Immunology, University of Maryland School of Medicine, 801 W. Baltimore Street, Suite 600, Baltimore, MD, 21201, USA.
Translational Genomics Research Institute, Flagstaff, AZ, 86001, USA.
Sci Rep. 2017 Jun 13;7(1):3402. doi: 10.1038/s41598-017-03631-x.
Enterotoxigenic Escherichia coli (ETEC) cause more than 500,000 deaths each year in the developing world and are characterized on a molecular level by the presence of genes that encode the heat-stable (ST) and/or heat-labile (LT) enterotoxins, as well as surface structures, known as colonization factors (CFs). Genome sequencing and comparative genomic analyses of 94 previously uncharacterized ETEC isolates demonstrated remarkable genomic diversity, with 28 distinct sequence types identified in three phylogenomic groups. Interestingly, there is a correlation between the genomic sequence type and virulence factor profiles based on prevalence of the isolate, suggesting that there is an optimal combination of genetic factors required for survival, virulence and transmission in the most successful clones. A large-scale BLAST score ratio (LS-BSR) analysis was further applied to identify ETEC-specific genomic regions when compared to non-ETEC genomes, as well as genes that are more associated with clinical presentations or other genotypic markers. Of the strains examined, 21 of 94 ETEC isolates lacked any previously identified CF. Homology searches with the structural subunits of known CFs identified 6 new putative CF variants. These studies provide a roadmap to exploit genomic analyses by directing investigations of pathogenesis, virulence regulation and vaccine development.
肠产毒性大肠杆菌(ETEC)每年在发展中国家导致超过 50 万人死亡,其分子特征是存在编码热稳定(ST)和/或热不稳定(LT)肠毒素的基因,以及称为定植因子(CF)的表面结构。对 94 株先前未表征的 ETEC 分离株进行基因组测序和比较基因组分析表明,存在显著的基因组多样性,在三个系统发育群中鉴定出 28 个不同的序列类型。有趣的是,基于分离株的流行情况,基因组序列类型与毒力因子谱之间存在相关性,这表明在最成功的克隆中,存在生存、毒力和传播所需的最佳遗传因素组合。进一步应用大规模 BLAST 得分比(LS-BSR)分析,以比较非 ETEC 基因组来识别 ETEC 特异性基因组区域,以及与临床表型或其他基因型标志物更相关的基因。在所检查的菌株中,94 株 ETEC 分离株中有 21 株缺乏任何先前鉴定的 CF。与已知 CF 的结构亚基进行同源搜索,鉴定出 6 种新的推定 CF 变体。这些研究为利用基因组分析提供了路线图,指导发病机制、毒力调节和疫苗开发的研究。