School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia.
Microb Genom. 2021 Dec;7(12). doi: 10.1099/mgen.0.000704.
and enteroinvasive (EIEC) cause human bacillary dysentery with similar invasion mechanisms and share similar physiological, biochemical and genetic characteristics. Differentiation of from EIEC is important for clinical diagnostic and epidemiological investigations. However, phylogenetically, and EIEC strains are composed of multiple clusters and are different forms of , making it difficult to find genetic markers to discriminate between and EIEC. In this study, we identified 10 clusters, seven EIEC clusters and 53 sporadic types of EIEC by examining over 17000 publicly available and EIEC genomes. We compared and EIEC accessory genomes to identify cluster-specific gene markers for the 17 clusters and 53 sporadic types. The cluster-specific gene markers showed 99.64% accuracy and more than 97.02% specificity. In addition, we developed a freely available serotyping pipeline named EIEC Cluster Enhanced Serotype Finder (ShigEiFinder) by incorporating the cluster-specific gene markers and established and EIEC serotype-specific O antigen genes and modification genes into typing. ShigEiFinder can process either paired-end Illumina sequencing reads or assembled genomes and almost perfectly differentiated from EIEC with 99.70 and 99.74% cluster assignment accuracy for the assembled genomes and read mapping respectively. ShigEiFinder was able to serotype over 59 serotypes and 22 EIEC serotypes and provided a high specificity of 99.40% for assembled genomes and 99.38% for read mapping for serotyping. The cluster-specific gene markers and our new serotyping tool, ShigEiFinder (installable package: https://github.com/LanLab/ShigEiFinder, online tool: https://mgtdb.unsw.edu.au/ShigEiFinder/), will be useful for epidemiological and diagnostic investigations.
和侵袭性肠杆菌(EIEC)引起人类细菌性痢疾,具有相似的入侵机制,并具有相似的生理、生化和遗传特征。将 与 EIEC 区分开来,对于临床诊断和流行病学调查非常重要。然而,从系统发育的角度来看,和 EIEC 菌株由多个聚类组成,是 的不同形式,因此很难找到遗传标记来区分 和 EIEC。在这项研究中,我们通过检查超过 17000 个公开的 和 EIEC 基因组,鉴定了 10 个聚类,7 个 EIEC 聚类和 53 个散发性 EIEC 类型。我们比较了 和 EIEC 辅助基因组,以确定 17 个聚类和 53 个散发性类型的聚类特异性基因标记。聚类特异性基因标记的准确率为 99.64%,特异性超过 97.02%。此外,我们开发了一个免费的 血清型分析工具 ShigEiFinder,通过整合聚类特异性基因标记,并将 血清型特异性 O 抗原基因和修饰基因纳入分型,对 血清型进行分析。ShigEiFinder 可以处理成对的 Illumina 测序reads 或组装的基因组,并且几乎可以完美地区分 和 EIEC,组装基因组和 read 映射的聚类分配准确率分别为 99.70%和 99.74%。ShigEiFinder 能够对 59 个血清型和 22 个 EIEC 血清型进行血清型分析,并提供了组装基因组 99.40%和 read 映射 99.38%的高特异性。聚类特异性基因标记和我们的新血清型分析工具 ShigEiFinder(可安装包:https://github.com/LanLab/ShigEiFinder,在线工具:https://mgtdb.unsw.edu.au/ShigEiFinder/)将有助于流行病学和诊断研究。