不同生物信息学方法从短读基因组数据中识别耐药基因的差异，重点是.

Discordance between different bioinformatic methods for identifying resistance genes from short-read genomic data, with a focus on .

机构信息

Nuffield Department of Medicine, Oxford University, Oxford, UK.

National Institute for Health Research (NIHR) Health Protection Research Unit on Healthcare Associated Infections and Antimicrobial Resistance at University of Oxford, Oxford, UK.

出版信息

Microb Genom. 2023 Dec;9(12). doi: 10.1099/mgen.0.001151.

DOI:10.1099/mgen.0.001151

PMID:38100178

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10763500/

Abstract

Several bioinformatics genotyping algorithms are now commonly used to characterize antimicrobial resistance (AMR) gene profiles in whole-genome sequencing (WGS) data, with a view to understanding AMR epidemiology and developing resistance prediction workflows using WGS in clinical settings. Accurately evaluating AMR in Enterobacterales, particularly , is of major importance, because this is a common pathogen. However, robust comparisons of different genotyping approaches on relevant simulated and large real-life WGS datasets are lacking. Here, we used both simulated datasets and a large set of real WGS data (=1818 isolates) to systematically investigate genotyping methods in greater detail. Simulated constructs and real sequences were processed using four different bioinformatic programs (ABRicate, ARIBA, KmerResistance and SRST2, run with the ResFinder database) and their outputs compared. For simulation tests where 3079 AMR gene variants were inserted into random sequence constructs, KmerResistance was correct for 3076 (99.9 %) simulations, ABRicate for 3054 (99.2 %), ARIBA for 2783 (90.4 %) and SRST2 for 2108 (68.5 %). For simulation tests where two closely related gene variants were inserted into random sequence constructs, KmerResistance identified the correct alleles in 35 338/46 318 (76.3 %) simulations, ABRicate identified them in 11 842/46 318 (25.6 %) simulations, ARIBA identified them in 1679/46 318 (3.6 %) simulations and SRST2 identified them in 2000/46 318 (4.3 %) simulations. In real data, across all methods, 1392/1818 (76 %) isolates had discrepant allele calls for at least 1 gene. In addition to highlighting areas for improvement in challenging scenarios, (e.g. identification of AMR genes at <10× coverage, identifying multiple closely related AMR genes present in the same sample), our evaluations identified some more systematic errors that could be readily soluble, such as repeated misclassification (i.e. naming) of genes as shorter variants of the same gene present within the reference resistance gene database. Such naming errors accounted for at least 2530/4321 (59 %) of the discrepancies seen in real data. Moreover, many of the remaining discrepancies were likely 'artefactual', with reporting of cut-off differences accounting for at least 1430/4321 (33 %) discrepants. Whilst we found that comparing outputs generated by running multiple algorithms on the same dataset could identify and resolve these algorithmic artefacts, the results of our evaluations emphasize the need for developing new and more robust genotyping algorithms to further improve accuracy and performance.

摘要

目前，有几种生物信息学基因分型算法常用于全基因组测序 (WGS) 数据中抗菌药物耐药 (AMR) 基因谱的特征描述，旨在了解 AMR 流行病学，并在临床环境中使用 WGS 开发耐药性预测工作流程。正确评估肠杆菌科中的 AMR 尤其重要，因为这是一种常见的病原体。然而，不同基因分型方法在相关模拟和大型真实 WGS 数据集上的稳健比较仍然缺乏。在这里，我们使用模拟数据集和一大组真实的 WGS 数据（= 1818 株）更详细地系统研究基因分型方法。使用四个不同的生物信息学程序（ABRicate、ARIBA、KmerResistance 和 SRST2，使用 ResFinder 数据库运行）处理模拟构建体和真实序列，并比较它们的输出。对于将 3079 个 AMR 基因变体插入随机序列构建体的模拟测试，KmerResistance 正确 3076（99.9%）次模拟，ABRicate 正确 3054（99.2%）次模拟，ARIBA 正确 2783（90.4%）次模拟，SRST2 正确 2108（68.5%）次模拟。对于将两个密切相关的基因变体插入随机序列构建体的模拟测试，KmerResistance 在 35338/46318（76.3%）次模拟中正确识别了正确的等位基因，ABRicate 在 11842/46318（25.6%）次模拟中正确识别了正确的等位基因，ARIBA 在 1679/46318（3.6%）次模拟中正确识别了正确的等位基因，而 SRST2 在 2000/46318（4.3%）次模拟中正确识别了正确的等位基因。在真实数据中，所有方法中，1392/1818（76%）株至少有 1 个基因的等位基因检测结果不一致。除了突出在具有挑战性的情况下需要改进的领域（例如，在<10×覆盖的情况下识别 AMR 基因，识别同一样本中存在的多个密切相关的 AMR 基因）外，我们的评估还确定了一些更系统的错误，这些错误很容易解决，例如，将基因重复错误分类（即命名）为同一参考耐药基因数据库中存在的相同基因的较短变体。这种命名错误占真实数据中观察到的差异的至少 2530/4321（59%）。此外，许多剩余的差异可能是“人为的”，报告截止值差异至少占 1430/4321（33%）的差异。虽然我们发现通过在同一数据集上运行多个算法来比较输出可以识别和解决这些算法错误，但我们的评估结果强调需要开发新的、更强大的基因分型算法，以进一步提高准确性和性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d3e/10763500/25942b98eec6/mgen-9-1151-g001.jpg

相似文献

Discordance between different bioinformatic methods for identifying resistance genes from short-read genomic data, with a focus on .不同生物信息学方法从短读基因组数据中识别耐药基因的差异，重点是.

Microb Genom. 2023 Dec;9(12). doi: 10.1099/mgen.0.001151.

Systematic Evaluation of Whole Genome Sequence-Based Predictions of Serotype and Antimicrobial Resistance.基于全基因组序列的血清型和抗菌药物耐药性预测的系统评价

Front Microbiol. 2020 Apr 3;11:549. doi: 10.3389/fmicb.2020.00549. eCollection 2020.

Discordant bioinformatic predictions of antimicrobial resistance from whole-genome sequencing data of bacterial isolates: an inter-laboratory study.从细菌分离物的全基因组测序数据中得出的抗菌药物耐药性的生物信息学预测结果不一致：一项实验室间研究。

Microb Genom. 2020 Feb;6(2). doi: 10.1099/mgen.0.000335. Epub 2020 Feb 12.

Whole-genome sequencing for antimicrobial surveillance: species-specific quality thresholds and data evaluation from the network of the European Union Reference Laboratory for Antimicrobial Resistance genomic proficiency tests of 2021 and 2022.全基因组测序用于抗菌药物监测：来自 2021 年和 2022 年欧盟抗菌药物耐药参考实验室基因组能力测试网络的针对特定物种的质量阈值和数据评估。

mSystems. 2024 Sep 17;9(9):e0016024. doi: 10.1128/msystems.00160-24. Epub 2024 Aug 6.

Using Genomics to Track Global Antimicrobial Resistance.利用基因组学追踪全球抗菌素耐药性。

Front Public Health. 2019 Sep 4;7:242. doi: 10.3389/fpubh.2019.00242. eCollection 2019.

Taking the next-gen step: Comprehensive antimicrobial resistance detection from Burkholderia pseudomallei.迈向下一代：从伯克霍尔德氏菌属假单胞菌中全面检测抗微生物药物耐药性。

EBioMedicine. 2021 Jan;63:103152. doi: 10.1016/j.ebiom.2020.103152. Epub 2020 Dec 4.

Global transmission of extended-spectrum cephalosporin resistance in Escherichia coli driven by epidemic plasmids.大肠杆菌中超广谱头孢菌素耐药性的全球传播由流行质粒驱动。

EBioMedicine. 2024 May;103:105097. doi: 10.1016/j.ebiom.2024.105097. Epub 2024 Apr 11.

Validation strategy of a bioinformatics whole genome sequencing workflow for Shiga toxin-producing using a reference collection extensively characterized with conventional methods.基于经传统方法广泛特征分析的参考集，建立用于产志贺毒素的生物信息学全基因组测序工作流程的验证策略。

Microb Genom. 2021 Mar;7(3). doi: 10.1099/mgen.0.000531. Epub 2021 Mar 3.

External validation of WGS-based antimicrobial susceptibility prediction tools, KOVER-AMR and ResFinder 4.1, for Escherichia coli clinical isolates.基于全基因组测序（WGS）的抗菌药物敏感性预测工具KOVER-AMR和ResFinder 4.1对大肠杆菌临床分离株的外部验证

Clin Microbiol Infect. 2022 Nov;28(11):1465-1470. doi: 10.1016/j.cmi.2022.05.024. Epub 2022 Jun 2.

Exploring uncatalogued genetic variation in antimicrobial resistance gene families in Escherichia coli: an observational analysis.探索大肠杆菌中抗菌药物耐药基因家族未被编目的遗传变异：一项观察性分析。

Lancet Microbe. 2024 Nov;5(11):100913. doi: 10.1016/S2666-5247(24)00152-6. Epub 2024 Oct 5.

引用本文的文献

Assembly-free typing of Nanopore and Illumina data through proximity scoring with KMA.通过KMA的邻近评分对纳米孔和Illumina数据进行无组装分型。

NAR Genom Bioinform. 2025 Sep 1;7(3):lqaf116. doi: 10.1093/nargab/lqaf116. eCollection 2025 Sep.

Large-scale genomic analysis reveals the distribution and diversity of type VI secretion systems in .大规模基因组分析揭示了……中VI型分泌系统的分布和多样性。（原文中“in”后面缺少具体内容）

mSystems. 2025 Jun 18:e0010525. doi: 10.1128/msystems.00105-25.

Nodules-associated Klebsiella oxytoca complex: genomic insights into plant growth promotion and health risk assessment.与结节相关的产酸克雷伯菌复合体：植物生长促进和健康风险评估的基因组学见解

BMC Microbiol. 2025 May 15;25(1):294. doi: 10.1186/s12866-025-04002-7.

Population analysis of heavy metal and biocide resistance genes in from human clinical cases in New Hampshire, United States.美国新罕布什尔州人类临床病例中重金属和抗微生物剂抗性基因的群体分析。

Front Microbiol. 2022 Oct 19;13:983083. doi: 10.3389/fmicb.2022.983083. eCollection 2022.

本文引用的文献

Keeping up with the pathogens: improved antimicrobial resistance detection and prediction from Pseudomonas aeruginosa genomes.紧跟病原体：从铜绿假单胞菌基因组中提高对抗菌药物耐药性的检测和预测。

Genome Med. 2024 Jun 7;16(1):78. doi: 10.1186/s13073-024-01346-z.

An ISO-certified genomics workflow for identification and surveillance of antimicrobial resistance.一个经过 ISO 认证的基因组学工作流程，用于鉴定和监测抗菌素耐药性。

Nat Commun. 2023 Jan 4;14(1):60. doi: 10.1038/s41467-022-35713-4.

Clin Microbiol Infect. 2022 Nov;28(11):1465-1470. doi: 10.1016/j.cmi.2022.05.024. Epub 2022 Jun 2.

Review and Comparison of Antimicrobial Resistance Gene Databases.抗菌耐药基因数据库的综述与比较

Antibiotics (Basel). 2022 Mar 4;11(3):339. doi: 10.3390/antibiotics11030339.

ResFinder 4.0 for predictions of phenotypes from genotypes.ResFinder 4.0 用于基因型到表型的预测。

J Antimicrob Chemother. 2020 Dec 1;75(12):3491-3500. doi: 10.1093/jac/dkaa345.

Reconciling the Potentially Irreconcilable? Genotypic and Phenotypic Amoxicillin-Clavulanate Resistance in .调和潜在的不可调和之处？[具体研究对象]中阿莫西林-克拉维酸的基因型和表型耐药性

Antimicrob Agents Chemother. 2020 May 21;64(6). doi: 10.1128/AAC.02026-19.

Microb Genom. 2020 Feb;6(2). doi: 10.1099/mgen.0.000335. Epub 2020 Feb 12.

Use of whole genome sequencing of commensal in pigs for antimicrobial resistance surveillance, United Kingdom, 2018.利用猪肠道共生菌的全基因组测序进行抗菌药物耐药性监测，英国，2018 年。

Euro Surveill. 2019 Dec;24(50). doi: 10.2807/1560-7917.ES.2019.24.50.1900136.

CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database.CARD 2020：利用综合抗生素耐药数据库进行抗生素耐药组监测。

Nucleic Acids Res. 2020 Jan 8;48(D1):D517-D525. doi: 10.1093/nar/gkz935.

Validating the AMRFinder Tool and Resistance Gene Database by Using Antimicrobial Resistance Genotype-Phenotype Correlations in a Collection of Isolates.通过在分离株集合中使用抗生素耐药基因型-表型相关性来验证 AMRFinder 工具和耐药基因数据库。

Antimicrob Agents Chemother. 2019 Oct 22;63(11). doi: 10.1128/AAC.00483-19. Print 2019 Nov.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

不同生物信息学方法从短读基因组数据中识别耐药基因的差异，重点是.

Discordance between different bioinformatic methods for identifying resistance genes from short-read genomic data, with a focus on .

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献