• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 BioHansel 对克隆细菌病原体进行快速准确的 SNP 基因分型。

Rapid and accurate SNP genotyping of clonal bacterial pathogens with BioHansel.

机构信息

National Microbiology Laboratory, Public Health Agency of Canada, Guelph, Ontario, Canada.

Canadian Food Inspection Agency, Winnipeg, MB, Canada.

出版信息

Microb Genom. 2021 Sep;7(9). doi: 10.1099/mgen.0.000651.

DOI:10.1099/mgen.0.000651
PMID:34554082
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8715432/
Abstract

Hierarchical genotyping approaches can provide insights into the source, geography and temporal distribution of bacterial pathogens. Multiple hierarchical SNP genotyping schemes have previously been developed so that new isolates can rapidly be placed within pre-computed population structures, without the need to rebuild phylogenetic trees for the entire dataset. This classification approach has, however, seen limited uptake in routine public health settings due to analytical complexity and the lack of standardized tools that provide clear and easy ways to interpret results. The BioHansel tool was developed to provide an organism-agnostic tool for hierarchical SNP-based genotyping. The tool identifies split k-mers that distinguish predefined lineages in whole genome sequencing (WGS) data using SNP-based genotyping schemes. BioHansel uses the Aho-Corasick algorithm to type isolates from assembled genomes or raw read sequence data in a matter of seconds, with limited computational resources. This makes BioHansel ideal for use by public health agencies that rely on WGS methods for surveillance of bacterial pathogens. Genotyping results are evaluated using a quality assurance module which identifies problematic samples, such as low-quality or contaminated datasets. Using existing hierarchical SNP schemes for and Typhi, we compare the genotyping results obtained with the k-mer-based tools BioHansel and SKA, with those of the organism-specific tools TBProfiler and genotyphi, which use gold-standard reference-mapping approaches. We show that the genotyping results are fully concordant across these different methods, and that the k-mer-based tools are significantly faster. We also test the ability of the BioHansel quality assurance module to detect intra-lineage contamination and demonstrate that it is effective, even in populations with low genetic diversity. We demonstrate the scalability of the tool using a dataset of ~8100 . Typhi public genomes and provide the aggregated results of geographical distributions as part of the tool's output. BioHansel is an open source Python 3 application available on PyPI and Conda repositories and as a Galaxy tool from the public Galaxy Toolshed. In a public health context, BioHansel enables rapid and high-resolution classification of bacterial pathogens with low genetic diversity.

摘要

分层基因分型方法可以深入了解细菌病原体的来源、地理位置和时间分布。以前已经开发了多种分层 SNP 基因分型方案,以便可以快速将新分离株放置在预先计算的种群结构中,而无需为整个数据集重建系统发育树。然而,由于分析复杂性以及缺乏提供清晰易用的解释结果的标准化工具,这种分类方法在常规公共卫生环境中的应用受到限制。BioHansel 工具是为基于 SNP 的分层基因分型开发的一种与生物体无关的工具。该工具使用基于 SNP 的基因分型方案,使用 SNP 识别区分全基因组测序 (WGS) 数据中预定义谱系的分裂 k-mer。BioHansel 使用 Aho-Corasick 算法在几秒钟内对组装基因组或原始读取序列数据中的分离株进行分型,所需的计算资源有限。这使得 BioHansel 非常适合依赖 WGS 方法进行细菌病原体监测的公共卫生机构使用。使用质量保证模块评估基因分型结果,该模块可识别有问题的样本,例如低质量或污染的数据集。我们使用 和 沙门氏菌的现有分层 SNP 方案,将基于 k-mer 的工具 BioHansel 和 SKA 与专门针对生物体的工具 TBProfiler 和 genotyphi 获得的基因分型结果进行比较,后者使用黄金标准参考映射方法。我们表明,这些不同方法的基因分型结果完全一致,并且基于 k-mer 的工具速度明显更快。我们还测试了 BioHansel 质量保证模块检测谱系内污染的能力,并证明即使在遗传多样性低的群体中,它也是有效的。我们使用约 8100 个 伤寒沙门氏菌公共基因组数据集来测试工具的可扩展性,并提供工具输出的地理分布的聚合结果。BioHansel 是一个可在 PyPI 和 Conda 存储库中获得的开源 Python 3 应用程序,并且是公共 Galaxy Toolshed 中的 Galaxy 工具。在公共卫生环境中,BioHansel 可以快速实现具有低遗传多样性的细菌病原体的高分辨率分类。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0081/8715432/0f8e5dc63917/mgen-7-0651-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0081/8715432/5229b4a98ab1/mgen-7-0651-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0081/8715432/3e7c0eb359a7/mgen-7-0651-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0081/8715432/0acc749c8a85/mgen-7-0651-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0081/8715432/cc0d78580cb7/mgen-7-0651-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0081/8715432/cc4425076308/mgen-7-0651-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0081/8715432/0f8e5dc63917/mgen-7-0651-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0081/8715432/5229b4a98ab1/mgen-7-0651-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0081/8715432/3e7c0eb359a7/mgen-7-0651-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0081/8715432/0acc749c8a85/mgen-7-0651-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0081/8715432/cc0d78580cb7/mgen-7-0651-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0081/8715432/cc4425076308/mgen-7-0651-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0081/8715432/0f8e5dc63917/mgen-7-0651-g006.jpg

相似文献

1
Rapid and accurate SNP genotyping of clonal bacterial pathogens with BioHansel.利用 BioHansel 对克隆细菌病原体进行快速准确的 SNP 基因分型。
Microb Genom. 2021 Sep;7(9). doi: 10.1099/mgen.0.000651.
2
TGS-TB: Total Genotyping Solution for Mycobacterium tuberculosis Using Short-Read Whole-Genome Sequencing.TGS-TB:利用短读长全基因组测序技术的结核分枝杆菌全基因分型解决方案
PLoS One. 2015 Nov 13;10(11):e0142951. doi: 10.1371/journal.pone.0142951. eCollection 2015.
3
MentaLiST - A fast MLST caller for large MLST schemes.MentaLiST - 一种适用于大型 MLST 方案的快速 MLST 调用程序。
Microb Genom. 2018 Feb;4(2). doi: 10.1099/mgen.0.000146. Epub 2018 Jan 10.
4
High Resolution Melting as a rapid, reliable, accurate and cost-effective emerging tool for genotyping pathogenic bacteria and enhancing molecular epidemiological surveillance: a comprehensive review of the literature.高分辨率熔解曲线分析作为一种快速、可靠、准确且经济高效的新兴工具用于致病性细菌基因分型和加强分子流行病学监测:文献综述
Ann Ig. 2017 Jul-Aug;29(4):293-316. doi: 10.7416/ai.2017.2153.
5
Tracking a tuberculosis outbreak over 21 years: strain-specific single-nucleotide polymorphism typing combined with targeted whole-genome sequencing.追踪一场持续21年的结核病疫情:菌株特异性单核苷酸多态性分型结合靶向全基因组测序
J Infect Dis. 2015 Apr 15;211(8):1306-16. doi: 10.1093/infdis/jiu601. Epub 2014 Oct 30.
6
Whole-genome-based Mycobacterium tuberculosis surveillance: a standardized, portable, and expandable approach.基于全基因组的结核分枝杆菌监测:一种标准化、便携且可扩展的方法。
J Clin Microbiol. 2014 Jul;52(7):2479-86. doi: 10.1128/JCM.00567-14. Epub 2014 Apr 30.
7
[Future prospects of molecular epidemiology in tuberculosis].[结核病分子流行病学的未来前景]
Kekkaku. 2009 Dec;84(12):783-4.
8
snpTree--a web-server to identify and construct SNP trees from whole genome sequence data.snpTree——一个从全基因组序列数据中识别和构建 SNP 树的网络服务器。
BMC Genomics. 2012;13 Suppl 7(Suppl 7):S6. doi: 10.1186/1471-2164-13-S7-S6. Epub 2012 Dec 13.
9
Whole genome sequencing versus traditional genotyping for investigation of a Mycobacterium tuberculosis outbreak: a longitudinal molecular epidemiological study.全基因组测序与传统基因分型在结核分枝杆菌暴发调查中的比较:一项纵向分子流行病学研究。
PLoS Med. 2013;10(2):e1001387. doi: 10.1371/journal.pmed.1001387. Epub 2013 Feb 12.
10
A comparison of genotyping-by-sequencing analysis methods on low-coverage crop datasets shows advantages of a new workflow, GB-eaSy.对低覆盖作物数据集的测序分析方法的比较表明,一种新的工作流程 GB-eaSy 具有优势。
BMC Bioinformatics. 2017 Dec 28;18(1):586. doi: 10.1186/s12859-017-2000-6.

引用本文的文献

1
Typhi Mykrobe: fast and accurate lineage identification and antimicrobial resistance genotyping directly from sequence reads for the typhoid fever agent Typhi.伤寒杆菌:直接从伤寒热病原体伤寒杆菌的序列读取中进行快速准确的谱系鉴定和抗菌药物耐药性基因分型。
bioRxiv. 2025 Jul 31:2024.09.30.613582. doi: 10.1101/2024.09.30.613582.
2
Seamless, rapid, and accurate analyses of outbreak genomic data using split -mer analysis.利用分拆分析实现爆发基因组数据的无缝、快速和准确分析。
Genome Res. 2024 Oct 29;34(10):1661-1673. doi: 10.1101/gr.279449.124.
3
Average Nucleotide Identity and Digital DNA-DNA Hybridization Analysis Following PromethION Nanopore-Based Whole Genome Sequencing Allows for Accurate Prokaryotic Typing.

本文引用的文献

1
Genomic diversity of The UoWUCC 10K genomes project.伍伦贡大学临床癌症中心10K基因组计划的基因组多样性
Wellcome Open Res. 2021 Feb 1;5:223. doi: 10.12688/wellcomeopenres.16291.2. eCollection 2020.
2
GC Content-Associated Sequencing Bias Caused by Library Preparation Method May Infrequently Affect Serotype Prediction Using SeqSero2.文库制备方法导致的与GC含量相关的测序偏差可能偶尔会影响使用SeqSero2进行血清型预测。
Appl Environ Microbiol. 2020 Sep 1;86(18). doi: 10.1128/AEM.00614-20.
3
Genomic diversity affects the accuracy of bacterial single-nucleotide polymorphism-calling pipelines.
基于PromethION纳米孔全基因组测序的平均核苷酸同一性和数字DNA-DNA杂交分析可实现准确的原核生物分型。
Diagnostics (Basel). 2024 Aug 17;14(16):1800. doi: 10.3390/diagnostics14161800.
4
Mutations in 406 Are Associated with Low-Level Ethambutol Resistance in Canadian Isolates.406位点的突变与加拿大分离株中的低水平乙胺丁醇耐药性相关。
Antibiotics (Basel). 2024 Jul 4;13(7):624. doi: 10.3390/antibiotics13070624.
5
Evaluation of Whole Genome Sequencing-Based Predictions of Antimicrobial Resistance to TB First Line Agents: A Lesson from 5 Years of Data.基于全基因组测序的结核病一线药物耐药性预测评估:5 年数据的经验教训。
Int J Mol Sci. 2024 Jun 6;25(11):6245. doi: 10.3390/ijms25116245.
6
Characterization of Mycobacterium orygis, Mycobacterium bovis, and Mycobacterium caprae Infections in Humans in Western Canada.鉴定加拿大西部人感染分枝杆菌 orientalis、牛分枝杆菌和山羊分枝杆菌。
J Infect Dis. 2024 Oct 16;230(4):e789-e797. doi: 10.1093/infdis/jiae124.
7
Whole genome sequencing-based identification of human tuberculosis caused by animal-lineage .基于全基因组测序的动物源性人结核分枝杆菌的鉴定
J Clin Microbiol. 2023 Nov 21;61(11):e0026023. doi: 10.1128/jcm.00260-23. Epub 2023 Oct 25.
8
Whole-genome analyses reveal a novel prophage and cgSNPs-derived sublineages of Brachyspira hyodysenteriae ST196.全基因组分析揭示了新型噬菌体和 cgSNP 衍生的猪痢疾短螺旋体 ST196 亚系。
BMC Genomics. 2022 Feb 15;23(1):131. doi: 10.1186/s12864-022-08347-5.
9
How much do smear-negative patients really contribute to tuberculosis transmissions? Re-examining an old question with new tools.涂片阴性患者对结核病传播的实际贡献有多大?用新工具重新审视一个老问题。
EClinicalMedicine. 2022 Jan 3;43:101250. doi: 10.1016/j.eclinm.2021.101250. eCollection 2022 Jan.
10
Generalizable characteristics of false-positive bacterial variant calls.假阳性细菌变异体检测的可推广特征。
Microb Genom. 2021 Aug;7(8). doi: 10.1099/mgen.0.000615.
基因组多样性影响细菌单核苷酸多态性 calling 管道的准确性。
Gigascience. 2020 Feb 1;9(2). doi: 10.1093/gigascience/giaa007.
4
QuantTB - a method to classify mixed Mycobacterium tuberculosis infections within whole genome sequencing data.QuantTB - 一种在全基因组测序数据中分类混合结核分枝杆菌感染的方法。
BMC Genomics. 2020 Jan 28;21(1):80. doi: 10.1186/s12864-020-6486-3.
5
Performance and Accuracy of Four Open-Source Tools for Serotyping of spp. Based on Whole-Genome Short-Read Sequencing Data.基于全基因组短读测序数据的 spp. 分型的四种开源工具的性能和准确性。
Appl Environ Microbiol. 2020 Feb 18;86(5). doi: 10.1128/AEM.02265-19.
6
Within-species contamination of bacterial whole-genome sequence data has a greater influence on clustering analyses than between-species contamination.种内细菌全基因组序列数据污染对聚类分析的影响大于种间污染。
Genome Biol. 2019 Dec 18;20(1):286. doi: 10.1186/s13059-019-1914-x.
7
The EnteroBase user's guide, with case studies on transmissions, phylogeny, and core genomic diversity.《EnteroBase 用户指南》,内含传播、系统发育和核心基因组多样性方面的案例研究。
Genome Res. 2020 Jan;30(1):138-152. doi: 10.1101/gr.251678.119. Epub 2019 Dec 6.
8
Benchmarking Metagenomics Tools for Taxonomic Classification.基于元基因组工具的分类学基准测试。
Cell. 2019 Aug 8;178(4):779-794. doi: 10.1016/j.cell.2019.07.010.
9
Comprehensive evaluation and characterisation of short read general-purpose structural variant calling software.全面评估和特征分析短读通用结构变异调用软件。
Nat Commun. 2019 Jul 19;10(1):3240. doi: 10.1038/s41467-019-11146-4.
10
Integrating informatics tools and portable sequencing technology for rapid detection of resistance to anti-tuberculous drugs.整合信息学工具和便携式测序技术,用于快速检测抗结核药物耐药性。
Genome Med. 2019 Jun 24;11(1):41. doi: 10.1186/s13073-019-0650-x.