鉴定未映射读取中高度可变的序列片段，用于快速细菌基因分型。

Identification of highly variable sequence fragments in unmapped reads for rapid bacterial genotyping.

机构信息

Department of Biomedical Engineering, Faculty of Electrical Engineering and Communication, Brno University of Technology, Brno, Czechia.

Department of Internal Medicine, Hematology and Oncology, University Hospital Brno, Brno, Czechia.

出版信息

BMC Genomics. 2022 Dec 29;23(Suppl 3):445. doi: 10.1186/s12864-022-08550-4.

DOI:10.1186/s12864-022-08550-4

PMID:36581824

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9798552/

Abstract

BACKGROUND

Bacterial genotyping is a crucial process in outbreak investigation and epidemiological studies. Several typing methods such as pulsed-field gel electrophoresis, multilocus sequence typing (MLST) and whole genome sequencing are currently used in routine clinical practice. However, these methods are costly, time-consuming and have high computational demands. An alternative to these methods is mini-MLST, a quick, cost-effective and robust method based on high-resolution melting analysis. Nevertheless, no standardized approach to identify markers suitable for mini-MLST exists. Here, we present a pipeline for variable fragment detection in unmapped reads based on a modified hybrid assembly approach using data from one sequencing platform.

RESULTS

In routine assembly against the reference sequence, high variable reads are not aligned and remain unmapped. If de novo assembly of them is performed, variable genomic regions can be located in created scaffolds. Based on the variability rates calculation, it is possible to find a highly variable region with the same discriminatory power as seven housekeeping gene fragments used in MLST. In the work presented here, we show the capability of identifying one variable fragment in de novo assembled scaffolds of 21 Escherichia coli genomes and three variable regions in scaffolds of 31 Klebsiella pneumoniae genomes. For each identified fragment, the melting temperatures are calculated based on the nearest neighbor method to verify the mini-MLST's discriminatory power.

CONCLUSIONS

A pipeline for a modified hybrid assembly approach consisting of reference-based mapping and de novo assembly of unmapped reads is presented. This approach can be employed for the identification of highly variable genomic fragments in unmapped reads. The identified variable regions can then be used in efficient laboratory methods for bacterial typing such as mini-MLST with high discriminatory power, fully replacing expensive methods such as MLST. The results can and will be delivered in a shorter time, which allows immediate and fast infection monitoring in clinical practice.

摘要

背景

细菌基因分型是暴发调查和流行病学研究的关键过程。目前在常规临床实践中使用几种分型方法，如脉冲场凝胶电泳、多位点序列分型（MLST）和全基因组测序。然而，这些方法成本高、耗时且计算需求高。这些方法的替代方法是 mini-MLST，这是一种基于高分辨率熔解分析的快速、经济高效且稳健的方法。然而，不存在用于识别适合 mini-MLST 的标记的标准化方法。在这里，我们提出了一种基于使用来自一个测序平台的数据进行修改的混合组装方法的未映射读段中可变片段检测的管道。

结果

在常规针对参考序列的组装中，高变量读段未对齐且仍未映射。如果对它们进行从头组装，则可以在创建的支架中找到可变基因组区域。基于变异性率的计算，可以找到一个具有与 MLST 中使用的七个管家基因片段相同的鉴别力的高度可变区域。在本文中，我们展示了在 21 个大肠杆菌基因组的从头组装支架中识别一个可变片段和在 31 个肺炎克雷伯菌基因组的支架中识别三个可变区域的能力。对于每个识别的片段，基于最近邻方法计算熔解温度，以验证 mini-MLST 的鉴别力。

结论

提出了一种由基于参考的映射和未映射读段的从头组装组成的修改混合组装方法的管道。该方法可用于鉴定未映射读段中的高度可变基因组片段。然后可以将鉴定的可变区域用于细菌分型的高效实验室方法，例如具有高鉴别力的 mini-MLST，完全替代昂贵的方法，如 MLST。结果可以并且将在更短的时间内交付，这允许在临床实践中立即和快速地进行感染监测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/879c/9798552/4f04e718b132/12864_2022_8550_Fig1_HTML.jpg

相似文献

Identification of highly variable sequence fragments in unmapped reads for rapid bacterial genotyping.

BMC Genomics. 2022 Dec 29;23(Suppl 3):445. doi: 10.1186/s12864-022-08550-4.

Rapid and Simple Universal Genotyping Method Based on Multiple-Locus Variable-Number Tandem-Repeat Analysis Using Single-Tube Multiplex PCR and Standard Gel Electrophoresis.

Appl Environ Microbiol. 2019 Mar 6;85(6). doi: 10.1128/AEM.02812-18. Print 2019 Mar 15.

High Resolution Melting as a rapid, reliable, accurate and cost-effective emerging tool for genotyping pathogenic bacteria and enhancing molecular epidemiological surveillance: a comprehensive review of the literature.

Ann Ig. 2017 Jul-Aug;29(4):293-316. doi: 10.7416/ai.2017.2153.

Rapid high-resolution melting genotyping scheme for Escherichia coli based on MLST derived single nucleotide polymorphisms.

Sci Rep. 2021 Aug 16;11(1):16572. doi: 10.1038/s41598-021-96148-3.

Minim typing--a rapid and low cost MLST based typing tool for Klebsiella pneumoniae.

PLoS One. 2012;7(3):e33530. doi: 10.1371/journal.pone.0033530. Epub 2012 Mar 12.

Sequencing Independent Molecular Typing of Staphylococcus aureus Isolates: Approach for Infection Control and Clonal Characterization.

Microbiol Spectr. 2022 Feb 23;10(1):e0181721. doi: 10.1128/spectrum.01817-21. Epub 2022 Feb 9.

Hypervariable-Locus Melting Typing: a Novel Approach for More Effective High-Resolution Melting-Based Typing, Suitable for Large Microbiological Surveillance Programs.

Microbiol Spectr. 2022 Aug 31;10(4):e0100922. doi: 10.1128/spectrum.01009-22. Epub 2022 Aug 1.

Rapid Identification of Pseudomonas aeruginosa International High-Risk Clones Based on High-Resolution Melting Analysis.

Microbiol Spectr. 2023 Feb 14;11(1):e0357122. doi: 10.1128/spectrum.03571-22. Epub 2023 Jan 11.

Multilocus sequence typing of total-genome-sequenced bacteria.

J Clin Microbiol. 2012 Apr;50(4):1355-61. doi: 10.1128/JCM.06094-11. Epub 2012 Jan 11.

Real-Time Nanopore Q20+ Sequencing Enables Extremely Fast and Accurate Core Genome MLST Typing and Democratizes Access to High-Resolution Bacterial Pathogen Surveillance.

J Clin Microbiol. 2023 Apr 20;61(4):e0163122. doi: 10.1128/jcm.01631-22. Epub 2023 Mar 29.

引用本文的文献

Mini-Multilocus Sequence Typing Scheme for the Global Population of .

Int J Mol Sci. 2024 May 26;25(11):5781. doi: 10.3390/ijms25115781.

Advances and challenges in Bioinformatics and Biomedical Engineering: IWBBIO 2020.

BMC Bioinformatics. 2023 Oct 18;24(Suppl 2):361. doi: 10.1186/s12859-023-05448-0.

本文引用的文献

Word Entropy-Based Approach to Detect Highly Variable Genetic Markers for Bacterial Genotyping.

Front Microbiol. 2021 Feb 3;12:631605. doi: 10.3389/fmicb.2021.631605. eCollection 2021.

Using SPAdes De Novo Assembler.

Curr Protoc Bioinformatics. 2020 Jun;70(1):e102. doi: 10.1002/cpbi.102.

Phylogenetic background and habitat drive the genetic diversification of Escherichia coli.

PLoS Genet. 2020 Jun 12;16(6):e1008866. doi: 10.1371/journal.pgen.1008866. eCollection 2020 Jun.

Structure and genetics of Escherichia coli O antigens.

FEMS Microbiol Rev. 2020 Nov 24;44(6):655-683. doi: 10.1093/femsre/fuz028.

Application of mini-MLST and whole genome sequencing in low diversity hospital extended-spectrum beta-lactamase producing Klebsiella pneumoniae population.

PLoS One. 2019 Aug 13;14(8):e0221187. doi: 10.1371/journal.pone.0221187. eCollection 2019.

Pulsed-field gel electrophoresis (PFGE): A review of the "gold standard" for bacteria typing and current alternatives.

Infect Genet Evol. 2019 Oct;74:103935. doi: 10.1016/j.meegid.2019.103935. Epub 2019 Jun 22.

Evolview v3: a webserver for visualization, annotation, and management of phylogenetic trees.

Nucleic Acids Res. 2019 Jul 2;47(W1):W270-W275. doi: 10.1093/nar/gkz357.

Klebsiella pneumoniae infection biology: living to counteract host defences.

FEMS Microbiol Rev. 2019 Mar 1;43(2):123-144. doi: 10.1093/femsre/fuy043.

Hybrid de novo genome assembly and centromere characterization of the gray mouse lemur (Microcebus murinus).

BMC Biol. 2017 Nov 16;15(1):110. doi: 10.1186/s12915-017-0439-6.

Next-generation sequencing technologies and their application to the study and control of bacterial infections.

Clin Microbiol Infect. 2018 Apr;24(4):335-341. doi: 10.1016/j.cmi.2017.10.013. Epub 2017 Oct 23.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

鉴定未映射读取中高度可变的序列片段，用于快速细菌基因分型。

Identification of highly variable sequence fragments in unmapped reads for rapid bacterial genotyping.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献