Suppr超能文献

鉴定未映射读取中高度可变的序列片段,用于快速细菌基因分型。

Identification of highly variable sequence fragments in unmapped reads for rapid bacterial genotyping.

机构信息

Department of Biomedical Engineering, Faculty of Electrical Engineering and Communication, Brno University of Technology, Brno, Czechia.

Department of Internal Medicine, Hematology and Oncology, University Hospital Brno, Brno, Czechia.

出版信息

BMC Genomics. 2022 Dec 29;23(Suppl 3):445. doi: 10.1186/s12864-022-08550-4.

Abstract

BACKGROUND

Bacterial genotyping is a crucial process in outbreak investigation and epidemiological studies. Several typing methods such as pulsed-field gel electrophoresis, multilocus sequence typing (MLST) and whole genome sequencing are currently used in routine clinical practice. However, these methods are costly, time-consuming and have high computational demands. An alternative to these methods is mini-MLST, a quick, cost-effective and robust method based on high-resolution melting analysis. Nevertheless, no standardized approach to identify markers suitable for mini-MLST exists. Here, we present a pipeline for variable fragment detection in unmapped reads based on a modified hybrid assembly approach using data from one sequencing platform.

RESULTS

In routine assembly against the reference sequence, high variable reads are not aligned and remain unmapped. If de novo assembly of them is performed, variable genomic regions can be located in created scaffolds. Based on the variability rates calculation, it is possible to find a highly variable region with the same discriminatory power as seven housekeeping gene fragments used in MLST. In the work presented here, we show the capability of identifying one variable fragment in de novo assembled scaffolds of 21 Escherichia coli genomes and three variable regions in scaffolds of 31 Klebsiella pneumoniae genomes. For each identified fragment, the melting temperatures are calculated based on the nearest neighbor method to verify the mini-MLST's discriminatory power.

CONCLUSIONS

A pipeline for a modified hybrid assembly approach consisting of reference-based mapping and de novo assembly of unmapped reads is presented. This approach can be employed for the identification of highly variable genomic fragments in unmapped reads. The identified variable regions can then be used in efficient laboratory methods for bacterial typing such as mini-MLST with high discriminatory power, fully replacing expensive methods such as MLST. The results can and will be delivered in a shorter time, which allows immediate and fast infection monitoring in clinical practice.

摘要

背景

细菌基因分型是暴发调查和流行病学研究的关键过程。目前在常规临床实践中使用几种分型方法,如脉冲场凝胶电泳、多位点序列分型(MLST)和全基因组测序。然而,这些方法成本高、耗时且计算需求高。这些方法的替代方法是 mini-MLST,这是一种基于高分辨率熔解分析的快速、经济高效且稳健的方法。然而,不存在用于识别适合 mini-MLST 的标记的标准化方法。在这里,我们提出了一种基于使用来自一个测序平台的数据进行修改的混合组装方法的未映射读段中可变片段检测的管道。

结果

在常规针对参考序列的组装中,高变量读段未对齐且仍未映射。如果对它们进行从头组装,则可以在创建的支架中找到可变基因组区域。基于变异性率的计算,可以找到一个具有与 MLST 中使用的七个管家基因片段相同的鉴别力的高度可变区域。在本文中,我们展示了在 21 个大肠杆菌基因组的从头组装支架中识别一个可变片段和在 31 个肺炎克雷伯菌基因组的支架中识别三个可变区域的能力。对于每个识别的片段,基于最近邻方法计算熔解温度,以验证 mini-MLST 的鉴别力。

结论

提出了一种由基于参考的映射和未映射读段的从头组装组成的修改混合组装方法的管道。该方法可用于鉴定未映射读段中的高度可变基因组片段。然后可以将鉴定的可变区域用于细菌分型的高效实验室方法,例如具有高鉴别力的 mini-MLST,完全替代昂贵的方法,如 MLST。结果可以并且将在更短的时间内交付,这允许在临床实践中立即和快速地进行感染监测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/879c/9798552/4f04e718b132/12864_2022_8550_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验