单核细胞增生李斯特菌核心基因组序列分型工具（LmCGST）：一种利用下一代测序数据进行分子特征分析的生物信息学流程。

The Listeria monocytogenes Core-Genome Sequence Typer (LmCGST): a bioinformatic pipeline for molecular characterization with next-generation sequence data.

作者信息

Pightling Arthur W, Petronella Nicholas, Pagotto Franco

机构信息

Office of Analytics and Outreach, Center for Food Safety and Applied Nutrition, U.S. Food and Drug Administration, 5100 Paint Branch Parkway, College Park, MD, 20740, USA.

Biostatistics and Modelling Division, Bureau of Food Surveillance and Science Integration, Food Directorate, Health Products and Food Branch, Health Canada, 251 Sir Frederick Banting Driveway, Ottawa, K1A 0K9, ON, Canada.

出版信息

BMC Microbiol. 2015 Oct 22;15:224. doi: 10.1186/s12866-015-0526-1.

DOI:10.1186/s12866-015-0526-1

PMID:26490433

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4618880/

Abstract

BACKGROUND

Next-generation sequencing provides a powerful means of molecular characterization. However, methods such as single-nucleotide polymorphism detection or whole-chromosome sequence analysis are computationally expensive, prone to errors, and are still less accessible than traditional typing methods. Here, we present the Listeria monocytogenes core-genome sequence typing method for molecular characterization. This method uses a high-confidence core (HCC) genome, calculated to ensure accurate identification of orthologs. We also developed an evolutionarily relevant nomenclature based upon phylogenetic analysis of HCC genomes. Finally, we created a pipeline (LmCGST; https://sourceforge.net/projects/lmcgst/files/) that takes in raw next-generation sequencing reads, calculates a subject HCC profile, compares it to an expandable database, assigns a sequence type, and performs a phylogenetic analysis.

RESULTS

We analyzed 29 high-quality, closed Listeria monocytogenes chromosome sequences and identified loci that are reliable targets for automated molecular characterization methods. We identified 1013 open-reading frames that comprise our high-confidence core (HCC) genome. We then populated a database with HCC profiles from 114 taxa. We sequenced 84 randomly selected isolates from the Listeriosis Reference Service for Canada's collection and analysed them with the LmCGST pipeline. In addition, we generated pulsed-field gel electrophoresis, ribotyping, and in silico multi-locus sequence typing (MLST) data for the 84 isolates and compared the results to those obtained using the CGST method. We found that all of the methods yielded results that are generally congruent. However, due to the increased numbers of categories, the CGST method provides much greater discriminatory power than the other methods tested here.

CONCLUSIONS

We show that the CGST method provides increased discriminatory power relative to typing methods such as pulsed-field gel electrophoresis, ribotyping, and multi-locus sequence typing while it addresses several shortcomings of other methods of molecular characterization with next-generation sequence data. It uses discrete, well-defined groupings (types) of organisms that are phylogenetically relevant and easily interpreted. In addition, the CGST scheme can be expanded to include additional loci and HCC profiles in the future. In total, the CGST method provides an approach to the molecular characterization of Listeria monocytogenes with next-generation sequence data that is highly reproducible, easily standardized, portable, and accessible.

摘要

背景

新一代测序提供了一种强大的分子特征分析手段。然而，诸如单核苷酸多态性检测或全染色体序列分析等方法计算成本高昂，容易出错，并且与传统分型方法相比，其普及程度仍然较低。在此，我们提出了用于分子特征分析的单核细胞增生李斯特菌核心基因组序列分型方法。该方法使用经计算以确保准确鉴定直系同源基因的高可信度核心（HCC）基因组。我们还基于HCC基因组的系统发育分析开发了一种具有进化相关性的命名法。最后，我们创建了一个流程（LmCGST；https://sourceforge.net/projects/lmcgst/files/），该流程接收原始的新一代测序读数，计算样本的HCC图谱，将其与一个可扩展数据库进行比较，指定一个序列类型，并进行系统发育分析。

结果

我们分析了29个高质量的、封闭的单核细胞增生李斯特菌染色体序列，并确定了可作为自动化分子特征分析方法可靠靶点的基因座。我们鉴定出1013个开放阅读框，它们构成了我们的高可信度核心（HCC）基因组。然后，我们用来自114个分类单元的HCC图谱填充了一个数据库。我们对从加拿大李斯特菌病参考服务中心随机选择的84株分离株进行测序，并使用LmCGST流程对其进行分析。此外，我们为这84株分离株生成了脉冲场凝胶电泳、核糖体分型和计算机多位点序列分型（MLST）数据，并将结果与使用CGST方法获得的结果进行比较。我们发现所有这些方法得出的结果总体上是一致的。然而，由于分类数量的增加，CGST方法比此处测试的其他方法具有更强的鉴别力。

结论

我们表明，与脉冲场凝胶电泳、核糖体分型和多位点序列分型等分型方法相比，CGST方法具有更强的鉴别力，同时它解决了其他分子特征分析方法在处理新一代序列数据时的几个缺点。它使用与系统发育相关且易于解释的离散、明确的生物体分组（类型）。此外，CGST方案未来可以扩展以纳入更多的基因座和HCC图谱。总体而言，CGST方法提供了一种利用新一代序列数据对单核细胞增生李斯特菌进行分子特征分析的方法，该方法具有高度可重复性、易于标准化、便携且易于获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8215/4618880/18a15b273a16/12866_2015_526_Fig1_HTML.jpg

相似文献

The Listeria monocytogenes Core-Genome Sequence Typer (LmCGST): a bioinformatic pipeline for molecular characterization with next-generation sequence data.

BMC Microbiol. 2015 Oct 22;15:224. doi: 10.1186/s12866-015-0526-1.

An Open-Source Program (Haplo-ST) for Whole-Genome Sequence Typing Shows Extensive Diversity among Listeria monocytogenes Isolates in Outdoor Environments and Poultry Processing Plants.

Appl Environ Microbiol. 2020 Dec 17;87(1). doi: 10.1128/AEM.02248-20.

A rapid typing method for Listeria monocytogenes based on high-throughput multilocus sequence typing (Hi-MLST).

Int J Food Microbiol. 2017 Feb 21;243:84-89. doi: 10.1016/j.ijfoodmicro.2016.12.009. Epub 2016 Dec 16.

Retrospective investigation of listeriosis outbreaks in small ruminants using different analytical approaches for whole genome sequencing-based typing of Listeria monocytogenes.

Infect Genet Evol. 2020 Jan;77:104047. doi: 10.1016/j.meegid.2019.104047. Epub 2019 Oct 17.

Development of a multilocus variable-number of tandem repeat typing method for Listeria monocytogenes serotype 4b strains.

Int J Food Microbiol. 2008 Jun 10;124(3):239-49. doi: 10.1016/j.ijfoodmicro.2008.03.023. Epub 2008 Mar 31.

Real-Time Whole-Genome Sequencing for Surveillance of Listeria monocytogenes, France.

Emerg Infect Dis. 2017 Sep;23(9):1462-1470. doi: 10.3201/eid2309.170336. Epub 2017 Sep 17.

Multi-virulence-locus sequence typing of Listeria monocytogenes.

Appl Environ Microbiol. 2004 Feb;70(2):913-20. doi: 10.1128/AEM.70.2.913-920.2004.

Whole genome sequence-based serogrouping of Listeria monocytogenes isolates.

J Biotechnol. 2016 Oct 10;235:181-6. doi: 10.1016/j.jbiotec.2016.06.005. Epub 2016 Jun 8.

Defining and Evaluating a Core Genome Multilocus Sequence Typing Scheme for Whole-Genome Sequence-Based Typing of Listeria monocytogenes.

J Clin Microbiol. 2015 Sep;53(9):2869-76. doi: 10.1128/JCM.01193-15. Epub 2015 Jul 1.

Extending RAD tag analysis to microbial ecology: a comparison between MultiLocus Sequence Typing and 2b-RAD to investigate Listeria monocytogenes genetic structure.

Mol Ecol Resour. 2016 May;16(3):823-35. doi: 10.1111/1755-0998.12495. Epub 2015 Dec 21.

引用本文的文献

In vitro and in silico parameters for precise cgMLST typing of Listeria monocytogenes.

BMC Genomics. 2022 Mar 26;23(1):235. doi: 10.1186/s12864-022-08437-4.

Molecular subtyping for source tracking of Escherichia coli using core genome multilocus sequence typing at a food manufacturing plant.

PLoS One. 2021 Dec 23;16(12):e0261352. doi: 10.1371/journal.pone.0261352. eCollection 2021.

Genomic Analysis of Prophages Recovered from Lysogens Found in Seafood and Seafood-Related Environment.

Microorganisms. 2021 Jun 22;9(7):1354. doi: 10.3390/microorganisms9071354.

Evaluating the accuracy of Listeria monocytogenes assemblies from quasimetagenomic samples using long and short reads.

BMC Genomics. 2021 May 26;22(1):389. doi: 10.1186/s12864-021-07702-2.

An Open-Source Program (Haplo-ST) for Whole-Genome Sequence Typing Shows Extensive Diversity among Listeria monocytogenes Isolates in Outdoor Environments and Poultry Processing Plants.

Appl Environ Microbiol. 2020 Dec 17;87(1). doi: 10.1128/AEM.02248-20.

Within-species contamination of bacterial whole-genome sequence data has a greater influence on clustering analyses than between-species contamination.

Genome Biol. 2019 Dec 18;20(1):286. doi: 10.1186/s13059-019-1914-x.

Phylogenomic Pipeline Validation for Foodborne Pathogen Disease Surveillance.

J Clin Microbiol. 2019 Apr 26;57(5). doi: 10.1128/JCM.01816-18. Print 2019 May.

Interpreting Whole-Genome Sequence Analyses of Foodborne Bacteria for Regulatory Applications and Outbreak Investigations.

Front Microbiol. 2018 Jul 10;9:1482. doi: 10.3389/fmicb.2018.01482. eCollection 2018.

GenomeTrakr proficiency testing for foodborne pathogen surveillance: an exercise from 2015.

Microb Genom. 2018 Jul;4(7). doi: 10.1099/mgen.0.000185. Epub 2018 Jun 15.

MentaLiST - A fast MLST caller for large MLST schemes.

Microb Genom. 2018 Feb;4(2). doi: 10.1099/mgen.0.000146. Epub 2018 Jan 10.

本文引用的文献

An Introduction to the Hows and Whys of Molecular Typing .

J Food Prot. 1996 Oct;59(10):1091-1101. doi: 10.4315/0362-028X-59.10.1091.

An evaluation of alternative methods for constructing phylogenies from whole genome sequence data: a case study with Salmonella.

PeerJ. 2014 Oct 14;2:e620. doi: 10.7717/peerj.620. eCollection 2014.

Choice of reference sequence and assembler for alignment of Listeria monocytogenes short-read sequence data greatly influences rates of error in SNP analyses.

PLoS One. 2014 Aug 21;9(8):e104579. doi: 10.1371/journal.pone.0104579. eCollection 2014.

Draft Genome Sequence of Listeria monocytogenes Strain LI0521 (syn. HPB7171), Isolated in 1983 during an Outbreak in Massachusetts Caused by Contaminated Cheese.

Genome Announc. 2014 Jul 24;2(4):e00729-14. doi: 10.1128/genomeA.00729-14.

Whole-genome-based Mycobacterium tuberculosis surveillance: a standardized, portable, and expandable approach.

J Clin Microbiol. 2014 Jul;52(7):2479-86. doi: 10.1128/JCM.00567-14. Epub 2014 Apr 30.

Comparison of widely used Listeria monocytogenes strains EGD, 10403S, and EGD-e highlights genomic variations underlying differences in pathogenicity.

mBio. 2014 Mar 25;5(2):e00969-14. doi: 10.1128/mBio.00969-14.

Prokka: rapid prokaryotic genome annotation.

Bioinformatics. 2014 Jul 15;30(14):2068-9. doi: 10.1093/bioinformatics/btu153. Epub 2014 Mar 18.

Automated reconstruction of whole-genome phylogenies from short-sequence reads.

Mol Biol Evol. 2014 May;31(5):1077-88. doi: 10.1093/molbev/msu088. Epub 2014 Mar 5.

RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies.

Bioinformatics. 2014 May 1;30(9):1312-3. doi: 10.1093/bioinformatics/btu033. Epub 2014 Jan 21.

Genome sequence analyses show that Neisseria oralis is the same species as 'Neisseria mucosa var. heidelbergensis'.

Int J Syst Evol Microbiol. 2013 Oct;63(Pt 10):3920-3926. doi: 10.1099/ijs.0.052431-0.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

单核细胞增生李斯特菌核心基因组序列分型工具（LmCGST）：一种利用下一代测序数据进行分子特征分析的生物信息学流程。

The Listeria monocytogenes Core-Genome Sequence Typer (LmCGST): a bioinformatic pipeline for molecular characterization with next-generation sequence data.

作者信息

Pightling Arthur W, Petronella Nicholas, Pagotto Franco

机构信息

Office of Analytics and Outreach, Center for Food Safety and Applied Nutrition, U.S. Food and Drug Administration, 5100 Paint Branch Parkway, College Park, MD, 20740, USA.