• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于表征大肠杆菌和沙门氏菌基因组的 GEA 管道。

The GEA pipeline for characterizing Escherichia coli and Salmonella genomes.

机构信息

US Department of Agriculture, Agricultural Research Service, US Meat Animal Research Center, 844 Rd 313, PO Box 165, Clay Center, NE, 68933, USA.

US Department of Agriculture, Agricultural Research Service, Eastern Regional Research Center, 600 East Mermaid Lane, Wyndmoor, PA, 19038, USA.

出版信息

Sci Rep. 2024 Jun 10;14(1):13257. doi: 10.1038/s41598-024-63832-z.

DOI:10.1038/s41598-024-63832-z
PMID:38858528
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11164923/
Abstract

Salmonella enterica and Escherichia coli are major food-borne human pathogens, and their genomes are routinely sequenced for clinical surveillance. Computational pipelines designed for analyzing pathogen genomes should both utilize the most current information from annotation databases and increase the coverage of these databases over time. We report the development of the GEA pipeline to analyze large batches of E. coli and S. enterica genomes. The GEA pipeline takes as input paired Illumina raw reads files which are then assembled followed by annotation. Alternatively, assemblies can be provided as input and directly annotated. The pipeline provides predictive genome annotations for E. coli and S. enterica with a focus on the Center for Genomic Epidemiology tools. Annotation results are provided as a tab delimited text file. The GEA pipeline is designed for large-scale E. coli and S. enterica genome assembly and characterization using the Center for Genomic Epidemiology command-line tools and high-performance computing. Large scale annotation is demonstrated by an analysis of more than 14,000 Salmonella genome assemblies. Testing the GEA pipeline on E. coli raw reads demonstrates reproducibility across multiple compute environments and computational usage is optimized on high performance computers.

摘要

肠沙门氏菌和大肠杆菌是主要的食源性人类病原体,它们的基因组通常被用于临床监测进行测序。用于分析病原体基因组的计算管道应同时利用注释数据库中的最新信息,并随着时间的推移增加这些数据库的覆盖范围。我们报告了 GEA 管道的开发,用于分析大量的大肠杆菌和肠沙门氏菌基因组。GEA 管道以 Illumina 原始读取文件对作为输入,然后进行组装和注释。或者,可以提供组装作为输入并直接注释。该管道提供了对大肠杆菌和肠沙门氏菌的预测基因组注释,重点是基因组流行病学中心的工具。注释结果以制表符分隔的文本文件形式提供。GEA 管道旨在使用基因组流行病学中心的命令行工具和高性能计算对大规模的大肠杆菌和肠沙门氏菌基因组进行组装和特征分析。通过对 14000 多个沙门氏菌基因组组装的分析,展示了大规模注释的能力。在大肠杆菌原始读取上测试 GEA 管道证明了在多个计算环境中的可重复性,并且在高性能计算机上优化了计算使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feb5/11164923/fdf4af7cc991/41598_2024_63832_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feb5/11164923/fdf4af7cc991/41598_2024_63832_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feb5/11164923/fdf4af7cc991/41598_2024_63832_Fig1_HTML.jpg

相似文献

1
The GEA pipeline for characterizing Escherichia coli and Salmonella genomes.用于表征大肠杆菌和沙门氏菌基因组的 GEA 管道。
Sci Rep. 2024 Jun 10;14(1):13257. doi: 10.1038/s41598-024-63832-z.
2
The automatic annotation of bacterial genomes.细菌基因组的自动注释。
Brief Bioinform. 2013 Jan;14(1):1-12. doi: 10.1093/bib/bbs007. Epub 2012 Mar 9.
3
MicrobeAnnotator: a user-friendly, comprehensive functional annotation pipeline for microbial genomes.微生物注释器:一个用户友好、全面的微生物基因组功能注释管道。
BMC Bioinformatics. 2021 Jan 6;22(1):11. doi: 10.1186/s12859-020-03940-5.
4
Beav: a bacterial genome and mobile element annotation pipeline.Beav:细菌基因组和移动元件注释流水线。
mSphere. 2024 Aug 28;9(8):e0020924. doi: 10.1128/msphere.00209-24. Epub 2024 Jul 22.
5
Evolution of pan-genomes of Escherichia coli, Shigella spp., and Salmonella enterica.大肠杆菌、志贺氏菌和沙门氏菌泛基因组的进化。
J Bacteriol. 2013 Jun;195(12):2786-92. doi: 10.1128/JB.02285-12. Epub 2013 Apr 12.
6
Benchmarking hybrid assembly approaches for genomic analyses of bacterial pathogens using Illumina and Oxford Nanopore sequencing.使用 Illumina 和 Oxford Nanopore 测序对细菌病原体进行基因组分析的混合组装方法的基准测试。
BMC Genomics. 2020 Sep 14;21(1):631. doi: 10.1186/s12864-020-07041-8.
7
GI-POP: a combinational annotation and genomic island prediction pipeline for ongoing microbial genome projects.GI-POP:一种组合注释和基因组岛预测管道,用于正在进行的微生物基因组项目。
Gene. 2013 Apr 10;518(1):114-23. doi: 10.1016/j.gene.2012.11.063. Epub 2013 Jan 12.
8
MEGAnnotator: a user-friendly pipeline for microbial genomes assembly and annotation.MEGAnnotator:一个用于微生物基因组组装和注释的用户友好型流程。
FEMS Microbiol Lett. 2016 Apr;363(7). doi: 10.1093/femsle/fnw049. Epub 2016 Mar 1.
9
Comparative genomics of transport proteins in probiotic and pathogenic Escherichia coli and Salmonella enterica strains.益生菌和致病性大肠杆菌及肠炎沙门氏菌菌株中转运蛋白的比较基因组学
Microb Pathog. 2017 Jun;107:106-115. doi: 10.1016/j.micpath.2017.03.022. Epub 2017 Mar 24.
10
When whole-genome alignments just won't work: kSNP v2 software for alignment-free SNP discovery and phylogenetics of hundreds of microbial genomes.当全基因组比对无法奏效时:用于数百个微生物基因组的无比对单核苷酸多态性(SNP)发现及系统发育分析的kSNP v2软件
PLoS One. 2013 Dec 9;8(12):e81760. doi: 10.1371/journal.pone.0081760. eCollection 2013.

本文引用的文献

1
Scalable and versatile container-based pipelines for de novo genome assembly and bacterial annotation.可扩展且通用的基于容器的流水线,用于从头基因组组装和细菌注释。
F1000Res. 2023 Sep 25;12:1205. doi: 10.12688/f1000research.139488.1. eCollection 2023.
2
BacSeq: A User-Friendly Automated Pipeline for Whole-Genome Sequence Analysis of Bacterial Genomes.BacSeq:一种用于细菌基因组全基因组序列分析的用户友好型自动化流程。
Microorganisms. 2023 Jul 6;11(7):1769. doi: 10.3390/microorganisms11071769.
3
simpiTB - a pipeline designed to extract meaningful information from whole genome sequencing data of Mycobacterium tuberculosis complex, allows to combine genomic, phylogenetic and clustering analyses in existing SITVIT databases.
simpiTB - 一个从结核分枝杆菌复合群全基因组测序数据中提取有意义信息的管道,允许将基因组、系统发育和聚类分析结合到现有的 SITVIT 数据库中。
Infect Genet Evol. 2023 Sep;113:105466. doi: 10.1016/j.meegid.2023.105466. Epub 2023 Jun 16.
4
Distribution of Extremely Heat-Resistant Escherichia coli in the Beef Production and Processing Continuum.极端耐热型大肠杆菌在牛肉生产和加工连续体中的分布。
J Food Prot. 2023 Jan;86(1):100031. doi: 10.1016/j.jfp.2022.100031. Epub 2022 Dec 21.
5
Characterization of Escherichia coli harboring colibactin genes (clb) isolated from beef production and processing systems.从牛肉生产和加工系统中分离的携带 colibactin 基因(clb)的大肠杆菌的特性。
Sci Rep. 2022 Mar 29;12(1):5305. doi: 10.1038/s41598-022-09274-x.
6
Twenty-Four-Month Longitudinal Study Suggests Little to No Horizontal Gene Transfer In Situ between Third-Generation Cephalosporin-Resistant Salmonella and Third-Generation Cephalosporin-Resistant Escherichia coli in a Beef Cattle Feedyard.一项为期24个月的纵向研究表明,在肉牛饲养场中,耐第三代头孢菌素的沙门氏菌和耐第三代头孢菌素的大肠杆菌之间几乎没有或不存在原位水平基因转移。
J Food Prot. 2022 Feb 1;85(2):323-335. doi: 10.4315/JFP-21-371.
7
Twelve years of SAMtools and BCFtools.SAMtools 和 BCFtools 十二年。
Gigascience. 2021 Feb 16;10(2). doi: 10.1093/gigascience/giab008.
8
Easy phylotyping of the EzClermont web app and command-line tool.EzClermont网络应用程序和命令行工具的简易系统发育分型。
Access Microbiol. 2020 Jun 19;2(9):acmi000143. doi: 10.1099/acmi.0.000143. eCollection 2020.
9
ResFinder 4.0 for predictions of phenotypes from genotypes.ResFinder 4.0 用于基因型到表型的预测。
J Antimicrob Chemother. 2020 Dec 1;75(12):3491-3500. doi: 10.1093/jac/dkaa345.
10
BacPipe: A Rapid, User-Friendly Whole-Genome Sequencing Pipeline for Clinical Diagnostic Bacteriology.BacPipe:一种用于临床诊断细菌学的快速、用户友好的全基因组测序流程。
iScience. 2020 Jan 24;23(1):100769. doi: 10.1016/j.isci.2019.100769. Epub 2019 Dec 12.