• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用 metashot/prok-quality 对原核生物基因组进行大规模质量评估。

Large-scale quality assessment of prokaryotic genomes with metashot/prok-quality.

机构信息

Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, TN, 38098, Italy.

出版信息

F1000Res. 2021 Aug 17;10:822. doi: 10.12688/f1000research.54418.1. eCollection 2021.

DOI:10.12688/f1000research.54418.1
PMID:35136576
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8804904/
Abstract

Metagenomic sequencing allows large-scale identification and genomic characterization. Binning is the process of recovering genomes from complex mixtures of sequence fragments (metagenome contigs) of unknown bacteria and archaeal species. Assessing the quality of genomes recovered from metagenomes requires the use of complex pipelines involving many independent steps, often difficult to reproduce and maintain. A comprehensive, automated and easy-to-use computational workflow for the quality assessment of draft prokaryotic genomes, based on container technology, would greatly improve reproducibility and reusability of published results. We present metashot/prok-quality, a container-enabled Nextflow pipeline for quality assessment and genome dereplication. The metashot/prok-quality tool produces genome quality reports that are compliant with the Minimum Information about a Metagenome-Assembled Genome (MIMAG) standard, and can run out-of-the-box on any platform that supports Nextflow, Docker or Singularity, including computing clusters or batch infrastructures in the cloud. metashot/prok-quality is part of the metashot collection of analysis pipelines. Workflow and documentation are available under GPL3 licence on GitHub.

摘要

宏基因组测序允许大规模识别和基因组特征分析。 分箱是从未知细菌和古菌物种的复杂序列片段(宏基因组序列)混合物中恢复基因组的过程。 评估从宏基因组中恢复的基因组的质量需要使用涉及许多独立步骤的复杂管道,这些步骤通常难以复制和维护。 基于容器技术,为基于容器技术的 draft 原核基因组质量评估提供一种全面、自动化和易于使用的计算工作流程,将极大地提高已发表结果的可重复性和可重用性。 我们提出了 metashot/prok-quality,这是一个基于 Nextflow 的容器化管道,用于质量评估和基因组去重复。 metashot/prok-quality 工具生成符合宏基因组组装基因组最低信息(Minimum Information about a Metagenome-Assembled Genome,MIMAG)标准的基因组质量报告,并且可以在任何支持 Nextflow、Docker 或 Singularity 的平台上开箱即用,包括计算集群或云批次基础设施。 metashot/prok-quality 是 metashot 分析管道集合的一部分。 工作流程和文档可在 GitHub 上根据 GPL3 许可证获得。

相似文献

1
Large-scale quality assessment of prokaryotic genomes with metashot/prok-quality.使用 metashot/prok-quality 对原核生物基因组进行大规模质量评估。
F1000Res. 2021 Aug 17;10:822. doi: 10.12688/f1000research.54418.1. eCollection 2021.
2
Evaluating Assembly and Binning Strategies for Time Series Drinking Water Metagenomes.评估时间序列饮用水宏基因组的组装和分类策略。
Microbiol Spectr. 2021 Dec 22;9(3):e0143421. doi: 10.1128/Spectrum.01434-21. Epub 2021 Nov 3.
3
Recovering prokaryotic genomes from host-associated, short-read shotgun metagenomic sequencing data.从宿主相关的短读 shotgun 宏基因组测序数据中回收原核基因组。
Nat Protoc. 2021 May;16(5):2520-2541. doi: 10.1038/s41596-021-00508-2. Epub 2021 Apr 16.
4
VEBA: a modular end-to-end suite for in silico recovery, clustering, and analysis of prokaryotic, microeukaryotic, and viral genomes from metagenomes.VEBA:一个用于元基因组中细菌、微真核生物和病毒基因组的从头组装、聚类和分析的模块化端到端套件。
BMC Bioinformatics. 2022 Oct 12;23(1):419. doi: 10.1186/s12859-022-04973-8.
5
MAGNETO: An Automated Workflow for Genome-Resolved Metagenomics.MAGNETO:基因组解析宏基因组学的自动化工作流程。
mSystems. 2022 Aug 30;7(4):e0043222. doi: 10.1128/msystems.00432-22. Epub 2022 Jun 15.
6
Genome-resolved metagenomics using environmental and clinical samples.基于环境和临床样本的基因组解析宏基因组学。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab030.
7
Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life.近 8000 个宏基因组组装基因组的恢复极大地扩展了生命之树。
Nat Microbiol. 2017 Nov;2(11):1533-1542. doi: 10.1038/s41564-017-0012-7. Epub 2017 Sep 11.
8
binny: an automated binning algorithm to recover high-quality genomes from complex metagenomic datasets.binny:一种自动化的分箱算法,可从复杂的宏基因组数据集中恢复高质量的基因组。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac431.
9
Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea.细菌和古菌单扩增基因组(MISAG)及宏基因组组装基因组(MIMAG)的最低信息要求
Nat Biotechnol. 2017 Aug 8;35(8):725-731. doi: 10.1038/nbt.3893.
10
ACR: metagenome-assembled prokaryotic and eukaryotic genome refinement tool.ACR:宏基因组组装原核生物和真核生物基因组精修工具。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad381.

引用本文的文献

1
Draft genome sequence of an uncultured archaeon from Antarctic endolithic communities.来自南极石内微生物群落的一种未培养古菌的基因组序列草图
Microbiol Resour Announc. 2025 Aug 14;14(8):e0042725. doi: 10.1128/mra.00427-25. Epub 2025 Jul 23.
2
Multi-omic stock of surface ocean microbiome built by monthly, weekly and daily sampling in Dapeng Bay, China.通过在中国大鹏湾进行月度、每周和每日采样构建的表层海洋微生物群落多组学库。
Sci Data. 2025 Mar 4;12(1):378. doi: 10.1038/s41597-025-04669-7.
3
Navigating the Complex Terrain of Methane Synthesis: Multienzyme Control Points and Data-Driven Strategies.

本文引用的文献

1
Contamination in Reference Sequence Databases: Time for Divide-and-Rule Tactics.参考序列数据库中的污染:是时候采取分而治之的策略了。
Front Microbiol. 2021 Oct 22;12:755101. doi: 10.3389/fmicb.2021.755101. eCollection 2021.
2
GUNC: detection of chimerism and contamination in prokaryotic genomes.GUNC:原核基因组嵌合体和污染的检测。
Genome Biol. 2021 Jun 13;22(1):178. doi: 10.1186/s13059-021-02393-0.
3
Improved metagenome binning and assembly using deep variational autoencoders.利用深度变分自动编码器改进宏基因组的分类和组装。
探索甲烷合成的复杂领域:多酶控制点与数据驱动策略
ACS Omega. 2024 Dec 20;10(1):93-101. doi: 10.1021/acsomega.3c05803. eCollection 2025 Jan 14.
4
Advanced Methods for Natural Products Discovery: Bioactivity Screening, Dereplication, Metabolomics Profiling, Genomic Sequencing, Databases and Informatic Tools, and Structure Elucidation.天然产物发现的先进方法:生物活性筛选、去重复、代谢组学分析、基因组测序、数据库和信息工具以及结构解析。
Mar Drugs. 2023 May 19;21(5):308. doi: 10.3390/md21050308.
5
Size-fractionated microbiome observed during an eight-month long sampling in Jiaozhou Bay and the Yellow Sea.在胶州湾和黄海进行的为期八个月的采样中观察到的微生物组大小分级。
Sci Data. 2022 Oct 7;9(1):605. doi: 10.1038/s41597-022-01734-3.
6
Genome sequencing provides new insights on the distribution of Erwinia amylovora lineages in northern Italy.基因组测序为意大利北部肠杆菌属的分布提供了新的见解。
Environ Microbiol Rep. 2022 Aug;14(4):584-590. doi: 10.1111/1758-2229.13074. Epub 2022 Apr 28.
Nat Biotechnol. 2021 May;39(5):555-560. doi: 10.1038/s41587-020-00777-4. Epub 2021 Jan 4.
4
To Dereplicate or Not To Dereplicate?去重还是不去重?
mSphere. 2020 May 20;5(3):e00971-19. doi: 10.1128/mSphere.00971-19.
5
Accurate and complete genomes from metagenomes.从宏基因组中获得准确和完整的基因组。
Genome Res. 2020 Mar;30(3):315-333. doi: 10.1101/gr.258640.119. Epub 2020 Mar 18.
6
The nf-core framework for community-curated bioinformatics pipelines.用于社区策划生物信息学流程的nf-core框架。
Nat Biotechnol. 2020 Mar;38(3):276-278. doi: 10.1038/s41587-020-0439-x.
7
MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies.MetaBAT 2:一种用于从宏基因组组装中进行稳健且高效的基因组重建的自适应分箱算法。
PeerJ. 2019 Jul 26;7:e7359. doi: 10.7717/peerj.7359. eCollection 2019.
8
Scalable Workflows and Reproducible Data Analysis for Genomics.基因组学的可扩展工作流程和可重复数据分析
Methods Mol Biol. 2019;1910:723-745. doi: 10.1007/978-1-4939-9074-0_24.
9
Composite Metagenome-Assembled Genomes Reduce the Quality of Public Genome Repositories.复合宏基因组组装基因组降低了公共基因组库的质量。
mBio. 2019 Jun 4;10(3):e00725-19. doi: 10.1128/mBio.00725-19.
10
tRNAscan-SE: Searching for tRNA Genes in Genomic Sequences.tRNAscan-SE:在基因组序列中搜索tRNA基因。
Methods Mol Biol. 2019;1962:1-14. doi: 10.1007/978-1-4939-9173-0_1.