• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

VEBA:一个用于元基因组中细菌、微真核生物和病毒基因组的从头组装、聚类和分析的模块化端到端套件。

VEBA: a modular end-to-end suite for in silico recovery, clustering, and analysis of prokaryotic, microeukaryotic, and viral genomes from metagenomes.

机构信息

Department of Environment and Sustainability, J. Craig Venter Institute, 4120 Capricorn Ln, La Jolla, CA, 92037, USA.

Department of Human Biology and Genomic Medicine, J. Craig Venter Institute, La Jolla, CA, 92037, USA.

出版信息

BMC Bioinformatics. 2022 Oct 12;23(1):419. doi: 10.1186/s12859-022-04973-8.

DOI:10.1186/s12859-022-04973-8
PMID:36224545
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9554839/
Abstract

BACKGROUND

With the advent of metagenomics, the importance of microorganisms and how their interactions are relevant to ecosystem resilience, sustainability, and human health has become evident. Cataloging and preserving biodiversity is paramount not only for the Earth's natural systems but also for discovering solutions to challenges that we face as a growing civilization. Metagenomics pertains to the in silico study of all microorganisms within an ecological community in situ, however, many software suites recover only prokaryotes and have limited to no support for viruses and eukaryotes.

RESULTS

In this study, we introduce the Viral Eukaryotic Bacterial Archaeal (VEBA) open-source software suite developed to recover genomes from all domains. To our knowledge, VEBA is the first end-to-end metagenomics suite that can directly recover, quality assess, and classify prokaryotic, eukaryotic, and viral genomes from metagenomes. VEBA implements a novel iterative binning procedure and hybrid sample-specific/multi-sample framework that yields more genomes than any existing methodology alone. VEBA includes a consensus microeukaryotic database containing proteins from existing databases to optimize microeukaryotic gene modeling and taxonomic classification. VEBA also provides a unique clustering-based dereplication strategy allowing for sample-specific genomes and genes to be directly compared across non-overlapping biological samples. Finally, VEBA is the only pipeline that automates the detection of candidate phyla radiation bacteria and implements the appropriate genome quality assessments. VEBA's capabilities are demonstrated by reanalyzing 3 existing public datasets which recovered a total of 948 MAGs (458 prokaryotic, 8 eukaryotic, and 482 viral) including several uncharacterized organisms and organisms with no public genome representatives.

CONCLUSIONS

The VEBA software suite allows for the in silico recovery of microorganisms from all domains of life by integrating cutting edge algorithms in novel ways. VEBA fully integrates both end-to-end and task-specific metagenomic analysis in a modular architecture that minimizes dependencies and maximizes productivity. The contributions of VEBA to the metagenomics community includes seamless end-to-end metagenomics analysis but also provides users with the flexibility to perform specific analytical tasks. VEBA allows for the automation of several metagenomics steps and shows that new information can be recovered from existing datasets.

摘要

背景

随着宏基因组学的出现,微生物及其相互作用与生态系统弹性、可持续性和人类健康的相关性的重要性变得显而易见。对生物多样性进行编目和保存不仅对地球的自然系统至关重要,而且对于发现我们作为一个不断发展的文明所面临的挑战的解决方案也至关重要。宏基因组学涉及对原位生态群落中所有微生物的计算机研究,然而,许多软件套件仅能恢复原核生物,并且对病毒和真核生物的支持有限或没有支持。

结果

在这项研究中,我们介绍了病毒真核生物细菌古菌(VEBA)开源软件套件,该套件旨在从所有领域恢复基因组。据我们所知,VEBA 是第一个端到端的宏基因组学套件,可以直接从宏基因组中恢复、质量评估和分类原核生物、真核生物和病毒基因组。VEBA 实现了一种新颖的迭代分箱过程和混合样本特异性/多样本框架,比任何现有方法单独产生的基因组都多。VEBA 包含一个包含现有数据库中蛋白质的共识微真核生物数据库,以优化微真核生物基因建模和分类学分类。VEBA 还提供了一种独特的基于聚类的去重复策略,允许在非重叠的生物样本中直接比较特定于样本的基因组和基因。最后,VEBA 是唯一自动检测候选门辐射细菌并实施适当基因组质量评估的管道。VEBA 的功能通过重新分析 3 个现有的公共数据集得到了证明,共恢复了 948 个 MAG(458 个原核生物、8 个真核生物和 482 个病毒),包括几个未被表征的生物体和没有公共基因组代表的生物体。

结论

VEBA 软件套件通过以新颖的方式整合前沿算法,允许从生命的所有领域的计算机中恢复微生物。VEBA 以最小化依赖性和最大化生产力的模块化架构完全集成了端到端和特定于任务的宏基因组分析。VEBA 为宏基因组学社区做出的贡献包括无缝的端到端宏基因组学分析,但也为用户提供了执行特定分析任务的灵活性。VEBA 允许自动化几个宏基因组学步骤,并表明可以从现有数据集中恢复新信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d46/9554991/83082da07429/12859_2022_4973_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d46/9554991/3d739d9d8414/12859_2022_4973_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d46/9554991/48d859f7ff02/12859_2022_4973_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d46/9554991/b1510015dfce/12859_2022_4973_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d46/9554991/2d74003a576f/12859_2022_4973_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d46/9554991/83082da07429/12859_2022_4973_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d46/9554991/3d739d9d8414/12859_2022_4973_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d46/9554991/48d859f7ff02/12859_2022_4973_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d46/9554991/b1510015dfce/12859_2022_4973_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d46/9554991/2d74003a576f/12859_2022_4973_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d46/9554991/83082da07429/12859_2022_4973_Fig5_HTML.jpg

相似文献

1
VEBA: a modular end-to-end suite for in silico recovery, clustering, and analysis of prokaryotic, microeukaryotic, and viral genomes from metagenomes.VEBA:一个用于元基因组中细菌、微真核生物和病毒基因组的从头组装、聚类和分析的模块化端到端套件。
BMC Bioinformatics. 2022 Oct 12;23(1):419. doi: 10.1186/s12859-022-04973-8.
2
Unveiling the microbial realm with VEBA 2.0: a modular bioinformatics suite for end-to-end genome-resolved prokaryotic, (micro)eukaryotic and viral multi-omics from either short- or long-read sequencing.揭示微生物世界的 VEBA 2.0:一个用于从短读或长读测序中进行端到端基因组解析的原核生物、(微)真核生物和病毒多组学的模块化生物信息学套件。
Nucleic Acids Res. 2024 Aug 12;52(14):e63. doi: 10.1093/nar/gkae528.
3
Unveiling the Microbial Realm with VEBA 2.0: A modular bioinformatics suite for end-to-end genome-resolved prokaryotic, (micro)eukaryotic, and viral multi-omics from either short- or long-read sequencing.利用VEBA 2.0揭示微生物领域:一个模块化生物信息学套件,用于从短读长或长读长测序进行端到端的基因组解析原核生物、(微)真核生物和病毒多组学分析。
bioRxiv. 2024 Mar 11:2024.03.08.583560. doi: 10.1101/2024.03.08.583560.
4
ACR: metagenome-assembled prokaryotic and eukaryotic genome refinement tool.ACR:宏基因组组装原核生物和真核生物基因组精修工具。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad381.
5
MAGNETO: An Automated Workflow for Genome-Resolved Metagenomics.MAGNETO:基因组解析宏基因组学的自动化工作流程。
mSystems. 2022 Aug 30;7(4):e0043222. doi: 10.1128/msystems.00432-22. Epub 2022 Jun 15.
6
Evaluating Assembly and Binning Strategies for Time Series Drinking Water Metagenomes.评估时间序列饮用水宏基因组的组装和分类策略。
Microbiol Spectr. 2021 Dec 22;9(3):e0143421. doi: 10.1128/Spectrum.01434-21. Epub 2021 Nov 3.
7
Anomalous Phylogenetic Behavior of Ribosomal Proteins in Metagenome-Assembled Asgard Archaea.宏基因组组装的古菌“阿斯加德”中核糖体蛋白的异常系统发育行为。
Genome Biol Evol. 2021 Jan 7;13(1). doi: 10.1093/gbe/evaa238.
8
Decomposing a San Francisco estuary microbiome using long-read metagenomics reveals species- and strain-level dominance from picoeukaryotes to viruses.利用长读长宏基因组学分解旧金山河口微生物组,揭示了从微微型真核生物到病毒的种属和菌株水平的优势。
mSystems. 2024 Sep 17;9(9):e0024224. doi: 10.1128/msystems.00242-24. Epub 2024 Aug 19.
9
Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life.近 8000 个宏基因组组装基因组的恢复极大地扩展了生命之树。
Nat Microbiol. 2017 Nov;2(11):1533-1542. doi: 10.1038/s41564-017-0012-7. Epub 2017 Sep 11.
10
Ecosystem-wide metagenomic binning enables prediction of ecological niches from genomes.全生态系统宏基因组分箱使从基因组预测生态位成为可能。
Commun Biol. 2020 Mar 13;3(1):119. doi: 10.1038/s42003-020-0856-x.

引用本文的文献

1
Live bacteria in gut microbiome dictate asthma onset triggered by environmental particles via modulation of DNA methylation in dendritic cells.肠道微生物群中的活细菌通过调节树突状细胞中的DNA甲基化来决定由环境颗粒引发的哮喘发作。
Cell Rep. 2025 May 27;44(5):115684. doi: 10.1016/j.celrep.2025.115684. Epub 2025 May 13.
2
Eukfinder: a pipeline to retrieve microbial eukaryote genome sequences from metagenomic data.Eukfinder:一种从宏基因组数据中检索微生物真核生物基因组序列的流程。
mBio. 2025 May 14;16(5):e0069925. doi: 10.1128/mbio.00699-25. Epub 2025 Apr 10.
3
Multiomic Insights into Human Health: Gut Microbiomes of Hunter-Gatherer, Agropastoral, and Western Urban Populations.

本文引用的文献

1
Eukaryotic genomes from a global metagenomic data set illuminate trophic modes and biogeography of ocean plankton.从全球宏基因组数据集揭示真核生物基因组揭示海洋浮游生物的营养模式和生物地理学。
mBio. 2023 Dec 19;14(6):e0167623. doi: 10.1128/mbio.01676-23. Epub 2023 Nov 10.
2
CheckM2: a rapid, scalable and accurate tool for assessing microbial genome quality using machine learning.CheckM2:一种使用机器学习快速、可扩展且准确评估微生物基因组质量的工具。
Nat Methods. 2023 Aug;20(8):1203-1212. doi: 10.1038/s41592-023-01940-w. Epub 2023 Jul 27.
3
Functional repertoire convergence of distantly related eukaryotic plankton lineages abundant in the sunlit ocean.
对人类健康的多组学洞察:狩猎采集者、农牧民和西方城市人群的肠道微生物群
bioRxiv. 2024 Sep 4:2024.09.03.611095. doi: 10.1101/2024.09.03.611095.
4
Sputum production and salivary microbiome in COVID-19 patients reveals oral-lung axis.新冠病毒患者的痰液产生和唾液微生物组揭示了口-肺轴。
PLoS One. 2024 Jul 25;19(7):e0300408. doi: 10.1371/journal.pone.0300408. eCollection 2024.
5
Unveiling the microbial realm with VEBA 2.0: a modular bioinformatics suite for end-to-end genome-resolved prokaryotic, (micro)eukaryotic and viral multi-omics from either short- or long-read sequencing.揭示微生物世界的 VEBA 2.0:一个用于从短读或长读测序中进行端到端基因组解析的原核生物、(微)真核生物和病毒多组学的模块化生物信息学套件。
Nucleic Acids Res. 2024 Aug 12;52(14):e63. doi: 10.1093/nar/gkae528.
6
Host-microbiome associations in saliva predict COVID-19 severity.唾液中的宿主-微生物组关联可预测新冠病毒疾病的严重程度。
PNAS Nexus. 2024 Mar 25;3(4):pgae126. doi: 10.1093/pnasnexus/pgae126. eCollection 2024 Apr.
7
Unveiling the Microbial Realm with VEBA 2.0: A modular bioinformatics suite for end-to-end genome-resolved prokaryotic, (micro)eukaryotic, and viral multi-omics from either short- or long-read sequencing.利用VEBA 2.0揭示微生物领域:一个模块化生物信息学套件,用于从短读长或长读长测序进行端到端的基因组解析原核生物、(微)真核生物和病毒多组学分析。
bioRxiv. 2024 Mar 11:2024.03.08.583560. doi: 10.1101/2024.03.08.583560.
8
Metagenomics and metatranscriptomics as potential driving forces for the exploration of diversity and functions of micro-eukaryotes in soil.宏基因组学和宏转录组学作为探索土壤中微型真核生物多样性和功能的潜在驱动力。
3 Biotech. 2023 Dec;13(12):423. doi: 10.1007/s13205-023-03841-3. Epub 2023 Nov 30.
9
Genus-Wide Transcriptional Landscapes Reveal Correlated Gene Networks Underlying Microevolutionary Divergence in Diatoms.属水平转录组图谱揭示了硅藻微观进化分歧中相关基因网络的基础。
Mol Biol Evol. 2023 Oct 4;40(10). doi: 10.1093/molbev/msad218.
10
Illuminating the oral microbiome and its host interactions: recent advancements in omics and bioinformatics technologies in the context of oral microbiome research.阐明口腔微生物组及其宿主相互作用:组学和生物信息学技术在口腔微生物组研究背景下的最新进展。
FEMS Microbiol Rev. 2023 Sep 5;47(5). doi: 10.1093/femsre/fuad051.
阳光照射的海洋中丰富的远缘真核浮游生物谱系的功能库趋同。
Cell Genom. 2022 Apr 28;2(5):100123. doi: 10.1016/j.xgen.2022.100123. eCollection 2022 May 11.
4
Metagenome-assembled genomes of phytoplankton microbiomes from the Arctic and Atlantic Oceans.北极和大西洋浮游植物微生物组的宏基因组组装基因组。
Microbiome. 2022 Apr 28;10(1):67. doi: 10.1186/s40168-022-01254-7.
5
Association of zoonotic protozoan parasites with microplastics in seawater and implications for human and wildlife health.海水中人畜共患原生动物寄生虫与微塑料的关联及其对人类和野生动物健康的影响。
Sci Rep. 2022 Apr 26;12(1):6532. doi: 10.1038/s41598-022-10485-5.
6
mAbs N-glycosylation: Implications for biotechnology and analytics.单克隆抗体 N-糖基化:对生物技术和分析学的影响。
Carbohydr Res. 2022 Apr;514:108541. doi: 10.1016/j.carres.2022.108541. Epub 2022 Mar 17.
7
A review on architecture with fungal biomaterials: the desired and the feasible.关于真菌生物材料构建的综述:理想与可行
Fungal Biol Biotechnol. 2021 Nov 19;8(1):17. doi: 10.1186/s40694-021-00124-5.
8
Polystyrene microplastics induced female reproductive toxicity in mice.聚苯乙烯微塑料诱导小鼠雌性生殖毒性。
J Hazard Mater. 2022 Feb 15;424(Pt C):127629. doi: 10.1016/j.jhazmat.2021.127629. Epub 2021 Oct 30.
9
Tiara: deep learning-based classification system for eukaryotic sequences.Tiara:基于深度学习的真核序列分类系统。
Bioinformatics. 2022 Jan 3;38(2):344-350. doi: 10.1093/bioinformatics/btab672.
10
coronaSPAdes: from biosynthetic gene clusters to RNA viral assemblies.coronaSPAdes:从生物合成基因簇到 RNA 病毒组装。
Bioinformatics. 2021 Dec 22;38(1):1-8. doi: 10.1093/bioinformatics/btab597.