• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iPHoP:一种集成机器学习框架,用于最大化基于宏基因组的古菌和细菌病毒的宿主预测。

iPHoP: An integrated machine learning framework to maximize host prediction for metagenome-derived viruses of archaea and bacteria.

机构信息

DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America.

Instituto de Ciencias del Mar (ICM-CSIC), Barcelona, Spain.

出版信息

PLoS Biol. 2023 Apr 21;21(4):e3002083. doi: 10.1371/journal.pbio.3002083. eCollection 2023 Apr.

DOI:10.1371/journal.pbio.3002083
PMID:37083735
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10155999/
Abstract

The extraordinary diversity of viruses infecting bacteria and archaea is now primarily studied through metagenomics. While metagenomes enable high-throughput exploration of the viral sequence space, metagenome-derived sequences lack key information compared to isolated viruses, in particular host association. Different computational approaches are available to predict the host(s) of uncultivated viruses based on their genome sequences, but thus far individual approaches are limited either in precision or in recall, i.e., for a number of viruses they yield erroneous predictions or no prediction at all. Here, we describe iPHoP, a two-step framework that integrates multiple methods to reliably predict host taxonomy at the genus rank for a broad range of viruses infecting bacteria and archaea, while retaining a low false discovery rate. Based on a large dataset of metagenome-derived virus genomes from the IMG/VR database, we illustrate how iPHoP can provide extensive host prediction and guide further characterization of uncultivated viruses.

摘要

目前,通过宏基因组学主要研究感染细菌和古菌的病毒的非凡多样性。虽然宏基因组能够高通量地探索病毒的序列空间,但与分离的病毒相比,宏基因组衍生的序列缺乏关键信息,特别是宿主相关性。有不同的计算方法可用于根据基因组序列预测未培养病毒的宿主,但到目前为止,每种方法要么在精度上,要么在召回率上都受到限制,即对于一些病毒,它们会产生错误的预测或根本没有预测。在这里,我们描述了 iPHoP,这是一个两步框架,它集成了多种方法,可以可靠地预测感染细菌和古菌的广泛病毒的属分类群宿主,同时保持低假阳性率。基于来自 IMG/VR 数据库的大量宏基因组衍生病毒基因组数据集,我们说明了 iPHoP 如何能够提供广泛的宿主预测,并指导未培养病毒的进一步特征描述。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7872/10155999/4f21b402f9e8/pbio.3002083.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7872/10155999/7f72a49f2fcb/pbio.3002083.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7872/10155999/e583d9a12e6b/pbio.3002083.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7872/10155999/ee2d55fe560f/pbio.3002083.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7872/10155999/8c84d5ce59d3/pbio.3002083.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7872/10155999/4f21b402f9e8/pbio.3002083.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7872/10155999/7f72a49f2fcb/pbio.3002083.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7872/10155999/e583d9a12e6b/pbio.3002083.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7872/10155999/ee2d55fe560f/pbio.3002083.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7872/10155999/8c84d5ce59d3/pbio.3002083.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7872/10155999/4f21b402f9e8/pbio.3002083.g005.jpg

相似文献

1
iPHoP: An integrated machine learning framework to maximize host prediction for metagenome-derived viruses of archaea and bacteria.iPHoP:一种集成机器学习框架,用于最大化基于宏基因组的古菌和细菌病毒的宿主预测。
PLoS Biol. 2023 Apr 21;21(4):e3002083. doi: 10.1371/journal.pbio.3002083. eCollection 2023 Apr.
2
VIBRANT: automated recovery, annotation and curation of microbial viruses, and evaluation of viral community function from genomic sequences.VIBRANT:从基因组序列中自动恢复、注释和培养微生物病毒,并评估病毒群落功能。
Microbiome. 2020 Jun 10;8(1):90. doi: 10.1186/s40168-020-00867-0.
3
A genomic catalog of Earth's microbiomes.地球微生物组的基因组目录。
Nat Biotechnol. 2021 Apr;39(4):499-509. doi: 10.1038/s41587-020-0718-6. Epub 2020 Nov 9.
4
Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks.基于基因共享网络的未培养原核病毒基因组的分类学分配。
Nat Biotechnol. 2019 Jun;37(6):632-639. doi: 10.1038/s41587-019-0100-8. Epub 2019 May 6.
5
IMG/VR v4: an expanded database of uncultivated virus genomes within a framework of extensive functional, taxonomic, and ecological metadata.IMG/VR v4:一个扩展的未培养病毒基因组数据库,其中包含广泛的功能、分类和生态元数据框架。
Nucleic Acids Res. 2023 Jan 6;51(D1):D733-D743. doi: 10.1093/nar/gkac1037.
6
IMG/VR v3: an integrated ecological and evolutionary framework for interrogating genomes of uncultivated viruses.IMG/VR v3:用于探究未培养病毒基因组的综合生态和进化框架。
Nucleic Acids Res. 2021 Jan 8;49(D1):D764-D775. doi: 10.1093/nar/gkaa946.
7
Comprehensive discovery of CRISPR-targeted terminally redundant sequences in the human gut metagenome: Viruses, plasmids, and more.全面发现人类肠道宏基因组中 CRISPR 靶向的末端冗余序列:病毒、质粒等。
PLoS Comput Biol. 2021 Oct 21;17(10):e1009428. doi: 10.1371/journal.pcbi.1009428. eCollection 2021 Oct.
8
VirFinder: a novel k-mer based tool for identifying viral sequences from assembled metagenomic data.VirFinder:一种新型的基于 k-mer 的工具,用于从组装的宏基因组数据中识别病毒序列。
Microbiome. 2017 Jul 6;5(1):69. doi: 10.1186/s40168-017-0283-5.
9
Lytic archaeal viruses infect abundant primary producers in Earth's crust.溶原性古菌病毒感染了地壳中丰富的初级生产者。
Nat Commun. 2021 Jul 30;12(1):4642. doi: 10.1038/s41467-021-24803-4.
10
Decomposing a San Francisco estuary microbiome using long-read metagenomics reveals species- and strain-level dominance from picoeukaryotes to viruses.利用长读长宏基因组学分解旧金山河口微生物组,揭示了从微微型真核生物到病毒的种属和菌株水平的优势。
mSystems. 2024 Sep 17;9(9):e0024224. doi: 10.1128/msystems.00242-24. Epub 2024 Aug 19.

引用本文的文献

1
Phage quest: a beginner's guide to explore viral diversity in the prokaryotic world.噬菌体探索:探索原核生物世界中病毒多样性的初学者指南。
Brief Bioinform. 2025 Aug 31;26(5). doi: 10.1093/bib/bbaf449.
2
A prevalent huge phage clade in human and animal gut microbiomes.在人类和动物肠道微生物群中普遍存在的一个巨大噬菌体分支。
Res Sq. 2025 Aug 19:rs.3.rs-7356405. doi: 10.21203/rs.3.rs-7356405/v1.
3
Unique plastisphere viromes with habitat-dependent potential for modulating global methane cycle.具有依赖栖息地调节全球甲烷循环潜力的独特塑料球病毒群落。

本文引用的文献

1
vHULK, a New Tool for Bacteriophage Host Prediction Based on Annotated Genomic Features and Neural Networks.vHULK,一种基于注释基因组特征和神经网络的噬菌体宿主预测新工具。
Phage (New Rochelle). 2022 Dec 1;3(4):204-212. doi: 10.1089/phage.2021.0016. Epub 2022 Dec 19.
2
INfrastructure for a PHAge REference Database: Identification of Large-Scale Biases in the Current Collection of Cultured Phage Genomes.噬菌体参考数据库的基础设施:识别当前培养噬菌体基因组集合中的大规模偏差
Phage (New Rochelle). 2021 Dec 1;2(4):214-223. doi: 10.1089/phage.2021.0007. Epub 2021 Dec 16.
3
CHERRY: a Computational metHod for accuratE pRediction of virus-pRokarYotic interactions using a graph encoder-decoder model.
Nat Commun. 2025 Aug 29;16(1):8098. doi: 10.1038/s41467-025-63215-6.
4
Zeta CrAss-like Phages, a Separate Phage Family Using a Variety of Adaptive Mechanisms to Persist in Their Hosts.类Zeta CrAss噬菌体,一个利用多种适应性机制在宿主中持续存在的独立噬菌体家族。
Int J Mol Sci. 2025 Aug 8;26(16):7694. doi: 10.3390/ijms26167694.
5
The hidden genetic reservoir: structural variants as drivers of marine microbial and viral microdiversity.隐藏的基因库:结构变异作为海洋微生物和病毒微多样性的驱动因素
Environ Microbiome. 2025 Aug 25;20(1):110. doi: 10.1186/s40793-025-00773-8.
6
The genetic diversity and populational specificity of the human gut virome at single-nucleotide resolution.单核苷酸分辨率下人类肠道病毒组的遗传多样性和群体特异性
Microbiome. 2025 Aug 20;13(1):188. doi: 10.1186/s40168-025-02185-9.
7
A prevalent huge phage clade in human and animal gut microbiomes.人类和动物肠道微生物群中普遍存在的一个巨大噬菌体分支。
bioRxiv. 2025 Aug 11:2025.08.10.669567. doi: 10.1101/2025.08.10.669567.
8
Delivery mode impacts gut bacteriophage colonization during infancy.分娩方式影响婴儿期肠道噬菌体的定植。
Gut Microbes Rep. 2025;2(1). doi: 10.1080/29933935.2025.2464631. Epub 2025 Mar 14.
9
Global dominance of Haloquadratum walsbyi by a single genomovar with distinct gene content and viral cohorts from close relatives.由具有独特基因内容和来自近亲的病毒群落的单个基因组变种在全球范围内主导沃氏嗜盐碱杆菌。
ISME J. 2025 Jan 2;19(1). doi: 10.1093/ismejo/wraf165.
10
Genomic insights into bacteriophages: a new frontier in AMR detection and phage therapy.噬菌体的基因组洞察:抗菌药物耐药性检测和噬菌体疗法的新前沿。
Brief Funct Genomics. 2025 Jan 15;24. doi: 10.1093/bfgp/elaf011.
CHERRY:一种基于图编解码模型的病毒-原核生物相互作用精确预测的计算方法。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac182.
4
PHISDetector: A Tool to Detect Diverse In Silico Phage-host Interaction Signals for Virome Studies.PHISDetector:一种用于检测病毒组研究中多样化的计算噬菌体-宿主相互作用信号的工具。
Genomics Proteomics Bioinformatics. 2022 Jun;20(3):508-523. doi: 10.1016/j.gpb.2022.02.003. Epub 2022 Mar 8.
5
PHIST: fast and accurate prediction of prokaryotic hosts from metagenomic viral sequences.PHIST:从宏基因组病毒序列中快速准确地预测原核宿主。
Bioinformatics. 2022 Feb 7;38(5):1447-1449. doi: 10.1093/bioinformatics/btab837.
6
Minnesota peat viromes reveal terrestrial and aquatic niche partitioning for local and global viral populations.明尼苏达泥炭病毒组揭示了本地和全球病毒群体的陆地和水生小生境分区。
Microbiome. 2021 Nov 26;9(1):233. doi: 10.1186/s40168-021-01156-0.
7
Taxonomy-aware, sequence similarity ranking reliably predicts phage-host relationships.基于分类学的序列相似性排序可可靠地预测噬菌体-宿主关系。
BMC Biol. 2021 Oct 8;19(1):223. doi: 10.1186/s12915-021-01146-6.
8
DeepHost: phage host prediction with convolutional neural network.DeepHost:基于卷积神经网络的噬菌体宿主预测。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab385.
9
GTDB: an ongoing census of bacterial and archaeal diversity through a phylogenetically consistent, rank normalized and complete genome-based taxonomy.GTDB:通过系统发生一致、等级归一化和基于完整基因组的分类学,对细菌和古菌多样性进行持续普查。
Nucleic Acids Res. 2022 Jan 7;50(D1):D785-D794. doi: 10.1093/nar/gkab776.
10
PHIAF: prediction of phage-host interactions with GAN-based data augmentation and sequence-based feature fusion.PHIAF:基于 GAN 的数据增强和基于序列的特征融合的噬菌体-宿主相互作用预测。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab348.