• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

原核病毒宿主预测器:一种用于宏基因组中原核病毒宿主预测的高斯模型。

Prokaryotic virus host predictor: a Gaussian model for host prediction of prokaryotic viruses in metagenomics.

机构信息

Bioinformatics Center, College of Biology, Hunan Provincial Key Laboratory of Medical Virology, Hunan University, Changsha, China.

Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China.

出版信息

BMC Biol. 2021 Jan 14;19(1):5. doi: 10.1186/s12915-020-00938-6.

DOI:10.1186/s12915-020-00938-6
PMID:33441133
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7807511/
Abstract

BACKGROUND

Viruses are ubiquitous biological entities, estimated to be the largest reservoirs of unexplored genetic diversity on Earth. Full functional characterization and annotation of newly discovered viruses requires tools to enable taxonomic assignment, the range of hosts, and biological properties of the virus. Here we focus on prokaryotic viruses, which include phages and archaeal viruses, and for which identifying the viral host is an essential step in characterizing the virus, as the virus relies on the host for survival. Currently, the method for determining the viral host is either to culture the virus, which is low-throughput, time-consuming, and expensive, or to computationally predict the viral hosts, which needs improvements at both accuracy and usability. Here we develop a Gaussian model to predict hosts for prokaryotic viruses with better performances than previous computational methods.

RESULTS

We present here Prokaryotic virus Host Predictor (PHP), a software tool using a Gaussian model, to predict hosts for prokaryotic viruses using the differences of k-mer frequencies between viral and host genomic sequences as features. PHP gave a host prediction accuracy of 34% (genus level) on the VirHostMatcher benchmark dataset and a host prediction accuracy of 35% (genus level) on a new dataset containing 671 viruses and 60,105 prokaryotic genomes. The prediction accuracy exceeded that of two alignment-free methods (VirHostMatcher and WIsH, 28-34%, genus level). PHP also outperformed these two alignment-free methods much (24-38% vs 18-20%, genus level) when predicting hosts for prokaryotic viruses which cannot be predicted by the BLAST-based or the CRISPR-spacer-based methods alone. Requiring a minimal score for making predictions (thresholding) and taking the consensus of the top 30 predictions further improved the host prediction accuracy of PHP.

CONCLUSIONS

The Prokaryotic virus Host Predictor software tool provides an intuitive and user-friendly API for the Gaussian model described herein. This work will facilitate the rapid identification of hosts for newly identified prokaryotic viruses in metagenomic studies.

摘要

背景

病毒是无处不在的生物实体,据估计是地球上尚未探索的遗传多样性最大的储存库。对新发现的病毒进行全面的功能特征描述和注释需要工具来进行分类学归属、宿主范围和病毒的生物学特性。在这里,我们专注于原核病毒,包括噬菌体和古菌病毒,对于这些病毒,确定病毒的宿主是对病毒进行特征描述的必要步骤,因为病毒的生存依赖于宿主。目前,确定病毒宿主的方法要么是培养病毒,这种方法低通量、耗时且昂贵,要么是通过计算预测病毒的宿主,这需要在准确性和可用性方面都进行改进。在这里,我们开发了一种高斯模型来预测原核病毒的宿主,其性能优于以前的计算方法。

结果

我们在这里提出了原核病毒宿主预测器(PHP),这是一种使用高斯模型的软件工具,使用病毒和宿主基因组序列之间的 k-mer 频率差异作为特征来预测原核病毒的宿主。PHP 在 VirHostMatcher 基准数据集上的宿主预测准确率为 34%(属水平),在包含 671 种病毒和 60105 种原核基因组的新数据集上的宿主预测准确率为 35%(属水平)。预测精度高于两种无比对方法(VirHostMatcher 和 WIsH,属水平 28-34%)。当预测无法仅通过 BLAST 或 CRISPR-spacer 方法预测的原核病毒的宿主时,PHP 也明显优于这两种无比对方法(24-38%对 18-20%,属水平)。通过设定一个最小得分进行预测(阈值)和对前 30 个预测结果取共识,可以进一步提高 PHP 的宿主预测准确性。

结论

原核病毒宿主预测器软件工具为本文描述的高斯模型提供了一个直观且用户友好的 API。这项工作将促进在宏基因组研究中快速鉴定新发现的原核病毒的宿主。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14be/7807511/1e4d4b39b27e/12915_2020_938_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14be/7807511/28663b46d349/12915_2020_938_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14be/7807511/56e11a44eb11/12915_2020_938_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14be/7807511/3e848e69c425/12915_2020_938_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14be/7807511/1d0526b682cb/12915_2020_938_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14be/7807511/711fb0d374ac/12915_2020_938_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14be/7807511/1e4d4b39b27e/12915_2020_938_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14be/7807511/28663b46d349/12915_2020_938_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14be/7807511/56e11a44eb11/12915_2020_938_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14be/7807511/3e848e69c425/12915_2020_938_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14be/7807511/1d0526b682cb/12915_2020_938_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14be/7807511/711fb0d374ac/12915_2020_938_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14be/7807511/1e4d4b39b27e/12915_2020_938_Fig6_HTML.jpg

相似文献

1
Prokaryotic virus host predictor: a Gaussian model for host prediction of prokaryotic viruses in metagenomics.原核病毒宿主预测器:一种用于宏基因组中原核病毒宿主预测的高斯模型。
BMC Biol. 2021 Jan 14;19(1):5. doi: 10.1186/s12915-020-00938-6.
2
Alignment-free $d_2^*$ oligonucleotide frequency dissimilarity measure improves prediction of hosts from metagenomically-derived viral sequences.无比对的$d_2^*$寡核苷酸频率差异度量法可改善从宏基因组来源的病毒序列预测宿主的效果。
Nucleic Acids Res. 2017 Jan 9;45(1):39-53. doi: 10.1093/nar/gkw1002. Epub 2016 Nov 28.
3
Computational approaches to predict bacteriophage-host relationships.预测噬菌体-宿主关系的计算方法。
FEMS Microbiol Rev. 2016 Mar;40(2):258-72. doi: 10.1093/femsre/fuv048. Epub 2015 Dec 9.
4
PHIST: fast and accurate prediction of prokaryotic hosts from metagenomic viral sequences.PHIST:从宏基因组病毒序列中快速准确地预测原核宿主。
Bioinformatics. 2022 Feb 7;38(5):1447-1449. doi: 10.1093/bioinformatics/btab837.
5
VirFinder: a novel k-mer based tool for identifying viral sequences from assembled metagenomic data.VirFinder:一种新型的基于 k-mer 的工具,用于从组装的宏基因组数据中识别病毒序列。
Microbiome. 2017 Jul 6;5(1):69. doi: 10.1186/s40168-017-0283-5.
6
WIsH: who is the host? Predicting prokaryotic hosts from metagenomic phage contigs.WIsH:谁是宿主?从宏基因组噬菌体序列预测原核宿主。
Bioinformatics. 2017 Oct 1;33(19):3113-3114. doi: 10.1093/bioinformatics/btx383.
7
Host Taxon Predictor - A Tool for Predicting Taxon of the Host of a Newly Discovered Virus.宿主分类预测器 - 一种预测新发现病毒宿主分类的工具。
Sci Rep. 2019 Mar 5;9(1):3436. doi: 10.1038/s41598-019-39847-2.
8
Streamlining CRISPR spacer-based bacterial host predictions to decipher the viral dark matter.简化基于 CRISPR 间隔物的细菌宿主预测,以破译病毒暗物质。
Nucleic Acids Res. 2021 Apr 6;49(6):3127-3138. doi: 10.1093/nar/gkab133.
9
Predicting the hosts of prokaryotic viruses using GCN-based semi-supervised learning.基于图卷积网络的半监督学习预测原核病毒宿主。
BMC Biol. 2021 Nov 24;19(1):250. doi: 10.1186/s12915-021-01180-4.
10
Prediction of virus-host infectious association by supervised learning methods.通过监督学习方法预测病毒-宿主感染关联。
BMC Bioinformatics. 2017 Mar 14;18(Suppl 3):60. doi: 10.1186/s12859-017-1473-7.

引用本文的文献

1
Phage quest: a beginner's guide to explore viral diversity in the prokaryotic world.噬菌体探索:探索原核生物世界中病毒多样性的初学者指南。
Brief Bioinform. 2025 Aug 31;26(5). doi: 10.1093/bib/bbaf449.
2
Identification and profiling of novel metagenome assembled uncultivated virus genomes from human gut.从人类肠道中鉴定和分析新的宏基因组组装未培养病毒基因组
Virol J. 2025 Jul 25;22(1):254. doi: 10.1186/s12985-025-02739-1.
3
Arctic Ocean virus communities and their seasonality, bipolarity, and prokaryotic associations.北冰洋病毒群落及其季节性、两极分布和原核生物关联。

本文引用的文献

1
Changes to virus taxonomy and the Statutes ratified by the International Committee on Taxonomy of Viruses (2020).病毒分类学的变化和国际病毒分类委员会批准的法规(2020 年)。
Arch Virol. 2020 Nov;165(11):2737-2748. doi: 10.1007/s00705-020-04752-x.
2
NCBI Taxonomy: a comprehensive update on curation, resources and tools.NCBI 分类学:在管理、资源和工具方面的全面更新。
Database (Oxford). 2020 Jan 1;2020. doi: 10.1093/database/baaa062.
3
A Review on Viral Metagenomics in Extreme Environments.极端环境下病毒宏基因组学综述
Nat Commun. 2025 Jul 11;16(1):6427. doi: 10.1038/s41467-025-61568-6.
4
Salinity-driven shifts in estuarine viral community composition and diversity near the Shenzhen coast.深圳海岸附近河口病毒群落组成和多样性的盐度驱动变化
Appl Environ Microbiol. 2025 Jul 23;91(7):e0040725. doi: 10.1128/aem.00407-25. Epub 2025 Jul 2.
5
Unveiling the role of phages in shaping the periodontal microbial ecosystem.揭示噬菌体在塑造牙周微生物生态系统中的作用。
mSystems. 2025 Apr 22;10(4):e0020125. doi: 10.1128/msystems.00201-25. Epub 2025 Mar 28.
6
Red Mud Potentially Alleviates Ammonia Nitrogen Inhibition in Swine Manure Anaerobic Digestion by Enhancing Phage-Mediated Ammonia Assimilation.赤泥通过增强噬菌体介导的氨同化作用,可能减轻猪粪厌氧消化过程中的氨氮抑制作用。
Microorganisms. 2025 Mar 19;13(3):690. doi: 10.3390/microorganisms13030690.
7
PHPGAT: predicting phage hosts based on multimodal heterogeneous knowledge graph with graph attention network.PHPGAT:基于具有图注意力网络的多模态异构知识图谱预测噬菌体宿主
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf017.
8
PHIStruct: improving phage-host interaction prediction at low sequence similarity settings using structure-aware protein embeddings.PHIStruct:使用结构感知蛋白质嵌入在低序列相似性设置下改进噬菌体-宿主相互作用预测。
Bioinformatics. 2024 Dec 26;41(1). doi: 10.1093/bioinformatics/btaf016.
9
A novel framework for phage-host prediction via logical probability theory and network sparsification.一种基于逻辑概率理论和网络稀疏化的噬菌体-宿主预测新框架。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae708.
10
Predicting phage-host interactions via feature augmentation and regional graph convolution.通过特征增强和区域图卷积预测噬菌体-宿主相互作用。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae672.
Front Microbiol. 2019 Oct 18;10:2403. doi: 10.3389/fmicb.2019.02403. eCollection 2019.
4
Defining the human gut host-phage network through single-cell viral tagging.通过单细胞病毒标记定义人类肠道宿主-噬菌体网络。
Nat Microbiol. 2019 Dec;4(12):2192-2203. doi: 10.1038/s41564-019-0526-2. Epub 2019 Aug 5.
5
Marine DNA Viral Macro- and Microdiversity from Pole to Pole.从极地到极地的海洋 DNA 病毒宏多样性和微多样性。
Cell. 2019 May 16;177(5):1109-1123.e14. doi: 10.1016/j.cell.2019.03.040. Epub 2019 Apr 25.
6
Database resources of the National Center for Biotechnology Information.国家生物技术信息中心数据库资源。
Nucleic Acids Res. 2019 Jan 8;47(D1):D23-D28. doi: 10.1093/nar/gky1069.
7
Molecular and Evolutionary Determinants of Bacteriophage Host Range.噬菌体宿主范围的分子和进化决定因素。
Trends Microbiol. 2019 Jan;27(1):51-63. doi: 10.1016/j.tim.2018.08.006. Epub 2018 Sep 1.
8
A human gut phage catalog correlates the gut phageome with type 2 diabetes.人类肠道噬菌体目录将肠道噬菌体组与 2 型糖尿病相关联。
Microbiome. 2018 Feb 1;6(1):24. doi: 10.1186/s40168-018-0410-y.
9
WIsH: who is the host? Predicting prokaryotic hosts from metagenomic phage contigs.WIsH:谁是宿主?从宏基因组噬菌体序列预测原核宿主。
Bioinformatics. 2017 Oct 1;33(19):3113-3114. doi: 10.1093/bioinformatics/btx383.
10
A Review of Phage Therapy against Bacterial Pathogens of Aquatic and Terrestrial Organisms.针对水生和陆生生物细菌病原体的噬菌体疗法综述
Viruses. 2017 Mar 18;9(3):50. doi: 10.3390/v9030050.