• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

VIRify:一种使用病毒特异性蛋白特征隐马尔可夫模型进行综合检测、注释和分类学分类的管道。

VIRify: An integrated detection, annotation and taxonomic classification pipeline using virus-specific protein profile hidden Markov models.

机构信息

The Globe Institute, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark.

Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogota, Colombia.

出版信息

PLoS Comput Biol. 2023 Aug 28;19(8):e1011422. doi: 10.1371/journal.pcbi.1011422. eCollection 2023 Aug.

DOI:10.1371/journal.pcbi.1011422
PMID:37639475
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10491390/
Abstract

The study of viral communities has revealed the enormous diversity and impact these biological entities have on various ecosystems. These observations have sparked widespread interest in developing computational strategies that support the comprehensive characterisation of viral communities based on sequencing data. Here we introduce VIRify, a new computational pipeline designed to provide a user-friendly and accurate functional and taxonomic characterisation of viral communities. VIRify identifies viral contigs and prophages from metagenomic assemblies and annotates them using a collection of viral profile hidden Markov models (HMMs). These include our manually-curated profile HMMs, which serve as specific taxonomic markers for a wide range of prokaryotic and eukaryotic viral taxa and are thus used to reliably classify viral contigs. We tested VIRify on assemblies from two microbial mock communities, a large metagenomics study, and a collection of publicly available viral genomic sequences from the human gut. The results showed that VIRify could identify sequences from both prokaryotic and eukaryotic viruses, and provided taxonomic classifications from the genus to the family rank with an average accuracy of 86.6%. In addition, VIRify allowed the detection and taxonomic classification of a range of prokaryotic and eukaryotic viruses present in 243 marine metagenomic assemblies. Finally, the use of VIRify led to a large expansion in the number of taxonomically classified human gut viral sequences and the improvement of outdated and shallow taxonomic classifications. Overall, we demonstrate that VIRify is a novel and powerful resource that offers an enhanced capability to detect a broad range of viral contigs and taxonomically classify them.

摘要

病毒群落的研究揭示了这些生物实体在各种生态系统中所具有的巨大多样性和影响力。这些观察结果引发了人们广泛的兴趣,促使开发出基于测序数据支持全面描述病毒群落的计算策略。在这里,我们介绍了 VIRify,这是一种新的计算管道,旨在为病毒群落提供用户友好且准确的功能和分类特征描述。VIRify 从宏基因组组装中识别病毒 contigs 和原噬菌体,并使用一系列病毒特征隐马尔可夫模型 (HMM) 对其进行注释。这些包括我们手动 curated 的特征 HMM,它们是广泛的原核和真核病毒分类群的特定分类标记,因此可用于可靠地对病毒 contigs 进行分类。我们在两个微生物模拟群落的组装体、一个大型宏基因组研究以及人类肠道中公开可用的病毒基因组序列集合上测试了 VIRify。结果表明,VIRify 可以识别出原核和真核病毒的序列,并提供从属到科的分类学分类,平均准确率为 86.6%。此外,VIRify 还可以检测和分类 243 个海洋宏基因组组装体中存在的一系列原核和真核病毒。最后,使用 VIRify 导致分类学上分类的人类肠道病毒序列数量大幅增加,并改进了过时和浅层的分类学分类。总体而言,我们证明 VIRify 是一种新颖而强大的资源,提供了增强的检测广泛的病毒 contigs 并对其进行分类学分类的能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/060d/10491390/5d2a9fbdecf1/pcbi.1011422.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/060d/10491390/237a25fc3f16/pcbi.1011422.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/060d/10491390/449b8681e949/pcbi.1011422.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/060d/10491390/0ec8f299ae35/pcbi.1011422.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/060d/10491390/255f871c9442/pcbi.1011422.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/060d/10491390/dfc63be4a471/pcbi.1011422.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/060d/10491390/5a3a2e502746/pcbi.1011422.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/060d/10491390/5d2a9fbdecf1/pcbi.1011422.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/060d/10491390/237a25fc3f16/pcbi.1011422.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/060d/10491390/449b8681e949/pcbi.1011422.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/060d/10491390/0ec8f299ae35/pcbi.1011422.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/060d/10491390/255f871c9442/pcbi.1011422.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/060d/10491390/dfc63be4a471/pcbi.1011422.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/060d/10491390/5a3a2e502746/pcbi.1011422.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/060d/10491390/5d2a9fbdecf1/pcbi.1011422.g007.jpg

相似文献

1
VIRify: An integrated detection, annotation and taxonomic classification pipeline using virus-specific protein profile hidden Markov models.VIRify:一种使用病毒特异性蛋白特征隐马尔可夫模型进行综合检测、注释和分类学分类的管道。
PLoS Comput Biol. 2023 Aug 28;19(8):e1011422. doi: 10.1371/journal.pcbi.1011422. eCollection 2023 Aug.
2
The genomic underpinnings of eukaryotic virus taxonomy: creating a sequence-based framework for family-level virus classification.真核病毒分类学的基因组基础:创建基于序列的病毒科分类框架。
Microbiome. 2018 Feb 20;6(1):38. doi: 10.1186/s40168-018-0422-7.
3
Exploring the Complexity of the Human Respiratory Virome through an In Silico Analysis of Shotgun Metagenomic Data Retrieved from Public Repositories.通过对公共存储库中获取的鸟枪法宏基因组数据进行计算分析,探索人类呼吸道病毒组的复杂性。
Viruses. 2024 Jun 13;16(6):953. doi: 10.3390/v16060953.
4
Simulation study and comparative evaluation of viral contiguous sequence identification tools.病毒连续序列识别工具的模拟研究与比较评估
BMC Bioinformatics. 2021 Jun 16;22(1):329. doi: 10.1186/s12859-021-04242-0.
5
Microbial Diversity and Phage-Host Interactions in the Georgian Coastal Area of the Black Sea Revealed by Whole Genome Metagenomic Sequencing.通过全基因组宏基因组测序揭示黑海格鲁吉亚沿海地区的微生物多样性和噬菌体-宿主相互作用。
Mar Drugs. 2020 Nov 14;18(11):558. doi: 10.3390/md18110558.
6
MVP: a modular viromics pipeline to identify, filter, cluster, annotate, and bin viruses from metagenomes.MVP:一个模块化的病毒组学分析流程,用于从宏基因组中识别、过滤、聚类、注释和分类病毒。
mSystems. 2024 Oct 22;9(10):e0088824. doi: 10.1128/msystems.00888-24. Epub 2024 Oct 1.
7
Contigs directed gene annotation (ConDiGA) for accurate protein sequence database construction in metaproteomics.宏基因组学中用于准确蛋白质序列数据库构建的 Contigs 定向基因注释(ConDiGA)。
Microbiome. 2024 Mar 19;12(1):58. doi: 10.1186/s40168-024-01775-3.
8
Comparison of different assembly and annotation tools on analysis of simulated viral metagenomic communities in the gut.比较不同的组装和注释工具在分析肠道中模拟病毒宏基因组群落中的应用。
BMC Genomics. 2014 Jan 18;15:37. doi: 10.1186/1471-2164-15-37.
9
Fragmentation and Coverage Variation in Viral Metagenome Assemblies, and Their Effect in Diversity Calculations.病毒宏基因组组装中的碎片化和覆盖度变化,及其对多样性计算的影响。
Front Bioeng Biotechnol. 2015 Sep 17;3:141. doi: 10.3389/fbioe.2015.00141. eCollection 2015.
10
Eukfinder: a pipeline to retrieve microbial eukaryote genome sequences from metagenomic data.Eukfinder:一种从宏基因组数据中检索微生物真核生物基因组序列的流程。
mBio. 2025 May 14;16(5):e0069925. doi: 10.1128/mbio.00699-25. Epub 2025 Apr 10.

引用本文的文献

1
First detection of a lizard-associated papillomavirus in the splendid japalure () from southwestern China.首次在中国西南部丽纹攀蜥()中检测到一种与蜥蜴相关的乳头瘤病毒。
Front Microbiol. 2025 Jul 28;16:1590538. doi: 10.3389/fmicb.2025.1590538. eCollection 2025.
2
A bacterial and viral genome catalogue from Atlantic salmon highlights diverse gut microbiome compositions at pre- and post-smolt life stages.一份来自大西洋鲑鱼的细菌和病毒基因组目录突出了其在稚鱼期前后肠道微生物群的不同组成。
Anim Microbiome. 2025 Aug 11;7(1):85. doi: 10.1186/s42523-025-00453-5.
3
VITALdb: to select the best viroinformatics tools for a desired virus or application.

本文引用的文献

1
Phage-inclusive profiling of human gut microbiomes with Phanta.使用 Phanta 进行人类肠道微生物组的噬菌体包容谱分析。
Nat Biotechnol. 2024 Apr;42(4):651-662. doi: 10.1038/s41587-023-01799-4. Epub 2023 May 25.
2
Gauge your phage: benchmarking of bacteriophage identification tools in metagenomic sequencing data.评估噬菌体:宏基因组测序数据中噬菌体鉴定工具的基准测试。
Microbiome. 2023 Apr 21;11(1):84. doi: 10.1186/s40168-023-01533-x.
3
Large-scale phage cultivation for commensal human gut bacteria.大规模培养共生人类肠道细菌的噬菌体。
VITALdb:为所需病毒或应用选择最佳的病毒信息学工具。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf084.
4
mettannotator: a comprehensive and scalable Nextflow annotation pipeline for prokaryotic assemblies.Mettannotator:一种用于原核生物组装的全面且可扩展的Nextflow注释管道。
Bioinformatics. 2025 Feb 4;41(2). doi: 10.1093/bioinformatics/btaf037.
5
HoloFood Data Portal: holo-omic datasets for analysing host-microbiota interactions in animal production.全食物数据门户:用于分析动物生产中宿主-微生物群相互作用的全组学数据集。
Database (Oxford). 2025 Jan 11;2025. doi: 10.1093/database/baae112.
6
Improving the reporting of metagenomic virome-scale data.改善宏基因组病毒组规模数据的报告。
Commun Biol. 2024 Dec 20;7(1):1687. doi: 10.1038/s42003-024-07212-3.
7
The CABANA model 2017-2022: research and training synergy to facilitate bioinformatics applications in Latin America.2017 - 2022年CABANA模型:促进生物信息学在拉丁美洲应用的研究与培训协同作用。
Front Educ (Lausanne). 2024 Jul 4;9. doi: 10.3389/feduc.2024.1358620.
8
ViraLM: empowering virus discovery through the genome foundation model.ViraLM:通过基因组基础模型助力病毒发现
Bioinformatics. 2024 Nov 28;40(12). doi: 10.1093/bioinformatics/btae704.
9
Impact of Alignments on the Accuracy of Protein Subcellular Localization Predictions.序列比对对蛋白质亚细胞定位预测准确性的影响。
Proteins. 2025 Mar;93(3):745-759. doi: 10.1002/prot.26767. Epub 2024 Nov 22.
10
Case report: Local bacteriophage therapy for fracture-related infection with polymicrobial multi-resistant bacteria: hydrogel application and postoperative phage analysis through metagenomic sequencing.病例报告:局部噬菌体疗法治疗骨折相关的多微生物多重耐药菌感染:水凝胶应用及术后通过宏基因组测序进行噬菌体分析
Front Med (Lausanne). 2024 Jul 12;11:1428432. doi: 10.3389/fmed.2024.1428432. eCollection 2024.
Cell Host Microbe. 2023 Apr 12;31(4):665-677.e7. doi: 10.1016/j.chom.2023.03.013.
4
iVirus 2.0: Cyberinfrastructure-supported tools and data to power DNA virus ecology.iVirus 2.0:支持网络基础设施的工具和数据助力DNA病毒生态学研究
ISME Commun. 2021 Dec 14;1(1):77. doi: 10.1038/s43705-021-00083-3.
5
Abolishment of morphology-based taxa and change to binomial species names: 2022 taxonomy update of the ICTV bacterial viruses subcommittee.基于形态学的分类群废除和二名法物种名称变更:ICTV 细菌病毒小组委员会 2022 年分类学更新。
Arch Virol. 2023 Jan 23;168(2):74. doi: 10.1007/s00705-022-05694-2.
6
What the Phage: a scalable workflow for the identification and analysis of phage sequences.噬菌体是什么:一种用于鉴定和分析噬菌体序列的可扩展工作流程。
Gigascience. 2022 Nov 18;11. doi: 10.1093/gigascience/giac110.
7
MetaPhage: an Automated Pipeline for Analyzing, Annotating, and Classifying Bacteriophages in Metagenomics Sequencing Data.MetaPhage:一个用于分析、注释和分类宏基因组测序数据中噬菌体的自动化管道。
mSystems. 2022 Oct 26;7(5):e0074122. doi: 10.1128/msystems.00741-22. Epub 2022 Sep 7.
8
Metagenomic Analyses of Multiple Gut Datasets Revealed the Association of Phage Signatures in Colorectal Cancer.基于多个肠道数据集的宏基因组分析揭示了噬菌体特征与结直肠癌的关联。
Front Cell Infect Microbiol. 2022 Jun 15;12:918010. doi: 10.3389/fcimb.2022.918010. eCollection 2022.
9
ChromoMap: an R package for interactive visualization of multi-omics data and annotation of chromosomes.ChromoMap:一个用于多组学数据交互式可视化和染色体注释的 R 包。
BMC Bioinformatics. 2022 Jan 11;23(1):33. doi: 10.1186/s12859-021-04556-z.
10
Illuminating the Virosphere Through Global Metagenomics.通过全球宏基因组学揭示病毒圈。
Annu Rev Biomed Data Sci. 2021 Jul 20;4:369-391. doi: 10.1146/annurev-biodatasci-012221-095114.