• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepViral:基于蛋白质序列和传染病表型预测新型病毒与宿主的相互作用

DeepViral: prediction of novel virus-host interactions from protein sequences and infectious disease phenotypes.

作者信息

Liu-Wei Wang, Kafkas Şenay, Chen Jun, Dimonaco Nicholas J, Tegnér Jesper, Hoehndorf Robert

机构信息

Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal 23955, Saudi Arabia.

Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal 23955, Saudi Arabia.

出版信息

Bioinformatics. 2021 Sep 9;37(17):2722-2729. doi: 10.1093/bioinformatics/btab147.

DOI:10.1093/bioinformatics/btab147
PMID:33682875
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8428617/
Abstract

MOTIVATION

Infectious diseases caused by novel viruses have become a major public health concern. Rapid identification of virus-host interactions can reveal mechanistic insights into infectious diseases and shed light on potential treatments. Current computational prediction methods for novel viruses are based mainly on protein sequences. However, it is not clear to what extent other important features, such as the symptoms caused by the viruses, could contribute to a predictor. Disease phenotypes (i.e. signs and symptoms) are readily accessible from clinical diagnosis and we hypothesize that they may act as a potential proxy and an additional source of information for the underlying molecular interactions between the pathogens and hosts.

RESULTS

We developed DeepViral, a deep learning based method that predicts protein-protein interactions (PPI) between humans and viruses. Motivated by the potential utility of infectious disease phenotypes, we first embedded human proteins and viruses in a shared space using their associated phenotypes and functions, supported by formalized background knowledge from biomedical ontologies. By jointly learning from protein sequences and phenotype features, DeepViral significantly improves over existing sequence-based methods for intra- and inter-species PPI prediction.

AVAILABILITY AND IMPLEMENTATION

Code and datasets for reproduction and customization are available at https://github.com/bio-ontology-research-group/DeepViral. Prediction results for 14 virus families are available at https://doi.org/10.5281/zenodo.4429824.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

新型病毒引起的传染病已成为主要的公共卫生问题。快速识别病毒与宿主的相互作用可以揭示传染病的机制,并为潜在治疗方法提供线索。目前针对新型病毒的计算预测方法主要基于蛋白质序列。然而,尚不清楚其他重要特征(如病毒引起的症状)在多大程度上有助于预测。疾病表型(即体征和症状)可从临床诊断中轻易获得,我们假设它们可能作为病原体与宿主之间潜在分子相互作用的潜在替代指标和额外信息来源。

结果

我们开发了DeepViral,这是一种基于深度学习的方法,用于预测人类与病毒之间的蛋白质-蛋白质相互作用(PPI)。受传染病表型潜在效用的启发,我们首先利用生物医学本体的形式化背景知识,根据人类蛋白质和病毒的相关表型及功能,将它们嵌入到一个共享空间中。通过联合学习蛋白质序列和表型特征,DeepViral在种内和种间PPI预测方面比现有的基于序列的方法有显著改进。

可用性和实现

可在https://github.com/bio-ontology-research-group/DeepViral获取用于重现和定制的代码及数据集。14个病毒家族的预测结果可在https://doi.org/10.5281/zenodo.4429824获取。

补充信息

补充数据可在《生物信息学》在线版获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c25/8428617/205743c85796/btab147f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c25/8428617/93717d31ce6e/btab147f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c25/8428617/aa3f42830967/btab147f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c25/8428617/205743c85796/btab147f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c25/8428617/93717d31ce6e/btab147f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c25/8428617/aa3f42830967/btab147f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c25/8428617/205743c85796/btab147f3.jpg

相似文献

1
DeepViral: prediction of novel virus-host interactions from protein sequences and infectious disease phenotypes.DeepViral:基于蛋白质序列和传染病表型预测新型病毒与宿主的相互作用
Bioinformatics. 2021 Sep 9;37(17):2722-2729. doi: 10.1093/bioinformatics/btab147.
2
mOWL: Python library for machine learning with biomedical ontologies.mOWL:用于生物医学本体机器学习的 Python 库。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac811.
3
Predicting candidate genes from phenotypes, functions and anatomical site of expression.从表型、功能和表达的解剖部位预测候选基因。
Bioinformatics. 2021 May 5;37(6):853-860. doi: 10.1093/bioinformatics/btaa879.
4
Formal axioms in biomedical ontologies improve analysis and interpretation of associated data.生物医学本体论中的形式公理可改善相关数据的分析和解释。
Bioinformatics. 2020 Apr 1;36(7):2229-2236. doi: 10.1093/bioinformatics/btz920.
5
DeNovo: virus-host sequence-based protein-protein interaction prediction.DeNovo:基于病毒-宿主序列的蛋白质-蛋白质相互作用预测。
Bioinformatics. 2016 Apr 15;32(8):1144-50. doi: 10.1093/bioinformatics/btv737. Epub 2015 Dec 16.
6
VIDHOP, viral host prediction with deep learning.VIDHOP,基于深度学习的病毒宿主预测。
Bioinformatics. 2021 Apr 20;37(3):318-325. doi: 10.1093/bioinformatics/btaa705.
7
OPA2Vec: combining formal and informal content of biomedical ontologies to improve similarity-based prediction.OPA2Vec:结合生物医学本体的正式和非正式内容以改进基于相似度的预测。
Bioinformatics. 2019 Jun 1;35(12):2133-2140. doi: 10.1093/bioinformatics/bty933.
8
Virtifier: a deep learning-based identifier for viral sequences from metagenomes.Virtifier:一种基于深度学习的宏基因组病毒序列标识符。
Bioinformatics. 2022 Feb 7;38(5):1216-1222. doi: 10.1093/bioinformatics/btab845.
9
Onto2Vec: joint vector-based representation of biological entities and their ontology-based annotations.Onto2Vec:基于向量的生物实体联合表示及其基于本体论的标注。
Bioinformatics. 2018 Jul 1;34(13):i52-i60. doi: 10.1093/bioinformatics/bty259.
10
Alignment free sequence comparison methods and reservoir host prediction.无比对序列比较方法与宿主预测
Bioinformatics. 2021 Oct 11;37(19):3337-3342. doi: 10.1093/bioinformatics/btab338.

引用本文的文献

1
Graph neural network integrated with pretrained protein language model for predicting human-virus protein-protein interactions.结合预训练蛋白质语言模型的图神经网络用于预测人-病毒蛋白质-蛋白质相互作用
Brief Bioinform. 2025 Aug 31;26(5). doi: 10.1093/bib/bbaf461.
2
An Update on RNA Virus Discovery: Current Challenges and Future Perspectives.RNA病毒发现的最新进展:当前挑战与未来展望
Viruses. 2025 Jul 15;17(7):983. doi: 10.3390/v17070983.
3
VHI-Pred: A Multi-Feature-Based Tool for Predicting Human-Virus Protein-Protein Interactions.VHI-Pred:一种基于多特征的人类病毒蛋白质-蛋白质相互作用预测工具。
Mol Biotechnol. 2025 Apr 5. doi: 10.1007/s12033-025-01417-5.
4
Nanobody screening and machine learning guided identification of cross-variant anti-SARS-CoV-2 neutralizing heavy-chain only antibodies.纳米抗体筛选及机器学习辅助鉴定针对新冠病毒变异株的仅重链抗SARS-CoV-2中和抗体
PLoS Pathog. 2025 Jan 23;21(1):e1012903. doi: 10.1371/journal.ppat.1012903. eCollection 2025 Jan.
5
HBFormer: a single-stream framework based on hybrid attention mechanism for identification of human-virus protein-protein interactions.HBFormer:一种基于混合注意力机制的单流框架,用于识别人类-病毒蛋白质-蛋白质相互作用。
Bioinformatics. 2024 Nov 28;40(12). doi: 10.1093/bioinformatics/btae724.
6
Bioinformatic Resources for Exploring Human-virus Protein-protein Interactions Based on Binding Modes.基于结合模式探索人类-病毒蛋白质-蛋白质相互作用的生物信息学资源
Genomics Proteomics Bioinformatics. 2024 Dec 3;22(5). doi: 10.1093/gpbjnl/qzae075.
7
DNASimCLR: a contrastive learning-based deep learning approach for gene sequence data classification.DNASimCLR:一种基于对比学习的深度学习方法,用于基因序列数据分类。
BMC Bioinformatics. 2024 Oct 14;25(1):328. doi: 10.1186/s12859-024-05955-8.
8
Integrating multi-omics to unravel host-microbiome interactions in inflammatory bowel disease.整合多组学以揭示炎症性肠病中的宿主-微生物组相互作用。
Cell Rep Med. 2024 Sep 17;5(9):101738. doi: 10.1016/j.xcrm.2024.101738.
9
A comprehensive review of protein-centric predictors for biomolecular interactions: from proteins to nucleic acids and beyond.蛋白质中心预测因子在生物分子相互作用研究中的综合综述:从蛋白质到核酸及其他。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae162.
10
A microbial knowledge graph-based deep learning model for predicting candidate microbes for target hosts.基于微生物知识图谱的深度学习模型,用于预测目标宿主的候选微生物。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae119.