• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于图卷积网络的新型病毒-人类蛋白质-蛋白质相互作用预测。

Graph convolutional network based virus-human protein-protein interaction prediction for novel viruses.

机构信息

Department of Computer Engineering, Faculty of Engineering, Gebze Technical University, Kocaeli, Turkey.

Department of Information Technology, Faculty of Computer Engineering and Information Technology, Azarbaijan Shahid Madani University, Tabriz, Iran.

出版信息

Comput Biol Chem. 2022 Dec;101:107755. doi: 10.1016/j.compbiolchem.2022.107755. Epub 2022 Aug 13.

DOI:10.1016/j.compbiolchem.2022.107755
PMID:36037723
Abstract

Computational identification of human-virus protein-protein interactions (PHIs) is a worthwhile step towards understanding infection mechanisms. Analysis of the PHI networks is important for the determination of pathogenic diseases. Prediction of these interactions is a popular problem since experimental detection of PHIs is both time-consuming and expensive. The available methods use biological features like amino acid sequences, molecular structure, or biological activities for prediction. Recent studies show that the topological properties of proteins in protein-protein interaction (PPI) networks increase the performance of the predictions. The basic network projections, random-walk-based models, or graph neural networks are used for generating topologically enriched (hybrid) protein embeddings. In this study, we propose a three-stage machine learning pipeline that generates and uses hybrid embeddings for PHI prediction. In the first stage, numerical features are extracted from the amino acid sequences using the Doc2Vec and Byte Pair Encoding method. The amino acid embeddings are used as node features while training a modified GraphSAGE model, which is an improved version of the graph convolutional network. Lastly, the hybrid protein embeddings are used for training a binary interaction classifier model that predicts whether there is an interaction between the given two proteins or not. The proposed method is evaluated with comprehensive experiments to test its functionality and compare it with the state-of-art methods. The experimental results on the benchmark dataset prove the efficiency of the proposed model by having a 3-23% better area under curve (AUC) score than its competitors.

摘要

计算识别人类-病毒蛋白质-蛋白质相互作用(PHI)是理解感染机制的重要步骤。分析 PHI 网络对于确定致病性疾病很重要。由于实验检测 PHI 既耗时又昂贵,因此预测这些相互作用是一个热门问题。这些方法利用生物特征,如氨基酸序列、分子结构或生物活性进行预测。最近的研究表明,蛋白质-蛋白质相互作用(PPI)网络中蛋白质的拓扑性质提高了预测的性能。基本的网络投影、基于随机游走的模型或图神经网络用于生成拓扑丰富的(混合)蛋白质嵌入。在这项研究中,我们提出了一个三阶段机器学习管道,用于生成和使用混合嵌入进行 PHI 预测。在第一阶段,使用 Doc2Vec 和字节对编码方法从氨基酸序列中提取数值特征。氨基酸嵌入被用作节点特征,同时训练一个修改后的 GraphSAGE 模型,这是图卷积网络的改进版本。最后,使用混合蛋白质嵌入来训练二进制交互分类器模型,该模型预测给定的两个蛋白质之间是否存在相互作用。通过与最先进的方法进行比较,我们使用全面的实验来评估该方法的功能,以验证其有效性。在基准数据集上的实验结果表明,该模型的 AUC 得分比竞争对手高出 3-23%,证明了其效率。

相似文献

1
Graph convolutional network based virus-human protein-protein interaction prediction for novel viruses.基于图卷积网络的新型病毒-人类蛋白质-蛋白质相互作用预测。
Comput Biol Chem. 2022 Dec;101:107755. doi: 10.1016/j.compbiolchem.2022.107755. Epub 2022 Aug 13.
2
A novel candidate disease gene prioritization method using deep graph convolutional networks and semi-supervised learning.一种使用深度图卷积网络和半监督学习的新型候选疾病基因优先级排序方法。
BMC Bioinformatics. 2022 Oct 14;23(1):422. doi: 10.1186/s12859-022-04954-x.
3
Graph-based prediction of Protein-protein interactions with attributed signed graph embedding.基于属性有向图嵌入的蛋白质-蛋白质相互作用的图预测。
BMC Bioinformatics. 2020 Jul 21;21(1):323. doi: 10.1186/s12859-020-03646-8.
4
Long-distance dependency combined multi-hop graph neural networks for protein-protein interactions prediction.长程依赖的多跳图神经网络用于蛋白质相互作用预测。
BMC Bioinformatics. 2022 Dec 5;23(1):521. doi: 10.1186/s12859-022-05062-6.
5
xCAPT5: protein-protein interaction prediction using deep and wide multi-kernel pooling convolutional neural networks with protein language model.xCAPT5:使用深度和广泛的多核池卷积神经网络与蛋白质语言模型进行蛋白质-蛋白质相互作用预测。
BMC Bioinformatics. 2024 Mar 10;25(1):106. doi: 10.1186/s12859-024-05725-6.
6
HN-PPISP: a hybrid network based on MLP-Mixer for protein-protein interaction site prediction.HN-PPISP:一种基于MLP-Mixer的用于蛋白质-蛋白质相互作用位点预测的混合网络。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac480.
7
CGINet: graph convolutional network-based model for identifying chemical-gene interaction in an integrated multi-relational graph.CGINet:基于图卷积网络的模型,用于在集成的多关系图中识别化学-基因相互作用。
BMC Bioinformatics. 2020 Nov 26;21(1):544. doi: 10.1186/s12859-020-03899-3.
8
All-Cause Death Prediction Method for CHD Based on Graph Convolutional Networks.基于图卷积网络的冠心病全因死亡预测方法。
Comput Intell Neurosci. 2022 Jul 18;2022:2389560. doi: 10.1155/2022/2389560. eCollection 2022.
9
Graph embedding-based novel protein interaction prediction via higher-order graph convolutional network.基于图嵌入的高阶图卷积网络新型蛋白质相互作用预测。
PLoS One. 2020 Sep 24;15(9):e0238915. doi: 10.1371/journal.pone.0238915. eCollection 2020.
10
GNNGL-PPI: multi-category prediction of protein-protein interactions using graph neural networks based on global graphs and local subgraphs.GNNGL-PPI:基于全局图和局部子图的图神经网络的蛋白质-蛋白质相互作用多类别预测。
BMC Genomics. 2024 May 9;25(1):406. doi: 10.1186/s12864-024-10299-x.

引用本文的文献

1
ISLRWR: A network diffusion algorithm for drug-target interactions prediction.ISLRWR:一种用于药物-靶点相互作用预测的网络扩散算法。
PLoS One. 2025 Jan 30;20(1):e0302281. doi: 10.1371/journal.pone.0302281. eCollection 2025.
2
Computational analysis of pathogen-host interactome for fast and low-risk in-silico drug repurposing in emerging viral threats like Mpox.针对如猴痘等新兴病毒威胁,进行病原体-宿主相互作用组的计算分析,以实现快速且低风险的计算机药物再利用。
Sci Rep. 2024 Aug 12;14(1):18736. doi: 10.1038/s41598-024-69617-8.
3
Pre-trained protein language model sheds new light on the prediction of Arabidopsis protein-protein interactions.
预训练蛋白质语言模型为拟南芥蛋白质-蛋白质相互作用的预测带来新曙光。
Plant Methods. 2023 Dec 7;19(1):141. doi: 10.1186/s13007-023-01119-6.
4
Bioinformatics approaches for unveiling virus-host interactions.用于揭示病毒-宿主相互作用的生物信息学方法。
Comput Struct Biotechnol J. 2023;21:1774-1784. doi: 10.1016/j.csbj.2023.02.044. Epub 2023 Feb 27.