基于多模态深度表示学习的蛋白质相互作用识别和蛋白质家族分类。

Multimodal deep representation learning for protein interaction identification and protein family classification.

机构信息

Department of Electrical and Computer Engineering, University of Miami, Coral Gables, FL, U.S..

Department of Electrical and Computer Engineering, University of Miami, Coral Gables, FL, U.S.

出版信息

BMC Bioinformatics. 2019 Dec 2;20(Suppl 16):531. doi: 10.1186/s12859-019-3084-y.

DOI:10.1186/s12859-019-3084-y

PMID:31787089

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6886253/

Abstract

BACKGROUND

Protein-protein interactions(PPIs) engage in dynamic pathological and biological procedures constantly in our life. Thus, it is crucial to comprehend the PPIs thoroughly such that we are able to illuminate the disease occurrence, achieve the optimal drug-target therapeutic effect and describe the protein complex structures. However, compared to the protein sequences obtainable from various species and organisms, the number of revealed protein-protein interactions is relatively limited. To address this dilemma, lots of research endeavor have investigated in it to facilitate the discovery of novel PPIs. Among these methods, PPI prediction techniques that merely rely on protein sequence data are more widespread than other methods which require extensive biological domain knowledge.

RESULTS

In this paper, we propose a multi-modal deep representation learning structure by incorporating protein physicochemical features with the graph topological features from the PPI networks. Specifically, our method not only bears in mind the protein sequence information but also discerns the topological representations for each protein node in the PPI networks. In our paper, we construct a stacked auto-encoder architecture together with a continuous bag-of-words (CBOW) model based on generated metapaths to study the PPI predictions. Following by that, we utilize the supervised deep neural networks to identify the PPIs and classify the protein families. The PPI prediction accuracy for eight species ranged from 96.76% to 99.77%, which signifies that our multi-modal deep representation learning framework achieves superior performance compared to other computational methods.

CONCLUSION

To the best of our knowledge, this is the first multi-modal deep representation learning framework for examining the PPI networks.

摘要

背景

蛋白质-蛋白质相互作用（PPIs）在我们的生活中不断参与动态的病理和生物学过程。因此，深入了解 PPIs 至关重要，这样我们才能阐明疾病的发生，实现最佳的药物靶点治疗效果，并描述蛋白质复合物的结构。然而，与从各种物种和生物体获得的蛋白质序列相比，揭示的蛋白质-蛋白质相互作用的数量相对有限。为了解决这个难题，许多研究都致力于促进新的 PPIs 的发现。在这些方法中，仅依赖蛋白质序列数据的 PPI 预测技术比其他需要广泛生物学领域知识的方法更为广泛。

结果

在本文中，我们提出了一种多模态深度表示学习结构，将蛋白质理化特性与 PPI 网络的图拓扑特征相结合。具体来说，我们的方法不仅考虑了蛋白质序列信息，还辨别了 PPI 网络中每个蛋白质节点的拓扑表示。在本文中，我们构建了一个堆叠自动编码器架构和一个基于生成元路径的连续袋字（CBOW）模型，以研究 PPI 预测。之后，我们利用有监督的深度神经网络来识别 PPIs 和分类蛋白质家族。八种物种的 PPI 预测准确率从 96.76%到 99.77%不等，这表明我们的多模态深度表示学习框架比其他计算方法具有更优异的性能。

结论

据我们所知，这是第一个用于研究 PPI 网络的多模态深度表示学习框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e76/6886253/662285f2b63b/12859_2019_3084_Fig1_HTML.jpg

相似文献

Multimodal deep representation learning for protein interaction identification and protein family classification.

BMC Bioinformatics. 2019 Dec 2;20(Suppl 16):531. doi: 10.1186/s12859-019-3084-y.

Graph-based prediction of Protein-protein interactions with attributed signed graph embedding.

BMC Bioinformatics. 2020 Jul 21;21(1):323. doi: 10.1186/s12859-020-03646-8.

Completing sparse and disconnected protein-protein network by deep learning.

BMC Bioinformatics. 2018 Mar 22;19(1):103. doi: 10.1186/s12859-018-2112-7.

DSSGNN-PPI: A Protein-Protein Interactions prediction model based on Double Structure and Sequence graph neural networks.

Comput Biol Med. 2024 Jul;177:108669. doi: 10.1016/j.compbiomed.2024.108669. Epub 2024 May 29.

DL-PPI: a method on prediction of sequenced protein-protein interaction based on deep learning.

BMC Bioinformatics. 2023 Dec 14;24(1):473. doi: 10.1186/s12859-023-05594-5.

SDNN-PPI: self-attention with deep neural network effect on protein-protein interaction prediction.

BMC Genomics. 2022 Jun 27;23(1):474. doi: 10.1186/s12864-022-08687-2.

Protein-Protein Interactions Prediction via Multimodal Deep Polynomial Network and Regularized Extreme Learning Machine.

IEEE J Biomed Health Inform. 2019 May;23(3):1290-1303. doi: 10.1109/JBHI.2018.2845866. Epub 2018 Jun 12.

Graph embedding-based novel protein interaction prediction via higher-order graph convolutional network.

PLoS One. 2020 Sep 24;15(9):e0238915. doi: 10.1371/journal.pone.0238915. eCollection 2020.

DeepEP: a deep learning framework for identifying essential proteins.

BMC Bioinformatics. 2019 Dec 2;20(Suppl 16):506. doi: 10.1186/s12859-019-3076-y.

Identification of Protein-Protein Interactions via a Novel Matrix-Based Sequence Representation Model with Amino Acid Contact Information.

Int J Mol Sci. 2016 Sep 24;17(10):1623. doi: 10.3390/ijms17101623.

引用本文的文献

Recent advances in deep learning for protein-protein interaction: a review.

BioData Min. 2025 Jun 16;18(1):43. doi: 10.1186/s13040-025-00457-6.

Negative sampling strategies impact the prediction of scale-free biomolecular network interactions with machine learning.

BMC Biol. 2025 May 9;23(1):123. doi: 10.1186/s12915-025-02231-w.

Sensitivity analysis on protein-protein interaction networks through deep graph networks.

BMC Bioinformatics. 2025 May 8;26(1):124. doi: 10.1186/s12859-025-06140-1.

Product Manifold Representations for Learning on Biological Pathways.

ArXiv. 2025 Feb 4:arXiv:2401.15478v2.

Protein engineering in the deep learning era.

mLife. 2024 Dec 26;3(4):477-491. doi: 10.1002/mlf2.12157. eCollection 2024 Dec.

GNNMF: a multi-view graph neural network for ATAC-seq motif finding.

BMC Genomics. 2024 Mar 21;25(1):300. doi: 10.1186/s12864-024-10218-0.

Deep learning-empowered crop breeding: intelligent, efficient and promising.

Front Plant Sci. 2023 Oct 3;14:1260089. doi: 10.3389/fpls.2023.1260089. eCollection 2023.

Synthetic whole-slide image tile generation with gene expression profile-infused deep generative models.

Cell Rep Methods. 2023 Jul 19;3(8):100534. doi: 10.1016/j.crmeth.2023.100534. eCollection 2023 Aug 28.

Struct2Graph: a graph attention network for structure based predictions of protein-protein interactions.

BMC Bioinformatics. 2022 Sep 10;23(1):370. doi: 10.1186/s12859-022-04910-9.

Protein Science Meets Artificial Intelligence: A Systematic Review and a Biochemical Meta-Analysis of an Inter-Field.

Front Bioeng Biotechnol. 2022 Jul 7;10:788300. doi: 10.3389/fbioe.2022.788300. eCollection 2022.

本文引用的文献

Predicting human protein function with multi-task deep neural networks.

PLoS One. 2018 Jun 11;13(6):e0198216. doi: 10.1371/journal.pone.0198216. eCollection 2018.

iFeature: a Python package and web server for features extraction and selection from protein and peptide sequences.

Bioinformatics. 2018 Jul 15;34(14):2499-2502. doi: 10.1093/bioinformatics/bty140.

Near perfect protein multi-label classification with deep neural networks.

Methods. 2018 Jan 1;132:50-56. doi: 10.1016/j.ymeth.2017.06.034. Epub 2017 Jul 3.

Sequence-based prediction of protein protein interaction using a deep-learning algorithm.

BMC Bioinformatics. 2017 May 25;18(1):277. doi: 10.1186/s12859-017-1700-2.

DeepPPI: Boosting Prediction of Protein-Protein Interactions with Deep Neural Networks.

J Chem Inf Model. 2017 Jun 26;57(6):1499-1510. doi: 10.1021/acs.jcim.7b00028. Epub 2017 May 26.

Predicting Protein Functions by Using Unbalanced Random Walk Algorithm on Three Biological Networks.

IEEE/ACM Trans Comput Biol Bioinform. 2017 Mar-Apr;14(2):360-369. doi: 10.1109/TCBB.2015.2394314.

HIPPI: highly accurate protein family classification with ensembles of HMMs.

BMC Genomics. 2016 Nov 11;17(Suppl 10):765. doi: 10.1186/s12864-016-3097-0.

Detection of Interactions between Proteins through Rotation Forest and Local Phase Quantization Descriptors.

Int J Mol Sci. 2015 Dec 24;17(1):21. doi: 10.3390/ijms17010021.

Using Weighted Sparse Representation Model Combined with Discrete Cosine Transformation to Predict Protein-Protein Interactions from Protein Sequence.

Biomed Res Int. 2015;2015:902198. doi: 10.1155/2015/902198. Epub 2015 Oct 28.

Deep learning.

Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于多模态深度表示学习的蛋白质相互作用识别和蛋白质家族分类。

Multimodal deep representation learning for protein interaction identification and protein family classification.

机构信息

Department of Electrical and Computer Engineering, University of Miami, Coral Gables, FL, U.S..

Department of Electrical and Computer Engineering, University of Miami, Coral Gables, FL, U.S.