基于注意力机制的集成深度学习蛋白质相互作用网络重建

Protein Interaction Network Reconstruction Through Ensemble Deep Learning With Attention Mechanism.

作者信息

Li Feifei, Zhu Fei, Ling Xinghong, Liu Quan

机构信息

School of Computer Science and Technology, Soochow University, Suzhou, China.

Provincial Key Laboratory for Computer Information Processing Technology, Soochow University, Suzhou, China.

出版信息

Front Bioeng Biotechnol. 2020 May 5;8:390. doi: 10.3389/fbioe.2020.00390. eCollection 2020.

DOI:10.3389/fbioe.2020.00390

PMID:32432096

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7215070/

Abstract

Protein interactions play an essential role in studying living systems and life phenomena. A considerable amount of literature has been published on analyzing and predicting protein interactions, such as support vector machine method, homology-based method and similarity-based method, each has its pros and cons. Most existing methods for predicting protein interactions require prior domain knowledge, making it difficult to effectively extract protein features. Single method is dissatisfactory in predicting protein interactions, declaring the need for a comprehensive method that combines the advantages of various methods. On this basis, a deep ensemble learning method called EnAmDNN (Ensemble Deep Neural Networks with Attention Mechanism) is proposed to predict protein interactions which is an appropriate candidate for comprehensive learning, combining multiple models, and considering the advantages of various methods. Particularly, it encode protein sequences by the local descriptor, auto covariance, conjoint triad, pseudo amino acid composition and combine the vector representation of each protein in the protein interaction network. Then it takes advantage of the multi-layer convolutional neural networks to automatically extract protein features and construct an attention mechanism to analyze deep-seated relationships between proteins. We set up four different structures of deep learning models. In the ensemble learning model, second layer data sets are generated with five-fold cross validation from basic learners, then predict the protein interaction network by combining 16 models. Results on five independent PPI data sets demonstrate that EnAmDNN achieves superior prediction performance than other comparing methods.

摘要

蛋白质相互作用在研究生命系统和生命现象中起着至关重要的作用。关于分析和预测蛋白质相互作用已经发表了大量文献，例如支持向量机方法、基于同源性的方法和基于相似性的方法，每种方法都有其优缺点。大多数现有的预测蛋白质相互作用的方法都需要先验领域知识，这使得难以有效地提取蛋白质特征。单一方法在预测蛋白质相互作用方面并不令人满意，这表明需要一种综合方法来结合各种方法的优点。在此基础上，提出了一种名为EnAmDNN（带有注意力机制的集成深度神经网络）的深度集成学习方法来预测蛋白质相互作用，它是综合学习、结合多个模型并考虑各种方法优点的合适候选方法。特别是，它通过局部描述符、自协方差、三联体组合、伪氨基酸组成对蛋白质序列进行编码，并结合蛋白质相互作用网络中每个蛋白质的向量表示。然后利用多层卷积神经网络自动提取蛋白质特征，并构建注意力机制来分析蛋白质之间的深层次关系。我们设置了四种不同结构的深度学习模型。在集成学习模型中，通过对基础学习器进行五折交叉验证生成第二层数据集，然后结合16个模型预测蛋白质相互作用网络。在五个独立的蛋白质-蛋白质相互作用（PPI）数据集上的结果表明，EnAmDNN比其他比较方法具有更优的预测性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2eda/7215070/6fdafd9e8393/fbioe-08-00390-g0001.jpg

相似文献

Protein Interaction Network Reconstruction Through Ensemble Deep Learning With Attention Mechanism.基于注意力机制的集成深度学习蛋白质相互作用网络重建

Front Bioeng Biotechnol. 2020 May 5;8:390. doi: 10.3389/fbioe.2020.00390. eCollection 2020.

SDNN-PPI: self-attention with deep neural network effect on protein-protein interaction prediction.SDNN-PPI：基于深度神经网络的自注意力在蛋白质-蛋白质相互作用预测中的应用。

BMC Genomics. 2022 Jun 27;23(1):474. doi: 10.1186/s12864-022-08687-2.

DCSE:Double-Channel-Siamese-Ensemble model for protein protein interaction prediction.DCSE：用于蛋白质相互作用预测的双通道暹罗集成模型。

BMC Genomics. 2022 Aug 4;23(1):555. doi: 10.1186/s12864-022-08772-6.

EDLMFC: an ensemble deep learning framework with multi-scale features combination for ncRNA-protein interaction prediction.EDLMFC：一种具有多尺度特征组合的集成深度学习框架，用于 ncRNA-蛋白质相互作用预测。

BMC Bioinformatics. 2021 Mar 19;22(1):133. doi: 10.1186/s12859-021-04069-9.

Prediction of RNA-protein interactions by combining deep convolutional neural network with feature selection ensemble method.通过结合深度卷积神经网络和特征选择集成方法预测 RNA-蛋白质相互作用。

J Theor Biol. 2019 Jan 14;461:230-238. doi: 10.1016/j.jtbi.2018.10.029. Epub 2018 Oct 12.

Multimodal deep representation learning for protein interaction identification and protein family classification.基于多模态深度表示学习的蛋白质相互作用识别和蛋白质家族分类。

BMC Bioinformatics. 2019 Dec 2;20(Suppl 16):531. doi: 10.1186/s12859-019-3084-y.

Compound-protein interaction prediction with end-to-end learning of neural networks for graphs and sequences.基于图和序列神经网络端到端学习的化合物-蛋白质相互作用预测。

Bioinformatics. 2019 Jan 15;35(2):309-318. doi: 10.1093/bioinformatics/bty535.

Bearing Fault Diagnosis Method Based on Deep Convolutional Neural Network and Random Forest Ensemble Learning.基于深度卷积神经网络和随机森林集成学习的轴承故障诊断方法。

Sensors (Basel). 2019 Mar 3;19(5):1088. doi: 10.3390/s19051088.

A new ensemble residual convolutional neural network for remaining useful life estimation.一种新的集成残差卷积神经网络用于剩余使用寿命估计。

Math Biosci Eng. 2019 Jan 28;16(2):862-880. doi: 10.3934/mbe.2019040.

Protein Interaction Network Reconstruction with a Structural Gated Attention Deep Model by Incorporating Network Structure Information.利用结构门控注意力深度模型结合网络结构信息进行蛋白质相互作用网络重构。

J Chem Inf Model. 2022 Jan 24;62(2):258-273. doi: 10.1021/acs.jcim.1c00982. Epub 2022 Jan 10.

引用本文的文献

TCellPredX: A Novel Approach for Accurate Prediction of Hepatitis C Virus Linear T Cell Epitopes.TCellPredX：一种准确预测丙型肝炎病毒线性T细胞表位的新方法。

ACS Omega. 2024 Dec 16;9(52):51494-51507. doi: 10.1021/acsomega.4c08715. eCollection 2024 Dec 31.

Homologous mapping yielded a comprehensive predicted protein-protein interaction network for peanut (Arachis hypogaea L.).同源映射为花生（Arachis hypogaea L.）生成了一个全面的预测蛋白质-蛋白质相互作用网络。

BMC Plant Biol. 2024 Sep 20;24(1):873. doi: 10.1186/s12870-024-05580-w.

Intelligence model on sequence-based prediction of PPI using AISSO deep concept with hyperparameter tuning process.基于 AISSO 深度概念和超参数调整过程的序列基 PPI 预测智能模型。

Sci Rep. 2024 Sep 18;14(1):21797. doi: 10.1038/s41598-024-72558-x.

HPIPred: Host-pathogen interactome prediction with phenotypic scoring.HPIPred：基于表型评分的宿主-病原体相互作用组预测

Comput Struct Biotechnol J. 2022 Nov 21;20:6534-6542. doi: 10.1016/j.csbj.2022.11.026. eCollection 2022.

Machine Learning for Causal Inference in Biological Networks: Perspectives of This Challenge.用于生物网络因果推断的机器学习：这一挑战的视角

Front Bioinform. 2021 Sep 22;1:746712. doi: 10.3389/fbinf.2021.746712. eCollection 2021.

Overview of methods for characterization and visualization of a protein-protein interaction network in a multi-omics integration context.多组学整合背景下蛋白质-蛋白质相互作用网络的表征与可视化方法概述。

Front Mol Biosci. 2022 Sep 8;9:962799. doi: 10.3389/fmolb.2022.962799. eCollection 2022.

Protein Function Analysis through Machine Learning.基于机器学习的蛋白质功能分析。

Biomolecules. 2022 Sep 6;12(9):1246. doi: 10.3390/biom12091246.

Molecular Modelling Hurdle in the Next-Generation Sequencing Era.下一代测序时代的分子建模难题。

Int J Mol Sci. 2022 Jun 28;23(13):7176. doi: 10.3390/ijms23137176.

BMC Genomics. 2022 Jun 27;23(1):474. doi: 10.1186/s12864-022-08687-2.

An Augmented High-Dimensional Graphical Lasso Method to Incorporate Prior Biological Knowledge for Global Network Learning.一种用于整合先验生物学知识以进行全局网络学习的增强型高维图形套索方法。

Front Genet. 2022 Jan 27;12:760299. doi: 10.3389/fgene.2021.760299. eCollection 2021.

本文引用的文献

Dual Convolutional Neural Networks With Attention Mechanisms Based Method for Predicting Disease-Related lncRNA Genes.基于注意力机制的双卷积神经网络预测疾病相关长链非编码RNA基因的方法

Front Genet. 2019 May 3;10:416. doi: 10.3389/fgene.2019.00416. eCollection 2019.

Chemical-protein interaction extraction via contextualized word representations and multihead attention.基于上下文化词表示和多头注意力机制的化学-蛋白质相互作用提取。

Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz054.

Protein-protein interaction sites prediction by ensemble random forests with synthetic minority oversampling technique.基于集成随机森林和合成少数过采样技术的蛋白质-蛋白质相互作用位点预测。

Bioinformatics. 2019 Jul 15;35(14):2395-2402. doi: 10.1093/bioinformatics/bty995.

Predicting protein-protein interactions by fusing various Chou's pseudo components and using wavelet denoising approach.通过融合各种周伪氨基酸组成成分并使用小波去噪方法来预测蛋白质-蛋白质相互作用。

J Theor Biol. 2019 Feb 7;462:329-346. doi: 10.1016/j.jtbi.2018.11.011. Epub 2018 Nov 16.

Predicting protein-protein interactions through sequence-based deep learning.基于序列的深度学习预测蛋白质-蛋白质相互作用。

Bioinformatics. 2018 Sep 1;34(17):i802-i810. doi: 10.1093/bioinformatics/bty573.

Extracting chemical-protein relations using attention-based neural networks.基于注意力机制神经网络的化学-蛋白质关系抽取。

Database (Oxford). 2018 Jan 1;2018:bay102. doi: 10.1093/database/bay102.

MusiteDeep: a deep-learning framework for general and kinase-specific phosphorylation site prediction.MusiteDeep：一个用于通用和激酶特异性磷酸化位点预测的深度学习框架。

Bioinformatics. 2017 Dec 15;33(24):3909-3916. doi: 10.1093/bioinformatics/btx496.

Sequence-based prediction of protein protein interaction using a deep-learning algorithm.使用深度学习算法基于序列预测蛋白质-蛋白质相互作用

BMC Bioinformatics. 2017 May 25;18(1):277. doi: 10.1186/s12859-017-1700-2.

DeepPPI: Boosting Prediction of Protein-Protein Interactions with Deep Neural Networks.DeepPPI：利用深度神经网络提升蛋白质-蛋白质相互作用预测。

J Chem Inf Model. 2017 Jun 26;57(6):1499-1510. doi: 10.1021/acs.jcim.7b00028. Epub 2017 May 26.

Predicting Protein-Protein Interactions from the Molecular to the Proteome Level.从分子水平到蛋白质组水平预测蛋白质-蛋白质相互作用。

Chem Rev. 2016 Apr 27;116(8):4884-909. doi: 10.1021/acs.chemrev.5b00683. Epub 2016 Apr 13.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于注意力机制的集成深度学习蛋白质相互作用网络重建

Protein Interaction Network Reconstruction Through Ensemble Deep Learning With Attention Mechanism.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献