Wu Chuanyan, Lin Bentao, Zhang Jialin, Gao Rui, Song Rui, Liu Zhi-Ping
School of Intelligent Engineering, Shandong Management University, No.3500 Dingxiang Road, Jinan, Shandong, 250357, China.
School of Control Science and Engineering, Shandong University, No.17923 Jingshi Road, Jinan, Shandong, 250061, China.
Comput Struct Biotechnol J. 2024 Nov 29;23:4315-4323. doi: 10.1016/j.csbj.2024.11.039. eCollection 2024 Dec.
Identifying essential proteins is of utmost importance in the field of biomedical research due to their essential functions in cellular activities and their involvement in mechanisms related to diseases. In this research, a novel approach called AttentionEP for predicting essential proteins (EP) is introduced by attention mechanisms. This method leverages both cross-attention and self-attention frameworks, focusing on enhancing prediction accuracy through the integration of features across diverse scales. Spatial characteristics of proteins are obtained from the protein-protein interaction (PPI) network by employing Graph Convolutional Networks (GCN) and Graph Attention Networks (GAT). Following this, Bidirectional Long Short-Term Memory networks (BiLSTM) are employed to derive temporal features from gene expression datasets. Furthermore, spatial characteristics are derived by integrating data on subcellular localization with the application of Deep Neural Networks (DNN). In order to effectively integrate features across multiple scales, initial steps involve the application of self-attention techniques to derive essential insights from each unique data set. Following this, mechanisms involving self-attention and cross-attention are employed to enhance the interaction between diverse information sources. To identify essential proteins, a classifier based on the ResNet architecture is developed. The findings from the experiments indicate that the method introduced here shows superior performance in identifying essential proteins, recording an Area Under the Curve (AUC) value of 0.9433. This approach shows a considerable advantage over established techniques. The findings of this study provide a significant advancement in the comprehension of critical proteins, revealing promising potential for applications in the development of therapeutics and addressing various diseases.
在生物医学研究领域,识别必需蛋白质至关重要,因为它们在细胞活动中具有重要功能,且参与与疾病相关的机制。在本研究中,通过注意力机制引入了一种名为AttentionEP的预测必需蛋白质(EP)的新方法。该方法利用交叉注意力和自注意力框架,通过整合不同尺度的特征来提高预测准确性。通过使用图卷积网络(GCN)和图注意力网络(GAT)从蛋白质 - 蛋白质相互作用(PPI)网络中获取蛋白质的空间特征。在此之后,使用双向长短期记忆网络(BiLSTM)从基因表达数据集中提取时间特征。此外,通过应用深度神经网络(DNN)将亚细胞定位数据整合来导出空间特征。为了有效整合多尺度特征,初始步骤包括应用自注意力技术从每个独特数据集中获得重要见解。在此之后,采用涉及自注意力和交叉注意力的机制来增强不同信息源之间的相互作用。为了识别必需蛋白质,开发了一种基于ResNet架构的分类器。实验结果表明,本文介绍的方法在识别必需蛋白质方面表现出卓越性能,曲线下面积(AUC)值达到0.9433。该方法相对于现有技术具有显著优势。本研究结果在关键蛋白质的理解方面取得了重大进展,在治疗药物开发和解决各种疾病方面显示出有前景的应用潜力。