利用具有注意力机制的堆叠卷积神经网络进行蛋白质折叠识别。

Performing protein fold recognition by exploiting a stack convolutional neural network with the attention mechanism.

机构信息

School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing, 210094, China.

Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Victoria, 3800, Australia; Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, Victoria, 3800, Australia.

出版信息

Anal Biochem. 2022 Aug 15;651:114695. doi: 10.1016/j.ab.2022.114695. Epub 2022 Apr 26.

DOI:10.1016/j.ab.2022.114695

PMID:35487269

Abstract

Protein fold recognition is a critical step in protein structure and function prediction, and aims to ascertain the most likely fold type of the query protein. As a typical pattern recognition problem, designing a powerful feature extractor and metric function to extract relevant and representative fold-specific features from protein sequences is the key to improving protein fold recognition. In this study, we propose an effective sequence-based approach, called RattnetFold, to identify protein fold types. The basic concept of RattnetFold is to employ a stack convolutional neural network with the attention mechanism that acts as a feature extractor to extract fold-specific features from protein residue-residue contact maps. Moreover, based on the fold-specific features, we leverage metric learning to project fold-specific features into a subspace where similar proteins are closer together and name this approach RattnetFoldPro. Benchmarking experiments illustrate that RattnetFold and RattnetFoldPro enable the convolutional neural networks to efficiently learn the underlying subtle patterns in residue-residue contact maps, thereby improving the performance of protein fold recognition. An online web server of RattnetFold and the benchmark datasets are freely available at http://csbio.njust.edu.cn/bioinf/rattnetfold/.

摘要

蛋白质结构预测是生物学和生物医学领域的一个重要研究方向，对于理解蛋白质的功能、药物设计和疾病诊断等具有重要意义。在蛋白质结构预测中，蛋白质折叠类型的识别是一个关键的步骤。本文提出了一种基于卷积神经网络的方法，称为 RattnetFold，用于识别蛋白质折叠类型。该方法利用卷积神经网络从蛋白质残基接触图中提取折叠特异性特征，并利用度量学习将这些特征投影到一个子空间中，使得相似的蛋白质更加接近。基准实验表明，RattnetFold 能够有效地学习残基接触图中的潜在模式，从而提高蛋白质折叠类型的识别性能。本文还提供了一个在线服务器，供用户使用 RattnetFold 进行蛋白质折叠类型的预测。

相似文献

Performing protein fold recognition by exploiting a stack convolutional neural network with the attention mechanism.利用具有注意力机制的堆叠卷积神经网络进行蛋白质折叠识别。

Anal Biochem. 2022 Aug 15;651:114695. doi: 10.1016/j.ab.2022.114695. Epub 2022 Apr 26.

Why can deep convolutional neural networks improve protein fold recognition? A visual explanation by interpretation.为什么深度卷积神经网络能够提高蛋白质折叠识别能力？通过解释进行可视化分析。

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab001.

Improving protein fold recognition using triplet network and ensemble deep learning.利用三重网络和集成深度学习提高蛋白质折叠识别。

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab248.

Improving protein fold recognition by extracting fold-specific features from predicted residue-residue contacts.通过从预测的残基-残基接触中提取折叠特异性特征来提高蛋白质折叠识别。

Bioinformatics. 2017 Dec 1;33(23):3749-3757. doi: 10.1093/bioinformatics/btx514.

ResCNNT-fold: Combining residual convolutional neural network and Transformer for protein fold recognition from language model embeddings.ResCNNT-fold：结合残差卷积神经网络和 Transformer 从语言模型嵌入中进行蛋白质折叠识别。

Comput Biol Med. 2023 Nov;166:107571. doi: 10.1016/j.compbiomed.2023.107571. Epub 2023 Oct 17.

Signal-3L 3.0: Improving Signal Peptide Prediction through Combining Attention Deep Learning with Window-Based Scoring.Signal-3L 3.0：通过结合注意力深度学习与窗口打分，提高信号肽预测性能。

J Chem Inf Model. 2020 Jul 27;60(7):3679-3686. doi: 10.1021/acs.jcim.0c00401. Epub 2020 Jul 1.

CoCoPRED: coiled-coil protein structural feature prediction from amino acid sequence using deep neural networks.CoCoPRED：使用深度神经网络从氨基酸序列预测卷曲螺旋蛋白结构特征。

Bioinformatics. 2022 Jan 12;38(3):720-729. doi: 10.1093/bioinformatics/btab744.

SSCpred: Single-Sequence-Based Protein Contact Prediction Using Deep Fully Convolutional Network.SSCpred：基于深度全卷积网络的单序列蛋白质接触预测

J Chem Inf Model. 2020 Jun 22;60(6):3295-3303. doi: 10.1021/acs.jcim.9b01207. Epub 2020 May 15.

Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测

PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.

deepNEC: a novel alignment-free tool for the identification and classification of nitrogen biochemical network-related enzymes using deep learning.深度 NEC：一种新颖的无对齐工具，用于使用深度学习识别和分类与氮生化网络相关的酶。

Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac071.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用具有注意力机制的堆叠卷积神经网络进行蛋白质折叠识别。

Performing protein fold recognition by exploiting a stack convolutional neural network with the attention mechanism.

机构信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献