基于空洞空间金字塔网络的深度集成学习用于蛋白质二级结构预测

Deep Ensemble Learning with Atrous Spatial Pyramid Networks for Protein Secondary Structure Prediction.

作者信息

Guo Yuzhi, Wu Jiaxiang, Ma Hehuan, Wang Sheng, Huang Junzhou

机构信息

Department of Computer Science and Engineering, University of Texas at Arlington, Arlington, TX 76019, USA.

AI Lab, Tencent, Shenzhen 508929, China.

出版信息

Biomolecules. 2022 Jun 2;12(6):774. doi: 10.3390/biom12060774.

DOI:10.3390/biom12060774

PMID:35740899

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9221033/

Abstract

The secondary structure of proteins is significant for studying the three-dimensional structure and functions of proteins. Several models from image understanding and natural language modeling have been successfully adapted in the protein sequence study area, such as Long Short-term Memory (LSTM) network and Convolutional Neural Network (CNN). Recently, Gated Convolutional Neural Network (GCNN) has been proposed for natural language processing. It has achieved high levels of sentence scoring, as well as reduced the latency. Conditionally Parameterized Convolution (CondConv) is another novel study which has gained great success in the image processing area. Compared with vanilla CNN, CondConv uses extra sample-dependant modules to conditionally adjust the convolutional network. In this paper, we propose a novel Conditionally Parameterized Convolutional network (CondGCNN) which utilizes the power of both CondConv and GCNN. CondGCNN leverages an ensemble encoder to combine the capabilities of both LSTM and CondGCNN to encode protein sequences by better capturing protein sequential features. In addition, we explore the similarity between the secondary structure prediction problem and the image segmentation problem, and propose an ASP network (Atrous Spatial Pyramid Pooling (ASPP) based network) to capture fine boundary details in secondary structure. Extensive experiments show that the proposed method can achieve higher performance on protein secondary structure prediction task than existing methods on CB513, Casp11, CASP12, CASP13, and CASP14 datasets. We also conducted ablation studies over each component to verify the effectiveness. Our method is expected to be useful for any protein related prediction tasks, which is not limited to protein secondary structure prediction.

摘要

蛋白质的二级结构对于研究蛋白质的三维结构和功能具有重要意义。图像理解和自然语言建模中的几种模型已成功应用于蛋白质序列研究领域，如长短期记忆（LSTM）网络和卷积神经网络（CNN）。最近，门控卷积神经网络（GCNN）被提出用于自然语言处理。它在句子评分方面取得了很高的水平，并减少了延迟。条件参数化卷积（CondConv）是另一项在图像处理领域取得巨大成功的新颖研究。与普通CNN相比，CondConv使用额外的样本相关模块来有条件地调整卷积网络。在本文中，我们提出了一种新颖的条件参数化卷积网络（CondGCNN），它利用了CondConv和GCNN的优势。CondGCNN利用一个集成编码器来结合LSTM和CondGCNN的能力，通过更好地捕捉蛋白质序列特征来编码蛋白质序列。此外，我们探索了二级结构预测问题与图像分割问题之间的相似性，并提出了一个基于空洞空间金字塔池化（ASPP）的网络（ASP网络）来捕捉二级结构中的精细边界细节。大量实验表明，所提出的方法在CB513、Casp11、CASP12、CASP13和CASP14数据集上的蛋白质二级结构预测任务中比现有方法具有更高的性能。我们还对每个组件进行了消融研究以验证其有效性。我们的方法有望对任何与蛋白质相关的预测任务有用，而不仅限于蛋白质二级结构预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5484/9221033/47ab3039fb28/biomolecules-12-00774-g001.jpg

相似文献

Deep Ensemble Learning with Atrous Spatial Pyramid Networks for Protein Secondary Structure Prediction.基于空洞空间金字塔网络的深度集成学习用于蛋白质二级结构预测

Biomolecules. 2022 Jun 2;12(6):774. doi: 10.3390/biom12060774.

Ensemble deep learning models for protein secondary structure prediction using bidirectional temporal convolution and bidirectional long short-term memory.使用双向时间卷积和双向长短期记忆的集成深度学习模型用于蛋白质二级结构预测。

Front Bioeng Biotechnol. 2023 Feb 13;11:1051268. doi: 10.3389/fbioe.2023.1051268. eCollection 2023.

A multiple-channel and atrous convolution network for ultrasound image segmentation.一种用于超声图像分割的多通道多孔卷积网络。

Med Phys. 2020 Dec;47(12):6270-6285. doi: 10.1002/mp.14512. Epub 2020 Oct 18.

Boundary-aware context neural network for medical image segmentation.边界感知上下文神经网络在医学图像分割中的应用。

Med Image Anal. 2022 May;78:102395. doi: 10.1016/j.media.2022.102395. Epub 2022 Feb 14.

Prediction of 8-state protein secondary structures by a novel deep learning architecture.一种新型深度学习架构预测 8 态蛋白质二级结构。

BMC Bioinformatics. 2018 Aug 3;19(1):293. doi: 10.1186/s12859-018-2280-5.

OCLSTM: Optimized convolutional and long short-term memory neural network model for protein secondary structure prediction.OCLSTM：用于蛋白质二级结构预测的优化卷积和长短期记忆神经网络模型。

PLoS One. 2021 Feb 3;16(2):e0245982. doi: 10.1371/journal.pone.0245982. eCollection 2021.

Protein secondary structure prediction improved by recurrent neural networks integrated with two-dimensional convolutional neural networks.通过与二维卷积神经网络集成的循环神经网络改进蛋白质二级结构预测。

J Bioinform Comput Biol. 2018 Oct;16(5):1850021. doi: 10.1142/S021972001850021X.

Synthetic CT reconstruction using a deep spatial pyramid convolutional framework for MR-only breast radiotherapy.基于深度空间金字塔卷积框架的合成 CT 重建技术在仅 MRI 乳腺癌放疗中的应用。

Med Phys. 2019 Sep;46(9):4135-4147. doi: 10.1002/mp.13716. Epub 2019 Aug 7.

DeepACLSTM: deep asymmetric convolutional long short-term memory neural models for protein secondary structure prediction.DeepACLSTM：用于蛋白质二级结构预测的深度非对称卷积长短时记忆神经模型。

BMC Bioinformatics. 2019 Jun 17;20(1):341. doi: 10.1186/s12859-019-2940-0.

Efficient deep learning architecture with dimension-wise pyramid pooling for nuclei segmentation of histopathology images.基于维度金字塔池化的高效深度学习架构，用于组织病理学图像的细胞核分割。

Comput Med Imaging Graph. 2021 Oct;93:101975. doi: 10.1016/j.compmedimag.2021.101975. Epub 2021 Aug 23.

引用本文的文献

DNA promoter task-oriented dictionary mining and prediction model based on natural language technology.基于自然语言技术的DNA启动子任务导向型词典挖掘与预测模型

Sci Rep. 2025 Jan 2;15(1):153. doi: 10.1038/s41598-024-84105-9.

Advances in Computational Intelligence-Based Methods of Structure and Function Prediction of Proteins.基于计算智能的蛋白质结构与功能预测方法的进展。

Biomolecules. 2024 Aug 29;14(9):1083. doi: 10.3390/biom14091083.

本文引用的文献

Comprehensive Study on Enhancing Low-Quality Position-Specific Scoring Matrix with Deep Learning for Accurate Protein Structure Property Prediction: Using Bagging Multiple Sequence Alignment Learning.利用Bagging多序列比对学习，通过深度学习增强低质量位置特异性评分矩阵以进行准确蛋白质结构特性预测的综合研究

J Comput Biol. 2021 Apr;28(4):346-361. doi: 10.1089/cmb.2020.0416. Epub 2021 Feb 22.

EPTool: A New Enhancing PSSM Tool for Protein Secondary Structure Prediction.EPTool：一种用于蛋白质二级结构预测的新型增强 PSSM 工具。

J Comput Biol. 2021 Apr;28(4):362-364. doi: 10.1089/cmb.2020.0417. Epub 2020 Dec 1.

BMC Bioinformatics. 2019 Jun 17;20(1):341. doi: 10.1186/s12859-019-2940-0.

Improving prediction of protein secondary structure, backbone angles, solvent accessibility and contact numbers by using predicted contact maps and an ensemble of recurrent and residual convolutional neural networks.利用预测的接触图和递归与残差卷积神经网络的集合来改进蛋白质二级结构、主链角度、溶剂可及性和接触数的预测。

Bioinformatics. 2019 Jul 15;35(14):2403-2410. doi: 10.1093/bioinformatics/bty1006.

J Bioinform Comput Biol. 2018 Oct;16(5):1850021. doi: 10.1142/S021972001850021X.

Detecting Proline and Non-Proline Cis Isomers in Protein Structures from Sequences Using Deep Residual Ensemble Learning.基于深度残差集成学习从序列中检测蛋白质结构中的脯氨酸和顺式异构体和非脯氨酸异构体。

J Chem Inf Model. 2018 Sep 24;58(9):2033-2042. doi: 10.1021/acs.jcim.8b00442. Epub 2018 Aug 29.

MUFOLD-SS: New deep inception-inside-inception networks for protein secondary structure prediction.MUFOLD-SS：用于蛋白质二级结构预测的新深度 inception-inside-inception 网络。

Proteins. 2018 May;86(5):592-598. doi: 10.1002/prot.25487. Epub 2018 Mar 12.

Direct detection of a break in the teraelectronvolt cosmic-ray spectrum of electrons and positrons.直接探测到太电子伏特能区电子和正电子宇宙射线能谱的断裂。

Nature. 2017 Dec 7;552(7683):63-66. doi: 10.1038/nature24475. Epub 2017 Nov 29.

PDBsum: Structural summaries of PDB entries.PDBsum：蛋白质数据库（PDB）条目的结构摘要。

Protein Sci. 2018 Jan;27(1):129-134. doi: 10.1002/pro.3289. Epub 2017 Oct 27.

Protein secondary structure prediction: A survey of the state of the art.蛋白质二级结构预测：最新技术综述。

J Mol Graph Model. 2017 Sep;76:379-402. doi: 10.1016/j.jmgm.2017.07.015. Epub 2017 Jul 19.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于空洞空间金字塔网络的深度集成学习用于蛋白质二级结构预测

Deep Ensemble Learning with Atrous Spatial Pyramid Networks for Protein Secondary Structure Prediction.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献