基于深度卷积神经场的蛋白质二级结构预测

Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields.

作者信息

Wang Sheng, Peng Jian, Ma Jianzhu, Xu Jinbo

机构信息

Toyota Technological Institute at Chicago, Chicago, IL.

Department of Human Genetics, University of Chicago, Chicago, IL.

出版信息

Sci Rep. 2016 Jan 11;6:18962. doi: 10.1038/srep18962.

DOI:10.1038/srep18962

PMID:26752681

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4707437/

Abstract

Protein secondary structure (SS) prediction is important for studying protein structure and function. When only the sequence (profile) information is used as input feature, currently the best predictors can obtain ~80% Q3 accuracy, which has not been improved in the past decade. Here we present DeepCNF (Deep Convolutional Neural Fields) for protein SS prediction. DeepCNF is a Deep Learning extension of Conditional Neural Fields (CNF), which is an integration of Conditional Random Fields (CRF) and shallow neural networks. DeepCNF can model not only complex sequence-structure relationship by a deep hierarchical architecture, but also interdependency between adjacent SS labels, so it is much more powerful than CNF. Experimental results show that DeepCNF can obtain ~84% Q3 accuracy, ~85% SOV score, and ~72% Q8 accuracy, respectively, on the CASP and CAMEO test proteins, greatly outperforming currently popular predictors. As a general framework, DeepCNF can be used to predict other protein structure properties such as contact number, disorder regions, and solvent accessibility.

摘要

蛋白质二级结构（SS）预测对于研究蛋白质的结构和功能至关重要。当仅将序列（概况）信息用作输入特征时，目前最佳的预测器可获得约80%的Q3准确率，这在过去十年中并未得到提高。在此，我们提出用于蛋白质SS预测的深度卷积神经场（DeepCNF）。DeepCNF是条件神经场（CNF）的深度学习扩展，而CNF是条件随机场（CRF）和浅层神经网络的集成。DeepCNF不仅可以通过深度层次结构对复杂的序列-结构关系进行建模，还可以对相邻SS标签之间的相互依赖性进行建模，因此它比CNF更强大。实验结果表明，在CASP和CAMEO测试蛋白质上，DeepCNF分别可获得约84%的Q3准确率、约85%的SOV分数和约72%的Q8准确率，大大优于当前流行的预测器。作为一个通用框架，DeepCNF可用于预测其他蛋白质结构属性，如接触数、无序区域和溶剂可及性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbe6/4707437/afc122377320/srep18962-f1.jpg

相似文献

Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields.

Sci Rep. 2016 Jan 11;6:18962. doi: 10.1038/srep18962.

RaptorX-Property: a web server for protein structure property prediction.

Nucleic Acids Res. 2016 Jul 8;44(W1):W430-5. doi: 10.1093/nar/gkw306. Epub 2016 Apr 25.

AUCpreD: proteome-level protein disorder prediction by AUC-maximized deep convolutional neural fields.

Bioinformatics. 2016 Sep 1;32(17):i672-i679. doi: 10.1093/bioinformatics/btw446.

Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.

PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.

AUC-Maximized Deep Convolutional Neural Fields for Protein Sequence Labeling.

Mach Learn Knowl Discov Databases. 2016 Sep;9852:1-16. doi: 10.1007/978-3-319-46227-1_1. Epub 2016 Sep 4.

AcconPred: Predicting Solvent Accessibility and Contact Number Simultaneously by a Multitask Learning Framework under the Conditional Neural Fields Model.

Biomed Res Int. 2015;2015:678764. doi: 10.1155/2015/678764. Epub 2015 Aug 3.

DeepCNF-D: Predicting Protein Order/Disorder Regions by Weighted Deep Convolutional Neural Fields.

Int J Mol Sci. 2015 Jul 29;16(8):17315-30. doi: 10.3390/ijms160817315.

Protein 8-class secondary structure prediction using conditional neural fields.

Proteomics. 2011 Oct;11(19):3786-92. doi: 10.1002/pmic.201100196. Epub 2011 Aug 31.

Protein secondary structure prediction improved by recurrent neural networks integrated with two-dimensional convolutional neural networks.

J Bioinform Comput Biol. 2018 Oct;16(5):1850021. doi: 10.1142/S021972001850021X.

A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction.

IEEE/ACM Trans Comput Biol Bioinform. 2015 Jan-Feb;12(1):103-12. doi: 10.1109/TCBB.2014.2343960. Epub 2014 Aug 7.

引用本文的文献

Artificial intelligence in orthopedics: fundamentals, current applications, and future perspectives.

Mil Med Res. 2025 Aug 4;12(1):42. doi: 10.1186/s40779-025-00633-z.

Structure Modeling Protocols for Protein Multimer and RNA in CASP16 With Enhanced MSAs, Model Ranking, and Deep Learning.

Proteins. 2025 Aug 1. doi: 10.1002/prot.70033.

Design of a multi-epitope vaccine against drug-resistant mycobacterium tuberculosis and mycobacterium bovis using reverse vaccinology.

Sci Rep. 2025 Jul 26;15(1):27298. doi: 10.1038/s41598-025-11768-3.

NTMFF-DTA: Prediction of Drug-Target Affinity Based on Network Topology and Multi-feature Fusion.

Interdiscip Sci. 2025 Feb 25. doi: 10.1007/s12539-025-00692-9.

Accurate prediction of nucleic acid binding proteins using protein language model.

Bioinform Adv. 2025 Jan 20;5(1):vbaf008. doi: 10.1093/bioadv/vbaf008. eCollection 2025.

Post-processing enhances protein secondary structure prediction with second order deep learning and embeddings.

Comput Struct Biotechnol J. 2025 Jan 2;27:243-251. doi: 10.1016/j.csbj.2024.12.022. eCollection 2025.

MHTAPred-SS: A Highly Targeted Autoencoder-Driven Deep Multi-Task Learning Framework for Accurate Protein Secondary Structure Prediction.

Int J Mol Sci. 2024 Dec 15;25(24):13444. doi: 10.3390/ijms252413444.

Designing and comparative analysis of anti-oxidant and heat shock proteins based multi-epitopic filarial vaccines.

BMC Infect Dis. 2024 Dec 18;24(1):1436. doi: 10.1186/s12879-024-10272-9.

Immunoinformatics design of a multi-epitope vaccine for Chlamydia trachomatis major outer membrane proteins.

Sci Rep. 2024 Dec 2;14(1):29919. doi: 10.1038/s41598-024-81736-w.

Prediction of the Stability of Protein Substructures Using AI/ML Techniques.

Methods Mol Biol. 2025;2870:153-182. doi: 10.1007/978-1-0716-4213-9_9.

本文引用的文献

JPred4: a protein secondary structure prediction server.

Nucleic Acids Res. 2015 Jul 1;43(W1):W389-94. doi: 10.1093/nar/gkv332. Epub 2015 Apr 16.

A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction.

IEEE/ACM Trans Comput Biol Bioinform. 2015 Jan-Feb;12(1):103-12. doi: 10.1109/TCBB.2014.2343960. Epub 2014 Aug 7.

CATH: comprehensive structural and functional annotations for genome sequences.

Nucleic Acids Res. 2015 Jan;43(Database issue):D376-81. doi: 10.1093/nar/gku947. Epub 2014 Oct 27.

SSpro/ACCpro 5: almost perfect prediction of protein secondary structure and relative solvent accessibility using profiles, machine learning and structural similarity.

Bioinformatics. 2014 Sep 15;30(18):2592-7. doi: 10.1093/bioinformatics/btu352. Epub 2014 May 24.

MRFalign: protein homology detection through alignment of Markov random fields.

PLoS Comput Biol. 2014 Mar 27;10(3):e1003500. doi: 10.1371/journal.pcbi.1003500. eCollection 2014 Mar.

Critical assessment of methods of protein structure prediction (CASP)--round x.

Proteins. 2014 Feb;82 Suppl 2(0 2):1-6. doi: 10.1002/prot.24452. Epub 2013 Dec 17.

SCOP2 prototype: a new approach to protein structure mining.

Nucleic Acids Res. 2014 Jan;42(Database issue):D310-4. doi: 10.1093/nar/gkt1242. Epub 2013 Nov 29.

Protein threading using context-specific alignment potential.

Bioinformatics. 2013 Jul 1;29(13):i257-65. doi: 10.1093/bioinformatics/btt210.

Assessment of the assessment: evaluation of the model quality estimates in CASP10.

Proteins. 2014 Feb;82 Suppl 2(0 2):112-26. doi: 10.1002/prot.24347. Epub 2013 Aug 31.

Protein structure alignment beyond spatial proximity.

Sci Rep. 2013;3:1448. doi: 10.1038/srep01448.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于深度卷积神经场的蛋白质二级结构预测

Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献