• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于蛋白质二级结构预测的深度剖面和级联递归与卷积神经网络。

Deeper Profiles and Cascaded Recurrent and Convolutional Neural Networks for state-of-the-art Protein Secondary Structure Prediction.

机构信息

School of Computer Science, University College Dublin, Belfield, Dublin 4, Ireland.

出版信息

Sci Rep. 2019 Aug 26;9(1):12374. doi: 10.1038/s41598-019-48786-x.

DOI:10.1038/s41598-019-48786-x
PMID:31451723
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6710256/
Abstract

Protein Secondary Structure prediction has been a central topic of research in Bioinformatics for decades. In spite of this, even the most sophisticated ab initio SS predictors are not able to reach the theoretical limit of three-state prediction accuracy (88-90%), while only a few predict more than the 3 traditional Helix, Strand and Coil classes. In this study we present tests on different models trained both on single sequence and evolutionary profile-based inputs and develop a new state-of-the-art system with Porter 5. Porter 5 is composed of ensembles of cascaded Bidirectional Recurrent Neural Networks and Convolutional Neural Networks, incorporates new input encoding techniques and is trained on a large set of protein structures. Porter 5 achieves 84% accuracy (81% SOV) when tested on 3 classes and 73% accuracy (70% SOV) on 8 classes on a large independent set. In our tests Porter 5 is 2% more accurate than its previous version and outperforms or matches the most recent predictors of secondary structure we tested. When Porter 5 is retrained on SCOPe based sets that eliminate homology between training/testing samples we obtain similar results. Porter is available as a web server and standalone program at http://distilldeep.ucd.ie/porter/ alongside all the datasets and alignments.

摘要

蛋白质二级结构预测是生物信息学研究的核心课题之一。尽管如此,即使是最复杂的从头预测 SS 也无法达到理论上的三态预测精度极限(88-90%),而只有少数预测方法可以预测出超过 3 种传统的螺旋、链和卷曲类别。在这项研究中,我们对基于单序列和基于进化轮廓的输入进行训练的不同模型进行了测试,并开发了一个新的基于 Porter 5 的最先进系统。Porter 5 由级联的双向递归神经网络和卷积神经网络的集合组成,采用了新的输入编码技术,并在大量蛋白质结构数据集上进行了训练。当在一个大型独立数据集上测试 3 个类别时,Porter 5 的准确率达到 84%(81% SOV),当测试 8 个类别时,准确率达到 73%(70% SOV)。在我们的测试中,Porter 5 比其前一版本精确了 2%,并且优于或匹配我们测试的最近的二级结构预测器。当 Porter 5 在基于 SCOPe 的数据集上进行重新训练时,该数据集消除了训练/测试样本之间的同源性,我们得到了类似的结果。Porter 可作为一个网络服务器和独立程序,在 http://distilldeep.ucd.ie/porter/上提供,同时提供所有数据集和比对。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce44/6710256/ec6efabc81b1/41598_2019_48786_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce44/6710256/ec6efabc81b1/41598_2019_48786_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce44/6710256/ec6efabc81b1/41598_2019_48786_Fig1_HTML.jpg

相似文献

1
Deeper Profiles and Cascaded Recurrent and Convolutional Neural Networks for state-of-the-art Protein Secondary Structure Prediction.用于蛋白质二级结构预测的深度剖面和级联递归与卷积神经网络。
Sci Rep. 2019 Aug 26;9(1):12374. doi: 10.1038/s41598-019-48786-x.
2
Porter: a new, accurate server for protein secondary structure prediction.波特:一种用于蛋白质二级结构预测的新型精确服务器。
Bioinformatics. 2005 Apr 15;21(8):1719-20. doi: 10.1093/bioinformatics/bti203. Epub 2004 Dec 7.
3
Cascaded bidirectional recurrent neural networks for protein secondary structure prediction.用于蛋白质二级结构预测的级联双向递归神经网络
IEEE/ACM Trans Comput Biol Bioinform. 2007 Oct-Dec;4(4):572-582. doi: 10.1109/tcbb.2007.1055.
4
Protein structural motif prediction in multidimensional phi-psi space leads to improved secondary structure prediction.多维φ-ψ空间中的蛋白质结构基序预测可改进二级结构预测。
J Comput Biol. 2006 Oct;13(8):1489-502. doi: 10.1089/cmb.2006.13.1489.
5
Brewery: deep learning and deeper profiles for the prediction of 1D protein structure annotations.啤酒厂:深度学习和更深入的蛋白质一维结构注释预测。
Bioinformatics. 2020 Jun 1;36(12):3897-3898. doi: 10.1093/bioinformatics/btaa204.
6
Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks.基于二维递归神经网络的多类别距离图的从头预测和基于模板的预测。
BMC Struct Biol. 2009 Jan 30;9:5. doi: 10.1186/1472-6807-9-5.
7
Prediction of 8-state protein secondary structures by a novel deep learning architecture.一种新型深度学习架构预测 8 态蛋白质二级结构。
BMC Bioinformatics. 2018 Aug 3;19(1):293. doi: 10.1186/s12859-018-2280-5.
8
Improving prediction of protein secondary structure, backbone angles, solvent accessibility and contact numbers by using predicted contact maps and an ensemble of recurrent and residual convolutional neural networks.利用预测的接触图和递归与残差卷积神经网络的集合来改进蛋白质二级结构、主链角度、溶剂可及性和接触数的预测。
Bioinformatics. 2019 Jul 15;35(14):2403-2410. doi: 10.1093/bioinformatics/bty1006.
9
SCLpred-EMS: subcellular localization prediction of endomembrane system and secretory pathway proteins by Deep N-to-1 Convolutional Neural Networks.SCLpred-EMS:基于深度 N 到 1 卷积神经网络的内膜系统和分泌途径蛋白的亚细胞定位预测。
Bioinformatics. 2020 Jun 1;36(11):3343-3349. doi: 10.1093/bioinformatics/btaa156.
10
A two-stage approach for improved prediction of residue contact maps.一种用于改进残基接触图预测的两阶段方法。
BMC Bioinformatics. 2006 Mar 30;7:180. doi: 10.1186/1471-2105-7-180.

引用本文的文献

1
Integration of proteomics and bioinformatics in traumatic brain injury biomarker discovery.蛋白质组学与生物信息学在创伤性脑损伤生物标志物发现中的整合
BioTechnologia (Pozn). 2025 Jun 30;106(2):123-150. doi: 10.5114/bta/202470. eCollection 2025.
2
DeepPredict: a state-of-the-art web server for protein secondary structure and relative solvent accessibility prediction.DeepPredict:用于蛋白质二级结构和相对溶剂可及性预测的先进网络服务器。
Front Bioinform. 2025 Jun 6;5:1607402. doi: 10.3389/fbinf.2025.1607402. eCollection 2025.
3
Advancements in one-dimensional protein structure prediction using machine learning and deep learning.

本文引用的文献

1
NetSurfP-2.0: Improved prediction of protein structural features by integrated deep learning.NetSurfP-2.0:通过集成深度学习改进蛋白质结构特征预测。
Proteins. 2019 Jun;87(6):520-527. doi: 10.1002/prot.25674. Epub 2019 Mar 9.
2
Improving prediction of protein secondary structure, backbone angles, solvent accessibility and contact numbers by using predicted contact maps and an ensemble of recurrent and residual convolutional neural networks.利用预测的接触图和递归与残差卷积神经网络的集合来改进蛋白质二级结构、主链角度、溶剂可及性和接触数的预测。
Bioinformatics. 2019 Jul 15;35(14):2403-2410. doi: 10.1093/bioinformatics/bty1006.
3
利用机器学习和深度学习进行一维蛋白质结构预测的进展。
Comput Struct Biotechnol J. 2025 Apr 3;27:1416-1430. doi: 10.1016/j.csbj.2025.04.005. eCollection 2025.
4
PUNCH2: Explore the strategy for intrinsically disordered protein predictor.PUNCH2:探索内在无序蛋白质预测器的策略。
PLoS One. 2025 Mar 26;20(3):e0319208. doi: 10.1371/journal.pone.0319208. eCollection 2025.
5
PaleAle 6.0: Prediction of Protein Relative Solvent Accessibility by Leveraging Pre-Trained Language Models (PLMs).淡色艾尔6.0:利用预训练语言模型预测蛋白质相对溶剂可及性
Biomolecules. 2025 Jan 2;15(1):49. doi: 10.3390/biom15010049.
6
Porter 6: Protein Secondary Structure Prediction by Leveraging Pre-Trained Language Models (PLMs).波特6:利用预训练语言模型(PLMs)进行蛋白质二级结构预测。
Int J Mol Sci. 2024 Dec 27;26(1):130. doi: 10.3390/ijms26010130.
7
Impact of Alignments on the Accuracy of Protein Subcellular Localization Predictions.序列比对对蛋白质亚细胞定位预测准确性的影响。
Proteins. 2025 Mar;93(3):745-759. doi: 10.1002/prot.26767. Epub 2024 Nov 22.
8
SCLpred-ECL: Subcellular Localization Prediction by Deep N-to-1 Convolutional Neural Networks.SCLpred-ECL:基于深度 N-to-1 卷积神经网络的亚细胞定位预测。
Int J Mol Sci. 2024 May 16;25(10):5440. doi: 10.3390/ijms25105440.
9
SERT-StructNet: Protein secondary structure prediction method based on multi-factor hybrid deep model.SERT-StructNet:基于多因素混合深度模型的蛋白质二级结构预测方法。
Comput Struct Biotechnol J. 2024 Mar 22;23:1364-1375. doi: 10.1016/j.csbj.2024.03.018. eCollection 2024 Dec.
10
Characterizing the flavodoxin landscape in .阐明. 中的黄素蛋白景观
Microbiol Spectr. 2024 Mar 5;12(3):e0189523. doi: 10.1128/spectrum.01895-23. Epub 2024 Feb 6.
Accurate prediction of protein contact maps by coupling residual two-dimensional bidirectional long short-term memory with convolutional neural networks.
通过将残差二维双向长短期记忆与卷积神经网络相结合,准确预测蛋白质接触图。
Bioinformatics. 2018 Dec 1;34(23):4039-4045. doi: 10.1093/bioinformatics/bty481.
4
SOV_refine: A further refined definition of segment overlap score and its significance for protein structure similarity.SOV细化:片段重叠分数的进一步细化定义及其对蛋白质结构相似性的意义。
Source Code Biol Med. 2018 Apr 20;13:1. doi: 10.1186/s13029-018-0068-7. eCollection 2018.
5
MUFOLD-SS: New deep inception-inside-inception networks for protein secondary structure prediction.MUFOLD-SS:用于蛋白质二级结构预测的新深度 inception-inside-inception 网络。
Proteins. 2018 May;86(5):592-598. doi: 10.1002/prot.25487. Epub 2018 Mar 12.
6
Continuous Automated Model EvaluatiOn (CAMEO) complementing the critical assessment of structure prediction in CASP12.连续自动模型评估(CAMEO)对蛋白质结构预测关键评估(CASP12)的补充
Proteins. 2018 Mar;86 Suppl 1(Suppl 1):387-398. doi: 10.1002/prot.25431. Epub 2017 Dec 17.
7
Assessment of contact predictions in CASP12: Co-evolution and deep learning coming of age.蛋白质结构预测技术关键评估第12轮(CASP12)中的接触预测评估:协同进化与深度学习走向成熟。
Proteins. 2018 Mar;86 Suppl 1(Suppl Suppl 1):51-66. doi: 10.1002/prot.25407. Epub 2017 Nov 7.
8
Capturing non-local interactions by long short-term memory bidirectional recurrent neural networks for improving prediction of protein secondary structure, backbone angles, contact numbers and solvent accessibility.利用长短期记忆双向递归神经网络捕捉非局部相互作用,提高蛋白质二级结构、主链角度、接触数和溶剂可及性的预测能力。
Bioinformatics. 2017 Sep 15;33(18):2842-2849. doi: 10.1093/bioinformatics/btx218.
9
Sixty-five years of the long march in protein secondary structure prediction: the final stretch?蛋白质二级结构预测的长征:终章?
Brief Bioinform. 2018 May 1;19(3):482-494. doi: 10.1093/bib/bbw129.
10
UniProt: the universal protein knowledgebase.通用蛋白质知识库:UniProt
Nucleic Acids Res. 2017 Jan 4;45(D1):D158-D169. doi: 10.1093/nar/gkw1099. Epub 2016 Nov 29.