• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

快速学习优化预测方法(FLOPRED)在蛋白质二级结构预测中的应用。

Fast learning optimized prediction methodology (FLOPRED) for protein secondary structure prediction.

机构信息

Battelle Center for Mathematical Medicine, The Research Institute at Nationwide Children's Hospital, 700 Children's Drive, Columbus, OH, USA.

出版信息

J Mol Model. 2012 Sep;18(9):4275-89. doi: 10.1007/s00894-012-1410-7. Epub 2012 May 8.

DOI:10.1007/s00894-012-1410-7
PMID:22562230
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3694724/
Abstract

Computational methods are rapidly gaining importance in the field of structural biology, mostly due to the explosive progress in genome sequencing projects and the large disparity between the number of sequences and the number of structures. There has been an exponential growth in the number of available protein sequences and a slower growth in the number of structures. There is therefore an urgent need to develop computational methods to predict structures and identify their functions from the sequence. Developing methods that will satisfy these needs both efficiently and accurately is of paramount importance for advances in many biomedical fields, including drug development and discovery of biomarkers. A novel method called fast learning optimized prediction methodology (FLOPRED) is proposed for predicting protein secondary structure, using knowledge-based potentials combined with structure information from the CATH database. A neural network-based extreme learning machine (ELM) and advanced particle swarm optimization (PSO) are used with this data that yield better and faster convergence to produce more accurate results. Protein secondary structures are predicted reliably, more efficiently and more accurately using FLOPRED. These techniques yield superior classification of secondary structure elements, with a training accuracy ranging between 83 % and 87 % over a widerange of hidden neurons and a cross-validated testing accuracy ranging between 81 % and 84 % and a segment overlap (SOV) score of 78 % that are obtained with different sets of proteins. These results are comparable to other recently published studies, but are obtained with greater efficiencies, in terms of time and cost.

摘要

计算方法在结构生物学领域的重要性日益增加,主要是由于基因组测序项目的飞速发展,以及序列数量和结构数量之间的巨大差距。可用蛋白质序列的数量呈指数级增长,而结构数量的增长则较慢。因此,迫切需要开发计算方法,从序列中预测结构并识别其功能。开发既高效又准确的方法对于许多生物医学领域的进展至关重要,包括药物开发和生物标志物的发现。提出了一种名为快速学习优化预测方法(FLOPRED)的新方法,用于预测蛋白质二级结构,该方法结合了基于知识的势能和 CATH 数据库中的结构信息。使用神经网络的极限学习机(ELM)和高级粒子群优化(PSO)来处理这些数据,以更好更快地收敛,从而产生更准确的结果。FLOPRED 可可靠、高效、准确地预测蛋白质二级结构。这些技术可实现二级结构元素的分类,在不同的隐藏神经元范围内,训练准确率在 83%到 87%之间,交叉验证测试准确率在 81%到 84%之间,片段重叠(SOV)分数为 78%,这些结果与其他最近发表的研究相当,但效率更高,无论是在时间还是成本方面。

相似文献

1
Fast learning optimized prediction methodology (FLOPRED) for protein secondary structure prediction.快速学习优化预测方法(FLOPRED)在蛋白质二级结构预测中的应用。
J Mol Model. 2012 Sep;18(9):4275-89. doi: 10.1007/s00894-012-1410-7. Epub 2012 May 8.
2
Distributions of amino acids suggest that certain residue types more effectively determine protein secondary structure.氨基酸的分布表明,某些残基类型更有效地决定蛋白质的二级结构。
J Mol Model. 2013 Oct;19(10):4337-48. doi: 10.1007/s00894-013-1911-z. Epub 2013 Aug 2.
3
Improving protein secondary structure prediction using a multi-modal BP method.利用多模态 BP 方法改进蛋白质二级结构预测。
Comput Biol Med. 2011 Oct;41(10):946-59. doi: 10.1016/j.compbiomed.2011.08.005. Epub 2011 Aug 30.
4
Protein secondary structure prediction: A survey of the state of the art.蛋白质二级结构预测:最新技术综述。
J Mol Graph Model. 2017 Sep;76:379-402. doi: 10.1016/j.jmgm.2017.07.015. Epub 2017 Jul 19.
5
Prediction of beta-turns at over 80% accuracy based on an ensemble of predicted secondary structures and multiple alignments.基于预测的二级结构集合和多重比对,以超过80%的准确率预测β转角。
BMC Bioinformatics. 2008 Oct 10;9:430. doi: 10.1186/1471-2105-9-430.
6
PATSIM: Prediction and analysis of protein sequences using hybrid Knuth-Morris Pratt (KMP) and Boyer-Moore (BM) algorithm.PATSIM:使用混合的克努特-莫里斯-普拉特(KMP)算法和博耶-摩尔(BM)算法对蛋白质序列进行预测和分析。
Gene. 2018 May 30;657:50-59. doi: 10.1016/j.gene.2018.02.069. Epub 2018 Mar 6.
7
Evaluation of methods for predicting the topology of beta-barrel outer membrane proteins and a consensus prediction method.β-桶状外膜蛋白拓扑结构预测方法的评估及一种共识预测方法
BMC Bioinformatics. 2005 Jan 12;6:7. doi: 10.1186/1471-2105-6-7.
8
Achieving 80% ten-fold cross-validated accuracy for secondary structure prediction by large-scale training.通过大规模训练实现二级结构预测的80%十折交叉验证准确率。
Proteins. 2007 Mar 1;66(4):838-45. doi: 10.1002/prot.21298.
9
Protein secondary structure prediction based on the fuzzy support vector machine with the hyperplane optimization.基于超平面优化的模糊支持向量机的蛋白质二级结构预测
Gene. 2018 Feb 5;642:74-83. doi: 10.1016/j.gene.2017.11.005. Epub 2017 Nov 14.
10
DeepCDpred: Inter-residue distance and contact prediction for improved prediction of protein structure.DeepCDpred:用于改进蛋白质结构预测的残差间距离和接触预测。
PLoS One. 2019 Jan 8;14(1):e0205214. doi: 10.1371/journal.pone.0205214. eCollection 2019.

引用本文的文献

1
Sulfatase 2 Is Associated with Steroid Resistance in Childhood Nephrotic Syndrome.硫酸酯酶2与儿童肾病综合征的类固醇抵抗有关。
J Clin Med. 2021 Feb 2;10(3):523. doi: 10.3390/jcm10030523.
2
Boosting the accuracy of protein secondary structure prediction through nearest neighbor search and method hybridization.通过最近邻搜索和方法杂交提高蛋白质二级结构预测的准确性。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i317-i325. doi: 10.1093/bioinformatics/btaa336.
3
Prediction of Protein Tertiary Structure via Regularized Template Classification Techniques.基于正则化模板分类技术的蛋白质三级结构预测。
Molecules. 2020 May 26;25(11):2467. doi: 10.3390/molecules25112467.
4
Predicting protein tertiary structure and its uncertainty analysis via particle swarm sampling.通过粒子群采样预测蛋白质三级结构及其不确定性分析。
J Mol Model. 2019 Feb 27;25(3):79. doi: 10.1007/s00894-019-3956-0.
5
Sixty-five years of the long march in protein secondary structure prediction: the final stretch?蛋白质二级结构预测的长征:终章?
Brief Bioinform. 2018 May 1;19(3):482-494. doi: 10.1093/bib/bbw129.
6
Protein secondary structure prediction using a small training set (compact model) combined with a Complex-valued neural network approach.使用小型训练集(紧凑模型)结合复值神经网络方法进行蛋白质二级结构预测。
BMC Bioinformatics. 2016 Sep 13;17(1):362. doi: 10.1186/s12859-016-1209-0.
7
Distributions of amino acids suggest that certain residue types more effectively determine protein secondary structure.氨基酸的分布表明,某些残基类型更有效地决定蛋白质的二级结构。
J Mol Model. 2013 Oct;19(10):4337-48. doi: 10.1007/s00894-013-1911-z. Epub 2013 Aug 2.
8
Enhanced hybrid search algorithm for protein structure prediction using the 3D-HP lattice model.使用 3D-HP 格子模型的蛋白质结构预测的增强型混合搜索算法。
J Mol Model. 2013 Sep;19(9):3883-91. doi: 10.1007/s00894-013-1907-8. Epub 2013 Jul 4.

本文引用的文献

1
SPINE X: improving protein secondary structure prediction by multistep learning coupled with prediction of solvent accessible surface area and backbone torsion angles.SPINE X:通过多步骤学习与溶剂可及表面积和骨架扭转角预测相结合来改进蛋白质二级结构预测。
J Comput Chem. 2012 Jan 30;33(3):259-67. doi: 10.1002/jcc.21968. Epub 2011 Nov 2.
2
ICGA-PSO-ELM approach for accurate multiclass cancer classification resulting in reduced gene sets in which genes encoding secreted proteins are highly represented.ICGA-PSO-ELM 方法可实现精确的多癌症分类,减少了基因集,其中高度代表了编码分泌蛋白的基因。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Mar-Apr;8(2):452-63. doi: 10.1109/TCBB.2010.13.
3
Proteins that switch folds.构象转换蛋白。
Curr Opin Struct Biol. 2010 Aug;20(4):482-8. doi: 10.1016/j.sbi.2010.06.002. Epub 2010 Jun 28.
4
Protein secondary structure prediction using modular reciprocal bidirectional recurrent neural networks.使用模块化互反双向递归神经网络进行蛋白质二级结构预测。
Comput Methods Programs Biomed. 2010 Dec;100(3):237-47. doi: 10.1016/j.cmpb.2010.04.005. Epub 2010 May 15.
5
Improving protein secondary structure prediction using a simple k-mer model.利用简单的 k- -mer 模型改进蛋白质二级结构预测。
Bioinformatics. 2010 Mar 1;26(5):596-602. doi: 10.1093/bioinformatics/btq020. Epub 2010 Feb 3.
6
A minimal sequence code for switching protein structure and function.一种用于切换蛋白质结构和功能的最小序列码。
Proc Natl Acad Sci U S A. 2009 Dec 15;106(50):21149-54. doi: 10.1073/pnas.0906408106. Epub 2009 Nov 18.
7
Protein secondary structure prediction using three neural networks and a segmental semi Markov model.使用三个神经网络和一个分段半马尔可夫模型进行蛋白质二级结构预测。
Math Biosci. 2009 Feb;217(2):145-50. doi: 10.1016/j.mbs.2008.11.001. Epub 2008 Nov 18.
8
Assessing the reliability of sequence similarities detected through hydrophobic cluster analysis.评估通过疏水簇分析检测到的序列相似性的可靠性。
Proteins. 2008 Mar;70(4):1588-94. doi: 10.1002/prot.21803.
9
Accurate prediction of protein secondary structure and solvent accessibility by consensus combiners of sequence and structure information.通过序列和结构信息的共识组合器准确预测蛋白质二级结构和溶剂可及性。
BMC Bioinformatics. 2007 Jun 14;8:201. doi: 10.1186/1471-2105-8-201.
10
Improving the accuracy of protein secondary structure prediction using structural alignment.利用结构比对提高蛋白质二级结构预测的准确性。
BMC Bioinformatics. 2006 Jun 14;7:301. doi: 10.1186/1471-2105-7-301.