识别并应用对于准确可靠的蛋白质二级结构预测至关重要的概念。

Identification and application of the concepts important for accurate and reliable protein secondary structure prediction.

作者信息

King R D, Sternberg M J

机构信息

Biomolecular Modelling Laboratory, Imperial Cancer Research Fund, London, United Kingdom.

出版信息

Protein Sci. 1996 Nov;5(11):2298-310. doi: 10.1002/pro.5560051116.

DOI:10.1002/pro.5560051116

PMID:8931148

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2143286/

Abstract

A protein secondary structure prediction method from multiply aligned homologous sequences is presented with an overall per residue three-state accuracy of 70.1%. There are two aims: to obtain high accuracy by identification of a set of concepts important for prediction followed by use of linear statistics; and to provide insight into the folding process. The important concepts in secondary structure prediction are identified as: residue conformational propensities, sequence edge effects, moments of hydrophobicity, position of insertions and deletions in aligned homologous sequence, moments of conservation, auto-correlation, residue ratios, secondary structure feedback effects, and filtering. Explicit use of edge effects, moments of conservation, and auto-correlation are new to this paper. The relative importance of the concepts used in prediction was analyzed by stepwise addition of information and examination of weights in the discrimination function. The simple and explicit structure of the prediction allows the method to be reimplemented easily. The accuracy of a prediction is predictable a priori. This permits evaluation of the utility of the prediction: 10% of the chains predicted were identified correctly as having a mean accuracy of > 80%. Existing high-accuracy prediction methods are "black-box" predictors based on complex nonlinear statistics (e.g., neural networks in PHD: Rost & Sander, 1993a). For medium- to short-length chains (> or = 90 residues and < 170 residues), the prediction method is significantly more accurate (P < 0.01) than the PHD algorithm (probably the most commonly used algorithm). In combination with the PHD, an algorithm is formed that is significantly more accurate than either method, with an estimated overall three-state accuracy of 72.4%, the highest accuracy reported for any prediction method.

摘要

本文提出了一种基于多重比对同源序列的蛋白质二级结构预测方法，其每个残基的整体三态准确率为70.1%。该方法有两个目标：一是通过识别一组对预测重要的概念，然后使用线性统计来获得高精度；二是深入了解折叠过程。二级结构预测中的重要概念被确定为：残基构象倾向、序列边缘效应、疏水性矩、比对同源序列中插入和缺失的位置、保守性矩、自相关、残基比率、二级结构反馈效应和过滤。本文首次明确使用了边缘效应、保守性矩和自相关。通过逐步添加信息并检查判别函数中的权重，分析了预测中使用的概念的相对重要性。该预测方法结构简单明了，易于重新实现。预测的准确性可以先验预测。这允许评估预测的效用：预测的链中有10%被正确识别，其平均准确率>80%。现有的高精度预测方法是基于复杂非线性统计的“黑箱”预测器（例如，PHD中的神经网络：Rost和Sander，1993a）。对于中短长度的链（≥90个残基且<170个残基），该预测方法比PHD算法（可能是最常用的算法）显著更准确（P<0.01）。与PHD相结合，形成了一种比任何一种方法都显著更准确的算法，估计整体三态准确率为72.4%，这是任何预测方法所报道的最高准确率。

相似文献

Identification and application of the concepts important for accurate and reliable protein secondary structure prediction.识别并应用对于准确可靠的蛋白质二级结构预测至关重要的概念。

Protein Sci. 1996 Nov;5(11):2298-310. doi: 10.1002/pro.5560051116.

Combining prediction of secondary structure and solvent accessibility in proteins.蛋白质二级结构预测与溶剂可及性预测相结合。

Proteins. 2005 May 15;59(3):467-75. doi: 10.1002/prot.20441.

Protein secondary structure prediction using local alignments.利用局部比对进行蛋白质二级结构预测。

J Mol Biol. 1997 Apr 25;268(1):31-6. doi: 10.1006/jmbi.1997.0958.

Use of amino acid environment-dependent substitution tables and conformational propensities in structure prediction from aligned sequences of homologous proteins. II. Secondary structures.在从同源蛋白质的比对序列进行结构预测中使用氨基酸环境依赖性替换表和构象倾向。II. 二级结构。

J Mol Biol. 1994 May 20;238(5):693-708. doi: 10.1006/jmbi.1994.1330.

Highly accurate and consistent method for prediction of helix and strand content from primary protein sequences.一种从蛋白质一级序列预测螺旋和链含量的高度准确且一致的方法。

Artif Intell Med. 2005 Sep-Oct;35(1-2):19-35. doi: 10.1016/j.artmed.2005.02.006.

Combining evolutionary information and neural networks to predict protein secondary structure.结合进化信息与神经网络预测蛋白质二级结构。

Proteins. 1994 May;19(1):55-72. doi: 10.1002/prot.340190108.

Prediction of protein secondary structure content for the twilight zone sequences.预测处于模糊区域序列的蛋白质二级结构含量。

Proteins. 2007 Nov 15;69(3):486-98. doi: 10.1002/prot.21527.

A simple and fast approach to prediction of protein secondary structure from multiply aligned sequences with accuracy above 70%.一种从多重比对序列预测蛋白质二级结构的简单快速方法，准确率高于70%。

Protein Sci. 1995 Dec;4(12):2517-25. doi: 10.1002/pro.5560041208.

Prediction of protein secondary structure at better than 70% accuracy.蛋白质二级结构预测准确率高于70%。

J Mol Biol. 1993 Jul 20;232(2):584-99. doi: 10.1006/jmbi.1993.1413.

Protein secondary structure prediction using nearest-neighbor methods.使用最近邻方法进行蛋白质二级结构预测。

J Mol Biol. 1993 Aug 20;232(4):1117-29. doi: 10.1006/jmbi.1993.1464.

引用本文的文献

The crosstalk between neuropilin-1 and tumor necrosis factor-α in endothelial cells.内皮细胞中神经纤毛蛋白-1与肿瘤坏死因子-α之间的相互作用。

Front Cell Dev Biol. 2024 Jun 27;12:1210944. doi: 10.3389/fcell.2024.1210944. eCollection 2024.

KCNQ1 is an essential mediator of the sex-dependent perception of moderate cold temperatures.KCNQ1 是性别依赖性感知中等寒冷温度的重要介质。

Proc Natl Acad Sci U S A. 2024 Jun 18;121(25):e2322475121. doi: 10.1073/pnas.2322475121. Epub 2024 Jun 10.

Allelic variation and haplotype diversity of () gene governing in vivo maternal haploid induction in maize.控制玉米体内母本单倍体诱导的（）基因的等位变异和单倍型多样性。（注：括号部分原文缺失具体基因名称）

Physiol Mol Biol Plants. 2024 May;30(5):823-838. doi: 10.1007/s12298-024-01456-3. Epub 2024 May 13.

Comparison, Analysis, and Molecular Dynamics Simulations of Structures of a Viral Protein Modeled Using Various Computational Tools.使用各种计算工具建模的病毒蛋白结构的比较、分析及分子动力学模拟

Bioengineering (Basel). 2023 Aug 24;10(9):1004. doi: 10.3390/bioengineering10091004.

Unveiling an indole alkaloid diketopiperazine biosynthetic pathway that features a unique stereoisomerase and multifunctional methyltransferase.揭示一种吲哚生物碱二酮哌嗪生物合成途径，其具有独特的立体异构酶和多功能甲基转移酶。

Nat Commun. 2023 May 3;14(1):2558. doi: 10.1038/s41467-023-38168-3.

Substitution of PINK1 Gly411 modulates substrate receptivity and turnover.PINK1 Gly411 的取代会调节底物的接受性和周转率。

Autophagy. 2023 Jun;19(6):1711-1732. doi: 10.1080/15548627.2022.2151294. Epub 2022 Dec 5.

Propensities of Some Amino Acid Pairings in α-Helices Vary with Length.某些氨基酸对在α-螺旋中的倾向性随长度而变化。

Protein J. 2022 Dec;41(6):551-562. doi: 10.1007/s10930-022-10076-3. Epub 2022 Sep 28.

Intrinsic and extrinsic regulators of Aux/IAA protein degradation dynamics.Aux/IAA 蛋白降解动力学的内在和外在调节因子。

Trends Biochem Sci. 2022 Oct;47(10):865-874. doi: 10.1016/j.tibs.2022.06.004. Epub 2022 Jul 8.

HrpA anchors meningococci to the dynein motor and affects the balance between apoptosis and pyroptosis.HrpA 将脑膜炎球菌锚定在动力蛋白上，并影响细胞凋亡和细胞焦亡之间的平衡。

J Biomed Sci. 2022 Jun 28;29(1):45. doi: 10.1186/s12929-022-00829-8.

Performance of Novel Antimicrobial Protein Bg_9562 and In Silico Predictions on Its Properties with Reference to Its Antimicrobial Efficiency against .新型抗菌蛋白Bg_9562的性能及其抗菌效率相关特性的计算机模拟预测

Antibiotics (Basel). 2022 Mar 8;11(3):363. doi: 10.3390/antibiotics11030363.

本文引用的文献

Protein Sci. 1995 Dec;4(12):2517-25. doi: 10.1002/pro.5560041208.

Predicting the conformation of proteins. Man versus machine.

FEBS Lett. 1993 Jun 28;325(1-2):29-33. doi: 10.1016/0014-5793(93)81408-r.

Protein secondary structure prediction using nearest-neighbor methods.使用最近邻方法进行蛋白质二级结构预测。

J Mol Biol. 1993 Aug 20;232(4):1117-29. doi: 10.1006/jmbi.1993.1464.

Prediction of protein secondary structure at better than 70% accuracy.蛋白质二级结构预测准确率高于70%。

J Mol Biol. 1993 Jul 20;232(2):584-99. doi: 10.1006/jmbi.1993.1413.

Comparison of three algorithms for the assignment of secondary structure in proteins: the advantages of a consensus assignment.三种蛋白质二级结构分配算法的比较：一致性分配的优势

Protein Eng. 1993 Jun;6(4):377-82. doi: 10.1093/protein/6.4.377.

Redefining the goals of protein secondary structure prediction.重新定义蛋白质二级结构预测的目标。

J Mol Biol. 1994 Jan 7;235(1):13-26. doi: 10.1016/s0022-2836(05)80007-5.

The limits of protein secondary structure prediction accuracy from multiple sequence alignment.基于多序列比对的蛋白质二级结构预测准确性的局限性。

J Mol Biol. 1993 Dec 20;234(4):951-7. doi: 10.1006/jmbi.1993.1649.

J Mol Biol. 1994 May 20;238(5):693-708. doi: 10.1006/jmbi.1994.1330.

SOPM: a self-optimized method for protein secondary structure prediction.SOPM：一种用于蛋白质二级结构预测的自优化方法。

Protein Eng. 1994 Feb;7(2):157-64. doi: 10.1093/protein/7.2.157.

Evaluating predictions of secondary structure in proteins.

Biochem Biophys Res Commun. 1994 Apr 15;200(1):149-55. doi: 10.1006/bbrc.1994.1427.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验