• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用局部比对进行蛋白质二级结构预测。

Protein secondary structure prediction using local alignments.

作者信息

Salamov A A, Solovyev V V

机构信息

Department of Cell Biology, Baylor College of Medicine, Houston, TX 77030, USA.

出版信息

J Mol Biol. 1997 Apr 25;268(1):31-6. doi: 10.1006/jmbi.1997.0958.

DOI:10.1006/jmbi.1997.0958
PMID:9149139
Abstract

The accuracy of secondary structure prediction methods has been improved significantly by the use of aligned protein sequences. The PHD method and the NNSSP method reach 71 to 72% of sustained overall three-state accuracy when multiple sequence alignments are with neural networks and nearest-neighbor algorithms, respectively. We introduce a variant of the nearest-neighbor approach that can achieve similar accuracy using a single sequence as the query input. We compute the 50 best non-intersecting local alignments of the query sequence with each sequence from a set of proteins with known 3D structures. Each position of the query sequence is aligned with the database amino acids in alpha-helical, beta-strand or coil states. The prediction type of secondary structure is selected as the type of aligned position with the maximal total score. On the dataset of 124 non-membrane non-homologous proteins, used earlier as a benchmark for secondary structure predictions, our method reaches an overall three-state accuracy of 71.2%. The performance accuracy is verified by an additional test on 461 non-homologous proteins giving an accuracy of 71.0%. The main strength of the method is the high level of prediction accuracy for proteins without any known homolog. Using multiple sequence alignments as input the method has a prediction accuracy of 73.5%. Prediction of secondary structure by the SSPAL method is available via Baylor College of Medicine World Wide Web server.

摘要

通过使用比对后的蛋白质序列,二级结构预测方法的准确性得到了显著提高。当分别使用神经网络和最近邻算法进行多序列比对时,PHD方法和NNSSP方法的持续总体三态准确率分别达到71%至72%。我们引入了一种最近邻方法的变体,该变体可以使用单个序列作为查询输入来实现类似的准确率。我们计算查询序列与一组具有已知三维结构的蛋白质中的每个序列的50个最佳不相交局部比对。查询序列的每个位置与处于α螺旋、β链或卷曲状态的数据库氨基酸进行比对。二级结构的预测类型被选为具有最大总分的比对位置的类型。在124个非膜非同源蛋白质的数据集上(该数据集早期用作二级结构预测的基准),我们的方法达到了71.2%的总体三态准确率。通过对461个非同源蛋白质的额外测试验证了性能准确率,其准确率为71.0%。该方法的主要优势在于对没有任何已知同源物的蛋白质具有较高水平的预测准确率。使用多序列比对作为输入时,该方法的预测准确率为73.5%。通过贝勒医学院万维网服务器可以使用SSPAL方法进行二级结构预测。

相似文献

1
Protein secondary structure prediction using local alignments.利用局部比对进行蛋白质二级结构预测。
J Mol Biol. 1997 Apr 25;268(1):31-6. doi: 10.1006/jmbi.1997.0958.
2
Prediction of protein secondary structure by combining nearest-neighbor algorithms and multiple sequence alignments.结合最近邻算法和多序列比对预测蛋白质二级结构。
J Mol Biol. 1995 Mar 17;247(1):11-5. doi: 10.1006/jmbi.1994.0116.
3
A simple and fast approach to prediction of protein secondary structure from multiply aligned sequences with accuracy above 70%.一种从多重比对序列预测蛋白质二级结构的简单快速方法,准确率高于70%。
Protein Sci. 1995 Dec;4(12):2517-25. doi: 10.1002/pro.5560041208.
4
A neural network method for prediction of beta-turn types in proteins using evolutionary information.一种利用进化信息预测蛋白质中β-转角类型的神经网络方法。
Bioinformatics. 2004 Nov 1;20(16):2751-8. doi: 10.1093/bioinformatics/bth322. Epub 2004 May 14.
5
Prediction of protein secondary structure at better than 70% accuracy.蛋白质二级结构预测准确率高于70%。
J Mol Biol. 1993 Jul 20;232(2):584-99. doi: 10.1006/jmbi.1993.1413.
6
Protein secondary structure prediction using nearest-neighbor methods.使用最近邻方法进行蛋白质二级结构预测。
J Mol Biol. 1993 Aug 20;232(4):1117-29. doi: 10.1006/jmbi.1993.1464.
7
A 3D-1D substitution matrix for protein fold recognition that includes predicted secondary structure of the sequence.一种用于蛋白质折叠识别的3D-1D替换矩阵,其包含序列的预测二级结构。
J Mol Biol. 1997 Apr 11;267(4):1026-38. doi: 10.1006/jmbi.1997.0924.
8
The influence of gapped positions in multiple sequence alignments on secondary structure prediction methods.多序列比对中的空位对二级结构预测方法的影响。
Comput Biol Chem. 2004 Dec;28(5-6):351-66. doi: 10.1016/j.compbiolchem.2004.09.005.
9
Use of amino acid environment-dependent substitution tables and conformational propensities in structure prediction from aligned sequences of homologous proteins. II. Secondary structures.在从同源蛋白质的比对序列进行结构预测中使用氨基酸环境依赖性替换表和构象倾向。II. 二级结构。
J Mol Biol. 1994 May 20;238(5):693-708. doi: 10.1006/jmbi.1994.1330.
10
NdPASA: a novel pairwise protein sequence alignment algorithm that incorporates neighbor-dependent amino acid propensities.NdPASA:一种整合了邻域依赖氨基酸倾向的新型双序列蛋白质序列比对算法。
Proteins. 2005 Feb 15;58(3):628-37. doi: 10.1002/prot.20359.

引用本文的文献

1
In silico interaction analysis of cannabinoid receptor interacting protein 1b (CRIP1b) - CB1 cannabinoid receptor.大麻素受体相互作用蛋白1b(CRIP1b)与CB1大麻素受体的计算机模拟相互作用分析
J Mol Graph Model. 2017 Oct;77:311-321. doi: 10.1016/j.jmgm.2017.09.006. Epub 2017 Sep 6.
2
Distributions of amino acids suggest that certain residue types more effectively determine protein secondary structure.氨基酸的分布表明,某些残基类型更有效地决定蛋白质的二级结构。
J Mol Model. 2013 Oct;19(10):4337-48. doi: 10.1007/s00894-013-1911-z. Epub 2013 Aug 2.
3
Fast learning optimized prediction methodology (FLOPRED) for protein secondary structure prediction.
快速学习优化预测方法(FLOPRED)在蛋白质二级结构预测中的应用。
J Mol Model. 2012 Sep;18(9):4275-89. doi: 10.1007/s00894-012-1410-7. Epub 2012 May 8.
4
Improving protein secondary structure prediction based on short subsequences with local structure similarity.基于局部结构相似性的短序列提高蛋白质二级结构预测。
BMC Genomics. 2010 Dec 2;11 Suppl 4(Suppl 4):S4. doi: 10.1186/1471-2164-11-S4-S4.
5
How many 3D structures do we need to train a predictor?我们需要训练一个预测器需要多少个 3D 结构?
Genomics Proteomics Bioinformatics. 2009 Sep;7(3):128-37. doi: 10.1016/S1672-0229(08)60041-8.
6
A reexamination of correlations of amino acids with particular secondary structures.对氨基酸与特定二级结构相关性的重新审视。
Protein J. 2009 Feb;28(2):74-86. doi: 10.1007/s10930-009-9166-3.
7
Prediction of protein secondary structure by mining structural fragment database.通过挖掘结构片段数据库预测蛋白质二级结构。
Polymer (Guildf). 2005 May 26;46(12):4314-4321. doi: 10.1016/j.polymer.2005.02.040.
8
Protein secondary structure prediction for a single-sequence using hidden semi-Markov models.使用隐藏半马尔可夫模型对单序列进行蛋白质二级结构预测。
BMC Bioinformatics. 2006 Mar 30;7:178. doi: 10.1186/1471-2105-7-178.
9
Early-stage folding in proteins (in silico) sequence-to-structure relation.蛋白质早期折叠(计算机模拟)中的序列-结构关系。
J Biomed Biotechnol. 2005 Jun 30;2005(2):65-79. doi: 10.1155/JBB.2005.65.
10
The importance of intrinsic disorder for protein phosphorylation.内在无序对蛋白质磷酸化的重要性。
Nucleic Acids Res. 2004 Feb 11;32(3):1037-49. doi: 10.1093/nar/gkh253. Print 2004.