• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

探索最近邻二级结构预测的极限。

Exploring the limits of nearest neighbour secondary structure prediction.

作者信息

Levin J M

机构信息

Unité de Bioinformatique, Bat. de Biotechnologie, INRA, Jouy-en-Josas, France.

出版信息

Protein Eng. 1997 Jul;10(7):771-6. doi: 10.1093/protein/10.7.771.

DOI:10.1093/protein/10.7.771
PMID:9342143
Abstract

This paper presents a simple and robust secondary structure prediction scheme (SIMPA96) based on an updated version of the nearest neighbour method. Using a larger database of known structures, the Blosum 62 substitution matrix and a regularization algorithm, the three state prediction accuracy is increased by 4.7 percentage points to 67.7% for a single sequence and up to 72.8% when using multiple alignments. The increase in prediction accuracy with respect to the previous version can be almost entirely ascribed to the sevenfold increase in the size of the database. A more detailed analysis of the results shows that badly predicted regions of a protein sequence are randomly distributed throughout the database and that the goal of perfect secondary structure predictions by methods which use only local sequence information is illusory.

摘要

本文提出了一种基于最近邻方法更新版本的简单且稳健的二级结构预测方案(SIMPA96)。通过使用更大的已知结构数据库、Blosum 62替换矩阵和一种正则化算法,单序列的三状态预测准确率提高了4.7个百分点,达到67.7%,使用多序列比对时则高达72.8%。相对于先前版本,预测准确率的提高几乎完全归因于数据库规模扩大了七倍。对结果进行更详细的分析表明,蛋白质序列预测不佳的区域在整个数据库中随机分布,并且仅使用局部序列信息的方法实现完美二级结构预测的目标是不切实际的。

相似文献

1
Exploring the limits of nearest neighbour secondary structure prediction.探索最近邻二级结构预测的极限。
Protein Eng. 1997 Jul;10(7):771-6. doi: 10.1093/protein/10.7.771.
2
Protein secondary structure prediction using local alignments.利用局部比对进行蛋白质二级结构预测。
J Mol Biol. 1997 Apr 25;268(1):31-6. doi: 10.1006/jmbi.1997.0958.
3
Prediction of protein secondary structure using Large Margin Nearest Neighbour classification.使用大间隔最近邻分类法预测蛋白质二级结构。
Int J Bioinform Res Appl. 2013;9(2):207-19. doi: 10.1504/IJBRA.2013.052445.
4
Robust sequence alignment using evolutionary rates coupled with an amino acid substitution matrix.使用进化速率结合氨基酸替换矩阵进行稳健的序列比对。
BMC Bioinformatics. 2015 Aug 14;16:255. doi: 10.1186/s12859-015-0688-8.
5
Protein secondary structure prediction using nearest-neighbor methods.使用最近邻方法进行蛋白质二级结构预测。
J Mol Biol. 1993 Aug 20;232(4):1117-29. doi: 10.1006/jmbi.1993.1464.
6
Improving the quality of twilight-zone alignments.提高暮光区对准的质量。
Protein Sci. 2000 Aug;9(8):1487-96. doi: 10.1110/ps.9.8.1487.
7
OXBench: a benchmark for evaluation of protein multiple sequence alignment accuracy.OXBench:一种用于评估蛋白质多序列比对准确性的基准。
BMC Bioinformatics. 2003 Oct 10;4:47. doi: 10.1186/1471-2105-4-47.
8
Quantification of secondary structure prediction improvement using multiple alignments.使用多序列比对对二级结构预测改进进行量化。
Protein Eng. 1993 Nov;6(8):849-54. doi: 10.1093/protein/6.8.849.
9
Prediction of protein secondary structure by combining nearest-neighbor algorithms and multiple sequence alignments.结合最近邻算法和多序列比对预测蛋白质二级结构。
J Mol Biol. 1995 Mar 17;247(1):11-5. doi: 10.1006/jmbi.1994.0116.
10
The influence of gapped positions in multiple sequence alignments on secondary structure prediction methods.多序列比对中的空位对二级结构预测方法的影响。
Comput Biol Chem. 2004 Dec;28(5-6):351-66. doi: 10.1016/j.compbiolchem.2004.09.005.

引用本文的文献

1
Modern and prebiotic amino acids support distinct structural profiles in proteins.现代和益生菌氨基酸支持蛋白质中不同的结构特征。
Open Biol. 2022 Jun;12(6):220040. doi: 10.1098/rsob.220040. Epub 2022 Jun 22.
2
Deep learning methods in protein structure prediction.蛋白质结构预测中的深度学习方法。
Comput Struct Biotechnol J. 2020 Jan 22;18:1301-1310. doi: 10.1016/j.csbj.2019.12.011. eCollection 2020.
3
Propensities of Amino Acid Pairings in Secondary Structure of Globular Proteins.球状蛋白质二级结构中氨基酸配对倾向。
Protein J. 2020 Feb;39(1):21-32. doi: 10.1007/s10930-020-09880-6.
4
Identification and Molecular Characterization of the Chloroplast Targeting Domain of Replication Proteins.复制蛋白叶绿体靶向结构域的鉴定与分子特征分析
Front Plant Sci. 2017 Dec 19;8:2138. doi: 10.3389/fpls.2017.02138. eCollection 2017.
5
Plant cell wall glycosyltransferases: High-throughput recombinant expression screening and general requirements for these challenging enzymes.植物细胞壁糖基转移酶:高通量重组表达筛选及这些具有挑战性的酶的一般要求
PLoS One. 2017 Jun 9;12(6):e0177591. doi: 10.1371/journal.pone.0177591. eCollection 2017.
6
VAPC, an human endogenous inhibitor for hepatitis C virus (HCV) infection, is intrinsically unstructured but forms a "fuzzy complex" with HCV NS5B.VAPC,一种人类内源性 HCV(丙型肝炎病毒)感染抑制剂,本质上无结构,但与 HCV NS5B 形成“模糊复合物”。
PLoS One. 2012;7(7):e40341. doi: 10.1371/journal.pone.0040341. Epub 2012 Jul 17.
7
PSP_MCSVM: brainstorming consensus prediction of protein secondary structures using two-stage multiclass support vector machines.PSP_MCSVM:使用两阶段多类支持向量机进行蛋白质二级结构的头脑风暴共识预测。
J Mol Model. 2011 Sep;17(9):2191-201. doi: 10.1007/s00894-011-1102-8. Epub 2011 May 19.
8
Characterization and comparison of two binding sites on obscurin for small ankyrin 1.鉴定和比较锚蛋白重复域 1 小分子与 obscurin 的两个结合位点。
Biochemistry. 2010 Nov 23;49(46):9948-56. doi: 10.1021/bi101165p. Epub 2010 Nov 1.
9
Increased hydrophobicity at the N terminus/membrane interface impairs gating of the severe combined immunodeficiency-related ORAI1 mutant.N端/膜界面处疏水性增加会损害与重症联合免疫缺陷相关的ORAI1突变体的门控功能。
J Biol Chem. 2009 Jun 5;284(23):15903-15. doi: 10.1074/jbc.M808312200. Epub 2009 Apr 14.
10
NMR evidence for forming highly populated helical conformations in the partially folded hNck2 SH3 domain.核磁共振证据表明,部分折叠的人Nck2 SH3结构域中形成了高丰度的螺旋构象。
Biophys J. 2008 Nov 15;95(10):4803-12. doi: 10.1529/biophysj.107.125641. Epub 2008 Jul 3.