• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用预测的形状字符串提高 γ-转角预测的准确性。

Using predicted shape string to enhance the accuracy of γ-turn prediction.

机构信息

Department of Chemistry, Tongji University, Room 438, No.1239, Siping Road, Shanghai, 200092, People's Republic of China.

出版信息

Amino Acids. 2012 May;42(5):1749-55. doi: 10.1007/s00726-011-0889-z. Epub 2011 Mar 22.

DOI:10.1007/s00726-011-0889-z
PMID:21424809
Abstract

Numerous methods for predicting γ-turns in proteins have been developed. However, the results they generally provided are not very good, with a Matthews correlation coefficient (MCC)≤0.18. Here, an attempt has been made to develop a method to improve the accuracy of γ-turn prediction. First, we employ the geometric mean metric as optimal criterion to evaluate the performance of support vector machine for the highly imbalanced γ-turn dataset. This metric tries to maximize both the sensitivity and the specificity while keeping them balanced. Second, a predictor to generate protein shape string by structure alignment against the protein structure database has been designed and the predicted shape string is introduced as new variable for γ-turn prediction. Based on this perception, we have developed a new method for γ-turn prediction. After training and testing the benchmark dataset of 320 non-homologous protein chains using a fivefold cross-validation technique, the present method achieves excellent performance. The overall prediction accuracy Qtotal can achieve 92.2% and the MCC is 0.38, which outperform the existing γ-turn prediction methods. Our results indicate that the protein shape string is useful for predicting protein tight turns and it is reasonable to use the dihedral angle information as a variable for machine learning to predict protein folding. The dataset used in this work and the software to generate predicted shape string from structure database can be obtained from anonymous ftp site ftp://cheminfo.tongji.edu.cn/GammaTurnPrediction/ freely.

摘要

已经开发出许多预测蛋白质 γ-转角的方法。然而,它们通常提供的结果并不是很好,马修斯相关系数 (MCC)≤0.18。在这里,我们尝试开发一种方法来提高 γ-转角预测的准确性。首先,我们采用几何平均值度量作为最优标准来评估支持向量机对高度不平衡的 γ-转角数据集的性能。该度量试图在保持两者平衡的同时最大化敏感性和特异性。其次,设计了一种通过结构比对蛋白质结构数据库生成蛋白质形状字符串的预测器,并将预测的形状字符串引入到 γ-转角预测中作为新的变量。基于这一认识,我们开发了一种新的 γ-转角预测方法。通过使用五重交叉验证技术对 320 个非同源蛋白质链的基准数据集进行训练和测试,该方法表现出优异的性能。总体预测准确性 Qtotal 可达 92.2%,MCC 为 0.38,优于现有的 γ-转角预测方法。我们的结果表明,蛋白质形状字符串对于预测蛋白质紧密转角非常有用,并且使用二面角信息作为机器学习的变量来预测蛋白质折叠是合理的。这项工作中使用的数据集和从结构数据库生成预测形状字符串的软件可以从匿名 ftp 站点 ftp://cheminfo.tongji.edu.cn/GammaTurnPrediction/ 免费获得。

相似文献

1
Using predicted shape string to enhance the accuracy of γ-turn prediction.利用预测的形状字符串提高 γ-转角预测的准确性。
Amino Acids. 2012 May;42(5):1749-55. doi: 10.1007/s00726-011-0889-z. Epub 2011 Mar 22.
2
A neural-network based method for prediction of gamma-turns in proteins from multiple sequence alignment.一种基于神经网络的方法,用于从多序列比对预测蛋白质中的γ-转角。
Protein Sci. 2003 May;12(5):923-9. doi: 10.1110/ps.0241703.
3
Improved method for predicting beta-turn using support vector machine.使用支持向量机预测β-转角的改进方法。
Bioinformatics. 2005 May 15;21(10):2370-4. doi: 10.1093/bioinformatics/bti358. Epub 2005 Mar 29.
4
A neural network method for prediction of beta-turn types in proteins using evolutionary information.一种利用进化信息预测蛋白质中β-转角类型的神经网络方法。
Bioinformatics. 2004 Nov 1;20(16):2751-8. doi: 10.1093/bioinformatics/bth322. Epub 2004 May 14.
5
gamma-Turn types prediction in proteins using the support vector machines.使用支持向量机预测蛋白质中的γ-转角类型
J Theor Biol. 2007 Dec 21;249(4):785-90. doi: 10.1016/j.jtbi.2007.09.002. Epub 2007 Sep 11.
6
Using support vector machine to predict beta- and gamma-turns in proteins.使用支持向量机预测蛋白质中的β-转角和γ-转角。
J Comput Chem. 2008 Sep;29(12):1867-75. doi: 10.1002/jcc.20929.
7
Prediction of turn types in protein structure by machine-learning classifiers.通过机器学习分类器预测蛋白质结构中的转角类型。
Proteins. 2009 Feb 1;74(2):344-52. doi: 10.1002/prot.22164.
8
Prediction of pi-turns in proteins using PSI-BLAST profiles and secondary structure information.利用PSI-BLAST序列谱和二级结构信息预测蛋白质中的π转角
Biochem Biophys Res Commun. 2006 Sep 1;347(3):574-80. doi: 10.1016/j.bbrc.2006.06.066. Epub 2006 Jun 21.
9
Better prediction of the location of alpha-turns in proteins with support vector machine.利用支持向量机更好地预测蛋白质中α-转角的位置。
Proteins. 2006 Oct 1;65(1):49-54. doi: 10.1002/prot.21062.
10
Improving protein secondary structure prediction using a multi-modal BP method.利用多模态 BP 方法改进蛋白质二级结构预测。
Comput Biol Med. 2011 Oct;41(10):946-59. doi: 10.1016/j.compbiomed.2011.08.005. Epub 2011 Aug 30.

引用本文的文献

1
Improving Protein Gamma-Turn Prediction Using Inception Capsule Networks.利用 inception 胶囊网络提高蛋白质 γ-转角预测。
Sci Rep. 2018 Oct 24;8(1):15741. doi: 10.1038/s41598-018-34114-2.
2
NMRDSP: an accurate prediction of protein shape strings from NMR chemical shifts and sequence data.NMRDSP:基于核磁共振化学位移和序列数据对蛋白质形状字符串进行准确预测
PLoS One. 2013 Dec 23;8(12):e83532. doi: 10.1371/journal.pone.0083532. eCollection 2013.
3
DomHR: accurately identifying domain boundaries in proteins using a hinge region strategy.
使用铰链区策略准确识别蛋白质的结构域边界。
PLoS One. 2013 Apr 11;8(4):e60559. doi: 10.1371/journal.pone.0060559. Print 2013.
4
Predicting turns in proteins with a unified model.用统一模型预测蛋白质的转折。
PLoS One. 2012;7(11):e48389. doi: 10.1371/journal.pone.0048389. Epub 2012 Nov 7.
5
DSP: a protein shape string and its profile prediction server.DSP:蛋白质形状字符串及其轮廓预测服务器。
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W298-302. doi: 10.1093/nar/gks361. Epub 2012 May 2.
6
Retrieving backbone string neighbors provides insights into structural modeling of membrane proteins.提取骨干字符串邻居为膜蛋白的结构建模提供了深入了解。
Mol Cell Proteomics. 2012 Jul;11(7):M111.016808. doi: 10.1074/mcp.M111.016808. Epub 2012 Mar 13.
7
Improving the performance of β-turn prediction using predicted shape strings and a two-layer support vector machine model.利用预测形状字符串和两层支持向量机模型提高 β-转角预测的性能。
BMC Bioinformatics. 2011 Jul 13;12:283. doi: 10.1186/1471-2105-12-283.