• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过将优化的氨基酸分组嵌入基于相对溶剂可及性(RSA)的线性模型来改进对残基灵活性的预测。

Improved prediction of residue flexibility by embedding optimized amino acid grouping into RSA-based linear models.

作者信息

Zhang Hua, Kurgan Lukasz

机构信息

School of Computer and Information Engineering, Zhejiang Gongshang University, 310018, Hangzhou, Zhejiang, People's Republic of China,

出版信息

Amino Acids. 2014 Dec;46(12):2665-80. doi: 10.1007/s00726-014-1817-9. Epub 2014 Aug 9.

DOI:10.1007/s00726-014-1817-9
PMID:25106508
Abstract

Knowledge of protein flexibility is vital for deciphering the corresponding functional mechanisms. This knowledge would help, for instance, in improving computational drug design and refinement in homology-based modeling. We propose a new predictor of the residue flexibility, which is expressed by B-factors, from protein chains that use local (in the chain) predicted (or native) relative solvent accessibility (RSA) and custom-derived amino acid (AA) alphabets. Our predictor is implemented as a two-stage linear regression model that uses RSA-based space in a local sequence window in the first stage and a reduced AA pair-based space in the second stage as the inputs. This method is easy to comprehend explicit linear form in both stages. Particle swarm optimization was used to find an optimal reduced AA alphabet to simplify the input space and improve the prediction performance. The average correlation coefficients between the native and predicted B-factors measured on a large benchmark dataset are improved from 0.65 to 0.67 when using the native RSA values and from 0.55 to 0.57 when using the predicted RSA values. Blind tests that were performed on two independent datasets show consistent improvements in the average correlation coefficients by a modest value of 0.02 for both native and predicted RSA-based predictions.

摘要

了解蛋白质的灵活性对于解读相应的功能机制至关重要。例如,这些知识将有助于改进基于同源性建模的计算药物设计和优化。我们提出了一种新的预测残基灵活性的方法,该方法通过B因子来表示,适用于使用局部(在链中)预测(或天然)相对溶剂可及性(RSA)和自定义衍生氨基酸(AA)字母表的蛋白质链。我们的预测器实现为一个两阶段线性回归模型,在第一阶段使用局部序列窗口中基于RSA的数据空间,在第二阶段使用基于简化AA对的数据空间作为输入。该方法在两个阶段都易于理解显式线性形式。使用粒子群优化算法来寻找最优的简化AA字母表,以简化输入空间并提高预测性能。在一个大型基准数据集上测量的天然B因子与预测B因子之间的平均相关系数,使用天然RSA值时从0.65提高到0.67,使用预测RSA值时从0.55提高到0.57。在两个独立数据集上进行的盲测表明,基于天然和预测RSA的预测,平均相关系数均有一致的适度提高,提高值为0.02。

相似文献

1
Improved prediction of residue flexibility by embedding optimized amino acid grouping into RSA-based linear models.通过将优化的氨基酸分组嵌入基于相对溶剂可及性(RSA)的线性模型来改进对残基灵活性的预测。
Amino Acids. 2014 Dec;46(12):2665-80. doi: 10.1007/s00726-014-1817-9. Epub 2014 Aug 9.
2
On the relation between residue flexibility and local solvent accessibility in proteins.关于蛋白质中残基柔性与局部溶剂可及性之间的关系。
Proteins. 2009 Aug 15;76(3):617-36. doi: 10.1002/prot.22375.
3
Combining prediction of secondary structure and solvent accessibility in proteins.蛋白质二级结构预测与溶剂可及性预测相结合。
Proteins. 2005 May 15;59(3):467-75. doi: 10.1002/prot.20441.
4
Accurate prediction of solvent accessibility using neural networks-based regression.使用基于神经网络的回归准确预测溶剂可及性。
Proteins. 2004 Sep 1;56(4):753-67. doi: 10.1002/prot.20176.
5
Linear regression models for solvent accessibility prediction in proteins.用于蛋白质溶剂可及性预测的线性回归模型。
J Comput Biol. 2005 Apr;12(3):355-69. doi: 10.1089/cmb.2005.12.355.
6
Accurate prediction of protein folding rates from sequence and sequence-derived residue flexibility and solvent accessibility.从序列和序列衍生的残基柔性及溶剂可及性预测蛋白质折叠速率的准确性。
Proteins. 2010 Jul;78(9):2114-30. doi: 10.1002/prot.22727.
7
Improved protein relative solvent accessibility prediction using deep multi-view feature learning framework.利用深度多视图特征学习框架提高蛋白质相对溶剂可及性预测。
Anal Biochem. 2021 Oct 15;631:114358. doi: 10.1016/j.ab.2021.114358. Epub 2021 Aug 31.
8
Enhanced prediction of conformational flexibility and phosphorylation in proteins.增强蛋白质构象柔性和磷酸化的预测。
Adv Exp Med Biol. 2010;680:307-19. doi: 10.1007/978-1-4419-5913-3_35.
9
Gaussian network model can be enhanced by combining solvent accessibility in proteins.高斯网络模型可以通过结合蛋白质中的溶剂可及性来增强。
Sci Rep. 2017 Aug 8;7(1):7486. doi: 10.1038/s41598-017-07677-9.
10
Achieving 80% ten-fold cross-validated accuracy for secondary structure prediction by large-scale training.通过大规模训练实现二级结构预测的80%十折交叉验证准确率。
Proteins. 2007 Mar 1;66(4):838-45. doi: 10.1002/prot.21298.

引用本文的文献

1
Research progress of reduced amino acid alphabets in protein analysis and prediction.蛋白质分析与预测中简化氨基酸字母表的研究进展
Comput Struct Biotechnol J. 2022 Jul 4;20:3503-3510. doi: 10.1016/j.csbj.2022.07.001. eCollection 2022.
2
Protein flexibility in the light of structural alphabets.从结构字母表看蛋白质的柔韧性。
Front Mol Biosci. 2015 May 27;2:20. doi: 10.3389/fmolb.2015.00020. eCollection 2015.