• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过氨基酸的属性编码对突变诱导的蛋白质稳定性变化进行稳健预测。

Robust prediction of mutation-induced protein stability change by property encoding of amino acids.

作者信息

Kang Shuli, Chen Gang, Xiao Gengfu

机构信息

State Key Laboratory of Virology, College of Life Sciences, Wuhan University, Wuhan, Hubei 430072, China.

出版信息

Protein Eng Des Sel. 2009 Feb;22(2):75-83. doi: 10.1093/protein/gzn063. Epub 2008 Dec 2.

DOI:10.1093/protein/gzn063
PMID:19054789
Abstract

Current methods of predicting mutation-induced protein stability change are imprecise. Machine learning methods have been introduced for this prediction recently; however, the available experimental data used for training these predictors are biased. Abundant data are available for several frequently occurring amino acid substitutions, whereas only limited data have been accumulated for some other mutation types. Generally, current statistical models do not account for this bias toward the commoner amino acids during the encoding process and are thus less effective in making predictions on less frequently occurring mutations. In this paper, we propose a method based on support vector machines and property encoding of amino acids. The predictor we constructed outperforms other methods on the same data sets and is more robust with poor training data. The prediction accuracy for mutations with no training data exceeded 80%. This advantage is critical for practical application, where the prediction could be applied for any type of mutations. Further analysis demonstrates our model relies on biological significant features to make predictions. To overcome the drawbacks of classifying mutations into 'stabilizing' and 'destabilizing' ones, a three-class classification of mutations was also discussed, where our method obtained an overall accuracy of 79.1%.

摘要

目前预测突变引起的蛋白质稳定性变化的方法并不精确。机器学习方法最近已被引入用于这种预测;然而,用于训练这些预测器的现有实验数据存在偏差。对于几种常见的氨基酸替换有大量数据,而对于其他一些突变类型仅积累了有限的数据。一般来说,当前的统计模型在编码过程中没有考虑到对较常见氨基酸的这种偏差,因此在对较少出现的突变进行预测时效果较差。在本文中,我们提出了一种基于支持向量机和氨基酸性质编码的方法。我们构建的预测器在相同数据集上优于其他方法,并且在训练数据较差时更稳健。对于没有训练数据的突变,预测准确率超过80%。这一优势对于实际应用至关重要,在实际应用中该预测可应用于任何类型的突变。进一步分析表明我们的模型依赖于生物学上有意义的特征来进行预测。为了克服将突变分为“稳定化”和“去稳定化”突变的缺点,还讨论了突变的三类分类,我们的方法在其中获得了79.1%的总体准确率。

相似文献

1
Robust prediction of mutation-induced protein stability change by property encoding of amino acids.通过氨基酸的属性编码对突变诱导的蛋白质稳定性变化进行稳健预测。
Protein Eng Des Sel. 2009 Feb;22(2):75-83. doi: 10.1093/protein/gzn063. Epub 2008 Dec 2.
2
Predicting protein stability changes from sequences using support vector machines.使用支持向量机从序列预测蛋白质稳定性变化。
Bioinformatics. 2005 Sep 1;21 Suppl 2:ii54-8. doi: 10.1093/bioinformatics/bti1109.
3
Accurate prediction of stability changes in protein mutants by combining machine learning with structure based computational mutagenesis.通过将机器学习与基于结构的计算诱变相结合,准确预测蛋白质突变体的稳定性变化。
Bioinformatics. 2008 Sep 15;24(18):2002-9. doi: 10.1093/bioinformatics/btn353. Epub 2008 Jul 16.
4
Support Vector Machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs.基于支持向量机,利用氨基酸残基和氨基酸残基对的结构特性对蛋白质折叠进行分类。
Bioinformatics. 2007 Dec 15;23(24):3320-7. doi: 10.1093/bioinformatics/btm527. Epub 2007 Nov 7.
5
Knowledge acquisition and development of accurate rules for predicting protein stability changes.知识获取以及用于预测蛋白质稳定性变化的精确规则的开发。
Comput Biol Chem. 2006 Dec;30(6):408-15. doi: 10.1016/j.compbiolchem.2006.06.004. Epub 2006 Sep 26.
6
A neural-network-based method for predicting protein stability changes upon single point mutations.一种基于神经网络的方法,用于预测单点突变后蛋白质稳定性的变化。
Bioinformatics. 2004 Aug 4;20 Suppl 1:i63-8. doi: 10.1093/bioinformatics/bth928.
7
Computational modeling of protein mutant stability: analysis and optimization of statistical potentials and structural features reveal insights into prediction model development.蛋白质突变体稳定性的计算建模:统计势和结构特征的分析与优化为预测模型开发提供了见解。
BMC Struct Biol. 2007 Aug 16;7:54. doi: 10.1186/1472-6807-7-54.
8
Average assignment method for predicting the stability of protein mutants.预测蛋白质突变体稳定性的平均分配方法。
Biopolymers. 2006 May;82(1):80-92. doi: 10.1002/bip.20462.
9
Statistical geometry based prediction of nonsynonymous SNP functional effects using random forest and neuro-fuzzy classifiers.基于统计几何学,使用随机森林和神经模糊分类器预测非同义单核苷酸多态性的功能效应
Proteins. 2008 Jun;71(4):1930-9. doi: 10.1002/prot.21838.
10
Prediction of protein stability upon point mutations.点突变后蛋白质稳定性的预测。
Biochem Soc Trans. 2007 Dec;35(Pt 6):1569-73. doi: 10.1042/BST0351569.

引用本文的文献

1
Variant Impact Predictor database (VIPdb), version 2: trends from three decades of genetic variant impact predictors.变异影响预测器数据库(VIPdb),版本 2:三十年来遗传变异影响预测器的趋势。
Hum Genomics. 2024 Aug 28;18(1):90. doi: 10.1186/s40246-024-00663-z.
2
Variant Impact Predictor database (VIPdb), version 2: Trends from 25 years of genetic variant impact predictors.变异影响预测数据库(VIPdb),版本2:25年基因变异影响预测的趋势
bioRxiv. 2024 Jun 28:2024.06.25.600283. doi: 10.1101/2024.06.25.600283.
3
Critical assessment of structure-based approaches to improve protein resistance in aqueous ionic liquids by enzyme-wide saturation mutagenesis.
通过全酶饱和诱变对基于结构的方法进行批判性评估,以提高蛋白质在水性离子液体中的抗性。
Comput Struct Biotechnol J. 2021 Dec 16;20:399-409. doi: 10.1016/j.csbj.2021.12.018. eCollection 2022.
4
VIPdb, a genetic Variant Impact Predictor Database.VIPdb,一个遗传变异影响预测数据库。
Hum Mutat. 2019 Sep;40(9):1202-1214. doi: 10.1002/humu.23858. Epub 2019 Aug 17.
5
Feature-based multiple models improve classification of mutation-induced stability changes.基于特征的多模型改进了对突变诱导稳定性变化的分类。
BMC Genomics. 2014;15 Suppl 4(Suppl 4):S6. doi: 10.1186/1471-2164-15-S4-S6. Epub 2014 May 20.
6
Computational and experimental approaches to reveal the effects of single nucleotide polymorphisms with respect to disease diagnostics.揭示单核苷酸多态性对疾病诊断影响的计算方法和实验方法。
Int J Mol Sci. 2014 May 30;15(6):9670-717. doi: 10.3390/ijms15069670.
7
Towards sequence-based prediction of mutation-induced stability changes in unseen non-homologous proteins.基于序列预测未见的非同源蛋白质中突变诱导的稳定性变化。
BMC Genomics. 2014;15 Suppl 1(Suppl 1):S4. doi: 10.1186/1471-2164-15-S1-S4. Epub 2014 Jan 24.
8
Molecular mechanisms of disease-causing missense mutations.致病错义突变的分子机制。
J Mol Biol. 2013 Nov 1;425(21):3919-36. doi: 10.1016/j.jmb.2013.07.014. Epub 2013 Jul 16.
9
A rational free energy-based approach to understanding and targeting disease-causing missense mutations.基于合理自由能的方法来理解和靶向致病变异。
J Am Med Inform Assoc. 2013 Jul-Aug;20(4):643-51. doi: 10.1136/amiajnl-2012-001505. Epub 2013 Feb 13.
10
Grading amino acid properties increased accuracies of single point mutation on protein stability prediction.对氨基酸性质进行分级提高了单点突变预测蛋白质稳定性的准确性。
BMC Bioinformatics. 2012 Mar 22;13:44. doi: 10.1186/1471-2105-13-44.