Suppr超能文献

SAAFEC-SEQ:一种基于序列的方法,用于预测单点突变对蛋白质热力学稳定性的影响。

SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability.

机构信息

Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA.

出版信息

Int J Mol Sci. 2021 Jan 9;22(2):606. doi: 10.3390/ijms22020606.

Abstract

Modeling the effect of mutations on protein thermodynamics stability is useful for protein engineering and understanding molecular mechanisms of disease-causing variants. Here, we report a new development of the SAAFEC method, the SAAFEC-SEQ, which is a gradient boosting decision tree machine learning method to predict the change of the folding free energy caused by amino acid substitutions. The method does not require the 3D structure of the corresponding protein, but only its sequence and, thus, can be applied on genome-scale investigations where structural information is very sparse. SAAFEC-SEQ uses physicochemical properties, sequence features, and evolutionary information features to make the predictions. It is shown to consistently outperform all existing state-of-the-art sequence-based methods in both the Pearson correlation coefficient and root-mean-squared-error parameters as benchmarked on several independent datasets. The SAAFEC-SEQ has been implemented into a web server and is available as stand-alone code that can be downloaded and embedded into other researchers' code.

摘要

建模突变对蛋白质热力学稳定性的影响对于蛋白质工程和理解致病变异的分子机制很有用。在这里,我们报告了 SAAFEC 方法的一个新进展,即 SAAFEC-SEQ,这是一种梯度提升决策树机器学习方法,用于预测氨基酸取代引起的折叠自由能变化。该方法不需要对应蛋白质的 3D 结构,而只需要其序列,因此可以应用于结构信息非常稀疏的基因组规模研究。SAAFEC-SEQ 使用物理化学性质、序列特征和进化信息特征进行预测。在几个独立的数据集上进行基准测试时,它在 Pearson 相关系数和均方根误差参数方面始终优于所有现有的基于序列的方法。SAAFEC-SEQ 已被实现为一个网络服务器,并作为独立的代码提供,可以下载并嵌入到其他研究人员的代码中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a185/7827184/84939447dfd6/ijms-22-00606-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验