• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SAAFEC-SEQ:一种基于序列的方法,用于预测单点突变对蛋白质热力学稳定性的影响。

SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability.

机构信息

Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA.

出版信息

Int J Mol Sci. 2021 Jan 9;22(2):606. doi: 10.3390/ijms22020606.

DOI:10.3390/ijms22020606
PMID:33435356
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7827184/
Abstract

Modeling the effect of mutations on protein thermodynamics stability is useful for protein engineering and understanding molecular mechanisms of disease-causing variants. Here, we report a new development of the SAAFEC method, the SAAFEC-SEQ, which is a gradient boosting decision tree machine learning method to predict the change of the folding free energy caused by amino acid substitutions. The method does not require the 3D structure of the corresponding protein, but only its sequence and, thus, can be applied on genome-scale investigations where structural information is very sparse. SAAFEC-SEQ uses physicochemical properties, sequence features, and evolutionary information features to make the predictions. It is shown to consistently outperform all existing state-of-the-art sequence-based methods in both the Pearson correlation coefficient and root-mean-squared-error parameters as benchmarked on several independent datasets. The SAAFEC-SEQ has been implemented into a web server and is available as stand-alone code that can be downloaded and embedded into other researchers' code.

摘要

建模突变对蛋白质热力学稳定性的影响对于蛋白质工程和理解致病变异的分子机制很有用。在这里,我们报告了 SAAFEC 方法的一个新进展,即 SAAFEC-SEQ,这是一种梯度提升决策树机器学习方法,用于预测氨基酸取代引起的折叠自由能变化。该方法不需要对应蛋白质的 3D 结构,而只需要其序列,因此可以应用于结构信息非常稀疏的基因组规模研究。SAAFEC-SEQ 使用物理化学性质、序列特征和进化信息特征进行预测。在几个独立的数据集上进行基准测试时,它在 Pearson 相关系数和均方根误差参数方面始终优于所有现有的基于序列的方法。SAAFEC-SEQ 已被实现为一个网络服务器,并作为独立的代码提供,可以下载并嵌入到其他研究人员的代码中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a185/7827184/a3d7e1f9e8fb/ijms-22-00606-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a185/7827184/84939447dfd6/ijms-22-00606-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a185/7827184/603261686f5d/ijms-22-00606-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a185/7827184/a3d7e1f9e8fb/ijms-22-00606-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a185/7827184/84939447dfd6/ijms-22-00606-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a185/7827184/603261686f5d/ijms-22-00606-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a185/7827184/a3d7e1f9e8fb/ijms-22-00606-g003.jpg

相似文献

1
SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability.SAAFEC-SEQ:一种基于序列的方法,用于预测单点突变对蛋白质热力学稳定性的影响。
Int J Mol Sci. 2021 Jan 9;22(2):606. doi: 10.3390/ijms22020606.
2
SAAFEC: Predicting the Effect of Single Point Mutations on Protein Folding Free Energy Using a Knowledge-Modified MM/PBSA Approach.SAAFEC:使用知识修正的MM/PBSA方法预测单点突变对蛋白质折叠自由能的影响。
Int J Mol Sci. 2016 Apr 7;17(4):512. doi: 10.3390/ijms17040512.
3
PROST: AlphaFold2-aware Sequence-Based Predictor to Estimate Protein Stability Changes upon Missense Mutations.PROST:基于序列的 AlphaFold2 感知预测器,用于预测错义突变后蛋白质稳定性的变化。
J Chem Inf Model. 2022 Sep 12;62(17):4270-4282. doi: 10.1021/acs.jcim.2c00799. Epub 2022 Aug 16.
4
SAAMBE-SEQ: a sequence-based method for predicting mutation effect on protein-protein binding affinity.SAAMBE-SEQ:一种基于序列的方法,用于预测突变对蛋白质-蛋白质结合亲和力的影响。
Bioinformatics. 2021 May 17;37(7):992-999. doi: 10.1093/bioinformatics/btaa761.
5
Predicting the Effect of Amino Acid Single-Point Mutations on Protein Stability-Large-Scale Validation of MD-Based Relative Free Energy Calculations.预测氨基酸单点突变对蛋白质稳定性的影响——基于分子动力学的相对自由能计算的大规模验证
J Mol Biol. 2017 Apr 7;429(7):948-963. doi: 10.1016/j.jmb.2016.12.007. Epub 2016 Dec 10.
6
Predicting folding free energy changes upon single point mutations.预测单点突变后折叠自由能的变化。
Bioinformatics. 2012 Mar 1;28(5):664-71. doi: 10.1093/bioinformatics/bts005. Epub 2012 Jan 11.
7
INPS: predicting the impact of non-synonymous variations on protein stability from sequence.INPS:从序列预测非同义变异对蛋白质稳定性的影响。
Bioinformatics. 2015 Sep 1;31(17):2816-21. doi: 10.1093/bioinformatics/btv291. Epub 2015 May 7.
8
STRUM: structure-based prediction of protein stability changes upon single-point mutation.STRUM:基于结构预测单点突变后蛋白质稳定性的变化
Bioinformatics. 2016 Oct 1;32(19):2936-46. doi: 10.1093/bioinformatics/btw361. Epub 2016 Jun 17.
9
SAMPDI-3D: predicting the effects of protein and DNA mutations on protein-DNA interactions.SAMPDI-3D:预测蛋白质和 DNA 突变对蛋白质-DNA 相互作用的影响。
Bioinformatics. 2021 Nov 5;37(21):3760-3765. doi: 10.1093/bioinformatics/btab567.
10
EASE-MM: Sequence-Based Prediction of Mutation-Induced Stability Changes with Feature-Based Multiple Models.EASE-MM:基于序列的突变诱导稳定性变化预测与基于特征的多模型方法
J Mol Biol. 2016 Mar 27;428(6):1394-1405. doi: 10.1016/j.jmb.2016.01.012. Epub 2016 Jan 22.

引用本文的文献

1
CDKL5 Deficiency Disorder: Revealing the Molecular Mechanism of Pathogenic Variants.CDKL5缺陷障碍:揭示致病变异的分子机制
Int J Mol Sci. 2025 Aug 29;26(17):8399. doi: 10.3390/ijms26178399.
2
Genotype Distribution and Migration Patterns of Hepatitis C Virus in Shandong Province, China: Molecular Epidemiology and Phylogenetic Study.中国山东省丙型肝炎病毒的基因型分布与传播模式:分子流行病学与系统发育研究
JMIR Med Inform. 2025 Aug 18;13:e60207. doi: 10.2196/60207.
3
Impact of SARS-CoV-2 Variant NSP6 on Pathogenicity: Genetic Analysis and Cell Biology.

本文引用的文献

1
Prognosis and personalized treatment prediction in TP53-mutant hepatocellular carcinoma: an in silico strategy towards precision oncology.TP53 突变型肝细胞癌的预后和个体化治疗预测:精准肿瘤学的一种计算策略。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa164.
2
Limitations and challenges in protein stability prediction upon genome variations: towards future applications in precision medicine.基因组变异后蛋白质稳定性预测的局限性与挑战:迈向精准医学的未来应用
Comput Struct Biotechnol J. 2020 Jul 24;18:1968-1979. doi: 10.1016/j.csbj.2020.07.011. eCollection 2020.
3
The Human Genome Project changed everything.
严重急性呼吸综合征冠状病毒2变体NSP6对致病性的影响:遗传分析与细胞生物学
Curr Issues Mol Biol. 2025 May 14;47(5):361. doi: 10.3390/cimb47050361.
4
Shared-weight graph framework for comprehensive protein stability prediction across diverse mutation types.用于跨多种突变类型进行全面蛋白质稳定性预测的共享权重图框架。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf190.
5
Development of a β-glucosidase improved for glucose retroinhibition for cellulosic ethanol production: an integrated bioinformatics and genetic engineering approach.用于纤维素乙醇生产的、对葡萄糖反馈抑制具有改善作用的β-葡萄糖苷酶的开发:一种生物信息学与基因工程相结合的方法
Biotechnol Biofuels Bioprod. 2025 Apr 5;18(1):44. doi: 10.1186/s13068-025-02643-4.
6
Adoption of an in-silico analysis approach to assess the functional and structural impacts of rpoB-encoded protein mutations on Chlamydia pneumoniae sensitivity to antibiotics.采用计算机模拟分析方法评估肺炎衣原体rpoB编码蛋白突变对抗生素敏感性的功能和结构影响。
BMC Microbiol. 2025 Mar 19;25(1):157. doi: 10.1186/s12866-025-03860-5.
7
Structural insights into SOD1: from in silico and molecular dynamics to experimental analyses of ALS-associated E49K and R115G mutants.超氧化物歧化酶1的结构见解:从计算机模拟和分子动力学到肌萎缩侧索硬化症相关E49K和R115G突变体的实验分析
Front Mol Biosci. 2025 Feb 25;12:1532375. doi: 10.3389/fmolb.2025.1532375. eCollection 2025.
8
Assessing the predicted impact of single amino acid substitutions in MAPK proteins for CAGI6 challenges.评估丝裂原活化蛋白激酶(MAPK)蛋白中单个氨基酸取代对CAGI6挑战的预测影响。
Hum Genet. 2025 Mar;144(2-3):265-280. doi: 10.1007/s00439-024-02724-8. Epub 2025 Feb 20.
9
Exploring the impact of deleterious missense nonsynonymous single nucleotide polymorphisms in the DRD4 gene using computational approaches.使用计算方法探索DRD4基因中有害错义非同义单核苷酸多态性的影响。
Sci Rep. 2025 Jan 24;15(1):3150. doi: 10.1038/s41598-025-86916-w.
10
Impact of frequent ARID1A mutations on protein stability provides insights into cancer pathogenesis.频繁的ARID1A突变对蛋白质稳定性的影响为癌症发病机制提供了见解。
Sci Rep. 2025 Jan 24;15(1):3072. doi: 10.1038/s41598-025-87103-7.
人类基因组计划改变了一切。
Nat Rev Genet. 2020 Oct;21(10):575-576. doi: 10.1038/s41576-020-0275-3.
4
Predicting the stability of mutant proteins by computational approaches: an overview.计算方法预测突变蛋白的稳定性:概述。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa074.
5
SAAMBE-3D: Predicting Effect of Mutations on Protein-Protein Interactions.SAAMBE-3D:预测突变对蛋白质-蛋白质相互作用的影响。
Int J Mol Sci. 2020 Apr 7;21(7):2563. doi: 10.3390/ijms21072563.
6
Accurately Predicting Mutation-Caused Stability Changes from Protein Sequences Using Extreme Gradient Boosting.利用极端梯度提升从蛋白质序列准确预测突变引起的稳定性变化。
J Chem Inf Model. 2020 Apr 27;60(4):2388-2395. doi: 10.1021/acs.jcim.0c00064. Epub 2020 Mar 30.
7
A critical review of five machine learning-based algorithms for predicting protein stability changes upon mutation.五种基于机器学习的算法预测蛋白质突变稳定性变化的综述。
Brief Bioinform. 2020 Jul 15;21(4):1285-1292. doi: 10.1093/bib/bbz071.
8
DDGun: an untrained method for the prediction of protein stability changes upon single and multiple point variations.DDGun:一种未经训练的方法,用于预测单点和多点变异后蛋白质稳定性的变化。
BMC Bioinformatics. 2019 Jul 3;20(Suppl 14):335. doi: 10.1186/s12859-019-2923-1.
9
Evaluating the predictions of the protein stability change upon single amino acid substitutions for the FXN CAGI5 challenge.评估 FXN CAGI5 挑战中单氨基酸替换对蛋白质稳定性变化的预测。
Hum Mutat. 2019 Sep;40(9):1392-1399. doi: 10.1002/humu.23843. Epub 2019 Jul 12.
10
PhyreRisk: A Dynamic Web Application to Bridge Genomics, Proteomics and 3D Structural Data to Guide Interpretation of Human Genetic Variants.PhyreRisk:一个动态的 Web 应用程序,将基因组学、蛋白质组学和 3D 结构数据联系起来,指导人类遗传变异的解释。
J Mol Biol. 2019 Jun 14;431(13):2460-2466. doi: 10.1016/j.jmb.2019.04.043. Epub 2019 May 7.