• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将失败转化为应用:蛋白质 ΔΔG 预测的问题。

Turning Failures into Applications: The Problem of Protein ΔΔG Prediction.

机构信息

Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy.

Department of Medical Sciences, University of Torino, Turin, Italy.

出版信息

Methods Mol Biol. 2022;2449:169-185. doi: 10.1007/978-1-0716-2095-3_6.

DOI:10.1007/978-1-0716-2095-3_6
PMID:35507262
Abstract

After nearly two decades of research in the field of computational methods based on machine learning and knowledge-based potentials for ΔG and ΔΔG prediction upon variations, we now realize that all the approaches are poorly performing when tested on specific cases and that there is large space for improvement. Why this is so? Is it wrong the underlying assumption that experimental protein thermodynamics in solution reflects the thermodynamics of a single protein? Both machine learning and knowledge-based computational methods are rigorous and we know the solid theory behind. We are now in a critical situation, which suggests that predictions of protein instability upon variation should be considered with care. In the following, we will show how to cope with the problem of understanding which protein positions may be of interest for biotechnological and biomedical purposes. By applying a consensus procedure, we indicate possible strategies for the result interpretation.

摘要

经过近二十年的研究,我们在基于机器学习的计算方法和基于知识的势能领域取得了进展,这些方法可用于预测结构变化对 ΔG 和 ΔΔG 的影响。现在我们意识到,当在特定情况下进行测试时,所有方法的性能都很差,还有很大的改进空间。为什么会这样呢?是否错误地假设了溶液中实验蛋白质热力学反映了单个蛋白质的热力学?机器学习和基于知识的计算方法都是严谨的,我们也了解其背后的坚实理论。我们现在正处于一个关键的情况,这表明应该谨慎考虑对蛋白质变异不稳定性的预测。在下面,我们将展示如何应对理解哪些蛋白质位置可能对生物技术和生物医学目的有意义的问题。通过应用共识程序,我们为结果解释指明了可能的策略。

相似文献

1
Turning Failures into Applications: The Problem of Protein ΔΔG Prediction.将失败转化为应用:蛋白质 ΔΔG 预测的问题。
Methods Mol Biol. 2022;2449:169-185. doi: 10.1007/978-1-0716-2095-3_6.
2
Are machine learning based methods suited to address complex biological problems? Lessons from CAGI-5 challenges.基于机器学习的方法是否适合解决复杂的生物学问题?来自 CAGI-5 挑战赛的经验教训。
Hum Mutat. 2019 Sep;40(9):1455-1462. doi: 10.1002/humu.23784. Epub 2019 Jun 18.
3
Prediction of hot spot residues at protein-protein interfaces by combining machine learning and energy-based methods.通过机器学习和基于能量的方法相结合来预测蛋白质-蛋白质界面的热点残基。
BMC Bioinformatics. 2009 Oct 30;10:365. doi: 10.1186/1471-2105-10-365.
4
Using diverse potentials and scoring functions for the development of improved machine-learned models for protein-ligand affinity and docking pose prediction.利用多样化的势能和评分函数,开发改进的基于机器学习的蛋白质-配体亲和力和对接构象预测模型。
J Comput Aided Mol Des. 2021 Nov;35(11):1095-1123. doi: 10.1007/s10822-021-00423-4. Epub 2021 Oct 28.
5
Examining the assumptions underlying continuum-solvent models.审视连续介质溶剂模型背后的假设。
J Chem Theory Comput. 2015 Oct 13;11(10):4593-600. doi: 10.1021/acs.jctc.5b00684. Epub 2015 Sep 11.
6
Accurate prediction of stability changes in protein mutants by combining machine learning with structure based computational mutagenesis.通过将机器学习与基于结构的计算诱变相结合,准确预测蛋白质突变体的稳定性变化。
Bioinformatics. 2008 Sep 15;24(18):2002-9. doi: 10.1093/bioinformatics/btn353. Epub 2008 Jul 16.
7
Computational prediction of protein functional sites-Applications in biotechnology and biomedicine.计算预测蛋白质功能位点——在生物技术和生物医学中的应用。
Adv Protein Chem Struct Biol. 2022;130:39-57. doi: 10.1016/bs.apcsb.2021.12.001. Epub 2022 Jan 17.
8
Computationally predicting binding affinity in protein-ligand complexes: free energy-based simulations and machine learning-based scoring functions.计算预测蛋白质-配体复合物中的结合亲和力:基于自由能的模拟和基于机器学习的评分函数。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa107.
9
Development of a machine-learning model to predict Gibbs free energy of binding for protein-ligand complexes.开发一种用于预测蛋白-配体复合物结合自由能的机器学习模型。
Biophys Chem. 2018 Sep;240:63-69. doi: 10.1016/j.bpc.2018.05.010. Epub 2018 Jun 7.
10
Volume-based solvation models out-perform area-based models in combined studies of wild-type and mutated protein-protein interfaces.在野生型和突变型蛋白质-蛋白质界面的联合研究中,基于体积的溶剂化模型比基于面积的模型表现更优。
BMC Bioinformatics. 2008 Oct 21;9:448. doi: 10.1186/1471-2105-9-448.

引用本文的文献

1
Homologous Pairs of Low and High Temperature Originating Proteins Spanning the Known Prokaryotic Universe.同源对的低温和高温起源蛋白跨越已知的原核生物界。
Sci Data. 2023 Oct 7;10(1):682. doi: 10.1038/s41597-023-02553-w.
2
Stability and expression of SARS-CoV-2 spike-protein mutations.SARS-CoV-2 刺突蛋白突变的稳定性和表达。
Mol Cell Biochem. 2023 Jun;478(6):1269-1280. doi: 10.1007/s11010-022-04588-w. Epub 2022 Oct 27.
3
Conformational Stability and Denaturation Processes of Proteins Investigated by Electrophoresis under Extreme Conditions.

本文引用的文献

1
A base measure of precision for protein stability predictors: structural sensitivity.蛋白质稳定性预测器的基本精度度量:结构敏感性。
BMC Bioinformatics. 2021 Feb 25;22(1):88. doi: 10.1186/s12859-021-04030-w.
2
Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks.利用深度 3D 卷积神经网络预测点突变对蛋白质热力学稳定性的影响。
PLoS Comput Biol. 2020 Nov 30;16(11):e1008291. doi: 10.1371/journal.pcbi.1008291. eCollection 2020 Nov.
3
ProThermDB: thermodynamic database for proteins and mutants revisited after 15 years.
极端条件下电泳研究蛋白质的构象稳定性和变性过程。
Molecules. 2022 Oct 13;27(20):6861. doi: 10.3390/molecules27206861.
4
Structural heterogeneity and precision of implications drawn from cryo-electron microscopy structures: SARS-CoV-2 spike-protein mutations as a test case.结构异质性和从冷冻电子显微镜结构中得出的结论的精确性:以 SARS-CoV-2 刺突蛋白突变为例。
Eur Biophys J. 2022 Dec;51(7-8):555-568. doi: 10.1007/s00249-022-01619-8. Epub 2022 Sep 27.
5
A Glance into MTHFR Deficiency at a Molecular Level.从分子水平看 MTHFR 缺乏症。
Int J Mol Sci. 2021 Dec 23;23(1):167. doi: 10.3390/ijms23010167.
ProThermDB:经过 15 年的回顾,蛋白质和突变体的热力学数据库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D420-D424. doi: 10.1093/nar/gkaa1035.
4
FireProtDB: database of manually curated protein stability data.FireProtDB:人工 curated 蛋白质稳定性数据数据库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D319-D324. doi: 10.1093/nar/gkaa981.
5
ThermoMutDB: a thermodynamic database for missense mutations.ThermoMutDB:一个错义突变热力学数据库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D475-D479. doi: 10.1093/nar/gkaa925.
6
Systematic Investigation of the Data Set Dependency of Protein Stability Predictors.系统研究蛋白质稳定性预测因子对数据集的依赖性。
J Chem Inf Model. 2020 Oct 26;60(10):4772-4784. doi: 10.1021/acs.jcim.0c00591. Epub 2020 Aug 24.
7
Limitations and challenges in protein stability prediction upon genome variations: towards future applications in precision medicine.基因组变异后蛋白质稳定性预测的局限性与挑战:迈向精准医学的未来应用
Comput Struct Biotechnol J. 2020 Jul 24;18:1968-1979. doi: 10.1016/j.csbj.2020.07.011. eCollection 2020.
8
Evaluating Protein Engineering Thermostability Prediction Tools Using an Independently Generated Dataset.使用独立生成的数据集评估蛋白质工程热稳定性预测工具
ACS Omega. 2020 Mar 20;5(12):6487-6493. doi: 10.1021/acsomega.9b04105. eCollection 2020 Mar 31.
9
Protein stability engineering insights revealed by domain-wide comprehensive mutagenesis.通过全域全面突变揭示蛋白质稳定性工程的见解。
Proc Natl Acad Sci U S A. 2019 Aug 13;116(33):16367-16377. doi: 10.1073/pnas.1903888116. Epub 2019 Aug 1.
10
Reports from the fifth edition of CAGI: The Critical Assessment of Genome Interpretation.来自第五版 CAGI 的报告:基因组解读的关键评估。
Hum Mutat. 2019 Sep;40(9):1197-1201. doi: 10.1002/humu.23876. Epub 2019 Aug 26.