• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的蛋白质 p 值准确预测。

Basis for Accurate Protein p Prediction with Machine Learning.

机构信息

College of Computer Engineering, Jimei University, Xiamen 361021, China.

出版信息

J Chem Inf Model. 2023 May 22;63(10):2936-2947. doi: 10.1021/acs.jcim.3c00254. Epub 2023 May 5.

DOI:10.1021/acs.jcim.3c00254
PMID:37146199
Abstract

pH regulates protein structures and the associated functions in many biological processes via protonation and deprotonation of ionizable side chains where the titration equilibria are determined by p's. To accelerate pH-dependent molecular mechanism research in the life sciences or industrial protein and drug designs, fast and accurate p prediction is crucial. Here we present a theoretical p data set PHMD549, which was successfully applied to four distinct machine learning methods, including DeepKa, which was proposed in our previous work. To reach a valid comparison, EXP67S was selected as the test set. Encouragingly, DeepKa was improved significantly and outperforms other state-of-the-art methods, except for the constant-pH molecular dynamics, which was utilized to create PHMD549. More importantly, DeepKa reproduced experimental p orders of acidic dyads in five enzyme catalytic sites. Apart from structural proteins, DeepKa was found applicable to intrinsically disordered peptides. Further, in combination with solvent exposures, it is revealed that DeepKa offers the most accurate prediction under the challenging circumstance that hydrogen bonding or salt bridge interaction is partly compensated by desolvation for a buried side chain. Finally, our benchmark data qualify PHMD549 and EXP67S as the basis for future developments of protein p prediction tools driven by artificial intelligence. In addition, DeepKa built on PHMD549 has been proven an efficient protein p predictor and thus can be applied immediately to, for example, p database construction, protein design, drug discovery, and so on.

摘要

pH 通过对可离子化侧链的质子化和去质子化来调节许多生物过程中的蛋白质结构和相关功能,其中滴定平衡由 p 值决定。为了加速生命科学或工业蛋白质和药物设计中依赖 pH 的分子机制研究,快速准确的 p 值预测至关重要。在这里,我们提出了一个理论 pH 值数据集 PHMD549,该数据集已成功应用于四种不同的机器学习方法,包括我们之前工作中提出的 DeepKa。为了进行有效的比较,选择 EXP67S 作为测试集。令人鼓舞的是,DeepKa 得到了显著改进,优于其他最先进的方法,除了常 pH 值分子动力学,该方法用于创建 PHMD549。更重要的是,DeepKa 再现了五个酶催化位点中酸性偶联物的实验 pH 值顺序。除了结构蛋白,DeepKa 还被发现适用于固有无序肽。此外,结合溶剂暴露情况,结果表明,在部分由去溶剂化补偿氢键或盐桥相互作用的埋置侧链的挑战性环境下,DeepKa 提供了最准确的预测。最后,我们的基准数据将 PHMD549 和 EXP67S 作为未来人工智能驱动的蛋白质 pH 值预测工具的基础。此外,基于 PHMD549 构建的 DeepKa 已被证明是一种有效的蛋白质 pH 值预测器,因此可以立即应用于 pH 值数据库构建、蛋白质设计、药物发现等领域。

相似文献

1
Basis for Accurate Protein p Prediction with Machine Learning.基于机器学习的蛋白质 p 值准确预测。
J Chem Inf Model. 2023 May 22;63(10):2936-2947. doi: 10.1021/acs.jcim.3c00254. Epub 2023 May 5.
2
DeepKa Web Server: High-Throughput Protein p Prediction.DeepKa Web 服务器:高通量蛋白质 p 预测。
J Chem Inf Model. 2024 Apr 22;64(8):2933-2940. doi: 10.1021/acs.jcim.3c02013. Epub 2024 Mar 26.
3
Comparative Performance of High-Throughput Methods for Protein p Predictions.高通量方法进行蛋白质 p 预测的性能比较。
J Chem Inf Model. 2023 Aug 28;63(16):5169-5181. doi: 10.1021/acs.jcim.3c00165. Epub 2023 Aug 7.
4
GPU-Accelerated Implementation of Continuous Constant pH Molecular Dynamics in Amber: p Predictions with Single-pH Simulations.GPU 加速的 Amber 中连续恒 pH 分子动力学的实现:使用单 pH 模拟进行 p 预测。
J Chem Inf Model. 2019 Nov 25;59(11):4821-4832. doi: 10.1021/acs.jcim.9b00754. Epub 2019 Nov 14.
5
Coupled molecular dynamics and continuum electrostatic method to compute the ionization pKa's of proteins as a function of pH. Test on a large set of proteins.耦合分子动力学和连续静电方法计算蛋白质在 pH 值变化下的离解常数 pKa。在大量蛋白质上进行测试。
J Biomol Struct Dyn. 2018 Feb;36(3):561-574. doi: 10.1080/07391102.2017.1288169. Epub 2017 Feb 24.
6
p Calculations with the Polarizable Drude Force Field and Poisson-Boltzmann Solvation Model.用极化德拜力场和泊松-玻尔兹曼溶剂化模型进行计算。
J Chem Theory Comput. 2020 Jul 14;16(7):4655-4668. doi: 10.1021/acs.jctc.0c00111. Epub 2020 Jun 12.
7
Protein p Prediction with Machine Learning.基于机器学习的蛋白质p预测
ACS Omega. 2021 Dec 7;6(50):34823-34831. doi: 10.1021/acsomega.1c05440. eCollection 2021 Dec 21.
8
GPU-Accelerated All-Atom Particle-Mesh Ewald Continuous Constant pH Molecular Dynamics in Amber.GPU 加速全原子粒子网格 Ewald 连续常数 pH 分子动力学在 Amber 中。
J Chem Theory Comput. 2022 Dec 13;18(12):7510-7527. doi: 10.1021/acs.jctc.2c00586. Epub 2022 Nov 15.
9
Protein p Prediction by Tree-Based Machine Learning.基于树的机器学习进行蛋白质 p 预测。
J Chem Theory Comput. 2022 Apr 12;18(4):2673-2686. doi: 10.1021/acs.jctc.1c01257. Epub 2022 Mar 15.
10
Reliable and Accurate Prediction of Single-Residue p Values through Free Energy Perturbation Calculations.通过自由能微扰计算可靠且准确地预测单残基 p 值。
J Chem Theory Comput. 2022 Dec 13;18(12):7193-7204. doi: 10.1021/acs.jctc.2c00954. Epub 2022 Nov 16.

引用本文的文献

1
Structure-based rational design of covalent probes.基于结构的共价探针合理设计。
Commun Chem. 2025 Aug 12;8(1):242. doi: 10.1038/s42004-025-01606-y.
2
Harnessing computational technologies to facilitate antibody-drug conjugate development.利用计算技术促进抗体药物偶联物的开发。
Nat Chem Biol. 2025 Jun 27. doi: 10.1038/s41589-025-01950-z.
3
Protonation State of Active-Site Histidines, Reaction Mechanism and Stereoselectivity in β-Alanine Synthase: A Computational Study.β-丙氨酸合酶中活性位点组氨酸的质子化状态、反应机制和立体选择性:一项计算研究
J Phys Chem B. 2025 Jun 12;129(23):5664-5673. doi: 10.1021/acs.jpcb.5c00829. Epub 2025 Jun 3.
4
Protein Electrostatic Properties are Fine-Tuned Through Evolution.蛋白质的静电特性通过进化得到精细调节。
Res Sq. 2025 Apr 28:rs.3.rs-6471091. doi: 10.21203/rs.3.rs-6471091/v1.
5
Accurate Predictions of Molecular Properties of Proteins via Graph Neural Networks and Transfer Learning.通过图神经网络和迁移学习对蛋白质分子特性进行准确预测
J Chem Theory Comput. 2025 May 13;21(9):4830-4845. doi: 10.1021/acs.jctc.4c01682. Epub 2025 Apr 24.
6
KaMLs for Predicting Protein p Values and Ionization States: Are Trees All You Need?用于预测蛋白质p值和电离状态的KaMLs:树模型就足够了吗?
J Chem Theory Comput. 2025 Feb 11;21(3):1446-1458. doi: 10.1021/acs.jctc.4c01602. Epub 2025 Jan 30.
7
Improved Structure-Based Histidine p Prediction for pH-Responsive Protein Design.基于结构改进的组氨酸p预测用于pH响应蛋白设计
J Chem Inf Model. 2025 Feb 10;65(3):1560-1569. doi: 10.1021/acs.jcim.4c01957. Epub 2025 Jan 18.
8
Accurate Predictions of Molecular Properties of Proteins via Graph Neural Networks and Transfer Learning.通过图神经网络和迁移学习对蛋白质分子性质进行准确预测。
bioRxiv. 2024 Dec 12:2024.12.10.627714. doi: 10.1101/2024.12.10.627714.
9
KaMLs for Predicting Protein p Values and Ionization States: Are Trees All You Need?用于预测蛋白质p值和电离状态的KaMLs:你只需要决策树吗?
bioRxiv. 2025 Jan 30:2024.11.09.622800. doi: 10.1101/2024.11.09.622800.
10
Open-Source Machine Learning in Computational Chemistry.开源机器学习在计算化学中的应用。
J Chem Inf Model. 2023 Aug 14;63(15):4505-4532. doi: 10.1021/acs.jcim.3c00643. Epub 2023 Jul 19.