用于定量构效关系研究的蛋白质-配体解离动力学常数公共数据集。

Public Data Set of Protein-Ligand Dissociation Kinetic Constants for Quantitative Structure-Kinetics Relationship Studies.

作者信息

Liu Huisi, Su Minyi, Lin Hai-Xia, Wang Renxiao, Li Yan

机构信息

Department of Chemistry, College of Sciences, Shanghai University, 99 Shangda Road, Shanghai 200444, People's Republic of China.

State Key Laboratory of Bioorganic and Natural Products Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai 200032, People's Republic of China.

出版信息

ACS Omega. 2022 May 26;7(22):18985-18996. doi: 10.1021/acsomega.2c02156. eCollection 2022 Jun 7.

DOI:10.1021/acsomega.2c02156

PMID:35694511

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9178723/

Abstract

Protein-ligand binding affinity reflects the equilibrium thermodynamics of the protein-ligand binding process. Binding/unbinding kinetics is the other side of the coin. Computational models for interpreting the quantitative structure-kinetics relationship (QSKR) aim at predicting protein-ligand binding/unbinding kinetics based on protein structure, ligand structure, or their complex structure, which in principle can provide a more rational basis for structure-based drug design. Thus far, most of the public data sets used for deriving such QSKR models are rather limited in sample size and structural diversity. To tackle this problem, we have compiled a set of 680 protein-ligand complexes with experimental dissociation rate constants ( ), which were mainly curated from the references accumulated for updating our PDBbind database. Three-dimensional structure of each protein-ligand complex in this data set was either retrieved from the Protein Data Bank or carefully modeled based on a proper template. The entire data set covers 155 types of protein, with their dissociation kinetic constants ( ) spanning nearly 10 orders of magnitude. To the best of our knowledge, this data set is the largest of its kind reported publicly. Utilizing this data set, we derived a random forest (RF) model based on protein-ligand atom pair descriptors for predicting values. We also demonstrated that utilizing modeled structures as additional training samples will benefit the model performance. The RF model with mixed structures can serve as a baseline for testifying other more sophisticated QSKR models. The whole data set, namely, , is available for free download at our PDBbind-CN web site (http://www.pdbbind.org.cn/download.php).

摘要

蛋白质-配体结合亲和力反映了蛋白质-配体结合过程的平衡热力学。结合/解离动力学则是问题的另一方面。用于解释定量结构-动力学关系（QSKR）的计算模型旨在基于蛋白质结构、配体结构或它们的复合物结构来预测蛋白质-配体的结合/解离动力学，原则上可为基于结构的药物设计提供更合理的基础。到目前为止，用于推导此类QSKR模型的大多数公共数据集在样本量和结构多样性方面都相当有限。为了解决这个问题，我们整理了一组680个具有实验解离速率常数（）的蛋白质-配体复合物，这些复合物主要是从为更新我们的PDBbind数据库而积累的参考文献中挑选出来的。该数据集中每个蛋白质-配体复合物的三维结构要么从蛋白质数据库中检索，要么基于合适的模板进行精心建模。整个数据集涵盖155种蛋白质，其解离动力学常数（）跨越近10个数量级。据我们所知，这个数据集是公开报道的同类数据集中最大的。利用这个数据集，我们基于蛋白质-配体原子对描述符推导了一个随机森林（RF）模型来预测值。我们还证明，将建模结构用作额外的训练样本将有利于模型性能。具有混合结构的RF模型可作为验证其他更复杂的QSKR模型的基线。整个数据集，即，可在我们的PDBbind-CN网站（http://www.pdbbind.org.cn/download.php）上免费下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/094b/9178723/cec51d91d4e8/ao2c02156_0002.jpg

相似文献

Public Data Set of Protein-Ligand Dissociation Kinetic Constants for Quantitative Structure-Kinetics Relationship Studies.

ACS Omega. 2022 May 26;7(22):18985-18996. doi: 10.1021/acsomega.2c02156. eCollection 2022 Jun 7.

Baseline Model for Predicting Protein-Ligand Unbinding Kinetics through Machine Learning.

J Chem Inf Model. 2020 Dec 28;60(12):5946-5956. doi: 10.1021/acs.jcim.0c00450. Epub 2020 Nov 13.

Prediction of the Drug-Target Binding Kinetics for Flexible Proteins by Comparative Binding Energy Analysis.

J Chem Inf Model. 2021 Jul 26;61(7):3708-3721. doi: 10.1021/acs.jcim.1c00639. Epub 2021 Jul 1.

A Protocol to Use Comparative Binding Energy Analysis to Estimate Drug-Target Residence Time.

Methods Mol Biol. 2021;2266:171-186. doi: 10.1007/978-1-0716-1209-5_10.

PDB-wide collection of binding data: current status of the PDBbind database.

Bioinformatics. 2015 Feb 1;31(3):405-12. doi: 10.1093/bioinformatics/btu626. Epub 2014 Oct 9.

Computational modeling approaches to quantitative structure-binding kinetics relationships in drug discovery.

Drug Discov Today. 2018 Jul;23(7):1396-1406. doi: 10.1016/j.drudis.2018.03.010. Epub 2018 Mar 21.

Forging the Basis for Developing Protein-Ligand Interaction Scoring Functions.

Acc Chem Res. 2017 Feb 21;50(2):302-309. doi: 10.1021/acs.accounts.6b00491. Epub 2017 Feb 9.

The PDBbind database: methodologies and updates.

J Med Chem. 2005 Jun 16;48(12):4111-9. doi: 10.1021/jm048957q.

Leak Proof PDBBind: A Reorganized Dataset of Protein-Ligand Complexes for More Generalizable Binding Affinity Prediction.

ArXiv. 2024 May 3:arXiv:2308.09639v2.

A High-Quality Data Set of Protein-Ligand Binding Interactions Via Comparative Complex Structure Modeling.

J Chem Inf Model. 2024 Apr 8;64(7):2454-2466. doi: 10.1021/acs.jcim.3c01170. Epub 2024 Jan 5.

引用本文的文献

Machine learning methods for developments of binding kinetic models in predicting protein-ligand dissociation rate constants.

Smart Mol. 2023 Nov 10;1(3):e20230012. doi: 10.1002/smo.20230012. eCollection 2023 Dec.

The Quasi-Bound State as a Predictor of Relative Binding Free Energy.

J Chem Inf Model. 2025 Jun 9;65(11):5544-5552. doi: 10.1021/acs.jcim.5c00289. Epub 2025 May 20.

High-Throughput Ligand Dissociation Kinetics Predictions Using Site Identification by Ligand Competitive Saturation.

J Chem Theory Comput. 2025 May 13;21(9):4964-4978. doi: 10.1021/acs.jctc.5c00265. Epub 2025 Apr 26.

Advances in Protein-Ligand Binding Affinity Prediction via Deep Learning: A Comprehensive Study of Datasets, Data Preprocessing Techniques, and Model Architectures.

Curr Drug Targets. 2024;25(15):1041-1065. doi: 10.2174/0113894501330963240905083020.

Binding Curve Viewer: Visualizing the Equilibrium and Kinetics of Protein-Ligand Binding and Competitive Binding.

J Chem Inf Model. 2024 May 27;64(10):4180-4192. doi: 10.1021/acs.jcim.4c00130. Epub 2024 May 8.

Accurate Characterization of Binding Kinetics and Allosteric Mechanisms for the HSP90 Chaperone Inhibitors Using AI-Augmented Integrative Biophysical Studies.

JACS Au. 2024 Apr 1;4(4):1632-1645. doi: 10.1021/jacsau.4c00123. eCollection 2024 Apr 22.

Fingerprinting Interactions between Proteins and Ligands for Facilitating Machine Learning in Drug Discovery.

Biomolecules. 2024 Jan 5;14(1):72. doi: 10.3390/biom14010072.

Structure-Kinetic Relationship for Drug Design Revealed by a PLS Model with Retrosynthesis-Based Pre-Trained Molecular Representation and Molecular Dynamics Simulation.

ACS Omega. 2023 May 12;8(20):18312-18322. doi: 10.1021/acsomega.3c02294. eCollection 2023 May 23.

Predicting Biomolecular Binding Kinetics: A Review.

J Chem Theory Comput. 2023 Apr 25;19(8):2135-2148. doi: 10.1021/acs.jctc.2c01085. Epub 2023 Mar 29.

本文引用的文献

Baseline Model for Predicting Protein-Ligand Unbinding Kinetics through Machine Learning.

J Chem Inf Model. 2020 Dec 28;60(12):5946-5956. doi: 10.1021/acs.jcim.0c00450. Epub 2020 Nov 13.

Ligand Gaussian Accelerated Molecular Dynamics (LiGaMD): Characterization of Ligand Binding Thermodynamics and Kinetics.

J Chem Theory Comput. 2020 Sep 8;16(9):5526-5547. doi: 10.1021/acs.jctc.0c00395. Epub 2020 Aug 7.

ClusPro LigTBM: Automated Template-based Small Molecule Docking.

J Mol Biol. 2020 May 15;432(11):3404-3410. doi: 10.1016/j.jmb.2019.12.011. Epub 2019 Dec 19.

KOFFI and Anabel 2.0-a new binding kinetics database and its integration in an open-source binding analysis software.

Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz101.

Updates to Binding MOAD (Mother of All Databases): Polypharmacology Tools and Their Utility in Drug Repurposing.

J Mol Biol. 2019 Jun 14;431(13):2423-2433. doi: 10.1016/j.jmb.2019.05.024. Epub 2019 May 22.

Prediction of Drug-Target Binding Kinetics by Comparative Binding Energy Analysis.

ACS Med Chem Lett. 2018 Oct 4;9(11):1134-1139. doi: 10.1021/acsmedchemlett.8b00397. eCollection 2018 Nov 8.

ChEMBL: towards direct deposition of bioassay data.

Nucleic Acids Res. 2019 Jan 8;47(D1):D930-D940. doi: 10.1093/nar/gky1075.

Computational modeling approaches to quantitative structure-binding kinetics relationships in drug discovery.

Drug Discov Today. 2018 Jul;23(7):1396-1406. doi: 10.1016/j.drudis.2018.03.010. Epub 2018 Mar 21.

New approaches for computing ligand-receptor binding kinetics.

Curr Opin Struct Biol. 2018 Apr;49:1-10. doi: 10.1016/j.sbi.2017.10.001. Epub 2017 Nov 11.

Lessons learned from participating in D3R 2016 Grand Challenge 2: compounds targeting the farnesoid X receptor.

J Comput Aided Mol Des. 2018 Jan;32(1):103-111. doi: 10.1007/s10822-017-0082-x. Epub 2017 Nov 10.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于定量构效关系研究的蛋白质-配体解离动力学常数公共数据集。

Public Data Set of Protein-Ligand Dissociation Kinetic Constants for Quantitative Structure-Kinetics Relationship Studies.

作者信息

Liu Huisi, Su Minyi, Lin Hai-Xia, Wang Renxiao, Li Yan

机构信息

Department of Chemistry, College of Sciences, Shanghai University, 99 Shangda Road, Shanghai 200444, People's Republic of China.

State Key Laboratory of Bioorganic and Natural Products Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai 200032, People's Republic of China.

出版信息

ACS Omega. 2022 May 26;7(22):18985-18996. doi: 10.1021/acsomega.2c02156. eCollection 2022 Jun 7.

DOI:10.1021/acsomega.2c02156

PMID:35694511

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9178723/

Abstract

摘要

用于定量构效关系研究的蛋白质-配体解离动力学常数公共数据集。

Public Data Set of Protein-Ligand Dissociation Kinetic Constants for Quantitative Structure-Kinetics Relationship Studies.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于定量构效关系研究的蛋白质-配体解离动力学常数公共数据集。

Public Data Set of Protein-Ligand Dissociation Kinetic Constants for Quantitative Structure-Kinetics Relationship Studies.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献