IDL-PPBopt：一种通过可解释深度学习方法预测和优化化合物与人血浆蛋白结合的策略。

IDL-PPBopt: A Strategy for Prediction and Optimization of Human Plasma Protein Binding of Compounds via an Interpretable Deep Learning Method.

机构信息

Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China.

出版信息

J Chem Inf Model. 2022 Jun 13;62(11):2788-2799. doi: 10.1021/acs.jcim.2c00297. Epub 2022 May 24.

DOI:10.1021/acs.jcim.2c00297

PMID:35607907

Abstract

The prediction and optimization of pharmacokinetic properties are essential in lead optimization. Traditional strategies mainly depend on the empirical chemical rules from medicinal chemists. However, with the rising amount of data, it is getting more difficult to manually extract useful medicinal chemistry knowledge. To this end, we introduced IDL-PPBopt, a computational strategy for predicting and optimizing the plasma protein binding (PPB) property based on an interpretable deep learning method. At first, a curated PPB data set was used to construct an interpretable deep learning model, which showed excellent predictive performance with a root mean squared error of 0.112 for the entire test set. Then, we designed a detection protocol based on the model and Wilcoxon test to identify the PPB-related substructures (named privileged substructures, PSubs) for each molecule. In total, 22 general privileged substructures (GPSubs) were identified, which shared some common features such as nitrogen-containing groups, diamines with two carbon units, and azetidine. Furthermore, a series of second-level chemical rules for each GPSub were derived through a statistical test and then summarized into substructure pairs. We demonstrated that these substructure pairs were equally applicable outside the training set and accordingly customized the structural modification schemes for each GPSub, which provided alternatives for the optimization of the PPB property. Therefore, IDL-PPBopt provides a promising scheme for the prediction and optimization of the PPB property and would be helpful for lead optimization of other pharmacokinetic properties.

摘要

在先导优化中，预测和优化药代动力学性质是至关重要的。传统策略主要依赖于药物化学家的经验化学规则。然而，随着数据量的增加，手动提取有用的药物化学知识变得越来越困难。为此，我们引入了 IDLPBopt，这是一种基于可解释深度学习方法预测和优化血浆蛋白结合（PPB）性质的计算策略。首先，我们使用经过精心整理的 PPB 数据集来构建可解释的深度学习模型，该模型在整个测试集上表现出出色的预测性能，均方根误差为 0.112。然后，我们基于该模型和 Wilcoxon 检验设计了一种检测方案，以识别每个分子的与 PPB 相关的亚结构（称为特权亚结构，PSubs）。总共确定了 22 个通用特权亚结构（GPSubs），它们具有一些共同的特征，例如含氮基团、两个碳原子的二胺和氮杂环丁烷。此外，通过统计检验得出了每个 GPSub 的一系列二级化学规则，并将其总结为亚结构对。我们证明了这些亚结构对在训练集之外同样适用，并相应地为每个 GPSub 定制了结构修改方案，为优化 PPB 性质提供了替代方案。因此，IDL-PPBopt 为预测和优化 PPB 性质提供了一个有前途的方案，并将有助于其他药代动力学性质的先导优化。

相似文献

IDL-PPBopt: A Strategy for Prediction and Optimization of Human Plasma Protein Binding of Compounds via an Interpretable Deep Learning Method.

J Chem Inf Model. 2022 Jun 13;62(11):2788-2799. doi: 10.1021/acs.jcim.2c00297. Epub 2022 May 24.

Plasma protein binding prediction focusing on residue-level features and circularity of cyclic peptides by deep learning.

Bioinformatics. 2022 Jan 27;38(4):1110-1117. doi: 10.1093/bioinformatics/btab726.

Chemical rules for optimization of chemical mutagenicity via matched molecular pairs analysis and machine learning methods.

J Cheminform. 2023 Mar 20;15(1):35. doi: 10.1186/s13321-023-00707-x.

In Silico Prediction of Compounds Binding to Human Plasma Proteins by QSAR Models.

ChemMedChem. 2018 Mar 20;13(6):572-581. doi: 10.1002/cmdc.201700582. Epub 2017 Nov 10.

Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques.

BMC Bioinformatics. 2018 Dec 31;19(Suppl 19):527. doi: 10.1186/s12859-018-2529-z.

Interpretable-ADMET: a web service for ADMET prediction and optimization based on deep neural representation.

Bioinformatics. 2022 May 13;38(10):2863-2871. doi: 10.1093/bioinformatics/btac192.

QSAR-assisted-MMPA to expand chemical transformation space for lead optimization.

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbaa374.

Predicting binding affinities of diverse pharmaceutical chemicals to human serum plasma proteins using QSPR modelling approaches.

SAR QSAR Environ Res. 2016;27(1):67-85. doi: 10.1080/1062936X.2015.1133700.

In vitro, in silico and integrated strategies for the estimation of plasma protein binding. A review.

Adv Drug Deliv Rev. 2015 Jun 23;86:27-45. doi: 10.1016/j.addr.2015.03.011. Epub 2015 Mar 27.

Transformer-based deep learning method for optimizing ADMET properties of lead compounds.

Phys Chem Chem Phys. 2023 Jan 18;25(3):2377-2385. doi: 10.1039/d2cp05332b.

引用本文的文献

Probing the dark chemical matter against PDE4 for the management of psoriasis using in silico, in vitro and in vivo approach.

Mol Divers. 2025 Aug;29(4):3449-3464. doi: 10.1007/s11030-025-11159-w. Epub 2025 Mar 17.

Enhancing property and activity prediction and interpretation using multiple molecular graph representations with MMGX.

Commun Chem. 2024 Apr 5;7(1):74. doi: 10.1038/s42004-024-01155-w.

Deep Learning for Drug Development: Using CNNs in MIA-QSAR to Predict Plasma Protein Binding of Drugs.

AAPS PharmSciTech. 2023 Nov 14;24(8):232. doi: 10.1208/s12249-023-02686-6.

Recent Studies of Artificial Intelligence on In Silico Drug Distribution Prediction.

Int J Mol Sci. 2023 Jan 17;24(3):1815. doi: 10.3390/ijms24031815.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

IDL-PPBopt：一种通过可解释深度学习方法预测和优化化合物与人血浆蛋白结合的策略。

IDL-PPBopt: A Strategy for Prediction and Optimization of Human Plasma Protein Binding of Compounds via an Interpretable Deep Learning Method.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献