分子亲脂性的计算：最新技术及对96000多种化合物的log P方法比较

Calculation of molecular lipophilicity: State-of-the-art and comparison of log P methods on more than 96,000 compounds.

作者信息

Mannhold Raimund, Poda Gennadiy I, Ostermann Claude, Tetko Igor V

机构信息

Molecular Drug Research Group, Heinrich-Heine-Universität, Universitätsstrasse 1, D-40225 Düsseldorf, Germany.

出版信息

J Pharm Sci. 2009 Mar;98(3):861-93. doi: 10.1002/jps.21494.

DOI:10.1002/jps.21494

PMID:18683876

Abstract

We first review the state-of-the-art in development of log P prediction approaches falling in two major categories: substructure-based and property-based methods. Then, we compare the predictive power of representative methods for one public (N = 266) and two in house datasets from Nycomed (N = 882) and Pfizer (N = 95809). A total of 30 and 18 methods were tested for public and industrial datasets, respectively. Accuracy of models declined with the number of nonhydrogen atoms. The Arithmetic Average Model (AAM), which predicts the same value (the arithmetic mean) for all compounds, was used as a baseline model for comparison. Methods with Root Mean Squared Error (RMSE) greater than RMSE produced by the AAM were considered as unacceptable. The majority of analyzed methods produced reasonable results for the public dataset but only seven methods were successful on the both in house datasets. We proposed a simple equation based on the number of carbon atoms, NC, and the number of hetero atoms, NHET: log P = 1.46(+/-0.02) + 0.11(+/-0.001) NC-0.11(+/-0.001) NHET. This equation outperformed a large number of programs benchmarked in this study. Factors influencing the accuracy of log P predictions were elucidated and discussed.

摘要

我们首先回顾了对数P预测方法发展的最新情况，这些方法主要分为两大类：基于子结构的方法和基于性质的方法。然后，我们比较了代表性方法对一个公共数据集（N = 266）以及来自奈科明公司（N = 882）和辉瑞公司（N = 95809）的两个内部数据集的预测能力。分别对公共数据集和工业数据集测试了总共30种和18种方法。模型的准确性随着非氢原子数量的增加而下降。算术平均模型（AAM）对所有化合物预测相同的值（算术平均值），被用作比较的基线模型。均方根误差（RMSE）大于AAM产生的RMSE的方法被认为是不可接受的。大多数分析方法对公共数据集产生了合理的结果，但只有七种方法在两个内部数据集上都取得了成功。我们提出了一个基于碳原子数NC和杂原子数NHET的简单方程：log P = 1.46（±0.02）+ 0.11（±0.001）NC - 0.11（±0.001）NHET。这个方程在本研究中优于大量基准程序。阐明并讨论了影响对数P预测准确性的因素。

相似文献

Calculation of molecular lipophilicity: State-of-the-art and comparison of log P methods on more than 96,000 compounds.

J Pharm Sci. 2009 Mar;98(3):861-93. doi: 10.1002/jps.21494.

Large-scale evaluation of log P predictors: local corrections may compensate insufficient accuracy and need of experimentally testing every other compound.

Chem Biodivers. 2009 Nov;6(11):1837-44. doi: 10.1002/cbdv.200900075.

Substructure and whole molecule approaches for calculating log P.

J Comput Aided Mol Des. 2001 Apr;15(4):337-54. doi: 10.1023/a:1011107422318.

A comparison of methods to handle skew distributed cost variables in the analysis of the resource consumption in schizophrenia treatment.

J Ment Health Policy Econ. 2002 Mar;5(1):21-31.

Comparison of Multiple Linear Regressions and Neural Networks based QSAR models for the design of new antitubercular compounds.

Eur J Med Chem. 2013;70:831-45. doi: 10.1016/j.ejmech.2013.10.029. Epub 2013 Oct 23.

ADME evaluation in drug discovery. 2. Prediction of partition coefficient by atom-additive approach based on atom-weighted solvent accessible surface areas.

J Chem Inf Comput Sci. 2003 May-Jun;43(3):1058-67. doi: 10.1021/ci034007m.

Application of ALOGPS 2.1 to predict log D distribution coefficient for Pfizer proprietary compounds.

J Med Chem. 2004 Nov 4;47(23):5601-4. doi: 10.1021/jm049509l.

Machine learning models for lipophilicity and their domain of applicability.

Mol Pharm. 2007 Jul-Aug;4(4):524-38. doi: 10.1021/mp0700413. Epub 2007 Jul 19.

In silico prediction of ionization constants of drugs.

Mol Pharm. 2007 Jul-Aug;4(4):498-512. doi: 10.1021/mp070019+. Epub 2007 Jul 13.

Lipophilicity of acidic compounds: impact of ion pair partitioning on drug design.

Bioorg Med Chem Lett. 2011 Jun 15;21(12):3550-6. doi: 10.1016/j.bmcl.2011.04.133. Epub 2011 May 5.

引用本文的文献

Study of the Lipophilicity of Tetracyclic Anticancer Azaphenothiazines.

Biomolecules. 2025 Aug 19;15(8):1194. doi: 10.3390/biom15081194.

Multi-fidelity graph neural networks for predicting toluene/water partition coefficients.

J Cheminform. 2025 Aug 8;17(1):123. doi: 10.1186/s13321-025-01057-6.

How THC works: Explaining ligand affinity for, and partial agonism of, cannabinoid receptor 1.

iScience. 2025 May 21;28(7):112706. doi: 10.1016/j.isci.2025.112706. eCollection 2025 Jul 18.

Chemical Targeting of the ATXN1 aa99-163 Interaction Site Suppresses polyQ-Expanded Protein Dimerization.

ACS Omega. 2025 Jun 17;10(25):27194-27205. doi: 10.1021/acsomega.5c02465. eCollection 2025 Jul 1.

The Hidden Crux of Correctly Determining Octanol-Water Partition Coefficients.

Mol Pharm. 2025 Aug 4;22(8):4930-4939. doi: 10.1021/acs.molpharmaceut.5c00552. Epub 2025 Jul 3.

ADME of Bromo-DragonFLY as an example of a new psychoactive substance (NPS) - application of in Silico methods for prediction: absorption, distribution, metabolism and excretion.

Sci Rep. 2025 Jul 2;15(1):22949. doi: 10.1038/s41598-025-06453-4.

Fluorescent PSMA-Targeted Radiotheranostic Compounds for Multiscale Imaging.

Bioconjug Chem. 2025 Jul 16;36(7):1448-1460. doi: 10.1021/acs.bioconjchem.5c00139. Epub 2025 Jun 27.

Enhancing ERα-targeted compound efficacy in breast cancer threapy with ExplainableAI and GeneticAlgorithm.

PLoS One. 2025 May 20;20(5):e0319673. doi: 10.1371/journal.pone.0319673. eCollection 2025.

Machine Learning for Toxicity Prediction Using Chemical Structures: Pillars for Success in the Real World.

Chem Res Toxicol. 2025 May 19;38(5):759-807. doi: 10.1021/acs.chemrestox.5c00033. Epub 2025 May 2.

Predicting Distribution Coefficients (LogD) of Cyclic Peptides Using Molecular Dynamics Simulations.

Pharm Res. 2025 Apr;42(4):613-622. doi: 10.1007/s11095-025-03850-2. Epub 2025 Mar 26.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

分子亲脂性的计算：最新技术及对96000多种化合物的log P方法比较

Calculation of molecular lipophilicity: State-of-the-art and comparison of log P methods on more than 96,000 compounds.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献