Suppr超能文献

利用机器学习通过分子描述符和液相色谱保留时间预测分配系数(log )和分布系数(log )

Using Machine Learning To Predict Partition Coefficient (Log ) and Distribution Coefficient (Log ) with Molecular Descriptors and Liquid Chromatography Retention Time.

作者信息

Win Zaw-Myo, Cheong Allen M Y, Hopkins W Scott

机构信息

Centre for Eye and Vision Research, Hong Kong Science Park, New Territories 999077, Hong Kong.

School of Optometry, The Hong Kong Polytechnic University, Kowloon 999077, Hong Kong.

出版信息

J Chem Inf Model. 2023 Apr 10;63(7):1906-1913. doi: 10.1021/acs.jcim.2c01373. Epub 2023 Mar 16.

Abstract

During preclinical evaluations of drug candidates, several physicochemical (p-chem) properties are measured and employed as metrics to estimate drug efficacy in vivo. Two such p-chem properties are the octanol-water partition coefficient, Log , and distribution coefficient, Log , which are useful in estimating the distribution of drugs within the body. Log and Log are traditionally measured using the shake-flask method and high-performance liquid chromatography. However, it is challenging to measure these properties for species that are very hydrophobic (or hydrophilic) owing to the very low equilibrium concentrations partitioned into octanol (or aqueous) phases. Moreover, the shake-flask method is relatively time-consuming and can require multistep dilutions as the range of analyte concentrations can differ by several orders of magnitude. Here, we circumvent these limitations by using machine learning (ML) to correlate Log and Log with liquid chromatography (LC) retention time (RT). Predictive models based on four ML algorithms, which used molecular descriptors and LC RTs as features, were extensively tested and compared. The inclusion of RT as an additional descriptor improves model performance (MAE = 0.366 and = 0.89), and Shapley additive explanations analysis indicates that RT has the highest impact on model accuracy.

摘要

在候选药物的临床前评估过程中,会测量几种物理化学(p-chem)性质,并将其用作评估体内药物疗效的指标。其中两种这样的p-chem性质是辛醇-水分配系数Log 和分布系数Log ,它们在估计药物在体内的分布方面很有用。传统上,Log 和Log 是使用摇瓶法和高效液相色谱法测量的。然而,对于非常疏水(或亲水)的物种,由于分配到辛醇(或水)相中的平衡浓度非常低,测量这些性质具有挑战性。此外,摇瓶法相对耗时,并且由于分析物浓度范围可能相差几个数量级,可能需要进行多步稀释。在这里,我们通过使用机器学习(ML)将Log 和Log 与液相色谱(LC)保留时间(RT)相关联来规避这些限制。基于四种ML算法的预测模型,使用分子描述符和LC RTs作为特征,进行了广泛的测试和比较。将RT作为额外的描述符纳入可提高模型性能(平均绝对误差=0.366, =0.89),并且Shapley附加解释分析表明RT对模型准确性的影响最大。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验