• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

比较使用公共数据集和专有数据集训练的 logP 和 logD 校正模型。

Comparison of logP and logD correction models trained with public and proprietary data sets.

机构信息

Discovery Chemistry, Genentech Inc, 1 DNA Way, South San Francisco, CA, 94080, USA.

出版信息

J Comput Aided Mol Des. 2022 Mar;36(3):253-262. doi: 10.1007/s10822-022-00450-9. Epub 2022 Apr 1.

DOI:10.1007/s10822-022-00450-9
PMID:35359246
Abstract

In drug discovery, partition and distribution coefficients, logP and logD for octanol/water, are widely used as metrics of the lipophilicity of molecules, which in turn have a strong influence on the bioactivity and bioavailability of potential drugs. There are a variety of established methods, mostly fragment or atom-based, to calculate logP while logD prediction generally relies on calculated logP and pKa for the estimation of neutral and ionized populations at a given pH. Algorithms such as ClogP have limitations generally leading to systematic errors for chemically related molecules while pKa estimation is generally more difficult due to the interplay of electronic, inductive and conjugation effects for ionizable moieties. We propose an integrated machine learning QSAR modeling approach to predict logD by training the model with experimental data while using ClogP and pKa predicted by commercial software as model descriptors. By optimizing the loss function for the ClogD calculated by the software, we build a correction model that incorporates both descriptors from the software and available experimental logD data. Additionally, we calculate logP from the logD model using the software predicted pKa's. Here, we have trained models using publicly or commercial available logD data to show that this approach can improve on commercial software predictions of lipophilicity. When applied to other logD data sets, this approach extends the domain of applicability of logD and logP predictions over commercial software. Performance of these models favorably compare with models built with a larger set of proprietary logD data.

摘要

在药物发现中,分配系数和分布系数(logP 和 logD,用于正辛醇/水)被广泛用作分子亲脂性的指标,而亲脂性又强烈影响潜在药物的生物活性和生物利用度。有多种已建立的方法,主要是基于片段或原子的方法来计算 logP,而 logD 的预测通常依赖于计算的 logP 和 pKa 来估计给定 pH 下中性和离子化群体的比例。ClogP 等算法通常会导致化学相关分子的系统误差,而 pKa 的估计通常更困难,因为可电离部分存在电子、诱导和共轭效应的相互作用。我们提出了一种集成的机器学习 QSAR 建模方法,通过使用商业软件预测的 ClogP 和 pKa 作为模型描述符来训练模型,从而预测 logD。通过优化软件计算的 ClogD 的损失函数,我们构建了一个校正模型,该模型包含了软件中的描述符和可用的实验 logD 数据。此外,我们还使用软件预测的 pKa 从 logD 模型中计算 logP。在这里,我们使用公开或商业上可用的 logD 数据来训练模型,以证明这种方法可以提高商业软件对亲脂性的预测。当应用于其他 logD 数据集时,这种方法扩展了商业软件的 logD 和 logP 预测的适用范围。这些模型的性能与使用更大的一组专有 logD 数据构建的模型相比具有优势。

相似文献

1
Comparison of logP and logD correction models trained with public and proprietary data sets.比较使用公共数据集和专有数据集训练的 logP 和 logD 校正模型。
J Comput Aided Mol Des. 2022 Mar;36(3):253-262. doi: 10.1007/s10822-022-00450-9. Epub 2022 Apr 1.
2
Determination of reversed-phase high performance liquid chromatography based octanol-water partition coefficients for neutral and ionizable compounds: Methodology evaluation.基于反相高效液相色谱法测定中性和可电离化合物的正辛醇-水分配系数:方法学评估
J Chromatogr A. 2017 Dec 15;1528:25-34. doi: 10.1016/j.chroma.2017.10.064. Epub 2017 Oct 27.
3
Application of ALOGPS to predict 1-octanol/water distribution coefficients, logP, and logD, of AstraZeneca in-house database.应用ALOGPS预测阿斯利康内部数据库中化合物的1-辛醇/水分配系数、logP和logD。
J Pharm Sci. 2004 Dec;93(12):3103-10. doi: 10.1002/jps.20217.
4
A deep learning approach for the blind logP prediction in SAMPL6 challenge.一种用于 SAMPL6 挑战赛中盲 logP 预测的深度学习方法。
J Comput Aided Mol Des. 2020 May;34(5):535-542. doi: 10.1007/s10822-020-00292-3. Epub 2020 Jan 30.
5
LogD7.4 prediction enhanced by transferring knowledge from chromatographic retention time, microscopic pKa and logP.通过从色谱保留时间、微观pKa和logP转移知识增强LogD7.4预测
J Cheminform. 2023 Sep 5;15(1):76. doi: 10.1186/s13321-023-00754-4.
6
[Computerized logP prediction using fragment methods].[使用片段方法进行计算机化的logP预测]
Acta Pharm Hung. 1998 Jan;68(1):39-48.
7
Structural Relationship Study of Octanol-Water Partition Coefficient of Some Sulfa Drugs Using GA-MLR and GA-ANN Methods.使用遗传算法-多元线性回归和遗传算法-人工神经网络方法研究一些磺胺类药物的辛醇-水分配系数的结构关系。
Curr Comput Aided Drug Des. 2020;16(3):207-221. doi: 10.2174/1573409915666190301124714.
8
Computation of octanol-water partition coefficients by guiding an additive model with knowledge.通过知识引导加和模型计算正辛醇-水分配系数。
J Chem Inf Model. 2007 Nov-Dec;47(6):2140-8. doi: 10.1021/ci700257y. Epub 2007 Nov 7.
9
Separation methods for estimating octanol-water partition coefficients.用于估算正辛醇-水分配系数的分离方法。
J Chromatogr B Analyt Technol Biomed Life Sci. 2003 Nov 25;797(1-2):3-19. doi: 10.1016/j.jchromb.2003.08.032.
10
Prediction of the -Octanol/Water Partition Coefficients of Basic Compounds Using Multi-Parameter QSRR Models Based on IS-RPLC Retention Behavior in a Wide pH Range.基于宽 pH 范围内 IS-RPLC 保留行为的多参数 QSRR 模型预测碱性化合物的辛醇/水分配系数。
Molecules. 2023 Feb 28;28(5):2270. doi: 10.3390/molecules28052270.

引用本文的文献

1
Characterization of Lipophilicity and Blood Partitioning of Pyrrolizidine Alkaloids and Their N-Oxides In Vitro and In Silico for Toxicokinetic Modeling.用于毒代动力学建模的吡咯里西啶生物碱及其N-氧化物的亲脂性和血液分配的体外和计算机模拟表征
Planta Med. 2025 Apr;91(5):274-282. doi: 10.1055/a-2523-3987. Epub 2025 Jan 23.
2
FormulationBCS: A Machine Learning Platform Based on Diverse Molecular Representations for Biopharmaceutical Classification System (BCS) Class Prediction.FormulationBCS:一种基于多种分子表征的机器学习平台,用于生物药剂分类系统(BCS)类别预测。
Mol Pharm. 2025 Jan 6;22(1):330-342. doi: 10.1021/acs.molpharmaceut.4c00946. Epub 2024 Dec 8.
3

本文引用的文献

1
Open-source QSAR models for pKa prediction using multiple machine learning approaches.使用多种机器学习方法进行pKa预测的开源定量构效关系模型
J Cheminform. 2019 Sep 18;11(1):60. doi: 10.1186/s13321-019-0384-1.
2
The giardicidal activity of lobendazole, fabomotizole, tenatoprazole and ipriflavone: A ligand-based virtual screening and in vitro study.洛苯达唑、法莫替唑、替那拉唑和依普黄酮的杀贾第虫活性:基于配体的虚拟筛选和体外研究。
Eur J Med Chem. 2021 Feb 5;211:113110. doi: 10.1016/j.ejmech.2020.113110. Epub 2020 Dec 17.
3
Assessing the accuracy of octanol-water partition coefficient predictions in the SAMPL6 Part II log P Challenge.
Machine learning in preclinical drug discovery.
机器学习在临床前药物发现中的应用。
Nat Chem Biol. 2024 Aug;20(8):960-973. doi: 10.1038/s41589-024-01679-1. Epub 2024 Jul 19.
4
An Automated Robotic Interface for Assays: Facilitating Machine Learning in Drug Discovery by the Automation of Physicochemical Property Assays.用于分析的自动化机器人接口:通过物理化学性质分析的自动化促进药物发现中的机器学习。
ACS Omega. 2024 May 24;9(23):24948-24958. doi: 10.1021/acsomega.4c02003. eCollection 2024 Jun 11.
5
LogD7.4 prediction enhanced by transferring knowledge from chromatographic retention time, microscopic pKa and logP.通过从色谱保留时间、微观pKa和logP转移知识增强LogD7.4预测
J Cheminform. 2023 Sep 5;15(1):76. doi: 10.1186/s13321-023-00754-4.
6
3D interaction homology: The hydrophobic residues alanine, isoleucine, leucine, proline and valine play different structural roles in soluble and membrane proteins.3D相互作用同源性:疏水性残基丙氨酸、异亮氨酸、亮氨酸、脯氨酸和缬氨酸在可溶性蛋白和膜蛋白中发挥不同的结构作用。
Front Mol Biosci. 2023 Mar 28;10:1116868. doi: 10.3389/fmolb.2023.1116868. eCollection 2023.
7
Machine Learning for Fast, Quantum Mechanics-Based Approximation of Drug Lipophilicity.基于量子力学的药物亲脂性快速近似的机器学习
ACS Omega. 2023 Jan 4;8(2):2046-2056. doi: 10.1021/acsomega.2c05607. eCollection 2023 Jan 17.
8
Revisiting the Use of Quantum Chemical Calculations in LogP Prediction.重新审视量子化学计算在预测 LogP 值中的应用。
Molecules. 2023 Jan 13;28(2):801. doi: 10.3390/molecules28020801.
评估 SAMPL6 第 II 部分 log P 挑战中辛醇-水分配系数预测的准确性。
J Comput Aided Mol Des. 2020 Apr;34(4):335-370. doi: 10.1007/s10822-020-00295-0. Epub 2020 Feb 27.
4
Systematic Modeling of log  Based on Ensemble Machine Learning, Group Contribution, and Matched Molecular Pair Analysis.基于集成机器学习、基团贡献和匹配分子对分析的对数系统建模。
J Chem Inf Model. 2020 Jan 27;60(1):63-76. doi: 10.1021/acs.jcim.9b00718. Epub 2020 Jan 10.
5
An extensive experimental survey of regression methods.回归方法的广泛实验调查。
Neural Netw. 2019 Mar;111:11-34. doi: 10.1016/j.neunet.2018.12.010. Epub 2018 Dec 21.
6
Prediction of pH-Dependent Hydrophobic Profiles of Small Molecules from Miertus-Scrocco-Tomasi Continuum Solvation Calculations.从 Miertus-Scrocco-Tomasi 连续溶剂化计算预测小分子的 pH 依赖的疏水轮廓。
J Phys Chem B. 2017 Oct 26;121(42):9868-9880. doi: 10.1021/acs.jpcb.7b08311. Epub 2017 Oct 13.
7
1-Octanol/Water Partition Coefficients of n-Alkanes from Molecular Simulations of Absolute Solvation Free Energies.基于绝对溶剂化自由能分子模拟的正构烷烃1-辛醇/水分配系数
J Chem Theory Comput. 2009 Sep 8;5(9):2436-46. doi: 10.1021/ct900214y.
8
A novel method for high throughput lipophilicity determination by microscale shake flask and liquid chromatography tandem mass spectrometry.一种通过微量摇瓶和液相色谱串联质谱法进行高通量亲脂性测定的新方法。
Comb Chem High Throughput Screen. 2013 Dec;16(10):817-25. doi: 10.2174/1386207311301010007.
9
Models for identification of erroneous atom-to-atom mapping of reactions performed by automated algorithms.用于识别自动化算法执行的反应中原子到原子映射错误的模型。
J Chem Inf Model. 2012 Dec 21;52(12):3116-22. doi: 10.1021/ci300418q. Epub 2012 Dec 5.
10
Lipophilicity in drug discovery.药物发现中的亲脂性。
Expert Opin Drug Discov. 2010 Mar;5(3):235-48. doi: 10.1517/17460441003605098.