• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习预测不同温度下药物在二元溶剂混合物中的溶解度

Towards the prediction of drug solubility in binary solvent mixtures at various temperatures using machine learning.

作者信息

Bao Zeqing, Tom Gary, Cheng Austin, Watchorn Jeffrey, Aspuru-Guzik Alán, Allen Christine

机构信息

Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON, M5S 3M2, Canada.

Department of Chemistry, University of Toronto, Toronto, ON, M5S 3H6, Canada.

出版信息

J Cheminform. 2024 Oct 28;16(1):117. doi: 10.1186/s13321-024-00911-3.

DOI:10.1186/s13321-024-00911-3
PMID:39468626
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11520512/
Abstract

Drug solubility is an important parameter in the drug development process, yet it is often tedious and challenging to measure, especially for expensive drugs or those available in small quantities. To alleviate these challenges, machine learning (ML) has been applied to predict drug solubility as an alternative approach. However, the majority of existing ML research has focused on the predictions of aqueous solubility and/or solubility at specific temperatures, which restricts the model applicability in pharmaceutical development. To bridge this gap, we compiled a dataset of 27,000 solubility datapoints, including solubility of small molecules measured in a range of binary solvent mixtures under various temperatures. Next, a panel of ML models were trained on this dataset with their hyperparameters tuned using Bayesian optimization. The resulting top-performing models, both gradient boosted decision trees (light gradient boosting machine and extreme gradient boosting), achieved mean absolute errors (MAE) of 0.33 for LogS (S in g/100 g) on the holdout set. These models were further validated through a prospective study, wherein the solubility of four drug molecules were predicted by the models and then validated with in-house solubility experiments. This prospective study demonstrated that the models accurately predicted the solubility of solutes in specific binary solvent mixtures under different temperatures, especially for drugs whose features closely align within the solutes in the dataset (MAE < 0.5 for LogS). To support future research and facilitate advancements in the field, we have made the dataset and code openly available. Scientific contribution Our research advances the state-of-the-art in predicting solubility for small molecules by leveraging ML and a uniquely comprehensive dataset. Unlike existing ML studies that predominantly focus on solubility in aqueous solvents at fixed temperatures, our work enables prediction of drug solubility in a variety of binary solvent mixtures over a broad temperature range, providing practical insights on the modeling of solubility for realistic pharmaceutical applications. These advancements along with the open access dataset and code support significant steps in the drug development process including new molecule discovery, drug analysis and formulation.

摘要

药物溶解度是药物研发过程中的一个重要参数,但测量起来往往繁琐且具有挑战性,尤其是对于昂贵药物或少量可用的药物。为了缓解这些挑战,机器学习(ML)已被应用于预测药物溶解度,作为一种替代方法。然而,大多数现有的ML研究都集中在水溶解度和/或特定温度下的溶解度预测上,这限制了模型在药物研发中的适用性。为了弥补这一差距,我们编制了一个包含27000个溶解度数据点的数据集,包括在不同温度下一系列二元溶剂混合物中测量的小分子溶解度。接下来,在这个数据集上训练了一组ML模型,并使用贝叶斯优化对其超参数进行了调整。由此产生的表现最佳的模型,即梯度提升决策树(轻梯度提升机和极端梯度提升),在验证集上对LogS(S以g/100 g为单位)的平均绝对误差(MAE)为0.33。这些模型通过一项前瞻性研究进一步得到验证,在该研究中,模型预测了四种药物分子的溶解度,然后通过内部溶解度实验进行了验证。这项前瞻性研究表明,这些模型准确地预测了溶质在不同温度下特定二元溶剂混合物中的溶解度,特别是对于其特征与数据集中溶质密切匹配的药物(LogS的MAE < 0.5)。为了支持未来的研究并促进该领域的进步,我们已将数据集和代码公开提供。科学贡献我们的研究通过利用ML和一个独特的全面数据集,推动了小分子溶解度预测方面的技术发展。与现有的主要关注固定温度下水溶性的ML研究不同,我们的工作能够预测药物在广泛温度范围内各种二元溶剂混合物中的溶解度,为实际药物应用中的溶解度建模提供了实用见解。这些进展以及开放获取的数据集和代码支持了药物研发过程中的重要步骤,包括新分子发现、药物分析和制剂开发。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11520512/36a02324b582/13321_2024_911_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11520512/accbe8eb2fd3/13321_2024_911_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11520512/19fa68e92f90/13321_2024_911_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11520512/34f3d741ee10/13321_2024_911_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11520512/a5b744e0bd9d/13321_2024_911_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11520512/84b7a0dd3398/13321_2024_911_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11520512/36a02324b582/13321_2024_911_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11520512/accbe8eb2fd3/13321_2024_911_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11520512/19fa68e92f90/13321_2024_911_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11520512/34f3d741ee10/13321_2024_911_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11520512/a5b744e0bd9d/13321_2024_911_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11520512/84b7a0dd3398/13321_2024_911_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11520512/36a02324b582/13321_2024_911_Fig6_HTML.jpg

相似文献

1
Towards the prediction of drug solubility in binary solvent mixtures at various temperatures using machine learning.利用机器学习预测不同温度下药物在二元溶剂混合物中的溶解度
J Cheminform. 2024 Oct 28;16(1):117. doi: 10.1186/s13321-024-00911-3.
2
Predicting drug solubility in organic solvents mixtures: A machine-learning approach supported by high-throughput experimentation.预测有机溶剂混合物中的药物溶解度:一种基于高通量实验的机器学习方法。
Int J Pharm. 2024 Jul 20;660:124233. doi: 10.1016/j.ijpharm.2024.124233. Epub 2024 May 18.
3
Prediction of small-molecule compound solubility in organic solvents by machine learning algorithms.基于机器学习算法预测小分子化合物在有机溶剂中的溶解度
J Cheminform. 2021 Dec 11;13(1):98. doi: 10.1186/s13321-021-00575-3.
4
Predicting sulfanilamide solubility in the binary mixtures using a reference solvent approach.采用参考溶剂法预测磺胺类药物在二元混合物中的溶解度。
Polim Med. 2024 Jan-Jun;54(1):27-34. doi: 10.17219/pim/178284.
5
Intermolecular Interactions as a Measure of Dapsone Solubility in Neat Solvents and Binary Solvent Mixtures.分子间相互作用作为衡量氨苯砜在纯溶剂和二元溶剂混合物中溶解度的指标。
Materials (Basel). 2023 Sep 21;16(18):6336. doi: 10.3390/ma16186336.
6
Solubility Characteristics of Acetaminophen and Phenacetin in Binary Mixtures of Aqueous Organic Solvents: Experimental and Deep Machine Learning Screening of Green Dissolution Media.对乙酰氨基酚和非那西丁在有机二元混合水溶液中的溶解度特性:绿色溶解介质的实验研究与深度机器学习筛选
Pharmaceutics. 2022 Dec 16;14(12):2828. doi: 10.3390/pharmaceutics14122828.
7
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
8
Solubility Prediction of Drugs in Binary Solvent Mixtures at Various Temperatures Using a Minimum Number of Experimental Data Points.在不同温度下使用最少实验数据点预测药物在二元溶剂混合物中的溶解度。
AAPS PharmSciTech. 2018 Dec 17;20(1):10. doi: 10.1208/s12249-018-1244-4.
9
Review of the cosolvency models for predicting solubility in solvent mixtures: An update.溶剂混合物中溶解度预测的共溶剂模型综述:更新。
J Pharm Pharm Sci. 2019;22(1):466-485. doi: 10.18433/jpps30611.
10
Prediction of the Aqueous Solubility of Compounds Based on Light Gradient Boosting Machines with Molecular Fingerprints and the Cuckoo Search Algorithm.基于带有分子指纹和布谷鸟搜索算法的轻梯度提升机预测化合物的水溶性
ACS Omega. 2022 Nov 8;7(46):42027-42035. doi: 10.1021/acsomega.2c03885. eCollection 2022 Nov 22.

引用本文的文献

1
Systematic benchmarking of 13 AI methods for predicting cyclic peptide membrane permeability.用于预测环肽膜通透性的13种人工智能方法的系统基准测试。
J Cheminform. 2025 Aug 28;17(1):129. doi: 10.1186/s13321-025-01083-4.
2
A dataset on formulation parameters and characteristics of drug-loaded PLGA microparticles.一个关于载药聚乳酸-羟基乙酸共聚物(PLGA)微粒的制剂参数和特性的数据集。
Sci Data. 2025 Mar 1;12(1):364. doi: 10.1038/s41597-025-04621-9.

本文引用的文献

1
Revolutionizing drug formulation development: The increasing impact of machine learning.颠覆药物制剂研发:机器学习的影响日益增大。
Adv Drug Deliv Rev. 2023 Nov;202:115108. doi: 10.1016/j.addr.2023.115108. Epub 2023 Sep 27.
2
Application of Machine Learning in Material Synthesis and Property Prediction.机器学习在材料合成与性能预测中的应用。
Materials (Basel). 2023 Aug 31;16(17):5977. doi: 10.3390/ma16175977.
3
Practical guidelines for the use of gradient boosting for molecular property prediction.用于分子性质预测的梯度提升法实用指南。
J Cheminform. 2023 Aug 28;15(1):73. doi: 10.1186/s13321-023-00743-7.
4
The role of the indoles in microbiota-gut-brain axis and potential therapeutic targets: A focus on human neurological and neuropsychiatric diseases.吲哚在微生物群-肠道-大脑轴中的作用及其潜在的治疗靶点:关注人类神经和神经精神疾病。
Neuropharmacology. 2023 Nov 15;239:109690. doi: 10.1016/j.neuropharm.2023.109690. Epub 2023 Aug 22.
5
Bridging the Gap between Differential Mobility, Log , and Log Using Machine Learning and SHAP Analysis.利用机器学习和 SHAP 分析在差分迁移率、对数和对数之间架起桥梁。
Anal Chem. 2023 Jul 11;95(27):10309-10321. doi: 10.1021/acs.analchem.3c00921. Epub 2023 Jun 29.
6
Building Machine Learning Small Molecule Melting Points and Solubility Models Using CCDC Melting Points Dataset.使用 CCDC 熔点数据集构建机器学习小分子熔点和溶解度模型。
J Chem Inf Model. 2023 May 22;63(10):2948-2959. doi: 10.1021/acs.jcim.3c00308. Epub 2023 May 1.
7
Machine learning and drug discovery for neglected tropical diseases.机器学习与被忽视热带病药物研发。
BMC Bioinformatics. 2023 Apr 24;24(1):165. doi: 10.1186/s12859-022-05076-0.
8
New insight into gut microbiota and their metabolites in ischemic stroke: A promising therapeutic target.肠道微生物群及其代谢物在缺血性脑卒中的新认识:有希望的治疗靶点。
Biomed Pharmacother. 2023 Jun;162:114559. doi: 10.1016/j.biopha.2023.114559. Epub 2023 Mar 28.
9
Attention-Based Graph Neural Network for Molecular Solubility Prediction.基于注意力机制的图神经网络用于分子溶解度预测。
ACS Omega. 2023 Jan 12;8(3):3236-3244. doi: 10.1021/acsomega.2c06702. eCollection 2023 Jan 24.
10
Machine learning models to accelerate the design of polymeric long-acting injectables.机器学习模型加速聚合物长效注射剂的设计。
Nat Commun. 2023 Jan 10;14(1):35. doi: 10.1038/s41467-022-35343-w.