Alqarni Mohammed, Alqarni Ali
Department of Pharmaceutical Chemistry, College of Pharmacy, Taif University, P.O. Box 11099, 21944, Taif, Saudi Arabia.
Department of Oral & Maxillofacial Surgery and Diagnostic Sciences, Faculty of Dentistry, Taif University, 21944, Taif, Saudi Arabia.
Sci Rep. 2025 Aug 7;15(1):28880. doi: 10.1038/s41598-025-13090-4.
Analysis of small-molecule drug solubility in binary solvents at different temperatures was carried out via several machine learning models and integration of models to optimize. We investigated the solubility of rivaroxaban in both dichloromethane and a variety of primary alcohols at various temperatures and concentrations of solvents to understand its behavior in mixed solvents. Given the complex, non-linear patterns in solubility behavior, three advanced regression approaches were utilized: Polynomial Curve Fitting, a Bayesian-based Neural Network (BNN), and the Neural Oblivious Decision Ensemble (NODE) method. To optimize model performance, hyperparameters were fine-tuned using the Stochastic Fractal Search (SFS) algorithm. Among the tested models, BNN obtained the best precision for fitting, with a test R² of 0.9926 and a MSE of 3.07 × 10⁻⁸, proving outstanding accuracy in fitting the rivaroxaban data. The NODE model followed BNN, showing a test R² of 0.9413 and the lowest MAPE of 0.1835. The Polynomial model yielded a lower test R² of 0.8200 and higher error rates, indicating its limitations in unravelling the underlying relationships for the solubility variations. This study shows that advanced machine learning models, particularly BNN and NODE, can predict pharmaceutical solubility and improve crystallization process design and optimization.
通过几种机器学习模型及模型集成优化,对不同温度下二元溶剂中小分子药物的溶解度进行了分析。我们研究了利伐沙班在二氯甲烷和多种伯醇中在不同温度及溶剂浓度下的溶解度,以了解其在混合溶剂中的行为。鉴于溶解度行为中复杂的非线性模式,采用了三种先进的回归方法:多项式曲线拟合、基于贝叶斯的神经网络(BNN)和神经遗忘决策集成(NODE)方法。为了优化模型性能,使用随机分形搜索(SFS)算法对超参数进行了微调。在测试的模型中,BNN拟合精度最高,测试R²为0.9九十九26,均方误差为3.07×10⁻⁸,在拟合利伐沙班数据方面显示出卓越的准确性。NODE模型仅次于BNN,测试R²为0.9413,最低平均绝对百分比误差为0.1835。多项式模型的测试R²较低,为0.8200,误差率较高,表明其在揭示溶解度变化潜在关系方面存在局限性。本研究表明,先进的机器学习模型,特别是BNN和NODE,可以预测药物溶解度,并改善结晶过程设计和优化。