Alotaibi Hadil Faris, Hassan Waqed H, Al-Nussairi Ahmed Kateb Jumaah, Singh Narinderjit Singh Sawaran, Al-Shati Ahmed Salah, Rekha M M, Ray Subhashree, Sinha Aashna, Garg Gunjan
Department of Pharmaceutical Sciences, College of Pharmacy, Princess Nourah Bint AbdulRahman University, 11671, Riyadh, Saudi Arabia.
University of Warith Al-Anbiyaa, Kerbala, 56001, Iraq.
Sci Rep. 2025 Aug 10;15(1):29248. doi: 10.1038/s41598-025-15049-x.
The solubility of medications in supercritical solvent is the most important factor that can be determined via appropriate computational tools. This work explores the modeling of digitoxin solubility as the case study in supercritical CO and solvent density utilizing ensemble methods. Temperature and pressure are the input parameters, while solvent density and digitoxin solubility are the output parameters. Several machine learning models along with optimizer were used for correlation of the dataset. Employing AdaBoost as an ensemble method, predictions from Bayesian Ridge Regression (BRR), Gaussian process regression (GPR), and K-nearest neighbors (KNN) are amalgamated. Sailfish Optimizer (SFO) is utilized for hyper-parameter tuning to enhance model performance. Results reveal that AdaBoost combined with ADA-GPR exhibits the lowest Average Absolute Relative Deviation (AARD%) values, with solubility achieving 7.74 and solvent density reaching 2.76, respectively. This underscores the efficacy of ensemble methods and hyper-parameter tuning in accurately predicting complex chemical properties in supercritical CO systems.
药物在超临界溶剂中的溶解度是可以通过适当的计算工具确定的最重要因素。本研究以洋地黄毒苷溶解度建模为例,利用集成方法探讨超临界CO和溶剂密度。温度和压力是输入参数,而溶剂密度和洋地黄毒苷溶解度是输出参数。使用了几种机器学习模型以及优化器来关联数据集。采用AdaBoost作为集成方法,将贝叶斯岭回归(BRR)、高斯过程回归(GPR)和K近邻(KNN)的预测结果进行合并。利用旗鱼优化器(SFO)进行超参数调整以提高模型性能。结果表明,AdaBoost与ADA-GPR相结合时平均绝对相对偏差(AARD%)值最低,溶解度分别达到7.74,溶剂密度达到2.76。这突出了集成方法和超参数调整在准确预测超临界CO系统中复杂化学性质方面的有效性。