Alotaibi Hadil Faris, Hsu Chou-Yi, Sead Fadhil Faez, Yadav Anupam, Jyothi S Renuka, Mishra Swati, Purohit Bilakshan, Ashirova Anorgul, Khudayberganov Islom, Chauhan Ashish Singh
Department of Pharmaceutical Sciences, College of Pharmacy, Princess Nourah Bint AbdulRahman University, Riyadh, 11671, Saudi Arabia.
Thunderbird School of Global Management, Arizona State University Tempe Campus, Phoenix, AZ, 85004, USA.
Sci Rep. 2025 Sep 11;15(1):32411. doi: 10.1038/s41598-025-17642-6.
One of the problems with new medications is their poor water solubility that is possible to be addressed by using supercritical method. This study aims to predict the solubility of raloxifene and the density of supercritical CO using temperature and pressure as inputs to analyze the supercritical processing for production of drug nanoparticles. Three regression models, Extra Trees (ET), Random Forest (RF), and Gradient Boosting (GB) were proposed and optimized using Gradient-based optimization to predict density and solubility of drug. In predicting the density of supercritical CO₂, GB attained an R² value of 0.986, reflecting an excellent agreement between its estimates and the true measurements. The model exhibited an RMSE of 23.20, indicating high accuracy, with a maximum error of 33.06. Regarding the solubility of raloxifene, the ET model yielded the highest R-squared score of 0.949, indicating a good fit to the data. The model exhibited an RMSE of 0.41, with a maximum error of 0.90. Comparatively, the RF and GB models obtained slightly lower precision, for the solubility of raloxifene. The RF model exhibited an RMSE of 0.55, while the GB model had an RMSE of 0.72. The optimized models were found to be reliable in predicting solubility and density within the supercritical processing field.
新药物存在的问题之一是其水溶性差,而采用超临界方法有可能解决这一问题。本研究旨在以温度和压力作为输入,预测雷洛昔芬的溶解度和超临界CO的密度,以分析用于生产药物纳米颗粒的超临界工艺。提出了三种回归模型,即极端随机树(ET)、随机森林(RF)和梯度提升(GB),并使用基于梯度的优化方法对其进行优化,以预测药物的密度和溶解度。在预测超临界CO₂的密度时,GB模型的R²值达到0.986,表明其估计值与真实测量值之间具有极好的一致性。该模型的均方根误差(RMSE)为23.20,表明精度较高,最大误差为33.06。关于雷洛昔芬的溶解度,ET模型的决定系数(R²)得分最高,为0.949,表明与数据拟合良好。该模型的RMSE为0.41,最大误差为0.90。相比之下,RF和GB模型对雷洛昔芬溶解度的预测精度略低。RF模型的RMSE为0.55,而GB模型的RMSE为0.72。结果发现,优化后的模型在预测超临界工艺领域内的溶解度和密度方面是可靠的。