Euldji Imane, Si-Moussa Cherif, Hamadache Mabrouk, Benkortbi Othmane
Biomaterials and Transport Phenomena Laboratory (LBMPT), University of Yahia Fares, Faculty of Technology, Department of Process and Environmental Engineering, Medea, 26000, Algeria.
Mol Inform. 2022 Oct;41(10):e2200026. doi: 10.1002/minf.202200026. Epub 2022 Apr 26.
Quantitative structure-property relationship (QSPR) modeling was investigated to predict drug and drug-like compounds solubility in supercritical carbon dioxide. A dataset of 148 drug\drug-like compounds, accounting for 3971 experimental data points (EDPs), was collected and used for modelling the relationship between selected molecular descriptors and solubility fraction data achieved by a nonlinear approach (Artificial neural network, ANN) based on molecular descriptors. Experimental solubility data for a given drug were published as a function of temperature and pressure. In the present study, 11 significant PaDEL descriptors (AATS3v, MATS2e, GATS4c, GATS3v, GATS4e, GATS3 s, nBondsM, AVP-0, SHBd, MLogP, and MLFER_S), the temperature and the pressure were statistically proved to be sufficient inputs. The architecture of the optimized model was found to be {13,10,1}. Several statistical metrics, including average absolute relative deviation (AARD=3.7748 %), root mean square error (RMSE=0.5162), coefficient of correlation (r=0.9761), coefficient of determination (R =0.9528), and robustise (Q =0.9528) were used to validate the obtained model. The model was also subjected to an external test by using 143 EDPs. Sensitivity analysis and domain of application were examined. The overall results confirmed that the optimized ANN-QSPR model is suitable for the correlation and prediction of this property.
研究了定量结构-性质关系(QSPR)建模,以预测药物及类药物化合物在超临界二氧化碳中的溶解度。收集了一个包含148种药物/类药物化合物的数据集,该数据集涵盖3971个实验数据点(EDP),并用于基于分子描述符通过非线性方法(人工神经网络,ANN)建立所选分子描述符与溶解度分数数据之间的关系模型。给定药物的实验溶解度数据作为温度和压力的函数公布。在本研究中,经统计证明11个重要的PaDEL描述符(AATS3v、MATS2e、GATS4c、GATS3v、GATS4e、GATS3s、nBondsM、AVP-0、SHBd、MLogP和MLFER_S)、温度和压力是足够的输入变量。发现优化模型的架构为{13,10,1}。使用了几个统计指标,包括平均绝对相对偏差(AARD = 3.7748%)、均方根误差(RMSE = 0.5162)、相关系数(r = 0.9761)、决定系数(R = 0.9528)和稳健性(Q = 0.9528)来验证所获得的模型。该模型还使用143个EDP进行了外部测试。进行了敏感性分析和应用领域研究。总体结果证实,优化的ANN-QSPR模型适用于该性质的关联和预测。