Faculty of Applied Sciences, Malek Ashtar University of Technology, Iran.
Faculty of Applied Sciences, Malek Ashtar University of Technology, Iran.
Chemosphere. 2024 Feb;349:140855. doi: 10.1016/j.chemosphere.2023.140855. Epub 2023 Dec 2.
A novel approach is introduced for the reliable prediction of PUF-air partition coefficients of organic compounds, which can determine the environmental fate of organic compounds during interactions with air, soil, and water. The biggest accessible measured data of PUF-air partition coefficients for 170 chemicals are used to develop and test the novel model. In comparison to available quantitative structure-property relationship (QSPR) methods for the prediction of PUF-air partition coefficients that need complex descriptors, the here used descriptors are simpler. The assessed various statistical factors of the simple method containing 147 (training) and 23 (test) organic compounds can verify the external and internal cross-validations. Various statistical parameters confirm the high reliability of the novel model as compared with the outputs of complex multiple linear regression (MLR), artificial neural network (ANN) and support vector machine (SVM) methods. The values of R-squared (R), and root mean square error (RMSE) of the new model are for training/test sets are 0.924/0.894 and 0.374/0.318, respectively. Meanwhile, R and RMSE values for three comparative models training/test sets are (i) MLR: 0.848/0.670 (R) and 0.531/0.573 (RMSE); (ii) ANN: 0.902/0.664 (R) and 0.425/0.560 (RMSE); (iii) SVM: 0.935/0.794 (R) and 0.351/0.419 (RMSE). Thus, the new model the simplest approach with higher reliability in comparison to the best available methods.
引入了一种新的方法来可靠地预测有机化合物的 PUF-空气分配系数,这可以决定有机化合物在与空气、土壤和水相互作用时的环境归宿。该方法使用了可获得的最大的 170 种化学物质的 PUF-空气分配系数的实测数据来开发和测试新模型。与可用于预测 PUF-空气分配系数的可用定量结构-性质关系 (QSPR) 方法相比,该方法所需的复杂描述符更简单。评估了包含 147 种(训练)和 23 种(测试)有机化合物的简单方法的各种统计因素,可以验证外部和内部交叉验证。各种统计参数证实了新模型的高度可靠性,与复杂的多元线性回归 (MLR)、人工神经网络 (ANN) 和支持向量机 (SVM) 方法的输出相比。新模型的 R-平方 (R) 和均方根误差 (RMSE) 值对于训练/测试集分别为 0.924/0.894 和 0.374/0.318。同时,三个比较模型的训练/测试集的 R 和 RMSE 值为:(i) MLR:0.848/0.670 (R) 和 0.531/0.573 (RMSE);(ii) ANN:0.902/0.664 (R) 和 0.425/0.560 (RMSE);(iii) SVM:0.935/0.794 (R) 和 0.351/0.419 (RMSE)。因此,与最佳可用方法相比,新模型是一种最简单但可靠性更高的方法。