School of Engineering, National University of Ireland Galway, Ireland; Ryan Institute, National University of Ireland Galway, Ireland; MaREI Research Centre, National University of Ireland Galway, Ireland.
School of Engineering, National University of Ireland Galway, Ireland; Ryan Institute, National University of Ireland Galway, Ireland; MaREI Research Centre, National University of Ireland Galway, Ireland.
J Environ Manage. 2022 Nov 1;321:115923. doi: 10.1016/j.jenvman.2022.115923. Epub 2022 Aug 19.
Coastal water quality assessment is an essential task to keep "good water quality" status for living organisms in coastal ecosystems. The Water quality index (WQI) is a widely used tool to assess water quality but this technique has received much criticism due to the model's reliability and inconsistence. The present study used a recently developed improved WQI model for calculating coastal WQIs in Cork Harbour. The aim of the research is to determine the most reliable and robust machine learning (ML) algorithm(s) to anticipate WQIs at each monitoring point instead of repeatedly employing SI and weight values in order to reduce model uncertainty. In this study, we compared eight commonly used algorithms, including Random Forest (RF), Decision Tree (DT), K-Nearest Neighbors (KNN), Extreme Gradient Boosting (XGB), Extra Tree (ExT), Support Vector Machine (SVM), Linear Regression (LR), and Gaussian Naïve Bayes (GNB). For the purposes of developing the prediction models, the dataset was divided into two groups: training (70%) and testing (30%), whereas the models were validated using the 10-fold cross-validation method. In order to evaluate the models' performance, the RMSE, MSE, MAE, R, and PREI metrics were used in this study. The tree-based DT (RMSE = 0.0, MSE = 0.0, MAE = 0.0, R = 1.0 and PERI = 0.0) and the ExT (RMSE = 0.0, MSE = 0.0, MAE = 0.0, R = 1.0 and PERI = 0.0) and ensemble tree-based XGB (RMSE = 0.0, MSE = 0.0, MAE = 0.0, R = 1.0 and PERI = +0.16 to -0.17) and RF (RMSE = 2.0, MSE = 3.80, MAE = 1.10, R = 0.98, PERI = +3.52 to -25.38) models outperformed other models. The results of model performance and PREI indicate that the DT, ExT, and GXB models could be effective, robust and significantly reduce model uncertainty in predicting WQIs. The findings of this study are also useful for reducing model uncertainty and optimizing the WQM-WQI model architecture for predicting WQI values.
沿海水质评估是维持沿海生态系统中生物“良好水质”状态的一项重要任务。水质指数(WQI)是一种广泛用于评估水质的工具,但由于模型的可靠性和一致性问题,该技术受到了广泛批评。本研究使用了一种新开发的改进的 WQI 模型来计算科克港的沿海 WQI。本研究的目的是确定最可靠和最强大的机器学习(ML)算法,以预测每个监测点的 WQI,而不是反复使用 SI 和权重值,以减少模型不确定性。在这项研究中,我们比较了包括随机森林(RF)、决策树(DT)、K-最近邻(KNN)、极端梯度提升(XGB)、Extra Tree(ExT)、支持向量机(SVM)、线性回归(LR)和高斯朴素贝叶斯(GNB)在内的八种常用算法。为了开发预测模型,数据集被分为两组:训练(70%)和测试(30%),而模型则使用 10 折交叉验证方法进行验证。为了评估模型的性能,本研究使用了 RMSE、MSE、MAE、R 和 PREI 指标。基于树的 DT(RMSE=0.0,MSE=0.0,MAE=0.0,R=1.0,PERI=0.0)和 ExT(RMSE=0.0,MSE=0.0,MAE=0.0,R=1.0,PERI=0.0)以及基于集合的树 XGB(RMSE=0.0,MSE=0.0,MAE=0.0,R=1.0,PERI=+0.16 到-0.17)和 RF(RMSE=2.0,MSE=3.80,MAE=1.10,R=0.98,PERI=+3.52 到-25.38)模型的性能和 PREI 指标优于其他模型。模型性能和 PREI 的结果表明,DT、ExT 和 GXB 模型在预测 WQI 方面可能是有效、稳健的,并且可以显著降低模型不确定性。本研究的结果也有助于降低模型不确定性,并优化 WQM-WQI 模型架构以预测 WQI 值。