Suppr超能文献

极限学习机:一种以水质变量作为预测因子或不使用水质变量来建模溶解氧(DO)浓度的新方法。

Extreme learning machines: a new approach for modeling dissolved oxygen (DO) concentration with and without water quality variables as predictors.

作者信息

Heddam Salim, Kisi Ozgur

机构信息

Faculty of Science, Agronomy Department, Hydraulics Division University, 20 Août 1955, Route El Hadaik, BP 26, Skikda, Algeria.

School of Natural Sciences and Engineering, Ilia State University, Tbilisi, Georgia.

出版信息

Environ Sci Pollut Res Int. 2017 Jul;24(20):16702-16724. doi: 10.1007/s11356-017-9283-z. Epub 2017 May 30.

Abstract

In this paper, several extreme learning machine (ELM) models, including standard extreme learning machine with sigmoid activation function (S-ELM), extreme learning machine with radial basis activation function (R-ELM), online sequential extreme learning machine (OS-ELM), and optimally pruned extreme learning machine (OP-ELM), are newly applied for predicting dissolved oxygen concentration with and without water quality variables as predictors. Firstly, using data from eight United States Geological Survey (USGS) stations located in different rivers basins, USA, the S-ELM, R-ELM, OS-ELM, and OP-ELM were compared against the measured dissolved oxygen (DO) using four water quality variables, water temperature, specific conductance, turbidity, and pH, as predictors. For each station, we used data measured at an hourly time step for a period of 4 years. The dataset was divided into a training set (70%) and a validation set (30%). We selected several combinations of the water quality variables as inputs for each ELM model and six different scenarios were compared. Secondly, an attempt was made to predict DO concentration without water quality variables. To achieve this goal, we used the year numbers, 2008, 2009, etc., month numbers from (1) to (12), day numbers from (1) to (31) and hour numbers from (00:00) to (24:00) as predictors. Thirdly, the best ELM models were trained using validation dataset and tested with the training dataset. The performances of the four ELM models were evaluated using four statistical indices: the coefficient of correlation (R), the Nash-Sutcliffe efficiency (NSE), the root mean squared error (RMSE), and the mean absolute error (MAE). Results obtained from the eight stations indicated that: (i) the best results were obtained by the S-ELM, R-ELM, OS-ELM, and OP-ELM models having four water quality variables as predictors; (ii) out of eight stations, the OP-ELM performed better than the other three ELM models at seven stations while the R-ELM performed the best at one station. The OS-ELM models performed the worst and provided the lowest accuracy; (iii) for predicting DO without water quality variables, the R-ELM performed the best at seven stations followed by the S-ELM in the second place and the OP-ELM performed the worst with low accuracy; (iv) for the final application where training ELM models with validation dataset and testing with training dataset, the OP-ELM provided the best accuracy using water quality variables and the R-ELM performed the best at all eight stations without water quality variables. Fourthly, and finally, we compared the results obtained from different ELM models with those obtained using multiple linear regression (MLR) and multilayer perceptron neural network (MLPNN). Results obtained using MLPNN and MLR models reveal that: (i) using water quality variables as predictors, the MLR performed the worst and provided the lowest accuracy in all stations; (ii) MLPNN was ranked in the second place at two stations, in the third place at four stations, and finally, in the fourth place at two stations, (iii) for predicting DO without water quality variables, MLPNN is ranked in the second place at five stations, and ranked in the third, fourth, and fifth places in the remaining three stations, while MLR was ranked in the last place with very low accuracy at all stations. Overall, the results suggest that the ELM is more effective than the MLPNN and MLR for modelling DO concentration in river ecosystems.

摘要

在本文中,几种极限学习机(ELM)模型,包括具有 sigmoid 激活函数的标准极限学习机(S-ELM)、具有径向基激活函数的极限学习机(R-ELM)、在线序贯极限学习机(OS-ELM)和最优剪枝极限学习机(OP-ELM),被首次应用于以水质变量作为预测因子和不以水质变量作为预测因子来预测溶解氧浓度。首先,利用美国不同河流流域的八个美国地质调查局(USGS)站点的数据,将 S-ELM、R-ELM、OS-ELM 和 OP-ELM 与使用四个水质变量(水温、电导率、浊度和 pH)作为预测因子测得的溶解氧(DO)进行比较。对于每个站点,我们使用了为期 4 年、每小时测量一次的数据。数据集被分为训练集(70%)和验证集(30%)。我们为每个 ELM 模型选择了几种水质变量的组合作为输入,并比较了六种不同的情况。其次,尝试在不使用水质变量的情况下预测溶解氧浓度。为了实现这一目标,我们使用年份数字(2008、2009 等)、月份数字(1 到 12)、日期数字(1 到 31)和小时数字(00:00 到 24:00)作为预测因子。第三,使用验证数据集训练最佳的 ELM 模型,并使用训练数据集进行测试。使用四个统计指标评估了这四种 ELM 模型的性能:相关系数(R)、纳什 - 萨特克利夫效率(NSE)、均方根误差(RMSE)和平均绝对误差(MAE)。从八个站点获得的结果表明:(i)以四个水质变量作为预测因子的 S-ELM、R-ELM、OS-ELM 和 OP-ELM 模型取得了最佳结果;(ii)在八个站点中,OP-ELM 在七个站点的表现优于其他三种 ELM 模型,而 R-ELM 在一个站点表现最佳。OS-ELM 模型表现最差,准确率最低;(iii)在不使用水质变量预测溶解氧时,R-ELM 在七个站点表现最佳,其次是 S-ELM,OP-ELM 表现最差,准确率较低;(iv)对于使用验证数据集训练 ELM 模型并使用训练数据集进行测试的最终应用,OP-ELM 使用水质变量时提供了最佳准确率,而 R-ELM 在所有八个不使用水质变量的站点表现最佳。第四,也是最后一点,我们将不同 ELM 模型获得的结果与使用多元线性回归(MLR)和多层感知器神经网络(MLPNN)获得的结果进行了比较。使用 MLPNN 和 MLR 模型获得的结果表明:(i)以水质变量作为预测因子时,MLR 在所有站点表现最差,准确率最低;(ii)MLPNN 在两个站点排名第二,在四个站点排名第三,最后在两个站点排名第四;(iii)在不使用水质变量预测溶解氧时,MLPNN 在五个站点排名第二,在其余三个站点排名第三、第四和第五,而 MLR 在所有站点排名最后,准确率非常低。总体而言,结果表明 ELM 在模拟河流生态系统中的溶解氧浓度方面比 MLPNN 和 MLR 更有效。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验