Laboratory of Energy and Environmental Science, Institute of Research and Development, Duy Tan University, Da Nang, 550000, Viet Nam; Faculty of Environmental Chemical Engineering, Duy Tan University, Da Nang, 550000, Viet Nam.
Department of Materials Science and Engineering, Institute of Green Manufacturing Technology, Korea University, Seoul 02841, Republic of Korea.
J Environ Manage. 2022 Jan 1;301:113868. doi: 10.1016/j.jenvman.2021.113868. Epub 2021 Oct 7.
Knowing the effluent quality of treatment systems in advance to enable the design of treatment systems that comply with environmental standards is a realistic strategy. This study aims to develop machine learning - based predictive models for designing the subsurface constructed wetlands (SCW). Data from the SCW literature during the period of 2009-2020 included 618 sets and 10 features. Five algorithms namely, Random forest, Classification and Regression trees, Support vector machines, K-nearest neighbors, and Cubist were compared to determine an optimal algorithm. All nine input features including the influent concentrations, C:N ratio, hydraulic loading rate, height, aeration, flow type, feeding, and filter type were confirmed as relevant features for the predictive algorithms. The comparative result revealed that Cubist is the best algorithm with the lowest RMSE (7.77 and 21.77 mg.L for NH-N and COD, respectively) corresponding to 84% of the variance in the effluents explained. The coefficient of determination of the Cubist algorithm obtained for NH-N and COD prediction from the test data were 0.92 and 0.93, respectively. Five case studies of the application of SCW design were also exercised and verified by the prediction model. Finally, a fully developed Cubist algorithm-based design tool for SCW was proposed.
提前了解处理系统的出水水质,以便设计符合环境标准的处理系统,是一种现实的策略。本研究旨在开发基于机器学习的预测模型,用于设计地下构造湿地(SCW)。2009 年至 2020 年期间,SCW 文献中的数据包括 618 组和 10 个特征。比较了五种算法,即随机森林、分类回归树、支持向量机、K-最近邻和 Cubist,以确定最佳算法。所有 9 个输入特征,包括进水浓度、C:N 比、水力负荷率、高度、曝气、流型、进料和过滤类型,都被确定为预测算法的相关特征。比较结果表明,Cubist 是最好的算法,其 RMSE 最低(NH-N 和 COD 分别为 7.77 和 21.77mg.L),对应的是出水解释方差的 84%。Cubist 算法对测试数据中 NH-N 和 COD 的预测的决定系数分别为 0.92 和 0.93。还对五个地下构造湿地设计的应用案例进行了应用,并通过预测模型进行了验证。最后,提出了一个基于 Cubist 算法的完整开发的地下构造湿地设计工具。