Graduate Institute of Applied Geology, National Central University, Taoyuan 32001, Taiwan.
Center for Environmental Studies, National Central University, Taoyuan 32001, Taiwan.
Int J Environ Res Public Health. 2022 Sep 26;19(19):12180. doi: 10.3390/ijerph191912180.
Monitoring ex-situ water parameters, namely heavy metals, needs time and laboratory work for water sampling and analytical processes, which can retard the response to ongoing pollution events. Previous studies have successfully applied fast modeling techniques such as artificial intelligence algorithms to predict heavy metals. However, neither low-cost feature predictability nor explainability assessments have been considered in the modeling process. This study proposes a reliable and explainable framework to find an effective model and feature set to predict heavy metals in groundwater. The integrated assessment framework has four steps: model selection uncertainty, feature selection uncertainty, predictive uncertainty, and model interpretability. The results show that Random Forest is the most suitable model, and quick-measure parameters can be used as predictors for arsenic (As), iron (Fe), and manganese (Mn). Although the model performance is auspicious, it likely produces significant uncertainties. The findings also demonstrate that arsenic is related to nutrients and spatial distribution, while Fe and Mn are affected by spatial distribution and salinity. Some limitations and suggestions are also discussed to improve the prediction accuracy and interpretability.
监测原地水参数,即重金属,需要时间和实验室工作进行水样采集和分析过程,这可能会延迟对正在发生的污染事件的反应。先前的研究已经成功地应用了快速建模技术,如人工智能算法来预测重金属。然而,在建模过程中既没有考虑低成本特征的可预测性,也没有考虑可解释性评估。本研究提出了一个可靠且可解释的框架,以找到有效的模型和特征集来预测地下水中的重金属。综合评估框架有四个步骤:模型选择不确定性、特征选择不确定性、预测不确定性和模型可解释性。结果表明,随机森林是最合适的模型,快速测量参数可以作为砷(As)、铁(Fe)和锰(Mn)的预测因子。尽管模型性能很好,但它可能会产生很大的不确定性。研究结果还表明,砷与营养物质和空间分布有关,而 Fe 和 Mn 则受空间分布和盐度的影响。还讨论了一些局限性和建议,以提高预测精度和可解释性。