Cranfield University, School of Water, Energy and Environment, Cranfield, MK430AL, UK.
Cranfield University, School of Water, Energy and Environment, Cranfield, MK430AL, UK.
Chemosphere. 2019 Jan;215:388-395. doi: 10.1016/j.chemosphere.2018.10.056. Epub 2018 Oct 11.
Empirical data from a 6-month mesocosms experiment were used to assess the ability and performance of two machine learning (ML) models, including artificial neural network (NN) and random forest (RF), to predict temporal bioavailability changes of complex chemical mixtures in contaminated soils amended with compost or biochar. From the predicted bioavailability data, toxicity response for relevant ecological receptors was then forecasted to establish environmental risk implications and determine acceptable end-point remediation. The dataset corresponds to replicate samples collected over 180 days and analysed for total and bioavailable petroleum hydrocarbons and heavy metals/metalloids content. Further to this, a range of biological indicators including bacteria count, soil respiration, microbial community fingerprint, seeds germination, earthworm's lethality, and bioluminescent bacteria were evaluated to inform the environmental risk assessment. Parameters such as soil type, amendment (biochar and compost), initial concentration of individual compounds, and incubation time were used as inputs of the ML models. The relative importance of the input variables was also analysed to better understand the drivers of temporal changes in bioavailability and toxicity. It showed that toxicity changes can be driven by multiple factors (combined effects), which may not be accounted for in classical linear regression analysis (correlation). The use of ML models could improve our understanding of rate-limiting processes affecting the freely available fraction (bioavailable) of contaminants in soil, therefore contributing to mitigate potential risks and to inform appropriate response and recovery methods.
采用为期 6 个月的中观实验的经验数据,评估了两种机器学习(ML)模型(人工神经网络(NN)和随机森林(RF))的能力和性能,以预测受污染土壤中添加堆肥或生物炭后复杂化学混合物的生物可利用性随时间的变化。然后,根据预测的生物可利用性数据,预测相关生态受体的毒性反应,以建立环境风险影响,并确定可接受的终点修复。该数据集对应于在 180 天以上时间收集并分析总石油烃和生物可利用石油烃以及重金属/类金属含量的重复样本。除此之外,还评估了一系列生物指标,包括细菌计数、土壤呼吸、微生物群落指纹、种子发芽、蚯蚓致死率和发光细菌,以提供环境风险评估信息。土壤类型、添加物(生物炭和堆肥)、单个化合物的初始浓度和培养时间等参数被用作 ML 模型的输入。还分析了输入变量的相对重要性,以更好地了解生物可利用性和毒性随时间变化的驱动因素。结果表明,毒性变化可能由多个因素(综合效应)驱动,而经典线性回归分析(相关性)可能无法考虑这些因素。使用 ML 模型可以帮助我们更好地理解影响土壤中污染物自由部分(生物可利用部分)的限速过程,从而有助于减轻潜在风险,并为适当的应对和恢复方法提供信息。