Institute of New Imaging Technologies (INIT), Universitat Jaume I, Castelló de la Plana, Castellón, Spain.
PLoS One. 2022 Jun 1;17(6):e0269295. doi: 10.1371/journal.pone.0269295. eCollection 2022.
Nitrogen dioxide is one of the pollutants with the most significant health effects. Advanced information on its concentration in the air can help to monitor and control further consequences more effectively, while also making it easier to apply preventive and mitigating measures. Machine learning technologies with available methods and capabilities, combined with the geospatial dimension, can perform predictive analyses with higher accuracy and, as a result, can serve as a supportive tool for productive management. One of the most advanced machine learning algorithms, Bidirectional convolutional LSTM, is being used in ongoing work to predict the concentration of nitrogen dioxide. The model has been validated to perform more accurate spatiotemporal analysis based on the integration of temporal and geospatial factors. The analysis was carried out according to two scenarios developed on the basis of selected features using data from the city of Madrid for the periods January-June 2019 and January-June 2020. Evaluation of the model's performance was conducted using the Root Mean Square Error and the Mean Absolute Error which emphasises the superiority of the proposed model over the reference models. In addition, the significance of a feature selection technique providing improved accuracy was underlined. In terms of execution time, due to the complexity of the Bidirectional convolutional LSTM architecture, convergence and generalisation of the data took longer, resulting in the superiority of the reference models.
二氧化氮是对健康影响最大的污染物之一。关于其在空气中的浓度的先进信息可以帮助更有效地监测和控制进一步的后果,同时也更容易采取预防和缓解措施。具有可用方法和功能的机器学习技术,结合地理空间维度,可以更准确地进行预测分析,因此可以作为一种支持性的管理工具。正在使用最先进的机器学习算法之一双向卷积长短期记忆网络来预测二氧化氮的浓度。该模型已根据时间和地理空间因素的集成进行了验证,以进行更准确的时空分析。根据使用马德里市 2019 年 1 月至 6 月和 2020 年 1 月至 6 月期间选定特征开发的两个场景进行了分析。使用均方根误差和平均绝对误差对模型性能进行了评估,这突出了所提出模型相对于参考模型的优越性。此外,强调了提供改进准确性的特征选择技术的重要性。就执行时间而言,由于双向卷积长短期记忆网络架构的复杂性,数据的收敛和泛化需要更长的时间,因此参考模型具有优势。