Nath Pritthijit, Saha Pratik, Middya Asif Iqbal, Roy Sarbani
Department of Computer Science and Engineering, Jadavpur University, Kolkata, India.
Department of Computer Science, SRM University, Kattankulathur, Chennai, India.
Neural Comput Appl. 2021;33(19):12551-12570. doi: 10.1007/s00521-021-05901-2. Epub 2021 Apr 3.
Tackling air pollution has become of utmost importance since the last few decades. Different statistical as well as deep learning methods have been proposed till now, but seldom those have been used to forecast future long-term pollution trends. Forecasting long-term pollution trends into the future is highly important for government bodies around the globe as they help in the framing of efficient environmental policies. This paper presents a comparative study of various statistical and deep learning methods to forecast long-term pollution trends for the two most important categories of particulate matter (PM) which are PM2.5 and PM10. The study is based on Kolkata, a major city on the eastern side of India. The historical pollution data collected from government set-up monitoring stations in Kolkata are used to analyse the underlying patterns with the help of various time-series analysis techniques, which is then used to produce a forecast for the next two years using different statistical and deep learning methods. The findings reflect that statistical methods such as auto-regressive (AR), seasonal auto-regressive integrated moving average (SARIMA) and Holt-Winters outperform deep learning methods such as stacked, bi-directional, auto-encoder and convolution long short-term memory networks based on the limited data available.
在过去几十年里,应对空气污染已变得至关重要。到目前为止,已经提出了不同的统计方法以及深度学习方法,但很少有方法被用于预测未来长期的污染趋势。对全球各国政府机构而言,预测未来长期的污染趋势非常重要,因为这有助于制定有效的环境政策。本文对各种统计方法和深度学习方法进行了比较研究,以预测两种最重要的颗粒物(PM)类别即PM2.5和PM10的长期污染趋势。该研究以印度东部的主要城市加尔各答为基础。从加尔各答政府设立的监测站收集的历史污染数据,借助各种时间序列分析技术用于分析潜在模式,然后使用不同的统计方法和深度学习方法对未来两年进行预测。研究结果表明,基于现有的有限数据,自回归(AR)、季节性自回归整合移动平均(SARIMA)和霍尔特 - 温特斯等统计方法优于堆叠、双向、自动编码器和卷积长短期记忆网络等深度学习方法。