Hossen Md Khalid, Peng Yan-Tsung, Shao Asher, Chen Meng Chang
Social Networks and Human-Centered Computing, TIGP, Academia Sinica, Taipei, 115, Taiwan.
Department of Computer Science, National Chengchi University, Taipei, 115, Taiwan.
Sci Rep. 2025 Jul 10;15(1):24830. doi: 10.1038/s41598-025-05958-2.
Predicting time-series data is inherently complex, spurring the development of advanced neural network approaches. Monitoring and predicting PM2.5 levels is especially challenging due to the interplay of diverse natural and anthropogenic factors influencing its dispersion, making accurate predictions both costly and intricate. A key challenge in predicting PM2.5 concentrations lies in its variability, as the data distribution fluctuates significantly over time. Meanwhile, neural networks provide a cost-effective and highly accurate solution in managing such complexities. Deep learning models like Long Short-Term Memory (LSTM) and Bidirectional LSTM (BiLSTM) have been widely applied to PM2.5 prediction tasks. However, prediction errors increase as the forecasting window expands from 1 to 72 hours, underscoring the rising uncertainty in longer-term predictions. Recurrent Neural Networks (RNNs) with continuous-time hidden states are well-suited for modeling irregularly sampled time series but struggle with long-term dependencies due to gradient vanishing or exploding, as revealed by the ordinary differential equation (ODE) based hidden state dynamics-regardless of the ODE solver used. Continuous-time neural processes, defined by differential equations, are limited by numerical solvers, restricting scalability and hindering the modeling of complex phenomena like neural dynamics-ideally addressed via closed-form solutions. In contrast to ODE-based continuous models, closed-form networks demonstrate superior scalability over traditional deep-learning approaches. As continuous-time neural networks, Neural ODEs excel in modeling the intricate dynamics of time-series data, presenting a robust alternative to traditional LSTM models. We propose two ODE-based models: a transformer-based ODE model and a closed-form ODE model. Empirical evaluations show these models significantly enhance prediction accuracy, with improvements ranging from 2.91 to 14.15% for 1-hour to 8-hour predictions when compared to LSTM-based models. Moreover, after conducting the paired t-test, the RMSE values of the proposed model (CCCFC) were found to be significantly different from those of BILSTM, LSTM, GRU, ODE-LSTM, and PCNN,CNN-LSSTM. This implies that CCCFC demonstrates a distinct performance advantage, reinforcing its effectiveness in hourly PM2.5 forecasting.
预测时间序列数据本质上是复杂的,这推动了先进神经网络方法的发展。由于影响PM2.5扩散的各种自然和人为因素相互作用,监测和预测PM2.5水平尤其具有挑战性,使得准确预测既昂贵又复杂。预测PM2.5浓度的一个关键挑战在于其变异性,因为数据分布会随时间显著波动。与此同时,神经网络在管理此类复杂性方面提供了一种经济高效且高度准确的解决方案。像长短期记忆(LSTM)和双向LSTM(BiLSTM)这样的深度学习模型已被广泛应用于PM2.5预测任务。然而,随着预测窗口从1小时扩展到72小时,预测误差会增加,这凸显了长期预测中不断上升的不确定性。具有连续时间隐藏状态的递归神经网络(RNN)非常适合对不规则采样的时间序列进行建模,但由于梯度消失或爆炸,在处理长期依赖关系时存在困难,基于常微分方程(ODE)的隐藏状态动态揭示了这一点——无论使用何种ODE求解器。由微分方程定义的连续时间神经过程受到数值求解器的限制,限制了可扩展性,并阻碍了对诸如神经动力学等复杂现象的建模——理想情况下通过闭式解来解决。与基于ODE的连续模型相比,闭式网络在可扩展性方面优于传统深度学习方法。作为连续时间神经网络,神经ODE在对时间序列数据的复杂动态进行建模方面表现出色,为传统LSTM模型提供了一个强大的替代方案。我们提出了两种基于ODE的模型:一种基于Transformer的ODE模型和一种闭式ODE模型。实证评估表明,这些模型显著提高了预测准确性,与基于LSTM的模型相比,1小时到8小时预测的改进幅度在2.91%到14.15%之间。此外,在进行配对t检验后,发现所提出模型(CCCFC)的RMSE值与BILSTM、LSTM、GRU、ODE-LSTM和PCNN、CNN-LSSTM的RMSE值有显著差异。这意味着CCCFC表现出明显的性能优势,增强了其在每小时PM2.5预测中的有效性。