Jia Shuopeng, She Weibin, Pi Zhipeng, Niu Buying, Zhang Jinhua, Lin Xihan, Xu Mingjun, She Weiya, Liao Jun
School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, #639 Longmian Avenue, Jiangning District, 211198, Nanjing, Jiangsu Province, China.
Medical Affairs, Science and Education Department, Foshan Fosun Chancheng Hospital, #3 Sanyou South Road, Chancheng District, Foshan, Guangdong Province, 52800, China.
Environ Sci Pollut Res Int. 2022 Feb;29(7):9944-9956. doi: 10.1007/s11356-021-16372-2. Epub 2021 Sep 12.
Meteorological factors, which are periodic and regular in a long run, have an unignorable impact on human health. Accurate health risk prediction based on meteorological factors is essential for optimal allocation of resource in healthcare units. However, due to the non-stationary and non-linear nature of the original hospitalization sequence, traditional methods are less robust in predicting it. This study aims to investigate hospital admission prediction models using time series pre-processing algorithms and deep learning approach based on meteorological factors. Using the electronic medical record data from Panyu Central Hospital and meteorological data of Panyu district from 2003 to 2019, 46,089 eligible patients with lower respiratory tract infections (LRTIs) and four meteorological factors were identified to build and evaluate the prediction models. A novel hybrid model, Cascade GAM-CEEMDAN-LSTM Model (CGCLM), was established in combination with generalized additive model (GAM), complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN), and long-short term memory (LSTM) networks for predicting daily admissions of patients with LRTIs. The experimental results show that CGCLM multistep method proposed in this paper outperforms single LSTM model in the prediction of health risk time series at different time window sizes. Moreover, our results also indicate that CGCLM has the best prediction performance when the time window is set to 61 days (RMSE = 1.12, MAE = 0.87, R = 0.93). Adequate extraction of exposure-response relationships between meteorological factors and diseases and suitable handling of sequence pre-processing have an important role in time series prediction. This hybrid climate-based model for predicting LRTIs disease can also be extended to time series prediction of other epidemic disease.
从长期来看具有周期性和规律性的气象因素,对人类健康有着不可忽视的影响。基于气象因素进行准确的健康风险预测,对于医疗单位资源的优化配置至关重要。然而,由于原始住院序列具有非平稳性和非线性的特点,传统方法在预测时稳健性较差。本研究旨在探讨基于气象因素,使用时间序列预处理算法和深度学习方法的医院入院预测模型。利用番禺中心医院的电子病历数据以及番禺区2003年至2019年的气象数据,确定了46089例符合条件的下呼吸道感染(LRTIs)患者和四个气象因素,以构建和评估预测模型。结合广义相加模型(GAM)、自适应噪声完备总体经验模态分解(CEEMDAN)和长短期记忆(LSTM)网络,建立了一种新型混合模型——级联GAM-CEEMDAN-LSTM模型(CGCLM),用于预测LRTIs患者的每日入院情况。实验结果表明,本文提出的CGCLM多步方法在不同时间窗口大小的健康风险时间序列预测中优于单一LSTM模型。此外,我们的结果还表明,当时间窗口设置为61天时,CGCLM具有最佳预测性能(均方根误差RMSE = 1.12,平均绝对误差MAE = 0.87,相关系数R = 0.93)。充分提取气象因素与疾病之间的暴露-反应关系以及对序列预处理进行适当处理,在时间序列预测中具有重要作用。这种基于气候的LRTIs疾病预测混合模型也可扩展到其他流行病的时间序列预测。