Hawas Mohamed
28 El Mobtadayan Street, El Monira, Cairo, Egypt.
Data Brief. 2020 Oct;32:106175. doi: 10.1016/j.dib.2020.106175. Epub 2020 Aug 19.
In light of the COVID-19 pandemic that has struck the world since the end of 2019, many endeavors have been carried out to overcome this crisis. Taking into consideration the uncertainty as a feature of forecasting, this data article introduces long-term time-series predictions for the virus's daily infections in Brazil by training forecasting models on limited raw data (30 time-steps and 40 time-steps alternatives). The primary reuse potential of this forecasting data is to enable decision-makers to develop action plans against the pandemic, and to help researchers working in infection prevention and control to: (1) explore limited data usage in predicting infections. (2) develop a reinforcement learning model on top of this data-lake, which can perform an online game between the trained models to generate a new capable model for predicting future true data. The prediction data was generated by training 4200 recurrent neural networks (54 to 84 days validation periods) on raw data from Johns Hopkins University's online repository, to pave the way for generating reliable extended long-term predictions.
鉴于自2019年底以来席卷全球的新冠疫情,人们为克服这一危机付出了诸多努力。考虑到预测具有不确定性这一特点,本文通过在有限的原始数据(30个时间步长和40个时间步长备选方案)上训练预测模型,介绍了巴西该病毒每日感染情况的长期时间序列预测。这些预测数据的主要潜在用途是使决策者能够制定应对疫情的行动计划,并帮助从事感染预防与控制工作的研究人员:(1)探索在预测感染情况时对有限数据的利用。(2)基于此数据湖开发强化学习模型,该模型可在训练好的模型之间进行在线博弈,以生成一个能够预测未来真实数据的新模型。预测数据是通过对约翰·霍普金斯大学在线存储库的原始数据训练4200个递归神经网络(验证期为54至84天)生成的,为生成可靠的长期扩展预测铺平道路。