运用元学习为时间序列预测模型推荐合适的模型。

Using meta-learning to recommend an appropriate time-series forecasting model.

机构信息

Department of Biostatistics, School of Health, Mashhad University of Medical Sciences, Mashhad, Iran.

Department of Statistics, Ferdowsi University of Mashhad, Mashhad, Iran.

出版信息

BMC Public Health. 2024 Jan 10;24(1):148. doi: 10.1186/s12889-023-17627-y.

DOI:10.1186/s12889-023-17627-y

PMID:38200512

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10782782/

Abstract

BACKGROUND

There are various forecasting algorithms available for univariate time series, ranging from simple to sophisticated and computational. In practice, selecting the most appropriate algorithm can be difficult, because there are too many algorithms. Although expert knowledge is required to make an informed decision, sometimes it is not feasible due to the lack of such resources as time, money, and manpower.

METHODS

In this study, we used coronavirus disease 2019 (COVID-19) data, including the absolute numbers of confirmed, death and recovered cases per day in 187 countries from February 20, 2020, to May 25, 2021. Two popular forecasting models, including Auto-Regressive Integrated Moving Average (ARIMA) and exponential smoothing state-space model with Trigonometric seasonality, Box-Cox transformation, ARMA errors, Trend, and Seasonal components (TBATS) were used to forecast the data. Moreover, the data were evaluated by the root mean squared error (RMSE), mean absolute error (MAE), mean absolute percentage error (MAPE), and symmetric mean absolute percentage error (SMAPE) criteria to label time series. The various characteristics of each time series based on the univariate time series structure were extracted as meta-features. After that, three machine-learning classification algorithms, including support vector machine (SVM), decision tree (DT), random forest (RF), and artificial neural network (ANN) were used as meta-learners to recommend an appropriate forecasting model.

RESULTS

The finding of the study showed that the DT model had a better performance in the classification of time series. The accuracy of DT in the training and testing phases was 87.50% and 82.50%, respectively. The sensitivity of the DT algorithm in the training phase was 86.58% and its specificity was 88.46%. Moreover, the sensitivity and specificity of the DT algorithm in the testing phase were 73.33% and 88%, respectively.

CONCLUSION

In general, the meta-learning approach was able to predict the appropriate forecasting model (ARIMA and TBATS) based on some time series features. Considering some characteristics of the desired COVID-19 time series, the ARIMA or TBATS forecasting model might be recommended to forecast the death, confirmed, and recovered trend cases of COVID-19 by the DT model.

摘要

背景

单变量时间序列有各种预测算法，从简单到复杂和计算密集型都有。在实践中，选择最合适的算法可能很困难，因为算法太多了。虽然需要专家知识来做出明智的决策，但有时由于缺乏时间、金钱和人力等资源，这并不可行。

方法

在这项研究中，我们使用了 2020 年 2 月 20 日至 2021 年 5 月 25 日来自 187 个国家的每日确诊、死亡和康复病例的绝对数量的新型冠状病毒疾病 2019（COVID-19）数据。我们使用了两种流行的预测模型，包括自回归综合移动平均（ARIMA）和具有三角函数季节性、Box-Cox 变换、ARMA 误差、趋势和季节性成分的指数平滑状态空间模型（TBATS）来预测数据。此外，我们使用均方根误差（RMSE）、平均绝对误差（MAE）、平均绝对百分比误差（MAPE）和对称平均绝对百分比误差（SMAPE）标准来评估数据，以标记时间序列。根据单变量时间序列结构提取了每个时间序列的各种特征作为元特征。之后，我们使用支持向量机（SVM）、决策树（DT）、随机森林（RF）和人工神经网络（ANN）这三种机器学习分类算法作为元学习者来推荐合适的预测模型。

结果

研究结果表明，DT 模型在时间序列分类方面表现更好。DT 在训练和测试阶段的准确率分别为 87.50%和 82.50%。DT 算法在训练阶段的灵敏度为 86.58%，特异性为 88.46%。此外，DT 算法在测试阶段的灵敏度和特异性分别为 73.33%和 88%。

结论

总的来说，元学习方法能够根据一些时间序列特征预测合适的预测模型（ARIMA 和 TBATS）。考虑到所需 COVID-19 时间序列的一些特征，DT 模型可能会推荐使用 ARIMA 或 TBATS 预测模型来预测 COVID-19 的死亡、确诊和康复病例趋势。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c079/10782782/20113af45ffc/12889_2023_17627_Fig1_HTML.jpg

相似文献

Using meta-learning to recommend an appropriate time-series forecasting model.

BMC Public Health. 2024 Jan 10;24(1):148. doi: 10.1186/s12889-023-17627-y.

Forecasting the dynamics of cumulative COVID-19 cases (confirmed, recovered and deaths) for top-16 countries using statistical machine learning models: Auto-Regressive Integrated Moving Average (ARIMA) and Seasonal Auto-Regressive Integrated Moving Average (SARIMA).

Appl Soft Comput. 2021 May;103:107161. doi: 10.1016/j.asoc.2021.107161. Epub 2021 Feb 8.

Improving the precision of modeling the incidence of hemorrhagic fever with renal syndrome in mainland China with an ensemble machine learning approach.

PLoS One. 2021 Mar 16;16(3):e0248597. doi: 10.1371/journal.pone.0248597. eCollection 2021.

A COVID-19 Pandemic Artificial Intelligence-Based System With Deep Learning Forecasting and Automatic Statistical Data Acquisition: Development and Implementation Study.

J Med Internet Res. 2021 May 20;23(5):e27806. doi: 10.2196/27806.

Comparison of ARIMA, ETS, NNAR, TBATS and hybrid models to forecast the second wave of COVID-19 hospitalizations in Italy.

Eur J Health Econ. 2022 Aug;23(6):917-940. doi: 10.1007/s10198-021-01347-4. Epub 2021 Aug 4.

Time series prediction of under-five mortality rates for Nigeria: comparative analysis of artificial neural networks, Holt-Winters exponential smoothing and autoregressive integrated moving average models.

BMC Med Res Methodol. 2020 Dec 3;20(1):292. doi: 10.1186/s12874-020-01159-9.

Time series forecasting of new cases and new deaths rate for COVID-19 using deep learning methods.

Results Phys. 2021 Aug;27:104495. doi: 10.1016/j.rinp.2021.104495. Epub 2021 Jun 26.

A novel comparative study of NNAR approach with linear stochastic time series models in predicting tennis player's performance.

BMC Sports Sci Med Rehabil. 2024 Jan 25;16(1):28. doi: 10.1186/s13102-024-00815-7.

Predicting daily emergency department visits using machine learning could increase accuracy.

Am J Emerg Med. 2023 Mar;65:5-11. doi: 10.1016/j.ajem.2022.12.019. Epub 2022 Dec 21.

Forecasting COVID-19 in Pakistan.

PLoS One. 2020 Nov 30;15(11):e0242762. doi: 10.1371/journal.pone.0242762. eCollection 2020.

本文引用的文献

Neural network models for influenza forecasting with associated uncertainty using Web search activity trends.

PLoS Comput Biol. 2023 Aug 28;19(8):e1011392. doi: 10.1371/journal.pcbi.1011392. eCollection 2023 Aug.

The Prediction of Influenza-like Illness and Respiratory Disease Using LSTM and ARIMA.

Int J Environ Res Public Health. 2022 Feb 7;19(3):1858. doi: 10.3390/ijerph19031858.

Exponentially Increasing Trend of Infected Patients with COVID-19 in Iran: A Comparison of Neural Network and ARIMA Forecasting Models.

Iran J Public Health. 2020 Oct;49(Suppl 1):92-100. doi: 10.18502/ijph.v49iS1.3675.

The COVID-19 pandemic: prediction study based on machine learning models.

Environ Sci Pollut Res Int. 2021 Aug;28(30):40496-40506. doi: 10.1007/s11356-021-13824-7. Epub 2021 Apr 10.

Modeling and forecasting number of confirmed and death caused COVID-19 in IRAN: A comparison of time series forecasting methods.

Biomed Signal Process Control. 2021 Apr;66:102494. doi: 10.1016/j.bspc.2021.102494. Epub 2021 Feb 10.

Manufacturing and service supply chain resilience to the COVID-19 outbreak: Lessons learned from the automobile and airline industries.

Technol Forecast Soc Change. 2021 Feb;163:120447. doi: 10.1016/j.techfore.2020.120447. Epub 2020 Nov 6.

Application of artificial neural networks to predict the COVID-19 outbreak.

Glob Health Res Policy. 2020 Nov 23;5(1):50. doi: 10.1186/s41256-020-00175-y.

Data analysis of Covid-19 pandemic and short-term cumulative case forecasting using machine learning time series methods.

Chaos Solitons Fractals. 2021 Jan;142:110512. doi: 10.1016/j.chaos.2020.110512. Epub 2020 Nov 28.

Forecasting COVID-19 daily cases using phone call data.

Appl Soft Comput. 2021 Mar;100:106932. doi: 10.1016/j.asoc.2020.106932. Epub 2020 Nov 25.

Implications of the coronavirus (COVID-19) outbreak for innovation: Which technologies will improve our lives?

Technol Forecast Soc Change. 2021 Feb;163:120451. doi: 10.1016/j.techfore.2020.120451. Epub 2020 Nov 7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

运用元学习为时间序列预测模型推荐合适的模型。

Using meta-learning to recommend an appropriate time-series forecasting model.

机构信息

Department of Biostatistics, School of Health, Mashhad University of Medical Sciences, Mashhad, Iran.

Department of Statistics, Ferdowsi University of Mashhad, Mashhad, Iran.