• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用梯度提升回归树和其他方法预测每日 COVID-19 病例:来自美国城市的证据。

Forecasting daily COVID-19 cases with gradient boosted regression trees and other methods: evidence from U.S. cities.

机构信息

Department of Economics, University of Waterloo, Waterloo, ON, Canada.

Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON, Canada.

出版信息

Front Public Health. 2023 Dec 11;11:1259410. doi: 10.3389/fpubh.2023.1259410. eCollection 2023.

DOI:10.3389/fpubh.2023.1259410
PMID:38146480
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10749509/
Abstract

INTRODUCTION

There is a vast literature on the performance of different short-term forecasting models for country specific COVID-19 cases, but much less research with respect to city level cases. This paper employs daily case counts for 25 Metropolitan Statistical Areas (MSAs) in the U.S. to evaluate the efficacy of a variety of statistical forecasting models with respect to 7 and 28-day ahead predictions.

METHODS

This study employed Gradient Boosted Regression Trees (GBRT), Linear Mixed Effects (LME), Susceptible, Infectious, or Recovered (SIR), and Seasonal Autoregressive Integrated Moving Average (SARIMA) models to generate daily forecasts of COVID-19 cases from November 2020 to March 2021.

RESULTS

Consistent with other research that have employed Machine Learning (ML) based methods, we find that Median Absolute Percentage Error (MAPE) values for both 7-day ahead and 28-day ahead predictions from GBRTs are lower than corresponding values from SIR, Linear Mixed Effects (LME), and Seasonal Autoregressive Integrated Moving Average (SARIMA) specifications for the majority of MSAs during November-December 2020 and January 2021. GBRT and SARIMA models do not offer high-quality predictions for February 2021. However, SARIMA generated MAPE values for 28-day ahead predictions are slightly lower than corresponding GBRT estimates for March 2021.

DISCUSSION

The results of this research demonstrate that basic ML models can lead to relatively accurate forecasts at the local level, which is important for resource allocation decisions and epidemiological surveillance by policymakers.

摘要

简介

关于针对特定国家 COVID-19 病例的不同短期预测模型的表现,已经有大量文献,但关于城市层面病例的研究却很少。本文利用美国 25 个大都市统计区(MSA)的每日病例数,评估了各种统计预测模型在 7 天和 28 天预测方面的效果。

方法

本研究采用梯度提升回归树(GBRT)、线性混合效应(LME)、易感、感染或康复(SIR)和季节性自回归综合移动平均(SARIMA)模型,从 2020 年 11 月至 2021 年 3 月,对 COVID-19 病例进行每日预测。

结果

与其他采用基于机器学习(ML)的方法的研究一致,我们发现,对于大多数 MSA,GBRT 的 7 天和 28 天预测的中位数绝对百分比误差(MAPE)值均低于 SIR、线性混合效应(LME)和季节性自回归综合移动平均(SARIMA)的对应值,这是在 2020 年 11 月至 12 月和 2021 年 1 月期间。GBRT 和 SARIMA 模型对 2021 年 2 月的预测并不准确。然而,SARIMA 生成的 28 天预测的 MAPE 值略低于相应的 GBRT 估计值,适用于 2021 年 3 月。

讨论

本研究结果表明,基本的 ML 模型可以在地方层面实现相对准确的预测,这对于决策者的资源分配决策和流行病学监测非常重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5bd/10749509/207eab74abab/fpubh-11-1259410-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5bd/10749509/4acc991c1060/fpubh-11-1259410-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5bd/10749509/fbce5cadfb6e/fpubh-11-1259410-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5bd/10749509/ec9fc301dcec/fpubh-11-1259410-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5bd/10749509/8bfbf1ebccc0/fpubh-11-1259410-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5bd/10749509/234b38eac976/fpubh-11-1259410-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5bd/10749509/207eab74abab/fpubh-11-1259410-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5bd/10749509/4acc991c1060/fpubh-11-1259410-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5bd/10749509/fbce5cadfb6e/fpubh-11-1259410-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5bd/10749509/ec9fc301dcec/fpubh-11-1259410-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5bd/10749509/8bfbf1ebccc0/fpubh-11-1259410-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5bd/10749509/234b38eac976/fpubh-11-1259410-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5bd/10749509/207eab74abab/fpubh-11-1259410-g0006.jpg

相似文献

1
Forecasting daily COVID-19 cases with gradient boosted regression trees and other methods: evidence from U.S. cities.用梯度提升回归树和其他方法预测每日 COVID-19 病例:来自美国城市的证据。
Front Public Health. 2023 Dec 11;11:1259410. doi: 10.3389/fpubh.2023.1259410. eCollection 2023.
2
Time series analysis-based seasonal autoregressive fractionally integrated moving average to estimate hepatitis B and C epidemics in China.基于时间序列分析的季节性自回归分数阶积分移动平均模型估计中国乙型和丙型肝炎流行情况。
World J Gastroenterol. 2023 Nov 14;29(42):5716-5727. doi: 10.3748/wjg.v29.i42.5716.
3
Do more stringent policies reduce daily COVID-19 case counts? Evidence from Canadian provinces.更严格的政策会降低新冠肺炎每日病例数吗?来自加拿大各省的证据。
Econ Anal Policy. 2023 Jun;78:225-242. doi: 10.1016/j.eap.2023.03.006. Epub 2023 Mar 8.
4
Applying SARIMA, ETS, and hybrid models for prediction of tuberculosis incidence rate in Taiwan.应用 SARIMA、ETS 和混合模型预测台湾的结核病发病率。
PeerJ. 2022 Sep 21;10:e13117. doi: 10.7717/peerj.13117. eCollection 2022.
5
A Hybrid Approach Based on Seasonal Autoregressive Integrated Moving Average and Neural Network Autoregressive Models to Predict Scorpion Sting Incidence in El Oued Province, Algeria, From 2005 to 2020.一种基于季节性自回归积分移动平均模型和神经网络自回归模型的混合方法,用于预测2005年至2020年阿尔及利亚瓦尔格拉省的蝎子蜇伤发病率。
J Res Health Sci. 2023 Sep 29;23(3):e00586. doi: 10.34172/jrhs.2023.121.
6
Application of a combined model with seasonal autoregressive integrated moving average and support vector regression in forecasting hand-foot-mouth disease incidence in Wuhan, China.季节性自回归积分滑动平均与支持向量回归组合模型在中国武汉手足口病发病率预测中的应用
Medicine (Baltimore). 2019 Feb;98(6):e14195. doi: 10.1097/MD.0000000000014195.
7
Forecasting COVID-19 Case Trends Using SARIMA Models during the Third Wave of COVID-19 in Malaysia.利用 SARIMA 模型预测马来西亚第三波 COVID-19 期间的 COVID-19 病例趋势。
Int J Environ Res Public Health. 2022 Jan 28;19(3):1504. doi: 10.3390/ijerph19031504.
8
Temporal trends analysis of tuberculosis morbidity in mainland China from 1997 to 2025 using a new SARIMA-NARNNX hybrid model.利用 SARIMA-NARNNX 混合模型分析 1997 年至 2025 年中国大陆结核病发病率的时间趋势。
BMJ Open. 2019 Jul 31;9(7):e024409. doi: 10.1136/bmjopen-2018-024409.
9
A new hybrid model SARIMA-ETS-SVR for seasonal influenza incidence prediction in mainland China.一种用于中国大陆季节性流感发病率预测的新型SARIMA-ETS-SVR混合模型。
J Infect Dev Ctries. 2023 Nov 30;17(11):1581-1590. doi: 10.3855/jidc.18037.
10
Improvement of Time Forecasting Models Using Machine Learning for Future Pandemic Applications Based on COVID-19 Data 2020-2022.基于2020 - 2022年新冠疫情数据,利用机器学习改进时间预测模型以用于未来大流行应用
Diagnostics (Basel). 2023 Mar 15;13(6):1121. doi: 10.3390/diagnostics13061121.

本文引用的文献

1
An interpretable hybrid predictive model of COVID-19 cases using autoregressive model and LSTM.基于自回归模型和 LSTM 的可解释混合 COVID-19 病例预测模型。
Sci Rep. 2023 Apr 25;13(1):6708. doi: 10.1038/s41598-023-33685-z.
2
Do more stringent policies reduce daily COVID-19 case counts? Evidence from Canadian provinces.更严格的政策会降低新冠肺炎每日病例数吗?来自加拿大各省的证据。
Econ Anal Policy. 2023 Jun;78:225-242. doi: 10.1016/j.eap.2023.03.006. Epub 2023 Mar 8.
3
Limitations of COVID-19 testing and case data for evidence-informed health policy and practice.
新冠病毒检测和病例数据在循证卫生政策和实践方面的局限性。
Health Res Policy Syst. 2023 Jan 25;21(1):11. doi: 10.1186/s12961-023-00963-1.
4
Modeling COVID-19 incidence with Google Trends.利用谷歌趋势对新冠疫情发病率进行建模。
Front Res Metr Anal. 2022 Sep 15;7:1003972. doi: 10.3389/frma.2022.1003972. eCollection 2022.
5
An evaluation of prospective COVID-19 modelling studies in the USA: from data to science translation.评估美国前瞻性 COVID-19 建模研究:从数据到科学转化。
Lancet Digit Health. 2022 Oct;4(10):e738-e747. doi: 10.1016/S2589-7500(22)00148-0.
6
Estimating the Effects of Non-Pharmaceutical Interventions and Population Mobility on Daily COVID-19 Cases: Evidence from Ontario.评估非药物干预措施和人口流动对每日新冠病毒病例的影响:来自安大略省的证据。
Can Public Policy. 2022 Mar 1;48(1):144-161. doi: 10.3138/cpp.2021-022.
7
Application of a data-driven XGBoost model for the prediction of COVID-19 in the USA: a time-series study.基于数据驱动的 XGBoost 模型在预测美国 COVID-19 中的应用:一项时间序列研究。
BMJ Open. 2022 Jul 1;12(7):e056685. doi: 10.1136/bmjopen-2021-056685.
8
Improving performance of deep learning predictive models for COVID-19 by incorporating environmental parameters.通过纳入环境参数提高用于新冠肺炎的深度学习预测模型的性能。
Gondwana Res. 2023 Feb;114:69-77. doi: 10.1016/j.gr.2022.03.014. Epub 2022 Apr 8.
9
A Novel Matrix Profile-Guided Attention LSTM Model for Forecasting COVID-19 Cases in USA.一种基于新型矩阵特征值引导注意力 LSTM 模型的美国新冠肺炎病例预测方法
Front Public Health. 2021 Oct 7;9:741030. doi: 10.3389/fpubh.2021.741030. eCollection 2021.
10
Improving prediction of COVID-19 evolution by fusing epidemiological and mobility data.通过融合流行病学和流动性数据来改进对 COVID-19 演变的预测。
Sci Rep. 2021 Jul 26;11(1):15173. doi: 10.1038/s41598-021-94696-2.