• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习实时预测 215 个国家和地区的每日 COVID-19 发病率:模型开发和验证。

Real-time Prediction of the Daily Incidence of COVID-19 in 215 Countries and Territories Using Machine Learning: Model Development and Validation.

机构信息

School of Electronics and Information Engineering, Soochow University, Suzhou, China.

Joint Shantou International Eye Center, Shantou University & the Chinese University of Hong Kong, Shantou, China.

出版信息

J Med Internet Res. 2021 Jun 14;23(6):e24285. doi: 10.2196/24285.

DOI:10.2196/24285
PMID:34081607
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8204940/
Abstract

BACKGROUND

Advanced prediction of the daily incidence of COVID-19 can aid policy making on the prevention of disease spread, which can profoundly affect people's livelihood. In previous studies, predictions were investigated for single or several countries and territories.

OBJECTIVE

We aimed to develop models that can be applied for real-time prediction of COVID-19 activity in all individual countries and territories worldwide.

METHODS

Data of the previous daily incidence and infoveillance data (search volume data via Google Trends) from 215 individual countries and territories were collected. A random forest regression algorithm was used to train models to predict the daily new confirmed cases 7 days ahead. Several methods were used to optimize the models, including clustering the countries and territories, selecting features according to the importance scores, performing multiple-step forecasting, and upgrading the models at regular intervals. The performance of the models was assessed using the mean absolute error (MAE), root mean square error (RMSE), Pearson correlation coefficient, and Spearman correlation coefficient.

RESULTS

Our models can accurately predict the daily new confirmed cases of COVID-19 in most countries and territories. Of the 215 countries and territories under study, 198 (92.1%) had MAEs <10 and 187 (87.0%) had Pearson correlation coefficients >0.8. For the 215 countries and territories, the mean MAE was 5.42 (range 0.26-15.32), the mean RMSE was 9.27 (range 1.81-24.40), the mean Pearson correlation coefficient was 0.89 (range 0.08-0.99), and the mean Spearman correlation coefficient was 0.84 (range 0.2-1.00).

CONCLUSIONS

By integrating previous incidence and Google Trends data, our machine learning algorithm was able to predict the incidence of COVID-19 in most individual countries and territories accurately 7 days ahead.

摘要

背景

对 COVID-19 日发病率的高级预测有助于制定预防疾病传播的政策,这将深刻影响人们的生活。在之前的研究中,对单个或多个国家和地区的预测进行了调查。

目的

我们旨在开发可用于实时预测全球所有单个国家和地区 COVID-19 活动的模型。

方法

收集了来自 215 个国家和地区的过去每日发病率和信息监测数据(通过 Google Trends 的搜索量数据)。使用随机森林回归算法训练模型,以预测 7 天前的每日新确诊病例数。使用多种方法优化模型,包括对国家和地区进行聚类、根据重要性得分选择特征、进行多步预测以及定期升级模型。使用平均绝对误差 (MAE)、均方根误差 (RMSE)、皮尔逊相关系数和斯皮尔曼相关系数评估模型的性能。

结果

我们的模型可以准确预测大多数国家和地区的 COVID-19 日新确诊病例数。在所研究的 215 个国家和地区中,198 个(92.1%)的 MAE<10,187 个(87.0%)的 Pearson 相关系数>0.8。对于 215 个国家和地区,平均 MAE 为 5.42(范围 0.26-15.32),平均 RMSE 为 9.27(范围 1.81-24.40),平均 Pearson 相关系数为 0.89(范围 0.08-0.99),平均斯皮尔曼相关系数为 0.84(范围 0.2-1.00)。

结论

通过整合过去的发病率和 Google Trends 数据,我们的机器学习算法能够准确预测大多数单个国家和地区 7 天前的 COVID-19 发病率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2128/8204940/39b418e6826b/jmir_v23i6e24285_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2128/8204940/13800e598328/jmir_v23i6e24285_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2128/8204940/ddc1f91fe05d/jmir_v23i6e24285_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2128/8204940/00e19ab45d56/jmir_v23i6e24285_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2128/8204940/39b418e6826b/jmir_v23i6e24285_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2128/8204940/13800e598328/jmir_v23i6e24285_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2128/8204940/ddc1f91fe05d/jmir_v23i6e24285_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2128/8204940/00e19ab45d56/jmir_v23i6e24285_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2128/8204940/39b418e6826b/jmir_v23i6e24285_fig4.jpg

相似文献

1
Real-time Prediction of the Daily Incidence of COVID-19 in 215 Countries and Territories Using Machine Learning: Model Development and Validation.利用机器学习实时预测 215 个国家和地区的每日 COVID-19 发病率:模型开发和验证。
J Med Internet Res. 2021 Jun 14;23(6):e24285. doi: 10.2196/24285.
2
Forecasting the COVID-19 Epidemic by Integrating Symptom Search Behavior Into Predictive Models: Infoveillance Study.将症状搜索行为纳入预测模型预测 COVID-19 疫情:信息监测研究。
J Med Internet Res. 2021 Aug 11;23(8):e28876. doi: 10.2196/28876.
3
Prediction and analysis of COVID-19 daily new cases and cumulative cases: times series forecasting and machine learning models.COVID-19 每日新增病例和累计病例的预测和分析:时间序列预测和机器学习模型。
BMC Infect Dis. 2022 May 25;22(1):495. doi: 10.1186/s12879-022-07472-6.
4
A COVID-19 Pandemic Artificial Intelligence-Based System With Deep Learning Forecasting and Automatic Statistical Data Acquisition: Development and Implementation Study.一种基于人工智能的新冠肺炎大流行深度学习预测与自动统计数据采集系统:开发与实施研究
J Med Internet Res. 2021 May 20;23(5):e27806. doi: 10.2196/27806.
5
Retrospective analysis of the accuracy of predicting the alert level of COVID-19 in 202 countries using Google Trends and machine learning.利用谷歌趋势和机器学习对 202 个国家的 COVID-19 警戒级别进行预测的准确性进行回顾性分析。
J Glob Health. 2020 Dec;10(2):020511. doi: 10.7189/jogh.10.020511.
6
The Causality Inference of Public Interest in Restaurants and Bars on Daily COVID-19 Cases in the United States: Google Trends Analysis.餐馆和酒吧的公众关注度对美国每日 COVID-19 病例的因果推断:谷歌趋势分析。
JMIR Public Health Surveill. 2021 Apr 6;7(4):e22880. doi: 10.2196/22880.
7
Machine Learning Prediction Model of Tuberculosis Incidence Based on Meteorological Factors and Air Pollutants.基于气象因素和空气污染物的结核病发病率机器学习预测模型。
Int J Environ Res Public Health. 2023 Feb 22;20(5):3910. doi: 10.3390/ijerph20053910.
8
Using Reports of Symptoms and Diagnoses on Social Media to Predict COVID-19 Case Counts in Mainland China: Observational Infoveillance Study.利用社交媒体上的症状报告和诊断信息预测中国大陆的新冠肺炎病例数:观察性信息监测研究
J Med Internet Res. 2020 May 28;22(5):e19421. doi: 10.2196/19421.
9
Monitoring SARS-CoV-2 Using Infoveillance, National Reporting Data, and Wastewater in Wales, United Kingdom: Mixed Methods Study.利用信息监测、国家报告数据和英国威尔士的污水监测 SARS-CoV-2:混合方法研究。
JMIR Infodemiology. 2023 Nov 23;3:e43891. doi: 10.2196/43891.
10
Machine Learning-Based Prediction of Growth in Confirmed COVID-19 Infection Cases in 114 Countries Using Metrics of Nonpharmaceutical Interventions and Cultural Dimensions: Model Development and Validation.基于机器学习的 114 个国家确诊 COVID-19 感染病例增长预测:使用非药物干预措施和文化维度指标的模型开发和验证。
J Med Internet Res. 2021 Apr 23;23(4):e26628. doi: 10.2196/26628.

引用本文的文献

1
IECata: interpretable bilinear attention network and evidential deep learning improve the catalytic efficiency prediction of enzymes.IECata:可解释的双线性注意力网络和证据深度学习改进了酶的催化效率预测
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf283.
2
Machine Learning Techniques Applied to COVID-19 Prediction: A Systematic Literature Review.应用于新冠肺炎预测的机器学习技术:系统文献综述
Bioengineering (Basel). 2025 May 13;12(5):514. doi: 10.3390/bioengineering12050514.
3
Association Between School-Related Google Trends Search Volume and Suicides Among Children and Adolescents in Japan During 2016-2020: Retrospective Observational Study With a Time-Series Analysis.

本文引用的文献

1
Predicting the growth and trend of COVID-19 pandemic using machine learning and cloud computing.利用机器学习和云计算预测新冠疫情的发展与趋势。
Internet Things (Amst). 2020 Sep;11:100222. doi: 10.1016/j.iot.2020.100222. Epub 2020 May 12.
2
Short-term forecasting COVID-19 cumulative confirmed cases: Perspectives for Brazil.新型冠状病毒肺炎累计确诊病例的短期预测:巴西的视角
Chaos Solitons Fractals. 2020 Jun;135:109853. doi: 10.1016/j.chaos.2020.109853. Epub 2020 May 1.
3
Correlations of Online Search Engine Trends With Coronavirus Disease (COVID-19) Incidence: Infodemiology Study.
2016-2020 年日本与学校相关的谷歌趋势搜索量与儿童和青少年自杀之间的关联:一项时间序列分析的回顾性观察研究。
J Med Internet Res. 2024 Oct 21;26:e51710. doi: 10.2196/51710.
4
MPEK: a multitask deep learning framework based on pretrained language models for enzymatic reaction kinetic parameters prediction.MPEK:基于预训练语言模型的酶反应动力学参数预测的多任务深度学习框架。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae387.
5
Changes to Public Health Surveillance Methods Due to the COVID-19 Pandemic: Scoping Review.因 COVID-19 大流行而改变的公共卫生监测方法:范围综述。
JMIR Public Health Surveill. 2024 Jan 19;10:e49185. doi: 10.2196/49185.
6
Improved healthcare disaster decision-making utilizing information extraction from complementary social media data during the COVID-19 pandemic.在新冠疫情期间,利用从互补性社交媒体数据中提取的信息改进医疗保健灾难决策。
Decis Support Syst. 2023 Apr 24:113983. doi: 10.1016/j.dss.2023.113983.
7
Long Short-term Memory-Based Prediction of the Spread of Influenza-Like Illness Leveraging Surveillance, Weather, and Twitter Data: Model Development and Validation.基于长短时记忆模型的流感样疾病传播预测:利用监测、气象和推特数据:模型开发和验证。
J Med Internet Res. 2023 Feb 6;25:e42519. doi: 10.2196/42519.
8
Bayesian nowcasting with leading indicators applied to COVID-19 fatalities in Sweden.贝叶斯实时预测与领先指标在瑞典 COVID-19 死亡人数中的应用。
PLoS Comput Biol. 2022 Dec 7;18(12):e1010767. doi: 10.1371/journal.pcbi.1010767. eCollection 2022 Dec.
9
Forecasting and Surveillance of COVID-19 Spread Using Google Trends: Literature Review.利用谷歌趋势预测和监测 COVID-19 传播:文献综述。
Int J Environ Res Public Health. 2022 Sep 29;19(19):12394. doi: 10.3390/ijerph191912394.
10
A data-driven hybrid ensemble AI model for COVID-19 infection forecast using multiple neural networks and reinforced learning.基于多神经网络和强化学习的 COVID-19 感染预测数据驱动混合集成人工智能模型。
Comput Biol Med. 2022 Jul;146:105560. doi: 10.1016/j.compbiomed.2022.105560. Epub 2022 Apr 27.
在线搜索引擎趋势与冠状病毒病(COVID-19)发病的相关性:信息流行病学研究。
JMIR Public Health Surveill. 2020 May 21;6(2):e19702. doi: 10.2196/19702.
4
Real-time forecasts and risk assessment of novel coronavirus (COVID-19) cases: A data-driven analysis.新型冠状病毒(COVID-19)病例的实时预测与风险评估:一项数据驱动的分析。
Chaos Solitons Fractals. 2020 Jun;135:109850. doi: 10.1016/j.chaos.2020.109850. Epub 2020 Apr 30.
5
Trends and Prediction in Daily New Cases and Deaths of COVID-19 in the United States: An Internet Search-Interest Based Model.美国新冠疫情每日新增病例及死亡人数的趋势与预测:基于互联网搜索兴趣的模型
Explor Res Hypothesis Med. 2020 Apr 18;5(2):1-6. doi: 10.14218/ERHM.2020.00023.
6
Epidemiology of Coronavirus COVID-19: Forecasting the Future Incidence in Different Countries.新型冠状病毒肺炎(COVID-19)的流行病学:预测不同国家未来的发病率
Healthcare (Basel). 2020 Apr 15;8(2):99. doi: 10.3390/healthcare8020099.
7
Modified SEIR and AI prediction of the epidemics trend of COVID-19 in China under public health interventions.公共卫生干预下中国新冠疫情趋势的改进型SEIR模型及人工智能预测
J Thorac Dis. 2020 Mar;12(3):165-174. doi: 10.21037/jtd.2020.02.64.
8
Tracking COVID-19 in Europe: Infodemiology Approach.追踪欧洲的 COVID-19:信息流行病学方法。
JMIR Public Health Surveill. 2020 Apr 20;6(2):e18941. doi: 10.2196/18941.
9
Predicting COVID-19 Incidence Through Analysis of Google Trends Data in Iran: Data Mining and Deep Learning Pilot Study.通过分析伊朗谷歌趋势数据预测 COVID-19 发病率:数据挖掘和深度学习初步研究。
JMIR Public Health Surveill. 2020 Apr 14;6(2):e18828. doi: 10.2196/18828.
10
Retrospective analysis of the possibility of predicting the COVID-19 outbreak from Internet searches and social media data, China, 2020.回顾性分析 2020 年中国从互联网搜索和社交媒体数据预测 COVID-19 疫情爆发的可能性。
Euro Surveill. 2020 Mar;25(10). doi: 10.2807/1560-7917.ES.2020.25.10.2000199.