• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用数据挖掘预测 COVID-19 的发病率。

Predicting the incidence of COVID-19 using data mining.

机构信息

Department of Computer Engineering, School of Engineering, Behbahan Khatam Alanbia University of Technology, Behbahan, Iran.

School of Medicine, Shahroud University of Medical Sciences, Shahroud, Iran.

出版信息

BMC Public Health. 2021 Jun 7;21(1):1087. doi: 10.1186/s12889-021-11058-3.

DOI:10.1186/s12889-021-11058-3
PMID:34098928
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8182740/
Abstract

BACKGROUND

The high prevalence of COVID-19 has made it a new pandemic. Predicting both its prevalence and incidence throughout the world is crucial to help health professionals make key decisions. In this study, we aim to predict the incidence of COVID-19 within a two-week period to better manage the disease.

METHODS

The COVID-19 datasets provided by Johns Hopkins University, contain information on COVID-19 cases in different geographic regions since January 22, 2020 and are updated daily. Data from 252 such regions were analyzed as of March 29, 2020, with 17,136 records and 4 variables, namely latitude, longitude, date, and records. In order to design the incidence pattern for each geographic region, the information was utilized on the region and its neighboring areas gathered 2 weeks prior to the designing. Then, a model was developed to predict the incidence rate for the coming 2 weeks via a Least-Square Boosting Classification algorithm.

RESULTS

The model was presented for three groups based on the incidence rate: less than 200, between 200 and 1000, and above 1000. The mean absolute error of model evaluation were 4.71, 8.54, and 6.13%, respectively. Also, comparing the forecast results with the actual values in the period in question showed that the proposed model predicted the number of globally confirmed cases of COVID-19 with a very high accuracy of 98.45%.

CONCLUSION

Using data from different geographical regions within a country and discovering the pattern of prevalence in a region and its neighboring areas, our boosting-based model was able to accurately predict the incidence of COVID-19 within a two-week period.

摘要

背景

COVID-19 的高患病率使其成为一种新的大流行病。预测全球范围内的患病率和发病率对于帮助卫生专业人员做出关键决策至关重要。在这项研究中,我们旨在预测 COVID-19 在两周内的发病率,以更好地管理这种疾病。

方法

约翰霍普金斯大学提供的 COVID-19 数据集包含自 2020 年 1 月 22 日以来不同地理区域 COVID-19 病例的信息,并且每天都会更新。截至 2020 年 3 月 29 日,分析了来自 252 个此类地区的数据,其中包含 17,136 条记录和 4 个变量,即纬度、经度、日期和记录。为了为每个地理区域设计发病率模式,利用了该区域及其周边地区在设计前两周收集的信息。然后,通过最小二乘提升分类算法开发了一个模型来预测未来两周的发病率。

结果

根据发病率将模型呈现为三组:少于 200、200 至 1000 之间和大于 1000。模型评估的平均绝对误差分别为 4.71、8.54 和 6.13%。此外,将预测结果与同期的实际值进行比较表明,所提出的模型以非常高的准确度(98.45%)预测了全球 COVID-19 确诊病例的数量。

结论

使用一个国家内不同地理区域的数据,并发现一个地区及其周边地区的流行模式,我们的基于提升的模型能够准确预测 COVID-19 在两周内的发病率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4e/8186210/3598303f10d1/12889_2021_11058_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4e/8186210/fc934ce8f946/12889_2021_11058_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4e/8186210/e54146eb75f8/12889_2021_11058_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4e/8186210/4dfe8239144c/12889_2021_11058_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4e/8186210/92d5389869fa/12889_2021_11058_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4e/8186210/3598303f10d1/12889_2021_11058_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4e/8186210/fc934ce8f946/12889_2021_11058_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4e/8186210/e54146eb75f8/12889_2021_11058_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4e/8186210/4dfe8239144c/12889_2021_11058_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4e/8186210/92d5389869fa/12889_2021_11058_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4e/8186210/3598303f10d1/12889_2021_11058_Fig5_HTML.jpg

相似文献

1
Predicting the incidence of COVID-19 using data mining.利用数据挖掘预测 COVID-19 的发病率。
BMC Public Health. 2021 Jun 7;21(1):1087. doi: 10.1186/s12889-021-11058-3.
2
Predicting COVID-19 Incidence Through Analysis of Google Trends Data in Iran: Data Mining and Deep Learning Pilot Study.通过分析伊朗谷歌趋势数据预测 COVID-19 发病率:数据挖掘和深度学习初步研究。
JMIR Public Health Surveill. 2020 Apr 14;6(2):e18828. doi: 10.2196/18828.
3
Analysis of COVID-19 pandemic in USA, using Topological Weighted Centroid.利用拓扑加权质心分析美国的 COVID-19 疫情。
Comput Biol Med. 2021 Sep;136:104670. doi: 10.1016/j.compbiomed.2021.104670. Epub 2021 Jul 21.
4
Forecasting daily confirmed COVID-19 cases in Malaysia using ARIMA models.使用自回归积分滑动平均(ARIMA)模型预测马来西亚每日新增新冠肺炎确诊病例数。
J Infect Dev Ctries. 2020 Sep 30;14(9):971-976. doi: 10.3855/jidc.13116.
5
The first laboratory-confirmed imported infections of SARS-CoV-2 in Sudan.苏丹首次实验室确诊的输入性 SARS-CoV-2 感染。
Trans R Soc Trop Med Hyg. 2021 Jan 7;115(1):103-109. doi: 10.1093/trstmh/traa151.
6
The Exponentially Increasing Rate of Patients Infected with COVID-19 in Iran.伊朗 COVID-19 感染患者呈指数级增长。
Arch Iran Med. 2020 Apr 1;23(4):235-238. doi: 10.34172/aim.2020.03.
7
Surveillance Metrics of SARS-CoV-2 Transmission in Central Asia: Longitudinal Trend Analysis.中亚地区 SARS-CoV-2 传播的监测指标:纵向趋势分析。
J Med Internet Res. 2021 Feb 3;23(2):e25799. doi: 10.2196/25799.
8
Dynamic Panel Data Modeling and Surveillance of COVID-19 in Metropolitan Areas in the United States: Longitudinal Trend Analysis.动态面板数据分析与美国大都市地区 COVID-19 的监测:纵向趋势分析。
J Med Internet Res. 2021 Feb 9;23(2):e26081. doi: 10.2196/26081.
9
Role of biological Data Mining and Machine Learning Techniques in Detecting and Diagnosing the Novel Coronavirus (COVID-19): A Systematic Review.生物数据挖掘和机器学习技术在检测和诊断新型冠状病毒 (COVID-19) 中的作用:系统评价。
J Med Syst. 2020 May 25;44(7):122. doi: 10.1007/s10916-020-01582-x.
10
Distribution of the SARS-CoV-2 Pandemic and Its Monthly Forecast Based on Seasonal Climate Patterns.基于季节性气候模式的 SARS-CoV-2 大流行分布及其月度预测。
Int J Environ Res Public Health. 2020 May 17;17(10):3493. doi: 10.3390/ijerph17103493.

引用本文的文献

1
Learning from the past: A short term forecast method for the COVID-19 incidence curve.从过去中学习:COVID-19 发病率曲线的短期预测方法。
PLoS Comput Biol. 2023 Jun 21;19(6):e1010790. doi: 10.1371/journal.pcbi.1010790. eCollection 2023 Jun.
2
Tracking machine learning models for pandemic scenarios: a systematic review of machine learning models that predict local and global evolution of pandemics.追踪大流行情景下的机器学习模型:对预测大流行局部和全球演变的机器学习模型的系统综述
Netw Model Anal Health Inform Bioinform. 2022;11(1):40. doi: 10.1007/s13721-022-00384-0. Epub 2022 Oct 11.
3
Prediction of Patients with COVID-19 Requiring Intensive Care: A Cross-sectional Study Based on Machine-learning Approach from Iran.

本文引用的文献

1
A SIR model assumption for the spread of COVID-19 in different communities.一种关于新冠病毒在不同社区传播的易感-感染-康复(SIR)模型假设。
Chaos Solitons Fractals. 2020 Oct;139:110057. doi: 10.1016/j.chaos.2020.110057. Epub 2020 Jun 28.
2
Estimation of COVID-19 dynamics "on a back-of-envelope": Does the simplest SIR model provide quantitative parameters and predictions?“粗略估计”新冠疫情动态:最简单的SIR模型能否提供定量参数和预测?
Chaos Solitons Fractals. 2020 Jun;135:109841. doi: 10.1016/j.chaos.2020.109841. Epub 2020 May 1.
3
1,000,000 cases of COVID-19 outside of China: The date predicted by a simple heuristic.
对需要重症监护的新冠肺炎患者的预测:一项基于机器学习方法的伊朗横断面研究。
Indian J Crit Care Med. 2022 Jun;26(6):688-695. doi: 10.5005/jp-journals-10071-24226.
4
COVIDNearTerm: A simple method to forecast COVID-19 hospitalizations.COVID近期预测:一种预测新冠肺炎住院人数的简单方法。
J Clin Transl Sci. 2022 Apr 19;6(1):e59. doi: 10.1017/cts.2022.389. eCollection 2022.
5
A data-driven hybrid ensemble AI model for COVID-19 infection forecast using multiple neural networks and reinforced learning.基于多神经网络和强化学习的 COVID-19 感染预测数据驱动混合集成人工智能模型。
Comput Biol Med. 2022 Jul;146:105560. doi: 10.1016/j.compbiomed.2022.105560. Epub 2022 Apr 27.
6
Predicting Mortality of COVID-19 Patients based on Data Mining Techniques.基于数据挖掘技术预测COVID-19患者的死亡率
J Biomed Phys Eng. 2021 Oct 1;11(5):653-662. doi: 10.31661/jbpe.v0i0.2104-1300. eCollection 2021 Oct.
中国境外100万例新冠病毒肺炎病例:基于一种简单启发式方法预测的日期。
Glob Epidemiol. 2020 Nov;2:100023. doi: 10.1016/j.gloepi.2020.100023. Epub 2020 Mar 23.
4
COVID-19: towards controlling of a pandemic.2019冠状病毒病:迈向大流行的控制
Lancet. 2020 Mar 28;395(10229):1015-1018. doi: 10.1016/S0140-6736(20)30673-5. Epub 2020 Mar 17.
5
Author Correction: China's response to a novel coronavirus stands in stark contrast to the 2002 SARS outbreak response.作者更正:中国对新型冠状病毒的应对与2002年非典疫情的应对形成鲜明对比。
Nat Med. 2020 Mar;26(3):441. doi: 10.1038/s41591-020-0816-5.
6
Analyzing the epidemiological outbreak of COVID-19: A visual exploratory data analysis approach.分析 COVID-19 的流行病学爆发:一种可视化探索性数据分析方法。
J Med Virol. 2020 Jun;92(6):632-638. doi: 10.1002/jmv.25743. Epub 2020 Mar 11.
7
Real-time forecasts of the COVID-19 epidemic in China from February 5th to February 24th, 2020.2020年2月5日至2月24日中国新冠肺炎疫情的实时预测
Infect Dis Model. 2020 Feb 14;5:256-263. doi: 10.1016/j.idm.2020.02.002. eCollection 2020.
8
Short-term Forecasts of the COVID-19 Epidemic in Guangdong and Zhejiang, China: February 13-23, 2020.中国广东和浙江地区2019冠状病毒病疫情的短期预测:2020年2月13日至23日
J Clin Med. 2020 Feb 22;9(2):596. doi: 10.3390/jcm9020596.
9
First cases of coronavirus disease 2019 (COVID-19) in France: surveillance, investigations and control measures, January 2020.法国 2019 年冠状病毒病(COVID-19)的首批病例:2020 年 1 月的监测、调查和控制措施。
Euro Surveill. 2020 Feb;25(6). doi: 10.2807/1560-7917.ES.2020.25.6.2000094.
10
The First Case of 2019 Novel Coronavirus Pneumonia Imported into Korea from Wuhan, China: Implication for Infection Prevention and Control Measures.韩国首例中国武汉输入性 2019 新型冠状病毒肺炎病例:感染防控措施启示。
J Korean Med Sci. 2020 Feb 10;35(5):e61. doi: 10.3346/jkms.2020.35.e61.