利用互联网搜索指数和机器学习模型准确预测急诊科就诊人数：模型开发与性能评估

Accurate Forecasting of Emergency Department Arrivals With Internet Search Index and Machine Learning Models: Model Development and Performance Evaluation.

作者信息

Fan Bi, Peng Jiaxuan, Guo Hainan, Gu Haobin, Xu Kangkang, Wu Tingting

机构信息

College of Management, Institute of Business Analysis and Supply Chain Management, Shenzhen University, Shenzhen, China.

Faculty of Science, University of St Andrews, St Andrews, United Kingdom.

出版信息

JMIR Med Inform. 2022 Jul 20;10(7):e34504. doi: 10.2196/34504.

DOI:10.2196/34504

PMID:35857360

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9350824/

Abstract

BACKGROUND

Emergency department (ED) overcrowding is a concerning global health care issue, which is mainly caused by the uncertainty of patient arrivals, especially during the pandemic. Accurate forecasting of patient arrivals can allow health resource allocation in advance to reduce overcrowding. Currently, traditional data, such as historical patient visits, weather, holiday, and calendar, are primarily used to create forecasting models. However, data from an internet search engine (eg, Google) is less studied, although they can provide pivotal real-time surveillance information. The internet data can be employed to improve forecasting performance and provide early warning, especially during the epidemic. Moreover, possible nonlinearities between patient arrivals and these variables are often ignored.

OBJECTIVE

This study aims to develop an intelligent forecasting system with machine learning models and internet search index to provide an accurate prediction of ED patient arrivals, to verify the effectiveness of the internet search index, and to explore whether nonlinear models can improve the forecasting accuracy.

METHODS

Data on ED patient arrivals were collected from July 12, 2009, to June 27, 2010, the period of the 2009 H1N1 pandemic. These included 139,910 ED visits in our collaborative hospital, which is one of the biggest public hospitals in Hong Kong. Traditional data were also collected during the same period. The internet search index was generated from 268 search queries on Google to comprehensively capture the information about potential patients. The relationship between the index and patient arrivals was verified by Pearson correlation coefficient, Johansen cointegration, and Granger causality. Linear and nonlinear models were then developed with the internet search index to predict patient arrivals. The accuracy and robustness were also examined.

RESULTS

All models could accurately predict patient arrivals. The causality test indicated internet search index as a strong predictor of ED patient arrivals. With the internet search index, the mean absolute percentage error (MAPE) and the root mean square error (RMSE) of the linear model reduced from 5.3% to 5.0% and from 24.44 to 23.18, respectively, whereas the MAPE and RMSE of the nonlinear model decreased even more, from 3.5% to 3% and from 16.72 to 14.55, respectively. Compared with each other, the experimental results revealed that the forecasting system with extreme learning machine, as well as the internet search index, had the best performance in both forecasting accuracy and robustness analysis.

CONCLUSIONS

The proposed forecasting system can make accurate, real-time prediction of ED patient arrivals. Compared with the static traditional variables, the internet search index significantly improves forecasting as a reliable predictor monitoring continuous behavior trend and sudden changes during the epidemic (P=.002). The nonlinear model performs better than the linear counterparts by capturing the dynamic relationship between the index and patient arrivals. Thus, the system can facilitate staff planning and workflow monitoring.

摘要

背景

急诊科过度拥挤是一个令人担忧的全球医疗保健问题，主要由患者就诊的不确定性导致，尤其是在疫情期间。准确预测患者就诊人数可以提前进行卫生资源分配，以减少过度拥挤。目前，传统数据，如历史患者就诊记录、天气、节假日和日历等，主要用于创建预测模型。然而，来自互联网搜索引擎（如谷歌）的数据虽然可以提供关键的实时监测信息，但却较少被研究。互联网数据可用于提高预测性能并提供早期预警，尤其是在疫情期间。此外，患者就诊人数与这些变量之间可能存在的非线性关系常常被忽视。

目的

本研究旨在开发一个基于机器学习模型和互联网搜索指数的智能预测系统，以准确预测急诊科患者就诊人数；验证互联网搜索指数的有效性；并探索非线性模型是否可以提高预测准确性。

方法

收集了2009年7月12日至2010年6月27日（2009年甲型H1N1流感大流行期间）急诊科患者就诊的数据。这些数据包括我们合作医院的139910次急诊科就诊记录，该医院是香港最大的公立医院之一。同时也收集了同期的传统数据。互联网搜索指数由谷歌上268个搜索查询生成，以全面捕捉潜在患者信息。通过皮尔逊相关系数、约翰森协整检验和格兰杰因果检验来验证该指数与患者就诊人数之间的关系。然后利用互联网搜索指数开发线性和非线性模型来预测患者就诊人数，并检验其准确性和稳健性。

结果

所有模型都能准确预测患者就诊人数。因果检验表明互联网搜索指数是急诊科患者就诊人数强有力的预测指标。加入互联网搜索指数后线性模型平均绝对百分比误差（MAPE）从5.3%降至5.0%，均方根误差（RMSE）从24.44降至23.18；而非线性模型的MAPE和RMSE下降得更多,分别从3.5%降至3%以及从16.72降至14.55。相互比较后实验结果表明，具有极限学习机和互联网搜索指数的预测系统在预测准确性和稳健性分析方面均表现最佳。

结论

所提出的预测系统能够对急诊科患者就诊人数进行准确实时预测。与静态传统变量相比,互联网搜索指数作为一个可靠预测指标，能显著改善预测效果，监测疫情期间的持续行为趋势和突发变化（P=0.002）。非线性模型通过捕捉指数与患者就诊人数之间的动态关系，比线性模型表现更好。因此，该系统有助于人员规划和工作流程监测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c1/9350824/be58678dbbc8/medinform_v10i7e34504_fig1.jpg

相似文献

Accurate Forecasting of Emergency Department Arrivals With Internet Search Index and Machine Learning Models: Model Development and Performance Evaluation.利用互联网搜索指数和机器学习模型准确预测急诊科就诊人数：模型开发与性能评估

JMIR Med Inform. 2022 Jul 20;10(7):e34504. doi: 10.2196/34504.

Performance evaluation of Emergency Department patient arrivals forecasting models by including meteorological and calendar information: A comparative study.纳入气象和日历信息的急诊科患者到达预测模型的性能评估：一项比较研究。

Comput Biol Med. 2021 Aug;135:104541. doi: 10.1016/j.compbiomed.2021.104541. Epub 2021 Jun 3.

Forecasting daily emergency department arrivals using high-dimensional multivariate data: a feature selection approach.利用高维多元数据预测每日急诊科就诊人数：一种特征选择方法。

BMC Med Inform Decis Mak. 2022 May 17;22(1):134. doi: 10.1186/s12911-022-01878-7.

Internet search query data improve forecasts of daily emergency department volume.网络搜索查询数据可改善对每日急诊量的预测。

J Am Med Inform Assoc. 2019 Dec 1;26(12):1574-1583. doi: 10.1093/jamia/ocz154.

Using Google Flu Trends data in forecasting influenza-like-illness related ED visits in Omaha, Nebraska.利用谷歌流感趋势数据预测内布拉斯加州奥马哈市与流感样疾病相关的急诊就诊情况。

Am J Emerg Med. 2014 Sep;32(9):1016-23. doi: 10.1016/j.ajem.2014.05.052. Epub 2014 Jun 12.

A Comparison of Univariate and Multivariate Forecasting Models Predicting Emergency Department Patient Arrivals during the COVID-19 Pandemic.预测COVID-19大流行期间急诊科患者就诊情况的单变量和多变量预测模型比较

Healthcare (Basel). 2022 Jun 16;10(6):1120. doi: 10.3390/healthcare10061120.

Real-time forecasting of emergency department arrivals using prehospital data.利用院前数据实时预测急诊科到达人数。

BMC Emerg Med. 2019 Aug 5;19(1):42. doi: 10.1186/s12873-019-0256-z.

Applying Machine Learning Models with An Ensemble Approach for Accurate Real-Time Influenza Forecasting in Taiwan: Development and Validation Study.应用集成方法的机器学习模型进行台湾地区流感的准确实时预测：开发和验证研究。

J Med Internet Res. 2020 Aug 5;22(8):e15394. doi: 10.2196/15394.

Forecasting patient arrivals at emergency department using calendar and meteorological information.利用日历和气象信息预测急诊科患者就诊人数

Appl Intell (Dordr). 2022;52(10):11232-11243. doi: 10.1007/s10489-021-03085-9. Epub 2022 Jan 21.

Predicting daily emergency department visits using machine learning could increase accuracy.使用机器学习预测每日急诊科就诊情况可提高准确性。

Am J Emerg Med. 2023 Mar;65:5-11. doi: 10.1016/j.ajem.2022.12.019. Epub 2022 Dec 21.

引用本文的文献

Prognostic models for predicting patient arrivals in emergency departments: an updated systematic review and research agenda.预测急诊科患者就诊情况的预后模型：最新系统评价与研究议程

BMC Emerg Med. 2025 Jul 1;25(1):106. doi: 10.1186/s12873-025-01250-8.

A Systematic Review of Features Forecasting Patient Arrival Numbers.预测患者到达人数特征的系统评价

Comput Inform Nurs. 2025 Jan 1;43(1):e01197. doi: 10.1097/CIN.0000000000001197.

Internet-based Surveillance Systems and Infectious Diseases Prediction: An Updated Review of the Last 10 Years and Lessons from the COVID-19 Pandemic.基于互联网的监测系统与传染病预测：对过去 10 年的最新回顾及新冠疫情带来的启示。

J Epidemiol Glob Health. 2024 Sep;14(3):645-657. doi: 10.1007/s44197-024-00272-y. Epub 2024 Aug 14.

Early Warning Software for Emergency Department Crowding.急诊科拥挤预警软件。

J Med Syst. 2023 May 26;47(1):66. doi: 10.1007/s10916-023-01958-9.

本文引用的文献

Need of care in interpreting Google Trends-based COVID-19 infodemiological study results: potential risk of false-positivity.需要注意的是，基于谷歌趋势的 COVID-19 信息流行病学研究结果的解读：存在假阳性的潜在风险。

BMC Med Res Methodol. 2021 Jul 18;21(1):147. doi: 10.1186/s12874-021-01338-2.

Locally Weighted Principal Component Analysis-Based Multimode Modeling for Complex Distributed Parameter Systems.基于局部加权主成分分析的复杂分布参数系统多模态建模

IEEE Trans Cybern. 2022 Oct;52(10):10504-10514. doi: 10.1109/TCYB.2021.3061741. Epub 2022 Sep 19.

An exhaustive review and analysis on applications of statistical forecasting in hospital emergency departments.对统计预测在医院急诊科应用的详尽综述与分析。

Health Syst (Basingstoke). 2018 Nov 19;9(4):263-284. doi: 10.1080/20476965.2018.1547348.

Application of the ARIMA model on the COVID-2019 epidemic dataset.自回归积分滑动平均（ARIMA）模型在2019年冠状病毒病疫情数据集上的应用。

Data Brief. 2020 Feb 26;29:105340. doi: 10.1016/j.dib.2020.105340. eCollection 2020 Apr.

Forecasting incidence of infectious diarrhea using random forest in Jiangsu Province, China.利用随机森林在中国江苏省预测传染性腹泻的发病率。

BMC Infect Dis. 2020 Mar 14;20(1):222. doi: 10.1186/s12879-020-4930-2.

Internet search query data improve forecasts of daily emergency department volume.网络搜索查询数据可改善对每日急诊量的预测。

J Am Med Inform Assoc. 2019 Dec 1;26(12):1574-1583. doi: 10.1093/jamia/ocz154.

Short and Long term predictions of Hospital emergency department attendances.医院急诊科就诊人次的短期和长期预测。

Int J Med Inform. 2019 Sep;129:167-174. doi: 10.1016/j.ijmedinf.2019.05.011. Epub 2019 May 13.

Avian Influenza A (H7N9) and related Internet search query data in China.中国的甲型 H7N9 禽流感及相关网络搜索查询数据。

Sci Rep. 2019 Jul 18;9(1):10434. doi: 10.1038/s41598-019-46898-y.

A comprehensive modelling framework to forecast the demand for all hospital services.一个全面的建模框架，用于预测所有医院服务的需求。

Int J Health Plann Manage. 2019 Apr;34(2):e1257-e1271. doi: 10.1002/hpm.2771. Epub 2019 Mar 22.

Emergency department crowding: A systematic review of causes, consequences and solutions.急诊科拥挤：原因、后果和解决方案的系统评价。

PLoS One. 2018 Aug 30;13(8):e0203316. doi: 10.1371/journal.pone.0203316. eCollection 2018.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用互联网搜索指数和机器学习模型准确预测急诊科就诊人数：模型开发与性能评估

Accurate Forecasting of Emergency Department Arrivals With Internet Search Index and Machine Learning Models: Model Development and Performance Evaluation.

作者信息

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献