• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用当地卫生系统数据,采用机器学习方法对全州范围内传染病住院情况进行实时邮政编码和县级估计。

Machine learning approaches for real-time ZIP code and county-level estimation of state-wide infectious disease hospitalizations using local health system data.

作者信息

Ahammed Tanvir, Hossain Md Sakhawat, McMahan Christopher, Rennert Lior

机构信息

Department of Public Health Sciences, Clemson University, Clemson, SC, USA; Center for Public Health Modeling and Response, Clemson University, Clemson, SC, USA.

Center for Public Health Modeling and Response, Clemson University, Clemson, SC, USA; School of Mathematical and Statistical Sciences, Clemson University, Clemson, SC, USA.

出版信息

Epidemics. 2025 Jun;51:100823. doi: 10.1016/j.epidem.2025.100823. Epub 2025 Apr 3.

DOI:10.1016/j.epidem.2025.100823
PMID:40215586
Abstract

The lack of conventional methods of estimating real-time infectious disease burden in granular regions inhibits timely and efficient public health response. Comprehensive data sources (e.g., state health department data) typically needed for such estimation are often limited due to 1) substantial delays in data reporting and 2) lack of geographic granularity in data provided to researchers. Leveraging real-time local health system data presents an opportunity to overcome these challenges. This study evaluates the effectiveness of machine learning and statistical approaches using local health system data to estimate current and previous COVID-19 hospitalizations in South Carolina. Random Forest models demonstrated consistently higher average median percent agreement accuracy compared to generalized linear mixed models for current weekly hospitalizations across 123 ZIP codes (72.29 %, IQR: 63.20-75.62 %) and 28 counties (76.43 %, IQR: 70.33-81.16 %) with sufficient health system coverage. To account for underrepresented populations in health systems, we combined Random Forest models with Classification and Regression Trees (CART) for imputation. The average median percent agreement was 61.02 % (IQR: 51.17-72.29 %) for all ZIP codes and 72.64 % (IQR: 66.13-77.69 %) for all counties. Median percent agreement for cumulative hospitalizations over the previous 6 months was 80.98 % (IQR: 68.99-89.66 %) for all ZIP codes and 81.17 % (IQR: 68.55-91.33 %) for all counties. These findings emphasize the effectiveness of utilizing real-time health system data to estimate infectious disease burden. Moreover, the methodologies developed in this study can be adapted to estimate hospitalizations for other diseases, offering a valuable tool for public health officials to respond swiftly and effectively to various health crises.

摘要

缺乏在精细区域估计实时传染病负担的传统方法,这阻碍了及时有效的公共卫生应对。进行此类估计通常所需的综合数据源(如州卫生部门的数据)往往受到限制,原因如下:1)数据报告存在严重延迟;2)提供给研究人员的数据缺乏地理精细度。利用实时本地卫生系统数据为克服这些挑战提供了契机。本研究评估了使用本地卫生系统数据的机器学习和统计方法在估计南卡罗来纳州当前和既往新冠住院情况方面的有效性。对于123个邮政编码区域(平均中位数百分比一致性准确率为72.29%,四分位距:63.20 - 75.62%)和28个县(平均中位数百分比一致性准确率为76.43%,四分位距:70.33 - 81.16%)且卫生系统覆盖充分的地区,随机森林模型在估计当前每周住院情况时,与广义线性混合模型相比,始终表现出更高的平均中位数百分比一致性准确率。为了考虑卫生系统中代表性不足的人群,我们将随机森林模型与分类回归树(CART)相结合进行插补。所有邮政编码区域的平均中位数百分比一致性为61.02%(四分位距:51.17 - 72.29%),所有县的为72.64%(四分位距:66.13 - 77.69%)。所有邮政编码区域过去6个月累计住院情况的中位数百分比一致性为80.98%(四分位距:68.99 - 89.66%),所有县的为81.17%(四分位距:68.55 - 91.33%)。这些发现强调了利用实时卫生系统数据估计传染病负担的有效性。此外,本研究中开发的方法可用于估计其他疾病的住院情况,为公共卫生官员迅速有效地应对各种健康危机提供了宝贵工具。

相似文献

1
Machine learning approaches for real-time ZIP code and county-level estimation of state-wide infectious disease hospitalizations using local health system data.利用当地卫生系统数据,采用机器学习方法对全州范围内传染病住院情况进行实时邮政编码和县级估计。
Epidemics. 2025 Jun;51:100823. doi: 10.1016/j.epidem.2025.100823. Epub 2025 Apr 3.
2
A flexible framework for local-level estimation of the effective reproductive number in geographic regions with sparse data.一种用于在数据稀疏的地理区域进行地方层面有效繁殖数估计的灵活框架。
BMC Med Res Methodol. 2025 Mar 18;25(1):73. doi: 10.1186/s12874-025-02525-1.
3
A Flexible Framework for Local-Level Estimation of the Effective Reproductive Number in Geographic Regions with Sparse Data.一种用于在数据稀疏的地理区域进行有效再生数的地方层面估计的灵活框架。
medRxiv. 2025 Mar 10:2024.11.06.24316859. doi: 10.1101/2024.11.06.24316859.
4
Assessing the accuracy of California county level COVID-19 hospitalization forecasts to inform public policy decision making.评估加利福尼亚县级 COVID-19 住院预测的准确性,以为公共政策决策提供信息。
BMC Public Health. 2023 Apr 28;23(1):782. doi: 10.1186/s12889-023-15649-0.
5
Estimation of US SARS-CoV-2 Infections, Symptomatic Infections, Hospitalizations, and Deaths Using Seroprevalence Surveys.利用血清流行率调查估计美国 SARS-CoV-2 感染、有症状感染、住院和死亡人数。
JAMA Netw Open. 2021 Jan 4;4(1):e2033706. doi: 10.1001/jamanetworkopen.2020.33706.
6
Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.用于预测埃塞俄比亚 COVID-19 死亡率的机器学习算法。
BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0.
7
Inequities in COVID-19 vaccine and booster coverage across Massachusetts ZIP codes after the emergence of Omicron: A population-based cross-sectional study.在奥密克戎出现后,马萨诸塞州邮政编码区域内 COVID-19 疫苗和加强针接种的不平等:一项基于人群的横断面研究。
PLoS Med. 2023 Jan 31;20(1):e1004167. doi: 10.1371/journal.pmed.1004167. eCollection 2023 Jan.
8
Mathematical Assessment of Wastewater-Based Epidemiology to Predict SARS-CoV-2 Cases and Hospitalizations in Miami-Dade County.基于废水的流行病学对迈阿密-戴德县新冠病毒病例和住院情况预测的数学评估
Acta Biotheor. 2025 Feb 11;73(1):2. doi: 10.1007/s10441-025-09492-6.
9
Variation in COVID-19 Diagnosis by Zip Code and Race and Ethnicity in Indiana.印第安纳州邮政编码和种族差异与 COVID-19 诊断的变化。
Front Public Health. 2020 Dec 11;8:593861. doi: 10.3389/fpubh.2020.593861. eCollection 2020.
10
Real-time estimation and forecasting of COVID-19 cases and hospitalizations in Wisconsin HERC regions for public health decision making processes.威斯康星州 HERC 地区实时估计和预测 COVID-19 病例和住院情况,以支持公共卫生决策过程。
BMC Public Health. 2023 Feb 17;23(1):359. doi: 10.1186/s12889-023-15160-6.

引用本文的文献

1
Mobility-informed metapopulation models predict the spatio-temporal spread of respiratory epidemics across scales.基于移动性的集合种群模型预测了呼吸道流行病在不同尺度上的时空传播。
medRxiv. 2025 Jun 26:2025.06.26.25330297. doi: 10.1101/2025.06.26.25330297.
2
Enhancing Pandemic Prediction: A Deep Learning Approach Using Transformer Neural Networks and Multi-Source Data Fusion for Infectious Disease Forecasting.增强大流行预测:一种使用Transformer神经网络和多源数据融合进行传染病预测的深度学习方法。
medRxiv. 2025 Jun 24:2025.06.24.25330211. doi: 10.1101/2025.06.24.25330211.