Megahed Fadel M, Jones-Farmer L Allison, Ma Yinjiao, Rigdon Steven E
Farmer School of Business, Miami University, Oxford, OH, United States.
Department of Epidemiology and Biostatistics, College for Public Health and Social Justice, Saint Louis University, St Louis, MO, United States.
JMIR Public Health Surveill. 2022 Jul 19;8(7):e32164. doi: 10.2196/32164.
Socially vulnerable communities are at increased risk for adverse health outcomes during a pandemic. Although this association has been established for H1N1, Middle East respiratory syndrome (MERS), and COVID-19 outbreaks, understanding the factors influencing the outbreak pattern for different communities remains limited.
Our 3 objectives are to determine how many distinct clusters of time series there are for COVID-19 deaths in 3108 contiguous counties in the United States, how the clusters are geographically distributed, and what factors influence the probability of cluster membership.
We proposed a 2-stage data analytic framework that can account for different levels of temporal aggregation for the pandemic outcomes and community-level predictors. Specifically, we used time-series clustering to identify clusters with similar outcome patterns for the 3108 contiguous US counties. Multinomial logistic regression was used to explain the relationship between community-level predictors and cluster assignment. We analyzed county-level confirmed COVID-19 deaths from Sunday, March 1, 2020, to Saturday, February 27, 2021.
Four distinct patterns of deaths were observed across the contiguous US counties. The multinomial regression model correctly classified 1904 (61.25%) of the counties' outbreak patterns/clusters.
Our results provide evidence that county-level patterns of COVID-19 deaths are different and can be explained in part by social and political predictors.
在大流行期间,社会弱势群体面临更严重的健康不良后果风险。尽管这种关联已在甲型H1N1流感、中东呼吸综合征(MERS)和新冠疫情中得到证实,但对于影响不同社区疫情爆发模式的因素,我们的了解仍然有限。
我们的三个目标是确定美国3108个相邻县的新冠死亡病例时间序列中有多少个不同的聚类,这些聚类在地理上如何分布,以及哪些因素会影响聚类成员的概率。
我们提出了一个两阶段的数据分析框架,该框架可以考虑疫情结果和社区层面预测因素的不同时间聚合水平。具体而言,我们使用时间序列聚类来识别美国3108个相邻县中具有相似结果模式的聚类。多项逻辑回归用于解释社区层面预测因素与聚类分配之间的关系。我们分析了2020年3月1日星期日至2021年2月27日星期六各县确诊的新冠死亡病例。
在美国相邻各县观察到四种不同的死亡模式。多项回归模型正确分类了1904个(61.25%)县的疫情爆发模式/聚类。
我们的结果提供了证据,表明县级新冠死亡模式各不相同,并且可以部分由社会和政治预测因素来解释。