Environmental and Occupational Health Sciences Institute (EOHSI), Rutgers University, Piscataway, NJ, 08854, USA.
Department of Chemical and Biochemical Engineering, Rutgers University, Piscataway, NJ, 08854, USA.
J Expo Sci Environ Epidemiol. 2024 Mar;34(2):197-207. doi: 10.1038/s41370-023-00518-0. Epub 2023 Feb 1.
Disparities in adverse COVID-19 health outcomes have been associated with multiple social and environmental stressors. However, research is needed to evaluate the consistency and efficiency of methods for studying these associations at local scales.
To assess socioexposomic associations with COVID-19 outcomes across New Jersey and evaluate consistency of findings from multiple modeling approaches.
We retrieved data for COVID-19 cases and deaths for the 565 municipalities of New Jersey up to the end of the first phase of the pandemic, and calculated mortality rates with and without long-term-care (LTC) facility deaths. We considered 84 spatially heterogeneous environmental, demographic and socioeconomic factors from publicly available databases, including air pollution, proximity to industrial sites/facilities, transportation-related noise, occupation and commuting, neighborhood and housing characteristics, age structure, racial/ethnic composition, poverty, etc. Six geostatistical models (Poisson/Negative-Binomial regression, Poison/Negative-Binomial mixed effect model, Poisson/Negative-Binomial Bersag-York-Mollie spatial model) and two Machine Learning (ML) methods (Random Forest, Extreme Gradient Boosting) were implemented to assess association patterns. The Shapley effects plot was established for explainable ML and change of support validation was introduced to compare performances of different approaches.
We found robust positive associations of COVID-19 mortality with historic exposures to NO, population density, percentage of minority and below high school education, and other social and environmental factors. Exclusion of LTC deaths does not significantly affect correlations for most factors but findings can be substantially influenced by model structures and assumptions. The best performing geostatistical models involved flexible structures representing data variations. ML methods captured association patterns consistent with the best performing geostatistical models, and furthermore detected consistent nonlinear associations not captured by geostatistical models.
The findings of this work improve the understanding of how social and environmental disparities impacted COVID-19 outcomes across New Jersey.
与 COVID-19 不良健康结果相关的差异与多种社会和环境压力因素有关。然而,需要研究在地方尺度上研究这些关联的方法的一致性和效率。
评估新泽西州 COVID-19 结果的社会暴露组关联,并评估多种建模方法的结果一致性。
我们检索了新泽西州 565 个市/镇的 COVID-19 病例和死亡数据,直至大流行第一阶段结束,并计算了有和没有长期护理(LTC)设施死亡的死亡率。我们考虑了 84 个来自公共数据库的空间异质环境、人口统计学和社会经济因素,包括空气污染、靠近工业场所/设施、交通相关噪声、职业和通勤、邻里和住房特征、年龄结构、种族/族裔构成、贫困等。我们实施了六种地质统计学模型(泊松/负二项回归、泊松/负二项混合效应模型、泊松/负二项贝塞尔-约克-莫利空间模型)和两种机器学习(ML)方法(随机森林、极端梯度提升)来评估关联模式。为可解释的 ML 建立了 Shapley 效应图,并引入支持变更验证来比较不同方法的性能。
我们发现 COVID-19 死亡率与历史上暴露于 NO、人口密度、少数民族比例和低于高中学历以及其他社会和环境因素呈显著正相关。排除 LTC 死亡不会显著影响大多数因素的相关性,但模型结构和假设会极大地影响结果。表现最佳的地质统计学模型涉及代表数据变化的灵活结构。ML 方法捕捉到与表现最佳的地质统计学模型一致的关联模式,并且还检测到地质统计学模型无法捕捉的一致非线性关联。
这项工作的结果提高了对新泽西州 COVID-19 结果中社会和环境差异如何影响的理解。