Department of Epidemiology, College of Public Health and Health Professions and College of Medicine, University of Florida, Gainesville, FL, United States.
Division of Cardiovascular Medicine, Department of Medicine, University of Florida, Gainesville, FL, United States.
JMIR Public Health Surveill. 2024 Sep 26;10:e54421. doi: 10.2196/54421.
Racial disparities in COVID-19 incidence and outcomes have been widely reported. Non-Hispanic Black patients endured worse outcomes disproportionately compared with non-Hispanic White patients, but the epidemiological basis for these observations was complex and multifaceted.
This study aimed to elucidate the potential reasons behind the worse outcomes of COVID-19 experienced by non-Hispanic Black patients compared with non-Hispanic White patients and how these variables interact using an explainable machine learning approach.
In this retrospective cohort study, we examined 28,943 laboratory-confirmed COVID-19 cases from the OneFlorida Research Consortium's data trust of health care recipients in Florida through April 28, 2021. We assessed the prevalence of pre-existing comorbid conditions, geo-socioeconomic factors, and health outcomes in the structured electronic health records of COVID-19 cases. The primary outcome was a composite of hospitalization, intensive care unit admission, and mortality at index admission. We developed and validated a machine learning model using Extreme Gradient Boosting to evaluate predictors of worse outcomes of COVID-19 and rank them by importance.
Compared to non-Hispanic White patients, non-Hispanic Blacks patients were younger, more likely to be uninsured, had a higher prevalence of emergency department and inpatient visits, and were in regions with higher area deprivation index rankings and pollutant concentrations. Non-Hispanic Black patients had the highest burden of comorbidities and rates of the primary outcome. Age was a key predictor in all models, ranking highest in non-Hispanic White patients. However, for non-Hispanic Black patients, congestive heart failure was a primary predictor. Other variables, such as food environment measures and air pollution indicators, also ranked high. By consolidating comorbidities into the Elixhauser Comorbidity Index, this became the top predictor, providing a comprehensive risk measure.
The study reveals that individual and geo-socioeconomic factors significantly influence the outcomes of COVID-19. It also highlights varying risk profiles among different racial groups. While these findings suggest potential disparities, further causal inference and statistical testing are needed to fully substantiate these observations. Recognizing these relationships is vital for creating effective, tailored interventions that reduce disparities and enhance health outcomes across all racial and socioeconomic groups.
COVID-19 发病率和结果的种族差异已得到广泛报道。与非西班牙裔白人患者相比,非西班牙裔黑人患者的预后更差,但这些观察结果的流行病学基础复杂且多方面。
本研究旨在通过可解释的机器学习方法阐明非西班牙裔黑人患者 COVID-19 预后较差的潜在原因,以及这些变量如何相互作用。
在这项回顾性队列研究中,我们检查了 2021 年 4 月 28 日前佛罗里达州 OneFlorida 研究联盟数据信托中来自佛罗里达州医疗保健接受者的 28943 例实验室确诊的 COVID-19 病例。我们评估了 COVID-19 病例的电子病历中预先存在的合并症、地理社会经济因素和健康结果的流行情况。主要结局是指数入院时住院、入住重症监护病房和死亡的复合结局。我们使用极端梯度提升开发和验证了机器学习模型,以评估 COVID-19 不良结局的预测因子,并按重要性对其进行排名。
与非西班牙裔白人患者相比,非西班牙裔黑人患者更年轻,更有可能没有保险,急诊和住院就诊的比例更高,并且所在地区的区域贫困指数排名和污染物浓度更高。非西班牙裔黑人患者的合并症负担和主要结局发生率最高。在所有模型中,年龄都是关键预测因素,在非西班牙裔白人患者中排名最高。然而,对于非西班牙裔黑人患者,充血性心力衰竭是主要预测因素。其他变量,如饮食环境措施和空气污染指标,也排名较高。通过将合并症合并到 Elixhauser 合并症指数中,这成为了首要预测因素,提供了全面的风险衡量标准。
该研究表明,个体和地理社会经济因素对 COVID-19 的结果有重大影响。它还强调了不同种族群体之间不同的风险特征。虽然这些发现表明存在潜在差异,但需要进一步的因果推理和统计检验来充分证实这些观察结果。认识到这些关系对于制定有效的、有针对性的干预措施至关重要,这些措施可以减少所有种族和社会经济群体的差异,并改善健康结果。