Predictive Science Inc., San Diego, California, United States of America.
PLoS One. 2022 Apr 21;17(4):e0266330. doi: 10.1371/journal.pone.0266330. eCollection 2022.
More than a year since the appearance of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), many questions about the disease COVID-19 have been answered; however, many more remain poorly understood. Although the situation continues to evolve, it is crucial to understand what factors may be driving transmission through different populations, both for potential future waves, as well as the implications for future pandemics. In this report, we compiled a database of more than 28 potentially explanatory variables for each of the 50 U.S. states through early May 2020. Using a combination of traditional statistical and modern machine learning approaches, we identified those variables that were the most statistically significant, and, those that were the most important. These variables were chosen to be fiduciaries of a range of possible drivers for COVID-19 deaths in the USA. We found that population-weighted population density (PWPD), some "stay at home" metrics, monthly temperature and precipitation, race/ethnicity, and chronic low-respiratory death rate, were all statistically significant. Of these, PWPD and mobility metrics dominated. This suggests that the biggest impact on COVID-19 deaths was, at least initially, a function of where you lived, and not what you did. However, clearly, increasing social distancing has the net effect of (at least temporarily) reducing the effective PWPD. Our results strongly support the idea that the loosening of "lock-down" orders should be tailored to the local PWPD. In contrast to these variables, while still statistically significant, race/ethnicity, health, and climate effects could only account for a few percent of the variability in deaths. Where associations were anticipated but were not found, we discuss how limitations in the parameters chosen may mask a contribution that might otherwise be present.
自严重急性呼吸系统综合症冠状病毒 2 型(SARS-CoV-2)出现以来,已经有一年多了,关于 COVID-19 疾病的许多问题已经得到解答;然而,仍有许多问题尚未得到很好的理解。尽管情况仍在不断发展,但了解哪些因素可能导致不同人群的传播至关重要,这不仅对未来可能出现的疫情浪潮有影响,对未来的大流行也有影响。在本报告中,我们汇总了截至 2020 年 5 月初美国 50 个州的每个州的 28 个潜在解释变量的数据库。我们使用传统统计学和现代机器学习方法的组合,确定了最具统计学意义的变量,以及最重要的变量。这些变量被选为 COVID-19 在美国死亡的各种可能驱动因素的受托人。我们发现,加权人口密度(PWPD)、一些“居家”指标、月温度和降水、种族/民族以及慢性低呼吸死亡率都是统计学上显著的。在这些变量中,PWPD 和移动性指标占主导地位。这表明,对 COVID-19 死亡人数的最大影响至少最初是由你居住的地方决定的,而不是你所做的事情。然而,显然,增加社交距离的净效应是(至少暂时)降低有效 PWPD。我们的结果强烈支持这样一种观点,即放宽“封锁”命令应该根据当地的 PWPD 进行调整。与这些变量相反,尽管仍具有统计学意义,但种族/民族、健康和气候的影响只能解释死亡人数变化的百分之几。对于那些预期存在关联但实际上不存在的变量,我们将讨论在选择参数时存在的局限性如何掩盖了可能存在的贡献。