Department of Statistics, Brigham Young University, Provo, Utah, United States of America.
PLoS One. 2024 May 16;19(5):e0289254. doi: 10.1371/journal.pone.0289254. eCollection 2024.
The onset of the COVID-19 pandemic commenced an era of widespread disruptions in the academic world, including shut downs, periodic shifts to online learning, and disengagement from students. In an effort to transition back to in-person learning, many universities and schools tried to implement policy that balanced student learning with community health. While academic administrators have little control over some aspects of COVID-19 spread, they often choose to use temporary shutdowns of in-person teaching based on perceived hotspots of COVID-19. Specifically, if administrators have substantial evidence of within-group transmission for a class or other academic unit (a "hotspot"), the activities of that class or division of the university might be temporarily moved online. In this article, we describe an approach used to make these types of decisions. Using demographic information and weekly COVID-19 testing outcomes for university students, we use an XGBoost model that produces an estimated probability of testing positive for each student. We discuss variables engineered from the demographic information that increased model fit. As part of our approach, we simulate semesters under the null hypothesis of no in-class transmission, and compare the distribution of simulated outcomes to the observed group positivity rates to find an initial p-value for each group (e.g., section, housing area, or major). Using a simulation-based modification of a standard false discovery rate procedure, we identify possible hot spots-classes or groups whose COVID-19 rates exceed the levels expected for the demographic mix of students in each group of interest. We use simulation experiments and an anonymized example from Fall 2020 to illustrate the performance of our approach. While our example is based on hotspot detection in a university setting, the approach can be used for monitoring the spread of infectious disease within any interconnected organization or population.
COVID-19 大流行的爆发开启了学术领域广泛中断的时代,包括关闭、定期转向在线学习以及与学生脱节。为了重新过渡到面对面学习,许多大学和学校试图实施平衡学生学习和社区健康的政策。虽然学术管理人员对 COVID-19 传播的某些方面几乎没有控制,但他们经常选择根据 COVID-19 的热点暂时关闭面对面教学。具体来说,如果管理人员有大量班级或其他学术单位(“热点”)内群体传播的证据,该班级或大学部门的活动可能会暂时转移到线上。在本文中,我们描述了一种用于做出这些类型决策的方法。使用大学生的人口统计学信息和每周 COVID-19 检测结果,我们使用 XGBoost 模型为每个学生生成一个检测呈阳性的估计概率。我们讨论了从人口统计学信息中设计的增加模型拟合度的变量。作为我们方法的一部分,我们在没有课堂内传播的零假设下模拟学期,并将模拟结果的分布与观察到的群体阳性率进行比较,以找到每个群体(例如,部分、住房区域或专业)的初始 p 值。我们使用一种基于模拟的标准错误发现率程序的修改,识别出可能的热点——班级或群体,其 COVID-19 率超过每个感兴趣群体中学生的人口统计学组合所预期的水平。我们使用模拟实验和来自 2020 年秋季的匿名示例来说明我们方法的性能。虽然我们的示例基于大学环境中的热点检测,但该方法可用于监测任何相互关联的组织或人群中传染病的传播。