Max Planck Institute for Intelligent Systems, Tübingen, Germany.
ETH Zürich, Zürich, Switzerland.
PLoS Comput Biol. 2023 Jan 23;19(1):e1010799. doi: 10.1371/journal.pcbi.1010799. eCollection 2023 Jan.
Simulating the spread of infectious diseases in human communities is critical for predicting the trajectory of an epidemic and verifying various policies to control the devastating impacts of the outbreak. Many existing simulators are based on compartment models that divide people into a few subsets and simulate the dynamics among those subsets using hypothesized differential equations. However, these models lack the requisite granularity to study the effect of intelligent policies that influence every individual in a particular way. In this work, we introduce a simulator software capable of modeling a population structure and controlling the disease's propagation at an individualistic level. In order to estimate the confidence of the conclusions drawn from the simulator, we employ a comprehensive probabilistic approach where the entire population is constructed as a hierarchical random variable. This approach makes the inferred conclusions more robust against sampling artifacts and gives confidence bounds for decisions based on the simulation results. To showcase potential applications, the simulator parameters are set based on the formal statistics of the COVID-19 pandemic, and the outcome of a wide range of control measures is investigated. Furthermore, the simulator is used as the environment of a reinforcement learning problem to find the optimal policies to control the pandemic. The obtained experimental results indicate the simulator's adaptability and capacity in making sound predictions and a successful policy derivation example based on real-world data. As an exemplary application, our results show that the proposed policy discovery method can lead to control measures that produce significantly fewer infected individuals in the population and protect the health system against saturation.
模拟人类社区中传染病的传播对于预测疫情的轨迹和验证各种控制疫情爆发破坏性影响的政策至关重要。许多现有的模拟器基于 compartment 模型,这些模型将人群分为几个子集,并使用假设的微分方程来模拟这些子集中的动态。然而,这些模型缺乏研究以特定方式影响每个人的智能政策效果的必要粒度。在这项工作中,我们引入了一种能够对人群结构进行建模并在个体层面控制疾病传播的模拟器软件。为了估计从模拟器得出的结论的置信度,我们采用了一种全面的概率方法,即将整个人群构建为一个层次随机变量。这种方法使推断出的结论更能抵抗抽样误差,并为基于模拟结果的决策提供置信区间。为了展示潜在的应用,我们根据 COVID-19 大流行的正式统计数据设置了模拟器参数,并研究了广泛的控制措施的结果。此外,我们还将模拟器用作强化学习问题的环境,以找到控制大流行的最佳策略。所获得的实验结果表明,模拟器具有适应性和能力,可以做出合理的预测,并基于实际数据成功得出策略。作为一个示范应用,我们的结果表明,所提出的策略发现方法可以导致在人群中产生更少感染个体的控制措施,并保护卫生系统免受饱和。