Suppr超能文献

多辖区传染病爆发控制的深度强化学习框架。

Deep reinforcement learning framework for controlling infectious disease outbreaks in the context of multi-jurisdictions.

机构信息

MGH Institute for Technology Assessment, Harvard Medical School, Boston, MA 02114, USA.

Mechanical and Industrial Engineering Department, University of Massachusetts Amherst, Amherst, MA 01003, USA.

出版信息

Math Biosci Eng. 2023 Jun 29;20(8):14306-14326. doi: 10.3934/mbe.2023640.

Abstract

In the absence of pharmaceutical interventions, social distancing and lockdown have been key options for controlling new or reemerging respiratory infectious disease outbreaks. The timely implementation of these interventions is vital for effectively controlling and safeguarding the economy.Motivated by the COVID-19 pandemic, we evaluated whether, when, and to what level lockdowns are necessary to minimize epidemic and economic burdens of new disease outbreaks. We formulated the question as a sequential decision-making Markov Decision Process and solved it using deep Q-network algorithm. We evaluated the question under two objective functions: a 2-objective function to minimize economic burden and hospital capacity violations, suitable for diseases with severe health risks but with minimal death, and a 3-objective function that additionally minimizes the number of deaths, suitable for diseases that have high risk of mortality.A key feature of the model is that we evaluated the above questions in the context of two-geographical jurisdictions that interact through travel but make autonomous and independent decisions, evaluating under cross-jurisdictional cooperation and non-cooperation. In the 2-objective function under cross-jurisdictional cooperation, the optimal policy was to aim for shutdowns at 50 and 25% per day. Though this policy avoided hospital capacity violations, the shutdowns extended until a large proportion of the population reached herd immunity. Delays in initiating this optimal policy or non-cooperation from an outside jurisdiction required shutdowns at a higher level of 75% per day, thus adding to economic burdens. In the 3-objective function, the optimal policy under cross-jurisdictional cooperation was to aim for shutdowns of up to 75% per day to prevent deaths by reducing infected cases. This optimal policy continued for the entire duration of the simulation, suggesting that, until pharmaceutical interventions such as treatment or vaccines become available, contact reductions through physical distancing would be necessary to minimize deaths. Deviating from this policy increased the number of shutdowns and led to several deaths.In summary, we present a decision-analytic methodology for identifying optimal lockdown strategy under the context of interactions between jurisdictions that make autonomous and independent decisions. The numerical analysis outcomes are intuitive and, as expected, serve as proof of the feasibility of such a model. Our sensitivity analysis demonstrates that the optimal policy exhibits robustness to minor alterations in the transmission rate, yet shows sensitivity to more substantial deviations. This finding underscores the dynamic nature of epidemic parameters, thereby emphasizing the necessity for models trained across a diverse range of values to ensure effective policy-making.

摘要

在缺乏药物干预的情况下,社交距离和封锁一直是控制新出现或重新出现的呼吸道传染病爆发的关键选择。这些干预措施的及时实施对于有效控制和保障经济至关重要。受 COVID-19 大流行的启发,我们评估了封锁对于最小化新疾病爆发的疫情和经济负担的必要性、时间和程度。我们将这个问题表述为一个序贯决策马尔可夫决策过程,并使用深度 Q 网络算法进行求解。我们在两个目标函数下评估了这个问题:一个 2 目标函数,用于最小化经济负担和医院容量违规,适用于健康风险严重但死亡率较低的疾病;一个 3 目标函数,除了最小化死亡率外,还额外最小化死亡人数,适用于死亡率较高的疾病。模型的一个关键特征是,我们在两个地理辖区的背景下评估了上述问题,这两个辖区通过旅行相互作用,但做出自主和独立的决策,在跨辖区合作和非合作下进行评估。在跨辖区合作的 2 目标函数下,最优策略是每天将关闭率设定为 50%和 25%。虽然这一政策避免了医院容量违规,但关闭时间会延长,直到很大一部分人口达到群体免疫。延迟启动这一最优政策或来自外部辖区的非合作,需要每天关闭率达到 75%以上,从而增加经济负担。在 3 目标函数下,跨辖区合作的最优政策是每天将关闭率设定为 75%以下,以减少感染病例来防止死亡。这一最优政策持续到模拟的整个期间,这表明,在获得药物干预(如治疗或疫苗)之前,通过物理隔离减少接触将是减少死亡人数的必要措施。偏离这一政策会增加关闭次数,并导致一些死亡。总之,我们提出了一种决策分析方法,用于在自主和独立决策的辖区之间的相互作用背景下确定最优的封锁策略。数值分析结果直观,并且如预期的那样,证明了这种模型的可行性。我们的敏感性分析表明,最优政策对传播率的微小变化具有稳健性,但对更大的偏差较为敏感。这一发现强调了疫情参数的动态性质,从而强调了需要训练跨越广泛范围值的模型,以确保有效的政策制定。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验