Du Xinqi, Chen Hechang, Yang Bo, Long Cheng, Zhao Songwei
School of Artificial Intelligence, Jilin University, Changchun 130012, China.
Engineering Research Center of Knowledge-Driven Human-Machine Intelligence, Ministry of Education, China.
Inf Sci (N Y). 2023 Sep;640:119065. doi: 10.1016/j.ins.2023.119065. Epub 2023 May 9.
Infectious diseases, such as Black Death, Spanish Flu, and COVID-19, have accompanied human history and threatened public health, resulting in enormous infections and even deaths among citizens. Because of their rapid development and huge impact, laying out interventions becomes one of the most critical paths for policymakers to respond to the epidemic. However, the existing studies mainly focus on epidemic control with a single intervention, which makes the epidemic control effectiveness severely compromised. In view of this, we propose a ierarchical einforcement earning decision framework for multi-mode pidemic ontrol with multiple interventions called . We devise an epidemiological model, referred to as , to describe multiple interventions' impact on transmission explicitly, and use it as the environment for HRL4EC. Besides, to address the complexity introduced by multiple interventions, this work transforms the multi-mode intervention decision problem into a multi-level control problem, and employs hierarchical reinforcement learning to find the optimal strategies. Finally, extensive experiments are conducted with real and simulated epidemic data to validate the effectiveness of our proposed method. We further analyze the experiment data in-depth, conclude a series of findings on epidemic intervention strategies, and make a visualization accordingly, which can provide heuristic support for policymakers' pandemic response.
传染病,如黑死病、西班牙流感和新冠疫情,一直伴随着人类历史并威胁着公众健康,导致大量民众感染甚至死亡。由于其快速发展和巨大影响,制定干预措施成为政策制定者应对疫情的最关键途径之一。然而,现有研究主要集中在单一干预措施的疫情控制上,这使得疫情控制效果大打折扣。鉴于此,我们提出了一种用于多模式疫情控制的分层强化学习决策框架,该框架采用多种干预措施,称为HRL4EC。我们设计了一种流行病学模型,称为[具体模型名称未给出],以明确描述多种干预措施对传播的影响,并将其用作HRL4EC的环境。此外,为了解决多种干预措施带来的复杂性,这项工作将多模式干预决策问题转化为多层次控制问题,并采用分层强化学习来寻找最优策略。最后,利用真实和模拟的疫情数据进行了广泛的实验,以验证我们提出的方法的有效性。我们进一步深入分析实验数据,总结了一系列关于疫情干预策略的发现,并据此进行了可视化展示,可为政策制定者应对疫情提供启发式支持。