Larie Dale, An Gary, Cockrell Chase
Department of Surgery, University of Vermont Larner College of Medicine.
bioRxiv. 2022 Feb 18:2022.02.17.480940. doi: 10.1101/2022.02.17.480940.
Despite a great deal of interest in the application of artificial intelligence (AI) to sepsis/critical illness, most current approaches are limited in their potential impact: prediction models do not (and cannot) address the lack of effective therapeutics and current approaches to enhancing the treatment of sepsis focus on optimizing the application of existing interventions, and thus cannot address the development of new treatment options/modalities. The inability to test new therapeutic applications was highlighted by the generally unsatisfactory results from drug repurposing efforts in COVID-19.
Addressing this challenge requires the application of simulation-based, model-free deep reinforcement learning (DRL) in a fashion akin to training the game-playing AIs. We have previously demonstrated the potential of this method in the context of bacterial sepsis in which the microbial infection is responsive to antibiotic therapy. The current work addresses the control problem of multi-modal, adaptive immunomodulation in the circumstance where there is no effective anti-pathogen therapy (e.g., in a novel viral pandemic or in the face of resistant microbes).
This is a proof-of-concept study that determines the controllability of sepsis without the ability to pharmacologically suppress the pathogen. We use as a surrogate system a previously validated agent-based model, the Innate Immune Response Agent-based Model (IIRABM), for control discovery using DRL. The DRL algorithm 'trains' an AI on simulations of infection where both the control and observation spaces are limited to operating upon the defined immune mediators included in the IIRABM (a total of 11). Policies were learned using the Deep Deterministic Policy Gradient approach, with the objective function being a return to baseline system health.
DRL trained an AI policy that improved system mortality from 85% to 10.4%. Control actions affected every one of the 11 targetable cytokines and could be divided into those with static/unchanging controls and those with variable/adaptive controls. Adaptive controls primarily targeted 3 different aspects of the immune response: 2 order pro-inflammation governing TH1/TH2 balance, primary anti-inflammation, and inflammatory cell proliferation.
The current treatment of sepsis is hampered by limitations in therapeutic options able to affect the biology of sepsis. This is heightened in circumstances where no effective antimicrobials exist, as was the case for COVID-19. Current AI methods are intrinsically unable to address this problem; doing so requires training AIs in contexts that fully represent the counterfactual space of potential treatments. The synthetic data needed for this task is only possible through the use of high-resolution, mechanism-based simulations. Finally, being able to treat sepsis will require a reorientation as to the sensing and actuating requirements needed to develop these simulations and bring them to the bedside.
尽管人们对将人工智能(AI)应用于脓毒症/危重病有着浓厚兴趣,但目前大多数方法的潜在影响有限:预测模型无法(也不能)解决缺乏有效治疗方法的问题,而当前增强脓毒症治疗的方法侧重于优化现有干预措施的应用,因此无法解决新治疗选择/方式的开发问题。新冠疫情中药物重新利用的总体结果不尽人意,凸显了无法测试新治疗应用的问题。
应对这一挑战需要以类似于训练游戏AI的方式应用基于模拟的、无模型的深度强化学习(DRL)。我们之前已经在细菌脓毒症的背景下证明了这种方法的潜力,其中微生物感染对抗生素治疗有反应。当前的工作解决了在没有有效抗病原体治疗的情况下(例如,在新型病毒大流行或面对耐药微生物时)多模式、适应性免疫调节的控制问题。
这是一项概念验证研究,旨在确定在无法通过药理学方法抑制病原体的情况下脓毒症的可控性。我们使用一个先前经过验证的基于主体的模型——先天性免疫反应基于主体模型(IIRABM)作为替代系统,通过DRL进行控制发现。DRL算法在感染模拟中“训练”AI,其中控制和观察空间仅限于对IIRABM中定义的免疫介质(共11种)进行操作。使用深度确定性策略梯度方法学习策略,目标函数是恢复到基线系统健康状态。
DRL训练了一种AI策略,将系统死亡率从85%降至10.4%。控制行动影响了11种可靶向细胞因子中的每一种,可分为具有静态/不变控制的和具有可变/自适应控制的。自适应控制主要针对免疫反应的3个不同方面:控制TH1/TH2平衡的二级促炎、主要抗炎和炎症细胞增殖方面。
目前脓毒症的治疗受到能够影响脓毒症生物学的治疗选择的限制。在没有有效抗菌药物的情况下,如新冠疫情期间,这种情况更加突出。当前的AI方法本质上无法解决这个问题;要做到这一点,需要在能够充分代表潜在治疗反事实空间的背景下训练AI。这项任务所需的合成数据只有通过使用高分辨率、基于机制的模拟才有可能获得。最后,要能够治疗脓毒症,需要重新定位开发这些模拟并将其应用于临床所需的传感和驱动要求。