为应对下一次新冠疫情做准备：深度强化学习训练的人工智能在缺乏有效抗菌药物的情况下发现全身炎症的多模态免疫调节控制。

Preparing for the next COVID: Deep Reinforcement Learning trained Artificial Intelligence discovery of multi-modal immunomodulatory control of systemic inflammation in the absence of effective anti-microbials.

作者信息

Larie Dale, An Gary, Cockrell Chase

机构信息

Department of Surgery, University of Vermont Larner College of Medicine.

出版信息

bioRxiv. 2022 Feb 18:2022.02.17.480940. doi: 10.1101/2022.02.17.480940.

DOI:10.1101/2022.02.17.480940

PMID:35194613

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8863155/

Abstract

BACKGROUND

Despite a great deal of interest in the application of artificial intelligence (AI) to sepsis/critical illness, most current approaches are limited in their potential impact: prediction models do not (and cannot) address the lack of effective therapeutics and current approaches to enhancing the treatment of sepsis focus on optimizing the application of existing interventions, and thus cannot address the development of new treatment options/modalities. The inability to test new therapeutic applications was highlighted by the generally unsatisfactory results from drug repurposing efforts in COVID-19.

HYPOTHESIS

Addressing this challenge requires the application of simulation-based, model-free deep reinforcement learning (DRL) in a fashion akin to training the game-playing AIs. We have previously demonstrated the potential of this method in the context of bacterial sepsis in which the microbial infection is responsive to antibiotic therapy. The current work addresses the control problem of multi-modal, adaptive immunomodulation in the circumstance where there is no effective anti-pathogen therapy (e.g., in a novel viral pandemic or in the face of resistant microbes).

METHODS

This is a proof-of-concept study that determines the controllability of sepsis without the ability to pharmacologically suppress the pathogen. We use as a surrogate system a previously validated agent-based model, the Innate Immune Response Agent-based Model (IIRABM), for control discovery using DRL. The DRL algorithm 'trains' an AI on simulations of infection where both the control and observation spaces are limited to operating upon the defined immune mediators included in the IIRABM (a total of 11). Policies were learned using the Deep Deterministic Policy Gradient approach, with the objective function being a return to baseline system health.

RESULTS

DRL trained an AI policy that improved system mortality from 85% to 10.4%. Control actions affected every one of the 11 targetable cytokines and could be divided into those with static/unchanging controls and those with variable/adaptive controls. Adaptive controls primarily targeted 3 different aspects of the immune response: 2 order pro-inflammation governing TH1/TH2 balance, primary anti-inflammation, and inflammatory cell proliferation.

DISCUSSION

The current treatment of sepsis is hampered by limitations in therapeutic options able to affect the biology of sepsis. This is heightened in circumstances where no effective antimicrobials exist, as was the case for COVID-19. Current AI methods are intrinsically unable to address this problem; doing so requires training AIs in contexts that fully represent the counterfactual space of potential treatments. The synthetic data needed for this task is only possible through the use of high-resolution, mechanism-based simulations. Finally, being able to treat sepsis will require a reorientation as to the sensing and actuating requirements needed to develop these simulations and bring them to the bedside.

摘要

背景

尽管人们对将人工智能（AI）应用于脓毒症/危重病有着浓厚兴趣，但目前大多数方法的潜在影响有限：预测模型无法（也不能）解决缺乏有效治疗方法的问题，而当前增强脓毒症治疗的方法侧重于优化现有干预措施的应用，因此无法解决新治疗选择/方式的开发问题。新冠疫情中药物重新利用的总体结果不尽人意，凸显了无法测试新治疗应用的问题。

假设

应对这一挑战需要以类似于训练游戏AI的方式应用基于模拟的、无模型的深度强化学习（DRL）。我们之前已经在细菌脓毒症的背景下证明了这种方法的潜力，其中微生物感染对抗生素治疗有反应。当前的工作解决了在没有有效抗病原体治疗的情况下（例如，在新型病毒大流行或面对耐药微生物时）多模式、适应性免疫调节的控制问题。

方法

这是一项概念验证研究，旨在确定在无法通过药理学方法抑制病原体的情况下脓毒症的可控性。我们使用一个先前经过验证的基于主体的模型——先天性免疫反应基于主体模型（IIRABM）作为替代系统，通过DRL进行控制发现。DRL算法在感染模拟中“训练”AI，其中控制和观察空间仅限于对IIRABM中定义的免疫介质（共11种）进行操作。使用深度确定性策略梯度方法学习策略，目标函数是恢复到基线系统健康状态。

结果

DRL训练了一种AI策略，将系统死亡率从85%降至10.4%。控制行动影响了11种可靶向细胞因子中的每一种，可分为具有静态/不变控制的和具有可变/自适应控制的。自适应控制主要针对免疫反应的3个不同方面：控制TH1/TH2平衡的二级促炎、主要抗炎和炎症细胞增殖方面。

讨论

目前脓毒症的治疗受到能够影响脓毒症生物学的治疗选择的限制。在没有有效抗菌药物的情况下，如新冠疫情期间，这种情况更加突出。当前的AI方法本质上无法解决这个问题；要做到这一点，需要在能够充分代表潜在治疗反事实空间的背景下训练AI。这项任务所需的合成数据只有通过使用高分辨率、基于机制的模拟才有可能获得。最后，要能够治疗脓毒症，需要重新定位开发这些模拟并将其应用于临床所需的传感和驱动要求。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d18/8863155/1075242a825d/nihpp-2022.02.17.480940v1-f0001.jpg

相似文献

Preparing for the next COVID: Deep Reinforcement Learning trained Artificial Intelligence discovery of multi-modal immunomodulatory control of systemic inflammation in the absence of effective anti-microbials.

bioRxiv. 2022 Feb 18:2022.02.17.480940. doi: 10.1101/2022.02.17.480940.

Preparing for the next pandemic: Simulation-based deep reinforcement learning to discover and test multimodal control of systemic inflammation using repurposed immunomodulatory agents.

Front Immunol. 2022 Nov 21;13:995395. doi: 10.3389/fimmu.2022.995395. eCollection 2022.

PaCAR: COVID-19 Pandemic Control Decision Making via Large-Scale Agent-Based Modeling and Deep Reinforcement Learning.

Med Decis Making. 2022 Nov;42(8):1064-1077. doi: 10.1177/0272989X221107902. Epub 2022 Jul 1.

The Wound Environment Agent-based Model (WEABM): a digital twin platform for characterization and complex therapeutic discovery for volumetric muscle loss.

bioRxiv. 2024 Jun 10:2024.06.04.595972. doi: 10.1101/2024.06.04.595972.

A dosing strategy model of deep deterministic policy gradient algorithm for sepsis patients.

BMC Med Inform Decis Mak. 2023 May 4;23(1):81. doi: 10.1186/s12911-023-02175-7.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Sepsis reconsidered: Identifying novel metrics for behavioral landscape characterization with a high-performance computing implementation of an agent-based model.

J Theor Biol. 2017 Oct 7;430:157-168. doi: 10.1016/j.jtbi.2017.07.016. Epub 2017 Jul 18.

Sepsis Care Pathway 2019.

Qatar Med J. 2019 Nov 7;2019(2):4. doi: 10.5339/qmj.2019.qccc.4. eCollection 2019.

Tuberculosis

Examining the controllability of sepsis using genetic algorithms on an agent-based model of systemic inflammation.

PLoS Comput Biol. 2018 Feb 15;14(2):e1005876. doi: 10.1371/journal.pcbi.1005876. eCollection 2018 Feb.

本文引用的文献

Lessons from the COVID-19 pandemic for advancing computational drug repurposing strategies.

Nat Comput Sci. 2021 Jan;1(1):33-41. doi: 10.1038/s43588-020-00007-6. Epub 2021 Jan 14.

Pushing the frontiers of density functionals by solving the fractional electron problem.

Science. 2021 Dec 10;374(6573):1385-1389. doi: 10.1126/science.abj6511. Epub 2021 Dec 9.

Nested active learning for efficient model contextualization and parameterization: pathway to generating simulated populations using multi-scale computational models.

Simulation. 2021 Apr;97(4):287-296. doi: 10.1177/0037549720975075. Epub 2020 Dec 14.

Immunomodulation as a Potent COVID-19 Pharmacotherapy: Past, Present and Future.

J Inflamm Res. 2021 Jul 20;14:3419-3428. doi: 10.2147/JIR.S322831. eCollection 2021.

Highly accurate protein structure prediction with AlphaFold.

Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.

Utilizing the Heterogeneity of Clinical Data for Model Refinement and Rule Discovery Through the Application of Genetic Algorithms to Calibrate a High-Dimensional Agent-Based Model of Systemic Inflammation.

Front Physiol. 2021 May 19;12:662845. doi: 10.3389/fphys.2021.662845. eCollection 2021.

Immunomodulation as Treatment for Severe Coronavirus Disease 2019: A Systematic Review of Current Modalities and Future Directions.

Clin Infect Dis. 2021 Jun 15;72(12):e1130-e1143. doi: 10.1093/cid/ciaa1759.

The Inflammasome in Times of COVID-19.

Front Immunol. 2020 Oct 8;11:583373. doi: 10.3389/fimmu.2020.583373. eCollection 2020.

Artificial intelligence in COVID-19 drug repurposing.

Lancet Digit Health. 2020 Dec;2(12):e667-e676. doi: 10.1016/S2589-7500(20)30192-8. Epub 2020 Sep 18.

Utilizing drug repurposing against COVID-19 - Efficacy, limitations, and challenges.

Life Sci. 2020 Oct 15;259:118275. doi: 10.1016/j.lfs.2020.118275. Epub 2020 Aug 17.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

为应对下一次新冠疫情做准备：深度强化学习训练的人工智能在缺乏有效抗菌药物的情况下发现全身炎症的多模态免疫调节控制。

Preparing for the next COVID: Deep Reinforcement Learning trained Artificial Intelligence discovery of multi-modal immunomodulatory control of systemic inflammation in the absence of effective anti-microbials.

作者信息

Larie Dale, An Gary, Cockrell Chase

机构信息

Department of Surgery, University of Vermont Larner College of Medicine.