• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

背景很重要:利用强化学习制定具有人类可读性且与状态相关的疫情应对政策。

Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies.

机构信息

1 Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford , Oxford OX3 7LF , UK.

2 Department of Biostatistics, Vanderbilt University , Nashville, TN 37203 , USA.

出版信息

Philos Trans R Soc Lond B Biol Sci. 2019 Jul 8;374(1776):20180277. doi: 10.1098/rstb.2018.0277.

DOI:10.1098/rstb.2018.0277
PMID:31104604
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6558555/
Abstract

The number of all possible epidemics of a given infectious disease that could occur on a given landscape is large for systems of real-world complexity. Furthermore, there is no guarantee that the control actions that are optimal, on average, over all possible epidemics are also best for each possible epidemic. Reinforcement learning (RL) and Monte Carlo control have been used to develop machine-readable context-dependent solutions for complex problems with many possible realizations ranging from video-games to the game of Go. RL could be a valuable tool to generate context-dependent policies for outbreak response, though translating the resulting policies into simple rules that can be read and interpreted by human decision-makers remains a challenge. Here we illustrate the application of RL to the development of context-dependent outbreak response policies to minimize outbreaks of foot-and-mouth disease. We show that control based on the resulting context-dependent policies, which adapt interventions to the specific outbreak, result in smaller outbreaks than static policies. We further illustrate two approaches for translating the complex machine-readable policies into simple heuristics that can be evaluated by human decision-makers. This article is part of the theme issue 'Modelling infectious disease outbreaks in humans, animals and plants: epidemic forecasting and control'. This theme issue is linked with the earlier issue 'Modelling infectious disease outbreaks in humans, animals and plants: approaches and important themes'.

摘要

给定传染病在给定景观上可能发生的所有可能流行病的数量对于现实世界复杂系统来说是巨大的。此外,不能保证平均而言对所有可能的流行病都是最优的控制措施对于每个可能的流行病也是最佳的。强化学习(RL)和蒙特卡罗控制已被用于为具有许多可能实现的复杂问题开发机器可读的上下文相关解决方案,这些问题的范围从视频游戏到围棋游戏。RL 可能是生成爆发应对的上下文相关策略的有价值工具,尽管将产生的策略转换为人类决策者可以阅读和解释的简单规则仍然是一个挑战。在这里,我们说明了 RL 在开发最小化口蹄疫爆发的上下文相关爆发应对策略中的应用。我们表明,基于特定爆发自适应干预措施的基于结果的上下文相关策略的控制导致的爆发比静态策略小。我们进一步说明了将复杂的机器可读策略转换为人类决策者可以评估的简单启发式的两种方法。本文是主题为“人类、动物和植物传染病爆发建模:流行预测和控制”的一部分。这个主题与之前的主题“人类、动物和植物传染病爆发建模:方法和重要主题”有关。

相似文献

1
Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies.背景很重要:利用强化学习制定具有人类可读性且与状态相关的疫情应对政策。
Philos Trans R Soc Lond B Biol Sci. 2019 Jul 8;374(1776):20180277. doi: 10.1098/rstb.2018.0277.
2
Detection, forecasting and control of infectious disease epidemics: modelling outbreaks in humans, animals and plants.传染病疫情的检测、预测和控制:人类、动物和植物疫情爆发的建模。
Philos Trans R Soc Lond B Biol Sci. 2019 Jun 24;374(1775):20190038. doi: 10.1098/rstb.2019.0038.
3
How decision makers can use quantitative approaches to guide outbreak responses.决策者如何利用定量方法来指导疫情应对。
Philos Trans R Soc Lond B Biol Sci. 2019 Jul 8;374(1776):20180365. doi: 10.1098/rstb.2018.0365.
4
Applying optimal control theory to complex epidemiological models to inform real-world disease management.将最优控制理论应用于复杂的流行病学模型,以提供现实世界疾病管理的信息。
Philos Trans R Soc Lond B Biol Sci. 2019 Jul 8;374(1776):20180284. doi: 10.1098/rstb.2018.0284.
5
Outbreak analytics: a developing data science for informing the response to emerging pathogens.疫情分析:一门新兴的数据科学,旨在为应对新出现的病原体提供信息支持。
Philos Trans R Soc Lond B Biol Sci. 2019 Jul 8;374(1776):20180276. doi: 10.1098/rstb.2018.0276.
6
Preface to theme issue 'Modelling infectious disease outbreaks in humans, animals and plants: epidemic forecasting and control'.前言:主题问题“人类、动物和植物传染病暴发模型:疫情预测和控制”。
Philos Trans R Soc Lond B Biol Sci. 2019 Jul 8;374(1776):20190375. doi: 10.1098/rstb.2019.0375.
7
Limits to forecasting precision for outbreaks of directly transmitted diseases.直接传播疾病暴发预测精度的局限性。
PLoS Med. 2006 Jan;3(1):e3. doi: 10.1371/journal.pmed.0030003. Epub 2005 Nov 22.
8
Rigorous surveillance is necessary for high confidence in end-of-outbreak declarations for Ebola and other infectious diseases.为了对埃博拉和其他传染病的疫情结束声明有高度信心,进行严格监测是必要的。
Philos Trans R Soc Lond B Biol Sci. 2019 Jul 8;374(1776):20180431. doi: 10.1098/rstb.2018.0431.
9
Translating surveillance data into incidence estimates.将监测数据转化为发病率估计值。
Philos Trans R Soc Lond B Biol Sci. 2019 Jul 8;374(1776):20180262. doi: 10.1098/rstb.2018.0262.
10
A modelling approach to support dynamic decision-making in the control of FMD epidemics.一种用于支持口蹄疫疫情控制中动态决策的建模方法。
Prev Vet Med. 2010 Jul 1;95(3-4):167-74. doi: 10.1016/j.prevetmed.2010.04.003. Epub 2010 May 14.

引用本文的文献

1
Potential benefits of adaptive control strategies are outweighed by costs of infrequent, but dramatically larger disease outbreaks.适应性控制策略的潜在益处被罕见但规模巨大的疾病暴发所带来的成本所抵消。
R Soc Open Sci. 2025 Aug 21;12(8):250598. doi: 10.1098/rsos.250598. eCollection 2025 Aug.
2
Reinforcement learning for healthcare operations management: methodological framework, recent developments, and future research directions.用于医疗运营管理的强化学习:方法框架、最新进展及未来研究方向。
Health Care Manag Sci. 2025 Jun;28(2):298-333. doi: 10.1007/s10729-025-09699-6. Epub 2025 Apr 9.
3
Optimal control prevents itself from eradicating stochastic disease epidemics.

本文引用的文献

1
Real-time decision-making during emergency disease outbreaks.突发疾病疫情中的实时决策。
PLoS Comput Biol. 2018 Jul 24;14(7):e1006202. doi: 10.1371/journal.pcbi.1006202. eCollection 2018 Jul.
2
Infectious Disease Surveillance in the Big Data Era: Towards Faster and Locally Relevant Systems.大数据时代的传染病监测:迈向更快且与本地相关的系统
J Infect Dis. 2016 Dec 1;214(suppl_4):S380-S385. doi: 10.1093/infdis/jiw376.
3
Quantifying the Value of Perfect Information in Emergency Vaccination Campaigns.量化紧急疫苗接种活动中完美信息的价值。
最优控制无法根除随机疾病流行。
PLoS Comput Biol. 2025 Feb 18;21(2):e1012781. doi: 10.1371/journal.pcbi.1012781. eCollection 2025 Feb.
4
Integrating artificial intelligence with mechanistic epidemiological modeling: a scoping review of opportunities and challenges.将人工智能与机制性流行病学建模相结合:机遇与挑战的范围综述
Nat Commun. 2025 Jan 10;16(1):581. doi: 10.1038/s41467-024-55461-x.
5
Epidemiology and Transmission Dynamics of Infectious Diseases and Control Measures.传染病的流行病学和传播动力学及控制措施。
Viruses. 2022 Nov 12;14(11):2510. doi: 10.3390/v14112510.
6
A deep reinforcement learning based decision-making approach for avoiding crowd situation within the case of Covid'19 pandemic.一种基于深度强化学习的决策方法,用于在新冠疫情情况下避免人群聚集。
Comput Intell. 2022 Apr;38(2):416-437. doi: 10.1111/coin.12516. Epub 2022 Mar 12.
7
Dynamic resource allocation for controlling pathogen spread on a large metapopulation network.用于控制病原体在大型集合种群网络上传播的动态资源分配
J R Soc Interface. 2022 Mar;19(188):20210744. doi: 10.1098/rsif.2021.0744. Epub 2022 Mar 9.
8
Transient disease dynamics across ecological scales.跨生态尺度的瞬态疾病动态
Theor Ecol. 2021;14(4):625-640. doi: 10.1007/s12080-021-00514-w. Epub 2021 May 27.
9
Research perspectives on animal health in the era of artificial intelligence.人工智能时代的动物健康研究视角。
Vet Res. 2021 Mar 6;52(1):40. doi: 10.1186/s13567-021-00902-4.
10
Anticipating future learning affects current control decisions: A comparison between passive and active adaptive management in an epidemiological setting.对未来学习的预期会影响当前的控制决策:流行病学环境中被动与主动适应性管理的比较。
J Theor Biol. 2020 Dec 7;506:110380. doi: 10.1016/j.jtbi.2020.110380. Epub 2020 Jul 19.
PLoS Comput Biol. 2017 Feb 16;13(2):e1005318. doi: 10.1371/journal.pcbi.1005318. eCollection 2017 Feb.
4
Retrospective Parameter Estimation and Forecast of Respiratory Syncytial Virus in the United States.美国呼吸道合胞病毒的回顾性参数估计与预测
PLoS Comput Biol. 2016 Oct 7;12(10):e1005133. doi: 10.1371/journal.pcbi.1005133. eCollection 2016 Oct.
5
Eradication of Ebola Based on Dynamic Programming.基于动态规划的埃博拉病毒根除策略
Comput Math Methods Med. 2016;2016:1580917. doi: 10.1155/2016/1580917. Epub 2016 May 25.
6
Decision-making for foot-and-mouth disease control: Objectives matter.口蹄疫防控决策:目标至关重要。
Epidemics. 2016 Jun;15:10-9. doi: 10.1016/j.epidem.2015.11.002. Epub 2015 Dec 10.
7
Mastering the game of Go with deep neural networks and tree search.用深度神经网络和树搜索掌握围棋游戏。
Nature. 2016 Jan 28;529(7587):484-9. doi: 10.1038/nature16961.
8
Human-level control through deep reinforcement learning.通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.
9
Ebola cases and health system demand in Liberia.利比里亚的埃博拉病例与卫生系统需求
PLoS Biol. 2015 Jan 13;13(1):e1002056. doi: 10.1371/journal.pbio.1002056. eCollection 2015 Jan.
10
Adaptive management and the value of information: learning via intervention in epidemiology.适应性管理与信息价值:通过流行病学干预进行学习
PLoS Biol. 2014 Oct 21;12(10):e1001970. doi: 10.1371/journal.pbio.1001970. eCollection 2014 Oct.