用于医院择期入院马尔可夫决策过程模型的可扩展近似策略

Scalable approximate policies for Markov decision process models of hospital elective admissions.

作者信息

Zhu George, Lizotte Dan, Hoey Jesse

机构信息

School of Computer Science, University of Waterloo, 200 University Avenue W., Waterloo, Ontario, Canada N2L 1Z2.

出版信息

Artif Intell Med. 2014 May;61(1):21-34. doi: 10.1016/j.artmed.2014.04.001. Epub 2014 Apr 13.

DOI:10.1016/j.artmed.2014.04.001

PMID:24791675

Abstract

OBJECTIVE

To demonstrate the feasibility of using stochastic simulation methods for the solution of a large-scale Markov decision process model of on-line patient admissions scheduling.

METHODS

The problem of admissions scheduling is modeled as a Markov decision process in which the states represent numbers of patients using each of a number of resources. We investigate current state-of-the-art real time planning methods to compute solutions to this Markov decision process. Due to the complexity of the model, traditional model-based planners are limited in scalability since they require an explicit enumeration of the model dynamics. To overcome this challenge, we apply sample-based planners along with efficient simulation techniques that given an initial start state, generate an action on-demand while avoiding portions of the model that are irrelevant to the start state. We also propose a novel variant of a popular sample-based planner that is particularly well suited to the elective admissions problem.

RESULTS

Results show that the stochastic simulation methods allow for the problem size to be scaled by a factor of almost 10 in the action space, and exponentially in the state space. We have demonstrated our approach on a problem with 81 actions, four specialities and four treatment patterns, and shown that we can generate solutions that are near-optimal in about 100s.

CONCLUSION

Sample-based planners are a viable alternative to state-based planners for large Markov decision process models of elective admissions scheduling.

摘要

目的

证明使用随机模拟方法解决在线患者入院调度大规模马尔可夫决策过程模型的可行性。

方法

将入院调度问题建模为一个马尔可夫决策过程，其中状态表示使用多种资源中每种资源的患者数量。我们研究当前最先进的实时规划方法来计算该马尔可夫决策过程的解决方案。由于模型的复杂性，传统的基于模型的规划器在可扩展性方面受到限制，因为它们需要对模型动态进行显式枚举。为了克服这一挑战，我们应用基于样本的规划器以及高效的模拟技术，给定初始起始状态，按需生成动作，同时避开与起始状态无关的模型部分。我们还提出了一种流行的基于样本的规划器的新颖变体，它特别适合选择性入院问题。

结果

结果表明，随机模拟方法在动作空间中可将问题规模扩大近10倍，在状态空间中呈指数级扩大。我们在一个具有81个动作、四个专科和四种治疗模式的问题上展示了我们的方法，并表明我们可以在大约100秒内生成接近最优的解决方案。

结论

对于选择性入院调度的大型马尔可夫决策过程模型，基于样本的规划器是基于状态的规划器的可行替代方案。

相似文献

Scalable approximate policies for Markov decision process models of hospital elective admissions.

Artif Intell Med. 2014 May;61(1):21-34. doi: 10.1016/j.artmed.2014.04.001. Epub 2014 Apr 13.

Markov decision process applied to the control of hospital elective admissions.

Artif Intell Med. 2009 Oct;47(2):159-71. doi: 10.1016/j.artmed.2009.07.003. Epub 2009 Aug 21.

Artificial intelligence framework for simulating clinical decision-making: a Markov decision process approach.

Artif Intell Med. 2013 Jan;57(1):9-19. doi: 10.1016/j.artmed.2012.12.003. Epub 2012 Dec 31.

Information space receding horizon control.

IEEE Trans Cybern. 2013 Dec;43(6):2255-60. doi: 10.1109/TSMCB.2012.2236313.

Strategic level proton therapy patient admission planning: a Markov decision process modeling approach.

Health Care Manag Sci. 2017 Jun;20(2):286-302. doi: 10.1007/s10729-016-9354-6. Epub 2016 Jan 25.

Efficient methods for studying stochastic disease and population dynamics.

Theor Popul Biol. 2009 Mar-May;75(2-3):133-41. doi: 10.1016/j.tpb.2009.01.003. Epub 2009 Jan 21.

Simulation-based approximate policy iteration for dynamic patient scheduling for radiation therapy.

Health Care Manag Sci. 2018 Sep;21(3):317-325. doi: 10.1007/s10729-016-9388-9. Epub 2016 Oct 20.

Monte Carlo estimation of total variation distance of Markov chains on large spaces, with application to phylogenetics.

Stat Appl Genet Mol Biol. 2013 Mar 26;12(1):39-48. doi: 10.1515/sagmb-2012-0023.

Markov models in medical decision making: a practical guide.

Med Decis Making. 1993 Oct-Dec;13(4):322-38. doi: 10.1177/0272989X9301300409.

A Bayesian method for construction of Markov models to describe dynamics on various time-scales.

J Chem Phys. 2010 Oct 14;133(14):144113. doi: 10.1063/1.3496438.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于医院择期入院马尔可夫决策过程模型的可扩展近似策略

Scalable approximate policies for Markov decision process models of hospital elective admissions.

作者信息

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献