Li Yafei, Wang Hongfeng, Wang Na, Zhang Tianhong
College of Information Science and Engineering, Northeastern University, Shenyang, China.
Fundamental Teaching Department of Computer and Mathematics, Shenyang Normal University, Shenyang, China.
Complex Intell Systems. 2022;8(6):4603-4618. doi: 10.1007/s40747-022-00776-9. Epub 2022 Jun 23.
Cloud healthcare system (CHS) can provide the telemedicine services, which is helpful to cope with the difficulty of patients getting medical service in the traditional medical systems. However, resource scheduling in CHS has to face with a great of challenges since managing the trade-off of efficiency and quality becomes complicated due to the uncertainty of patient choice behavior. Motivated by this, a resource scheduling problem with multi-stations queueing network in CHS is studied in this paper. A Markov decision model with uncertainty is developed to optimize the match process of patients and scarce resources with the objective of minimizing the total medical costs that consist of three conflicting sub-costs, i.e., medical costs, waiting time costs and the penalty costs caused by unmuting choice behavior of patients. For solving the proposed model, a three-stage dynamic scheduling method is designed, in which an improved Q-learning algorithm is employed to achieve the optimal schedule. Numerical experimental results show that this Q-learning-based scheduling algorithm outperforms two traditional scheduling algorithms significantly, as well as the balance of the three conflicting sub-costs is kept and the service efficiency is improved.
云医疗系统(CHS)可以提供远程医疗服务,这有助于应对患者在传统医疗系统中获得医疗服务的困难。然而,CHS中的资源调度必须面对巨大的挑战,因为由于患者选择行为的不确定性,管理效率和质量之间的权衡变得复杂。受此启发,本文研究了CHS中具有多站排队网络的资源调度问题。建立了一个具有不确定性的马尔可夫决策模型,以优化患者与稀缺资源的匹配过程,目标是最小化由三个相互冲突的子成本组成的总医疗成本,即医疗成本、等待时间成本和患者未选择行为导致的惩罚成本。为了解决所提出的模型,设计了一种三阶段动态调度方法,其中采用改进的Q学习算法来实现最优调度。数值实验结果表明基于Q学习的调度算法明显优于两种传统调度算法,同时保持了三个相互冲突的子成本的平衡并提高了服务效率。