Ngo Phuong D, Wei Susan, Holubová Anna, Muzik Jan, Godtliebsen Fred
UiT The Arctic University of Norway, Tromsø, Norway.
The University of Melbourne, Australia.
Comput Math Methods Med. 2018 Dec 30;2018:4091497. doi: 10.1155/2018/4091497. eCollection 2018.
Type-1 diabetes is a condition caused by the lack of insulin hormone, which leads to an excessive increase in blood glucose level. The glucose kinetics process is difficult to control due to its complex and nonlinear nature and with state variables that are difficult to measure.
This paper proposes a method for automatically calculating the basal and bolus insulin doses for patients with type-1 diabetes using reinforcement learning with feedforward controller. The algorithm is designed to keep the blood glucose stable and directly compensate for the external events such as food intake. Its performance was assessed using simulation on a blood glucose model. The usage of the Kalman filter with the controller was demonstrated to estimate unmeasurable state variables.
Comparison simulations between the proposed controller with the optimal reinforcement learning and the proportional-integral-derivative controller show that the proposed methodology has the best performance in regulating the fluctuation of the blood glucose. The proposed controller also improved the blood glucose responses and prevented hypoglycemia condition. Simulation of the control system in different uncertain conditions provided insights on how the inaccuracies of carbohydrate counting and meal-time reporting affect the performance of the control system.
The proposed controller is an effective tool for reducing postmeal blood glucose rise and for countering the effects of external known events such as meal intake and maintaining blood glucose at a healthy level under uncertainties.
1型糖尿病是一种由胰岛素激素缺乏引起的疾病,会导致血糖水平过度升高。由于葡萄糖动力学过程复杂且非线性,其状态变量难以测量,因此该过程难以控制。
本文提出了一种使用带有前馈控制器的强化学习为1型糖尿病患者自动计算基础胰岛素剂量和大剂量胰岛素剂量的方法。该算法旨在保持血糖稳定,并直接补偿诸如食物摄入等外部事件的影响。通过在血糖模型上进行仿真来评估其性能。证明了将卡尔曼滤波器与控制器一起使用可估计不可测量的状态变量。
将所提出的带有最优强化学习的控制器与比例积分微分控制器进行比较仿真,结果表明所提出的方法在调节血糖波动方面具有最佳性能。所提出的控制器还改善了血糖反应并预防了低血糖情况。在不同不确定条件下对控制系统进行仿真,为碳水化合物计数和进餐时间报告的不准确如何影响控制系统性能提供了见解。
所提出的控制器是一种有效的工具,可减少餐后血糖升高,应对诸如进餐等外部已知事件的影响,并在不确定情况下将血糖维持在健康水平。