Universidad Politecnica de Cartagena (UPCT), Department of Information Technologies and Communications, Cartagena, Spain.
Universidad Politecnica de Cartagena (UPCT), Department of Electronics, Computer Technology and Projects, Cartagena, Spain.
PLoS One. 2022 Sep 13;17(9):e0274608. doi: 10.1371/journal.pone.0274608. eCollection 2022.
Diabetes mellitus is a disease associated with abnormally high levels of blood glucose due to a lack of insulin. Combining an insulin pump and continuous glucose monitor with a control algorithm to deliver insulin is an alternative to patient self-management of insulin doses to control blood glucose levels in diabetes mellitus patients. In this work, we propose a closed-loop control for blood glucose levels based on deep reinforcement learning. We describe the initial evaluation of several alternatives conducted on a realistic simulator of the glucoregulatory system and propose a particular implementation strategy based on reducing the frequency of the observations and rewards passed to the agent, and using a simple reward function. We train agents with that strategy for three groups of patient classes, evaluate and compare it with alternative control baselines. Our results show that our method is able to outperform baselines as well as similar recent proposals, by achieving longer periods of safe glycemic state and low risk.
糖尿病是一种由于缺乏胰岛素导致血糖水平异常升高的疾病。将胰岛素泵和连续血糖监测仪与控制算法相结合,以输送胰岛素,是替代糖尿病患者自我管理胰岛素剂量以控制血糖水平的一种方法。在这项工作中,我们提出了一种基于深度学习的血糖水平闭环控制方法。我们描述了在血糖调节系统的真实模拟器上进行的几种替代方案的初步评估,并提出了一种基于减少传递给代理的观察和奖励的频率,以及使用简单奖励函数的特定实现策略。我们使用该策略为三组患者类别训练代理,并对其进行评估并与替代控制基线进行比较。我们的结果表明,我们的方法能够通过实现更长时间的安全血糖状态和低风险,从而优于基线和类似的最新提案。