Jafar Adnan, Fathi Anas El, Haidar Ahmad
Department of Biomedical Engineering, McGill University, Montreal, Canada.
Department of Electrical and Computer Engineering, McGill University, Montreal, Canada.
Comput Methods Programs Biomed. 2021 Mar;200:105936. doi: 10.1016/j.cmpb.2021.105936. Epub 2021 Jan 14.
The hybrid artificial pancreas regulates glucose levels in people with type 1 diabetes. It delivers (i) insulin boluses at meal times based on the meals' carbohydrate content and the carbohydrate ratios (CRs) and (ii) insulin basal, between meals and at night, continuously modulated around individual-specific programmed basal rate. The CRs and programmed basal rate significantly vary between individuals and within the same individual with type 1 diabetes, and using suboptimal values in the hybrid artificial pancreas may degrade glucose control. We propose a reinforcement learning algorithm to adaptively optimize CRs and programmed basal rate to improve the performance of the hybrid artificial pancreas.
The proposed reinforcement learning algorithm was designed using the Q-learning approach. The algorithm learns the optimal actions (CRs and programmed basal rate) by applying them to the individual's state (previous day's glucose levels and insulin delivery) based on an exploration and exploitation trade-off. First, outcomes from our simulator were compared to those of a clinical study in 23 individuals with type 1 diabetes and have yielded similar results. Second, the learning algorithm was tested using the simulator with two scenarios. Scenario 1 has fixed meal sizes and ingestion times and scenario 2 has a more realistic eating behavior with random meal sizes, ingestion times, and carbohydrate counting errors.
After about five weeks, the reinforcement learning algorithm improved the percentage of time spent in target range from 67% to 86.7% in scenario 1 and 65.5% to 86% in scenario 2. The percentage of time spent below 4.0 mmol/L decreased from 9% to 0.9% in scenario 1 and 9.5% to 1.1% in scenario 2.
Results indicate that the proposed algorithm has the potential to improve glucose control in people with type 1 diabetes using the hybrid artificial pancreas. The proposed algorithm is a key in making the hybrid artificial pancreas adaptive for the long-term real life outpatient studies.
混合式人工胰腺可调节1型糖尿病患者的血糖水平。它在进餐时根据食物的碳水化合物含量和碳水化合物比例(CRs)输注(i)胰岛素大剂量,以及在两餐之间和夜间持续围绕个体特定的预设基础输注率调制(ii)基础胰岛素。1型糖尿病患者个体之间以及同一个体内的CRs和预设基础输注率差异显著,在混合式人工胰腺中使用次优值可能会降低血糖控制效果。我们提出一种强化学习算法,以自适应地优化CRs和预设基础输注率,从而提高混合式人工胰腺的性能。
所提出的强化学习算法采用Q学习方法设计。该算法基于探索与利用的权衡,将最优行动(CRs和预设基础输注率)应用于个体状态(前一天的血糖水平和胰岛素输注情况)来进行学习。首先,将我们模拟器的结果与一项针对23名1型糖尿病患者的临床研究结果进行比较,结果相似。其次,使用模拟器在两种场景下测试学习算法。场景1中进餐量和进餐时间固定,场景2中进餐行为更贴近现实,进餐量、进餐时间和碳水化合物计数存在随机误差。
约五周后,强化学习算法在场景1中将处于目标范围内的时间百分比从67%提高到86.7%,在场景2中从65.5%提高到86%。血糖低于4.0 mmol/L的时间百分比在场景1中从9%降至0.9%,在场景2中从9.5%降至1.1%。
结果表明,所提出的算法有潜力改善使用混合式人工胰腺的1型糖尿病患者的血糖控制。所提出的算法是使混合式人工胰腺适用于长期现实生活门诊研究的关键。