强化学习在糖尿病血糖控制中的应用：一项系统综述。

Tejedor Miguel, Woldaregay Ashenafi Zebene, Godtliebsen Fred

Department of Computer Science, University of Tromsø-The Arctic University of Norway, Norway.

Artif Intell Med. 2020 Apr;104:101836. doi: 10.1016/j.artmed.2020.101836. Epub 2020 Feb 21.

BACKGROUND

Reinforcement learning (RL) is a computational approach to understanding and automating goal-directed learning and decision-making. It is designed for problems which include a learning agent interacting with its environment to achieve a goal. For example, blood glucose (BG) control in diabetes mellitus (DM), where the learning agent and its environment are the controller and the body of the patient respectively. RL algorithms could be used to design a fully closed-loop controller, providing a truly personalized insulin dosage regimen based exclusively on the patient's own data.

OBJECTIVE

In this review we aim to evaluate state-of-the-art RL approaches to designing BG control algorithms in DM patients, reporting successfully implemented RL algorithms in closed-loop, insulin infusion, decision support and personalized feedback in the context of DM.

METHODS

An exhaustive literature search was performed using different online databases, analyzing the literature from 1990 to 2019. In a first stage, a set of selection criteria were established in order to select the most relevant papers according to the title, keywords and abstract. Research questions were established and answered in a second stage, using the information extracted from the articles selected during the preliminary selection.

RESULTS

The initial search using title, keywords, and abstracts resulted in a total of 404 articles. After removal of duplicates from the record, 347 articles remained. An independent analysis and screening of the records against our inclusion and exclusion criteria defined in Methods section resulted in removal of 296 articles, leaving 51 relevant articles. A full-text assessment was conducted on the remaining relevant articles, which resulted in 29 relevant articles that were critically analyzed. The inter-rater agreement was measured using Cohen Kappa test, and disagreements were resolved through discussion.

CONCLUSIONS

The advances in health technologies and mobile devices have facilitated the implementation of RL algorithms for optimal glycemic regulation in diabetes. However, there exists few articles in the literature focused on the application of these algorithms to the BG regulation problem. Moreover, such algorithms are designed for control tasks as BG adjustment and their use have increased recently in the diabetes research area, therefore we foresee RL algorithms will be used more frequently for BG control in the coming years. Furthermore, in the literature there is a lack of focus on aspects that influence BG level such as meal intakes and physical activity (PA), which should be included in the control problem. Finally, there exists a need to perform clinical validation of the algorithms.

背景

强化学习（RL）是一种用于理解和自动化目标导向学习与决策的计算方法。它适用于包括学习智能体与环境交互以实现目标的问题。例如，糖尿病（DM）中的血糖（BG）控制，其中学习智能体和环境分别是控制器和患者的身体。RL算法可用于设计完全闭环控制器，仅根据患者自身数据提供真正个性化的胰岛素剂量方案。

目的

在本综述中，我们旨在评估用于设计糖尿病患者BG控制算法的最新RL方法，报告在糖尿病背景下成功应用于闭环、胰岛素输注、决策支持和个性化反馈的RL算法。

方法

使用不同的在线数据库进行了详尽的文献检索，分析了1990年至2019年的文献。在第一阶段，建立了一组选择标准，以便根据标题、关键词和摘要选择最相关的论文。在第二阶段，利用从初步筛选中选定文章提取的信息，确定并回答了研究问题。

结果

最初使用标题、关键词和摘要进行检索共得到404篇文章。去除记录中的重复项后，还剩347篇文章。根据我们在方法部分定义的纳入和排除标准对记录进行独立分析和筛选，排除了296篇文章，剩下51篇相关文章。对其余相关文章进行了全文评估，最终对29篇相关文章进行了批判性分析。使用Cohen Kappa检验测量评分者间一致性，分歧通过讨论解决。

结论

健康技术和移动设备的进步促进了RL算法在糖尿病最佳血糖调节中的应用。然而，文献中很少有文章关注这些算法在BG调节问题上的应用。此外，此类算法是为BG调整等控制任务设计的，最近在糖尿病研究领域的应用有所增加，因此我们预计未来几年RL算法将更频繁地用于BG控制。此外，文献中缺乏对影响BG水平的因素（如饮食摄入和身体活动（PA））的关注，而这些因素应纳入控制问题。最后，需要对算法进行临床验证。

相似文献

Reinforcement learning application in diabetes blood glucose control: A systematic review.

Artif Intell Med. 2020 Apr;104:101836. doi: 10.1016/j.artmed.2020.101836. Epub 2020 Feb 21.

Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.

Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Data-Driven Blood Glucose Pattern Classification and Anomalies Detection: Machine-Learning Applications in Type 1 Diabetes.

J Med Internet Res. 2019 May 1;21(5):e11030. doi: 10.2196/11030.

Hybrid closed-loop systems for managing blood glucose levels in type 1 diabetes: a systematic review and economic modelling.

Health Technol Assess. 2024 Dec;28(80):1-190. doi: 10.3310/JYPL3536.

Closed-loop artificial pancreas using subcutaneous glucose sensing and insulin delivery and a model predictive control algorithm: the Virginia experience.

J Diabetes Sci Technol. 2009 Sep 1;3(5):1031-8. doi: 10.1177/193229680900300506.

Offline reinforcement learning for safer blood glucose control in people with type 1 diabetes.

J Biomed Inform. 2023 Jun;142:104376. doi: 10.1016/j.jbi.2023.104376. Epub 2023 May 4.

Telemedicine Services for the Arctic: A Systematic Review.

JMIR Med Inform. 2017 Jun 28;5(2):e16. doi: 10.2196/medinform.6323.

Enhancing automatic closed-loop glucose control in type 1 diabetes with an adaptive meal bolus calculator - in silico evaluation under intra-day variability.

Comput Methods Programs Biomed. 2017 Jul;146:125-131. doi: 10.1016/j.cmpb.2017.05.010. Epub 2017 Jun 1.

引用本文的文献

Privacy-Preserving Glycemic Management in Type 1 Diabetes: Development and Validation of a Multiobjective Federated Reinforcement Learning Framework.

JMIR Diabetes. 2025 Jul 4;10:e72874. doi: 10.2196/72874.

Comprehensive review of reinforcement learning for medical ultrasound imaging.

Artif Intell Rev. 2025;58(9):284. doi: 10.1007/s10462-025-11268-w. Epub 2025 Jun 23.

: A Reinforcement Learning Benchmark for Dynamic Treatment Regimes.

Adv Neural Inf Process Syst. 2024;37:130536-130568.

Better Blood Pressure Control for Stroke Patients in the ICU: A Deep Reinforcement Learning with Supervised Guidance Approach for Adaptive Infusion Rate Tuning.

AMIA Annu Symp Proc. 2025 May 22;2024:271-280. eCollection 2024.

Chronic Kidney Disease-Mineral and Bone Disorder Management in 4D: The Case for Dynamic Treatment Regime Methods to Optimize Care.

Curr Osteoporos Rep. 2025 Mar 25;23(1):16. doi: 10.1007/s11914-025-00911-8.

Impact of machine learning on dietary and exercise behaviors in type 2 diabetes self-management: a systematic literature review.

PeerJ Comput Sci. 2025 Feb 3;11:e2568. doi: 10.7717/peerj-cs.2568. eCollection 2025.

A safe-enhanced fully closed-loop artificial pancreas controller based on deep reinforcement learning.

PLoS One. 2025 Jan 27;20(1):e0317662. doi: 10.1371/journal.pone.0317662. eCollection 2025.

Reinforcement Learning: A Paradigm Shift in Personalized Blood Glucose Management for Diabetes.

Biomedicines. 2024 Sep 21;12(9):2143. doi: 10.3390/biomedicines12092143.

Optimal Dynamic Regimes for CO Oxidation Discovered by Reinforcement Learning.

ACS Omega. 2024 Jun 20;9(26):27987-27997. doi: 10.1021/acsomega.3c10422. eCollection 2024 Jul 2.

An automatic deep reinforcement learning bolus calculator for automated insulin delivery systems.

Sci Rep. 2024 Jul 2;14(1):15245. doi: 10.1038/s41598-024-62912-4.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Reinforcement learning application in diabetes blood glucose control: A systematic review.

Artif Intell Med. 2020 Apr;104:101836. doi: 10.1016/j.artmed.2020.101836. Epub 2020 Feb 21.

Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.

Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Data-Driven Blood Glucose Pattern Classification and Anomalies Detection: Machine-Learning Applications in Type 1 Diabetes.

J Med Internet Res. 2019 May 1;21(5):e11030. doi: 10.2196/11030.

Hybrid closed-loop systems for managing blood glucose levels in type 1 diabetes: a systematic review and economic modelling.

Health Technol Assess. 2024 Dec;28(80):1-190. doi: 10.3310/JYPL3536.

Closed-loop artificial pancreas using subcutaneous glucose sensing and insulin delivery and a model predictive control algorithm: the Virginia experience.

J Diabetes Sci Technol. 2009 Sep 1;3(5):1031-8. doi: 10.1177/193229680900300506.

Offline reinforcement learning for safer blood glucose control in people with type 1 diabetes.

J Biomed Inform. 2023 Jun;142:104376. doi: 10.1016/j.jbi.2023.104376. Epub 2023 May 4.

Telemedicine Services for the Arctic: A Systematic Review.

JMIR Med Inform. 2017 Jun 28;5(2):e16. doi: 10.2196/medinform.6323.

Enhancing automatic closed-loop glucose control in type 1 diabetes with an adaptive meal bolus calculator - in silico evaluation under intra-day variability.

Comput Methods Programs Biomed. 2017 Jul;146:125-131. doi: 10.1016/j.cmpb.2017.05.010. Epub 2017 Jun 1.

引用本文的文献

Privacy-Preserving Glycemic Management in Type 1 Diabetes: Development and Validation of a Multiobjective Federated Reinforcement Learning Framework.

JMIR Diabetes. 2025 Jul 4;10:e72874. doi: 10.2196/72874.

Comprehensive review of reinforcement learning for medical ultrasound imaging.

Artif Intell Rev. 2025;58(9):284. doi: 10.1007/s10462-025-11268-w. Epub 2025 Jun 23.

: A Reinforcement Learning Benchmark for Dynamic Treatment Regimes.

Adv Neural Inf Process Syst. 2024;37:130536-130568.

Better Blood Pressure Control for Stroke Patients in the ICU: A Deep Reinforcement Learning with Supervised Guidance Approach for Adaptive Infusion Rate Tuning.

AMIA Annu Symp Proc. 2025 May 22;2024:271-280. eCollection 2024.

Chronic Kidney Disease-Mineral and Bone Disorder Management in 4D: The Case for Dynamic Treatment Regime Methods to Optimize Care.

Curr Osteoporos Rep. 2025 Mar 25;23(1):16. doi: 10.1007/s11914-025-00911-8.

Impact of machine learning on dietary and exercise behaviors in type 2 diabetes self-management: a systematic literature review.

PeerJ Comput Sci. 2025 Feb 3;11:e2568. doi: 10.7717/peerj-cs.2568. eCollection 2025.

A safe-enhanced fully closed-loop artificial pancreas controller based on deep reinforcement learning.

PLoS One. 2025 Jan 27;20(1):e0317662. doi: 10.1371/journal.pone.0317662. eCollection 2025.

Reinforcement Learning: A Paradigm Shift in Personalized Blood Glucose Management for Diabetes.

Biomedicines. 2024 Sep 21;12(9):2143. doi: 10.3390/biomedicines12092143.

Optimal Dynamic Regimes for CO Oxidation Discovered by Reinforcement Learning.

ACS Omega. 2024 Jun 20;9(26):27987-27997. doi: 10.1021/acsomega.3c10422. eCollection 2024 Jul 2.

An automatic deep reinforcement learning bolus calculator for automated insulin delivery systems.

Sci Rep. 2024 Jul 2;14(1):15245. doi: 10.1038/s41598-024-62912-4.

Reinforcement learning application in diabetes blood glucose control: A systematic review.

作者信息

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景

目的

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献