Li Can, Guo Yuqi, Lin Xinyan, Feng Xuezhen, Xu Dachuan, Yang Ruijie
Institute of Operations Research and Information Engineering, Beijing University of Technology, Beijing 100124, PR China.
Department of Radiation Oncology, Cancer Center, Peking University Third Hospital, Beijing, 100191, China; School of Physics, Beihang University, Beijing, 102206, China.
Phys Med. 2024 Sep;125:104498. doi: 10.1016/j.ejmp.2024.104498. Epub 2024 Aug 19.
The formulation and optimization of radiation therapy plans are complex and time-consuming processes that heavily rely on the expertise of medical physicists. Consequently, there is an urgent need for automated optimization methods. Recent advancements in reinforcement learning, particularly deep reinforcement learning (DRL), show great promise for automating radiotherapy planning. This review summarizes the current state of DRL applications in this field, evaluates their effectiveness, and identifies challenges and future directions.
A systematic search was conducted in Google Scholar, PubMed, IEEE Xplore, and Scopus using keywords such as "deep reinforcement learning", "radiation therapy", and "treatment planning". The extracted data were synthesized for an overview and critical analysis.
The application of deep reinforcement learning in radiation therapy plan optimization can generally be divided into three categories: optimizing treatment planning parameters, directly optimizing machine parameters, and adaptive radiotherapy. From the perspective of disease sites, DRL has been applied to cervical cancer, prostate cancer, vestibular schwannoma, and lung cancer. Regarding types of radiation therapy, it has been used in HDRBT, IMRT, SBRT, VMAT, GK, and Cyberknife.
Deep reinforcement learning technology has played a significant role in advancing the automated optimization of radiation therapy plans. However, there is still a considerable gap before it can be widely applied in clinical settings due to three main reasons: inefficiency, limited methods for quality assessment, and poor interpretability. To address these challenges, significant research opportunities exist in the future, such as constructing evaluators, parallelized training, and exploring continuous action spaces.
放射治疗计划的制定和优化是复杂且耗时的过程,严重依赖医学物理学家的专业知识。因此,迫切需要自动化优化方法。强化学习的最新进展,特别是深度强化学习(DRL),在放射治疗计划自动化方面显示出巨大潜力。本综述总结了DRL在该领域的应用现状,评估了其有效性,并确定了挑战和未来方向。
在谷歌学术、PubMed、IEEE Xplore和Scopus中使用“深度强化学习”、“放射治疗”和“治疗计划”等关键词进行系统搜索。对提取的数据进行综合,以进行概述和批判性分析。
深度强化学习在放射治疗计划优化中的应用一般可分为三类:优化治疗计划参数、直接优化机器参数和自适应放射治疗。从疾病部位来看,DRL已应用于宫颈癌、前列腺癌、前庭神经鞘瘤和肺癌。在放射治疗类型方面,它已用于高剂量率近距离放疗(HDRBT)、调强放疗(IMRT)、立体定向体部放疗(SBRT)、容积旋转调强放疗(VMAT)、伽玛刀(GK)和赛博刀。
深度强化学习技术在推进放射治疗计划的自动化优化方面发挥了重要作用。然而,由于三个主要原因,在其能够广泛应用于临床之前仍存在相当大的差距:效率低下、质量评估方法有限以及可解释性差。为应对这些挑战,未来存在重大的研究机会,例如构建评估器、并行训练以及探索连续动作空间。