Sun Wenzheng, Dang Jun, Zhang Lei, Wei Qichun, Li Chao, Liu Ye, Jing Huang, Huang Kanghua, Zhang Yuanpeng, Li Bing
Department of Radiation Oncology, The Second Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou, Zhejiang, 310009, China.
Department of Radiation Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital & Shenzhen Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Shenzhen, Guangdong, 518116, China.
Radiat Oncol. 2025 Jun 6;20(1):97. doi: 10.1186/s13014-025-02676-x.
To investigate the optimal training dataset size (TDS) for respiration prediction accuracy using a long short-term memory (LSTM) model.
The respiratory signals of 151 patients acquired with the real-time position management system were retrospectively included in this study. Among the dataset, 101 respiratory signals were utilized to evaluate the impact of the TDS on prediction accuracy, while the remaining 50 signals were employed for setting the default hyperparameters. The prediction accuracy of the LSTM model using eight different TDSs (10 s, 20 s, 30 s, 60 s, 90 s, 110 s, 130 s, and 150 s) was examined and evaluated by the root mean square error (RMSE) between the real and predicted respiratory signals. The interplay effects of the main hyperparameters, the ahead time and the different testing data lengths using different TDSs were also measured.
For the 520 ms ahead time, the root mean square error values of the LSTM model using the eight different training data sizes listed above were 0.146 cm, 0.137 cm, 0.134 cm, 0.125 cm, 0.120 cm, 0.121 cm, 0.121 cm, and 0.119 cm, respectively. The LSTM model achieved the highest prediction accuracy when the TDS was 150 s. The prediction accuracy was stable when the TDS exceeded 90 s.
TDS selection could influence the respiration signal prediction accuracy of the LSTM model. The relationship between TDS and the prediction accuracy of the LSTM model was not linear. The 90 s seemed to be an optimal TDS for the respiration signal prediction tasks using the LSTM model, as it was the shortest time at which a favorable prediction accuracy was maintained in this study.
使用长短期记忆(LSTM)模型研究用于呼吸预测准确性的最佳训练数据集大小(TDS)。
本研究回顾性纳入了151例患者通过实时位置管理系统获取的呼吸信号。在数据集中,101个呼吸信号用于评估TDS对预测准确性的影响,其余50个信号用于设置默认超参数。使用八个不同的TDS(10秒、20秒、30秒、60秒、90秒、110秒、130秒和150秒)的LSTM模型的预测准确性通过真实和预测呼吸信号之间的均方根误差(RMSE)进行检验和评估。还测量了主要超参数、提前时间和使用不同TDS的不同测试数据长度之间的相互作用效应。
对于提前520毫秒的情况,使用上述八种不同训练数据大小的LSTM模型的均方根误差值分别为0.146厘米、0.137厘米、0.134厘米、0.125厘米、0.120厘米、0.121厘米、0.121厘米和0.119厘米。当TDS为150秒时,LSTM模型实现了最高的预测准确性。当TDS超过90秒时,预测准确性稳定。
TDS的选择会影响LSTM模型的呼吸信号预测准确性。TDS与LSTM模型预测准确性之间的关系不是线性的。90秒似乎是使用LSTM模型进行呼吸信号预测任务的最佳TDS,因为它是本研究中保持良好预测准确性的最短时间。