Li Yi, Tang Zimei, Ren Anwen, Tian Gang, Zhang Jianing, Wang Yiran, Liu Jie, Ming Jie
Department of Breast and Thyroid Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.
Department of Radiology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.
Front Endocrinol (Lausanne). 2025 Jun 17;16:1552479. doi: 10.3389/fendo.2025.1552479. eCollection 2025.
Current guidelines provide a recognized yet broad framework for stratifying recurrence risk in differentiated thyroid cancer (DTC) patients. More precise tools are needed for intermediate- and high-risk groups. This study aims to identify recurrence-associated risk factors and develop a machine learning-based predictive model.
In this retrospective analysis, 2,388 DTC patients were randomly assigned to a training group (1,910 cases) and a validation group (478 cases). Predictive factors were identified using univariate and multivariate analyses. Six machine learning models were trained and validated, with performance evaluated through accuracy, area under the curve, and clinical utility via decision curve analysis.
Independent risk factors for recurrence included intraglandular dissemination, total tumor size, bilateral cervical lymph node involvement, and Hashimoto's thyroiditis, while normal/elevated TSH and multifocal nodules were protective. The random forest model demonstrated the best performance (training accuracy: 0.801; validation accuracy: 0.808). A random forest-based online calculator was developed to facilitate individualized risk assessment in clinical settings.
The random forest model effectively predicts DTC recurrence, offering a practical tool for individualized risk assessment and aiding clinical decision-making.
当前指南为分化型甲状腺癌(DTC)患者复发风险分层提供了一个公认但较为宽泛的框架。对于中高危组,需要更精确的工具。本研究旨在确定复发相关危险因素,并开发一种基于机器学习的预测模型。
在这项回顾性分析中,2388例DTC患者被随机分为训练组(1910例)和验证组(478例)。使用单因素和多因素分析确定预测因素。训练并验证了六种机器学习模型,通过准确性、曲线下面积评估性能,并通过决策曲线分析评估临床实用性。
复发的独立危险因素包括腺内播散、肿瘤总大小、双侧颈部淋巴结受累和桥本甲状腺炎,而正常/升高的促甲状腺激素和多灶性结节具有保护作用。随机森林模型表现最佳(训练准确性:0.801;验证准确性:0.808)。开发了一个基于随机森林的在线计算器,以方便临床环境中的个体化风险评估。
随机森林模型有效地预测了DTC复发,为个体化风险评估提供了实用工具,并有助于临床决策。