Ma Fuqiang, Yu Fengchang, Lv Shenhui, Zhang Lihua, Lu Zhilin, Zhou Quan, Mao He-Rong, Zhang Lele, Xiang Nan
Department of Integrated Traditional and Western Medicine,The First Affiliated Hospital, and College of Clinical Medicine of Henan University of Science and Technology, Luoyang, China.
Hubei University of Chinese Medicine, Wuhan, Hubei, China.
BMJ Open. 2025 May 7;15(5):e093466. doi: 10.1136/bmjopen-2024-093466.
To develop and validate a machine learning (ML) model to differentiate malignant from benign thyroid nodules (TNs) based on the routine data and provide diagnostic assistance for medical professionals.
A qualified panel of 1649 patients with TNs from one hospital were stratified by gender, age, free triiodothyronine (FT3), free thyroxine (FT4) and thyroid peroxidase antibody (TPOAB).
Thyroid function (TF) data of 1649 patients with TNs were collected in a single centre from January 2018 to June 2022, with a total of 273 males and 1376 females, respectively.
Seven popular ML models (Random Forest, Decision Tree, Logistic Regression (LR), K-Neighbours, Gaussian Naive Bayes, Multilayer Perception and Gradient Boosting) were developed to predict malignant and benign TNs, whose performance indicators included area under the curve (AUC), accuracy, recall, precision and F1 score.
A total of 1649 patients were enrolled in this study, with the median age of 45.15±13.41 years, and the male to female ratio was 1:5.055. In the multivariate LR analysis, statistically significant differences existed between the TNs group and thyroid cancer group in gender, age, free triiodothyronine (FT3), free thyroxine (FT4) and TPOAB. Among the seven tested ML models, the best performance was achieved in the Gradient Boosting model in terms of precision, AUC, accuracy, recall and F1 score, with the AUC of 0.82, accuracy of 79.4% and precision of 0.814 after experimental verification. FT4, TPOAB and FT3 were validated as the top three features in the Gradient Boosting model.
This study innovatively developed a predictive model for benign and malignant TNs based on the Gradient Boosting Decision Tree algorithm. For the first time, it validated the clinical predictive value of TF parameters (FT4, FT3) and TPOAB as key biomarkers.
开发并验证一种基于常规数据区分甲状腺结节(TN)良恶性的机器学习(ML)模型,为医学专业人员提供诊断辅助。
来自一家医院的1649例TN患者组成的合格样本,按性别、年龄、游离三碘甲状腺原氨酸(FT3)、游离甲状腺素(FT4)和甲状腺过氧化物酶抗体(TPOAB)进行分层。
2018年1月至2022年6月在单中心收集了1649例TN患者的甲状腺功能(TF)数据,其中男性273例,女性1376例。
开发了七种常用的ML模型(随机森林、决策树、逻辑回归(LR)、K近邻、高斯朴素贝叶斯、多层感知器和梯度提升)来预测TN的良恶性,其性能指标包括曲线下面积(AUC)、准确率、召回率、精确率和F1分数。
本研究共纳入1649例患者,中位年龄为45.15±13.41岁,男女比例为1:5.055。在多变量LR分析中,TN组和甲状腺癌组在性别、年龄、游离三碘甲状腺原氨酸(FT3)、游离甲状腺素(FT4)和TPOAB方面存在统计学显著差异。在七种测试的ML模型中,梯度提升模型在精确率、AUC、准确率、召回率和F1分数方面表现最佳,经实验验证,AUC为0.82,准确率为79.4%,精确率为0.814。FT4、TPOAB和FT3被验证为梯度提升模型中的前三大特征。
本研究创新性地基于梯度提升决策树算法开发了一种TN良恶性预测模型。首次验证了TF参数(FT4、FT3)和TPOAB作为关键生物标志物的临床预测价值。