Suppr超能文献

基于甲状腺功能数据区分甲状腺恶性结节与良性结节的机器学习模型

Machine learning model for differentiating malignant from benign thyroid nodules based on the thyroid function data.

作者信息

Ma Fuqiang, Yu Fengchang, Lv Shenhui, Zhang Lihua, Lu Zhilin, Zhou Quan, Mao He-Rong, Zhang Lele, Xiang Nan

机构信息

Department of Integrated Traditional and Western Medicine,The First Affiliated Hospital, and College of Clinical Medicine of Henan University of Science and Technology, Luoyang, China.

Hubei University of Chinese Medicine, Wuhan, Hubei, China.

出版信息

BMJ Open. 2025 May 7;15(5):e093466. doi: 10.1136/bmjopen-2024-093466.

Abstract

OBJECTIVES

To develop and validate a machine learning (ML) model to differentiate malignant from benign thyroid nodules (TNs) based on the routine data and provide diagnostic assistance for medical professionals.

SETTING

A qualified panel of 1649 patients with TNs from one hospital were stratified by gender, age, free triiodothyronine (FT3), free thyroxine (FT4) and thyroid peroxidase antibody (TPOAB).

PARTICIPANTS

Thyroid function (TF) data of 1649 patients with TNs were collected in a single centre from January 2018 to June 2022, with a total of 273 males and 1376 females, respectively.

MEASURES

Seven popular ML models (Random Forest, Decision Tree, Logistic Regression (LR), K-Neighbours, Gaussian Naive Bayes, Multilayer Perception and Gradient Boosting) were developed to predict malignant and benign TNs, whose performance indicators included area under the curve (AUC), accuracy, recall, precision and F1 score.

RESULTS

A total of 1649 patients were enrolled in this study, with the median age of 45.15±13.41 years, and the male to female ratio was 1:5.055. In the multivariate LR analysis, statistically significant differences existed between the TNs group and thyroid cancer group in gender, age, free triiodothyronine (FT3), free thyroxine (FT4) and TPOAB. Among the seven tested ML models, the best performance was achieved in the Gradient Boosting model in terms of precision, AUC, accuracy, recall and F1 score, with the AUC of 0.82, accuracy of 79.4% and precision of 0.814 after experimental verification. FT4, TPOAB and FT3 were validated as the top three features in the Gradient Boosting model.

CONCLUSIONS

This study innovatively developed a predictive model for benign and malignant TNs based on the Gradient Boosting Decision Tree algorithm. For the first time, it validated the clinical predictive value of TF parameters (FT4, FT3) and TPOAB as key biomarkers.

摘要

目的

开发并验证一种基于常规数据区分甲状腺结节(TN)良恶性的机器学习(ML)模型,为医学专业人员提供诊断辅助。

设置

来自一家医院的1649例TN患者组成的合格样本,按性别、年龄、游离三碘甲状腺原氨酸(FT3)、游离甲状腺素(FT4)和甲状腺过氧化物酶抗体(TPOAB)进行分层。

参与者

2018年1月至2022年6月在单中心收集了1649例TN患者的甲状腺功能(TF)数据,其中男性273例,女性1376例。

测量方法

开发了七种常用的ML模型(随机森林、决策树、逻辑回归(LR)、K近邻、高斯朴素贝叶斯、多层感知器和梯度提升)来预测TN的良恶性,其性能指标包括曲线下面积(AUC)、准确率、召回率、精确率和F1分数。

结果

本研究共纳入1649例患者,中位年龄为45.15±13.41岁,男女比例为1:5.055。在多变量LR分析中,TN组和甲状腺癌组在性别、年龄、游离三碘甲状腺原氨酸(FT3)、游离甲状腺素(FT4)和TPOAB方面存在统计学显著差异。在七种测试的ML模型中,梯度提升模型在精确率、AUC、准确率、召回率和F1分数方面表现最佳,经实验验证,AUC为0.82,准确率为79.4%,精确率为0.814。FT4、TPOAB和FT3被验证为梯度提升模型中的前三大特征。

结论

本研究创新性地基于梯度提升决策树算法开发了一种TN良恶性预测模型。首次验证了TF参数(FT4、FT3)和TPOAB作为关键生物标志物的临床预测价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a05b/12060893/3df7a88c8801/bmjopen-15-5-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验