Suppr超能文献

基于临床超声特征的可解释机器学习模型对甲状腺滤泡癌的诊断价值

Diagnostic value of an interpretable machine learning model based on clinical ultrasound features for follicular thyroid carcinoma.

作者信息

Zheng Yuxin, Zhang Yajiao, Lu Kefeng, Wang Jiafeng, Li Linlin, Xu Dong, Liu Junping, Lou Jiangyan

机构信息

Second Clinical College, Zhejiang University of Traditional Chinese Medicine, Hangzhou, China.

Department of Diagnostic Ultrasound Imaging & Interventional Therapy, Zhejiang Cancer Hospital, Hangzhou, China.

出版信息

Quant Imaging Med Surg. 2024 Sep 1;14(9):6311-6324. doi: 10.21037/qims-24-601. Epub 2024 Aug 20.

Abstract

BACKGROUND

Follicular thyroid carcinoma (FTC) and follicular thyroid adenoma (FTA) present diagnostic challenges due to overlapping clinical and ultrasound features. Improving the diagnosis of FTC can enhance patient prognosis and effectiveness in clinical management. This study seeks to develop a predictive model for FTC based on ultrasound features using machine learning (ML) algorithms and assess its diagnostic effectiveness.

METHODS

Patients diagnosed with FTA or FTC based on surgical pathology between January 2009 and February 2023 at Zhejiang Provincial Cancer Hospital and Zhejiang Provincial People's Hospital were retrospectively included. A total of 562 patients from Zhejiang Provincial Cancer Hospital comprised the training set, and 218 patients from Zhejiang Provincial People's Hospital constituted the validation set. Subsequently, clinical parameters and ultrasound characteristics of the patients were collected. The diagnostic parameters were analyzed using the least absolute shrinkage and selection operator and multivariate logistic regression screening methods. Next, a comparative analysis was performed using seven ML models. The area under the receiver operating characteristic (ROC) curve (AUC), accuracy, sensitivity, specificity, positive predicted value (PPV), negative predicted value (NPV), precision, recall, and comprehensive evaluation index (F-score) were calculated to compare the diagnostic efficacy among the seven models and determine the optimal model. Further, the optimal model was validated, and the SHapley Additive ExPlanations (SHAP) approach was applied to explain the significance of the model variables. Finally, an individualized risk assessment was conducted.

RESULTS

Age, echogenicity, thyroglobulin antibody (TGAb), echotexture, composition, triiodothyronine (T3), thyroglobulin (TG), margin, thyroid-stimulating hormone (TSH), calcification, and halo thickness >2 mm were influential factors for diagnosing FTC. The XGBoost model was identified as the optimal model after a comprehensive evaluation. The AUC of this model in the validation set was 0.969 [95% confidence interval (CI), 0.946-0.992], while its precision sensitivity, specificity, and accuracy were 0.791, 0.930, 0.913 and 0.917, respectively.

CONCLUSIONS

XGBoost model based on ultrasound features was constructed and interpreted using the SHAP method, providing evidence for the diagnosis of FTC and guidance for the personalized treatment of patients.

摘要

背景

滤泡性甲状腺癌(FTC)和滤泡性腺瘤(FTA)因临床及超声特征重叠,给诊断带来挑战。提高FTC的诊断水平可改善患者预后及临床管理效果。本研究旨在利用机器学习(ML)算法,基于超声特征建立FTC预测模型并评估其诊断效能。

方法

回顾性纳入2009年1月至2023年2月期间在浙江省肿瘤医院和浙江省人民医院经手术病理确诊为FTA或FTC的患者。浙江省肿瘤医院的562例患者组成训练集,浙江省人民医院的218例患者构成验证集。随后收集患者的临床参数和超声特征。采用最小绝对收缩和选择算子及多因素逻辑回归筛选方法分析诊断参数。接下来,使用7种ML模型进行对比分析。计算受试者工作特征(ROC)曲线下面积(AUC)、准确率、灵敏度、特异度、阳性预测值(PPV)、阴性预测值(NPV)、精确率、召回率及综合评价指标(F值),以比较7种模型的诊断效能并确定最优模型。进一步对最优模型进行验证,并应用SHapley加性解释(SHAP)方法解释模型变量的重要性。最后进行个体化风险评估。

结果

年龄、回声、甲状腺球蛋白抗体(TGAb)、回声质地、成分、三碘甲状腺原氨酸(T3)、甲状腺球蛋白(TG)、边缘、促甲状腺激素(TSH)、钙化及晕环厚度>2mm是诊断FTC的影响因素。综合评估后,XGBoost模型被确定为最优模型。该模型在验证集中的AUC为0.969[95%置信区间(CI),0.946 - 0.992],其精确率、灵敏度、特异度及准确率分别为0.791、0.930、0.913和0.917。

结论

构建了基于超声特征的XGBoost模型并采用SHAP方法进行解释,为FTC诊断提供依据,为患者个体化治疗提供指导。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0b5/11400673/7c5850bf528f/qims-14-09-6311-f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验