Cai Xiaojuan, Zhou Ya, Ren Jie, Wei Jinrong, Lu Shiyu, Gu Hanbing, Xu Weizhe, Zhu Xun
Department of Breast and Thyroid Surgery, The Second Affiliated Hospital of Soochow University, Suzhou, China.
Front Endocrinol (Lausanne). 2025 May 21;16:1546983. doi: 10.3389/fendo.2025.1546983. eCollection 2025.
Accurate evaluation of thyroid nodules is crucial for effective management; however, methods such as ultrasonography and Fine Needle Aspiration Cytology (FNAC) can be subjective and operator-dependent. Indeterminate thyroid nodules (ITNs) complicate diagnosis, coming at the expense of time, money, and potentially additional FNA samplings, causing more discomfort for the patients. Recent advancements in artificial intelligence (AI) assisted ultrasound diagnosis system have demonstrated excellent diagnostic performance and the potential to aid in the differentiation of ITNs. This study aims to develop an AI classifier that integrates the AI-assisted ultrasound diagnosis system, FNAC, and demographic data to enhance the differentiation of benign and malignant thyroid nodules, and to compare the diagnostic performance of the models, with a focus on diagnosing ITNs.
In the present research, 620 thyroid nodules were collected from a single medical center and divided into training and testing cohorts (Testing1). We developed five AI models using distinct classification algorithms (Logistic Regression, Support Vector Machine, K-Nearest Neighbor, Random Forest, and Gradient Boosting Machine) that integrate demographic data, cytological findings, and an AI-assisted ultrasound diagnostic system for thyroid nodule assessment. These models underwent prospective validation (Testing2, n = 243) to identify the optimal model. A subsequent prospective study (Testing3) involving 70 thyroid nodules further evaluated the model's performance, where the selected optimal model was compared against FNAC combined with BRAF V600E mutation analysis.
After validation with the Testing1 and Testing2 cohorts, the Random Forest (RF) model demonstrated the best overall performance among the five classifiers. The area under the curve (AUC) for the RF model to diagnose thyroid nodules was 0.994 in the training cohort, 0.993 in the testing cohort, and 0.977 in the prospective data. In addition, for 42 included ITNs in the prospective data, the accuracy, sensitivity, and specificity of the RF model were 90.48%, 89.47%, and 91.30%, respectively. In the Testing 3 cohort, the RF model demonstrated superior diagnostic performance compared to both the standalone AI ultrasound auxiliary diagnostic system and FNAC alone. Its accuracy was comparable to FNAC combined with BRAF V600E mutation analysis. Conclusion: Our developed thyroid nodule AI diagnostic model shows favorable predictive value. It can serve as a decision support tool for non-thyroid specialists and assist thyroid surgeons in the management of ITN.
Our developed thyroid nodule AI diagnostic model shows favorable predictive value. It can serve as a decision support tool for non-thyroid specialists and assist thyroid surgeons in the management of ITN.
准确评估甲状腺结节对于有效管理至关重要;然而,超声检查和细针穿刺细胞学检查(FNAC)等方法可能具有主观性且依赖操作人员。不确定甲状腺结节(ITN)使诊断变得复杂,耗费时间、金钱,还可能需要额外的FNA采样,给患者带来更多不适。人工智能(AI)辅助超声诊断系统的最新进展已显示出卓越的诊断性能以及辅助鉴别ITN的潜力。本研究旨在开发一种整合AI辅助超声诊断系统、FNAC和人口统计学数据的AI分类器,以增强良性和恶性甲状腺结节的鉴别能力,并比较各模型的诊断性能,重点是诊断ITN。
在本研究中,从单个医疗中心收集了620个甲状腺结节,并分为训练组和测试组(测试1)。我们使用不同的分类算法(逻辑回归、支持向量机、K近邻、随机森林和梯度提升机)开发了五个AI模型,这些模型整合了人口统计学数据、细胞学检查结果和用于甲状腺结节评估的AI辅助超声诊断系统。这些模型进行了前瞻性验证(测试2,n = 243)以确定最佳模型。随后一项涉及70个甲状腺结节的前瞻性研究(测试3)进一步评估了该模型的性能,将所选的最佳模型与FNAC联合BRAF V600E突变分析进行比较。
经测试1和测试2队列验证后,随机森林(RF)模型在五个分类器中表现出最佳的总体性能。RF模型诊断甲状腺结节的曲线下面积(AUC)在训练组中为0.994,在测试组中为0.993,在前瞻性数据中为0.977。此外,在前瞻性数据中的42个纳入的ITN中,RF模型的准确性、敏感性和特异性分别为90.48%、89.47%和91.30%。在测试3队列中,RF模型显示出比单独的AI超声辅助诊断系统和单独的FNAC都更优的诊断性能。其准确性与FNAC联合BRAF V600E突变分析相当。结论:我们开发的甲状腺结节AI诊断模型显示出良好的预测价值。它可以作为非甲状腺专科医生的决策支持工具,并协助甲状腺外科医生管理ITN。
我们开发的甲状腺结节AI诊断模型显示出良好的预测价值。它可以作为非甲状腺专科医生的决策支持工具,并协助甲状腺外科医生管理ITN。