Wen Wen, Zhang Tingrui, Zhao Haina, Liu Jingyan, Jiang Heng, He Yushuang, Jiang Zekun
Department of Ultrasound, West China Hospital, Sichuan University, Chengdu, China.
Department of Electrical and Computer Engineering, University of Michigan, Ann Arbor, MI, USA.
Gland Surg. 2025 Aug 31;14(8):1558-1571. doi: 10.21037/gs-2025-183. Epub 2025 Aug 26.
Facing challenges in differentiating benign/malignant hypervascular thyroid nodules due to overlapping ultrasound features and limited vascular characterization, this study developed multimodal machine learning models integrating B-mode and power Doppler imaging (PDI) features.
A retrospective cohort of 315 patients with pathologically confirmed hypervascular thyroid nodules (Adler grade 2/3) was divided into training (n=220) and test (n=95) sets. Multimodal ultrasound images were processed using a deep learning-based segmentation model and red-channel thresholding method, followed by radiomics feature extraction (1,910 features via PyRadiomics) and deep learning feature derivation (1,000 ResNet-derived features). Feature selection employed analysis of variance (ANOVA) F-tests, yielding hybrid feature sets. Five machine learning algorithms, including random forest, logistic regression, support vector machine (SVM), eXtreme Gradient Boosting (XGBoost), and Tabular Prior-data Fitted Network (TABPFN), were trained and validated. A fused model integrated optimal B-mode and PDI SVM models. Performance was assessed via area under the curve (AUC), accuracy, precision, recall, and SHapley Additive exPlanations (SHAP) analysis. Clinical trial registration number: ChinCTR2100049742.
SVM outperformed other models in single-modality analyses: B-mode SVM achieved an AUC of 0.89 (accuracy: 0.84; recall: 0.94), while PDI SVM attained an AUC of 0.86 (accuracy: 0.82; recall: 0.97). The combined model demonstrated near-perfect training performance (AUC: 1.00; accuracy: 0.96) but moderated in testing (AUC: 0.89; accuracy: 0.78), indicating potential overfitting. Radiomics features dominated feature importance, including Logarithm_firstorder_Energy (B-mode) and squareroot_firstorder_Minimum (PDI). The fused model showed superior recall (0.95) and F1-score (0.86) compared to single modalities, highlighting complementary diagnostic value.
Multimodal ultrasound fusion models, particularly SVM-based frameworks, enhance diagnostic accuracy for hypervascular thyroid nodules by synergizing morphological and vascular features. Despite challenges in generalizability, the integration of radiomics and deep learning features offers clinically reliable tools to reduce invasive biopsies.
由于超声特征重叠以及血管特征描述有限,在鉴别甲状腺高血管性结节的良恶性时面临挑战,本研究开发了整合B超和能量多普勒成像(PDI)特征的多模态机器学习模型。
对315例经病理证实的甲状腺高血管性结节(Adler分级2/3级)患者的回顾性队列进行研究,分为训练组(n = 220)和测试组(n = 95)。使用基于深度学习的分割模型和红色通道阈值法处理多模态超声图像,随后进行影像组学特征提取(通过PyRadiomics提取1910个特征)和深度学习特征推导(1000个ResNet衍生特征)。采用方差分析(ANOVA)F检验进行特征选择,得到混合特征集。对包括随机森林、逻辑回归、支持向量机(SVM)、极端梯度提升(XGBoost)和表格先验数据拟合网络(TABPFN)在内的五种机器学习算法进行训练和验证。构建一个融合模型,整合最佳的B超和PDI SVM模型。通过曲线下面积(AUC)、准确率、精确率、召回率和SHapley加性解释(SHAP)分析评估性能。临床试验注册号:ChinCTR2100049742。
在单模态分析中,SVM的表现优于其他模型:B超SVM的AUC为0.89(准确率:0.84;召回率:0.94),而PDI SVM的AUC为0.86(准确率:0.82;召回率:0.97)。联合模型在训练时表现出近乎完美的性能(AUC:1.00;准确率:0.96),但在测试时有所下降(AUC:0.89;准确率:0.78),表明存在潜在的过拟合。影像组学特征在特征重要性方面占主导地位,包括Logarithm_firstorder_Energy(B超)和squareroot_firstorder_Minimum(PDI)。与单模态相比,融合模型显示出更高的召回率(0.95)和F1分数(0.86),突出了其互补的诊断价值。
多模态超声融合模型,特别是基于SVM的框架,通过整合形态学和血管特征提高了甲状腺高血管性结节的诊断准确性。尽管在可推广性方面存在挑战,但影像组学和深度学习特征的整合为减少侵入性活检提供了临床可靠的工具。