Liu Rongwei, Yuan Fengqin, Wang Biaoyang, Chen Weihua, Ye Jun, He Yun
Department of Medical Ultrasound, The First Affiliated Hospital of Gannan Medical University, Ganzhou, Jiangxi, China.
Department of Medical Ultrasound, The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, China.
Front Endocrinol (Lausanne). 2025 Jul 24;16:1634875. doi: 10.3389/fendo.2025.1634875. eCollection 2025.
This study aimed to evaluate the value of constructing a multimodal deep-learning video model based on 2D ultrasound and contrast-enhanced ultrasound (CEUS) dynamic video for the preoperative prediction of OLNM in papillary thyroid carcinoma (PTC) patients.
A retrospective analysis was conducted on 396 cases of clinically lymph node-negative PTC cases with ultrasound images collected between January and September 2023. Five representative deep learning architectures were pre-trained to construct deep learning static image models (DL_image), CEUS dynamic video models (DL_CEUSvideo), and combined models (DL_combined). The area under the receiver operating characteristic curve (AUC) was used to evaluate model performance, with comparisons made using the Delong test. A P-value of less than 0.05 was considered statistically significant.
The DL_CEUSvideo, DL_image, and DL_combined models were successfully developed and demonstrated. The AUC values were 0.826 (95% CI: 0.771-0.881), 0.759 (95% CI: 0.690-0.828), and 0.926 (95% CI: 0.891-0.962) in the training set, and 0.701 (95% CI: 0.589-0.813), 0.624 (95% CI: 0.502-0.745), and 0.734 (95% CI: 0.627-0.842) in the test set. Finally, sensitivity, specificity, and accuracy for the DL_CEUSvideo, DL_image, and DL_combined models were 0.836, 0.671, 0.704; 0.673, 0.716, 0.707; and 0.818, 0.902, 0.886 in the training set, and 0.556, 0.775, 0.724; 0.556, 0.674, 0.647; and 0.704, 0.663, 0.672 in the test set, respectively.
These results demonstrated that the multimodal deep learning dynamic video model could preoperatively predict OLNM in PTC patients. The DL_CEUSvideo model outperformed the DL_image model, while the DL_combined model significantly enhanced sensitivity without compromising specificity.
本研究旨在评估基于二维超声和超声造影(CEUS)动态视频构建多模态深度学习视频模型对甲状腺乳头状癌(PTC)患者术前预测颈部淋巴结转移(OLNM)的价值。
对2023年1月至9月收集的396例临床淋巴结阴性的PTC病例的超声图像进行回顾性分析。预训练五种代表性的深度学习架构,以构建深度学习静态图像模型(DL_image)、CEUS动态视频模型(DL_CEUSvideo)和联合模型(DL_combined)。采用受试者操作特征曲线(AUC)下面积评估模型性能,使用德龙检验进行比较。P值小于0.05被认为具有统计学意义。
成功开发并展示了DL_CEUSvideo、DL_image和DL_combined模型。训练集中的AUC值分别为0.826(95%CI:0.7