Du Hai, Chen Feng, Li Hao, Wang Kaifeng, Zhang Jian, Meng Jian, Li Huiwen, Xu Xia, Qu Junpu, Wu Rong, Li Jing, Zhang Meilan, Zhang Fengxiang, Zhu Xuelin
Department of Radiology, Ordos Central Hospital, Ordos, China.
Department of Oncology, Ordos Central Hospital, Ordos, China.
Quant Imaging Med Surg. 2024 Aug 1;14(8):5932-5945. doi: 10.21037/qims-23-1597. Epub 2024 Jul 30.
The incidence rate of thyroid nodules has reached 65%, but only 5-15% of these modules are malignant. Therefore, accurately determining the benign and malignant nature of thyroid nodules can prevent unnecessary treatment. We aimed to develop a deep-learning (DL) radiomics model based on ultrasound (US), explore its diagnostic efficacy for benign and malignant thyroid nodules, and verify whether it improved the diagnostic level of physicians.
We retrospectively included 1,076 thyroid nodules from 817 patients at three institutions. The radiomics and DL features of the US images were extracted and used to construct radiomics signature (Rad_sig) and deep-learning signature (DL_sig). A Pearson correlation analysis and least absolute shrinkage and selection operator (LASSO) regression analysis were used for feature selection. Clinical US semantic signature (C_US_sig) was constructed based on clinical information and US semantic features. Next, a combined model was constructed based on the above three signatures in the form of a nomogram. The model was constructed using a development set (institution 1: 719 nodules), and the model was evaluated using two external validation sets (institution 2: 74 nodules, and institution 3: 283 nodules). The performance of the model was assessed using decision curve analysis (DCA) and calibration curves. Furthermore, the C_US_sigs of junior physicians, senior physicians, and expers were constructed. The DL radiomics model was used to assist the physicians with different levels of experience in the interpretation of thyroid nodules.
In the development and validation sets, the combined model showed the highest performance, with areas under the curve (AUCs) of 0.947, 0.917, and 0.929, respectively. The DCA results showed that the comprehensive nomogram had the best clinical utility. The calibration curves indicated good calibration for all models. The AUCs for distinguishing between benign and malignant thyroid nodules by junior physicians, senior physicians, and experts were 0.714-0.752, 0.740-0.824, and 0.891-0.908, respectively; however, with the assistance of DL radiomics, the AUCs reached 0.858-0.923, 0.888-0.944, and 0.912-0.919, respectively.
The nomogram based on DL radiomics had high diagnostic efficacy for thyroid nodules, and DL radiomics could assist physicians with different levels of experience to improve their diagnostic level.
甲状腺结节的发病率已达65%,但这些结节中只有5%-15%是恶性的。因此,准确判断甲状腺结节的良恶性可避免不必要的治疗。我们旨在开发一种基于超声(US)的深度学习(DL)放射组学模型,探讨其对甲状腺良恶性结节的诊断效能,并验证其是否提高了医生的诊断水平。
我们回顾性纳入了来自三个机构的817例患者的1076个甲状腺结节。提取超声图像的放射组学和深度学习特征,用于构建放射组学特征(Rad_sig)和深度学习特征(DL_sig)。采用Pearson相关分析和最小绝对收缩和选择算子(LASSO)回归分析进行特征选择。基于临床信息和超声语义特征构建临床超声语义特征(C_US_sig)。接下来,以列线图的形式基于上述三种特征构建联合模型。使用开发集(机构1:719个结节)构建模型,并使用两个外部验证集(机构2:74个结节,机构3:283个结节)对模型进行评估。使用决策曲线分析(DCA)和校准曲线评估模型的性能。此外,构建了初级医生、高级医生和专家的C_US_sig。使用DL放射组学模型协助不同经验水平的医生解读甲状腺结节。
在开发集和验证集中,联合模型表现出最高的性能,曲线下面积(AUC)分别为0.947、0.917和0.929。DCA结果表明,综合列线图具有最佳的临床实用性。校准曲线表明所有模型的校准效果良好。初级医生、高级医生和专家区分甲状腺良恶性结节的AUC分别为0.714-0.752、0.740-0.824和0.891-0.908;然而,在DL放射组学的协助下,AUC分别达到0.858-0.923、0.888-0.944和0.912-0.919。
基于DL放射组学的列线图对甲状腺结节具有较高的诊断效能,DL放射组学可协助不同经验水平的医生提高其诊断水平。