Department of Thyroid Surgery, West China Hospital, Sichuan University, Chengdu, China.
West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China.
J Clin Endocrinol Metab. 2022 Mar 24;107(4):953-963. doi: 10.1210/clinem/dgab870.
This study investigates the efficiency of deep learning models in the automated diagnosis of Hashimoto's thyroiditis (HT) using real-world ultrasound data from ultrasound examinations by computer-assisted diagnosis (CAD) with artificial intelligence.
We retrospectively collected ultrasound images from patients with and without HT from 2 hospitals in China between September 2008 and February 2018. Images were divided into a training set (80%) and a validation set (20%). We ensembled 9 convolutional neural networks (CNNs) as the final model (CAD-HT) for HT classification. The model's diagnostic performance was validated and compared to 2 hospital validation sets. We also compared the accuracy of CAD-HT against seniors/junior radiologists. Subgroup analysis of CAD-HT performance for different thyroid hormone levels (hyperthyroidism, hypothyroidism, and euthyroidism) was also evaluated.
39 280 ultrasound images from 21 118 patients were included in this study. The accuracy, sensitivity, and specificity of the HT-CAD model were 0.892, 0.890, and 0.895, respectively. HT-CAD performance between 2 hospitals was not significantly different. The HT-CAD model achieved a higher performance (P < 0.001) when compared to senior radiologists, with a nearly 9% accuracy improvement. HT-CAD had almost similar accuracy (range 0.87-0.894) for the 3 subgroups based on thyroid hormone level.
The HT-CAD strategy based on CNN significantly improved the radiologists' diagnostic accuracy of HT. Our model demonstrates good performance and robustness in different hospitals and for different thyroid hormone levels.
本研究旨在利用人工智能计算机辅助诊断 (CAD) 的真实世界超声数据,调查深度学习模型在桥本甲状腺炎 (HT) 的自动诊断中的效率。
我们回顾性地从 2008 年 9 月至 2018 年 2 月期间中国的 2 家医院收集了 HT 患者和无 HT 患者的超声图像。图像分为训练集 (80%)和验证集 (20%)。我们集成了 9 个卷积神经网络 (CNN) 作为最终模型 (CAD-HT) 进行 HT 分类。验证和比较了模型的诊断性能与 2 家医院的验证集。我们还比较了 CAD-HT 与高级/初级放射科医生的准确性。还评估了 CAD-HT 对不同甲状腺激素水平 (甲亢、甲减和甲状腺功能正常) 的性能的亚组分析。
本研究共纳入了 21118 名患者的 39280 张超声图像。HT-CAD 模型的准确率、敏感度和特异度分别为 0.892、0.890 和 0.895。2 家医院之间的 HT-CAD 性能没有显著差异。与高级放射科医生相比,HT-CAD 模型的性能更高 (P < 0.001),准确率提高了近 9%。与高级放射科医生相比,HT-CAD 模型在不同的甲状腺激素水平亚组中具有相似的准确性 (范围为 0.87-0.894)。
基于 CNN 的 HT-CAD 策略显著提高了放射科医生诊断 HT 的准确性。我们的模型在不同医院和不同甲状腺激素水平下均表现出良好的性能和稳健性。