Guan Qing, Wang Yunjun, Du Jiajun, Qin Yu, Lu Hongtao, Xiang Jun, Wang Fen
Department of Head and Neck Surgery, Fudan University Shanghai Cancer Center, Shanghai 200032, China.
Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China.
Ann Transl Med. 2019 Apr;7(7):137. doi: 10.21037/atm.2019.04.34.
To explore the ability of the deep learning network Inception-v3 to differentiate between papillary thyroid carcinomas (PTCs) and benign nodules in ultrasound images.
A total of 2,836 thyroid ultrasound images from 2,235 patients were divided into a training dataset and a test dataset. Inception-v3 was trained and tested to crop the margin of the images of nodules and provide a differential diagnosis. The sizes and sonographic features of nodules were further analysed to identify the factors that may influence diagnostic efficiency. Statistical analyses included χ and Fisher's exact tests and univariate and multivariate analyses.
There were 1,275 PTCs and 1,162 benign nodules in the training group and 209 PTCs and 190 benign nodules in the test group. A margin size of 50 pixels and an input size of 384×384 showed the best outcome after training, and these parameters were selected for the test group. In the test group, the sensitivity and specificity for Inception-v3 were 93.3% (195/209) and 87.4% (166/190), respectively. Inception-v3 displayed the highest accuracy for 0.5-1.0 cm nodules. The accuracy differed according to the margin description (P=0.024). Taller nodules were more accurately diagnosed than were wider nodules (P=0.015). Microcalcification [odds ratio (OR) =0.254, 95% confidence interval (CI): 0.076-0.847, P=0.026] and taller shape (OR =0.243, 95% CI: 0.073-0.810, P=0.021) were negatively associated with misdiagnosis rate.
Inception-v3 can achieve an excellent diagnostic efficiency. Nodules that are 0.5-1.0 cm in size and have microcalcification and a taller shape can be more accurately diagnosed by Inception-v3.
探讨深度学习网络Inception-v3在超声图像中鉴别甲状腺乳头状癌(PTC)与良性结节的能力。
将来自2235例患者的2836幅甲状腺超声图像分为训练数据集和测试数据集。对Inception-v3进行训练和测试,以裁剪结节图像的边缘并提供鉴别诊断。进一步分析结节的大小和超声特征,以确定可能影响诊断效率的因素。统计分析包括χ检验和Fisher精确检验以及单因素和多因素分析。
训练组有1275例PTC和1162个良性结节,测试组有209例PTC和190个良性结节。训练后,边缘大小为50像素、输入大小为384×384时效果最佳,这些参数被用于测试组。在测试组中,Inception-v3的灵敏度和特异度分别为93.3%(195/209)和87.4%(166/190)。Inception-v3对0.5 - 1.0 cm的结节诊断准确率最高。准确率根据边缘描述不同而有所差异(P = 0.024)。高结节比宽结节诊断更准确(P = 0.015)。微钙化[比值比(OR)= 0.254,95%置信区间(CI):0.076 - 0.847,P = 0.026]和高形态(OR = 0.243,95% CI:0.073 - 0.810,P = 0.021)与误诊率呈负相关。
Inception-v3可实现优异的诊断效率。Inception-v3对大小为0.5 - 1.0 cm、有微钙化且形态较高的结节能更准确地诊断。