Department of Radiation Oncology and Winship Cancer Institute, Emory University, Atlanta, GA 30322, United States of America.
Phys Med Biol. 2021 Feb 16;66(5):055007. doi: 10.1088/1361-6560/abc5a6.
The purpose of this study is to develop a deep learning method for thyroid delineation with high accuracy, efficiency, and robustness in noncontrast-enhanced head and neck CTs. The cross-sectional analysis consisted of six tests, including randomized cross-validation and hold-out experiments, tests of prediction accuracy between cancer and benign and cross-gender analysis were performed to evaluate the proposed deep-learning-based performance method. CT images of 1977 patients with suspected thyroid carcinoma were retrospectively investigated. The automatically segmented thyroid gland volume was compared against physician-approved clinical contours using metrics, the Pearson correlation and Bland-Altman analysis. Quantitative metrics included: the Dice similarity coefficient (DSC), sensitivity, specificity, Jaccard index (JAC), Hausdorff distance (HD), mean surface distance (MSD), residual mean square distance (RMSD) and the center of mass distance (CMD). The robustness of the proposed method was further tested using the nonparametric Kruskal-Wallis test to assess the equality of distribution of DSC values. The proposed method's accuracy remained high through all the tests, with the median DSC, JAC, sensitivity and specificity higher than 0.913, 0.839, 0.856 and 0.979, respectively. The proposed method also resulted in median MSD, RMSD, HD and CMD, of less than 0.31 mm, 0.48 mm, 2.06 mm and 0.50 mm, respectively. The MSD and RMSD were 0.40 ± 0.29 mm and 0.70 ± 0.46 mm, respectively. Concurrent testing of the proposed method with 3D U-Net and V-Net showed that the proposed method had significantly improved performance. The proposed deep-learning method achieved accurate and robust performance through six cross-sectional analysis tests.
本研究旨在开发一种高精度、高效率、高鲁棒性的非对比增强头颈部 CT 甲状腺勾画深度学习方法。该横断面分析包括六项测试,包括随机交叉验证和留一实验、癌症与良性之间的预测准确性测试以及跨性别分析,以评估所提出的基于深度学习的性能方法。回顾性研究了 1977 例疑似甲状腺癌患者的 CT 图像。使用度量标准、Pearson 相关和 Bland-Altman 分析将自动分割的甲状腺体积与医师批准的临床轮廓进行比较。定量指标包括:Dice 相似系数(DSC)、灵敏度、特异性、Jaccard 指数(JAC)、Hausdorff 距离(HD)、平均表面距离(MSD)、残余均方距离(RMSD)和质心距离(CMD)。通过非参数 Kruskal-Wallis 检验进一步测试了所提出方法的稳健性,以评估 DSC 值分布的均等性。所提出的方法在所有测试中的准确性仍然很高,中位数 DSC、JAC、灵敏度和特异性均高于 0.913、0.839、0.856 和 0.979。所提出的方法还导致中位数 MSD、RMSD、HD 和 CMD 分别小于 0.31mm、0.48mm、2.06mm 和 0.50mm。MSD 和 RMSD 分别为 0.40±0.29mm 和 0.70±0.46mm。同时对所提出的方法与 3D U-Net 和 V-Net 进行测试,结果表明所提出的方法具有显著提高的性能。所提出的深度学习方法通过六项横断面分析测试实现了准确和稳健的性能。