Zhao Xing, Li Xiaoge, Gao Kun, Shao Hang, Yuan Yongyi, Dai Pu
Senior Department of Otolaryngology Head and Neck Surgery,the 6th Medical Center of Chinese PLA General Hospital,Chinese PLA Medical School,National Clinical Research Center for Otolaryngologic Diseases,Beijing,100853,China.
Yangtze Delta Region Institute of Tsinghua University.
Lin Chuang Er Bi Yan Hou Tou Jing Wai Ke Za Zhi. 2024 Jun;38(6):547-552. doi: 10.13201/j.issn.2096-7993.2024.06.017.
To evaluate the diagnostic efficacy of traditional radiomics, deep learning, and deep learning radiomics in differentiating normal and inner ear malformations on temporal bone computed tomography(CT). A total of 572 temporal bone CT data were retrospectively collected, including 201 cases of inner ear malformation and 371 cases of normal inner ear, and randomly divided into a training cohort(=458) and a test cohort(=114) in a ratio of 4∶1. Deep transfer learning features and radiomics features were extracted from the CT images and feature fusion was performed to establish the least absolute shrinkage and selection operator. The CT results interpretated by two chief otologists from the National Clinical Research Center for Otorhinolaryngological Diseases served as the gold standard for diagnosis. The model performance was evaluated using receiver operating characteristic(ROC), and the accuracy, sensitivity, specificity, and other indicators of the models were calculated. The predictive power of each model was compared using the Delong test. 1 179 radiomics features were obtained from traditional radiomics, 2 048 deep learning features were obtained from deep learning, and 137 features fusion were obtained after feature screening and fusion of the two. The area under the curve(AUC) of the deep learning radiomics model on the test cohort was 0.964 0(95% 0.931 4-0.996 8), with an accuracy of 0.922, sensitivity of 0.881, and specificity of 0.945. The AUC of the radiomics features alone on the test cohort was 0.929 0(95% 0.882 2-0.974 9), with an accuracy of 0.878, sensitivity of 0.881, and specificity of 0.877. The AUC of the deep learning features alone on the test cohort was 0.947 0(95% 0.898 2-0.994 8), with an accuracy of 0.913, sensitivity of 0.810, and specificity of 0.973. The results indicated that the prediction accuracy and AUC of the deep learning radiomics model are the highest. The Delong test showed that the differences between any two models did not reach statistical significance. The feature fusion model can be used for the differential diagnosis of normal and inner ear malformations, and its diagnostic performance is superior to radiomics or deep learning models alone.
评估传统放射组学、深度学习及深度学习放射组学在颞骨计算机断层扫描(CT)上鉴别正常内耳与内耳畸形的诊断效能。回顾性收集572例颞骨CT数据,其中包括201例内耳畸形和371例正常内耳,并按照4∶1的比例随机分为训练队列(=458)和测试队列(=114)。从CT图像中提取深度迁移学习特征和放射组学特征,并进行特征融合以建立最小绝对收缩和选择算子。由国家耳鼻咽喉疾病临床医学研究中心的两名主任医师解读的CT结果作为诊断的金标准。使用受试者工作特征(ROC)曲线评估模型性能,并计算模型的准确性、敏感性、特异性等指标。使用德龙检验比较各模型的预测能力。从传统放射组学中获得1179个放射组学特征,从深度学习中获得2048个深度学习特征,两者经特征筛选和融合后获得137个融合特征。测试队列中深度学习放射组学模型的曲线下面积(AUC)为0.964 0(95%可信区间为0.931 4-0.996 8),准确性为0.922,敏感性为0.881,特异性为0.945。测试队列中单独的放射组学特征的AUC为0.929 0(95%可信区间为0.882 2-0.974 9),准确性为0.878,敏感性为0.881,特异性为0.877。测试队列中单独的深度学习特征的AUC为0.947 0(95%可信区间为0.898 2-0.994 8),准确性为0.913,敏感性为0.810,特异性为0.973。结果表明,深度学习放射组学模型的预测准确性和AUC最高。德龙检验表明,任意两个模型之间的差异均未达到统计学意义。特征融合模型可用于正常内耳与内耳畸形的鉴别诊断,其诊断性能优于单独的放射组学或深度学习模型。