Division of Medical Mathematics, National Institute for Mathematical Sciences, Daejeon, Korea.
Department of Radiology, Seoul National University Hospital, Seoul, Korea.
Korean J Radiol. 2021 Apr;22(4):612-623. doi: 10.3348/kjr.2020.0051. Epub 2020 Nov 26.
To evaluate the diagnostic performance of a deep learning algorithm for the automated detection of developmental dysplasia of the hip (DDH) on anteroposterior (AP) radiographs.
Of 2601 hip AP radiographs, 5076 cropped unilateral hip joint images were used to construct a dataset that was further divided into training (80%), validation (10%), or test sets (10%). Three radiologists were asked to label the hip images as normal or DDH. To investigate the diagnostic performance of the deep learning algorithm, we calculated the receiver operating characteristics (ROC), precision-recall curve (PRC) plots, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) and compared them with the performance of radiologists with different levels of experience.
The area under the ROC plot generated by the deep learning algorithm and radiologists was 0.988 and 0.988-0.919, respectively. The area under the PRC plot generated by the deep learning algorithm and radiologists was 0.973 and 0.618-0.958, respectively. The sensitivity, specificity, PPV, and NPV of the proposed deep learning algorithm were 98.0, 98.1, 84.5, and 99.8%, respectively. There was no significant difference in the diagnosis of DDH by the algorithm and the radiologist with experience in pediatric radiology ( = 0.180). However, the proposed model showed higher sensitivity, specificity, and PPV, compared to the radiologist without experience in pediatric radiology ( < 0.001).
The proposed deep learning algorithm provided an accurate diagnosis of DDH on hip radiographs, which was comparable to the diagnosis by an experienced radiologist.
评估深度学习算法在评估发育性髋关节发育不良(DDH)的前后位(AP)X 光片自动检测中的诊断性能。
在 2601 张髋关节 AP 射线片中,使用 5076 张裁剪后的单侧髋关节图像构建了一个数据集,该数据集进一步分为训练集(80%)、验证集(10%)和测试集(10%)。三名放射科医生被要求对髋关节图像进行正常或 DDH 分类。为了研究深度学习算法的诊断性能,我们计算了受试者工作特征(ROC)曲线、精确召回曲线(PRC)图、敏感性、特异性、阳性预测值(PPV)和阴性预测值(NPV),并与不同经验水平的放射科医生的表现进行了比较。
深度学习算法和放射科医生生成的 ROC 曲线下面积分别为 0.988 和 0.988-0.919。深度学习算法和放射科医生生成的 PRC 曲线下面积分别为 0.973 和 0.618-0.958。所提出的深度学习算法的敏感性、特异性、PPV 和 NPV 分别为 98.0%、98.1%、84.5%和 99.8%。该算法与具有小儿放射学经验的放射科医生在 DDH 诊断方面无显著差异(=0.180)。然而,与没有小儿放射学经验的放射科医生相比,该模型表现出更高的敏感性、特异性和 PPV(<0.001)。
所提出的深度学习算法在髋关节 X 光片中对 DDH 的诊断准确率可与有经验的放射科医生相媲美。