Kim Young Hyun, Lee Chena, Ha Eun-Gyu, Choi Yoon Jeong, Han Sang-Sun
Department of Oral and Maxillofacial Radiology, Yonsei University College of Dentistry, Seoul, Korea.
Department of Orthodontics, Institute of Craniofacial Deformity, Yonsei University College of Dentistry, Seoul, Korea.
Imaging Sci Dent. 2021 Sep;51(3):299-306. doi: 10.5624/isd.20210077. Epub 2021 Jul 13.
This study aimed to propose a fully automatic landmark identification model based on a deep learning algorithm using real clinical data and to verify its accuracy considering inter-examiner variability.
In total, 950 lateral cephalometric images from Yonsei Dental Hospital were used. Two calibrated examiners manually identified the 13 most important landmarks to set as references. The proposed deep learning model has a 2-step structure-a region of interest machine and a detection machine-each consisting of 8 convolution layers, 5 pooling layers, and 2 fully connected layers. The distance errors of detection between 2 examiners were used as a clinically acceptable range for performance evaluation.
The 13 landmarks were automatically detected using the proposed model. Inter-examiner agreement for all landmarks indicated excellent reliability based on the 95% confidence interval. The average clinically acceptable range for all 13 landmarks was 1.24 mm. The mean radial error between the reference values assigned by 1 expert and the proposed model was 1.84 mm, exhibiting a successful detection rate of 36.1%. The A-point, the incisal tip of the maxillary and mandibular incisors, and ANS showed lower mean radial error than the calibrated expert variability.
This experiment demonstrated that the proposed deep learning model can perform fully automatic identification of cephalometric landmarks and achieve better results than examiners for some landmarks. It is meaningful to consider between-examiner variability for clinical applicability when evaluating the performance of deep learning methods in cephalometric landmark identification.
本研究旨在基于深度学习算法,利用真实临床数据提出一种全自动地标识别模型,并考虑检查者间的变异性来验证其准确性。
总共使用了来自延世牙科医院的950张头颅侧位X线片。两名经过校准的检查者手动识别出13个最重要的地标作为参考。所提出的深度学习模型具有两步结构——感兴趣区域机器和检测机器,每个结构都由8个卷积层、5个池化层和2个全连接层组成。将两名检查者之间的检测距离误差用作性能评估的临床可接受范围。
使用所提出的模型自动检测出了13个地标。基于95%置信区间,所有地标在检查者间的一致性显示出极好的可靠性。所有13个地标平均临床可接受范围为1.24毫米。一名专家指定的参考值与所提出模型之间的平均径向误差为1.84毫米,成功检测率为36.1%。A点、上颌和下颌切牙的切端以及前鼻棘的平均径向误差低于经过校准的专家变异性。
本实验表明,所提出的深度学习模型能够对头影测量地标进行全自动识别,并且在某些地标上比检查者取得了更好的结果。在评估深度学习方法在头影测量地标识别中的性能时,考虑检查者间的变异性对于临床适用性具有重要意义。