Department of Oral and Maxillofacial Radiology, Yonsei University College of Dentistry, Seoul, South Korea.
Dentomaxillofac Radiol. 2022 May 1;51(4):20210383. doi: 10.1259/dmfr.20210383. Epub 2021 Dec 2.
This study aimed to develop a fully automated human identification method based on a convolutional neural network (CNN) with a large-scale dental panoramic radiograph (DPR) data set.
In total, 2760 DPRs from 746 subjects who had 2-17 DPRs with various changes in image characteristics due to various dental treatments (tooth extraction, oral surgery, prosthetics, orthodontics, or tooth development) were collected. The test data set included the latest DPR of each subject (746 images) and the other DPRs (2014 images) were used for model training. A modified VGG16 model with two fully connected layers was applied for human identification. The proposed model was evaluated with rank-1, -3, and -5 accuracies, running time, and gradient-weighted class activation mapping (Grad-CAM)-applied images.
This model had rank-1, -3, and -5 accuracies of 82.84%, 89.14%, and 92.23%, respectively. All rank-1 accuracy values of the proposed model were above 80% regardless of changes in image characteristics. The average running time to train the proposed model was 60.9 s per epoch, and the prediction time for 746 test DPRs was short (3.2 s/image). The Grad-CAM technique verified that the model automatically identified humans by focusing on identifiable dental information.
The proposed model showed good performance in fully automatic human identification despite differing image characteristics of DPRs acquired from the same patients. Our model is expected to assist in the fast and accurate identification by experts by comparing large amounts of images and proposing identification candidates at high speed.
本研究旨在开发一种基于卷积神经网络(CNN)的全自动人脸识别方法,该方法使用大规模的牙科全景放射照片(DPR)数据集。
共收集了 746 名受试者的 2760 张 DPR,这些受试者的 2-17 张 DPR 由于各种牙齿治疗(拔牙、口腔手术、修复体、正畸或牙齿发育)而具有不同的图像特征变化。测试数据集包括每位受试者的最新 DPR(746 张图像),其余 DPR(2014 张图像)用于模型训练。应用了带有两个全连接层的改进的 VGG16 模型进行人脸识别。通过等级 1、3 和 5 的准确率、运行时间和应用梯度加权类激活映射(Grad-CAM)的图像来评估所提出的模型。
该模型的等级 1、3 和 5 的准确率分别为 82.84%、89.14%和 92.23%。无论图像特征如何变化,该模型的所有等级 1 准确率均高于 80%。提出的模型每轮训练的平均运行时间为 60.9 秒,对 746 张测试 DPR 的预测时间很短(每张 3.2 秒)。Grad-CAM 技术验证了该模型通过关注可识别的牙科信息来自动识别人类。
尽管从同一患者获得的 DPR 的图像特征不同,但所提出的模型在全自动人脸识别中表现出良好的性能。我们的模型有望通过比较大量的图像并快速提出识别候选者,来帮助专家快速准确地进行识别。