Liu Jian, Jin Chaoran, Wang Xiaolan, Pan Kexu, Li Zhuoyang, Yi Xinxuan, Shao Yu, Sun Xiaodong, Yu Xijiao
School of Stomatology, Shandong Second Medical University, Weifang, 261053, Shandong, China.
Department of Endodontics, Central Laboratory, Jinan Stomatological Hospital, Jinan Key Laboratory of oral tissue regeneration, Shandong Provincial Key Medical and Health Laboratory of Oral Diseases and Tissue Regeneration, Jinan, 250001, Shandong Province, China.
BMC Oral Health. 2025 May 26;25(1):801. doi: 10.1186/s12903-025-06104-0.
Numerous studies have investigated the use of convolutional neural network (CNN) models for detecting periapical lesions(PLs). However, limited research has focused on evaluating their potential in assisting clinicians with diagnosis. This study aims to utilize two deep learning(DL) models, ConvNeXt and ResNet34, to aid novice dentists in the detection of PLs on periapical radiographs (PRs). By assessing the diagnostic support provided by these models, this research seeks to promote the clinical application of DL in dentistry.
In this study, 1,305 PRs were gathered and then split into a training set of 1,044 images and a validation set of 261 images, following an 80/20 ratio. The model's effectiveness was assessed using various measures, including precision, sensitivity, F1 score, specificity, accuracy, and the area under the curve (AUC). To evaluate the impact of the model on diagnostic performance by novice dentists, we used an additional set of 800 individual teeth PRs, which were not included in the training or validation sets. The diagnostic performance and time of three novice dentists were measured both with and without model assistance.
The precision of ConvNeXt was 85.93%, with an F1 score of 0.92, accuracy of 91.25%, sensitivity of 98.49%, specificity of 84.11%, and an AUC of 0.9693, outperforming ResNet34 across all metrics. In comparison, ResNet34 achieved a precision of 83.08%, an F1 score of 0.84, accuracy of 81.63%, sensitivity of 84.38%, specificity of 78.13%, and an AUC of 0.8988. In the model-assisted diagnosis phase, both ConvNeXt and ResNet34 improved the diagnostic performance of novice dentists. With the help of ConvNeXt, the average AUC of three dentists increased from 0.88 to 0.94, while with ResNet34, the average AUC of the three dentists improved from 0.88 to 0.91. ConvNeXt performed better than ResNet34 (p < 0.05). Additionally, ConvNeXt reduced the average diagnostic time of the three dentists from 178.8 min to 141.9 min, while ResNet34 reduced the average diagnostic time from 178.8 min to 153.6 min.
ConvNeXt significantly improved the diagnostic performance of novice dentists and reduced the time required for diagnosis, thereby enhancing clinical efficiency in both diagnosis and treatment. This model shows potential for application in dental clinics or educational institutions where experienced specialists are limited, but there is a large presence of novice, less-experienced dentists.
众多研究已对使用卷积神经网络(CNN)模型检测根尖周病变(PLs)进行了调查。然而,针对评估其在协助临床医生诊断方面潜力的研究有限。本研究旨在利用两种深度学习(DL)模型,即ConvNeXt和ResNet34,辅助新手牙医在根尖片(PRs)上检测PLs。通过评估这些模型提供的诊断支持,本研究旨在促进DL在牙科领域的临床应用。
在本研究中,收集了1305张PRs,然后按照80/20的比例分为1044张图像的训练集和261张图像的验证集。使用包括精准度、灵敏度、F1分数、特异性、准确率和曲线下面积(AUC)等多种指标评估模型的有效性。为了评估模型对新手牙医诊断性能的影响,我们使用了另外一组800张单颗牙齿的PRs,这些PRs未包含在训练集或验证集中。测量了三名新手牙医在有和没有模型辅助情况下的诊断性能和时间。
ConvNeXt的精准度为85.93%,F1分数为0.92,准确率为91.25%,灵敏度为98.49%,特异性为84.11%,AUC为0.9693,在所有指标上均优于ResNet34。相比之下,ResNet34的精准度为83.08%,F1分数为0.84,准确率为81.63%,灵敏度为84.38%,特异性为78.13%,AUC为0.8988。在模型辅助诊断阶段,ConvNeXt和ResNet34均提高了新手牙医的诊断性能。在ConvNeXt的帮助下,三名牙医的平均AUC从0.88提高到0.94,而在ResNet34的帮助下,三名牙医的平均AUC从0.88提高到0.91。ConvNeXt的表现优于ResNet34(p < 0.05)。此外,ConvNeXt将三名牙医的平均诊断时间从178.8分钟减少到141.9分钟,而ResNet34将平均诊断时间从178.8分钟减少到153.6分钟。
ConvNeXt显著提高了新手牙医的诊断性能并减少了诊断所需时间,从而提高了诊断和治疗的临床效率。该模型在经验丰富的专家有限但新手、经验不足的牙医众多的牙科诊所或教育机构中显示出应用潜力。