Upalananda Witsarut, Charuakkra Arnon, Chaichulee Sitthichok
Department of Oral Diagnostic Sciences, Faculty of Dentistry, Prince of Songkla University, Songkhla, Thailand.
Department of Oral Biology and Oral Diagnostic Sciences, Faculty of Dentistry, Chiang Mai University, Chiang Mai, Thailand.
J Forensic Odontostomatol. 2025 Dec 24;43(3):20-30. doi: 10.5281/zenodo.17776415.
Accurate age classification using mandibular third molar radiographs is crucial for legal and forensic applications. This study evaluated different methods for classifying age as under or over 18 years in a Thai population. We compared three approaches: (i) a traditional human-based method using a modified Demirjian classification adapted for mandibular third molars, (ii) an end-to-end deep learning model in which a convolutional neural network (CNN) directly predicts age group, and (iii) a human-defined feature extraction approach, where a CNN estimates tooth developmental stages that are subsequently used for age classification. The dataset included 3,407 images of individuals aged 14-23 years. The results indicated that the traditional human-based method achieved high specificity (0.99) and a strong Bayes' post-test probability (0.99), but it exhibited low sensitivity (0.45). In comparison, the end-to-end deep learning models showed higher sensitivity (0.65 to 0.74) than the traditional method, along with a specificity of 0.91 to 0.95 and Bayes' post-test probability of 0.93 to 0.95. The human-defined feature extraction approach, which used developmental stages for age determination, achieved an accuracy of 0.88 to 0.92 in developmental stage classification. For age classification, the models demonstrated higher specificity (0.95 to 0.97) and Bayes' post-test probability (0.95 to 0.97) than the end-to-end deep learning method, along with sensitivity ranging from 0.51 to 0.56. Our results indicate that although traditional methods excel in specificity, the human-defined feature extraction approach provides a balanced solution with high specificity and interpretability, suggesting its potential value in clinical practice for age estimation.
利用下颌第三磨牙X光片进行准确的年龄分类对于法律和法医应用至关重要。本研究评估了泰国人群中年龄分类为18岁以下或以上的不同方法。我们比较了三种方法:(i)一种基于传统人工的方法,使用适用于下颌第三磨牙的改良Demirjian分类法;(ii)一种端到端深度学习模型,其中卷积神经网络(CNN)直接预测年龄组;(iii)一种人工定义特征提取方法,其中CNN估计牙齿发育阶段,随后用于年龄分类。数据集包括3407张14 - 23岁个体的图像。结果表明,基于传统人工的方法具有高特异性(0.99)和高贝叶斯检验后概率(0.99),但敏感性较低(0.45)。相比之下,端到端深度学习模型的敏感性(0.65至0.74)高于传统方法,特异性为0.91至0.95,贝叶斯检验后概率为0.93至0.95。使用发育阶段进行年龄判定的人工定义特征提取方法在发育阶段分类中的准确率为0.88至0.92。对于年龄分类,这些模型的特异性(0.95至0.97)和贝叶斯检验后概率(0.95至0.97)高于端到端深度学习方法,敏感性范围为0.51至0.56。我们的结果表明,尽管传统方法在特异性方面表现出色,但人工定义特征提取方法提供了一种具有高特异性和可解释性的平衡解决方案,表明其在临床实践中进行年龄估计的潜在价值。