Faragalli Andrea, Ferrante Luigi, Angelakopoulos Nikolaos, Cameriere Roberto, Skrami Edlira
Center of Epidemiology, Biostatistics and Medical Information Technology, Department of Biomedical Sciences and Public Health, Università Politecnica delle Marche, Ancona 60126, Italy.
Center of Epidemiology, Biostatistics and Medical Information Technology, Department of Biomedical Sciences and Public Health, Università Politecnica delle Marche, Ancona 60126, Italy.
Forensic Sci Int. 2025 Feb;367:112353. doi: 10.1016/j.forsciint.2024.112353. Epub 2024 Dec 20.
Age estimation is crucial in forensic and anthropological fields. Teeth, are valued for their resilience to environmental factors and their preservation over time, making them essential for age estimation when other skeletal remains deteriorate. Recently, Machine Learning algorithms have been used in age estimation, demonstrating high levels of accuracy. However, their precision with respect to the trend of age estimation error, typical in some traditional methods like linear regression, has not been thoroughly investigated.
To evaluate and compare the performance of frequently used Machine Learning-assisted methods against two traditional age estimation methods, linear regression and the Segmented Normal Bayesian Calibration model.
Overall, 1.949 orthopantomographs from black and white South African children aged 5-14 years, with 49 % males, were evaluated. The performance of Random Forest, Support Vector Regression, K-Nearest Neighbors and the Gradient Boosting Method were compared against traditional linear regression and the Segmented Normal Bayesian Calibration model. The comparison was based on accuracy measures, including Mean Absolute Error and Root Mean Squared Error, and precision measures, including the Inter-Quartile Range of the error distribution and the slope of the estimated age error relative to chronological age.
The Machine Learning methods outperformed linear regression and the Segmented Normal Bayesian Calibration models in terms of accuracy, although the differences were small. Gradient Boosting Method and Support Vector Regression achieved the highest levels of accuracy (Mean Absolute Error: 0.69 years, Root Mean Squared Error: 0.85 years). All Machine Learning methods and linear regression exhibited significant bias in residuals, whereas the Segmented Normal Bayesian Calibration model showed no significant bias. Gender-stratified analyses revealed similar results in terms of the accuracy and precision of all considered models.
Although Machine Learning methods demonstrate high levels of accuracy, they may be prone to trends in error distribution when estimating dental age. Evaluating this error is crucial and should be an integral part of model performance evaluation. Future research should aim to improve accuracy while rigorously addressing systematic biases.
年龄估计在法医学和人类学领域至关重要。牙齿因其对环境因素的耐受性及其长期保存性而备受重视,当其他骨骼遗骸 deteriorate 时,牙齿对于年龄估计至关重要。近年来,机器学习算法已被用于年龄估计,显示出较高的准确性。然而,它们在年龄估计误差趋势方面的精度,在一些传统方法(如线性回归)中很典型,尚未得到充分研究。
评估并比较常用的机器学习辅助方法与两种传统年龄估计方法(线性回归和分段正态贝叶斯校准模型)的性能。
总共评估了1949份来自南非5至14岁黑白儿童的曲面断层片,其中49%为男性。将随机森林、支持向量回归、K近邻和梯度提升方法的性能与传统线性回归和分段正态贝叶斯校准模型进行比较。比较基于准确性指标(包括平均绝对误差和均方根误差)和精度指标(包括误差分布的四分位间距和估计年龄误差相对于实际年龄的斜率)。
机器学习方法在准确性方面优于线性回归和分段正态贝叶斯校准模型,尽管差异较小。梯度提升方法和支持向量回归达到了最高的准确性水平(平均绝对误差:0.69岁,均方根误差:0.85岁)。所有机器学习方法和线性回归在残差中均表现出显著偏差,而分段正态贝叶斯校准模型未显示出显著偏差。按性别分层分析显示,所有考虑的模型在准确性和精度方面都有类似结果。
尽管机器学习方法显示出较高的准确性,但在估计牙齿年龄时可能容易出现误差分布趋势。评估这种误差至关重要,应成为模型性能评估的一个组成部分。未来的研究应旨在提高准确性,同时严格解决系统偏差问题。