Štepanovský Michal, Ibrová Alexandra, Buk Zdeněk, Velemínská Jana
Department of Computer Systems, Faculty of Information Technology, Czech Technical University in Prague, Thákurova 9, 160 00 Prague, Czech Republic.
Department of Anthropology and Human Genetics, Faculty of Science, Charles University, Viničná 7, 128 43 Prague, Czech Republic.
Forensic Sci Int. 2017 Oct;279:72-82. doi: 10.1016/j.forsciint.2017.08.005. Epub 2017 Aug 14.
In order to analyze and improve the dental age estimation in children and adolescents for forensic purposes, 22 age estimation methods were compared to a sample of 976 orthopantomographs (662 males, 314 females) of healthy Czech children and adolescents aged between 2.7 and 20.5 years. All methods are compared in terms of the accuracy and complexity and are based on various data mining methods or on simple mathematical operations. The winning method is presented in detail. The comparison showed that only three methods provide the best accuracy while remaining user-friendly. These methods were used to build a tabular multiple linear regression model, an M5P tree model and support vector machine model with first-order polynomial kernel. All of them have mean absolute error (MAE) under 0.7 years for both males and females. The other well-performing data mining methods (RBF neural network, K-nearest neighbors, Kstar, etc.) have similar or slightly better accuracy, but they are not user-friendly as they require computing equipment and the implementation as computer program. The lowest estimation accuracy provides the traditional model based on age averages (MAE under 0.96 years). Different relevancy of various teeth for the age estimation was found. This finding also explains the lowest accuracy of the traditional averages-based model. In this paper, a technique for missing data replacement for the cases with missing teeth is presented in detail as well as the constrained tabular multiple regression model. Also, we provide free age prediction software based on this wining model.
为了分析和改进用于法医目的的儿童和青少年牙齿年龄估计方法,将22种年龄估计方法与976张健康捷克儿童和青少年(年龄在2.7至20.5岁之间)的口腔全景片样本(662名男性,314名女性)进行了比较。所有方法都在准确性和复杂性方面进行了比较,并且基于各种数据挖掘方法或简单的数学运算。详细介绍了获胜方法。比较结果表明,只有三种方法在保持用户友好的同时提供了最佳准确性。这些方法被用于构建表格多元线性回归模型、M5P树模型和具有一阶多项式核的支持向量机模型。对于男性和女性,所有这些模型的平均绝对误差(MAE)均在0.7岁以下。其他表现良好的数据挖掘方法(径向基函数神经网络、K近邻、K星等)具有相似或略高的准确性,但它们不便于用户使用,因为它们需要计算设备并作为计算机程序来实现。基于年龄平均值的传统模型的估计准确性最低(MAE在0.96岁以下)。发现了不同牙齿在年龄估计中的不同相关性。这一发现也解释了基于传统平均值的模型准确性最低的原因。本文还详细介绍了一种针对牙齿缺失情况的数据缺失替换技术以及约束表格多元回归模型。此外,我们基于这个获胜模型提供了免费的年龄预测软件。