Postgraduate Program in Human Communication Health, Tuiuti University of Paraná, Curitiba, Paraná, Brazil.
School of Dentistry, Tuiuti University of Paraná, Curitiba, Brazil.
PLoS One. 2024 Nov 19;19(11):e0310811. doi: 10.1371/journal.pone.0310811. eCollection 2024.
The objective of this study was to develop a predictive model using supervised machine learning to determine sex based on the dimensions of the hyoid bone. Lateral cephalometric radiographs of 495 patients were analyzed, collecting the horizontal and vertical dimensions of the hyoid bone, as well as the distance from the hyoid to the mandible. The following algorithms were trained: Logistic Regression, Gradient Boosting Classifier, K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Multilayer Perceptron Classifier (MLP), Decision Tree, AdaBoost Classifier, and Random Forest Classifier. A 5-fold cross-validation approach was used to validate each model. Model evaluation metrics included areas under the curve (AUC), accuracy, recall, precision, F1 score, and ROC curves. The horizontal dimension of the hyoid bone demonstrated the highest predictive power across all evaluated models. The AUC values of the different trained models ranged from 0.81 to 0.86 on test data and from 0.78 to 0.84 in cross-validation, with the random forest classifier achieving the highest accuracy rates. The supervised machine learning model showed good predictive accuracy, indicating the model's potential for sex determination in forensic and anthropological contexts. These findings suggest that the application of artificial intelligence methods can enhance the accuracy of sex estimation, contributing to significant advancements in the field.
本研究旨在开发一个基于舌骨尺寸的监督机器学习预测模型,以确定性别。对 495 名患者的侧位头颅侧位片进行了分析,收集了舌骨的水平和垂直尺寸以及舌骨到下颌的距离。训练了以下算法:逻辑回归、梯度提升分类器、K 最近邻 (KNN)、支持向量机 (SVM)、多层感知器分类器 (MLP)、决策树、AdaBoost 分类器和随机森林分类器。使用 5 折交叉验证方法对每个模型进行验证。模型评估指标包括曲线下面积 (AUC)、准确性、召回率、精度、F1 得分和 ROC 曲线。舌骨的水平尺寸在所有评估模型中表现出最高的预测能力。不同训练模型的 AUC 值在测试数据中的范围为 0.81 到 0.86,在交叉验证中的范围为 0.78 到 0.84,随机森林分类器的准确率最高。监督机器学习模型显示出良好的预测准确性,表明该模型在法医学和人类学背景下确定性别的潜力。这些发现表明,人工智能方法的应用可以提高性别估计的准确性,为该领域的重大进展做出贡献。