基于腰椎形态测量学的性别预测机器学习模型
Machine Learning Models for Prediction of Sex Based on Lumbar Vertebral Morphometry.
作者信息
Diac Madalina Maria, Toma Gina Madalina, Damian Simona Irina, Fotache Marin, Romanov Nicolae, Tabian Daniel, Sechel Gabriela, Scripcaru Andrei, Hancianu Monica, Iliescu Diana Bulgaru
机构信息
Forensic Medicine Sciences Department, Institute of Legal Medicine, University of Medicine and Pharmacy "Grigore T. Popa", 700115 Iasi, Romania.
Forensic Medicine Department, "Sf. Ioan" Hospital Suceava, University of Medicine and Pharmacy "Grigore T. Popa", 700115 Iasi, Romania.
出版信息
Diagnostics (Basel). 2023 Dec 8;13(24):3630. doi: 10.3390/diagnostics13243630.
BACKGROUND
Identifying skeletal remains has been and will remain a challenge for forensic experts and forensic anthropologists, especially in disasters with multiple victims or skeletal remains in an advanced stage of decomposition. This study examined the performance of two machine learning (ML) algorithms in predicting the person's sex based only on the morphometry of L1-L5 lumbar vertebrae collected recently from Romanian individuals. The purpose of the present study was to assess whether by using the machine learning (ML) techniques one can obtain a reliable prediction of sex in forensic identification based only on the parameters obtained from the metric analysis of the lumbar spine.
METHOD
This paper built and tuned predictive models with two of the most popular techniques for classification, RF (random forest) and XGB (xgboost). Both series of models used cross-validation and a grid search to find the best combination of hyper-parameters. The best models were selected based on the ROC_AUC (area under curve) metric.
RESULTS
The L1-L5 lumbar vertebrae exhibit sexual dimorphism and can be used as predictors in sex prediction. Out of the eight significant predictors for sex, six were found to be particularly important for the RF model, while only three were determined to be important by the XGB model.
CONCLUSIONS
Even if the data set was small (149 observations), both RF and XGB techniques reliably predicted a person's sex based only on the L1-L5 measurements. This can prove valuable, especially when only skeletal remains are available. With minor adjustments, the presented ML setup can be transformed into an interactive web service, freely accessible to forensic anthropologists, in which, after entering the L1-L5 measurements of a body/cadaver, they can predict the person's sex.
背景
识别骨骼遗骸一直是法医专家和法医人类学家面临的挑战,在多受害者灾难或处于高度分解阶段的骨骼遗骸案件中尤其如此。本研究考察了两种机器学习(ML)算法仅基于最近从罗马尼亚个体收集的L1-L5腰椎形态测量数据预测性别的性能。本研究的目的是评估通过使用机器学习(ML)技术,能否仅基于腰椎测量分析获得的参数,在法医鉴定中可靠地预测性别。
方法
本文使用两种最流行的分类技术——随机森林(RF)和极端梯度提升(XGB)构建并调整预测模型。两个系列的模型均使用交叉验证和网格搜索来找到超参数的最佳组合。基于ROC_AUC(曲线下面积)指标选择最佳模型。
结果
L1-L5腰椎呈现出性别二态性,可作为性别预测的指标。在八个重要的性别预测指标中,六个被发现对RF模型尤为重要,而XGB模型仅确定其中三个指标重要。
结论
即使数据集较小(149个观测值),RF和XGB技术都能仅基于L1-L5测量可靠地预测个体性别。这可能具有重要价值,特别是在仅有骨骼遗骸的情况下。只需进行微小调整,所展示的ML设置即可转化为一个法医人类学家可免费访问的交互式网络服务,在其中输入尸体的L1-L5测量数据后,即可预测个体性别。