Huang Jialiang, Chan Ian-Tong, Wang Zhixian, Ding Xiaoyi, Jin Ying, Yang Congchong, Pan Yichen
Department of Orthodontics, Shanghai Stomatological Hospital and School of Stomatology, Fudan University, Shanghai, China.
Shanghai Key Laboratory of Craniomaxillofacial Development and Diseases, Fudan University, Shanghai, China.
Front Bioeng Biotechnol. 2024 Oct 14;12:1483230. doi: 10.3389/fbioe.2024.1483230. eCollection 2024.
The study aims to predict tooth extraction decision based on four machine learning methods and analyze the feature contribution, so as to shed light on the important basis for experts of tooth extraction planning, providing reference for orthodontic treatment planning.
This study collected clinical information of 192 patients with malocclusion diagnosis and treatment plans. This study used four machine learning strategies, including decision tree, random forest, support vector machine (SVM) and multilayer perceptron (MLP) to predict orthodontic extraction decisions on clinical examination data acquired during initial consultant containing Angle classification, skeletal classification, maxillary and mandibular crowding, overjet, overbite, upper and lower incisor inclination, vertical growth pattern, lateral facial profile. Among them, 30% of the samples were randomly selected as testing sets. We used five-fold cross-validation to evaluate the generalization performance of the model and avoid over-fitting. The accuracy of the four models was calculated for the training set and cross-validation set. The confusion matrix was plotted for the testing set, and 6 indicators were calculated to evaluate the performance of the model. For the decision tree and random forest models, we observed the feature contribution.
The accuracy of the four models in the training set ranges from 82% to 90%, and in the cross-validation set, the decision tree and random forest had higher accuracy. In the confusion matrix analysis, decision tree tops the four models with highest accuracy, specificity, precision and F1-score and the other three models tended to classify too many samples as extraction cases. In the feature contribution analysis, crowding, lateral facial profile, and lower incisor inclination ranked at the top in the decision tree model.
Among the machine learning models that only use clinical data for tooth extraction prediction, decision tree has the best overall performance. For tooth extraction decisions, specifically, crowding, lateral facial profile, and lower incisor inclination have the greatest contribution.
本研究旨在基于四种机器学习方法预测拔牙决策并分析特征贡献,从而为拔牙计划专家提供重要依据,为正畸治疗计划提供参考。
本研究收集了192例错颌畸形诊断及治疗计划患者的临床信息。本研究使用四种机器学习策略,包括决策树、随机森林、支持向量机(SVM)和多层感知器(MLP),对初次会诊时获取的临床检查数据进行正畸拔牙决策预测,这些数据包括安氏分类、骨骼分类、上颌和下颌拥挤度、覆盖、覆合、上下切牙倾斜度、垂直生长型、侧面面部轮廓。其中,随机抽取30%的样本作为测试集。我们使用五折交叉验证来评估模型的泛化性能并避免过拟合。计算了四个模型在训练集和交叉验证集上的准确率。为测试集绘制了混淆矩阵,并计算了6个指标来评估模型的性能。对于决策树和随机森林模型,我们观察了特征贡献。
四个模型在训练集中的准确率在82%至90%之间,在交叉验证集中,决策树和随机森林的准确率较高。在混淆矩阵分析中,决策树在四个模型中准确率、特异性、精确率和F1分数最高,其他三个模型倾向于将过多样本分类为拔牙病例。在特征贡献分析中,拥挤度、侧面面部轮廓和下切牙倾斜度在决策树模型中排名靠前。
在仅使用临床数据进行拔牙预测的机器学习模型中,决策树的整体性能最佳。对于拔牙决策而言,具体来说,拥挤度、侧面面部轮廓和下切牙倾斜度的贡献最大。