Etemad Lily E, Heiner J Parker, Amin A A, Wu Tai-Hsien, Chao Wei-Lun, Hsieh Shin-Jung, Sun Zongyang, Guez Camille, Ko Ching-Chang
Division of Orthodontics, The Ohio State University, 305 W. 12th Avenue, Columbus, OH 43210, USA.
College of Dentistry, The Ohio State University, 305 W. 12th Avenue, Columbus, OH 43210, USA.
Bioengineering (Basel). 2024 Aug 31;11(9):888. doi: 10.3390/bioengineering11090888.
The study aimed to evaluate the effectiveness of machine learning in predicting whether orthodontic patients would require extraction or non-extraction treatment using data from two university datasets. A total of 1135 patients, with 297 from University 1 and 838 from University 2, were included during consecutive enrollment periods. The study identified 20 inputs including 9 clinical features and 11 cephalometric measurements based on previous research. Random forest (RF) models were used to make predictions for both institutions. The performance of each model was assessed using sensitivity (SEN), specificity (SPE), accuracy (ACC), and feature ranking. The model trained on the combined data from two universities demonstrated the highest performance, achieving 50% sensitivity, 97% specificity, and 85% accuracy. When cross-predicting, where the University 1 (U1) model was applied to the University 2 (U2) data and vice versa, there was a slight decrease in performance metrics (ranging from 0% to 20%). Maxillary and mandibular crowding were identified as the most significant features influencing extraction decisions in both institutions. This study is among the first to utilize datasets from two United States institutions, marking progress toward developing an artificial intelligence model to support orthodontists in clinical practice.
该研究旨在利用来自两个大学数据集的数据,评估机器学习在预测正畸患者是否需要拔牙或不拔牙治疗方面的有效性。在连续的入组期间,共纳入了1135名患者,其中297名来自大学1,838名来自大学2。该研究基于先前的研究确定了20个输入变量,包括9个临床特征和11个头颅测量指标。随机森林(RF)模型被用于对两个机构的数据进行预测。使用灵敏度(SEN)、特异度(SPE)、准确度(ACC)和特征排名来评估每个模型的性能。在来自两所大学的合并数据上训练的模型表现出最高的性能,灵敏度达到50%,特异度达到97%,准确度达到85%。当进行交叉预测时,即将大学1(U1)的模型应用于大学2(U2)的数据,反之亦然,性能指标会略有下降(范围从0%到20%)。上颌和下颌拥挤被确定为两个机构中影响拔牙决策的最显著特征。这项研究是最早利用来自美国两个机构的数据集之一,标志着在开发人工智能模型以支持正畸医生临床实践方面取得了进展。