Tabib Saghar, Alizadeh Seyed Danial, Andishgar Aref, Pezeshki Babak, Keshavarzian Omid, Tabrizi Reza
Student Research Committee, School of Medicine, Fasa University of Medical Sciences, Fasa, Iran.
Sina Trauma and Surgery Research Centre, Tehran University of Medical Sciences, Tehran, Iran.
Endocrinol Diabetes Metab. 2025 Jan;8(1):e70023. doi: 10.1002/edm2.70023.
In Iran, the assessment of osteoporosis through tools like dual-energy X-ray absorptiometry poses significant challenges due to their high costs and limited availability, particularly in small cities and rural areas. Our objective was to employ a variety of machine learning (ML) techniques to evaluate the accuracy and precision of each method, with the aim of identifying the most accurate pattern for diagnosing the osteoporosis risks.
We analysed the data related to osteoporosis risk factors obtained from the Fasa Adults Cohort Study in eight ML methods, including logistic regression (LR), baseline LR, decision tree classifiers (DT), support vector classifiers (SVC), random forest classifiers (RF), linear discriminant analysis (LDA), K nearest neighbour classifiers (KNN) and extreme gradient boosting (XGB). For each algorithm, we calculated accuracy, precision, sensitivity, specificity, F1 score and area under the curve (AUC) and compared them.
The XGB model with an AUC of 0.78 (95% confidence interval [CI]: 0.74-0.82) and an accuracy of 0.79 (0.75-0.83) demonstrated the best performance, while AUC and accuracy values of RF were achieved (0.78 and 0.77), LR (0.78 and 0.77), LDA (0.78 and 0.76), DT (0.76 and 0.79), SVC (0.71 and 0.64), KNN (0.63 and 0.59) and baseline LR (0.72 and 0.82), respectively.
The XGB model had the best performance in assessing the risk of osteoporosis in the Iranian population. Given the disadvantages and challenges associated with traditional osteoporosis diagnostic tools, the implementation of ML algorithms for the early identification of individuals with osteoporosis can lead to a significant reduction in morbidity and mortality related to this condition. This advancement not only alleviates the substantial financial burden placed on the healthcare systems of various countries due to the treatment of complications arising from osteoporosis but also encourages health policies to shift toward more preventive approaches for managing this disease.
在伊朗,通过双能X线吸收测定法等工具评估骨质疏松症面临重大挑战,因为这些方法成本高昂且可用性有限,尤其是在小城市和农村地区。我们的目标是采用多种机器学习(ML)技术来评估每种方法的准确性和精确性,以确定诊断骨质疏松症风险的最准确模式。
我们分析了从法萨成人队列研究中获得的与骨质疏松症风险因素相关的数据,采用了八种ML方法,包括逻辑回归(LR)、基线LR、决策树分类器(DT)、支持向量分类器(SVC)、随机森林分类器(RF)、线性判别分析(LDA)、K近邻分类器(KNN)和极端梯度提升(XGB)。对于每种算法,我们计算了准确性、精确性、敏感性、特异性、F1分数和曲线下面积(AUC)并进行了比较。
AUC为0.78(95%置信区间[CI]:0.74 - 0.82)且准确性为0.79(0.75 - 0.83)的XGB模型表现最佳,而RF的AUC和准确性值分别为(0.78和0.77),LR为(0.78和0.77),LDA为(0.78和0.76),DT为(0.76和0.79),SVC为(0.71和0.64),KNN为(0.63和0.59),基线LR为(0.72和0.82)。
XGB模型在评估伊朗人群骨质疏松症风险方面表现最佳。鉴于传统骨质疏松症诊断工具存在的缺点和挑战,实施ML算法用于早期识别骨质疏松症患者可显著降低与此病症相关的发病率和死亡率。这一进展不仅减轻了各国医疗系统因治疗骨质疏松症并发症而承受的巨大经济负担,还促使卫生政策转向更具预防性的方法来管理这种疾病。