Inui Atsuyuki, Nishimoto Hanako, Mifune Yutaka, Yoshikawa Tomoya, Shinohara Issei, Furukawa Takahiro, Kato Tatsuo, Tanaka Shuya, Kusunose Masaya, Kuroda Ryosuke
Department of Orthopaedic Surgery, Kobe University Graduate School of Medicine, Kusunoki-cho, 7-5-1, Chuou-ku, Kobe City 650-0017, Japan.
Orthopaedic Surgery Kobe Rosai Hospital, Kagoike-dori 4-1-23, Chuou-ku, Kobe City 651-0053, Japan.
Bioengineering (Basel). 2023 Feb 21;10(3):277. doi: 10.3390/bioengineering10030277.
The diagnosis of osteoporosis is made by measuring bone mineral density (BMD) using dual-energy X-ray absorptiometry (DXA). Machine learning, one of the artificial intelligence methods, was used to predict low BMD without using DXA in elderly women. Medical records from 2541 females who visited the osteoporosis clinic were used in this study. As hyperparameters for machine learning, patient age, body mass index (BMI), and blood test data were used. As machine learning models, logistic regression, decision tree, random forest, gradient boosting trees, and lightGBM were used. Each model was trained to classify and predict low-BMD patients. The model performance was compared using a confusion matrix. The accuracy of each trained model was 0.772 in logistic regression, 0.739 in the decision tree, 0.775 in the random forest, 0.800 in gradient boosting, and 0.834 in lightGBM. The area under the curve (AUC) was 0.595 in the decision tree, 0.673 in logistic regression, 0.699 in the random forest, 0.840 in gradient boosting, and 0.961, which was the highest, in the lightGBM model. Important features were BMI, age, and the number of platelets. Shapley additive explanation scores in the lightGBM model showed that BMI, age, and ALT were ranked as important features. Among several machine learning models, the lightGBM model showed the best performance in the present research.
骨质疏松症的诊断是通过使用双能X线吸收法(DXA)测量骨矿物质密度(BMD)来进行的。机器学习作为人工智能方法之一,被用于在不使用DXA的情况下预测老年女性的低骨密度。本研究使用了2541名到骨质疏松症门诊就诊的女性的病历。作为机器学习的超参数,使用了患者年龄、体重指数(BMI)和血液检测数据。作为机器学习模型,使用了逻辑回归、决策树、随机森林、梯度提升树和轻量级梯度提升机(lightGBM)。每个模型都经过训练以对低骨密度患者进行分类和预测。使用混淆矩阵比较模型性能。逻辑回归中每个训练模型的准确率为0.772,决策树为0.739,随机森林为0.775,梯度提升为0.800,轻量级梯度提升机为0.834。决策树的曲线下面积(AUC)为0.595,逻辑回归为0.673,随机森林为0.699,梯度提升为0.840,轻量级梯度提升机模型的AUC最高,为0.961。重要特征为BMI、年龄和血小板数量。轻量级梯度提升机模型中的Shapley值解释分数表明,BMI、年龄和谷丙转氨酶(ALT)被列为重要特征。在几种机器学习模型中,轻量级梯度提升机模型在本研究中表现最佳。