• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于XGBoost-SHAP的膝骨关节炎可解释诊断框架:一项基于人群的回顾性队列研究

XGBoost-SHAP-based interpretable diagnostic framework for knee osteoarthritis: a population-based retrospective cohort study.

作者信息

Fan Zijuan, Song Wenzhu, Ke Yan, Jia Ligan, Li Songyan, Li Jiao Jiao, Zhang Yuqing, Lin Jianhao, Wang Bin

机构信息

Department of Orthopaedic Surgery, The First Affiliated Hospital, Zhejiang University School of Medicine, Qingchun Road No. 79, Hangzhou, China.

Department of Health Statistics, School of Public Health, Sun Yat-sen University, Guangzhou, China.

出版信息

Arthritis Res Ther. 2024 Dec 19;26(1):213. doi: 10.1186/s13075-024-03450-2.

DOI:10.1186/s13075-024-03450-2
PMID:39696605
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11657490/
Abstract

OBJECTIVE

To use routine demographic and clinical data to develop an interpretable individual-level machine learning (ML) model to diagnose knee osteoarthritis (KOA) and to identify highly ranked features.

METHODS

In this retrospective, population-based cohort study, anonymized questionnaire data was retrieved from the Wu Chuan KOA Study, Inner Mongolia, China. After feature selections, participants were divided in a 7:3 ratio into training and test sets. Class balancing was applied to the training set for data augmentation. Four ML classifiers were compared by cross-validation within the training set and their performance was further analyzed with an unseen test set. Classifications were evaluated using sensitivity, specificity, positive predictive value, negative predictive value, accuracy, area under the curve(AUC), G-means, and F1 scores. The best model was explained using Shapley values to extract highly ranked features.

RESULTS

A total of 1188 participants were investigated in this study, among whom 26.3% were diagnosed with KOA. Comparatively, XGBoost with Boruta exhibited the highest classification performance among the four models, with an AUC of 0.758, G-means of 0.800, and F1 scores of 0.703. The SHAP method reveals the top 17 features of KOA according to the importance ranking, and the average of the experience of joint pain was recognized as the most important features.

CONCLUSIONS

Our study highlights the usefulness of machine learning in unveiling important factors that influence the diagnosis of KOA to guide new prevention strategies. Further work is needed to validate this approach.

摘要

目的

利用常规人口统计学和临床数据开发一种可解释的个体水平机器学习(ML)模型,以诊断膝关节骨关节炎(KOA)并识别排名靠前的特征。

方法

在这项基于人群的回顾性队列研究中,从中国内蒙古武川KOA研究中检索匿名问卷数据。经过特征选择后,参与者以7:3的比例分为训练集和测试集。对训练集应用类平衡进行数据增强。在训练集内通过交叉验证比较了四种ML分类器,并使用未见过的测试集进一步分析了它们的性能。使用灵敏度、特异度、阳性预测值、阴性预测值、准确度、曲线下面积(AUC)、G均值和F1分数评估分类。使用Shapley值解释最佳模型以提取排名靠前的特征。

结果

本研究共调查了1188名参与者,其中26.3%被诊断为KOA。相比之下,带有Boruta的XGBoost在四个模型中表现出最高的分类性能,AUC为0.758,G均值为0.800,F1分数为0.703。SHAP方法根据重要性排名揭示了KOA的前17个特征,关节疼痛经历的平均值被认为是最重要的特征。

结论

我们的研究强调了机器学习在揭示影响KOA诊断的重要因素以指导新的预防策略方面的有用性。需要进一步的工作来验证这种方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec8e/11657490/41d81a764d25/13075_2024_3450_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec8e/11657490/20bbd33f5678/13075_2024_3450_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec8e/11657490/ce8ed6f154fe/13075_2024_3450_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec8e/11657490/b4026c723af7/13075_2024_3450_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec8e/11657490/41d81a764d25/13075_2024_3450_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec8e/11657490/20bbd33f5678/13075_2024_3450_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec8e/11657490/ce8ed6f154fe/13075_2024_3450_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec8e/11657490/b4026c723af7/13075_2024_3450_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec8e/11657490/41d81a764d25/13075_2024_3450_Fig4_HTML.jpg

相似文献

1
XGBoost-SHAP-based interpretable diagnostic framework for knee osteoarthritis: a population-based retrospective cohort study.基于XGBoost-SHAP的膝骨关节炎可解释诊断框架:一项基于人群的回顾性队列研究
Arthritis Res Ther. 2024 Dec 19;26(1):213. doi: 10.1186/s13075-024-03450-2.
2
Identification of biomarkers for knee osteoarthritis through clinical data and machine learning models.通过临床数据和机器学习模型识别膝关节骨关节炎的生物标志物
Sci Rep. 2025 Jan 11;15(1):1703. doi: 10.1038/s41598-025-85945-9.
3
An explainable machine learning-based prediction model for sarcopenia in elderly Chinese people with knee osteoarthritis.一种基于可解释机器学习的中国老年膝骨关节炎患者肌肉减少症预测模型。
Aging Clin Exp Res. 2025 Mar 7;37(1):67. doi: 10.1007/s40520-025-02931-x.
4
Predicting Mortality in Intensive Care Unit Patients With Heart Failure Using an Interpretable Machine Learning Model: Retrospective Cohort Study.利用可解释机器学习模型预测重症监护病房心力衰竭患者的死亡率:回顾性队列研究。
J Med Internet Res. 2022 Aug 9;24(8):e38082. doi: 10.2196/38082.
5
Early prediction of sepsis associated encephalopathy in elderly ICU patients using machine learning models: a retrospective study based on the MIMIC-IV database.使用机器学习模型对老年重症监护病房患者脓毒症相关脑病进行早期预测:一项基于MIMIC-IV数据库的回顾性研究
Front Cell Infect Microbiol. 2025 Apr 17;15:1545979. doi: 10.3389/fcimb.2025.1545979. eCollection 2025.
6
Interpretable machine learning model for early prediction of 28-day mortality in ICU patients with sepsis-induced coagulopathy: development and validation.用于脓毒症诱导性凝血病 ICU 患者 28 天死亡率早期预测的可解释机器学习模型:开发与验证。
Eur J Med Res. 2024 Jan 3;29(1):14. doi: 10.1186/s40001-023-01593-7.
7
Multiparametric MRI-Based Interpretable Radiomics Machine Learning Model Differentiates Medulloblastoma and Ependymoma in Children: A Two-Center Study.基于多参数 MRI 的可解释放射组学机器学习模型鉴别儿童髓母细胞瘤和室管膜瘤:一项双中心研究。
Acad Radiol. 2024 Aug;31(8):3384-3396. doi: 10.1016/j.acra.2024.02.040. Epub 2024 Mar 20.
8
A multicenter study on developing a prognostic model for severe fever with thrombocytopenia syndrome using machine learning.一项利用机器学习开发严重发热伴血小板减少综合征预后模型的多中心研究。
Front Microbiol. 2025 Mar 19;16:1557922. doi: 10.3389/fmicb.2025.1557922. eCollection 2025.
9
Development and internal validation of an interpretable risk prediction model for diabetic peripheral neuropathy in type 2 diabetes: a single-centre retrospective cohort study in China.2型糖尿病患者糖尿病周围神经病变可解释性风险预测模型的开发与内部验证:一项中国单中心回顾性队列研究
BMJ Open. 2025 Apr 3;15(4):e092463. doi: 10.1136/bmjopen-2024-092463.
10
Prediction of lumbar disc degeneration based on interpretable machine learning models: retrospective cohort study.基于可解释机器学习模型的腰椎间盘退变预测:回顾性队列研究
Spine J. 2025 Apr 9. doi: 10.1016/j.spinee.2025.04.004.

引用本文的文献

1
Machine learning approach for the prediction of 30-day mortality in patients with sepsis-associated delirium.用于预测脓毒症相关性谵妄患者30天死亡率的机器学习方法。
PLoS One. 2025 Apr 9;20(4):e0319519. doi: 10.1371/journal.pone.0319519. eCollection 2025.

本文引用的文献

1
Early identification of macrophage activation syndrome secondary to systemic lupus erythematosus with machine learning.利用机器学习技术早期识别系统性红斑狼疮继发的巨噬细胞活化综合征。
Arthritis Res Ther. 2024 May 9;26(1):92. doi: 10.1186/s13075-024-03330-9.
2
Prediction of ineffectiveness of biological drugs using machine learning and explainable AI methods: data from the Austrian Biological Registry BioReg.使用机器学习和可解释人工智能方法预测生物药物的无效性:来自奥地利生物登记处 BioReg 的数据。
Arthritis Res Ther. 2024 Feb 8;26(1):44. doi: 10.1186/s13075-024-03277-x.
3
Gut microbiota landscape and potential biomarker identification in female patients with systemic lupus erythematosus using machine learning.
基于机器学习的系统性红斑狼疮女性患者肠道微生物组特征及潜在生物标志物鉴定。
Front Cell Infect Microbiol. 2023 Dec 19;13:1289124. doi: 10.3389/fcimb.2023.1289124. eCollection 2023.
4
Predicting the risk of diabetic retinopathy using explainable machine learning algorithms.使用可解释的机器学习算法预测糖尿病视网膜病变的风险。
Diabetes Metab Syndr. 2023 Dec;17(12):102919. doi: 10.1016/j.dsx.2023.102919. Epub 2023 Dec 4.
5
Machine learning and molecular subtype analyses provide insights into PANoptosis-associated genes in rheumatoid arthritis.机器学习和分子亚型分析为类风湿关节炎中与 PANoptosis 相关的基因提供了深入了解。
Arthritis Res Ther. 2023 Dec 1;25(1):233. doi: 10.1186/s13075-023-03222-4.
6
Evidence on risk factors for knee osteoarthritis in middle-older aged: a systematic review and meta analysis.中老年人膝关节骨关节炎危险因素的证据:系统评价和荟萃分析。
J Orthop Surg Res. 2023 Aug 29;18(1):634. doi: 10.1186/s13018-023-04089-6.
7
Fundamentals of osteoarthritis. Rehabilitation: Exercise, diet, biomechanics, and physical therapist-delivered interventions.骨关节炎基础。康复:运动、饮食、生物力学和理疗师提供的干预措施。
Osteoarthritis Cartilage. 2023 Oct;31(10):1312-1326. doi: 10.1016/j.joca.2023.06.011. Epub 2023 Jul 7.
8
Diabetes Detection Models in Mexican Patients by Combining Machine Learning Algorithms and Feature Selection Techniques for Clinical and Paraclinical Attributes: A Comparative Evaluation.基于机器学习算法和特征选择技术的临床和辅助临床特征联合对墨西哥患者的糖尿病检测模型:一项对比评估。
J Diabetes Res. 2023 Jun 26;2023:9713905. doi: 10.1155/2023/9713905. eCollection 2023.
9
Explainable ensemble machine learning model for prediction of 28-day mortality risk in patients with sepsis-associated acute kidney injury.用于预测脓毒症相关性急性肾损伤患者28天死亡风险的可解释集成机器学习模型。
Front Med (Lausanne). 2023 May 18;10:1165129. doi: 10.3389/fmed.2023.1165129. eCollection 2023.
10
Development and internal-external validation of statistical and machine learning models for breast cancer prognostication: cohort study.统计和机器学习模型在乳腺癌预后预测中的开发和内外验证:队列研究。
BMJ. 2023 May 10;381:e073800. doi: 10.1136/bmj-2022-073800.