• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过临床数据和机器学习模型识别膝关节骨关节炎的生物标志物

Identification of biomarkers for knee osteoarthritis through clinical data and machine learning models.

作者信息

Chen Wei, Zheng Haotian, Ye Binglin, Guo Tiefeng, Xu Yude, Fu Zhibin, Ji Xing, Chai Xiping, Li Shenghua, Deng Qiang

机构信息

Clinical College of Chinese Medicine, Gansu University of Chinese Medicine, Lanzhou, Gansu, China.

Department of Orthopaedics, Traditional Chinese Medical Hospital of Gansu Province, Qilihe District, Guazhou Street 418, Lanzhou, 730050,, Gansu, China.

出版信息

Sci Rep. 2025 Jan 11;15(1):1703. doi: 10.1038/s41598-025-85945-9.

DOI:10.1038/s41598-025-85945-9
PMID:39799234
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11724986/
Abstract

Knee osteoarthritis (KOA) represents a progressive degenerative disorder characterized by the gradual erosion of articular cartilage. This study aimed to develop and validate biomarker-based predictive models for KOA diagnosis using machine learning techniques. Clinical data from 2594 samples were obtained and stratified into training and validation datasets in a 7:3 ratio. Key clinical features were identified through differential analysis between KOA and control groups, combined with least absolute shrinkage and selection operator (LASSO) regression. The SHapley Additive Planning (SHAP) method was employed to rank feature importance quantitatively. Based on these rankings, predictive models were constructed using Logistic Regression (LR), Random Forest (RF), eXtreme Gradient Boosting (xGBoost), Naive Bayes (NB), Support Vector Machine (SVM), and Decision Tree (DT) algorithms. Models were developed for subsets of variables, including the top 5, top 10, top 15, and all identified features. Receiver operating characteristic (ROC) curves were applied to compare diagnostic performance across models. Additionally, a risk stratification framework for KOA prediction was designed using recursive partitioning analysis (RPA). Using difference analysis and LASSO, 44 critical clinical features were identified. Among these, age, plasma prothrombin time, gender, body mass index (BMI), and prothrombin time and international normalized ratio (PTINR) emerged as the top five features, with SHAP values of 0.1990, 0.0981, 0.0471, 0.0433, and 0.0422, respectively. Machine learning analysis demonstrated that these variables provided robust diagnostic performance for KOA. In the training set, area under the curve (AUC) values for LR, RF, xGBoost, NB, SVM, and DT models were 0.947, 0.961, 0.892, 0.952, 0.885, and 0.779, respectively. Similarly, in the validation dataset, these models achieved AUC values of 0.961, 0.943, 0.789, 0.957, 0.824, and 0.76. Among them, RF consistently exhibited superior diagnostic accuracy for KOA. Additionally, RPA analysis indicated a higher prevalence of KOA among individuals aged 54 years and older. The integration of the top five clinical variables significantly enhanced the diagnostic accuracy for KOA, particularly when employing the RF model. Moreover, the RPA model offered valuable insights to assist clinicians in refining prognostic assessments and optimizing clinical decision-making processes.

摘要

膝关节骨关节炎(KOA)是一种进行性退行性疾病,其特征是关节软骨逐渐磨损。本研究旨在利用机器学习技术开发并验证基于生物标志物的KOA诊断预测模型。获取了来自2594个样本的临床数据,并按照7:3的比例分层为训练数据集和验证数据集。通过KOA组与对照组之间的差异分析,并结合最小绝对收缩和选择算子(LASSO)回归,确定了关键临床特征。采用SHapley加法规划(SHAP)方法对特征重要性进行定量排序。基于这些排序,使用逻辑回归(LR)、随机森林(RF)、极端梯度提升(xGBoost)、朴素贝叶斯(NB)、支持向量机(SVM)和决策树(DT)算法构建了预测模型。针对变量子集开发了模型,包括排名前5、前10、前15的特征以及所有已识别的特征。应用受试者工作特征(ROC)曲线比较各模型的诊断性能。此外,使用递归划分分析(RPA)设计了一个KOA预测的风险分层框架。通过差异分析和LASSO,确定了44个关键临床特征。其中,年龄、血浆凝血酶原时间、性别、体重指数(BMI)以及凝血酶原时间和国际标准化比值(PTINR)成为排名前五的特征,其SHAP值分别为0.1990、0.0981、0.0471、0.0433和0.0422。机器学习分析表明,这些变量为KOA提供了强大的诊断性能。在训练集中,LR、RF、xGBoost、NB、SVM和DT模型的曲线下面积(AUC)值分别为0.947、0.961、0.892、0.952、0.885和0.779。同样,在验证数据集中,这些模型的AUC值分别为0.961、0.943、0.789、0.957、0.824和0.76。其中,RF对KOA始终表现出卓越的诊断准确性。此外,RPA分析表明,54岁及以上人群中KOA的患病率更高。整合排名前五的临床变量显著提高了KOA的诊断准确性,尤其是在使用RF模型时。此外,RPA模型提供了有价值的见解,有助于临床医生完善预后评估并优化临床决策过程。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8b9/11724986/6248af2f3678/41598_2025_85945_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8b9/11724986/e46fd69e4d22/41598_2025_85945_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8b9/11724986/82bc1e11aa8c/41598_2025_85945_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8b9/11724986/b25fb9ccda7d/41598_2025_85945_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8b9/11724986/7c9df5833fa1/41598_2025_85945_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8b9/11724986/99f2bed509eb/41598_2025_85945_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8b9/11724986/18108a818402/41598_2025_85945_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8b9/11724986/6248af2f3678/41598_2025_85945_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8b9/11724986/e46fd69e4d22/41598_2025_85945_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8b9/11724986/82bc1e11aa8c/41598_2025_85945_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8b9/11724986/b25fb9ccda7d/41598_2025_85945_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8b9/11724986/7c9df5833fa1/41598_2025_85945_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8b9/11724986/99f2bed509eb/41598_2025_85945_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8b9/11724986/18108a818402/41598_2025_85945_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8b9/11724986/6248af2f3678/41598_2025_85945_Fig7_HTML.jpg

相似文献

1
Identification of biomarkers for knee osteoarthritis through clinical data and machine learning models.通过临床数据和机器学习模型识别膝关节骨关节炎的生物标志物
Sci Rep. 2025 Jan 11;15(1):1703. doi: 10.1038/s41598-025-85945-9.
2
[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].基于监督机器学习算法构建脓毒症休克患者死亡风险预测模型
Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832.
3
Interpretable machine learning for allergic rhinitis prediction among preschool children in Urumqi, China.中国乌鲁木齐学龄前儿童变应性鼻炎预测的可解释机器学习。
Sci Rep. 2024 Sep 27;14(1):22281. doi: 10.1038/s41598-024-73733-w.
4
Prediction of STAS in lung adenocarcinoma with nodules ≤ 2 cm using machine learning: a multicenter retrospective study.使用机器学习预测直径≤2 cm的肺腺癌中的STAS:一项多中心回顾性研究
BMC Cancer. 2025 Mar 7;25(1):417. doi: 10.1186/s12885-025-13783-z.
5
Prediction and feature selection of low birth weight using machine learning algorithms.利用机器学习算法预测和选择低出生体重。
J Health Popul Nutr. 2024 Oct 12;43(1):157. doi: 10.1186/s41043-024-00647-8.
6
Construction and evaluation of machine learning-based prediction model for live birth following fresh embryo transfer in IVF/ICSI patients with polycystic ovary syndrome.基于机器学习的预测模型在多囊卵巢综合征体外受精/卵胞浆内单精子注射患者新鲜胚胎移植后活产率的构建与评估
J Ovarian Res. 2025 Apr 4;18(1):70. doi: 10.1186/s13048-025-01654-x.
7
Interpretable machine learning method to predict the risk of pre-diabetes using a national-wide cross-sectional data: evidence from CHNS.利用全国性横断面数据预测糖尿病前期风险的可解释机器学习方法:来自中国健康与营养调查的证据
BMC Public Health. 2025 Mar 26;25(1):1145. doi: 10.1186/s12889-025-22419-7.
8
Machine learning-based prediction model for patients with recurrent Staphylococcus aureus bacteremia.基于机器学习的复发性金黄色葡萄球菌菌血症患者预测模型
BMC Med Inform Decis Mak. 2025 Feb 24;25(1):99. doi: 10.1186/s12911-025-02878-z.
9
Development and validation of an interpretable machine learning model for predicting in-hospital mortality for ischemic stroke patients in ICU.用于预测ICU中缺血性中风患者院内死亡率的可解释机器学习模型的开发与验证
Int J Med Inform. 2025 Jun;198:105874. doi: 10.1016/j.ijmedinf.2025.105874. Epub 2025 Mar 9.
10
Predictive model and risk analysis for peripheral vascular disease in type 2 diabetes mellitus patients using machine learning and shapley additive explanation.基于机器学习和 Shapley 加法解释的 2 型糖尿病患者外周血管疾病预测模型和风险分析。
Front Endocrinol (Lausanne). 2024 Feb 28;15:1320335. doi: 10.3389/fendo.2024.1320335. eCollection 2024.

引用本文的文献

1
Beyond the Surface: Nutritional Interventions Integrated with Diagnostic Imaging Tools to Target and Preserve Cartilage Integrity: A Narrative Review.透过表象:结合诊断成像工具的营养干预以靶向和维持软骨完整性:一篇叙述性综述
Biomedicines. 2025 Feb 24;13(3):570. doi: 10.3390/biomedicines13030570.
2
The Mechanism by Which Estrogen Level Affects Knee Osteoarthritis Pain in Perimenopause and Non-Pharmacological Measures.雌激素水平影响围绝经期膝关节骨关节炎疼痛的机制及非药物措施
Int J Mol Sci. 2025 Mar 7;26(6):2391. doi: 10.3390/ijms26062391.

本文引用的文献

1
Changes in urinary renal injury markers in children with Mycoplasma pneumoniae pneumonia and a prediction model for related early renal injury.肺炎支原体肺炎患儿尿肾损伤标志物的变化及相关早期肾损伤的预测模型。
Ital J Pediatr. 2024 Aug 23;50(1):155. doi: 10.1186/s13052-024-01709-7.
2
Machine learning analysis of serum cholesterol's impact on knee osteoarthritis progression.机器学习分析血清胆固醇对膝骨关节炎进展的影响。
Sci Rep. 2024 Aug 14;14(1):18852. doi: 10.1038/s41598-024-69906-2.
3
The value of deep learning-based X-ray techniques in detecting and classifying K-L grades of knee osteoarthritis: a systematic review and meta-analysis.
基于深度学习的X射线技术在检测和分类膝关节骨关节炎K-L分级中的价值:一项系统评价和荟萃分析
Eur Radiol. 2025 Jan;35(1):327-340. doi: 10.1007/s00330-024-10928-9. Epub 2024 Jul 12.
4
Investigating the Effects of Sex Hormones on Macrophage Polarization.研究性激素对巨噬细胞极化的影响。
Int J Mol Sci. 2024 Jan 12;25(2):951. doi: 10.3390/ijms25020951.
5
Efficacy and safety of curcumin therapy for knee osteoarthritis: A Bayesian network meta-analysis.姜黄素治疗膝骨关节炎的疗效和安全性:贝叶斯网状荟萃分析。
J Ethnopharmacol. 2024 Mar 1;321:117493. doi: 10.1016/j.jep.2023.117493. Epub 2023 Nov 28.
6
Lipid alternations in the plasma of COVID-19 patients with various clinical presentations.不同临床表现的 COVID-19 患者血浆中的脂质改变。
Front Immunol. 2023 Aug 29;14:1221493. doi: 10.3389/fimmu.2023.1221493. eCollection 2023.
7
Construction and validation of a chemokine family-based signature for the prediction of prognosis and therapeutic response in colon cancer.用于预测结肠癌预后和治疗反应的基于趋化因子家族的特征构建与验证
Heliyon. 2023 Jun 5;9(6):e16478. doi: 10.1016/j.heliyon.2023.e16478. eCollection 2023 Jun.
8
Artificial neural network-based diagnostic models for lung cancer combining conventional indicators with tumor markers.基于人工神经网络的结合常规指标与肿瘤标志物的肺癌诊断模型。
Exp Biol Med (Maywood). 2023 May;248(10):829-838. doi: 10.1177/15353702231177013. Epub 2023 Jul 4.
9
Acupuncture for treatment of knee osteoarthritis: A clinical practice guideline.针灸治疗膝骨关节炎:临床实践指南。
J Evid Based Med. 2023 Jun;16(2):237-245. doi: 10.1111/jebm.12526. Epub 2023 Mar 31.
10
Mathematical Modelling of Cervical Precancerous Lesion Grade Risk Scores: Linear Regression Analysis of Cellular Protein Biomarkers and Human Papillomavirus / RNA Staining Patterns.子宫颈癌前病变分级风险评分的数学建模:细胞蛋白生物标志物与人类乳头瘤病毒/RNA染色模式的线性回归分析
Diagnostics (Basel). 2023 Mar 13;13(6):1084. doi: 10.3390/diagnostics13061084.