Guo Biao, Li Yuan, Peng Weihang, Liu Yabin, He Fei, Zhai Zhe
Department of Physical Education, Xi'an University of Posts and Telecommunications, Xi'an, Shaanxi, China.
Faculty of Health Sciences and Sports, Macao Polytechnic University, Macao, China.
Front Physiol. 2025 Jun 25;16:1607276. doi: 10.3389/fphys.2025.1607276. eCollection 2025.
Knee pain significantly impairs health and quality of life among middle-aged and older adults. However, the predictive utility of lipid metabolism biomarkers for knee pain risk remains inadequately explored.
This study utilized data from the China Health and Retirement Longitudinal Study (CHARLS, 2011-2013) to investigate the association between lipid-related metabolic indicators and the risk of knee pain. Multiple lipid biomarkers and composite indices-including the lipid accumulation product (LAP), triglyceride-glucose (TyG) index, and TyG-BMI-were incorporated. Five machine learning models were developed and evaluated for predictive performance. Model interpretation was conducted using SHAP (SHapley Additive exPlanations) to identify the most influential predictors.
A higher prevalence of knee pain was observed in high-altitude, cold regions such as Qinghai and Sichuan provinces. Composite metabolic indices (LAP, TyG, and TyG-BMI) exhibited stronger predictive power than traditional single lipid markers. Among the models, the Stacked Ensemble algorithm achieved the best performance, with an AUC of 0.85 and a Brier score of 0.13. SHAP analysis highlighted LAP and TyG-related indices as the top contributors to prediction outcomes.
These findings emphasize the importance of lipid metabolism indicators in the early identification of knee pain risk. The integration of interpretable machine learning approaches and composite metabolic indices offers a promising strategy for personalized prevention in aging populations.
膝关节疼痛严重损害中老年人的健康和生活质量。然而,脂质代谢生物标志物对膝关节疼痛风险的预测效用仍未得到充分探索。
本研究利用中国健康与养老追踪调查(CHARLS,2011 - 2013年)的数据,调查脂质相关代谢指标与膝关节疼痛风险之间的关联。纳入了多种脂质生物标志物和综合指标,包括脂质蓄积产物(LAP)、甘油三酯 - 葡萄糖(TyG)指数和TyG - 体重指数(BMI)。开发并评估了五种机器学习模型的预测性能。使用SHAP(SHapley Additive exPlanations)进行模型解释,以识别最具影响力的预测因素。
在青海和四川等高海拔寒冷地区,膝关节疼痛的患病率较高。综合代谢指标(LAP、TyG和TyG - BMI)比传统的单一脂质标志物表现出更强的预测能力。在这些模型中,堆叠集成算法表现最佳,曲线下面积(AUC)为0.85,布里尔评分(Brier score)为0.13。SHAP分析突出显示LAP和TyG相关指标是预测结果的主要贡献因素。
这些发现强调了脂质代谢指标在早期识别膝关节疼痛风险中的重要性。将可解释的机器学习方法与综合代谢指标相结合,为老年人群的个性化预防提供了一种有前景的策略。