Liu Mengdie, Guo Wen, Peng Jin, Wu Jinhui
Center of Gerontology and Geriatrics, National Clinical Research Center for Geriatrics, West China Hospital, Sichuan University, Chengdu, China.
Front Public Health. 2025 Aug 7;13:1614374. doi: 10.3389/fpubh.2025.1614374. eCollection 2025.
Sarcopenia (SP) is a progressive, age-related disease that may result in various adverse health outcomes and even mortality in older adults. Accurately predicting the mortality risk of older adults with SP is essential for informed clinical decision-making. This study aims to utilize machine learning techniques that incorporate sociodemographic factors, health-related metrics, lifestyle variables, and biomarker data to improve risk stratification and management in older adults with SP.
We analyzed data from the NHANES from 1999-2006 and 2010-2018, including a total of 1,619 older adult patients with SP, with a 10-year follow-up period for this population, during which 541 (33%) patients died and 1,078 (67%) survived. This study extracted 36 clinical variables for each patient, encompassing sociodemographic factors, health-related metrics, and biochemical markers. Feature selection was performed using Lasso Regression, XGBoost, and Random Forest machine learning algorithms, and a nomogram model was developed using univariate and multivariate Cox regression analyses, with validation of its accuracy, concordance, and clinical applicability.
A total of 12 feature variables were identified through the combined use of three machine learning methods. Univariate and multivariate Cox regression analyses identified Age, Height, Neutrophil count (NENO), The ratio of hemoglobin to red cell distribution width (HRR), Uric Acid (UA), and Creatinine as significant predictors of mortality in older adults with SP, and a nomogram model was constructed based on these feature variables, with model performance assessed through discrimination, calibration curves, and clinical utility evaluation. The model achieved AUC values of 0.753, 0.773, 0.782, and 0.800 at 1, 3, 5, and 10 years, respectively, demonstrating good concordance and adequate calibration. Decision curve analysis (DCA) indicated that the model had broad applicability in predicting short-term and long-term outcomes in older adult patients with SP. Finally, based on the nomogram risk score, patients were stratified into risk groups and survival curves were plotted, illustrating a significantly lower survival probability in the high-risk group compared to the low-risk group ( < 0.0001).
Utilizing advanced statistical and machine learning techniques, we developed and validated a prognostic model for SP in the older adult that integrates multimodal data, enhancing predictive accuracy and reliability. This model provides valuable insights for clinicians, facilitates risk stratification, and provides personalized interventions for older adults with SP.
肌肉减少症(SP)是一种与年龄相关的进行性疾病,可能导致老年人出现各种不良健康后果甚至死亡。准确预测老年SP患者的死亡风险对于明智的临床决策至关重要。本研究旨在利用机器学习技术,纳入社会人口学因素、健康相关指标、生活方式变量和生物标志物数据,以改善老年SP患者的风险分层和管理。
我们分析了1999 - 2006年和2010 - 2018年美国国家健康与营养检查调查(NHANES)的数据,共纳入1619例老年SP患者,对该人群进行了10年的随访,在此期间541例(33%)患者死亡,1078例(67%)存活。本研究为每位患者提取了36个临床变量,包括社会人口学因素、健康相关指标和生化标志物。使用套索回归、极端梯度提升(XGBoost)和随机森林机器学习算法进行特征选择,并使用单变量和多变量Cox回归分析建立列线图模型,验证其准确性、一致性和临床适用性。
通过联合使用三种机器学习方法共识别出12个特征变量。单变量和多变量Cox回归分析确定年龄、身高、中性粒细胞计数(NENO)、血红蛋白与红细胞分布宽度之比(HRR)、尿酸(UA)和肌酐是老年SP患者死亡的重要预测因素,并基于这些特征变量构建了列线图模型,通过区分度、校准曲线和临床效用评估对模型性能进行评估。该模型在1年、3年、5年和10年时的曲线下面积(AUC)值分别为0.753、0.773、0.782和0.800,显示出良好的一致性和充分的校准。决策曲线分析(DCA)表明该模型在预测老年SP患者的短期和长期结局方面具有广泛的适用性。最后,根据列线图风险评分将患者分层为风险组并绘制生存曲线,结果表明高风险组的生存概率显著低于低风险组(<0.0001)。
利用先进的统计和机器学习技术,我们开发并验证了一种针对老年人SP的预后模型,该模型整合了多模态数据,提高了预测的准确性和可靠性。该模型为临床医生提供了有价值的见解,有助于风险分层,并为老年SP患者提供个性化干预。