Yang Siwei, Yu Jianan, Chen Qiyang, Sun Xuedong, Hu Yuefeng, Su Tianhao, Li Jian, Jin Long
Department of Interventional Radiology, Beijing Friendship Hospital, Capital Medical University, China.
Department of Ultrasound, Beijing Tongren Hospital, Capital Medical University, China.
Ann Hepatol. 2024 Oct 5;30(2):101585. doi: 10.1016/j.aohep.2024.101585.
With rising prevalence of pre-sarcopenia in metabolic dysfunction-associated steatotic liver disease (MASLD), this study aimed to develop and validate machine learning-based model to identify pre-sarcopenia in MASLD population.
A total of 571 MASLD subjects were screened from the National Health and Nutrition Examination Survey 2017-2018. This cohort was randomly divided into training set and internal testing set with a ratio of 7:3. Sixty-six MASLD subjects were collected from our institution as external validation set. Four binary classifiers, including Random Forest (RF), support vector machine, and extreme gradient boosting and logistic regression, were fitted to identify pre-sarcopenia. The best-performing model was further validated in external validation set. Model performance was assessed in terms of discrimination and calibration. Shapley Additive explanations were used for model interpretability.
The pre-sarcopenia rate was 17.51 % and 15.16 % in NHANES cohort and external validation set, respectively. RF outperformed other models with area under receiver operating characteristic curve (AUROC) of 0.819 (95 %CI: 0.749, 0.889). When six top-ranking features were retained as per variable importance, including weight-adjusted waist, sex, race, creatinine, education and alkaline phosphatase, a final RF model reached an AUROC being 0.824 (0.737, 0.910) and 0.732 (95 %CI: 0.529, 0.936) in internal and external validation sets, respectively. The model robustness was proved in sensitivity analysis. The calibration curve and decision curve analysis confirmed a good calibration capacity and good clinical usage.
This study proposed a user-friendly model using explainable machine learning algorithm to predict pre-sarcopenia in MASLD population. A web-based tool was provided to screening pre-sarcopenia in community and hospitalization settings.
随着代谢功能障碍相关脂肪性肝病(MASLD)患者中肌肉减少症前期患病率的上升,本研究旨在开发并验证基于机器学习的模型,以识别MASLD人群中的肌肉减少症前期。
从2017 - 2018年国家健康与营养检查调查中筛选出571例MASLD受试者。该队列以7:3的比例随机分为训练集和内部测试集。从我们机构收集了66例MASLD受试者作为外部验证集。拟合了四个二元分类器,包括随机森林(RF)、支持向量机、极端梯度提升和逻辑回归,以识别肌肉减少症前期。在外部验证集中进一步验证表现最佳的模型。从区分度和校准方面评估模型性能。使用夏普利值加法解释进行模型解释。
在NHANES队列和外部验证集中,肌肉减少症前期发生率分别为17.51%和15.16%。RF的表现优于其他模型,其受试者操作特征曲线下面积(AUROC)为0.819(95%CI:0.749,0.889)。根据变量重要性保留六个排名靠前的特征,包括体重调整腰围、性别、种族、肌酐、教育程度和碱性磷酸酶,最终的RF模型在内部和外部验证集中的AUROC分别为0.824(0.737,0.910)和0.732(95%CI:0.529,0.936)。敏感性分析证明了模型的稳健性。校准曲线和决策曲线分析证实了良好的校准能力和良好的临床实用性。
本研究提出了一种使用可解释机器学习算法的用户友好型模型,用于预测MASLD人群中的肌肉减少症前期。提供了一个基于网络的工具,用于在社区和住院环境中筛查肌肉减少症前期。