使用可解释机器学习方法评估营养状况和临床抑郁分类

Evaluation of nutritional status and clinical depression classification using an explainable machine learning method.

作者信息

Hosseinzadeh Kasani Payam, Lee Jung Eun, Park Chihyun, Yun Cheol-Heui, Jang Jae-Won, Lee Sang-Ah

机构信息

Department of Neurology, Kangwon National University Hospital, Chuncheon, Republic of Korea.

Interdisciplinary Graduate Program in Medical Bigdata Convergence, Kangwon National University, Chuncheon, Republic of Korea.

出版信息

Front Nutr. 2023 May 9;10:1165854. doi: 10.3389/fnut.2023.1165854. eCollection 2023.

DOI:10.3389/fnut.2023.1165854

PMID:37229464

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10203418/

Abstract

INTRODUCTION

Depression is a prevalent disorder worldwide, with potentially severe implications. It contributes significantly to an increased risk of diseases associated with multiple risk factors. Early accurate diagnosis of depressive symptoms is a critical first step toward management, intervention, and prevention. Various nutritional and dietary compounds have been suggested to be involved in the onset, maintenance, and severity of depressive disorders. Despite the challenges to better understanding the association between nutritional risk factors and the occurrence of depression, assessing the interplay of these markers through supervised machine learning remains to be fully explored.

METHODS

This study aimed to determine the ability of machine learning-based decision support methods to identify the presence of depression using publicly available health data from the Korean National Health and Nutrition Examination Survey. Two exploration techniques, namely, uniform manifold approximation and projection and Pearson correlation, were performed for explanatory analysis among datasets. A grid search optimization with cross-validation was performed to fine-tune the models for classifying depression with the highest accuracy. Several performance measures, including accuracy, precision, recall, F1 score, confusion matrix, areas under the precision-recall and receiver operating characteristic curves, and calibration plot, were used to compare classifier performances. We further investigated the importance of the features provided: visualized interpretation using ELI5, partial dependence plots, and local interpretable using model-agnostic explanations and Shapley additive explanation for the prediction at both the population and individual levels.

RESULTS

The best model achieved an accuracy of 86.18% for XGBoost and an area under the curve of 84.96% for the random forest model in original dataset and the XGBoost algorithm with an accuracy of 86.02% and an area under the curve of 85.34% in the quantile-based dataset. The explainable results revealed a complementary observation of the relative changes in feature values, and, thus, the importance of emergent depression risks could be identified.

DISCUSSION

The strength of our approach is the large sample size used for training with a fine-tuned model. The machine learning-based analysis showed that the hyper-tuned model has empirically higher accuracy in classifying patients with depressive disorder, as evidenced by the set of interpretable experiments, and can be an effective solution for disease control.

摘要

引言

抑郁症是一种在全球范围内普遍存在的疾病，具有潜在的严重影响。它显著增加了与多种风险因素相关疾病的发病风险。早期准确诊断抑郁症状是管理、干预和预防的关键第一步。各种营养和膳食化合物被认为与抑郁症的发生、维持和严重程度有关。尽管在更好地理解营养风险因素与抑郁症发生之间的关联方面存在挑战，但通过监督式机器学习评估这些标志物之间的相互作用仍有待充分探索。

方法

本研究旨在利用韩国国家健康与营养检查调查的公开可用健康数据，确定基于机器学习的决策支持方法识别抑郁症的能力。对数据集进行了两种探索技术，即均匀流形逼近与投影和皮尔逊相关性分析，以进行解释性分析。进行了带有交叉验证的网格搜索优化，以微调模型，使其以最高准确率对抑郁症进行分类。使用了几种性能指标，包括准确率、精确率、召回率、F1分数、混淆矩阵、精确率-召回率曲线下面积和受试者工作特征曲线下面积以及校准图，来比较分类器性能。我们进一步研究了所提供特征的重要性：使用ELI5进行可视化解释、部分依赖图，以及在总体和个体层面上使用模型无关解释和夏普利加性解释进行局部可解释性分析以进行预测。

结果

在原始数据集中，XGBoost的最佳模型准确率达到86.18%，随机森林模型的曲线下面积为84.96%；在基于分位数的数据集里，XGBoost算法的准确率为86.02%，曲线下面积为85.34%。可解释的结果揭示了对特征值相对变化的互补观察，因此可以确定新发抑郁风险的重要性。

讨论

我们方法的优势在于使用了经过微调的模型进行训练的大样本量。基于机器学习的分析表明，经过超参数调整的模型在对抑郁症患者进行分类方面具有更高的经验准确性，一系列可解释实验证明了这一点，并且可以成为疾病控制的有效解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e372/10203418/c517a0cc50f3/fnut-10-1165854-g0001.jpg

相似文献

Evaluation of nutritional status and clinical depression classification using an explainable machine learning method.

Front Nutr. 2023 May 9;10:1165854. doi: 10.3389/fnut.2023.1165854. eCollection 2023.

Comparative effectiveness of explainable machine learning approaches for extrauterine growth restriction classification in preterm infants using longitudinal data.

Front Med (Lausanne). 2023 Nov 29;10:1166743. doi: 10.3389/fmed.2023.1166743. eCollection 2023.

Prediction Model of Osteonecrosis of the Femoral Head After Femoral Neck Fracture: Machine Learning-Based Development and Validation Study.

JMIR Med Inform. 2021 Nov 19;9(11):e30079. doi: 10.2196/30079.

Explainable machine learning approach to predict extubation in critically ill ventilated patients: a retrospective study in central Taiwan.

BMC Anesthesiol. 2022 Nov 14;22(1):351. doi: 10.1186/s12871-022-01888-y.

Explainable Machine Learning to Predict Successful Weaning Among Patients Requiring Prolonged Mechanical Ventilation: A Retrospective Cohort Study in Central Taiwan.

Front Med (Lausanne). 2021 Apr 23;8:663739. doi: 10.3389/fmed.2021.663739. eCollection 2021.

Automated machine learning models for nonalcoholic fatty liver disease assessed by controlled attenuation parameter from the NHANES 2017-2020.

Digit Health. 2024 Aug 7;10:20552076241272535. doi: 10.1177/20552076241272535. eCollection 2024 Jan-Dec.

Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?

Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.

Intelligent diagnosis of Kawasaki disease from real-world data using interpretable machine learning models.

Hellenic J Cardiol. 2025 Jan-Feb;81:38-48. doi: 10.1016/j.hjc.2024.08.003. Epub 2024 Aug 10.

An explainable and interpretable model for attention deficit hyperactivity disorder in children using EEG signals.

Comput Biol Med. 2023 Mar;155:106676. doi: 10.1016/j.compbiomed.2023.106676. Epub 2023 Feb 18.

Application of an Interpretable Machine Learning Model to Predict Lymph Node Metastasis in Patients with Laryngeal Carcinoma.

J Oncol. 2022 Nov 12;2022:6356399. doi: 10.1155/2022/6356399. eCollection 2022.

引用本文的文献

Enhancing Diagnostic Accuracy of Neurological Disorders Through Feature-Driven Multi-Class Classification with Machine Learning.

Diagnostics (Basel). 2025 Aug 23;15(17):2132. doi: 10.3390/diagnostics15172132.

Nutritional Impairment and Quality of Life Trajectories Among Older Adults With Advanced Cancer.

J Am Geriatr Soc. 2025 Jul 4. doi: 10.1111/jgs.19617.

Machine Learning in Predicting Child Malnutrition: A Meta-Analysis of Demographic and Health Surveys Data.

Int J Environ Res Public Health. 2025 Mar 18;22(3):449. doi: 10.3390/ijerph22030449.

Global, Regional, and National Epidemiology of Depression in Working-Age Individuals, 1990-2019.

Depress Anxiety. 2024 Aug 24;2024:4747449. doi: 10.1155/2024/4747449. eCollection 2024.

A Scoping Review of Artificial Intelligence for Precision Nutrition.

Adv Nutr. 2025 Apr;16(4):100398. doi: 10.1016/j.advnut.2025.100398. Epub 2025 Feb 28.

Explainable AI and transformer models: Unraveling the nutritional influences on Alzheimer's disease mortality.

Smart Health (Amst). 2024 Jun;32. doi: 10.1016/j.smhl.2024.100478. Epub 2024 Mar 20.

Comparative effectiveness of explainable machine learning approaches for extrauterine growth restriction classification in preterm infants using longitudinal data.

Front Med (Lausanne). 2023 Nov 29;10:1166743. doi: 10.3389/fmed.2023.1166743. eCollection 2023.

本文引用的文献

Machine Learning Algorithms for understanding the determinants of under-five Mortality.

BioData Min. 2022 Sep 24;15(1):20. doi: 10.1186/s13040-022-00308-8.

Nutrition and mental health: A review of current knowledge about the impact of diet on mental health.

Front Nutr. 2022 Aug 22;9:943998. doi: 10.3389/fnut.2022.943998. eCollection 2022.

On the road to explainable AI in drug-drug interactions prediction: A systematic review.

Comput Struct Biotechnol J. 2022 Apr 19;20:2112-2123. doi: 10.1016/j.csbj.2022.04.021. eCollection 2022.

Nutrition, Epigenetics, and Major Depressive Disorder: Understanding the Connection.

Front Nutr. 2022 May 18;9:867150. doi: 10.3389/fnut.2022.867150. eCollection 2022.

Keeping pace with the healthcare transformation: a literature review and research agenda for a new decade of health information systems research.

Electron Mark. 2021;31(4):901-921. doi: 10.1007/s12525-021-00484-1. Epub 2021 Jul 17.

Health Informatics: Engaging Modern Healthcare Units: A Brief Overview.

Front Public Health. 2022 Apr 29;10:854688. doi: 10.3389/fpubh.2022.854688. eCollection 2022.

Demystifying the Black Box: The Importance of Interpretability of Predictive Models in Neurocritical Care.

Neurocrit Care. 2022 Aug;37(Suppl 2):185-191. doi: 10.1007/s12028-022-01504-4. Epub 2022 May 6.

Shapley variable importance cloud for interpretable machine learning.

Patterns (N Y). 2022 Feb 22;3(4):100452. doi: 10.1016/j.patter.2022.100452. eCollection 2022 Apr 8.

The Joint Association Between Multiple Dietary Patterns and Depressive Symptoms in Adults Aged 55 and Over in Northern China.

Front Nutr. 2022 Mar 7;9:849384. doi: 10.3389/fnut.2022.849384. eCollection 2022.

Prediction-Driven Decision Support for Patients With Mild Stroke: A Model Based on Machine Learning Algorithms.

Front Neurol. 2021 Dec 23;12:761092. doi: 10.3389/fneur.2021.761092. eCollection 2021.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用可解释机器学习方法评估营养状况和临床抑郁分类

Evaluation of nutritional status and clinical depression classification using an explainable machine learning method.

作者信息

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS

DISCUSSION

引言

方法

结果

讨论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献