Suppr超能文献

基于机器学习的可解释风险模型识别超重人群的影响因素:一项大型回顾性队列研究。

Identification of influence factors in overweight population through an interpretable risk model based on machine learning: a large retrospective cohort.

机构信息

Department of Endocrinology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, FuZhou, 350001, PR China.

Department of Critical Care Medicine, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital South Branch, Fujian Provincial Hospital Jinshan Branch, Fujian Provincial Hospital, Fuzhou, 350001, PR China.

出版信息

Endocrine. 2024 Mar;83(3):604-614. doi: 10.1007/s12020-023-03536-y. Epub 2023 Sep 30.

Abstract

BACKGROUND

The identification of associated overweight risk factors is crucial to future health risk predictions and behavioral interventions. Several consensus problems remain in machine learning, such as cross-validation, and the resulting model may suffer from overfitting or poor interpretability.

METHODS

This study employed nine commonly used machine learning methods to construct overweight risk models. The general community are the target of this study, and a total of 10,905 Chinese subjects from Ningde City in Fujian province, southeast China, participated. The best model was selected through appropriate verification and validation and was suitably explained.

RESULTS

The overweight risk models employing machine learning exhibited good performance. It was concluded that CatBoost, which is used in the construction of clinical risk models, may surpass previous machine learning methods. The visual display of the Shapley additive explanation value for the machine model variables accurately represented the influence of each variable in the model.

CONCLUSIONS

The construction of an overweight risk model using machine learning may currently be the best approach. Moreover, CatBoost may be the best machine learning method. Furthermore, combining Shapley's additive explanation and machine learning methods can be effective in identifying disease risk factors for prevention and control.

摘要

背景

识别相关的超重危险因素对于未来的健康风险预测和行为干预至关重要。机器学习中仍然存在一些共识问题,例如交叉验证,由此产生的模型可能存在过拟合或可解释性差的问题。

方法

本研究采用了九种常用的机器学习方法来构建超重风险模型。本研究的目标人群是一般社区,共有来自中国东南部福建省宁德市的 10905 名中国受试者参与。通过适当的验证和确认选择了最佳模型,并进行了适当的解释。

结果

采用机器学习的超重风险模型表现出良好的性能。可以得出结论,在构建临床风险模型时使用的 CatBoost 可能超过了以前的机器学习方法。对机器模型变量的 Shapley 加法解释值的可视化显示准确地表示了模型中每个变量的影响。

结论

使用机器学习构建超重风险模型可能是目前最好的方法。此外,CatBoost 可能是最好的机器学习方法。此外,结合 Shapley 的加法解释和机器学习方法可以有效地识别疾病风险因素,以进行预防和控制。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验