利用可解释人工智能对美国县级横断面肥胖流行率进行可解释的机器学习模型

An interpretable machine learning model of cross-sectional U.S. county-level obesity prevalence using explainable artificial intelligence.

机构信息

Department of Psychology, University of Kansas, Lawrence, Kansas, United States of America.

出版信息

PLoS One. 2023 Oct 5;18(10):e0292341. doi: 10.1371/journal.pone.0292341. eCollection 2023.

DOI:10.1371/journal.pone.0292341

PMID:37796874

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10553328/

Abstract

BACKGROUND

There is considerable geographic heterogeneity in obesity prevalence across counties in the United States. Machine learning algorithms accurately predict geographic variation in obesity prevalence, but the models are often uninterpretable and viewed as a black-box.

OBJECTIVE

The goal of this study is to extract knowledge from machine learning models for county-level variation in obesity prevalence.

METHODS

This study shows the application of explainable artificial intelligence methods to machine learning models of cross-sectional obesity prevalence data collected from 3,142 counties in the United States. County-level features from 7 broad categories: health outcomes, health behaviors, clinical care, social and economic factors, physical environment, demographics, and severe housing conditions. Explainable methods applied to random forest prediction models include feature importance, accumulated local effects, global surrogate decision tree, and local interpretable model-agnostic explanations.

RESULTS

The results show that machine learning models explained 79% of the variance in obesity prevalence, with physical inactivity, diabetes, and smoking prevalence being the most important factors in predicting obesity prevalence.

CONCLUSIONS

Interpretable machine learning models of health behaviors and outcomes provide substantial insight into obesity prevalence variation across counties in the United States.

摘要

背景

美国各县的肥胖症患病率存在相当大的地域差异。机器学习算法可以准确预测肥胖症患病率的地域变化，但这些模型往往不可解释，被视为黑箱。

目的

本研究旨在从机器学习模型中提取有关肥胖症患病率的县际差异的知识。

方法

本研究展示了可解释人工智能方法在从美国 3142 个县收集的横断面肥胖患病率数据的机器学习模型中的应用。县级特征分为 7 个广泛类别：健康结果、健康行为、临床护理、社会经济因素、物理环境、人口统计学和严重住房条件。应用于随机森林预测模型的可解释方法包括特征重要性、累积局部效应、全局替代决策树和局部可解释模型不可知解释。

结果

结果表明，机器学习模型解释了肥胖症患病率变异的 79%，其中身体活动不足、糖尿病和吸烟流行率是预测肥胖症患病率的最重要因素。

结论

健康行为和结果的可解释机器学习模型为了解美国各县肥胖症患病率的差异提供了重要的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/968e/10553328/1e7f26d204e4/pone.0292341.g001.jpg

相似文献

An interpretable machine learning model of cross-sectional U.S. county-level obesity prevalence using explainable artificial intelligence.

PLoS One. 2023 Oct 5;18(10):e0292341. doi: 10.1371/journal.pone.0292341. eCollection 2023.

Identification of Factors Associated With Variation in US County-Level Obesity Prevalence Rates Using Epidemiologic vs Machine Learning Models.

JAMA Netw Open. 2019 Apr 5;2(4):e192884. doi: 10.1001/jamanetworkopen.2019.2884.

Opening the Black Box: The Promise and Limitations of Explainable Machine Learning in Cardiology.

Can J Cardiol. 2022 Feb;38(2):204-213. doi: 10.1016/j.cjca.2021.09.004. Epub 2021 Sep 14.

Socioeconomic and environmental determinants of asthma prevalence: a cross-sectional study at the U.S. County level using geographically weighted random forests.

Int J Health Geogr. 2023 Aug 10;22(1):18. doi: 10.1186/s12942-023-00343-6.

County-level socio-environmental factors and obesity prevalence in the United States.

Diabetes Obes Metab. 2024 May;26(5):1766-1774. doi: 10.1111/dom.15488. Epub 2024 Feb 14.

Explainable AI: Machine Learning Interpretation in Blackcurrant Powders.

Sensors (Basel). 2024 May 17;24(10):3198. doi: 10.3390/s24103198.

An Explainable Artificial Intelligence Framework for the Deterioration Risk Prediction of Hepatitis Patients.

J Med Syst. 2021 Apr 13;45(5):61. doi: 10.1007/s10916-021-01736-5.

IHCP: interpretable hepatitis C prediction system based on black-box machine learning models.

BMC Bioinformatics. 2023 Sep 6;24(1):333. doi: 10.1186/s12859-023-05456-0.

Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA.

Sci Rep. 2021 Dec 16;11(1):24090. doi: 10.1038/s41598-021-03198-8.

An Explainable AI Approach for the Rapid Diagnosis of COVID-19 Using Ensemble Learning Algorithms.

Front Public Health. 2022 Jun 21;10:874455. doi: 10.3389/fpubh.2022.874455. eCollection 2022.

引用本文的文献

Machine-learning-based model for analysing and accurately predicting factors related to burnout in healthcare workers.

BMJ Public Health. 2025 Sep 4;3(2):e000777. doi: 10.1136/bmjph-2023-000777. eCollection 2025.

Bridging the gap in obesity research: A consensus statement from the European Society for Clinical Investigation.

Eur J Clin Invest. 2025 Aug;55(8):e70059. doi: 10.1111/eci.70059. Epub 2025 May 15.

Using interpretable machine learning methods to identify the relative importance of lifestyle factors for overweight and obesity in adults: pooled evidence from CHNS and NHANES.

BMC Public Health. 2024 Nov 1;24(1):3034. doi: 10.1186/s12889-024-20510-z.

本文引用的文献

The Effects of Smoking on the Diagnostic Characteristics of Metabolic Syndrome: A Review.

Am J Lifestyle Med. 2022 Jun 28;17(3):397-412. doi: 10.1177/15598276221111046. eCollection 2023 May-Jun.

Explainable artificial intelligence model for identifying COVID-19 gene biomarkers.

Comput Biol Med. 2023 Mar;154:106619. doi: 10.1016/j.compbiomed.2023.106619. Epub 2023 Feb 1.

Using Explainable Artificial Intelligence to Discover Interactions in an Ecological Model for Obesity.

Int J Environ Res Public Health. 2022 Aug 2;19(15):9447. doi: 10.3390/ijerph19159447.

The energy balance model of obesity: beyond calories in, calories out.

Am J Clin Nutr. 2022 May 1;115(5):1243-1254. doi: 10.1093/ajcn/nqac031.

XAI-Explainable artificial intelligence.

Sci Robot. 2019 Dec 18;4(37). doi: 10.1126/scirobotics.aay7120.

Obesity Phenotypes, Diabetes, and Cardiovascular Diseases.

Circ Res. 2020 May 22;126(11):1477-1500. doi: 10.1161/CIRCRESAHA.120.316101. Epub 2020 May 21.

Chronic Disease, the Built Environment, and Unequal Health Risks in the 500 Largest U.S. Cities.

Int J Environ Res Public Health. 2020 Apr 24;17(8):2961. doi: 10.3390/ijerph17082961.

Identification of Factors Associated With Variation in US County-Level Obesity Prevalence Rates Using Epidemiologic vs Machine Learning Models.

JAMA Netw Open. 2019 Apr 5;2(4):e192884. doi: 10.1001/jamanetworkopen.2019.2884.

Iterative random forests to discover predictive and stable high-order interactions.

Proc Natl Acad Sci U S A. 2018 Feb 20;115(8):1943-1948. doi: 10.1073/pnas.1711236115. Epub 2018 Jan 19.

Prevalence of Obesity Among Adults, by Household Income and Education - United States, 2011-2014.

MMWR Morb Mortal Wkly Rep. 2017 Dec 22;66(50):1369-1373. doi: 10.15585/mmwr.mm6650a1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用可解释人工智能对美国县级横断面肥胖流行率进行可解释的机器学习模型

An interpretable machine learning model of cross-sectional U.S. county-level obesity prevalence using explainable artificial intelligence.

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献