• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

随机森林分析在流行病学研究中的适用性:在横断面设计中探索与社会人口学和生活方式相关的超重风险因素。

Suitability of random forest analysis for epidemiological research: Exploring sociodemographic and lifestyle-related risk factors of overweight in a cross-sectional design.

机构信息

1 Department of Public Health, University of Helsinki, Finland.

2 Department of Public Health Solutions, National Institute for Health and Welfare, Finland.

出版信息

Scand J Public Health. 2018 Jul;46(5):557-564. doi: 10.1177/1403494817736944. Epub 2017 Oct 30.

DOI:10.1177/1403494817736944
PMID:29082809
Abstract

AIMS

Factors that contribute to the development of overweight are numerous and form a complex structure with many unknown interactions and associations. We aimed to explore this structure (i.e. the mutual importance or hierarchy of sociodemographic and lifestyle-related risk factors of being overweight) using a machine-learning technique called random forest (RF). The results were compared with traditional logistic regression (LR) analysis.

METHODS

The cross-sectional FINRISK 2007 Study included 4757 Finns (aged 25-74 years). Information on participants' lifestyle and sociodemographic characteristics were collected with questionnaires. Diet was assessed, using a validated food-frequency questionnaire. Height and weight were measured. Participants with a body mass index (BMI) ≥25 kg/m were classified as overweight. R-statistical software was used to run RF analysis ('randomForest') to derive estimates for variable importance and out-of-bag error, which were compared to a LR model.

RESULTS

In total, 704 (32%) men and 1119 (44%) women had normal BMI, whereas 1502 (69%) men and 1432 (57%) women had BMI ≥25. Estimated error rates for the models were similar (RF vs. LR: 42% vs. 40% for men, 38% vs. 35% for women). Both models ranked age, education and physical activity as the most important risk factors for being overweight, but RF ranked macronutrients (carbohydrates and protein) as more important compared to LR.

CONCLUSIONS

RF did not demonstrate higher power in variable selection compared to LR in our study. The features of RF are more likely to appear beneficial in settings with a larger number of predictors.

摘要

目的

导致超重的因素很多,形成了一个具有许多未知相互作用和关联的复杂结构。我们旨在使用一种称为随机森林(RF)的机器学习技术来探索这种结构(即超重的社会人口统计学和生活方式相关风险因素的相互重要性或层次结构)。结果与传统的逻辑回归(LR)分析进行了比较。

方法

横断面 FINRISK 2007 研究包括 4757 名芬兰人(年龄 25-74 岁)。使用问卷收集了参与者的生活方式和社会人口统计学特征信息。使用经过验证的食物频率问卷评估饮食。测量身高和体重。BMI≥25kg/m 的参与者被归类为超重。使用 R 统计软件运行 RF 分析('randomForest')以得出变量重要性和袋外误差的估计值,并与 LR 模型进行比较。

结果

共有 704 名(32%)男性和 1119 名(44%)女性的 BMI 正常,而 1502 名(69%)男性和 1432 名(57%)女性的 BMI≥25。模型的估计误差率相似(RF 与 LR:男性为 42%比 40%,女性为 38%比 35%)。两种模型都将年龄、教育程度和体力活动列为超重的最重要危险因素,但 RF 将宏量营养素(碳水化合物和蛋白质)列为比 LR 更重要的危险因素。

结论

在我们的研究中,RF 并没有在变量选择方面表现出比 LR 更高的能力。RF 的特征在预测因子数量较多的情况下更有可能显示出优势。

相似文献

1
Suitability of random forest analysis for epidemiological research: Exploring sociodemographic and lifestyle-related risk factors of overweight in a cross-sectional design.随机森林分析在流行病学研究中的适用性:在横断面设计中探索与社会人口学和生活方式相关的超重风险因素。
Scand J Public Health. 2018 Jul;46(5):557-564. doi: 10.1177/1403494817736944. Epub 2017 Oct 30.
2
Dietary and lifestyle characteristics associated with normal-weight obesity: the National FINRISK 2007 Study.与正常体重肥胖相关的饮食和生活方式特征:芬兰国家 FINRISK 2007 研究。
Br J Nutr. 2014 Mar 14;111(5):887-94. doi: 10.1017/S0007114513002742. Epub 2013 Nov 14.
3
Longitudinal analysis of lifestyle habits in relation to body mass index, onset of overweight and obesity: results from a large population-based cohort in Sweden.与体重指数、超重和肥胖发病相关的生活习惯纵向分析:来自瑞典一项大型基于人群队列的结果。
Scand J Public Health. 2015 May;43(3):236-45. doi: 10.1177/1403494815569865. Epub 2015 Mar 4.
4
Sociodemographic and Lifestyle Factors in relation to Overweight Defined by BMI and "Normal-Weight Obesity".与 BMI 定义的超重和“正常体重肥胖”相关的社会人口学和生活方式因素。
J Obes. 2020 Jan 7;2020:2070297. doi: 10.1155/2020/2070297. eCollection 2020.
5
Association of breakfast consumption with body mass index and prevalence of overweight/obesity in a nationally-representative survey of Canadian adults.在一项针对加拿大成年人的全国代表性调查中,早餐摄入与体重指数及超重/肥胖患病率的关联。
Nutr J. 2016 Mar 31;15:33. doi: 10.1186/s12937-016-0151-3.
6
Trends in BMI, diet and lifestyle between 1976 and 2005 in North Sydney.1976年至2005年北悉尼地区的体重指数、饮食和生活方式趋势。
Asia Pac J Clin Nutr. 2009;18(3):453-61.
7
Assessing obesity and overweight in a high mountain Pakistani population.评估巴基斯坦高山地区人群的肥胖和超重情况。
Trop Med Int Health. 2004 Apr;9(4):526-32. doi: 10.1111/j.1365-3156.2004.01220.x.
8
[Overweight and obesity in Colombian college students and its association with physical activity].[哥伦比亚大学生的超重与肥胖及其与体育活动的关联]
Nutr Hosp. 2014 Nov 1;31(2):629-36. doi: 10.3305/nh.2015.31.2.7757.
9
Alterations in Multiple Lifestyle Factors in Subjects with the Metabolic Syndrome Independently of Obesity.代谢综合征患者中多种生活方式因素的改变与肥胖无关。
Metab Syndr Relat Disord. 2017 Apr;15(3):118-123. doi: 10.1089/met.2016.0120. Epub 2017 Feb 16.
10
Lifestyle and food habits changes after migration: a focus on immigrant women in Modena (Italy).移民后的生活方式和饮食习惯变化:以摩德纳(意大利)的移民女性为重点。
Ann Ig. 2015 Sep-Oct;27(5):748-59. doi: 10.7416/ai.2015.2067.

引用本文的文献

1
Machine learning for modeling and identifying risk factors of pancreatic fistula.用于胰腺瘘风险因素建模与识别的机器学习
World J Gastrointest Oncol. 2025 Apr 15;17(4):100089. doi: 10.4251/wjgo.v17.i4.100089.
2
Using Machine Learning to Predict Weight Gain in Adults: an Observational Analysis From the All of Us Research Program.使用机器学习预测成年人的体重增加:来自“我们所有人”研究项目的观察性分析。
J Surg Res. 2025 Feb;306:43-53. doi: 10.1016/j.jss.2024.11.042. Epub 2024 Dec 31.
3
Classification of Obesity among South African Female Adolescents: Comparative Analysis of Logistic Regression and Random Forest Algorithms.
南非女性青少年肥胖的分类:逻辑回归与随机森林算法的比较分析
Int J Environ Res Public Health. 2023 Dec 19;21(1):2. doi: 10.3390/ijerph21010002.
4
Trends in the ability of socioeconomic position to predict individual body mass index: an analysis of repeated cross-sectional data, 1991-2019.社会经济地位预测个体体重指数能力的趋势:1991-2019 年重复横断面数据分析。
BMC Med. 2023 Nov 13;21(1):434. doi: 10.1186/s12916-023-03103-2.
5
Identification of influence factors in overweight population through an interpretable risk model based on machine learning: a large retrospective cohort.基于机器学习的可解释风险模型识别超重人群的影响因素:一项大型回顾性队列研究。
Endocrine. 2024 Mar;83(3):604-614. doi: 10.1007/s12020-023-03536-y. Epub 2023 Sep 30.
6
Identifying Young Adults at High Risk for Weight Gain Using Machine Learning.利用机器学习识别易增重的青年成年人。
J Surg Res. 2023 Nov;291:7-16. doi: 10.1016/j.jss.2023.05.015. Epub 2023 Jun 15.
7
The prevalence of anxiety and its key influencing factors among the elderly in China.中国老年人焦虑症的患病率及其主要影响因素
Front Psychiatry. 2023 Feb 2;14:1038049. doi: 10.3389/fpsyt.2023.1038049. eCollection 2023.
8
Predicting COVID-19 community infection relative risk with a Dynamic Bayesian Network.利用动态贝叶斯网络预测 COVID-19 社区感染的相对风险。
Front Public Health. 2022 Oct 28;10:876691. doi: 10.3389/fpubh.2022.876691. eCollection 2022.
9
Association of body composition with pubertal timing in children and adolescents from Guangzhou, China.中国广州儿童和青少年的身体成分与青春期发育时间的关联。
Front Public Health. 2022 Aug 17;10:943886. doi: 10.3389/fpubh.2022.943886. eCollection 2022.
10
Machine-Learning for Prescription Patterns: Random Forest in the Prediction of Dose and Number of Antipsychotics Prescribed to People with Schizophrenia.用于处方模式的机器学习:随机森林在预测精神分裂症患者抗精神病药物剂量和处方数量中的应用
Clin Psychopharmacol Neurosci. 2022 Aug 31;20(3):450-461. doi: 10.9758/cpn.2022.20.3.450.