• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习从美国电子病历中识别有体重增加风险的个体。

Identifying individuals at risk for weight gain using machine learning in electronic medical records from the United States.

作者信息

Choong Casey, Xavier Neena, Falcon Beverly, Kan Hong, Lipkovich Ilya, Nowak Callie, Hoyt Margaret, Houle Christy, Kahan Scott

机构信息

Eli Lilly and Company, Indianapolis, Indiana, USA.

National Center for Weight and Wellness, George Washington University School of Medicine, Washington, Washington DC, USA.

出版信息

Diabetes Obes Metab. 2025 Jun;27(6):3061-3071. doi: 10.1111/dom.16311. Epub 2025 Mar 11.

DOI:10.1111/dom.16311
PMID:40069847
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12046438/
Abstract

AIMS

Numerous risk factors for the development of obesity have been identified, yet the aetiology is not well understood. Traditional statistical methods for analysing observational data are limited by the volume and characteristics of large datasets. Machine learning (ML) methods can analyse large datasets to extract novel insights on risk factors for obesity. This study predicted adults at risk of a ≥10% increase in index body mass index (BMI) within 12 months using ML and a large electronic medical records (EMR) database.

MATERIALS AND METHODS

ML algorithms were used with EMR from Optum's de-identified Market Clarity Data, a US database. Models included extreme gradient boosting (XGBoost), random forest, simple logistic regression (no feature selection procedure) and two penalised logistic models (Elastic Net and Least Absolute Shrinkage and Selection Operator [LASSO]). Performance metrics included the area under the curve (AUC) of the receiver operating characteristic curve (used to determine the best-performing model), average precision, Brier score, accuracy, recall, positive predictive value, Youden index, F1 score, negative predictive value and specificity.

RESULTS

The XGBoost model performed best 12 months post-index, with an AUC of 0.75. Lower baseline BMI, having any emergency room visit during the study period, no diabetes mellitus, no lipid disorders and younger age were among the top predictors for ≥10% increase in index BMI.

CONCLUSION

The current study demonstrates an ML approach applied to EMR to identify those at risk for weight gain over 12 months. Providers may use this risk stratification to prioritise prevention strategies or earlier obesity intervention.

摘要

目的

已确定了许多导致肥胖的风险因素,但其病因尚未完全明确。用于分析观察性数据的传统统计方法受到大型数据集的数量和特征的限制。机器学习(ML)方法可以分析大型数据集,以提取有关肥胖风险因素的新见解。本研究使用机器学习和一个大型电子病历(EMR)数据库预测在12个月内指数体重指数(BMI)增加≥10%的成年风险人群。

材料与方法

将机器学习算法与来自美国Optum公司匿名化的市场透明度数据中的电子病历一起使用。模型包括极端梯度提升(XGBoost)、随机森林、简单逻辑回归(无特征选择程序)和两种惩罚逻辑模型(弹性网络和最小绝对收缩和选择算子 [LASSO])。性能指标包括受试者工作特征曲线的曲线下面积(AUC)(用于确定性能最佳的模型)、平均精度、布里尔评分、准确性、召回率、阳性预测值、约登指数、F1评分、阴性预测值和特异性。

结果

XGBoost模型在指数后12个月表现最佳,AUC为0.75。较低的基线BMI、在研究期间有过任何急诊就诊、无糖尿病、无血脂异常以及较年轻是指数BMI增加≥10%的主要预测因素。

结论

本研究展示了一种应用于电子病历的机器学习方法,以识别在12个月内有体重增加风险的人群。医疗服务提供者可以使用这种风险分层来确定预防策略或早期肥胖干预的优先级。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df07/12046438/e97b2d09bf52/DOM-27-3061-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df07/12046438/42e9a7f0a5b2/DOM-27-3061-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df07/12046438/aa2d4ca5e1af/DOM-27-3061-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df07/12046438/e97b2d09bf52/DOM-27-3061-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df07/12046438/42e9a7f0a5b2/DOM-27-3061-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df07/12046438/aa2d4ca5e1af/DOM-27-3061-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df07/12046438/e97b2d09bf52/DOM-27-3061-g003.jpg

相似文献

1
Identifying individuals at risk for weight gain using machine learning in electronic medical records from the United States.利用机器学习从美国电子病历中识别有体重增加风险的个体。
Diabetes Obes Metab. 2025 Jun;27(6):3061-3071. doi: 10.1111/dom.16311. Epub 2025 Mar 11.
2
Applying machine learning approaches for predicting obesity risk using US health administrative claims database.应用机器学习方法,利用美国健康管理数据库预测肥胖风险。
BMJ Open Diabetes Res Care. 2024 Sep 26;12(5):e004193. doi: 10.1136/bmjdrc-2024-004193.
3
Using Machine Learning to Predict Weight Gain in Adults: an Observational Analysis From the All of Us Research Program.使用机器学习预测成年人的体重增加:来自“我们所有人”研究项目的观察性分析。
J Surg Res. 2025 Feb;306:43-53. doi: 10.1016/j.jss.2024.11.042. Epub 2024 Dec 31.
4
Machine Learning Model for Risk Prediction of Community-Acquired Acute Kidney Injury Hospitalization From Electronic Health Records: Development and Validation Study.基于电子健康记录的社区获得性急性肾损伤住院风险预测的机器学习模型:开发和验证研究。
J Med Internet Res. 2020 Aug 4;22(8):e16903. doi: 10.2196/16903.
5
Can Machine-learning Algorithms Predict Early Revision TKA in the Danish Knee Arthroplasty Registry?机器学习算法能否预测丹麦膝关节置换登记处的早期翻修 TKA?
Clin Orthop Relat Res. 2020 Sep;478(9):2088-2101. doi: 10.1097/CORR.0000000000001343.
6
Machine learning algorithms for diabetic kidney disease risk predictive model of Chinese patients with type 2 diabetes mellitus.用于中国2型糖尿病患者糖尿病肾病风险预测模型的机器学习算法
Ren Fail. 2025 Dec;47(1):2486558. doi: 10.1080/0886022X.2025.2486558. Epub 2025 Apr 7.
7
A machine learning-based algorithm to identify U-500R insulin candidates among adults with type 2 diabetes mellitus in US retrospective databases.基于机器学习的算法,用于从美国回顾性数据库中识别患有 2 型糖尿病的成年人中的 U-500R 胰岛素候选药物。
Curr Med Res Opin. 2024 Mar;40(3):367-375. doi: 10.1080/03007995.2023.2293116. Epub 2024 Jan 23.
8
Prediction and feature selection of low birth weight using machine learning algorithms.利用机器学习算法预测和选择低出生体重。
J Health Popul Nutr. 2024 Oct 12;43(1):157. doi: 10.1186/s41043-024-00647-8.
9
[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].基于监督机器学习算法构建脓毒症休克患者死亡风险预测模型
Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832.
10
Learning from the machine: is diabetes in adults predicted by lifestyle variables? A retrospective predictive modelling study of NHANES 2007-2018.向机器学习:成人糖尿病能否由生活方式变量预测?一项对2007 - 2018年美国国家健康与营养检查调查(NHANES)的回顾性预测建模研究。
BMJ Open. 2025 Mar 22;15(3):e096595. doi: 10.1136/bmjopen-2024-096595.

引用本文的文献

1
Supervised machine learning algorithms for the classification of obesity levels using anthropometric indices derived from bioelectrical impedance analysis.使用源自生物电阻抗分析的人体测量指标对肥胖水平进行分类的监督式机器学习算法。
Sci Rep. 2025 Aug 21;15(1):30681. doi: 10.1038/s41598-025-15264-6.