• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习和社会行政数据预测在一个存在健康不平等的国家中糖尿病并发症的风险:新西兰。

Predicting the risk of diabetes complications using machine learning and social administrative data in a country with ethnic inequities in health: Aotearoa New Zealand.

机构信息

Department of Public Health, University of Otago Wellington, Wellington City, Wellington, 6021, New Zealand.

John Curtin School of Medical Research, Australian National University, Canberra City, ACT, 2601, Australia.

出版信息

BMC Med Inform Decis Mak. 2024 Sep 27;24(1):274. doi: 10.1186/s12911-024-02678-x.

DOI:10.1186/s12911-024-02678-x
PMID:39334279
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11438423/
Abstract

BACKGROUND

In the age of big data, linked social and administrative health data in combination with machine learning (ML) is being increasingly used to improve prediction in chronic disease, e.g., cardiovascular diseases (CVD). In this study we aimed to apply ML methods on extensive national-level health and social administrative datasets to assess the utility of these for predicting future diabetes complications, including by ethnicity.

METHODS

Five ML models were used to predict CVD events among all people with known diabetes in the population of New Zealand, utilizing nationwide individual-level administrative data.

RESULTS

The Xgboost ML model had the best predictive power for predicting CVD events three years into the future among the population with diabetes (N = 145,600). The optimization procedure also found limited improvement in prediction by ethnicity (using area under the receiver operating curve, [AUC]). The results indicated no trade-off between model predictive performance and equity gap of prediction by ethnicity (that is improving model prediction and reducing performance gaps by ethnicity can be achieved simultaneously). The list of variables of importance was different among different models/ethnic groups, for example: age, deprivation (neighborhood-level), having had a hospitalization event, and the number of years living with diabetes.

DISCUSSION AND CONCLUSIONS

We provide further evidence that ML with administrative health data can be used for meaningful future prediction of health outcomes. As such, it could be utilized to inform health planning and healthcare resource allocation for diabetes management and the prevention of CVD events. Our results may suggest limited scope for developing prediction models by ethnic group and that the major ways to reduce inequitable health outcomes is probably via improved delivery of prevention and management to those groups with diabetes at highest need.

摘要

背景

在大数据时代,结合机器学习(ML)的关联社会和行政健康数据正被越来越多地用于改善慢性病(如心血管疾病(CVD))的预测。在这项研究中,我们旨在应用 ML 方法对广泛的国家级健康和社会行政数据集进行评估,以评估这些数据对预测未来糖尿病并发症(包括按族裔)的效用。

方法

使用五种 ML 模型来预测新西兰人群中所有已知糖尿病患者的 CVD 事件,利用全国范围内的个人级行政数据。

结果

Xgboost ML 模型在预测未来三年糖尿病患者(N=145600)发生 CVD 事件方面具有最佳的预测能力。优化过程也发现,通过族裔来改善预测的能力有限(使用接收者操作曲线下面积[AUC])。结果表明,预测模型的预测性能和族裔预测的公平差距之间没有权衡(即可以同时提高模型预测和减少族裔预测的差距)。不同模型/族裔群体之间的重要变量列表不同,例如年龄、贫困(社区水平)、住院事件和患糖尿病的年数。

讨论和结论

我们提供了进一步的证据表明,使用行政健康数据的 ML 可用于有意义的未来健康结果预测。因此,它可以用于为糖尿病管理和预防 CVD 事件的健康规划和医疗资源分配提供信息。我们的结果可能表明,通过族裔开发预测模型的范围有限,减少不平等健康结果的主要方法可能是通过改善对最高需求糖尿病患者的预防和管理。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7688/11438423/ba9187d0a8be/12911_2024_2678_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7688/11438423/bd437c716cbf/12911_2024_2678_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7688/11438423/e042a2beea62/12911_2024_2678_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7688/11438423/dd55001991f8/12911_2024_2678_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7688/11438423/3700753543e0/12911_2024_2678_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7688/11438423/ba9187d0a8be/12911_2024_2678_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7688/11438423/bd437c716cbf/12911_2024_2678_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7688/11438423/e042a2beea62/12911_2024_2678_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7688/11438423/dd55001991f8/12911_2024_2678_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7688/11438423/3700753543e0/12911_2024_2678_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7688/11438423/ba9187d0a8be/12911_2024_2678_Fig5_HTML.jpg

相似文献

1
Predicting the risk of diabetes complications using machine learning and social administrative data in a country with ethnic inequities in health: Aotearoa New Zealand.利用机器学习和社会行政数据预测在一个存在健康不平等的国家中糖尿病并发症的风险:新西兰。
BMC Med Inform Decis Mak. 2024 Sep 27;24(1):274. doi: 10.1186/s12911-024-02678-x.
2
Development and validation of alternative cardiovascular risk prediction equations for population health planning: a routine health data linkage study of 1.7 million New Zealanders.开发和验证用于人群健康规划的替代心血管风险预测方程:对 170 万新西兰人的常规健康数据链接研究。
Int J Epidemiol. 2018 Oct 1;47(5):1571-1584. doi: 10.1093/ije/dyy137.
3
Predicting high health-cost users among people with cardiovascular disease using machine learning and nationwide linked social administrative datasets.利用机器学习和全国性关联社会管理数据集预测心血管疾病患者中的高医疗费用使用者。
Health Econ Rev. 2023 Feb 4;13(1):9. doi: 10.1186/s13561-023-00422-1.
4
Development and Validation of a Machine Learning Model Using Administrative Health Data to Predict Onset of Type 2 Diabetes.利用行政健康数据开发和验证预测 2 型糖尿病发病的机器学习模型。
JAMA Netw Open. 2021 May 3;4(5):e2111315. doi: 10.1001/jamanetworkopen.2021.11315.
5
Cardiovascular disease risk prediction using automated machine learning: A prospective study of 423,604 UK Biobank participants.使用自动化机器学习进行心血管疾病风险预测:对 423604 名英国生物库参与者的前瞻性研究。
PLoS One. 2019 May 15;14(5):e0213653. doi: 10.1371/journal.pone.0213653. eCollection 2019.
6
Machine learning approaches improve risk stratification for secondary cardiovascular disease prevention in multiethnic patients.机器学习方法可提高多民族患者二级心血管疾病预防的风险分层。
Open Heart. 2021 Oct;8(2). doi: 10.1136/openhrt-2021-001802.
7
Development of a prediction model of postpartum hospital use using an equity-focused approach.采用公平为重点的方法开发产后住院使用预测模型。
Am J Obstet Gynecol. 2024 Jun;230(6):671.e1-671.e10. doi: 10.1016/j.ajog.2023.10.033. Epub 2023 Oct 23.
8
Development and validation of cardiovascular risk prediction equations in 76 000 people with known cardiovascular disease.在 76000 名已知心血管疾病患者中开发和验证心血管风险预测方程。
Eur J Prev Cardiol. 2024 Jan 25;31(2):218-227. doi: 10.1093/eurjpc/zwad314.
9
Machine learning model for cardiovascular disease prediction in patients with chronic kidney disease.机器学习模型预测慢性肾脏病患者心血管疾病
Front Endocrinol (Lausanne). 2024 May 28;15:1390729. doi: 10.3389/fendo.2024.1390729. eCollection 2024.
10
A data-driven approach to predicting diabetes and cardiovascular disease with machine learning.基于机器学习的数据驱动方法预测糖尿病和心血管疾病。
BMC Med Inform Decis Mak. 2019 Nov 6;19(1):211. doi: 10.1186/s12911-019-0918-5.

引用本文的文献

1
Optimized prediction of diabetes complications using ensemble learning with Bayesian optimization: a cost-efficient laboratory-based approach.使用贝叶斯优化的集成学习优化糖尿病并发症预测:一种基于实验室的经济高效方法。
Front Endocrinol (Lausanne). 2025 Jun 20;16:1593068. doi: 10.3389/fendo.2025.1593068. eCollection 2025.

本文引用的文献

1
Predicting high health-cost users among people with cardiovascular disease using machine learning and nationwide linked social administrative datasets.利用机器学习和全国性关联社会管理数据集预测心血管疾病患者中的高医疗费用使用者。
Health Econ Rev. 2023 Feb 4;13(1):9. doi: 10.1186/s13561-023-00422-1.
2
Using household economic survey data to assess food expenditure patterns and trends in a high-income country with notable health inequities.利用家庭经济调查数据评估一个高收入国家的食物支出模式和趋势,该国有显著的健康不平等现象。
Sci Rep. 2022 Dec 15;12(1):21703. doi: 10.1038/s41598-022-26301-z.
3
Health promoting and demoting consumption: What accounts for budget share differentials by ethnicity in New Zealand.
促进健康与有损健康的消费:新西兰不同种族预算份额差异的原因
SSM Popul Health. 2022 Aug 15;19:101204. doi: 10.1016/j.ssmph.2022.101204. eCollection 2022 Sep.
4
Relative contribution of trends in myocardial infarction event rates and case fatality to declines in mortality: an international comparative study of 1·95 million events in 80·4 million people in four countries.趋势在心肌梗死发生率和病死率对死亡率下降的相对贡献:四个国家 8040 万人中 195 万例事件的国际比较研究。
Lancet Public Health. 2022 Mar;7(3):e229-e239. doi: 10.1016/S2468-2667(22)00006-8.
5
Improving the Performance of Risk Adjustment Systems: Constrained Regressions, Reinsurance, and Variable Selection.提升风险调整系统的性能:约束回归、再保险与变量选择
Am J Health Econ. 2021 Fall;7(4):497-521. doi: 10.1086/716199. Epub 2021 Oct 4.
6
Cardiovascular risk prediction in type 2 diabetes before and after widespread screening: a derivation and validation study.2 型糖尿病广泛筛查前后的心血管风险预测:一项推导和验证研究。
Lancet. 2021 Jun 12;397(10291):2264-2274. doi: 10.1016/S0140-6736(21)00572-9. Epub 2021 Jun 2.
7
Potential impact of COVID-19 related unemployment on increased cardiovascular disease in a high-income country: Modeling health loss, cost and equity.新冠疫情相关失业对高收入国家心血管疾病增加的潜在影响:建模健康损失、成本和公平性。
PLoS One. 2021 May 27;16(5):e0246053. doi: 10.1371/journal.pone.0246053. eCollection 2021.
8
Predicting adverse outcomes due to diabetes complications with machine learning using administrative health data.利用行政健康数据通过机器学习预测糖尿病并发症导致的不良后果。
NPJ Digit Med. 2021 Feb 12;4(1):24. doi: 10.1038/s41746-021-00394-8.
9
Machine learning-based prediction of adverse events following an acute coronary syndrome (PRAISE): a modelling study of pooled datasets.基于机器学习的急性冠状动脉综合征后不良事件预测(PRAISE):汇总数据集的建模研究。
Lancet. 2021 Jan 16;397(10270):199-207. doi: 10.1016/S0140-6736(20)32519-8.
10
Ethnic differences in mortality and hospital admission rates between Māori, Pacific, and European New Zealanders with type 2 diabetes between 1994 and 2018: a retrospective, population-based, longitudinal cohort study.1994 年至 2018 年间,2 型糖尿病毛利人、太平洋岛民和欧洲裔新西兰人之间的死亡率和住院率的种族差异:一项回顾性、基于人群、纵向队列研究。
Lancet Glob Health. 2021 Feb;9(2):e209-e217. doi: 10.1016/S2214-109X(20)30412-5. Epub 2020 Oct 15.