针对当地人群定制风险预测模型。

Tailoring Risk Prediction Models to Local Populations.

机构信息

Brigham & Women's Hospital, Boston, Massachusetts.

ETH Zurich, Zurich, Switzerland.

出版信息

JAMA Cardiol. 2024 Nov 1;9(11):1018-1028. doi: 10.1001/jamacardio.2024.2912.

DOI:10.1001/jamacardio.2024.2912

PMID:39292486

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11411452/

Abstract

IMPORTANCE

Risk estimation is an integral part of cardiovascular care. Local recalibration of guideline-recommended models could address the limitations of existing tools.

OBJECTIVE

To provide a machine learning (ML) approach to augment the performance of the American Heart Association's Predicting Risk of Cardiovascular Disease Events (AHA-PREVENT) equations when applied to a local population while preserving clinical interpretability.

DESIGN, SETTING, AND PARTICIPANTS: This cohort study used a New England-based electronic health record cohort of patients without prior atherosclerotic cardiovascular disease (ASCVD) who had the data necessary to calculate the AHA-PREVENT 10-year risk of developing ASCVD in the event period (2007-2016). Patients with prior ASCVD events, death prior to 2007, or age 79 years or older in 2007 were subsequently excluded. The final study population of 95 326 patients was split into 3 nonoverlapping subsets for training, testing, and validation. The AHA-PREVENT model was adapted to this local population using the open-source ML model (MLM) Extreme Gradient Boosting model (XGBoost) with minimal predictor variables, including age, sex, and AHA-PREVENT. The MLM was monotonically constrained to preserve known associations between risk factors and ASCVD risk. Along with sex, race and ethnicity data from the electronic health record were collected to validate the performance of ASCVD risk prediction in subgroups. Data were analyzed from August 2021 to February 2024.

MAIN OUTCOMES AND MEASURES

Consistent with the AHA-PREVENT model, ASCVD events were defined as the first occurrence of either nonfatal myocardial infarction, coronary artery disease, ischemic stroke, or cardiovascular death. Cardiovascular death was coded via government registries. Discrimination, calibration, and risk reclassification were assessed using the Harrell C index, a modified Hosmer-Lemeshow goodness-of-fit test and calibration curves, and reclassification tables, respectively.

RESULTS

In the test set of 38 137 patients (mean [SD] age, 64.8 [6.9] years, 22 708 [59.5]% women and 15 429 [40.5%] men; 935 [2.5%] Asian, 2153 [5.6%] Black, 1414 [3.7%] Hispanic, 31 400 [82.3%] White, and 2235 [5.9%] other, including American Indian, multiple races, unspecified, and unrecorded, consolidated owing to small numbers), MLM-PREVENT had improved calibration (modified Hosmer-Lemeshow P > .05) compared to the AHA-PREVENT model across risk categories in the overall cohort (χ23 = 2.2; P = .53 vs χ23 > 16.3; P < .001) and sex subgroups (men: χ23 = 2.1; P = .55 vs χ23 > 16.3; P < .001; women: χ23 = 6.5; P = .09 vs. χ23 > 16.3; P < .001), while also surpassing a traditional recalibration approach. MLM-PREVENT maintained or improved AHA-PREVENT's calibration in Asian, Black, and White individuals. Both MLM-PREVENT and AHA-PREVENT performed equally well in discriminating risk (approximate ΔC index, ±0.01). Using a clinically significant 7.5% risk threshold, MLM-PREVENT reclassified a total of 11.5% of patients. We visualize the recalibration through MLM-PREVENT ASCVD risk charts that highlight preserved risk associations of the original AHA-PREVENT model.

CONCLUSIONS AND RELEVANCE

The interpretable ML approach presented in this article enhanced the accuracy of the AHA-PREVENT model when applied to a local population while still preserving the risk associations found by the original model. This method has the potential to recalibrate other established risk tools and is implementable in electronic health record systems for improved cardiovascular risk assessment.

摘要

重要性

风险评估是心血管护理的一个组成部分。对指南推荐模型进行局部重新校准可以解决现有工具的局限性。

目的

提供一种机器学习 (ML) 方法，在保留临床可解释性的同时，增强美国心脏协会预测心血管疾病事件风险 (AHA-PREVENT) 方程在本地人群中的性能。

设计、地点和参与者：本队列研究使用了基于新英格兰的电子健康记录队列，该队列的患者没有先前的动脉粥样硬化性心血管疾病 (ASCVD)，并且在事件期间（2007-2016 年）有必要计算 AHA-PREVENT 的 10 年 ASCVD 发病风险。排除了有 ASCVD 事件、2007 年之前死亡或 2007 年时年龄在 79 岁或以上的患者。最终的研究人群为 95326 名患者，分为 3 个不重叠的子集进行训练、测试和验证。使用开源 ML 模型（极端梯度提升模型 [XGBoost]）对 AHA-PREVENT 模型进行了调整，使用的最小预测变量包括年龄、性别和 AHA-PREVENT。MLM 受到单调约束，以保留风险因素与 ASCVD 风险之间的已知关联。同时，从电子健康记录中收集了性别、种族和民族数据，以验证亚组中 ASCVD 风险预测的性能。数据分析时间为 2021 年 8 月至 2024 年 2 月。

主要结果和措施

与 AHA-PREVENT 模型一致，ASCVD 事件定义为首次发生非致命性心肌梗死、冠状动脉疾病、缺血性卒中和心血管死亡之一。通过政府登记处对心血管死亡进行编码。使用 Harrell C 指数、改良 Hosmer-Lemeshow 拟合优度检验和校准曲线以及重新分类表分别评估区分度、校准和风险再分类。

结果

在 38137 名患者的测试集中（平均[标准差]年龄为 64.8[6.9]岁，22708[59.5%]为女性和 15429[40.5%]为男性；935[2.5%]为亚裔，2153[5.6%]为黑人，1414[3.7%]为西班牙裔，31400[82.3%]为白人，2235[5.9%]为其他种族，包括美洲印第安人、多种族、未指定和未记录，由于数量较少而合并），与 AHA-PREVENT 模型相比，MLM-PREVENT 在整个队列（χ23=2.2；P>.05）和性别亚组（男性：χ23=2.1；P=.55 与 χ23>16.3；P<.001；女性：χ23=6.5；P=.09 与 χ23>16.3；P<.001）中，在所有风险类别中均具有改善的校准（改良 Hosmer-Lemeshow P>.05），并且在亚裔、黑人和白人个体中也保持或提高了 AHA-PREVENT 的校准。MLM-PREVENT 和 AHA-PREVENT 在区分风险方面表现相当（近似ΔC 指数，±0.01）。使用具有临床意义的 7.5%风险阈值，MLM-PREVENT 重新分类了总共 11.5%的患者。我们通过 MLM-PREVENT ASCVD 风险图表可视化重新校准，该图表突出显示了原始 AHA-PREVENT 模型的风险关联。

结论和相关性

本文提出的可解释 ML 方法在应用于本地人群时增强了 AHA-PREVENT 模型的准确性，同时仍保留了原始模型发现的风险关联。这种方法有可能重新校准其他已建立的风险工具，并可在电子健康记录系统中实施，以改善心血管风险评估。

相似文献

Tailoring Risk Prediction Models to Local Populations.针对当地人群定制风险预测模型。

JAMA Cardiol. 2024 Nov 1;9(11):1018-1028. doi: 10.1001/jamacardio.2024.2912.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗？

Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

Cardiovascular Disease Risk Estimates in the US CKD Population Using the PREVENT Equation.使用PREVENT方程对美国慢性肾脏病患者群体进行心血管疾病风险评估。

Am J Kidney Dis. 2025 Mar 5. doi: 10.1053/j.ajkd.2025.01.012.

Performance of the pooled cohort equations and D:A:D risk scores among individuals with HIV in a global cardiovascular disease prevention trial: a cohort study leveraging data from REPRIEVE.在一项全球心血管疾病预防试验中，对感染艾滋病毒个体应用汇总队列方程和D:A:D风险评分的情况：一项利用REPRIEVE研究数据的队列研究

Lancet HIV. 2025 Feb;12(2):e118-e129. doi: 10.1016/S2352-3018(24)00276-5. Epub 2025 Jan 17.

Risk Prediction for Atherosclerotic Cardiovascular Disease With and Without Race Stratification.有无种族分层情况下动脉粥样硬化性心血管疾病的风险预测

JAMA Cardiol. 2024 Jan 1;9(1):55-62. doi: 10.1001/jamacardio.2023.4520.

Performance of the American Heart Association's PREVENT Equations Among Disaggregated Racial and Ethnic Subgroups.美国心脏协会预防方程在不同种族和族裔亚组中的表现。

JAMA Cardiol. 2025 Jun 25. doi: 10.1001/jamacardio.2025.1865.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

AHA PREVENT Equations and Lipoprotein(a) for Cardiovascular Disease Risk : Insights From MESA and the UK Biobank.美国心脏协会预防方程与脂蛋白（a）对心血管疾病风险的评估：来自多族裔动脉粥样硬化研究（MESA）和英国生物银行的见解

JAMA Cardiol. 2025 Jun 4. doi: 10.1001/jamacardio.2025.1603.

Machine Learning-Based Analysis of Lifestyle Risk Factors for Atherosclerotic Cardiovascular Disease: Retrospective Case-Control Study.基于机器学习的动脉粥样硬化性心血管疾病生活方式风险因素分析：回顾性病例对照研究

JMIR Med Inform. 2025 Aug 7;13:e74415. doi: 10.2196/74415.

引用本文的文献

Incorporating the STOP-BANG questionnaire improves prediction of cardiovascular events during hospitalization after myocardial infarction.纳入 STOP-BANG 问卷可改善心肌梗死后住院期间心血管事件的预测。

Sci Rep. 2025 May 31;15(1):19180. doi: 10.1038/s41598-025-03882-z.

Improved prediction and risk stratification of major adverse cardiovascular events using an explainable machine learning approach combining plasma biomarkers and traditional risk factors.使用结合血浆生物标志物和传统危险因素的可解释机器学习方法改善主要不良心血管事件的预测和风险分层。

Cardiovasc Diabetol. 2025 Apr 2;24(1):153. doi: 10.1186/s12933-025-02711-x.

Combating cardiovascular disease disparities: The potential role of artificial intelligence.对抗心血管疾病差异：人工智能的潜在作用。

Am J Prev Cardiol. 2025 Mar 9;22:100954. doi: 10.1016/j.ajpc.2025.100954. eCollection 2025 Jun.

Atherosclerosis and the Bidirectional Relationship Between Cancer and Cardiovascular Disease: From Bench to Bedside, Part 2 Management.动脉粥样硬化与癌症和心血管疾病的双向关系：从实验台到病床边，第2部分管理

Int J Mol Sci. 2025 Jan 2;26(1):334. doi: 10.3390/ijms26010334.

本文引用的文献

Adapting cardiovascular risk prediction models to different populations: the need for recalibration.使心血管疾病风险预测模型适用于不同人群：重新校准的必要性。

Eur Heart J. 2024 Jan 7;45(2):129-131. doi: 10.1093/eurheartj/ehad748.

Development and Validation of the American Heart Association's PREVENT Equations.美国心脏协会 PREVENT 方程的制定与验证。

Circulation. 2024 Feb 6;149(6):430-449. doi: 10.1161/CIRCULATIONAHA.123.067626. Epub 2023 Nov 10.

Factors Associated With Variability in the Performance of a Proprietary Sepsis Prediction Model Across 9 Networked Hospitals in the US.美国9家联网医院中，与一种专利脓毒症预测模型性能变异性相关的因素

JAMA Intern Med. 2023 Jun 1;183(6):611-612. doi: 10.1001/jamainternmed.2022.7182.

Generalizability challenges of mortality risk prediction models: A retrospective analysis on a multi-center database.死亡率风险预测模型的可推广性挑战：对多中心数据库的回顾性分析

PLOS Digit Health. 2022 Apr 5;1(4):e0000023. doi: 10.1371/journal.pdig.0000023. eCollection 2022 Apr.

Explainable machine learning framework for predicting long-term cardiovascular disease risk among adolescents.用于预测青少年长期心血管疾病风险的可解释机器学习框架。

Sci Rep. 2022 Dec 19;12(1):21905. doi: 10.1038/s41598-022-25933-5.

XGBoost, A Novel Explainable AI Technique, in the Prediction of Myocardial Infarction: A UK Biobank Cohort Study.XGBoost，一种新型可解释人工智能技术，用于心肌梗死预测：一项英国生物银行队列研究。

Clin Med Insights Cardiol. 2022 Nov 8;16:11795468221133611. doi: 10.1177/11795468221133611. eCollection 2022.

Time to Revisit Using 10-Year Risk to Guide Statin Therapy.是时候重新审视使用10年风险来指导他汀类药物治疗了。

JAMA Cardiol. 2022 Aug 1;7(8):785-786. doi: 10.1001/jamacardio.2022.1883.

Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.停止为高风险决策解释黑箱机器学习模型，转而使用可解释模型。

Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.

Temporal shift and predictive performance of machine learning for heart transplant outcomes.机器学习在心脏移植结果中的时间迁移和预测性能。

J Heart Lung Transplant. 2022 Jul;41(7):928-936. doi: 10.1016/j.healun.2022.03.019. Epub 2022 Mar 31.

Observability and its impact on differential bias for clinical prediction models.可观测性及其对临床预测模型差异偏倚的影响。

J Am Med Inform Assoc. 2022 Apr 13;29(5):937-943. doi: 10.1093/jamia/ocac019.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验