Gibson Kyle, Forrest Iain S, Petrazzini Ben O, Duffy Áine, Park Joshua K, Malick Waqas, Rosenson Robert S, Rocheleau Ghislain, Jordan Daniel M, Do Ron
The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Medical Scientist Training Program, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
Atherosclerosis. 2025 Feb;401:119103. doi: 10.1016/j.atherosclerosis.2024.119103. Epub 2024 Dec 18.
An in silico quantitative score of coronary artery disease (ISCAD), built using machine learning and clinical data from electronic health records, has been shown to result in gradations of risk of subclinical atherosclerosis, coronary artery disease (CAD) sequelae, and mortality. Large-scale metabolite biomarker profiling provides increased portability and objectivity in machine learning for disease prediction and gradation. However, these models have not been fully leveraged. We evaluated a quantitative score of CAD derived from probabilities of a machine learning model trained on metabolomic data.
We developed a CAD-predictive learning model using metabolic data from 93,642 individuals from the UK Biobank (median [IQR] age, 57 [14] years; 39,796 [42 %] male; 5640 [6 %] with diagnosed CAD), and assessed its probabilities as a quantitative metabolic risk score for CAD (M-CAD; range 0 [lowest probability] to 1 [highest probability]) in participants of the UK Biobank. The relationship of M-CAD with arterial stiffness index, ejection fraction, CAD sequelae, and mortality was assessed.
The model predicted CAD with an area under the receiver-operating-characteristic curve of 0.712. Arterial Stiffness Index increased by 0.19 and ejection fraction decreased by 0.2 % per 0.1 increase in M-CAD. Both incident and recurrent myocardial infarction increased stepwise over M-CAD quartiles (odds ratio (OR) 15.3 [4.2 %] and 12.5 [0.2 %]) in top quartiles as compared to the first quartile of incident and recurrent MI respectively). Likewise, the hazard ratio and prevalence of all-cause mortality, CVD-associated mortality, and CAD-associated mortality increased stepwise over M-CAD deciles (2.98 [14 %], 9.34 [4.3 %], 26.7 [2.7 %] in the top deciles as compared to the first decile of all-cause, CVD, and CAD mortality respectively).
Metabolic-based machine learning can be used to build a quantitative risk score for CAD that is associated with atherosclerotic burden, CAD sequelae and mortality.
利用机器学习和电子健康记录中的临床数据构建的冠状动脉疾病计算机定量评分(ISCAD),已被证明可得出亚临床动脉粥样硬化、冠状动脉疾病(CAD)后遗症及死亡率的风险分级。大规模代谢物生物标志物分析在用于疾病预测和分级的机器学习中提高了便携性和客观性。然而,这些模型尚未得到充分利用。我们评估了一种基于代谢组学数据训练的机器学习模型概率得出的CAD定量评分。
我们使用来自英国生物银行93642名个体(年龄中位数[四分位间距]为57[14]岁;39796名[42%]为男性;5640名[6%]患有确诊CAD)的代谢数据开发了一种CAD预测学习模型,并在英国生物银行参与者中将其概率评估为CAD的定量代谢风险评分(M-CAD;范围为0[最低概率]至1[最高概率])。评估了M-CAD与动脉僵硬度指数、射血分数、CAD后遗症及死亡率之间的关系。
该模型预测CAD的受试者工作特征曲线下面积为0.712。M-CAD每增加0.1,动脉僵硬度指数增加0.19,射血分数降低0.2%。与首次发生和复发性心肌梗死的第一四分位数相比,首次发生和复发性心肌梗死在M-CAD四分位数中均呈逐步增加(优势比[OR]分别为15.3[4.2%]和12.5[0.2%])。同样,全因死亡率、心血管疾病相关死亡率和CAD相关死亡率的风险比及患病率在M-CAD十分位数中呈逐步增加(与全因、心血管疾病和CAD死亡率的第一十分位数相比,最高十分位数分别为2.98[14%]、9.34[4.3%]、26.7[2.7%])。
基于代谢的机器学习可用于构建与动脉粥样硬化负担、CAD后遗症及死亡率相关的CAD定量风险评分。