Venkatesh Rasika, Cherlin Tess, Ritchie Marylyn D, Guerraty Marie A, Verma Shefali S
Genomics and Computational Biology Graduate Group, University of Pennsylvania, Philadelphia, PA, USA.
Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, USA.
medRxiv. 2025 Aug 21:2025.08.18.25333844. doi: 10.1101/2025.08.18.25333844.
Coronary microvascular disease (CMVD) is an underdiagnosed but significant contributor to the burden of ischemic heart disease, characterized by angina and myocardial infarction. The development of risk prediction models such as polygenic risk scores (PRS) for CMVD has been limited by a lack of large-scale genome-wide association studies (GWAS). However, there is significant overlap between CMVD and enrollment criteria for coronary artery disease (CAD) GWAS. In this study, we developed CMVD PRS models by selecting variants identified in a CMVD GWAS and applying weights from an external CAD GWAS, using CMVD-associated loci as proxies for the genetic risk. We integrated plasma proteomics, clinical measures from perfusion PET imaging, and PRS to evaluate their contributions to CMVD risk prediction in comprehensive machine and deep learning models. We then developed a novel unsupervised endotyping framework for CMVD from perfusion PET-derived myocardial blood flow data, revealing distinct patient subgroups beyond traditional case-control definitions. This imaging-based stratification substantially improved classification performance alongside plasma proteomics and PRS, achieving AUROCs between 0.65 and 0.73 per class, significantly outperforming binary classifiers and existing clinical models, highlighting the potential of this stratification approach to enable more precise and personalized diagnosis by capturing the underlying heterogeneity of CMVD. This work represents the first application of imaging-based endotyping and the integration of genetic and proteomic data for CMVD risk prediction, establishing a framework for multimodal modeling in complex diseases.
冠状动脉微血管疾病(CMVD)是缺血性心脏病负担的一个未被充分诊断但重要的因素,其特征为心绞痛和心肌梗死。诸如CMVD的多基因风险评分(PRS)等风险预测模型的发展一直受到缺乏大规模全基因组关联研究(GWAS)的限制。然而,CMVD与冠状动脉疾病(CAD)GWAS的纳入标准之间存在显著重叠。在本研究中,我们通过选择在CMVD的GWAS中鉴定出的变异,并应用来自外部CAD GWAS的权重,以CMVD相关位点作为遗传风险的代理,开发了CMVD PRS模型。我们整合了血浆蛋白质组学、灌注PET成像的临床测量指标和PRS,以评估它们在综合机器学习和深度学习模型中对CMVD风险预测的贡献。然后,我们从灌注PET衍生的心肌血流数据中为CMVD开发了一种新颖的无监督内型分型框架,揭示了超越传统病例对照定义的不同患者亚组。这种基于成像的分层与血浆蛋白质组学和PRS一起显著提高了分类性能,每类的受试者工作特征曲线下面积(AUROC)在0.65至0.73之间,明显优于二元分类器和现有的临床模型,突出了这种分层方法通过捕捉CMVD潜在异质性实现更精确和个性化诊断的潜力。这项工作代表了基于成像的内型分型以及遗传和蛋白质组学数据整合在CMVD风险预测中的首次应用,为复杂疾病的多模态建模建立了一个框架。