Abramowitz Sarah A, Boulier Kristin, Keat Karl, Cardone Katie M, Shivakumar Manu, DePaolo John, Judy Renae, Kim Dokyoon, Rader Daniel J, Ritchie Marylyn D, Voight Benjamin F, Pasaniuc Bogdan, Levin Michael G, Damrauer Scott M
medRxiv. 2024 Oct 17:2024.07.25.24310931. doi: 10.1101/2024.07.25.24310931.
Polygenic risk scores (PRSs) for coronary artery disease (CAD) are a growing clinical and commercial reality. Whether existing scores provide similar individual-level assessments of disease liability is a critical consideration for clinical implementation that remains uncharacterized.
Characterize the reliability of CAD PRSs that perform equivalently at the population level at predicting individual-level risk.
Cross-sectional Study.
All of Us Research Program (AOU), Penn Medicine Biobank (PMBB), and UCLA ATLAS Precision Health Biobank.
Volunteers of diverse genetic backgrounds enrolled in AOU, PMBB, and UCLA with available electronic health record and genotyping data.
Polygenic risk for CAD from previously published PRSs and new PRSs developed separately from the testing cohorts.
Sets of CAD PRSs that perform population prediction equivalently were identified by comparing calibration and discrimination (Brier score and AUROC) of generalized linear models of prevalent CAD using Bayesian analysis of variance. Among equivalently performing scores, individual-level agreement between risk estimates was tested with intraclass correlation (ICC) and Light's Kappa, measures of inter-rater reliability.
50 PRSs were calculated for 171,095 AOU participants. When included in a model of prevalent CAD, 48 scores had practically equivalent Brier scores and AUROCs (region of practical equivalence = 0.02). Across these scores, 84% of participants had at least one score in both the top and bottom risk quintile. Continuous agreement of individual risk predictions from the 48 scores was poor, with an ICC of 0.351 (95% CI; 0.349, 0.352). Agreement between two statistically equivalent scores was moderate, with an ICC of 0.649 (95% CI; 0.646, 0.652). Light's Kappa, used to evaluate consistency of assignment to high-risk thresholds, did not exceed 0.56 (interpreted as 'fair') across statistically and practically equivalent scores. Repeating the analysis among 41,193 PMBB and 50,748 UCLA participants yielded different sets of statistically and practically equivalent scores which also lacked strong individual agreement.
Across three diverse biobanks, CAD PRSs that performed equivalently at the population level produced unreliable individual risk estimates. Approaches to clinical implementation of CAD PRSs must consider the potential for discordant individual risk estimates from otherwise indistinguishable scores.
冠状动脉疾病(CAD)的多基因风险评分(PRSs)在临床和商业领域的应用日益广泛。现有评分是否能提供相似的个体疾病易感性评估,是临床应用中一个关键的考量因素,但目前仍未明确。
评估在人群水平上预测个体风险表现相当的CAD PRSs的可靠性。
横断面研究。
“我们所有人”研究计划(AOU)、宾夕法尼亚大学医学生物样本库(PMBB)和加州大学洛杉矶分校ATLAS精准健康生物样本库。
参加AOU、PMBB和加州大学洛杉矶分校的具有不同遗传背景的志愿者,他们有可用的电子健康记录和基因分型数据。
来自先前发表的PRSs以及从测试队列中单独开发的新PRSs的CAD多基因风险。
通过使用贝叶斯方差分析比较普遍CAD的广义线性模型的校准和辨别力(Brier评分和AUROC),确定在人群预测方面表现相当的CAD PRSs集合。在表现相当的评分中,使用组内相关系数(ICC)和莱特卡帕检验风险估计之间的个体水平一致性,这是评估评分者间可靠性的指标。
为171,095名AOU参与者计算了50个PRSs。当纳入普遍CAD模型时,48个评分的Brier评分和AUROCs实际上相当(实际等效区域 = 0.02)。在这些评分中,84%的参与者在风险最高和最低五分位数中至少有一个评分。48个评分的个体风险预测的连续一致性较差,ICC为0.351(95%CI;0.349,0.352)。两个统计上等效的评分之间的一致性中等,ICC为0.649(95%CI;0.646,0.652)。用于评估高危阈值分配一致性的莱特卡帕在统计和实际等效评分中均未超过0.56(解释为“一般”)。在41,193名PMBB参与者和50,748名加州大学洛杉矶分校参与者中重复该分析,得到了不同的统计和实际等效评分集,这些评分集也缺乏强烈的个体一致性。
在三个不同的生物样本库中,在人群水平上表现相当的CAD PRSs产生了不可靠的个体风险估计。CAD PRSs的临床应用方法必须考虑到来自其他难以区分的评分可能产生不一致的个体风险估计。