SomaLogic Inc., Boulder, CO 80301, USA.
Johns Hopkins University, Baltimore, MD 21218, USA.
Sci Transl Med. 2022 Apr 6;14(639):eabj9625. doi: 10.1126/scitranslmed.abj9625.
A reliable, individualized, and dynamic surrogate of cardiovascular risk, synoptic for key biologic mechanisms, could shorten the path for drug development, enhance drug cost-effectiveness and improve patient outcomes. We used highly multiplexed proteomics to address these objectives, measuring about 5000 proteins in each of 32,130 archived plasma samples from 22,849 participants in nine clinical studies. We used machine learning to derive a 27-protein model predicting 4-year likelihood of myocardial infarction, stroke, heart failure, or death. The 27 proteins encompassed 10 biologic systems, and 12 were associated with relevant causal genetic traits. We independently validated results in 11,609 participants. Compared to a clinical model, the ratio of observed events in quintile 5 to quintile 1 was 6.7 for proteins versus 2.9 for the clinical model, AUCs (95% CI) were 0.73 (0.72 to 0.74) versus 0.64 (0.62 to 0.65), -statistics were 0.71 (0.69 to 0.72) versus 0.62 (0.60 to 0.63), and the net reclassification index was +0.43. Adding the clinical model to the proteins only improved discrimination metrics by 0.01 to 0.02. Event rates in four predefined protein risk categories were 5.6, 11.2, 20.0, and 43.4% within 4 years; median time to event was 1.71 years. Protein predictions were directionally concordant with changed outcomes. Adverse risks were predicted for aging, approaching an event, anthracycline chemotherapy, diabetes, smoking, rheumatoid arthritis, cancer history, cardiovascular disease, high systolic blood pressure, and lipids. Reduced risks were predicted for weight loss and exenatide. The 27-protein model has potential as a "universal" surrogate end point for cardiovascular risk.
一种可靠、个体化且动态的心血管风险替代指标,能够综合关键生物学机制,可能会缩短药物研发的路径,提高药物成本效益,并改善患者的预后。我们使用高度多重蛋白质组学来解决这些目标,在来自 9 项临床研究的 22849 名参与者的 32130 个存档血浆样本中的每一个中测量了约 5000 种蛋白质。我们使用机器学习得出了一个 27 种蛋白质模型,用于预测 4 年内心肌梗死、中风、心力衰竭或死亡的可能性。这 27 种蛋白质涵盖了 10 个生物学系统,其中 12 种与相关的因果遗传特征有关。我们在 11609 名参与者中独立验证了结果。与临床模型相比,蛋白质五分位数 5 与五分位数 1 之间观察到的事件比为 6.7,而临床模型为 2.9,AUC(95%CI)分别为 0.73(0.72 至 0.74)和 0.64(0.62 至 0.65),-统计量分别为 0.71(0.69 至 0.72)和 0.62(0.60 至 0.63),净重新分类指数为 0.43。将临床模型添加到蛋白质中仅将判别指标提高了 0.01 至 0.02。在 4 年内,四个预先定义的蛋白质风险类别中的事件发生率分别为 5.6%、11.2%、20.0%和 43.4%;中位事件时间为 1.71 年。蛋白质预测与改变的结果呈方向性一致。衰老、接近事件、蒽环类化疗、糖尿病、吸烟、类风湿关节炎、癌症病史、心血管疾病、收缩压升高和血脂异常预示着不良风险。体重减轻和 exenatide 预示着风险降低。该 27 种蛋白质模型有可能成为心血管风险的“通用”替代终点。