基于网络的共病风险评分，利用生物库规模的 phewas 数据预测心肌梗死。

netCRS: Network-based comorbidity risk score for prediction of myocardial infarction using biobank-scaled PheWAS data.

机构信息

Department of Biostatistics, Epidemiology & Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.

出版信息

Pac Symp Biocomput. 2022;27:325-336.

PMID:34890160

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8682919/

Abstract

The polygenic risk score (PRS) can help to identify individuals' genetic susceptibility for various diseases by combining patient genetic profiles and identified single-nucleotide polymorphisms (SNPs) from genome-wide association studies. Although multiple diseases will usually afflict patients at once or in succession, conventional PRSs fail to consider genetic relationships across multiple diseases. Even multi-trait PRSs, which take into account genetic effects for more than one disease at a time, fail to consider a sufficient number of phenotypes to accurately reflect the state of disease comorbidity in a patient, or are biased in terms of the traits that are selected. Thus, we developed novel network-based comorbidity risk scores to quantify associations among multiple phenotypes from phenome-wide association studies (PheWAS). We first constructed a disease-SNP heterogeneous multi-layered network (DS-Net), which consists of a disease network (disease-layer) and SNP network (SNP-layer). The disease-layer describes the population-level interactome from PheWAS data. The SNP-layer was constructed according to linkage disequilibrium. Both layers were attached to transform the information from a population-level interactome to individual-level inferences. Then, graph-based semi-supervised learning was applied to predict possible comorbidity scores on disease-layer for each subject. The SNP-layer serves as receiving individual genotyping data in the scoring process, and the disease-layer serves as the propagated output for an individual's multiple disease comorbidity scores. The possible comorbidity scores were combined by logistic regression, and it is denoted as netCRS. The DS-Net was constructed from UK Biobank PheWAS data, and the individual genetic profiles were collected from the Penn Medicine Biobank. As a proof-of-concept study, myocardial infarction (MI) was selected to compare netCRS with the PRS with pruning and thresholding (PRS-PT). The combined model (netCRS + PRS-PT + covariates) achieved an AUC improvement of 6.26% compared to the (PRS-PT + covariates) model. In terms of risk stratification, the combined model was able to capture the risk of MI up to approximately eight-fold higher than that of the low-risk group. The netCRS and PRS-PT complement each other in predicting high-risk groups of patients with MI. We expect that using these risk prediction models will allow for the development of prevention strategies and reduction of MI morbidity and mortality.

摘要

多基因风险评分（PRS）可以通过结合患者的遗传特征和全基因组关联研究中确定的单核苷酸多态性（SNP）来帮助识别个体对各种疾病的遗传易感性。尽管多种疾病通常会同时或相继影响患者，但传统的 PRS 未能考虑多种疾病之间的遗传关系。即使是同时考虑一种以上疾病遗传效应的多性状 PRS，也未能考虑到足够多的表型来准确反映患者疾病共病的状态，或者在选择的性状方面存在偏差。因此，我们开发了基于网络的新型共病风险评分，以量化来自全表型关联研究（PheWAS）的多种表型之间的关联。我们首先构建了疾病-SNP 异质多层网络（DS-Net），它由疾病网络（疾病层）和 SNP 网络（SNP 层）组成。疾病层描述了来自 PheWAS 数据的人群水平相互作用组。SNP 层是根据连锁不平衡构建的。这两层都用于将信息从人群水平的相互作用组转换为个体水平的推断。然后，应用基于图的半监督学习来预测每个个体疾病层上可能的共病评分。SNP 层在评分过程中作为个体基因分型数据的接收者，疾病层作为个体多种疾病共病评分的传播输出。可能的共病评分通过逻辑回归组合，并用 netCRS 表示。DS-Net 是从英国生物库 PheWAS 数据中构建的，个体遗传特征是从宾夕法尼亚大学医学生物库收集的。作为概念验证研究，选择心肌梗死（MI）来比较 netCRS 与修剪和阈值 PRS（PRS-PT）。与（PRS-PT+协变量）模型相比，组合模型（netCRS+PRS-PT+协变量）的 AUC 提高了 6.26%。在风险分层方面，该组合模型能够捕捉到 MI 风险，其风险比低危组高约 8 倍。netCRS 和 PRS-PT 在预测 MI 高危患者方面相互补充。我们期望使用这些风险预测模型可以制定预防策略，降低 MI 的发病率和死亡率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9143/8682919/f605bb78afc1/nihms-1760619-f0001.jpg

相似文献

netCRS: Network-based comorbidity risk score for prediction of myocardial infarction using biobank-scaled PheWAS data.基于网络的共病风险评分，利用生物库规模的 phewas 数据预测心肌梗死。

Pac Symp Biocomput. 2022;27:325-336.

Cancer PRSweb: An Online Repository with Polygenic Risk Scores for Major Cancer Traits and Their Evaluation in Two Independent Biobanks.癌症 PRSweb：一个具有主要癌症特征多基因风险评分的在线知识库及其在两个独立生物库中的评估。

Am J Hum Genet. 2020 Nov 5;107(5):815-836. doi: 10.1016/j.ajhg.2020.08.025. Epub 2020 Sep 28.

Exploring various polygenic risk scores for skin cancer in the phenomes of the Michigan genomics initiative and the UK Biobank with a visual catalog: PRSWeb.探索密歇根基因组倡议和英国生物库表型中用于皮肤癌的多种多基因风险评分：PRSWeb。

PLoS Genet. 2019 Jun 13;15(6):e1008202. doi: 10.1371/journal.pgen.1008202. eCollection 2019 Jun.

Improving prediction models of amyotrophic lateral sclerosis (ALS) using polygenic, pre-existing conditions, and survey-based risk scores in the UK Biobank.利用英国生物库中的多基因、已存在疾病和基于调查的风险评分来改进肌萎缩侧索硬化症（ALS）的预测模型。

J Neurol. 2024 Oct;271(10):6923-6934. doi: 10.1007/s00415-024-12644-2. Epub 2024 Sep 9.

Genetic liability for substance use associated with medical comorbidities in electronic health records of African- and European-ancestry individuals.电子健康记录中与医疗共病相关的物质使用的遗传易感性在非裔和欧洲裔个体中。

Addict Biol. 2022 Jan;27(1):e13099. doi: 10.1111/adb.13099. Epub 2021 Oct 5.

Polygenic risk score-based phenome-wide association study of head and neck cancer across two large biobanks.基于多基因风险评分的头颈部癌症表型全基因组关联研究在两个大型生物库中进行。

BMC Med. 2024 Mar 14;22(1):120. doi: 10.1186/s12916-024-03305-2.

Highly elevated polygenic risk scores are better predictors of myocardial infarction risk early in life than later.高得多的多基因风险评分在生命早期预测心肌梗死风险比后期更好。

Genome Med. 2021 Jan 28;13(1):13. doi: 10.1186/s13073-021-00828-8.

A Polygenic and Phenotypic Risk Prediction for Polycystic Ovary Syndrome Evaluated by Phenome-Wide Association Studies.多囊卵巢综合征的表型和多基因风险预测：基于表型全基因组关联研究的评估。

J Clin Endocrinol Metab. 2020 Jun 1;105(6):1918-36. doi: 10.1210/clinem/dgz326.

Assessing agreement between different polygenic risk scores in the UK Biobank.评估英国生物库中不同多基因风险评分之间的一致性。

Sci Rep. 2022 Jul 27;12(1):12812. doi: 10.1038/s41598-022-17012-6.

Polygenic risk scores for cardiometabolic traits demonstrate importance of ancestry for predictive precision medicine.多基因风险评分可用于预测心血管代谢特征，这表明了祖先对于预测精准医学的重要性。

Pac Symp Biocomput. 2024;29:611-626.

引用本文的文献

Clinically validated graphical approaches identify hepatosplenic multimorbidity in individuals at risk of schistosomiasis.经临床验证的图形化方法可识别血吸虫病高危个体的肝脾合并症。

R Soc Open Sci. 2025 Jul 16;12(7):242256. doi: 10.1098/rsos.242256. eCollection 2025 Jul.

Bridging Genomics to Cardiology Clinical Practice: Artificial Intelligence in Optimizing Polygenic Risk Scores: A Systematic Review.将基因组学与心脏病临床实践相联系：人工智能在优化多基因风险评分中的应用：一项系统综述

JACC Adv. 2025 Jun;4(6 Pt 2):101803. doi: 10.1016/j.jacadv.2025.101803.

The interplay of sex and genotype in disease associations: a comprehensive network analysis in the UK Biobank.疾病关联中性别与基因型的相互作用：英国生物银行的综合网络分析

Hum Genomics. 2025 Jan 17;19(1):4. doi: 10.1186/s40246-024-00710-9.

Uncovering genetic associations in the human diseasome using an endophenotype-augmented disease network.利用表型增强疾病网络揭示人类疾病组中的遗传关联。

Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae126.

Discovering comorbid diseases using an inter-disease interactivity network based on biobank-scale PheWAS data.基于生物银行规模的 PheWAS 数据，利用疾病间相互作用网络发现共病。

Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac822.

本文引用的文献

Combined Utility of 25 Disease and Risk Factor Polygenic Risk Scores for Stratifying Risk of All-Cause Mortality.25 种疾病和风险因素多基因风险评分联合用于分层全因死亡率风险。

Am J Hum Genet. 2020 Sep 3;107(3):418-431. doi: 10.1016/j.ajhg.2020.07.002. Epub 2020 Aug 5.

Tutorial: a guide to performing polygenic risk score analyses.教程：多基因风险评分分析操作指南。

Nat Protoc. 2020 Sep;15(9):2759-2772. doi: 10.1038/s41596-020-0353-1. Epub 2020 Jul 24.

A cross-disorder PRS-pheWAS of 5 major psychiatric disorders in UK Biobank.一项 UK Biobank 中 5 种主要精神障碍的跨疾病 PRS-pheWAS 研究。

PLoS Genet. 2020 May 11;16(5):e1008185. doi: 10.1371/journal.pgen.1008185. eCollection 2020 May.

Predictive Accuracy of a Polygenic Risk Score-Enhanced Prediction Model vs a Clinical Risk Score for Coronary Artery Disease.多基因风险评分增强预测模型与临床风险评分对冠状动脉疾病预测的准确性比较。

JAMA. 2020 Feb 18;323(7):636-645. doi: 10.1001/jama.2019.22241.

Heterogeneous Multi-Layered Network Model for Omics Data Integration and Analysis.用于组学数据整合与分析的异构多层网络模型

Front Genet. 2020 Jan 28;10:1381. doi: 10.3389/fgene.2019.01381. eCollection 2019.

The translational network for metabolic disease - from protein interaction to disease co-occurrence.代谢性疾病的转译网络——从蛋白质相互作用到疾病共发生。

BMC Bioinformatics. 2019 Nov 13;20(1):576. doi: 10.1186/s12859-019-3106-9.

Penetrance and Pleiotropy of Polygenic Risk Scores for Schizophrenia in 106,160 Patients Across Four Health Care Systems.多基因风险评分对四个医疗保健系统中 106160 名精神分裂症患者的外显率和多效性。

Am J Psychiatry. 2019 Oct 1;176(10):846-855. doi: 10.1176/appi.ajp.2019.18091085. Epub 2019 Aug 16.

Analysis of polygenic risk score usage and performance in diverse human populations.多基因风险评分在不同人群中的使用和表现分析。

Nat Commun. 2019 Jul 25;10(1):3328. doi: 10.1038/s41467-019-11112-0.

PRSice-2: Polygenic Risk Score software for biobank-scale data.PRSice-2：用于生物库规模数据的多基因风险评分软件。

Gigascience. 2019 Jul 1;8(7). doi: 10.1093/gigascience/giz082.

Clinical use of current polygenic risk scores may exacerbate health disparities.现行多基因风险评分的临床应用可能会加剧健康差异。

Nat Genet. 2019 Apr;51(4):584-591. doi: 10.1038/s41588-019-0379-x. Epub 2019 Mar 29.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于网络的共病风险评分，利用生物库规模的 phewas 数据预测心肌梗死。

netCRS: Network-based comorbidity risk score for prediction of myocardial infarction using biobank-scaled PheWAS data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献