• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

整合电子健康记录和全基因组关联研究汇总统计数据以预测自身免疫性疾病从临床前阶段的进展。

Integrating electronic health records and GWAS summary statistics to predict the progression of autoimmune diseases from preclinical stages.

作者信息

Wang Chen, Markus Havell, Diwadkar Avantika R, Khunsriraksakul Chachrit, Carrel Laura, Li Bingshan, Zhong Xue, Wang Xingyan, Zhan Xiaowei, Foulke Galen T, Olsen Nancy J, Liu Dajiang J, Jiang Bibo

机构信息

Bioinformatics and Genomics Graduate Program, College of Medicine, Penn State University, Hershey, PA, USA.

Department of Public Health Sciences, College of Medicine, Penn State University, Hershey, PA, USA.

出版信息

Nat Commun. 2025 Jan 2;16(1):180. doi: 10.1038/s41467-024-55636-6.

DOI:10.1038/s41467-024-55636-6
PMID:39747168
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11695684/
Abstract

Autoimmune diseases often exhibit a preclinical stage before diagnosis. Electronic health record (EHR) based-biobanks contain genetic data and diagnostic information, which can identify preclinical individuals at risk for progression. Biobanks typically have small numbers of cases, which are not sufficient to construct accurate polygenic risk scores (PRS). Importantly, progression and case-control phenotypes may have shared genetic basis, which we can exploit to improve prediction accuracy. We propose a novel method Genetic Progression Score (GPS) that integrates biobank and case-control study to predict the disease progression risk. Via penalized regression, GPS incorporates PRS weights for case-control studies as prior and forces model parameters to be similar to the prior if the prior improves prediction accuracy. In simulations, GPS consistently yields better prediction accuracy than alternative strategies relying on biobank or case-control samples only and those combining biobank and case-control samples. The improvement is particularly evident when biobank sample is smaller or the genetic correlation is lower. We derive PRS for the progression from preclinical rheumatoid arthritis and systemic lupus erythematosus in the BioVU biobank and validate them in All of Us. For both diseases, GPS achieves the highest prediction and the resulting PRS yields the strongest correlation with progression prevalence.

摘要

自身免疫性疾病在诊断前通常会经历一个临床前期阶段。基于电子健康记录(EHR)的生物样本库包含遗传数据和诊断信息,可识别有病情进展风险的临床前期个体。生物样本库中的病例数量通常较少,不足以构建准确的多基因风险评分(PRS)。重要的是,疾病进展和病例对照表型可能具有共同的遗传基础,我们可以利用这一点来提高预测准确性。我们提出了一种新方法——遗传进展评分(GPS),它整合了生物样本库和病例对照研究来预测疾病进展风险。通过惩罚回归,GPS将病例对照研究的PRS权重作为先验纳入,并在该先验提高预测准确性时,迫使模型参数与先验相似。在模拟中,与仅依赖生物样本库或病例对照样本以及结合生物样本库和病例对照样本的其他策略相比,GPS始终具有更高的预测准确性。当生物样本库样本量较小或遗传相关性较低时,这种改进尤为明显。我们在BioVU生物样本库中推导了临床前期类风湿性关节炎和系统性红斑狼疮进展的PRS,并在“我们所有人”项目中对其进行了验证。对于这两种疾病,GPS都实现了最高的预测,并且所得的PRS与进展患病率的相关性最强。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de36/11695684/cf43b4495bee/41467_2024_55636_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de36/11695684/984daee5d36b/41467_2024_55636_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de36/11695684/f111671a0930/41467_2024_55636_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de36/11695684/686aa2c87884/41467_2024_55636_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de36/11695684/01c62824e594/41467_2024_55636_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de36/11695684/2e714059a215/41467_2024_55636_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de36/11695684/c28178107d91/41467_2024_55636_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de36/11695684/cf43b4495bee/41467_2024_55636_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de36/11695684/984daee5d36b/41467_2024_55636_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de36/11695684/f111671a0930/41467_2024_55636_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de36/11695684/686aa2c87884/41467_2024_55636_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de36/11695684/01c62824e594/41467_2024_55636_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de36/11695684/2e714059a215/41467_2024_55636_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de36/11695684/c28178107d91/41467_2024_55636_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de36/11695684/cf43b4495bee/41467_2024_55636_Fig7_HTML.jpg

相似文献

1
Integrating electronic health records and GWAS summary statistics to predict the progression of autoimmune diseases from preclinical stages.整合电子健康记录和全基因组关联研究汇总统计数据以预测自身免疫性疾病从临床前阶段的进展。
Nat Commun. 2025 Jan 2;16(1):180. doi: 10.1038/s41467-024-55636-6.
2
Efficient Implementation of Penalized Regression for Genetic Risk Prediction.高效实现基于惩罚回归的遗传风险预测。
Genetics. 2019 May;212(1):65-74. doi: 10.1534/genetics.119.302019. Epub 2019 Feb 26.
3
Phenotype integration improves power and preserves specificity in biobank-based genetic studies of major depressive disorder.表型整合提高了基于生物库的重度抑郁症遗传研究的功效并保持了特异性。
Nat Genet. 2023 Dec;55(12):2082-2093. doi: 10.1038/s41588-023-01559-9. Epub 2023 Nov 20.
4
Exploring various polygenic risk scores for skin cancer in the phenomes of the Michigan genomics initiative and the UK Biobank with a visual catalog: PRSWeb.探索密歇根基因组倡议和英国生物库表型中用于皮肤癌的多种多基因风险评分:PRSWeb。
PLoS Genet. 2019 Jun 13;15(6):e1008202. doi: 10.1371/journal.pgen.1008202. eCollection 2019 Jun.
5
Development of a Polygenic Risk Score for Metabolic Dysfunction-Associated Steatotic Liver Disease Prediction in UK Biobank.用于在英国生物银行中预测代谢功能障碍相关脂肪性肝病的多基因风险评分的开发
Genes (Basel). 2024 Dec 28;16(1):33. doi: 10.3390/genes16010033.
6
Cancer PRSweb: An Online Repository with Polygenic Risk Scores for Major Cancer Traits and Their Evaluation in Two Independent Biobanks.癌症 PRSweb:一个具有主要癌症特征多基因风险评分的在线知识库及其在两个独立生物库中的评估。
Am J Hum Genet. 2020 Nov 5;107(5):815-836. doi: 10.1016/j.ajhg.2020.08.025. Epub 2020 Sep 28.
7
Leveraging both individual-level genetic data and GWAS summary statistics increases polygenic prediction.利用个体水平的遗传数据和 GWAS 汇总统计数据可以提高多基因预测。
Am J Hum Genet. 2021 Jun 3;108(6):1001-1011. doi: 10.1016/j.ajhg.2021.04.014. Epub 2021 May 7.
8
Making the Most of Clumping and Thresholding for Polygenic Scores.充分利用聚类和阈值处理多基因评分。
Am J Hum Genet. 2019 Dec 5;105(6):1213-1221. doi: 10.1016/j.ajhg.2019.11.001. Epub 2019 Nov 21.
9
Polygenic risk score identifies associations between sleep duration and diseases determined from an electronic medical record biobank.多基因风险评分可识别电子病历生物库中确定的睡眠持续时间与疾病之间的关联。
Sleep. 2019 Mar 1;42(3). doi: 10.1093/sleep/zsy247.
10
Phenotype risk scores (PheRS) for pancreatic cancer using time-stamped electronic health record data: Discovery and validation in two large biobanks.利用带时间戳的电子健康记录数据构建胰腺癌的表型风险评分(PheRS):在两个大型生物样本库中的发现与验证
J Biomed Inform. 2021 Jan;113:103652. doi: 10.1016/j.jbi.2020.103652. Epub 2020 Dec 3.

引用本文的文献

1
Medical laboratory data-based models: opportunities, obstacles, and solutions.基于医学实验室数据的模型:机遇、障碍与解决方案。
J Transl Med. 2025 Jul 24;23(1):823. doi: 10.1186/s12967-025-06802-x.

本文引用的文献

1
An ensemble penalized regression method for multi-ancestry polygenic risk prediction.一种用于多祖裔多基因风险预测的集成惩罚回归方法。
Nat Commun. 2024 Apr 15;15(1):3238. doi: 10.1038/s41467-024-47357-7.
2
Improving polygenic risk prediction in admixed populations by explicitly modeling ancestral-differential effects via GAUDI.通过 GAUDI 明确建模祖先差异效应对混合人群中的多基因风险预测进行改进。
Nat Commun. 2024 Feb 3;15(1):1016. doi: 10.1038/s41467-024-45135-z.
3
mtPGS: Leverage multiple correlated traits for accurate polygenic score construction.
mtPGS:利用多个相关性状进行准确的多基因评分构建。
Am J Hum Genet. 2023 Oct 5;110(10):1673-1689. doi: 10.1016/j.ajhg.2023.08.016. Epub 2023 Sep 15.
4
Multivariate extension of penalized regression on summary statistics to construct polygenic risk scores for correlated traits.基于汇总统计量的惩罚回归的多元扩展,以构建相关性状的多基因风险评分。
HGG Adv. 2023 May 20;4(3):100209. doi: 10.1016/j.xhgg.2023.100209. eCollection 2023 Jul 13.
5
Multi-ancestry and multi-trait genome-wide association meta-analyses inform clinical risk prediction for systemic lupus erythematosus.多血统和多特征全基因组关联荟萃分析为系统性红斑狼疮的临床风险预测提供信息。
Nat Commun. 2023 Feb 7;14(1):668. doi: 10.1038/s41467-023-36306-5.
6
Gastrointestinal Manifestations in Patients with Systemic Lupus Erythematosus.系统性红斑狼疮患者的胃肠道表现
Open Access Rheumatol. 2022 Oct 17;14:243-253. doi: 10.2147/OARRR.S384256. eCollection 2022.
7
The construction of cross-population polygenic risk scores using transfer learning.使用迁移学习构建跨人群多基因风险评分。
Am J Hum Genet. 2022 Nov 3;109(11):1998-2008. doi: 10.1016/j.ajhg.2022.09.010. Epub 2022 Oct 13.
8
Preclinical Autoimmune Disease: a Comparison of Rheumatoid Arthritis, Systemic Lupus Erythematosus, Multiple Sclerosis and Type 1 Diabetes.临床前自身免疫性疾病:类风湿关节炎、系统性红斑狼疮、多发性硬化症和1型糖尿病的比较
Front Immunol. 2022 Jun 30;13:899372. doi: 10.3389/fimmu.2022.899372. eCollection 2022.
9
Development of Open-Angle Glaucoma in Adults With Seropositive Rheumatoid Arthritis in Korea.韩国血清阳性类风湿关节炎成年患者开角型青光眼的发展。
JAMA Netw Open. 2022 Mar 1;5(3):e223345. doi: 10.1001/jamanetworkopen.2022.3345.
10
Association Between Polygenic Risk Score and the Progression from Mild Cognitive Impairment to Alzheimer's Disease.多基因风险评分与轻度认知障碍向阿尔茨海默病进展的关系。
J Alzheimers Dis. 2021;84(3):1323-1335. doi: 10.3233/JAD-210700.