• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AoUPRS:该项目一款经济高效且通用的PRS计算器。

AoUPRS: A Cost-Effective and Versatile PRS Calculator for the Program.

作者信息

Khattab Ahmed, Chen Shang-Fu, Wineinger Nathan, Torkamani Ali

机构信息

Integrative Structural and Computational Biology, Scripps Research, La Jolla, CA, USA.

Scripps Research Translational Institute, La Jolla, California, 92037, USA.

出版信息

bioRxiv. 2024 Jul 16:2024.07.11.603165. doi: 10.1101/2024.07.11.603165.

DOI:10.1101/2024.07.11.603165
PMID:39071377
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11275801/
Abstract

BACKGROUND

The All of Us (AoU) Research Program provides a comprehensive genomic dataset to accelerate health research and medical breakthroughs. Despite its potential, researchers face significant challenges, including high costs and inefficiencies associated with data extraction and analysis. AoUPRS addresses these challenges by offering a versatile and cost-effective tool for calculating polygenic risk scores (PRS), enabling both experienced and novice researchers to leverage the AoU dataset for significant genomic discoveries.

RESULTS

AoUPRS is implemented in Python and utilizes the Hail framework for genomic data analysis. It offers two distinct approaches for PRS calculation: the Hail MatrixTable (MT) and the Hail Variant Dataset (VDS). The MT approach provides a dense representation of genotype data, while the VDS approach offers a sparse representation, significantly reducing computational costs. In performance evaluations, the VDS approach demonstrated a cost reduction of up to 99.51% for smaller scores and 85% for larger scores compared to the MT approach. Both approaches yielded similar predictive power, as shown by logistic regression analyses of PRS for coronary artery disease, atrial fibrillation, and type 2 diabetes. The empirical cumulative distribution functions (ECDFs) for PRS values further confirmed the consistency between the two methods.

CONCLUSIONS

AoUPRS is a versatile and cost-effective tool that addresses the high costs and inefficiencies associated with PRS calculations using the AoU dataset. By offering both dense and sparse data processing approaches, AoUPRS allows researchers to choose the approach best suited to their needs, facilitating genomic discoveries. The tool's open-source availability on GitHub, coupled with detailed documentation and tutorials, ensures accessibility and ease of use for the scientific community.

摘要

背景

“我们所有人”(AoU)研究计划提供了一个全面的基因组数据集,以加速健康研究和医学突破。尽管具有潜力,但研究人员面临重大挑战,包括与数据提取和分析相关的高成本和低效率。AoUPRS通过提供一种通用且经济高效的工具来计算多基因风险评分(PRS),使经验丰富和新手研究人员都能够利用AoU数据集进行重大的基因组发现。

结果

AoUPRS用Python实现,并利用Hail框架进行基因组数据分析。它提供了两种不同的PRS计算方法:Hail矩阵表(MT)和Hail变异数据集(VDS)。MT方法提供了基因型数据的密集表示,而VDS方法提供了稀疏表示,显著降低了计算成本。在性能评估中,与MT方法相比,VDS方法对于较小评分显示成本降低高达99.51%,对于较大评分降低85%。两种方法产生了相似的预测能力,如对冠状动脉疾病、心房颤动和2型糖尿病的PRS进行逻辑回归分析所示。PRS值的经验累积分布函数(ECDF)进一步证实了两种方法之间的一致性。

结论

AoUPRS是一种通用且经济高效的工具,解决了使用AoU数据集进行PRS计算时的高成本和低效率问题。通过提供密集和稀疏数据处理方法,AoUPRS允许研究人员选择最适合其需求的方法,促进基因组发现。该工具在GitHub上的开源可用性,加上详细的文档和教程,确保了科学界能够访问并易于使用。

相似文献

1
AoUPRS: A Cost-Effective and Versatile PRS Calculator for the Program.AoUPRS:该项目一款经济高效且通用的PRS计算器。
bioRxiv. 2024 Jul 16:2024.07.11.603165. doi: 10.1101/2024.07.11.603165.
2
AoUPRS: A cost-effective and versatile PRS calculator for the All of Us Program.AoUPRS:一款用于“我们所有人计划”的经济高效且通用的PRS计算器。
BMC Genomics. 2025 May 22;26(1):521. doi: 10.1186/s12864-025-11693-9.
3
Precision Medicine in Cardiovascular Disease Prevention: Clinical Validation of Multi-Ancestry Polygenic Risk Scores in a U.S. Cohort.心血管疾病预防中的精准医学:美国队列中多血统多基因风险评分的临床验证
Nutrients. 2025 Mar 6;17(5):926. doi: 10.3390/nu17050926.
4
All of Us diversity and scale improve polygenic prediction contextually with greatest improvements for under-represented populations.“我们所有人”项目的多样性和规模在情境中改善了多基因预测,对代表性不足的人群改善最大。
bioRxiv. 2024 Aug 6:2024.08.06.606846. doi: 10.1101/2024.08.06.606846.
5
Cost-effectiveness analysis of implementing polygenic risk score in a workplace cardiovascular disease prevention program.实施多基因风险评分在工作场所心血管疾病预防计划中的成本效益分析。
Front Public Health. 2023 Jul 11;11:1139496. doi: 10.3389/fpubh.2023.1139496. eCollection 2023.
6
Validating and automating learning of cardiometabolic polygenic risk scores from direct-to-consumer genetic and phenotypic data: implications for scaling precision health research.从直接面向消费者的遗传和表型数据中验证和自动化学习心血管代谢多基因风险评分:对精准健康研究扩展的影响。
Hum Genomics. 2022 Sep 8;16(1):37. doi: 10.1186/s40246-022-00406-y.
7
Integrating a Polygenic Risk Score for Coronary Artery Disease as a Risk-Enhancing Factor in the Pooled Cohort Equation: A Cost-Effectiveness Analysis Study.将冠状动脉疾病的多基因风险评分整合为合并队列方程中的风险增强因素:一项成本效益分析研究。
J Am Heart Assoc. 2022 Jun 21;11(12):e025236. doi: 10.1161/JAHA.121.025236. Epub 2022 Jun 14.
8
Polygenic risk scores for the prediction of common cancers in East Asians: A population-based prospective cohort study.基于人群的前瞻性队列研究:东亚常见癌症的多基因风险评分预测。
Elife. 2023 Mar 27;12:e82608. doi: 10.7554/eLife.82608.
9
PRS-on-Spark (PRSoS): a novel, efficient and flexible approach for generating polygenic risk scores.PRS-on-Spark(PRSoS):一种新颖、高效且灵活的多基因风险评分生成方法。
BMC Bioinformatics. 2018 Aug 8;19(1):295. doi: 10.1186/s12859-018-2289-9.
10
Polygenic Prediction of Peripheral Artery Disease and Major Adverse Limb Events.外周动脉疾病和主要肢体不良事件的多基因预测
JAMA Cardiol. 2025 May 21. doi: 10.1001/jamacardio.2025.1182.

本文引用的文献

1
Polygenic Risk Score, Lifestyles, and Type 2 Diabetes Risk: A Prospective Chinese Cohort Study.多基因风险评分、生活方式与 2 型糖尿病风险:一项前瞻性中国队列研究。
Nutrients. 2023 Apr 29;15(9):2144. doi: 10.3390/nu15092144.
2
Type 2 Diabetes Polygenic Score Predicts the Risk of Glucocorticoid-Induced Hyperglycemia in Patients Without Diabetes.2 型糖尿病多基因风险评分可预测无糖尿病患者发生糖皮质激素诱导性高血糖的风险。
Diabetes Care. 2023 Aug 1;46(8):1541-1545. doi: 10.2337/dc23-0353.
3
Polygenic risk scores for cardiovascular diseases and type 2 diabetes.心血管疾病和 2 型糖尿病的多基因风险评分。
PLoS One. 2022 Dec 2;17(12):e0278764. doi: 10.1371/journal.pone.0278764. eCollection 2022.
4
The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation.多基因风险评分目录作为一个开放的数据库,用于可重复性和系统评估。
Nat Genet. 2021 Apr;53(4):420-425. doi: 10.1038/s41588-021-00783-5.
5
Population Bias in Polygenic Risk Prediction Models for Coronary Artery Disease.群体偏倚对冠状动脉疾病多基因风险预测模型的影响。
Circ Genom Precis Med. 2020 Dec;13(6):e002932. doi: 10.1161/CIRCGEN.120.002932. Epub 2020 Nov 10.
6
The personal and clinical utility of polygenic risk scores.多基因风险评分的个体和临床效用。
Nat Rev Genet. 2018 Sep;19(9):581-590. doi: 10.1038/s41576-018-0018-x.