• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于全基因组关联研究汇总统计量的遗传相关性估计

On Genetic Correlation Estimation With Summary Statistics From Genome-Wide Association Studies.

作者信息

Zhao Bingxin, Zhu Hongtu

机构信息

Department of Biostatistics, University of North Carolina at Chapel Hill, NC.

出版信息

J Am Stat Assoc. 2022;117(537):1-11. doi: 10.1080/01621459.2021.1906684. Epub 2021 May 19.

DOI:10.1080/01621459.2021.1906684
PMID:35757777
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9232179/
Abstract

Cross-trait polygenic risk score (PRS) method has gained popularity for assessing genetic correlation of complex traits using summary statistics from biobank-scale genome-wide association studies (GWAS). However, empirical evidence has shown a common bias phenomenon that highly significant cross-trait PRS can only account for a very small amount of genetic variance ( can be < 1%) in independent testing GWAS. The aim of this paper is to investigate and address the bias phenomenon of cross-trait PRS in numerous GWAS applications. We show that the estimated genetic correlation can be asymptotically biased toward zero. A consistent cross-trait PRS estimator is then proposed to correct such asymptotic bias. In addition, we investigate whether or not SNP screening by GWAS -values can lead to improved estimation and show the effect of overlapping samples among GWAS. We analyze GWAS summary statistics of reaction time and brain structural magnetic resonance imaging-based features measured in the Pediatric Imaging, Neurocognition, and Genetics study. We find that the raw cross-trait PRS estimators heavily underestimate the genetic similarity between cognitive function and human brain structures (mean = 1.32%), whereas the bias-corrected estimators uncover the moderate degree of genetic overlap between these closely related heritable traits (mean = 22.42%). Supplementary materials for this article, including a standardized description of the materials available for reproducing the work, are available as an online supplement.

摘要

跨性状多基因风险评分(PRS)方法已广泛用于利用生物样本库规模的全基因组关联研究(GWAS)的汇总统计数据评估复杂性状的遗传相关性。然而,实证证据表明存在一种常见的偏差现象,即高度显著的跨性状PRS在独立测试的GWAS中只能解释非常少量的遗传方差(可能<1%)。本文的目的是研究并解决跨性状PRS在众多GWAS应用中的偏差现象。我们表明,估计的遗传相关性可能会渐近地偏向于零。然后提出了一种一致的跨性状PRS估计器来纠正这种渐近偏差。此外,我们研究了通过GWAS的P值进行单核苷酸多态性(SNP)筛选是否能改善估计,并展示了GWAS之间样本重叠的影响。我们分析了儿科成像、神经认知和遗传学研究中测量的反应时间以及基于脑结构磁共振成像的特征的GWAS汇总统计数据。我们发现,原始的跨性状PRS估计器严重低估了认知功能与人类脑结构之间的遗传相似性(平均r = 1.32%),而经过偏差校正的估计器揭示了这些密切相关的可遗传性状之间适度的遗传重叠程度(平均r = 22.42%)。本文的补充材料,包括可用于重现该工作的材料的标准化描述,可作为在线补充材料获取。

相似文献

1
On Genetic Correlation Estimation With Summary Statistics From Genome-Wide Association Studies.基于全基因组关联研究汇总统计量的遗传相关性估计
J Am Stat Assoc. 2022;117(537):1-11. doi: 10.1080/01621459.2021.1906684. Epub 2021 May 19.
2
Cross-trait prediction accuracy of summary statistics in genome-wide association studies.全基因组关联研究中汇总统计数据的跨性状预测准确性。
Biometrics. 2023 Jun;79(2):841-853. doi: 10.1111/biom.13661. Epub 2022 Mar 30.
3
Comparison of Methods Utilizing Sex-Specific PRSs Derived From GWAS Summary Statistics.利用全基因组关联研究(GWAS)汇总统计数据得出的性别特异性多基因风险评分(PRSs)的方法比较。
Front Genet. 2022 Jul 8;13:892950. doi: 10.3389/fgene.2022.892950. eCollection 2022.
4
Integrate multiple traits to detect novel trait-gene association using GWAS summary data with an adaptive test approach.利用 GWAS 汇总数据和自适应检验方法整合多种性状,以检测新的性状-基因关联。
Bioinformatics. 2019 Jul 1;35(13):2251-2257. doi: 10.1093/bioinformatics/bty961.
5
Applying polygenic risk score methods to pharmacogenomics GWAS: challenges and opportunities.将多基因风险评分方法应用于药物基因组学全基因组关联研究:挑战与机遇
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad470.
6
PhenoSpD: an integrated toolkit for phenotypic correlation estimation and multiple testing correction using GWAS summary statistics.PhenoSpD:一个整合的工具包,用于使用 GWAS 汇总统计数据进行表型相关性估计和多重检验校正。
Gigascience. 2018 Aug 1;7(8):giy090. doi: 10.1093/gigascience/giy090.
7
Cancer PRSweb: An Online Repository with Polygenic Risk Scores for Major Cancer Traits and Their Evaluation in Two Independent Biobanks.癌症 PRSweb:一个具有主要癌症特征多基因风险评分的在线知识库及其在两个独立生物库中的评估。
Am J Hum Genet. 2020 Nov 5;107(5):815-836. doi: 10.1016/j.ajhg.2020.08.025. Epub 2020 Sep 28.
8
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
9
Genome-wide association study identifies novel susceptible loci and evaluation of polygenic risk score for chronic obstructive pulmonary disease in a Taiwanese population.全基因组关联研究鉴定了台湾人群慢性阻塞性肺疾病的新易感位点,并评估了多基因风险评分。
BMC Genomics. 2024 Jun 17;25(1):607. doi: 10.1186/s12864-024-10526-5.
10
Transcriptome-wide association analysis of brain structures yields insights into pleiotropy with complex neuropsychiatric traits.全转录组关联分析大脑结构为复杂神经精神特征的多效性提供了见解。
Nat Commun. 2021 May 17;12(1):2878. doi: 10.1038/s41467-021-23130-y.

引用本文的文献

1
Lifespan investigation of brain volumetric changes associated with substance use disorders.与物质使用障碍相关的脑容量变化的寿命期研究。
Res Sq. 2025 Jun 26:rs.3.rs-6864753. doi: 10.21203/rs.3.rs-6864753/v1.
2
Controlling the False Split Rate in Tree-Based Aggregation.控制基于树的聚合中的误分裂率。
J Am Stat Assoc. 2025;120(550):935-947. doi: 10.1080/01621459.2024.2376285. Epub 2024 Sep 24.
3
A PARTIALLY FUNCTIONAL LINEAR REGRESSION FRAMEWORK FOR INTEGRATING GENETIC, IMAGING, AND CLINICAL DATA.一种用于整合遗传、影像和临床数据的部分功能线性回归框架。

本文引用的文献

1
Optimal Estimation of Genetic Relatedness in High-dimensional Linear Models.高维线性模型中遗传相关性的最优估计
J Am Stat Assoc. 2019;114(525):358-369. doi: 10.1080/01621459.2017.1407774. Epub 2018 Nov 19.
2
Polygenic Architecture of Human Neuroanatomical Diversity.人类神经解剖多样性的多基因结构。
Cereb Cortex. 2020 Apr 14;30(4):2307-2320. doi: 10.1093/cercor/bhz241.
3
Genome-wide association analysis of 19,629 individuals identifies variants influencing regional brain volumes and refines their genetic co-architecture with cognitive and mental health traits.
Ann Appl Stat. 2024 Mar;18(1):704-728. doi: 10.1214/23-aoas1808. Epub 2024 Jan 31.
4
Neural networks for geospatial data.用于地理空间数据的神经网络。
J Am Stat Assoc. 2025;120(549):535-547. doi: 10.1080/01621459.2024.2356293. Epub 2024 Jun 24.
5
Accounting for Twins and Other Multiple Births in Perinatal Studies of Live Births Conducted Using Healthcare Administration Data.在利用医疗保健管理数据进行的活产围产期研究中对双胞胎及其他多胞胎的考量。
Epidemiology. 2025 Mar 1;36(2):165-173. doi: 10.1097/EDE.0000000000001809. Epub 2024 Nov 13.
6
A Generalized Bayesian Stochastic Block Model for Microbiome Community Detection.用于微生物群落检测的广义贝叶斯随机块模型
Stat Med. 2025 Feb 10;44(3-4):e10291. doi: 10.1002/sim.10291.
7
HighDimMixedModels.jl: Robust high-dimensional mixed-effects models across omics data.HighDimMixedModels.jl:跨组学数据的稳健高维混合效应模型。
PLoS Comput Biol. 2025 Jan 13;21(1):e1012143. doi: 10.1371/journal.pcbi.1012143. eCollection 2025 Jan.
8
Estimation of a genetic Gaussian network using GWAS summary data.利用全基因组关联研究(GWAS)汇总数据估计遗传高斯网络。
Biometrics. 2024 Oct 3;80(4). doi: 10.1093/biomtc/ujae148.
9
Joint and Individual Component Regression.联合与个体成分回归
J Comput Graph Stat. 2024;33(3):763-773. doi: 10.1080/10618600.2023.2284227. Epub 2023 Dec 29.
10
Estimating trans-ancestry genetic correlation with unbalanced data resources.利用不平衡数据资源估计跨祖先遗传相关性。
J Am Stat Assoc. 2024;119(546):839-850. doi: 10.1080/01621459.2024.2344703. Epub 2024 May 21.
对 19629 个人进行全基因组关联分析,确定了影响区域脑容量的变异,并与认知和精神健康特征一起细化了它们的遗传共构。
Nat Genet. 2019 Nov;51(11):1637-1644. doi: 10.1038/s41588-019-0516-6. Epub 2019 Nov 1.
4
Polygenic prediction via Bayesian regression and continuous shrinkage priors.基于贝叶斯回归和连续收缩先验的多基因预测。
Nat Commun. 2019 Apr 16;10(1):1776. doi: 10.1038/s41467-019-09718-5.
5
Clinical use of current polygenic risk scores may exacerbate health disparities.现行多基因风险评分的临床应用可能会加剧健康差异。
Nat Genet. 2019 Apr;51(4):584-591. doi: 10.1038/s41588-019-0379-x. Epub 2019 Mar 29.
6
Genome-wide by environment interaction studies of depressive symptoms and psychosocial stress in UK Biobank and Generation Scotland.基于 UK Biobank 和 Generation Scotland 研究的抑郁症状和心理社会压力的全基因组与环境交互作用研究。
Transl Psychiatry. 2019 Feb 4;9(1):14. doi: 10.1038/s41398-018-0360-y.
7
The Cognitive Thalamus as a Gateway to Mental Representations.丘脑作为心理表象的门户。
J Neurosci. 2019 Jan 2;39(1):3-14. doi: 10.1523/JNEUROSCI.0479-18.2018. Epub 2018 Nov 2.
8
The UK Biobank resource with deep phenotyping and genomic data.英国生物银行资源库,具有深度表型和基因组数据。
Nature. 2018 Oct;562(7726):203-209. doi: 10.1038/s41586-018-0579-z. Epub 2018 Oct 10.
9
Study of 300,486 individuals identifies 148 independent genetic loci influencing general cognitive function.对 300486 人的研究确定了 148 个独立的遗传位置,影响一般认知功能。
Nat Commun. 2018 May 29;9(1):2098. doi: 10.1038/s41467-018-04362-x.
10
Estimation of Genetic Correlation via Linkage Disequilibrium Score Regression and Genomic Restricted Maximum Likelihood.基于连锁不平衡评分回归和基因组约束极大似然估计的遗传相关性估计。
Am J Hum Genet. 2018 Jun 7;102(6):1185-1194. doi: 10.1016/j.ajhg.2018.03.021. Epub 2018 May 10.